BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= 537021.9.peg.1073_1 (162 letters) Database: nr 13,984,884 sequences; 4,792,584,752 total letters Searching..................................................done >gi|315122533|ref|YP_004063022.1| hypothetical protein CKC_03925 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495935|gb|ADR52534.1| hypothetical protein CKC_03925 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 331 Score = 151 bits (382), Expect = 2e-35, Method: Composition-based stats. Identities = 141/161 (87%), Positives = 148/161 (91%) Query: 1 MTKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYA 60 MTKKAKDG QR F+KQ+LD VMQEGYQNG NFRHI+VSPYVKSEFVRFMSDSNVASFRYA Sbjct: 170 MTKKAKDGAQRPFTKQLLDAVMQEGYQNGANFRHIVVSPYVKSEFVRFMSDSNVASFRYA 229 Query: 61 TSGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHE 120 S S NNTIVATADIYD PFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLR I E Sbjct: 230 VSDNSKNNTIVATADIYDGPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRNIQE 289 Query: 121 DKNIAKNGDANKGVLIGEGALKVRNEKAVGVVADLFGLSKT 161 DKNIAK GDANKGVLIGEG LKV+NEKA+GV++DLFGLSKT Sbjct: 290 DKNIAKTGDANKGVLIGEGTLKVKNEKAIGVISDLFGLSKT 330 >gi|227822441|ref|YP_002826413.1| putative phage major head protein [Sinorhizobium fredii NGR234] gi|227341442|gb|ACP25660.1| putative phage major head protein [Sinorhizobium fredii NGR234] Length = 331 Score = 150 bits (378), Expect = 8e-35, Method: Composition-based stats. Identities = 109/162 (67%), Positives = 124/162 (76%) Query: 1 MTKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYA 60 +T DGTQR F+K ILD VMQ+GYQNG NFRH+ VSPYVKS FV FMSDSNVA FRYA Sbjct: 170 LTVAPTDGTQRPFTKAILDTVMQQGYQNGANFRHVSVSPYVKSVFVTFMSDSNVAPFRYA 229 Query: 61 TSGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHE 120 S NTIVATAD Y+ PFG VM+HPNRV A N ARNAF ID +ML FLWLR I E Sbjct: 230 VSQGGERNTIVATADYYEGPFGTVMIHPNRVQAVNGGVARNAFFIDSDMLSFLWLRQIQE 289 Query: 121 DKNIAKNGDANKGVLIGEGALKVRNEKAVGVVADLFGLSKTT 162 D+++AK GDA+KGV+IGEG LKV NE+ +GV ADLFGLS + Sbjct: 290 DRDVAKTGDADKGVIIGEGTLKVHNERGLGVAADLFGLSSAS 331 >gi|150397033|ref|YP_001327500.1| hypothetical protein Smed_1830 [Sinorhizobium medicae WSM419] gi|150028548|gb|ABR60665.1| hypothetical protein Smed_1830 [Sinorhizobium medicae WSM419] Length = 331 Score = 146 bits (368), Expect = 1e-33, Method: Composition-based stats. Identities = 110/161 (68%), Positives = 126/161 (78%) Query: 1 MTKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYA 60 +T DG QR F+K ILD VMQ GYQ+G NFRH+ VSPYVKS FV FMSDSNVA FRYA Sbjct: 170 LTVAPTDGEQRAFTKSILDSVMQAGYQSGANFRHVSVSPYVKSVFVTFMSDSNVAPFRYA 229 Query: 61 TSGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHE 120 S NTIVATAD Y+ PFG VM+HPNRV A+ A TARNAF +D +MLEFLWLR I E Sbjct: 230 VSQGGERNTIVATADYYEGPFGTVMIHPNRVQAATAATARNAFFLDTDMLEFLWLRKIQE 289 Query: 121 DKNIAKNGDANKGVLIGEGALKVRNEKAVGVVADLFGLSKT 161 DK++AK GDA+KGV+IGEG LKV+NEK +GV ADLFGLS+ Sbjct: 290 DKDVAKTGDADKGVIIGEGTLKVKNEKGLGVAADLFGLSEA 330 >gi|291334595|gb|ADD94245.1| putative phage major head protein [uncultured phage MedDCM-OCT-S04-C136] Length = 316 Score = 137 bits (346), Expect = 4e-31, Method: Composition-based stats. Identities = 42/156 (26%), Positives = 73/156 (46%), Gaps = 12/156 (7%) Query: 5 AKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGK 64 DGTQR F++ L ++ Y G N I+V+ + K + F S Sbjct: 167 RTDGTQRAFTEDQLKGILVSCYNEGGNPNMIMVNAFNKQKLSGFTGGSTRF-------DA 219 Query: 65 SHNNTIVATADIYDRPFGKVMVHPNRVMASNAETA----RNAFLIDPNMLEFLWLRTIHE 120 + + ++ + D+Y+ FG + V PNR + TA ++A ++D +LR Sbjct: 220 AEDRRLITSIDVYESDFGTMQVSPNRFIRGANGTAAKIGQDAHILDMEYWAVSFLRDF-A 278 Query: 121 DKNIAKNGDANKGVLIGEGALKVRNEKAVGVVADLF 156 + A+ DA++ ++ E L+ RNEKA G++ DL Sbjct: 279 LQTPAQTADADQRFMVAEYTLESRNEKASGLITDLT 314 >gi|160897389|ref|YP_001562971.1| putative phage major head protein [Delftia acidovorans SPH-1] gi|160362973|gb|ABX34586.1| putative phage major head protein [Delftia acidovorans SPH-1] Length = 306 Score = 137 bits (346), Expect = 4e-31, Method: Composition-based stats. Identities = 46/153 (30%), Positives = 72/153 (47%), Gaps = 13/153 (8%) Query: 3 KKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATS 62 GTQR F + + +V+Q+ Y G I++ P K F F ++ Sbjct: 166 TGYTPGTQRAFVESQVKDVLQKCYTAGGEPDTIMMPPAAKQTFSTFTGNATR-------M 218 Query: 63 GKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDK 122 KS ++ + A+ D Y FG + PNR MA+ R+ FL+ + L +LR + Sbjct: 219 DKSEDSKLYASVDYYVSDFGTIQAVPNRFMAT-----RDVFLLQSDKLAVAYLRPF-QTT 272 Query: 123 NIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 +IAK GD++ +I E L+ R KA G V D+ Sbjct: 273 DIAKTGDSDAQQIIVEYTLECRAPKAHGAVYDI 305 >gi|291334838|gb|ADD94478.1| putative phage major head protein [uncultured phage MedDCM-OCT-S06-C1041] Length = 323 Score = 135 bits (340), Expect = 2e-30, Method: Composition-based stats. Identities = 40/152 (26%), Positives = 65/152 (42%), Gaps = 7/152 (4%) Query: 2 TKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYAT 61 T GT TF + L V+++ + NG + I+ + K F Sbjct: 174 TTDPTAGTAATFVEADLKSVVKQCWDNGGDPGVIMTGSFNKQAASGFAGIGTQFRDV-QP 232 Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHED 121 SG +IV ADIY FG+ + NR M + + +D +LR I + Sbjct: 233 SGPVSPGSIVGAADIYISDFGQHQIVANRFMPAAN-----VYALDMEYWCVAYLRPI-QT 286 Query: 122 KNIAKNGDANKGVLIGEGALKVRNEKAVGVVA 153 +++ K GD+++ +LI E L +NE A G + Sbjct: 287 EDLGKTGDSDRRMLISEYTLGSKNEAASGKIY 318 >gi|291334638|gb|ADD94286.1| putative phage major head protein [uncultured phage MedDCM-OCT-S04-C64] Length = 323 Score = 134 bits (338), Expect = 3e-30, Method: Composition-based stats. Identities = 39/152 (25%), Positives = 65/152 (42%), Gaps = 7/152 (4%) Query: 2 TKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYAT 61 + GT TF + L V+++ + NG + I+ + K F Sbjct: 174 STDPTAGTAATFVEADLKSVVKQCWDNGGDPGVIMTGSFNKQAASGFAGIGTQFRDV-QP 232 Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHED 121 SG +IV ADIY FG+ + NR M + + +D +LR I + Sbjct: 233 SGPVSPGSIVGAADIYISDFGQHQIVANRFMPAAN-----VYALDMEYWCVAYLRPI-QT 286 Query: 122 KNIAKNGDANKGVLIGEGALKVRNEKAVGVVA 153 +++ K GD+++ +LI E L +NE A G + Sbjct: 287 EDLGKTGDSDRRMLISEYTLGSKNEAASGKIY 318 >gi|148257053|ref|YP_001241638.1| putative phage major head protein [Bradyrhizobium sp. BTAi1] gi|146409226|gb|ABQ37732.1| putative phage Major head protein [Bradyrhizobium sp. BTAi1] Length = 320 Score = 131 bits (329), Expect = 4e-29, Method: Composition-based stats. Identities = 41/154 (26%), Positives = 67/154 (43%), Gaps = 13/154 (8%) Query: 2 TKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYAT 61 T DGTQ F++ L V+ + NG I+ + K F F S Sbjct: 175 TGTRTDGTQIAFTEARLKTVLSSIWINGGKPGTILTGAFNKQVFSTFTGRSTAIEE---- 230 Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTI-HE 120 + + IVA+ D Y+ FGK+ V +R + R+ +++ + +L Sbjct: 231 ---AKSKKIVASVDAYESDFGKLKVVASRFQRA-----RDVLVLEMDKWAVAYLNGRNMI 282 Query: 121 DKNIAKNGDANKGVLIGEGALKVRNEKAVGVVAD 154 +AK GD+++ ++ E AL RNEKA G + D Sbjct: 283 SIPLAKTGDSDRRQILAEYALVARNEKASGGIFD 316 >gi|256377352|ref|YP_003101012.1| hypothetical protein Amir_3259 [Actinosynnema mirum DSM 43827] gi|255921655|gb|ACU37166.1| hypothetical protein Amir_3259 [Actinosynnema mirum DSM 43827] Length = 329 Score = 126 bits (316), Expect = 1e-27, Method: Composition-based stats. Identities = 28/159 (17%), Positives = 58/159 (36%), Gaps = 12/159 (7%) Query: 2 TKKAKDGTQRTFSKQILDEVMQEGYQNGD----NFRHIIVSPYVKSEFVRFMSDSNVASF 57 T + T ++ ++ +++Q+ ++NG ++ + K Sbjct: 173 TNVITNSTPTPLTESMVIDLLQKVWENGGIQISETATLMCGAWQKRRLTDEFVTRKNYQE 232 Query: 58 RYATSGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRT 117 + G + + FG+V + NR M ++A + P MLE Sbjct: 233 QTRNVG-------GVSVSTIETDFGRVNIMLNRHMPADAVQVVSLEQCAPVMLEKPG-SG 284 Query: 118 IHEDKNIAKNGDANKGVLIGEGALKVRNEKAVGVVADLF 156 + +AK G +++ + GE L+ E A G + L Sbjct: 285 FLFSEPLAKTGASDRVHIYGEIGLEYGPEIAHGKITGLT 323 >gi|163783849|ref|ZP_02178828.1| hypothetical protein HG1285_12862 [Hydrogenivirga sp. 128-5-R1-1] gi|159880872|gb|EDP74397.1| hypothetical protein HG1285_12862 [Hydrogenivirga sp. 128-5-R1-1] Length = 291 Score = 125 bits (313), Expect = 2e-27, Method: Composition-based stats. Identities = 37/151 (24%), Positives = 74/151 (49%), Gaps = 13/151 (8%) Query: 5 AKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGK 64 A GT R ++ +L++ +Q+ +Q G N ++V+ K F + Sbjct: 153 ANAGTPRPLTETLLNDGLQQAWQTGGNPDVVVVNGTNKRTISGFTAGLTK-------EIA 205 Query: 65 SHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNI 124 + + ++A+ D+Y+ FG V V NR M ++ F+++ + +L H+ + + Sbjct: 206 AKDKRLIASVDVYESDFGLVRVIANRWMPAD-----KVFILEKGRWKVAYLDPFHQ-EEL 259 Query: 125 AKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 +K GD + V++GE L+ R EKA ++ D+ Sbjct: 260 SKVGDRKERVVVGELTLEGRAEKANAIITDI 290 >gi|288817864|ref|YP_003432211.1| putative phage major head protein [Hydrogenobacter thermophilus TK-6] gi|288787263|dbj|BAI69010.1| putative phage major head protein [Hydrogenobacter thermophilus TK-6] gi|308751463|gb|ADO44946.1| putative phage major head protein [Hydrogenobacter thermophilus TK-6] Length = 291 Score = 121 bits (303), Expect = 3e-26, Method: Composition-based stats. Identities = 40/151 (26%), Positives = 75/151 (49%), Gaps = 13/151 (8%) Query: 5 AKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGK 64 A G+ R ++ +L++ +Q+ +Q G N ++VS K F + Sbjct: 153 ANGGSPRALTETLLNDGIQQAWQAGGNPDVVVVSGKNKRTVSSFTAGVTKTV-------A 205 Query: 65 SHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNI 124 + + +V+T D+Y+ FG V + +R M + F+++ + +LR + + I Sbjct: 206 AEDKKLVSTVDVYESDFGLVRIVADRWMPDS-----KVFILEKGRFKIAYLRPFKQQE-I 259 Query: 125 AKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 AK GD + V++GE L+VR EKA ++AD+ Sbjct: 260 AKVGDRIERVVVGELTLEVRAEKANAIIADI 290 >gi|226305754|ref|YP_002765714.1| hypothetical protein RER_22670 [Rhodococcus erythropolis PR4] gi|226184871|dbj|BAH32975.1| hypothetical protein RER_22670 [Rhodococcus erythropolis PR4] Length = 317 Score = 119 bits (298), Expect = 1e-25, Method: Composition-based stats. Identities = 33/158 (20%), Positives = 55/158 (34%), Gaps = 12/158 (7%) Query: 2 TKKAKDGTQRTFSKQILDEVMQEGYQNGD----NFRHIIVSPYVKSEFVRFMSDSNVASF 57 + +GT ++ +L ++MQ ++NG R I+V K + D Sbjct: 167 SNVVTNGTPAALTETMLLDLMQLAWENGGIQESETRTIVVGAKQKRALTKIFIDGKGYKE 226 Query: 58 RYATSGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRT 117 G + FG+ + NR M + + P LE + Sbjct: 227 STRNVG-------GVNLQTIETDFGRCNIMLNRYMPAAQLAVVSLEECAPRFLEIPG-KG 278 Query: 118 IHEDKNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 +++AK G + K L GE LK NE + L Sbjct: 279 HFFAESLAKTGASEKVQLYGEIGLKYGNELKHAKLTGL 316 >gi|221199511|ref|ZP_03572555.1| major head protein [Burkholderia multivorans CGD2M] gi|221205587|ref|ZP_03578602.1| major head protein [Burkholderia multivorans CGD2] gi|221174425|gb|EEE06857.1| major head protein [Burkholderia multivorans CGD2] gi|221180796|gb|EEE13199.1| major head protein [Burkholderia multivorans CGD2M] Length = 317 Score = 116 bits (291), Expect = 9e-25, Method: Composition-based stats. Identities = 45/153 (29%), Positives = 72/153 (47%), Gaps = 6/153 (3%) Query: 3 KKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATS 62 GT R F++ +L E ++ Y+ G N + + P K F + A+ + + Sbjct: 169 TPPVAGTARPFTEDLLKEALRLAYEAGGNVTQVHMRPVDKQRASGFAGN---ATRMESVN 225 Query: 63 GKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDK 122 G + + IY FG V + PNRVM++ +TA + IDP+M LR + + Sbjct: 226 GTGKTAVLQTSYSIYASDFGNVAMIPNRVMSAVNDTA--VYAIDPSMWALATLRGFEK-E 282 Query: 123 NIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 +AK GDA + EG L+ RNE + + DL Sbjct: 283 ELAKVGDARNWQITWEGTLEARNEASSAQIRDL 315 >gi|167600435|ref|YP_001671935.1| major head protein [Pseudomonas phage LUZ24] gi|161168298|emb|CAP45463.1| major head protein [Pseudomonas phage LUZ24] Length = 317 Score = 115 bits (288), Expect = 2e-24, Method: Composition-based stats. Identities = 39/154 (25%), Positives = 67/154 (43%), Gaps = 11/154 (7%) Query: 2 TKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYAT 61 + G R ++ +L + ++NG I S +K + N+ Sbjct: 171 SDTGTAGDLRLLTEDMLLNASEAIWRNGGQANSIQTSSSIKKAISK-----NMKGRATEI 225 Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHED 121 + + +N I T D+Y+ FGK + NR N F+ DP M +LR + Sbjct: 226 TLDASDNRIAQTVDVYESDFGKYTIRANRWFHENT-----MFIFDPKMHALCYLRPFFQH 280 Query: 122 KNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 + +AK GD+ K L+ E L+V NEK+ ++ D+ Sbjct: 281 E-LAKTGDSEKRQLLVEYTLRVNNEKSGALIRDV 313 >gi|27476049|ref|NP_775251.1| major head protein [Pseudomonas phage PaP3] gi|27414479|gb|AAL85565.1| major head protein [Pseudomonas phage PaP3] Length = 317 Score = 114 bits (284), Expect = 6e-24, Method: Composition-based stats. Identities = 38/154 (24%), Positives = 66/154 (42%), Gaps = 11/154 (7%) Query: 2 TKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYAT 61 + G R ++ +L + ++NG I S +K + N+ Sbjct: 171 SNTGTAGDLRLLTEDMLLNASESIWRNGGQANSIQTSSSIKKAISK-----NMKGRATEI 225 Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHED 121 + + +N I T D+Y+ FGK + NR N F+ DP M +LR + Sbjct: 226 TLDASDNRIAQTVDVYESDFGKYTIRANRWFHENT-----LFVFDPKMHSLCYLRPFFQH 280 Query: 122 KNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 + +AK GD+ K L+ E +V NEK+ ++ D+ Sbjct: 281 E-LAKTGDSEKRQLLVEYTFRVNNEKSGALIRDV 313 >gi|291334405|gb|ADD94061.1| major head protein [uncultured phage MedDCM-OCT-S01-C1] Length = 344 Score = 108 bits (270), Expect = 2e-22, Method: Composition-based stats. Identities = 37/158 (23%), Positives = 69/158 (43%), Gaps = 4/158 (2%) Query: 1 MTKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYA 60 +T K + S Q + + +Q Y+ G H++ +P V +M D+ Sbjct: 180 LTVKYTEDDGVALSFQAVKDAIQGVYEQGGEVTHLMSNPGVIGALSSYMFDNEARVATLT 239 Query: 61 TSGKSHNN---TIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRT 117 + + N T +++ ++ FG + + PNR+ A AFL+DP + +L Sbjct: 240 SDQGAPANSKATALSSVNVLVSDFGTIKLVPNRLQPLAANGTAVAFLLDPEYVSLSYLEG 299 Query: 118 IHEDKNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 +AK G A K + + L+V EKA G++ ++ Sbjct: 300 Y-RTDTLAKTGLAEKRQISVDWGLRVHTEKAHGMLVNI 336 >gi|283856246|ref|YP_162122.2| putative phage major head protein [Zymomonas mobilis subsp. mobilis ZM4] gi|283775241|gb|AAV89011.2| putative phage major head protein [Zymomonas mobilis subsp. mobilis ZM4] Length = 304 Score = 107 bits (268), Expect = 4e-22, Method: Composition-based stats. Identities = 43/157 (27%), Positives = 70/157 (44%), Gaps = 11/157 (7%) Query: 2 TKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYAT 61 T A G R +++ +L EV+ + + +G +I S +K F ++ Sbjct: 159 TTAATTGKARLYTEALLKEVLGDIWVSGGQPNMVITSLKLKQTAAAFPGLASNRR----- 213 Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHED 121 I+A ADIY G+V P+R ++ +A +IDP L I + Sbjct: 214 DTGDQKARIIAGADIYVSDVGEVQFVPDRFCDNS-----SALVIDPEYWSVATLDPIQK- 267 Query: 122 KNIAKNGDANKGVLIGEGALKVRNEKAVGVVADLFGL 158 +++A G A++ L E AL+ NE A GV+ADL Sbjct: 268 RSLATTGLADRDALYTEIALRCHNEAASGVLADLSAA 304 >gi|241760939|ref|ZP_04759028.1| putative phage major head protein [Zymomonas mobilis subsp. mobilis ATCC 10988] gi|241374558|gb|EER64019.1| putative phage major head protein [Zymomonas mobilis subsp. mobilis ATCC 10988] Length = 238 Score = 107 bits (266), Expect = 8e-22, Method: Composition-based stats. Identities = 42/157 (26%), Positives = 69/157 (43%), Gaps = 11/157 (7%) Query: 2 TKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYAT 61 T A G R +++ +L EV+ + + +G +I S +K F ++ Sbjct: 93 TTAATAGKARLYTEALLKEVLGDIWVSGGQPNMVITSLKLKQTAAAFPGLASNRR----- 147 Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHED 121 I+A ADIY G+V P+R ++ +A +IDP L I + Sbjct: 148 DTGDQKARIIAGADIYVSDVGEVQFVPDRFCDNS-----SALVIDPEYWSVATLDPIQK- 201 Query: 122 KNIAKNGDANKGVLIGEGALKVRNEKAVGVVADLFGL 158 +++A G A++ L E AL+ NE GV+ADL Sbjct: 202 RSLATTGLADRDALYTEIALRCHNEATSGVLADLSAA 238 >gi|261337429|ref|ZP_05965313.1| conserved hypothetical protein [Bifidobacterium gallicum DSM 20093] gi|270277816|gb|EFA23670.1| conserved hypothetical protein [Bifidobacterium gallicum DSM 20093] Length = 160 Score = 102 bits (255), Expect = 1e-20, Method: Composition-based stats. Identities = 30/148 (20%), Positives = 56/148 (37%), Gaps = 12/148 (8%) Query: 10 QRTFSKQILDEVMQEGYQNGD----NFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKS 65 +T S+ + ++MQ + NG R I+V+ +K R G Sbjct: 7 AKTLSEDDVLDLMQMAWDNGGIREAETRTIVVNSDLKRALTRIFIKDAGYKEETRNVG-- 64 Query: 66 HNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIA 125 + + FG+ + + ++ + + + P LE + + +A Sbjct: 65 -----GVSLQTIETDFGRCNIMLDSLVLKDKMLVLSLDQLAPRFLEIPG-KGHFFVEPLA 118 Query: 126 KNGDANKGVLIGEGALKVRNEKAVGVVA 153 K G ++K L GE L+ NEKA V+ Sbjct: 119 KTGASDKVQLYGEIGLEYGNEKAHAVLT 146 >gi|316934287|ref|YP_004109269.1| putative phage major head protein [Rhodopseudomonas palustris DX-1] gi|315602001|gb|ADU44536.1| putative phage major head protein [Rhodopseudomonas palustris DX-1] Length = 304 Score = 99.8 bits (247), Expect = 1e-19, Method: Composition-based stats. Identities = 34/149 (22%), Positives = 61/149 (40%), Gaps = 11/149 (7%) Query: 6 KDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKS 65 T ++ + Q Y G +V+P F F + + S Sbjct: 159 TGATGTKITEANYLDAAQALYDAGSEASITLVTPTNSRTFAGFTGAAGRSRVINDDS--- 215 Query: 66 HNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIH-EDKNI 124 IV ++Y PFG+ + +R + + + L DP M + L+ + + + Sbjct: 216 --KKIVNAVNLYVSPFGEEKIVLSRHLKAG-----DTLLFDPAMWARVILQGRNWFRETL 268 Query: 125 AKNGDANKGVLIGEGALKVRNEKAVGVVA 153 AK GD+ K +++GE LK +N+KA +V Sbjct: 269 AKTGDSLKMMIVGEFGLKHKNQKASALVR 297 >gi|307308932|ref|ZP_07588615.1| putative phage major head protein [Sinorhizobium meliloti BL225C] gi|306900566|gb|EFN31179.1| putative phage major head protein [Sinorhizobium meliloti BL225C] Length = 309 Score = 98.6 bits (244), Expect = 3e-19, Method: Composition-based stats. Identities = 35/156 (22%), Positives = 61/156 (39%), Gaps = 13/156 (8%) Query: 1 MTKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYA 60 + DGT R S ++ E+ + + G + ++V P K V + + Sbjct: 167 LVGAVTDGTARALSHALIVEMAEGIHSEGGVPKILLVPPAQKKNIVALSNGTTKFQ---- 222 Query: 61 TSGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHE 120 + T+ +Y+ PFG+ + +R + S+ DP + R + + Sbjct: 223 ---DASKKTVFGDVTMYETPFGRFDIVTSRDVRSST-----VIGYDPELWAQAVFRGLTK 274 Query: 121 DKNIAKNGDANKGVLIGEGALKVRNEKAVGVVADLF 156 K + + GD +I E L RNE G VADL Sbjct: 275 -KKLPEGGDYEGYQVITEVTLVCRNEAGNGKVADLS 309 >gi|294083943|ref|YP_003550700.1| major head protein [Candidatus Puniceispirillum marinum IMCC1322] gi|292663515|gb|ADE38616.1| major head protein [Candidatus Puniceispirillum marinum IMCC1322] Length = 133 Score = 97.5 bits (241), Expect = 5e-19, Method: Composition-based stats. Identities = 32/143 (22%), Positives = 56/143 (39%), Gaps = 11/143 (7%) Query: 13 FSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTIVA 72 ++ L + Q Y G + ++ P F S + T+V Sbjct: 1 MTEAKLLDAHQAAYTAGADVSVFMIKPSDSEIVAGFTGSSGRTRNFNDGTT-----TLVN 55 Query: 73 TADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGDANK 132 D+ P+G + V NR + AFL+DP+M LR + ++K GD++K Sbjct: 56 AVDVMINPYGTLKVVLNRHQLTTH-----AFLLDPSMWRSAVLRPVSRT-LLSKTGDSDK 109 Query: 133 GVLIGEGALKVRNEKAVGVVADL 155 + EG L+ N+ ++ L Sbjct: 110 HFCVYEGGLQHLNQSGSAMITGL 132 >gi|256751057|ref|ZP_05491940.1| conserved hypothetical protein [Thermoanaerobacter ethanolicus CCSD1] gi|256750167|gb|EEU63188.1| conserved hypothetical protein [Thermoanaerobacter ethanolicus CCSD1] Length = 292 Score = 95.5 bits (236), Expect = 2e-18, Method: Composition-based stats. Identities = 30/151 (19%), Positives = 54/151 (35%), Gaps = 16/151 (10%) Query: 4 KAKDGTQRTFSKQILDEVMQEGYQNG--DNFRHIIVSPYVKSEFVRFMSDSNVASFRYAT 61 + F + +++ + NG N +V+ +K F +S Sbjct: 152 TENKVSNTAFDEAQFKATVKKLWDNGLGSNQYIALVNADIKEIIDDFYKNSTRYIGNVNE 211 Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHED 121 G + +G V + NR + + + DP + +LR + Sbjct: 212 YGVVVDKVN--------TNYGVVNLVLNRHVPAG-----KMIVFDPAYVRLAFLR-TPSN 257 Query: 122 KNIAKNGDANKGVLIGEGALKVRNEKAVGVV 152 + +AK GD G +I E LKV N KA+ + Sbjct: 258 EILAKTGDYISGQVITELTLKVLNAKALALF 288 >gi|229037842|ref|ZP_04189640.1| hypothetical protein bcere0028_57530 [Bacillus cereus AH1271] gi|228727464|gb|EEL78642.1| hypothetical protein bcere0028_57530 [Bacillus cereus AH1271] Length = 315 Score = 94.0 bits (232), Expect = 6e-18, Method: Composition-based stats. Identities = 24/156 (15%), Positives = 51/156 (32%), Gaps = 14/156 (8%) Query: 2 TKKAKDGTQRTFSKQILDEVMQEGYQNGDNFR--HIIVSPYVKSEFVRFMSDSNVASFRY 59 T +K + +++++ NG +F + V+ + K Sbjct: 169 TGNTIAAAGAKLNKDLFNQLLRMMATNGADFNEMVLYVNAFNKQVVSEIFG--------- 219 Query: 60 ATSGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIH 119 N + F KV + +R + + + + ++ P + + + Sbjct: 220 --YAPQDRNVGGVNIQQIETDFAKVGIVWDRFVPAGSIGLFDMSVVAPVLQNVPD-KGVL 276 Query: 120 EDKNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 + +AK G A KG + GE L G + L Sbjct: 277 FYEELAKTGAAEKGQIYGEIGLAHGPAFMHGSITGL 312 >gi|302389556|ref|YP_003825377.1| hypothetical protein Toce_0992 [Thermosediminibacter oceani DSM 16646] gi|302200184|gb|ADL07754.1| conserved hypothetical protein [Thermosediminibacter oceani DSM 16646] Length = 294 Score = 92.5 bits (228), Expect = 2e-17, Method: Composition-based stats. Identities = 29/146 (19%), Positives = 50/146 (34%), Gaps = 14/146 (9%) Query: 9 TQRTFSKQILDEVMQEGYQNGD--NFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSH 66 T + +++ ++NG N +V+ +K Sbjct: 157 TGANLDEANFKATVKKLWENGLGINEYVALVNADLKEAIDAIYDAKYQY------IAPMQ 210 Query: 67 NNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAK 126 +N +G V + NR M + DP L LR + +AK Sbjct: 211 DNYFGLVVRRVQTNYGNVNIILNRHMP-----VDKMVVFDPAYLRIANLREPVF-EPLAK 264 Query: 127 NGDANKGVLIGEGALKVRNEKAVGVV 152 GD+ KG ++ E LKV + KA+ + Sbjct: 265 TGDSVKGQVVAELTLKVLSPKAIAMF 290 >gi|217961109|ref|YP_002339677.1| hypothetical protein BCAH187_A3735 [Bacillus cereus AH187] gi|229140327|ref|ZP_04268882.1| hypothetical protein bcere0013_34260 [Bacillus cereus BDRD-ST26] gi|217064163|gb|ACJ78413.1| conserved hypothetical protein [Bacillus cereus AH187] gi|228642888|gb|EEK99164.1| hypothetical protein bcere0013_34260 [Bacillus cereus BDRD-ST26] Length = 293 Score = 91.7 bits (226), Expect = 3e-17, Method: Composition-based stats. Identities = 29/155 (18%), Positives = 61/155 (39%), Gaps = 17/155 (10%) Query: 3 KKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHII--VSPYVKSEFVRFMSDSNVASFRYA 60 T+ +++ + E++++ + G+ V +K + D Sbjct: 151 NVVNGATKDVLTEKEIKELVKKLWTAGNENGEFYALVGADLKDQIDELYKDRYSYQHVTT 210 Query: 61 TSGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHE 120 G + + +GKV +R M ++ A + L+ +LR + Sbjct: 211 DFG--------IVVNSVETSYGKVNFILDRYMPADKIVAFDVNA-----LKVAFLRQ-PQ 256 Query: 121 DKNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 + + K GD KG ++ E +L+V ++KAV V +L Sbjct: 257 FEQLGKTGDNVKGQVVAECSLEVGSKKAVA-VYNL 290 >gi|228910960|ref|ZP_04074768.1| hypothetical protein bthur0013_51010 [Bacillus thuringiensis IBL 200] gi|228848615|gb|EEM93461.1| hypothetical protein bthur0013_51010 [Bacillus thuringiensis IBL 200] Length = 363 Score = 91.3 bits (225), Expect = 4e-17, Method: Composition-based stats. Identities = 37/159 (23%), Positives = 53/159 (33%), Gaps = 19/159 (11%) Query: 2 TKKAKDGTQRTFSKQILDEVMQEGYQNGDNF----RHIIVSPYVKSEFVRFMSDSNVASF 57 T G + L E ++ ++ G IIV K F + S Sbjct: 217 TNVINAGGAAVADE-HLIEAFRQIFEKGGYTVGGNYKIIVGATQKIAISAFDNALIRLSR 275 Query: 58 RYATSGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRT 117 G+ D Y FG + N M + +ID N L LRT Sbjct: 276 EDNGRGQV--------VDHYVSDFGNAEILLNNNMHAG-----EILVIDANRLSIRPLRT 322 Query: 118 IHE-DKNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 + + K GD KG+L+GE L+ E A + L Sbjct: 323 REFAHEFLGKKGDYMKGMLVGEYTLEFLQEAAHAKITGL 361 >gi|257458669|ref|ZP_05623796.1| conserved hypothetical protein [Campylobacter gracilis RM3268] gi|257443942|gb|EEV19058.1| conserved hypothetical protein [Campylobacter gracilis RM3268] Length = 328 Score = 90.2 bits (222), Expect = 9e-17, Method: Composition-based stats. Identities = 27/149 (18%), Positives = 55/149 (36%), Gaps = 14/149 (9%) Query: 8 GTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHN 67 GT ++ L++++Q + G R + + +K F + + Sbjct: 190 GTPTVLTEDALNKILQNIWDAGATPRDVFIGAKLKKAINAFATRQF-----------GNE 238 Query: 68 NTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRT-IHEDKNIAK 126 +I ++ D FGKV +R ++ + D + ++ L + E+ +K Sbjct: 239 KSINSSVVSLDTDFGKVNFRLHRYLSEQNKLDDVLIAGDFSFMKNGLLIPTMIENVTTSK 298 Query: 127 NGDANKGVLIGEGALKVRNEKAVGVVADL 155 A + E L+VRN A + L Sbjct: 299 T--AKQKRYYTEATLEVRNADAFAIGVGL 325 >gi|196048420|ref|ZP_03115595.1| conserved hypothetical protein [Bacillus cereus 03BB108] gi|196020677|gb|EDX59409.1| conserved hypothetical protein [Bacillus cereus 03BB108] Length = 312 Score = 89.8 bits (221), Expect = 1e-16, Method: Composition-based stats. Identities = 34/164 (20%), Positives = 65/164 (39%), Gaps = 12/164 (7%) Query: 1 MTKKAKDGTQ--RTFSKQILDEVMQEGYQNG-DNFRHIIVSPYVKSEFVRFMSDSNVASF 57 + T S L + + ++ G + + V + V Sbjct: 150 LINSRNKFTPKDGKLSADDLIKAFRLCWEKGAGGDKLVQCGSQVAEFIDKLFK---VEKG 206 Query: 58 RYATSGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRT 117 + + N I TAD+ +G+ + NR M A T ++D N ++ LR Sbjct: 207 VMIPAIQGGGNIIGLTADVIHTRYGRGNIVLNRHMPDGALT-----IVDLNQVKLRPLRK 261 Query: 118 IHEDKNIAKNGDANKGVLIGEGALKVRNEKAVGVVADLFGLSKT 161 + + +AKNGD+ K +++GE +L+++N A V+ + G Sbjct: 262 MLA-EPLAKNGDSQKRMIVGEYSLELKNSYAGAVINGITGYVDP 304 >gi|331088860|ref|ZP_08337770.1| hypothetical protein HMPREF1025_01353 [Lachnospiraceae bacterium 3_1_46FAA] gi|330407383|gb|EGG86886.1| hypothetical protein HMPREF1025_01353 [Lachnospiraceae bacterium 3_1_46FAA] Length = 314 Score = 89.4 bits (220), Expect = 1e-16, Method: Composition-based stats. Identities = 27/145 (18%), Positives = 51/145 (35%), Gaps = 9/145 (6%) Query: 9 TQRTFSKQILDEVMQEGYQNGDNFRHI-IVSPYVKSEFVRFMSDSNVASFRYATSGKSHN 67 T S+ + +Q+ + +G + V+ K N + S Sbjct: 161 TAGALSEDHFLDALQKMWDHGAQGAYFAFVNATQKRAINALAKTGNNVRWVLDN--GSVT 218 Query: 68 NTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKN 127 N FG++ + +R M +N +D + + LR +++ K Sbjct: 219 NVYGIAVSKIVTDFGEISLVLDRYMDNNT-----ILTLDLDEVRIAELRGT-FYEDLPKA 272 Query: 128 GDANKGVLIGEGALKVRNEKAVGVV 152 GD KG +I E +K+ N A + Sbjct: 273 GDYFKGHVINESTIKLLNSYAGSKI 297 >gi|238909129|ref|YP_002939596.1| hypothetical protein EUBELI_10025 [Eubacterium eligens ATCC 27750] gi|238873366|gb|ACR73075.1| Hypothetical protein EUBELI_10025 [Eubacterium eligens ATCC 27750] Length = 304 Score = 89.4 bits (220), Expect = 2e-16, Method: Composition-based stats. Identities = 24/145 (16%), Positives = 53/145 (36%), Gaps = 9/145 (6%) Query: 9 TQRTFSKQILDEVMQEGYQNGDNFRHI-IVSPYVKSEFVRFMSDSNVASFRYATSGKSHN 67 T+ ++ + +Q+ + +G + V+ +K N R+ S Sbjct: 161 TKGALTEDHFLDALQKMWDHGAQGEYFSFVNANIKRMINNLAKAGNNV--RFLGDNGSMQ 218 Query: 68 NTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKN 127 N + FG++ + +R + +D ++ LR +++ K Sbjct: 219 NVLGIGVQKIVTDFGEISLVLDRY-----ADTKTILTVDLGEVQIAELRGT-FYEDLPKA 272 Query: 128 GDANKGVLIGEGALKVRNEKAVGVV 152 GD KG ++ E +K+ N A + Sbjct: 273 GDYFKGHVLNESTIKLLNSHAGSKI 297 >gi|281417131|ref|ZP_06248151.1| conserved hypothetical protein [Clostridium thermocellum JW20] gi|281408533|gb|EFB38791.1| conserved hypothetical protein [Clostridium thermocellum JW20] Length = 292 Score = 89.4 bits (220), Expect = 2e-16, Method: Composition-based stats. Identities = 34/141 (24%), Positives = 56/141 (39%), Gaps = 18/141 (12%) Query: 14 SKQILDEVMQEGYQNG---DNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTI 70 ++ V++ + G F + V+ K F ++ + T Sbjct: 165 TQNDFKAVVKRLWDCGLGSGEF-YCFVNADFKEIIDGFYANQINYNMPMN--------TF 215 Query: 71 VATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGDA 130 A+ +G V V NR M ++DP L ++LR + +AK+GD Sbjct: 216 GFVANKIITNYGIVNVVLNRHMP-----VDKILVVDPTYLRLVYLRK-PSFEMLAKDGDN 269 Query: 131 NKGVLIGEGALKVRNEKAVGV 151 KG++I E LK+ N KAV V Sbjct: 270 LKGMVITELTLKLLNSKAVAV 290 >gi|163937921|ref|YP_001642807.1| hypothetical protein BcerKBAB4_5338 [Bacillus weihenstephanensis KBAB4] gi|163865776|gb|ABY46832.1| hypothetical protein BcerKBAB4_5338 [Bacillus weihenstephanensis KBAB4] Length = 391 Score = 89.0 bits (219), Expect = 2e-16, Method: Composition-based stats. Identities = 31/152 (20%), Positives = 54/152 (35%), Gaps = 19/152 (12%) Query: 16 QILDEVMQEGYQNGDNFR----HIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTIV 71 +L + ++ G IIV K +F + G+ Sbjct: 233 DMLINAFRSIFEKGGMNSGGNFKIIVGGTQKVAISQFGNAQIRLDRLDNGRGQV------ 286 Query: 72 ATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTI-HEDKNIAKNGDA 130 D Y FG + N + S+ ++D N + L T + + K GD Sbjct: 287 --VDHYMSDFGAAEIVLNNNLRSD-----EILIVDSNRISIRPLTTREFSHEFLGKKGDY 339 Query: 131 NKGVLIGEGALKVRNEKAVGVVADLFGLSKTT 162 KG+L+GE ++ E+A + + G +K T Sbjct: 340 MKGMLVGEYTMEFLQEQAHARITGV-GTAKPT 370 >gi|227821704|ref|YP_002825674.1| putative phage related protein [Sinorhizobium fredii NGR234] gi|227340703|gb|ACP24921.1| putative phage related protein [Sinorhizobium fredii NGR234] Length = 123 Score = 88.6 bits (218), Expect = 2e-16, Method: Composition-based stats. Identities = 76/110 (69%), Positives = 85/110 (77%) Query: 26 YQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTIVATADIYDRPFGKVM 85 YQNG NFRH+ SPYVKS FV FMSDSNVA FRYA S NTIVATAD Y+ PFG VM Sbjct: 7 YQNGANFRHVSASPYVKSVFVTFMSDSNVAPFRYAVSKGGERNTIVATADYYEGPFGTVM 66 Query: 86 VHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGDANKGVL 135 +HPNRV A+ A ARNAF ID +ML FLWLR I EDK++AK GDA+KGV+ Sbjct: 67 IHPNRVQATGATQARNAFFIDTDMLSFLWLRKIQEDKDVAKTGDADKGVI 116 >gi|150021335|ref|YP_001306689.1| hypothetical protein Tmel_1457 [Thermosipho melanesiensis BI429] gi|149793856|gb|ABR31304.1| hypothetical protein Tmel_1457 [Thermosipho melanesiensis BI429] Length = 362 Score = 88.6 bits (218), Expect = 3e-16, Method: Composition-based stats. Identities = 28/146 (19%), Positives = 54/146 (36%), Gaps = 14/146 (9%) Query: 9 TQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNN 68 + T+S+ ++ Y NG ++ K F SD + + Sbjct: 228 STNTWSEDNFRAFLKLIYDNGGYITEAWMNASTKQYFNSLNSDKLIVTQ--------DER 279 Query: 69 TIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTI-HEDKNIAKN 127 T D Y +G++ + + + ++D + L+ L + +AK Sbjct: 280 TAGRLVDGYLSEYGQISLKTSPHIPEGM-----IIVVDTSNLKIKPLSGRNMAYEPLAKT 334 Query: 128 GDANKGVLIGEGALKVRNEKAVGVVA 153 GD+ KG ++GE L+ RN G+ Sbjct: 335 GDSVKGQIVGEYTLEFRNPDGAGIFY 360 >gi|152975085|ref|YP_001374602.1| hypothetical protein Bcer98_1285 [Bacillus cereus subsp. cytotoxis NVH 391-98] gi|152023837|gb|ABS21607.1| conserved hypothetical protein [Bacillus cytotoxicus NVH 391-98] Length = 293 Score = 88.6 bits (218), Expect = 3e-16, Method: Composition-based stats. Identities = 29/151 (19%), Positives = 60/151 (39%), Gaps = 16/151 (10%) Query: 3 KKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHII--VSPYVKSEFVRFMSDSNVASFRYA 60 T+ +++ + E++++ + G+ V +K + D Sbjct: 151 NVVNGVTKDVLTEKEVKELVKKLWTAGNENGEFYALVGADLKDQIDELYKDRYSYQHVTT 210 Query: 61 TSGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHE 120 G T D + +GK+ +R M ++ A + I + +LR + Sbjct: 211 DFG--------ITVDSVNTSYGKINFILDRYMPADKIVAFDVNAI-----KVAFLRQ-PQ 256 Query: 121 DKNIAKNGDANKGVLIGEGALKVRNEKAVGV 151 + + K GD +KG ++ E L+V ++KAV V Sbjct: 257 FQALGKTGDNDKGQVVAEATLEVGSKKAVAV 287 >gi|229162523|ref|ZP_04290484.1| hypothetical protein bcere0009_32950 [Bacillus cereus R309803] gi|228621002|gb|EEK77867.1| hypothetical protein bcere0009_32950 [Bacillus cereus R309803] Length = 293 Score = 87.8 bits (216), Expect = 4e-16, Method: Composition-based stats. Identities = 29/155 (18%), Positives = 58/155 (37%), Gaps = 17/155 (10%) Query: 3 KKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHII--VSPYVKSEFVRFMSDSNVASFRYA 60 T ++ + E++++ + G+ V +K + D Sbjct: 151 NVVNGATANVLQEKEVKELVKKLWTAGNENGEFYALVGADIKDQIDELYKDRYSYQHVTN 210 Query: 61 TSGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHE 120 G + + +GKV +R M ++ A + L+ +LR + Sbjct: 211 DFG--------IVVNSVETSYGKVNFILDRYMPADKIVAFDINA-----LKVAFLRQ-PQ 256 Query: 121 DKNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 + + K GD KG ++ E L+V ++KAV V +L Sbjct: 257 FEALGKTGDNIKGQVVAEATLEVGSKKAVA-VYNL 290 >gi|114566839|ref|YP_753993.1| hypothetical protein Swol_1314 [Syntrophomonas wolfei subsp. wolfei str. Goettingen] gi|114337774|gb|ABI68622.1| hypothetical protein Swol_1314 [Syntrophomonas wolfei subsp. wolfei str. Goettingen] Length = 398 Score = 87.8 bits (216), Expect = 5e-16, Method: Composition-based stats. Identities = 27/143 (18%), Positives = 59/143 (41%), Gaps = 16/143 (11%) Query: 12 TFSKQILDEVMQEGYQNGDNFRH--IIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNT 69 T ++ + E +++ + G + +V+ +K + ++ + + Sbjct: 160 TITEPKIKETVKKLWDAGMSSGTYIAMVNADLKEDIDALYANKYSY--------IAQESL 211 Query: 70 IVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGD 129 +G + + NR M ++ + DP ++ +LR H + +AK GD Sbjct: 212 FGLIVSSIQTNYGTIKLVLNRHMPTD-----KMVVFDPAFVKISYLRQPHF-ELLAKTGD 265 Query: 130 ANKGVLIGEGALKVRNEKAVGVV 152 + G +I E LK+ N+KA+ V Sbjct: 266 SINGHVIAEMTLKMLNQKAIAVF 288 >gi|154174521|ref|YP_001409081.1| hypothetical protein CCV52592_0028 [Campylobacter curvus 525.92] gi|112803013|gb|EAU00357.1| conserved hypothetical protein [Campylobacter curvus 525.92] Length = 327 Score = 87.5 bits (215), Expect = 6e-16, Method: Composition-based stats. Identities = 26/149 (17%), Positives = 54/149 (36%), Gaps = 14/149 (9%) Query: 8 GTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHN 67 G ++ +L +++Q+ + +G + + + +K + + + Sbjct: 190 GANTPLTEDVLSQILQQIWDSGATPKDVFIGAALKPAINKLATRQF-----------GNE 238 Query: 68 NTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFL-WLRTIHEDKNIAK 126 I + D FGKV +R +++ A D + +L T ED +K Sbjct: 239 KAINSRVVSLDTDFGKVNFRMHRFLSAKYGLADTLIAGDFEFAKNGLFLPTEIEDVPTSK 298 Query: 127 NGDANKGVLIGEGALKVRNEKAVGVVADL 155 A + E L++RN A + L Sbjct: 299 T--AKQKRYYTECCLEIRNPAAFAIGVGL 325 >gi|169827502|ref|YP_001697660.1| hypothetical protein Bsph_1941 [Lysinibacillus sphaericus C3-41] gi|168991990|gb|ACA39530.1| conserved hypothetical protein [Lysinibacillus sphaericus C3-41] Length = 288 Score = 87.5 bits (215), Expect = 6e-16, Method: Composition-based stats. Identities = 27/151 (17%), Positives = 60/151 (39%), Gaps = 16/151 (10%) Query: 5 AKDGTQRTFSKQILDEVMQEGYQNGDNFRHI--IVSPYVKSEFVRFMSDSNVASFRYATS 62 A + ++ + +VM+ + + V+ +K + D+ + + Sbjct: 149 ATNAVTAPIDEKAIKQVMRNLWNQNLAEGQVYAFVNADIKEKIDAIYKDNYSYNHVTTSF 208 Query: 63 GKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDK 122 G D + +G V ++ + ++ + ++ +LR H + Sbjct: 209 G--------LLVDEINTNYGSVKFVLSKHVPAD-----KVVFFNDAYVDLAYLREPHF-E 254 Query: 123 NIAKNGDANKGVLIGEGALKVRNEKAVGVVA 153 +AK GD+ KG ++ E LKV ++KAV V+ Sbjct: 255 PLAKTGDSVKGQVVAEATLKVGSKKAVAVLT 285 >gi|167583566|ref|YP_001671756.1| major head protein [Enterobacteria phage phiEco32] gi|164375404|gb|ABY52812.1| major head protein [Enterobacteria phage phiEco32] Length = 352 Score = 85.9 bits (211), Expect = 2e-15, Method: Composition-based stats. Identities = 27/152 (17%), Positives = 55/152 (36%), Gaps = 11/152 (7%) Query: 3 KKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATS 62 + T F + + ++ + Y G I+++P F ++ + R + Sbjct: 203 NASNPTTNIGFDEADIFDMTLQLYTAGSEADIIMINPAHAKIFAGLQENTQGSRKRIFEN 262 Query: 63 GKSHNNTIVATADIYDRPFGK-VMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHED 121 + + P G+ + NR M ++A + + LR Sbjct: 263 T----KQFIYEVNSITDPLGQSYKIIVNRWMPTDA-----VYFFRSADWTQMVLRAPKRT 313 Query: 122 KNIAKNGDANKGVLIGEGALKVRNEKAVGVVA 153 + +AK+G K ++ E L+ RN A GV+ Sbjct: 314 E-LAKDGSYEKWMIEMEVGLRHRNPYASGVLF 344 >gi|153951462|ref|YP_001398222.1| hypothetical protein JJD26997_1140 [Campylobacter jejuni subsp. doylei 269.97] gi|152938908|gb|ABS43649.1| hypothetical protein JJD26997_1140 [Campylobacter jejuni subsp. doylei 269.97] Length = 344 Score = 85.5 bits (210), Expect = 2e-15, Method: Composition-based stats. Identities = 24/156 (15%), Positives = 56/156 (35%), Gaps = 14/156 (8%) Query: 7 DGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSH 66 GT ++ L+ ++Q + +G + + + +K +F + + Sbjct: 190 SGTSTELTEDKLNIILQTIWNSGVAPKDVFLGADLKGAINKFATRIL-----------GN 238 Query: 67 NNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFL-WLRTIHEDKNIA 125 +V + FG V H +R++++ D ++ ++ T+ E+ Sbjct: 239 ETKLVGQVVSLETDFGTVNFHMHRLLSAKYGLGDVLIAGDFEYMKHGLYIPTMIEEVTTD 298 Query: 126 KNGDANKGVLIGEGALKVRNEKAVGVVADLFGLSKT 161 A + + L+VRN A + L + T Sbjct: 299 ITAKAKRF--YTQSTLEVRNADAFAIGVGLTSTNNT 332 >gi|291526329|emb|CBK91916.1| hypothetical protein EUR_29920 [Eubacterium rectale DSM 17629] Length = 304 Score = 85.5 bits (210), Expect = 2e-15, Method: Composition-based stats. Identities = 24/145 (16%), Positives = 55/145 (37%), Gaps = 9/145 (6%) Query: 9 TQRTFSKQILDEVMQEGYQNGDNFRHI-IVSPYVKSEFVRFMSDSNVASFRYATSGKSHN 67 T+ +++ + +QE +++G + V+ +K N R+ S Sbjct: 161 TKGALTEEHFLDALQEMWKHGAQGEYFSFVNANIKRMINDLAKAGNNV--RFLGDNGSMQ 218 Query: 68 NTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKN 127 N + FG++ + +R + +D ++ LR +++ K Sbjct: 219 NVLGIGVQKIVTDFGEISLVLDRY-----ADTKTILTVDLAEVQIAELRGT-FYEDLPKA 272 Query: 128 GDANKGVLIGEGALKVRNEKAVGVV 152 GD +G ++ E +K+ N A + Sbjct: 273 GDYYRGHVLNESTIKLLNSFAGSKI 297 >gi|229187822|ref|ZP_04314947.1| hypothetical protein bcere0004_53480 [Bacillus cereus BGSC 6E1] gi|228595657|gb|EEK53352.1| hypothetical protein bcere0004_53480 [Bacillus cereus BGSC 6E1] Length = 313 Score = 85.1 bits (209), Expect = 2e-15, Method: Composition-based stats. Identities = 30/154 (19%), Positives = 66/154 (42%), Gaps = 10/154 (6%) Query: 9 TQRTFSKQILDEVMQEGYQNG-DNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHN 67 T S + + + + ++ G + + V + V + + Sbjct: 160 TSGKLSAEDVTKAFRLCWEKGAGGDKLVQCGSAVAEFIDKLFK---VDKGVVIPAIQGGG 216 Query: 68 NTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKN 127 N I T D+ +G+ + NR M A T ++D N + LR + + ++K Sbjct: 217 NIIGITVDVIHTRYGRGNIVLNRHMPDGALT-----IVDLNQTKLRPLRKM-AAQKLSKI 270 Query: 128 GDANKGVLIGEGALKVRNEKAVGVVADLFGLSKT 161 GD++K +++GE +L+++N A V++ + G+ + Sbjct: 271 GDSDKQMIVGEYSLELKNSYAGAVISGITGIVEP 304 >gi|290457630|sp|P85987|CAPSD_BPSK1 RecName: Full=Major capsid protein; AltName: Full=Virion protein G gi|221271431|dbj|BAH15184.1| major capsid protein [Serratia phage KSP100] Length = 306 Score = 84.4 bits (207), Expect = 4e-15, Method: Composition-based stats. Identities = 29/155 (18%), Positives = 56/155 (36%), Gaps = 13/155 (8%) Query: 1 MTKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYA 60 + + T + L E+ + Y G + I+ P + F S + Sbjct: 149 LIDASMKKTA-DLDEATLFEMTAKLYTEGADPTLIMYHPSNANFFASLQEKSGTRMRIFE 207 Query: 61 TSGKSHNNTIVATADIYDRPFG-KVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIH 119 + V + P G ++ PNR +A ++ +P+ L LR Sbjct: 208 N-----DKRFVKQVEYIVDPLGQELKCIPNRWCPEDAT-----YIFNPSDLGMAVLRAPK 257 Query: 120 EDKNIAKNGDANKGVLIGEGALKVRNEKAVGVVAD 154 + +AK+G A K ++ E ++ N KA ++ Sbjct: 258 KV-ALAKSGSAEKYMIEQEVGFRLNNPKAAALIIG 291 >gi|57237589|ref|YP_178603.1| hypothetical protein CJE0587 [Campylobacter jejuni RM1221] gi|57166393|gb|AAW35172.1| hypothetical protein CJE0587 [Campylobacter jejuni RM1221] Length = 344 Score = 84.0 bits (206), Expect = 7e-15, Method: Composition-based stats. Identities = 23/151 (15%), Positives = 53/151 (35%), Gaps = 14/151 (9%) Query: 7 DGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSH 66 GT ++ L++++Q + +G + + + +K +F + + Sbjct: 190 SGTATELTEDKLNQILQTIWNSGVTPKDVFLGADLKGAINKFATRIL-----------GN 238 Query: 67 NNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFL-WLRTIHEDKNIA 125 + + FG V H +R+++ D ++ ++ T+ ED Sbjct: 239 ETKLAGQVVSLETDFGTVNFHMHRLLSPKYGLGDVLIAGDFEYMKHGLYIPTMIEDVPTD 298 Query: 126 KNGDANKGVLIGEGALKVRNEKAVGVVADLF 156 A + + L+VRN A + L Sbjct: 299 ITAKAKRF--YTQSTLEVRNADAFAIGVGLT 327 >gi|283956330|ref|ZP_06373810.1| hypothetical protein C1336_000250101 [Campylobacter jejuni subsp. jejuni 1336] gi|283792050|gb|EFC30839.1| hypothetical protein C1336_000250101 [Campylobacter jejuni subsp. jejuni 1336] Length = 344 Score = 83.6 bits (205), Expect = 8e-15, Method: Composition-based stats. Identities = 23/151 (15%), Positives = 53/151 (35%), Gaps = 14/151 (9%) Query: 7 DGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSH 66 GT ++ L++++Q + +G + + + +K +F + + Sbjct: 190 SGTATELTEDKLNQILQTIWNSGVTPKDVFLGADLKGAINKFATRIL-----------GN 238 Query: 67 NNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFL-WLRTIHEDKNIA 125 + + FG V H +R+++ D ++ ++ T+ ED Sbjct: 239 ETKLAGQVVSLETDFGTVNFHMHRLLSPKYGLGDVLIAGDFEYMKHGLYIPTMIEDVPTD 298 Query: 126 KNGDANKGVLIGEGALKVRNEKAVGVVADLF 156 A + + L+VRN A + L Sbjct: 299 ITAKAKRF--YTQSTLEVRNADAFAIGVGLT 327 >gi|315929828|gb|EFV08993.1| hypothetical protein CSS_0883 [Campylobacter jejuni subsp. jejuni 305] Length = 344 Score = 82.4 bits (202), Expect = 2e-14, Method: Composition-based stats. Identities = 22/151 (14%), Positives = 53/151 (35%), Gaps = 14/151 (9%) Query: 7 DGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSH 66 GT ++ L++++Q + +G + + + +K +F + + Sbjct: 190 SGTATELTEDKLNQILQTIWNSGVTPKDVFLGADLKGAINKFATRIL-----------GN 238 Query: 67 NNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFL-WLRTIHEDKNIA 125 + + FG V H +R+++ D ++ ++ T+ ED Sbjct: 239 ETKLAGQVVSLETDFGTVNFHMHRLLSPKYGLGDVLIAGDFEYMKHGLYIPTMIEDVPTD 298 Query: 126 KNGDANKGVLIGEGALKVRNEKAVGVVADLF 156 A + + L++RN A + L Sbjct: 299 ITAKAKRF--YTQSTLEIRNADAFAIGVGLT 327 >gi|257451764|ref|ZP_05617063.1| hypothetical protein F3_01776 [Fusobacterium sp. 3_1_5R] gi|317058321|ref|ZP_07922806.1| predicted protein [Fusobacterium sp. 3_1_5R] gi|313683997|gb|EFS20832.1| predicted protein [Fusobacterium sp. 3_1_5R] Length = 371 Score = 79.4 bits (194), Expect = 2e-13, Method: Composition-based stats. Identities = 31/155 (20%), Positives = 67/155 (43%), Gaps = 18/155 (11%) Query: 6 KDGTQRTFSKQILDEVMQEGYQNGDNFR----HIIVSPYVKSEFVRFMSDSNVASFRYAT 61 D + S +I+ V+++ ++ G + + V K + + + D A Sbjct: 228 VDAGGQEISLEIIGNVLRKIFEAGGDVNGGNYALYVPGVQKVKISKLLKDYIQA------ 281 Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTI-HE 120 NT+ A A+ FG + + P + A +++ + + LRT + Sbjct: 282 --PPSENTLGAVANYVATDFGTLPIIP-----TVNLRADEMMILNHDDITLQVLRTRELQ 334 Query: 121 DKNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 + + K GD KG+++ E +++VRN +G++ +L Sbjct: 335 HEYMGKTGDNTKGLIVTELSVEVRNIPTMGMIMNL 369 >gi|319649918|ref|ZP_08004068.1| hypothetical protein HMPREF1013_00673 [Bacillus sp. 2_A_57_CT2] gi|317398356|gb|EFV79044.1| hypothetical protein HMPREF1013_00673 [Bacillus sp. 2_A_57_CT2] Length = 292 Score = 79.4 bits (194), Expect = 2e-13, Method: Composition-based stats. Identities = 24/145 (16%), Positives = 58/145 (40%), Gaps = 16/145 (11%) Query: 11 RTFSKQILDEVMQEGYQNGDNFRHI--IVSPYVKSEFVRFMSDSNVASFRYATSGKSHNN 68 ++ + +VM+ + ++ ++ +K + D S + G Sbjct: 157 GAITEDDVKKVMRNLWNQDLAEGNVYAFLNADLKEQIDAIYKDKYGYSHVTTSFG----- 211 Query: 69 TIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNG 128 + + +G V ++ + ++ + + ++ +LR H + +AK G Sbjct: 212 ---LLVESINTNYGTVHFVLSKHVPAD-----KIVVFNDAYVDLAYLREPHF-EPLAKTG 262 Query: 129 DANKGVLIGEGALKVRNEKAVGVVA 153 D+ KG +I E +KV ++K V V+ Sbjct: 263 DSVKGQVIAEATVKVGSKKGVAVLT 287 >gi|291336566|gb|ADD96115.1| hypothetical protein HG1285_12862 [uncultured organism MedDCM-OCT-S04-C6] Length = 347 Score = 77.4 bits (189), Expect = 5e-13, Method: Composition-based stats. Identities = 24/155 (15%), Positives = 55/155 (35%), Gaps = 11/155 (7%) Query: 6 KDGTQRT-FSKQILDEVMQEGYQNGDNFR--HIIVSPYVKSEFVRFMSDSNVASFRYATS 62 +G ++ + ++Q + + ++ F F A + + Sbjct: 188 VNGDPTADITESDIQTMLQTIWSATGMTGDYKLFADATLRRAFTDFTRTIATAGYSQRSL 247 Query: 63 GKSHNNTIV-ATADIYDRPFGKVMVHPNRVMA------SNAETARNAFLIDPNMLEFLWL 115 + T + T I+D FG V V + + + A +L+D + ++ Sbjct: 248 NYDGDGTRISNTTTIFDGDFGSVEVIADNFIGFKDPASGTTQEAGRGYLLDMDKIDLRVN 307 Query: 116 RTIHEDKNIAKNGDANKGVLIGEGALKVRNEKAVG 150 + + + G + ++ G AL+VRN +G Sbjct: 308 KQ-PQIERFEDKGGGERFLIEGRAALQVRNPIGLG 341 >gi|256375780|ref|YP_003099440.1| hypothetical protein Amir_1645 [Actinosynnema mirum DSM 43827] gi|255920083|gb|ACU35594.1| hypothetical protein Amir_1645 [Actinosynnema mirum DSM 43827] Length = 406 Score = 77.4 bits (189), Expect = 5e-13, Method: Composition-based stats. Identities = 21/150 (14%), Positives = 47/150 (31%), Gaps = 8/150 (5%) Query: 9 TQRTFSKQILDEVMQEGYQNGDNFR---HIIVSPYVKSEFVRFMSDSNVASFRYATSGKS 65 + S ++ Q + G + + K + +D+ + Sbjct: 257 SATVLSVDDVNVFAQRVWDGGGLTDGLGVFLCNSSQKIALSKAYADAYGKAVPVT----Q 312 Query: 66 HNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIA 125 + FG + + +R M +A + + L + ++ +A Sbjct: 313 GEKVGGVVVERLLTDFGPINIMVDRAMPQDALLLTSLSQLKLQFLLIPG-KGSFFEEPLA 371 Query: 126 KNGDANKGVLIGEGALKVRNEKAVGVVADL 155 K G + + GE L+ + A GV+ L Sbjct: 372 KTGASEDTQIYGEVGLEYGSPLAHGVLRGL 401 >gi|281355462|ref|ZP_06241956.1| hypothetical protein Vvad_PD3568 [Victivallis vadensis ATCC BAA-548] gi|281318342|gb|EFB02362.1| hypothetical protein Vvad_PD3568 [Victivallis vadensis ATCC BAA-548] Length = 403 Score = 75.9 bits (185), Expect = 1e-12, Method: Composition-based stats. Identities = 17/145 (11%), Positives = 47/145 (32%), Gaps = 11/145 (7%) Query: 2 TKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYAT 61 T A D + + +++++ + + + G N + +P ++ Sbjct: 228 TALAVDASGGRLTSKLVNDAVMKVTEEGGNPTALYCTPDQARVLANEYKNNITVMRDDKA 287 Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHED 121 G + + + + + +A+L D + W+R + + Sbjct: 288 RG------VYVAVVVNEANGQNIEII-----GDPDFNDADAWLCDDDCFALSWMRPVGDT 336 Query: 122 KNIAKNGDANKGVLIGEGALKVRNE 146 + + D ++GE L+ N Sbjct: 337 DSTPPDADGISRKMVGEFTLEFHNA 361 >gi|257463376|ref|ZP_05627772.1| hypothetical protein FuD12_05953 [Fusobacterium sp. D12] gi|317060946|ref|ZP_07925431.1| predicted protein [Fusobacterium sp. D12] gi|313686622|gb|EFS23457.1| predicted protein [Fusobacterium sp. D12] Length = 369 Score = 74.4 bits (181), Expect = 6e-12, Method: Composition-based stats. Identities = 28/154 (18%), Positives = 64/154 (41%), Gaps = 18/154 (11%) Query: 6 KDGTQRTFSKQILDEVMQEGYQNGDNFR----HIIVSPYVKSEFVRFMSDSNVASFRYAT 61 D + S +++ V+++ ++ G + + V K + + + D A Sbjct: 228 VDAGGQDISLEMIGNVLRKIFEVGGDVDGGNYALYVPGIQKVKISKLLKDYIQA------ 281 Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHED 121 +T+ A A FG + + P + +++ + +E LR Sbjct: 282 --PPAEHTLGAVATYIATDFGTLPIIP-----TTNLRGDEMMILNHDDIEAKVLRG-LSH 333 Query: 122 KNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 K + + GD KG+++ E +++VRN +G++ +L Sbjct: 334 KYMGETGDNTKGLIVTELSVQVRNIHTMGMIVNL 367 >gi|319789007|ref|YP_004150640.1| phage protein [Thermovibrio ammonificans HB-1] gi|317113509|gb|ADU95999.1| phage protein [Thermovibrio ammonificans HB-1] Length = 373 Score = 73.6 bits (179), Expect = 8e-12, Method: Composition-based stats. Identities = 22/155 (14%), Positives = 55/155 (35%), Gaps = 17/155 (10%) Query: 6 KDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKS 65 + ++ +L++ +++ Q+G + ++ + + F +++ + Sbjct: 229 VNAAGNDLTQTLLNDTIEKIVQDGGSPNVLVCNTTQARKISAFNANNIIVQR-------- 280 Query: 66 HNNTIVATADIYDRPF--GKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRT--IHED 121 + T + G + + ++D + L + LR I + Sbjct: 281 EDQTAGNFVARFVADLPAGVITTI----VVDTNFPKDKIAILDTSRLRLVPLRGSTIRDF 336 Query: 122 KNIAKNGDANKGVLIGEGALKVRNEK-AVGVVADL 155 D ++GE L+VRN K A G++ L Sbjct: 337 DATPNGADFIARRILGEYTLEVRNAKQAHGLLEGL 371 >gi|134299981|ref|YP_001113477.1| hypothetical protein Dred_2135 [Desulfotomaculum reducens MI-1] gi|134052681|gb|ABO50652.1| hypothetical protein Dred_2135 [Desulfotomaculum reducens MI-1] Length = 285 Score = 72.4 bits (176), Expect = 2e-11, Method: Composition-based stats. Identities = 33/153 (21%), Positives = 54/153 (35%), Gaps = 18/153 (11%) Query: 2 TKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHI-IVSPYVKSEFVRFMSDSNVASFRYA 60 A T TF+K +EV+ + Y G N + + +K + F + +A + Sbjct: 148 INPANKVTNATFTKTKFEEVVGKMYDAGVNDEMLCFLPAQMKIQLNSFSNVEFLARDMFL 207 Query: 61 TSGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHE 120 + Y P+G V + + F+++PN LE L H Sbjct: 208 GFD----------TERYVTPYGIVTFVLS------EKLNNKLFIVNPNYLELAELIPFHA 251 Query: 121 DKNIAKNGDANKGVLIGEGALKVRNEKAVGVVA 153 A +G L + LK+ N KA A Sbjct: 252 VPQ-AVSGSKQSVYLETQFGLKLLNTKAAASFA 283 >gi|134298256|ref|YP_001111752.1| hypothetical protein Dred_0379 [Desulfotomaculum reducens MI-1] gi|134050956|gb|ABO48927.1| hypothetical protein Dred_0379 [Desulfotomaculum reducens MI-1] Length = 285 Score = 72.0 bits (175), Expect = 3e-11, Method: Composition-based stats. Identities = 31/153 (20%), Positives = 53/153 (34%), Gaps = 18/153 (11%) Query: 2 TKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHI-IVSPYVKSEFVRFMSDSNVASFRYA 60 T TF+K +EV+ + Y G N + + +K + F + +A + Sbjct: 148 INPVNQVTNATFTKTKFEEVVSKMYDAGVNDEMLCFLPAQMKIQLNSFSNVEFLARDMF- 206 Query: 61 TSGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHE 120 + + Y P+G V + + F+++PN LE L H Sbjct: 207 ---------LGFDTERYVTPYGIVTFVLS------EKLNNKLFIVNPNYLELAELIPFHA 251 Query: 121 DKNIAKNGDANKGVLIGEGALKVRNEKAVGVVA 153 +G L + LK+ N KA A Sbjct: 252 TVE-PVSGSKQSIYLETQFGLKLLNTKAAASFA 283 >gi|30020036|ref|NP_831667.1| Phage protein [Bacillus cereus ATCC 14579] gi|31415788|ref|NP_852528.1| hypothetical protein BC1894 [Bacillus phage phBC6A51] gi|229127327|ref|ZP_04256323.1| hypothetical protein bcere0015_17800 [Bacillus cereus BDRD-Cer4] gi|29895581|gb|AAP08868.1| Phage protein [Bacillus phage phBC6A51] gi|228656160|gb|EEL12002.1| hypothetical protein bcere0015_17800 [Bacillus cereus BDRD-Cer4] Length = 374 Score = 70.5 bits (171), Expect = 7e-11, Method: Composition-based stats. Identities = 26/147 (17%), Positives = 57/147 (38%), Gaps = 18/147 (12%) Query: 18 LDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTIVATADIY 77 L++ +++ + G + I+ P + + + D+ + + +G + Sbjct: 241 LNDAVEQVFTRGGSANTILCGPNIARQITKLGGDTIRTTRQDTAAGYQ--------ILSF 292 Query: 78 DRPF--GKV-MVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGDANKGV 134 G + V + M + A L+D + ++ +L I++ D V Sbjct: 293 VSDLPGGAISSVVVDLNMPKD-----RALLLDTDKVKARYLTPIYDQDATPNGADYFSRV 347 Query: 135 LIGEGALKVRNEKAVGVVADLFGLSKT 161 + GE +V+N K +A L +SKT Sbjct: 348 IRGEFGFEVKNAKES--IAVLENISKT 372 >gi|228941057|ref|ZP_04103614.1| hypothetical protein bthur0008_36970 [Bacillus thuringiensis serovar berliner ATCC 10792] gi|228973988|ref|ZP_04134562.1| hypothetical protein bthur0003_37430 [Bacillus thuringiensis serovar thuringiensis str. T01001] gi|228980577|ref|ZP_04140886.1| hypothetical protein bthur0002_37450 [Bacillus thuringiensis Bt407] gi|228779138|gb|EEM27396.1| hypothetical protein bthur0002_37450 [Bacillus thuringiensis Bt407] gi|228785714|gb|EEM33719.1| hypothetical protein bthur0003_37430 [Bacillus thuringiensis serovar thuringiensis str. T01001] gi|228818600|gb|EEM64668.1| hypothetical protein bthur0008_36970 [Bacillus thuringiensis serovar berliner ATCC 10792] gi|326939625|gb|AEA15521.1| Phage protein [Bacillus thuringiensis serovar chinensis CT-43] Length = 374 Score = 70.5 bits (171), Expect = 8e-11, Method: Composition-based stats. Identities = 26/149 (17%), Positives = 58/149 (38%), Gaps = 18/149 (12%) Query: 16 QILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTIVATAD 75 ++L++ +++ + G + I+ P + + + D+ + + +G Sbjct: 239 KLLNDAVEQVFTRGGSANTILCGPNIARQITKLGGDTIRTTRQDTAAGYQ--------IL 290 Query: 76 IYDRPF--GKV-MVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGDANK 132 + G + V + M + A L+D ++ +L I++ D Sbjct: 291 SFVSDLPGGAISSVVVDLNMPKD-----RALLLDTEKVKARYLTPIYDQDATPNGADYFS 345 Query: 133 GVLIGEGALKVRNEKAVGVVADLFGLSKT 161 V+ GE +V+N K +A L +SKT Sbjct: 346 RVIRGEFGFEVKNAKES--IAVLENISKT 372 >gi|218897919|ref|YP_002446330.1| phage protein [Bacillus cereus G9842] gi|218542918|gb|ACK95312.1| phage protein [Bacillus cereus G9842] Length = 374 Score = 70.1 bits (170), Expect = 1e-10, Method: Composition-based stats. Identities = 26/147 (17%), Positives = 56/147 (38%), Gaps = 18/147 (12%) Query: 18 LDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTIVATADIY 77 L++ +++ + G + I+ P + + + D+ + + +G + Sbjct: 241 LNDAVEQVFTRGGSANTILCGPNIARQITKLGGDTIRTTRQDTAAGYQ--------ILSF 292 Query: 78 DRPF--GKV-MVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGDANKGV 134 G + V + M + A L+D ++ +L I++ D V Sbjct: 293 VSDLPGGAISSVVVDLNMPKD-----RALLLDTEKVKARYLTPIYDQDATPNGADYFSRV 347 Query: 135 LIGEGALKVRNEKAVGVVADLFGLSKT 161 + GE +V+N K +A L +SKT Sbjct: 348 IRGEFGFEVKNAKES--IAVLENISKT 372 >gi|229020770|ref|ZP_04177493.1| hypothetical protein bcere0030_52440 [Bacillus cereus AH1273] gi|228740571|gb|EEL90846.1| hypothetical protein bcere0030_52440 [Bacillus cereus AH1273] Length = 374 Score = 70.1 bits (170), Expect = 1e-10, Method: Composition-based stats. Identities = 26/147 (17%), Positives = 56/147 (38%), Gaps = 18/147 (12%) Query: 18 LDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTIVATADIY 77 L++ +++ + G + I+ P + + + D+ + + +G + Sbjct: 241 LNDAVEQVFTRGGSANTILCGPNIARQITKLGGDTIRTTRQDTAAGYQ--------ILSF 292 Query: 78 DRPF--GKV-MVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGDANKGV 134 G + V + M + A L+D ++ +L I++ D V Sbjct: 293 VSDLPGGAISSVVVDLNMPKD-----RALLLDTEKVKARYLTPIYDQDATPNGADYFSRV 347 Query: 135 LIGEGALKVRNEKAVGVVADLFGLSKT 161 + GE +V+N K +A L +SKT Sbjct: 348 IRGEFGFEVKNAKES--IAVLENISKT 372 >gi|229190579|ref|ZP_04317576.1| hypothetical protein bcere0002_22460 [Bacillus cereus ATCC 10876] gi|228592924|gb|EEK50746.1| hypothetical protein bcere0002_22460 [Bacillus cereus ATCC 10876] Length = 374 Score = 69.7 bits (169), Expect = 1e-10, Method: Composition-based stats. Identities = 26/147 (17%), Positives = 56/147 (38%), Gaps = 18/147 (12%) Query: 18 LDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTIVATADIY 77 L++ +++ + G + I+ P + + + D+ + + + T + Sbjct: 241 LNDAVEQVFTRGGSANTILCGPNIARQITKLGGDTIRTTRQ--------DTTAGYQILSF 292 Query: 78 DRPF--GKV-MVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGDANKGV 134 G + V + M + A L+D ++ +L I++ D V Sbjct: 293 VSDLPGGAISSVVVDLNMPKD-----RALLLDTEKVKARYLTPIYDQDATPNGADYFSRV 347 Query: 135 LIGEGALKVRNEKAVGVVADLFGLSKT 161 + GE +V+N K +A L +SKT Sbjct: 348 IRGEFGFEVKNAKES--IAVLENISKT 372 >gi|228968787|ref|ZP_04129749.1| hypothetical protein bthur0004_55460 [Bacillus thuringiensis serovar sotto str. T04001] gi|228790850|gb|EEM38489.1| hypothetical protein bthur0004_55460 [Bacillus thuringiensis serovar sotto str. T04001] Length = 374 Score = 69.7 bits (169), Expect = 1e-10, Method: Composition-based stats. Identities = 26/147 (17%), Positives = 56/147 (38%), Gaps = 18/147 (12%) Query: 18 LDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTIVATADIY 77 L++ +++ + G + I+ P + + + D+ + + +G + Sbjct: 241 LNDAVEQVFTRGGSANTILCGPNIARQITKLGGDTIRTTRQDTAAGYQ--------ILSF 292 Query: 78 DRPF--GKV-MVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGDANKGV 134 G + V + M + A L+D ++ +L I++ D V Sbjct: 293 VSDLPGGAISSVVVDLNMPKD-----RALLLDTEKVKARYLTPIYDQDATPNGADYFSRV 347 Query: 135 LIGEGALKVRNEKAVGVVADLFGLSKT 161 + GE +V+N K +A L +SKT Sbjct: 348 IRGEFGFEVKNAKES--IAVLENISKT 372 >gi|323703894|ref|ZP_08115527.1| hypothetical protein DesniDRAFT_2739 [Desulfotomaculum nigrificans DSM 574] gi|323531143|gb|EGB21049.1| hypothetical protein DesniDRAFT_2739 [Desulfotomaculum nigrificans DSM 574] Length = 285 Score = 69.7 bits (169), Expect = 1e-10, Method: Composition-based stats. Identities = 30/153 (19%), Positives = 51/153 (33%), Gaps = 18/153 (11%) Query: 2 TKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHI-IVSPYVKSEFVRFMSDSNVASFRYA 60 A T +F+K +E + + Y G N I + +K F + +A + Sbjct: 148 IDPANQVTNTSFTKAKFEETVAKMYDAGVNDEMICFIPAQMKIALNGFNNVEFLARDMFQ 207 Query: 61 TSGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHE 120 + Y P+G + + + F+++PN LE L H Sbjct: 208 GFD----------TEKYVTPYGIITFVLS------EKLQNKLFIVNPNYLELAELIPFHA 251 Query: 121 DKNIAKNGDANKGVLIGEGALKVRNEKAVGVVA 153 +G L + LK+ N KA A Sbjct: 252 VPQ-PVSGSKQSIYLETQFGLKLLNSKAAASFA 283 >gi|188585861|ref|YP_001917406.1| conserved hypothetical protein [Natranaerobius thermophilus JW/NM-WN-LF] gi|179350548|gb|ACB84818.1| conserved hypothetical protein [Natranaerobius thermophilus JW/NM-WN-LF] Length = 289 Score = 67.8 bits (164), Expect = 5e-10, Method: Composition-based stats. Identities = 26/149 (17%), Positives = 53/149 (35%), Gaps = 14/149 (9%) Query: 1 MTKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHI-IVSPYVKSEFVRFMSDSNVASFRY 59 + T+ F++ + + +Q+ Y + I + P K + + ++ + Sbjct: 145 LINSENQKTEDEFNENVFLDTLQKLYDANVHDNMIVFLRPEEKIKINQEFNNVVKYT--- 201 Query: 60 ATSGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIH 119 + + A+ Y PFG V R + A F+I+P LE L Sbjct: 202 ----DTLDRVAGFDAEQYISPFGIV-----RFALAPKLPANTMFVINPAYLEMPTLIPFT 252 Query: 120 EDKNIAKNGDANKGVLIGEGALKVRNEKA 148 + +G + L + +K+ N A Sbjct: 253 A-RPQNPSGSKQEVYLETQAGVKLLNSYA 280 >gi|257468183|ref|ZP_05632279.1| hypothetical protein FulcA4_02527 [Fusobacterium ulcerans ATCC 49185] gi|317062468|ref|ZP_07926953.1| predicted protein [Fusobacterium ulcerans ATCC 49185] gi|313688144|gb|EFS24979.1| predicted protein [Fusobacterium ulcerans ATCC 49185] Length = 370 Score = 66.7 bits (161), Expect = 1e-09, Method: Composition-based stats. Identities = 28/155 (18%), Positives = 55/155 (35%), Gaps = 18/155 (11%) Query: 6 KDGTQRTFSKQILDEVMQEGYQNGDNFRH----IIVSPYVKSEFVRFMSDSNVASFRYAT 61 D S + L+ ++++ Y G + ++ + P +S + + V +T Sbjct: 227 VDANNEKISIEKLEALLKKAYLVGGDLQNGYYAFYIPPKQQSNLNKEIEKYIVKEQTSST 286 Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTI-HE 120 G +G V + + N L++ E L Sbjct: 287 LGAVVKEVY--------TSYGIVPLIMTPNLPDN-----EMLLVNHENCEIRPLINRDLF 333 Query: 121 DKNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 + + K GD KG+++GE L+VR G + +L Sbjct: 334 HEYMGKTGDNTKGLILGELTLEVRGVHEQGKIVNL 368 >gi|194015203|ref|ZP_03053819.1| phage protein [Bacillus pumilus ATCC 7061] gi|194012607|gb|EDW22173.1| phage protein [Bacillus pumilus ATCC 7061] Length = 367 Score = 60.9 bits (146), Expect = 6e-08, Method: Composition-based stats. Identities = 24/150 (16%), Positives = 52/150 (34%), Gaps = 18/150 (12%) Query: 16 QILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTIVATAD 75 + ++ +M++ Y G + I+ + + + +D+ A +G Sbjct: 230 KDINALMEQVYLRGGSVNTILTNTAGARQISKMATDTIRTERTDAATGHR--------IS 281 Query: 76 IYDRPF--GKVMVHPNRVMASNAETARNAFLIDPNMLEFLWL--RTIHEDKNIAKNGDAN 131 + G V + L D ++L L R++++ D Sbjct: 282 TFVSDMVGGGVATV----IVDPNFPKDKIALFDRSILSLHPLTGRSVYDTDASVPGADFV 337 Query: 132 KGVLIGEGALKVRNEKAVGVVADLFGLSKT 161 + GE +KV+N A +A L ++ T Sbjct: 338 ARQIRGEYGIKVKN--ANEKIAILENIATT 365 >gi|294614769|ref|ZP_06694669.1| hypothetical protein EfmE1636_0859 [Enterococcus faecium E1636] gi|291592381|gb|EFF23990.1| hypothetical protein EfmE1636_0859 [Enterococcus faecium E1636] Length = 296 Score = 60.5 bits (145), Expect = 7e-08, Method: Composition-based stats. Identities = 25/154 (16%), Positives = 53/154 (34%), Gaps = 13/154 (8%) Query: 3 KKAKDGTQRTFSKQILDEVMQEGYQNG-DNFRHIIVSPYVKSEFVRFMSDSNVASFRYAT 61 KA+ T +++ +D + + Y+ G + ++SP ++ + Sbjct: 152 NKAETATAGAVTRKDIDTLFKLMYEKGYMGEKLCLISPDMQELMTDELDG-----KSTKI 206 Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHED 121 T +G + + ID N ++ LR Sbjct: 207 VQFGERVTFGLQIGNIVSNYGTGIALLEPSLPKGT-----IAAIDTNYVKLRPLREW-RA 260 Query: 122 KNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 + +AK D+ + L+GE +L+ N G + +L Sbjct: 261 EELAKTTDSRRIGLVGEYSLEY-NASNSGAILNL 293 >gi|257893408|ref|ZP_05673061.1| conserved hypothetical protein [Enterococcus faecium 1,231,408] gi|257829787|gb|EEV56394.1| conserved hypothetical protein [Enterococcus faecium 1,231,408] Length = 296 Score = 60.5 bits (145), Expect = 7e-08, Method: Composition-based stats. Identities = 25/154 (16%), Positives = 53/154 (34%), Gaps = 13/154 (8%) Query: 3 KKAKDGTQRTFSKQILDEVMQEGYQNG-DNFRHIIVSPYVKSEFVRFMSDSNVASFRYAT 61 KA+ T +++ +D + + Y+ G + ++SP ++ + Sbjct: 152 NKAETATAGAVTRKDIDTLFKLMYEKGYMGEKLCLISPDMQELMTDELDG-----KSTKI 206 Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHED 121 T +G + + ID N ++ LR Sbjct: 207 VQFGERVTFGLQIGNIVSNYGTGIALLEPSLPKGT-----IAAIDTNYVKLRPLREW-RA 260 Query: 122 KNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 + +AK D+ + L+GE +L+ N G + +L Sbjct: 261 EELAKTTDSRRIGLVGEYSLEY-NASNSGAILNL 293 >gi|257879565|ref|ZP_05659218.1| conserved hypothetical protein [Enterococcus faecium 1,230,933] gi|257891545|ref|ZP_05671198.1| conserved hypothetical protein [Enterococcus faecium 1,231,410] gi|314940388|ref|ZP_07847550.1| conserved hypothetical protein [Enterococcus faecium TX0133a04] gi|314943205|ref|ZP_07849996.1| conserved hypothetical protein [Enterococcus faecium TX0133C] gi|314949154|ref|ZP_07852509.1| conserved hypothetical protein [Enterococcus faecium TX0082] gi|314951966|ref|ZP_07854992.1| conserved hypothetical protein [Enterococcus faecium TX0133A] gi|314993065|ref|ZP_07858455.1| conserved hypothetical protein [Enterococcus faecium TX0133B] gi|314995396|ref|ZP_07860499.1| conserved hypothetical protein [Enterococcus faecium TX0133a01] gi|257813793|gb|EEV42551.1| conserved hypothetical protein [Enterococcus faecium 1,230,933] gi|257827905|gb|EEV54531.1| conserved hypothetical protein [Enterococcus faecium 1,231,410] gi|313590399|gb|EFR69244.1| conserved hypothetical protein [Enterococcus faecium TX0133a01] gi|313592421|gb|EFR71266.1| conserved hypothetical protein [Enterococcus faecium TX0133B] gi|313595906|gb|EFR74751.1| conserved hypothetical protein [Enterococcus faecium TX0133A] gi|313598089|gb|EFR76934.1| conserved hypothetical protein [Enterococcus faecium TX0133C] gi|313640428|gb|EFS05008.1| conserved hypothetical protein [Enterococcus faecium TX0133a04] gi|313644467|gb|EFS09047.1| conserved hypothetical protein [Enterococcus faecium TX0082] Length = 296 Score = 60.5 bits (145), Expect = 7e-08, Method: Composition-based stats. Identities = 25/154 (16%), Positives = 53/154 (34%), Gaps = 13/154 (8%) Query: 3 KKAKDGTQRTFSKQILDEVMQEGYQNG-DNFRHIIVSPYVKSEFVRFMSDSNVASFRYAT 61 KA+ T +++ +D + + Y+ G + ++SP ++ + Sbjct: 152 NKAETATAGAVTRKDIDGLFKLMYEKGYMGEKLCLISPDMQELMTDELDG-----KSTKI 206 Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHED 121 T +G + + ID N ++ LR Sbjct: 207 VQFGERVTFGLQIGNIVSNYGTGIALLEPSLPKGT-----IAAIDTNYVKLRPLREW-RA 260 Query: 122 KNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 + +AK D+ + L+GE +L+ N G + +L Sbjct: 261 EELAKTTDSRRIGLVGEYSLEY-NASNSGAILNL 293 >gi|257883499|ref|ZP_05663152.1| conserved hypothetical protein [Enterococcus faecium 1,231,502] gi|261208026|ref|ZP_05922703.1| conserved hypothetical protein [Enterococcus faecium TC 6] gi|289567093|ref|ZP_06447488.1| conserved hypothetical protein [Enterococcus faecium D344SRF] gi|294622496|ref|ZP_06701518.1| conserved hypothetical protein [Enterococcus faecium U0317] gi|257819157|gb|EEV46485.1| conserved hypothetical protein [Enterococcus faecium 1,231,502] gi|260077743|gb|EEW65457.1| conserved hypothetical protein [Enterococcus faecium TC 6] gi|289161108|gb|EFD09013.1| conserved hypothetical protein [Enterococcus faecium D344SRF] gi|291598043|gb|EFF29153.1| conserved hypothetical protein [Enterococcus faecium U0317] Length = 296 Score = 60.5 bits (145), Expect = 7e-08, Method: Composition-based stats. Identities = 25/154 (16%), Positives = 53/154 (34%), Gaps = 13/154 (8%) Query: 3 KKAKDGTQRTFSKQILDEVMQEGYQNG-DNFRHIIVSPYVKSEFVRFMSDSNVASFRYAT 61 KA+ T +++ +D + + Y+ G + ++SP ++ + Sbjct: 152 NKAETATAGAVTRKDIDTLFKLMYEKGYMGEKLCLISPDMQELMTDELDG-----KSTKI 206 Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHED 121 T +G + + ID N ++ LR Sbjct: 207 VQFGERVTFGLQIGNIVSNYGTGIALLEPSLPKGT-----IAAIDTNYVKLRPLREW-RA 260 Query: 122 KNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 + +AK D+ + L+GE +L+ N G + +L Sbjct: 261 EELAKTTDSRRIGLVGEYSLEY-NASNSGAILNL 293 >gi|239828160|ref|YP_002950784.1| hypothetical protein GWCH70_2835 [Geobacillus sp. WCH70] gi|239808453|gb|ACS25518.1| conserved hypothetical protein [Geobacillus sp. WCH70] Length = 283 Score = 60.5 bits (145), Expect = 8e-08, Method: Composition-based stats. Identities = 22/137 (16%), Positives = 46/137 (33%), Gaps = 17/137 (12%) Query: 13 FSKQILDEVMQEGYQNGDNFRHI-IVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTIV 71 + +E + + Y G + + V +K + + + + + + Sbjct: 155 LTADAFEEAIGKLYDAGVSDEILVFVPARIKKQLNKLGNVEF----------YARDKFLG 204 Query: 72 ATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGDAN 131 + Y FG V + +N F ++PN +E L H A +G Sbjct: 205 FDMEQYITVFGTVNFILCEELGNN-----KFFALNPNYIEMPVLIPFHAQVE-AVSGSKK 258 Query: 132 KGVLIGEGALKVRNEKA 148 L + + + N+KA Sbjct: 259 SIYLETQAGVNLLNDKA 275 >gi|308172834|ref|YP_003919539.1| phage protein [Bacillus amyloliquefaciens DSM 7] gi|307605698|emb|CBI42069.1| phage protein [Bacillus amyloliquefaciens DSM 7] Length = 367 Score = 60.5 bits (145), Expect = 8e-08, Method: Composition-based stats. Identities = 25/148 (16%), Positives = 53/148 (35%), Gaps = 14/148 (9%) Query: 16 QILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTIVATAD 75 + ++ +M++ YQ G + I+ + + + +D+ A +G Sbjct: 230 KDINALMEKVYQRGGSANTILTNTAGARQISKMATDTIRTERTDAATGHR--------VS 281 Query: 76 IYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWL--RTIHEDKNIAKNGDANKG 133 + V+ L D ++L L R+++++ D Sbjct: 282 TFVSDMVGGG--VATVVVDPNFPKDKIALFDRSILSLHPLNGRSVYDEDATVPGADFVAR 339 Query: 134 VLIGEGALKVRNEKAVGVVADLFGLSKT 161 + GE +KV+N A +A L +S T Sbjct: 340 QIRGEYGIKVKN--ANEKIAILENISTT 365 >gi|319956911|ref|YP_004168174.1| hypothetical protein Nitsa_1172 [Nitratifractor salsuginis DSM 16511] gi|319419315|gb|ADV46425.1| hypothetical protein Nitsa_1172 [Nitratifractor salsuginis DSM 16511] Length = 308 Score = 60.1 bits (144), Expect = 1e-07, Method: Composition-based stats. Identities = 22/147 (14%), Positives = 44/147 (29%), Gaps = 12/147 (8%) Query: 7 DGTQRTFSKQILDEVMQEGYQNGD---NFRHIIVSPYVKSEFVRFMSDSNVASFRYATSG 63 +G + L E+++ ++ G +I VK RF D Sbjct: 165 NGNPTVLTYDSLSEIIEPVWERGGIEDESFMLICGTSVKRAINRFAGDQFFRKV------ 218 Query: 64 KSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKN 123 S T + FG V V +R+ + + ++ + + Sbjct: 219 -SGKEKFDPTLFELETDFGTVQVKIHRLFNQEKLRD-KILVGKLSEARIMFQTSTEFTE- 275 Query: 124 IAKNGDANKGVLIGEGALKVRNEKAVG 150 + + A G + L+V+N Sbjct: 276 VPTDKTAKFGRYYTDLTLEVKNPDYFA 302 >gi|291561307|emb|CBL40106.1| hypothetical protein CK3_02480 [butyrate-producing bacterium SS3/4] Length = 338 Score = 57.8 bits (138), Expect = 4e-07, Method: Composition-based stats. Identities = 18/142 (12%), Positives = 38/142 (26%), Gaps = 9/142 (6%) Query: 17 ILDEVMQEGYQNGDNFR--HIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTIVATA 74 + +++++ Y + V + A T + Sbjct: 179 DIADMVKKIYGANAPTDGLVLWCDA------VTLFQVNADAVQNGLTVVPAAREINGIAL 232 Query: 75 DIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGDANKGV 134 P G V ++ + + N ++ P + + +AK G K Sbjct: 233 SSVVTPIGVVYLYLGECLPAGTAMLLNLDVLAPVYQPVPG-KGNFFLEPLAKTGAGEKYQ 291 Query: 135 LIGEGALKVRNEKAVGVVADLF 156 L G+ L E G + Sbjct: 292 LFGQIGLDHGPEWYHGKFTGIS 313 >gi|329568771|gb|EGG50571.1| hypothetical protein HMPREF9520_03403 [Enterococcus faecalis TX1467] Length = 295 Score = 56.3 bits (134), Expect = 1e-06, Method: Composition-based stats. Identities = 21/154 (13%), Positives = 50/154 (32%), Gaps = 11/154 (7%) Query: 2 TKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYAT 61 T K + +++ +D + + +Q G + + E + + Sbjct: 151 TNKVETAAAGAVTRKDIDALFKTMFQKGYMGEKLCLVAPDMQELMT----DQLDEKSTKI 206 Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHED 121 T +G + + + ID N ++ LR Sbjct: 207 VQFGDKLTFGLQLGNIVSNYGSGIALIEPNLPNGT-----IAAIDTNYVKLRPLREW-RA 260 Query: 122 KNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 + +AK D+ + L+GE +++ + G + +L Sbjct: 261 EELAKTTDSRRIGLVGEYSIEYKASN-SGAILNL 293 >gi|227517040|ref|ZP_03947089.1| conserved hypothetical protein [Enterococcus faecalis TX0104] gi|229545405|ref|ZP_04434130.1| conserved hypothetical protein [Enterococcus faecalis TX1322] gi|229549652|ref|ZP_04438377.1| conserved hypothetical protein [Enterococcus faecalis ATCC 29200] gi|255972349|ref|ZP_05422935.1| predicted protein [Enterococcus faecalis T1] gi|256619486|ref|ZP_05476332.1| conserved hypothetical protein [Enterococcus faecalis ATCC 4200] gi|257090289|ref|ZP_05584650.1| predicted protein [Enterococcus faecalis CH188] gi|300860939|ref|ZP_07107026.1| conserved hypothetical protein [Enterococcus faecalis TUSoD Ef11] gi|307275949|ref|ZP_07557082.1| hypothetical protein HMPREF9521_01574 [Enterococcus faecalis TX2134] gi|307295873|ref|ZP_07575705.1| hypothetical protein HMPREF9509_02949 [Enterococcus faecalis TX0411] gi|312900152|ref|ZP_07759467.1| conserved hypothetical protein [Enterococcus faecalis TX0470] gi|312902789|ref|ZP_07761993.1| conserved hypothetical protein [Enterococcus faecalis TX0635] gi|227075515|gb|EEI13478.1| conserved hypothetical protein [Enterococcus faecalis TX0104] gi|229305317|gb|EEN71313.1| conserved hypothetical protein [Enterococcus faecalis ATCC 29200] gi|229309512|gb|EEN75499.1| conserved hypothetical protein [Enterococcus faecalis TX1322] gi|255963367|gb|EET95843.1| predicted protein [Enterococcus faecalis T1] gi|256599013|gb|EEU18189.1| conserved hypothetical protein [Enterococcus faecalis ATCC 4200] gi|256999101|gb|EEU85621.1| predicted protein [Enterococcus faecalis CH188] gi|295113266|emb|CBL31903.1| hypothetical protein [Enterococcus sp. 7L76] gi|300849978|gb|EFK77728.1| conserved hypothetical protein [Enterococcus faecalis TUSoD Ef11] gi|306496204|gb|EFM65783.1| hypothetical protein HMPREF9509_02949 [Enterococcus faecalis TX0411] gi|306507279|gb|EFM76416.1| hypothetical protein HMPREF9521_01574 [Enterococcus faecalis TX2134] gi|310633843|gb|EFQ17126.1| conserved hypothetical protein [Enterococcus faecalis TX0635] gi|311292711|gb|EFQ71267.1| conserved hypothetical protein [Enterococcus faecalis TX0470] gi|315149052|gb|EFT93068.1| conserved hypothetical protein [Enterococcus faecalis TX4244] gi|315159923|gb|EFU03940.1| conserved hypothetical protein [Enterococcus faecalis TX0312] gi|315167465|gb|EFU11482.1| conserved hypothetical protein [Enterococcus faecalis TX1341] gi|315169417|gb|EFU13434.1| conserved hypothetical protein [Enterococcus faecalis TX1342] gi|315575405|gb|EFU87596.1| conserved hypothetical protein [Enterococcus faecalis TX0309B] gi|315576720|gb|EFU88911.1| conserved hypothetical protein [Enterococcus faecalis TX0630] gi|315582750|gb|EFU94941.1| conserved hypothetical protein [Enterococcus faecalis TX0309A] gi|323481145|gb|ADX80584.1| hypothetical protein EF62_2373 [Enterococcus faecalis 62] Length = 295 Score = 56.3 bits (134), Expect = 1e-06, Method: Composition-based stats. Identities = 21/154 (13%), Positives = 50/154 (32%), Gaps = 11/154 (7%) Query: 2 TKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYAT 61 T K + +++ +D + + +Q G + + E + + Sbjct: 151 TNKVETAAAGAVTRKDIDALFKTMFQKGYMGEKLCLVAPDMQELMT----DQLDEKSTKI 206 Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHED 121 T +G + + + ID N ++ LR Sbjct: 207 VQFGDKLTFGLQLGNIVSNYGSGIALIEPNLPNGT-----IAAIDTNYVKLRPLREW-RA 260 Query: 122 KNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 + +AK D+ + L+GE +++ + G + +L Sbjct: 261 EELAKTTDSRRIGLVGEYSIEYKASN-SGAILNL 293 >gi|29376526|ref|NP_815680.1| hypothetical protein EF2011 [Enterococcus faecalis V583] gi|227555439|ref|ZP_03985486.1| conserved hypothetical protein [Enterococcus faecalis HH22] gi|29343990|gb|AAO81750.1| hypothetical protein EF_2011 [Enterococcus faecalis V583] gi|227175420|gb|EEI56392.1| conserved hypothetical protein [Enterococcus faecalis HH22] Length = 295 Score = 55.5 bits (132), Expect = 2e-06, Method: Composition-based stats. Identities = 21/154 (13%), Positives = 50/154 (32%), Gaps = 11/154 (7%) Query: 2 TKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYAT 61 T K + +++ +D + + +Q G + + E + + Sbjct: 151 TNKVETAAAGAVTRKDIDALFKTMFQKGYMGEKLCLVAPDMQELM----IDQLDEKSTKI 206 Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHED 121 T +G + + + ID N ++ LR Sbjct: 207 VQFGDKLTFGLQLGNIVSNYGSGIALIEPNLPNGT-----IAAIDTNYVKLRPLREW-RA 260 Query: 122 KNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 + +AK D+ + L+GE +++ + G + +L Sbjct: 261 EELAKTTDSRRIGLVGEYSIEYKASN-SGAILNL 293 >gi|256964718|ref|ZP_05568889.1| conserved hypothetical protein [Enterococcus faecalis HIP11704] gi|307272797|ref|ZP_07554044.1| hypothetical protein HMPREF9514_01561 [Enterococcus faecalis TX0855] gi|256955214|gb|EEU71846.1| conserved hypothetical protein [Enterococcus faecalis HIP11704] gi|306510411|gb|EFM79434.1| hypothetical protein HMPREF9514_01561 [Enterococcus faecalis TX0855] Length = 264 Score = 55.5 bits (132), Expect = 2e-06, Method: Composition-based stats. Identities = 21/154 (13%), Positives = 50/154 (32%), Gaps = 11/154 (7%) Query: 2 TKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYAT 61 T K + +++ +D + + +Q G + + E + + Sbjct: 120 TNKVETAAAGAVTRKDIDALFKTMFQKGYMGEKLCLVAPDMQELMT----DQLDEKSTKI 175 Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHED 121 T +G + + + ID N ++ LR Sbjct: 176 VQFGDKLTFGLQLGNIVSNYGSGIALIEPNLPNGT-----IAAIDTNYVKLRPLREW-RA 229 Query: 122 KNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 + +AK D+ + L+GE +++ + G + +L Sbjct: 230 EELAKTTDSRRIGLVGEYSIEYKASN-SGAILNL 262 >gi|307280635|ref|ZP_07561683.1| hypothetical protein HMPREF9515_01677 [Enterococcus faecalis TX0860] gi|306504001|gb|EFM73218.1| hypothetical protein HMPREF9515_01677 [Enterococcus faecalis TX0860] Length = 300 Score = 54.7 bits (130), Expect = 4e-06, Method: Composition-based stats. Identities = 19/150 (12%), Positives = 45/150 (30%), Gaps = 11/150 (7%) Query: 10 QRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNT 69 +++ +D++ + + G + + + + V Sbjct: 159 ADAVTRKDVDKMFKTMFDKGYAGEKLCLVSTDMVDLMT----DEVDKAGTKVFNFGDQVA 214 Query: 70 IVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGD 129 +G + S +D N +E LR + +AK D Sbjct: 215 FGLQLGKIVSNYGSGTALIEPSLPSGT-----MIALDTNYVELRPLREW-RAEELAKTTD 268 Query: 130 ANKGVLIGEGALKVRNEKAVGVVADLFGLS 159 + + L+GE ++ N G + +L + Sbjct: 269 SKRIGLVGEYTIEY-NASNSGAILNLATAA 297 >gi|315173098|gb|EFU17115.1| conserved hypothetical protein [Enterococcus faecalis TX1346] Length = 300 Score = 54.7 bits (130), Expect = 5e-06, Method: Composition-based stats. Identities = 19/150 (12%), Positives = 45/150 (30%), Gaps = 11/150 (7%) Query: 10 QRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNT 69 +++ +D++ + + G + + + + V Sbjct: 159 ADAVTRKDVDKMFKTMFDKGYAGEKLCLVSTDMVDLMT----DEVDKAGTKVFNFGDRVA 214 Query: 70 IVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGD 129 +G + S +D N +E LR + +AK D Sbjct: 215 FGLQLGKIVSNYGSGTALIEPSLPSGT-----MIALDTNYVELRPLREW-RAEELAKTTD 268 Query: 130 ANKGVLIGEGALKVRNEKAVGVVADLFGLS 159 + + L+GE ++ N G + +L + Sbjct: 269 SKRIGLVGEYTIEY-NASNSGAILNLATAA 297 >gi|307286482|ref|ZP_07566582.1| hypothetical protein HMPREF9505_00059 [Enterococcus faecalis TX0109] gi|306502395|gb|EFM71671.1| hypothetical protein HMPREF9505_00059 [Enterococcus faecalis TX0109] Length = 300 Score = 53.9 bits (128), Expect = 7e-06, Method: Composition-based stats. Identities = 19/150 (12%), Positives = 45/150 (30%), Gaps = 11/150 (7%) Query: 10 QRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNT 69 +++ +D++ + + G + + + + V Sbjct: 159 ADAVTRKDVDKMFKTMFDKGYAGEKLCLVSTDMVDLMT----DEVDKAGTKVFNFGDQVA 214 Query: 70 IVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGD 129 +G + S +D N +E LR + +AK D Sbjct: 215 FGLQLGKIVSNYGSGTALIEPSLPSGT-----MIALDTNYVELRPLREW-RAEELAKTTD 268 Query: 130 ANKGVLIGEGALKVRNEKAVGVVADLFGLS 159 + + L+GE ++ N G + +L + Sbjct: 269 SKRIGLVGEYTIEY-NASNSGAILNLATAA 297 >gi|315028531|gb|EFT40463.1| conserved hypothetical protein [Enterococcus faecalis TX4000] Length = 295 Score = 53.2 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 21/154 (13%), Positives = 49/154 (31%), Gaps = 11/154 (7%) Query: 2 TKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYAT 61 T K + +++ +D + + Q G + + E + + Sbjct: 151 TNKVETAAAGAVTRKDIDALFKTMLQKGYMGEKLCLVAPDMQELMT----DQLDEKSTKI 206 Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHED 121 T +G + + + ID N ++ LR Sbjct: 207 VQFGDKLTFGLQLGNIVSNYGSGIALIEPNLPNGT-----IAAIDTNYVKLRPLREW-RA 260 Query: 122 KNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 + +AK D+ + L+GE +++ + G + +L Sbjct: 261 EELAKTTDSRRIGLVGEYSIEYKASN-SGAILNL 293 >gi|307270079|ref|ZP_07551399.1| hypothetical protein HMPREF9498_02197 [Enterococcus faecalis TX4248] gi|306513574|gb|EFM82186.1| hypothetical protein HMPREF9498_02197 [Enterococcus faecalis TX4248] Length = 300 Score = 53.2 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 19/150 (12%), Positives = 46/150 (30%), Gaps = 11/150 (7%) Query: 10 QRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNT 69 +++ +D++ + + G + + + + V Sbjct: 159 ADAVTRKDVDKMFKTMFDKGYAGEKLCLVSTDMVDLMT----DEVDKAGTKVFNFGDQVA 214 Query: 70 IVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGD 129 +G + S +D N +E LR + +AK D Sbjct: 215 FGLQLGKIVSNYGSGTALIEPSLPSGT-----MIALDTNYVELRPLREW-RAEELAKTTD 268 Query: 130 ANKGVLIGEGALKVRNEKAVGVVADLFGLS 159 + + L+GE +++ N G + +L + Sbjct: 269 SKRIGLVGEYSIEY-NASNSGAILNLATAA 297 >gi|256956794|ref|ZP_05560965.1| conserved hypothetical protein [Enterococcus faecalis DS5] gi|256947290|gb|EEU63922.1| conserved hypothetical protein [Enterococcus faecalis DS5] gi|295113775|emb|CBL32412.1| hypothetical protein [Enterococcus sp. 7L76] gi|315035894|gb|EFT47826.1| conserved hypothetical protein [Enterococcus faecalis TX0027] Length = 300 Score = 52.8 bits (125), Expect = 1e-05, Method: Composition-based stats. Identities = 19/150 (12%), Positives = 46/150 (30%), Gaps = 11/150 (7%) Query: 10 QRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNT 69 +++ +D++ + + G + + + + V Sbjct: 159 ADAVTRKDVDKMFKTMFDKGYAGEKLCLVSTDMVDLMT----DEVDKAGTKVFNFGDQVA 214 Query: 70 IVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGD 129 +G + S +D N +E LR + +AK D Sbjct: 215 FGLQLGKIVSNYGSGTALIEPSLPSGT-----MIALDTNYVELRPLREW-RAEELAKTTD 268 Query: 130 ANKGVLIGEGALKVRNEKAVGVVADLFGLS 159 + + L+GE +++ N G + +L + Sbjct: 269 SKRIGLVGEYSIEY-NASNSGAILNLATAA 297 >gi|257079386|ref|ZP_05573747.1| predicted protein [Enterococcus faecalis JH1] gi|256987416|gb|EEU74718.1| predicted protein [Enterococcus faecalis JH1] Length = 300 Score = 52.4 bits (124), Expect = 2e-05, Method: Composition-based stats. Identities = 18/146 (12%), Positives = 44/146 (30%), Gaps = 11/146 (7%) Query: 10 QRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNT 69 +++ +D++ + + G + + + + V Sbjct: 159 ADAVTRKDVDKMFKTMFDKGYAGEKLCLVSTDMVDLMT----DEVDKAGTKVFNFGDQVA 214 Query: 70 IVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGD 129 +G + S +D N +E +R + +AK D Sbjct: 215 FGLQLGKIVSNYGSGTALIEPSLPSGT-----MIALDTNYVELRPVREW-RAEELAKTTD 268 Query: 130 ANKGVLIGEGALKVRNEKAVGVVADL 155 + + L+GE ++ N G + +L Sbjct: 269 SKRIGLVGEYTIEY-NASNSGAILNL 293 >gi|315144740|gb|EFT88756.1| conserved hypothetical protein [Enterococcus faecalis TX2141] Length = 300 Score = 51.6 bits (122), Expect = 3e-05, Method: Composition-based stats. Identities = 18/146 (12%), Positives = 44/146 (30%), Gaps = 11/146 (7%) Query: 10 QRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNT 69 +++ +D++ + + G + + + + V Sbjct: 159 ADAVTRKDVDKMFKTMFDKGYAGEKLCLVSTDMVDLMT----DEVDKAGTKVFNFGDQVA 214 Query: 70 IVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGD 129 +G + S +D N +E +R + +AK D Sbjct: 215 FGLQLGKIVSNYGSGTALIEPSLPSGT-----MIALDTNYVELRPVREW-RAEELAKTTD 268 Query: 130 ANKGVLIGEGALKVRNEKAVGVVADL 155 + + L+GE ++ N G + +L Sbjct: 269 SKRIGLVGEYTIEY-NASNSGAILNL 293 >gi|262067743|ref|ZP_06027355.1| hypothetical protein FUSPEROL_02025 [Fusobacterium periodonticum ATCC 33693] gi|291378467|gb|EFE85985.1| hypothetical protein FUSPEROL_02025 [Fusobacterium periodonticum ATCC 33693] Length = 379 Score = 44.7 bits (104), Expect = 0.004, Method: Composition-based stats. Identities = 26/147 (17%), Positives = 46/147 (31%), Gaps = 22/147 (14%) Query: 7 DGTQRTFSKQILDEVMQEGYQNG--------DNFRHIIVSPYVKSEFVRFMSDSNVASFR 58 D + F+ +LD ++ G N + V + + D+ + Sbjct: 234 DANNQPFTLDLLDNAVKAIVDKGNPGSANLKANKYFLCVPYLILRTINKLNKDNVRS--- 290 Query: 59 YATSGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTI 118 + T + G V V + + LI N + L I Sbjct: 291 -----NITDKVTGTTIEQIVTTSGTVSV-----FPATSLAPNEFLLIHLNDVSLRQLYPI 340 Query: 119 HEDKNIAKNGDANKGVLIGEGALKVRN 145 E + AK A+ L GE A +++N Sbjct: 341 KE-EEGAKTALADNYFLHGEYAHQIKN 366 >gi|328725488|ref|XP_003248499.1| PREDICTED: probable NADP-dependent mannitol dehydrogenase-like, partial [Acyrthosiphon pisum] Length = 462 Score = 38.5 bits (88), Expect = 0.34, Method: Composition-based stats. Identities = 10/68 (14%), Positives = 24/68 (35%) Query: 20 EVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTIVATADIYDR 79 E+++ + G + I + ++ + S ++I AT D Sbjct: 260 EIVRGLAEAGADVAIIYATSDNAPAIAAEVASQTNVKVQAYQSDVRRRDSIAATVDQIVS 319 Query: 80 PFGKVMVH 87 FG++ + Sbjct: 320 DFGRIDIM 327 >gi|320535449|ref|ZP_08035557.1| hypothetical protein HMPREF9554_00276 [Treponema phagedenis F0421] gi|320147702|gb|EFW39210.1| hypothetical protein HMPREF9554_00276 [Treponema phagedenis F0421] Length = 460 Score = 36.2 bits (82), Expect = 1.7, Method: Composition-based stats. Identities = 10/69 (14%), Positives = 25/69 (36%), Gaps = 7/69 (10%) Query: 15 KQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTIVATA 74 ++I+ + + ++ G + + P + + + D G S + + Sbjct: 192 ERIITDPVGMIFEAGGDANKLFFPPILAQDIQDLIRDRIRF-------GTSGSGAMNLVV 244 Query: 75 DIYDRPFGK 83 D Y P+G Sbjct: 245 DQYPTPYGS 253 >gi|281358094|ref|ZP_06244578.1| hypothetical protein Vvad_PD0658 [Victivallis vadensis ATCC BAA-548] gi|281315467|gb|EFA99496.1| hypothetical protein Vvad_PD0658 [Victivallis vadensis ATCC BAA-548] Length = 84 Score = 35.5 bits (80), Expect = 2.6, Method: Composition-based stats. Identities = 14/58 (24%), Positives = 21/58 (36%), Gaps = 2/58 (3%) Query: 92 MASNAETARNAFLIDPNMLEFLWLRTI--HEDKNIAKNGDANKGVLIGEGALKVRNEK 147 MA +A++IDP L+ + K D K + +GE RN Sbjct: 4 MADPDMPDTDAWVIDPAGFGMSSLKGRSITDTDATPKGFDGIKRMALGELTFVFRNAA 61 >gi|152990358|ref|YP_001356080.1| flagellar basal-body rod protein FlgF [Nitratiruptor sp. SB155-2] gi|151422219|dbj|BAF69723.1| flagellar basal-body rod protein FlgF [Nitratiruptor sp. SB155-2] Length = 238 Score = 34.7 bits (78), Expect = 4.4, Method: Composition-based stats. Identities = 11/59 (18%), Positives = 22/59 (37%) Query: 1 MTKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRY 59 +T + F + ++ E+ Q +NG + H+ V P K + F + Sbjct: 25 VTNNLANVNTTGFKEMVVKEMSQRLDENGGDSNHLFVFPRFKESILNFAQGPLKHTQNR 83 >gi|117619346|ref|YP_856478.1| methionine gamma-lyase [Aeromonas hydrophila subsp. hydrophila ATCC 7966] gi|117560753|gb|ABK37701.1| methionine gamma-lyase [Aeromonas hydrophila subsp. hydrophila ATCC 7966] Length = 387 Score = 33.5 bits (75), Expect = 9.7, Method: Composition-based stats. Identities = 25/142 (17%), Positives = 39/142 (27%), Gaps = 20/142 (14%) Query: 16 QILDEVMQEGYQNGDNFRHIIVSP-------YVKSEFVRFMSDSNVASFRYATSGKSHNN 68 + +M +G + P ++K F R Sbjct: 84 AAVSAIMYALADSG--DEVAYIGPLYGGTDAFLKQTFSRAGIKVTAYESDQDLLTNIRPA 141 Query: 69 TIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHE-------- 120 T V + P + V RV+ A R + D L LR + Sbjct: 142 TKVVLFETLTNP--TLKVVDPRVVVEAARKVRAITVCDNTFLTPYLLRPLELGIDIVMHS 199 Query: 121 -DKNIAKNGDANKGVLIGEGAL 141 K + +GD GV+ G L Sbjct: 200 GTKYLGGHGDIIAGVVAGSHTL 221 Database: nr Posted date: May 13, 2011 4:10 AM Number of letters in database: 999,999,932 Number of sequences in database: 2,987,209 Database: /data/usr2/db/fasta/nr.01 Posted date: May 13, 2011 4:17 AM Number of letters in database: 999,998,956 Number of sequences in database: 2,896,973 Database: /data/usr2/db/fasta/nr.02 Posted date: May 13, 2011 4:23 AM Number of letters in database: 999,999,979 Number of sequences in database: 2,907,862 Database: /data/usr2/db/fasta/nr.03 Posted date: May 13, 2011 4:29 AM Number of letters in database: 999,999,513 Number of sequences in database: 2,932,190 Database: /data/usr2/db/fasta/nr.04 Posted date: May 13, 2011 4:33 AM Number of letters in database: 792,586,372 Number of sequences in database: 2,260,650 Lambda K H 0.313 0.131 0.320 Lambda K H 0.267 0.0397 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,027,113,406 Number of Sequences: 13984884 Number of extensions: 24518826 Number of successful extensions: 86389 Number of sequences better than 10.0: 98 Number of HSP's better than 10.0 without gapping: 69 Number of HSP's successfully gapped in prelim test: 29 Number of HSP's that attempted gapping in prelim test: 86186 Number of HSP's gapped (non-prelim): 104 length of query: 162 length of database: 4,792,584,752 effective HSP length: 123 effective length of query: 39 effective length of database: 3,072,444,020 effective search space: 119825316780 effective search space used: 119825316780 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.4 bits) S2: 75 (33.5 bits)