BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= 537021.9.peg.1073_1 (162 letters) Database: nr 13,984,884 sequences; 4,792,584,752 total letters Searching..................................................done Results from round 1 >gi|315122533|ref|YP_004063022.1| hypothetical protein CKC_03925 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495935|gb|ADR52534.1| hypothetical protein CKC_03925 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 331 Score = 300 bits (767), Expect = 6e-80, Method: Compositional matrix adjust. Identities = 141/161 (87%), Positives = 148/161 (91%) Query: 1 MTKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYA 60 MTKKAKDG QR F+KQ+LD VMQEGYQNG NFRHI+VSPYVKSEFVRFMSDSNVASFRYA Sbjct: 170 MTKKAKDGAQRPFTKQLLDAVMQEGYQNGANFRHIVVSPYVKSEFVRFMSDSNVASFRYA 229 Query: 61 TSGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHE 120 S S NNTIVATADIYD PFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLR I E Sbjct: 230 VSDNSKNNTIVATADIYDGPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRNIQE 289 Query: 121 DKNIAKNGDANKGVLIGEGALKVRNEKAVGVVADLFGLSKT 161 DKNIAK GDANKGVLIGEG LKV+NEKA+GV++DLFGLSKT Sbjct: 290 DKNIAKTGDANKGVLIGEGTLKVKNEKAIGVISDLFGLSKT 330 >gi|227822441|ref|YP_002826413.1| putative phage major head protein [Sinorhizobium fredii NGR234] gi|227341442|gb|ACP25660.1| putative phage major head protein [Sinorhizobium fredii NGR234] Length = 331 Score = 233 bits (593), Expect = 9e-60, Method: Compositional matrix adjust. Identities = 109/162 (67%), Positives = 124/162 (76%) Query: 1 MTKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYA 60 +T DGTQR F+K ILD VMQ+GYQNG NFRH+ VSPYVKS FV FMSDSNVA FRYA Sbjct: 170 LTVAPTDGTQRPFTKAILDTVMQQGYQNGANFRHVSVSPYVKSVFVTFMSDSNVAPFRYA 229 Query: 61 TSGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHE 120 S NTIVATAD Y+ PFG VM+HPNRV A N ARNAF ID +ML FLWLR I E Sbjct: 230 VSQGGERNTIVATADYYEGPFGTVMIHPNRVQAVNGGVARNAFFIDSDMLSFLWLRQIQE 289 Query: 121 DKNIAKNGDANKGVLIGEGALKVRNEKAVGVVADLFGLSKTT 162 D+++AK GDA+KGV+IGEG LKV NE+ +GV ADLFGLS + Sbjct: 290 DRDVAKTGDADKGVIIGEGTLKVHNERGLGVAADLFGLSSAS 331 >gi|150397033|ref|YP_001327500.1| hypothetical protein Smed_1830 [Sinorhizobium medicae WSM419] gi|150028548|gb|ABR60665.1| hypothetical protein Smed_1830 [Sinorhizobium medicae WSM419] Length = 331 Score = 226 bits (576), Expect = 8e-58, Method: Compositional matrix adjust. Identities = 110/160 (68%), Positives = 126/160 (78%) Query: 1 MTKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYA 60 +T DG QR F+K ILD VMQ GYQ+G NFRH+ VSPYVKS FV FMSDSNVA FRYA Sbjct: 170 LTVAPTDGEQRAFTKSILDSVMQAGYQSGANFRHVSVSPYVKSVFVTFMSDSNVAPFRYA 229 Query: 61 TSGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHE 120 S NTIVATAD Y+ PFG VM+HPNRV A+ A TARNAF +D +MLEFLWLR I E Sbjct: 230 VSQGGERNTIVATADYYEGPFGTVMIHPNRVQAATAATARNAFFLDTDMLEFLWLRKIQE 289 Query: 121 DKNIAKNGDANKGVLIGEGALKVRNEKAVGVVADLFGLSK 160 DK++AK GDA+KGV+IGEG LKV+NEK +GV ADLFGLS+ Sbjct: 290 DKDVAKTGDADKGVIIGEGTLKVKNEKGLGVAADLFGLSE 329 >gi|227821704|ref|YP_002825674.1| putative phage related protein [Sinorhizobium fredii NGR234] gi|227340703|gb|ACP24921.1| putative phage related protein [Sinorhizobium fredii NGR234] Length = 123 Score = 166 bits (419), Expect = 1e-39, Method: Compositional matrix adjust. Identities = 77/111 (69%), Positives = 86/111 (77%) Query: 25 GYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTIVATADIYDRPFGKV 84 GYQNG NFRH+ SPYVKS FV FMSDSNVA FRYA S NTIVATAD Y+ PFG V Sbjct: 6 GYQNGANFRHVSASPYVKSVFVTFMSDSNVAPFRYAVSKGGERNTIVATADYYEGPFGTV 65 Query: 85 MVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGDANKGVL 135 M+HPNRV A+ A ARNAF ID +ML FLWLR I EDK++AK GDA+KGV+ Sbjct: 66 MIHPNRVQATGATQARNAFFIDTDMLSFLWLRKIQEDKDVAKTGDADKGVI 116 >gi|160897389|ref|YP_001562971.1| putative phage major head protein [Delftia acidovorans SPH-1] gi|160362973|gb|ABX34586.1| putative phage major head protein [Delftia acidovorans SPH-1] Length = 306 Score = 70.9 bits (172), Expect = 6e-11, Method: Compositional matrix adjust. Identities = 46/149 (30%), Positives = 72/149 (48%), Gaps = 13/149 (8%) Query: 8 GTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHN 67 GTQR F + + +V+Q+ Y G I++ P K F F ++ KS + Sbjct: 171 GTQRAFVESQVKDVLQKCYTAGGEPDTIMMPPAAKQTFSTFTGNATRMD-------KSED 223 Query: 68 NTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKN 127 + + A+ D Y FG + PNR MA+ R+ FL+ + L +LR + +IAK Sbjct: 224 SKLYASVDYYVSDFGTIQAVPNRFMAT-----RDVFLLQSDKLAVAYLRPF-QTTDIAKT 277 Query: 128 GDANKGVLIGEGALKVRNEKAVGVVADLF 156 GD++ +I E L+ R KA G V D+ Sbjct: 278 GDSDAQQIIVEYTLECRAPKAHGAVYDIL 306 >gi|221199511|ref|ZP_03572555.1| major head protein [Burkholderia multivorans CGD2M] gi|221205587|ref|ZP_03578602.1| major head protein [Burkholderia multivorans CGD2] gi|221174425|gb|EEE06857.1| major head protein [Burkholderia multivorans CGD2] gi|221180796|gb|EEE13199.1| major head protein [Burkholderia multivorans CGD2M] Length = 317 Score = 68.6 bits (166), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 46/148 (31%), Positives = 72/148 (48%), Gaps = 6/148 (4%) Query: 8 GTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHN 67 GT R F++ +L E ++ Y+ G N + + P K F + A+ + +G Sbjct: 174 GTARPFTEDLLKEALRLAYEAGGNVTQVHMRPVDKQRASGFAGN---ATRMESVNGTGKT 230 Query: 68 NTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKN 127 + + IY FG V + PNRVM++ +TA + IDP+M LR E + +AK Sbjct: 231 AVLQTSYSIYASDFGNVAMIPNRVMSAVNDTA--VYAIDPSMWALATLRGF-EKEELAKV 287 Query: 128 GDANKGVLIGEGALKVRNEKAVGVVADL 155 GDA + EG L+ RNE + + DL Sbjct: 288 GDARNWQITWEGTLEARNEASSAQIRDL 315 >gi|291334595|gb|ADD94245.1| putative phage major head protein [uncultured phage MedDCM-OCT-S04-C136] Length = 316 Score = 63.9 bits (154), Expect = 7e-09, Method: Compositional matrix adjust. Identities = 42/153 (27%), Positives = 76/153 (49%), Gaps = 12/153 (7%) Query: 7 DGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSH 66 DGTQR F++ L ++ Y G N I+V+ + K + F S R+ + Sbjct: 169 DGTQRAFTEDQLKGILVSCYNEGGNPNMIMVNAFNKQKLSGFTGGST----RF---DAAE 221 Query: 67 NNTIVATADIYDRPFGKVMVHPNRVM----ASNAETARNAFLIDPNMLEFLWLRTIHEDK 122 + ++ + D+Y+ FG + V PNR + + A+ ++A ++D +LR + Sbjct: 222 DRRLITSIDVYESDFGTMQVSPNRFIRGANGTAAKIGQDAHILDMEYWAVSFLRDFAL-Q 280 Query: 123 NIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 A+ DA++ ++ E L+ RNEKA G++ DL Sbjct: 281 TPAQTADADQRFMVAEYTLESRNEKASGLITDL 313 >gi|288817864|ref|YP_003432211.1| putative phage major head protein [Hydrogenobacter thermophilus TK-6] gi|288787263|dbj|BAI69010.1| putative phage major head protein [Hydrogenobacter thermophilus TK-6] gi|308751463|gb|ADO44946.1| putative phage major head protein [Hydrogenobacter thermophilus TK-6] Length = 291 Score = 58.9 bits (141), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 42/153 (27%), Positives = 78/153 (50%), Gaps = 17/153 (11%) Query: 5 AKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSG- 63 A G+ R ++ +L++ +Q+ +Q G N ++VS K V+SF + Sbjct: 153 ANGGSPRALTETLLNDGIQQAWQAGGNPDVVVVSGKNKR---------TVSSFTAGVTKT 203 Query: 64 -KSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDK 122 + + +V+T D+Y+ FG V + +R M + F+++ + +LR + + Sbjct: 204 VAAEDKKLVSTVDVYESDFGLVRIVADRWMPDS-----KVFILEKGRFKIAYLRPFKQ-Q 257 Query: 123 NIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 IAK GD + V++GE L+VR EKA ++AD+ Sbjct: 258 EIAKVGDRIERVVVGELTLEVRAEKANAIIADI 290 >gi|283856246|ref|YP_162122.2| putative phage major head protein [Zymomonas mobilis subsp. mobilis ZM4] gi|283775241|gb|AAV89011.2| putative phage major head protein [Zymomonas mobilis subsp. mobilis ZM4] Length = 304 Score = 57.0 bits (136), Expect = 8e-07, Method: Compositional matrix adjust. Identities = 47/154 (30%), Positives = 74/154 (48%), Gaps = 11/154 (7%) Query: 2 TKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYAT 61 T A G R +++ +L EV+ + + +G +I S +K F +AS R T Sbjct: 159 TTAATTGKARLYTEALLKEVLGDIWVSGGQPNMVITSLKLKQTAAAF---PGLASNRRDT 215 Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHED 121 + I+A ADIY G+V P+R +++ A +IDP L I + Sbjct: 216 GDQKAR--IIAGADIYVSDVGEVQFVPDRFCDNSS-----ALVIDPEYWSVATLDPIQK- 267 Query: 122 KNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 +++A G A++ L E AL+ NE A GV+ADL Sbjct: 268 RSLATTGLADRDALYTEIALRCHNEAASGVLADL 301 >gi|163783849|ref|ZP_02178828.1| hypothetical protein HG1285_12862 [Hydrogenivirga sp. 128-5-R1-1] gi|159880872|gb|EDP74397.1| hypothetical protein HG1285_12862 [Hydrogenivirga sp. 128-5-R1-1] Length = 291 Score = 55.8 bits (133), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 39/151 (25%), Positives = 76/151 (50%), Gaps = 13/151 (8%) Query: 5 AKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGK 64 A GT R ++ +L++ +Q+ +Q G N ++V+ K F A + K Sbjct: 153 ANAGTPRPLTETLLNDGLQQAWQTGGNPDVVVVNGTNKRTISGF-----TAGLTKEIAAK 207 Query: 65 SHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNI 124 + ++A+ D+Y+ FG V V NR M ++ F+++ + +L H+++ + Sbjct: 208 --DKRLIASVDVYESDFGLVRVIANRWMPAD-----KVFILEKGRWKVAYLDPFHQEE-L 259 Query: 125 AKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 +K GD + V++GE L+ R EKA ++ D+ Sbjct: 260 SKVGDRKERVVVGELTLEGRAEKANAIITDI 290 >gi|241760939|ref|ZP_04759028.1| putative phage major head protein [Zymomonas mobilis subsp. mobilis ATCC 10988] gi|241374558|gb|EER64019.1| putative phage major head protein [Zymomonas mobilis subsp. mobilis ATCC 10988] Length = 238 Score = 55.1 bits (131), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 46/154 (29%), Positives = 72/154 (46%), Gaps = 11/154 (7%) Query: 2 TKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYAT 61 T A G R +++ +L EV+ + + +G +I S +K F +AS R T Sbjct: 93 TTAATAGKARLYTEALLKEVLGDIWVSGGQPNMVITSLKLKQTAAAF---PGLASNRRDT 149 Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHED 121 I+A ADIY G+V P+R ++ +A +IDP L I + Sbjct: 150 G--DQKARIIAGADIYVSDVGEVQFVPDRFCDNS-----SALVIDPEYWSVATLDPIQK- 201 Query: 122 KNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 +++A G A++ L E AL+ NE GV+ADL Sbjct: 202 RSLATTGLADRDALYTEIALRCHNEATSGVLADL 235 >gi|167600435|ref|YP_001671935.1| major head protein [Pseudomonas phage LUZ24] gi|161168298|emb|CAP45463.1| major head protein [Pseudomonas phage LUZ24] Length = 317 Score = 53.9 bits (128), Expect = 7e-06, Method: Compositional matrix adjust. Identities = 39/148 (26%), Positives = 66/148 (44%), Gaps = 11/148 (7%) Query: 8 GTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHN 67 G R ++ +L + ++NG I S +K + N+ + + + Sbjct: 177 GDLRLLTEDMLLNASEAIWRNGGQANSIQTSSSIKKAISK-----NMKGRATEITLDASD 231 Query: 68 NTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKN 127 N I T D+Y+ FGK + NR N F+ DP M +LR + + +AK Sbjct: 232 NRIAQTVDVYESDFGKYTIRANRWFHENT-----MFIFDPKMHALCYLRPFFQHE-LAKT 285 Query: 128 GDANKGVLIGEGALKVRNEKAVGVVADL 155 GD+ K L+ E L+V NEK+ ++ D+ Sbjct: 286 GDSEKRQLLVEYTLRVNNEKSGALIRDV 313 >gi|291334838|gb|ADD94478.1| putative phage major head protein [uncultured phage MedDCM-OCT-S06-C1041] Length = 323 Score = 52.8 bits (125), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 43/146 (29%), Positives = 66/146 (45%), Gaps = 9/146 (6%) Query: 8 GTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFR-YATSGKSH 66 GT TF + L V+++ + NG + I+ + K F FR SG Sbjct: 180 GTAATFVEADLKSVVKQCWDNGGDPGVIMTGSFNKQAASGFAGIGT--QFRDVQPSGPVS 237 Query: 67 NNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAK 126 +IV ADIY FG+ + NR M A N + +D +LR I + ++ K Sbjct: 238 PGSIVGAADIYISDFGQHQIVANRFMP-----AANVYALDMEYWCVAYLRPIQTE-DLGK 291 Query: 127 NGDANKGVLIGEGALKVRNEKAVGVV 152 GD+++ +LI E L +NE A G + Sbjct: 292 TGDSDRRMLISEYTLGSKNEAASGKI 317 >gi|291334638|gb|ADD94286.1| putative phage major head protein [uncultured phage MedDCM-OCT-S04-C64] Length = 323 Score = 52.8 bits (125), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 43/146 (29%), Positives = 66/146 (45%), Gaps = 9/146 (6%) Query: 8 GTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFR-YATSGKSH 66 GT TF + L V+++ + NG + I+ + K F FR SG Sbjct: 180 GTAATFVEADLKSVVKQCWDNGGDPGVIMTGSFNKQAASGFAGIGT--QFRDVQPSGPVS 237 Query: 67 NNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAK 126 +IV ADIY FG+ + NR M A N + +D +LR I + ++ K Sbjct: 238 PGSIVGAADIYISDFGQHQIVANRFMP-----AANVYALDMEYWCVAYLRPIQTE-DLGK 291 Query: 127 NGDANKGVLIGEGALKVRNEKAVGVV 152 GD+++ +LI E L +NE A G + Sbjct: 292 TGDSDRRMLISEYTLGSKNEAASGKI 317 >gi|27476049|ref|NP_775251.1| major head protein [Pseudomonas phage PaP3] gi|27414479|gb|AAL85565.1| major head protein [Pseudomonas phage PaP3] Length = 317 Score = 52.4 bits (124), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 38/148 (25%), Positives = 65/148 (43%), Gaps = 11/148 (7%) Query: 8 GTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHN 67 G R ++ +L + ++NG I S +K + N+ + + + Sbjct: 177 GDLRLLTEDMLLNASESIWRNGGQANSIQTSSSIKKAISK-----NMKGRATEITLDASD 231 Query: 68 NTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKN 127 N I T D+Y+ FGK + NR N F+ DP M +LR + + +AK Sbjct: 232 NRIAQTVDVYESDFGKYTIRANRWFHENT-----LFVFDPKMHSLCYLRPFFQHE-LAKT 285 Query: 128 GDANKGVLIGEGALKVRNEKAVGVVADL 155 GD+ K L+ E +V NEK+ ++ D+ Sbjct: 286 GDSEKRQLLVEYTFRVNNEKSGALIRDV 313 >gi|148257053|ref|YP_001241638.1| putative phage major head protein [Bradyrhizobium sp. BTAi1] gi|146409226|gb|ABQ37732.1| putative phage Major head protein [Bradyrhizobium sp. BTAi1] Length = 320 Score = 51.6 bits (122), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 43/153 (28%), Positives = 71/153 (46%), Gaps = 21/153 (13%) Query: 7 DGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSH 66 DGTQ F++ L V+ + NG I+ + K F F S ++ Sbjct: 180 DGTQIAFTEARLKTVLSSIWINGGKPGTILTGAFNKQVFSTFTGRSTAIE-------EAK 232 Query: 67 NNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKN--- 123 + IVA+ D Y+ FGK+ +V+AS + AR+ +++ + W +N Sbjct: 233 SKKIVASVDAYESDFGKL-----KVVASRFQRARDVLVLEMDK----WAVAYLNGRNMIS 283 Query: 124 --IAKNGDANKGVLIGEGALKVRNEKAVGVVAD 154 +AK GD+++ ++ E AL RNEKA G + D Sbjct: 284 IPLAKTGDSDRRQILAEYALVARNEKASGGIFD 316 >gi|291334405|gb|ADD94061.1| major head protein [uncultured phage MedDCM-OCT-S01-C1] Length = 344 Score = 51.6 bits (122), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 43/153 (28%), Positives = 72/153 (47%), Gaps = 8/153 (5%) Query: 7 DGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATS--GK 64 DG +F Q + + +Q Y+ G H++ +P V +M D N A TS G Sbjct: 188 DGVALSF--QAVKDAIQGVYEQGGEVTHLMSNPGVIGALSSYMFD-NEARVATLTSDQGA 244 Query: 65 SHNN--TIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDK 122 N+ T +++ ++ FG + + PNR+ A AFL+DP + +L D Sbjct: 245 PANSKATALSSVNVLVSDFGTIKLVPNRLQPLAANGTAVAFLLDPEYVSLSYLEGYRTD- 303 Query: 123 NIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 +AK G A K + + L+V EKA G++ ++ Sbjct: 304 TLAKTGLAEKRQISVDWGLRVHTEKAHGMLVNI 336 >gi|294083943|ref|YP_003550700.1| major head protein [Candidatus Puniceispirillum marinum IMCC1322] gi|292663515|gb|ADE38616.1| major head protein [Candidatus Puniceispirillum marinum IMCC1322] Length = 133 Score = 46.2 bits (108), Expect = 0.001, Method: Compositional matrix adjust. Identities = 36/143 (25%), Positives = 63/143 (44%), Gaps = 21/143 (14%) Query: 18 LDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHN-----NTIVA 72 L + Q Y G + ++ P SDS + + +SG++ N T+V Sbjct: 6 LLDAHQAAYTAGADVSVFMIKP----------SDSEIVAGFTGSSGRTRNFNDGTTTLVN 55 Query: 73 TADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGDANK 132 D+ P+G + V NR + +AFL+DP+M LR + ++K GD++K Sbjct: 56 AVDVMINPYGTLKVVLNR-----HQLTTHAFLLDPSMWRSAVLRPVSRTL-LSKTGDSDK 109 Query: 133 GVLIGEGALKVRNEKAVGVVADL 155 + EG L+ N+ ++ L Sbjct: 110 HFCVYEGGLQHLNQSGSAMITGL 132 >gi|316934287|ref|YP_004109269.1| putative phage major head protein [Rhodopseudomonas palustris DX-1] gi|315602001|gb|ADU44536.1| putative phage major head protein [Rhodopseudomonas palustris DX-1] Length = 304 Score = 44.7 bits (104), Expect = 0.005, Method: Compositional matrix adjust. Identities = 29/91 (31%), Positives = 48/91 (52%), Gaps = 14/91 (15%) Query: 70 IVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKN-----I 124 IV ++Y PFG+ +++ S A + L DP M W R I + +N + Sbjct: 218 IVNAVNLYVSPFGE-----EKIVLSRHLKAGDTLLFDPAM----WARVILQGRNWFRETL 268 Query: 125 AKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 AK GD+ K +++GE LK +N+KA +V ++ Sbjct: 269 AKTGDSLKMMIVGEFGLKHKNQKASALVREM 299 >gi|196048420|ref|ZP_03115595.1| conserved hypothetical protein [Bacillus cereus 03BB108] gi|196020677|gb|EDX59409.1| conserved hypothetical protein [Bacillus cereus 03BB108] Length = 312 Score = 39.3 bits (90), Expect = 0.18, Method: Compositional matrix adjust. Identities = 28/90 (31%), Positives = 49/90 (54%), Gaps = 6/90 (6%) Query: 68 NTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKN 127 N I TAD+ +G+ + NR M A T ++D N ++ LR + + +AKN Sbjct: 217 NIIGLTADVIHTRYGRGNIVLNRHMPDGALT-----IVDLNQVKLRPLRKMLAEP-LAKN 270 Query: 128 GDANKGVLIGEGALKVRNEKAVGVVADLFG 157 GD+ K +++GE +L+++N A V+ + G Sbjct: 271 GDSQKRMIVGEYSLELKNSYAGAVINGITG 300 >gi|307308932|ref|ZP_07588615.1| putative phage major head protein [Sinorhizobium meliloti BL225C] gi|306900566|gb|EFN31179.1| putative phage major head protein [Sinorhizobium meliloti BL225C] Length = 309 Score = 38.9 bits (89), Expect = 0.25, Method: Compositional matrix adjust. Identities = 39/152 (25%), Positives = 64/152 (42%), Gaps = 19/152 (12%) Query: 7 DGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSH 66 DGT R S ++ E M EG + I++ P + + + +S+ F+ A+ Sbjct: 173 DGTARALSHALIVE-MAEGIHSEGGVPKILLVPPAQKKNIVALSNGTT-KFQDAS----- 225 Query: 67 NNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHE---DKN 123 T+ +Y+ PFG+ + +R + S+ DP LW + + K Sbjct: 226 KKTVFGDVTMYETPFGRFDIVTSRDVRSS-----TVIGYDPE----LWAQAVFRGLTKKK 276 Query: 124 IAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 + + GD +I E L RNE G VADL Sbjct: 277 LPEGGDYEGYQVITEVTLVCRNEAGNGKVADL 308 >gi|281417131|ref|ZP_06248151.1| conserved hypothetical protein [Clostridium thermocellum JW20] gi|281408533|gb|EFB38791.1| conserved hypothetical protein [Clostridium thermocellum JW20] Length = 292 Score = 37.4 bits (85), Expect = 0.76, Method: Compositional matrix adjust. Identities = 26/71 (36%), Positives = 38/71 (53%), Gaps = 6/71 (8%) Query: 81 FGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGDANKGVLIGEGA 140 +G V V NR M + ++DP L ++LR + +AK+GD KG++I E Sbjct: 226 YGIVNVVLNRHMPVD-----KILVVDPTYLRLVYLRKPSFEM-LAKDGDNLKGMVITELT 279 Query: 141 LKVRNEKAVGV 151 LK+ N KAV V Sbjct: 280 LKLLNSKAVAV 290 >gi|90579438|ref|ZP_01235248.1| hypothetical protein VAS14_07013 [Vibrio angustum S14] gi|90440271|gb|EAS65452.1| hypothetical protein VAS14_07013 [Vibrio angustum S14] Length = 424 Score = 36.6 bits (83), Expect = 1.1, Method: Compositional matrix adjust. Identities = 29/114 (25%), Positives = 50/114 (43%), Gaps = 12/114 (10%) Query: 52 SNVASFRYATSGKSHNNTIVATADIYDRPFGKVMVH----PNRVMASNAETARNAFLIDP 107 +N+A+ R KS++ ++ D+Y P + + P+ VM + ET + + DP Sbjct: 175 ANLATMRALLGNKSNSEHLMVMLDLYHLPERENKFNQNPFPHYVMLTETETPEHWMMWDP 234 Query: 108 NMLEFLWLRTIHEDK--NIAKNGDANKGVLIGEGALKVRNEKAVGVVADLFGLS 159 + F W I +D+ N G + E AL ++ + VV D F L Sbjct: 235 D---FRWQGEISQDRIENAISQPTVEGGYIFNENAL---HDAPLNVVQDYFKLC 282 >gi|89072638|ref|ZP_01159203.1| hypothetical protein SKA34_18949 [Photobacterium sp. SKA34] gi|89051458|gb|EAR56912.1| hypothetical protein SKA34_18949 [Photobacterium sp. SKA34] Length = 424 Score = 36.6 bits (83), Expect = 1.2, Method: Compositional matrix adjust. Identities = 30/115 (26%), Positives = 50/115 (43%), Gaps = 14/115 (12%) Query: 52 SNVASFRYATSGKSHNNTIVATADIYDRP-----FGKVMVHPNRVMASNAETARNAFLID 106 +N+AS R + KS+ + ++ D+Y P F + P+ VM + T + + D Sbjct: 175 ANLASMRALLNNKSNQDHLMVMLDLYHLPERENKFNQNPF-PHYVMLTETSTPEHWLMWD 233 Query: 107 PNMLEFLWLRTIHEDK--NIAKNGDANKGVLIGEGALKVRNEKAVGVVADLFGLS 159 P+ F W I +D+ N G + E AL ++ + VV D F L Sbjct: 234 PD---FRWQGEISQDRIENAISQPTVEGGYIFNENAL---HDAPLNVVQDYFKLC 282 >gi|165973117|gb|ABY76238.1| EagI restriction endonuclease [Pantoea agglomerans] Length = 301 Score = 36.2 bits (82), Expect = 1.5, Method: Compositional matrix adjust. Identities = 24/80 (30%), Positives = 42/80 (52%), Gaps = 7/80 (8%) Query: 65 SHNNTIVATADIYDRPFGKVMVHPNRVMASNAET----ARNAFLIDPNML---EFLWLRT 117 +H+ TI+ P+G V++ PNR+ A++ ET +R+AF D L F+ R Sbjct: 42 NHDQTIIYGTCSVTSPYGDVIICPNRLYANDYETLHKVSRDAFGDDVPFLTYSNFIKYRA 101 Query: 118 IHEDKNIAKNGDANKGVLIG 137 ++D +A ++ K V +G Sbjct: 102 TYKDCIVALGKNSGKEVQVG 121 >gi|292669402|ref|ZP_06602828.1| aspartate-tRNA ligase [Selenomonas noxia ATCC 43541] gi|292648963|gb|EFF66935.1| aspartate-tRNA ligase [Selenomonas noxia ATCC 43541] Length = 602 Score = 35.0 bits (79), Expect = 3.3, Method: Compositional matrix adjust. Identities = 24/75 (32%), Positives = 34/75 (45%), Gaps = 8/75 (10%) Query: 41 VKSEFVRFMSDSNVASFRYATSGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETAR 100 VKS F +F SD + + A K+ + +V + D+P V + E R Sbjct: 370 VKSPFKKFYSDETFETIKNAVGAKTGDLLLV----VADQP----AVVAQALGELRLEMGR 421 Query: 101 NAFLIDPNMLEFLWL 115 LIDPN L FLW+ Sbjct: 422 RRNLIDPNALSFLWV 436 >gi|229187822|ref|ZP_04314947.1| hypothetical protein bcere0004_53480 [Bacillus cereus BGSC 6E1] gi|228595657|gb|EEK53352.1| hypothetical protein bcere0004_53480 [Bacillus cereus BGSC 6E1] Length = 313 Score = 34.7 bits (78), Expect = 4.4, Method: Compositional matrix adjust. Identities = 24/86 (27%), Positives = 47/86 (54%), Gaps = 6/86 (6%) Query: 73 TADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGDANK 132 T D+ +G+ + NR M A T ++D N + LR + K ++K GD++K Sbjct: 222 TVDVIHTRYGRGNIVLNRHMPDGALT-----IVDLNQTKLRPLRKMAAQK-LSKIGDSDK 275 Query: 133 GVLIGEGALKVRNEKAVGVVADLFGL 158 +++GE +L+++N A V++ + G+ Sbjct: 276 QMIVGEYSLELKNSYAGAVISGITGI 301 >gi|114566839|ref|YP_753993.1| hypothetical protein Swol_1314 [Syntrophomonas wolfei subsp. wolfei str. Goettingen] gi|114337774|gb|ABI68622.1| hypothetical protein Swol_1314 [Syntrophomonas wolfei subsp. wolfei str. Goettingen] Length = 398 Score = 34.7 bits (78), Expect = 4.7, Method: Compositional matrix adjust. Identities = 22/71 (30%), Positives = 38/71 (53%), Gaps = 6/71 (8%) Query: 81 FGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGDANKGVLIGEGA 140 +G + + NR M ++ + DP ++ +LR H + +AK GD+ G +I E Sbjct: 223 YGTIKLVLNRHMPTD-----KMVVFDPAFVKISYLRQPHFEL-LAKTGDSINGHVIAEMT 276 Query: 141 LKVRNEKAVGV 151 LK+ N+KA+ V Sbjct: 277 LKMLNQKAIAV 287 >gi|238926284|ref|ZP_04658044.1| aspartyl-tRNA synthetase [Selenomonas flueggei ATCC 43531] gi|238885964|gb|EEQ49602.1| aspartyl-tRNA synthetase [Selenomonas flueggei ATCC 43531] Length = 598 Score = 34.3 bits (77), Expect = 5.8, Method: Composition-based stats. Identities = 23/75 (30%), Positives = 35/75 (46%), Gaps = 8/75 (10%) Query: 41 VKSEFVRFMSDSNVASFRYATSGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETAR 100 VKS F +F SD + + A K+ + +V + D+P +V + E R Sbjct: 370 VKSPFKKFYSDETFETIKNAVGAKTGDLLLV----VADQP----LVVAQALGELRLEMGR 421 Query: 101 NAFLIDPNMLEFLWL 115 LIDP+ L FLW+ Sbjct: 422 RRGLIDPDALRFLWV 436 >gi|304437074|ref|ZP_07397037.1| aspartate--tRNA ligase [Selenomonas sp. oral taxon 149 str. 67H29BP] gi|304370025|gb|EFM23687.1| aspartate--tRNA ligase [Selenomonas sp. oral taxon 149 str. 67H29BP] Length = 602 Score = 34.3 bits (77), Expect = 6.0, Method: Composition-based stats. Identities = 23/75 (30%), Positives = 35/75 (46%), Gaps = 8/75 (10%) Query: 41 VKSEFVRFMSDSNVASFRYATSGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETAR 100 VKS F +F SD + + A K+ + +V + D+P +V + E R Sbjct: 370 VKSPFKKFYSDETFETIKNAVGAKTGDLLLV----VADQP----LVVAQALGELRLEMGR 421 Query: 101 NAFLIDPNMLEFLWL 115 LIDP+ L FLW+ Sbjct: 422 RRGLIDPDALRFLWV 436 >gi|228470849|ref|ZP_04055696.1| signal peptide peptidase SppA, 67K type [Porphyromonas uenonis 60-3] gi|228307416|gb|EEK16426.1| signal peptide peptidase SppA, 67K type [Porphyromonas uenonis 60-3] Length = 594 Score = 33.9 bits (76), Expect = 6.8, Method: Compositional matrix adjust. Identities = 15/38 (39%), Positives = 25/38 (65%) Query: 107 PNMLEFLWLRTIHEDKNIAKNGDANKGVLIGEGALKVR 144 P+ L F+ LR ++ ++ A++G+AN GVL EG + V Sbjct: 286 PDDLYFVSLRDVYATRSSARSGEANIGVLFAEGEINVE 323 >gi|303327249|ref|ZP_07357691.1| succinate-semialdehyde dehydrogenase [NAD(P)+] [Desulfovibrio sp. 3_1_syn3] gi|302863237|gb|EFL86169.1| succinate-semialdehyde dehydrogenase [NAD(P)+] [Desulfovibrio sp. 3_1_syn3] Length = 478 Score = 33.9 bits (76), Expect = 8.0, Method: Compositional matrix adjust. Identities = 19/46 (41%), Positives = 28/46 (60%), Gaps = 2/46 (4%) Query: 57 FRYATSGKSHNNTIVATADIYDRPFGKVMVH-PNRVMASNAETARN 101 F YATS S N ++A AD+Y+R +++ H + V +N ET RN Sbjct: 249 FDYATSCSS-ENAVIAVADVYERFMEEMLAHGCHLVSGANRETLRN 293 Searching..................................................done Results from round 2 >gi|315122533|ref|YP_004063022.1| hypothetical protein CKC_03925 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495935|gb|ADR52534.1| hypothetical protein CKC_03925 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 331 Score = 224 bits (570), Expect = 3e-57, Method: Composition-based stats. Identities = 141/161 (87%), Positives = 148/161 (91%) Query: 1 MTKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYA 60 MTKKAKDG QR F+KQ+LD VMQEGYQNG NFRHI+VSPYVKSEFVRFMSDSNVASFRYA Sbjct: 170 MTKKAKDGAQRPFTKQLLDAVMQEGYQNGANFRHIVVSPYVKSEFVRFMSDSNVASFRYA 229 Query: 61 TSGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHE 120 S S NNTIVATADIYD PFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLR I E Sbjct: 230 VSDNSKNNTIVATADIYDGPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRNIQE 289 Query: 121 DKNIAKNGDANKGVLIGEGALKVRNEKAVGVVADLFGLSKT 161 DKNIAK GDANKGVLIGEG LKV+NEKA+GV++DLFGLSKT Sbjct: 290 DKNIAKTGDANKGVLIGEGTLKVKNEKAIGVISDLFGLSKT 330 >gi|227822441|ref|YP_002826413.1| putative phage major head protein [Sinorhizobium fredii NGR234] gi|227341442|gb|ACP25660.1| putative phage major head protein [Sinorhizobium fredii NGR234] Length = 331 Score = 221 bits (562), Expect = 3e-56, Method: Composition-based stats. Identities = 109/162 (67%), Positives = 124/162 (76%) Query: 1 MTKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYA 60 +T DGTQR F+K ILD VMQ+GYQNG NFRH+ VSPYVKS FV FMSDSNVA FRYA Sbjct: 170 LTVAPTDGTQRPFTKAILDTVMQQGYQNGANFRHVSVSPYVKSVFVTFMSDSNVAPFRYA 229 Query: 61 TSGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHE 120 S NTIVATAD Y+ PFG VM+HPNRV A N ARNAF ID +ML FLWLR I E Sbjct: 230 VSQGGERNTIVATADYYEGPFGTVMIHPNRVQAVNGGVARNAFFIDSDMLSFLWLRQIQE 289 Query: 121 DKNIAKNGDANKGVLIGEGALKVRNEKAVGVVADLFGLSKTT 162 D+++AK GDA+KGV+IGEG LKV NE+ +GV ADLFGLS + Sbjct: 290 DRDVAKTGDADKGVIIGEGTLKVHNERGLGVAADLFGLSSAS 331 >gi|150397033|ref|YP_001327500.1| hypothetical protein Smed_1830 [Sinorhizobium medicae WSM419] gi|150028548|gb|ABR60665.1| hypothetical protein Smed_1830 [Sinorhizobium medicae WSM419] Length = 331 Score = 213 bits (542), Expect = 6e-54, Method: Composition-based stats. Identities = 110/161 (68%), Positives = 126/161 (78%) Query: 1 MTKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYA 60 +T DG QR F+K ILD VMQ GYQ+G NFRH+ VSPYVKS FV FMSDSNVA FRYA Sbjct: 170 LTVAPTDGEQRAFTKSILDSVMQAGYQSGANFRHVSVSPYVKSVFVTFMSDSNVAPFRYA 229 Query: 61 TSGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHE 120 S NTIVATAD Y+ PFG VM+HPNRV A+ A TARNAF +D +MLEFLWLR I E Sbjct: 230 VSQGGERNTIVATADYYEGPFGTVMIHPNRVQAATAATARNAFFLDTDMLEFLWLRKIQE 289 Query: 121 DKNIAKNGDANKGVLIGEGALKVRNEKAVGVVADLFGLSKT 161 DK++AK GDA+KGV+IGEG LKV+NEK +GV ADLFGLS+ Sbjct: 290 DKDVAKTGDADKGVIIGEGTLKVKNEKGLGVAADLFGLSEA 330 >gi|291334595|gb|ADD94245.1| putative phage major head protein [uncultured phage MedDCM-OCT-S04-C136] Length = 316 Score = 166 bits (419), Expect = 1e-39, Method: Composition-based stats. Identities = 41/154 (26%), Positives = 74/154 (48%), Gaps = 12/154 (7%) Query: 6 KDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKS 65 DGTQR F++ L ++ Y G N I+V+ + K + F S + Sbjct: 168 TDGTQRAFTEDQLKGILVSCYNEGGNPNMIMVNAFNKQKLSGFTGGSTRF-------DAA 220 Query: 66 HNNTIVATADIYDRPFGKVMVHPNRVM----ASNAETARNAFLIDPNMLEFLWLRTIHED 121 + ++ + D+Y+ FG + V PNR + + A+ ++A ++D +LR Sbjct: 221 EDRRLITSIDVYESDFGTMQVSPNRFIRGANGTAAKIGQDAHILDMEYWAVSFLRDFAL- 279 Query: 122 KNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 + A+ DA++ ++ E L+ RNEKA G++ DL Sbjct: 280 QTPAQTADADQRFMVAEYTLESRNEKASGLITDL 313 >gi|160897389|ref|YP_001562971.1| putative phage major head protein [Delftia acidovorans SPH-1] gi|160362973|gb|ABX34586.1| putative phage major head protein [Delftia acidovorans SPH-1] Length = 306 Score = 162 bits (410), Expect = 1e-38, Method: Composition-based stats. Identities = 46/154 (29%), Positives = 70/154 (45%), Gaps = 13/154 (8%) Query: 3 KKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATS 62 GTQR F + + +V+Q+ Y G I++ P K F F ++ Sbjct: 166 TGYTPGTQRAFVESQVKDVLQKCYTAGGEPDTIMMPPAAKQTFSTFTGNATRM------- 218 Query: 63 GKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDK 122 KS ++ + A+ D Y FG + PNR MA R+ FL+ + L +LR Sbjct: 219 DKSEDSKLYASVDYYVSDFGTIQAVPNRFMA-----TRDVFLLQSDKLAVAYLRPFQTT- 272 Query: 123 NIAKNGDANKGVLIGEGALKVRNEKAVGVVADLF 156 +IAK GD++ +I E L+ R KA G V D+ Sbjct: 273 DIAKTGDSDAQQIIVEYTLECRAPKAHGAVYDIL 306 >gi|163783849|ref|ZP_02178828.1| hypothetical protein HG1285_12862 [Hydrogenivirga sp. 128-5-R1-1] gi|159880872|gb|EDP74397.1| hypothetical protein HG1285_12862 [Hydrogenivirga sp. 128-5-R1-1] Length = 291 Score = 159 bits (402), Expect = 1e-37, Method: Composition-based stats. Identities = 37/151 (24%), Positives = 74/151 (49%), Gaps = 13/151 (8%) Query: 5 AKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGK 64 A GT R ++ +L++ +Q+ +Q G N ++V+ K F + Sbjct: 153 ANAGTPRPLTETLLNDGLQQAWQTGGNPDVVVVNGTNKRTISGFTAGLTK-------EIA 205 Query: 65 SHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNI 124 + + ++A+ D+Y+ FG V V NR M ++ F+++ + +L H+ + + Sbjct: 206 AKDKRLIASVDVYESDFGLVRVIANRWMPAD-----KVFILEKGRWKVAYLDPFHQ-EEL 259 Query: 125 AKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 +K GD + V++GE L+ R EKA ++ D+ Sbjct: 260 SKVGDRKERVVVGELTLEGRAEKANAIITDI 290 >gi|291334838|gb|ADD94478.1| putative phage major head protein [uncultured phage MedDCM-OCT-S06-C1041] Length = 323 Score = 158 bits (400), Expect = 2e-37, Method: Composition-based stats. Identities = 42/152 (27%), Positives = 65/152 (42%), Gaps = 7/152 (4%) Query: 2 TKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYAT 61 T GT TF + L V+++ + NG + I+ + K F Sbjct: 174 TTDPTAGTAATFVEADLKSVVKQCWDNGGDPGVIMTGSFNKQAASGFAGIGTQF-RDVQP 232 Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHED 121 SG +IV ADIY FG+ + NR M A N + +D +LR I Sbjct: 233 SGPVSPGSIVGAADIYISDFGQHQIVANRFMP-----AANVYALDMEYWCVAYLRPIQ-T 286 Query: 122 KNIAKNGDANKGVLIGEGALKVRNEKAVGVVA 153 +++ K GD+++ +LI E L +NE A G + Sbjct: 287 EDLGKTGDSDRRMLISEYTLGSKNEAASGKIY 318 >gi|291334638|gb|ADD94286.1| putative phage major head protein [uncultured phage MedDCM-OCT-S04-C64] Length = 323 Score = 157 bits (397), Expect = 4e-37, Method: Composition-based stats. Identities = 41/152 (26%), Positives = 65/152 (42%), Gaps = 7/152 (4%) Query: 2 TKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYAT 61 + GT TF + L V+++ + NG + I+ + K F Sbjct: 174 STDPTAGTAATFVEADLKSVVKQCWDNGGDPGVIMTGSFNKQAASGFAGIGTQF-RDVQP 232 Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHED 121 SG +IV ADIY FG+ + NR M A N + +D +LR I Sbjct: 233 SGPVSPGSIVGAADIYISDFGQHQIVANRFMP-----AANVYALDMEYWCVAYLRPIQ-T 286 Query: 122 KNIAKNGDANKGVLIGEGALKVRNEKAVGVVA 153 +++ K GD+++ +LI E L +NE A G + Sbjct: 287 EDLGKTGDSDRRMLISEYTLGSKNEAASGKIY 318 >gi|221199511|ref|ZP_03572555.1| major head protein [Burkholderia multivorans CGD2M] gi|221205587|ref|ZP_03578602.1| major head protein [Burkholderia multivorans CGD2] gi|221174425|gb|EEE06857.1| major head protein [Burkholderia multivorans CGD2] gi|221180796|gb|EEE13199.1| major head protein [Burkholderia multivorans CGD2M] Length = 317 Score = 155 bits (391), Expect = 2e-36, Method: Composition-based stats. Identities = 46/153 (30%), Positives = 72/153 (47%), Gaps = 6/153 (3%) Query: 3 KKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATS 62 GT R F++ +L E ++ Y+ G N + + P K F + A+ + + Sbjct: 169 TPPVAGTARPFTEDLLKEALRLAYEAGGNVTQVHMRPVDKQRASGFAGN---ATRMESVN 225 Query: 63 GKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDK 122 G + + IY FG V + PNRVM++ +TA + IDP+M LR E + Sbjct: 226 GTGKTAVLQTSYSIYASDFGNVAMIPNRVMSAVNDTA--VYAIDPSMWALATLRGF-EKE 282 Query: 123 NIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 +AK GDA + EG L+ RNE + + DL Sbjct: 283 ELAKVGDARNWQITWEGTLEARNEASSAQIRDL 315 >gi|283856246|ref|YP_162122.2| putative phage major head protein [Zymomonas mobilis subsp. mobilis ZM4] gi|283775241|gb|AAV89011.2| putative phage major head protein [Zymomonas mobilis subsp. mobilis ZM4] Length = 304 Score = 153 bits (387), Expect = 6e-36, Method: Composition-based stats. Identities = 47/154 (30%), Positives = 73/154 (47%), Gaps = 11/154 (7%) Query: 2 TKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYAT 61 T A G R +++ +L EV+ + + +G +I S +K F +AS R T Sbjct: 159 TTAATTGKARLYTEALLKEVLGDIWVSGGQPNMVITSLKLKQTAAAF---PGLASNRRDT 215 Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHED 121 I+A ADIY G+V P+R +++ A +IDP L I + Sbjct: 216 GD--QKARIIAGADIYVSDVGEVQFVPDRFCDNSS-----ALVIDPEYWSVATLDPIQK- 267 Query: 122 KNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 +++A G A++ L E AL+ NE A GV+ADL Sbjct: 268 RSLATTGLADRDALYTEIALRCHNEAASGVLADL 301 >gi|241760939|ref|ZP_04759028.1| putative phage major head protein [Zymomonas mobilis subsp. mobilis ATCC 10988] gi|241374558|gb|EER64019.1| putative phage major head protein [Zymomonas mobilis subsp. mobilis ATCC 10988] Length = 238 Score = 153 bits (385), Expect = 1e-35, Method: Composition-based stats. Identities = 46/154 (29%), Positives = 72/154 (46%), Gaps = 11/154 (7%) Query: 2 TKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYAT 61 T A G R +++ +L EV+ + + +G +I S +K F +AS R T Sbjct: 93 TTAATAGKARLYTEALLKEVLGDIWVSGGQPNMVITSLKLKQTAAAF---PGLASNRRDT 149 Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHED 121 I+A ADIY G+V P+R +++ A +IDP L I + Sbjct: 150 GD--QKARIIAGADIYVSDVGEVQFVPDRFCDNSS-----ALVIDPEYWSVATLDPIQK- 201 Query: 122 KNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 +++A G A++ L E AL+ NE GV+ADL Sbjct: 202 RSLATTGLADRDALYTEIALRCHNEATSGVLADL 235 >gi|288817864|ref|YP_003432211.1| putative phage major head protein [Hydrogenobacter thermophilus TK-6] gi|288787263|dbj|BAI69010.1| putative phage major head protein [Hydrogenobacter thermophilus TK-6] gi|308751463|gb|ADO44946.1| putative phage major head protein [Hydrogenobacter thermophilus TK-6] Length = 291 Score = 151 bits (381), Expect = 3e-35, Method: Composition-based stats. Identities = 40/151 (26%), Positives = 76/151 (50%), Gaps = 13/151 (8%) Query: 5 AKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGK 64 A G+ R ++ +L++ +Q+ +Q G N ++VS K F + + Sbjct: 153 ANGGSPRALTETLLNDGIQQAWQAGGNPDVVVVSGKNKRTVSSFTAGVTK-------TVA 205 Query: 65 SHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNI 124 + + +V+T D+Y+ FG V + +R M + F+++ + +LR + + I Sbjct: 206 AEDKKLVSTVDVYESDFGLVRIVADRWMPDS-----KVFILEKGRFKIAYLRPFKQ-QEI 259 Query: 125 AKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 AK GD + V++GE L+VR EKA ++AD+ Sbjct: 260 AKVGDRIERVVVGELTLEVRAEKANAIIADI 290 >gi|167600435|ref|YP_001671935.1| major head protein [Pseudomonas phage LUZ24] gi|161168298|emb|CAP45463.1| major head protein [Pseudomonas phage LUZ24] Length = 317 Score = 149 bits (376), Expect = 1e-34, Method: Composition-based stats. Identities = 39/150 (26%), Positives = 66/150 (44%), Gaps = 11/150 (7%) Query: 6 KDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKS 65 G R ++ +L + ++NG I S +K + N+ + + Sbjct: 175 TAGDLRLLTEDMLLNASEAIWRNGGQANSIQTSSSIKKAISK-----NMKGRATEITLDA 229 Query: 66 HNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIA 125 +N I T D+Y+ FGK + NR N F+ DP M +LR + + +A Sbjct: 230 SDNRIAQTVDVYESDFGKYTIRANRWFHENT-----MFIFDPKMHALCYLRPFFQHE-LA 283 Query: 126 KNGDANKGVLIGEGALKVRNEKAVGVVADL 155 K GD+ K L+ E L+V NEK+ ++ D+ Sbjct: 284 KTGDSEKRQLLVEYTLRVNNEKSGALIRDV 313 >gi|148257053|ref|YP_001241638.1| putative phage major head protein [Bradyrhizobium sp. BTAi1] gi|146409226|gb|ABQ37732.1| putative phage Major head protein [Bradyrhizobium sp. BTAi1] Length = 320 Score = 147 bits (371), Expect = 4e-34, Method: Composition-based stats. Identities = 42/154 (27%), Positives = 70/154 (45%), Gaps = 13/154 (8%) Query: 2 TKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYAT 61 T DGTQ F++ L V+ + NG I+ + K F F S Sbjct: 175 TGTRTDGTQIAFTEARLKTVLSSIWINGGKPGTILTGAFNKQVFSTFTGRSTAIE----- 229 Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHE- 120 ++ + IVA+ D Y+ FGK+ V +R + AR+ +++ + +L + Sbjct: 230 --EAKSKKIVASVDAYESDFGKLKVVASRF-----QRARDVLVLEMDKWAVAYLNGRNMI 282 Query: 121 DKNIAKNGDANKGVLIGEGALKVRNEKAVGVVAD 154 +AK GD+++ ++ E AL RNEKA G + D Sbjct: 283 SIPLAKTGDSDRRQILAEYALVARNEKASGGIFD 316 >gi|27476049|ref|NP_775251.1| major head protein [Pseudomonas phage PaP3] gi|27414479|gb|AAL85565.1| major head protein [Pseudomonas phage PaP3] Length = 317 Score = 147 bits (370), Expect = 6e-34, Method: Composition-based stats. Identities = 38/150 (25%), Positives = 65/150 (43%), Gaps = 11/150 (7%) Query: 6 KDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKS 65 G R ++ +L + ++NG I S +K + N+ + + Sbjct: 175 TAGDLRLLTEDMLLNASESIWRNGGQANSIQTSSSIKKAISK-----NMKGRATEITLDA 229 Query: 66 HNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIA 125 +N I T D+Y+ FGK + NR N F+ DP M +LR + + +A Sbjct: 230 SDNRIAQTVDVYESDFGKYTIRANRWFHENT-----LFVFDPKMHSLCYLRPFFQHE-LA 283 Query: 126 KNGDANKGVLIGEGALKVRNEKAVGVVADL 155 K GD+ K L+ E +V NEK+ ++ D+ Sbjct: 284 KTGDSEKRQLLVEYTFRVNNEKSGALIRDV 313 >gi|227821704|ref|YP_002825674.1| putative phage related protein [Sinorhizobium fredii NGR234] gi|227340703|gb|ACP24921.1| putative phage related protein [Sinorhizobium fredii NGR234] Length = 123 Score = 146 bits (369), Expect = 8e-34, Method: Composition-based stats. Identities = 77/112 (68%), Positives = 86/112 (76%) Query: 25 GYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTIVATADIYDRPFGKV 84 GYQNG NFRH+ SPYVKS FV FMSDSNVA FRYA S NTIVATAD Y+ PFG V Sbjct: 6 GYQNGANFRHVSASPYVKSVFVTFMSDSNVAPFRYAVSKGGERNTIVATADYYEGPFGTV 65 Query: 85 MVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGDANKGVLI 136 M+HPNRV A+ A ARNAF ID +ML FLWLR I EDK++AK GDA+KGV+ Sbjct: 66 MIHPNRVQATGATQARNAFFIDTDMLSFLWLRKIQEDKDVAKTGDADKGVIT 117 >gi|291334405|gb|ADD94061.1| major head protein [uncultured phage MedDCM-OCT-S01-C1] Length = 344 Score = 136 bits (341), Expect = 1e-30, Method: Composition-based stats. Identities = 41/159 (25%), Positives = 71/159 (44%), Gaps = 6/159 (3%) Query: 1 MTKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYA 60 +T K + S Q + + +Q Y+ G H++ +P V +M D N A Sbjct: 180 LTVKYTEDDGVALSFQAVKDAIQGVYEQGGEVTHLMSNPGVIGALSSYMFD-NEARVATL 238 Query: 61 TSGKS----HNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLR 116 TS + T +++ ++ FG + + PNR+ A AFL+DP + +L Sbjct: 239 TSDQGAPANSKATALSSVNVLVSDFGTIKLVPNRLQPLAANGTAVAFLLDPEYVSLSYLE 298 Query: 117 TIHEDKNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 D +AK G A K + + L+V EKA G++ ++ Sbjct: 299 GYRTD-TLAKTGLAEKRQISVDWGLRVHTEKAHGMLVNI 336 >gi|294083943|ref|YP_003550700.1| major head protein [Candidatus Puniceispirillum marinum IMCC1322] gi|292663515|gb|ADE38616.1| major head protein [Candidatus Puniceispirillum marinum IMCC1322] Length = 133 Score = 126 bits (316), Expect = 1e-27, Method: Composition-based stats. Identities = 32/142 (22%), Positives = 57/142 (40%), Gaps = 11/142 (7%) Query: 14 SKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTIVAT 73 ++ L + Q Y G + ++ P F S + T+V Sbjct: 2 TEAKLLDAHQAAYTAGADVSVFMIKPSDSEIVAGFTGSSGRTRNFNDGT-----TTLVNA 56 Query: 74 ADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGDANKG 133 D+ P+G + V NR + +AFL+DP+M LR + ++K GD++K Sbjct: 57 VDVMINPYGTLKVVLNR-----HQLTTHAFLLDPSMWRSAVLRPVSRTL-LSKTGDSDKH 110 Query: 134 VLIGEGALKVRNEKAVGVVADL 155 + EG L+ N+ ++ L Sbjct: 111 FCVYEGGLQHLNQSGSAMITGL 132 >gi|316934287|ref|YP_004109269.1| putative phage major head protein [Rhodopseudomonas palustris DX-1] gi|315602001|gb|ADU44536.1| putative phage major head protein [Rhodopseudomonas palustris DX-1] Length = 304 Score = 91.4 bits (225), Expect = 4e-17, Method: Composition-based stats. Identities = 35/151 (23%), Positives = 62/151 (41%), Gaps = 11/151 (7%) Query: 6 KDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKS 65 T ++ + Q Y G +V+P F F + + S Sbjct: 159 TGATGTKITEANYLDAAQALYDAGSEASITLVTPTNSRTFAGFTGAAGRSRVINDDS--- 215 Query: 66 HNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHE-DKNI 124 IV ++Y PFG+ + +R A + L DP M + L+ + + + Sbjct: 216 --KKIVNAVNLYVSPFGEEKIVLSR-----HLKAGDTLLFDPAMWARVILQGRNWFRETL 268 Query: 125 AKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 AK GD+ K +++GE LK +N+KA +V ++ Sbjct: 269 AKTGDSLKMMIVGEFGLKHKNQKASALVREM 299 >gi|307308932|ref|ZP_07588615.1| putative phage major head protein [Sinorhizobium meliloti BL225C] gi|306900566|gb|EFN31179.1| putative phage major head protein [Sinorhizobium meliloti BL225C] Length = 309 Score = 82.2 bits (201), Expect = 2e-14, Method: Composition-based stats. Identities = 35/155 (22%), Positives = 60/155 (38%), Gaps = 13/155 (8%) Query: 1 MTKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYA 60 + DGT R S ++ E+ + + G + ++V P K V + + Sbjct: 167 LVGAVTDGTARALSHALIVEMAEGIHSEGGVPKILLVPPAQKKNIVALSNGTTKFQ---- 222 Query: 61 TSGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHE 120 + T+ +Y+ PFG+ + +R + S+ DP + R + Sbjct: 223 ---DASKKTVFGDVTMYETPFGRFDIVTSRDVRSSTVIG-----YDPELWAQAVFRGL-T 273 Query: 121 DKNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 K + + GD +I E L RNE G VADL Sbjct: 274 KKKLPEGGDYEGYQVITEVTLVCRNEAGNGKVADL 308 >gi|226305754|ref|YP_002765714.1| hypothetical protein RER_22670 [Rhodococcus erythropolis PR4] gi|226184871|dbj|BAH32975.1| hypothetical protein RER_22670 [Rhodococcus erythropolis PR4] Length = 317 Score = 74.1 bits (180), Expect = 6e-12, Method: Composition-based stats. Identities = 35/155 (22%), Positives = 57/155 (36%), Gaps = 12/155 (7%) Query: 6 KDGTQRTFSKQILDEVMQEGYQNGD----NFRHIIVSPYVKSEFVRFMSDSNVASFRYAT 61 +GT ++ +L ++MQ ++NG R I+V K + D Sbjct: 171 TNGTPAALTETMLLDLMQLAWENGGIQESETRTIVVGAKQKRALTKIFIDGKGYKESTRN 230 Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHED 121 G + TI + FG+ + NR M + + P LE + Sbjct: 231 VGGVNLQTI-------ETDFGRCNIMLNRYMPAAQLAVVSLEECAPRFLEIPG-KGHFFA 282 Query: 122 KNIAKNGDANKGVLIGEGALKVRNEKAVGVVADLF 156 +++AK G + K L GE LK NE + L Sbjct: 283 ESLAKTGASEKVQLYGEIGLKYGNELKHAKLTGLL 317 >gi|256377352|ref|YP_003101012.1| hypothetical protein Amir_3259 [Actinosynnema mirum DSM 43827] gi|255921655|gb|ACU37166.1| hypothetical protein Amir_3259 [Actinosynnema mirum DSM 43827] Length = 329 Score = 71.4 bits (173), Expect = 4e-11, Method: Composition-based stats. Identities = 30/159 (18%), Positives = 61/159 (38%), Gaps = 14/159 (8%) Query: 2 TKKAKDGTQRTFSKQILDEVMQEGYQNGD----NFRHIIVSPYVKSEFV-RFMSDSNVAS 56 T + T ++ ++ +++Q+ ++NG ++ + K F++ N Sbjct: 173 TNVITNSTPTPLTESMVIDLLQKVWENGGIQISETATLMCGAWQKRRLTDEFVTRKNYQE 232 Query: 57 FRYATSGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLR 116 G + + FG+V + NR M ++A + P MLE Sbjct: 233 QTRNVGG--------VSVSTIETDFGRVNIMLNRHMPADAVQVVSLEQCAPVMLEKPG-S 283 Query: 117 TIHEDKNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 + +AK G +++ + GE L+ E A G + L Sbjct: 284 GFLFSEPLAKTGASDRVHIYGEIGLEYGPEIAHGKITGL 322 >gi|290457630|sp|P85987|CAPSD_BPSK1 RecName: Full=Major capsid protein; AltName: Full=Virion protein G gi|221271431|dbj|BAH15184.1| major capsid protein [Serratia phage KSP100] Length = 306 Score = 69.1 bits (167), Expect = 2e-10, Method: Composition-based stats. Identities = 28/141 (19%), Positives = 54/141 (38%), Gaps = 12/141 (8%) Query: 13 FSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTIVA 72 + L E+ + Y G + I+ P + F S + ++ V Sbjct: 160 LDEATLFEMTAKLYTEGADPTLIMYHPSNANFFASLQEKSGTRMRIFE-----NDKRFVK 214 Query: 73 TADIYDRPFG-KVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGDAN 131 + P G ++ PNR +A ++ +P+ L LR + +AK+G A Sbjct: 215 QVEYIVDPLGQELKCIPNRWCPEDAT-----YIFNPSDLGMAVLRA-PKKVALAKSGSAE 268 Query: 132 KGVLIGEGALKVRNEKAVGVV 152 K ++ E ++ N KA ++ Sbjct: 269 KYMIEQEVGFRLNNPKAAALI 289 >gi|228910960|ref|ZP_04074768.1| hypothetical protein bthur0013_51010 [Bacillus thuringiensis IBL 200] gi|228848615|gb|EEM93461.1| hypothetical protein bthur0013_51010 [Bacillus thuringiensis IBL 200] Length = 363 Score = 67.1 bits (162), Expect = 7e-10, Method: Composition-based stats. Identities = 37/134 (27%), Positives = 52/134 (38%), Gaps = 15/134 (11%) Query: 23 QEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTIVATADIYDRPFG 82 + GY G N++ IIV K F D+ + +G+ D Y FG Sbjct: 242 KGGYTVGGNYK-IIVGATQKIAISAF--DNALIRLSREDNGRGQ------VVDHYVSDFG 292 Query: 83 KVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTI-HEDKNIAKNGDANKGVLIGEGAL 141 + N M A +ID N L LRT + + K GD KG+L+GE L Sbjct: 293 NAEILLNNNMH-----AGEILVIDANRLSIRPLRTREFAHEFLGKKGDYMKGMLVGEYTL 347 Query: 142 KVRNEKAVGVVADL 155 + E A + L Sbjct: 348 EFLQEAAHAKITGL 361 >gi|167583566|ref|YP_001671756.1| major head protein [Enterobacteria phage phiEco32] gi|164375404|gb|ABY52812.1| major head protein [Enterobacteria phage phiEco32] Length = 352 Score = 66.0 bits (159), Expect = 2e-09, Method: Composition-based stats. Identities = 28/155 (18%), Positives = 56/155 (36%), Gaps = 11/155 (7%) Query: 4 KAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSG 63 + T F + + ++ + Y G I+++P F ++ + R + Sbjct: 204 ASNPTTNIGFDEADIFDMTLQLYTAGSEADIIMINPAHAKIFAGLQENTQGSRKRIFEN- 262 Query: 64 KSHNNTIVATADIYDRPFGK-VMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDK 122 + + P G+ + NR M ++A + + LR + Sbjct: 263 ---TKQFIYEVNSITDPLGQSYKIIVNRWMPTDA-----VYFFRSADWTQMVLRAPKRTE 314 Query: 123 NIAKNGDANKGVLIGEGALKVRNEKAVGVVADLFG 157 +AK+G K ++ E L+ RN A GV+ G Sbjct: 315 -LAKDGSYEKWMIEMEVGLRHRNPYASGVLFTAAG 348 >gi|261337429|ref|ZP_05965313.1| conserved hypothetical protein [Bifidobacterium gallicum DSM 20093] gi|270277816|gb|EFA23670.1| conserved hypothetical protein [Bifidobacterium gallicum DSM 20093] Length = 160 Score = 65.6 bits (158), Expect = 2e-09, Method: Composition-based stats. Identities = 31/154 (20%), Positives = 59/154 (38%), Gaps = 22/154 (14%) Query: 9 TQRTFSKQILDEVMQEGYQNGD----NFRHIIVSPYVKSEFVR-FMSDSNVASFRYATSG 63 +T S+ + ++MQ + NG R I+V+ +K R F+ D+ G Sbjct: 6 AAKTLSEDDVLDLMQMAWDNGGIREAETRTIVVNSDLKRALTRIFIKDAGYKEETRNVGG 65 Query: 64 KSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLR----TIH 119 + + FG+ + + ++ ++ + L +L Sbjct: 66 --------VSLQTIETDFGRCNIMLDSLV-----LKDKMLVLSLDQLAPRFLEIPGKGHF 112 Query: 120 EDKNIAKNGDANKGVLIGEGALKVRNEKAVGVVA 153 + +AK G ++K L GE L+ NEKA V+ Sbjct: 113 FVEPLAKTGASDKVQLYGEIGLEYGNEKAHAVLT 146 >gi|256751057|ref|ZP_05491940.1| conserved hypothetical protein [Thermoanaerobacter ethanolicus CCSD1] gi|256750167|gb|EEU63188.1| conserved hypothetical protein [Thermoanaerobacter ethanolicus CCSD1] Length = 292 Score = 64.1 bits (154), Expect = 6e-09, Method: Composition-based stats. Identities = 32/142 (22%), Positives = 56/142 (39%), Gaps = 16/142 (11%) Query: 12 TFSKQILDEVMQEGYQNG--DNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNT 69 F + +++ + NG N +V+ +K F +S + Sbjct: 160 AFDEAQFKATVKKLWDNGLGSNQYIALVNADIKEIIDDFYKNST-----RYIGNVNEYGV 214 Query: 70 IVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGD 129 +V + +G V + NR + A + DP + +LRT ++ +AK GD Sbjct: 215 VVDKVN---TNYGVVNLVLNRHVP-----AGKMIVFDPAYVRLAFLRT-PSNEILAKTGD 265 Query: 130 ANKGVLIGEGALKVRNEKAVGV 151 G +I E LKV N KA+ + Sbjct: 266 YISGQVITELTLKVLNAKALAL 287 >gi|196048420|ref|ZP_03115595.1| conserved hypothetical protein [Bacillus cereus 03BB108] gi|196020677|gb|EDX59409.1| conserved hypothetical protein [Bacillus cereus 03BB108] Length = 312 Score = 63.3 bits (152), Expect = 1e-08, Method: Composition-based stats. Identities = 38/153 (24%), Positives = 68/153 (44%), Gaps = 11/153 (7%) Query: 5 AKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGK 64 KDG S L + + ++ G ++ +EF+ + A G Sbjct: 159 PKDGK---LSADDLIKAFRLCWEKGAGGDKLVQCGSQVAEFIDKLFKVEKGVMIPAIQGG 215 Query: 65 SHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNI 124 N I TAD+ +G+ + NR M A T ++D N ++ LR + + + Sbjct: 216 G--NIIGLTADVIHTRYGRGNIVLNRHMPDGALT-----IVDLNQVKLRPLRKML-AEPL 267 Query: 125 AKNGDANKGVLIGEGALKVRNEKAVGVVADLFG 157 AKNGD+ K +++GE +L+++N A V+ + G Sbjct: 268 AKNGDSQKRMIVGEYSLELKNSYAGAVINGITG 300 >gi|302389556|ref|YP_003825377.1| hypothetical protein Toce_0992 [Thermosediminibacter oceani DSM 16646] gi|302200184|gb|ADL07754.1| conserved hypothetical protein [Thermosediminibacter oceani DSM 16646] Length = 294 Score = 61.7 bits (148), Expect = 3e-08, Method: Composition-based stats. Identities = 30/146 (20%), Positives = 54/146 (36%), Gaps = 16/146 (10%) Query: 9 TQRTFSKQILDEVMQEGYQNGDNFR--HIIVSPYVKSEFVRFMSDSNVASFRYATSGKSH 66 T + +++ ++NG +V+ +K + +Y Sbjct: 157 TGANLDEANFKATVKKLWENGLGINEYVALVNADLKEAIDA------IYDAKYQYIAPMQ 210 Query: 67 NNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLR-TIHEDKNIA 125 +N +G V + NR M + + DP L LR + E +A Sbjct: 211 DNYFGLVVRRVQTNYGNVNIILNRHMPVD-----KMVVFDPAYLRIANLREPVFEP--LA 263 Query: 126 KNGDANKGVLIGEGALKVRNEKAVGV 151 K GD+ KG ++ E LKV + KA+ + Sbjct: 264 KTGDSVKGQVVAELTLKVLSPKAIAM 289 >gi|257458669|ref|ZP_05623796.1| conserved hypothetical protein [Campylobacter gracilis RM3268] gi|257443942|gb|EEV19058.1| conserved hypothetical protein [Campylobacter gracilis RM3268] Length = 328 Score = 61.0 bits (146), Expect = 6e-08, Method: Composition-based stats. Identities = 28/149 (18%), Positives = 55/149 (36%), Gaps = 12/149 (8%) Query: 7 DGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSH 66 GT ++ L++++Q + G R + + +K F AT + Sbjct: 189 TGTPTVLTEDALNKILQNIWDAGATPRDVFIGAKLKKAINAF-----------ATRQFGN 237 Query: 67 NNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAK 126 +I ++ D FGKV +R ++ + D + ++ L +N+ Sbjct: 238 EKSINSSVVSLDTDFGKVNFRLHRYLSEQNKLDDVLIAGDFSFMKNGLLIP-TMIENVTT 296 Query: 127 NGDANKGVLIGEGALKVRNEKAVGVVADL 155 + A + E L+VRN A + L Sbjct: 297 SKTAKQKRYYTEATLEVRNADAFAIGVGL 325 >gi|114566839|ref|YP_753993.1| hypothetical protein Swol_1314 [Syntrophomonas wolfei subsp. wolfei str. Goettingen] gi|114337774|gb|ABI68622.1| hypothetical protein Swol_1314 [Syntrophomonas wolfei subsp. wolfei str. Goettingen] Length = 398 Score = 61.0 bits (146), Expect = 6e-08, Method: Composition-based stats. Identities = 32/142 (22%), Positives = 66/142 (46%), Gaps = 16/142 (11%) Query: 12 TFSKQILDEVMQEGYQNGDNFRHI--IVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNT 69 T ++ + E +++ + G + +V+ +K + D+ A+ + +S Sbjct: 160 TITEPKIKETVKKLWDAGMSSGTYIAMVNADLKEDI-----DALYANKYSYIAQESLFGL 214 Query: 70 IVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGD 129 IV++ +G + + NR M ++ + DP ++ +LR H + +AK GD Sbjct: 215 IVSSI---QTNYGTIKLVLNRHMPTD-----KMVVFDPAFVKISYLRQPHFEL-LAKTGD 265 Query: 130 ANKGVLIGEGALKVRNEKAVGV 151 + G +I E LK+ N+KA+ V Sbjct: 266 SINGHVIAEMTLKMLNQKAIAV 287 >gi|150021335|ref|YP_001306689.1| hypothetical protein Tmel_1457 [Thermosipho melanesiensis BI429] gi|149793856|gb|ABR31304.1| hypothetical protein Tmel_1457 [Thermosipho melanesiensis BI429] Length = 362 Score = 59.8 bits (143), Expect = 1e-07, Method: Composition-based stats. Identities = 29/143 (20%), Positives = 57/143 (39%), Gaps = 14/143 (9%) Query: 12 TFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTIV 71 T+S+ ++ Y NG ++ K F SD + + T+G+ Sbjct: 231 TWSEDNFRAFLKLIYDNGGYITEAWMNASTKQYFNSLNSDKLIVTQDERTAGRL------ 284 Query: 72 ATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHED-KNIAKNGDA 130 D Y +G++ + + + ++D + L+ L + + +AK GD+ Sbjct: 285 --VDGYLSEYGQISLKTSPHIPEGM-----IIVVDTSNLKIKPLSGRNMAYEPLAKTGDS 337 Query: 131 NKGVLIGEGALKVRNEKAVGVVA 153 KG ++GE L+ RN G+ Sbjct: 338 VKGQIVGEYTLEFRNPDGAGIFY 360 >gi|257451764|ref|ZP_05617063.1| hypothetical protein F3_01776 [Fusobacterium sp. 3_1_5R] gi|317058321|ref|ZP_07922806.1| predicted protein [Fusobacterium sp. 3_1_5R] gi|313683997|gb|EFS20832.1| predicted protein [Fusobacterium sp. 3_1_5R] Length = 371 Score = 58.7 bits (140), Expect = 3e-07, Method: Composition-based stats. Identities = 34/171 (19%), Positives = 71/171 (41%), Gaps = 31/171 (18%) Query: 3 KKAKDGTQRTFSKQILDE-------------VMQEGYQNGDNFR----HIIVSPYVKSEF 45 K+ DG +R +K L + V+++ ++ G + + V K + Sbjct: 212 KRGMDGVKRFLAKGQLVDAGGQEISLEIIGNVLRKIFEAGGDVNGGNYALYVPGVQKVKI 271 Query: 46 VRFMSDSNVASFRYATSGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLI 105 + + D A NT+ A A+ FG + + P + A ++ Sbjct: 272 SKLLKDYIQAP--------PSENTLGAVANYVATDFGTLPIIP-----TVNLRADEMMIL 318 Query: 106 DPNMLEFLWLRTIH-EDKNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 + + + LRT + + + K GD KG+++ E +++VRN +G++ +L Sbjct: 319 NHDDITLQVLRTRELQHEYMGKTGDNTKGLIVTELSVEVRNIPTMGMIMNL 369 >gi|229187822|ref|ZP_04314947.1| hypothetical protein bcere0004_53480 [Bacillus cereus BGSC 6E1] gi|228595657|gb|EEK53352.1| hypothetical protein bcere0004_53480 [Bacillus cereus BGSC 6E1] Length = 313 Score = 58.3 bits (139), Expect = 4e-07, Method: Composition-based stats. Identities = 31/156 (19%), Positives = 69/156 (44%), Gaps = 11/156 (7%) Query: 5 AKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGK 64 G S + + + + ++ G ++ +EF+ + V + + Sbjct: 159 PTSGK---LSAEDVTKAFRLCWEKGAGGDKLVQCGSAVAEFIDKLFK--VDKGVVIPAIQ 213 Query: 65 SHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNI 124 N I T D+ +G+ + NR M A T ++D N + LR + + + Sbjct: 214 GGGNIIGITVDVIHTRYGRGNIVLNRHMPDGALT-----IVDLNQTKLRPLRKMA-AQKL 267 Query: 125 AKNGDANKGVLIGEGALKVRNEKAVGVVADLFGLSK 160 +K GD++K +++GE +L+++N A V++ + G+ + Sbjct: 268 SKIGDSDKQMIVGEYSLELKNSYAGAVISGITGIVE 303 >gi|152975085|ref|YP_001374602.1| hypothetical protein Bcer98_1285 [Bacillus cereus subsp. cytotoxis NVH 391-98] gi|152023837|gb|ABS21607.1| conserved hypothetical protein [Bacillus cytotoxicus NVH 391-98] Length = 293 Score = 57.5 bits (137), Expect = 6e-07, Method: Composition-based stats. Identities = 31/148 (20%), Positives = 66/148 (44%), Gaps = 16/148 (10%) Query: 6 KDGTQRTFSKQILDEVMQEGYQNGDNFRHI--IVSPYVKSEFVRFMSDSNVASFRYATSG 63 T+ +++ + E++++ + G+ +V +K + D RY+ Sbjct: 154 NGVTKDVLTEKEVKELVKKLWTAGNENGEFYALVGADLKDQIDELYKD------RYSYQH 207 Query: 64 KSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKN 123 + + I T D + +GK+ +R M ++ D N ++ +LR + + Sbjct: 208 VTTDFGI--TVDSVNTSYGKINFILDRYMPAD-----KIVAFDVNAIKVAFLRQ-PQFQA 259 Query: 124 IAKNGDANKGVLIGEGALKVRNEKAVGV 151 + K GD +KG ++ E L+V ++KAV V Sbjct: 260 LGKTGDNDKGQVVAEATLEVGSKKAVAV 287 >gi|229162523|ref|ZP_04290484.1| hypothetical protein bcere0009_32950 [Bacillus cereus R309803] gi|228621002|gb|EEK77867.1| hypothetical protein bcere0009_32950 [Bacillus cereus R309803] Length = 293 Score = 57.5 bits (137), Expect = 7e-07, Method: Composition-based stats. Identities = 32/148 (21%), Positives = 64/148 (43%), Gaps = 16/148 (10%) Query: 6 KDGTQRTFSKQILDEVMQEGYQNGDNFRHI--IVSPYVKSEFVRFMSDSNVASFRYATSG 63 T ++ + E++++ + G+ +V +K + D RY+ Sbjct: 154 NGATANVLQEKEVKELVKKLWTAGNENGEFYALVGADIKDQIDELYKD------RYSYQH 207 Query: 64 KSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKN 123 +++ IV + + +GKV +R M ++ D N L+ +LR + + Sbjct: 208 VTNDFGIV--VNSVETSYGKVNFILDRYMPAD-----KIVAFDINALKVAFLRQ-PQFEA 259 Query: 124 IAKNGDANKGVLIGEGALKVRNEKAVGV 151 + K GD KG ++ E L+V ++KAV V Sbjct: 260 LGKTGDNIKGQVVAEATLEVGSKKAVAV 287 >gi|281417131|ref|ZP_06248151.1| conserved hypothetical protein [Clostridium thermocellum JW20] gi|281408533|gb|EFB38791.1| conserved hypothetical protein [Clostridium thermocellum JW20] Length = 292 Score = 56.7 bits (135), Expect = 9e-07, Method: Composition-based stats. Identities = 36/141 (25%), Positives = 57/141 (40%), Gaps = 18/141 (12%) Query: 14 SKQILDEVMQEGYQNG---DNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTI 70 ++ V++ + G F V+ K F ++ + T G N I Sbjct: 165 TQNDFKAVVKRLWDCGLGSGEFYCF-VNADFKEIIDGFYANQINYNMPMNTFGFVANKII 223 Query: 71 VATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGDA 130 +G V V NR M + ++DP L ++LR + +AK+GD Sbjct: 224 --------TNYGIVNVVLNRHMPVD-----KILVVDPTYLRLVYLRK-PSFEMLAKDGDN 269 Query: 131 NKGVLIGEGALKVRNEKAVGV 151 KG++I E LK+ N KAV V Sbjct: 270 LKGMVITELTLKLLNSKAVAV 290 >gi|217961109|ref|YP_002339677.1| hypothetical protein BCAH187_A3735 [Bacillus cereus AH187] gi|229140327|ref|ZP_04268882.1| hypothetical protein bcere0013_34260 [Bacillus cereus BDRD-ST26] gi|217064163|gb|ACJ78413.1| conserved hypothetical protein [Bacillus cereus AH187] gi|228642888|gb|EEK99164.1| hypothetical protein bcere0013_34260 [Bacillus cereus BDRD-ST26] Length = 293 Score = 56.4 bits (134), Expect = 1e-06, Method: Composition-based stats. Identities = 32/148 (21%), Positives = 66/148 (44%), Gaps = 16/148 (10%) Query: 6 KDGTQRTFSKQILDEVMQEGYQNGDNFRHI--IVSPYVKSEFVRFMSDSNVASFRYATSG 63 T+ +++ + E++++ + G+ +V +K + D RY+ Sbjct: 154 NGATKDVLTEKEIKELVKKLWTAGNENGEFYALVGADLKDQIDELYKD------RYSYQH 207 Query: 64 KSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKN 123 + + IV + + +GKV +R M ++ D N L+ +LR + + Sbjct: 208 VTTDFGIV--VNSVETSYGKVNFILDRYMPAD-----KIVAFDVNALKVAFLRQ-PQFEQ 259 Query: 124 IAKNGDANKGVLIGEGALKVRNEKAVGV 151 + K GD KG ++ E +L+V ++KAV V Sbjct: 260 LGKTGDNVKGQVVAECSLEVGSKKAVAV 287 >gi|331088860|ref|ZP_08337770.1| hypothetical protein HMPREF1025_01353 [Lachnospiraceae bacterium 3_1_46FAA] gi|330407383|gb|EGG86886.1| hypothetical protein HMPREF1025_01353 [Lachnospiraceae bacterium 3_1_46FAA] Length = 314 Score = 56.0 bits (133), Expect = 2e-06, Method: Composition-based stats. Identities = 28/145 (19%), Positives = 53/145 (36%), Gaps = 9/145 (6%) Query: 9 TQRTFSKQILDEVMQEGYQNGDNFRHI-IVSPYVKSEFVRFMSDSNVASFRYATSGKSHN 67 T S+ + +Q+ + +G + V+ K N + R+ S Sbjct: 161 TAGALSEDHFLDALQKMWDHGAQGAYFAFVNATQKRAINALAKTGN--NVRWVLDNGSVT 218 Query: 68 NTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKN 127 N FG++ + +R M +N +D + + LR +++ K Sbjct: 219 NVYGIAVSKIVTDFGEISLVLDRYMDNNT-----ILTLDLDEVRIAELRGTF-YEDLPKA 272 Query: 128 GDANKGVLIGEGALKVRNEKAVGVV 152 GD KG +I E +K+ N A + Sbjct: 273 GDYFKGHVINESTIKLLNSYAGSKI 297 >gi|163937921|ref|YP_001642807.1| hypothetical protein BcerKBAB4_5338 [Bacillus weihenstephanensis KBAB4] gi|163865776|gb|ABY46832.1| hypothetical protein BcerKBAB4_5338 [Bacillus weihenstephanensis KBAB4] Length = 391 Score = 55.6 bits (132), Expect = 2e-06, Method: Composition-based stats. Identities = 31/148 (20%), Positives = 52/148 (35%), Gaps = 22/148 (14%) Query: 15 KQILDEVMQEGYQ-----NGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNT 69 +L + ++ +G NF+ IIV K +F A R Sbjct: 232 DDMLINAFRSIFEKGGMNSGGNFK-IIVGGTQKVAISQF----GNAQIRLDRLDNGRGQV 286 Query: 70 IVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEF--LWLRTIHEDKNIAKN 127 D Y FG + N N + ++D N + L R + + K Sbjct: 287 ----VDHYMSDFGAAEIVLN-----NNLRSDEILIVDSNRISIRPLTTREFS-HEFLGKK 336 Query: 128 GDANKGVLIGEGALKVRNEKAVGVVADL 155 GD KG+L+GE ++ E+A + + Sbjct: 337 GDYMKGMLVGEYTMEFLQEQAHARITGV 364 >gi|154174521|ref|YP_001409081.1| hypothetical protein CCV52592_0028 [Campylobacter curvus 525.92] gi|112803013|gb|EAU00357.1| conserved hypothetical protein [Campylobacter curvus 525.92] Length = 327 Score = 55.6 bits (132), Expect = 2e-06, Method: Composition-based stats. Identities = 26/148 (17%), Positives = 58/148 (39%), Gaps = 12/148 (8%) Query: 8 GTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHN 67 G ++ +L +++Q+ + +G + + + +K N + R + K+ N Sbjct: 190 GANTPLTEDVLSQILQQIWDSGATPKDVFIGAALKPAI-------NKLATRQFGNEKAIN 242 Query: 68 NTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKN 127 + +V+ D FGKV +R +++ A D + E +++ + Sbjct: 243 SRVVS----LDTDFGKVNFRMHRFLSAKYGLADTLIAGDFEFAKNGLFLP-TEIEDVPTS 297 Query: 128 GDANKGVLIGEGALKVRNEKAVGVVADL 155 A + E L++RN A + L Sbjct: 298 KTAKQKRYYTECCLEIRNPAAFAIGVGL 325 >gi|229037842|ref|ZP_04189640.1| hypothetical protein bcere0028_57530 [Bacillus cereus AH1271] gi|228727464|gb|EEL78642.1| hypothetical protein bcere0028_57530 [Bacillus cereus AH1271] Length = 315 Score = 53.7 bits (127), Expect = 8e-06, Method: Composition-based stats. Identities = 25/147 (17%), Positives = 55/147 (37%), Gaps = 18/147 (12%) Query: 13 FSKQILDEVMQEGYQNGDNFR--HIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTI 70 +K + +++++ NG +F + V+ + K G + + Sbjct: 180 LNKDLFNQLLRMMATNGADFNEMVLYVNAFNKQVVS-------------EIFGYAPQDRN 226 Query: 71 VATADI--YDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNG 128 V +I + F KV + +R + + + + ++ P + + + + +AK G Sbjct: 227 VGGVNIQQIETDFAKVGIVWDRFVPAGSIGLFDMSVVAPVLQNVPD-KGVLFYEELAKTG 285 Query: 129 DANKGVLIGEGALKVRNEKAVGVVADL 155 A KG + GE L G + L Sbjct: 286 AAEKGQIYGEIGLAHGPAFMHGSITGL 312 >gi|238909129|ref|YP_002939596.1| hypothetical protein EUBELI_10025 [Eubacterium eligens ATCC 27750] gi|238873366|gb|ACR73075.1| Hypothetical protein EUBELI_10025 [Eubacterium eligens ATCC 27750] Length = 304 Score = 53.3 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 24/146 (16%), Positives = 55/146 (37%), Gaps = 9/146 (6%) Query: 9 TQRTFSKQILDEVMQEGYQNGDNFRHII-VSPYVKSEFVRFMSDSNVASFRYATSGKSHN 67 T+ ++ + +Q+ + +G + V+ +K N + R+ S Sbjct: 161 TKGALTEDHFLDALQKMWDHGAQGEYFSFVNANIKRMINNLAKAGN--NVRFLGDNGSMQ 218 Query: 68 NTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKN 127 N + FG++ + +R + +D ++ LR +++ K Sbjct: 219 NVLGIGVQKIVTDFGEISLVLDRYADTKT-----ILTVDLGEVQIAELRGTF-YEDLPKA 272 Query: 128 GDANKGVLIGEGALKVRNEKAVGVVA 153 GD KG ++ E +K+ N A ++ Sbjct: 273 GDYFKGHVLNESTIKLLNSHAGSKIS 298 >gi|257463376|ref|ZP_05627772.1| hypothetical protein FuD12_05953 [Fusobacterium sp. D12] gi|317060946|ref|ZP_07925431.1| predicted protein [Fusobacterium sp. D12] gi|313686622|gb|EFS23457.1| predicted protein [Fusobacterium sp. D12] Length = 369 Score = 53.3 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 33/171 (19%), Positives = 69/171 (40%), Gaps = 31/171 (18%) Query: 2 TKKAKDGTQRTFSKQILDE-------------VMQEGYQNGDNFR----HIIVSPYVKSE 44 TK+ DG +R K L + V+++ ++ G + + V K + Sbjct: 211 TKRGMDGVKRFLVKGQLVDAGGQDISLEMIGNVLRKIFEVGGDVDGGNYALYVPGIQKVK 270 Query: 45 FVRFMSDSNVASFRYATSGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFL 104 + + D A +T+ A A FG + + P + + Sbjct: 271 ISKLLKDYIQAP--------PAEHTLGAVATYIATDFGTLPIIP-----TTNLRGDEMMI 317 Query: 105 IDPNMLEFLWLRTIHEDKNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 ++ + +E LR + K + + GD KG+++ E +++VRN +G++ +L Sbjct: 318 LNHDDIEAKVLRGLS-HKYMGETGDNTKGLIVTELSVQVRNIHTMGMIVNL 367 >gi|153951462|ref|YP_001398222.1| hypothetical protein JJD26997_1140 [Campylobacter jejuni subsp. doylei 269.97] gi|152938908|gb|ABS43649.1| hypothetical protein JJD26997_1140 [Campylobacter jejuni subsp. doylei 269.97] Length = 344 Score = 53.3 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 27/155 (17%), Positives = 58/155 (37%), Gaps = 14/155 (9%) Query: 8 GTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHN 67 GT ++ L+ ++Q + +G + + + +K +F AT + Sbjct: 191 GTSTELTEDKLNIILQTIWNSGVAPKDVFLGADLKGAINKF-----------ATRILGNE 239 Query: 68 NTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEF-LWLRTIHEDKNIAK 126 +V + FG V H +R++++ D ++ L++ T+ E+ Sbjct: 240 TKLVGQVVSLETDFGTVNFHMHRLLSAKYGLGDVLIAGDFEYMKHGLYIPTMIEEVTTDI 299 Query: 127 NGDANKGVLIGEGALKVRNEKAVGVVADLFGLSKT 161 A + + L+VRN A + L + T Sbjct: 300 TAKAKR--FYTQSTLEVRNADAFAIGVGLTSTNNT 332 >gi|57237589|ref|YP_178603.1| hypothetical protein CJE0587 [Campylobacter jejuni RM1221] gi|57166393|gb|AAW35172.1| hypothetical protein CJE0587 [Campylobacter jejuni RM1221] Length = 344 Score = 52.9 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 26/149 (17%), Positives = 55/149 (36%), Gaps = 14/149 (9%) Query: 8 GTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHN 67 GT ++ L++++Q + +G + + + +K +F AT + Sbjct: 191 GTATELTEDKLNQILQTIWNSGVTPKDVFLGADLKGAINKF-----------ATRILGNE 239 Query: 68 NTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEF-LWLRTIHEDKNIAK 126 + + FG V H +R+++ D ++ L++ T+ ED Sbjct: 240 TKLAGQVVSLETDFGTVNFHMHRLLSPKYGLGDVLIAGDFEYMKHGLYIPTMIEDVPTDI 299 Query: 127 NGDANKGVLIGEGALKVRNEKAVGVVADL 155 A + + L+VRN A + L Sbjct: 300 TAKAKR--FYTQSTLEVRNADAFAIGVGL 326 >gi|283956330|ref|ZP_06373810.1| hypothetical protein C1336_000250101 [Campylobacter jejuni subsp. jejuni 1336] gi|283792050|gb|EFC30839.1| hypothetical protein C1336_000250101 [Campylobacter jejuni subsp. jejuni 1336] Length = 344 Score = 52.9 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 26/149 (17%), Positives = 55/149 (36%), Gaps = 14/149 (9%) Query: 8 GTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHN 67 GT ++ L++++Q + +G + + + +K +F AT + Sbjct: 191 GTATELTEDKLNQILQTIWNSGVTPKDVFLGADLKGAINKF-----------ATRILGNE 239 Query: 68 NTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEF-LWLRTIHEDKNIAK 126 + + FG V H +R+++ D ++ L++ T+ ED Sbjct: 240 TKLAGQVVSLETDFGTVNFHMHRLLSPKYGLGDVLIAGDFEYMKHGLYIPTMIEDVPTDI 299 Query: 127 NGDANKGVLIGEGALKVRNEKAVGVVADL 155 A + + L+VRN A + L Sbjct: 300 TAKAKR--FYTQSTLEVRNADAFAIGVGL 326 >gi|315929828|gb|EFV08993.1| hypothetical protein CSS_0883 [Campylobacter jejuni subsp. jejuni 305] Length = 344 Score = 52.1 bits (123), Expect = 3e-05, Method: Composition-based stats. Identities = 25/149 (16%), Positives = 55/149 (36%), Gaps = 14/149 (9%) Query: 8 GTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHN 67 GT ++ L++++Q + +G + + + +K +F AT + Sbjct: 191 GTATELTEDKLNQILQTIWNSGVTPKDVFLGADLKGAINKF-----------ATRILGNE 239 Query: 68 NTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEF-LWLRTIHEDKNIAK 126 + + FG V H +R+++ D ++ L++ T+ ED Sbjct: 240 TKLAGQVVSLETDFGTVNFHMHRLLSPKYGLGDVLIAGDFEYMKHGLYIPTMIEDVPTDI 299 Query: 127 NGDANKGVLIGEGALKVRNEKAVGVVADL 155 A + + L++RN A + L Sbjct: 300 TAKAKR--FYTQSTLEIRNADAFAIGVGL 326 >gi|319789007|ref|YP_004150640.1| phage protein [Thermovibrio ammonificans HB-1] gi|317113509|gb|ADU95999.1| phage protein [Thermovibrio ammonificans HB-1] Length = 373 Score = 51.7 bits (122), Expect = 3e-05, Method: Composition-based stats. Identities = 29/146 (19%), Positives = 62/146 (42%), Gaps = 13/146 (8%) Query: 13 FSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTIVA 72 ++ +L++ +++ Q+G + ++ + + F ++ N+ R + + VA Sbjct: 236 LTQTLLNDTIEKIVQDGGSPNVLVCNTTQARKISAFNAN-NIIVQREDQTAGNFVARFVA 294 Query: 73 TADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRT--IHEDKNIAKNGDA 130 P G + V+ +N + A ++D + L + LR I + D Sbjct: 295 DL-----PAGVITTI---VVDTNFPKDKIA-ILDTSRLRLVPLRGSTIRDFDATPNGADF 345 Query: 131 NKGVLIGEGALKVRNEK-AVGVVADL 155 ++GE L+VRN K A G++ L Sbjct: 346 IARRILGEYTLEVRNAKQAHGLLEGL 371 >gi|291526329|emb|CBK91916.1| hypothetical protein EUR_29920 [Eubacterium rectale DSM 17629] Length = 304 Score = 50.6 bits (119), Expect = 7e-05, Method: Composition-based stats. Identities = 24/146 (16%), Positives = 57/146 (39%), Gaps = 9/146 (6%) Query: 9 TQRTFSKQILDEVMQEGYQNGDNFRHII-VSPYVKSEFVRFMSDSNVASFRYATSGKSHN 67 T+ +++ + +QE +++G + V+ +K N + R+ S Sbjct: 161 TKGALTEEHFLDALQEMWKHGAQGEYFSFVNANIKRMINDLAKAGN--NVRFLGDNGSMQ 218 Query: 68 NTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKN 127 N + FG++ + +R + +D ++ LR +++ K Sbjct: 219 NVLGIGVQKIVTDFGEISLVLDRYADTKT-----ILTVDLAEVQIAELRGTF-YEDLPKA 272 Query: 128 GDANKGVLIGEGALKVRNEKAVGVVA 153 GD +G ++ E +K+ N A ++ Sbjct: 273 GDYYRGHVLNESTIKLLNSFAGSKIS 298 >gi|169827502|ref|YP_001697660.1| hypothetical protein Bsph_1941 [Lysinibacillus sphaericus C3-41] gi|168991990|gb|ACA39530.1| conserved hypothetical protein [Lysinibacillus sphaericus C3-41] Length = 288 Score = 50.6 bits (119), Expect = 8e-05, Method: Composition-based stats. Identities = 27/151 (17%), Positives = 60/151 (39%), Gaps = 16/151 (10%) Query: 5 AKDGTQRTFSKQILDEVMQEGYQNGDNFRHI--IVSPYVKSEFVRFMSDSNVASFRYATS 62 A + ++ + +VM+ + + V+ +K + D+ + + Sbjct: 149 ATNAVTAPIDEKAIKQVMRNLWNQNLAEGQVYAFVNADIKEKIDAIYKDNYSYNHVTTSF 208 Query: 63 GKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDK 122 G D + +G V ++ + ++ + ++ +LR H + Sbjct: 209 GLL--------VDEINTNYGSVKFVLSKHVPAD-----KVVFFNDAYVDLAYLREPH-FE 254 Query: 123 NIAKNGDANKGVLIGEGALKVRNEKAVGVVA 153 +AK GD+ KG ++ E LKV ++KAV V+ Sbjct: 255 PLAKTGDSVKGQVVAEATLKVGSKKAVAVLT 285 >gi|319649918|ref|ZP_08004068.1| hypothetical protein HMPREF1013_00673 [Bacillus sp. 2_A_57_CT2] gi|317398356|gb|EFV79044.1| hypothetical protein HMPREF1013_00673 [Bacillus sp. 2_A_57_CT2] Length = 292 Score = 48.6 bits (114), Expect = 3e-04, Method: Composition-based stats. Identities = 22/145 (15%), Positives = 56/145 (38%), Gaps = 18/145 (12%) Query: 12 TFSKQILDEVMQEGYQNG---DNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNN 68 ++ + +VM+ + N + + + + + + Sbjct: 158 AITEDDVKKVMRNLWNQDLAEGNVYAFLNADLKEQIDAIYKDKYGYSHVTTSFGL----- 212 Query: 69 TIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNG 128 +V + + +G V ++ + ++ + + ++ +LR H + +AK G Sbjct: 213 -LVESIN---TNYGTVHFVLSKHVPAD-----KIVVFNDAYVDLAYLREPH-FEPLAKTG 262 Query: 129 DANKGVLIGEGALKVRNEKAVGVVA 153 D+ KG +I E +KV ++K V V+ Sbjct: 263 DSVKGQVIAEATVKVGSKKGVAVLT 287 >gi|228941057|ref|ZP_04103614.1| hypothetical protein bthur0008_36970 [Bacillus thuringiensis serovar berliner ATCC 10792] gi|228973988|ref|ZP_04134562.1| hypothetical protein bthur0003_37430 [Bacillus thuringiensis serovar thuringiensis str. T01001] gi|228980577|ref|ZP_04140886.1| hypothetical protein bthur0002_37450 [Bacillus thuringiensis Bt407] gi|228779138|gb|EEM27396.1| hypothetical protein bthur0002_37450 [Bacillus thuringiensis Bt407] gi|228785714|gb|EEM33719.1| hypothetical protein bthur0003_37430 [Bacillus thuringiensis serovar thuringiensis str. T01001] gi|228818600|gb|EEM64668.1| hypothetical protein bthur0008_36970 [Bacillus thuringiensis serovar berliner ATCC 10792] gi|326939625|gb|AEA15521.1| Phage protein [Bacillus thuringiensis serovar chinensis CT-43] Length = 374 Score = 45.6 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 25/145 (17%), Positives = 55/145 (37%), Gaps = 12/145 (8%) Query: 17 ILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTIVATADI 76 +L++ +++ + G + I+ P + + + D + R + I++ Sbjct: 240 LLNDAVEQVFTRGGSANTILCGPNIARQITKLGGD----TIRTTRQDTAAGYQILS---- 291 Query: 77 YDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGDANKGVLI 136 + + V+ A L+D ++ +L I++ D V+ Sbjct: 292 FVSDLPGGAISS--VVVDLNMPKDRALLLDTEKVKARYLTPIYDQDATPNGADYFSRVIR 349 Query: 137 GEGALKVRNEKAVGVVADLFGLSKT 161 GE +V+N K +A L +SKT Sbjct: 350 GEFGFEVKNAKES--IAVLENISKT 372 >gi|281355462|ref|ZP_06241956.1| hypothetical protein Vvad_PD3568 [Victivallis vadensis ATCC BAA-548] gi|281318342|gb|EFB02362.1| hypothetical protein Vvad_PD3568 [Victivallis vadensis ATCC BAA-548] Length = 403 Score = 44.8 bits (104), Expect = 0.004, Method: Composition-based stats. Identities = 17/144 (11%), Positives = 45/144 (31%), Gaps = 11/144 (7%) Query: 2 TKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYAT 61 T A D + + +++++ + + + G N + +P N Sbjct: 228 TALAVDASGGRLTSKLVNDAVMKVTEEGGNPTALYCTPDQARVLA------NEYKNNITV 281 Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHED 121 + + + + + +A+L D + W+R + + Sbjct: 282 MRDDKARGVYVAVVVNEANGQNIEII-----GDPDFNDADAWLCDDDCFALSWMRPVGDT 336 Query: 122 KNIAKNGDANKGVLIGEGALKVRN 145 + + D ++GE L+ N Sbjct: 337 DSTPPDADGISRKMVGEFTLEFHN 360 >gi|229020770|ref|ZP_04177493.1| hypothetical protein bcere0030_52440 [Bacillus cereus AH1273] gi|228740571|gb|EEL90846.1| hypothetical protein bcere0030_52440 [Bacillus cereus AH1273] Length = 374 Score = 44.8 bits (104), Expect = 0.004, Method: Composition-based stats. Identities = 25/144 (17%), Positives = 54/144 (37%), Gaps = 12/144 (8%) Query: 18 LDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTIVATADIY 77 L++ +++ + G + I+ P + + + D + R + I++ + Sbjct: 241 LNDAVEQVFTRGGSANTILCGPNIARQITKLGGD----TIRTTRQDTAAGYQILS----F 292 Query: 78 DRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGDANKGVLIG 137 + V+ A L+D ++ +L I++ D V+ G Sbjct: 293 VSDLPGGAISS--VVVDLNMPKDRALLLDTEKVKARYLTPIYDQDATPNGADYFSRVIRG 350 Query: 138 EGALKVRNEKAVGVVADLFGLSKT 161 E +V+N K +A L +SKT Sbjct: 351 EFGFEVKNAKES--IAVLENISKT 372 >gi|218897919|ref|YP_002446330.1| phage protein [Bacillus cereus G9842] gi|218542918|gb|ACK95312.1| phage protein [Bacillus cereus G9842] Length = 374 Score = 44.8 bits (104), Expect = 0.004, Method: Composition-based stats. Identities = 25/144 (17%), Positives = 54/144 (37%), Gaps = 12/144 (8%) Query: 18 LDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTIVATADIY 77 L++ +++ + G + I+ P + + + D + R + I++ + Sbjct: 241 LNDAVEQVFTRGGSANTILCGPNIARQITKLGGD----TIRTTRQDTAAGYQILS----F 292 Query: 78 DRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGDANKGVLIG 137 + V+ A L+D ++ +L I++ D V+ G Sbjct: 293 VSDLPGGAISS--VVVDLNMPKDRALLLDTEKVKARYLTPIYDQDATPNGADYFSRVIRG 350 Query: 138 EGALKVRNEKAVGVVADLFGLSKT 161 E +V+N K +A L +SKT Sbjct: 351 EFGFEVKNAKES--IAVLENISKT 372 >gi|30020036|ref|NP_831667.1| Phage protein [Bacillus cereus ATCC 14579] gi|31415788|ref|NP_852528.1| hypothetical protein BC1894 [Bacillus phage phBC6A51] gi|229127327|ref|ZP_04256323.1| hypothetical protein bcere0015_17800 [Bacillus cereus BDRD-Cer4] gi|29895581|gb|AAP08868.1| Phage protein [Bacillus phage phBC6A51] gi|228656160|gb|EEL12002.1| hypothetical protein bcere0015_17800 [Bacillus cereus BDRD-Cer4] Length = 374 Score = 44.8 bits (104), Expect = 0.004, Method: Composition-based stats. Identities = 25/144 (17%), Positives = 55/144 (38%), Gaps = 12/144 (8%) Query: 18 LDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTIVATADIY 77 L++ +++ + G + I+ P + + + D + R + I++ + Sbjct: 241 LNDAVEQVFTRGGSANTILCGPNIARQITKLGGD----TIRTTRQDTAAGYQILS----F 292 Query: 78 DRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGDANKGVLIG 137 + V+ A L+D + ++ +L I++ D V+ G Sbjct: 293 VSDLPGGAISS--VVVDLNMPKDRALLLDTDKVKARYLTPIYDQDATPNGADYFSRVIRG 350 Query: 138 EGALKVRNEKAVGVVADLFGLSKT 161 E +V+N K +A L +SKT Sbjct: 351 EFGFEVKNAKES--IAVLENISKT 372 >gi|228968787|ref|ZP_04129749.1| hypothetical protein bthur0004_55460 [Bacillus thuringiensis serovar sotto str. T04001] gi|228790850|gb|EEM38489.1| hypothetical protein bthur0004_55460 [Bacillus thuringiensis serovar sotto str. T04001] Length = 374 Score = 44.8 bits (104), Expect = 0.004, Method: Composition-based stats. Identities = 25/144 (17%), Positives = 54/144 (37%), Gaps = 12/144 (8%) Query: 18 LDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTIVATADIY 77 L++ +++ + G + I+ P + + + D + R + I++ + Sbjct: 241 LNDAVEQVFTRGGSANTILCGPNIARQITKLGGD----TIRTTRQDTAAGYQILS----F 292 Query: 78 DRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGDANKGVLIG 137 + V+ A L+D ++ +L I++ D V+ G Sbjct: 293 VSDLPGGAISS--VVVDLNMPKDRALLLDTEKVKARYLTPIYDQDATPNGADYFSRVIRG 350 Query: 138 EGALKVRNEKAVGVVADLFGLSKT 161 E +V+N K +A L +SKT Sbjct: 351 EFGFEVKNAKES--IAVLENISKT 372 >gi|229190579|ref|ZP_04317576.1| hypothetical protein bcere0002_22460 [Bacillus cereus ATCC 10876] gi|228592924|gb|EEK50746.1| hypothetical protein bcere0002_22460 [Bacillus cereus ATCC 10876] Length = 374 Score = 44.8 bits (104), Expect = 0.004, Method: Composition-based stats. Identities = 30/144 (20%), Positives = 60/144 (41%), Gaps = 12/144 (8%) Query: 18 LDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTIVATADIY 77 L++ +++ + G + I+ P + + + D + + R T+ + V+ D+ Sbjct: 241 LNDAVEQVFTRGGSANTILCGPNIARQITKLGGD-TIRTTRQDTTAGYQILSFVS--DLP 297 Query: 78 DRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGDANKGVLIG 137 V+V N M + A L+D ++ +L I++ D V+ G Sbjct: 298 GGAISSVVVDLN--MPKD-----RALLLDTEKVKARYLTPIYDQDATPNGADYFSRVIRG 350 Query: 138 EGALKVRNEKAVGVVADLFGLSKT 161 E +V+N K +A L +SKT Sbjct: 351 EFGFEVKNAKES--IAVLENISKT 372 >gi|319956911|ref|YP_004168174.1| hypothetical protein Nitsa_1172 [Nitratifractor salsuginis DSM 16511] gi|319419315|gb|ADV46425.1| hypothetical protein Nitsa_1172 [Nitratifractor salsuginis DSM 16511] Length = 308 Score = 44.0 bits (102), Expect = 0.006, Method: Composition-based stats. Identities = 31/144 (21%), Positives = 53/144 (36%), Gaps = 14/144 (9%) Query: 7 DGTQRTFSKQILDEVMQEGYQNGD---NFRHIIVSPYVKSEFVRFMSDSNVASFRYATSG 63 +G + L E+++ ++ G +I VK RF D F SG Sbjct: 165 NGNPTVLTYDSLSEIIEPVWERGGIEDESFMLICGTSVKRAINRFAGD----QFFRKVSG 220 Query: 64 KSH-NNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDK 122 K + T+ + FG V V +R+ N E R+ L+ + +T E Sbjct: 221 KEKFDPTLFE----LETDFGTVQVKIHRLF--NQEKLRDKILVGKLSEARIMFQTSTEFT 274 Query: 123 NIAKNGDANKGVLIGEGALKVRNE 146 + + A G + L+V+N Sbjct: 275 EVPTDKTAKFGRYYTDLTLEVKNP 298 >gi|134299981|ref|YP_001113477.1| hypothetical protein Dred_2135 [Desulfotomaculum reducens MI-1] gi|134052681|gb|ABO50652.1| hypothetical protein Dred_2135 [Desulfotomaculum reducens MI-1] Length = 285 Score = 42.5 bits (98), Expect = 0.019, Method: Composition-based stats. Identities = 32/147 (21%), Positives = 52/147 (35%), Gaps = 18/147 (12%) Query: 5 AKDGTQRTFSKQILDEVMQEGYQNGDNFRHI-IVSPYVKSEFVRFMSDSNVASFRYATSG 63 A T TF+K +EV+ + Y G N + + +K + F + +A + Sbjct: 151 ANKVTNATFTKTKFEEVVGKMYDAGVNDEMLCFLPAQMKIQLNSFSNVEFLARDMFLGFD 210 Query: 64 KSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKN 123 Y P+G V + + F+++PN LE L H Sbjct: 211 TER----------YVTPYGIVTFV------LSEKLNNKLFIVNPNYLELAELIPFHAVPQ 254 Query: 124 IAKNGDANKGVLIGEGALKVRNEKAVG 150 A +G L + LK+ N KA Sbjct: 255 -AVSGSKQSVYLETQFGLKLLNTKAAA 280 >gi|134298256|ref|YP_001111752.1| hypothetical protein Dred_0379 [Desulfotomaculum reducens MI-1] gi|134050956|gb|ABO48927.1| hypothetical protein Dred_0379 [Desulfotomaculum reducens MI-1] Length = 285 Score = 42.5 bits (98), Expect = 0.021, Method: Composition-based stats. Identities = 30/143 (20%), Positives = 50/143 (34%), Gaps = 18/143 (12%) Query: 9 TQRTFSKQILDEVMQEGYQNGDNFRHI-IVSPYVKSEFVRFMSDSNVASFRYATSGKSHN 67 T TF+K +EV+ + Y G N + + +K + F + +A + Sbjct: 155 TNATFTKTKFEEVVSKMYDAGVNDEMLCFLPAQMKIQLNSFSNVEFLARDMFLGFDTER- 213 Query: 68 NTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKN 127 Y P+G V + + F+++PN LE L H + Sbjct: 214 ---------YVTPYGIVTFV------LSEKLNNKLFIVNPNYLELAELIPFHATVEPV-S 257 Query: 128 GDANKGVLIGEGALKVRNEKAVG 150 G L + LK+ N KA Sbjct: 258 GSKQSIYLETQFGLKLLNTKAAA 280 >gi|323703894|ref|ZP_08115527.1| hypothetical protein DesniDRAFT_2739 [Desulfotomaculum nigrificans DSM 574] gi|323531143|gb|EGB21049.1| hypothetical protein DesniDRAFT_2739 [Desulfotomaculum nigrificans DSM 574] Length = 285 Score = 42.1 bits (97), Expect = 0.026, Method: Composition-based stats. Identities = 29/147 (19%), Positives = 49/147 (33%), Gaps = 18/147 (12%) Query: 5 AKDGTQRTFSKQILDEVMQEGYQNGDNFRHI-IVSPYVKSEFVRFMSDSNVASFRYATSG 63 A T +F+K +E + + Y G N I + +K F + +A + Sbjct: 151 ANQVTNTSFTKAKFEETVAKMYDAGVNDEMICFIPAQMKIALNGFNNVEFLARDMFQGFD 210 Query: 64 KSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKN 123 Y P+G + + + F+++PN LE L H Sbjct: 211 TEK----------YVTPYGIITFV------LSEKLQNKLFIVNPNYLELAELIPFHAVPQ 254 Query: 124 IAKNGDANKGVLIGEGALKVRNEKAVG 150 +G L + LK+ N KA Sbjct: 255 PV-SGSKQSIYLETQFGLKLLNSKAAA 280 >gi|322818742|gb|EFZ26063.1| hypothetical protein TCSYLVIO_7770 [Trypanosoma cruzi] Length = 756 Score = 40.9 bits (94), Expect = 0.055, Method: Composition-based stats. Identities = 21/111 (18%), Positives = 39/111 (35%), Gaps = 3/111 (2%) Query: 18 LDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTIVATA--D 75 L V+ +NG + I PY++ RF+S + R + T+ + A D Sbjct: 119 LVSVVIAILENGGDDGVIASGPYLEWYRSRFVSMDAIYGLRSTLNAVPPRATLSSAAEED 178 Query: 76 IYDRPFGKVMVHPNR-VMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIA 125 +++ G M+ R +A A + L + + A Sbjct: 179 LFEGLLGPYMISIQRAFLAGLFPVAAELISSLVDYLRSCEIDPFDAEDESA 229 >gi|71655038|ref|XP_816128.1| hypothetical protein [Trypanosoma cruzi strain CL Brener] gi|70881234|gb|EAN94277.1| hypothetical protein, conserved [Trypanosoma cruzi] Length = 756 Score = 40.9 bits (94), Expect = 0.055, Method: Composition-based stats. Identities = 21/111 (18%), Positives = 39/111 (35%), Gaps = 3/111 (2%) Query: 18 LDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTIVATA--D 75 L V+ +NG + I PY++ RF+S + R + T+ + A D Sbjct: 119 LVSVVIAILENGGDDGVIASGPYLEWYRSRFVSMDAIYGLRSTLNAVPPRATLSSAAEED 178 Query: 76 IYDRPFGKVMVHPNR-VMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIA 125 +++ G M+ R +A A + L + + A Sbjct: 179 LFEGLLGPYMISIQRAFLAGLFPVAAELISSLVDYLRSCEIDPFDAEDESA 229 >gi|256375780|ref|YP_003099440.1| hypothetical protein Amir_1645 [Actinosynnema mirum DSM 43827] gi|255920083|gb|ACU35594.1| hypothetical protein Amir_1645 [Actinosynnema mirum DSM 43827] Length = 406 Score = 40.9 bits (94), Expect = 0.062, Method: Composition-based stats. Identities = 19/88 (21%), Positives = 34/88 (38%), Gaps = 13/88 (14%) Query: 74 ADIYDRPFGKVMVHPNRVMASNAETARNA------FLIDPNMLEFLWLRTIHEDKNIAKN 127 + FG + + +R M +A + FL+ P F ++ +AK Sbjct: 321 VERLLTDFGPINIMVDRAMPQDALLLTSLSQLKLQFLLIPGKGSFF-------EEPLAKT 373 Query: 128 GDANKGVLIGEGALKVRNEKAVGVVADL 155 G + + GE L+ + A GV+ L Sbjct: 374 GASEDTQIYGEVGLEYGSPLAHGVLRGL 401 >gi|71657449|ref|XP_817240.1| hypothetical protein [Trypanosoma cruzi strain CL Brener] gi|70882418|gb|EAN95389.1| hypothetical protein, conserved [Trypanosoma cruzi] Length = 757 Score = 40.2 bits (92), Expect = 0.11, Method: Composition-based stats. Identities = 21/111 (18%), Positives = 39/111 (35%), Gaps = 3/111 (2%) Query: 18 LDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTIVATA--D 75 L V+ +NG + I PY++ RF+S + R + T+ + A D Sbjct: 120 LVSVVIAILENGGDDAVIASGPYLEWYRSRFVSMDAIYGLRSTLNAVPPRATLSSAAEED 179 Query: 76 IYDRPFGKVMVHPNR-VMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIA 125 +++ G M+ R +A A + L + + A Sbjct: 180 LFEGLLGPYMISIQRAFLAGLFPVAAELISSLVDYLRSCEIDPFDAEDESA 230 >gi|262067743|ref|ZP_06027355.1| hypothetical protein FUSPEROL_02025 [Fusobacterium periodonticum ATCC 33693] gi|291378467|gb|EFE85985.1| hypothetical protein FUSPEROL_02025 [Fusobacterium periodonticum ATCC 33693] Length = 379 Score = 39.0 bits (89), Expect = 0.20, Method: Composition-based stats. Identities = 34/148 (22%), Positives = 54/148 (36%), Gaps = 24/148 (16%) Query: 7 DGTQRTFSKQILDEVMQEGYQNGDNFRH--------IIVSPYVKSEFVRFMSDSNVASFR 58 D + F+ +LD ++ G N + PY+ + ++ NV S Sbjct: 234 DANNQPFTLDLLDNAVKAIVDKG-NPGSANLKANKYFLCVPYLILRTINKLNKDNVRSNI 292 Query: 59 YATSGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLI-DPNMLEFLWLRT 117 + IV T+ G V V P A N FL+ N + L Sbjct: 293 TDKVTGTTIEQIVTTS-------GTVSVFP------ATSLAPNEFLLIHLNDVSLRQLYP 339 Query: 118 IHEDKNIAKNGDANKGVLIGEGALKVRN 145 I E++ AK A+ L GE A +++N Sbjct: 340 IKEEEG-AKTALADNYFLHGEYAHQIKN 366 >gi|258590947|emb|CBE67242.1| conserved hypothetical protein; putative mxaE, involved in methanol dehydrogenase (mxaE2) [NC10 bacterium 'Dutch sediment'] Length = 322 Score = 38.6 bits (88), Expect = 0.27, Method: Composition-based stats. Identities = 23/116 (19%), Positives = 43/116 (37%), Gaps = 7/116 (6%) Query: 29 GDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTIVATADIYDRPFGKVMVHP 88 G N R + +P K +V D V+ A ++AT + +RP G V+ H Sbjct: 187 GANPRGVSFTPDGKRAYVGCERDGTVSVIDTAA------RRVIATIPVGERPVGTVVSHD 240 Query: 89 NRVMASNAETARNAFLIDPNMLEFLWLRTIHEDK-NIAKNGDANKGVLIGEGALKV 143 + + + +++D + L E +A D + + G +V Sbjct: 241 GKKVYVAHGRSYEVWVLDTRTHQVLTKIPTQERSWWVALTPDGRELYVTVHGGGRV 296 >gi|194015203|ref|ZP_03053819.1| phage protein [Bacillus pumilus ATCC 7061] gi|194012607|gb|EDW22173.1| phage protein [Bacillus pumilus ATCC 7061] Length = 367 Score = 37.9 bits (86), Expect = 0.44, Method: Composition-based stats. Identities = 24/128 (18%), Positives = 51/128 (39%), Gaps = 8/128 (6%) Query: 18 LDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTIVATADIY 77 ++ +M++ Y G + I+ + + + +D + + R + +T V+ D+ Sbjct: 232 INALMEQVYLRGGSVNTILTNTAGARQISKMATD-TIRTERTDAATGHRISTFVS--DMV 288 Query: 78 DRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGDANKGVLIG 137 V+V PN A R+ + P L R++++ D + G Sbjct: 289 GGGVATVIVDPNFPKDKIALFDRSILSLHP-----LTGRSVYDTDASVPGADFVARQIRG 343 Query: 138 EGALKVRN 145 E +KV+N Sbjct: 344 EYGIKVKN 351 >gi|300710357|ref|YP_003736171.1| phospholactate guanylyltransferase [Halalkalicoccus jeotgali B3] gi|299124040|gb|ADJ14379.1| phospholactate guanylyltransferase [Halalkalicoccus jeotgali B3] Length = 203 Score = 37.5 bits (85), Expect = 0.62, Method: Composition-based stats. Identities = 24/140 (17%), Positives = 45/140 (32%), Gaps = 17/140 (12%) Query: 11 RTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTI 70 R FS+ +L +V++ + G + +P A + T Sbjct: 26 REFSRAMLGDVLESVREAGGEPEVLATAPLDVDTPATVDDRP----LTEAVNAALEPETA 81 Query: 71 VATAD-----------IYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIH 119 V AD +++ P G V++ P R +NA +R + +L+ Sbjct: 82 VVMADLPLLTAETIEGLFETP-GDVVLAPGRGGGTNALLSREP-EFRVDYHGGSFLKHRA 139 Query: 120 EDKNIAKNGDANKGVLIGEG 139 I G + L + Sbjct: 140 SASEIGSVGVVDSHRLATDI 159 >gi|217076269|ref|YP_002333985.1| pyrimidine-nucleoside phosphorylase [Thermosipho africanus TCF52B] gi|217036122|gb|ACJ74644.1| pyrimidine-nucleoside phosphorylase [Thermosipho africanus TCF52B] Length = 434 Score = 37.1 bits (84), Expect = 0.78, Method: Composition-based stats. Identities = 24/133 (18%), Positives = 48/133 (36%), Gaps = 15/133 (11%) Query: 28 NGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSG-----KSHNNTIVATADIYDRPFG 82 G + P K+ NV ++ A G + I A D Sbjct: 117 TGGTIDKLESIPGFKTSLSLEKFFENVNKYKIAVVGQTANLAPADKKIYALRDATAT-VD 175 Query: 83 KVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGDANKGVLIGEGALK 142 ++ + + +M+ +AF++D + +++ I K +A + E A K Sbjct: 176 EISLIASSIMSKKLAGGADAFVLDVKVGSGAFMKDISSAKELASA--------MVEIA-K 226 Query: 143 VRNEKAVGVVADL 155 N+ AV V+ ++ Sbjct: 227 SHNKTAVAVLTNM 239 >gi|294614769|ref|ZP_06694669.1| hypothetical protein EfmE1636_0859 [Enterococcus faecium E1636] gi|291592381|gb|EFF23990.1| hypothetical protein EfmE1636_0859 [Enterococcus faecium E1636] Length = 296 Score = 36.3 bits (82), Expect = 1.3, Method: Composition-based stats. Identities = 25/152 (16%), Positives = 51/152 (33%), Gaps = 11/152 (7%) Query: 4 KAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSG 63 KA+ T +++ +D + + Y+ G + + E + + Sbjct: 153 KAETATAGAVTRKDIDTLFKLMYEKGYMGEKLCLISPDMQELMT----DELDGKSTKIVQ 208 Query: 64 KSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKN 123 T +G + + A ID N ++ LR + Sbjct: 209 FGERVTFGLQIGNIVSNYGTGIALLEPSLPKGTIAA-----IDTNYVKLRPLREW-RAEE 262 Query: 124 IAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 +AK D+ + L+GE +L+ N G + +L Sbjct: 263 LAKTTDSRRIGLVGEYSLE-YNASNSGAILNL 293 >gi|257893408|ref|ZP_05673061.1| conserved hypothetical protein [Enterococcus faecium 1,231,408] gi|257829787|gb|EEV56394.1| conserved hypothetical protein [Enterococcus faecium 1,231,408] Length = 296 Score = 36.3 bits (82), Expect = 1.3, Method: Composition-based stats. Identities = 25/152 (16%), Positives = 51/152 (33%), Gaps = 11/152 (7%) Query: 4 KAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSG 63 KA+ T +++ +D + + Y+ G + + E + + Sbjct: 153 KAETATAGAVTRKDIDTLFKLMYEKGYMGEKLCLISPDMQELMT----DELDGKSTKIVQ 208 Query: 64 KSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKN 123 T +G + + A ID N ++ LR + Sbjct: 209 FGERVTFGLQIGNIVSNYGTGIALLEPSLPKGTIAA-----IDTNYVKLRPLREW-RAEE 262 Query: 124 IAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 +AK D+ + L+GE +L+ N G + +L Sbjct: 263 LAKTTDSRRIGLVGEYSLE-YNASNSGAILNL 293 >gi|257883499|ref|ZP_05663152.1| conserved hypothetical protein [Enterococcus faecium 1,231,502] gi|261208026|ref|ZP_05922703.1| conserved hypothetical protein [Enterococcus faecium TC 6] gi|289567093|ref|ZP_06447488.1| conserved hypothetical protein [Enterococcus faecium D344SRF] gi|294622496|ref|ZP_06701518.1| conserved hypothetical protein [Enterococcus faecium U0317] gi|257819157|gb|EEV46485.1| conserved hypothetical protein [Enterococcus faecium 1,231,502] gi|260077743|gb|EEW65457.1| conserved hypothetical protein [Enterococcus faecium TC 6] gi|289161108|gb|EFD09013.1| conserved hypothetical protein [Enterococcus faecium D344SRF] gi|291598043|gb|EFF29153.1| conserved hypothetical protein [Enterococcus faecium U0317] Length = 296 Score = 36.3 bits (82), Expect = 1.3, Method: Composition-based stats. Identities = 25/152 (16%), Positives = 51/152 (33%), Gaps = 11/152 (7%) Query: 4 KAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSG 63 KA+ T +++ +D + + Y+ G + + E + + Sbjct: 153 KAETATAGAVTRKDIDTLFKLMYEKGYMGEKLCLISPDMQELMT----DELDGKSTKIVQ 208 Query: 64 KSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKN 123 T +G + + A ID N ++ LR + Sbjct: 209 FGERVTFGLQIGNIVSNYGTGIALLEPSLPKGTIAA-----IDTNYVKLRPLREW-RAEE 262 Query: 124 IAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 +AK D+ + L+GE +L+ N G + +L Sbjct: 263 LAKTTDSRRIGLVGEYSLE-YNASNSGAILNL 293 >gi|257879565|ref|ZP_05659218.1| conserved hypothetical protein [Enterococcus faecium 1,230,933] gi|257891545|ref|ZP_05671198.1| conserved hypothetical protein [Enterococcus faecium 1,231,410] gi|314940388|ref|ZP_07847550.1| conserved hypothetical protein [Enterococcus faecium TX0133a04] gi|314943205|ref|ZP_07849996.1| conserved hypothetical protein [Enterococcus faecium TX0133C] gi|314949154|ref|ZP_07852509.1| conserved hypothetical protein [Enterococcus faecium TX0082] gi|314951966|ref|ZP_07854992.1| conserved hypothetical protein [Enterococcus faecium TX0133A] gi|314993065|ref|ZP_07858455.1| conserved hypothetical protein [Enterococcus faecium TX0133B] gi|314995396|ref|ZP_07860499.1| conserved hypothetical protein [Enterococcus faecium TX0133a01] gi|257813793|gb|EEV42551.1| conserved hypothetical protein [Enterococcus faecium 1,230,933] gi|257827905|gb|EEV54531.1| conserved hypothetical protein [Enterococcus faecium 1,231,410] gi|313590399|gb|EFR69244.1| conserved hypothetical protein [Enterococcus faecium TX0133a01] gi|313592421|gb|EFR71266.1| conserved hypothetical protein [Enterococcus faecium TX0133B] gi|313595906|gb|EFR74751.1| conserved hypothetical protein [Enterococcus faecium TX0133A] gi|313598089|gb|EFR76934.1| conserved hypothetical protein [Enterococcus faecium TX0133C] gi|313640428|gb|EFS05008.1| conserved hypothetical protein [Enterococcus faecium TX0133a04] gi|313644467|gb|EFS09047.1| conserved hypothetical protein [Enterococcus faecium TX0082] Length = 296 Score = 35.9 bits (81), Expect = 1.6, Method: Composition-based stats. Identities = 25/152 (16%), Positives = 51/152 (33%), Gaps = 11/152 (7%) Query: 4 KAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSG 63 KA+ T +++ +D + + Y+ G + + E + + Sbjct: 153 KAETATAGAVTRKDIDGLFKLMYEKGYMGEKLCLISPDMQELMT----DELDGKSTKIVQ 208 Query: 64 KSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKN 123 T +G + + A ID N ++ LR + Sbjct: 209 FGERVTFGLQIGNIVSNYGTGIALLEPSLPKGTIAA-----IDTNYVKLRPLREW-RAEE 262 Query: 124 IAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 +AK D+ + L+GE +L+ N G + +L Sbjct: 263 LAKTTDSRRIGLVGEYSLE-YNASNSGAILNL 293 >gi|114707526|ref|ZP_01440422.1| hypothetical protein FP2506_11672 [Fulvimarina pelagi HTCC2506] gi|114537085|gb|EAU40213.1| hypothetical protein FP2506_11672 [Fulvimarina pelagi HTCC2506] Length = 375 Score = 35.9 bits (81), Expect = 2.0, Method: Composition-based stats. Identities = 21/127 (16%), Positives = 41/127 (32%), Gaps = 13/127 (10%) Query: 16 QILDEVMQEGYQNGDNFRHIIVSPYVKSEFVR--FMSDSNVASFRYATSGKSHNNTIVAT 73 ++DE + + + ++ P F +D+ ++ +N IV Sbjct: 122 SMVDEALGQDLMSFGTPILVLGDPGQLPPVSGGGFFTDAEPDFLLTDIHRQARDNPIVKL 181 Query: 74 A-------DIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHE----DK 122 A I +G V V R + A + L+ N + R + E Sbjct: 182 ALDVREGRQIMHGDYGTVQVIGKRDVDQAKVLAVDQVLVGTNKTRRAYNRRLRELKGFTH 241 Query: 123 NIAKNGD 129 + + GD Sbjct: 242 ELPQVGD 248 >gi|257079386|ref|ZP_05573747.1| predicted protein [Enterococcus faecalis JH1] gi|256987416|gb|EEU74718.1| predicted protein [Enterococcus faecalis JH1] Length = 300 Score = 35.6 bits (80), Expect = 2.2, Method: Composition-based stats. Identities = 28/155 (18%), Positives = 55/155 (35%), Gaps = 19/155 (12%) Query: 3 KKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATS 62 A D R ++ + +GY G+ + + + V +F + Sbjct: 156 TSAADAVTRKDVDKMFKTMFDKGY-AGEKLCLVSTDMVDLMTDEVDKAGTKVFNFGDQVA 214 Query: 63 GKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHE-- 120 IV+ Y G ++ P + + +D N +E LR + E Sbjct: 215 FGLQLGKIVSN---YGS--GTALIEP-------SLPSGTMIALDTNYVE---LRPVREWR 259 Query: 121 DKNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 + +AK D+ + L+GE ++ N G + +L Sbjct: 260 AEELAKTTDSKRIGLVGEYTIE-YNASNSGAILNL 293 >gi|307280635|ref|ZP_07561683.1| hypothetical protein HMPREF9515_01677 [Enterococcus faecalis TX0860] gi|306504001|gb|EFM73218.1| hypothetical protein HMPREF9515_01677 [Enterococcus faecalis TX0860] Length = 300 Score = 35.2 bits (79), Expect = 2.8, Method: Composition-based stats. Identities = 27/153 (17%), Positives = 53/153 (34%), Gaps = 15/153 (9%) Query: 3 KKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATS 62 A D R ++ + +GY G+ + + + V +F + Sbjct: 156 TSAADAVTRKDVDKMFKTMFDKGY-AGEKLCLVSTDMVDLMTDEVDKAGTKVFNFGDQVA 214 Query: 63 GKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDK 122 IV+ Y G ++ P + + +D N +E LR + Sbjct: 215 FGLQLGKIVSN---YGS--GTALIEP-------SLPSGTMIALDTNYVELRPLREW-RAE 261 Query: 123 NIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 +AK D+ + L+GE ++ N G + +L Sbjct: 262 ELAKTTDSKRIGLVGEYTIE-YNASNSGAILNL 293 >gi|307286482|ref|ZP_07566582.1| hypothetical protein HMPREF9505_00059 [Enterococcus faecalis TX0109] gi|306502395|gb|EFM71671.1| hypothetical protein HMPREF9505_00059 [Enterococcus faecalis TX0109] Length = 300 Score = 35.2 bits (79), Expect = 2.8, Method: Composition-based stats. Identities = 27/153 (17%), Positives = 53/153 (34%), Gaps = 15/153 (9%) Query: 3 KKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATS 62 A D R ++ + +GY G+ + + + V +F + Sbjct: 156 TSAADAVTRKDVDKMFKTMFDKGY-AGEKLCLVSTDMVDLMTDEVDKAGTKVFNFGDQVA 214 Query: 63 GKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDK 122 IV+ Y G ++ P + + +D N +E LR + Sbjct: 215 FGLQLGKIVSN---YGS--GTALIEP-------SLPSGTMIALDTNYVELRPLREW-RAE 261 Query: 123 NIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 +AK D+ + L+GE ++ N G + +L Sbjct: 262 ELAKTTDSKRIGLVGEYTIE-YNASNSGAILNL 293 >gi|315144740|gb|EFT88756.1| conserved hypothetical protein [Enterococcus faecalis TX2141] Length = 300 Score = 35.2 bits (79), Expect = 3.4, Method: Composition-based stats. Identities = 28/155 (18%), Positives = 55/155 (35%), Gaps = 19/155 (12%) Query: 3 KKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATS 62 A D R ++ + +GY G+ + + + V +F + Sbjct: 156 TSAADAVTRKDVDKMFKTMFDKGY-AGEKLCLVSTDMVDLMTDEVDKAGTKVFNFGDQVA 214 Query: 63 GKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHE-- 120 IV+ Y G ++ P + + +D N +E LR + E Sbjct: 215 FGLQLGKIVSN---YGS--GTALIEP-------SLPSGTMIALDTNYVE---LRPVREWR 259 Query: 121 DKNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 + +AK D+ + L+GE ++ N G + +L Sbjct: 260 AEELAKTTDSKRIGLVGEYTIE-YNASNSGAILNL 293 >gi|315173098|gb|EFU17115.1| conserved hypothetical protein [Enterococcus faecalis TX1346] Length = 300 Score = 34.8 bits (78), Expect = 4.4, Method: Composition-based stats. Identities = 14/54 (25%), Positives = 25/54 (46%), Gaps = 2/54 (3%) Query: 102 AFLIDPNMLEFLWLRTIHEDKNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 +D N +E LR + +AK D+ + L+GE ++ N G + +L Sbjct: 242 MIALDTNYVELRPLREW-RAEELAKTTDSKRIGLVGEYTIE-YNASNSGAILNL 293 >gi|307270079|ref|ZP_07551399.1| hypothetical protein HMPREF9498_02197 [Enterococcus faecalis TX4248] gi|306513574|gb|EFM82186.1| hypothetical protein HMPREF9498_02197 [Enterococcus faecalis TX4248] Length = 300 Score = 34.4 bits (77), Expect = 5.2, Method: Composition-based stats. Identities = 27/153 (17%), Positives = 54/153 (35%), Gaps = 15/153 (9%) Query: 3 KKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATS 62 A D R ++ + +GY G+ + + + V +F + Sbjct: 156 TSAADAVTRKDVDKMFKTMFDKGY-AGEKLCLVSTDMVDLMTDEVDKAGTKVFNFGDQVA 214 Query: 63 GKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDK 122 IV+ Y G ++ P + + +D N +E LR + Sbjct: 215 FGLQLGKIVSN---YGS--GTALIEP-------SLPSGTMIALDTNYVELRPLREW-RAE 261 Query: 123 NIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 +AK D+ + L+GE +++ N G + +L Sbjct: 262 ELAKTTDSKRIGLVGEYSIE-YNASNSGAILNL 293 >gi|328725488|ref|XP_003248499.1| PREDICTED: probable NADP-dependent mannitol dehydrogenase-like, partial [Acyrthosiphon pisum] Length = 462 Score = 34.0 bits (76), Expect = 6.4, Method: Composition-based stats. Identities = 12/88 (13%), Positives = 31/88 (35%), Gaps = 5/88 (5%) Query: 19 DEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTIVATADIYD 78 E+++ + G + I + ++ + S ++I AT D Sbjct: 259 LEIVRGLAEAGADVAIIYATSDNAPAIAAEVASQTNVKVQAYQSDVRRRDSIAATVDQIV 318 Query: 79 RPFGKVMVHPNRVMASNAETARNAFLID 106 FG++ + M +N+ + ++ Sbjct: 319 SDFGRIDI-----MVANSGVCADVPALE 341 >gi|256956794|ref|ZP_05560965.1| conserved hypothetical protein [Enterococcus faecalis DS5] gi|256947290|gb|EEU63922.1| conserved hypothetical protein [Enterococcus faecalis DS5] gi|295113775|emb|CBL32412.1| hypothetical protein [Enterococcus sp. 7L76] gi|315035894|gb|EFT47826.1| conserved hypothetical protein [Enterococcus faecalis TX0027] Length = 300 Score = 34.0 bits (76), Expect = 7.4, Method: Composition-based stats. Identities = 27/153 (17%), Positives = 54/153 (35%), Gaps = 15/153 (9%) Query: 3 KKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATS 62 A D R ++ + +GY G+ + + + V +F + Sbjct: 156 TSAADAVTRKDVDKMFKTMFDKGY-AGEKLCLVSTDMVDLMTDEVDKAGTKVFNFGDQVA 214 Query: 63 GKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDK 122 IV+ Y G ++ P + + +D N +E LR + Sbjct: 215 FGLQLGKIVSN---YGS--GTALIEP-------SLPSGTMIALDTNYVELRPLREW-RAE 261 Query: 123 NIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155 +AK D+ + L+GE +++ N G + +L Sbjct: 262 ELAKTTDSKRIGLVGEYSIE-YNASNSGAILNL 293 >gi|308172834|ref|YP_003919539.1| phage protein [Bacillus amyloliquefaciens DSM 7] gi|307605698|emb|CBI42069.1| phage protein [Bacillus amyloliquefaciens DSM 7] Length = 367 Score = 34.0 bits (76), Expect = 7.6, Method: Composition-based stats. Identities = 26/146 (17%), Positives = 50/146 (34%), Gaps = 14/146 (9%) Query: 18 LDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTIVATADIY 77 ++ +M++ YQ G + I+ + + + +D+ A +G + Sbjct: 232 INALMEKVYQRGGSANTILTNTAGARQISKMATDTIRTERTDAATGHR--------VSTF 283 Query: 78 DRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGDAN--KGVL 135 V V + + L D ++L L A A+ + Sbjct: 284 VSDMVGGGVATVVVDPNFPK--DKIALFDRSILSLHPLNGRSVYDEDATVPGADFVARQI 341 Query: 136 IGEGALKVRNEKAVGVVADLFGLSKT 161 GE +KV+N A +A L +S T Sbjct: 342 RGEYGIKVKN--ANEKIAILENISTT 365 Database: nr Posted date: May 13, 2011 4:10 AM Number of letters in database: 999,999,932 Number of sequences in database: 2,987,209 Database: /data/usr2/db/fasta/nr.01 Posted date: May 13, 2011 4:17 AM Number of letters in database: 999,998,956 Number of sequences in database: 2,896,973 Database: /data/usr2/db/fasta/nr.02 Posted date: May 13, 2011 4:23 AM Number of letters in database: 999,999,979 Number of sequences in database: 2,907,862 Database: /data/usr2/db/fasta/nr.03 Posted date: May 13, 2011 4:29 AM Number of letters in database: 999,999,513 Number of sequences in database: 2,932,190 Database: /data/usr2/db/fasta/nr.04 Posted date: May 13, 2011 4:33 AM Number of letters in database: 792,586,372 Number of sequences in database: 2,260,650 Lambda K H 0.306 0.123 0.299 Lambda K H 0.267 0.0383 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 2,299,913,160 Number of Sequences: 13984884 Number of extensions: 73107684 Number of successful extensions: 180898 Number of sequences better than 10.0: 87 Number of HSP's better than 10.0 without gapping: 48 Number of HSP's successfully gapped in prelim test: 77 Number of HSP's that attempted gapping in prelim test: 180739 Number of HSP's gapped (non-prelim): 129 length of query: 162 length of database: 4,792,584,752 effective HSP length: 123 effective length of query: 39 effective length of database: 3,072,444,020 effective search space: 119825316780 effective search space used: 119825316780 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.1 bits) S2: 75 (33.6 bits)