BLASTP 2.2.22 [Sep-27-2009]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= 537021.9.peg.1073_1
(162 letters)
Database: nr
13,984,884 sequences; 4,792,584,752 total letters
Searching..................................................done
>gi|315122533|ref|YP_004063022.1| hypothetical protein CKC_03925 [Candidatus Liberibacter
solanacearum CLso-ZC1]
gi|313495935|gb|ADR52534.1| hypothetical protein CKC_03925 [Candidatus Liberibacter
solanacearum CLso-ZC1]
Length = 331
Score = 151 bits (382), Expect = 2e-35, Method: Composition-based stats.
Identities = 141/161 (87%), Positives = 148/161 (91%)
Query: 1 MTKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYA 60
MTKKAKDG QR F+KQ+LD VMQEGYQNG NFRHI+VSPYVKSEFVRFMSDSNVASFRYA
Sbjct: 170 MTKKAKDGAQRPFTKQLLDAVMQEGYQNGANFRHIVVSPYVKSEFVRFMSDSNVASFRYA 229
Query: 61 TSGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHE 120
S S NNTIVATADIYD PFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLR I E
Sbjct: 230 VSDNSKNNTIVATADIYDGPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRNIQE 289
Query: 121 DKNIAKNGDANKGVLIGEGALKVRNEKAVGVVADLFGLSKT 161
DKNIAK GDANKGVLIGEG LKV+NEKA+GV++DLFGLSKT
Sbjct: 290 DKNIAKTGDANKGVLIGEGTLKVKNEKAIGVISDLFGLSKT 330
>gi|227822441|ref|YP_002826413.1| putative phage major head protein [Sinorhizobium fredii NGR234]
gi|227341442|gb|ACP25660.1| putative phage major head protein [Sinorhizobium fredii NGR234]
Length = 331
Score = 150 bits (378), Expect = 8e-35, Method: Composition-based stats.
Identities = 109/162 (67%), Positives = 124/162 (76%)
Query: 1 MTKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYA 60
+T DGTQR F+K ILD VMQ+GYQNG NFRH+ VSPYVKS FV FMSDSNVA FRYA
Sbjct: 170 LTVAPTDGTQRPFTKAILDTVMQQGYQNGANFRHVSVSPYVKSVFVTFMSDSNVAPFRYA 229
Query: 61 TSGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHE 120
S NTIVATAD Y+ PFG VM+HPNRV A N ARNAF ID +ML FLWLR I E
Sbjct: 230 VSQGGERNTIVATADYYEGPFGTVMIHPNRVQAVNGGVARNAFFIDSDMLSFLWLRQIQE 289
Query: 121 DKNIAKNGDANKGVLIGEGALKVRNEKAVGVVADLFGLSKTT 162
D+++AK GDA+KGV+IGEG LKV NE+ +GV ADLFGLS +
Sbjct: 290 DRDVAKTGDADKGVIIGEGTLKVHNERGLGVAADLFGLSSAS 331
>gi|150397033|ref|YP_001327500.1| hypothetical protein Smed_1830 [Sinorhizobium medicae WSM419]
gi|150028548|gb|ABR60665.1| hypothetical protein Smed_1830 [Sinorhizobium medicae WSM419]
Length = 331
Score = 146 bits (368), Expect = 1e-33, Method: Composition-based stats.
Identities = 110/161 (68%), Positives = 126/161 (78%)
Query: 1 MTKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYA 60
+T DG QR F+K ILD VMQ GYQ+G NFRH+ VSPYVKS FV FMSDSNVA FRYA
Sbjct: 170 LTVAPTDGEQRAFTKSILDSVMQAGYQSGANFRHVSVSPYVKSVFVTFMSDSNVAPFRYA 229
Query: 61 TSGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHE 120
S NTIVATAD Y+ PFG VM+HPNRV A+ A TARNAF +D +MLEFLWLR I E
Sbjct: 230 VSQGGERNTIVATADYYEGPFGTVMIHPNRVQAATAATARNAFFLDTDMLEFLWLRKIQE 289
Query: 121 DKNIAKNGDANKGVLIGEGALKVRNEKAVGVVADLFGLSKT 161
DK++AK GDA+KGV+IGEG LKV+NEK +GV ADLFGLS+
Sbjct: 290 DKDVAKTGDADKGVIIGEGTLKVKNEKGLGVAADLFGLSEA 330
>gi|291334595|gb|ADD94245.1| putative phage major head protein [uncultured phage
MedDCM-OCT-S04-C136]
Length = 316
Score = 137 bits (346), Expect = 4e-31, Method: Composition-based stats.
Identities = 42/156 (26%), Positives = 73/156 (46%), Gaps = 12/156 (7%)
Query: 5 AKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGK 64
DGTQR F++ L ++ Y G N I+V+ + K + F S
Sbjct: 167 RTDGTQRAFTEDQLKGILVSCYNEGGNPNMIMVNAFNKQKLSGFTGGSTRF-------DA 219
Query: 65 SHNNTIVATADIYDRPFGKVMVHPNRVMASNAETA----RNAFLIDPNMLEFLWLRTIHE 120
+ + ++ + D+Y+ FG + V PNR + TA ++A ++D +LR
Sbjct: 220 AEDRRLITSIDVYESDFGTMQVSPNRFIRGANGTAAKIGQDAHILDMEYWAVSFLRDF-A 278
Query: 121 DKNIAKNGDANKGVLIGEGALKVRNEKAVGVVADLF 156
+ A+ DA++ ++ E L+ RNEKA G++ DL
Sbjct: 279 LQTPAQTADADQRFMVAEYTLESRNEKASGLITDLT 314
>gi|160897389|ref|YP_001562971.1| putative phage major head protein [Delftia acidovorans SPH-1]
gi|160362973|gb|ABX34586.1| putative phage major head protein [Delftia acidovorans SPH-1]
Length = 306
Score = 137 bits (346), Expect = 4e-31, Method: Composition-based stats.
Identities = 46/153 (30%), Positives = 72/153 (47%), Gaps = 13/153 (8%)
Query: 3 KKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATS 62
GTQR F + + +V+Q+ Y G I++ P K F F ++
Sbjct: 166 TGYTPGTQRAFVESQVKDVLQKCYTAGGEPDTIMMPPAAKQTFSTFTGNATR-------M 218
Query: 63 GKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDK 122
KS ++ + A+ D Y FG + PNR MA+ R+ FL+ + L +LR +
Sbjct: 219 DKSEDSKLYASVDYYVSDFGTIQAVPNRFMAT-----RDVFLLQSDKLAVAYLRPF-QTT 272
Query: 123 NIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155
+IAK GD++ +I E L+ R KA G V D+
Sbjct: 273 DIAKTGDSDAQQIIVEYTLECRAPKAHGAVYDI 305
>gi|291334838|gb|ADD94478.1| putative phage major head protein [uncultured phage
MedDCM-OCT-S06-C1041]
Length = 323
Score = 135 bits (340), Expect = 2e-30, Method: Composition-based stats.
Identities = 40/152 (26%), Positives = 65/152 (42%), Gaps = 7/152 (4%)
Query: 2 TKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYAT 61
T GT TF + L V+++ + NG + I+ + K F
Sbjct: 174 TTDPTAGTAATFVEADLKSVVKQCWDNGGDPGVIMTGSFNKQAASGFAGIGTQFRDV-QP 232
Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHED 121
SG +IV ADIY FG+ + NR M + + +D +LR I +
Sbjct: 233 SGPVSPGSIVGAADIYISDFGQHQIVANRFMPAAN-----VYALDMEYWCVAYLRPI-QT 286
Query: 122 KNIAKNGDANKGVLIGEGALKVRNEKAVGVVA 153
+++ K GD+++ +LI E L +NE A G +
Sbjct: 287 EDLGKTGDSDRRMLISEYTLGSKNEAASGKIY 318
>gi|291334638|gb|ADD94286.1| putative phage major head protein [uncultured phage
MedDCM-OCT-S04-C64]
Length = 323
Score = 134 bits (338), Expect = 3e-30, Method: Composition-based stats.
Identities = 39/152 (25%), Positives = 65/152 (42%), Gaps = 7/152 (4%)
Query: 2 TKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYAT 61
+ GT TF + L V+++ + NG + I+ + K F
Sbjct: 174 STDPTAGTAATFVEADLKSVVKQCWDNGGDPGVIMTGSFNKQAASGFAGIGTQFRDV-QP 232
Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHED 121
SG +IV ADIY FG+ + NR M + + +D +LR I +
Sbjct: 233 SGPVSPGSIVGAADIYISDFGQHQIVANRFMPAAN-----VYALDMEYWCVAYLRPI-QT 286
Query: 122 KNIAKNGDANKGVLIGEGALKVRNEKAVGVVA 153
+++ K GD+++ +LI E L +NE A G +
Sbjct: 287 EDLGKTGDSDRRMLISEYTLGSKNEAASGKIY 318
>gi|148257053|ref|YP_001241638.1| putative phage major head protein [Bradyrhizobium sp. BTAi1]
gi|146409226|gb|ABQ37732.1| putative phage Major head protein [Bradyrhizobium sp. BTAi1]
Length = 320
Score = 131 bits (329), Expect = 4e-29, Method: Composition-based stats.
Identities = 41/154 (26%), Positives = 67/154 (43%), Gaps = 13/154 (8%)
Query: 2 TKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYAT 61
T DGTQ F++ L V+ + NG I+ + K F F S
Sbjct: 175 TGTRTDGTQIAFTEARLKTVLSSIWINGGKPGTILTGAFNKQVFSTFTGRSTAIEE---- 230
Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTI-HE 120
+ + IVA+ D Y+ FGK+ V +R + R+ +++ + +L
Sbjct: 231 ---AKSKKIVASVDAYESDFGKLKVVASRFQRA-----RDVLVLEMDKWAVAYLNGRNMI 282
Query: 121 DKNIAKNGDANKGVLIGEGALKVRNEKAVGVVAD 154
+AK GD+++ ++ E AL RNEKA G + D
Sbjct: 283 SIPLAKTGDSDRRQILAEYALVARNEKASGGIFD 316
>gi|256377352|ref|YP_003101012.1| hypothetical protein Amir_3259 [Actinosynnema mirum DSM 43827]
gi|255921655|gb|ACU37166.1| hypothetical protein Amir_3259 [Actinosynnema mirum DSM 43827]
Length = 329
Score = 126 bits (316), Expect = 1e-27, Method: Composition-based stats.
Identities = 28/159 (17%), Positives = 58/159 (36%), Gaps = 12/159 (7%)
Query: 2 TKKAKDGTQRTFSKQILDEVMQEGYQNGD----NFRHIIVSPYVKSEFVRFMSDSNVASF 57
T + T ++ ++ +++Q+ ++NG ++ + K
Sbjct: 173 TNVITNSTPTPLTESMVIDLLQKVWENGGIQISETATLMCGAWQKRRLTDEFVTRKNYQE 232
Query: 58 RYATSGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRT 117
+ G + + FG+V + NR M ++A + P MLE
Sbjct: 233 QTRNVG-------GVSVSTIETDFGRVNIMLNRHMPADAVQVVSLEQCAPVMLEKPG-SG 284
Query: 118 IHEDKNIAKNGDANKGVLIGEGALKVRNEKAVGVVADLF 156
+ +AK G +++ + GE L+ E A G + L
Sbjct: 285 FLFSEPLAKTGASDRVHIYGEIGLEYGPEIAHGKITGLT 323
>gi|163783849|ref|ZP_02178828.1| hypothetical protein HG1285_12862 [Hydrogenivirga sp. 128-5-R1-1]
gi|159880872|gb|EDP74397.1| hypothetical protein HG1285_12862 [Hydrogenivirga sp. 128-5-R1-1]
Length = 291
Score = 125 bits (313), Expect = 2e-27, Method: Composition-based stats.
Identities = 37/151 (24%), Positives = 74/151 (49%), Gaps = 13/151 (8%)
Query: 5 AKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGK 64
A GT R ++ +L++ +Q+ +Q G N ++V+ K F +
Sbjct: 153 ANAGTPRPLTETLLNDGLQQAWQTGGNPDVVVVNGTNKRTISGFTAGLTK-------EIA 205
Query: 65 SHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNI 124
+ + ++A+ D+Y+ FG V V NR M ++ F+++ + +L H+ + +
Sbjct: 206 AKDKRLIASVDVYESDFGLVRVIANRWMPAD-----KVFILEKGRWKVAYLDPFHQ-EEL 259
Query: 125 AKNGDANKGVLIGEGALKVRNEKAVGVVADL 155
+K GD + V++GE L+ R EKA ++ D+
Sbjct: 260 SKVGDRKERVVVGELTLEGRAEKANAIITDI 290
>gi|288817864|ref|YP_003432211.1| putative phage major head protein [Hydrogenobacter thermophilus
TK-6]
gi|288787263|dbj|BAI69010.1| putative phage major head protein [Hydrogenobacter thermophilus
TK-6]
gi|308751463|gb|ADO44946.1| putative phage major head protein [Hydrogenobacter thermophilus
TK-6]
Length = 291
Score = 121 bits (303), Expect = 3e-26, Method: Composition-based stats.
Identities = 40/151 (26%), Positives = 75/151 (49%), Gaps = 13/151 (8%)
Query: 5 AKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGK 64
A G+ R ++ +L++ +Q+ +Q G N ++VS K F +
Sbjct: 153 ANGGSPRALTETLLNDGIQQAWQAGGNPDVVVVSGKNKRTVSSFTAGVTKTV-------A 205
Query: 65 SHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNI 124
+ + +V+T D+Y+ FG V + +R M + F+++ + +LR + + I
Sbjct: 206 AEDKKLVSTVDVYESDFGLVRIVADRWMPDS-----KVFILEKGRFKIAYLRPFKQQE-I 259
Query: 125 AKNGDANKGVLIGEGALKVRNEKAVGVVADL 155
AK GD + V++GE L+VR EKA ++AD+
Sbjct: 260 AKVGDRIERVVVGELTLEVRAEKANAIIADI 290
>gi|226305754|ref|YP_002765714.1| hypothetical protein RER_22670 [Rhodococcus erythropolis PR4]
gi|226184871|dbj|BAH32975.1| hypothetical protein RER_22670 [Rhodococcus erythropolis PR4]
Length = 317
Score = 119 bits (298), Expect = 1e-25, Method: Composition-based stats.
Identities = 33/158 (20%), Positives = 55/158 (34%), Gaps = 12/158 (7%)
Query: 2 TKKAKDGTQRTFSKQILDEVMQEGYQNGD----NFRHIIVSPYVKSEFVRFMSDSNVASF 57
+ +GT ++ +L ++MQ ++NG R I+V K + D
Sbjct: 167 SNVVTNGTPAALTETMLLDLMQLAWENGGIQESETRTIVVGAKQKRALTKIFIDGKGYKE 226
Query: 58 RYATSGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRT 117
G + FG+ + NR M + + P LE +
Sbjct: 227 STRNVG-------GVNLQTIETDFGRCNIMLNRYMPAAQLAVVSLEECAPRFLEIPG-KG 278
Query: 118 IHEDKNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155
+++AK G + K L GE LK NE + L
Sbjct: 279 HFFAESLAKTGASEKVQLYGEIGLKYGNELKHAKLTGL 316
>gi|221199511|ref|ZP_03572555.1| major head protein [Burkholderia multivorans CGD2M]
gi|221205587|ref|ZP_03578602.1| major head protein [Burkholderia multivorans CGD2]
gi|221174425|gb|EEE06857.1| major head protein [Burkholderia multivorans CGD2]
gi|221180796|gb|EEE13199.1| major head protein [Burkholderia multivorans CGD2M]
Length = 317
Score = 116 bits (291), Expect = 9e-25, Method: Composition-based stats.
Identities = 45/153 (29%), Positives = 72/153 (47%), Gaps = 6/153 (3%)
Query: 3 KKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATS 62
GT R F++ +L E ++ Y+ G N + + P K F + A+ + +
Sbjct: 169 TPPVAGTARPFTEDLLKEALRLAYEAGGNVTQVHMRPVDKQRASGFAGN---ATRMESVN 225
Query: 63 GKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDK 122
G + + IY FG V + PNRVM++ +TA + IDP+M LR + +
Sbjct: 226 GTGKTAVLQTSYSIYASDFGNVAMIPNRVMSAVNDTA--VYAIDPSMWALATLRGFEK-E 282
Query: 123 NIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155
+AK GDA + EG L+ RNE + + DL
Sbjct: 283 ELAKVGDARNWQITWEGTLEARNEASSAQIRDL 315
>gi|167600435|ref|YP_001671935.1| major head protein [Pseudomonas phage LUZ24]
gi|161168298|emb|CAP45463.1| major head protein [Pseudomonas phage LUZ24]
Length = 317
Score = 115 bits (288), Expect = 2e-24, Method: Composition-based stats.
Identities = 39/154 (25%), Positives = 67/154 (43%), Gaps = 11/154 (7%)
Query: 2 TKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYAT 61
+ G R ++ +L + ++NG I S +K + N+
Sbjct: 171 SDTGTAGDLRLLTEDMLLNASEAIWRNGGQANSIQTSSSIKKAISK-----NMKGRATEI 225
Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHED 121
+ + +N I T D+Y+ FGK + NR N F+ DP M +LR +
Sbjct: 226 TLDASDNRIAQTVDVYESDFGKYTIRANRWFHENT-----MFIFDPKMHALCYLRPFFQH 280
Query: 122 KNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155
+ +AK GD+ K L+ E L+V NEK+ ++ D+
Sbjct: 281 E-LAKTGDSEKRQLLVEYTLRVNNEKSGALIRDV 313
>gi|27476049|ref|NP_775251.1| major head protein [Pseudomonas phage PaP3]
gi|27414479|gb|AAL85565.1| major head protein [Pseudomonas phage PaP3]
Length = 317
Score = 114 bits (284), Expect = 6e-24, Method: Composition-based stats.
Identities = 38/154 (24%), Positives = 66/154 (42%), Gaps = 11/154 (7%)
Query: 2 TKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYAT 61
+ G R ++ +L + ++NG I S +K + N+
Sbjct: 171 SNTGTAGDLRLLTEDMLLNASESIWRNGGQANSIQTSSSIKKAISK-----NMKGRATEI 225
Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHED 121
+ + +N I T D+Y+ FGK + NR N F+ DP M +LR +
Sbjct: 226 TLDASDNRIAQTVDVYESDFGKYTIRANRWFHENT-----LFVFDPKMHSLCYLRPFFQH 280
Query: 122 KNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155
+ +AK GD+ K L+ E +V NEK+ ++ D+
Sbjct: 281 E-LAKTGDSEKRQLLVEYTFRVNNEKSGALIRDV 313
>gi|291334405|gb|ADD94061.1| major head protein [uncultured phage MedDCM-OCT-S01-C1]
Length = 344
Score = 108 bits (270), Expect = 2e-22, Method: Composition-based stats.
Identities = 37/158 (23%), Positives = 69/158 (43%), Gaps = 4/158 (2%)
Query: 1 MTKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYA 60
+T K + S Q + + +Q Y+ G H++ +P V +M D+
Sbjct: 180 LTVKYTEDDGVALSFQAVKDAIQGVYEQGGEVTHLMSNPGVIGALSSYMFDNEARVATLT 239
Query: 61 TSGKSHNN---TIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRT 117
+ + N T +++ ++ FG + + PNR+ A AFL+DP + +L
Sbjct: 240 SDQGAPANSKATALSSVNVLVSDFGTIKLVPNRLQPLAANGTAVAFLLDPEYVSLSYLEG 299
Query: 118 IHEDKNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155
+AK G A K + + L+V EKA G++ ++
Sbjct: 300 Y-RTDTLAKTGLAEKRQISVDWGLRVHTEKAHGMLVNI 336
>gi|283856246|ref|YP_162122.2| putative phage major head protein [Zymomonas mobilis subsp. mobilis
ZM4]
gi|283775241|gb|AAV89011.2| putative phage major head protein [Zymomonas mobilis subsp. mobilis
ZM4]
Length = 304
Score = 107 bits (268), Expect = 4e-22, Method: Composition-based stats.
Identities = 43/157 (27%), Positives = 70/157 (44%), Gaps = 11/157 (7%)
Query: 2 TKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYAT 61
T A G R +++ +L EV+ + + +G +I S +K F ++
Sbjct: 159 TTAATTGKARLYTEALLKEVLGDIWVSGGQPNMVITSLKLKQTAAAFPGLASNRR----- 213
Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHED 121
I+A ADIY G+V P+R ++ +A +IDP L I +
Sbjct: 214 DTGDQKARIIAGADIYVSDVGEVQFVPDRFCDNS-----SALVIDPEYWSVATLDPIQK- 267
Query: 122 KNIAKNGDANKGVLIGEGALKVRNEKAVGVVADLFGL 158
+++A G A++ L E AL+ NE A GV+ADL
Sbjct: 268 RSLATTGLADRDALYTEIALRCHNEAASGVLADLSAA 304
>gi|241760939|ref|ZP_04759028.1| putative phage major head protein [Zymomonas mobilis subsp. mobilis
ATCC 10988]
gi|241374558|gb|EER64019.1| putative phage major head protein [Zymomonas mobilis subsp. mobilis
ATCC 10988]
Length = 238
Score = 107 bits (266), Expect = 8e-22, Method: Composition-based stats.
Identities = 42/157 (26%), Positives = 69/157 (43%), Gaps = 11/157 (7%)
Query: 2 TKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYAT 61
T A G R +++ +L EV+ + + +G +I S +K F ++
Sbjct: 93 TTAATAGKARLYTEALLKEVLGDIWVSGGQPNMVITSLKLKQTAAAFPGLASNRR----- 147
Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHED 121
I+A ADIY G+V P+R ++ +A +IDP L I +
Sbjct: 148 DTGDQKARIIAGADIYVSDVGEVQFVPDRFCDNS-----SALVIDPEYWSVATLDPIQK- 201
Query: 122 KNIAKNGDANKGVLIGEGALKVRNEKAVGVVADLFGL 158
+++A G A++ L E AL+ NE GV+ADL
Sbjct: 202 RSLATTGLADRDALYTEIALRCHNEATSGVLADLSAA 238
>gi|261337429|ref|ZP_05965313.1| conserved hypothetical protein [Bifidobacterium gallicum DSM 20093]
gi|270277816|gb|EFA23670.1| conserved hypothetical protein [Bifidobacterium gallicum DSM 20093]
Length = 160
Score = 102 bits (255), Expect = 1e-20, Method: Composition-based stats.
Identities = 30/148 (20%), Positives = 56/148 (37%), Gaps = 12/148 (8%)
Query: 10 QRTFSKQILDEVMQEGYQNGD----NFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKS 65
+T S+ + ++MQ + NG R I+V+ +K R G
Sbjct: 7 AKTLSEDDVLDLMQMAWDNGGIREAETRTIVVNSDLKRALTRIFIKDAGYKEETRNVG-- 64
Query: 66 HNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIA 125
+ + FG+ + + ++ + + + P LE + + +A
Sbjct: 65 -----GVSLQTIETDFGRCNIMLDSLVLKDKMLVLSLDQLAPRFLEIPG-KGHFFVEPLA 118
Query: 126 KNGDANKGVLIGEGALKVRNEKAVGVVA 153
K G ++K L GE L+ NEKA V+
Sbjct: 119 KTGASDKVQLYGEIGLEYGNEKAHAVLT 146
>gi|316934287|ref|YP_004109269.1| putative phage major head protein [Rhodopseudomonas palustris DX-1]
gi|315602001|gb|ADU44536.1| putative phage major head protein [Rhodopseudomonas palustris DX-1]
Length = 304
Score = 99.8 bits (247), Expect = 1e-19, Method: Composition-based stats.
Identities = 34/149 (22%), Positives = 61/149 (40%), Gaps = 11/149 (7%)
Query: 6 KDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKS 65
T ++ + Q Y G +V+P F F + + S
Sbjct: 159 TGATGTKITEANYLDAAQALYDAGSEASITLVTPTNSRTFAGFTGAAGRSRVINDDS--- 215
Query: 66 HNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIH-EDKNI 124
IV ++Y PFG+ + +R + + + L DP M + L+ + + +
Sbjct: 216 --KKIVNAVNLYVSPFGEEKIVLSRHLKAG-----DTLLFDPAMWARVILQGRNWFRETL 268
Query: 125 AKNGDANKGVLIGEGALKVRNEKAVGVVA 153
AK GD+ K +++GE LK +N+KA +V
Sbjct: 269 AKTGDSLKMMIVGEFGLKHKNQKASALVR 297
>gi|307308932|ref|ZP_07588615.1| putative phage major head protein [Sinorhizobium meliloti BL225C]
gi|306900566|gb|EFN31179.1| putative phage major head protein [Sinorhizobium meliloti BL225C]
Length = 309
Score = 98.6 bits (244), Expect = 3e-19, Method: Composition-based stats.
Identities = 35/156 (22%), Positives = 61/156 (39%), Gaps = 13/156 (8%)
Query: 1 MTKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYA 60
+ DGT R S ++ E+ + + G + ++V P K V + +
Sbjct: 167 LVGAVTDGTARALSHALIVEMAEGIHSEGGVPKILLVPPAQKKNIVALSNGTTKFQ---- 222
Query: 61 TSGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHE 120
+ T+ +Y+ PFG+ + +R + S+ DP + R + +
Sbjct: 223 ---DASKKTVFGDVTMYETPFGRFDIVTSRDVRSST-----VIGYDPELWAQAVFRGLTK 274
Query: 121 DKNIAKNGDANKGVLIGEGALKVRNEKAVGVVADLF 156
K + + GD +I E L RNE G VADL
Sbjct: 275 -KKLPEGGDYEGYQVITEVTLVCRNEAGNGKVADLS 309
>gi|294083943|ref|YP_003550700.1| major head protein [Candidatus Puniceispirillum marinum IMCC1322]
gi|292663515|gb|ADE38616.1| major head protein [Candidatus Puniceispirillum marinum IMCC1322]
Length = 133
Score = 97.5 bits (241), Expect = 5e-19, Method: Composition-based stats.
Identities = 32/143 (22%), Positives = 56/143 (39%), Gaps = 11/143 (7%)
Query: 13 FSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTIVA 72
++ L + Q Y G + ++ P F S + T+V
Sbjct: 1 MTEAKLLDAHQAAYTAGADVSVFMIKPSDSEIVAGFTGSSGRTRNFNDGTT-----TLVN 55
Query: 73 TADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGDANK 132
D+ P+G + V NR + AFL+DP+M LR + ++K GD++K
Sbjct: 56 AVDVMINPYGTLKVVLNRHQLTTH-----AFLLDPSMWRSAVLRPVSRT-LLSKTGDSDK 109
Query: 133 GVLIGEGALKVRNEKAVGVVADL 155
+ EG L+ N+ ++ L
Sbjct: 110 HFCVYEGGLQHLNQSGSAMITGL 132
>gi|256751057|ref|ZP_05491940.1| conserved hypothetical protein [Thermoanaerobacter ethanolicus
CCSD1]
gi|256750167|gb|EEU63188.1| conserved hypothetical protein [Thermoanaerobacter ethanolicus
CCSD1]
Length = 292
Score = 95.5 bits (236), Expect = 2e-18, Method: Composition-based stats.
Identities = 30/151 (19%), Positives = 54/151 (35%), Gaps = 16/151 (10%)
Query: 4 KAKDGTQRTFSKQILDEVMQEGYQNG--DNFRHIIVSPYVKSEFVRFMSDSNVASFRYAT 61
+ F + +++ + NG N +V+ +K F +S
Sbjct: 152 TENKVSNTAFDEAQFKATVKKLWDNGLGSNQYIALVNADIKEIIDDFYKNSTRYIGNVNE 211
Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHED 121
G + +G V + NR + + + DP + +LR +
Sbjct: 212 YGVVVDKVN--------TNYGVVNLVLNRHVPAG-----KMIVFDPAYVRLAFLR-TPSN 257
Query: 122 KNIAKNGDANKGVLIGEGALKVRNEKAVGVV 152
+ +AK GD G +I E LKV N KA+ +
Sbjct: 258 EILAKTGDYISGQVITELTLKVLNAKALALF 288
>gi|229037842|ref|ZP_04189640.1| hypothetical protein bcere0028_57530 [Bacillus cereus AH1271]
gi|228727464|gb|EEL78642.1| hypothetical protein bcere0028_57530 [Bacillus cereus AH1271]
Length = 315
Score = 94.0 bits (232), Expect = 6e-18, Method: Composition-based stats.
Identities = 24/156 (15%), Positives = 51/156 (32%), Gaps = 14/156 (8%)
Query: 2 TKKAKDGTQRTFSKQILDEVMQEGYQNGDNFR--HIIVSPYVKSEFVRFMSDSNVASFRY 59
T +K + +++++ NG +F + V+ + K
Sbjct: 169 TGNTIAAAGAKLNKDLFNQLLRMMATNGADFNEMVLYVNAFNKQVVSEIFG--------- 219
Query: 60 ATSGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIH 119
N + F KV + +R + + + + ++ P + + +
Sbjct: 220 --YAPQDRNVGGVNIQQIETDFAKVGIVWDRFVPAGSIGLFDMSVVAPVLQNVPD-KGVL 276
Query: 120 EDKNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155
+ +AK G A KG + GE L G + L
Sbjct: 277 FYEELAKTGAAEKGQIYGEIGLAHGPAFMHGSITGL 312
>gi|302389556|ref|YP_003825377.1| hypothetical protein Toce_0992 [Thermosediminibacter oceani DSM
16646]
gi|302200184|gb|ADL07754.1| conserved hypothetical protein [Thermosediminibacter oceani DSM
16646]
Length = 294
Score = 92.5 bits (228), Expect = 2e-17, Method: Composition-based stats.
Identities = 29/146 (19%), Positives = 50/146 (34%), Gaps = 14/146 (9%)
Query: 9 TQRTFSKQILDEVMQEGYQNGD--NFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSH 66
T + +++ ++NG N +V+ +K
Sbjct: 157 TGANLDEANFKATVKKLWENGLGINEYVALVNADLKEAIDAIYDAKYQY------IAPMQ 210
Query: 67 NNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAK 126
+N +G V + NR M + DP L LR + +AK
Sbjct: 211 DNYFGLVVRRVQTNYGNVNIILNRHMP-----VDKMVVFDPAYLRIANLREPVF-EPLAK 264
Query: 127 NGDANKGVLIGEGALKVRNEKAVGVV 152
GD+ KG ++ E LKV + KA+ +
Sbjct: 265 TGDSVKGQVVAELTLKVLSPKAIAMF 290
>gi|217961109|ref|YP_002339677.1| hypothetical protein BCAH187_A3735 [Bacillus cereus AH187]
gi|229140327|ref|ZP_04268882.1| hypothetical protein bcere0013_34260 [Bacillus cereus BDRD-ST26]
gi|217064163|gb|ACJ78413.1| conserved hypothetical protein [Bacillus cereus AH187]
gi|228642888|gb|EEK99164.1| hypothetical protein bcere0013_34260 [Bacillus cereus BDRD-ST26]
Length = 293
Score = 91.7 bits (226), Expect = 3e-17, Method: Composition-based stats.
Identities = 29/155 (18%), Positives = 61/155 (39%), Gaps = 17/155 (10%)
Query: 3 KKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHII--VSPYVKSEFVRFMSDSNVASFRYA 60
T+ +++ + E++++ + G+ V +K + D
Sbjct: 151 NVVNGATKDVLTEKEIKELVKKLWTAGNENGEFYALVGADLKDQIDELYKDRYSYQHVTT 210
Query: 61 TSGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHE 120
G + + +GKV +R M ++ A + L+ +LR +
Sbjct: 211 DFG--------IVVNSVETSYGKVNFILDRYMPADKIVAFDVNA-----LKVAFLRQ-PQ 256
Query: 121 DKNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155
+ + K GD KG ++ E +L+V ++KAV V +L
Sbjct: 257 FEQLGKTGDNVKGQVVAECSLEVGSKKAVA-VYNL 290
>gi|228910960|ref|ZP_04074768.1| hypothetical protein bthur0013_51010 [Bacillus thuringiensis IBL
200]
gi|228848615|gb|EEM93461.1| hypothetical protein bthur0013_51010 [Bacillus thuringiensis IBL
200]
Length = 363
Score = 91.3 bits (225), Expect = 4e-17, Method: Composition-based stats.
Identities = 37/159 (23%), Positives = 53/159 (33%), Gaps = 19/159 (11%)
Query: 2 TKKAKDGTQRTFSKQILDEVMQEGYQNGDNF----RHIIVSPYVKSEFVRFMSDSNVASF 57
T G + L E ++ ++ G IIV K F + S
Sbjct: 217 TNVINAGGAAVADE-HLIEAFRQIFEKGGYTVGGNYKIIVGATQKIAISAFDNALIRLSR 275
Query: 58 RYATSGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRT 117
G+ D Y FG + N M + +ID N L LRT
Sbjct: 276 EDNGRGQV--------VDHYVSDFGNAEILLNNNMHAG-----EILVIDANRLSIRPLRT 322
Query: 118 IHE-DKNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155
+ + K GD KG+L+GE L+ E A + L
Sbjct: 323 REFAHEFLGKKGDYMKGMLVGEYTLEFLQEAAHAKITGL 361
>gi|257458669|ref|ZP_05623796.1| conserved hypothetical protein [Campylobacter gracilis RM3268]
gi|257443942|gb|EEV19058.1| conserved hypothetical protein [Campylobacter gracilis RM3268]
Length = 328
Score = 90.2 bits (222), Expect = 9e-17, Method: Composition-based stats.
Identities = 27/149 (18%), Positives = 55/149 (36%), Gaps = 14/149 (9%)
Query: 8 GTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHN 67
GT ++ L++++Q + G R + + +K F + +
Sbjct: 190 GTPTVLTEDALNKILQNIWDAGATPRDVFIGAKLKKAINAFATRQF-----------GNE 238
Query: 68 NTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRT-IHEDKNIAK 126
+I ++ D FGKV +R ++ + D + ++ L + E+ +K
Sbjct: 239 KSINSSVVSLDTDFGKVNFRLHRYLSEQNKLDDVLIAGDFSFMKNGLLIPTMIENVTTSK 298
Query: 127 NGDANKGVLIGEGALKVRNEKAVGVVADL 155
A + E L+VRN A + L
Sbjct: 299 T--AKQKRYYTEATLEVRNADAFAIGVGL 325
>gi|196048420|ref|ZP_03115595.1| conserved hypothetical protein [Bacillus cereus 03BB108]
gi|196020677|gb|EDX59409.1| conserved hypothetical protein [Bacillus cereus 03BB108]
Length = 312
Score = 89.8 bits (221), Expect = 1e-16, Method: Composition-based stats.
Identities = 34/164 (20%), Positives = 65/164 (39%), Gaps = 12/164 (7%)
Query: 1 MTKKAKDGTQ--RTFSKQILDEVMQEGYQNG-DNFRHIIVSPYVKSEFVRFMSDSNVASF 57
+ T S L + + ++ G + + V + V
Sbjct: 150 LINSRNKFTPKDGKLSADDLIKAFRLCWEKGAGGDKLVQCGSQVAEFIDKLFK---VEKG 206
Query: 58 RYATSGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRT 117
+ + N I TAD+ +G+ + NR M A T ++D N ++ LR
Sbjct: 207 VMIPAIQGGGNIIGLTADVIHTRYGRGNIVLNRHMPDGALT-----IVDLNQVKLRPLRK 261
Query: 118 IHEDKNIAKNGDANKGVLIGEGALKVRNEKAVGVVADLFGLSKT 161
+ + +AKNGD+ K +++GE +L+++N A V+ + G
Sbjct: 262 MLA-EPLAKNGDSQKRMIVGEYSLELKNSYAGAVINGITGYVDP 304
>gi|331088860|ref|ZP_08337770.1| hypothetical protein HMPREF1025_01353 [Lachnospiraceae bacterium
3_1_46FAA]
gi|330407383|gb|EGG86886.1| hypothetical protein HMPREF1025_01353 [Lachnospiraceae bacterium
3_1_46FAA]
Length = 314
Score = 89.4 bits (220), Expect = 1e-16, Method: Composition-based stats.
Identities = 27/145 (18%), Positives = 51/145 (35%), Gaps = 9/145 (6%)
Query: 9 TQRTFSKQILDEVMQEGYQNGDNFRHI-IVSPYVKSEFVRFMSDSNVASFRYATSGKSHN 67
T S+ + +Q+ + +G + V+ K N + S
Sbjct: 161 TAGALSEDHFLDALQKMWDHGAQGAYFAFVNATQKRAINALAKTGNNVRWVLDN--GSVT 218
Query: 68 NTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKN 127
N FG++ + +R M +N +D + + LR +++ K
Sbjct: 219 NVYGIAVSKIVTDFGEISLVLDRYMDNNT-----ILTLDLDEVRIAELRGT-FYEDLPKA 272
Query: 128 GDANKGVLIGEGALKVRNEKAVGVV 152
GD KG +I E +K+ N A +
Sbjct: 273 GDYFKGHVINESTIKLLNSYAGSKI 297
>gi|238909129|ref|YP_002939596.1| hypothetical protein EUBELI_10025 [Eubacterium eligens ATCC 27750]
gi|238873366|gb|ACR73075.1| Hypothetical protein EUBELI_10025 [Eubacterium eligens ATCC 27750]
Length = 304
Score = 89.4 bits (220), Expect = 2e-16, Method: Composition-based stats.
Identities = 24/145 (16%), Positives = 53/145 (36%), Gaps = 9/145 (6%)
Query: 9 TQRTFSKQILDEVMQEGYQNGDNFRHI-IVSPYVKSEFVRFMSDSNVASFRYATSGKSHN 67
T+ ++ + +Q+ + +G + V+ +K N R+ S
Sbjct: 161 TKGALTEDHFLDALQKMWDHGAQGEYFSFVNANIKRMINNLAKAGNNV--RFLGDNGSMQ 218
Query: 68 NTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKN 127
N + FG++ + +R + +D ++ LR +++ K
Sbjct: 219 NVLGIGVQKIVTDFGEISLVLDRY-----ADTKTILTVDLGEVQIAELRGT-FYEDLPKA 272
Query: 128 GDANKGVLIGEGALKVRNEKAVGVV 152
GD KG ++ E +K+ N A +
Sbjct: 273 GDYFKGHVLNESTIKLLNSHAGSKI 297
>gi|281417131|ref|ZP_06248151.1| conserved hypothetical protein [Clostridium thermocellum JW20]
gi|281408533|gb|EFB38791.1| conserved hypothetical protein [Clostridium thermocellum JW20]
Length = 292
Score = 89.4 bits (220), Expect = 2e-16, Method: Composition-based stats.
Identities = 34/141 (24%), Positives = 56/141 (39%), Gaps = 18/141 (12%)
Query: 14 SKQILDEVMQEGYQNG---DNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTI 70
++ V++ + G F + V+ K F ++ + T
Sbjct: 165 TQNDFKAVVKRLWDCGLGSGEF-YCFVNADFKEIIDGFYANQINYNMPMN--------TF 215
Query: 71 VATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGDA 130
A+ +G V V NR M ++DP L ++LR + +AK+GD
Sbjct: 216 GFVANKIITNYGIVNVVLNRHMP-----VDKILVVDPTYLRLVYLRK-PSFEMLAKDGDN 269
Query: 131 NKGVLIGEGALKVRNEKAVGV 151
KG++I E LK+ N KAV V
Sbjct: 270 LKGMVITELTLKLLNSKAVAV 290
>gi|163937921|ref|YP_001642807.1| hypothetical protein BcerKBAB4_5338 [Bacillus weihenstephanensis
KBAB4]
gi|163865776|gb|ABY46832.1| hypothetical protein BcerKBAB4_5338 [Bacillus weihenstephanensis
KBAB4]
Length = 391
Score = 89.0 bits (219), Expect = 2e-16, Method: Composition-based stats.
Identities = 31/152 (20%), Positives = 54/152 (35%), Gaps = 19/152 (12%)
Query: 16 QILDEVMQEGYQNGDNFR----HIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTIV 71
+L + ++ G IIV K +F + G+
Sbjct: 233 DMLINAFRSIFEKGGMNSGGNFKIIVGGTQKVAISQFGNAQIRLDRLDNGRGQV------ 286
Query: 72 ATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTI-HEDKNIAKNGDA 130
D Y FG + N + S+ ++D N + L T + + K GD
Sbjct: 287 --VDHYMSDFGAAEIVLNNNLRSD-----EILIVDSNRISIRPLTTREFSHEFLGKKGDY 339
Query: 131 NKGVLIGEGALKVRNEKAVGVVADLFGLSKTT 162
KG+L+GE ++ E+A + + G +K T
Sbjct: 340 MKGMLVGEYTMEFLQEQAHARITGV-GTAKPT 370
>gi|227821704|ref|YP_002825674.1| putative phage related protein [Sinorhizobium fredii NGR234]
gi|227340703|gb|ACP24921.1| putative phage related protein [Sinorhizobium fredii NGR234]
Length = 123
Score = 88.6 bits (218), Expect = 2e-16, Method: Composition-based stats.
Identities = 76/110 (69%), Positives = 85/110 (77%)
Query: 26 YQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTIVATADIYDRPFGKVM 85
YQNG NFRH+ SPYVKS FV FMSDSNVA FRYA S NTIVATAD Y+ PFG VM
Sbjct: 7 YQNGANFRHVSASPYVKSVFVTFMSDSNVAPFRYAVSKGGERNTIVATADYYEGPFGTVM 66
Query: 86 VHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGDANKGVL 135
+HPNRV A+ A ARNAF ID +ML FLWLR I EDK++AK GDA+KGV+
Sbjct: 67 IHPNRVQATGATQARNAFFIDTDMLSFLWLRKIQEDKDVAKTGDADKGVI 116
>gi|150021335|ref|YP_001306689.1| hypothetical protein Tmel_1457 [Thermosipho melanesiensis BI429]
gi|149793856|gb|ABR31304.1| hypothetical protein Tmel_1457 [Thermosipho melanesiensis BI429]
Length = 362
Score = 88.6 bits (218), Expect = 3e-16, Method: Composition-based stats.
Identities = 28/146 (19%), Positives = 54/146 (36%), Gaps = 14/146 (9%)
Query: 9 TQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNN 68
+ T+S+ ++ Y NG ++ K F SD + +
Sbjct: 228 STNTWSEDNFRAFLKLIYDNGGYITEAWMNASTKQYFNSLNSDKLIVTQ--------DER 279
Query: 69 TIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTI-HEDKNIAKN 127
T D Y +G++ + + + ++D + L+ L + +AK
Sbjct: 280 TAGRLVDGYLSEYGQISLKTSPHIPEGM-----IIVVDTSNLKIKPLSGRNMAYEPLAKT 334
Query: 128 GDANKGVLIGEGALKVRNEKAVGVVA 153
GD+ KG ++GE L+ RN G+
Sbjct: 335 GDSVKGQIVGEYTLEFRNPDGAGIFY 360
>gi|152975085|ref|YP_001374602.1| hypothetical protein Bcer98_1285 [Bacillus cereus subsp. cytotoxis
NVH 391-98]
gi|152023837|gb|ABS21607.1| conserved hypothetical protein [Bacillus cytotoxicus NVH 391-98]
Length = 293
Score = 88.6 bits (218), Expect = 3e-16, Method: Composition-based stats.
Identities = 29/151 (19%), Positives = 60/151 (39%), Gaps = 16/151 (10%)
Query: 3 KKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHII--VSPYVKSEFVRFMSDSNVASFRYA 60
T+ +++ + E++++ + G+ V +K + D
Sbjct: 151 NVVNGVTKDVLTEKEVKELVKKLWTAGNENGEFYALVGADLKDQIDELYKDRYSYQHVTT 210
Query: 61 TSGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHE 120
G T D + +GK+ +R M ++ A + I + +LR +
Sbjct: 211 DFG--------ITVDSVNTSYGKINFILDRYMPADKIVAFDVNAI-----KVAFLRQ-PQ 256
Query: 121 DKNIAKNGDANKGVLIGEGALKVRNEKAVGV 151
+ + K GD +KG ++ E L+V ++KAV V
Sbjct: 257 FQALGKTGDNDKGQVVAEATLEVGSKKAVAV 287
>gi|229162523|ref|ZP_04290484.1| hypothetical protein bcere0009_32950 [Bacillus cereus R309803]
gi|228621002|gb|EEK77867.1| hypothetical protein bcere0009_32950 [Bacillus cereus R309803]
Length = 293
Score = 87.8 bits (216), Expect = 4e-16, Method: Composition-based stats.
Identities = 29/155 (18%), Positives = 58/155 (37%), Gaps = 17/155 (10%)
Query: 3 KKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHII--VSPYVKSEFVRFMSDSNVASFRYA 60
T ++ + E++++ + G+ V +K + D
Sbjct: 151 NVVNGATANVLQEKEVKELVKKLWTAGNENGEFYALVGADIKDQIDELYKDRYSYQHVTN 210
Query: 61 TSGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHE 120
G + + +GKV +R M ++ A + L+ +LR +
Sbjct: 211 DFG--------IVVNSVETSYGKVNFILDRYMPADKIVAFDINA-----LKVAFLRQ-PQ 256
Query: 121 DKNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155
+ + K GD KG ++ E L+V ++KAV V +L
Sbjct: 257 FEALGKTGDNIKGQVVAEATLEVGSKKAVA-VYNL 290
>gi|114566839|ref|YP_753993.1| hypothetical protein Swol_1314 [Syntrophomonas wolfei subsp. wolfei
str. Goettingen]
gi|114337774|gb|ABI68622.1| hypothetical protein Swol_1314 [Syntrophomonas wolfei subsp. wolfei
str. Goettingen]
Length = 398
Score = 87.8 bits (216), Expect = 5e-16, Method: Composition-based stats.
Identities = 27/143 (18%), Positives = 59/143 (41%), Gaps = 16/143 (11%)
Query: 12 TFSKQILDEVMQEGYQNGDNFRH--IIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNT 69
T ++ + E +++ + G + +V+ +K + ++ + +
Sbjct: 160 TITEPKIKETVKKLWDAGMSSGTYIAMVNADLKEDIDALYANKYSY--------IAQESL 211
Query: 70 IVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGD 129
+G + + NR M ++ + DP ++ +LR H + +AK GD
Sbjct: 212 FGLIVSSIQTNYGTIKLVLNRHMPTD-----KMVVFDPAFVKISYLRQPHF-ELLAKTGD 265
Query: 130 ANKGVLIGEGALKVRNEKAVGVV 152
+ G +I E LK+ N+KA+ V
Sbjct: 266 SINGHVIAEMTLKMLNQKAIAVF 288
>gi|154174521|ref|YP_001409081.1| hypothetical protein CCV52592_0028 [Campylobacter curvus 525.92]
gi|112803013|gb|EAU00357.1| conserved hypothetical protein [Campylobacter curvus 525.92]
Length = 327
Score = 87.5 bits (215), Expect = 6e-16, Method: Composition-based stats.
Identities = 26/149 (17%), Positives = 54/149 (36%), Gaps = 14/149 (9%)
Query: 8 GTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHN 67
G ++ +L +++Q+ + +G + + + +K + + +
Sbjct: 190 GANTPLTEDVLSQILQQIWDSGATPKDVFIGAALKPAINKLATRQF-----------GNE 238
Query: 68 NTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFL-WLRTIHEDKNIAK 126
I + D FGKV +R +++ A D + +L T ED +K
Sbjct: 239 KAINSRVVSLDTDFGKVNFRMHRFLSAKYGLADTLIAGDFEFAKNGLFLPTEIEDVPTSK 298
Query: 127 NGDANKGVLIGEGALKVRNEKAVGVVADL 155
A + E L++RN A + L
Sbjct: 299 T--AKQKRYYTECCLEIRNPAAFAIGVGL 325
>gi|169827502|ref|YP_001697660.1| hypothetical protein Bsph_1941 [Lysinibacillus sphaericus C3-41]
gi|168991990|gb|ACA39530.1| conserved hypothetical protein [Lysinibacillus sphaericus C3-41]
Length = 288
Score = 87.5 bits (215), Expect = 6e-16, Method: Composition-based stats.
Identities = 27/151 (17%), Positives = 60/151 (39%), Gaps = 16/151 (10%)
Query: 5 AKDGTQRTFSKQILDEVMQEGYQNGDNFRHI--IVSPYVKSEFVRFMSDSNVASFRYATS 62
A + ++ + +VM+ + + V+ +K + D+ + +
Sbjct: 149 ATNAVTAPIDEKAIKQVMRNLWNQNLAEGQVYAFVNADIKEKIDAIYKDNYSYNHVTTSF 208
Query: 63 GKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDK 122
G D + +G V ++ + ++ + ++ +LR H +
Sbjct: 209 G--------LLVDEINTNYGSVKFVLSKHVPAD-----KVVFFNDAYVDLAYLREPHF-E 254
Query: 123 NIAKNGDANKGVLIGEGALKVRNEKAVGVVA 153
+AK GD+ KG ++ E LKV ++KAV V+
Sbjct: 255 PLAKTGDSVKGQVVAEATLKVGSKKAVAVLT 285
>gi|167583566|ref|YP_001671756.1| major head protein [Enterobacteria phage phiEco32]
gi|164375404|gb|ABY52812.1| major head protein [Enterobacteria phage phiEco32]
Length = 352
Score = 85.9 bits (211), Expect = 2e-15, Method: Composition-based stats.
Identities = 27/152 (17%), Positives = 55/152 (36%), Gaps = 11/152 (7%)
Query: 3 KKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATS 62
+ T F + + ++ + Y G I+++P F ++ + R +
Sbjct: 203 NASNPTTNIGFDEADIFDMTLQLYTAGSEADIIMINPAHAKIFAGLQENTQGSRKRIFEN 262
Query: 63 GKSHNNTIVATADIYDRPFGK-VMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHED 121
+ + P G+ + NR M ++A + + LR
Sbjct: 263 T----KQFIYEVNSITDPLGQSYKIIVNRWMPTDA-----VYFFRSADWTQMVLRAPKRT 313
Query: 122 KNIAKNGDANKGVLIGEGALKVRNEKAVGVVA 153
+ +AK+G K ++ E L+ RN A GV+
Sbjct: 314 E-LAKDGSYEKWMIEMEVGLRHRNPYASGVLF 344
>gi|153951462|ref|YP_001398222.1| hypothetical protein JJD26997_1140 [Campylobacter jejuni subsp.
doylei 269.97]
gi|152938908|gb|ABS43649.1| hypothetical protein JJD26997_1140 [Campylobacter jejuni subsp.
doylei 269.97]
Length = 344
Score = 85.5 bits (210), Expect = 2e-15, Method: Composition-based stats.
Identities = 24/156 (15%), Positives = 56/156 (35%), Gaps = 14/156 (8%)
Query: 7 DGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSH 66
GT ++ L+ ++Q + +G + + + +K +F + +
Sbjct: 190 SGTSTELTEDKLNIILQTIWNSGVAPKDVFLGADLKGAINKFATRIL-----------GN 238
Query: 67 NNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFL-WLRTIHEDKNIA 125
+V + FG V H +R++++ D ++ ++ T+ E+
Sbjct: 239 ETKLVGQVVSLETDFGTVNFHMHRLLSAKYGLGDVLIAGDFEYMKHGLYIPTMIEEVTTD 298
Query: 126 KNGDANKGVLIGEGALKVRNEKAVGVVADLFGLSKT 161
A + + L+VRN A + L + T
Sbjct: 299 ITAKAKRF--YTQSTLEVRNADAFAIGVGLTSTNNT 332
>gi|291526329|emb|CBK91916.1| hypothetical protein EUR_29920 [Eubacterium rectale DSM 17629]
Length = 304
Score = 85.5 bits (210), Expect = 2e-15, Method: Composition-based stats.
Identities = 24/145 (16%), Positives = 55/145 (37%), Gaps = 9/145 (6%)
Query: 9 TQRTFSKQILDEVMQEGYQNGDNFRHI-IVSPYVKSEFVRFMSDSNVASFRYATSGKSHN 67
T+ +++ + +QE +++G + V+ +K N R+ S
Sbjct: 161 TKGALTEEHFLDALQEMWKHGAQGEYFSFVNANIKRMINDLAKAGNNV--RFLGDNGSMQ 218
Query: 68 NTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKN 127
N + FG++ + +R + +D ++ LR +++ K
Sbjct: 219 NVLGIGVQKIVTDFGEISLVLDRY-----ADTKTILTVDLAEVQIAELRGT-FYEDLPKA 272
Query: 128 GDANKGVLIGEGALKVRNEKAVGVV 152
GD +G ++ E +K+ N A +
Sbjct: 273 GDYYRGHVLNESTIKLLNSFAGSKI 297
>gi|229187822|ref|ZP_04314947.1| hypothetical protein bcere0004_53480 [Bacillus cereus BGSC 6E1]
gi|228595657|gb|EEK53352.1| hypothetical protein bcere0004_53480 [Bacillus cereus BGSC 6E1]
Length = 313
Score = 85.1 bits (209), Expect = 2e-15, Method: Composition-based stats.
Identities = 30/154 (19%), Positives = 66/154 (42%), Gaps = 10/154 (6%)
Query: 9 TQRTFSKQILDEVMQEGYQNG-DNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHN 67
T S + + + + ++ G + + V + V + +
Sbjct: 160 TSGKLSAEDVTKAFRLCWEKGAGGDKLVQCGSAVAEFIDKLFK---VDKGVVIPAIQGGG 216
Query: 68 NTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKN 127
N I T D+ +G+ + NR M A T ++D N + LR + + ++K
Sbjct: 217 NIIGITVDVIHTRYGRGNIVLNRHMPDGALT-----IVDLNQTKLRPLRKM-AAQKLSKI 270
Query: 128 GDANKGVLIGEGALKVRNEKAVGVVADLFGLSKT 161
GD++K +++GE +L+++N A V++ + G+ +
Sbjct: 271 GDSDKQMIVGEYSLELKNSYAGAVISGITGIVEP 304
>gi|290457630|sp|P85987|CAPSD_BPSK1 RecName: Full=Major capsid protein; AltName: Full=Virion protein G
gi|221271431|dbj|BAH15184.1| major capsid protein [Serratia phage KSP100]
Length = 306
Score = 84.4 bits (207), Expect = 4e-15, Method: Composition-based stats.
Identities = 29/155 (18%), Positives = 56/155 (36%), Gaps = 13/155 (8%)
Query: 1 MTKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYA 60
+ + T + L E+ + Y G + I+ P + F S +
Sbjct: 149 LIDASMKKTA-DLDEATLFEMTAKLYTEGADPTLIMYHPSNANFFASLQEKSGTRMRIFE 207
Query: 61 TSGKSHNNTIVATADIYDRPFG-KVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIH 119
+ V + P G ++ PNR +A ++ +P+ L LR
Sbjct: 208 N-----DKRFVKQVEYIVDPLGQELKCIPNRWCPEDAT-----YIFNPSDLGMAVLRAPK 257
Query: 120 EDKNIAKNGDANKGVLIGEGALKVRNEKAVGVVAD 154
+ +AK+G A K ++ E ++ N KA ++
Sbjct: 258 KV-ALAKSGSAEKYMIEQEVGFRLNNPKAAALIIG 291
>gi|57237589|ref|YP_178603.1| hypothetical protein CJE0587 [Campylobacter jejuni RM1221]
gi|57166393|gb|AAW35172.1| hypothetical protein CJE0587 [Campylobacter jejuni RM1221]
Length = 344
Score = 84.0 bits (206), Expect = 7e-15, Method: Composition-based stats.
Identities = 23/151 (15%), Positives = 53/151 (35%), Gaps = 14/151 (9%)
Query: 7 DGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSH 66
GT ++ L++++Q + +G + + + +K +F + +
Sbjct: 190 SGTATELTEDKLNQILQTIWNSGVTPKDVFLGADLKGAINKFATRIL-----------GN 238
Query: 67 NNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFL-WLRTIHEDKNIA 125
+ + FG V H +R+++ D ++ ++ T+ ED
Sbjct: 239 ETKLAGQVVSLETDFGTVNFHMHRLLSPKYGLGDVLIAGDFEYMKHGLYIPTMIEDVPTD 298
Query: 126 KNGDANKGVLIGEGALKVRNEKAVGVVADLF 156
A + + L+VRN A + L
Sbjct: 299 ITAKAKRF--YTQSTLEVRNADAFAIGVGLT 327
>gi|283956330|ref|ZP_06373810.1| hypothetical protein C1336_000250101 [Campylobacter jejuni subsp.
jejuni 1336]
gi|283792050|gb|EFC30839.1| hypothetical protein C1336_000250101 [Campylobacter jejuni subsp.
jejuni 1336]
Length = 344
Score = 83.6 bits (205), Expect = 8e-15, Method: Composition-based stats.
Identities = 23/151 (15%), Positives = 53/151 (35%), Gaps = 14/151 (9%)
Query: 7 DGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSH 66
GT ++ L++++Q + +G + + + +K +F + +
Sbjct: 190 SGTATELTEDKLNQILQTIWNSGVTPKDVFLGADLKGAINKFATRIL-----------GN 238
Query: 67 NNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFL-WLRTIHEDKNIA 125
+ + FG V H +R+++ D ++ ++ T+ ED
Sbjct: 239 ETKLAGQVVSLETDFGTVNFHMHRLLSPKYGLGDVLIAGDFEYMKHGLYIPTMIEDVPTD 298
Query: 126 KNGDANKGVLIGEGALKVRNEKAVGVVADLF 156
A + + L+VRN A + L
Sbjct: 299 ITAKAKRF--YTQSTLEVRNADAFAIGVGLT 327
>gi|315929828|gb|EFV08993.1| hypothetical protein CSS_0883 [Campylobacter jejuni subsp. jejuni
305]
Length = 344
Score = 82.4 bits (202), Expect = 2e-14, Method: Composition-based stats.
Identities = 22/151 (14%), Positives = 53/151 (35%), Gaps = 14/151 (9%)
Query: 7 DGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSH 66
GT ++ L++++Q + +G + + + +K +F + +
Sbjct: 190 SGTATELTEDKLNQILQTIWNSGVTPKDVFLGADLKGAINKFATRIL-----------GN 238
Query: 67 NNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFL-WLRTIHEDKNIA 125
+ + FG V H +R+++ D ++ ++ T+ ED
Sbjct: 239 ETKLAGQVVSLETDFGTVNFHMHRLLSPKYGLGDVLIAGDFEYMKHGLYIPTMIEDVPTD 298
Query: 126 KNGDANKGVLIGEGALKVRNEKAVGVVADLF 156
A + + L++RN A + L
Sbjct: 299 ITAKAKRF--YTQSTLEIRNADAFAIGVGLT 327
>gi|257451764|ref|ZP_05617063.1| hypothetical protein F3_01776 [Fusobacterium sp. 3_1_5R]
gi|317058321|ref|ZP_07922806.1| predicted protein [Fusobacterium sp. 3_1_5R]
gi|313683997|gb|EFS20832.1| predicted protein [Fusobacterium sp. 3_1_5R]
Length = 371
Score = 79.4 bits (194), Expect = 2e-13, Method: Composition-based stats.
Identities = 31/155 (20%), Positives = 67/155 (43%), Gaps = 18/155 (11%)
Query: 6 KDGTQRTFSKQILDEVMQEGYQNGDNFR----HIIVSPYVKSEFVRFMSDSNVASFRYAT 61
D + S +I+ V+++ ++ G + + V K + + + D A
Sbjct: 228 VDAGGQEISLEIIGNVLRKIFEAGGDVNGGNYALYVPGVQKVKISKLLKDYIQA------ 281
Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTI-HE 120
NT+ A A+ FG + + P + A +++ + + LRT +
Sbjct: 282 --PPSENTLGAVANYVATDFGTLPIIP-----TVNLRADEMMILNHDDITLQVLRTRELQ 334
Query: 121 DKNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155
+ + K GD KG+++ E +++VRN +G++ +L
Sbjct: 335 HEYMGKTGDNTKGLIVTELSVEVRNIPTMGMIMNL 369
>gi|319649918|ref|ZP_08004068.1| hypothetical protein HMPREF1013_00673 [Bacillus sp. 2_A_57_CT2]
gi|317398356|gb|EFV79044.1| hypothetical protein HMPREF1013_00673 [Bacillus sp. 2_A_57_CT2]
Length = 292
Score = 79.4 bits (194), Expect = 2e-13, Method: Composition-based stats.
Identities = 24/145 (16%), Positives = 58/145 (40%), Gaps = 16/145 (11%)
Query: 11 RTFSKQILDEVMQEGYQNGDNFRHI--IVSPYVKSEFVRFMSDSNVASFRYATSGKSHNN 68
++ + +VM+ + ++ ++ +K + D S + G
Sbjct: 157 GAITEDDVKKVMRNLWNQDLAEGNVYAFLNADLKEQIDAIYKDKYGYSHVTTSFG----- 211
Query: 69 TIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNG 128
+ + +G V ++ + ++ + + ++ +LR H + +AK G
Sbjct: 212 ---LLVESINTNYGTVHFVLSKHVPAD-----KIVVFNDAYVDLAYLREPHF-EPLAKTG 262
Query: 129 DANKGVLIGEGALKVRNEKAVGVVA 153
D+ KG +I E +KV ++K V V+
Sbjct: 263 DSVKGQVIAEATVKVGSKKGVAVLT 287
>gi|291336566|gb|ADD96115.1| hypothetical protein HG1285_12862 [uncultured organism
MedDCM-OCT-S04-C6]
Length = 347
Score = 77.4 bits (189), Expect = 5e-13, Method: Composition-based stats.
Identities = 24/155 (15%), Positives = 55/155 (35%), Gaps = 11/155 (7%)
Query: 6 KDGTQRT-FSKQILDEVMQEGYQNGDNFR--HIIVSPYVKSEFVRFMSDSNVASFRYATS 62
+G ++ + ++Q + + ++ F F A + +
Sbjct: 188 VNGDPTADITESDIQTMLQTIWSATGMTGDYKLFADATLRRAFTDFTRTIATAGYSQRSL 247
Query: 63 GKSHNNTIV-ATADIYDRPFGKVMVHPNRVMA------SNAETARNAFLIDPNMLEFLWL 115
+ T + T I+D FG V V + + + A +L+D + ++
Sbjct: 248 NYDGDGTRISNTTTIFDGDFGSVEVIADNFIGFKDPASGTTQEAGRGYLLDMDKIDLRVN 307
Query: 116 RTIHEDKNIAKNGDANKGVLIGEGALKVRNEKAVG 150
+ + + G + ++ G AL+VRN +G
Sbjct: 308 KQ-PQIERFEDKGGGERFLIEGRAALQVRNPIGLG 341
>gi|256375780|ref|YP_003099440.1| hypothetical protein Amir_1645 [Actinosynnema mirum DSM 43827]
gi|255920083|gb|ACU35594.1| hypothetical protein Amir_1645 [Actinosynnema mirum DSM 43827]
Length = 406
Score = 77.4 bits (189), Expect = 5e-13, Method: Composition-based stats.
Identities = 21/150 (14%), Positives = 47/150 (31%), Gaps = 8/150 (5%)
Query: 9 TQRTFSKQILDEVMQEGYQNGDNFR---HIIVSPYVKSEFVRFMSDSNVASFRYATSGKS 65
+ S ++ Q + G + + K + +D+ +
Sbjct: 257 SATVLSVDDVNVFAQRVWDGGGLTDGLGVFLCNSSQKIALSKAYADAYGKAVPVT----Q 312
Query: 66 HNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIA 125
+ FG + + +R M +A + + L + ++ +A
Sbjct: 313 GEKVGGVVVERLLTDFGPINIMVDRAMPQDALLLTSLSQLKLQFLLIPG-KGSFFEEPLA 371
Query: 126 KNGDANKGVLIGEGALKVRNEKAVGVVADL 155
K G + + GE L+ + A GV+ L
Sbjct: 372 KTGASEDTQIYGEVGLEYGSPLAHGVLRGL 401
>gi|281355462|ref|ZP_06241956.1| hypothetical protein Vvad_PD3568 [Victivallis vadensis ATCC
BAA-548]
gi|281318342|gb|EFB02362.1| hypothetical protein Vvad_PD3568 [Victivallis vadensis ATCC
BAA-548]
Length = 403
Score = 75.9 bits (185), Expect = 1e-12, Method: Composition-based stats.
Identities = 17/145 (11%), Positives = 47/145 (32%), Gaps = 11/145 (7%)
Query: 2 TKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYAT 61
T A D + + +++++ + + + G N + +P ++
Sbjct: 228 TALAVDASGGRLTSKLVNDAVMKVTEEGGNPTALYCTPDQARVLANEYKNNITVMRDDKA 287
Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHED 121
G + + + + + +A+L D + W+R + +
Sbjct: 288 RG------VYVAVVVNEANGQNIEII-----GDPDFNDADAWLCDDDCFALSWMRPVGDT 336
Query: 122 KNIAKNGDANKGVLIGEGALKVRNE 146
+ + D ++GE L+ N
Sbjct: 337 DSTPPDADGISRKMVGEFTLEFHNA 361
>gi|257463376|ref|ZP_05627772.1| hypothetical protein FuD12_05953 [Fusobacterium sp. D12]
gi|317060946|ref|ZP_07925431.1| predicted protein [Fusobacterium sp. D12]
gi|313686622|gb|EFS23457.1| predicted protein [Fusobacterium sp. D12]
Length = 369
Score = 74.4 bits (181), Expect = 6e-12, Method: Composition-based stats.
Identities = 28/154 (18%), Positives = 64/154 (41%), Gaps = 18/154 (11%)
Query: 6 KDGTQRTFSKQILDEVMQEGYQNGDNFR----HIIVSPYVKSEFVRFMSDSNVASFRYAT 61
D + S +++ V+++ ++ G + + V K + + + D A
Sbjct: 228 VDAGGQDISLEMIGNVLRKIFEVGGDVDGGNYALYVPGIQKVKISKLLKDYIQA------ 281
Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHED 121
+T+ A A FG + + P + +++ + +E LR
Sbjct: 282 --PPAEHTLGAVATYIATDFGTLPIIP-----TTNLRGDEMMILNHDDIEAKVLRG-LSH 333
Query: 122 KNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155
K + + GD KG+++ E +++VRN +G++ +L
Sbjct: 334 KYMGETGDNTKGLIVTELSVQVRNIHTMGMIVNL 367
>gi|319789007|ref|YP_004150640.1| phage protein [Thermovibrio ammonificans HB-1]
gi|317113509|gb|ADU95999.1| phage protein [Thermovibrio ammonificans HB-1]
Length = 373
Score = 73.6 bits (179), Expect = 8e-12, Method: Composition-based stats.
Identities = 22/155 (14%), Positives = 55/155 (35%), Gaps = 17/155 (10%)
Query: 6 KDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKS 65
+ ++ +L++ +++ Q+G + ++ + + F +++ +
Sbjct: 229 VNAAGNDLTQTLLNDTIEKIVQDGGSPNVLVCNTTQARKISAFNANNIIVQR-------- 280
Query: 66 HNNTIVATADIYDRPF--GKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRT--IHED 121
+ T + G + + ++D + L + LR I +
Sbjct: 281 EDQTAGNFVARFVADLPAGVITTI----VVDTNFPKDKIAILDTSRLRLVPLRGSTIRDF 336
Query: 122 KNIAKNGDANKGVLIGEGALKVRNEK-AVGVVADL 155
D ++GE L+VRN K A G++ L
Sbjct: 337 DATPNGADFIARRILGEYTLEVRNAKQAHGLLEGL 371
>gi|134299981|ref|YP_001113477.1| hypothetical protein Dred_2135 [Desulfotomaculum reducens MI-1]
gi|134052681|gb|ABO50652.1| hypothetical protein Dred_2135 [Desulfotomaculum reducens MI-1]
Length = 285
Score = 72.4 bits (176), Expect = 2e-11, Method: Composition-based stats.
Identities = 33/153 (21%), Positives = 54/153 (35%), Gaps = 18/153 (11%)
Query: 2 TKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHI-IVSPYVKSEFVRFMSDSNVASFRYA 60
A T TF+K +EV+ + Y G N + + +K + F + +A +
Sbjct: 148 INPANKVTNATFTKTKFEEVVGKMYDAGVNDEMLCFLPAQMKIQLNSFSNVEFLARDMFL 207
Query: 61 TSGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHE 120
+ Y P+G V + + F+++PN LE L H
Sbjct: 208 GFD----------TERYVTPYGIVTFVLS------EKLNNKLFIVNPNYLELAELIPFHA 251
Query: 121 DKNIAKNGDANKGVLIGEGALKVRNEKAVGVVA 153
A +G L + LK+ N KA A
Sbjct: 252 VPQ-AVSGSKQSVYLETQFGLKLLNTKAAASFA 283
>gi|134298256|ref|YP_001111752.1| hypothetical protein Dred_0379 [Desulfotomaculum reducens MI-1]
gi|134050956|gb|ABO48927.1| hypothetical protein Dred_0379 [Desulfotomaculum reducens MI-1]
Length = 285
Score = 72.0 bits (175), Expect = 3e-11, Method: Composition-based stats.
Identities = 31/153 (20%), Positives = 53/153 (34%), Gaps = 18/153 (11%)
Query: 2 TKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHI-IVSPYVKSEFVRFMSDSNVASFRYA 60
T TF+K +EV+ + Y G N + + +K + F + +A +
Sbjct: 148 INPVNQVTNATFTKTKFEEVVSKMYDAGVNDEMLCFLPAQMKIQLNSFSNVEFLARDMF- 206
Query: 61 TSGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHE 120
+ + Y P+G V + + F+++PN LE L H
Sbjct: 207 ---------LGFDTERYVTPYGIVTFVLS------EKLNNKLFIVNPNYLELAELIPFHA 251
Query: 121 DKNIAKNGDANKGVLIGEGALKVRNEKAVGVVA 153
+G L + LK+ N KA A
Sbjct: 252 TVE-PVSGSKQSIYLETQFGLKLLNTKAAASFA 283
>gi|30020036|ref|NP_831667.1| Phage protein [Bacillus cereus ATCC 14579]
gi|31415788|ref|NP_852528.1| hypothetical protein BC1894 [Bacillus phage phBC6A51]
gi|229127327|ref|ZP_04256323.1| hypothetical protein bcere0015_17800 [Bacillus cereus BDRD-Cer4]
gi|29895581|gb|AAP08868.1| Phage protein [Bacillus phage phBC6A51]
gi|228656160|gb|EEL12002.1| hypothetical protein bcere0015_17800 [Bacillus cereus BDRD-Cer4]
Length = 374
Score = 70.5 bits (171), Expect = 7e-11, Method: Composition-based stats.
Identities = 26/147 (17%), Positives = 57/147 (38%), Gaps = 18/147 (12%)
Query: 18 LDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTIVATADIY 77
L++ +++ + G + I+ P + + + D+ + + +G +
Sbjct: 241 LNDAVEQVFTRGGSANTILCGPNIARQITKLGGDTIRTTRQDTAAGYQ--------ILSF 292
Query: 78 DRPF--GKV-MVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGDANKGV 134
G + V + M + A L+D + ++ +L I++ D V
Sbjct: 293 VSDLPGGAISSVVVDLNMPKD-----RALLLDTDKVKARYLTPIYDQDATPNGADYFSRV 347
Query: 135 LIGEGALKVRNEKAVGVVADLFGLSKT 161
+ GE +V+N K +A L +SKT
Sbjct: 348 IRGEFGFEVKNAKES--IAVLENISKT 372
>gi|228941057|ref|ZP_04103614.1| hypothetical protein bthur0008_36970 [Bacillus thuringiensis
serovar berliner ATCC 10792]
gi|228973988|ref|ZP_04134562.1| hypothetical protein bthur0003_37430 [Bacillus thuringiensis
serovar thuringiensis str. T01001]
gi|228980577|ref|ZP_04140886.1| hypothetical protein bthur0002_37450 [Bacillus thuringiensis Bt407]
gi|228779138|gb|EEM27396.1| hypothetical protein bthur0002_37450 [Bacillus thuringiensis Bt407]
gi|228785714|gb|EEM33719.1| hypothetical protein bthur0003_37430 [Bacillus thuringiensis
serovar thuringiensis str. T01001]
gi|228818600|gb|EEM64668.1| hypothetical protein bthur0008_36970 [Bacillus thuringiensis
serovar berliner ATCC 10792]
gi|326939625|gb|AEA15521.1| Phage protein [Bacillus thuringiensis serovar chinensis CT-43]
Length = 374
Score = 70.5 bits (171), Expect = 8e-11, Method: Composition-based stats.
Identities = 26/149 (17%), Positives = 58/149 (38%), Gaps = 18/149 (12%)
Query: 16 QILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTIVATAD 75
++L++ +++ + G + I+ P + + + D+ + + +G
Sbjct: 239 KLLNDAVEQVFTRGGSANTILCGPNIARQITKLGGDTIRTTRQDTAAGYQ--------IL 290
Query: 76 IYDRPF--GKV-MVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGDANK 132
+ G + V + M + A L+D ++ +L I++ D
Sbjct: 291 SFVSDLPGGAISSVVVDLNMPKD-----RALLLDTEKVKARYLTPIYDQDATPNGADYFS 345
Query: 133 GVLIGEGALKVRNEKAVGVVADLFGLSKT 161
V+ GE +V+N K +A L +SKT
Sbjct: 346 RVIRGEFGFEVKNAKES--IAVLENISKT 372
>gi|218897919|ref|YP_002446330.1| phage protein [Bacillus cereus G9842]
gi|218542918|gb|ACK95312.1| phage protein [Bacillus cereus G9842]
Length = 374
Score = 70.1 bits (170), Expect = 1e-10, Method: Composition-based stats.
Identities = 26/147 (17%), Positives = 56/147 (38%), Gaps = 18/147 (12%)
Query: 18 LDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTIVATADIY 77
L++ +++ + G + I+ P + + + D+ + + +G +
Sbjct: 241 LNDAVEQVFTRGGSANTILCGPNIARQITKLGGDTIRTTRQDTAAGYQ--------ILSF 292
Query: 78 DRPF--GKV-MVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGDANKGV 134
G + V + M + A L+D ++ +L I++ D V
Sbjct: 293 VSDLPGGAISSVVVDLNMPKD-----RALLLDTEKVKARYLTPIYDQDATPNGADYFSRV 347
Query: 135 LIGEGALKVRNEKAVGVVADLFGLSKT 161
+ GE +V+N K +A L +SKT
Sbjct: 348 IRGEFGFEVKNAKES--IAVLENISKT 372
>gi|229020770|ref|ZP_04177493.1| hypothetical protein bcere0030_52440 [Bacillus cereus AH1273]
gi|228740571|gb|EEL90846.1| hypothetical protein bcere0030_52440 [Bacillus cereus AH1273]
Length = 374
Score = 70.1 bits (170), Expect = 1e-10, Method: Composition-based stats.
Identities = 26/147 (17%), Positives = 56/147 (38%), Gaps = 18/147 (12%)
Query: 18 LDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTIVATADIY 77
L++ +++ + G + I+ P + + + D+ + + +G +
Sbjct: 241 LNDAVEQVFTRGGSANTILCGPNIARQITKLGGDTIRTTRQDTAAGYQ--------ILSF 292
Query: 78 DRPF--GKV-MVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGDANKGV 134
G + V + M + A L+D ++ +L I++ D V
Sbjct: 293 VSDLPGGAISSVVVDLNMPKD-----RALLLDTEKVKARYLTPIYDQDATPNGADYFSRV 347
Query: 135 LIGEGALKVRNEKAVGVVADLFGLSKT 161
+ GE +V+N K +A L +SKT
Sbjct: 348 IRGEFGFEVKNAKES--IAVLENISKT 372
>gi|229190579|ref|ZP_04317576.1| hypothetical protein bcere0002_22460 [Bacillus cereus ATCC 10876]
gi|228592924|gb|EEK50746.1| hypothetical protein bcere0002_22460 [Bacillus cereus ATCC 10876]
Length = 374
Score = 69.7 bits (169), Expect = 1e-10, Method: Composition-based stats.
Identities = 26/147 (17%), Positives = 56/147 (38%), Gaps = 18/147 (12%)
Query: 18 LDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTIVATADIY 77
L++ +++ + G + I+ P + + + D+ + + + T +
Sbjct: 241 LNDAVEQVFTRGGSANTILCGPNIARQITKLGGDTIRTTRQ--------DTTAGYQILSF 292
Query: 78 DRPF--GKV-MVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGDANKGV 134
G + V + M + A L+D ++ +L I++ D V
Sbjct: 293 VSDLPGGAISSVVVDLNMPKD-----RALLLDTEKVKARYLTPIYDQDATPNGADYFSRV 347
Query: 135 LIGEGALKVRNEKAVGVVADLFGLSKT 161
+ GE +V+N K +A L +SKT
Sbjct: 348 IRGEFGFEVKNAKES--IAVLENISKT 372
>gi|228968787|ref|ZP_04129749.1| hypothetical protein bthur0004_55460 [Bacillus thuringiensis
serovar sotto str. T04001]
gi|228790850|gb|EEM38489.1| hypothetical protein bthur0004_55460 [Bacillus thuringiensis
serovar sotto str. T04001]
Length = 374
Score = 69.7 bits (169), Expect = 1e-10, Method: Composition-based stats.
Identities = 26/147 (17%), Positives = 56/147 (38%), Gaps = 18/147 (12%)
Query: 18 LDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTIVATADIY 77
L++ +++ + G + I+ P + + + D+ + + +G +
Sbjct: 241 LNDAVEQVFTRGGSANTILCGPNIARQITKLGGDTIRTTRQDTAAGYQ--------ILSF 292
Query: 78 DRPF--GKV-MVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGDANKGV 134
G + V + M + A L+D ++ +L I++ D V
Sbjct: 293 VSDLPGGAISSVVVDLNMPKD-----RALLLDTEKVKARYLTPIYDQDATPNGADYFSRV 347
Query: 135 LIGEGALKVRNEKAVGVVADLFGLSKT 161
+ GE +V+N K +A L +SKT
Sbjct: 348 IRGEFGFEVKNAKES--IAVLENISKT 372
>gi|323703894|ref|ZP_08115527.1| hypothetical protein DesniDRAFT_2739 [Desulfotomaculum nigrificans
DSM 574]
gi|323531143|gb|EGB21049.1| hypothetical protein DesniDRAFT_2739 [Desulfotomaculum nigrificans
DSM 574]
Length = 285
Score = 69.7 bits (169), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/153 (19%), Positives = 51/153 (33%), Gaps = 18/153 (11%)
Query: 2 TKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHI-IVSPYVKSEFVRFMSDSNVASFRYA 60
A T +F+K +E + + Y G N I + +K F + +A +
Sbjct: 148 IDPANQVTNTSFTKAKFEETVAKMYDAGVNDEMICFIPAQMKIALNGFNNVEFLARDMFQ 207
Query: 61 TSGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHE 120
+ Y P+G + + + F+++PN LE L H
Sbjct: 208 GFD----------TEKYVTPYGIITFVLS------EKLQNKLFIVNPNYLELAELIPFHA 251
Query: 121 DKNIAKNGDANKGVLIGEGALKVRNEKAVGVVA 153
+G L + LK+ N KA A
Sbjct: 252 VPQ-PVSGSKQSIYLETQFGLKLLNSKAAASFA 283
>gi|188585861|ref|YP_001917406.1| conserved hypothetical protein [Natranaerobius thermophilus
JW/NM-WN-LF]
gi|179350548|gb|ACB84818.1| conserved hypothetical protein [Natranaerobius thermophilus
JW/NM-WN-LF]
Length = 289
Score = 67.8 bits (164), Expect = 5e-10, Method: Composition-based stats.
Identities = 26/149 (17%), Positives = 53/149 (35%), Gaps = 14/149 (9%)
Query: 1 MTKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHI-IVSPYVKSEFVRFMSDSNVASFRY 59
+ T+ F++ + + +Q+ Y + I + P K + + ++ +
Sbjct: 145 LINSENQKTEDEFNENVFLDTLQKLYDANVHDNMIVFLRPEEKIKINQEFNNVVKYT--- 201
Query: 60 ATSGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIH 119
+ + A+ Y PFG V R + A F+I+P LE L
Sbjct: 202 ----DTLDRVAGFDAEQYISPFGIV-----RFALAPKLPANTMFVINPAYLEMPTLIPFT 252
Query: 120 EDKNIAKNGDANKGVLIGEGALKVRNEKA 148
+ +G + L + +K+ N A
Sbjct: 253 A-RPQNPSGSKQEVYLETQAGVKLLNSYA 280
>gi|257468183|ref|ZP_05632279.1| hypothetical protein FulcA4_02527 [Fusobacterium ulcerans ATCC
49185]
gi|317062468|ref|ZP_07926953.1| predicted protein [Fusobacterium ulcerans ATCC 49185]
gi|313688144|gb|EFS24979.1| predicted protein [Fusobacterium ulcerans ATCC 49185]
Length = 370
Score = 66.7 bits (161), Expect = 1e-09, Method: Composition-based stats.
Identities = 28/155 (18%), Positives = 55/155 (35%), Gaps = 18/155 (11%)
Query: 6 KDGTQRTFSKQILDEVMQEGYQNGDNFRH----IIVSPYVKSEFVRFMSDSNVASFRYAT 61
D S + L+ ++++ Y G + ++ + P +S + + V +T
Sbjct: 227 VDANNEKISIEKLEALLKKAYLVGGDLQNGYYAFYIPPKQQSNLNKEIEKYIVKEQTSST 286
Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTI-HE 120
G +G V + + N L++ E L
Sbjct: 287 LGAVVKEVY--------TSYGIVPLIMTPNLPDN-----EMLLVNHENCEIRPLINRDLF 333
Query: 121 DKNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155
+ + K GD KG+++GE L+VR G + +L
Sbjct: 334 HEYMGKTGDNTKGLILGELTLEVRGVHEQGKIVNL 368
>gi|194015203|ref|ZP_03053819.1| phage protein [Bacillus pumilus ATCC 7061]
gi|194012607|gb|EDW22173.1| phage protein [Bacillus pumilus ATCC 7061]
Length = 367
Score = 60.9 bits (146), Expect = 6e-08, Method: Composition-based stats.
Identities = 24/150 (16%), Positives = 52/150 (34%), Gaps = 18/150 (12%)
Query: 16 QILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTIVATAD 75
+ ++ +M++ Y G + I+ + + + +D+ A +G
Sbjct: 230 KDINALMEQVYLRGGSVNTILTNTAGARQISKMATDTIRTERTDAATGHR--------IS 281
Query: 76 IYDRPF--GKVMVHPNRVMASNAETARNAFLIDPNMLEFLWL--RTIHEDKNIAKNGDAN 131
+ G V + L D ++L L R++++ D
Sbjct: 282 TFVSDMVGGGVATV----IVDPNFPKDKIALFDRSILSLHPLTGRSVYDTDASVPGADFV 337
Query: 132 KGVLIGEGALKVRNEKAVGVVADLFGLSKT 161
+ GE +KV+N A +A L ++ T
Sbjct: 338 ARQIRGEYGIKVKN--ANEKIAILENIATT 365
>gi|294614769|ref|ZP_06694669.1| hypothetical protein EfmE1636_0859 [Enterococcus faecium E1636]
gi|291592381|gb|EFF23990.1| hypothetical protein EfmE1636_0859 [Enterococcus faecium E1636]
Length = 296
Score = 60.5 bits (145), Expect = 7e-08, Method: Composition-based stats.
Identities = 25/154 (16%), Positives = 53/154 (34%), Gaps = 13/154 (8%)
Query: 3 KKAKDGTQRTFSKQILDEVMQEGYQNG-DNFRHIIVSPYVKSEFVRFMSDSNVASFRYAT 61
KA+ T +++ +D + + Y+ G + ++SP ++ +
Sbjct: 152 NKAETATAGAVTRKDIDTLFKLMYEKGYMGEKLCLISPDMQELMTDELDG-----KSTKI 206
Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHED 121
T +G + + ID N ++ LR
Sbjct: 207 VQFGERVTFGLQIGNIVSNYGTGIALLEPSLPKGT-----IAAIDTNYVKLRPLREW-RA 260
Query: 122 KNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155
+ +AK D+ + L+GE +L+ N G + +L
Sbjct: 261 EELAKTTDSRRIGLVGEYSLEY-NASNSGAILNL 293
>gi|257893408|ref|ZP_05673061.1| conserved hypothetical protein [Enterococcus faecium 1,231,408]
gi|257829787|gb|EEV56394.1| conserved hypothetical protein [Enterococcus faecium 1,231,408]
Length = 296
Score = 60.5 bits (145), Expect = 7e-08, Method: Composition-based stats.
Identities = 25/154 (16%), Positives = 53/154 (34%), Gaps = 13/154 (8%)
Query: 3 KKAKDGTQRTFSKQILDEVMQEGYQNG-DNFRHIIVSPYVKSEFVRFMSDSNVASFRYAT 61
KA+ T +++ +D + + Y+ G + ++SP ++ +
Sbjct: 152 NKAETATAGAVTRKDIDTLFKLMYEKGYMGEKLCLISPDMQELMTDELDG-----KSTKI 206
Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHED 121
T +G + + ID N ++ LR
Sbjct: 207 VQFGERVTFGLQIGNIVSNYGTGIALLEPSLPKGT-----IAAIDTNYVKLRPLREW-RA 260
Query: 122 KNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155
+ +AK D+ + L+GE +L+ N G + +L
Sbjct: 261 EELAKTTDSRRIGLVGEYSLEY-NASNSGAILNL 293
>gi|257879565|ref|ZP_05659218.1| conserved hypothetical protein [Enterococcus faecium 1,230,933]
gi|257891545|ref|ZP_05671198.1| conserved hypothetical protein [Enterococcus faecium 1,231,410]
gi|314940388|ref|ZP_07847550.1| conserved hypothetical protein [Enterococcus faecium TX0133a04]
gi|314943205|ref|ZP_07849996.1| conserved hypothetical protein [Enterococcus faecium TX0133C]
gi|314949154|ref|ZP_07852509.1| conserved hypothetical protein [Enterococcus faecium TX0082]
gi|314951966|ref|ZP_07854992.1| conserved hypothetical protein [Enterococcus faecium TX0133A]
gi|314993065|ref|ZP_07858455.1| conserved hypothetical protein [Enterococcus faecium TX0133B]
gi|314995396|ref|ZP_07860499.1| conserved hypothetical protein [Enterococcus faecium TX0133a01]
gi|257813793|gb|EEV42551.1| conserved hypothetical protein [Enterococcus faecium 1,230,933]
gi|257827905|gb|EEV54531.1| conserved hypothetical protein [Enterococcus faecium 1,231,410]
gi|313590399|gb|EFR69244.1| conserved hypothetical protein [Enterococcus faecium TX0133a01]
gi|313592421|gb|EFR71266.1| conserved hypothetical protein [Enterococcus faecium TX0133B]
gi|313595906|gb|EFR74751.1| conserved hypothetical protein [Enterococcus faecium TX0133A]
gi|313598089|gb|EFR76934.1| conserved hypothetical protein [Enterococcus faecium TX0133C]
gi|313640428|gb|EFS05008.1| conserved hypothetical protein [Enterococcus faecium TX0133a04]
gi|313644467|gb|EFS09047.1| conserved hypothetical protein [Enterococcus faecium TX0082]
Length = 296
Score = 60.5 bits (145), Expect = 7e-08, Method: Composition-based stats.
Identities = 25/154 (16%), Positives = 53/154 (34%), Gaps = 13/154 (8%)
Query: 3 KKAKDGTQRTFSKQILDEVMQEGYQNG-DNFRHIIVSPYVKSEFVRFMSDSNVASFRYAT 61
KA+ T +++ +D + + Y+ G + ++SP ++ +
Sbjct: 152 NKAETATAGAVTRKDIDGLFKLMYEKGYMGEKLCLISPDMQELMTDELDG-----KSTKI 206
Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHED 121
T +G + + ID N ++ LR
Sbjct: 207 VQFGERVTFGLQIGNIVSNYGTGIALLEPSLPKGT-----IAAIDTNYVKLRPLREW-RA 260
Query: 122 KNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155
+ +AK D+ + L+GE +L+ N G + +L
Sbjct: 261 EELAKTTDSRRIGLVGEYSLEY-NASNSGAILNL 293
>gi|257883499|ref|ZP_05663152.1| conserved hypothetical protein [Enterococcus faecium 1,231,502]
gi|261208026|ref|ZP_05922703.1| conserved hypothetical protein [Enterococcus faecium TC 6]
gi|289567093|ref|ZP_06447488.1| conserved hypothetical protein [Enterococcus faecium D344SRF]
gi|294622496|ref|ZP_06701518.1| conserved hypothetical protein [Enterococcus faecium U0317]
gi|257819157|gb|EEV46485.1| conserved hypothetical protein [Enterococcus faecium 1,231,502]
gi|260077743|gb|EEW65457.1| conserved hypothetical protein [Enterococcus faecium TC 6]
gi|289161108|gb|EFD09013.1| conserved hypothetical protein [Enterococcus faecium D344SRF]
gi|291598043|gb|EFF29153.1| conserved hypothetical protein [Enterococcus faecium U0317]
Length = 296
Score = 60.5 bits (145), Expect = 7e-08, Method: Composition-based stats.
Identities = 25/154 (16%), Positives = 53/154 (34%), Gaps = 13/154 (8%)
Query: 3 KKAKDGTQRTFSKQILDEVMQEGYQNG-DNFRHIIVSPYVKSEFVRFMSDSNVASFRYAT 61
KA+ T +++ +D + + Y+ G + ++SP ++ +
Sbjct: 152 NKAETATAGAVTRKDIDTLFKLMYEKGYMGEKLCLISPDMQELMTDELDG-----KSTKI 206
Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHED 121
T +G + + ID N ++ LR
Sbjct: 207 VQFGERVTFGLQIGNIVSNYGTGIALLEPSLPKGT-----IAAIDTNYVKLRPLREW-RA 260
Query: 122 KNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155
+ +AK D+ + L+GE +L+ N G + +L
Sbjct: 261 EELAKTTDSRRIGLVGEYSLEY-NASNSGAILNL 293
>gi|239828160|ref|YP_002950784.1| hypothetical protein GWCH70_2835 [Geobacillus sp. WCH70]
gi|239808453|gb|ACS25518.1| conserved hypothetical protein [Geobacillus sp. WCH70]
Length = 283
Score = 60.5 bits (145), Expect = 8e-08, Method: Composition-based stats.
Identities = 22/137 (16%), Positives = 46/137 (33%), Gaps = 17/137 (12%)
Query: 13 FSKQILDEVMQEGYQNGDNFRHI-IVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTIV 71
+ +E + + Y G + + V +K + + + + + +
Sbjct: 155 LTADAFEEAIGKLYDAGVSDEILVFVPARIKKQLNKLGNVEF----------YARDKFLG 204
Query: 72 ATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGDAN 131
+ Y FG V + +N F ++PN +E L H A +G
Sbjct: 205 FDMEQYITVFGTVNFILCEELGNN-----KFFALNPNYIEMPVLIPFHAQVE-AVSGSKK 258
Query: 132 KGVLIGEGALKVRNEKA 148
L + + + N+KA
Sbjct: 259 SIYLETQAGVNLLNDKA 275
>gi|308172834|ref|YP_003919539.1| phage protein [Bacillus amyloliquefaciens DSM 7]
gi|307605698|emb|CBI42069.1| phage protein [Bacillus amyloliquefaciens DSM 7]
Length = 367
Score = 60.5 bits (145), Expect = 8e-08, Method: Composition-based stats.
Identities = 25/148 (16%), Positives = 53/148 (35%), Gaps = 14/148 (9%)
Query: 16 QILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTIVATAD 75
+ ++ +M++ YQ G + I+ + + + +D+ A +G
Sbjct: 230 KDINALMEKVYQRGGSANTILTNTAGARQISKMATDTIRTERTDAATGHR--------VS 281
Query: 76 IYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWL--RTIHEDKNIAKNGDANKG 133
+ V+ L D ++L L R+++++ D
Sbjct: 282 TFVSDMVGGG--VATVVVDPNFPKDKIALFDRSILSLHPLNGRSVYDEDATVPGADFVAR 339
Query: 134 VLIGEGALKVRNEKAVGVVADLFGLSKT 161
+ GE +KV+N A +A L +S T
Sbjct: 340 QIRGEYGIKVKN--ANEKIAILENISTT 365
>gi|319956911|ref|YP_004168174.1| hypothetical protein Nitsa_1172 [Nitratifractor salsuginis DSM
16511]
gi|319419315|gb|ADV46425.1| hypothetical protein Nitsa_1172 [Nitratifractor salsuginis DSM
16511]
Length = 308
Score = 60.1 bits (144), Expect = 1e-07, Method: Composition-based stats.
Identities = 22/147 (14%), Positives = 44/147 (29%), Gaps = 12/147 (8%)
Query: 7 DGTQRTFSKQILDEVMQEGYQNGD---NFRHIIVSPYVKSEFVRFMSDSNVASFRYATSG 63
+G + L E+++ ++ G +I VK RF D
Sbjct: 165 NGNPTVLTYDSLSEIIEPVWERGGIEDESFMLICGTSVKRAINRFAGDQFFRKV------ 218
Query: 64 KSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKN 123
S T + FG V V +R+ + + ++ + +
Sbjct: 219 -SGKEKFDPTLFELETDFGTVQVKIHRLFNQEKLRD-KILVGKLSEARIMFQTSTEFTE- 275
Query: 124 IAKNGDANKGVLIGEGALKVRNEKAVG 150
+ + A G + L+V+N
Sbjct: 276 VPTDKTAKFGRYYTDLTLEVKNPDYFA 302
>gi|291561307|emb|CBL40106.1| hypothetical protein CK3_02480 [butyrate-producing bacterium SS3/4]
Length = 338
Score = 57.8 bits (138), Expect = 4e-07, Method: Composition-based stats.
Identities = 18/142 (12%), Positives = 38/142 (26%), Gaps = 9/142 (6%)
Query: 17 ILDEVMQEGYQNGDNFR--HIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTIVATA 74
+ +++++ Y + V + A T +
Sbjct: 179 DIADMVKKIYGANAPTDGLVLWCDA------VTLFQVNADAVQNGLTVVPAAREINGIAL 232
Query: 75 DIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGDANKGV 134
P G V ++ + + N ++ P + + +AK G K
Sbjct: 233 SSVVTPIGVVYLYLGECLPAGTAMLLNLDVLAPVYQPVPG-KGNFFLEPLAKTGAGEKYQ 291
Query: 135 LIGEGALKVRNEKAVGVVADLF 156
L G+ L E G +
Sbjct: 292 LFGQIGLDHGPEWYHGKFTGIS 313
>gi|329568771|gb|EGG50571.1| hypothetical protein HMPREF9520_03403 [Enterococcus faecalis
TX1467]
Length = 295
Score = 56.3 bits (134), Expect = 1e-06, Method: Composition-based stats.
Identities = 21/154 (13%), Positives = 50/154 (32%), Gaps = 11/154 (7%)
Query: 2 TKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYAT 61
T K + +++ +D + + +Q G + + E + +
Sbjct: 151 TNKVETAAAGAVTRKDIDALFKTMFQKGYMGEKLCLVAPDMQELMT----DQLDEKSTKI 206
Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHED 121
T +G + + + ID N ++ LR
Sbjct: 207 VQFGDKLTFGLQLGNIVSNYGSGIALIEPNLPNGT-----IAAIDTNYVKLRPLREW-RA 260
Query: 122 KNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155
+ +AK D+ + L+GE +++ + G + +L
Sbjct: 261 EELAKTTDSRRIGLVGEYSIEYKASN-SGAILNL 293
>gi|227517040|ref|ZP_03947089.1| conserved hypothetical protein [Enterococcus faecalis TX0104]
gi|229545405|ref|ZP_04434130.1| conserved hypothetical protein [Enterococcus faecalis TX1322]
gi|229549652|ref|ZP_04438377.1| conserved hypothetical protein [Enterococcus faecalis ATCC 29200]
gi|255972349|ref|ZP_05422935.1| predicted protein [Enterococcus faecalis T1]
gi|256619486|ref|ZP_05476332.1| conserved hypothetical protein [Enterococcus faecalis ATCC 4200]
gi|257090289|ref|ZP_05584650.1| predicted protein [Enterococcus faecalis CH188]
gi|300860939|ref|ZP_07107026.1| conserved hypothetical protein [Enterococcus faecalis TUSoD Ef11]
gi|307275949|ref|ZP_07557082.1| hypothetical protein HMPREF9521_01574 [Enterococcus faecalis
TX2134]
gi|307295873|ref|ZP_07575705.1| hypothetical protein HMPREF9509_02949 [Enterococcus faecalis
TX0411]
gi|312900152|ref|ZP_07759467.1| conserved hypothetical protein [Enterococcus faecalis TX0470]
gi|312902789|ref|ZP_07761993.1| conserved hypothetical protein [Enterococcus faecalis TX0635]
gi|227075515|gb|EEI13478.1| conserved hypothetical protein [Enterococcus faecalis TX0104]
gi|229305317|gb|EEN71313.1| conserved hypothetical protein [Enterococcus faecalis ATCC 29200]
gi|229309512|gb|EEN75499.1| conserved hypothetical protein [Enterococcus faecalis TX1322]
gi|255963367|gb|EET95843.1| predicted protein [Enterococcus faecalis T1]
gi|256599013|gb|EEU18189.1| conserved hypothetical protein [Enterococcus faecalis ATCC 4200]
gi|256999101|gb|EEU85621.1| predicted protein [Enterococcus faecalis CH188]
gi|295113266|emb|CBL31903.1| hypothetical protein [Enterococcus sp. 7L76]
gi|300849978|gb|EFK77728.1| conserved hypothetical protein [Enterococcus faecalis TUSoD Ef11]
gi|306496204|gb|EFM65783.1| hypothetical protein HMPREF9509_02949 [Enterococcus faecalis
TX0411]
gi|306507279|gb|EFM76416.1| hypothetical protein HMPREF9521_01574 [Enterococcus faecalis
TX2134]
gi|310633843|gb|EFQ17126.1| conserved hypothetical protein [Enterococcus faecalis TX0635]
gi|311292711|gb|EFQ71267.1| conserved hypothetical protein [Enterococcus faecalis TX0470]
gi|315149052|gb|EFT93068.1| conserved hypothetical protein [Enterococcus faecalis TX4244]
gi|315159923|gb|EFU03940.1| conserved hypothetical protein [Enterococcus faecalis TX0312]
gi|315167465|gb|EFU11482.1| conserved hypothetical protein [Enterococcus faecalis TX1341]
gi|315169417|gb|EFU13434.1| conserved hypothetical protein [Enterococcus faecalis TX1342]
gi|315575405|gb|EFU87596.1| conserved hypothetical protein [Enterococcus faecalis TX0309B]
gi|315576720|gb|EFU88911.1| conserved hypothetical protein [Enterococcus faecalis TX0630]
gi|315582750|gb|EFU94941.1| conserved hypothetical protein [Enterococcus faecalis TX0309A]
gi|323481145|gb|ADX80584.1| hypothetical protein EF62_2373 [Enterococcus faecalis 62]
Length = 295
Score = 56.3 bits (134), Expect = 1e-06, Method: Composition-based stats.
Identities = 21/154 (13%), Positives = 50/154 (32%), Gaps = 11/154 (7%)
Query: 2 TKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYAT 61
T K + +++ +D + + +Q G + + E + +
Sbjct: 151 TNKVETAAAGAVTRKDIDALFKTMFQKGYMGEKLCLVAPDMQELMT----DQLDEKSTKI 206
Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHED 121
T +G + + + ID N ++ LR
Sbjct: 207 VQFGDKLTFGLQLGNIVSNYGSGIALIEPNLPNGT-----IAAIDTNYVKLRPLREW-RA 260
Query: 122 KNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155
+ +AK D+ + L+GE +++ + G + +L
Sbjct: 261 EELAKTTDSRRIGLVGEYSIEYKASN-SGAILNL 293
>gi|29376526|ref|NP_815680.1| hypothetical protein EF2011 [Enterococcus faecalis V583]
gi|227555439|ref|ZP_03985486.1| conserved hypothetical protein [Enterococcus faecalis HH22]
gi|29343990|gb|AAO81750.1| hypothetical protein EF_2011 [Enterococcus faecalis V583]
gi|227175420|gb|EEI56392.1| conserved hypothetical protein [Enterococcus faecalis HH22]
Length = 295
Score = 55.5 bits (132), Expect = 2e-06, Method: Composition-based stats.
Identities = 21/154 (13%), Positives = 50/154 (32%), Gaps = 11/154 (7%)
Query: 2 TKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYAT 61
T K + +++ +D + + +Q G + + E + +
Sbjct: 151 TNKVETAAAGAVTRKDIDALFKTMFQKGYMGEKLCLVAPDMQELM----IDQLDEKSTKI 206
Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHED 121
T +G + + + ID N ++ LR
Sbjct: 207 VQFGDKLTFGLQLGNIVSNYGSGIALIEPNLPNGT-----IAAIDTNYVKLRPLREW-RA 260
Query: 122 KNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155
+ +AK D+ + L+GE +++ + G + +L
Sbjct: 261 EELAKTTDSRRIGLVGEYSIEYKASN-SGAILNL 293
>gi|256964718|ref|ZP_05568889.1| conserved hypothetical protein [Enterococcus faecalis HIP11704]
gi|307272797|ref|ZP_07554044.1| hypothetical protein HMPREF9514_01561 [Enterococcus faecalis
TX0855]
gi|256955214|gb|EEU71846.1| conserved hypothetical protein [Enterococcus faecalis HIP11704]
gi|306510411|gb|EFM79434.1| hypothetical protein HMPREF9514_01561 [Enterococcus faecalis
TX0855]
Length = 264
Score = 55.5 bits (132), Expect = 2e-06, Method: Composition-based stats.
Identities = 21/154 (13%), Positives = 50/154 (32%), Gaps = 11/154 (7%)
Query: 2 TKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYAT 61
T K + +++ +D + + +Q G + + E + +
Sbjct: 120 TNKVETAAAGAVTRKDIDALFKTMFQKGYMGEKLCLVAPDMQELMT----DQLDEKSTKI 175
Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHED 121
T +G + + + ID N ++ LR
Sbjct: 176 VQFGDKLTFGLQLGNIVSNYGSGIALIEPNLPNGT-----IAAIDTNYVKLRPLREW-RA 229
Query: 122 KNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155
+ +AK D+ + L+GE +++ + G + +L
Sbjct: 230 EELAKTTDSRRIGLVGEYSIEYKASN-SGAILNL 262
>gi|307280635|ref|ZP_07561683.1| hypothetical protein HMPREF9515_01677 [Enterococcus faecalis
TX0860]
gi|306504001|gb|EFM73218.1| hypothetical protein HMPREF9515_01677 [Enterococcus faecalis
TX0860]
Length = 300
Score = 54.7 bits (130), Expect = 4e-06, Method: Composition-based stats.
Identities = 19/150 (12%), Positives = 45/150 (30%), Gaps = 11/150 (7%)
Query: 10 QRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNT 69
+++ +D++ + + G + + + + V
Sbjct: 159 ADAVTRKDVDKMFKTMFDKGYAGEKLCLVSTDMVDLMT----DEVDKAGTKVFNFGDQVA 214
Query: 70 IVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGD 129
+G + S +D N +E LR + +AK D
Sbjct: 215 FGLQLGKIVSNYGSGTALIEPSLPSGT-----MIALDTNYVELRPLREW-RAEELAKTTD 268
Query: 130 ANKGVLIGEGALKVRNEKAVGVVADLFGLS 159
+ + L+GE ++ N G + +L +
Sbjct: 269 SKRIGLVGEYTIEY-NASNSGAILNLATAA 297
>gi|315173098|gb|EFU17115.1| conserved hypothetical protein [Enterococcus faecalis TX1346]
Length = 300
Score = 54.7 bits (130), Expect = 5e-06, Method: Composition-based stats.
Identities = 19/150 (12%), Positives = 45/150 (30%), Gaps = 11/150 (7%)
Query: 10 QRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNT 69
+++ +D++ + + G + + + + V
Sbjct: 159 ADAVTRKDVDKMFKTMFDKGYAGEKLCLVSTDMVDLMT----DEVDKAGTKVFNFGDRVA 214
Query: 70 IVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGD 129
+G + S +D N +E LR + +AK D
Sbjct: 215 FGLQLGKIVSNYGSGTALIEPSLPSGT-----MIALDTNYVELRPLREW-RAEELAKTTD 268
Query: 130 ANKGVLIGEGALKVRNEKAVGVVADLFGLS 159
+ + L+GE ++ N G + +L +
Sbjct: 269 SKRIGLVGEYTIEY-NASNSGAILNLATAA 297
>gi|307286482|ref|ZP_07566582.1| hypothetical protein HMPREF9505_00059 [Enterococcus faecalis
TX0109]
gi|306502395|gb|EFM71671.1| hypothetical protein HMPREF9505_00059 [Enterococcus faecalis
TX0109]
Length = 300
Score = 53.9 bits (128), Expect = 7e-06, Method: Composition-based stats.
Identities = 19/150 (12%), Positives = 45/150 (30%), Gaps = 11/150 (7%)
Query: 10 QRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNT 69
+++ +D++ + + G + + + + V
Sbjct: 159 ADAVTRKDVDKMFKTMFDKGYAGEKLCLVSTDMVDLMT----DEVDKAGTKVFNFGDQVA 214
Query: 70 IVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGD 129
+G + S +D N +E LR + +AK D
Sbjct: 215 FGLQLGKIVSNYGSGTALIEPSLPSGT-----MIALDTNYVELRPLREW-RAEELAKTTD 268
Query: 130 ANKGVLIGEGALKVRNEKAVGVVADLFGLS 159
+ + L+GE ++ N G + +L +
Sbjct: 269 SKRIGLVGEYTIEY-NASNSGAILNLATAA 297
>gi|315028531|gb|EFT40463.1| conserved hypothetical protein [Enterococcus faecalis TX4000]
Length = 295
Score = 53.2 bits (126), Expect = 1e-05, Method: Composition-based stats.
Identities = 21/154 (13%), Positives = 49/154 (31%), Gaps = 11/154 (7%)
Query: 2 TKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYAT 61
T K + +++ +D + + Q G + + E + +
Sbjct: 151 TNKVETAAAGAVTRKDIDALFKTMLQKGYMGEKLCLVAPDMQELMT----DQLDEKSTKI 206
Query: 62 SGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHED 121
T +G + + + ID N ++ LR
Sbjct: 207 VQFGDKLTFGLQLGNIVSNYGSGIALIEPNLPNGT-----IAAIDTNYVKLRPLREW-RA 260
Query: 122 KNIAKNGDANKGVLIGEGALKVRNEKAVGVVADL 155
+ +AK D+ + L+GE +++ + G + +L
Sbjct: 261 EELAKTTDSRRIGLVGEYSIEYKASN-SGAILNL 293
>gi|307270079|ref|ZP_07551399.1| hypothetical protein HMPREF9498_02197 [Enterococcus faecalis
TX4248]
gi|306513574|gb|EFM82186.1| hypothetical protein HMPREF9498_02197 [Enterococcus faecalis
TX4248]
Length = 300
Score = 53.2 bits (126), Expect = 1e-05, Method: Composition-based stats.
Identities = 19/150 (12%), Positives = 46/150 (30%), Gaps = 11/150 (7%)
Query: 10 QRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNT 69
+++ +D++ + + G + + + + V
Sbjct: 159 ADAVTRKDVDKMFKTMFDKGYAGEKLCLVSTDMVDLMT----DEVDKAGTKVFNFGDQVA 214
Query: 70 IVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGD 129
+G + S +D N +E LR + +AK D
Sbjct: 215 FGLQLGKIVSNYGSGTALIEPSLPSGT-----MIALDTNYVELRPLREW-RAEELAKTTD 268
Query: 130 ANKGVLIGEGALKVRNEKAVGVVADLFGLS 159
+ + L+GE +++ N G + +L +
Sbjct: 269 SKRIGLVGEYSIEY-NASNSGAILNLATAA 297
>gi|256956794|ref|ZP_05560965.1| conserved hypothetical protein [Enterococcus faecalis DS5]
gi|256947290|gb|EEU63922.1| conserved hypothetical protein [Enterococcus faecalis DS5]
gi|295113775|emb|CBL32412.1| hypothetical protein [Enterococcus sp. 7L76]
gi|315035894|gb|EFT47826.1| conserved hypothetical protein [Enterococcus faecalis TX0027]
Length = 300
Score = 52.8 bits (125), Expect = 1e-05, Method: Composition-based stats.
Identities = 19/150 (12%), Positives = 46/150 (30%), Gaps = 11/150 (7%)
Query: 10 QRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNT 69
+++ +D++ + + G + + + + V
Sbjct: 159 ADAVTRKDVDKMFKTMFDKGYAGEKLCLVSTDMVDLMT----DEVDKAGTKVFNFGDQVA 214
Query: 70 IVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGD 129
+G + S +D N +E LR + +AK D
Sbjct: 215 FGLQLGKIVSNYGSGTALIEPSLPSGT-----MIALDTNYVELRPLREW-RAEELAKTTD 268
Query: 130 ANKGVLIGEGALKVRNEKAVGVVADLFGLS 159
+ + L+GE +++ N G + +L +
Sbjct: 269 SKRIGLVGEYSIEY-NASNSGAILNLATAA 297
>gi|257079386|ref|ZP_05573747.1| predicted protein [Enterococcus faecalis JH1]
gi|256987416|gb|EEU74718.1| predicted protein [Enterococcus faecalis JH1]
Length = 300
Score = 52.4 bits (124), Expect = 2e-05, Method: Composition-based stats.
Identities = 18/146 (12%), Positives = 44/146 (30%), Gaps = 11/146 (7%)
Query: 10 QRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNT 69
+++ +D++ + + G + + + + V
Sbjct: 159 ADAVTRKDVDKMFKTMFDKGYAGEKLCLVSTDMVDLMT----DEVDKAGTKVFNFGDQVA 214
Query: 70 IVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGD 129
+G + S +D N +E +R + +AK D
Sbjct: 215 FGLQLGKIVSNYGSGTALIEPSLPSGT-----MIALDTNYVELRPVREW-RAEELAKTTD 268
Query: 130 ANKGVLIGEGALKVRNEKAVGVVADL 155
+ + L+GE ++ N G + +L
Sbjct: 269 SKRIGLVGEYTIEY-NASNSGAILNL 293
>gi|315144740|gb|EFT88756.1| conserved hypothetical protein [Enterococcus faecalis TX2141]
Length = 300
Score = 51.6 bits (122), Expect = 3e-05, Method: Composition-based stats.
Identities = 18/146 (12%), Positives = 44/146 (30%), Gaps = 11/146 (7%)
Query: 10 QRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNT 69
+++ +D++ + + G + + + + V
Sbjct: 159 ADAVTRKDVDKMFKTMFDKGYAGEKLCLVSTDMVDLMT----DEVDKAGTKVFNFGDQVA 214
Query: 70 IVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHEDKNIAKNGD 129
+G + S +D N +E +R + +AK D
Sbjct: 215 FGLQLGKIVSNYGSGTALIEPSLPSGT-----MIALDTNYVELRPVREW-RAEELAKTTD 268
Query: 130 ANKGVLIGEGALKVRNEKAVGVVADL 155
+ + L+GE ++ N G + +L
Sbjct: 269 SKRIGLVGEYTIEY-NASNSGAILNL 293
>gi|262067743|ref|ZP_06027355.1| hypothetical protein FUSPEROL_02025 [Fusobacterium periodonticum
ATCC 33693]
gi|291378467|gb|EFE85985.1| hypothetical protein FUSPEROL_02025 [Fusobacterium periodonticum
ATCC 33693]
Length = 379
Score = 44.7 bits (104), Expect = 0.004, Method: Composition-based stats.
Identities = 26/147 (17%), Positives = 46/147 (31%), Gaps = 22/147 (14%)
Query: 7 DGTQRTFSKQILDEVMQEGYQNG--------DNFRHIIVSPYVKSEFVRFMSDSNVASFR 58
D + F+ +LD ++ G N + V + + D+ +
Sbjct: 234 DANNQPFTLDLLDNAVKAIVDKGNPGSANLKANKYFLCVPYLILRTINKLNKDNVRS--- 290
Query: 59 YATSGKSHNNTIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTI 118
+ T + G V V + + LI N + L I
Sbjct: 291 -----NITDKVTGTTIEQIVTTSGTVSV-----FPATSLAPNEFLLIHLNDVSLRQLYPI 340
Query: 119 HEDKNIAKNGDANKGVLIGEGALKVRN 145
E + AK A+ L GE A +++N
Sbjct: 341 KE-EEGAKTALADNYFLHGEYAHQIKN 366
>gi|328725488|ref|XP_003248499.1| PREDICTED: probable NADP-dependent mannitol dehydrogenase-like,
partial [Acyrthosiphon pisum]
Length = 462
Score = 38.5 bits (88), Expect = 0.34, Method: Composition-based stats.
Identities = 10/68 (14%), Positives = 24/68 (35%)
Query: 20 EVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTIVATADIYDR 79
E+++ + G + I + ++ + S ++I AT D
Sbjct: 260 EIVRGLAEAGADVAIIYATSDNAPAIAAEVASQTNVKVQAYQSDVRRRDSIAATVDQIVS 319
Query: 80 PFGKVMVH 87
FG++ +
Sbjct: 320 DFGRIDIM 327
>gi|320535449|ref|ZP_08035557.1| hypothetical protein HMPREF9554_00276 [Treponema phagedenis F0421]
gi|320147702|gb|EFW39210.1| hypothetical protein HMPREF9554_00276 [Treponema phagedenis F0421]
Length = 460
Score = 36.2 bits (82), Expect = 1.7, Method: Composition-based stats.
Identities = 10/69 (14%), Positives = 25/69 (36%), Gaps = 7/69 (10%)
Query: 15 KQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRYATSGKSHNNTIVATA 74
++I+ + + ++ G + + P + + + D G S + +
Sbjct: 192 ERIITDPVGMIFEAGGDANKLFFPPILAQDIQDLIRDRIRF-------GTSGSGAMNLVV 244
Query: 75 DIYDRPFGK 83
D Y P+G
Sbjct: 245 DQYPTPYGS 253
>gi|281358094|ref|ZP_06244578.1| hypothetical protein Vvad_PD0658 [Victivallis vadensis ATCC
BAA-548]
gi|281315467|gb|EFA99496.1| hypothetical protein Vvad_PD0658 [Victivallis vadensis ATCC
BAA-548]
Length = 84
Score = 35.5 bits (80), Expect = 2.6, Method: Composition-based stats.
Identities = 14/58 (24%), Positives = 21/58 (36%), Gaps = 2/58 (3%)
Query: 92 MASNAETARNAFLIDPNMLEFLWLRTI--HEDKNIAKNGDANKGVLIGEGALKVRNEK 147
MA +A++IDP L+ + K D K + +GE RN
Sbjct: 4 MADPDMPDTDAWVIDPAGFGMSSLKGRSITDTDATPKGFDGIKRMALGELTFVFRNAA 61
>gi|152990358|ref|YP_001356080.1| flagellar basal-body rod protein FlgF [Nitratiruptor sp. SB155-2]
gi|151422219|dbj|BAF69723.1| flagellar basal-body rod protein FlgF [Nitratiruptor sp. SB155-2]
Length = 238
Score = 34.7 bits (78), Expect = 4.4, Method: Composition-based stats.
Identities = 11/59 (18%), Positives = 22/59 (37%)
Query: 1 MTKKAKDGTQRTFSKQILDEVMQEGYQNGDNFRHIIVSPYVKSEFVRFMSDSNVASFRY 59
+T + F + ++ E+ Q +NG + H+ V P K + F +
Sbjct: 25 VTNNLANVNTTGFKEMVVKEMSQRLDENGGDSNHLFVFPRFKESILNFAQGPLKHTQNR 83
>gi|117619346|ref|YP_856478.1| methionine gamma-lyase [Aeromonas hydrophila subsp. hydrophila ATCC
7966]
gi|117560753|gb|ABK37701.1| methionine gamma-lyase [Aeromonas hydrophila subsp. hydrophila ATCC
7966]
Length = 387
Score = 33.5 bits (75), Expect = 9.7, Method: Composition-based stats.
Identities = 25/142 (17%), Positives = 39/142 (27%), Gaps = 20/142 (14%)
Query: 16 QILDEVMQEGYQNGDNFRHIIVSP-------YVKSEFVRFMSDSNVASFRYATSGKSHNN 68
+ +M +G + P ++K F R
Sbjct: 84 AAVSAIMYALADSG--DEVAYIGPLYGGTDAFLKQTFSRAGIKVTAYESDQDLLTNIRPA 141
Query: 69 TIVATADIYDRPFGKVMVHPNRVMASNAETARNAFLIDPNMLEFLWLRTIHE-------- 120
T V + P + V RV+ A R + D L LR +
Sbjct: 142 TKVVLFETLTNP--TLKVVDPRVVVEAARKVRAITVCDNTFLTPYLLRPLELGIDIVMHS 199
Query: 121 -DKNIAKNGDANKGVLIGEGAL 141
K + +GD GV+ G L
Sbjct: 200 GTKYLGGHGDIIAGVVAGSHTL 221
Database: nr
Posted date: May 13, 2011 4:10 AM
Number of letters in database: 999,999,932
Number of sequences in database: 2,987,209
Database: /data/usr2/db/fasta/nr.01
Posted date: May 13, 2011 4:17 AM
Number of letters in database: 999,998,956
Number of sequences in database: 2,896,973
Database: /data/usr2/db/fasta/nr.02
Posted date: May 13, 2011 4:23 AM
Number of letters in database: 999,999,979
Number of sequences in database: 2,907,862
Database: /data/usr2/db/fasta/nr.03
Posted date: May 13, 2011 4:29 AM
Number of letters in database: 999,999,513
Number of sequences in database: 2,932,190
Database: /data/usr2/db/fasta/nr.04
Posted date: May 13, 2011 4:33 AM
Number of letters in database: 792,586,372
Number of sequences in database: 2,260,650
Lambda K H
0.313 0.131 0.320
Lambda K H
0.267 0.0397 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,027,113,406
Number of Sequences: 13984884
Number of extensions: 24518826
Number of successful extensions: 86389
Number of sequences better than 10.0: 98
Number of HSP's better than 10.0 without gapping: 69
Number of HSP's successfully gapped in prelim test: 29
Number of HSP's that attempted gapping in prelim test: 86186
Number of HSP's gapped (non-prelim): 104
length of query: 162
length of database: 4,792,584,752
effective HSP length: 123
effective length of query: 39
effective length of database: 3,072,444,020
effective search space: 119825316780
effective search space used: 119825316780
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.4 bits)
S2: 75 (33.5 bits)