BLASTP 2.2.22 [Sep-27-2009]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= 537021.9.peg.1074_1
(169 letters)
Database: nr
13,984,884 sequences; 4,792,584,752 total letters
Searching..................................................done
>gi|148257053|ref|YP_001241638.1| putative phage major head protein [Bradyrhizobium sp. BTAi1]
gi|146409226|gb|ABQ37732.1| putative phage Major head protein [Bradyrhizobium sp. BTAi1]
Length = 320
Score = 181 bits (459), Expect = 3e-44, Method: Composition-based stats.
Identities = 65/168 (38%), Positives = 92/168 (54%), Gaps = 5/168 (2%)
Query: 1 MTIVNNTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIHPEWVVDDLASP-G 59
MT +TF+T + N+E LSD++ RI P DTP S + K +++ EW LA G
Sbjct: 1 MTTPTSTFVTYQAVGNREDLSDMIYRIDPVDTPFMSGVDKEKATAVNHEWQTQALAPADG 60
Query: 60 PNAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALE 119
NAQLEGD+ + R+GN QI K +SGTQ+AVD G + Q++ K LE
Sbjct: 61 TNAQLEGDDPNTNVTTPTVRLGNQCQISYKVARVSGTQQAVDHAGRDNELAYQEMLKGLE 120
Query: 120 IRKDVEFALVSSQG----SEKTSPRKMAALSSWIKKNASRGTGGVLED 163
+++D+E L + T+PRK A++ SWI N S+GT G D
Sbjct: 121 LKRDLETILCGTNQAKVVGNTTTPRKTASILSWIVSNTSKGTAGGAAD 168
>gi|150397033|ref|YP_001327500.1| hypothetical protein Smed_1830 [Sinorhizobium medicae WSM419]
gi|150028548|gb|ABR60665.1| hypothetical protein Smed_1830 [Sinorhizobium medicae WSM419]
Length = 331
Score = 179 bits (455), Expect = 9e-44, Method: Composition-based stats.
Identities = 93/161 (57%), Positives = 118/161 (73%)
Query: 1 MTIVNNTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIHPEWVVDDLASPGP 60
M + NT++T+ + N+E LSDVVSRITPEDTPIYS I+KG SIHPEW D+LA+PG
Sbjct: 1 MAALANTYMTTQAVGNREELSDVVSRITPEDTPIYSFIEKGKCVSIHPEWETDELAAPGE 60
Query: 61 NAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEI 120
N + EGDEY+F I PER+GNYTQIMRK WI+SGTQE V + G + K K QKLKK +EI
Sbjct: 61 NIKSEGDEYAFGAITPPERLGNYTQIMRKDWIISGTQEVVSEAGNVQKRKYQKLKKGIEI 120
Query: 121 RKDVEFALVSSQGSEKTSPRKMAALSSWIKKNASRGTGGVL 161
RKDVE+A+V + S + R+ +L++WI+ N SRG GG
Sbjct: 121 RKDVEYAIVDTNASVAGATREFGSLNTWIETNVSRGAGGAN 161
>gi|288817864|ref|YP_003432211.1| putative phage major head protein [Hydrogenobacter thermophilus
TK-6]
gi|288787263|dbj|BAI69010.1| putative phage major head protein [Hydrogenobacter thermophilus
TK-6]
gi|308751463|gb|ADO44946.1| putative phage major head protein [Hydrogenobacter thermophilus
TK-6]
Length = 291
Score = 176 bits (447), Expect = 8e-43, Method: Composition-based stats.
Identities = 58/166 (34%), Positives = 92/166 (55%), Gaps = 5/166 (3%)
Query: 7 TFITSSSTTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIHPEWVVDDLASPGPNAQLEG 66
T ++ N+E LSD+++ I+P +TP+YSM K T + + EW+ D LA+PG NA +EG
Sbjct: 2 ALTTYTAVGNREDLSDIITNISPTETPLYSMFGKATAKATYHEWIEDSLAAPGTNAMVEG 61
Query: 67 DEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEF 126
Y T R GNYTQI K + +S TQEAV G + Q K EI +DVE+
Sbjct: 62 ANYPIADPQTRVRKGNYTQIFAKGYGISETQEAVLKAGIKSEIAYQMQKAMKEIARDVEY 121
Query: 127 ALVSSQ---GSEKTSPRKMAALSSWIKKNA--SRGTGGVLEDMILS 167
A++++ T+ R+M + +++ N + G+ L + +L+
Sbjct: 122 AIINNTAAVAGNATTARQMGGIQAFVITNVLANGGSPRALTETLLN 167
>gi|27476049|ref|NP_775251.1| major head protein [Pseudomonas phage PaP3]
gi|27414479|gb|AAL85565.1| major head protein [Pseudomonas phage PaP3]
Length = 317
Score = 175 bits (444), Expect = 2e-42, Method: Composition-based stats.
Identities = 53/166 (31%), Positives = 82/166 (49%), Gaps = 4/166 (2%)
Query: 1 MTIVNNTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIHPEWVVDDLASPGP 60
M N T +E L D++ I P DTP S I KG +I EW D+L PG
Sbjct: 1 MATPTNAVSTVEINGKREDLIDIIYNIAPYDTPFMSAIGKGVATAITHEWQTDELRQPGK 60
Query: 61 NAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEI 120
N ++EG++ + K + + NY QI ++ ++GT + V G + Q KK+ E+
Sbjct: 61 NTRVEGEDATIKAGSFTTMLNNYCQISDETLQVTGTADRVKKAGRKNELAYQLAKKSKEL 120
Query: 121 RKDVEFALVSS----QGSEKTSPRKMAALSSWIKKNASRGTGGVLE 162
+ D+E+ALV + T+P +MA + ++ K N S G GV
Sbjct: 121 KLDMEYALVGAPQAKVQRNTTTPGQMANIFAYYKTNGSLGANGVAP 166
>gi|167600435|ref|YP_001671935.1| major head protein [Pseudomonas phage LUZ24]
gi|161168298|emb|CAP45463.1| major head protein [Pseudomonas phage LUZ24]
Length = 317
Score = 174 bits (441), Expect = 4e-42, Method: Composition-based stats.
Identities = 51/167 (30%), Positives = 83/167 (49%), Gaps = 4/167 (2%)
Query: 1 MTIVNNTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIHPEWVVDDLASPGP 60
M N T +E L D++ I P DTP + I KG +I EW D+L PG
Sbjct: 1 MATPTNAVSTVEINGKREDLIDIIYNIAPYDTPFMTAIGKGVATAITHEWQTDELRQPGK 60
Query: 61 NAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEI 120
N ++EG++ + K + + NY QI ++ ++GT + V G + Q KK+ E+
Sbjct: 61 NTRVEGEDATIKAGSFTTMLNNYCQISDETLQVTGTADKVKKAGRKNELAYQLAKKSKEL 120
Query: 121 RKDVEFALVSSQGS----EKTSPRKMAALSSWIKKNASRGTGGVLED 163
+ D+E+A+V + + T+P +MA + ++ K N S G G L
Sbjct: 121 KLDMEYAMVGAPQAKIQRNTTTPGQMANIFAYYKTNGSVGANGTLPT 167
>gi|291334638|gb|ADD94286.1| putative phage major head protein [uncultured phage
MedDCM-OCT-S04-C64]
Length = 323
Score = 172 bits (436), Expect = 1e-41, Method: Composition-based stats.
Identities = 51/172 (29%), Positives = 83/172 (48%), Gaps = 3/172 (1%)
Query: 1 MTIVNNTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIHPEWVVDDLASPGP 60
M + TF T + +E L+D++ I+P DTP S + + ++ EW D L +
Sbjct: 1 MAVPAQTFTTYGAVGEREDLTDIIYDISPMDTPFLSNASRESATAVFYEWQTDSLDTAAV 60
Query: 61 NAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEI 120
NAQLEGD+ T + R+GNYTQI K ++GT AV G + Q K+ E+
Sbjct: 61 NAQLEGDDGVTSTSSATTRLGNYTQISTKVPRVTGTLRAVATAGRADELAYQISKRGREL 120
Query: 121 RKDVEFALV---SSQGSEKTSPRKMAALSSWIKKNASRGTGGVLEDMILSLA 169
++D+E AL ++ + R +A + +W+ N + + S A
Sbjct: 121 KRDMETALTGTQAASAGGAGTARNLAGIGAWLSTNQVQKGANATTPPVSSGA 172
>gi|291334838|gb|ADD94478.1| putative phage major head protein [uncultured phage
MedDCM-OCT-S06-C1041]
Length = 323
Score = 172 bits (435), Expect = 2e-41, Method: Composition-based stats.
Identities = 51/172 (29%), Positives = 83/172 (48%), Gaps = 3/172 (1%)
Query: 1 MTIVNNTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIHPEWVVDDLASPGP 60
M + TF T + +E L+D++ I+P DTP S + + ++ EW D L +
Sbjct: 1 MAVPAQTFTTYGAVGEREDLTDIIYDISPMDTPFLSNASRESATAVFYEWQTDSLDTAAV 60
Query: 61 NAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEI 120
NAQLEGD+ T + R+GNYTQI K ++GT AV G + Q K+ E+
Sbjct: 61 NAQLEGDDGVTSTSSATTRLGNYTQISTKVPRVTGTLRAVATAGRADELAYQISKRGREL 120
Query: 121 RKDVEFALV---SSQGSEKTSPRKMAALSSWIKKNASRGTGGVLEDMILSLA 169
++D+E AL ++ + R +A + +W+ N + + S A
Sbjct: 121 KRDMETALTGTQAASAGGAGTARNLAGIGAWLSTNQVQKGANATTPPVTSGA 172
>gi|163783849|ref|ZP_02178828.1| hypothetical protein HG1285_12862 [Hydrogenivirga sp. 128-5-R1-1]
gi|159880872|gb|EDP74397.1| hypothetical protein HG1285_12862 [Hydrogenivirga sp. 128-5-R1-1]
Length = 291
Score = 171 bits (434), Expect = 3e-41, Method: Composition-based stats.
Identities = 59/172 (34%), Positives = 94/172 (54%), Gaps = 10/172 (5%)
Query: 1 MTIVNNTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIHPEWVVDDLASPGP 60
M + T ++ N+E LSD+++ I P +TP+YSM K T S + EW+ DDL PG
Sbjct: 1 MAV-----TTYTAVGNREDLSDLITNIAPTETPLYSMFGKTTAKSTYHEWLEDDLNPPGV 55
Query: 61 NAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEI 120
NA++EG +++ T R GNYTQI K + +S TQE V G + Q K EI
Sbjct: 56 NAKVEGADFTIDTPTNRVRKGNYTQIFSKGYGVSRTQEKVLKAGIKSELAYQMAKAMKEI 115
Query: 121 RKDVEFALVSS---QGSEKTSPRKMAALSSWIKKNA--SRGTGGVLEDMILS 167
+DVE+A++++ T+ R+M + +++ N + GT L + +L+
Sbjct: 116 ARDVEYAIINNTAASAGSATTARQMGGVQAFVSTNVLANAGTPRPLTETLLN 167
>gi|227822441|ref|YP_002826413.1| putative phage major head protein [Sinorhizobium fredii NGR234]
gi|227341442|gb|ACP25660.1| putative phage major head protein [Sinorhizobium fredii NGR234]
Length = 331
Score = 169 bits (428), Expect = 1e-40, Method: Composition-based stats.
Identities = 88/155 (56%), Positives = 113/155 (72%)
Query: 1 MTIVNNTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIHPEWVVDDLASPGP 60
M ++ NTF T+ + N+E LSDVVSRITPEDTPIYS+I+KG + HPEW D+LA+PG
Sbjct: 1 MAVLTNTFQTTQAVGNREELSDVVSRITPEDTPIYSLIEKGKCTTYHPEWETDELAAPGA 60
Query: 61 NAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEI 120
N + EG+EY+F I P+R+GNYTQIMRK WI+S TQE + G + K K QKLKK +EI
Sbjct: 61 NVREEGEEYAFGAITPPKRLGNYTQIMRKDWIISATQEVTAEAGNVQKRKYQKLKKGVEI 120
Query: 121 RKDVEFALVSSQGSEKTSPRKMAALSSWIKKNASR 155
RKDVEFA+V + + S R+ +LS+WI NASR
Sbjct: 121 RKDVEFAIVDTNATVAGSTREFGSLSTWIVSNASR 155
>gi|315122533|ref|YP_004063022.1| hypothetical protein CKC_03925 [Candidatus Liberibacter
solanacearum CLso-ZC1]
gi|313495935|gb|ADR52534.1| hypothetical protein CKC_03925 [Candidatus Liberibacter
solanacearum CLso-ZC1]
Length = 331
Score = 167 bits (423), Expect = 5e-40, Method: Composition-based stats.
Identities = 134/160 (83%), Positives = 150/160 (93%)
Query: 1 MTIVNNTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIHPEWVVDDLASPGP 60
MT + NTFI++SS+TNKESLSDVVSRITPEDTPIYSMIKKG+T SIHPEWVVDDL+SPGP
Sbjct: 1 MTEITNTFISTSSSTNKESLSDVVSRITPEDTPIYSMIKKGSTRSIHPEWVVDDLSSPGP 60
Query: 61 NAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEI 120
NAQLEGDEYSF++I+TPERMGNYTQIMRKSWILSGTQE++DD G +LKYKEQKLKKALEI
Sbjct: 61 NAQLEGDEYSFESISTPERMGNYTQIMRKSWILSGTQESIDDTGSLLKYKEQKLKKALEI 120
Query: 121 RKDVEFALVSSQGSEKTSPRKMAALSSWIKKNASRGTGGV 160
RKDVEFALVS+Q SEK SPRK+A+LSSWIK N +RGTGG
Sbjct: 121 RKDVEFALVSAQESEKKSPRKLASLSSWIKTNVNRGTGGA 160
>gi|160897389|ref|YP_001562971.1| putative phage major head protein [Delftia acidovorans SPH-1]
gi|160362973|gb|ABX34586.1| putative phage major head protein [Delftia acidovorans SPH-1]
Length = 306
Score = 167 bits (423), Expect = 5e-40, Method: Composition-based stats.
Identities = 61/169 (36%), Positives = 92/169 (54%), Gaps = 2/169 (1%)
Query: 1 MTIVNNTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIHPEWVVDDLASPGP 60
M + TF+T+++ N+E L+DV+ RI+P TP +M K + EW DLA+
Sbjct: 1 MAAPSGTFLTTAAIGNREDLTDVIYRISPTQTPTLNMASKAKATNTLHEWQTQDLAAAAS 60
Query: 61 NAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEI 120
NA +EGD+ + KT+ R+ N TQI K+ +SGTQ A++ G + Q +LEI
Sbjct: 61 NAAVEGDDAAAKTVTPTVRLNNRTQISTKTVRVSGTQRAMNPAGRKDELAYQLSLASLEI 120
Query: 121 RKDVEFALVSSQGSEKTSPRKMAALSSWIKKNASRGTGGVLEDMILSLA 169
++D+E L S + TSPRK L W+ N +R GG L D + +
Sbjct: 121 KRDMELDLTQSDVA-ATSPRKSRGLRGWVVDNVNRN-GGTLADYVANTG 167
>gi|221199511|ref|ZP_03572555.1| major head protein [Burkholderia multivorans CGD2M]
gi|221205587|ref|ZP_03578602.1| major head protein [Burkholderia multivorans CGD2]
gi|221174425|gb|EEE06857.1| major head protein [Burkholderia multivorans CGD2]
gi|221180796|gb|EEE13199.1| major head protein [Burkholderia multivorans CGD2M]
Length = 317
Score = 161 bits (407), Expect = 4e-38, Method: Composition-based stats.
Identities = 55/164 (33%), Positives = 86/164 (52%), Gaps = 5/164 (3%)
Query: 4 VNNTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIHPEWVVDDLASP-GPNA 62
NT+ T ++ N+E L + V +I+P DTP S I+K ++ EW D L +P NA
Sbjct: 2 PANTYTTYTAVGNREDLINKVFQISPTDTPFTSAIEKTDAEGVYHEWQTDSLRAPTDSNA 61
Query: 63 QLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRK 122
+EG + ++ + +R+GN QI++ ++ +SGTQEAV G + KKA+E++K
Sbjct: 62 AVEGADATYNEQDPTKRIGNRCQIVQDTFSVSGTQEAVKRAGP-KEVARLSAKKAIELKK 120
Query: 123 DVEFALVSSQG---SEKTSPRKMAALSSWIKKNASRGTGGVLED 163
D+E + S KT RKM + W + N G G D
Sbjct: 121 DIEATSLVSGAAVVGSKTVARKMRGVKGWCETNFLGGAGAAAPD 164
>gi|291334595|gb|ADD94245.1| putative phage major head protein [uncultured phage
MedDCM-OCT-S04-C136]
Length = 316
Score = 153 bits (387), Expect = 6e-36, Method: Composition-based stats.
Identities = 44/150 (29%), Positives = 73/150 (48%), Gaps = 3/150 (2%)
Query: 8 FITSSSTTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIHPEWVVDDLASPGPNAQLEGD 67
+ T + +E L+D++ I+P +TP S + K + +W D LA NA +EG
Sbjct: 4 YQTYQTIGIREDLADIIYSISPTETPFMSGVAKTKATNTLHQWQTDALADVAANAAVEGA 63
Query: 68 EYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFA 127
+ S+ T+ N+TQI K ++ T EAV G + Q K A E+++D+E A
Sbjct: 64 DISYGTMAPTVLENNHTQISTKGIQVTATNEAVTSAGRNNEMAYQVAKAAKELKRDMETA 123
Query: 128 LVSS---QGSEKTSPRKMAALSSWIKKNAS 154
L+S+ T+ RK+ +W + N
Sbjct: 124 LLSNVAKTAGNATTARKLGGCPTWYETNVD 153
>gi|307308932|ref|ZP_07588615.1| putative phage major head protein [Sinorhizobium meliloti BL225C]
gi|306900566|gb|EFN31179.1| putative phage major head protein [Sinorhizobium meliloti BL225C]
Length = 309
Score = 148 bits (373), Expect = 3e-34, Method: Composition-based stats.
Identities = 49/165 (29%), Positives = 76/165 (46%), Gaps = 5/165 (3%)
Query: 1 MTIVNNTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIHPEWVVDDLASPGP 60
M T T+ + +E L D++S I+PEDTP + I K + EW D L +
Sbjct: 1 MA----TLKTTDVSHVREDLEDIISNISPEDTPFLTSIAKVSASQKTHEWTQDKLRARNK 56
Query: 61 NAQLEGDEYSFKTINTPE-RMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALE 119
N + N+ R+ N+ QI ++ +SG+ A D VG + Q K +
Sbjct: 57 NNAAIEGAEAAAASNSAPVRLRNHAQIFTETVQVSGSLIASDTVGSKNELAYQLAKSIKQ 116
Query: 120 IRKDVEFALVSSQGSEKTSPRKMAALSSWIKKNASRGTGGVLEDM 164
++ D+E VS + S PR+M + +W+K NA GTGG
Sbjct: 117 VKGDIEATAVSEKASSLGEPREMGGMEAWVKTNALHGTGGATAGY 161
>gi|290457630|sp|P85987|CAPSD_BPSK1 RecName: Full=Major capsid protein; AltName: Full=Virion protein G
gi|221271431|dbj|BAH15184.1| major capsid protein [Serratia phage KSP100]
Length = 306
Score = 146 bits (369), Expect = 8e-34, Method: Composition-based stats.
Identities = 45/163 (27%), Positives = 67/163 (41%), Gaps = 9/163 (5%)
Query: 1 MTIVNNTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIHPEWVVDDLASPGP 60
M + T + KE +D VS I+PE TP+ SMI+K H+ +W D L
Sbjct: 1 MA----AYQTYTMAGIKEDFADWVSNISPEYTPLISMIRKFPVHNTMFQWQWDVLKDVDT 56
Query: 61 -NAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALE 119
N E + + + NY QIMRK +S + AV G + Q K A E
Sbjct: 57 ENQHNEASDAKDVELTPTTVVQNYVQIMRKVVFVSDSANAVSSHGREKELFYQLKKAAKE 116
Query: 120 IRKDVEFALV----SSQGSEKTSPRKMAALSSWIKKNASRGTG 158
+++D E + + T PR A+ S I + +
Sbjct: 117 LKRDNEGIFLLKDRAGDAGSATKPRLTASFGSLIDASMKKTAD 159
>gi|316934287|ref|YP_004109269.1| putative phage major head protein [Rhodopseudomonas palustris DX-1]
gi|315602001|gb|ADU44536.1| putative phage major head protein [Rhodopseudomonas palustris DX-1]
Length = 304
Score = 141 bits (355), Expect = 4e-32, Method: Composition-based stats.
Identities = 46/162 (28%), Positives = 75/162 (46%), Gaps = 5/162 (3%)
Query: 8 FITSSSTTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIHPEWVVDDLASPGPNAQLEGD 67
+ + + KE +SD++S ITP TP S+ K T H+ + EW D+L + NAQ EG
Sbjct: 5 YTSYDAVGTKEDVSDIISMITPTKTPFTSLTKSETVHNTYYEWQEDELRATADNAQPEGF 64
Query: 68 EYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFA 127
+ GN TQIM ++ +SGT +AV G + + K + ++ D+E A
Sbjct: 65 TATPVARTPTIMRGNVTQIMSDTFEVSGTNDAVTKYGRGKESAREASKASAALKLDLEAA 124
Query: 128 LVSSQ-----GSEKTSPRKMAALSSWIKKNASRGTGGVLEDM 164
+ + ++PRK A + I + TG +
Sbjct: 125 FTKNDSDMVKPTVASTPRKFAGVQKQIDPDNIVYTGATGTKI 166
>gi|283856246|ref|YP_162122.2| putative phage major head protein [Zymomonas mobilis subsp. mobilis
ZM4]
gi|283775241|gb|AAV89011.2| putative phage major head protein [Zymomonas mobilis subsp. mobilis
ZM4]
Length = 304
Score = 130 bits (327), Expect = 6e-29, Method: Composition-based stats.
Identities = 42/139 (30%), Positives = 68/139 (48%), Gaps = 5/139 (3%)
Query: 31 DTPIYSMIKKGTTHSIHPEWVVDDLASP-GPNAQLEGDEYSFKTINTPERMGNYTQIMRK 89
+TP + I + T + + EW D+LAS N Q+EG + + ++ R+GNYTQIM K
Sbjct: 10 ETPFVTAIGQTTAKNTYTEWQTDNLASANAQNKQVEGADLANESRQPTVRVGNYTQIMTK 69
Query: 90 SWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQGS----EKTSPRKMAAL 145
S T AV + G ++ Q + E+++D+E + + R+ A
Sbjct: 70 VVGTSTTDRAVHNAGRGDEHAYQLARAGQELKRDIEARFTGNFAAIPGDGAVVARETAGA 129
Query: 146 SSWIKKNASRGTGGVLEDM 164
+W++ NA RG GG M
Sbjct: 130 LAWLRSNAHRGDGGANPVM 148
>gi|167583566|ref|YP_001671756.1| major head protein [Enterobacteria phage phiEco32]
gi|164375404|gb|ABY52812.1| major head protein [Enterobacteria phage phiEco32]
Length = 352
Score = 129 bits (325), Expect = 1e-28, Method: Composition-based stats.
Identities = 41/137 (29%), Positives = 61/137 (44%), Gaps = 2/137 (1%)
Query: 1 MTIVNNTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIHPEWVVDDLASP-G 59
M F++ K S ++ +S ++P+DTP SM K + + W D LAS G
Sbjct: 1 MANPT-LFVSYDQNGKKLSFANWISVLSPQDTPFVSMTGKESINQTIFSWQTDALASVDG 59
Query: 60 PNAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALE 119
NA +EG + N TQI+RK +S T + G + Q KK E
Sbjct: 60 NNAHVEGSRAEDGEMKPTVIKSNVTQILRKVVRVSDTANTTANYGRGRELMYQLEKKGKE 119
Query: 120 IRKDVEFALVSSQGSEK 136
I++D+E L+S Q
Sbjct: 120 IKRDLEKILLSGQARTD 136
>gi|154174521|ref|YP_001409081.1| hypothetical protein CCV52592_0028 [Campylobacter curvus 525.92]
gi|112803013|gb|EAU00357.1| conserved hypothetical protein [Campylobacter curvus 525.92]
Length = 327
Score = 119 bits (297), Expect = 2e-25, Method: Composition-based stats.
Identities = 44/181 (24%), Positives = 79/181 (43%), Gaps = 17/181 (9%)
Query: 1 MTIVNNTFIT--SSSTTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIHPEWVVDDLASP 58
M I + F + S+ D + I ++TP+ S+I SI W+ D + P
Sbjct: 1 MAITSTGFQAPATKRVGLVPSVYDKIILIGADETPMLSLIGTSKVKSIKHSWITDTIGEP 60
Query: 59 GPNAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKAL 118
NAQ+E ++S +T +++ N TQI +S T + G + + + KKA
Sbjct: 61 KKNAQIEISDFSGAGKSTKKQLDNDTQIFTTEVSVSKTMQTAQTYG-GKELENEITKKAK 119
Query: 119 EIRKDVEFALV--------------SSQGSEKTSPRKMAALSSWIKKNASRGTGGVLEDM 164
E + D+E+AL ++ T+ +MA + ++ AS TGG ++
Sbjct: 120 EHKLDIEYALFGLGRDADAKKSVFKAATPRTDTTASEMAGIFYYVANGASAFTGGKCGNV 179
Query: 165 I 165
+
Sbjct: 180 L 180
>gi|291334405|gb|ADD94061.1| major head protein [uncultured phage MedDCM-OCT-S01-C1]
Length = 344
Score = 118 bits (295), Expect = 3e-25, Method: Composition-based stats.
Identities = 34/152 (22%), Positives = 66/152 (43%), Gaps = 4/152 (2%)
Query: 16 NKESLSDVVSRITPEDTPIYSMIKKGTTHSIHPEWVVDDLASPG-PNAQLEGDEYSFKTI 74
E + + I+ P ++ T + +WVVD+L +P NA+++G + +
Sbjct: 20 INEDVMQKIFDISKIPLPFTDLVGSTTHKNERFDWVVDELRAPDVTNARVDGSDAGTASE 79
Query: 75 NTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQ-- 132
R+GN++QI + +S +A D +G + + + +IR+DVE +++Q
Sbjct: 80 AGGARVGNHSQISDEVIAVSYRADASDTIGRTKELAYRITRGNQQIRRDVEAMALNNQAS 139
Query: 133 -GSEKTSPRKMAALSSWIKKNASRGTGGVLED 163
T L +WI+ +G G
Sbjct: 140 VAGTDTVAGVTGGLPTWIETTVMQGDGSAAVT 171
>gi|256751057|ref|ZP_05491940.1| conserved hypothetical protein [Thermoanaerobacter ethanolicus
CCSD1]
gi|256750167|gb|EEU63188.1| conserved hypothetical protein [Thermoanaerobacter ethanolicus
CCSD1]
Length = 292
Score = 118 bits (295), Expect = 3e-25, Method: Composition-based stats.
Identities = 36/165 (21%), Positives = 70/165 (42%), Gaps = 12/165 (7%)
Query: 1 MTIVNNTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKKGTT----HSIHPEWVVDDLA 56
M +N + K L++ ++ + P DTP+++ + +S W L
Sbjct: 1 MIQTSN-----FTVGEKIDLTNEIALVQPLDTPLFTYLMSRKAYDKANSTIVTWREKTLD 55
Query: 57 SPGPNAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKK 116
+ + EG E + + N +I +K+ +SGT EA++ G Y + +
Sbjct: 56 TTEDISVPEGSETNVFYKSDRVEKNNVCEIFKKAVQISGTAEAINIKGIGDLYASEMADR 115
Query: 117 ALEIRKDVEFALVSSQGSEKTS--PRKMAALSSWIKK-NASRGTG 158
EI+ ++E L++ + ++ RKMA L S++ N T
Sbjct: 116 LAEIKVNIEKKLINGVKDDGSTSGIRKMAGLLSFVLTENKVSNTA 160
>gi|257458669|ref|ZP_05623796.1| conserved hypothetical protein [Campylobacter gracilis RM3268]
gi|257443942|gb|EEV19058.1| conserved hypothetical protein [Campylobacter gracilis RM3268]
Length = 328
Score = 115 bits (288), Expect = 2e-24, Method: Composition-based stats.
Identities = 43/175 (24%), Positives = 77/175 (44%), Gaps = 17/175 (9%)
Query: 1 MTIVNNTFITSSST--TNKESLSDVVSRITPEDTPIYSMIKKGTTHSIHPEWVVDDLASP 58
M I + F ++ K S+ D + I +DTP+ S+I + W+ D++A+P
Sbjct: 1 MAITSTGFQAPATKREGLKPSVYDSIILIGADDTPVLSLIGTSNVTNTEHSWLTDNIAAP 60
Query: 59 GPNAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKAL 118
NAQLE +++ +T ++ N QI + +S T + V G + + + K+A
Sbjct: 61 KKNAQLEISDFADDRKSTIQKTTNSVQIFTTNISVSYTMQKVATYG-GKEMERETTKRAK 119
Query: 119 EIRKDVEFALV--------------SSQGSEKTSPRKMAALSSWIKKNASRGTGG 159
E ++D+E+AL + T +MA + +I K S G
Sbjct: 120 EHKRDMEYALFGLGRDTDTKVSIFKAPTSRADTVAGEMAGMFYYISKGESAFVNG 174
>gi|331088860|ref|ZP_08337770.1| hypothetical protein HMPREF1025_01353 [Lachnospiraceae bacterium
3_1_46FAA]
gi|330407383|gb|EGG86886.1| hypothetical protein HMPREF1025_01353 [Lachnospiraceae bacterium
3_1_46FAA]
Length = 314
Score = 111 bits (278), Expect = 3e-23, Method: Composition-based stats.
Identities = 35/151 (23%), Positives = 68/151 (45%), Gaps = 6/151 (3%)
Query: 19 SLSDVVSRITPEDTPIYSMIKK----GTTHSIHPEWVVDDLASPGPNAQLEGDEYSFKTI 74
L++ + ++P DTP+ +M+ I W +L + +LEG E
Sbjct: 17 DLTEEIKLVSPTDTPLTTMLMGRGAVEPATDITVTWRERELNANRGTLKLEGAEAGAVIT 76
Query: 75 NTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQGS 134
+T + N QI+ K +SGT A+ G + + + +E ++D+E+ ++ +
Sbjct: 77 STRGSLSNVCQIIEKVTQVSGTARALHPKGIGDTFTAEVQDRLIETKRDLEWYFLNGTKT 136
Query: 135 --EKTSPRKMAALSSWIKKNASRGTGGVLED 163
++PR+MA L + + N T G L +
Sbjct: 137 LEADSTPRQMAGLINLVNDNNVVSTAGALSE 167
>gi|291526329|emb|CBK91916.1| hypothetical protein EUR_29920 [Eubacterium rectale DSM 17629]
Length = 304
Score = 110 bits (276), Expect = 6e-23, Method: Composition-based stats.
Identities = 31/151 (20%), Positives = 68/151 (45%), Gaps = 6/151 (3%)
Query: 19 SLSDVVSRITPEDTPIYSMIKKGT----THSIHPEWVVDDLASPGPNAQLEGDEYSFKTI 74
L++ + ++P DTP+ +++ + I W +L S +LEG E
Sbjct: 17 DLTEEIKLVSPTDTPLTTLLMGRGQVVPANDITVTWREKELNSDRGTLKLEGSEAGEAIT 76
Query: 75 NTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQGS 134
+ + + N QI+ K +SGT +++ G + + + +E ++D+E+ ++ +
Sbjct: 77 SGRKTLSNVCQIIEKVTQVSGTARSLNPKGIGDVFNSEVQDRLVETKRDMEWYFLNGTKA 136
Query: 135 --EKTSPRKMAALSSWIKKNASRGTGGVLED 163
++PR+M L + + T G L +
Sbjct: 137 LESGSTPRQMNGLVNLVASGNVVETKGALTE 167
>gi|196048420|ref|ZP_03115595.1| conserved hypothetical protein [Bacillus cereus 03BB108]
gi|196020677|gb|EDX59409.1| conserved hypothetical protein [Bacillus cereus 03BB108]
Length = 312
Score = 110 bits (275), Expect = 7e-23, Method: Composition-based stats.
Identities = 34/143 (23%), Positives = 64/143 (44%), Gaps = 7/143 (4%)
Query: 16 NKESLSDVVSRITPEDTPIYSMIK----KGTTHSIHPEWVVDDLASPGPNAQLEGDEYSF 71
K LS+ ++ +P DTP +++ + S W L S QLEG + +
Sbjct: 11 EKIDLSEAIAYASPMDTPFTTLLLQNGLTADSTSTEISWREAALDSNRKGPQLEGADATD 70
Query: 72 KTINTPERMGNYTQIMRKSWILSGTQEAVDDVG-YILKYKEQKLKKALEIRKDVEFALVS 130
T E + N QI +++ +SG+ EAV G + + + +E + D+E+ +
Sbjct: 71 PNKTTRELIKNNQQIFQRTAEVSGSLEAVKVPGVPGGEMASEINDRMIESKVDLEWYALQ 130
Query: 131 SQGSE--KTSPRKMAALSSWIKK 151
++ ++PR+M L + I
Sbjct: 131 GTKADESGSTPRQMNGLINLINS 153
>gi|229187822|ref|ZP_04314947.1| hypothetical protein bcere0004_53480 [Bacillus cereus BGSC 6E1]
gi|228595657|gb|EEK53352.1| hypothetical protein bcere0004_53480 [Bacillus cereus BGSC 6E1]
Length = 313
Score = 108 bits (271), Expect = 2e-22, Method: Composition-based stats.
Identities = 37/156 (23%), Positives = 63/156 (40%), Gaps = 8/156 (5%)
Query: 16 NKESLSDVVSRITPEDTPIYSMIK----KGTTHSIHPEWVVDDLASPGPNAQLEGDEYSF 71
K LS ++ +P DTP +++ S W L S QLEG +
Sbjct: 11 EKIDLSQAIAYASPMDTPFTTLLLQNGLTADATSTEISWREAALDSNRKGPQLEGANATD 70
Query: 72 KTINTPERMGNYTQIMRKSWILSGTQEAVDDVG-YILKYKEQKLKKALEIRKDVEFALVS 130
E + N QI +++ +SG+ EAV G + + + +E + D+E+ +
Sbjct: 71 PNKTVRELIKNNQQIFQRTAEVSGSLEAVKVPGVPGGEMASEINDRMIEAKVDLEWYALQ 130
Query: 131 SQGSE--KTSPRKMAALSSWIKK-NASRGTGGVLED 163
++ +PR+M L + I N T G L
Sbjct: 131 GTKADESGATPRQMNGLINLINSRNKFTPTSGKLSA 166
>gi|291336566|gb|ADD96115.1| hypothetical protein HG1285_12862 [uncultured organism
MedDCM-OCT-S04-C6]
Length = 347
Score = 103 bits (256), Expect = 9e-21, Method: Composition-based stats.
Identities = 40/158 (25%), Positives = 73/158 (46%), Gaps = 10/158 (6%)
Query: 12 SSTTNKESLSDVVSRITPEDTPIYSMIKKGTT-HSIHPEWVVDDLASPGPNAQLEGDEYS 70
KE L D+++R+ + TP S++ KG+T H+ +W VD A ++G + +
Sbjct: 8 DQVAKKEDLLDLITRVDEKATPFMSLVNKGSTPHNTFIQWPVDTYADAALGGTVDGTDVA 67
Query: 71 FKTINTPERM--GNYTQIMRKSWILSGTQEAVDDV---GYILKYKEQKLKKALEIRKDVE 125
+ R +Y Q RK++ +S + V DV G + E K +E+ +++E
Sbjct: 68 SYANHAENRTLLSSYLQTFRKAYQVSRLAQEVSDVAGLGAGNEIAEASAKAGVELVRNME 127
Query: 126 FALVSSQ----GSEKTSPRKMAALSSWIKKNASRGTGG 159
L+S Q + ++ + L WI+ +A T G
Sbjct: 128 ATLLSDQEHQVDNGSSNAYLLRGLGVWIRDSARLTTPG 165
>gi|315144740|gb|EFT88756.1| conserved hypothetical protein [Enterococcus faecalis TX2141]
Length = 300
Score = 102 bits (253), Expect = 2e-20, Method: Composition-based stats.
Identities = 38/152 (25%), Positives = 67/152 (44%), Gaps = 7/152 (4%)
Query: 19 SLSDVVSRITPEDTPIYS----MIKKGTTHSIHPEWVVDDLASPGPNAQLEGDEYSFKTI 74
+S ++ + TP S K S +W +L +AQLEG +Y+
Sbjct: 13 DISQEINALQRPSTPFLSWLLGAGKTSPATSTEIKWRESELDGEDSSAQLEGGDYTD-AD 71
Query: 75 NTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQGS 134
+ + NYT+I RKS +SGT +A++ G + Q ++ALE+++D+ L+ +
Sbjct: 72 SGRKWFNNYTEIFRKSTSVSGTLDAINVNGVGSELANQVSQRALEMKRDLNKKLLIGVKA 131
Query: 135 --EKTSPRKMAALSSWIKKNASRGTGGVLEDM 164
T R+MA + + I + T
Sbjct: 132 DENGTKGRQMAGVINLINSDNLVKTSAADAVT 163
>gi|217961109|ref|YP_002339677.1| hypothetical protein BCAH187_A3735 [Bacillus cereus AH187]
gi|229140327|ref|ZP_04268882.1| hypothetical protein bcere0013_34260 [Bacillus cereus BDRD-ST26]
gi|217064163|gb|ACJ78413.1| conserved hypothetical protein [Bacillus cereus AH187]
gi|228642888|gb|EEK99164.1| hypothetical protein bcere0013_34260 [Bacillus cereus BDRD-ST26]
Length = 293
Score = 102 bits (253), Expect = 2e-20, Method: Composition-based stats.
Identities = 25/151 (16%), Positives = 63/151 (41%), Gaps = 9/151 (5%)
Query: 20 LSDVVSRITPEDTPIYSMIKKG----TTHSIHPEWVVDDLASPGPNAQLEGDEYSFKTIN 75
L+D ++ + P TP ++++ + W L + EG + + +
Sbjct: 15 LTDEIALVAPIATPFFTLLMSKGLYVDSKGKFHTWREKTLDGTADISVDEGIDATQFVQS 74
Query: 76 TPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQGSE 135
+ N +I K+ +SGT +A VG + ++ + +E+ +E L++ ++
Sbjct: 75 GRAELNNVMEIFYKATSVSGTAQATGAVG--DLFAQEINDRLIELAIGMEKKLINGVKND 132
Query: 136 -KTSPRKMAALSSWIKKNASRGTGGVLEDMI 165
+ R+M + ++ + G +D++
Sbjct: 133 GASGKRQMDGILKFVDADNVV--NGATKDVL 161
>gi|307286482|ref|ZP_07566582.1| hypothetical protein HMPREF9505_00059 [Enterococcus faecalis
TX0109]
gi|306502395|gb|EFM71671.1| hypothetical protein HMPREF9505_00059 [Enterococcus faecalis
TX0109]
Length = 300
Score = 102 bits (253), Expect = 3e-20, Method: Composition-based stats.
Identities = 38/152 (25%), Positives = 66/152 (43%), Gaps = 7/152 (4%)
Query: 19 SLSDVVSRITPEDTPIYS----MIKKGTTHSIHPEWVVDDLASPGPNAQLEGDEYSFKTI 74
+S ++ + TP S K S +W +L +AQLEG +Y+
Sbjct: 13 DISQEINALQRPSTPFLSWLLGAGKTSPATSTEIKWRESELDGEDSSAQLEGGDYTD-AD 71
Query: 75 NTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQGS 134
+ + NYT+I RKS +SGT +A++ G + Q ++ALE++ D+ L+ +
Sbjct: 72 SGRKWFNNYTEIFRKSTSVSGTLDAINVNGVGSELANQVSQRALEMKLDLNKKLLIGVKA 131
Query: 135 --EKTSPRKMAALSSWIKKNASRGTGGVLEDM 164
T R+MA + + I + T
Sbjct: 132 NENGTKGRQMAGVINLINSDNLVKTSAADAVT 163
>gi|226305754|ref|YP_002765714.1| hypothetical protein RER_22670 [Rhodococcus erythropolis PR4]
gi|226184871|dbj|BAH32975.1| hypothetical protein RER_22670 [Rhodococcus erythropolis PR4]
Length = 317
Score = 101 bits (252), Expect = 3e-20, Method: Composition-based stats.
Identities = 33/190 (17%), Positives = 60/190 (31%), Gaps = 26/190 (13%)
Query: 1 MTIVNNTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKK----GTTHSIHPEWVVDDLA 56
M + T + + + +++ EDTP S I T S W DL
Sbjct: 1 MPGITGMGTTYNL----PNYVGELFQLSTEDTPFLSAIGGLTGGEDTGSTIFTWQTADLR 56
Query: 57 SPGPNAQL-EGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVD---------DVG-- 104
Q EG + N +I ++ +S T++ G
Sbjct: 57 DADETRQRLEGADAPTAEGRKRSSGSNVLEIHQEQVSVSYTKQGATRQLTGTDPMQAGVQ 116
Query: 105 -YILKYKEQKLKKALEIRKDVEFALVSSQ---GSEKTSPRKMAALSSWIKKNASRGTGGV 160
+ Q + +I +DVE + + ++ T+ R+ + + N
Sbjct: 117 PVTDELTFQTAAEIKQIARDVEKSFIVGTYNLPTDNTTKRRTRGILEAVTSNVVTNGTPA 176
Query: 161 --LEDMILSL 168
E M+L L
Sbjct: 177 ALTETMLLDL 186
>gi|57237589|ref|YP_178603.1| hypothetical protein CJE0587 [Campylobacter jejuni RM1221]
gi|57166393|gb|AAW35172.1| hypothetical protein CJE0587 [Campylobacter jejuni RM1221]
Length = 344
Score = 101 bits (252), Expect = 3e-20, Method: Composition-based stats.
Identities = 43/183 (23%), Positives = 76/183 (41%), Gaps = 20/183 (10%)
Query: 1 MTIVN--NTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIH-PEWVVDDLAS 57
M + + +T + + K+S+ + + +I +TPI + I + W+ D
Sbjct: 1 MALPSMGHTSPATENVKLKQSIYETIIKIGATETPILNKIGTSKVTNPLTHSWITDTFEE 60
Query: 58 PGPNAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKA 117
P NA LE ++ +T NT ++ N TQI ++S + G + + Q KK
Sbjct: 61 PKKNANLELSKFVGETKNTAQKTTNATQIFITEAMVSKALLKANQYG-GNEMEYQIGKKT 119
Query: 118 LEIRKDVEFALVS---------------SQGSEKTSPRKMAALSSWIKKNASRGTGGVLE 162
E + D+E+AL Q E TS +MA L +I K + G
Sbjct: 120 KEHKMDMEYALFGLGRDSDVKKSVFKDYVQAQEATSG-EMAGLFHYIAKGKDSFSDGKRG 178
Query: 163 DMI 165
+++
Sbjct: 179 NVL 181
>gi|315929828|gb|EFV08993.1| hypothetical protein CSS_0883 [Campylobacter jejuni subsp. jejuni
305]
Length = 344
Score = 100 bits (250), Expect = 5e-20, Method: Composition-based stats.
Identities = 43/183 (23%), Positives = 75/183 (40%), Gaps = 20/183 (10%)
Query: 1 MTIVN--NTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIH-PEWVVDDLAS 57
M + + +T + + K+S+ + + +I +TPI + I + W+ D
Sbjct: 1 MALPSMGHTSPATENVKLKQSIYETIIKIGATETPILNKIGTSKVTNPLTHSWITDTFEE 60
Query: 58 PGPNAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKA 117
P NA LE ++ +T NT ++ N TQI ++S + G + + Q KK
Sbjct: 61 PKKNANLELSKFVGETKNTAQKTTNATQIFITEAMVSKALLKANQYG-GNEMEYQIGKKT 119
Query: 118 LEIRKDVEFALVS---------------SQGSEKTSPRKMAALSSWIKKNASRGTGGVLE 162
E + D+E+AL Q E TS +MA L +I K G
Sbjct: 120 KEHKMDMEYALFGLGRDSDVKKSVFKDYVQAQEATSG-EMAGLFHYIAKGKDNFADGKRG 178
Query: 163 DMI 165
+++
Sbjct: 179 NVL 181
>gi|283956330|ref|ZP_06373810.1| hypothetical protein C1336_000250101 [Campylobacter jejuni subsp.
jejuni 1336]
gi|283792050|gb|EFC30839.1| hypothetical protein C1336_000250101 [Campylobacter jejuni subsp.
jejuni 1336]
Length = 344
Score = 100 bits (249), Expect = 6e-20, Method: Composition-based stats.
Identities = 43/183 (23%), Positives = 75/183 (40%), Gaps = 20/183 (10%)
Query: 1 MTIVN--NTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIH-PEWVVDDLAS 57
M + + +T + + K+S+ + + +I +TPI + I + W+ D
Sbjct: 1 MALPSMGHTAPATENVKLKQSIYETIIKIGATETPILNKIGTSKVTNPLTHSWITDTFEE 60
Query: 58 PGPNAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKA 117
P NA LE ++ +T NT ++ N TQI ++S + G + + Q KK
Sbjct: 61 PKKNANLELSKFVGETKNTAQKTTNATQIFITEAMVSKALLKANQYG-GNEMEYQIGKKT 119
Query: 118 LEIRKDVEFALVS---------------SQGSEKTSPRKMAALSSWIKKNASRGTGGVLE 162
E + D+E+AL Q E TS +MA L +I K G
Sbjct: 120 KEHKMDMEYALFGLGRDSDVKKSVFKDYVQAQEATSG-EMAGLFHYIAKGKDSFADGKRG 178
Query: 163 DMI 165
+++
Sbjct: 179 NVL 181
>gi|281417131|ref|ZP_06248151.1| conserved hypothetical protein [Clostridium thermocellum JW20]
gi|281408533|gb|EFB38791.1| conserved hypothetical protein [Clostridium thermocellum JW20]
Length = 292
Score = 100 bits (249), Expect = 7e-20, Method: Composition-based stats.
Identities = 28/155 (18%), Positives = 61/155 (39%), Gaps = 12/155 (7%)
Query: 3 IVNNTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKKGT----THSIHPEWVVDDLASP 58
I + F T + LS + I+P DTP+ +++ S+ W L
Sbjct: 2 IKTSHFTTHENI----DLSKEIVLISPSDTPLTTLLMNKKLVETAGSVTINWREKTLDDT 57
Query: 59 GPNAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKAL 118
++ EG + N +I K+ +SG+ +A + G + + +
Sbjct: 58 EDISKTEGFTVDTFVSSGRAEKSNVMEIFSKAVQVSGSAQASNITGINDLFASEISDRLT 117
Query: 119 EIRKDVEFALVS----SQGSEKTSPRKMAALSSWI 149
E++ ++E +++ + GS R+M ++ +
Sbjct: 118 EVKVNIEKKMLAPKNYNDGSSAPFIRRMKSIFEQV 152
>gi|238909129|ref|YP_002939596.1| hypothetical protein EUBELI_10025 [Eubacterium eligens ATCC 27750]
gi|238873366|gb|ACR73075.1| Hypothetical protein EUBELI_10025 [Eubacterium eligens ATCC 27750]
Length = 304
Score = 100 bits (249), Expect = 7e-20, Method: Composition-based stats.
Identities = 32/151 (21%), Positives = 69/151 (45%), Gaps = 6/151 (3%)
Query: 19 SLSDVVSRITPEDTPIYSMIKKGT----THSIHPEWVVDDLASPGPNAQLEGDEYSFKTI 74
L++ + + +P DTP+ +++ I W +L S +LEG E
Sbjct: 17 DLTEEIKQTSPTDTPLTTLLMSRGQVVPAKDITVTWREKELNSERGTLKLEGSEAGEVIT 76
Query: 75 NTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQGS 134
++ + + N QI+ K +SGT +++ +G + + + +E ++D+E+ ++ +
Sbjct: 77 SSRKTLSNVCQIIEKVTQVSGTARSLNPMGINDVFNAEVQDRLVETKRDMEWYFLNGTKA 136
Query: 135 --EKTSPRKMAALSSWIKKNASRGTGGVLED 163
+PR+M L + + N T G L +
Sbjct: 137 LESGATPRQMNGLVNLVNANNVVETKGALTE 167
>gi|239828160|ref|YP_002950784.1| hypothetical protein GWCH70_2835 [Geobacillus sp. WCH70]
gi|239808453|gb|ACS25518.1| conserved hypothetical protein [Geobacillus sp. WCH70]
Length = 283
Score = 99.4 bits (246), Expect = 2e-19, Method: Composition-based stats.
Identities = 25/146 (17%), Positives = 48/146 (32%), Gaps = 4/146 (2%)
Query: 7 TFITSS-STTNKESLSDVVSRITPEDTPIYS--MIKKGTTHSIHPEWVVDDLASPGPNAQ 63
F + + L DV+ + + P + M K S W+ +++A
Sbjct: 1 MFTSQDFAVGQNYDLKDVLIEVNKKQNPFVTFLMSKTVKATSPQVHWITEEIADSAVTLA 60
Query: 64 LEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKD 123
GD +F R NY +I + ++ T + VG + KK I++
Sbjct: 61 EGGDAPAFVKDTLAPRE-NYLEIFAATATVTNTAQYSKAVGINDLLAHEVEKKTKAIKRR 119
Query: 124 VEFALVSSQGSEKTSPRKMAALSSWI 149
+E + + + I
Sbjct: 120 MENKFIHGTKGYSNGVYTTDGILAQI 145
>gi|256956794|ref|ZP_05560965.1| conserved hypothetical protein [Enterococcus faecalis DS5]
gi|256947290|gb|EEU63922.1| conserved hypothetical protein [Enterococcus faecalis DS5]
gi|295113775|emb|CBL32412.1| hypothetical protein [Enterococcus sp. 7L76]
gi|315035894|gb|EFT47826.1| conserved hypothetical protein [Enterococcus faecalis TX0027]
Length = 300
Score = 99.0 bits (245), Expect = 2e-19, Method: Composition-based stats.
Identities = 40/152 (26%), Positives = 66/152 (43%), Gaps = 7/152 (4%)
Query: 19 SLSDVVSRITPEDTPIYS---MIKKGT-THSIHPEWVVDDLASPGPNAQLEGDEYSFKTI 74
+S V+ + TP S K + S +W +L +AQLEG EY
Sbjct: 13 DISQEVNALQRPSTPFLSWLLGAGKTSPATSTEIKWRESELDGEDSSAQLEGGEY-RDAD 71
Query: 75 NTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQGS 134
+ + NYT+I RKS +SGT +A++ G + Q ++ALE++ D+ L+ +
Sbjct: 72 SGRKWFNNYTEIFRKSTSVSGTLDAINVNGVGSELANQVSQRALEMKLDLNKKLLIGVKA 131
Query: 135 --EKTSPRKMAALSSWIKKNASRGTGGVLEDM 164
T R+MA + + I + T
Sbjct: 132 DENGTKGRQMAGVINLINSDNLVKTSAADAVT 163
>gi|134298256|ref|YP_001111752.1| hypothetical protein Dred_0379 [Desulfotomaculum reducens MI-1]
gi|134050956|gb|ABO48927.1| hypothetical protein Dred_0379 [Desulfotomaculum reducens MI-1]
Length = 285
Score = 98.6 bits (244), Expect = 3e-19, Method: Composition-based stats.
Identities = 24/156 (15%), Positives = 55/156 (35%), Gaps = 5/156 (3%)
Query: 13 STTNKESLSDVVSRITPEDTPIYSMI--KKGTTHSIHPEWVVDDLASPGPNAQLEGDEYS 70
+ DV+ + TP TP +++ K + W+ + + EG +
Sbjct: 8 VAGQSIDMKDVLIQTTPILTPFTTLLLPKTVKAENATLNWIEEAINESAAVTLGEGADAP 67
Query: 71 FKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVS 130
+T + NY +++ + +S T +A + G + +KK ++ +E L++
Sbjct: 68 NPVDDTLAPISNYCELIGATATVSNTAQATNAKGISDLLAHEIVKKTKAMKIKMENILIN 127
Query: 131 SQGS--EKTSPRKMAALSSWIKKNASRGTGGVLEDM 164
T + + I ++ T
Sbjct: 128 GTKGYVSATKTYTTDGILAQINP-VNQVTNATFTKT 162
>gi|307270079|ref|ZP_07551399.1| hypothetical protein HMPREF9498_02197 [Enterococcus faecalis
TX4248]
gi|306513574|gb|EFM82186.1| hypothetical protein HMPREF9498_02197 [Enterococcus faecalis
TX4248]
Length = 300
Score = 98.6 bits (244), Expect = 3e-19, Method: Composition-based stats.
Identities = 36/152 (23%), Positives = 65/152 (42%), Gaps = 7/152 (4%)
Query: 19 SLSDVVSRITPEDTPIYS----MIKKGTTHSIHPEWVVDDLASPGPNAQLEGDEYSFKTI 74
+S ++ + TP S K S +W ++ +AQLEG EY+
Sbjct: 13 DISQEINALQRPSTPFLSWLLGAGKTRPATSTEIKWREYEMNGEDSSAQLEGGEYNE-AE 71
Query: 75 NTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQGS 134
+ + NY +I RKS +SGT +A++ G + Q ++ALE++ D+ L+ +
Sbjct: 72 SGRKWFNNYAEIFRKSTSVSGTLDAINVNGVGSELANQVSQRALEMKLDLNKKLLIGVKA 131
Query: 135 --EKTSPRKMAALSSWIKKNASRGTGGVLEDM 164
+ R+MA + + I + T
Sbjct: 132 DENGSKGRQMAGVINLINSDNLVKTSAADAVT 163
>gi|256375780|ref|YP_003099440.1| hypothetical protein Amir_1645 [Actinosynnema mirum DSM 43827]
gi|255920083|gb|ACU35594.1| hypothetical protein Amir_1645 [Actinosynnema mirum DSM 43827]
Length = 406
Score = 98.6 bits (244), Expect = 3e-19, Method: Composition-based stats.
Identities = 40/191 (20%), Positives = 62/191 (32%), Gaps = 31/191 (16%)
Query: 1 MTIVNNTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKK----GTTHSIHPEWVVDDLA 56
M + T + L +TPEDTP+ S I S EW DL
Sbjct: 1 MAGITGMGTTFNLPNYHGEL----FGLTPEDTPLLSAIGGLGSGSEITSKEWEWQAYDLR 56
Query: 57 SPGPNAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAV------------DDVG 104
P LEG N QI+ + S T++A
Sbjct: 57 DPAQRVALEGQTAPTGEARVRTNFSNVVQIVHERVSTSYTKQAAIGQFAANSAPISGANP 116
Query: 105 YILKYKEQKLKKALEIRKDVEFALVSSQ---GSEKTSPRKMAALSSWIK--------KNA 153
++ Q + +I +DV + ++ Q S+ +SPRK L + N
Sbjct: 117 ITDEHDWQVTQAVKQIARDVNWTCINGQYAKPSDNSSPRKTRGLMQAVSAANTVDRGSNV 176
Query: 154 SRGTGGVLEDM 164
+ G V + +
Sbjct: 177 ATGASSVTDTI 187
>gi|315173098|gb|EFU17115.1| conserved hypothetical protein [Enterococcus faecalis TX1346]
Length = 300
Score = 98.6 bits (244), Expect = 3e-19, Method: Composition-based stats.
Identities = 40/152 (26%), Positives = 66/152 (43%), Gaps = 7/152 (4%)
Query: 19 SLSDVVSRITPEDTPIYS---MIKKGT-THSIHPEWVVDDLASPGPNAQLEGDEYSFKTI 74
+S V+ + TP S K + S +W +L +AQLEG EY
Sbjct: 13 DISQEVNALQRPSTPFLSWLLGAGKTSPATSTEIKWRESELDGEDSSAQLEGGEY-KDAD 71
Query: 75 NTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQGS 134
+ + NYT+I RKS +SGT +A++ G + Q ++ALE++ D+ L+ +
Sbjct: 72 SGRKWFNNYTEIFRKSTSVSGTLDAINVNGVGSELANQVSQRALEMKLDLNKKLLIGVKA 131
Query: 135 --EKTSPRKMAALSSWIKKNASRGTGGVLEDM 164
T R+MA + + I + T
Sbjct: 132 DENGTKGRQMAGVINLINSDNLVKTSAADAVT 163
>gi|134299981|ref|YP_001113477.1| hypothetical protein Dred_2135 [Desulfotomaculum reducens MI-1]
gi|134052681|gb|ABO50652.1| hypothetical protein Dred_2135 [Desulfotomaculum reducens MI-1]
Length = 285
Score = 98.2 bits (243), Expect = 4e-19, Method: Composition-based stats.
Identities = 26/156 (16%), Positives = 56/156 (35%), Gaps = 5/156 (3%)
Query: 13 STTNKESLSDVVSRITPEDTPIYSMI--KKGTTHSIHPEWVVDDLASPGPNAQLEGDEYS 70
T + DV+ + TP TP +++ K + W+ + + EG +
Sbjct: 8 VTGQSIDMKDVLIQTTPILTPFTTLLLPKTVKAENATLNWIEEAINENAAVTLGEGADAP 67
Query: 71 FKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVS 130
+T NY +++ + +S T +A + G + +KK ++ +E L++
Sbjct: 68 NPVDDTLTPCSNYCELVGATATVSNTAQATNAKGISDLLAHETVKKTKAMKIRMENILIN 127
Query: 131 SQGS--EKTSPRKMAALSSWIKKNASRGTGGVLEDM 164
T + + I A++ T
Sbjct: 128 GTKGYVSATKTYTTDGILAQINP-ANKVTNATFTKT 162
>gi|257079386|ref|ZP_05573747.1| predicted protein [Enterococcus faecalis JH1]
gi|256987416|gb|EEU74718.1| predicted protein [Enterococcus faecalis JH1]
Length = 300
Score = 97.8 bits (242), Expect = 4e-19, Method: Composition-based stats.
Identities = 36/152 (23%), Positives = 65/152 (42%), Gaps = 7/152 (4%)
Query: 19 SLSDVVSRITPEDTPIYS----MIKKGTTHSIHPEWVVDDLASPGPNAQLEGDEYSFKTI 74
+S ++ + TP S K S +W ++ +AQLEG EY+
Sbjct: 13 DISQEINALQRPSTPFLSWLLGAGKTRPATSTEIKWREYEMNGEDSSAQLEGGEYNE-AE 71
Query: 75 NTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQGS 134
+ + NY +I RKS +SGT +A++ G + Q ++ALE++ D+ L+ +
Sbjct: 72 SGRKWFNNYAEIFRKSTSVSGTLDAINVNGVGSELANQVSQRALEMKLDLNKKLLIGVKA 131
Query: 135 --EKTSPRKMAALSSWIKKNASRGTGGVLEDM 164
+ R+MA + + I + T
Sbjct: 132 DENGSKGRQMAGVINLINSDNLVKTSAADAVT 163
>gi|307280635|ref|ZP_07561683.1| hypothetical protein HMPREF9515_01677 [Enterococcus faecalis
TX0860]
gi|306504001|gb|EFM73218.1| hypothetical protein HMPREF9515_01677 [Enterococcus faecalis
TX0860]
Length = 300
Score = 97.8 bits (242), Expect = 5e-19, Method: Composition-based stats.
Identities = 39/152 (25%), Positives = 65/152 (42%), Gaps = 7/152 (4%)
Query: 19 SLSDVVSRITPEDTPIYS---MIKKGT-THSIHPEWVVDDLASPGPNAQLEGDEYSFKTI 74
+S V+ + TP S K + S +W +L +AQLEG EY
Sbjct: 13 DISQEVNALQRPSTPFLSWLLGAGKTSPATSTEIKWRESELDGEDSSAQLEGGEY-KDAD 71
Query: 75 NTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQGS 134
+ + NYT+I RKS +SGT +A++ G + Q ++ALE++ D+ L+ +
Sbjct: 72 SGRKWFSNYTEIFRKSTSVSGTLDAINVNGVGSELANQVSQRALEMKLDLNKKLLIGVKA 131
Query: 135 --EKTSPRKMAALSSWIKKNASRGTGGVLEDM 164
R+MA + + I + T
Sbjct: 132 DENGDKGRQMAGVINLINSDNLVKTSAADAVT 163
>gi|152975085|ref|YP_001374602.1| hypothetical protein Bcer98_1285 [Bacillus cereus subsp. cytotoxis
NVH 391-98]
gi|152023837|gb|ABS21607.1| conserved hypothetical protein [Bacillus cytotoxicus NVH 391-98]
Length = 293
Score = 97.4 bits (241), Expect = 6e-19, Method: Composition-based stats.
Identities = 26/151 (17%), Positives = 62/151 (41%), Gaps = 9/151 (5%)
Query: 20 LSDVVSRITPEDTPIYSMIKKG----TTHSIHPEWVVDDLASPGPNAQLEGDEYSFKTIN 75
L+D ++ + P TP ++++ + W L EG + + +
Sbjct: 15 LTDEIALVAPIATPFFTLLMSKGLYVDSKGKFHTWREKTLDGTADITVDEGVDATQFVQS 74
Query: 76 TPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQGSE 135
+ N +I K+ +SGT ++ VG + ++ + +E+ +E L++ ++
Sbjct: 75 GRAELNNVMEIFYKATSVSGTAQSTGAVG--DLFAQEINDRLVELAIGIENKLINGVKND 132
Query: 136 -KTSPRKMAALSSWIKKNASRGTGGVLEDMI 165
+ R+M L ++ GV +D++
Sbjct: 133 GASGKRQMDGLLKFVDAGNVV--NGVTKDVL 161
>gi|256377352|ref|YP_003101012.1| hypothetical protein Amir_3259 [Actinosynnema mirum DSM 43827]
gi|255921655|gb|ACU37166.1| hypothetical protein Amir_3259 [Actinosynnema mirum DSM 43827]
Length = 329
Score = 96.7 bits (239), Expect = 1e-18, Method: Composition-based stats.
Identities = 34/193 (17%), Positives = 68/193 (35%), Gaps = 35/193 (18%)
Query: 4 VNNTFITSSSTTNKESLSDVVSRITPEDTPIYSMI----KKGTTHSIHPEWVVDDLASPG 59
+ NT+ + + +TP DTP S I ++ W V DL P
Sbjct: 7 IANTYNAPNFVGE-------LFSLTPSDTPFLSAIGGLTGGRRATAVIHTWTVYDLRPPD 59
Query: 60 PNAQL-EGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVG-------------- 104
P+ Q EG + + N +I ++S ++ T++A +
Sbjct: 60 PDRQRAEGADAPPAEGRIRGQERNVVEIHQESVGVTYTRQATQAMFAGTGAANPNAAAIG 119
Query: 105 ----YILKYKEQKLKKALEIRKDVEFALVSSQ---GSEKTSPRKMAALSSWIKKNASRGT 157
+ Q + ++I +DVE + + ++ ++ RK + + N +
Sbjct: 120 GTNAVANEMDWQTQQALVQIARDVEATFLVGRYQEPTDNSTVRKTRGILEATRTNVITNS 179
Query: 158 GGVL--EDMILSL 168
E M++ L
Sbjct: 180 TPTPLTESMVIDL 192
>gi|257879565|ref|ZP_05659218.1| conserved hypothetical protein [Enterococcus faecium 1,230,933]
gi|257891545|ref|ZP_05671198.1| conserved hypothetical protein [Enterococcus faecium 1,231,410]
gi|314940388|ref|ZP_07847550.1| conserved hypothetical protein [Enterococcus faecium TX0133a04]
gi|314943205|ref|ZP_07849996.1| conserved hypothetical protein [Enterococcus faecium TX0133C]
gi|314949154|ref|ZP_07852509.1| conserved hypothetical protein [Enterococcus faecium TX0082]
gi|314951966|ref|ZP_07854992.1| conserved hypothetical protein [Enterococcus faecium TX0133A]
gi|314993065|ref|ZP_07858455.1| conserved hypothetical protein [Enterococcus faecium TX0133B]
gi|314995396|ref|ZP_07860499.1| conserved hypothetical protein [Enterococcus faecium TX0133a01]
gi|257813793|gb|EEV42551.1| conserved hypothetical protein [Enterococcus faecium 1,230,933]
gi|257827905|gb|EEV54531.1| conserved hypothetical protein [Enterococcus faecium 1,231,410]
gi|313590399|gb|EFR69244.1| conserved hypothetical protein [Enterococcus faecium TX0133a01]
gi|313592421|gb|EFR71266.1| conserved hypothetical protein [Enterococcus faecium TX0133B]
gi|313595906|gb|EFR74751.1| conserved hypothetical protein [Enterococcus faecium TX0133A]
gi|313598089|gb|EFR76934.1| conserved hypothetical protein [Enterococcus faecium TX0133C]
gi|313640428|gb|EFS05008.1| conserved hypothetical protein [Enterococcus faecium TX0133a04]
gi|313644467|gb|EFS09047.1| conserved hypothetical protein [Enterococcus faecium TX0082]
Length = 296
Score = 95.5 bits (236), Expect = 2e-18, Method: Composition-based stats.
Identities = 32/152 (21%), Positives = 63/152 (41%), Gaps = 7/152 (4%)
Query: 19 SLSDVVSRITPEDTPIYS----MIKKGTTHSIHPEWVVDDLASPGPNAQLEGDEYSFKTI 74
+S ++ + +TP S K +S +W D+ + + +LEG +Y
Sbjct: 13 DISPAINAMQVPNTPFLSYLLGAGKTEQANSTEIKWREYDINNDDSSEKLEGGDYPD-AE 71
Query: 75 NTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQGS 134
+ NYT+I RKS +SGT +A++ G + Q + +E++ D+ L++ +
Sbjct: 72 SGRNWFNNYTEIFRKSTSVSGTLDAINVNGVGNELTNQVALRGMEMKIDLNRKLITGVKA 131
Query: 135 --EKTSPRKMAALSSWIKKNASRGTGGVLEDM 164
+ R+M + + I T
Sbjct: 132 DENGSKGRRMNGILNLINSANKAETATAGAVT 163
>gi|188585861|ref|YP_001917406.1| conserved hypothetical protein [Natranaerobius thermophilus
JW/NM-WN-LF]
gi|179350548|gb|ACB84818.1| conserved hypothetical protein [Natranaerobius thermophilus
JW/NM-WN-LF]
Length = 289
Score = 95.1 bits (235), Expect = 3e-18, Method: Composition-based stats.
Identities = 29/165 (17%), Positives = 55/165 (33%), Gaps = 6/165 (3%)
Query: 6 NTFITSSSTTNKESLSDVVSRITPEDTPIYS--MIKKGTTHSIHPEWVVDDLASPGPNAQ 63
N F+ S +S V+ TPI S M+++ + WV ++ Q
Sbjct: 5 NNFLQYESI----DMSGVLEVTNVPQTPITSLLMVRQVQAQAPQVHWVEVEIDESSAVTQ 60
Query: 64 LEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKD 123
EGD+ + E NY +I + +S T + + K IR
Sbjct: 61 GEGDDAPEHKTDNRELKENYLEIFGATAKVSNTAQYSTSETVNDLLAHEVELKTQSIRNR 120
Query: 124 VEFALVSSQGSEKTSPRKMAALSSWIKKNASRGTGGVLEDMILSL 168
+E ++ + + + + I + E++ L
Sbjct: 121 MENKFINGNKNFADGVYETDGILNLINSENQKTEDEFNENVFLDT 165
>gi|229162523|ref|ZP_04290484.1| hypothetical protein bcere0009_32950 [Bacillus cereus R309803]
gi|228621002|gb|EEK77867.1| hypothetical protein bcere0009_32950 [Bacillus cereus R309803]
Length = 293
Score = 94.7 bits (234), Expect = 3e-18, Method: Composition-based stats.
Identities = 23/152 (15%), Positives = 57/152 (37%), Gaps = 9/152 (5%)
Query: 20 LSDVVSRITPEDTPIYSMIKKG----TTHSIHPEWVVDDLASPGPNAQLEGDEYSFKTIN 75
L+D ++ + P TP ++++ + W L EG + + +
Sbjct: 15 LTDEIALVAPIATPFFALLMSKGLYVDSKGKFHTWREKTLDGTADITVDEGVDATQFVQS 74
Query: 76 TPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQGSE 135
+ N +I K+ +SGT +A + ++ + +E+ +E L+S ++
Sbjct: 75 GRAELNNVMEIFYKATSVSGTAQATG--AVSDLFAQEINDRLVELAIGIEKKLISGIKND 132
Query: 136 -KTSPRKMAALSSWIKKNASRGTGGVLEDMIL 166
+ R+M + + G +++
Sbjct: 133 GASGKRQMDGILKFADAGNVV--NGATANVLQ 162
>gi|323703894|ref|ZP_08115527.1| hypothetical protein DesniDRAFT_2739 [Desulfotomaculum nigrificans
DSM 574]
gi|323531143|gb|EGB21049.1| hypothetical protein DesniDRAFT_2739 [Desulfotomaculum nigrificans
DSM 574]
Length = 285
Score = 94.0 bits (232), Expect = 6e-18, Method: Composition-based stats.
Identities = 27/149 (18%), Positives = 55/149 (36%), Gaps = 5/149 (3%)
Query: 15 TNKESLSDVVSRITPEDTPIYSMI--KKGTTHSIHPEWVVDDLASPGPNAQLEGDEYSFK 72
+ DV+ + TP TP +++ K ++ W+ + + EG +
Sbjct: 10 GQSIDMKDVLIQTTPVLTPFTTLLLDKTVKAENVTLNWIEEAINESAAVTLGEGADAPAV 69
Query: 73 TINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQ 132
+T M NY +++ + +S T +A G + +KK ++ +E L++
Sbjct: 70 VDDTLAPMTNYCELIGATATVSNTAQATTAKGISDLLAHEVVKKTKAMKMRMENILINGT 129
Query: 133 GS--EKTSPRKMAALSSWIK-KNASRGTG 158
S T + + I N T
Sbjct: 130 KSYDATTKTYTTDGILAQIDPANQVTNTS 158
>gi|153951462|ref|YP_001398222.1| hypothetical protein JJD26997_1140 [Campylobacter jejuni subsp.
doylei 269.97]
gi|152938908|gb|ABS43649.1| hypothetical protein JJD26997_1140 [Campylobacter jejuni subsp.
doylei 269.97]
Length = 344
Score = 93.6 bits (231), Expect = 8e-18, Method: Composition-based stats.
Identities = 43/176 (24%), Positives = 72/176 (40%), Gaps = 20/176 (11%)
Query: 1 MTIV--NNTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIH-PEWVVDDLAS 57
M + +T + + K+S+ + + +I +TPI + I + W+ D
Sbjct: 1 MALPSMAHTPPATENVKLKQSIYETIIKIGATETPILNKIGTSKVSNPLTHSWITDTFEE 60
Query: 58 PGPNAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKA 117
P NA LE ++ +T NT ++ N TQI ++S + G + + Q KK
Sbjct: 61 PKKNANLELSKFVGETKNTTQKTTNATQIFITEAMVSKALLKANQYG-GNEMEYQIGKKT 119
Query: 118 LEIRKDVEFALVSS---------------QGSEKTSPRKMAALSSWIKKNASRGTG 158
E + D+E+AL+ Q E TS +MA L +I K T
Sbjct: 120 KEHKMDMEYALLGLGRDNDVKTSVFKDYIQAQEATSG-EMAGLFHYIAKGKDSFTD 174
>gi|257883499|ref|ZP_05663152.1| conserved hypothetical protein [Enterococcus faecium 1,231,502]
gi|261208026|ref|ZP_05922703.1| conserved hypothetical protein [Enterococcus faecium TC 6]
gi|289567093|ref|ZP_06447488.1| conserved hypothetical protein [Enterococcus faecium D344SRF]
gi|294622496|ref|ZP_06701518.1| conserved hypothetical protein [Enterococcus faecium U0317]
gi|257819157|gb|EEV46485.1| conserved hypothetical protein [Enterococcus faecium 1,231,502]
gi|260077743|gb|EEW65457.1| conserved hypothetical protein [Enterococcus faecium TC 6]
gi|289161108|gb|EFD09013.1| conserved hypothetical protein [Enterococcus faecium D344SRF]
gi|291598043|gb|EFF29153.1| conserved hypothetical protein [Enterococcus faecium U0317]
Length = 296
Score = 93.2 bits (230), Expect = 1e-17, Method: Composition-based stats.
Identities = 33/152 (21%), Positives = 63/152 (41%), Gaps = 7/152 (4%)
Query: 19 SLSDVVSRITPEDTPIYS----MIKKGTTHSIHPEWVVDDLASPGPNAQLEGDEYSFKTI 74
+S ++ + +TP S K +S +W D+ + + +LEG EY
Sbjct: 13 DISPAINAMQVPNTPFLSYLFGAGKTEPANSTEIKWREYDINNDDSSEKLEGGEYPD-AE 71
Query: 75 NTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQGS 134
+ NYT+I RKS +SGT +A++ G + Q + +E++ D+ L++ +
Sbjct: 72 SGRTWFNNYTEIFRKSTSVSGTLDAINVNGVGNELTNQVALRGMEMKIDLNRKLITGVKA 131
Query: 135 --EKTSPRKMAALSSWIKKNASRGTGGVLEDM 164
+ R+M + + I T
Sbjct: 132 DENGSKGRRMNGILNLINSANKAETATAGAVT 163
>gi|257893408|ref|ZP_05673061.1| conserved hypothetical protein [Enterococcus faecium 1,231,408]
gi|257829787|gb|EEV56394.1| conserved hypothetical protein [Enterococcus faecium 1,231,408]
Length = 296
Score = 93.2 bits (230), Expect = 1e-17, Method: Composition-based stats.
Identities = 33/152 (21%), Positives = 63/152 (41%), Gaps = 7/152 (4%)
Query: 19 SLSDVVSRITPEDTPIYS----MIKKGTTHSIHPEWVVDDLASPGPNAQLEGDEYSFKTI 74
+S ++ + +TP S K +S +W D+ + + +LEG EY
Sbjct: 13 DISPAINAMQVPNTPFLSYLLGAGKTEPANSTEIKWREYDINNDDSSEKLEGGEYPD-AE 71
Query: 75 NTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQGS 134
+ NYT+I RKS +SGT +A++ G + Q + +E++ D+ L++ +
Sbjct: 72 SGRTWFNNYTEIFRKSTSVSGTLDAINVNGVGNELTNQVALRGMEMKIDLNRKLITGVKA 131
Query: 135 --EKTSPRKMAALSSWIKKNASRGTGGVLEDM 164
+ R+M + + I T
Sbjct: 132 DENGSKGRRMNGILNLINSANKAETATAGAVT 163
>gi|294614769|ref|ZP_06694669.1| hypothetical protein EfmE1636_0859 [Enterococcus faecium E1636]
gi|291592381|gb|EFF23990.1| hypothetical protein EfmE1636_0859 [Enterococcus faecium E1636]
Length = 296
Score = 92.8 bits (229), Expect = 1e-17, Method: Composition-based stats.
Identities = 33/152 (21%), Positives = 63/152 (41%), Gaps = 7/152 (4%)
Query: 19 SLSDVVSRITPEDTPIYS----MIKKGTTHSIHPEWVVDDLASPGPNAQLEGDEYSFKTI 74
+S ++ + +TP S K +S +W D+ + + +LEG EY
Sbjct: 13 DISPAINAMQVPNTPFLSYLLGAGKTEPANSTEIKWREYDINNDDSSEKLEGGEYPD-AE 71
Query: 75 NTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQGS 134
+ NYT+I RKS +SGT +A++ G + Q + +E++ D+ L++ +
Sbjct: 72 SGRTWFNNYTEIFRKSTSVSGTLDAINVNGVGNELTNQVALRGMEMKIDLNRKLITGVKA 131
Query: 135 --EKTSPRKMAALSSWIKKNASRGTGGVLEDM 164
+ R+M + + I T
Sbjct: 132 DENSSKGRRMNGILNLINSANKAETATAGAVT 163
>gi|114566839|ref|YP_753993.1| hypothetical protein Swol_1314 [Syntrophomonas wolfei subsp. wolfei
str. Goettingen]
gi|114337774|gb|ABI68622.1| hypothetical protein Swol_1314 [Syntrophomonas wolfei subsp. wolfei
str. Goettingen]
Length = 398
Score = 91.7 bits (226), Expect = 3e-17, Method: Composition-based stats.
Identities = 32/154 (20%), Positives = 65/154 (42%), Gaps = 11/154 (7%)
Query: 1 MTIVNNTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKKG---TTHSIHPEWVVDDLAS 57
M N +L+ +S ++P D P+ ++I TT S W L +
Sbjct: 1 MIKTTNFTD-----LENINLTKEISLVSPMDCPLTTIIMGKGYDTTGSKIVTWREKTLDN 55
Query: 58 PGPNAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKA 117
+Q+EG + + N +I +K+ +SGT +A G + E+ +
Sbjct: 56 TEDISQVEGSTTNTFQSSARAEKSNVCEIFKKATSISGTADASSITGVSNLFAEEINDRL 115
Query: 118 LEIRKDVEFALVSSQGSEKTS---PRKMAALSSW 148
+E++ ++E L++ + ++ RKM L ++
Sbjct: 116 IEMKVNIEKKLINGTKDDGSTSPYVRKMDGLLAF 149
>gi|29376526|ref|NP_815680.1| hypothetical protein EF2011 [Enterococcus faecalis V583]
gi|227555439|ref|ZP_03985486.1| conserved hypothetical protein [Enterococcus faecalis HH22]
gi|29343990|gb|AAO81750.1| hypothetical protein EF_2011 [Enterococcus faecalis V583]
gi|227175420|gb|EEI56392.1| conserved hypothetical protein [Enterococcus faecalis HH22]
Length = 295
Score = 91.7 bits (226), Expect = 3e-17, Method: Composition-based stats.
Identities = 36/152 (23%), Positives = 63/152 (41%), Gaps = 7/152 (4%)
Query: 19 SLSDVVSRITPEDTPIYS---MIKK-GTTHSIHPEWVVDDLASPGPNAQLEGDEYSFKTI 74
+S V+ + +TP S K S +W + + +AQLEG EY+
Sbjct: 13 DISQEVNALQVPNTPFLSYLLGAGKVEAAKSTEIKWREYGMNNDDSSAQLEGGEYAD-AE 71
Query: 75 NTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQGS 134
+ NYT+I RKS +SGT +A++ G + Q +A E++ D+ L+ +
Sbjct: 72 SDRTWFNNYTEIFRKSTSVSGTLDAINVDGVGNELNSQVALRATEMKIDLNRKLIVGVKA 131
Query: 135 E--KTSPRKMAALSSWIKKNASRGTGGVLEDM 164
+ + R+M + + I T
Sbjct: 132 DESGSKGRQMNGILNLISSTNKVETAAAGAVT 163
>gi|169827502|ref|YP_001697660.1| hypothetical protein Bsph_1941 [Lysinibacillus sphaericus C3-41]
gi|168991990|gb|ACA39530.1| conserved hypothetical protein [Lysinibacillus sphaericus C3-41]
Length = 288
Score = 91.3 bits (225), Expect = 4e-17, Method: Composition-based stats.
Identities = 26/149 (17%), Positives = 61/149 (40%), Gaps = 12/149 (8%)
Query: 16 NKESLSDVVSRITPEDTPIYSMIKK----GTTHSIHPEWVVDDLASPGPNAQLEGDEYSF 71
+ SL++ ++ I + TP S++ S W L++ + +EG + +
Sbjct: 11 ERISLANEIAVIGVQATPFTSLLMAKGNIEKALSTVYTWREKSLSNDEDISAVEGADTTV 70
Query: 72 KTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSS 131
+ + N +I +K +SGT EA+ + + + LE++ ++E ++
Sbjct: 71 FYESARAELSNILEIFKKGVQVSGTAEAMQSTQFSAE----VADRLLELKVNMEKKFING 126
Query: 132 QGSEKTSP---RKMAALSSWI-KKNASRG 156
++ + R+++ L NA
Sbjct: 127 LKADGSKAPFKRQLSGLIEMADATNAVTA 155
>gi|319956911|ref|YP_004168174.1| hypothetical protein Nitsa_1172 [Nitratifractor salsuginis DSM
16511]
gi|319419315|gb|ADV46425.1| hypothetical protein Nitsa_1172 [Nitratifractor salsuginis DSM
16511]
Length = 308
Score = 90.9 bits (224), Expect = 5e-17, Method: Composition-based stats.
Identities = 36/155 (23%), Positives = 60/155 (38%), Gaps = 11/155 (7%)
Query: 7 TFITSS-STTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIHPEWVVDDLASPGPNAQLE 65
T + + K S+ D + P P +G ++ W+ D L P PN LE
Sbjct: 2 ALTTYNNTVNQKPSVLDSIILQGPSQVPFLKWFGRGDVNAPKHAWITDRLRDPKPNYNLE 61
Query: 66 GDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVE 125
T +T + N TQI++ + LS + + G ++ + K E KD+E
Sbjct: 62 ITGLEEDTEDTKVMLDNVTQIVKNEFGLSRKERSTARYG-QKEWPYRVGKVGKEHAKDLE 120
Query: 126 FALVSSQ---------GSEKTSPRKMAALSSWIKK 151
F L+ Q T+ +MA + +I
Sbjct: 121 FNLLGLQNDSVFDNYVPGSDTTEARMAGIFHFIPS 155
>gi|227517040|ref|ZP_03947089.1| conserved hypothetical protein [Enterococcus faecalis TX0104]
gi|229545405|ref|ZP_04434130.1| conserved hypothetical protein [Enterococcus faecalis TX1322]
gi|229549652|ref|ZP_04438377.1| conserved hypothetical protein [Enterococcus faecalis ATCC 29200]
gi|255972349|ref|ZP_05422935.1| predicted protein [Enterococcus faecalis T1]
gi|256619486|ref|ZP_05476332.1| conserved hypothetical protein [Enterococcus faecalis ATCC 4200]
gi|257090289|ref|ZP_05584650.1| predicted protein [Enterococcus faecalis CH188]
gi|300860939|ref|ZP_07107026.1| conserved hypothetical protein [Enterococcus faecalis TUSoD Ef11]
gi|307275949|ref|ZP_07557082.1| hypothetical protein HMPREF9521_01574 [Enterococcus faecalis
TX2134]
gi|307295873|ref|ZP_07575705.1| hypothetical protein HMPREF9509_02949 [Enterococcus faecalis
TX0411]
gi|312900152|ref|ZP_07759467.1| conserved hypothetical protein [Enterococcus faecalis TX0470]
gi|312902789|ref|ZP_07761993.1| conserved hypothetical protein [Enterococcus faecalis TX0635]
gi|227075515|gb|EEI13478.1| conserved hypothetical protein [Enterococcus faecalis TX0104]
gi|229305317|gb|EEN71313.1| conserved hypothetical protein [Enterococcus faecalis ATCC 29200]
gi|229309512|gb|EEN75499.1| conserved hypothetical protein [Enterococcus faecalis TX1322]
gi|255963367|gb|EET95843.1| predicted protein [Enterococcus faecalis T1]
gi|256599013|gb|EEU18189.1| conserved hypothetical protein [Enterococcus faecalis ATCC 4200]
gi|256999101|gb|EEU85621.1| predicted protein [Enterococcus faecalis CH188]
gi|295113266|emb|CBL31903.1| hypothetical protein [Enterococcus sp. 7L76]
gi|300849978|gb|EFK77728.1| conserved hypothetical protein [Enterococcus faecalis TUSoD Ef11]
gi|306496204|gb|EFM65783.1| hypothetical protein HMPREF9509_02949 [Enterococcus faecalis
TX0411]
gi|306507279|gb|EFM76416.1| hypothetical protein HMPREF9521_01574 [Enterococcus faecalis
TX2134]
gi|310633843|gb|EFQ17126.1| conserved hypothetical protein [Enterococcus faecalis TX0635]
gi|311292711|gb|EFQ71267.1| conserved hypothetical protein [Enterococcus faecalis TX0470]
gi|315149052|gb|EFT93068.1| conserved hypothetical protein [Enterococcus faecalis TX4244]
gi|315159923|gb|EFU03940.1| conserved hypothetical protein [Enterococcus faecalis TX0312]
gi|315167465|gb|EFU11482.1| conserved hypothetical protein [Enterococcus faecalis TX1341]
gi|315169417|gb|EFU13434.1| conserved hypothetical protein [Enterococcus faecalis TX1342]
gi|315575405|gb|EFU87596.1| conserved hypothetical protein [Enterococcus faecalis TX0309B]
gi|315576720|gb|EFU88911.1| conserved hypothetical protein [Enterococcus faecalis TX0630]
gi|315582750|gb|EFU94941.1| conserved hypothetical protein [Enterococcus faecalis TX0309A]
gi|323481145|gb|ADX80584.1| hypothetical protein EF62_2373 [Enterococcus faecalis 62]
Length = 295
Score = 90.9 bits (224), Expect = 5e-17, Method: Composition-based stats.
Identities = 36/152 (23%), Positives = 63/152 (41%), Gaps = 7/152 (4%)
Query: 19 SLSDVVSRITPEDTPIYS---MIKK-GTTHSIHPEWVVDDLASPGPNAQLEGDEYSFKTI 74
+S V+ + +TP S K S +W + + +AQLEG EY+
Sbjct: 13 DISQEVNALQVPNTPFLSYLLGAGKVEAAKSTEIKWREYGMNNDDSSAQLEGGEYAD-AE 71
Query: 75 NTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQGS 134
+ NYT+I RKS +SGT +A++ G + Q +A E++ D+ L+ +
Sbjct: 72 SDRTWFNNYTEIFRKSTSVSGTLDAINVDGVGNELNSQVALRATEMKIDLNRKLIVGVKA 131
Query: 135 E--KTSPRKMAALSSWIKKNASRGTGGVLEDM 164
+ + R+M + + I T
Sbjct: 132 DESGSKGRQMNGILNLISSTNKVETAAAGAVT 163
>gi|315028531|gb|EFT40463.1| conserved hypothetical protein [Enterococcus faecalis TX4000]
Length = 295
Score = 90.9 bits (224), Expect = 5e-17, Method: Composition-based stats.
Identities = 36/152 (23%), Positives = 63/152 (41%), Gaps = 7/152 (4%)
Query: 19 SLSDVVSRITPEDTPIYS---MIKK-GTTHSIHPEWVVDDLASPGPNAQLEGDEYSFKTI 74
+S V+ + +TP S K S +W + + +AQLEG EY+
Sbjct: 13 DISQEVNALQVPNTPFLSYLLGAGKVEAAKSTEIKWREYGMNNDDSSAQLEGGEYAD-AE 71
Query: 75 NTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQGS 134
+ NYT+I RKS +SGT +A++ G + Q +A E++ D+ L+ +
Sbjct: 72 SDRTWFNNYTEIFRKSTSVSGTLDAINVDGVGNELNSQVALRATEMKIDLNRKLIVGVKA 131
Query: 135 E--KTSPRKMAALSSWIKKNASRGTGGVLEDM 164
+ + R+M + + I T
Sbjct: 132 DESGSKGRQMNGILNLISSTNKVETAAAGAVT 163
>gi|302389556|ref|YP_003825377.1| hypothetical protein Toce_0992 [Thermosediminibacter oceani DSM
16646]
gi|302200184|gb|ADL07754.1| conserved hypothetical protein [Thermosediminibacter oceani DSM
16646]
Length = 294
Score = 90.1 bits (222), Expect = 1e-16, Method: Composition-based stats.
Identities = 34/164 (20%), Positives = 63/164 (38%), Gaps = 13/164 (7%)
Query: 3 IVNNTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKK----GTTHSIHPEWVVDDLASP 58
I ++F SL+ + + P DTP+YS+I S W L +
Sbjct: 2 IKTDSFTNLEKV----SLATEIGLVAPTDTPLYSLILNLGQVDQATSPVVVWREKTLDTT 57
Query: 59 GPNAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKAL 118
+ EG + + NY +I K +SG+ A G + +
Sbjct: 58 NDISVPEGAN-PVFYQSNRAEISNYCEIFLKGVEVSGSASASSIAGIPDLMASEVADRLA 116
Query: 119 EIRKDVEFALVSSQGSEKTSP---RKMAALSSWI-KKNASRGTG 158
E++ ++E AL++ ++ + R+M L S++ + N G
Sbjct: 117 EMKVNIEKALINGVKNDGSQTPYIRRMGGLISFVPEGNKVTGAN 160
>gi|329568771|gb|EGG50571.1| hypothetical protein HMPREF9520_03403 [Enterococcus faecalis
TX1467]
Length = 295
Score = 89.3 bits (220), Expect = 2e-16, Method: Composition-based stats.
Identities = 36/152 (23%), Positives = 61/152 (40%), Gaps = 7/152 (4%)
Query: 19 SLSDVVSRITPEDTPIYS---MIKK-GTTHSIHPEWVVDDLASPGPNAQLEGDEYSFKTI 74
+S V+ + +TP S K S +W + + +AQLEG EY+
Sbjct: 13 DISQEVNALQVPNTPFLSYLLGAGKVEAAKSTEIKWREYGMNNDDSSAQLEGGEYAD-AE 71
Query: 75 NTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQGS 134
+ NYT+I RKS +SGT A + G + Q +A E++ D+ L+ +
Sbjct: 72 SDRTWFNNYTEIFRKSTSVSGTLIASNVDGVGNELNSQVALRATEMKIDLNRKLIVGVKA 131
Query: 135 E--KTSPRKMAALSSWIKKNASRGTGGVLEDM 164
+ + R+M + + I T
Sbjct: 132 DESGSKGRQMNGILNLISSTNKVETAAAGAVT 163
>gi|163937921|ref|YP_001642807.1| hypothetical protein BcerKBAB4_5338 [Bacillus weihenstephanensis
KBAB4]
gi|163865776|gb|ABY46832.1| hypothetical protein BcerKBAB4_5338 [Bacillus weihenstephanensis
KBAB4]
Length = 391
Score = 85.9 bits (211), Expect = 2e-15, Method: Composition-based stats.
Identities = 33/105 (31%), Positives = 55/105 (52%), Gaps = 1/105 (0%)
Query: 64 LEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKD 123
EG + +R+ N TQI +S L+GT AV G +Y+++K KK LE+
Sbjct: 133 SEGADARDSRYKPRKRVSNITQIFDESVELTGTAMAVAQYGVNNEYEKEKQKKQLELALA 192
Query: 124 VEFALVSSQGSEKTSPRKMAALSSWIKKNASRGTG-GVLEDMILS 167
+E A+++ E S R M + S+I+ N + G V +DM+++
Sbjct: 193 LEKAVINGIRYEAGSKRMMRGIRSFIETNVIKAEGESVNDDMLIN 237
Score = 51.6 bits (122), Expect = 3e-05, Method: Composition-based stats.
Identities = 24/144 (16%), Positives = 45/144 (31%), Gaps = 13/144 (9%)
Query: 1 MTIVNNTFITSSSTTNKESLSDVVSRITPEDTPIYSMIK-KGTTHSIHPEWVVDDLASPG 59
MT+V KES+ D + P TP+ S++ ++ W D++ +
Sbjct: 1 MTVVTEKVYNEDLVGKKESVVDEFLLLNPLQTPMLSLVGFGQAVTAVEHIWFEDEMFAQE 60
Query: 60 PNAQLEGDEYSFKTINTPERMGNYTQIMRK------SWILSGTQEAVDDVGYILKYKEQK 113
A E + + Q++R ++G + V GY E
Sbjct: 61 STATKEATATATEIEVADSEAFRKLQVVRAGDELILVVSVAGNKLTVA-RGYADTTAEAI 119
Query: 114 LKKALEIRKDVEFALVSSQGSEKT 137
+ + +E V
Sbjct: 120 AEGDV-----IEVMFVEGSEGADA 138
>gi|228910960|ref|ZP_04074768.1| hypothetical protein bthur0013_51010 [Bacillus thuringiensis IBL
200]
gi|228848615|gb|EEM93461.1| hypothetical protein bthur0013_51010 [Bacillus thuringiensis IBL
200]
Length = 363
Score = 84.7 bits (208), Expect = 3e-15, Method: Composition-based stats.
Identities = 28/99 (28%), Positives = 49/99 (49%)
Query: 65 EGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDV 124
EG R+ N TQI ++ L+GT +A+ G +Y+++K KK LE+ +
Sbjct: 131 EGSNARDARYKPRNRVSNITQIFDETVELTGTAQAIAQYGVDNEYEKEKQKKQLELALQL 190
Query: 125 EFALVSSQGSEKTSPRKMAALSSWIKKNASRGTGGVLED 163
E A+++ E+ + R M + S+I+ N G + D
Sbjct: 191 EKAVINGVRYEQGNRRMMRGIRSFIETNVINAGGAAVAD 229
>gi|256964718|ref|ZP_05568889.1| conserved hypothetical protein [Enterococcus faecalis HIP11704]
gi|307272797|ref|ZP_07554044.1| hypothetical protein HMPREF9514_01561 [Enterococcus faecalis
TX0855]
gi|256955214|gb|EEU71846.1| conserved hypothetical protein [Enterococcus faecalis HIP11704]
gi|306510411|gb|EFM79434.1| hypothetical protein HMPREF9514_01561 [Enterococcus faecalis
TX0855]
Length = 264
Score = 81.3 bits (199), Expect = 5e-14, Method: Composition-based stats.
Identities = 30/126 (23%), Positives = 53/126 (42%), Gaps = 3/126 (2%)
Query: 41 GTTHSIHPEWVVDDLASPGPNAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAV 100
S +W + + +AQLEG EY+ + NYT+I RKS +SGT +A+
Sbjct: 8 EAAKSTEIKWREYGMNNDDSSAQLEGGEYAD-AESDRTWFNNYTEIFRKSTSVSGTLDAI 66
Query: 101 DDVGYILKYKEQKLKKALEIRKDVEFALVSSQGSE--KTSPRKMAALSSWIKKNASRGTG 158
+ G + Q +A E++ D+ L+ ++ + R+M + + I T
Sbjct: 67 NVDGVGNELNSQVALRATEMKIDLNRKLIVGVKADESGSKGRQMNGILNLISSTNKVETA 126
Query: 159 GVLEDM 164
Sbjct: 127 AAGAVT 132
>gi|319649918|ref|ZP_08004068.1| hypothetical protein HMPREF1013_00673 [Bacillus sp. 2_A_57_CT2]
gi|317398356|gb|EFV79044.1| hypothetical protein HMPREF1013_00673 [Bacillus sp. 2_A_57_CT2]
Length = 292
Score = 80.5 bits (197), Expect = 8e-14, Method: Composition-based stats.
Identities = 33/168 (19%), Positives = 70/168 (41%), Gaps = 13/168 (7%)
Query: 7 TFITSSSTTNKE-SLSDVVSRITPEDTPIYSMIKK----GTTHSIHPEWVVDDLASPGPN 61
F +++ T ++ SL+ ++ I + TP+ SM+ S W L
Sbjct: 1 MFKSTNFTEIEQISLAKEIAVIGVQATPLTSMLMAKGNIEKALSTVYTWREKSLDHAEDL 60
Query: 62 AQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIR 121
+ +EG + + N +I +K +SGT A+ ++ E+ + LE++
Sbjct: 61 SAVEGSDEVVFYETARAELNNILEIFKKGASISGTAVAMK----STQFAEEVNDRLLELK 116
Query: 122 KDVEFALVSSQGSEKTSP---RKMAALSSWIK-KNASRGTGGVLEDMI 165
++E ++ ++ + R+++ L NA TG + ED +
Sbjct: 117 INMEKKFINGLRNDGSVTPFKRQLSGLIQMADPSNAVPVTGAITEDDV 164
>gi|291561307|emb|CBL40106.1| hypothetical protein CK3_02480 [butyrate-producing bacterium SS3/4]
Length = 338
Score = 79.3 bits (194), Expect = 2e-13, Method: Composition-based stats.
Identities = 34/171 (19%), Positives = 60/171 (35%), Gaps = 23/171 (13%)
Query: 5 NNTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIHPEWVV-DDLASPGPNAQ 63
+TF TS N S ++ TP+ S+I + H E+V + S G +Q
Sbjct: 2 ADTFATSFGVLN---YSGMLFNKGNVRTPLSSIIGSKAKTTNHVEFVTGQEYTSNGNGSQ 58
Query: 64 LEGDE-----YSFKTINTPERMGNYTQIMRKSWILS-----------GTQEAVDDVGYIL 107
E + T + N TQI ++S +S G A +
Sbjct: 59 PAISESASLTAPDADVVTRSQKTNVTQIFQESVGISYGKQSNMGTLSGINIAEQQANPMS 118
Query: 108 KYKEQKLKKALEIRKDVEFALVSS---QGSEKTSPRKMAALSSWIKKNASR 155
+ Q K ++ +D+E+ ++ + + K L + I N
Sbjct: 119 ELDFQVAAKIQKVNRDIEYTFINGEYNKATSDAEVNKTRGLVNAITTNTLA 169
>gi|241760939|ref|ZP_04759028.1| putative phage major head protein [Zymomonas mobilis subsp. mobilis
ATCC 10988]
gi|241374558|gb|EER64019.1| putative phage major head protein [Zymomonas mobilis subsp. mobilis
ATCC 10988]
Length = 238
Score = 73.2 bits (178), Expect = 1e-11, Method: Composition-based stats.
Identities = 21/82 (25%), Positives = 35/82 (42%), Gaps = 4/82 (4%)
Query: 87 MRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQGS----EKTSPRKM 142
M K S T AV + G ++ Q + E+++D+E + + R+
Sbjct: 1 MTKVVGTSTTDRAVHNAGRGDEHAYQLARAGQELKRDIEARFTGNFAAIPGDGAVVARET 60
Query: 143 AALSSWIKKNASRGTGGVLEDM 164
A +W++ NA RG GG M
Sbjct: 61 AGALAWLRSNAHRGDGGANPVM 82
>gi|229037842|ref|ZP_04189640.1| hypothetical protein bcere0028_57530 [Bacillus cereus AH1271]
gi|228727464|gb|EEL78642.1| hypothetical protein bcere0028_57530 [Bacillus cereus AH1271]
Length = 315
Score = 67.8 bits (164), Expect = 5e-10, Method: Composition-based stats.
Identities = 30/160 (18%), Positives = 49/160 (30%), Gaps = 22/160 (13%)
Query: 23 VVSRITPEDTPIYSMIKKGTTHSIHP---EWVVDDL---ASPGPNAQLEGDEYSFKTINT 76
+ E+TP SMI T + E+ D L +P A E + T +
Sbjct: 19 ELFTADSENTPFLSMIGGLTGGGLQTANKEFATDSLYEYPAPSQPAISEQASGTAPTAVS 78
Query: 77 PER--MGNYTQIMRKSWIL-----------SGTQEAVDDVGYILKYKEQKLKKALEIRKD 123
R N TQI +S + SG A + Q + +I +D
Sbjct: 79 YARGQNKNVTQIFHESVNVTYRKLSNGGRLSGINTAGASNNAPSEKDFQIARALTKIARD 138
Query: 124 VEFALVSSQ---GSEKTSPRKMAALSSWIKKNASRGTGGV 160
E ++ ++ T K + + G
Sbjct: 139 AEHTFLNGTYALATKDTEADKTRGMFELCSTGNTIAAAGA 178
>gi|228968787|ref|ZP_04129749.1| hypothetical protein bthur0004_55460 [Bacillus thuringiensis
serovar sotto str. T04001]
gi|228790850|gb|EEM38489.1| hypothetical protein bthur0004_55460 [Bacillus thuringiensis
serovar sotto str. T04001]
Length = 374
Score = 63.5 bits (153), Expect = 1e-08, Method: Composition-based stats.
Identities = 20/92 (21%), Positives = 38/92 (41%), Gaps = 3/92 (3%)
Query: 60 PNAQLEGDEY-SFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKAL 118
+ EG++ IN N++QI + +S TQ+ V+ G + Q +
Sbjct: 129 ARPRPEGEDAFRKNEINDRLVSHNFSQIFSRYASVSRTQQQVNTYGVSNELDYQVNLRLQ 188
Query: 119 EIRKDVEFALVSS--QGSEKTSPRKMAALSSW 148
E+ ++ +L+ G T PR L ++
Sbjct: 189 EMIREANTSLIYGRRNGGSPTQPRTTGGLFAF 220
>gi|228941057|ref|ZP_04103614.1| hypothetical protein bthur0008_36970 [Bacillus thuringiensis
serovar berliner ATCC 10792]
gi|228973988|ref|ZP_04134562.1| hypothetical protein bthur0003_37430 [Bacillus thuringiensis
serovar thuringiensis str. T01001]
gi|228980577|ref|ZP_04140886.1| hypothetical protein bthur0002_37450 [Bacillus thuringiensis Bt407]
gi|228779138|gb|EEM27396.1| hypothetical protein bthur0002_37450 [Bacillus thuringiensis Bt407]
gi|228785714|gb|EEM33719.1| hypothetical protein bthur0003_37430 [Bacillus thuringiensis
serovar thuringiensis str. T01001]
gi|228818600|gb|EEM64668.1| hypothetical protein bthur0008_36970 [Bacillus thuringiensis
serovar berliner ATCC 10792]
gi|326939625|gb|AEA15521.1| Phage protein [Bacillus thuringiensis serovar chinensis CT-43]
Length = 374
Score = 63.5 bits (153), Expect = 1e-08, Method: Composition-based stats.
Identities = 20/92 (21%), Positives = 38/92 (41%), Gaps = 3/92 (3%)
Query: 60 PNAQLEGDEY-SFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKAL 118
+ EG++ IN N++QI + +S TQ+ V+ G + Q +
Sbjct: 129 ARPRPEGEDAFRKNEINDRLVSHNFSQIFSRYASVSRTQQQVNTYGVSNELDYQVNLRLQ 188
Query: 119 EIRKDVEFALVSS--QGSEKTSPRKMAALSSW 148
E+ ++ +L+ G T PR L ++
Sbjct: 189 EMIREANTSLIYGRRNGGSPTQPRTTGGLFAF 220
>gi|30020036|ref|NP_831667.1| Phage protein [Bacillus cereus ATCC 14579]
gi|31415788|ref|NP_852528.1| hypothetical protein BC1894 [Bacillus phage phBC6A51]
gi|229127327|ref|ZP_04256323.1| hypothetical protein bcere0015_17800 [Bacillus cereus BDRD-Cer4]
gi|29895581|gb|AAP08868.1| Phage protein [Bacillus phage phBC6A51]
gi|228656160|gb|EEL12002.1| hypothetical protein bcere0015_17800 [Bacillus cereus BDRD-Cer4]
Length = 374
Score = 63.5 bits (153), Expect = 1e-08, Method: Composition-based stats.
Identities = 20/92 (21%), Positives = 38/92 (41%), Gaps = 3/92 (3%)
Query: 60 PNAQLEGDEY-SFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKAL 118
+ EG++ IN N++QI + +S TQ+ V+ G + Q +
Sbjct: 129 ARPRPEGEDAFRKNEINDRLVSHNFSQIFSRYASVSRTQQQVNTYGVSNELDYQVNLRLQ 188
Query: 119 EIRKDVEFALVSS--QGSEKTSPRKMAALSSW 148
E+ ++ +L+ G T PR L ++
Sbjct: 189 EMIREANTSLIYGRRNGGSPTQPRTTGGLFAF 220
>gi|229020770|ref|ZP_04177493.1| hypothetical protein bcere0030_52440 [Bacillus cereus AH1273]
gi|228740571|gb|EEL90846.1| hypothetical protein bcere0030_52440 [Bacillus cereus AH1273]
Length = 374
Score = 62.8 bits (151), Expect = 2e-08, Method: Composition-based stats.
Identities = 19/92 (20%), Positives = 37/92 (40%), Gaps = 3/92 (3%)
Query: 60 PNAQLEGDEY-SFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKAL 118
+ EG++ IN N++QI + +S TQ+ V+ G + Q +
Sbjct: 129 ARPRPEGEDAFRKNEINDRLVSHNFSQIFSRYASVSRTQQQVNTYGVSNELDYQVNLRLQ 188
Query: 119 EIRKDVEFALVSS--QGSEKTSPRKMAALSSW 148
E+ ++ +L+ T PR L ++
Sbjct: 189 EMIREANTSLIYGRRNVGSPTQPRTTGGLFAF 220
>gi|229190579|ref|ZP_04317576.1| hypothetical protein bcere0002_22460 [Bacillus cereus ATCC 10876]
gi|228592924|gb|EEK50746.1| hypothetical protein bcere0002_22460 [Bacillus cereus ATCC 10876]
Length = 374
Score = 62.8 bits (151), Expect = 2e-08, Method: Composition-based stats.
Identities = 19/92 (20%), Positives = 37/92 (40%), Gaps = 3/92 (3%)
Query: 60 PNAQLEGDEY-SFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKAL 118
+ EG++ IN N++QI + +S TQ+ V+ G + Q +
Sbjct: 129 ARPRPEGEDAFRKNEINDRLVSHNFSQIFSRYASVSRTQQQVNTYGVSNELDYQVNLRLQ 188
Query: 119 EIRKDVEFALVSS--QGSEKTSPRKMAALSSW 148
E+ ++ +L+ T PR L ++
Sbjct: 189 EMIREANTSLIYGRRNVGSPTQPRTTGGLFAF 220
>gi|218897919|ref|YP_002446330.1| phage protein [Bacillus cereus G9842]
gi|218542918|gb|ACK95312.1| phage protein [Bacillus cereus G9842]
Length = 374
Score = 62.8 bits (151), Expect = 2e-08, Method: Composition-based stats.
Identities = 19/92 (20%), Positives = 37/92 (40%), Gaps = 3/92 (3%)
Query: 60 PNAQLEGDEY-SFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKAL 118
+ EG++ IN N++QI + +S TQ+ V+ G + Q +
Sbjct: 129 ARPRPEGEDAFRKNEINDRLVSHNFSQIFSRYASVSRTQQQVNTYGVSNELDYQVNLRLQ 188
Query: 119 EIRKDVEFALVSS--QGSEKTSPRKMAALSSW 148
E+ ++ +L+ T PR L ++
Sbjct: 189 EMIREANTSLIYGRRNVGSPTQPRTTGGLFAF 220
>gi|257451764|ref|ZP_05617063.1| hypothetical protein F3_01776 [Fusobacterium sp. 3_1_5R]
gi|317058321|ref|ZP_07922806.1| predicted protein [Fusobacterium sp. 3_1_5R]
gi|313683997|gb|EFS20832.1| predicted protein [Fusobacterium sp. 3_1_5R]
Length = 371
Score = 59.7 bits (143), Expect = 1e-07, Method: Composition-based stats.
Identities = 25/102 (24%), Positives = 43/102 (42%), Gaps = 2/102 (1%)
Query: 60 PNAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEA--VDDVGYILKYKEQKLKKA 117
+ EG + T N QI+R+ +S + EA V G I Y +++KK
Sbjct: 131 NDNIEEGADLQGATYKKGVNYDNNVQIIREEISVSASAEAITVPSAGGIDAYSLEQMKKM 190
Query: 118 LEIRKDVEFALVSSQGSEKTSPRKMAALSSWIKKNASRGTGG 159
++ +E A++S + E R M + ++ K GG
Sbjct: 191 DKVLGKIEKAIISGKKFESGLKRGMDGVKRFLAKGQLVDAGG 232
>gi|150021335|ref|YP_001306689.1| hypothetical protein Tmel_1457 [Thermosipho melanesiensis BI429]
gi|149793856|gb|ABR31304.1| hypothetical protein Tmel_1457 [Thermosipho melanesiensis BI429]
Length = 362
Score = 59.7 bits (143), Expect = 1e-07, Method: Composition-based stats.
Identities = 24/130 (18%), Positives = 42/130 (32%), Gaps = 7/130 (5%)
Query: 1 MTIVNNTFITSSSTTNKESLSDVVSRITPEDTPIYSMIK--KGTTHSIHPEWVVDDLAS- 57
M +N T NK +S V+S + +TP+ + I T S EW D L
Sbjct: 1 MGTINGMVTTYDVAENKIDVSPVLSMLKLPNTPLLNAIGISNETVDSTRYEWWDDVLPVL 60
Query: 58 ----PGPNAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQK 113
G ++GN ++ + ++ V V + +
Sbjct: 61 KVKLAAAYTAGGGSLTVETGAGKKFKVGNVIKVENSIYRVTAINGDVLSVAVVSGDADHA 120
Query: 114 LKKALEIRKD 123
+E+ D
Sbjct: 121 ANVDVELIGD 130
Score = 55.5 bits (132), Expect = 3e-06, Method: Composition-based stats.
Identities = 30/158 (18%), Positives = 58/158 (36%), Gaps = 15/158 (9%)
Query: 14 TTNKESLSDVVSRITPEDTPIYS-MIKKGTTH---SIHPEWVVDDLASPGPNAQLEGDEY 69
N + + + R+T + + S + G ++ E + D AQ EG +Y
Sbjct: 87 VGNVIKVENSIYRVTAINGDVLSVAVVSGDADHAANVDVELIGD--------AQPEGQDY 138
Query: 70 SFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFAL- 128
+ + N TQI SG+Q AV + + +K +++ +E
Sbjct: 139 NDSNYEQKVKRYNVTQIFSDYVKFSGSQLAVKQYVNEDVFLNEVQRKLKKLKILLERTAW 198
Query: 129 --VSSQGSEKTSPRKMAALSSWIKKNASRGTGGVLEDM 164
+ ++ + PR M + +I + T ED
Sbjct: 199 LGIRVDPNDNSGPRMMGGIKYFIDSDGITSTNTWSEDN 236
>gi|291335186|gb|ADD94810.1| hypothetical protein [uncultured phage MedDCM-OCT-S12-C102]
Length = 74
Score = 59.3 bits (142), Expect = 2e-07, Method: Composition-based stats.
Identities = 17/67 (25%), Positives = 32/67 (47%), Gaps = 1/67 (1%)
Query: 12 SSTTNKESLSDVVSRITPEDTPIYSMIKKGTT-HSIHPEWVVDDLASPGPNAQLEGDEYS 70
KE L D+++R+ + TP S++ KG+T H+ +W VD A ++G + +
Sbjct: 8 DQVAKKEDLLDLITRVDEKATPFMSLVNKGSTPHNTFIQWPVDTYADAALGGTVDGTDVA 67
Query: 71 FKTINTP 77
+
Sbjct: 68 SYANHAE 74
>gi|194015203|ref|ZP_03053819.1| phage protein [Bacillus pumilus ATCC 7061]
gi|194012607|gb|EDW22173.1| phage protein [Bacillus pumilus ATCC 7061]
Length = 367
Score = 58.5 bits (140), Expect = 3e-07, Method: Composition-based stats.
Identities = 18/86 (20%), Positives = 34/86 (39%), Gaps = 1/86 (1%)
Query: 63 QLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRK 122
Q EG + N+TQI+ + +S TQ+AV + Q + E+ +
Sbjct: 127 QNEGAGVGMDEGHDRYVDYNFTQIIERYAAVSNTQQAVRTHNVTDELNYQVQLRLKEMAR 186
Query: 123 DVEFALVSSQGSEKTSPRKMAALSSW 148
+ L+ + + PR L ++
Sbjct: 187 EFNDWLIYGRRID-GKPRMTGGLLNF 211
>gi|308172834|ref|YP_003919539.1| phage protein [Bacillus amyloliquefaciens DSM 7]
gi|307605698|emb|CBI42069.1| phage protein [Bacillus amyloliquefaciens DSM 7]
Length = 367
Score = 57.8 bits (138), Expect = 5e-07, Method: Composition-based stats.
Identities = 18/86 (20%), Positives = 34/86 (39%), Gaps = 1/86 (1%)
Query: 63 QLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRK 122
Q EG + N+TQI+ + +S TQ+AV + Q + E+ +
Sbjct: 127 QNEGAGVGIDEGHDRYVDYNFTQIIERYAAVSNTQQAVRTHNVSNELDYQVKLRLKEMAR 186
Query: 123 DVEFALVSSQGSEKTSPRKMAALSSW 148
+ L+ + + PR L ++
Sbjct: 187 EFNDWLIYGRRID-GKPRMTGGLLNF 211
>gi|257463376|ref|ZP_05627772.1| hypothetical protein FuD12_05953 [Fusobacterium sp. D12]
gi|317060946|ref|ZP_07925431.1| predicted protein [Fusobacterium sp. D12]
gi|313686622|gb|EFS23457.1| predicted protein [Fusobacterium sp. D12]
Length = 369
Score = 56.2 bits (134), Expect = 1e-06, Method: Composition-based stats.
Identities = 25/102 (24%), Positives = 44/102 (43%), Gaps = 2/102 (1%)
Query: 60 PNAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEA--VDDVGYILKYKEQKLKKA 117
+ EG + T N TQI+R+ +SGT EA V G + Y ++ +K
Sbjct: 131 NDNIAEGADLQGTTYKKGVNYDNNTQIIREEISVSGTSEAINVPSSGGVDVYTLEQTRKM 190
Query: 118 LEIRKDVEFALVSSQGSEKTSPRKMAALSSWIKKNASRGTGG 159
+ +E A++ + E+ + R M + ++ K GG
Sbjct: 191 DTVLGKIEKAIIKGKKFEEGTKRGMDGVKRFLVKGQLVDAGG 232
>gi|257468183|ref|ZP_05632279.1| hypothetical protein FulcA4_02527 [Fusobacterium ulcerans ATCC
49185]
gi|317062468|ref|ZP_07926953.1| predicted protein [Fusobacterium ulcerans ATCC 49185]
gi|313688144|gb|EFS24979.1| predicted protein [Fusobacterium ulcerans ATCC 49185]
Length = 370
Score = 53.9 bits (128), Expect = 8e-06, Method: Composition-based stats.
Identities = 21/101 (20%), Positives = 38/101 (37%), Gaps = 2/101 (1%)
Query: 60 PNAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKE--QKLKKA 117
+ EG + + E NYTQI+R+ +SGT +A+ + +K
Sbjct: 130 NDNIEEGADLLGASYKPGENFTNYTQIIREEISISGTAQALTVPSGEGLDPYSLEMTRKM 189
Query: 118 LEIRKDVEFALVSSQGSEKTSPRKMAALSSWIKKNASRGTG 158
+ VE A+V+ + R M + + + K
Sbjct: 190 DKAVGKVEKAIVAGKKFATGKNRGMDGIRTILDKGQIVDAN 230
>gi|56551280|ref|YP_162119.1| hypothetical protein ZMO0384 [Zymomonas mobilis subsp. mobilis
ZM4]
gi|241760937|ref|ZP_04759026.1| hypothetical protein ZmobDRAFT_0102 [Zymomonas mobilis subsp.
mobilis ATCC 10988]
gi|56542854|gb|AAV89008.1| hypothetical protein ZMO0384 [Zymomonas mobilis subsp. mobilis
ZM4]
gi|241374556|gb|EER64017.1| hypothetical protein ZmobDRAFT_0102 [Zymomonas mobilis subsp.
mobilis ATCC 10988]
Length = 35
Score = 51.6 bits (122), Expect = 3e-05, Method: Composition-based stats.
Identities = 11/30 (36%), Positives = 18/30 (60%)
Query: 1 MTIVNNTFITSSSTTNKESLSDVVSRITPE 30
M++ +NT T S +E LSD++ I+P
Sbjct: 1 MSVASNTVQTYSRVGIREDLSDIIYNISPT 30
>gi|34763997|ref|ZP_00144887.1| Phage protein [Fusobacterium nucleatum subsp. vincentii ATCC 49256]
gi|27886234|gb|EAA23520.1| Phage protein [Fusobacterium nucleatum subsp. vincentii ATCC 49256]
Length = 378
Score = 44.7 bits (104), Expect = 0.004, Method: Composition-based stats.
Identities = 18/90 (20%), Positives = 36/90 (40%), Gaps = 2/90 (2%)
Query: 63 QLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQK--LKKALEI 120
EG E T+ P R+ N T I+ + + ++ T + ++ G + KK E+
Sbjct: 137 MEEGGELKTSTVRLPVRITNNTGIIYEQYKVTETAKHLNPHGQGSLSVRELESQKKKDEL 196
Query: 121 RKDVEFALVSSQGSEKTSPRKMAALSSWIK 150
+E ++ + R + + IK
Sbjct: 197 LGIMENKFLNGVKFTSGNLRMSGGVKALIK 226
>gi|156344548|ref|XP_001621225.1| hypothetical protein NEMVEDRAFT_v1g222228 [Nematostella vectensis]
gi|156206955|gb|EDO29125.1| predicted protein [Nematostella vectensis]
Length = 400
Score = 42.0 bits (97), Expect = 0.027, Method: Composition-based stats.
Identities = 30/118 (25%), Positives = 45/118 (38%), Gaps = 8/118 (6%)
Query: 59 GPNAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKAL 118
A EG T E + NYTQI R +W ++ T A I E K +
Sbjct: 138 AGTAFEEGSNRPTARRLTTEYIPNYTQIFRNAWAMTDTARASYAEMGISNIAENKADCMM 197
Query: 119 EIRKDVEFALVSSQGSEKTSPRK--------MAALSSWIKKNASRGTGGVLEDMILSL 168
D+E A++ SQ TS + AL ++ N + G D +++L
Sbjct: 198 FHSVDIESAMIFSQPKMDTSGATPMHATQGILDALRQYVPGNVNAAGGTTTFDQLVAL 255
>gi|281355462|ref|ZP_06241956.1| hypothetical protein Vvad_PD3568 [Victivallis vadensis ATCC
BAA-548]
gi|281318342|gb|EFB02362.1| hypothetical protein Vvad_PD3568 [Victivallis vadensis ATCC
BAA-548]
Length = 403
Score = 40.0 bits (92), Expect = 0.097, Method: Composition-based stats.
Identities = 21/105 (20%), Positives = 38/105 (36%), Gaps = 5/105 (4%)
Query: 70 SFKTINTPERMGNYTQIMRKSWILSGTQEAVDDV-GYILKYKEQKLKKALEIRKDVEFAL 128
T T N+TQI+RK +S + A + Q EI +D+
Sbjct: 143 GEYTRRTVGSAYNHTQIIRKDLGISNSALATKTIDQVENSIARQTEFALQEIDRDMNRQA 202
Query: 129 VSSQGSEKTSPR----KMAALSSWIKKNASRGTGGVLEDMILSLA 169
+ +E+ + L ++ A +GG L +++ A
Sbjct: 203 IWGIRTERDEANDVFGEAGGLYNFATALAVDASGGRLTSKLVNDA 247
>gi|256027862|ref|ZP_05441696.1| hypothetical protein PrD11_07671 [Fusobacterium sp. D11]
gi|289765813|ref|ZP_06525191.1| phage protein [Fusobacterium sp. D11]
gi|289717368|gb|EFD81380.1| phage protein [Fusobacterium sp. D11]
Length = 378
Score = 39.7 bits (91), Expect = 0.13, Method: Composition-based stats.
Identities = 17/90 (18%), Positives = 35/90 (38%), Gaps = 2/90 (2%)
Query: 63 QLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQK--LKKALEI 120
EG E T+ P + N T I+ + + ++ T + ++ G + KK E+
Sbjct: 137 MEEGGELKASTVRLPVHITNNTGIIYEQYKVTETAKHLNPHGQGGLSVRELESQKKKDEL 196
Query: 121 RKDVEFALVSSQGSEKTSPRKMAALSSWIK 150
+E ++ + R + + IK
Sbjct: 197 LGIMENKFLNGVKFTSGNLRMSGGVKALIK 226
>gi|262067743|ref|ZP_06027355.1| hypothetical protein FUSPEROL_02025 [Fusobacterium periodonticum
ATCC 33693]
gi|291378467|gb|EFE85985.1| hypothetical protein FUSPEROL_02025 [Fusobacterium periodonticum
ATCC 33693]
Length = 379
Score = 38.5 bits (88), Expect = 0.35, Method: Composition-based stats.
Identities = 22/113 (19%), Positives = 41/113 (36%), Gaps = 6/113 (5%)
Query: 40 KGTTHSIHPEWVVDDLASPGPNAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEA 99
T +I +V L EG E ++ P + N T I+ + + ++ T +
Sbjct: 119 TSTAGNIVANTIVQSL----GIEMEEGGELKKSSVRLPVHITNNTGIIYEEYEVTETAKH 174
Query: 100 VDDVGYILKYKEQK--LKKALEIRKDVEFALVSSQGSEKTSPRKMAALSSWIK 150
++ G + KK E+ +E L++ R + S IK
Sbjct: 175 INPHGQSGLSVREVESQKKKDEMLGIMENKLLNGVKYVNGKLRMSGGIKSLIK 227
>gi|115403015|ref|XP_001217584.1| hypothetical protein ATEG_08998 [Aspergillus terreus NIH2624]
gi|114189430|gb|EAU31130.1| hypothetical protein ATEG_08998 [Aspergillus terreus NIH2624]
Length = 522
Score = 38.5 bits (88), Expect = 0.35, Method: Composition-based stats.
Identities = 24/119 (20%), Positives = 37/119 (31%), Gaps = 23/119 (19%)
Query: 29 PEDTPIYSMIKKGTTHSIHPEWVVDDLASPGPNAQLEGDEYSFKTINTPERMGNYTQIMR 88
P DTP+ S++ +H LAS D + NY I +
Sbjct: 27 PADTPLSSLVASAKSH----------LASGSAR-----DALLYFDAAIARDPTNYLTIFQ 71
Query: 89 KSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQGSEKTSPRKMAALSS 147
+ A + + LE++ D E AL+ +S ALS
Sbjct: 72 RG--------AAYLSLRRNTQALEDFDRVLELKPDFESALLQRSRLRASSADWTGALSD 122
>gi|225016607|ref|ZP_03705799.1| hypothetical protein CLOSTMETH_00514 [Clostridium methylpentosum
DSM 5476]
gi|224950571|gb|EEG31780.1| hypothetical protein CLOSTMETH_00514 [Clostridium methylpentosum
DSM 5476]
Length = 169
Score = 37.3 bits (85), Expect = 0.69, Method: Composition-based stats.
Identities = 15/88 (17%), Positives = 31/88 (35%), Gaps = 15/88 (17%)
Query: 67 DEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEF 126
+Y+ + R N I++ +S +Y+ Q ++ D E
Sbjct: 76 SDYTSGLLADYMRDKNACAIVKGLRAISDF-----------EYEFQMALANRKLNPDAET 124
Query: 127 ALVSSQGSE----KTSPRKMAALSSWIK 150
+++QG + R++A L I
Sbjct: 125 VFLTTQGENMYLSSSLVRQIAGLGGDIS 152
>gi|170056995|ref|XP_001864283.1| serine/threonine-protein kinase SBK1 [Culex quinquefasciatus]
gi|167876570|gb|EDS39953.1| serine/threonine-protein kinase SBK1 [Culex quinquefasciatus]
Length = 459
Score = 37.3 bits (85), Expect = 0.75, Method: Composition-based stats.
Identities = 13/90 (14%), Positives = 29/90 (32%), Gaps = 9/90 (10%)
Query: 27 ITPEDTPIYSMIKKGTTHSIHPEWVVDDLASPGPNAQLEGDEYSFKTINTPERMGNYT-- 84
+TPE TP+++ + T + +W L S N D+ + + Y
Sbjct: 372 LTPEPTPVFTGVDPETARNKVWDW----LESNDLNRHDSQDDVVDFSFWSKSESKTYQYA 427
Query: 85 ---QIMRKSWILSGTQEAVDDVGYILKYKE 111
I+ + + + + +
Sbjct: 428 KRESIIGATSTTTASLAVTREASNASSVQR 457
>gi|256845901|ref|ZP_05551359.1| phage protein [Fusobacterium sp. 3_1_36A2]
gi|256719460|gb|EEU33015.1| phage protein [Fusobacterium sp. 3_1_36A2]
Length = 397
Score = 36.6 bits (83), Expect = 1.1, Method: Composition-based stats.
Identities = 19/90 (21%), Positives = 33/90 (36%), Gaps = 2/90 (2%)
Query: 63 QLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQK--LKKALEI 120
EG E T+ + + N T I+ + ++ T + G + KK E+
Sbjct: 156 IEEGGELKDSTVRLSKHITNITGIIYDKYEITETMKHTHPQGQGGLSAREIESQKKKDEL 215
Query: 121 RKDVEFALVSSQGSEKTSPRKMAALSSWIK 150
+E L++ R A + S IK
Sbjct: 216 LGTMENKLLNGIKYINGDIRHSAGIKSLIK 245
>gi|118197693|ref|YP_874086.1| major structural protein [Thermus phage phiYS40]
gi|116266384|gb|ABJ91467.1| major structural protein [Thermus phage phiYS40]
Length = 470
Score = 36.6 bits (83), Expect = 1.2, Method: Composition-based stats.
Identities = 18/121 (14%), Positives = 38/121 (31%), Gaps = 5/121 (4%)
Query: 16 NKESLSDVVSRITPEDTPIYSMIKK--GTTHSIHPEWVVDDLA--SPGPNAQLEGDEYSF 71
+E L V+++ DTP+ ++ K + E+ V G A EG
Sbjct: 28 EREDLEPEVTQLNVLDTPLTDLLSKNAVKAKAYEHEYNVVTARHDKIGYAAFREGG-LPR 86
Query: 72 KTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSS 131
R ++ ++ G + + K +K + + + E+
Sbjct: 87 TVEVNVVRRRIRPMLVGHRITVTELATRTTQNGVMQIDELVKREKMIAVANEFEYLAFYG 146
Query: 132 Q 132
Sbjct: 147 D 147
>gi|317130456|ref|YP_004096738.1| hypothetical protein Bcell_3767 [Bacillus cellulosilyticus DSM
2522]
gi|315475404|gb|ADU32007.1| hypothetical protein Bcell_3767 [Bacillus cellulosilyticus DSM
2522]
Length = 557
Score = 36.2 bits (82), Expect = 1.8, Method: Composition-based stats.
Identities = 18/101 (17%), Positives = 36/101 (35%), Gaps = 10/101 (9%)
Query: 57 SPGPNAQLEGDEYSFKTINTPERMGNYTQ----IMRKSWILSGTQEAVDDVGYILKYKEQ 112
+ G N + + I N M +S + ++E + G + + K++
Sbjct: 368 AVGANHIVYSSVFGPPYIIQIVDESNVDHYESLTMHESGATTDSEEIEESAGSMDEDKQE 427
Query: 113 KLKKALEIRKDVEFALVSSQGSEKTSPRKMAALSSWIKKNA 153
+ + E D+E AL R + +L I +N
Sbjct: 428 EERAPKEKEVDIETAL------SNAVGRYLNSLIKAINEND 462
>gi|242013050|ref|XP_002427232.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212511544|gb|EEB14494.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 1398
Score = 35.8 bits (81), Expect = 2.3, Method: Composition-based stats.
Identities = 25/92 (27%), Positives = 38/92 (41%), Gaps = 4/92 (4%)
Query: 5 NNTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIHPEWVVDDLASPGPNAQL 64
NT + S ++ ES + VVS P TP S + T ++ + LA P +
Sbjct: 1021 TNTSGSLHSVSSDESTAAVVSL--PVTTPFRSNVAGTTVGTVQHSASLMSLAKPKIPETV 1078
Query: 65 EGDEYSFKTINTPERMGNYTQIMRKSWILSGT 96
+ + I+ +R N IM ILSG
Sbjct: 1079 SLNALTE--ISNIKRSNNSVNIMSSGSILSGN 1108
>gi|326532742|dbj|BAJ89216.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 806
Score = 35.0 bits (79), Expect = 3.8, Method: Composition-based stats.
Identities = 16/83 (19%), Positives = 33/83 (39%)
Query: 60 PNAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALE 119
N Q ++ + +G Q+ K + S ++ + + K +L K E
Sbjct: 275 SNLQDAESAFNSALWSDSSVLGGALQLHSKEMMESNLKQVMVEAEGSRKEAFLELLKRKE 334
Query: 120 IRKDVEFALVSSQGSEKTSPRKM 142
I V+ A + + +E + R+M
Sbjct: 335 IESKVDSAFIRVKAAESSKKREM 357
>gi|194697698|gb|ACF82933.1| unknown [Zea mays]
gi|195657807|gb|ACG48371.1| O-methyltransferase ZRP4 [Zea mays]
Length = 364
Score = 34.7 bits (78), Expect = 4.7, Method: Composition-based stats.
Identities = 21/116 (18%), Positives = 34/116 (29%), Gaps = 24/116 (20%)
Query: 3 IVNNTFITSSSTTNKESLSDVVSRITPE-------------DTPIYSMIKKGTTHSIHPE 49
N F T + S+ V +TP TP+ +M+ T S E
Sbjct: 79 TTTNVFGTQQPAGGSDDDSEPVYTLTPVSRLLIASQSSQLAQTPLAAMVLDPTIVSPFFE 138
Query: 50 ---WVVDDLASPGPNAQLEG--------DEYSFKTINTPERMGNYTQIMRKSWILS 94
W +L P G D+ +F + + I+ + S
Sbjct: 139 LAAWFQHELPDPCIFKHTHGRGIWELTKDDATFDALVNDGLASDSQLIVDVAIKQS 194
>gi|302392162|ref|YP_003827982.1| methyl-accepting chemotaxis sensory transducer with Cache sensor
[Acetohalobium arabaticum DSM 5501]
gi|302204239|gb|ADL12917.1| methyl-accepting chemotaxis sensory transducer with Cache sensor
[Acetohalobium arabaticum DSM 5501]
Length = 550
Score = 34.7 bits (78), Expect = 5.2, Method: Composition-based stats.
Identities = 21/82 (25%), Positives = 30/82 (36%), Gaps = 5/82 (6%)
Query: 52 VDDLAS----PGPNAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGT-QEAVDDVGYI 106
VDDL++ +AQ T E N QI + ++G QEA
Sbjct: 293 VDDLSAYSEELSASAQEGNAAIETTTQLIEEMSTNIQQISASAQEVTGLAQEANSQAEIG 352
Query: 107 LKYKEQKLKKALEIRKDVEFAL 128
+ EQ + EI VE +
Sbjct: 353 SENIEQAVSSMKEINNAVEETV 374
>gi|327183179|gb|AEA31626.1| protease [Lactobacillus amylovorus GRL 1118]
Length = 404
Score = 33.9 bits (76), Expect = 7.2, Method: Composition-based stats.
Identities = 17/84 (20%), Positives = 29/84 (34%)
Query: 49 EWVVDDLASPGPNAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILK 108
++ VDDL P N LE E + Q + +SG + G
Sbjct: 216 QFEVDDLTIPAENNPLEKTEEQDNAQAQLLMGFGFKQKITYQGQISGLLLSQYLAGDQSS 275
Query: 109 YKEQKLKKALEIRKDVEFALVSSQ 132
++++ L DVE ++
Sbjct: 276 KLFNQIREELGAAYDVEANSFANN 299
>gi|223042595|ref|ZP_03612644.1| NADPH:quinone reductase [Staphylococcus capitis SK14]
gi|222444258|gb|EEE50354.1| NADPH:quinone reductase [Staphylococcus capitis SK14]
Length = 337
Score = 33.9 bits (76), Expect = 7.5, Method: Composition-based stats.
Identities = 15/109 (13%), Positives = 32/109 (29%), Gaps = 3/109 (2%)
Query: 11 SSSTTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIHPEWVVDDL--ASPGPNAQLEGDE 68
+ E + + V+ P D YS + + + ++L +P +
Sbjct: 63 FDAAGIVEQVGEDVTMFEPGDYVFYSGSPNQHGSNEEYQLIEEELVAKAPSNLKPEQAAS 122
Query: 69 YSFKTINTPERMGNYTQIMRKSWILSGT-QEAVDDVGYILKYKEQKLKK 116
+ E + + QI G ++ G + Q K
Sbjct: 123 LPLTGLTASETLFDVFQISHDPEKNKGKSLLIINGAGGVGSIATQIAKA 171
Database: nr
Posted date: May 13, 2011 4:10 AM
Number of letters in database: 999,999,932
Number of sequences in database: 2,987,209
Database: /data/usr2/db/fasta/nr.01
Posted date: May 13, 2011 4:17 AM
Number of letters in database: 999,998,956
Number of sequences in database: 2,896,973
Database: /data/usr2/db/fasta/nr.02
Posted date: May 13, 2011 4:23 AM
Number of letters in database: 999,999,979
Number of sequences in database: 2,907,862
Database: /data/usr2/db/fasta/nr.03
Posted date: May 13, 2011 4:29 AM
Number of letters in database: 999,999,513
Number of sequences in database: 2,932,190
Database: /data/usr2/db/fasta/nr.04
Posted date: May 13, 2011 4:33 AM
Number of letters in database: 792,586,372
Number of sequences in database: 2,260,650
Lambda K H
0.308 0.134 0.377
Lambda K H
0.267 0.0411 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,309,193,696
Number of Sequences: 13984884
Number of extensions: 44428082
Number of successful extensions: 102725
Number of sequences better than 10.0: 148
Number of HSP's better than 10.0 without gapping: 91
Number of HSP's successfully gapped in prelim test: 57
Number of HSP's that attempted gapping in prelim test: 102445
Number of HSP's gapped (non-prelim): 228
length of query: 169
length of database: 4,792,584,752
effective HSP length: 128
effective length of query: 41
effective length of database: 3,002,519,600
effective search space: 123103303600
effective search space used: 123103303600
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.5 bits)
S2: 76 (33.9 bits)