BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= 537021.9.peg.1074_1 (169 letters) Database: nr 13,984,884 sequences; 4,792,584,752 total letters Searching..................................................done >gi|148257053|ref|YP_001241638.1| putative phage major head protein [Bradyrhizobium sp. BTAi1] gi|146409226|gb|ABQ37732.1| putative phage Major head protein [Bradyrhizobium sp. BTAi1] Length = 320 Score = 181 bits (459), Expect = 3e-44, Method: Composition-based stats. Identities = 65/168 (38%), Positives = 92/168 (54%), Gaps = 5/168 (2%) Query: 1 MTIVNNTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIHPEWVVDDLASP-G 59 MT +TF+T + N+E LSD++ RI P DTP S + K +++ EW LA G Sbjct: 1 MTTPTSTFVTYQAVGNREDLSDMIYRIDPVDTPFMSGVDKEKATAVNHEWQTQALAPADG 60 Query: 60 PNAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALE 119 NAQLEGD+ + R+GN QI K +SGTQ+AVD G + Q++ K LE Sbjct: 61 TNAQLEGDDPNTNVTTPTVRLGNQCQISYKVARVSGTQQAVDHAGRDNELAYQEMLKGLE 120 Query: 120 IRKDVEFALVSSQG----SEKTSPRKMAALSSWIKKNASRGTGGVLED 163 +++D+E L + T+PRK A++ SWI N S+GT G D Sbjct: 121 LKRDLETILCGTNQAKVVGNTTTPRKTASILSWIVSNTSKGTAGGAAD 168 >gi|150397033|ref|YP_001327500.1| hypothetical protein Smed_1830 [Sinorhizobium medicae WSM419] gi|150028548|gb|ABR60665.1| hypothetical protein Smed_1830 [Sinorhizobium medicae WSM419] Length = 331 Score = 179 bits (455), Expect = 9e-44, Method: Composition-based stats. Identities = 93/161 (57%), Positives = 118/161 (73%) Query: 1 MTIVNNTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIHPEWVVDDLASPGP 60 M + NT++T+ + N+E LSDVVSRITPEDTPIYS I+KG SIHPEW D+LA+PG Sbjct: 1 MAALANTYMTTQAVGNREELSDVVSRITPEDTPIYSFIEKGKCVSIHPEWETDELAAPGE 60 Query: 61 NAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEI 120 N + EGDEY+F I PER+GNYTQIMRK WI+SGTQE V + G + K K QKLKK +EI Sbjct: 61 NIKSEGDEYAFGAITPPERLGNYTQIMRKDWIISGTQEVVSEAGNVQKRKYQKLKKGIEI 120 Query: 121 RKDVEFALVSSQGSEKTSPRKMAALSSWIKKNASRGTGGVL 161 RKDVE+A+V + S + R+ +L++WI+ N SRG GG Sbjct: 121 RKDVEYAIVDTNASVAGATREFGSLNTWIETNVSRGAGGAN 161 >gi|288817864|ref|YP_003432211.1| putative phage major head protein [Hydrogenobacter thermophilus TK-6] gi|288787263|dbj|BAI69010.1| putative phage major head protein [Hydrogenobacter thermophilus TK-6] gi|308751463|gb|ADO44946.1| putative phage major head protein [Hydrogenobacter thermophilus TK-6] Length = 291 Score = 176 bits (447), Expect = 8e-43, Method: Composition-based stats. Identities = 58/166 (34%), Positives = 92/166 (55%), Gaps = 5/166 (3%) Query: 7 TFITSSSTTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIHPEWVVDDLASPGPNAQLEG 66 T ++ N+E LSD+++ I+P +TP+YSM K T + + EW+ D LA+PG NA +EG Sbjct: 2 ALTTYTAVGNREDLSDIITNISPTETPLYSMFGKATAKATYHEWIEDSLAAPGTNAMVEG 61 Query: 67 DEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEF 126 Y T R GNYTQI K + +S TQEAV G + Q K EI +DVE+ Sbjct: 62 ANYPIADPQTRVRKGNYTQIFAKGYGISETQEAVLKAGIKSEIAYQMQKAMKEIARDVEY 121 Query: 127 ALVSSQ---GSEKTSPRKMAALSSWIKKNA--SRGTGGVLEDMILS 167 A++++ T+ R+M + +++ N + G+ L + +L+ Sbjct: 122 AIINNTAAVAGNATTARQMGGIQAFVITNVLANGGSPRALTETLLN 167 >gi|27476049|ref|NP_775251.1| major head protein [Pseudomonas phage PaP3] gi|27414479|gb|AAL85565.1| major head protein [Pseudomonas phage PaP3] Length = 317 Score = 175 bits (444), Expect = 2e-42, Method: Composition-based stats. Identities = 53/166 (31%), Positives = 82/166 (49%), Gaps = 4/166 (2%) Query: 1 MTIVNNTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIHPEWVVDDLASPGP 60 M N T +E L D++ I P DTP S I KG +I EW D+L PG Sbjct: 1 MATPTNAVSTVEINGKREDLIDIIYNIAPYDTPFMSAIGKGVATAITHEWQTDELRQPGK 60 Query: 61 NAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEI 120 N ++EG++ + K + + NY QI ++ ++GT + V G + Q KK+ E+ Sbjct: 61 NTRVEGEDATIKAGSFTTMLNNYCQISDETLQVTGTADRVKKAGRKNELAYQLAKKSKEL 120 Query: 121 RKDVEFALVSS----QGSEKTSPRKMAALSSWIKKNASRGTGGVLE 162 + D+E+ALV + T+P +MA + ++ K N S G GV Sbjct: 121 KLDMEYALVGAPQAKVQRNTTTPGQMANIFAYYKTNGSLGANGVAP 166 >gi|167600435|ref|YP_001671935.1| major head protein [Pseudomonas phage LUZ24] gi|161168298|emb|CAP45463.1| major head protein [Pseudomonas phage LUZ24] Length = 317 Score = 174 bits (441), Expect = 4e-42, Method: Composition-based stats. Identities = 51/167 (30%), Positives = 83/167 (49%), Gaps = 4/167 (2%) Query: 1 MTIVNNTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIHPEWVVDDLASPGP 60 M N T +E L D++ I P DTP + I KG +I EW D+L PG Sbjct: 1 MATPTNAVSTVEINGKREDLIDIIYNIAPYDTPFMTAIGKGVATAITHEWQTDELRQPGK 60 Query: 61 NAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEI 120 N ++EG++ + K + + NY QI ++ ++GT + V G + Q KK+ E+ Sbjct: 61 NTRVEGEDATIKAGSFTTMLNNYCQISDETLQVTGTADKVKKAGRKNELAYQLAKKSKEL 120 Query: 121 RKDVEFALVSSQGS----EKTSPRKMAALSSWIKKNASRGTGGVLED 163 + D+E+A+V + + T+P +MA + ++ K N S G G L Sbjct: 121 KLDMEYAMVGAPQAKIQRNTTTPGQMANIFAYYKTNGSVGANGTLPT 167 >gi|291334638|gb|ADD94286.1| putative phage major head protein [uncultured phage MedDCM-OCT-S04-C64] Length = 323 Score = 172 bits (436), Expect = 1e-41, Method: Composition-based stats. Identities = 51/172 (29%), Positives = 83/172 (48%), Gaps = 3/172 (1%) Query: 1 MTIVNNTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIHPEWVVDDLASPGP 60 M + TF T + +E L+D++ I+P DTP S + + ++ EW D L + Sbjct: 1 MAVPAQTFTTYGAVGEREDLTDIIYDISPMDTPFLSNASRESATAVFYEWQTDSLDTAAV 60 Query: 61 NAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEI 120 NAQLEGD+ T + R+GNYTQI K ++GT AV G + Q K+ E+ Sbjct: 61 NAQLEGDDGVTSTSSATTRLGNYTQISTKVPRVTGTLRAVATAGRADELAYQISKRGREL 120 Query: 121 RKDVEFALV---SSQGSEKTSPRKMAALSSWIKKNASRGTGGVLEDMILSLA 169 ++D+E AL ++ + R +A + +W+ N + + S A Sbjct: 121 KRDMETALTGTQAASAGGAGTARNLAGIGAWLSTNQVQKGANATTPPVSSGA 172 >gi|291334838|gb|ADD94478.1| putative phage major head protein [uncultured phage MedDCM-OCT-S06-C1041] Length = 323 Score = 172 bits (435), Expect = 2e-41, Method: Composition-based stats. Identities = 51/172 (29%), Positives = 83/172 (48%), Gaps = 3/172 (1%) Query: 1 MTIVNNTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIHPEWVVDDLASPGP 60 M + TF T + +E L+D++ I+P DTP S + + ++ EW D L + Sbjct: 1 MAVPAQTFTTYGAVGEREDLTDIIYDISPMDTPFLSNASRESATAVFYEWQTDSLDTAAV 60 Query: 61 NAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEI 120 NAQLEGD+ T + R+GNYTQI K ++GT AV G + Q K+ E+ Sbjct: 61 NAQLEGDDGVTSTSSATTRLGNYTQISTKVPRVTGTLRAVATAGRADELAYQISKRGREL 120 Query: 121 RKDVEFALV---SSQGSEKTSPRKMAALSSWIKKNASRGTGGVLEDMILSLA 169 ++D+E AL ++ + R +A + +W+ N + + S A Sbjct: 121 KRDMETALTGTQAASAGGAGTARNLAGIGAWLSTNQVQKGANATTPPVTSGA 172 >gi|163783849|ref|ZP_02178828.1| hypothetical protein HG1285_12862 [Hydrogenivirga sp. 128-5-R1-1] gi|159880872|gb|EDP74397.1| hypothetical protein HG1285_12862 [Hydrogenivirga sp. 128-5-R1-1] Length = 291 Score = 171 bits (434), Expect = 3e-41, Method: Composition-based stats. Identities = 59/172 (34%), Positives = 94/172 (54%), Gaps = 10/172 (5%) Query: 1 MTIVNNTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIHPEWVVDDLASPGP 60 M + T ++ N+E LSD+++ I P +TP+YSM K T S + EW+ DDL PG Sbjct: 1 MAV-----TTYTAVGNREDLSDLITNIAPTETPLYSMFGKTTAKSTYHEWLEDDLNPPGV 55 Query: 61 NAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEI 120 NA++EG +++ T R GNYTQI K + +S TQE V G + Q K EI Sbjct: 56 NAKVEGADFTIDTPTNRVRKGNYTQIFSKGYGVSRTQEKVLKAGIKSELAYQMAKAMKEI 115 Query: 121 RKDVEFALVSS---QGSEKTSPRKMAALSSWIKKNA--SRGTGGVLEDMILS 167 +DVE+A++++ T+ R+M + +++ N + GT L + +L+ Sbjct: 116 ARDVEYAIINNTAASAGSATTARQMGGVQAFVSTNVLANAGTPRPLTETLLN 167 >gi|227822441|ref|YP_002826413.1| putative phage major head protein [Sinorhizobium fredii NGR234] gi|227341442|gb|ACP25660.1| putative phage major head protein [Sinorhizobium fredii NGR234] Length = 331 Score = 169 bits (428), Expect = 1e-40, Method: Composition-based stats. Identities = 88/155 (56%), Positives = 113/155 (72%) Query: 1 MTIVNNTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIHPEWVVDDLASPGP 60 M ++ NTF T+ + N+E LSDVVSRITPEDTPIYS+I+KG + HPEW D+LA+PG Sbjct: 1 MAVLTNTFQTTQAVGNREELSDVVSRITPEDTPIYSLIEKGKCTTYHPEWETDELAAPGA 60 Query: 61 NAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEI 120 N + EG+EY+F I P+R+GNYTQIMRK WI+S TQE + G + K K QKLKK +EI Sbjct: 61 NVREEGEEYAFGAITPPKRLGNYTQIMRKDWIISATQEVTAEAGNVQKRKYQKLKKGVEI 120 Query: 121 RKDVEFALVSSQGSEKTSPRKMAALSSWIKKNASR 155 RKDVEFA+V + + S R+ +LS+WI NASR Sbjct: 121 RKDVEFAIVDTNATVAGSTREFGSLSTWIVSNASR 155 >gi|315122533|ref|YP_004063022.1| hypothetical protein CKC_03925 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495935|gb|ADR52534.1| hypothetical protein CKC_03925 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 331 Score = 167 bits (423), Expect = 5e-40, Method: Composition-based stats. Identities = 134/160 (83%), Positives = 150/160 (93%) Query: 1 MTIVNNTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIHPEWVVDDLASPGP 60 MT + NTFI++SS+TNKESLSDVVSRITPEDTPIYSMIKKG+T SIHPEWVVDDL+SPGP Sbjct: 1 MTEITNTFISTSSSTNKESLSDVVSRITPEDTPIYSMIKKGSTRSIHPEWVVDDLSSPGP 60 Query: 61 NAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEI 120 NAQLEGDEYSF++I+TPERMGNYTQIMRKSWILSGTQE++DD G +LKYKEQKLKKALEI Sbjct: 61 NAQLEGDEYSFESISTPERMGNYTQIMRKSWILSGTQESIDDTGSLLKYKEQKLKKALEI 120 Query: 121 RKDVEFALVSSQGSEKTSPRKMAALSSWIKKNASRGTGGV 160 RKDVEFALVS+Q SEK SPRK+A+LSSWIK N +RGTGG Sbjct: 121 RKDVEFALVSAQESEKKSPRKLASLSSWIKTNVNRGTGGA 160 >gi|160897389|ref|YP_001562971.1| putative phage major head protein [Delftia acidovorans SPH-1] gi|160362973|gb|ABX34586.1| putative phage major head protein [Delftia acidovorans SPH-1] Length = 306 Score = 167 bits (423), Expect = 5e-40, Method: Composition-based stats. Identities = 61/169 (36%), Positives = 92/169 (54%), Gaps = 2/169 (1%) Query: 1 MTIVNNTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIHPEWVVDDLASPGP 60 M + TF+T+++ N+E L+DV+ RI+P TP +M K + EW DLA+ Sbjct: 1 MAAPSGTFLTTAAIGNREDLTDVIYRISPTQTPTLNMASKAKATNTLHEWQTQDLAAAAS 60 Query: 61 NAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEI 120 NA +EGD+ + KT+ R+ N TQI K+ +SGTQ A++ G + Q +LEI Sbjct: 61 NAAVEGDDAAAKTVTPTVRLNNRTQISTKTVRVSGTQRAMNPAGRKDELAYQLSLASLEI 120 Query: 121 RKDVEFALVSSQGSEKTSPRKMAALSSWIKKNASRGTGGVLEDMILSLA 169 ++D+E L S + TSPRK L W+ N +R GG L D + + Sbjct: 121 KRDMELDLTQSDVA-ATSPRKSRGLRGWVVDNVNRN-GGTLADYVANTG 167 >gi|221199511|ref|ZP_03572555.1| major head protein [Burkholderia multivorans CGD2M] gi|221205587|ref|ZP_03578602.1| major head protein [Burkholderia multivorans CGD2] gi|221174425|gb|EEE06857.1| major head protein [Burkholderia multivorans CGD2] gi|221180796|gb|EEE13199.1| major head protein [Burkholderia multivorans CGD2M] Length = 317 Score = 161 bits (407), Expect = 4e-38, Method: Composition-based stats. Identities = 55/164 (33%), Positives = 86/164 (52%), Gaps = 5/164 (3%) Query: 4 VNNTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIHPEWVVDDLASP-GPNA 62 NT+ T ++ N+E L + V +I+P DTP S I+K ++ EW D L +P NA Sbjct: 2 PANTYTTYTAVGNREDLINKVFQISPTDTPFTSAIEKTDAEGVYHEWQTDSLRAPTDSNA 61 Query: 63 QLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRK 122 +EG + ++ + +R+GN QI++ ++ +SGTQEAV G + KKA+E++K Sbjct: 62 AVEGADATYNEQDPTKRIGNRCQIVQDTFSVSGTQEAVKRAGP-KEVARLSAKKAIELKK 120 Query: 123 DVEFALVSSQG---SEKTSPRKMAALSSWIKKNASRGTGGVLED 163 D+E + S KT RKM + W + N G G D Sbjct: 121 DIEATSLVSGAAVVGSKTVARKMRGVKGWCETNFLGGAGAAAPD 164 >gi|291334595|gb|ADD94245.1| putative phage major head protein [uncultured phage MedDCM-OCT-S04-C136] Length = 316 Score = 153 bits (387), Expect = 6e-36, Method: Composition-based stats. Identities = 44/150 (29%), Positives = 73/150 (48%), Gaps = 3/150 (2%) Query: 8 FITSSSTTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIHPEWVVDDLASPGPNAQLEGD 67 + T + +E L+D++ I+P +TP S + K + +W D LA NA +EG Sbjct: 4 YQTYQTIGIREDLADIIYSISPTETPFMSGVAKTKATNTLHQWQTDALADVAANAAVEGA 63 Query: 68 EYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFA 127 + S+ T+ N+TQI K ++ T EAV G + Q K A E+++D+E A Sbjct: 64 DISYGTMAPTVLENNHTQISTKGIQVTATNEAVTSAGRNNEMAYQVAKAAKELKRDMETA 123 Query: 128 LVSS---QGSEKTSPRKMAALSSWIKKNAS 154 L+S+ T+ RK+ +W + N Sbjct: 124 LLSNVAKTAGNATTARKLGGCPTWYETNVD 153 >gi|307308932|ref|ZP_07588615.1| putative phage major head protein [Sinorhizobium meliloti BL225C] gi|306900566|gb|EFN31179.1| putative phage major head protein [Sinorhizobium meliloti BL225C] Length = 309 Score = 148 bits (373), Expect = 3e-34, Method: Composition-based stats. Identities = 49/165 (29%), Positives = 76/165 (46%), Gaps = 5/165 (3%) Query: 1 MTIVNNTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIHPEWVVDDLASPGP 60 M T T+ + +E L D++S I+PEDTP + I K + EW D L + Sbjct: 1 MA----TLKTTDVSHVREDLEDIISNISPEDTPFLTSIAKVSASQKTHEWTQDKLRARNK 56 Query: 61 NAQLEGDEYSFKTINTPE-RMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALE 119 N + N+ R+ N+ QI ++ +SG+ A D VG + Q K + Sbjct: 57 NNAAIEGAEAAAASNSAPVRLRNHAQIFTETVQVSGSLIASDTVGSKNELAYQLAKSIKQ 116 Query: 120 IRKDVEFALVSSQGSEKTSPRKMAALSSWIKKNASRGTGGVLEDM 164 ++ D+E VS + S PR+M + +W+K NA GTGG Sbjct: 117 VKGDIEATAVSEKASSLGEPREMGGMEAWVKTNALHGTGGATAGY 161 >gi|290457630|sp|P85987|CAPSD_BPSK1 RecName: Full=Major capsid protein; AltName: Full=Virion protein G gi|221271431|dbj|BAH15184.1| major capsid protein [Serratia phage KSP100] Length = 306 Score = 146 bits (369), Expect = 8e-34, Method: Composition-based stats. Identities = 45/163 (27%), Positives = 67/163 (41%), Gaps = 9/163 (5%) Query: 1 MTIVNNTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIHPEWVVDDLASPGP 60 M + T + KE +D VS I+PE TP+ SMI+K H+ +W D L Sbjct: 1 MA----AYQTYTMAGIKEDFADWVSNISPEYTPLISMIRKFPVHNTMFQWQWDVLKDVDT 56 Query: 61 -NAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALE 119 N E + + + NY QIMRK +S + AV G + Q K A E Sbjct: 57 ENQHNEASDAKDVELTPTTVVQNYVQIMRKVVFVSDSANAVSSHGREKELFYQLKKAAKE 116 Query: 120 IRKDVEFALV----SSQGSEKTSPRKMAALSSWIKKNASRGTG 158 +++D E + + T PR A+ S I + + Sbjct: 117 LKRDNEGIFLLKDRAGDAGSATKPRLTASFGSLIDASMKKTAD 159 >gi|316934287|ref|YP_004109269.1| putative phage major head protein [Rhodopseudomonas palustris DX-1] gi|315602001|gb|ADU44536.1| putative phage major head protein [Rhodopseudomonas palustris DX-1] Length = 304 Score = 141 bits (355), Expect = 4e-32, Method: Composition-based stats. Identities = 46/162 (28%), Positives = 75/162 (46%), Gaps = 5/162 (3%) Query: 8 FITSSSTTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIHPEWVVDDLASPGPNAQLEGD 67 + + + KE +SD++S ITP TP S+ K T H+ + EW D+L + NAQ EG Sbjct: 5 YTSYDAVGTKEDVSDIISMITPTKTPFTSLTKSETVHNTYYEWQEDELRATADNAQPEGF 64 Query: 68 EYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFA 127 + GN TQIM ++ +SGT +AV G + + K + ++ D+E A Sbjct: 65 TATPVARTPTIMRGNVTQIMSDTFEVSGTNDAVTKYGRGKESAREASKASAALKLDLEAA 124 Query: 128 LVSSQ-----GSEKTSPRKMAALSSWIKKNASRGTGGVLEDM 164 + + ++PRK A + I + TG + Sbjct: 125 FTKNDSDMVKPTVASTPRKFAGVQKQIDPDNIVYTGATGTKI 166 >gi|283856246|ref|YP_162122.2| putative phage major head protein [Zymomonas mobilis subsp. mobilis ZM4] gi|283775241|gb|AAV89011.2| putative phage major head protein [Zymomonas mobilis subsp. mobilis ZM4] Length = 304 Score = 130 bits (327), Expect = 6e-29, Method: Composition-based stats. Identities = 42/139 (30%), Positives = 68/139 (48%), Gaps = 5/139 (3%) Query: 31 DTPIYSMIKKGTTHSIHPEWVVDDLASP-GPNAQLEGDEYSFKTINTPERMGNYTQIMRK 89 +TP + I + T + + EW D+LAS N Q+EG + + ++ R+GNYTQIM K Sbjct: 10 ETPFVTAIGQTTAKNTYTEWQTDNLASANAQNKQVEGADLANESRQPTVRVGNYTQIMTK 69 Query: 90 SWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQGS----EKTSPRKMAAL 145 S T AV + G ++ Q + E+++D+E + + R+ A Sbjct: 70 VVGTSTTDRAVHNAGRGDEHAYQLARAGQELKRDIEARFTGNFAAIPGDGAVVARETAGA 129 Query: 146 SSWIKKNASRGTGGVLEDM 164 +W++ NA RG GG M Sbjct: 130 LAWLRSNAHRGDGGANPVM 148 >gi|167583566|ref|YP_001671756.1| major head protein [Enterobacteria phage phiEco32] gi|164375404|gb|ABY52812.1| major head protein [Enterobacteria phage phiEco32] Length = 352 Score = 129 bits (325), Expect = 1e-28, Method: Composition-based stats. Identities = 41/137 (29%), Positives = 61/137 (44%), Gaps = 2/137 (1%) Query: 1 MTIVNNTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIHPEWVVDDLASP-G 59 M F++ K S ++ +S ++P+DTP SM K + + W D LAS G Sbjct: 1 MANPT-LFVSYDQNGKKLSFANWISVLSPQDTPFVSMTGKESINQTIFSWQTDALASVDG 59 Query: 60 PNAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALE 119 NA +EG + N TQI+RK +S T + G + Q KK E Sbjct: 60 NNAHVEGSRAEDGEMKPTVIKSNVTQILRKVVRVSDTANTTANYGRGRELMYQLEKKGKE 119 Query: 120 IRKDVEFALVSSQGSEK 136 I++D+E L+S Q Sbjct: 120 IKRDLEKILLSGQARTD 136 >gi|154174521|ref|YP_001409081.1| hypothetical protein CCV52592_0028 [Campylobacter curvus 525.92] gi|112803013|gb|EAU00357.1| conserved hypothetical protein [Campylobacter curvus 525.92] Length = 327 Score = 119 bits (297), Expect = 2e-25, Method: Composition-based stats. Identities = 44/181 (24%), Positives = 79/181 (43%), Gaps = 17/181 (9%) Query: 1 MTIVNNTFIT--SSSTTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIHPEWVVDDLASP 58 M I + F + S+ D + I ++TP+ S+I SI W+ D + P Sbjct: 1 MAITSTGFQAPATKRVGLVPSVYDKIILIGADETPMLSLIGTSKVKSIKHSWITDTIGEP 60 Query: 59 GPNAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKAL 118 NAQ+E ++S +T +++ N TQI +S T + G + + + KKA Sbjct: 61 KKNAQIEISDFSGAGKSTKKQLDNDTQIFTTEVSVSKTMQTAQTYG-GKELENEITKKAK 119 Query: 119 EIRKDVEFALV--------------SSQGSEKTSPRKMAALSSWIKKNASRGTGGVLEDM 164 E + D+E+AL ++ T+ +MA + ++ AS TGG ++ Sbjct: 120 EHKLDIEYALFGLGRDADAKKSVFKAATPRTDTTASEMAGIFYYVANGASAFTGGKCGNV 179 Query: 165 I 165 + Sbjct: 180 L 180 >gi|291334405|gb|ADD94061.1| major head protein [uncultured phage MedDCM-OCT-S01-C1] Length = 344 Score = 118 bits (295), Expect = 3e-25, Method: Composition-based stats. Identities = 34/152 (22%), Positives = 66/152 (43%), Gaps = 4/152 (2%) Query: 16 NKESLSDVVSRITPEDTPIYSMIKKGTTHSIHPEWVVDDLASPG-PNAQLEGDEYSFKTI 74 E + + I+ P ++ T + +WVVD+L +P NA+++G + + Sbjct: 20 INEDVMQKIFDISKIPLPFTDLVGSTTHKNERFDWVVDELRAPDVTNARVDGSDAGTASE 79 Query: 75 NTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQ-- 132 R+GN++QI + +S +A D +G + + + +IR+DVE +++Q Sbjct: 80 AGGARVGNHSQISDEVIAVSYRADASDTIGRTKELAYRITRGNQQIRRDVEAMALNNQAS 139 Query: 133 -GSEKTSPRKMAALSSWIKKNASRGTGGVLED 163 T L +WI+ +G G Sbjct: 140 VAGTDTVAGVTGGLPTWIETTVMQGDGSAAVT 171 >gi|256751057|ref|ZP_05491940.1| conserved hypothetical protein [Thermoanaerobacter ethanolicus CCSD1] gi|256750167|gb|EEU63188.1| conserved hypothetical protein [Thermoanaerobacter ethanolicus CCSD1] Length = 292 Score = 118 bits (295), Expect = 3e-25, Method: Composition-based stats. Identities = 36/165 (21%), Positives = 70/165 (42%), Gaps = 12/165 (7%) Query: 1 MTIVNNTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKKGTT----HSIHPEWVVDDLA 56 M +N + K L++ ++ + P DTP+++ + +S W L Sbjct: 1 MIQTSN-----FTVGEKIDLTNEIALVQPLDTPLFTYLMSRKAYDKANSTIVTWREKTLD 55 Query: 57 SPGPNAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKK 116 + + EG E + + N +I +K+ +SGT EA++ G Y + + Sbjct: 56 TTEDISVPEGSETNVFYKSDRVEKNNVCEIFKKAVQISGTAEAINIKGIGDLYASEMADR 115 Query: 117 ALEIRKDVEFALVSSQGSEKTS--PRKMAALSSWIKK-NASRGTG 158 EI+ ++E L++ + ++ RKMA L S++ N T Sbjct: 116 LAEIKVNIEKKLINGVKDDGSTSGIRKMAGLLSFVLTENKVSNTA 160 >gi|257458669|ref|ZP_05623796.1| conserved hypothetical protein [Campylobacter gracilis RM3268] gi|257443942|gb|EEV19058.1| conserved hypothetical protein [Campylobacter gracilis RM3268] Length = 328 Score = 115 bits (288), Expect = 2e-24, Method: Composition-based stats. Identities = 43/175 (24%), Positives = 77/175 (44%), Gaps = 17/175 (9%) Query: 1 MTIVNNTFITSSST--TNKESLSDVVSRITPEDTPIYSMIKKGTTHSIHPEWVVDDLASP 58 M I + F ++ K S+ D + I +DTP+ S+I + W+ D++A+P Sbjct: 1 MAITSTGFQAPATKREGLKPSVYDSIILIGADDTPVLSLIGTSNVTNTEHSWLTDNIAAP 60 Query: 59 GPNAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKAL 118 NAQLE +++ +T ++ N QI + +S T + V G + + + K+A Sbjct: 61 KKNAQLEISDFADDRKSTIQKTTNSVQIFTTNISVSYTMQKVATYG-GKEMERETTKRAK 119 Query: 119 EIRKDVEFALV--------------SSQGSEKTSPRKMAALSSWIKKNASRGTGG 159 E ++D+E+AL + T +MA + +I K S G Sbjct: 120 EHKRDMEYALFGLGRDTDTKVSIFKAPTSRADTVAGEMAGMFYYISKGESAFVNG 174 >gi|331088860|ref|ZP_08337770.1| hypothetical protein HMPREF1025_01353 [Lachnospiraceae bacterium 3_1_46FAA] gi|330407383|gb|EGG86886.1| hypothetical protein HMPREF1025_01353 [Lachnospiraceae bacterium 3_1_46FAA] Length = 314 Score = 111 bits (278), Expect = 3e-23, Method: Composition-based stats. Identities = 35/151 (23%), Positives = 68/151 (45%), Gaps = 6/151 (3%) Query: 19 SLSDVVSRITPEDTPIYSMIKK----GTTHSIHPEWVVDDLASPGPNAQLEGDEYSFKTI 74 L++ + ++P DTP+ +M+ I W +L + +LEG E Sbjct: 17 DLTEEIKLVSPTDTPLTTMLMGRGAVEPATDITVTWRERELNANRGTLKLEGAEAGAVIT 76 Query: 75 NTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQGS 134 +T + N QI+ K +SGT A+ G + + + +E ++D+E+ ++ + Sbjct: 77 STRGSLSNVCQIIEKVTQVSGTARALHPKGIGDTFTAEVQDRLIETKRDLEWYFLNGTKT 136 Query: 135 --EKTSPRKMAALSSWIKKNASRGTGGVLED 163 ++PR+MA L + + N T G L + Sbjct: 137 LEADSTPRQMAGLINLVNDNNVVSTAGALSE 167 >gi|291526329|emb|CBK91916.1| hypothetical protein EUR_29920 [Eubacterium rectale DSM 17629] Length = 304 Score = 110 bits (276), Expect = 6e-23, Method: Composition-based stats. Identities = 31/151 (20%), Positives = 68/151 (45%), Gaps = 6/151 (3%) Query: 19 SLSDVVSRITPEDTPIYSMIKKGT----THSIHPEWVVDDLASPGPNAQLEGDEYSFKTI 74 L++ + ++P DTP+ +++ + I W +L S +LEG E Sbjct: 17 DLTEEIKLVSPTDTPLTTLLMGRGQVVPANDITVTWREKELNSDRGTLKLEGSEAGEAIT 76 Query: 75 NTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQGS 134 + + + N QI+ K +SGT +++ G + + + +E ++D+E+ ++ + Sbjct: 77 SGRKTLSNVCQIIEKVTQVSGTARSLNPKGIGDVFNSEVQDRLVETKRDMEWYFLNGTKA 136 Query: 135 --EKTSPRKMAALSSWIKKNASRGTGGVLED 163 ++PR+M L + + T G L + Sbjct: 137 LESGSTPRQMNGLVNLVASGNVVETKGALTE 167 >gi|196048420|ref|ZP_03115595.1| conserved hypothetical protein [Bacillus cereus 03BB108] gi|196020677|gb|EDX59409.1| conserved hypothetical protein [Bacillus cereus 03BB108] Length = 312 Score = 110 bits (275), Expect = 7e-23, Method: Composition-based stats. Identities = 34/143 (23%), Positives = 64/143 (44%), Gaps = 7/143 (4%) Query: 16 NKESLSDVVSRITPEDTPIYSMIK----KGTTHSIHPEWVVDDLASPGPNAQLEGDEYSF 71 K LS+ ++ +P DTP +++ + S W L S QLEG + + Sbjct: 11 EKIDLSEAIAYASPMDTPFTTLLLQNGLTADSTSTEISWREAALDSNRKGPQLEGADATD 70 Query: 72 KTINTPERMGNYTQIMRKSWILSGTQEAVDDVG-YILKYKEQKLKKALEIRKDVEFALVS 130 T E + N QI +++ +SG+ EAV G + + + +E + D+E+ + Sbjct: 71 PNKTTRELIKNNQQIFQRTAEVSGSLEAVKVPGVPGGEMASEINDRMIESKVDLEWYALQ 130 Query: 131 SQGSE--KTSPRKMAALSSWIKK 151 ++ ++PR+M L + I Sbjct: 131 GTKADESGSTPRQMNGLINLINS 153 >gi|229187822|ref|ZP_04314947.1| hypothetical protein bcere0004_53480 [Bacillus cereus BGSC 6E1] gi|228595657|gb|EEK53352.1| hypothetical protein bcere0004_53480 [Bacillus cereus BGSC 6E1] Length = 313 Score = 108 bits (271), Expect = 2e-22, Method: Composition-based stats. Identities = 37/156 (23%), Positives = 63/156 (40%), Gaps = 8/156 (5%) Query: 16 NKESLSDVVSRITPEDTPIYSMIK----KGTTHSIHPEWVVDDLASPGPNAQLEGDEYSF 71 K LS ++ +P DTP +++ S W L S QLEG + Sbjct: 11 EKIDLSQAIAYASPMDTPFTTLLLQNGLTADATSTEISWREAALDSNRKGPQLEGANATD 70 Query: 72 KTINTPERMGNYTQIMRKSWILSGTQEAVDDVG-YILKYKEQKLKKALEIRKDVEFALVS 130 E + N QI +++ +SG+ EAV G + + + +E + D+E+ + Sbjct: 71 PNKTVRELIKNNQQIFQRTAEVSGSLEAVKVPGVPGGEMASEINDRMIEAKVDLEWYALQ 130 Query: 131 SQGSE--KTSPRKMAALSSWIKK-NASRGTGGVLED 163 ++ +PR+M L + I N T G L Sbjct: 131 GTKADESGATPRQMNGLINLINSRNKFTPTSGKLSA 166 >gi|291336566|gb|ADD96115.1| hypothetical protein HG1285_12862 [uncultured organism MedDCM-OCT-S04-C6] Length = 347 Score = 103 bits (256), Expect = 9e-21, Method: Composition-based stats. Identities = 40/158 (25%), Positives = 73/158 (46%), Gaps = 10/158 (6%) Query: 12 SSTTNKESLSDVVSRITPEDTPIYSMIKKGTT-HSIHPEWVVDDLASPGPNAQLEGDEYS 70 KE L D+++R+ + TP S++ KG+T H+ +W VD A ++G + + Sbjct: 8 DQVAKKEDLLDLITRVDEKATPFMSLVNKGSTPHNTFIQWPVDTYADAALGGTVDGTDVA 67 Query: 71 FKTINTPERM--GNYTQIMRKSWILSGTQEAVDDV---GYILKYKEQKLKKALEIRKDVE 125 + R +Y Q RK++ +S + V DV G + E K +E+ +++E Sbjct: 68 SYANHAENRTLLSSYLQTFRKAYQVSRLAQEVSDVAGLGAGNEIAEASAKAGVELVRNME 127 Query: 126 FALVSSQ----GSEKTSPRKMAALSSWIKKNASRGTGG 159 L+S Q + ++ + L WI+ +A T G Sbjct: 128 ATLLSDQEHQVDNGSSNAYLLRGLGVWIRDSARLTTPG 165 >gi|315144740|gb|EFT88756.1| conserved hypothetical protein [Enterococcus faecalis TX2141] Length = 300 Score = 102 bits (253), Expect = 2e-20, Method: Composition-based stats. Identities = 38/152 (25%), Positives = 67/152 (44%), Gaps = 7/152 (4%) Query: 19 SLSDVVSRITPEDTPIYS----MIKKGTTHSIHPEWVVDDLASPGPNAQLEGDEYSFKTI 74 +S ++ + TP S K S +W +L +AQLEG +Y+ Sbjct: 13 DISQEINALQRPSTPFLSWLLGAGKTSPATSTEIKWRESELDGEDSSAQLEGGDYTD-AD 71 Query: 75 NTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQGS 134 + + NYT+I RKS +SGT +A++ G + Q ++ALE+++D+ L+ + Sbjct: 72 SGRKWFNNYTEIFRKSTSVSGTLDAINVNGVGSELANQVSQRALEMKRDLNKKLLIGVKA 131 Query: 135 --EKTSPRKMAALSSWIKKNASRGTGGVLEDM 164 T R+MA + + I + T Sbjct: 132 DENGTKGRQMAGVINLINSDNLVKTSAADAVT 163 >gi|217961109|ref|YP_002339677.1| hypothetical protein BCAH187_A3735 [Bacillus cereus AH187] gi|229140327|ref|ZP_04268882.1| hypothetical protein bcere0013_34260 [Bacillus cereus BDRD-ST26] gi|217064163|gb|ACJ78413.1| conserved hypothetical protein [Bacillus cereus AH187] gi|228642888|gb|EEK99164.1| hypothetical protein bcere0013_34260 [Bacillus cereus BDRD-ST26] Length = 293 Score = 102 bits (253), Expect = 2e-20, Method: Composition-based stats. Identities = 25/151 (16%), Positives = 63/151 (41%), Gaps = 9/151 (5%) Query: 20 LSDVVSRITPEDTPIYSMIKKG----TTHSIHPEWVVDDLASPGPNAQLEGDEYSFKTIN 75 L+D ++ + P TP ++++ + W L + EG + + + Sbjct: 15 LTDEIALVAPIATPFFTLLMSKGLYVDSKGKFHTWREKTLDGTADISVDEGIDATQFVQS 74 Query: 76 TPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQGSE 135 + N +I K+ +SGT +A VG + ++ + +E+ +E L++ ++ Sbjct: 75 GRAELNNVMEIFYKATSVSGTAQATGAVG--DLFAQEINDRLIELAIGMEKKLINGVKND 132 Query: 136 -KTSPRKMAALSSWIKKNASRGTGGVLEDMI 165 + R+M + ++ + G +D++ Sbjct: 133 GASGKRQMDGILKFVDADNVV--NGATKDVL 161 >gi|307286482|ref|ZP_07566582.1| hypothetical protein HMPREF9505_00059 [Enterococcus faecalis TX0109] gi|306502395|gb|EFM71671.1| hypothetical protein HMPREF9505_00059 [Enterococcus faecalis TX0109] Length = 300 Score = 102 bits (253), Expect = 3e-20, Method: Composition-based stats. Identities = 38/152 (25%), Positives = 66/152 (43%), Gaps = 7/152 (4%) Query: 19 SLSDVVSRITPEDTPIYS----MIKKGTTHSIHPEWVVDDLASPGPNAQLEGDEYSFKTI 74 +S ++ + TP S K S +W +L +AQLEG +Y+ Sbjct: 13 DISQEINALQRPSTPFLSWLLGAGKTSPATSTEIKWRESELDGEDSSAQLEGGDYTD-AD 71 Query: 75 NTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQGS 134 + + NYT+I RKS +SGT +A++ G + Q ++ALE++ D+ L+ + Sbjct: 72 SGRKWFNNYTEIFRKSTSVSGTLDAINVNGVGSELANQVSQRALEMKLDLNKKLLIGVKA 131 Query: 135 --EKTSPRKMAALSSWIKKNASRGTGGVLEDM 164 T R+MA + + I + T Sbjct: 132 NENGTKGRQMAGVINLINSDNLVKTSAADAVT 163 >gi|226305754|ref|YP_002765714.1| hypothetical protein RER_22670 [Rhodococcus erythropolis PR4] gi|226184871|dbj|BAH32975.1| hypothetical protein RER_22670 [Rhodococcus erythropolis PR4] Length = 317 Score = 101 bits (252), Expect = 3e-20, Method: Composition-based stats. Identities = 33/190 (17%), Positives = 60/190 (31%), Gaps = 26/190 (13%) Query: 1 MTIVNNTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKK----GTTHSIHPEWVVDDLA 56 M + T + + + +++ EDTP S I T S W DL Sbjct: 1 MPGITGMGTTYNL----PNYVGELFQLSTEDTPFLSAIGGLTGGEDTGSTIFTWQTADLR 56 Query: 57 SPGPNAQL-EGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVD---------DVG-- 104 Q EG + N +I ++ +S T++ G Sbjct: 57 DADETRQRLEGADAPTAEGRKRSSGSNVLEIHQEQVSVSYTKQGATRQLTGTDPMQAGVQ 116 Query: 105 -YILKYKEQKLKKALEIRKDVEFALVSSQ---GSEKTSPRKMAALSSWIKKNASRGTGGV 160 + Q + +I +DVE + + ++ T+ R+ + + N Sbjct: 117 PVTDELTFQTAAEIKQIARDVEKSFIVGTYNLPTDNTTKRRTRGILEAVTSNVVTNGTPA 176 Query: 161 --LEDMILSL 168 E M+L L Sbjct: 177 ALTETMLLDL 186 >gi|57237589|ref|YP_178603.1| hypothetical protein CJE0587 [Campylobacter jejuni RM1221] gi|57166393|gb|AAW35172.1| hypothetical protein CJE0587 [Campylobacter jejuni RM1221] Length = 344 Score = 101 bits (252), Expect = 3e-20, Method: Composition-based stats. Identities = 43/183 (23%), Positives = 76/183 (41%), Gaps = 20/183 (10%) Query: 1 MTIVN--NTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIH-PEWVVDDLAS 57 M + + +T + + K+S+ + + +I +TPI + I + W+ D Sbjct: 1 MALPSMGHTSPATENVKLKQSIYETIIKIGATETPILNKIGTSKVTNPLTHSWITDTFEE 60 Query: 58 PGPNAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKA 117 P NA LE ++ +T NT ++ N TQI ++S + G + + Q KK Sbjct: 61 PKKNANLELSKFVGETKNTAQKTTNATQIFITEAMVSKALLKANQYG-GNEMEYQIGKKT 119 Query: 118 LEIRKDVEFALVS---------------SQGSEKTSPRKMAALSSWIKKNASRGTGGVLE 162 E + D+E+AL Q E TS +MA L +I K + G Sbjct: 120 KEHKMDMEYALFGLGRDSDVKKSVFKDYVQAQEATSG-EMAGLFHYIAKGKDSFSDGKRG 178 Query: 163 DMI 165 +++ Sbjct: 179 NVL 181 >gi|315929828|gb|EFV08993.1| hypothetical protein CSS_0883 [Campylobacter jejuni subsp. jejuni 305] Length = 344 Score = 100 bits (250), Expect = 5e-20, Method: Composition-based stats. Identities = 43/183 (23%), Positives = 75/183 (40%), Gaps = 20/183 (10%) Query: 1 MTIVN--NTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIH-PEWVVDDLAS 57 M + + +T + + K+S+ + + +I +TPI + I + W+ D Sbjct: 1 MALPSMGHTSPATENVKLKQSIYETIIKIGATETPILNKIGTSKVTNPLTHSWITDTFEE 60 Query: 58 PGPNAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKA 117 P NA LE ++ +T NT ++ N TQI ++S + G + + Q KK Sbjct: 61 PKKNANLELSKFVGETKNTAQKTTNATQIFITEAMVSKALLKANQYG-GNEMEYQIGKKT 119 Query: 118 LEIRKDVEFALVS---------------SQGSEKTSPRKMAALSSWIKKNASRGTGGVLE 162 E + D+E+AL Q E TS +MA L +I K G Sbjct: 120 KEHKMDMEYALFGLGRDSDVKKSVFKDYVQAQEATSG-EMAGLFHYIAKGKDNFADGKRG 178 Query: 163 DMI 165 +++ Sbjct: 179 NVL 181 >gi|283956330|ref|ZP_06373810.1| hypothetical protein C1336_000250101 [Campylobacter jejuni subsp. jejuni 1336] gi|283792050|gb|EFC30839.1| hypothetical protein C1336_000250101 [Campylobacter jejuni subsp. jejuni 1336] Length = 344 Score = 100 bits (249), Expect = 6e-20, Method: Composition-based stats. Identities = 43/183 (23%), Positives = 75/183 (40%), Gaps = 20/183 (10%) Query: 1 MTIVN--NTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIH-PEWVVDDLAS 57 M + + +T + + K+S+ + + +I +TPI + I + W+ D Sbjct: 1 MALPSMGHTAPATENVKLKQSIYETIIKIGATETPILNKIGTSKVTNPLTHSWITDTFEE 60 Query: 58 PGPNAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKA 117 P NA LE ++ +T NT ++ N TQI ++S + G + + Q KK Sbjct: 61 PKKNANLELSKFVGETKNTAQKTTNATQIFITEAMVSKALLKANQYG-GNEMEYQIGKKT 119 Query: 118 LEIRKDVEFALVS---------------SQGSEKTSPRKMAALSSWIKKNASRGTGGVLE 162 E + D+E+AL Q E TS +MA L +I K G Sbjct: 120 KEHKMDMEYALFGLGRDSDVKKSVFKDYVQAQEATSG-EMAGLFHYIAKGKDSFADGKRG 178 Query: 163 DMI 165 +++ Sbjct: 179 NVL 181 >gi|281417131|ref|ZP_06248151.1| conserved hypothetical protein [Clostridium thermocellum JW20] gi|281408533|gb|EFB38791.1| conserved hypothetical protein [Clostridium thermocellum JW20] Length = 292 Score = 100 bits (249), Expect = 7e-20, Method: Composition-based stats. Identities = 28/155 (18%), Positives = 61/155 (39%), Gaps = 12/155 (7%) Query: 3 IVNNTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKKGT----THSIHPEWVVDDLASP 58 I + F T + LS + I+P DTP+ +++ S+ W L Sbjct: 2 IKTSHFTTHENI----DLSKEIVLISPSDTPLTTLLMNKKLVETAGSVTINWREKTLDDT 57 Query: 59 GPNAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKAL 118 ++ EG + N +I K+ +SG+ +A + G + + + Sbjct: 58 EDISKTEGFTVDTFVSSGRAEKSNVMEIFSKAVQVSGSAQASNITGINDLFASEISDRLT 117 Query: 119 EIRKDVEFALVS----SQGSEKTSPRKMAALSSWI 149 E++ ++E +++ + GS R+M ++ + Sbjct: 118 EVKVNIEKKMLAPKNYNDGSSAPFIRRMKSIFEQV 152 >gi|238909129|ref|YP_002939596.1| hypothetical protein EUBELI_10025 [Eubacterium eligens ATCC 27750] gi|238873366|gb|ACR73075.1| Hypothetical protein EUBELI_10025 [Eubacterium eligens ATCC 27750] Length = 304 Score = 100 bits (249), Expect = 7e-20, Method: Composition-based stats. Identities = 32/151 (21%), Positives = 69/151 (45%), Gaps = 6/151 (3%) Query: 19 SLSDVVSRITPEDTPIYSMIKKGT----THSIHPEWVVDDLASPGPNAQLEGDEYSFKTI 74 L++ + + +P DTP+ +++ I W +L S +LEG E Sbjct: 17 DLTEEIKQTSPTDTPLTTLLMSRGQVVPAKDITVTWREKELNSERGTLKLEGSEAGEVIT 76 Query: 75 NTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQGS 134 ++ + + N QI+ K +SGT +++ +G + + + +E ++D+E+ ++ + Sbjct: 77 SSRKTLSNVCQIIEKVTQVSGTARSLNPMGINDVFNAEVQDRLVETKRDMEWYFLNGTKA 136 Query: 135 --EKTSPRKMAALSSWIKKNASRGTGGVLED 163 +PR+M L + + N T G L + Sbjct: 137 LESGATPRQMNGLVNLVNANNVVETKGALTE 167 >gi|239828160|ref|YP_002950784.1| hypothetical protein GWCH70_2835 [Geobacillus sp. WCH70] gi|239808453|gb|ACS25518.1| conserved hypothetical protein [Geobacillus sp. WCH70] Length = 283 Score = 99.4 bits (246), Expect = 2e-19, Method: Composition-based stats. Identities = 25/146 (17%), Positives = 48/146 (32%), Gaps = 4/146 (2%) Query: 7 TFITSS-STTNKESLSDVVSRITPEDTPIYS--MIKKGTTHSIHPEWVVDDLASPGPNAQ 63 F + + L DV+ + + P + M K S W+ +++A Sbjct: 1 MFTSQDFAVGQNYDLKDVLIEVNKKQNPFVTFLMSKTVKATSPQVHWITEEIADSAVTLA 60 Query: 64 LEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKD 123 GD +F R NY +I + ++ T + VG + KK I++ Sbjct: 61 EGGDAPAFVKDTLAPRE-NYLEIFAATATVTNTAQYSKAVGINDLLAHEVEKKTKAIKRR 119 Query: 124 VEFALVSSQGSEKTSPRKMAALSSWI 149 +E + + + I Sbjct: 120 MENKFIHGTKGYSNGVYTTDGILAQI 145 >gi|256956794|ref|ZP_05560965.1| conserved hypothetical protein [Enterococcus faecalis DS5] gi|256947290|gb|EEU63922.1| conserved hypothetical protein [Enterococcus faecalis DS5] gi|295113775|emb|CBL32412.1| hypothetical protein [Enterococcus sp. 7L76] gi|315035894|gb|EFT47826.1| conserved hypothetical protein [Enterococcus faecalis TX0027] Length = 300 Score = 99.0 bits (245), Expect = 2e-19, Method: Composition-based stats. Identities = 40/152 (26%), Positives = 66/152 (43%), Gaps = 7/152 (4%) Query: 19 SLSDVVSRITPEDTPIYS---MIKKGT-THSIHPEWVVDDLASPGPNAQLEGDEYSFKTI 74 +S V+ + TP S K + S +W +L +AQLEG EY Sbjct: 13 DISQEVNALQRPSTPFLSWLLGAGKTSPATSTEIKWRESELDGEDSSAQLEGGEY-RDAD 71 Query: 75 NTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQGS 134 + + NYT+I RKS +SGT +A++ G + Q ++ALE++ D+ L+ + Sbjct: 72 SGRKWFNNYTEIFRKSTSVSGTLDAINVNGVGSELANQVSQRALEMKLDLNKKLLIGVKA 131 Query: 135 --EKTSPRKMAALSSWIKKNASRGTGGVLEDM 164 T R+MA + + I + T Sbjct: 132 DENGTKGRQMAGVINLINSDNLVKTSAADAVT 163 >gi|134298256|ref|YP_001111752.1| hypothetical protein Dred_0379 [Desulfotomaculum reducens MI-1] gi|134050956|gb|ABO48927.1| hypothetical protein Dred_0379 [Desulfotomaculum reducens MI-1] Length = 285 Score = 98.6 bits (244), Expect = 3e-19, Method: Composition-based stats. Identities = 24/156 (15%), Positives = 55/156 (35%), Gaps = 5/156 (3%) Query: 13 STTNKESLSDVVSRITPEDTPIYSMI--KKGTTHSIHPEWVVDDLASPGPNAQLEGDEYS 70 + DV+ + TP TP +++ K + W+ + + EG + Sbjct: 8 VAGQSIDMKDVLIQTTPILTPFTTLLLPKTVKAENATLNWIEEAINESAAVTLGEGADAP 67 Query: 71 FKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVS 130 +T + NY +++ + +S T +A + G + +KK ++ +E L++ Sbjct: 68 NPVDDTLAPISNYCELIGATATVSNTAQATNAKGISDLLAHEIVKKTKAMKIKMENILIN 127 Query: 131 SQGS--EKTSPRKMAALSSWIKKNASRGTGGVLEDM 164 T + + I ++ T Sbjct: 128 GTKGYVSATKTYTTDGILAQINP-VNQVTNATFTKT 162 >gi|307270079|ref|ZP_07551399.1| hypothetical protein HMPREF9498_02197 [Enterococcus faecalis TX4248] gi|306513574|gb|EFM82186.1| hypothetical protein HMPREF9498_02197 [Enterococcus faecalis TX4248] Length = 300 Score = 98.6 bits (244), Expect = 3e-19, Method: Composition-based stats. Identities = 36/152 (23%), Positives = 65/152 (42%), Gaps = 7/152 (4%) Query: 19 SLSDVVSRITPEDTPIYS----MIKKGTTHSIHPEWVVDDLASPGPNAQLEGDEYSFKTI 74 +S ++ + TP S K S +W ++ +AQLEG EY+ Sbjct: 13 DISQEINALQRPSTPFLSWLLGAGKTRPATSTEIKWREYEMNGEDSSAQLEGGEYNE-AE 71 Query: 75 NTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQGS 134 + + NY +I RKS +SGT +A++ G + Q ++ALE++ D+ L+ + Sbjct: 72 SGRKWFNNYAEIFRKSTSVSGTLDAINVNGVGSELANQVSQRALEMKLDLNKKLLIGVKA 131 Query: 135 --EKTSPRKMAALSSWIKKNASRGTGGVLEDM 164 + R+MA + + I + T Sbjct: 132 DENGSKGRQMAGVINLINSDNLVKTSAADAVT 163 >gi|256375780|ref|YP_003099440.1| hypothetical protein Amir_1645 [Actinosynnema mirum DSM 43827] gi|255920083|gb|ACU35594.1| hypothetical protein Amir_1645 [Actinosynnema mirum DSM 43827] Length = 406 Score = 98.6 bits (244), Expect = 3e-19, Method: Composition-based stats. Identities = 40/191 (20%), Positives = 62/191 (32%), Gaps = 31/191 (16%) Query: 1 MTIVNNTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKK----GTTHSIHPEWVVDDLA 56 M + T + L +TPEDTP+ S I S EW DL Sbjct: 1 MAGITGMGTTFNLPNYHGEL----FGLTPEDTPLLSAIGGLGSGSEITSKEWEWQAYDLR 56 Query: 57 SPGPNAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAV------------DDVG 104 P LEG N QI+ + S T++A Sbjct: 57 DPAQRVALEGQTAPTGEARVRTNFSNVVQIVHERVSTSYTKQAAIGQFAANSAPISGANP 116 Query: 105 YILKYKEQKLKKALEIRKDVEFALVSSQ---GSEKTSPRKMAALSSWIK--------KNA 153 ++ Q + +I +DV + ++ Q S+ +SPRK L + N Sbjct: 117 ITDEHDWQVTQAVKQIARDVNWTCINGQYAKPSDNSSPRKTRGLMQAVSAANTVDRGSNV 176 Query: 154 SRGTGGVLEDM 164 + G V + + Sbjct: 177 ATGASSVTDTI 187 >gi|315173098|gb|EFU17115.1| conserved hypothetical protein [Enterococcus faecalis TX1346] Length = 300 Score = 98.6 bits (244), Expect = 3e-19, Method: Composition-based stats. Identities = 40/152 (26%), Positives = 66/152 (43%), Gaps = 7/152 (4%) Query: 19 SLSDVVSRITPEDTPIYS---MIKKGT-THSIHPEWVVDDLASPGPNAQLEGDEYSFKTI 74 +S V+ + TP S K + S +W +L +AQLEG EY Sbjct: 13 DISQEVNALQRPSTPFLSWLLGAGKTSPATSTEIKWRESELDGEDSSAQLEGGEY-KDAD 71 Query: 75 NTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQGS 134 + + NYT+I RKS +SGT +A++ G + Q ++ALE++ D+ L+ + Sbjct: 72 SGRKWFNNYTEIFRKSTSVSGTLDAINVNGVGSELANQVSQRALEMKLDLNKKLLIGVKA 131 Query: 135 --EKTSPRKMAALSSWIKKNASRGTGGVLEDM 164 T R+MA + + I + T Sbjct: 132 DENGTKGRQMAGVINLINSDNLVKTSAADAVT 163 >gi|134299981|ref|YP_001113477.1| hypothetical protein Dred_2135 [Desulfotomaculum reducens MI-1] gi|134052681|gb|ABO50652.1| hypothetical protein Dred_2135 [Desulfotomaculum reducens MI-1] Length = 285 Score = 98.2 bits (243), Expect = 4e-19, Method: Composition-based stats. Identities = 26/156 (16%), Positives = 56/156 (35%), Gaps = 5/156 (3%) Query: 13 STTNKESLSDVVSRITPEDTPIYSMI--KKGTTHSIHPEWVVDDLASPGPNAQLEGDEYS 70 T + DV+ + TP TP +++ K + W+ + + EG + Sbjct: 8 VTGQSIDMKDVLIQTTPILTPFTTLLLPKTVKAENATLNWIEEAINENAAVTLGEGADAP 67 Query: 71 FKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVS 130 +T NY +++ + +S T +A + G + +KK ++ +E L++ Sbjct: 68 NPVDDTLTPCSNYCELVGATATVSNTAQATNAKGISDLLAHETVKKTKAMKIRMENILIN 127 Query: 131 SQGS--EKTSPRKMAALSSWIKKNASRGTGGVLEDM 164 T + + I A++ T Sbjct: 128 GTKGYVSATKTYTTDGILAQINP-ANKVTNATFTKT 162 >gi|257079386|ref|ZP_05573747.1| predicted protein [Enterococcus faecalis JH1] gi|256987416|gb|EEU74718.1| predicted protein [Enterococcus faecalis JH1] Length = 300 Score = 97.8 bits (242), Expect = 4e-19, Method: Composition-based stats. Identities = 36/152 (23%), Positives = 65/152 (42%), Gaps = 7/152 (4%) Query: 19 SLSDVVSRITPEDTPIYS----MIKKGTTHSIHPEWVVDDLASPGPNAQLEGDEYSFKTI 74 +S ++ + TP S K S +W ++ +AQLEG EY+ Sbjct: 13 DISQEINALQRPSTPFLSWLLGAGKTRPATSTEIKWREYEMNGEDSSAQLEGGEYNE-AE 71 Query: 75 NTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQGS 134 + + NY +I RKS +SGT +A++ G + Q ++ALE++ D+ L+ + Sbjct: 72 SGRKWFNNYAEIFRKSTSVSGTLDAINVNGVGSELANQVSQRALEMKLDLNKKLLIGVKA 131 Query: 135 --EKTSPRKMAALSSWIKKNASRGTGGVLEDM 164 + R+MA + + I + T Sbjct: 132 DENGSKGRQMAGVINLINSDNLVKTSAADAVT 163 >gi|307280635|ref|ZP_07561683.1| hypothetical protein HMPREF9515_01677 [Enterococcus faecalis TX0860] gi|306504001|gb|EFM73218.1| hypothetical protein HMPREF9515_01677 [Enterococcus faecalis TX0860] Length = 300 Score = 97.8 bits (242), Expect = 5e-19, Method: Composition-based stats. Identities = 39/152 (25%), Positives = 65/152 (42%), Gaps = 7/152 (4%) Query: 19 SLSDVVSRITPEDTPIYS---MIKKGT-THSIHPEWVVDDLASPGPNAQLEGDEYSFKTI 74 +S V+ + TP S K + S +W +L +AQLEG EY Sbjct: 13 DISQEVNALQRPSTPFLSWLLGAGKTSPATSTEIKWRESELDGEDSSAQLEGGEY-KDAD 71 Query: 75 NTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQGS 134 + + NYT+I RKS +SGT +A++ G + Q ++ALE++ D+ L+ + Sbjct: 72 SGRKWFSNYTEIFRKSTSVSGTLDAINVNGVGSELANQVSQRALEMKLDLNKKLLIGVKA 131 Query: 135 --EKTSPRKMAALSSWIKKNASRGTGGVLEDM 164 R+MA + + I + T Sbjct: 132 DENGDKGRQMAGVINLINSDNLVKTSAADAVT 163 >gi|152975085|ref|YP_001374602.1| hypothetical protein Bcer98_1285 [Bacillus cereus subsp. cytotoxis NVH 391-98] gi|152023837|gb|ABS21607.1| conserved hypothetical protein [Bacillus cytotoxicus NVH 391-98] Length = 293 Score = 97.4 bits (241), Expect = 6e-19, Method: Composition-based stats. Identities = 26/151 (17%), Positives = 62/151 (41%), Gaps = 9/151 (5%) Query: 20 LSDVVSRITPEDTPIYSMIKKG----TTHSIHPEWVVDDLASPGPNAQLEGDEYSFKTIN 75 L+D ++ + P TP ++++ + W L EG + + + Sbjct: 15 LTDEIALVAPIATPFFTLLMSKGLYVDSKGKFHTWREKTLDGTADITVDEGVDATQFVQS 74 Query: 76 TPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQGSE 135 + N +I K+ +SGT ++ VG + ++ + +E+ +E L++ ++ Sbjct: 75 GRAELNNVMEIFYKATSVSGTAQSTGAVG--DLFAQEINDRLVELAIGIENKLINGVKND 132 Query: 136 -KTSPRKMAALSSWIKKNASRGTGGVLEDMI 165 + R+M L ++ GV +D++ Sbjct: 133 GASGKRQMDGLLKFVDAGNVV--NGVTKDVL 161 >gi|256377352|ref|YP_003101012.1| hypothetical protein Amir_3259 [Actinosynnema mirum DSM 43827] gi|255921655|gb|ACU37166.1| hypothetical protein Amir_3259 [Actinosynnema mirum DSM 43827] Length = 329 Score = 96.7 bits (239), Expect = 1e-18, Method: Composition-based stats. Identities = 34/193 (17%), Positives = 68/193 (35%), Gaps = 35/193 (18%) Query: 4 VNNTFITSSSTTNKESLSDVVSRITPEDTPIYSMI----KKGTTHSIHPEWVVDDLASPG 59 + NT+ + + +TP DTP S I ++ W V DL P Sbjct: 7 IANTYNAPNFVGE-------LFSLTPSDTPFLSAIGGLTGGRRATAVIHTWTVYDLRPPD 59 Query: 60 PNAQL-EGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVG-------------- 104 P+ Q EG + + N +I ++S ++ T++A + Sbjct: 60 PDRQRAEGADAPPAEGRIRGQERNVVEIHQESVGVTYTRQATQAMFAGTGAANPNAAAIG 119 Query: 105 ----YILKYKEQKLKKALEIRKDVEFALVSSQ---GSEKTSPRKMAALSSWIKKNASRGT 157 + Q + ++I +DVE + + ++ ++ RK + + N + Sbjct: 120 GTNAVANEMDWQTQQALVQIARDVEATFLVGRYQEPTDNSTVRKTRGILEATRTNVITNS 179 Query: 158 GGVL--EDMILSL 168 E M++ L Sbjct: 180 TPTPLTESMVIDL 192 >gi|257879565|ref|ZP_05659218.1| conserved hypothetical protein [Enterococcus faecium 1,230,933] gi|257891545|ref|ZP_05671198.1| conserved hypothetical protein [Enterococcus faecium 1,231,410] gi|314940388|ref|ZP_07847550.1| conserved hypothetical protein [Enterococcus faecium TX0133a04] gi|314943205|ref|ZP_07849996.1| conserved hypothetical protein [Enterococcus faecium TX0133C] gi|314949154|ref|ZP_07852509.1| conserved hypothetical protein [Enterococcus faecium TX0082] gi|314951966|ref|ZP_07854992.1| conserved hypothetical protein [Enterococcus faecium TX0133A] gi|314993065|ref|ZP_07858455.1| conserved hypothetical protein [Enterococcus faecium TX0133B] gi|314995396|ref|ZP_07860499.1| conserved hypothetical protein [Enterococcus faecium TX0133a01] gi|257813793|gb|EEV42551.1| conserved hypothetical protein [Enterococcus faecium 1,230,933] gi|257827905|gb|EEV54531.1| conserved hypothetical protein [Enterococcus faecium 1,231,410] gi|313590399|gb|EFR69244.1| conserved hypothetical protein [Enterococcus faecium TX0133a01] gi|313592421|gb|EFR71266.1| conserved hypothetical protein [Enterococcus faecium TX0133B] gi|313595906|gb|EFR74751.1| conserved hypothetical protein [Enterococcus faecium TX0133A] gi|313598089|gb|EFR76934.1| conserved hypothetical protein [Enterococcus faecium TX0133C] gi|313640428|gb|EFS05008.1| conserved hypothetical protein [Enterococcus faecium TX0133a04] gi|313644467|gb|EFS09047.1| conserved hypothetical protein [Enterococcus faecium TX0082] Length = 296 Score = 95.5 bits (236), Expect = 2e-18, Method: Composition-based stats. Identities = 32/152 (21%), Positives = 63/152 (41%), Gaps = 7/152 (4%) Query: 19 SLSDVVSRITPEDTPIYS----MIKKGTTHSIHPEWVVDDLASPGPNAQLEGDEYSFKTI 74 +S ++ + +TP S K +S +W D+ + + +LEG +Y Sbjct: 13 DISPAINAMQVPNTPFLSYLLGAGKTEQANSTEIKWREYDINNDDSSEKLEGGDYPD-AE 71 Query: 75 NTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQGS 134 + NYT+I RKS +SGT +A++ G + Q + +E++ D+ L++ + Sbjct: 72 SGRNWFNNYTEIFRKSTSVSGTLDAINVNGVGNELTNQVALRGMEMKIDLNRKLITGVKA 131 Query: 135 --EKTSPRKMAALSSWIKKNASRGTGGVLEDM 164 + R+M + + I T Sbjct: 132 DENGSKGRRMNGILNLINSANKAETATAGAVT 163 >gi|188585861|ref|YP_001917406.1| conserved hypothetical protein [Natranaerobius thermophilus JW/NM-WN-LF] gi|179350548|gb|ACB84818.1| conserved hypothetical protein [Natranaerobius thermophilus JW/NM-WN-LF] Length = 289 Score = 95.1 bits (235), Expect = 3e-18, Method: Composition-based stats. Identities = 29/165 (17%), Positives = 55/165 (33%), Gaps = 6/165 (3%) Query: 6 NTFITSSSTTNKESLSDVVSRITPEDTPIYS--MIKKGTTHSIHPEWVVDDLASPGPNAQ 63 N F+ S +S V+ TPI S M+++ + WV ++ Q Sbjct: 5 NNFLQYESI----DMSGVLEVTNVPQTPITSLLMVRQVQAQAPQVHWVEVEIDESSAVTQ 60 Query: 64 LEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKD 123 EGD+ + E NY +I + +S T + + K IR Sbjct: 61 GEGDDAPEHKTDNRELKENYLEIFGATAKVSNTAQYSTSETVNDLLAHEVELKTQSIRNR 120 Query: 124 VEFALVSSQGSEKTSPRKMAALSSWIKKNASRGTGGVLEDMILSL 168 +E ++ + + + + I + E++ L Sbjct: 121 MENKFINGNKNFADGVYETDGILNLINSENQKTEDEFNENVFLDT 165 >gi|229162523|ref|ZP_04290484.1| hypothetical protein bcere0009_32950 [Bacillus cereus R309803] gi|228621002|gb|EEK77867.1| hypothetical protein bcere0009_32950 [Bacillus cereus R309803] Length = 293 Score = 94.7 bits (234), Expect = 3e-18, Method: Composition-based stats. Identities = 23/152 (15%), Positives = 57/152 (37%), Gaps = 9/152 (5%) Query: 20 LSDVVSRITPEDTPIYSMIKKG----TTHSIHPEWVVDDLASPGPNAQLEGDEYSFKTIN 75 L+D ++ + P TP ++++ + W L EG + + + Sbjct: 15 LTDEIALVAPIATPFFALLMSKGLYVDSKGKFHTWREKTLDGTADITVDEGVDATQFVQS 74 Query: 76 TPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQGSE 135 + N +I K+ +SGT +A + ++ + +E+ +E L+S ++ Sbjct: 75 GRAELNNVMEIFYKATSVSGTAQATG--AVSDLFAQEINDRLVELAIGIEKKLISGIKND 132 Query: 136 -KTSPRKMAALSSWIKKNASRGTGGVLEDMIL 166 + R+M + + G +++ Sbjct: 133 GASGKRQMDGILKFADAGNVV--NGATANVLQ 162 >gi|323703894|ref|ZP_08115527.1| hypothetical protein DesniDRAFT_2739 [Desulfotomaculum nigrificans DSM 574] gi|323531143|gb|EGB21049.1| hypothetical protein DesniDRAFT_2739 [Desulfotomaculum nigrificans DSM 574] Length = 285 Score = 94.0 bits (232), Expect = 6e-18, Method: Composition-based stats. Identities = 27/149 (18%), Positives = 55/149 (36%), Gaps = 5/149 (3%) Query: 15 TNKESLSDVVSRITPEDTPIYSMI--KKGTTHSIHPEWVVDDLASPGPNAQLEGDEYSFK 72 + DV+ + TP TP +++ K ++ W+ + + EG + Sbjct: 10 GQSIDMKDVLIQTTPVLTPFTTLLLDKTVKAENVTLNWIEEAINESAAVTLGEGADAPAV 69 Query: 73 TINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQ 132 +T M NY +++ + +S T +A G + +KK ++ +E L++ Sbjct: 70 VDDTLAPMTNYCELIGATATVSNTAQATTAKGISDLLAHEVVKKTKAMKMRMENILINGT 129 Query: 133 GS--EKTSPRKMAALSSWIK-KNASRGTG 158 S T + + I N T Sbjct: 130 KSYDATTKTYTTDGILAQIDPANQVTNTS 158 >gi|153951462|ref|YP_001398222.1| hypothetical protein JJD26997_1140 [Campylobacter jejuni subsp. doylei 269.97] gi|152938908|gb|ABS43649.1| hypothetical protein JJD26997_1140 [Campylobacter jejuni subsp. doylei 269.97] Length = 344 Score = 93.6 bits (231), Expect = 8e-18, Method: Composition-based stats. Identities = 43/176 (24%), Positives = 72/176 (40%), Gaps = 20/176 (11%) Query: 1 MTIV--NNTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIH-PEWVVDDLAS 57 M + +T + + K+S+ + + +I +TPI + I + W+ D Sbjct: 1 MALPSMAHTPPATENVKLKQSIYETIIKIGATETPILNKIGTSKVSNPLTHSWITDTFEE 60 Query: 58 PGPNAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKA 117 P NA LE ++ +T NT ++ N TQI ++S + G + + Q KK Sbjct: 61 PKKNANLELSKFVGETKNTTQKTTNATQIFITEAMVSKALLKANQYG-GNEMEYQIGKKT 119 Query: 118 LEIRKDVEFALVSS---------------QGSEKTSPRKMAALSSWIKKNASRGTG 158 E + D+E+AL+ Q E TS +MA L +I K T Sbjct: 120 KEHKMDMEYALLGLGRDNDVKTSVFKDYIQAQEATSG-EMAGLFHYIAKGKDSFTD 174 >gi|257883499|ref|ZP_05663152.1| conserved hypothetical protein [Enterococcus faecium 1,231,502] gi|261208026|ref|ZP_05922703.1| conserved hypothetical protein [Enterococcus faecium TC 6] gi|289567093|ref|ZP_06447488.1| conserved hypothetical protein [Enterococcus faecium D344SRF] gi|294622496|ref|ZP_06701518.1| conserved hypothetical protein [Enterococcus faecium U0317] gi|257819157|gb|EEV46485.1| conserved hypothetical protein [Enterococcus faecium 1,231,502] gi|260077743|gb|EEW65457.1| conserved hypothetical protein [Enterococcus faecium TC 6] gi|289161108|gb|EFD09013.1| conserved hypothetical protein [Enterococcus faecium D344SRF] gi|291598043|gb|EFF29153.1| conserved hypothetical protein [Enterococcus faecium U0317] Length = 296 Score = 93.2 bits (230), Expect = 1e-17, Method: Composition-based stats. Identities = 33/152 (21%), Positives = 63/152 (41%), Gaps = 7/152 (4%) Query: 19 SLSDVVSRITPEDTPIYS----MIKKGTTHSIHPEWVVDDLASPGPNAQLEGDEYSFKTI 74 +S ++ + +TP S K +S +W D+ + + +LEG EY Sbjct: 13 DISPAINAMQVPNTPFLSYLFGAGKTEPANSTEIKWREYDINNDDSSEKLEGGEYPD-AE 71 Query: 75 NTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQGS 134 + NYT+I RKS +SGT +A++ G + Q + +E++ D+ L++ + Sbjct: 72 SGRTWFNNYTEIFRKSTSVSGTLDAINVNGVGNELTNQVALRGMEMKIDLNRKLITGVKA 131 Query: 135 --EKTSPRKMAALSSWIKKNASRGTGGVLEDM 164 + R+M + + I T Sbjct: 132 DENGSKGRRMNGILNLINSANKAETATAGAVT 163 >gi|257893408|ref|ZP_05673061.1| conserved hypothetical protein [Enterococcus faecium 1,231,408] gi|257829787|gb|EEV56394.1| conserved hypothetical protein [Enterococcus faecium 1,231,408] Length = 296 Score = 93.2 bits (230), Expect = 1e-17, Method: Composition-based stats. Identities = 33/152 (21%), Positives = 63/152 (41%), Gaps = 7/152 (4%) Query: 19 SLSDVVSRITPEDTPIYS----MIKKGTTHSIHPEWVVDDLASPGPNAQLEGDEYSFKTI 74 +S ++ + +TP S K +S +W D+ + + +LEG EY Sbjct: 13 DISPAINAMQVPNTPFLSYLLGAGKTEPANSTEIKWREYDINNDDSSEKLEGGEYPD-AE 71 Query: 75 NTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQGS 134 + NYT+I RKS +SGT +A++ G + Q + +E++ D+ L++ + Sbjct: 72 SGRTWFNNYTEIFRKSTSVSGTLDAINVNGVGNELTNQVALRGMEMKIDLNRKLITGVKA 131 Query: 135 --EKTSPRKMAALSSWIKKNASRGTGGVLEDM 164 + R+M + + I T Sbjct: 132 DENGSKGRRMNGILNLINSANKAETATAGAVT 163 >gi|294614769|ref|ZP_06694669.1| hypothetical protein EfmE1636_0859 [Enterococcus faecium E1636] gi|291592381|gb|EFF23990.1| hypothetical protein EfmE1636_0859 [Enterococcus faecium E1636] Length = 296 Score = 92.8 bits (229), Expect = 1e-17, Method: Composition-based stats. Identities = 33/152 (21%), Positives = 63/152 (41%), Gaps = 7/152 (4%) Query: 19 SLSDVVSRITPEDTPIYS----MIKKGTTHSIHPEWVVDDLASPGPNAQLEGDEYSFKTI 74 +S ++ + +TP S K +S +W D+ + + +LEG EY Sbjct: 13 DISPAINAMQVPNTPFLSYLLGAGKTEPANSTEIKWREYDINNDDSSEKLEGGEYPD-AE 71 Query: 75 NTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQGS 134 + NYT+I RKS +SGT +A++ G + Q + +E++ D+ L++ + Sbjct: 72 SGRTWFNNYTEIFRKSTSVSGTLDAINVNGVGNELTNQVALRGMEMKIDLNRKLITGVKA 131 Query: 135 --EKTSPRKMAALSSWIKKNASRGTGGVLEDM 164 + R+M + + I T Sbjct: 132 DENSSKGRRMNGILNLINSANKAETATAGAVT 163 >gi|114566839|ref|YP_753993.1| hypothetical protein Swol_1314 [Syntrophomonas wolfei subsp. wolfei str. Goettingen] gi|114337774|gb|ABI68622.1| hypothetical protein Swol_1314 [Syntrophomonas wolfei subsp. wolfei str. Goettingen] Length = 398 Score = 91.7 bits (226), Expect = 3e-17, Method: Composition-based stats. Identities = 32/154 (20%), Positives = 65/154 (42%), Gaps = 11/154 (7%) Query: 1 MTIVNNTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKKG---TTHSIHPEWVVDDLAS 57 M N +L+ +S ++P D P+ ++I TT S W L + Sbjct: 1 MIKTTNFTD-----LENINLTKEISLVSPMDCPLTTIIMGKGYDTTGSKIVTWREKTLDN 55 Query: 58 PGPNAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKA 117 +Q+EG + + N +I +K+ +SGT +A G + E+ + Sbjct: 56 TEDISQVEGSTTNTFQSSARAEKSNVCEIFKKATSISGTADASSITGVSNLFAEEINDRL 115 Query: 118 LEIRKDVEFALVSSQGSEKTS---PRKMAALSSW 148 +E++ ++E L++ + ++ RKM L ++ Sbjct: 116 IEMKVNIEKKLINGTKDDGSTSPYVRKMDGLLAF 149 >gi|29376526|ref|NP_815680.1| hypothetical protein EF2011 [Enterococcus faecalis V583] gi|227555439|ref|ZP_03985486.1| conserved hypothetical protein [Enterococcus faecalis HH22] gi|29343990|gb|AAO81750.1| hypothetical protein EF_2011 [Enterococcus faecalis V583] gi|227175420|gb|EEI56392.1| conserved hypothetical protein [Enterococcus faecalis HH22] Length = 295 Score = 91.7 bits (226), Expect = 3e-17, Method: Composition-based stats. Identities = 36/152 (23%), Positives = 63/152 (41%), Gaps = 7/152 (4%) Query: 19 SLSDVVSRITPEDTPIYS---MIKK-GTTHSIHPEWVVDDLASPGPNAQLEGDEYSFKTI 74 +S V+ + +TP S K S +W + + +AQLEG EY+ Sbjct: 13 DISQEVNALQVPNTPFLSYLLGAGKVEAAKSTEIKWREYGMNNDDSSAQLEGGEYAD-AE 71 Query: 75 NTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQGS 134 + NYT+I RKS +SGT +A++ G + Q +A E++ D+ L+ + Sbjct: 72 SDRTWFNNYTEIFRKSTSVSGTLDAINVDGVGNELNSQVALRATEMKIDLNRKLIVGVKA 131 Query: 135 E--KTSPRKMAALSSWIKKNASRGTGGVLEDM 164 + + R+M + + I T Sbjct: 132 DESGSKGRQMNGILNLISSTNKVETAAAGAVT 163 >gi|169827502|ref|YP_001697660.1| hypothetical protein Bsph_1941 [Lysinibacillus sphaericus C3-41] gi|168991990|gb|ACA39530.1| conserved hypothetical protein [Lysinibacillus sphaericus C3-41] Length = 288 Score = 91.3 bits (225), Expect = 4e-17, Method: Composition-based stats. Identities = 26/149 (17%), Positives = 61/149 (40%), Gaps = 12/149 (8%) Query: 16 NKESLSDVVSRITPEDTPIYSMIKK----GTTHSIHPEWVVDDLASPGPNAQLEGDEYSF 71 + SL++ ++ I + TP S++ S W L++ + +EG + + Sbjct: 11 ERISLANEIAVIGVQATPFTSLLMAKGNIEKALSTVYTWREKSLSNDEDISAVEGADTTV 70 Query: 72 KTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSS 131 + + N +I +K +SGT EA+ + + + LE++ ++E ++ Sbjct: 71 FYESARAELSNILEIFKKGVQVSGTAEAMQSTQFSAE----VADRLLELKVNMEKKFING 126 Query: 132 QGSEKTSP---RKMAALSSWI-KKNASRG 156 ++ + R+++ L NA Sbjct: 127 LKADGSKAPFKRQLSGLIEMADATNAVTA 155 >gi|319956911|ref|YP_004168174.1| hypothetical protein Nitsa_1172 [Nitratifractor salsuginis DSM 16511] gi|319419315|gb|ADV46425.1| hypothetical protein Nitsa_1172 [Nitratifractor salsuginis DSM 16511] Length = 308 Score = 90.9 bits (224), Expect = 5e-17, Method: Composition-based stats. Identities = 36/155 (23%), Positives = 60/155 (38%), Gaps = 11/155 (7%) Query: 7 TFITSS-STTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIHPEWVVDDLASPGPNAQLE 65 T + + K S+ D + P P +G ++ W+ D L P PN LE Sbjct: 2 ALTTYNNTVNQKPSVLDSIILQGPSQVPFLKWFGRGDVNAPKHAWITDRLRDPKPNYNLE 61 Query: 66 GDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVE 125 T +T + N TQI++ + LS + + G ++ + K E KD+E Sbjct: 62 ITGLEEDTEDTKVMLDNVTQIVKNEFGLSRKERSTARYG-QKEWPYRVGKVGKEHAKDLE 120 Query: 126 FALVSSQ---------GSEKTSPRKMAALSSWIKK 151 F L+ Q T+ +MA + +I Sbjct: 121 FNLLGLQNDSVFDNYVPGSDTTEARMAGIFHFIPS 155 >gi|227517040|ref|ZP_03947089.1| conserved hypothetical protein [Enterococcus faecalis TX0104] gi|229545405|ref|ZP_04434130.1| conserved hypothetical protein [Enterococcus faecalis TX1322] gi|229549652|ref|ZP_04438377.1| conserved hypothetical protein [Enterococcus faecalis ATCC 29200] gi|255972349|ref|ZP_05422935.1| predicted protein [Enterococcus faecalis T1] gi|256619486|ref|ZP_05476332.1| conserved hypothetical protein [Enterococcus faecalis ATCC 4200] gi|257090289|ref|ZP_05584650.1| predicted protein [Enterococcus faecalis CH188] gi|300860939|ref|ZP_07107026.1| conserved hypothetical protein [Enterococcus faecalis TUSoD Ef11] gi|307275949|ref|ZP_07557082.1| hypothetical protein HMPREF9521_01574 [Enterococcus faecalis TX2134] gi|307295873|ref|ZP_07575705.1| hypothetical protein HMPREF9509_02949 [Enterococcus faecalis TX0411] gi|312900152|ref|ZP_07759467.1| conserved hypothetical protein [Enterococcus faecalis TX0470] gi|312902789|ref|ZP_07761993.1| conserved hypothetical protein [Enterococcus faecalis TX0635] gi|227075515|gb|EEI13478.1| conserved hypothetical protein [Enterococcus faecalis TX0104] gi|229305317|gb|EEN71313.1| conserved hypothetical protein [Enterococcus faecalis ATCC 29200] gi|229309512|gb|EEN75499.1| conserved hypothetical protein [Enterococcus faecalis TX1322] gi|255963367|gb|EET95843.1| predicted protein [Enterococcus faecalis T1] gi|256599013|gb|EEU18189.1| conserved hypothetical protein [Enterococcus faecalis ATCC 4200] gi|256999101|gb|EEU85621.1| predicted protein [Enterococcus faecalis CH188] gi|295113266|emb|CBL31903.1| hypothetical protein [Enterococcus sp. 7L76] gi|300849978|gb|EFK77728.1| conserved hypothetical protein [Enterococcus faecalis TUSoD Ef11] gi|306496204|gb|EFM65783.1| hypothetical protein HMPREF9509_02949 [Enterococcus faecalis TX0411] gi|306507279|gb|EFM76416.1| hypothetical protein HMPREF9521_01574 [Enterococcus faecalis TX2134] gi|310633843|gb|EFQ17126.1| conserved hypothetical protein [Enterococcus faecalis TX0635] gi|311292711|gb|EFQ71267.1| conserved hypothetical protein [Enterococcus faecalis TX0470] gi|315149052|gb|EFT93068.1| conserved hypothetical protein [Enterococcus faecalis TX4244] gi|315159923|gb|EFU03940.1| conserved hypothetical protein [Enterococcus faecalis TX0312] gi|315167465|gb|EFU11482.1| conserved hypothetical protein [Enterococcus faecalis TX1341] gi|315169417|gb|EFU13434.1| conserved hypothetical protein [Enterococcus faecalis TX1342] gi|315575405|gb|EFU87596.1| conserved hypothetical protein [Enterococcus faecalis TX0309B] gi|315576720|gb|EFU88911.1| conserved hypothetical protein [Enterococcus faecalis TX0630] gi|315582750|gb|EFU94941.1| conserved hypothetical protein [Enterococcus faecalis TX0309A] gi|323481145|gb|ADX80584.1| hypothetical protein EF62_2373 [Enterococcus faecalis 62] Length = 295 Score = 90.9 bits (224), Expect = 5e-17, Method: Composition-based stats. Identities = 36/152 (23%), Positives = 63/152 (41%), Gaps = 7/152 (4%) Query: 19 SLSDVVSRITPEDTPIYS---MIKK-GTTHSIHPEWVVDDLASPGPNAQLEGDEYSFKTI 74 +S V+ + +TP S K S +W + + +AQLEG EY+ Sbjct: 13 DISQEVNALQVPNTPFLSYLLGAGKVEAAKSTEIKWREYGMNNDDSSAQLEGGEYAD-AE 71 Query: 75 NTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQGS 134 + NYT+I RKS +SGT +A++ G + Q +A E++ D+ L+ + Sbjct: 72 SDRTWFNNYTEIFRKSTSVSGTLDAINVDGVGNELNSQVALRATEMKIDLNRKLIVGVKA 131 Query: 135 E--KTSPRKMAALSSWIKKNASRGTGGVLEDM 164 + + R+M + + I T Sbjct: 132 DESGSKGRQMNGILNLISSTNKVETAAAGAVT 163 >gi|315028531|gb|EFT40463.1| conserved hypothetical protein [Enterococcus faecalis TX4000] Length = 295 Score = 90.9 bits (224), Expect = 5e-17, Method: Composition-based stats. Identities = 36/152 (23%), Positives = 63/152 (41%), Gaps = 7/152 (4%) Query: 19 SLSDVVSRITPEDTPIYS---MIKK-GTTHSIHPEWVVDDLASPGPNAQLEGDEYSFKTI 74 +S V+ + +TP S K S +W + + +AQLEG EY+ Sbjct: 13 DISQEVNALQVPNTPFLSYLLGAGKVEAAKSTEIKWREYGMNNDDSSAQLEGGEYAD-AE 71 Query: 75 NTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQGS 134 + NYT+I RKS +SGT +A++ G + Q +A E++ D+ L+ + Sbjct: 72 SDRTWFNNYTEIFRKSTSVSGTLDAINVDGVGNELNSQVALRATEMKIDLNRKLIVGVKA 131 Query: 135 E--KTSPRKMAALSSWIKKNASRGTGGVLEDM 164 + + R+M + + I T Sbjct: 132 DESGSKGRQMNGILNLISSTNKVETAAAGAVT 163 >gi|302389556|ref|YP_003825377.1| hypothetical protein Toce_0992 [Thermosediminibacter oceani DSM 16646] gi|302200184|gb|ADL07754.1| conserved hypothetical protein [Thermosediminibacter oceani DSM 16646] Length = 294 Score = 90.1 bits (222), Expect = 1e-16, Method: Composition-based stats. Identities = 34/164 (20%), Positives = 63/164 (38%), Gaps = 13/164 (7%) Query: 3 IVNNTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKK----GTTHSIHPEWVVDDLASP 58 I ++F SL+ + + P DTP+YS+I S W L + Sbjct: 2 IKTDSFTNLEKV----SLATEIGLVAPTDTPLYSLILNLGQVDQATSPVVVWREKTLDTT 57 Query: 59 GPNAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKAL 118 + EG + + NY +I K +SG+ A G + + Sbjct: 58 NDISVPEGAN-PVFYQSNRAEISNYCEIFLKGVEVSGSASASSIAGIPDLMASEVADRLA 116 Query: 119 EIRKDVEFALVSSQGSEKTSP---RKMAALSSWI-KKNASRGTG 158 E++ ++E AL++ ++ + R+M L S++ + N G Sbjct: 117 EMKVNIEKALINGVKNDGSQTPYIRRMGGLISFVPEGNKVTGAN 160 >gi|329568771|gb|EGG50571.1| hypothetical protein HMPREF9520_03403 [Enterococcus faecalis TX1467] Length = 295 Score = 89.3 bits (220), Expect = 2e-16, Method: Composition-based stats. Identities = 36/152 (23%), Positives = 61/152 (40%), Gaps = 7/152 (4%) Query: 19 SLSDVVSRITPEDTPIYS---MIKK-GTTHSIHPEWVVDDLASPGPNAQLEGDEYSFKTI 74 +S V+ + +TP S K S +W + + +AQLEG EY+ Sbjct: 13 DISQEVNALQVPNTPFLSYLLGAGKVEAAKSTEIKWREYGMNNDDSSAQLEGGEYAD-AE 71 Query: 75 NTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQGS 134 + NYT+I RKS +SGT A + G + Q +A E++ D+ L+ + Sbjct: 72 SDRTWFNNYTEIFRKSTSVSGTLIASNVDGVGNELNSQVALRATEMKIDLNRKLIVGVKA 131 Query: 135 E--KTSPRKMAALSSWIKKNASRGTGGVLEDM 164 + + R+M + + I T Sbjct: 132 DESGSKGRQMNGILNLISSTNKVETAAAGAVT 163 >gi|163937921|ref|YP_001642807.1| hypothetical protein BcerKBAB4_5338 [Bacillus weihenstephanensis KBAB4] gi|163865776|gb|ABY46832.1| hypothetical protein BcerKBAB4_5338 [Bacillus weihenstephanensis KBAB4] Length = 391 Score = 85.9 bits (211), Expect = 2e-15, Method: Composition-based stats. Identities = 33/105 (31%), Positives = 55/105 (52%), Gaps = 1/105 (0%) Query: 64 LEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKD 123 EG + +R+ N TQI +S L+GT AV G +Y+++K KK LE+ Sbjct: 133 SEGADARDSRYKPRKRVSNITQIFDESVELTGTAMAVAQYGVNNEYEKEKQKKQLELALA 192 Query: 124 VEFALVSSQGSEKTSPRKMAALSSWIKKNASRGTG-GVLEDMILS 167 +E A+++ E S R M + S+I+ N + G V +DM+++ Sbjct: 193 LEKAVINGIRYEAGSKRMMRGIRSFIETNVIKAEGESVNDDMLIN 237 Score = 51.6 bits (122), Expect = 3e-05, Method: Composition-based stats. Identities = 24/144 (16%), Positives = 45/144 (31%), Gaps = 13/144 (9%) Query: 1 MTIVNNTFITSSSTTNKESLSDVVSRITPEDTPIYSMIK-KGTTHSIHPEWVVDDLASPG 59 MT+V KES+ D + P TP+ S++ ++ W D++ + Sbjct: 1 MTVVTEKVYNEDLVGKKESVVDEFLLLNPLQTPMLSLVGFGQAVTAVEHIWFEDEMFAQE 60 Query: 60 PNAQLEGDEYSFKTINTPERMGNYTQIMRK------SWILSGTQEAVDDVGYILKYKEQK 113 A E + + Q++R ++G + V GY E Sbjct: 61 STATKEATATATEIEVADSEAFRKLQVVRAGDELILVVSVAGNKLTVA-RGYADTTAEAI 119 Query: 114 LKKALEIRKDVEFALVSSQGSEKT 137 + + +E V Sbjct: 120 AEGDV-----IEVMFVEGSEGADA 138 >gi|228910960|ref|ZP_04074768.1| hypothetical protein bthur0013_51010 [Bacillus thuringiensis IBL 200] gi|228848615|gb|EEM93461.1| hypothetical protein bthur0013_51010 [Bacillus thuringiensis IBL 200] Length = 363 Score = 84.7 bits (208), Expect = 3e-15, Method: Composition-based stats. Identities = 28/99 (28%), Positives = 49/99 (49%) Query: 65 EGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDV 124 EG R+ N TQI ++ L+GT +A+ G +Y+++K KK LE+ + Sbjct: 131 EGSNARDARYKPRNRVSNITQIFDETVELTGTAQAIAQYGVDNEYEKEKQKKQLELALQL 190 Query: 125 EFALVSSQGSEKTSPRKMAALSSWIKKNASRGTGGVLED 163 E A+++ E+ + R M + S+I+ N G + D Sbjct: 191 EKAVINGVRYEQGNRRMMRGIRSFIETNVINAGGAAVAD 229 >gi|256964718|ref|ZP_05568889.1| conserved hypothetical protein [Enterococcus faecalis HIP11704] gi|307272797|ref|ZP_07554044.1| hypothetical protein HMPREF9514_01561 [Enterococcus faecalis TX0855] gi|256955214|gb|EEU71846.1| conserved hypothetical protein [Enterococcus faecalis HIP11704] gi|306510411|gb|EFM79434.1| hypothetical protein HMPREF9514_01561 [Enterococcus faecalis TX0855] Length = 264 Score = 81.3 bits (199), Expect = 5e-14, Method: Composition-based stats. Identities = 30/126 (23%), Positives = 53/126 (42%), Gaps = 3/126 (2%) Query: 41 GTTHSIHPEWVVDDLASPGPNAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAV 100 S +W + + +AQLEG EY+ + NYT+I RKS +SGT +A+ Sbjct: 8 EAAKSTEIKWREYGMNNDDSSAQLEGGEYAD-AESDRTWFNNYTEIFRKSTSVSGTLDAI 66 Query: 101 DDVGYILKYKEQKLKKALEIRKDVEFALVSSQGSE--KTSPRKMAALSSWIKKNASRGTG 158 + G + Q +A E++ D+ L+ ++ + R+M + + I T Sbjct: 67 NVDGVGNELNSQVALRATEMKIDLNRKLIVGVKADESGSKGRQMNGILNLISSTNKVETA 126 Query: 159 GVLEDM 164 Sbjct: 127 AAGAVT 132 >gi|319649918|ref|ZP_08004068.1| hypothetical protein HMPREF1013_00673 [Bacillus sp. 2_A_57_CT2] gi|317398356|gb|EFV79044.1| hypothetical protein HMPREF1013_00673 [Bacillus sp. 2_A_57_CT2] Length = 292 Score = 80.5 bits (197), Expect = 8e-14, Method: Composition-based stats. Identities = 33/168 (19%), Positives = 70/168 (41%), Gaps = 13/168 (7%) Query: 7 TFITSSSTTNKE-SLSDVVSRITPEDTPIYSMIKK----GTTHSIHPEWVVDDLASPGPN 61 F +++ T ++ SL+ ++ I + TP+ SM+ S W L Sbjct: 1 MFKSTNFTEIEQISLAKEIAVIGVQATPLTSMLMAKGNIEKALSTVYTWREKSLDHAEDL 60 Query: 62 AQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIR 121 + +EG + + N +I +K +SGT A+ ++ E+ + LE++ Sbjct: 61 SAVEGSDEVVFYETARAELNNILEIFKKGASISGTAVAMK----STQFAEEVNDRLLELK 116 Query: 122 KDVEFALVSSQGSEKTSP---RKMAALSSWIK-KNASRGTGGVLEDMI 165 ++E ++ ++ + R+++ L NA TG + ED + Sbjct: 117 INMEKKFINGLRNDGSVTPFKRQLSGLIQMADPSNAVPVTGAITEDDV 164 >gi|291561307|emb|CBL40106.1| hypothetical protein CK3_02480 [butyrate-producing bacterium SS3/4] Length = 338 Score = 79.3 bits (194), Expect = 2e-13, Method: Composition-based stats. Identities = 34/171 (19%), Positives = 60/171 (35%), Gaps = 23/171 (13%) Query: 5 NNTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIHPEWVV-DDLASPGPNAQ 63 +TF TS N S ++ TP+ S+I + H E+V + S G +Q Sbjct: 2 ADTFATSFGVLN---YSGMLFNKGNVRTPLSSIIGSKAKTTNHVEFVTGQEYTSNGNGSQ 58 Query: 64 LEGDE-----YSFKTINTPERMGNYTQIMRKSWILS-----------GTQEAVDDVGYIL 107 E + T + N TQI ++S +S G A + Sbjct: 59 PAISESASLTAPDADVVTRSQKTNVTQIFQESVGISYGKQSNMGTLSGINIAEQQANPMS 118 Query: 108 KYKEQKLKKALEIRKDVEFALVSS---QGSEKTSPRKMAALSSWIKKNASR 155 + Q K ++ +D+E+ ++ + + K L + I N Sbjct: 119 ELDFQVAAKIQKVNRDIEYTFINGEYNKATSDAEVNKTRGLVNAITTNTLA 169 >gi|241760939|ref|ZP_04759028.1| putative phage major head protein [Zymomonas mobilis subsp. mobilis ATCC 10988] gi|241374558|gb|EER64019.1| putative phage major head protein [Zymomonas mobilis subsp. mobilis ATCC 10988] Length = 238 Score = 73.2 bits (178), Expect = 1e-11, Method: Composition-based stats. Identities = 21/82 (25%), Positives = 35/82 (42%), Gaps = 4/82 (4%) Query: 87 MRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQGS----EKTSPRKM 142 M K S T AV + G ++ Q + E+++D+E + + R+ Sbjct: 1 MTKVVGTSTTDRAVHNAGRGDEHAYQLARAGQELKRDIEARFTGNFAAIPGDGAVVARET 60 Query: 143 AALSSWIKKNASRGTGGVLEDM 164 A +W++ NA RG GG M Sbjct: 61 AGALAWLRSNAHRGDGGANPVM 82 >gi|229037842|ref|ZP_04189640.1| hypothetical protein bcere0028_57530 [Bacillus cereus AH1271] gi|228727464|gb|EEL78642.1| hypothetical protein bcere0028_57530 [Bacillus cereus AH1271] Length = 315 Score = 67.8 bits (164), Expect = 5e-10, Method: Composition-based stats. Identities = 30/160 (18%), Positives = 49/160 (30%), Gaps = 22/160 (13%) Query: 23 VVSRITPEDTPIYSMIKKGTTHSIHP---EWVVDDL---ASPGPNAQLEGDEYSFKTINT 76 + E+TP SMI T + E+ D L +P A E + T + Sbjct: 19 ELFTADSENTPFLSMIGGLTGGGLQTANKEFATDSLYEYPAPSQPAISEQASGTAPTAVS 78 Query: 77 PER--MGNYTQIMRKSWIL-----------SGTQEAVDDVGYILKYKEQKLKKALEIRKD 123 R N TQI +S + SG A + Q + +I +D Sbjct: 79 YARGQNKNVTQIFHESVNVTYRKLSNGGRLSGINTAGASNNAPSEKDFQIARALTKIARD 138 Query: 124 VEFALVSSQ---GSEKTSPRKMAALSSWIKKNASRGTGGV 160 E ++ ++ T K + + G Sbjct: 139 AEHTFLNGTYALATKDTEADKTRGMFELCSTGNTIAAAGA 178 >gi|228968787|ref|ZP_04129749.1| hypothetical protein bthur0004_55460 [Bacillus thuringiensis serovar sotto str. T04001] gi|228790850|gb|EEM38489.1| hypothetical protein bthur0004_55460 [Bacillus thuringiensis serovar sotto str. T04001] Length = 374 Score = 63.5 bits (153), Expect = 1e-08, Method: Composition-based stats. Identities = 20/92 (21%), Positives = 38/92 (41%), Gaps = 3/92 (3%) Query: 60 PNAQLEGDEY-SFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKAL 118 + EG++ IN N++QI + +S TQ+ V+ G + Q + Sbjct: 129 ARPRPEGEDAFRKNEINDRLVSHNFSQIFSRYASVSRTQQQVNTYGVSNELDYQVNLRLQ 188 Query: 119 EIRKDVEFALVSS--QGSEKTSPRKMAALSSW 148 E+ ++ +L+ G T PR L ++ Sbjct: 189 EMIREANTSLIYGRRNGGSPTQPRTTGGLFAF 220 >gi|228941057|ref|ZP_04103614.1| hypothetical protein bthur0008_36970 [Bacillus thuringiensis serovar berliner ATCC 10792] gi|228973988|ref|ZP_04134562.1| hypothetical protein bthur0003_37430 [Bacillus thuringiensis serovar thuringiensis str. T01001] gi|228980577|ref|ZP_04140886.1| hypothetical protein bthur0002_37450 [Bacillus thuringiensis Bt407] gi|228779138|gb|EEM27396.1| hypothetical protein bthur0002_37450 [Bacillus thuringiensis Bt407] gi|228785714|gb|EEM33719.1| hypothetical protein bthur0003_37430 [Bacillus thuringiensis serovar thuringiensis str. T01001] gi|228818600|gb|EEM64668.1| hypothetical protein bthur0008_36970 [Bacillus thuringiensis serovar berliner ATCC 10792] gi|326939625|gb|AEA15521.1| Phage protein [Bacillus thuringiensis serovar chinensis CT-43] Length = 374 Score = 63.5 bits (153), Expect = 1e-08, Method: Composition-based stats. Identities = 20/92 (21%), Positives = 38/92 (41%), Gaps = 3/92 (3%) Query: 60 PNAQLEGDEY-SFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKAL 118 + EG++ IN N++QI + +S TQ+ V+ G + Q + Sbjct: 129 ARPRPEGEDAFRKNEINDRLVSHNFSQIFSRYASVSRTQQQVNTYGVSNELDYQVNLRLQ 188 Query: 119 EIRKDVEFALVSS--QGSEKTSPRKMAALSSW 148 E+ ++ +L+ G T PR L ++ Sbjct: 189 EMIREANTSLIYGRRNGGSPTQPRTTGGLFAF 220 >gi|30020036|ref|NP_831667.1| Phage protein [Bacillus cereus ATCC 14579] gi|31415788|ref|NP_852528.1| hypothetical protein BC1894 [Bacillus phage phBC6A51] gi|229127327|ref|ZP_04256323.1| hypothetical protein bcere0015_17800 [Bacillus cereus BDRD-Cer4] gi|29895581|gb|AAP08868.1| Phage protein [Bacillus phage phBC6A51] gi|228656160|gb|EEL12002.1| hypothetical protein bcere0015_17800 [Bacillus cereus BDRD-Cer4] Length = 374 Score = 63.5 bits (153), Expect = 1e-08, Method: Composition-based stats. Identities = 20/92 (21%), Positives = 38/92 (41%), Gaps = 3/92 (3%) Query: 60 PNAQLEGDEY-SFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKAL 118 + EG++ IN N++QI + +S TQ+ V+ G + Q + Sbjct: 129 ARPRPEGEDAFRKNEINDRLVSHNFSQIFSRYASVSRTQQQVNTYGVSNELDYQVNLRLQ 188 Query: 119 EIRKDVEFALVSS--QGSEKTSPRKMAALSSW 148 E+ ++ +L+ G T PR L ++ Sbjct: 189 EMIREANTSLIYGRRNGGSPTQPRTTGGLFAF 220 >gi|229020770|ref|ZP_04177493.1| hypothetical protein bcere0030_52440 [Bacillus cereus AH1273] gi|228740571|gb|EEL90846.1| hypothetical protein bcere0030_52440 [Bacillus cereus AH1273] Length = 374 Score = 62.8 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 19/92 (20%), Positives = 37/92 (40%), Gaps = 3/92 (3%) Query: 60 PNAQLEGDEY-SFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKAL 118 + EG++ IN N++QI + +S TQ+ V+ G + Q + Sbjct: 129 ARPRPEGEDAFRKNEINDRLVSHNFSQIFSRYASVSRTQQQVNTYGVSNELDYQVNLRLQ 188 Query: 119 EIRKDVEFALVSS--QGSEKTSPRKMAALSSW 148 E+ ++ +L+ T PR L ++ Sbjct: 189 EMIREANTSLIYGRRNVGSPTQPRTTGGLFAF 220 >gi|229190579|ref|ZP_04317576.1| hypothetical protein bcere0002_22460 [Bacillus cereus ATCC 10876] gi|228592924|gb|EEK50746.1| hypothetical protein bcere0002_22460 [Bacillus cereus ATCC 10876] Length = 374 Score = 62.8 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 19/92 (20%), Positives = 37/92 (40%), Gaps = 3/92 (3%) Query: 60 PNAQLEGDEY-SFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKAL 118 + EG++ IN N++QI + +S TQ+ V+ G + Q + Sbjct: 129 ARPRPEGEDAFRKNEINDRLVSHNFSQIFSRYASVSRTQQQVNTYGVSNELDYQVNLRLQ 188 Query: 119 EIRKDVEFALVSS--QGSEKTSPRKMAALSSW 148 E+ ++ +L+ T PR L ++ Sbjct: 189 EMIREANTSLIYGRRNVGSPTQPRTTGGLFAF 220 >gi|218897919|ref|YP_002446330.1| phage protein [Bacillus cereus G9842] gi|218542918|gb|ACK95312.1| phage protein [Bacillus cereus G9842] Length = 374 Score = 62.8 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 19/92 (20%), Positives = 37/92 (40%), Gaps = 3/92 (3%) Query: 60 PNAQLEGDEY-SFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKAL 118 + EG++ IN N++QI + +S TQ+ V+ G + Q + Sbjct: 129 ARPRPEGEDAFRKNEINDRLVSHNFSQIFSRYASVSRTQQQVNTYGVSNELDYQVNLRLQ 188 Query: 119 EIRKDVEFALVSS--QGSEKTSPRKMAALSSW 148 E+ ++ +L+ T PR L ++ Sbjct: 189 EMIREANTSLIYGRRNVGSPTQPRTTGGLFAF 220 >gi|257451764|ref|ZP_05617063.1| hypothetical protein F3_01776 [Fusobacterium sp. 3_1_5R] gi|317058321|ref|ZP_07922806.1| predicted protein [Fusobacterium sp. 3_1_5R] gi|313683997|gb|EFS20832.1| predicted protein [Fusobacterium sp. 3_1_5R] Length = 371 Score = 59.7 bits (143), Expect = 1e-07, Method: Composition-based stats. Identities = 25/102 (24%), Positives = 43/102 (42%), Gaps = 2/102 (1%) Query: 60 PNAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEA--VDDVGYILKYKEQKLKKA 117 + EG + T N QI+R+ +S + EA V G I Y +++KK Sbjct: 131 NDNIEEGADLQGATYKKGVNYDNNVQIIREEISVSASAEAITVPSAGGIDAYSLEQMKKM 190 Query: 118 LEIRKDVEFALVSSQGSEKTSPRKMAALSSWIKKNASRGTGG 159 ++ +E A++S + E R M + ++ K GG Sbjct: 191 DKVLGKIEKAIISGKKFESGLKRGMDGVKRFLAKGQLVDAGG 232 >gi|150021335|ref|YP_001306689.1| hypothetical protein Tmel_1457 [Thermosipho melanesiensis BI429] gi|149793856|gb|ABR31304.1| hypothetical protein Tmel_1457 [Thermosipho melanesiensis BI429] Length = 362 Score = 59.7 bits (143), Expect = 1e-07, Method: Composition-based stats. Identities = 24/130 (18%), Positives = 42/130 (32%), Gaps = 7/130 (5%) Query: 1 MTIVNNTFITSSSTTNKESLSDVVSRITPEDTPIYSMIK--KGTTHSIHPEWVVDDLAS- 57 M +N T NK +S V+S + +TP+ + I T S EW D L Sbjct: 1 MGTINGMVTTYDVAENKIDVSPVLSMLKLPNTPLLNAIGISNETVDSTRYEWWDDVLPVL 60 Query: 58 ----PGPNAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQK 113 G ++GN ++ + ++ V V + + Sbjct: 61 KVKLAAAYTAGGGSLTVETGAGKKFKVGNVIKVENSIYRVTAINGDVLSVAVVSGDADHA 120 Query: 114 LKKALEIRKD 123 +E+ D Sbjct: 121 ANVDVELIGD 130 Score = 55.5 bits (132), Expect = 3e-06, Method: Composition-based stats. Identities = 30/158 (18%), Positives = 58/158 (36%), Gaps = 15/158 (9%) Query: 14 TTNKESLSDVVSRITPEDTPIYS-MIKKGTTH---SIHPEWVVDDLASPGPNAQLEGDEY 69 N + + + R+T + + S + G ++ E + D AQ EG +Y Sbjct: 87 VGNVIKVENSIYRVTAINGDVLSVAVVSGDADHAANVDVELIGD--------AQPEGQDY 138 Query: 70 SFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFAL- 128 + + N TQI SG+Q AV + + +K +++ +E Sbjct: 139 NDSNYEQKVKRYNVTQIFSDYVKFSGSQLAVKQYVNEDVFLNEVQRKLKKLKILLERTAW 198 Query: 129 --VSSQGSEKTSPRKMAALSSWIKKNASRGTGGVLEDM 164 + ++ + PR M + +I + T ED Sbjct: 199 LGIRVDPNDNSGPRMMGGIKYFIDSDGITSTNTWSEDN 236 >gi|291335186|gb|ADD94810.1| hypothetical protein [uncultured phage MedDCM-OCT-S12-C102] Length = 74 Score = 59.3 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 17/67 (25%), Positives = 32/67 (47%), Gaps = 1/67 (1%) Query: 12 SSTTNKESLSDVVSRITPEDTPIYSMIKKGTT-HSIHPEWVVDDLASPGPNAQLEGDEYS 70 KE L D+++R+ + TP S++ KG+T H+ +W VD A ++G + + Sbjct: 8 DQVAKKEDLLDLITRVDEKATPFMSLVNKGSTPHNTFIQWPVDTYADAALGGTVDGTDVA 67 Query: 71 FKTINTP 77 + Sbjct: 68 SYANHAE 74 >gi|194015203|ref|ZP_03053819.1| phage protein [Bacillus pumilus ATCC 7061] gi|194012607|gb|EDW22173.1| phage protein [Bacillus pumilus ATCC 7061] Length = 367 Score = 58.5 bits (140), Expect = 3e-07, Method: Composition-based stats. Identities = 18/86 (20%), Positives = 34/86 (39%), Gaps = 1/86 (1%) Query: 63 QLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRK 122 Q EG + N+TQI+ + +S TQ+AV + Q + E+ + Sbjct: 127 QNEGAGVGMDEGHDRYVDYNFTQIIERYAAVSNTQQAVRTHNVTDELNYQVQLRLKEMAR 186 Query: 123 DVEFALVSSQGSEKTSPRKMAALSSW 148 + L+ + + PR L ++ Sbjct: 187 EFNDWLIYGRRID-GKPRMTGGLLNF 211 >gi|308172834|ref|YP_003919539.1| phage protein [Bacillus amyloliquefaciens DSM 7] gi|307605698|emb|CBI42069.1| phage protein [Bacillus amyloliquefaciens DSM 7] Length = 367 Score = 57.8 bits (138), Expect = 5e-07, Method: Composition-based stats. Identities = 18/86 (20%), Positives = 34/86 (39%), Gaps = 1/86 (1%) Query: 63 QLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRK 122 Q EG + N+TQI+ + +S TQ+AV + Q + E+ + Sbjct: 127 QNEGAGVGIDEGHDRYVDYNFTQIIERYAAVSNTQQAVRTHNVSNELDYQVKLRLKEMAR 186 Query: 123 DVEFALVSSQGSEKTSPRKMAALSSW 148 + L+ + + PR L ++ Sbjct: 187 EFNDWLIYGRRID-GKPRMTGGLLNF 211 >gi|257463376|ref|ZP_05627772.1| hypothetical protein FuD12_05953 [Fusobacterium sp. D12] gi|317060946|ref|ZP_07925431.1| predicted protein [Fusobacterium sp. D12] gi|313686622|gb|EFS23457.1| predicted protein [Fusobacterium sp. D12] Length = 369 Score = 56.2 bits (134), Expect = 1e-06, Method: Composition-based stats. Identities = 25/102 (24%), Positives = 44/102 (43%), Gaps = 2/102 (1%) Query: 60 PNAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEA--VDDVGYILKYKEQKLKKA 117 + EG + T N TQI+R+ +SGT EA V G + Y ++ +K Sbjct: 131 NDNIAEGADLQGTTYKKGVNYDNNTQIIREEISVSGTSEAINVPSSGGVDVYTLEQTRKM 190 Query: 118 LEIRKDVEFALVSSQGSEKTSPRKMAALSSWIKKNASRGTGG 159 + +E A++ + E+ + R M + ++ K GG Sbjct: 191 DTVLGKIEKAIIKGKKFEEGTKRGMDGVKRFLVKGQLVDAGG 232 >gi|257468183|ref|ZP_05632279.1| hypothetical protein FulcA4_02527 [Fusobacterium ulcerans ATCC 49185] gi|317062468|ref|ZP_07926953.1| predicted protein [Fusobacterium ulcerans ATCC 49185] gi|313688144|gb|EFS24979.1| predicted protein [Fusobacterium ulcerans ATCC 49185] Length = 370 Score = 53.9 bits (128), Expect = 8e-06, Method: Composition-based stats. Identities = 21/101 (20%), Positives = 38/101 (37%), Gaps = 2/101 (1%) Query: 60 PNAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKE--QKLKKA 117 + EG + + E NYTQI+R+ +SGT +A+ + +K Sbjct: 130 NDNIEEGADLLGASYKPGENFTNYTQIIREEISISGTAQALTVPSGEGLDPYSLEMTRKM 189 Query: 118 LEIRKDVEFALVSSQGSEKTSPRKMAALSSWIKKNASRGTG 158 + VE A+V+ + R M + + + K Sbjct: 190 DKAVGKVEKAIVAGKKFATGKNRGMDGIRTILDKGQIVDAN 230 >gi|56551280|ref|YP_162119.1| hypothetical protein ZMO0384 [Zymomonas mobilis subsp. mobilis ZM4] gi|241760937|ref|ZP_04759026.1| hypothetical protein ZmobDRAFT_0102 [Zymomonas mobilis subsp. mobilis ATCC 10988] gi|56542854|gb|AAV89008.1| hypothetical protein ZMO0384 [Zymomonas mobilis subsp. mobilis ZM4] gi|241374556|gb|EER64017.1| hypothetical protein ZmobDRAFT_0102 [Zymomonas mobilis subsp. mobilis ATCC 10988] Length = 35 Score = 51.6 bits (122), Expect = 3e-05, Method: Composition-based stats. Identities = 11/30 (36%), Positives = 18/30 (60%) Query: 1 MTIVNNTFITSSSTTNKESLSDVVSRITPE 30 M++ +NT T S +E LSD++ I+P Sbjct: 1 MSVASNTVQTYSRVGIREDLSDIIYNISPT 30 >gi|34763997|ref|ZP_00144887.1| Phage protein [Fusobacterium nucleatum subsp. vincentii ATCC 49256] gi|27886234|gb|EAA23520.1| Phage protein [Fusobacterium nucleatum subsp. vincentii ATCC 49256] Length = 378 Score = 44.7 bits (104), Expect = 0.004, Method: Composition-based stats. Identities = 18/90 (20%), Positives = 36/90 (40%), Gaps = 2/90 (2%) Query: 63 QLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQK--LKKALEI 120 EG E T+ P R+ N T I+ + + ++ T + ++ G + KK E+ Sbjct: 137 MEEGGELKTSTVRLPVRITNNTGIIYEQYKVTETAKHLNPHGQGSLSVRELESQKKKDEL 196 Query: 121 RKDVEFALVSSQGSEKTSPRKMAALSSWIK 150 +E ++ + R + + IK Sbjct: 197 LGIMENKFLNGVKFTSGNLRMSGGVKALIK 226 >gi|156344548|ref|XP_001621225.1| hypothetical protein NEMVEDRAFT_v1g222228 [Nematostella vectensis] gi|156206955|gb|EDO29125.1| predicted protein [Nematostella vectensis] Length = 400 Score = 42.0 bits (97), Expect = 0.027, Method: Composition-based stats. Identities = 30/118 (25%), Positives = 45/118 (38%), Gaps = 8/118 (6%) Query: 59 GPNAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKAL 118 A EG T E + NYTQI R +W ++ T A I E K + Sbjct: 138 AGTAFEEGSNRPTARRLTTEYIPNYTQIFRNAWAMTDTARASYAEMGISNIAENKADCMM 197 Query: 119 EIRKDVEFALVSSQGSEKTSPRK--------MAALSSWIKKNASRGTGGVLEDMILSL 168 D+E A++ SQ TS + AL ++ N + G D +++L Sbjct: 198 FHSVDIESAMIFSQPKMDTSGATPMHATQGILDALRQYVPGNVNAAGGTTTFDQLVAL 255 >gi|281355462|ref|ZP_06241956.1| hypothetical protein Vvad_PD3568 [Victivallis vadensis ATCC BAA-548] gi|281318342|gb|EFB02362.1| hypothetical protein Vvad_PD3568 [Victivallis vadensis ATCC BAA-548] Length = 403 Score = 40.0 bits (92), Expect = 0.097, Method: Composition-based stats. Identities = 21/105 (20%), Positives = 38/105 (36%), Gaps = 5/105 (4%) Query: 70 SFKTINTPERMGNYTQIMRKSWILSGTQEAVDDV-GYILKYKEQKLKKALEIRKDVEFAL 128 T T N+TQI+RK +S + A + Q EI +D+ Sbjct: 143 GEYTRRTVGSAYNHTQIIRKDLGISNSALATKTIDQVENSIARQTEFALQEIDRDMNRQA 202 Query: 129 VSSQGSEKTSPR----KMAALSSWIKKNASRGTGGVLEDMILSLA 169 + +E+ + L ++ A +GG L +++ A Sbjct: 203 IWGIRTERDEANDVFGEAGGLYNFATALAVDASGGRLTSKLVNDA 247 >gi|256027862|ref|ZP_05441696.1| hypothetical protein PrD11_07671 [Fusobacterium sp. D11] gi|289765813|ref|ZP_06525191.1| phage protein [Fusobacterium sp. D11] gi|289717368|gb|EFD81380.1| phage protein [Fusobacterium sp. D11] Length = 378 Score = 39.7 bits (91), Expect = 0.13, Method: Composition-based stats. Identities = 17/90 (18%), Positives = 35/90 (38%), Gaps = 2/90 (2%) Query: 63 QLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQK--LKKALEI 120 EG E T+ P + N T I+ + + ++ T + ++ G + KK E+ Sbjct: 137 MEEGGELKASTVRLPVHITNNTGIIYEQYKVTETAKHLNPHGQGGLSVRELESQKKKDEL 196 Query: 121 RKDVEFALVSSQGSEKTSPRKMAALSSWIK 150 +E ++ + R + + IK Sbjct: 197 LGIMENKFLNGVKFTSGNLRMSGGVKALIK 226 >gi|262067743|ref|ZP_06027355.1| hypothetical protein FUSPEROL_02025 [Fusobacterium periodonticum ATCC 33693] gi|291378467|gb|EFE85985.1| hypothetical protein FUSPEROL_02025 [Fusobacterium periodonticum ATCC 33693] Length = 379 Score = 38.5 bits (88), Expect = 0.35, Method: Composition-based stats. Identities = 22/113 (19%), Positives = 41/113 (36%), Gaps = 6/113 (5%) Query: 40 KGTTHSIHPEWVVDDLASPGPNAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEA 99 T +I +V L EG E ++ P + N T I+ + + ++ T + Sbjct: 119 TSTAGNIVANTIVQSL----GIEMEEGGELKKSSVRLPVHITNNTGIIYEEYEVTETAKH 174 Query: 100 VDDVGYILKYKEQK--LKKALEIRKDVEFALVSSQGSEKTSPRKMAALSSWIK 150 ++ G + KK E+ +E L++ R + S IK Sbjct: 175 INPHGQSGLSVREVESQKKKDEMLGIMENKLLNGVKYVNGKLRMSGGIKSLIK 227 >gi|115403015|ref|XP_001217584.1| hypothetical protein ATEG_08998 [Aspergillus terreus NIH2624] gi|114189430|gb|EAU31130.1| hypothetical protein ATEG_08998 [Aspergillus terreus NIH2624] Length = 522 Score = 38.5 bits (88), Expect = 0.35, Method: Composition-based stats. Identities = 24/119 (20%), Positives = 37/119 (31%), Gaps = 23/119 (19%) Query: 29 PEDTPIYSMIKKGTTHSIHPEWVVDDLASPGPNAQLEGDEYSFKTINTPERMGNYTQIMR 88 P DTP+ S++ +H LAS D + NY I + Sbjct: 27 PADTPLSSLVASAKSH----------LASGSAR-----DALLYFDAAIARDPTNYLTIFQ 71 Query: 89 KSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSSQGSEKTSPRKMAALSS 147 + A + + LE++ D E AL+ +S ALS Sbjct: 72 RG--------AAYLSLRRNTQALEDFDRVLELKPDFESALLQRSRLRASSADWTGALSD 122 >gi|225016607|ref|ZP_03705799.1| hypothetical protein CLOSTMETH_00514 [Clostridium methylpentosum DSM 5476] gi|224950571|gb|EEG31780.1| hypothetical protein CLOSTMETH_00514 [Clostridium methylpentosum DSM 5476] Length = 169 Score = 37.3 bits (85), Expect = 0.69, Method: Composition-based stats. Identities = 15/88 (17%), Positives = 31/88 (35%), Gaps = 15/88 (17%) Query: 67 DEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEF 126 +Y+ + R N I++ +S +Y+ Q ++ D E Sbjct: 76 SDYTSGLLADYMRDKNACAIVKGLRAISDF-----------EYEFQMALANRKLNPDAET 124 Query: 127 ALVSSQGSE----KTSPRKMAALSSWIK 150 +++QG + R++A L I Sbjct: 125 VFLTTQGENMYLSSSLVRQIAGLGGDIS 152 >gi|170056995|ref|XP_001864283.1| serine/threonine-protein kinase SBK1 [Culex quinquefasciatus] gi|167876570|gb|EDS39953.1| serine/threonine-protein kinase SBK1 [Culex quinquefasciatus] Length = 459 Score = 37.3 bits (85), Expect = 0.75, Method: Composition-based stats. Identities = 13/90 (14%), Positives = 29/90 (32%), Gaps = 9/90 (10%) Query: 27 ITPEDTPIYSMIKKGTTHSIHPEWVVDDLASPGPNAQLEGDEYSFKTINTPERMGNYT-- 84 +TPE TP+++ + T + +W L S N D+ + + Y Sbjct: 372 LTPEPTPVFTGVDPETARNKVWDW----LESNDLNRHDSQDDVVDFSFWSKSESKTYQYA 427 Query: 85 ---QIMRKSWILSGTQEAVDDVGYILKYKE 111 I+ + + + + + Sbjct: 428 KRESIIGATSTTTASLAVTREASNASSVQR 457 >gi|256845901|ref|ZP_05551359.1| phage protein [Fusobacterium sp. 3_1_36A2] gi|256719460|gb|EEU33015.1| phage protein [Fusobacterium sp. 3_1_36A2] Length = 397 Score = 36.6 bits (83), Expect = 1.1, Method: Composition-based stats. Identities = 19/90 (21%), Positives = 33/90 (36%), Gaps = 2/90 (2%) Query: 63 QLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQK--LKKALEI 120 EG E T+ + + N T I+ + ++ T + G + KK E+ Sbjct: 156 IEEGGELKDSTVRLSKHITNITGIIYDKYEITETMKHTHPQGQGGLSAREIESQKKKDEL 215 Query: 121 RKDVEFALVSSQGSEKTSPRKMAALSSWIK 150 +E L++ R A + S IK Sbjct: 216 LGTMENKLLNGIKYINGDIRHSAGIKSLIK 245 >gi|118197693|ref|YP_874086.1| major structural protein [Thermus phage phiYS40] gi|116266384|gb|ABJ91467.1| major structural protein [Thermus phage phiYS40] Length = 470 Score = 36.6 bits (83), Expect = 1.2, Method: Composition-based stats. Identities = 18/121 (14%), Positives = 38/121 (31%), Gaps = 5/121 (4%) Query: 16 NKESLSDVVSRITPEDTPIYSMIKK--GTTHSIHPEWVVDDLA--SPGPNAQLEGDEYSF 71 +E L V+++ DTP+ ++ K + E+ V G A EG Sbjct: 28 EREDLEPEVTQLNVLDTPLTDLLSKNAVKAKAYEHEYNVVTARHDKIGYAAFREGG-LPR 86 Query: 72 KTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALEIRKDVEFALVSS 131 R ++ ++ G + + K +K + + + E+ Sbjct: 87 TVEVNVVRRRIRPMLVGHRITVTELATRTTQNGVMQIDELVKREKMIAVANEFEYLAFYG 146 Query: 132 Q 132 Sbjct: 147 D 147 >gi|317130456|ref|YP_004096738.1| hypothetical protein Bcell_3767 [Bacillus cellulosilyticus DSM 2522] gi|315475404|gb|ADU32007.1| hypothetical protein Bcell_3767 [Bacillus cellulosilyticus DSM 2522] Length = 557 Score = 36.2 bits (82), Expect = 1.8, Method: Composition-based stats. Identities = 18/101 (17%), Positives = 36/101 (35%), Gaps = 10/101 (9%) Query: 57 SPGPNAQLEGDEYSFKTINTPERMGNYTQ----IMRKSWILSGTQEAVDDVGYILKYKEQ 112 + G N + + I N M +S + ++E + G + + K++ Sbjct: 368 AVGANHIVYSSVFGPPYIIQIVDESNVDHYESLTMHESGATTDSEEIEESAGSMDEDKQE 427 Query: 113 KLKKALEIRKDVEFALVSSQGSEKTSPRKMAALSSWIKKNA 153 + + E D+E AL R + +L I +N Sbjct: 428 EERAPKEKEVDIETAL------SNAVGRYLNSLIKAINEND 462 >gi|242013050|ref|XP_002427232.1| conserved hypothetical protein [Pediculus humanus corporis] gi|212511544|gb|EEB14494.1| conserved hypothetical protein [Pediculus humanus corporis] Length = 1398 Score = 35.8 bits (81), Expect = 2.3, Method: Composition-based stats. Identities = 25/92 (27%), Positives = 38/92 (41%), Gaps = 4/92 (4%) Query: 5 NNTFITSSSTTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIHPEWVVDDLASPGPNAQL 64 NT + S ++ ES + VVS P TP S + T ++ + LA P + Sbjct: 1021 TNTSGSLHSVSSDESTAAVVSL--PVTTPFRSNVAGTTVGTVQHSASLMSLAKPKIPETV 1078 Query: 65 EGDEYSFKTINTPERMGNYTQIMRKSWILSGT 96 + + I+ +R N IM ILSG Sbjct: 1079 SLNALTE--ISNIKRSNNSVNIMSSGSILSGN 1108 >gi|326532742|dbj|BAJ89216.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 806 Score = 35.0 bits (79), Expect = 3.8, Method: Composition-based stats. Identities = 16/83 (19%), Positives = 33/83 (39%) Query: 60 PNAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILKYKEQKLKKALE 119 N Q ++ + +G Q+ K + S ++ + + K +L K E Sbjct: 275 SNLQDAESAFNSALWSDSSVLGGALQLHSKEMMESNLKQVMVEAEGSRKEAFLELLKRKE 334 Query: 120 IRKDVEFALVSSQGSEKTSPRKM 142 I V+ A + + +E + R+M Sbjct: 335 IESKVDSAFIRVKAAESSKKREM 357 >gi|194697698|gb|ACF82933.1| unknown [Zea mays] gi|195657807|gb|ACG48371.1| O-methyltransferase ZRP4 [Zea mays] Length = 364 Score = 34.7 bits (78), Expect = 4.7, Method: Composition-based stats. Identities = 21/116 (18%), Positives = 34/116 (29%), Gaps = 24/116 (20%) Query: 3 IVNNTFITSSSTTNKESLSDVVSRITPE-------------DTPIYSMIKKGTTHSIHPE 49 N F T + S+ V +TP TP+ +M+ T S E Sbjct: 79 TTTNVFGTQQPAGGSDDDSEPVYTLTPVSRLLIASQSSQLAQTPLAAMVLDPTIVSPFFE 138 Query: 50 ---WVVDDLASPGPNAQLEG--------DEYSFKTINTPERMGNYTQIMRKSWILS 94 W +L P G D+ +F + + I+ + S Sbjct: 139 LAAWFQHELPDPCIFKHTHGRGIWELTKDDATFDALVNDGLASDSQLIVDVAIKQS 194 >gi|302392162|ref|YP_003827982.1| methyl-accepting chemotaxis sensory transducer with Cache sensor [Acetohalobium arabaticum DSM 5501] gi|302204239|gb|ADL12917.1| methyl-accepting chemotaxis sensory transducer with Cache sensor [Acetohalobium arabaticum DSM 5501] Length = 550 Score = 34.7 bits (78), Expect = 5.2, Method: Composition-based stats. Identities = 21/82 (25%), Positives = 30/82 (36%), Gaps = 5/82 (6%) Query: 52 VDDLAS----PGPNAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGT-QEAVDDVGYI 106 VDDL++ +AQ T E N QI + ++G QEA Sbjct: 293 VDDLSAYSEELSASAQEGNAAIETTTQLIEEMSTNIQQISASAQEVTGLAQEANSQAEIG 352 Query: 107 LKYKEQKLKKALEIRKDVEFAL 128 + EQ + EI VE + Sbjct: 353 SENIEQAVSSMKEINNAVEETV 374 >gi|327183179|gb|AEA31626.1| protease [Lactobacillus amylovorus GRL 1118] Length = 404 Score = 33.9 bits (76), Expect = 7.2, Method: Composition-based stats. Identities = 17/84 (20%), Positives = 29/84 (34%) Query: 49 EWVVDDLASPGPNAQLEGDEYSFKTINTPERMGNYTQIMRKSWILSGTQEAVDDVGYILK 108 ++ VDDL P N LE E + Q + +SG + G Sbjct: 216 QFEVDDLTIPAENNPLEKTEEQDNAQAQLLMGFGFKQKITYQGQISGLLLSQYLAGDQSS 275 Query: 109 YKEQKLKKALEIRKDVEFALVSSQ 132 ++++ L DVE ++ Sbjct: 276 KLFNQIREELGAAYDVEANSFANN 299 >gi|223042595|ref|ZP_03612644.1| NADPH:quinone reductase [Staphylococcus capitis SK14] gi|222444258|gb|EEE50354.1| NADPH:quinone reductase [Staphylococcus capitis SK14] Length = 337 Score = 33.9 bits (76), Expect = 7.5, Method: Composition-based stats. Identities = 15/109 (13%), Positives = 32/109 (29%), Gaps = 3/109 (2%) Query: 11 SSSTTNKESLSDVVSRITPEDTPIYSMIKKGTTHSIHPEWVVDDL--ASPGPNAQLEGDE 68 + E + + V+ P D YS + + + ++L +P + Sbjct: 63 FDAAGIVEQVGEDVTMFEPGDYVFYSGSPNQHGSNEEYQLIEEELVAKAPSNLKPEQAAS 122 Query: 69 YSFKTINTPERMGNYTQIMRKSWILSGT-QEAVDDVGYILKYKEQKLKK 116 + E + + QI G ++ G + Q K Sbjct: 123 LPLTGLTASETLFDVFQISHDPEKNKGKSLLIINGAGGVGSIATQIAKA 171 Database: nr Posted date: May 13, 2011 4:10 AM Number of letters in database: 999,999,932 Number of sequences in database: 2,987,209 Database: /data/usr2/db/fasta/nr.01 Posted date: May 13, 2011 4:17 AM Number of letters in database: 999,998,956 Number of sequences in database: 2,896,973 Database: /data/usr2/db/fasta/nr.02 Posted date: May 13, 2011 4:23 AM Number of letters in database: 999,999,979 Number of sequences in database: 2,907,862 Database: /data/usr2/db/fasta/nr.03 Posted date: May 13, 2011 4:29 AM Number of letters in database: 999,999,513 Number of sequences in database: 2,932,190 Database: /data/usr2/db/fasta/nr.04 Posted date: May 13, 2011 4:33 AM Number of letters in database: 792,586,372 Number of sequences in database: 2,260,650 Lambda K H 0.308 0.134 0.377 Lambda K H 0.267 0.0411 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,309,193,696 Number of Sequences: 13984884 Number of extensions: 44428082 Number of successful extensions: 102725 Number of sequences better than 10.0: 148 Number of HSP's better than 10.0 without gapping: 91 Number of HSP's successfully gapped in prelim test: 57 Number of HSP's that attempted gapping in prelim test: 102445 Number of HSP's gapped (non-prelim): 228 length of query: 169 length of database: 4,792,584,752 effective HSP length: 128 effective length of query: 41 effective length of database: 3,002,519,600 effective search space: 123103303600 effective search space used: 123103303600 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.5 bits) S2: 76 (33.9 bits)