BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254781209|ref|YP_003065622.1| hypothetical protein CLIBASIA_05580 [Candidatus Liberibacter asiaticus str. psy62] (175 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done Results from round 1 >gi|254781209|ref|YP_003065622.1| hypothetical protein CLIBASIA_05580 [Candidatus Liberibacter asiaticus str. psy62] gi|254040886|gb|ACT57682.1| hypothetical protein CLIBASIA_05580 [Candidatus Liberibacter asiaticus str. psy62] gi|317120674|gb|ADV02497.1| hypothetical protein SC1_gp085 [Liberibacter phage SC1] gi|317120818|gb|ADV02639.1| hypothetical protein SC1_gp085 [Candidatus Liberibacter asiaticus] Length = 175 Score = 360 bits (924), Expect = 5e-98, Method: Compositional matrix adjust. Identities = 175/175 (100%), Positives = 175/175 (100%) Query: 1 MTELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 MTELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP Sbjct: 1 MTELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 Query: 61 LYFTYNGELKYYIPLKCLKVLNTSEETQLRGRCIVARGDTPRPLSLKYIGEVPLSDCDPL 120 LYFTYNGELKYYIPLKCLKVLNTSEETQLRGRCIVARGDTPRPLSLKYIGEVPLSDCDPL Sbjct: 61 LYFTYNGELKYYIPLKCLKVLNTSEETQLRGRCIVARGDTPRPLSLKYIGEVPLSDCDPL 120 Query: 121 YQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIPEDRGYHNG 175 YQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIPEDRGYHNG Sbjct: 121 YQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIPEDRGYHNG 175 >gi|315121934|ref|YP_004062423.1| hypothetical protein CKC_00920 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495336|gb|ADR51935.1| hypothetical protein CKC_00920 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 172 Score = 208 bits (530), Expect = 2e-52, Method: Compositional matrix adjust. Identities = 100/172 (58%), Positives = 124/172 (72%) Query: 1 MTELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 MTE EICNWALLKLGQRP+ +E S+KA YC+ LL PIH+ LLRSF+W FAT+S L Sbjct: 1 MTECEICNWALLKLGQRPIAKLDEGSIKAKYCKFLLSPIHRSLLRSFAWGFATKSSFLSS 60 Query: 61 LYFTYNGELKYYIPLKCLKVLNTSEETQLRGRCIVARGDTPRPLSLKYIGEVPLSDCDPL 120 L G ++Y +P KCLKVL + L G CIV + PLS+KYI EV L+DCDPL Sbjct: 61 LVSEDQGRVRYPLPKKCLKVLKIDRDGILDGDCIVVHDNKSIPLSIKYIEEVSLADCDPL 120 Query: 121 YQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIPEDRGY 172 YQE +LKLA ELCP L+ D+Q+R+YLK ES+RVL A++MDG+E+ EDR Y Sbjct: 121 YQETLSLKLASELCPTLVADEQLRAYLKAESDRVLKTAIEMDGIELSEDREY 172 >gi|315122896|ref|YP_004063385.1| hypothetical protein CKC_05760 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496298|gb|ADR52897.1| hypothetical protein CKC_05760 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 172 Score = 196 bits (499), Expect = 7e-49, Method: Compositional matrix adjust. Identities = 98/172 (56%), Positives = 118/172 (68%) Query: 1 MTELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 MTE EICNWALLKLGQRP+ +E S+KA YC+ LL PIH+ LLRSF+W FAT+S L Sbjct: 1 MTECEICNWALLKLGQRPIAKLDEGSIKAKYCKFLLSPIHRSLLRSFAWGFATKSSFLSS 60 Query: 61 LYFTYNGELKYYIPLKCLKVLNTSEETQLRGRCIVARGDTPRPLSLKYIGEVPLSDCDPL 120 L G ++Y +P KCLKVL + L G CIV + PLS+KYI EV L+DCDPL Sbjct: 61 LVSEDQGRVRYPLPKKCLKVLKIDRDGILDGDCIVVHDNKSIPLSIKYIEEVSLADCDPL 120 Query: 121 YQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIPEDRGY 172 YQE +LKLA ELCP L D Q+ LKGESE + A+DMDG+EI E+R Y Sbjct: 121 YQETLSLKLASELCPTLTADSQLTLRLKGESEELKKSAIDMDGIEIVEEREY 172 >gi|317120717|gb|ADV02539.1| hypothetical protein SC2_gp085 [Liberibacter phage SC2] gi|317120778|gb|ADV02599.1| hypothetical protein SC2_gp085 [Candidatus Liberibacter asiaticus] Length = 172 Score = 99.8 bits (247), Expect = 1e-19, Method: Compositional matrix adjust. Identities = 57/134 (42%), Positives = 78/134 (58%), Gaps = 3/134 (2%) Query: 1 MTELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 MT+LE+CNWAL K+G+ P+E + S A C+ L P+H LLR F WSFAT L P Sbjct: 2 MTDLEVCNWALRKIGEEPVETLEDNSSLALSCKAQLQPLHSSLLRRFDWSFATFCYYLSP 61 Query: 61 LYFTYNGELKYYIPLKCLKVLNTSEETQLRGRCIVARGDTPRPLSLKYIGEVPLSDCDPL 120 L +G+ Y +P CL++L + LRG ++ L+L +I +V + DCDPL Sbjct: 62 LEQEDDGQWIYPLPRDCLRILKP-QNPLLRG--LMLYSSMEEFLALTFIKKVKIEDCDPL 118 Query: 121 YQEAFALKLAGELC 134 Y E A KLA E+C Sbjct: 119 YCEVLACKLALEVC 132 >gi|221201504|ref|ZP_03574543.1| conserved hypothetical protein [Burkholderia multivorans CGD2M] gi|221207940|ref|ZP_03580946.1| conserved hypothetical protein [Burkholderia multivorans CGD2] gi|221172125|gb|EEE04566.1| conserved hypothetical protein [Burkholderia multivorans CGD2] gi|221178772|gb|EEE11180.1| conserved hypothetical protein [Burkholderia multivorans CGD2M] Length = 207 Score = 70.1 bits (170), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 53/182 (29%), Positives = 82/182 (45%), Gaps = 20/182 (10%) Query: 2 TELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLL-- 59 +++ ICN AL KLG + + + +E S A + + LR WSFA L Sbjct: 3 SQVGICNRALTKLGDKRITSLDEDSKAAAVLNSMYDDVLDACLREHVWSFAKTRAQLAAL 62 Query: 60 ---PLY-FTYNGELKYYIPLKCLKVLNTSE-----ETQLRGRCIVARG----DTPRPLSL 106 PL+ F Y +Y +P L++L + T RG + G D P PL + Sbjct: 63 ADAPLFGFGY----QYRLPADFLRLLQVGQFLIYPRTDTRGLFSIENGNILTDLPAPLYV 118 Query: 107 KYIGEVPLSDC-DPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVE 165 +Y + + DPL++E FA +LA E C L R E +R +A A+ ++ +E Sbjct: 119 RYTKRITDPNVMDPLFREVFACRLAAEACETLTQSSTKRQAAWAEHDRAIAQAIRVNAIE 178 Query: 166 IP 167 P Sbjct: 179 RP 180 >gi|221213948|ref|ZP_03586921.1| conserved hypothetical protein [Burkholderia multivorans CGD1] gi|221166125|gb|EED98598.1| conserved hypothetical protein [Burkholderia multivorans CGD1] Length = 207 Score = 68.6 bits (166), Expect = 3e-10, Method: Compositional matrix adjust. Identities = 51/182 (28%), Positives = 83/182 (45%), Gaps = 20/182 (10%) Query: 2 TELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLL-- 59 +++ ICN AL KLG++ + + +E S A + + LR WSFA L Sbjct: 3 SQVGICNRALTKLGEKRITSLDEDSKPAAALNSMYDDVLDACLREHVWSFAKTRTQLAAL 62 Query: 60 ---PLY-FTYNGELKYYIPLKCLKVLNTSE-----ETQLRGRCIVARG----DTPRPLSL 106 PL+ F Y +Y +P ++++ + T RG + G D P PL + Sbjct: 63 ADAPLFGFGY----QYRLPTDFIRLIQIGQFLVYPRTDTRGLFSIENGNILTDLPAPLYV 118 Query: 107 KYIGEVPLSDC-DPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVE 165 +Y + + DPL++E FA +LA E C L R E +R +A A+ ++ +E Sbjct: 119 RYTKRITDPNAMDPLFREVFACRLAAEACESLTQSSTKRQAAWAEHDRAIAQAIRVNAIE 178 Query: 166 IP 167 P Sbjct: 179 RP 180 >gi|332160973|ref|YP_004297550.1| hypothetical protein YE105_C1351 [Yersinia enterocolitica subsp. palearctica 105.5R(r)] gi|325665203|gb|ADZ41847.1| hypothetical protein YE105_C1351 [Yersinia enterocolitica subsp. palearctica 105.5R(r)] gi|330862129|emb|CBX72293.1| hypothetical protein YEW_AK02300 [Yersinia enterocolitica W22703] Length = 186 Score = 65.9 bits (159), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 46/175 (26%), Positives = 78/175 (44%), Gaps = 14/175 (8%) Query: 2 TELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPL 61 +++ IC+ ALL LG P+ ++ E + A C + P + LLR W+ A + L P Sbjct: 3 SDISICSNALLMLGAHPINSFTENTDHARLCSNIYPSVRDDLLRKHPWNCAVSRVSLAPS 62 Query: 62 ----YFTYNGELKYYIPLKCLKVLNTSEET-----QLRGRCIVARGDTPRPLSLKYIGEV 112 F Y + + IP CL++L+ E Q+ G+ I+A ++L+YI Sbjct: 63 TSLPVFGYANQ--FPIPSDCLRILSVGAEGYEIPYQVEGKNILANTTV---VALRYIKRT 117 Query: 113 PLSDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIP 167 + DP + +A +L + +R + E+ L A +DG E P Sbjct: 118 DEASWDPALAHVAEMAMAAKLAYAVTSSASLRDSMAQEAAYALRQAKAIDGQEEP 172 >gi|296532339|ref|ZP_06895076.1| conserved hypothetical protein [Roseomonas cervicalis ATCC 49957] gi|296267335|gb|EFH13223.1| conserved hypothetical protein [Roseomonas cervicalis ATCC 49957] Length = 189 Score = 64.3 bits (155), Expect = 5e-09, Method: Compositional matrix adjust. Identities = 45/146 (30%), Positives = 71/146 (48%), Gaps = 12/146 (8%) Query: 1 MTELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 ++ L +C+ ALLK+G +P+ + E + +A+ L P + LL + WSFAT +L Sbjct: 3 LSALVLCSRALLKIGAQPVASLEEGTAEAEVAANLYPAVRDALLSAHPWSFATGQAELPR 62 Query: 61 LYFTYNGE--LKYYIPLKCLKVLNTSEETQLRGRCIVAR-------GDTPRPLSLKYIGE 111 L + + Y +P L+VL+ E RGR ++ R D+PR + L Y+ Sbjct: 63 LLAVPHADYAFAYQLPADFLRVLSAGGER--RGRGLLYRIHENRLHTDSPR-VVLTYLFR 119 Query: 112 VPLSDCDPLYQEAFALKLAGELCPPL 137 S P + A +LA E C PL Sbjct: 120 PAESALPPFFASALVARLAAEFCVPL 145 >gi|187736541|ref|YP_001878653.1| hypothetical protein Amuc_2062 [Akkermansia muciniphila ATCC BAA-835] gi|187426593|gb|ACD05872.1| hypothetical protein Amuc_2062 [Akkermansia muciniphila ATCC BAA-835] Length = 206 Score = 64.3 bits (155), Expect = 6e-09, Method: Compositional matrix adjust. Identities = 49/160 (30%), Positives = 78/160 (48%), Gaps = 9/160 (5%) Query: 2 TELEICNWALLKLGQRPLENW-NERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 + L+ICN AL K+G+ PL+ S + C L P + L W+FA L Sbjct: 13 SALDICNAALSKIGEAPLDALIANESTASRLCVLHYHPARRETLCMARWTFAATQTTLDS 72 Query: 61 LYFTYNGEL---KYYIPLKCLKVLNTS-EETQLRGRCIVARGDTPRPLSLKYIGEVPLSD 116 + L ++ +P CL+VL+ E +++GR I A PL L YI ++ +D Sbjct: 73 VSAQAPNSLTPYQFTLPADCLRVLDVECSEWKMQGRRIHASC---APLPLSYIADIENTD 129 Query: 117 -CDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVL 155 DPL+ +A A +LA +L PL + +R L E +++ Sbjct: 130 QFDPLFMDALATRLAEKLAMPLTGNQSLRQNLNQEFHKII 169 >gi|48697201|ref|YP_024931.1| hypothetical protein BcepC6B_gp11 [Burkholderia phage BcepC6B] gi|47779007|gb|AAT38370.1| gp11 [Burkholderia phage BcepC6B] Length = 207 Score = 64.3 bits (155), Expect = 7e-09, Method: Compositional matrix adjust. Identities = 49/182 (26%), Positives = 82/182 (45%), Gaps = 20/182 (10%) Query: 2 TELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLL-- 59 +++ ICN AL K+G + + + +E S A + + LR+ WSF L Sbjct: 3 SQVGICNRALTKIGDKRITSLDEDSKAAATLNSMYDDVLDACLRAHVWSFTKARAQLAAL 62 Query: 60 ---PLY-FTYNGELKYYIPLKCLKVLNTSE-----ETQLRGRCIVARG----DTPRPLSL 106 PL+ F+Y +Y +P +++L + T RG + G D PL + Sbjct: 63 AEAPLFGFSY----QYRLPTDFIRLLQVGQFDVYPRTDTRGLFSIENGNILTDMQAPLYI 118 Query: 107 KYIGEVPLSDC-DPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVE 165 +Y V + D L++EAFA +LA E C L R E ++ +A A+ ++ +E Sbjct: 119 RYAKRVTDPNAMDALFREAFACRLAAEACESLTQSATKRQGAWAEHDQAIAAAIRVNAIE 178 Query: 166 IP 167 P Sbjct: 179 RP 180 >gi|242278912|ref|YP_002991041.1| hypothetical protein Desal_1440 [Desulfovibrio salexigens DSM 2638] gi|242121806|gb|ACS79502.1| conserved hypothetical protein [Desulfovibrio salexigens DSM 2638] Length = 197 Score = 55.5 bits (132), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 42/140 (30%), Positives = 69/140 (49%), Gaps = 9/140 (6%) Query: 1 MTELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 ++E++ICN AL KLG R +E+ ++ S +A C L + +LR W+FA + L Sbjct: 3 VSEVKICNLALRKLGARLIESLSDSSQEAVTCNLFYDQVRDAVLREHPWNFAASRIRLAK 62 Query: 61 LYFT--YNGELKYYIPLKCLKV--LNT-SEETQLRGRCIVARGDTPRPL-SLKYIGEVPL 114 L + +Y +P CL + LN+ +E + G I+ D + ++K V Sbjct: 63 LADNPAFGFVHQYQLPSDCLHLRQLNSPDDEFVVEGDKILTDRDGASAVYTMKVTNPVLF 122 Query: 115 SDCDPLYQEAFALKLAGELC 134 DP + AFA +LA E+ Sbjct: 123 ---DPSFVMAFAARLAAEMA 139 >gi|83313368|ref|YP_423632.1| hypothetical protein amb4269 [Magnetospirillum magneticum AMB-1] gi|82948209|dbj|BAE53073.1| hypothetical protein [Magnetospirillum magneticum AMB-1] Length = 189 Score = 54.7 bits (130), Expect = 5e-06, Method: Compositional matrix adjust. Identities = 43/159 (27%), Positives = 68/159 (42%), Gaps = 18/159 (11%) Query: 1 MTELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 ++ + +C+ AL++LG P+ +++ S +++ L PP+ G L S WSFAT + L Sbjct: 3 LSAIALCSRALIRLGAAPIAGFDDGSAESEVAANLYPPLRDGALSSHPWSFATGQVSLAR 62 Query: 61 LY------FTYNGELKYYIPLKCLKVLNTSEETQLRGRCIVARGDTPRPL------SLKY 108 L F Y Y +P L+VL+ RG +V R R L L Y Sbjct: 63 LAGEPLADFAY----AYQLPTDFLRVLSAGPSG--RGSGLVFRIAGSRLLCDADEVVLTY 116 Query: 109 IGEVPLSDCDPLYQEAFALKLAGELCPPLIMDDQMRSYL 147 + S P + + +L E C PL +L Sbjct: 117 VYRPDESAFPPYFDQMLMARLTAEFCIPLTESTTRAQFL 155 >gi|212710811|ref|ZP_03318939.1| hypothetical protein PROVALCAL_01879 [Providencia alcalifaciens DSM 30120] gi|212686508|gb|EEB46036.1| hypothetical protein PROVALCAL_01879 [Providencia alcalifaciens DSM 30120] Length = 201 Score = 52.8 bits (125), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 46/158 (29%), Positives = 74/158 (46%), Gaps = 26/158 (16%) Query: 2 TELEICNWALLKLGQ-RPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 +E+EICN AL ++G R + + +E+S +A+ C L +L F W+FA++ + L Sbjct: 3 SEIEICNMALSRIGNSRFINSLSEKSKEAEQCNLHYGHCRDTVLSDFPWNFASKRVALAD 62 Query: 61 LYFTYN----GELKYYIPLKCLKVLNTSEETQ-----------LRGRCIVARG-----DT 100 T N + Y P CLK + + Q + G + G D Sbjct: 63 ---TNNPPPDWKFAYSYPTDCLKAIAIIQSGQKYPQPNNAIQFIVGSDVSGTGKLIYCDQ 119 Query: 101 PRPLSLKYIGEVP-LSDCDPLYQEAFALKLAGELCPPL 137 P+ L+Y+ V ++ D L+ +A A +LAGEL PL Sbjct: 120 PQAW-LQYVASVTDVNMFDSLFIDALAWRLAGELARPL 156 >gi|167041088|gb|ABZ05849.1| hypothetical protein ALOHA_HF400048F7ctg1g16 [uncultured marine microorganism HF4000_48F7] Length = 198 Score = 52.0 bits (123), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 44/168 (26%), Positives = 75/168 (44%), Gaps = 9/168 (5%) Query: 4 LEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPLYF 63 ++ICN AL LG+ + + + +A C L +LRS W+ A + L Sbjct: 5 VQICNIALSNLGEAKIAALTDENERARQCNLRYEDCRDAVLRSHPWNAAVTRVALAASVT 64 Query: 64 --TYNGELKYYIPLKCLKVLNTSEETQ---LRGRCIVARGDTPRPLSLKYIGEVPL-SDC 117 + K+ +P CL+VL+ + Q + GR + ++L YI +V + Sbjct: 65 APAWGFAKKFALPADCLRVLDIEDFYQSYKVEGRFVFTDATA---VNLLYIAKVTDPTQF 121 Query: 118 DPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVE 165 D L A A+KLA E+ L ++R + + ++LA A +D E Sbjct: 122 DSLLLHAVAMKLASEIAEALTGRAELRDRMLSKYLQILAEARGVDSQE 169 >gi|304398396|ref|ZP_07380270.1| conserved hypothetical protein [Pantoea sp. aB] gi|304354262|gb|EFM18635.1| conserved hypothetical protein [Pantoea sp. aB] Length = 201 Score = 51.2 bits (121), Expect = 5e-05, Method: Compositional matrix adjust. Identities = 42/155 (27%), Positives = 73/155 (47%), Gaps = 24/155 (15%) Query: 4 LEICNWALLKLGQ-RPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPLY 62 +EICN AL ++G R + +++E+S +A C L + +L F W+FAT+ L Sbjct: 5 IEICNIALSRIGNSRTINSFSEQSKEAGLCSLHFDTAREEVLADFDWNFATKRAALADTG 64 Query: 63 F-TYNGELKYYIPLKCLKVL--------NTSEETQLR----------GRCIVARGDTPRP 103 + Y P C+++ N E +++ GR ++ D P Sbjct: 65 LPPSDWRFAYRYPTDCMRITGIMLPGMRNPPERLRIQYEVGSDPDGEGRLVLC--DEPD- 121 Query: 104 LSLKYIGEVPLSDC-DPLYQEAFALKLAGELCPPL 137 + +KY+ + ++ DPL++ A A KLA E+ PL Sbjct: 122 VWIKYVANITNANMFDPLFRSALAWKLASEIAMPL 156 >gi|227355853|ref|ZP_03840246.1| conserved hypothetical protein [Proteus mirabilis ATCC 29906] gi|227164172|gb|EEI49069.1| conserved hypothetical protein [Proteus mirabilis ATCC 29906] Length = 201 Score = 50.4 bits (119), Expect = 8e-05, Method: Compositional matrix adjust. Identities = 47/161 (29%), Positives = 69/161 (42%), Gaps = 32/161 (19%) Query: 2 TELEICNWALLKLGQ-RPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLL- 59 +E+EICN AL ++G R + + E S +A C L +L F W+FAT+ + L Sbjct: 3 SEIEICNIALSRIGNSRSINSMTEASKEAVQCNLHYAQCRDSVLADFPWNFATKKVALAN 62 Query: 60 ----PLYFTYNGELKYYIPLKCLKVLNTSEETQL------------------RGRCIVAR 97 P + Y Y P CLK + E Q GR I Sbjct: 63 TNNPPPNWAY----AYRYPNDCLKAIGIVEPHQKYRRPDTAIHFHVGSDENGTGRLIFT- 117 Query: 98 GDTPRPLSLKYIGEVP-LSDCDPLYQEAFALKLAGELCPPL 137 D P L+Y+ + ++ D L+++A A +LA EL PL Sbjct: 118 -DHPSAW-LEYVARITDVNMFDALFKDALAWRLAAELARPL 156 >gi|303257569|ref|ZP_07343581.1| conserved hypothetical protein [Burkholderiales bacterium 1_1_47] gi|302859539|gb|EFL82618.1| conserved hypothetical protein [Burkholderiales bacterium 1_1_47] Length = 214 Score = 50.4 bits (119), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 44/156 (28%), Positives = 70/156 (44%), Gaps = 24/156 (15%) Query: 4 LEICNWALLKLGQRPLENW---NERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 ++ICN AL +LG R E S +AD+CR P K +L +++WSFAT +L Sbjct: 5 VDICNIALSRLGDRATVTSIDPPEGSAQADHCRRFYPIALKTILATYNWSFATTRKELAR 64 Query: 61 LYFT-YNGELKYYIPLKCLKVL---------NTSEETQLRGR-------CIVARGDTPRP 103 L G + IP C+K++ N + +T R C+VA + Sbjct: 65 LTAEPIGGGYAFPIPADCVKIIYAYPVDENGNATRQTLHYVRELINGQVCLVAE---QKR 121 Query: 104 LSLKYI-GEVPLSDCDPLYQEAFALKLAGELCPPLI 138 + ++YI EV ++ +A A LA L ++ Sbjct: 122 IWIRYITTEVKPEKFSDVFSDALAFLLASNLAGTVV 157 >gi|209966376|ref|YP_002299291.1| hypothetical protein RC1_3114 [Rhodospirillum centenum SW] gi|209959842|gb|ACJ00479.1| conserved hypothetical protein [Rhodospirillum centenum SW] Length = 198 Score = 50.1 bits (118), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 37/147 (25%), Positives = 61/147 (41%), Gaps = 10/147 (6%) Query: 1 MTELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 +T + +C ALLK+G + ++ + + +A+ L P LL + WSFAT L Sbjct: 3 LTAIGLCARALLKIGAAGIASFADGTAEAEVAAALYAPTRDALLSAHGWSFATLHAGLAR 62 Query: 61 LYFTYNGEL------KYYIPLKCLKVLNTSEETQLRGRCIVARGDT----PRPLSLKYIG 110 L G + +P L+ L+ ++RG +G T + L Y+ Sbjct: 63 LADPPQGTAVAGPAHAFALPDGFLRALSAGRGGRMRGLEYRIQGGTLLADADAVILTYLH 122 Query: 111 EVPLSDCDPLYQEAFALKLAGELCPPL 137 P + +A +LA E C PL Sbjct: 123 RPAEESFPPFFDQALIARLAAEFCLPL 149 >gi|268589381|ref|ZP_06123602.1| conserved hypothetical protein [Providencia rettgeri DSM 1131] gi|291315408|gb|EFE55861.1| conserved hypothetical protein [Providencia rettgeri DSM 1131] Length = 201 Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 47/158 (29%), Positives = 71/158 (44%), Gaps = 26/158 (16%) Query: 2 TELEICNWALLKLGQ-RPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 +E+EICN AL ++G R + + +E+S +A+ C L L F W+FA++ + L Sbjct: 3 SEIEICNLALSRIGNSRFINSLSEKSKEAEQCNLHFNHCRDTTLSDFPWNFASKRVALAD 62 Query: 61 LYFTYN----GELKYYIPLKCLKVLNTSEETQ-----------LRGRCIVARG-----DT 100 T N + Y P CLK + + Q L G G D Sbjct: 63 ---TNNPPPDWKFAYSYPTDCLKAIAIIQSGQKYPQPHNAINFLVGSDADGTGKLIYCDQ 119 Query: 101 PRPLSLKYIGEVP-LSDCDPLYQEAFALKLAGELCPPL 137 P+ L+Y V ++ D L+ +A A +LAGEL PL Sbjct: 120 PQAW-LQYTASVTDVNMFDSLFIDALAWRLAGELARPL 156 >gi|262043558|ref|ZP_06016671.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] gi|259039092|gb|EEW40250.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] Length = 201 Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 40/151 (26%), Positives = 66/151 (43%), Gaps = 24/151 (15%) Query: 4 LEICNWALLKLGQ-RPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPLY 62 +EICN AL +G R + + E S +A C L I +L F W+FAT+++ L Sbjct: 5 IEICNRALSNIGNNRSINSLEEASKEAGQCSLYYESIRDAVLADFDWNFATKNIALADTN 64 Query: 63 F-TYNGELKYYIPLKCLKVLN------------------TSEETQLRGRCIVARGDTPRP 103 + Y P CLK++ ++ GR I D P+ Sbjct: 65 NPPQDWAFAYTYPTDCLKIIEIPVPGVRYPTAAMRVQYVVGADSAGTGRLIYT--DLPQA 122 Query: 104 LSLKYIGEVP-LSDCDPLYQEAFALKLAGEL 133 L+Y+ + ++ DP++QEA + +LA + Sbjct: 123 W-LRYVARITDVNMFDPIFQEALSWRLAAAI 152 >gi|315122634|ref|YP_004063123.1| hypothetical protein CKC_04430 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496036|gb|ADR52635.1| hypothetical protein CKC_04430 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 228 Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 37/128 (28%), Positives = 60/128 (46%), Gaps = 5/128 (3%) Query: 43 LLRSFSWSFATQSMDLLPLY--FTYNGELKYYIPLKCLKVLNTSEETQLRGRCIVARGDT 100 +LR W FA + L L + Y P C+ +++ G+ I+A +T Sbjct: 1 MLRKARWHFAIKRKLLQELDEPLVEDRYHTYISPHDCICIIDHPPSWIFEGKKILA-AET 59 Query: 101 PRPLSLKYIGEVP-LSDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAM 159 P PL ++YI + L DCD L+ E A +A ELC L+ + + L+ E E ++ A Sbjct: 60 P-PLKIRYIARLQDLEDCDSLFLEILATNMALELCETLLGSESRKQTLRKEFEMLMDKAK 118 Query: 160 DMDGVEIP 167 +E+P Sbjct: 119 TTGAIEMP 126 >gi|169795390|ref|YP_001713183.1| putative phage related protein [Acinetobacter baumannii AYE] gi|169148317|emb|CAM86182.1| conserved hypothetical protein; putative phage related protein [Acinetobacter baumannii AYE] Length = 186 Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 43/171 (25%), Positives = 77/171 (45%), Gaps = 7/171 (4%) Query: 1 MTELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 ++++ ICN AL +G + + ++ E S A CR + +LRS WS A + L P Sbjct: 3 VSKVTICNNALSMIGGQQIASFEEDSKLAQTCRNIYDTTRLSILRSHPWSCAKKRQILSP 62 Query: 61 LYFTYNGELKYYIPLKCLKVLNTSEET---QLRGRCIVARGDTPRPLSLKYIGEVPLSDC 117 + + + PL VL S T ++ R I+A + + L+Y+ + Sbjct: 63 ISTYPSFGYAHAFPLPSDYVLIISANTERYEVENRYILADTEV---VYLEYVFDNDNEQT 119 Query: 118 -DPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIP 167 D + EA K+A +LC P+ D + + + ++ A ++G E P Sbjct: 120 WDAMLVEAMTYKMASKLCKPITGSDAAGQSAEAQFQFLIKQARTVNGQERP 170 >gi|293609615|ref|ZP_06691917.1| predicted protein [Acinetobacter sp. SH024] gi|292828067|gb|EFF86430.1| predicted protein [Acinetobacter sp. SH024] Length = 186 Score = 49.3 bits (116), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 40/171 (23%), Positives = 81/171 (47%), Gaps = 7/171 (4%) Query: 1 MTELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 ++++ ICN AL +G + + ++ E S A CR + +LRS WS A + L P Sbjct: 3 VSKVTICNNALSMIGGQQIASFEEDSKLAQTCRNIYDTTRLSILRSHPWSCAKKRQILSP 62 Query: 61 L--YFTYNGELKYYIPLKCLKVLN-TSEETQLRGRCIVARGDTPRPLSLKYIGEVPLSDC 117 + Y ++ + +P + +++ +E ++ R I+A + + L+Y+ + Sbjct: 63 ISPYPSFGYAHAFPLPSDYVLIISANTERYEVENRYILADTEV---VYLEYVFDNDNEQT 119 Query: 118 -DPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIP 167 D + EA K+A +LC P+ D + + + ++ A ++G E P Sbjct: 120 WDAMLVEAMTYKMASKLCKPITGSDAAGQSAEAQFQFLIKQARTVNGQERP 170 >gi|298485989|ref|ZP_07004063.1| predicted phage protein [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] gi|298159466|gb|EFI00513.1| predicted phage protein [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] Length = 208 Score = 48.9 bits (115), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 47/176 (26%), Positives = 77/176 (43%), Gaps = 15/176 (8%) Query: 2 TELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWS----FATQSMD 57 T + IC+ ALL LG +P+ ++ E +A L P LLR+ W+ A + D Sbjct: 16 TAVSICSNALLMLGAQPINDFQEAVDRAKIAANLYPSTRDDLLRNHPWNCCIRRAVLAPD 75 Query: 58 LLPLYFTYNGELKYYIPLKCLKVLNT-SEETQL----RGRCIVARGDTPRPLSLKYIGEV 112 P F Y+ + + +P CL+VL S + Q+ G+ I A L LKY+ Sbjct: 76 AEPPAFGYDQQFE--LPSDCLRVLEVGSADAQIDYVVEGKRIQANTTV---LELKYVFRN 130 Query: 113 PLSDC-DPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIP 167 + + D E L +A ++ + ++ +K + E A +DG E P Sbjct: 131 EVENTWDAALVELMTLSMAAKMAYAITQSSAVQQSVKQDLEMAKRRARAIDGQEDP 186 >gi|295096863|emb|CBK85953.1| hypothetical protein ENC_24260 [Enterobacter cloacae subsp. cloacae NCTC 9394] Length = 187 Score = 48.1 bits (113), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 44/176 (25%), Positives = 80/176 (45%), Gaps = 15/176 (8%) Query: 2 TELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPL 61 + + IC+ ALL LG P+ +++E + A C L P + LLR+ W+ A + + L P+ Sbjct: 3 SSVSICSNALLALGAHPINDFDEDTDHARLCANLYPTVRNKLLRAHPWNCAIKRVVLSPV 62 Query: 62 ----YFTYNGELKYYIPLKCLKVLNTSE-----ETQLRGRCIVARGDTPRPLSLKYI-GE 111 F Y ++ +P ++VL+ E + ++ G +++ D R L+YI Sbjct: 63 SAAPVFGYG--YQFSLPGDLIRVLSVGEPRDDIDYRIEGTRLLSNVDVIR---LRYIYRN 117 Query: 112 VPLSDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIP 167 S D + + + +L + +R L E+ +L A +DG E P Sbjct: 118 EDESTWDAALVDVAEMMMQSKLAYAVTGSASLRDSLAQEAAFLLKQAKAVDGQEEP 173 >gi|85059167|ref|YP_454869.1| hypothetical protein SG1189 [Sodalis glossinidius str. 'morsitans'] gi|84779687|dbj|BAE74464.1| hypothetical protein [Sodalis glossinidius str. 'morsitans'] Length = 187 Score = 47.8 bits (112), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 43/177 (24%), Positives = 76/177 (42%), Gaps = 11/177 (6%) Query: 2 TELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPL 61 + + IC+ ALL LG P+ +++E + A C + P + LLR+ W A + + L P+ Sbjct: 3 SSVSICSNALLALGAHPINSFDEATPHARLCANIYPSVRNNLLRAHPWHCAIKRVVLSPV 62 Query: 62 YFTYNGELKYYIPLKC--LKVLNTSEE-----TQLRGRCIVARGDTPRPLSLKYIGEVP- 113 Y PL ++VL+ E ++ G+ ++A R L+YI Sbjct: 63 STAPAFGFGYQFPLPADLIRVLSVGERYDDIPYRIEGKRLLANVQEAR---LRYIFRSED 119 Query: 114 LSDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIPEDR 170 S D + + +L + +R L E+ +L A ++G E P +R Sbjct: 120 ESTWDVALVNVAEVTMQAKLAYAVTGSASLRDSLTQEAAFLLQQAKSINGQEDPPER 176 >gi|48696642|ref|YP_024421.1| hypothetical protein VP2p14 [Vibrio phage VP2] gi|40950040|gb|AAR97631.1| hypothetical protein [Vibrio phage VP2] Length = 741 Score = 47.8 bits (112), Expect = 6e-04, Method: Composition-based stats. Identities = 45/180 (25%), Positives = 80/180 (44%), Gaps = 19/180 (10%) Query: 4 LEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP--- 60 + I N AL +G +P+ + + + +A+ C + + L + WSFA + LLP Sbjct: 5 VSIYNKALAHVGAKPVLSETDANTRAEMCNAMYEFALRSALEAREWSFAAMRVRLLPDPL 64 Query: 61 LYFTYNGELKYYIPLKCLKVLNTSEETQLRGRCIVARG--DTPRP-LSLKY--IGEVPLS 115 + + Y +P C++V + + G RP + +KY + E P + Sbjct: 65 ITPAWGYNYGYRLPNDCIRVCEVRDGNSYNIDWAIESGFLYATRPTVDVKYNRLVEDP-N 123 Query: 116 DCDPLYQEAFALKLAGELCPPL-----IMDDQMRSYLKGESERVLAVAMDMDGVEIPEDR 170 P + EA +LKLA ++C PL + +D + YL SE +DG++ +R Sbjct: 124 AFTPTFVEALSLKLASDICIPLTENRGLGNDLISRYLMAVSE-----GAALDGLQASRER 178 >gi|48696686|ref|YP_024980.1| major tail subunit [Vibrio phage VP5] gi|40806149|gb|AAR92067.1| major tail subunit [Vibrio phage VP5] Length = 743 Score = 47.8 bits (112), Expect = 6e-04, Method: Composition-based stats. Identities = 45/180 (25%), Positives = 80/180 (44%), Gaps = 19/180 (10%) Query: 4 LEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP--- 60 + I N AL +G +P+ + + + +A+ C + + L + WSFA + LLP Sbjct: 5 VSIYNKALAHVGAKPVLSETDANTRAEMCNAMYEFALRSALEAREWSFAAMRVRLLPDPL 64 Query: 61 LYFTYNGELKYYIPLKCLKVLNTSEETQLRGRCIVARG--DTPRP-LSLKY--IGEVPLS 115 + + Y +P C++V + + G RP + +KY + E P + Sbjct: 65 ITPAWGYNYGYRLPNDCIRVCEVRDGNSYNIDWAIESGFLYATRPTVDVKYNRLVEDP-N 123 Query: 116 DCDPLYQEAFALKLAGELCPPL-----IMDDQMRSYLKGESERVLAVAMDMDGVEIPEDR 170 P + EA +LKLA ++C PL + +D + YL SE +DG++ +R Sbjct: 124 AFTPTFVEALSLKLASDICIPLTENRGLGNDLISRYLMAVSE-----GAALDGLQASRER 178 >gi|225158771|ref|ZP_03725088.1| GP11 [Opitutaceae bacterium TAV2] gi|224802606|gb|EEG20861.1| GP11 [Opitutaceae bacterium TAV2] Length = 203 Score = 47.0 bits (110), Expect = 9e-04, Method: Compositional matrix adjust. Identities = 44/187 (23%), Positives = 79/187 (42%), Gaps = 18/187 (9%) Query: 1 MTELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 MTE++ICN AL + + + + ++ + A C+L + +LR W A + + L P Sbjct: 1 MTEVDICNQALGLVSENGIASLSDATKSARLCQLHFAATVREVLRDGQWRCARRRVALAP 60 Query: 61 LYF--TYNGELKYYIPLKCLKV--LNTSEET-------QLRGRCIVARGDTPRPLSLKYI 109 L + + +P L+ LN E ++ GR ++ T R + + + Sbjct: 61 LAQPPAFEWSYGFQLPGDFLRTIRLNGHEVGSPDMPLFEIEGRTLLMNEGTARIVYTRDV 120 Query: 110 -------GEVPLSDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMD 162 G ++D DPL A LA +L P +R L E + V+A A + + Sbjct: 121 TQQIDDPGAAHIADLDPLCARAIVTALASKLAWPFQQSRTLRESLLSEYDGVIARARNAN 180 Query: 163 GVEIPED 169 + E+ Sbjct: 181 SRDAFEN 187 >gi|78219993|gb|ABB39342.1| hypothetical protein Dde_2546 [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] Length = 320 Score = 46.6 bits (109), Expect = 0.001, Method: Compositional matrix adjust. Identities = 49/199 (24%), Positives = 77/199 (38%), Gaps = 29/199 (14%) Query: 2 TELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFAT------QS 55 +E+ ICN AL +G + + ++ S A C P + LL + W+FAT Q Sbjct: 119 SEVSICNKALRYIGAEEIASLSQESRGARLCAQYYPEVRDELLEAHHWNFATRYAVLPQV 178 Query: 56 MDLLPLYFTYNGELKYYIPLKCLKVLNTSEETQ--LRGRCIVARGDT------------- 100 P F + Y +P CL+V + L+G VA +T Sbjct: 179 AQEPPFGFLH----AYRLPQDCLRVRRVLQHAAGVLQGGAAVADDETGIRTFEVVEGGIL 234 Query: 101 ---PRPLSLKYIGEVPLSDCDP-LYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLA 156 P V + C P L+ EA A KLA ++ L ++ + L+ +++ L Sbjct: 235 YTDASPARAVMTVRVTETACFPALFAEALARKLAADMAVALTGSTRLYAGLRDGADKALD 294 Query: 157 VAMDMDGVEIPEDRGYHNG 175 A D E D + G Sbjct: 295 TARTADAGEEHADPQEYEG 313 >gi|262043404|ref|ZP_06016529.1| gp11 family protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] gi|259039230|gb|EEW40376.1| gp11 family protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] Length = 187 Score = 46.6 bits (109), Expect = 0.001, Method: Compositional matrix adjust. Identities = 41/172 (23%), Positives = 75/172 (43%), Gaps = 11/172 (6%) Query: 4 LEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPLYF 63 + IC+ ALL LG P+ +++E + A C L P + LLR+ W+ + + L P+ Sbjct: 5 ISICSNALLALGAHPINSFDENTDHARLCSNLYPTVRNKLLRAHPWNCVVKRVVLSPVST 64 Query: 64 --TYNGELKYYIPLKCLKVLNTSEET-----QLRGRCIVARGDTPRPLSLKYI-GEVPLS 115 + ++ +P L+VL+ E ++ G ++A L L+Y+ S Sbjct: 65 APVFGFRFQFALPGDLLRVLSVGEPCDDIPYRVEGNRLLANVQV---LKLRYVFRNEDES 121 Query: 116 DCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIP 167 D + + + ++ + +R L ES +L A +DG E P Sbjct: 122 TWDAALVDVAEMMMQAKMAYAVTGSTSLRDSLAQESLVLLKQAKAVDGQEDP 173 >gi|144898782|emb|CAM75646.1| conserved hypothetical protein [Magnetospirillum gryphiswaldense MSR-1] Length = 189 Score = 46.6 bits (109), Expect = 0.001, Method: Compositional matrix adjust. Identities = 38/158 (24%), Positives = 67/158 (42%), Gaps = 6/158 (3%) Query: 1 MTELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 ++ + +C+ ALLKLG + +++E + +++ L P + L S+ W FAT + L Sbjct: 3 LSAIALCSRALLKLGATTIASFDEGTAESEVAANLYPSVRDATLSSYPWKFATGQVTLPR 62 Query: 61 LYF--TYNGELKYYIPLKCLKVLNTSEETQLRG---RCIVARGD-TPRPLSLKYIGEVPL 114 L + + Y +P L+ ++ Q G R +R ++L YI Sbjct: 63 LVAQPVADYSMAYQLPADFLRAMSAGVAGQGYGLSYRIAESRLHCDADEVTLTYIFRPDE 122 Query: 115 SDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESE 152 P + + +LA E C PL +L SE Sbjct: 123 MAFPPFFDQVLIARLAAEFCIPLTESTSRAEFLYRLSE 160 >gi|282848882|ref|ZP_06258272.1| hypothetical protein HMPREF1035_1391 [Veillonella parvula ATCC 17745] gi|282581387|gb|EFB86780.1| hypothetical protein HMPREF1035_1391 [Veillonella parvula ATCC 17745] Length = 197 Score = 45.8 bits (107), Expect = 0.002, Method: Compositional matrix adjust. Identities = 49/190 (25%), Positives = 77/190 (40%), Gaps = 21/190 (11%) Query: 1 MTELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 MT +ICN AL + L ++ E + + CR+ P + +LR F W+FA ++ L+ Sbjct: 1 MTPTDICNQALALINAGLLYSFEEETEQGRQCRMQYDPTRQLVLRQFEWNFARKNERLVL 60 Query: 61 LYFTYNG-ELKYYIPLKCLKVLNT-------SEETQ---------LRGRCIVARGDTPRP 103 NG Y P +C+++L E+Q +CIV+ D P Sbjct: 61 SAHKINGWNYVYAYPERCIRILGVIPQGDRFHAESQPEYNIFNIGNNKKCIVS--DVPLA 118 Query: 104 LSLKYIGEVPLSDC-DPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMD 162 + YI +V D D + KLA L PL D + + + A M+ Sbjct: 119 F-IDYIYDVTDLDVWDSISLYMLQCKLASALAMPLTGDRGLFDQAYKLYQAAVQEAKGMN 177 Query: 163 GVEIPEDRGY 172 E +D Y Sbjct: 178 AKERKQDTVY 187 >gi|320175039|gb|EFW50152.1| 11 [Shigella dysenteriae CDC 74-1112] Length = 201 Score = 45.1 bits (105), Expect = 0.004, Method: Compositional matrix adjust. Identities = 45/176 (25%), Positives = 71/176 (40%), Gaps = 30/176 (17%) Query: 4 LEICNWALLKLGQ-RPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPLY 62 +EICN AL +G R + + E S +A C L +L F W+FAT+ + L Sbjct: 5 VEICNRALSNIGNSRSINSLTEASKEAGECSLHFEACRDAVLSDFDWNFATKRVSLAD-- 62 Query: 63 FTYN----GELKYYIPLKCLKVL------------------NTSEETQLRGRCIVARGDT 100 T N E Y P CL++ +T G+ I D Sbjct: 63 -TSNPPPDWEYAYQYPSDCLRITEIMLPGVRNPTAAMRVQYEVGADTNGTGKLIYT--DQ 119 Query: 101 PRPLSLKYIGEVP-LSDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVL 155 P+ LKY+ V ++ D ++ EA A +LA + L + + ++ RV+ Sbjct: 120 PQAW-LKYVSRVTDVNMFDAIFMEALAWRLAAAINMALTGNADLGTFALNMYNRVI 174 >gi|288959324|ref|YP_003449665.1| hypothetical protein AZL_024830 [Azospirillum sp. B510] gi|288911632|dbj|BAI73121.1| hypothetical protein AZL_024830 [Azospirillum sp. B510] Length = 189 Score = 44.7 bits (104), Expect = 0.004, Method: Compositional matrix adjust. Identities = 41/173 (23%), Positives = 72/173 (41%), Gaps = 14/173 (8%) Query: 1 MTELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 +T + +C AL+KLG + ++++ S +A+ L P LL + +WSFA+ L Sbjct: 3 LTAIGLCGRALIKLGATAIASFDDGSAEAEVAAALYGPTRDALLSASAWSFASVQATLPR 62 Query: 61 LYF--TYNGELKYYIPLKCLKVLNTSEET-------QLRGRCIVARGDTPRPLSLKYIGE 111 L + + +P L+ L ++ GR ++ ++L YIG Sbjct: 63 LAEPPVADYAFAFQLPADLLRALGAGGNGRGRGLPYRIDGRRLLCDAGA---VTLSYIGR 119 Query: 112 VPLSDCDP-LYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDG 163 P + P + +A +LA E C PL +L +E A +D Sbjct: 120 -PAEEAFPAFFDQALIARLAAEFCLPLTESSTRAEFLAQLAESEFRRARQIDA 171 >gi|323699390|ref|ZP_08111302.1| hypothetical protein DND132_1981 [Desulfovibrio sp. ND132] gi|323459322|gb|EGB15187.1| hypothetical protein DND132_1981 [Desulfovibrio desulfuricans ND132] Length = 193 Score = 44.7 bits (104), Expect = 0.005, Method: Compositional matrix adjust. Identities = 43/176 (24%), Positives = 74/176 (42%), Gaps = 13/176 (7%) Query: 4 LEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPLYF 63 +EICN ALL LG+ + + + S A C P + +LR+ W+ A +L Sbjct: 6 IEICNNALLDLGEDAIMSLTDDSKAAGLCNHRWPAVRDAVLRAHPWNCAMAQAELAAGTT 65 Query: 64 T--YNGELKYYIPLKCLKVLNTS-------EETQLRGRCIVARGDTPRPLSLKYI-GEVP 113 + K+ +P L+++ + E+ +++ I+ D P+ + Y+ E Sbjct: 66 APLWKWAYKFVLPTDFLRIIAITGVDGEDVEDWEIQSGIILC--DEEAPIFIAYVRRETD 123 Query: 114 LSDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVE-IPE 168 D L EA + +LA L PL + + LA A +D E +PE Sbjct: 124 PKKYDALLSEALSARLAATLAYPLSGSTALAQSCWKTYQEKLAEARGVDAREGVPE 179 >gi|218700983|ref|YP_002408612.1| hypothetical protein ECIAI39_2673 [Escherichia coli IAI39] gi|218370969|emb|CAR18796.1| conserved hypothetical protein from phage origin [Escherichia coli IAI39] Length = 201 Score = 44.7 bits (104), Expect = 0.005, Method: Compositional matrix adjust. Identities = 45/176 (25%), Positives = 71/176 (40%), Gaps = 30/176 (17%) Query: 4 LEICNWALLKLGQ-RPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPLY 62 +EICN AL +G R + + E S +A C L +L F W+FAT+ + L Sbjct: 5 VEICNRALSNIGNSRSINSLTEASKEAGECSLHFEACRDAVLSDFDWNFATKRVALAD-- 62 Query: 63 FTYN----GELKYYIPLKCLKVL------------------NTSEETQLRGRCIVARGDT 100 T N E Y P CL++ +T G+ I D Sbjct: 63 -TSNPPPDWEYAYQYPSDCLRITEIMLPGVRNPTAAMRVQYEVGADTNGTGKLIYT--DQ 119 Query: 101 PRPLSLKYIGEVP-LSDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVL 155 P+ LKY+ V ++ D ++ EA A +LA + L + + ++ RV+ Sbjct: 120 PQAW-LKYVSRVSDVNMFDAIFMEALAWRLAAAINMALTGNADLGTFALNMYNRVI 174 >gi|309702805|emb|CBJ02136.1| hypothetical phage protein [Escherichia coli ETEC H10407] Length = 201 Score = 44.7 bits (104), Expect = 0.005, Method: Compositional matrix adjust. Identities = 45/176 (25%), Positives = 71/176 (40%), Gaps = 30/176 (17%) Query: 4 LEICNWALLKLGQ-RPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPLY 62 +EICN AL +G R + + E S +A C L +L F W+FAT+ + L Sbjct: 5 VEICNRALSNIGNSRSINSLTEASKEAGECSLHFEACRDAVLSDFDWNFATKRVALAD-- 62 Query: 63 FTYN----GELKYYIPLKCLKVL------------------NTSEETQLRGRCIVARGDT 100 T N E Y P CL++ +T G+ I D Sbjct: 63 -TSNPPPDWEYAYQYPSDCLRITEIMLPGVRNPTAAMRVQYEVGADTNGTGKLIYT--DQ 119 Query: 101 PRPLSLKYIGEVP-LSDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVL 155 P+ LKY+ V ++ D ++ EA A +LA + L + + ++ RV+ Sbjct: 120 PQAW-LKYVSRVTDVNMFDAIFMEALAWRLAAAINMALTGNTDLGTFALNMYNRVI 174 >gi|117624705|ref|YP_853618.1| hypothetical protein APECO1_4048 [Escherichia coli APEC O1] gi|301046401|ref|ZP_07193561.1| conserved hypothetical protein [Escherichia coli MS 185-1] gi|115513829|gb|ABJ01904.1| conserved hypothetical protein [Escherichia coli APEC O1] gi|300301627|gb|EFJ58012.1| conserved hypothetical protein [Escherichia coli MS 185-1] gi|324008553|gb|EGB77772.1| hypothetical protein HMPREF9532_01740 [Escherichia coli MS 57-2] gi|332344347|gb|AEE57681.1| conserved hypothetical protein [Escherichia coli UMNK88] Length = 201 Score = 44.3 bits (103), Expect = 0.006, Method: Compositional matrix adjust. Identities = 45/176 (25%), Positives = 71/176 (40%), Gaps = 30/176 (17%) Query: 4 LEICNWALLKLGQ-RPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPLY 62 +EICN AL +G R + + E S +A C L +L F W+FAT+ + L Sbjct: 5 VEICNRALSNIGNSRSINSLTEASKEAGECSLHFEACRDAVLSDFDWNFATKRVALAD-- 62 Query: 63 FTYN----GELKYYIPLKCLKVL------------------NTSEETQLRGRCIVARGDT 100 T N E Y P CL++ +T G+ I D Sbjct: 63 -TSNPPPDWEYAYQYPSDCLRITEIMLPGVRNPTAAMRVQYEVGADTNGTGKLIYT--DQ 119 Query: 101 PRPLSLKYIGEVP-LSDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVL 155 P+ LKY+ V ++ D ++ EA A +LA + L + + ++ RV+ Sbjct: 120 PQAW-LKYVSRVSDVNMFDAIFMEALAWRLAAAINMALTGNADLGTFALNMYNRVI 174 >gi|83718530|ref|YP_441473.1| gp11 [Burkholderia thailandensis E264] gi|257139845|ref|ZP_05588107.1| gp11 [Burkholderia thailandensis E264] gi|83652355|gb|ABC36418.1| gp11 [Burkholderia thailandensis E264] Length = 200 Score = 44.3 bits (103), Expect = 0.006, Method: Compositional matrix adjust. Identities = 20/60 (33%), Positives = 36/60 (60%) Query: 2 TELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPL 61 + + IC+ ALL+LG +P+ ++ E + +A C L P + +LR+ W+ T+ + L PL Sbjct: 3 SSISICSNALLQLGDKPIASFTEPTDRAMVCSNLYPEVRDAVLRAHPWNSCTKRVVLAPL 62 >gi|89152435|ref|YP_512268.1| hypothetical protein PhiV10p14 [Escherichia phage phiV10] gi|74055458|gb|AAZ95907.1| hypothetical protein PhiV10p14 [Escherichia phage phiV10] gi|294493853|gb|ADE92609.1| conserved hypothetical protein [Escherichia coli IHE3034] gi|323156126|gb|EFZ42285.1| hypothetical protein ECEPECA14_1901 [Escherichia coli EPECa14] Length = 201 Score = 44.3 bits (103), Expect = 0.007, Method: Compositional matrix adjust. Identities = 45/176 (25%), Positives = 71/176 (40%), Gaps = 30/176 (17%) Query: 4 LEICNWALLKLGQ-RPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPLY 62 +EICN AL +G R + + E S +A C L +L F W+FAT+ + L Sbjct: 5 VEICNRALSNIGNSRSINSLTEASKEAGECSLHFEACRDAVLSDFDWNFATKRVALAD-- 62 Query: 63 FTYN----GELKYYIPLKCLKVL------------------NTSEETQLRGRCIVARGDT 100 T N E Y P CL++ +T G+ I D Sbjct: 63 -TSNPPPDWEYAYQYPSDCLRITEIMLPGVRNPTAAMRVQYEVGADTNGTGKLIYT--DQ 119 Query: 101 PRPLSLKYIGEVP-LSDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVL 155 P+ LKY+ V ++ D ++ EA A +LA + L + + ++ RV+ Sbjct: 120 PQAW-LKYVSRVTDVNMFDAIFMEALAWRLAAAINMALTGNADLGTFALNMYNRVI 174 >gi|323948678|gb|EGB44583.1| hypothetical protein ERKG_04901 [Escherichia coli H252] gi|327252177|gb|EGE63849.1| hypothetical protein ECSTEC7V_3024 [Escherichia coli STEC_7v] Length = 201 Score = 44.3 bits (103), Expect = 0.007, Method: Compositional matrix adjust. Identities = 45/176 (25%), Positives = 71/176 (40%), Gaps = 30/176 (17%) Query: 4 LEICNWALLKLGQ-RPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPLY 62 +EICN AL +G R + + E S +A C L +L F W+FAT+ + L Sbjct: 5 VEICNRALSNIGNSRSINSLTEASKEAGECSLHFEACRDAVLSDFDWNFATKRVALAD-- 62 Query: 63 FTYN----GELKYYIPLKCLKVL------------------NTSEETQLRGRCIVARGDT 100 T N E Y P CL++ +T G+ I D Sbjct: 63 -TSNPPPDWEYAYQYPSDCLRITEIMLPGVRNPTAAMRVQYEVGADTDGTGKLIYT--DQ 119 Query: 101 PRPLSLKYIGEVP-LSDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVL 155 P+ LKY+ V ++ D ++ EA A +LA + L + + ++ RV+ Sbjct: 120 PQAW-LKYVSRVSDVNMFDAIFMEALAWRLAAAINMALTGNADLGTFALNMYNRVI 174 >gi|300898434|ref|ZP_07116775.1| conserved hypothetical protein [Escherichia coli MS 198-1] gi|331648169|ref|ZP_08349259.1| conserved hypothetical protein [Escherichia coli M605] gi|300357901|gb|EFJ73771.1| conserved hypothetical protein [Escherichia coli MS 198-1] gi|331043029|gb|EGI15169.1| conserved hypothetical protein [Escherichia coli M605] Length = 201 Score = 43.9 bits (102), Expect = 0.009, Method: Compositional matrix adjust. Identities = 45/176 (25%), Positives = 71/176 (40%), Gaps = 30/176 (17%) Query: 4 LEICNWALLKLGQ-RPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPLY 62 +EICN AL +G R + + E S +A C L +L F W+FAT+ + L Sbjct: 5 VEICNRALSNIGNSRSINSLTEASKEAGECSLHFEACRDAVLSDFDWNFATKRVALAD-- 62 Query: 63 FTYN----GELKYYIPLKCLKVL------------------NTSEETQLRGRCIVARGDT 100 T N E Y P CL++ +T G+ I D Sbjct: 63 -TSNPPPDWEYAYQYPSDCLRITEIMLPGVRNPTAAMRVQYEVGVDTNGTGKLIYT--DQ 119 Query: 101 PRPLSLKYIGEVP-LSDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVL 155 P+ LKY+ V ++ D ++ EA A +LA + L + + ++ RV+ Sbjct: 120 PQAW-LKYVSRVTDVNMFDAIFMEALAWRLAAAINMALTGNADLGTFALNMYNRVI 174 >gi|298381711|ref|ZP_06991310.1| hypothetical protein ECFG_01448 [Escherichia coli FVEC1302] gi|298279153|gb|EFI20667.1| hypothetical protein ECFG_01448 [Escherichia coli FVEC1302] Length = 201 Score = 43.5 bits (101), Expect = 0.013, Method: Compositional matrix adjust. Identities = 44/177 (24%), Positives = 71/177 (40%), Gaps = 32/177 (18%) Query: 4 LEICNWALLKLGQ-RPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLL--- 59 +EICN AL +G R + + E S +A C L +L F W+FAT+ + L Sbjct: 5 VEICNRALSNIGNSRSINSLTEASKEAGECSLHFDACRDAVLSDFDWNFATKRVALADTS 64 Query: 60 --PLYFTYNGELKYYIPLKCLKVL------------------NTSEETQLRGRCIVARGD 99 P + Y Y P CL++ +T G+ I D Sbjct: 65 NPPPDWAY----AYQYPSDCLRITEIMLPGVRNPTAAMRVQYEVGADTDGTGKLIYT--D 118 Query: 100 TPRPLSLKYIGEVP-LSDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVL 155 P+ LKY+ V ++ D ++ EA A +LA + L + + ++ RV+ Sbjct: 119 QPQAW-LKYVSRVSDVNMFDAIFMEALAWRLAAAINMALTGNADLGTFALNMYNRVI 174 >gi|215487814|ref|YP_002330245.1| hypothetical protein E2348C_2747 [Escherichia coli O127:H6 str. E2348/69] gi|215265886|emb|CAS10295.1| predicted protein [Escherichia coli O127:H6 str. E2348/69] Length = 201 Score = 43.1 bits (100), Expect = 0.015, Method: Compositional matrix adjust. Identities = 43/157 (27%), Positives = 67/157 (42%), Gaps = 36/157 (22%) Query: 4 LEICNWALLKLGQ-RPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLL--- 59 +EICN AL +G R + + NE S +A C L L F W+FAT+ + L Sbjct: 5 IEICNRALSNIGNSRSINSLNEASKEAGQCSLHFDACRDAALADFDWNFATKRVALADTN 64 Query: 60 --PLYFTYNGELKYYIPLKCLK--------------------VLNTSEETQLRGRCIVAR 97 P + Y Y P C++ V+ ++EE L G+ I Sbjct: 65 NPPPDWQY----AYQYPSDCVRITEIMPTGIRNPTAAQRIEYVVGSNEE--LTGKLIYT- 117 Query: 98 GDTPRPLSLKYIGEVP-LSDCDPLYQEAFALKLAGEL 133 D P+ LKY+ V ++ D ++ EA + +LA + Sbjct: 118 -DQPKAW-LKYVARVTDVNMYDAIFMEALSWRLAAAI 152 >gi|330007162|ref|ZP_08305904.1| hypothetical protein HMPREF9538_03593 [Klebsiella sp. MS 92-3] gi|328535509|gb|EGF61969.1| hypothetical protein HMPREF9538_03593 [Klebsiella sp. MS 92-3] Length = 203 Score = 43.1 bits (100), Expect = 0.015, Method: Compositional matrix adjust. Identities = 40/178 (22%), Positives = 67/178 (37%), Gaps = 24/178 (13%) Query: 4 LEICNWALLKLGQ-RPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPLY 62 + ICN AL ++G R + + E++ +A C L I +L W+FA + L L Sbjct: 5 INICNIALARIGNSRTINSLTEKTKEAYTCNLFYESIRDAVLADNDWNFAMSRVVLADLS 64 Query: 63 FTYNGEL-KYYIPLKCLK--------------------VLNTSEETQLRGRCIVARGDTP 101 G L +Y P C + V GR I Sbjct: 65 DPAPGWLFRYQYPTDCARIAAILPKWFTGSHIVLQDKPVFEVGSNEDGTGRVIHTNESQA 124 Query: 102 RPLSLKYIGEVPLSDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAM 159 L +K I + + D L+ +A + ++A E+ P+ + + ++VL AM Sbjct: 125 VLLYVKRITDPTM--FDALFADALSWRMAAEIAMPITANASLGQQAMANYQQVLTAAM 180 >gi|332875219|ref|ZP_08443052.1| hypothetical protein HMPREF0022_02685 [Acinetobacter baumannii 6014059] gi|332736663|gb|EGJ67657.1| hypothetical protein HMPREF0022_02685 [Acinetobacter baumannii 6014059] Length = 172 Score = 42.7 bits (99), Expect = 0.018, Method: Compositional matrix adjust. Identities = 35/158 (22%), Positives = 72/158 (45%), Gaps = 7/158 (4%) Query: 14 LGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPL--YFTYNGELKY 71 +G + + ++ E S A CR + +LRS WS A + L P+ Y ++ + Sbjct: 2 IGGQQIASFEEDSKLAQTCRNIYDTTRLSILRSHPWSCAKKRQILSPISTYPSFGYAHAF 61 Query: 72 YIPLKCLKVLN-TSEETQLRGRCIVARGDTPRPLSLKYIGEVPLSDC-DPLYQEAFALKL 129 +P + +++ +E ++ R I+A + + L+Y+ + D + EA K+ Sbjct: 62 PLPSDYVLIISANTEHYEVENRHILANAEV---IHLEYVFDNDNEQTWDAMLVEAMTYKM 118 Query: 130 AGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIP 167 A +LC P+ D + + + ++ A ++G E P Sbjct: 119 ASKLCKPITGSDAAGQSAEAQFQFLIKQARTVNGQERP 156 >gi|120601702|ref|YP_966102.1| hypothetical protein Dvul_0652 [Desulfovibrio vulgaris DP4] gi|120561931|gb|ABM27675.1| conserved hypothetical protein [Desulfovibrio vulgaris DP4] Length = 184 Score = 42.4 bits (98), Expect = 0.024, Method: Compositional matrix adjust. Identities = 26/83 (31%), Positives = 40/83 (48%), Gaps = 2/83 (2%) Query: 2 TELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPL 61 +++ ICN AL LG + + ++ S +AD C LL S W+FAT+ L P+ Sbjct: 3 SDVAICNRALRYLGAPEILSLDQHSREADLCARHYVETRDELLESHHWNFATRYARLAPM 62 Query: 62 YFT--YNGELKYYIPLKCLKVLN 82 + Y +P CL+V N Sbjct: 63 ESAPPFGYARSYRLPGDCLRVRN 85 >gi|220903986|ref|YP_002479298.1| hypothetical protein Ddes_0712 [Desulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774] gi|219868285|gb|ACL48620.1| conserved hypothetical protein [Desulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774] Length = 196 Score = 42.4 bits (98), Expect = 0.026, Method: Compositional matrix adjust. Identities = 31/110 (28%), Positives = 48/110 (43%), Gaps = 15/110 (13%) Query: 1 MTELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQ----SM 56 ++++++ N AL LG R + + E +A C L + +LR F WSFA + ++ Sbjct: 3 ISQIDVWNRALGFLGARSVASERESIPEALQCSLYWDSARRQVLRDFPWSFAQRRAWLAL 62 Query: 57 DLLPLYFTYNGELKYYIPLKCLKVLNTSEETQLRGRCIVARGDTPRPLSL 106 LP + Y +P CLKV E G +PRP L Sbjct: 63 QALPQGYAPEYRFAYALPENCLKVHEVRHE-----------GLSPRPFCL 101 >gi|41179375|ref|NP_958683.1| Bbp14 [Bordetella phage BPP-1] gi|45569507|ref|NP_996576.1| hypothetical protein BMP-1p13 [Bordetella phage BMP-1] gi|45580758|ref|NP_996624.1| hypothetical protein BIP-1p13 [Bordetella phage BIP-1] gi|40950114|gb|AAR97680.1| Bbp14 [Bordetella phage BPP-1] Length = 223 Score = 42.0 bits (97), Expect = 0.033, Method: Compositional matrix adjust. Identities = 43/164 (26%), Positives = 62/164 (37%), Gaps = 27/164 (16%) Query: 2 TELEICNWALLKLGQRPLE---NWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDL 58 +E++ICN AL LG N E S++A+YC P LL +W FAT+ L Sbjct: 3 SEVDICNLALAYLGDEATVAGINPPEGSVQAEYCARFYPFARDSLLELHTWGFATKCAQL 62 Query: 59 LPLYFTY-NGELKYYIPLKCLKVLNTSEETQLRGRCIVARGDTPRPLS------------ 105 + + Y P +K++ I A D +P S Sbjct: 63 AAMGISRPEWRFAYAQPADAIKIVAVLPHDAAN---IEAGIDNAQPFSCEIDNTGADIIL 119 Query: 106 -------LKYIGEVP-LSDCDPLYQEAFALKLAGELCPPLIMDD 141 +YI V + PL+ +A A LA L PL+ D Sbjct: 120 TNQVNAVARYISLVKDTTKFSPLFVQALAWHLASMLAGPLLKGD 163 >gi|291336929|gb|ADD96457.1| hypothetical protein ALOHA_HF400048F7ctg1g16 [uncultured organism MedDCM-OCT-S09-C787] Length = 193 Score = 42.0 bits (97), Expect = 0.034, Method: Compositional matrix adjust. Identities = 30/114 (26%), Positives = 52/114 (45%), Gaps = 8/114 (7%) Query: 4 LEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPLYF 63 ++ICN AL +LG + E S A C + L RS W+ + ++L Sbjct: 5 VDICNGALNQLGASTILTLTEDSKNARLCNARYTQVRDSLFRSHPWNCLIKRVELAKDTA 64 Query: 64 T--YNGELKYYIPLKCLKV---LNTSEETQLRGRCIVARGDTPRPLSLKYIGEV 112 T + ++ +P CL+V LN + ++ GR IVA T + ++Y+ + Sbjct: 65 TPSWGFSYQFTLPADCLRVLTILNYDYDYKIEGRKIVANHGT---VKIQYVSRI 115 >gi|218886167|ref|YP_002435488.1| hypothetical protein DvMF_1066 [Desulfovibrio vulgaris str. 'Miyazaki F'] gi|218757121|gb|ACL08020.1| conserved hypothetical protein [Desulfovibrio vulgaris str. 'Miyazaki F'] Length = 198 Score = 41.6 bits (96), Expect = 0.038, Method: Compositional matrix adjust. Identities = 26/81 (32%), Positives = 37/81 (45%), Gaps = 2/81 (2%) Query: 2 TELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPL 61 +E+ ICN AL LG + ++ S +AD C LL W+FAT+ L PL Sbjct: 17 SEVSICNKALRYLGAPEIVALDQPSREADLCSRYYAEARDELLEHHHWNFATRYTSLAPL 76 Query: 62 YFT--YNGELKYYIPLKCLKV 80 + Y +P CL+V Sbjct: 77 AAVPPFGFACAYRLPGDCLRV 97 >gi|212703341|ref|ZP_03311469.1| hypothetical protein DESPIG_01384 [Desulfovibrio piger ATCC 29098] gi|212673187|gb|EEB33670.1| hypothetical protein DESPIG_01384 [Desulfovibrio piger ATCC 29098] Length = 187 Score = 41.6 bits (96), Expect = 0.038, Method: Compositional matrix adjust. Identities = 32/106 (30%), Positives = 43/106 (40%), Gaps = 17/106 (16%) Query: 1 MTELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 +T++ I N AL LG R + +E + +A CRL + LR + W+FA LP Sbjct: 4 LTQIAIWNRALGFLGARSIAAEDENTPEALQCRLYWDSARRQALRDYPWAFA------LP 57 Query: 61 LYFTYNGELKYYIPLKCLKVLNTSEETQLRGRCIVARGDTPRPLSL 106 F Y +P CLK E G PRP L Sbjct: 58 TGFELEYRFAYALPESCLKAHEVRHE-----------GLLPRPFCL 92 >gi|46581002|ref|YP_011810.1| hypothetical protein DVU2598 [Desulfovibrio vulgaris str. Hildenborough] gi|46450423|gb|AAS97070.1| hypothetical protein DVU_2598 [Desulfovibrio vulgaris str. Hildenborough] gi|311234687|gb|ADP87541.1| hypothetical protein Deval_2398 [Desulfovibrio vulgaris RCH1] Length = 184 Score = 41.6 bits (96), Expect = 0.038, Method: Compositional matrix adjust. Identities = 42/172 (24%), Positives = 69/172 (40%), Gaps = 5/172 (2%) Query: 2 TELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPL 61 +++ ICN AL LG + + ++ S +AD C LL S W+FAT+ L P+ Sbjct: 3 SDVAICNRALRYLGAPEILSLDQHSREADLCARHYVETRDELLESHHWNFATRYARLAPM 62 Query: 62 YFT--YNGELKYYIPLKCLKV--LNTSEETQLRGRCIVARGDTPRPLSLKYIGEVPLSDC 117 + Y +P CL+V L + ++ ++ TP L P Sbjct: 63 ESAPPFGYARSYRLPGDCLRVRHLRAGDPFEVVEDRVLYTDATPAEAVLTVRVTDP-GRF 121 Query: 118 DPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIPED 169 L+ E A +LA L PL+ + ++ + +A D E D Sbjct: 122 PALFTEMLARRLASSLAVPLMGNARLEQTMLQRFAAAFDLARMADAAEGAAD 173 >gi|187476935|ref|YP_784959.1| phage protein [Bordetella avium 197N] gi|115421521|emb|CAJ48030.1| phage protein [Bordetella avium 197N] Length = 223 Score = 41.6 bits (96), Expect = 0.039, Method: Compositional matrix adjust. Identities = 43/164 (26%), Positives = 62/164 (37%), Gaps = 27/164 (16%) Query: 2 TELEICNWALLKLGQRPLE---NWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDL 58 +E++ICN AL LG N E S++A+YC P LL +W FAT+ L Sbjct: 3 SEVDICNLALAYLGDEATVAGINPPEGSVQAEYCARFYPFARDSLLELHTWGFATKCAQL 62 Query: 59 LPLYFTY-NGELKYYIPLKCLKVLNTSEETQLRGRCIVARGDTPRPLS------------ 105 + + Y P +K++ I A D +P S Sbjct: 63 AAMDISRPEWHFAYAQPADAIKIVAVLPHDAAN---IEAGIDNAQPFSCEIDDTGADIIL 119 Query: 106 -------LKYIGEVP-LSDCDPLYQEAFALKLAGELCPPLIMDD 141 +YI V + PL+ +A A LA L PL+ D Sbjct: 120 SNQVNAVARYISLVKDTTKFSPLFVQALAWHLASMLAGPLLKGD 163 >gi|291334664|gb|ADD94311.1| conserved hypothetical protein [uncultured phage MedDCM-OCT-S04-C695] Length = 132 Score = 41.6 bits (96), Expect = 0.045, Method: Compositional matrix adjust. Identities = 30/114 (26%), Positives = 52/114 (45%), Gaps = 8/114 (7%) Query: 4 LEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPLYF 63 ++ICN AL +LG + E S A C + L RS W+ + ++L Sbjct: 5 VDICNGALNQLGASTILTLTEDSKNARLCNARYTQVRDSLFRSHPWNCLIKRVELAKDTA 64 Query: 64 T--YNGELKYYIPLKCLKV---LNTSEETQLRGRCIVARGDTPRPLSLKYIGEV 112 T + ++ +P CL+V LN + ++ GR IVA T + ++Y+ + Sbjct: 65 TPSWGFSYQFTLPADCLRVLTILNYDYDYKIEGRKIVANHGT---VKIQYVSRI 115 >gi|291334459|gb|ADD94113.1| hypothetical protein Desal_1440 [uncultured phage MedDCM-OCT-S04-C1161] gi|291334516|gb|ADD94169.1| hypothetical protein Desal_1440 [uncultured phage MedDCM-OCT-S04-C1201] Length = 164 Score = 41.2 bits (95), Expect = 0.048, Method: Compositional matrix adjust. Identities = 30/114 (26%), Positives = 52/114 (45%), Gaps = 8/114 (7%) Query: 4 LEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPLYF 63 ++ICN AL +LG + E S A C + L RS W+ + ++L Sbjct: 5 VDICNGALNQLGASTILTLTEDSKNARLCNARYTQVRDSLFRSHPWNCLIKRVELAKDTA 64 Query: 64 T--YNGELKYYIPLKCLKV---LNTSEETQLRGRCIVARGDTPRPLSLKYIGEV 112 T + ++ +P CL+V LN + ++ GR IVA T + ++Y+ + Sbjct: 65 TPSWGFSYQFTLPADCLRVLTILNYDYDYKIEGRKIVANHGT---VKIQYVSRI 115 >gi|291336444|gb|ADD95999.1| conserved hypothetical protein [uncultured organism MedDCM-OCT-S04-C1073] Length = 129 Score = 41.2 bits (95), Expect = 0.049, Method: Compositional matrix adjust. Identities = 30/114 (26%), Positives = 52/114 (45%), Gaps = 8/114 (7%) Query: 4 LEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPLYF 63 ++ICN AL +LG + E S A C + L RS W+ + ++L Sbjct: 5 VDICNGALNQLGASTILTLTEDSKNARLCNARYTQVRDSLFRSHPWNCLIKRVELAKDTA 64 Query: 64 T--YNGELKYYIPLKCLKV---LNTSEETQLRGRCIVARGDTPRPLSLKYIGEV 112 T + ++ +P CL+V LN + ++ GR IVA T + ++Y+ + Sbjct: 65 TPSWGFSYQFTLPADCLRVLTILNYDYDYKIEGRKIVANHGT---VKIQYVSRI 115 >gi|317151945|ref|YP_004119993.1| hypothetical protein Daes_0220 [Desulfovibrio aespoeensis Aspo-2] gi|316942196|gb|ADU61247.1| hypothetical protein Daes_0220 [Desulfovibrio aespoeensis Aspo-2] Length = 194 Score = 41.2 bits (95), Expect = 0.056, Method: Compositional matrix adjust. Identities = 24/80 (30%), Positives = 41/80 (51%), Gaps = 2/80 (2%) Query: 4 LEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPLYF 63 +EICN ALL LG+ + + + S A C P + +LR+ W+ AT +L Sbjct: 6 IEICNNALLDLGEDAIMSLGDESKAAGLCNHRWPAVRDAVLRAHPWNCATAQAELAAGAA 65 Query: 64 T--YNGELKYYIPLKCLKVL 81 T + E +Y +P+ L+++ Sbjct: 66 TPLWKWECRYMLPVDFLRIV 85 >gi|301019349|ref|ZP_07183535.1| conserved domain protein [Escherichia coli MS 196-1] gi|299882266|gb|EFI90477.1| conserved domain protein [Escherichia coli MS 196-1] Length = 128 Score = 40.4 bits (93), Expect = 0.094, Method: Compositional matrix adjust. Identities = 27/82 (32%), Positives = 38/82 (46%), Gaps = 8/82 (9%) Query: 4 LEICNWALLKLGQ-RPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPLY 62 +EICN AL +G R + + E S +A C L +L F W+FAT+ + L Sbjct: 5 VEICNRALSNIGNSRSINSLTEASKEAGECSLHFEACRDAVLSDFDWNFATKRVALAD-- 62 Query: 63 FTYNG----ELKYYIPLKCLKV 80 T N E Y P CL++ Sbjct: 63 -TSNPPPDWEYAYQYPSDCLRI 83 >gi|288959383|ref|YP_003449724.1| hypothetical protein AZL_025420 [Azospirillum sp. B510] gi|288911691|dbj|BAI73180.1| hypothetical protein AZL_025420 [Azospirillum sp. B510] Length = 187 Score = 39.7 bits (91), Expect = 0.17, Method: Compositional matrix adjust. Identities = 34/141 (24%), Positives = 58/141 (41%), Gaps = 8/141 (5%) Query: 4 LEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPLYF 63 +++ N AL + + E ++ CR LR++ W+ A L L Sbjct: 5 IDLFNRALGWCRAQAMVTLEEDTVSGRACRRFYATSRDATLRAYPWNSAASRAVLAALVD 64 Query: 64 T--YNGELKYYIPLKCLKVLNTSEETQ----LRGRCIVARGDTPRPLSLKYIGEVPLSDC 117 + + +Y +P CL V ++ Q + GR ++ + PL LKY + Sbjct: 65 APEFEFDHQYRLPDDCLAVRQLWDDPQADYVVEGRSLLT--NLSAPLRLKYTARIEPDGM 122 Query: 118 DPLYQEAFALKLAGELCPPLI 138 DPL A A +LA ++ P L Sbjct: 123 DPLLFNAVAARLAYDIAPGLT 143 >gi|46580125|ref|YP_010933.1| hypothetical protein DVU1715 [Desulfovibrio vulgaris str. Hildenborough] gi|46449541|gb|AAS96192.1| hypothetical protein DVU_1715 [Desulfovibrio vulgaris str. Hildenborough] gi|311233882|gb|ADP86736.1| hypothetical protein Deval_1581 [Desulfovibrio vulgaris RCH1] Length = 195 Score = 39.3 bits (90), Expect = 0.19, Method: Compositional matrix adjust. Identities = 41/161 (25%), Positives = 69/161 (42%), Gaps = 27/161 (16%) Query: 6 ICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFAT------QSMDLL 59 I N AL +GQ + + +++ A C + +LR+ W+ AT +S Sbjct: 7 IMNKALRLIGQTEIASPEQQTPSAVRCSAAWRDVVDEVLRAHPWAHATVWAALSKSDTPP 66 Query: 60 PLYFTYNGELKYYIPLKCLKVLNTSEETQLRGRCI---VARGDT---PRPLSLKYIGEVP 113 P F + Y +P C++V++ + L + + V RG+ P + +Y+ Sbjct: 67 PFGFAH----AYVLPADCVRVIDVRADGNLARQGLPFEVVRGNIYTDAAPCNTRYVAR-- 120 Query: 114 LSDCDPL-----YQEAFALKLAGELCPPLIMDD--QMRSYL 147 D DP + +A A KLA E+ PL D MR+ L Sbjct: 121 --DADPTFWPPDFYDAVAAKLAAEVALPLARDGGAMMRAML 159 >gi|290968642|ref|ZP_06560180.1| hypothetical protein HMPREF0889_0282 [Megasphaera genomosp. type_1 str. 28L] gi|290781295|gb|EFD93885.1| hypothetical protein HMPREF0889_0282 [Megasphaera genomosp. type_1 str. 28L] Length = 232 Score = 37.7 bits (86), Expect = 0.60, Method: Compositional matrix adjust. Identities = 17/52 (32%), Positives = 27/52 (51%) Query: 1 MTELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFA 52 +TE +CN AL LG+ ++ + A C L + +LR+F+W FA Sbjct: 25 LTETHLCNMALSYLGKGSIQELTNMNENAVTCSLFYDITRQEVLRTFNWGFA 76 >gi|302339300|ref|YP_003804506.1| hypothetical protein Spirs_2809 [Spirochaeta smaragdinae DSM 11293] gi|301636485|gb|ADK81912.1| hypothetical protein Spirs_2809 [Spirochaeta smaragdinae DSM 11293] Length = 197 Score = 37.4 bits (85), Expect = 0.81, Method: Compositional matrix adjust. Identities = 18/52 (34%), Positives = 26/52 (50%) Query: 1 MTELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFA 52 MT LE+ N AL ++ + L + + AD CRL L P+ + W FA Sbjct: 1 MTALELINIALTRMNESKLTSITIDTYVADLCRLFLKPVADEVTLEEDWQFA 52 >gi|30387390|ref|NP_848219.1| hypothetical protein epsilon15p11 [Enterobacteria phage epsilon15] gi|30266045|gb|AAO06074.1| 11 [Salmonella phage epsilon15] Length = 201 Score = 37.4 bits (85), Expect = 0.82, Method: Compositional matrix adjust. Identities = 39/154 (25%), Positives = 61/154 (39%), Gaps = 30/154 (19%) Query: 4 LEICNWALLKLGQ-RPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPLY 62 +EICN AL +G R + + E S +A C L L F W+FAT+ + L Sbjct: 5 IEICNRALSNIGNSRSINSLIEASKEAGQCSLHFDACRDAALADFDWNFATKRVALAD-- 62 Query: 63 FTYN----GELKYYIPLKCLKVLN------------------TSEETQLRGRCIVARGDT 100 T N + Y P C+++ L G+ I D Sbjct: 63 -TNNPPPDWQYAYQYPSDCVRITEIMPTGIRNPTAAQRIEYVVGSNEDLTGKLIYT--DQ 119 Query: 101 PRPLSLKYIGEVP-LSDCDPLYQEAFALKLAGEL 133 P+ LKY+ V ++ D ++ EA + +LA + Sbjct: 120 PKAW-LKYMARVTDVNMYDAIFMEALSWRLAAAI 152 >gi|319952748|ref|YP_004164015.1| tonb-dependent receptor [Cellulophaga algicola DSM 14237] gi|319421408|gb|ADV48517.1| TonB-dependent receptor [Cellulophaga algicola DSM 14237] Length = 809 Score = 35.8 bits (81), Expect = 2.1, Method: Composition-based stats. Identities = 26/86 (30%), Positives = 37/86 (43%), Gaps = 8/86 (9%) Query: 11 LLKLGQRPLENWN--------ERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPLY 62 L K+ +P NW+ E S + Y RL+ P LRS W + Q + L Sbjct: 330 LQKIAFKPNSNWDINWSNYYSETSDYSRYDRLIRPNSAGDGLRSAEWYYGPQKWLMSNLQ 389 Query: 63 FTYNGELKYYIPLKCLKVLNTSEETQ 88 F NG+ K+Y LK EE++ Sbjct: 390 FNKNGKGKFYDGLKVTTAYQHFEESR 415 >gi|320164153|gb|EFW41052.1| B3GNTL1 protein [Capsaspora owczarzaki ATCC 30864] Length = 331 Score = 35.8 bits (81), Expect = 2.6, Method: Compositional matrix adjust. Identities = 32/100 (32%), Positives = 43/100 (43%), Gaps = 9/100 (9%) Query: 69 LKYYIPLKCLKVLNTSEETQLRGRCIVARGDTPRPLSLKYIGEVPLSD-CDPLYQEAFAL 127 L + + + V N S Q IVA+ T RPL + + D CD L Sbjct: 11 LGFLLAVVWTSVYNASRFLQATIESIVAQ--TYRPLEICVFNDGSTDDSCDLLR------ 62 Query: 128 KLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIP 167 LA L ++ DD R+YLK E ER+ V+ D E P Sbjct: 63 ALAQRLGIAVVQDDSGRAYLKEEVERLFVVSRDQSLPETP 102 >gi|325184199|emb|CCA18660.1| hypothetical protein TRIADDRAFT_27796 [Albugo laibachii Nc14] Length = 500 Score = 34.7 bits (78), Expect = 4.7, Method: Compositional matrix adjust. Identities = 17/56 (30%), Positives = 30/56 (53%), Gaps = 4/56 (7%) Query: 21 NWNERSLKADYCRLLLPPIHKGLLRSFSW----SFATQSMDLLPLYFTYNGELKYY 72 WN LK CR+ LP + + +L W +F T+ MDL + + ++G+ K++ Sbjct: 147 TWNLNDLKTILCRIELPGVTRSMLYFGMWRAMFAFHTEDMDLYSINYLHHGKPKFW 202 >gi|325184198|emb|CCA18657.1| hypothetical protein TRIADDRAFT_27796 [Albugo laibachii Nc14] Length = 494 Score = 34.7 bits (78), Expect = 4.7, Method: Compositional matrix adjust. Identities = 17/56 (30%), Positives = 30/56 (53%), Gaps = 4/56 (7%) Query: 21 NWNERSLKADYCRLLLPPIHKGLLRSFSW----SFATQSMDLLPLYFTYNGELKYY 72 WN LK CR+ LP + + +L W +F T+ MDL + + ++G+ K++ Sbjct: 147 TWNLNDLKTILCRIELPGVTRSMLYFGMWRAMFAFHTEDMDLYSINYLHHGKPKFW 202 >gi|86570109|ref|NP_500438.3| C-type LECtin family member (clec-68) [Caenorhabditis elegans] Length = 443 Score = 34.7 bits (78), Expect = 4.9, Method: Compositional matrix adjust. Identities = 19/64 (29%), Positives = 31/64 (48%), Gaps = 3/64 (4%) Query: 111 EVPLSDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIPEDR 170 E P ++C P Q+ + +L P ++ D + SYL S + + A+ + PED Sbjct: 156 EPPTNNCQPGGQQTILFAYSNDLDPSVVTDTLINSYL---SSQPVTFAVSRFDLRQPEDI 212 Query: 171 GYHN 174 GY N Sbjct: 213 GYFN 216 >gi|291334271|gb|ADD93934.1| hypothetical protein [uncultured marine bacterium MedDCM-OCT-S08-C235] Length = 119 Score = 34.7 bits (78), Expect = 5.6, Method: Compositional matrix adjust. Identities = 15/57 (26%), Positives = 32/57 (56%) Query: 2 TELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDL 58 T+++IC AL+ +G P+ ++++ + ++ L K +L S+ W FA++ L Sbjct: 4 TKIDICARALVMIGASPISSFSDGTTESTVASNLYLDTIKNMLSSYRWRFASKQSQL 60 >gi|227452781|gb|ABA54415.2| C-type lectin protein 68, partially confirmed by transcript evidence [Caenorhabditis elegans] Length = 451 Score = 34.7 bits (78), Expect = 5.8, Method: Compositional matrix adjust. Identities = 19/64 (29%), Positives = 31/64 (48%), Gaps = 3/64 (4%) Query: 111 EVPLSDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIPEDR 170 E P ++C P Q+ + +L P ++ D + SYL S + + A+ + PED Sbjct: 156 EPPTNNCQPGGQQTILFAYSNDLDPSVVTDTLINSYL---SSQPVTFAVSRFDLRQPEDI 212 Query: 171 GYHN 174 GY N Sbjct: 213 GYFN 216 >gi|322827991|gb|EFZ31951.1| hypothetical protein TCSYLVIO_1715 [Trypanosoma cruzi] Length = 621 Score = 34.3 bits (77), Expect = 7.2, Method: Compositional matrix adjust. Identities = 20/62 (32%), Positives = 27/62 (43%) Query: 80 VLNTSEETQLRGRCIVARGDTPRPLSLKYIGEVPLSDCDPLYQEAFALKLAGELCPPLIM 139 VL+ S E ++R AR D L + V +S P AF + L GE C P + Sbjct: 502 VLSNSREEEVRRGAKRARLDNDEALRISMADNVGVSKLAPYISTAFCVSLPGEFCAPPLS 561 Query: 140 DD 141 D Sbjct: 562 SD 563 >gi|71406838|ref|XP_805926.1| hypothetical protein [Trypanosoma cruzi strain CL Brener] gi|70869519|gb|EAN84075.1| hypothetical protein, conserved [Trypanosoma cruzi] Length = 627 Score = 34.3 bits (77), Expect = 7.3, Method: Compositional matrix adjust. Identities = 20/62 (32%), Positives = 27/62 (43%) Query: 80 VLNTSEETQLRGRCIVARGDTPRPLSLKYIGEVPLSDCDPLYQEAFALKLAGELCPPLIM 139 VL+ S E ++R AR D L + V +S P AF + L GE C P + Sbjct: 508 VLSNSREEEVRRGAKRARLDNDEALRMSMADNVGVSKLAPYISTAFCVSLPGEFCAPPLS 567 Query: 140 DD 141 D Sbjct: 568 SD 569 >gi|254251748|ref|ZP_04945066.1| hypothetical protein BDAG_00945 [Burkholderia dolosa AUO158] gi|124894357|gb|EAY68237.1| hypothetical protein BDAG_00945 [Burkholderia dolosa AUO158] Length = 189 Score = 33.9 bits (76), Expect = 7.9, Method: Compositional matrix adjust. Identities = 15/51 (29%), Positives = 30/51 (58%) Query: 11 LLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPL 61 +L+LG +P+ +++E + +A C L + +LR+ W+ T+ + L PL Sbjct: 1 MLQLGDKPISSFDEPTDRALVCSNLYAEVRDAMLRAHPWNSCTKRVVLAPL 51 >gi|146319023|ref|YP_001198735.1| glycosyltransferase [Streptococcus suis 05ZYH33] gi|146321231|ref|YP_001200942.1| glycosyltransferase [Streptococcus suis 98HAH33] gi|253752088|ref|YP_003025229.1| glycosyltransferase [Streptococcus suis SC84] gi|253753913|ref|YP_003027054.1| glycosyltransferase [Streptococcus suis P1/7] gi|253755212|ref|YP_003028352.1| glycosyltransferase [Streptococcus suis BM407] gi|145689829|gb|ABP90335.1| Glycosyltransferase [Streptococcus suis 05ZYH33] gi|145692037|gb|ABP92542.1| Glycosyltransferase [Streptococcus suis 98HAH33] gi|251816377|emb|CAZ52008.1| putative glycosyltransferase [Streptococcus suis SC84] gi|251817676|emb|CAZ55424.1| putative glycosyltransferase [Streptococcus suis BM407] gi|251820159|emb|CAR46500.1| putative glycosyltransferase [Streptococcus suis P1/7] gi|292558671|gb|ADE31672.1| Glycosyl transferase, group 1 [Streptococcus suis GZ1] Length = 437 Score = 33.9 bits (76), Expect = 8.6, Method: Compositional matrix adjust. Identities = 22/74 (29%), Positives = 35/74 (47%), Gaps = 12/74 (16%) Query: 95 VARGDTPRPLSLKYIGEVPLSDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERV 154 +A+G RP +KY+ L D D G +CP I++D + SY S+RV Sbjct: 127 IAKGRIIRPGMVKYLVRAFLKDLD------------GVICPSEIVEDLLMSYNVPISKRV 174 Query: 155 LAVAMDMDGVEIPE 168 + +D+ + PE Sbjct: 175 IPTGIDLAKFDRPE 188 >gi|302024089|ref|ZP_07249300.1| 1,2-diacylglycerol 3-glucosyltransferase [Streptococcus suis 05HAS68] gi|330832436|ref|YP_004401261.1| glycosyltransferase [Streptococcus suis ST3] gi|329306659|gb|AEB81075.1| glycosyltransferase [Streptococcus suis ST3] Length = 437 Score = 33.9 bits (76), Expect = 8.8, Method: Compositional matrix adjust. Identities = 22/74 (29%), Positives = 35/74 (47%), Gaps = 12/74 (16%) Query: 95 VARGDTPRPLSLKYIGEVPLSDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERV 154 +A+G RP +KY+ L D D G +CP I++D + SY S+RV Sbjct: 127 IAKGRIIRPGMVKYLVRAFLKDLD------------GVICPSEIVEDLLMSYNVPISKRV 174 Query: 155 LAVAMDMDGVEIPE 168 + +D+ + PE Sbjct: 175 IPTGIDLAKFDRPE 188 >gi|223932676|ref|ZP_03624675.1| glycosyl transferase group 1 [Streptococcus suis 89/1591] gi|223898646|gb|EEF65008.1| glycosyl transferase group 1 [Streptococcus suis 89/1591] Length = 437 Score = 33.9 bits (76), Expect = 8.9, Method: Compositional matrix adjust. Identities = 22/74 (29%), Positives = 35/74 (47%), Gaps = 12/74 (16%) Query: 95 VARGDTPRPLSLKYIGEVPLSDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERV 154 +A+G RP +KY+ L D D G +CP I++D + SY S+RV Sbjct: 127 IAKGRIIRPGMVKYLVRAFLKDLD------------GVICPSEIVEDLLMSYNVPISKRV 174 Query: 155 LAVAMDMDGVEIPE 168 + +D+ + PE Sbjct: 175 IPTGIDLAKFDRPE 188 Searching..................................................done Results from round 2 >gi|254781209|ref|YP_003065622.1| hypothetical protein CLIBASIA_05580 [Candidatus Liberibacter asiaticus str. psy62] gi|254040886|gb|ACT57682.1| hypothetical protein CLIBASIA_05580 [Candidatus Liberibacter asiaticus str. psy62] gi|317120674|gb|ADV02497.1| hypothetical protein SC1_gp085 [Liberibacter phage SC1] gi|317120818|gb|ADV02639.1| hypothetical protein SC1_gp085 [Candidatus Liberibacter asiaticus] Length = 175 Score = 232 bits (591), Expect = 2e-59, Method: Composition-based stats. Identities = 175/175 (100%), Positives = 175/175 (100%) Query: 1 MTELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 MTELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP Sbjct: 1 MTELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 Query: 61 LYFTYNGELKYYIPLKCLKVLNTSEETQLRGRCIVARGDTPRPLSLKYIGEVPLSDCDPL 120 LYFTYNGELKYYIPLKCLKVLNTSEETQLRGRCIVARGDTPRPLSLKYIGEVPLSDCDPL Sbjct: 61 LYFTYNGELKYYIPLKCLKVLNTSEETQLRGRCIVARGDTPRPLSLKYIGEVPLSDCDPL 120 Query: 121 YQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIPEDRGYHNG 175 YQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIPEDRGYHNG Sbjct: 121 YQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIPEDRGYHNG 175 >gi|315121934|ref|YP_004062423.1| hypothetical protein CKC_00920 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495336|gb|ADR51935.1| hypothetical protein CKC_00920 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 172 Score = 215 bits (548), Expect = 2e-54, Method: Composition-based stats. Identities = 100/172 (58%), Positives = 124/172 (72%) Query: 1 MTELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 MTE EICNWALLKLGQRP+ +E S+KA YC+ LL PIH+ LLRSF+W FAT+S L Sbjct: 1 MTECEICNWALLKLGQRPIAKLDEGSIKAKYCKFLLSPIHRSLLRSFAWGFATKSSFLSS 60 Query: 61 LYFTYNGELKYYIPLKCLKVLNTSEETQLRGRCIVARGDTPRPLSLKYIGEVPLSDCDPL 120 L G ++Y +P KCLKVL + L G CIV + PLS+KYI EV L+DCDPL Sbjct: 61 LVSEDQGRVRYPLPKKCLKVLKIDRDGILDGDCIVVHDNKSIPLSIKYIEEVSLADCDPL 120 Query: 121 YQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIPEDRGY 172 YQE +LKLA ELCP L+ D+Q+R+YLK ES+RVL A++MDG+E+ EDR Y Sbjct: 121 YQETLSLKLASELCPTLVADEQLRAYLKAESDRVLKTAIEMDGIELSEDREY 172 >gi|315122896|ref|YP_004063385.1| hypothetical protein CKC_05760 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496298|gb|ADR52897.1| hypothetical protein CKC_05760 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 172 Score = 213 bits (542), Expect = 8e-54, Method: Composition-based stats. Identities = 98/172 (56%), Positives = 118/172 (68%) Query: 1 MTELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 MTE EICNWALLKLGQRP+ +E S+KA YC+ LL PIH+ LLRSF+W FAT+S L Sbjct: 1 MTECEICNWALLKLGQRPIAKLDEGSIKAKYCKFLLSPIHRSLLRSFAWGFATKSSFLSS 60 Query: 61 LYFTYNGELKYYIPLKCLKVLNTSEETQLRGRCIVARGDTPRPLSLKYIGEVPLSDCDPL 120 L G ++Y +P KCLKVL + L G CIV + PLS+KYI EV L+DCDPL Sbjct: 61 LVSEDQGRVRYPLPKKCLKVLKIDRDGILDGDCIVVHDNKSIPLSIKYIEEVSLADCDPL 120 Query: 121 YQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIPEDRGY 172 YQE +LKLA ELCP L D Q+ LKGESE + A+DMDG+EI E+R Y Sbjct: 121 YQETLSLKLASELCPTLTADSQLTLRLKGESEELKKSAIDMDGIEIVEEREY 172 >gi|332160973|ref|YP_004297550.1| hypothetical protein YE105_C1351 [Yersinia enterocolitica subsp. palearctica 105.5R(r)] gi|325665203|gb|ADZ41847.1| hypothetical protein YE105_C1351 [Yersinia enterocolitica subsp. palearctica 105.5R(r)] gi|330862129|emb|CBX72293.1| hypothetical protein YEW_AK02300 [Yersinia enterocolitica W22703] Length = 186 Score = 203 bits (517), Expect = 6e-51, Method: Composition-based stats. Identities = 44/175 (25%), Positives = 78/175 (44%), Gaps = 10/175 (5%) Query: 2 TELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPL 61 +++ IC+ ALL LG P+ ++ E + A C + P + LLR W+ A + L P Sbjct: 3 SDISICSNALLMLGAHPINSFTENTDHARLCSNIYPSVRDDLLRKHPWNCAVSRVSLAPS 62 Query: 62 Y--FTYNGELKYYIPLKCLKVLNTSEET-----QLRGRCIVARGDTPRPLSLKYIGEVPL 114 + ++ IP CL++L+ E Q+ G+ I+A ++L+YI Sbjct: 63 TSLPVFGYANQFPIPSDCLRILSVGAEGYEIPYQVEGKNILANTTV---VALRYIKRTDE 119 Query: 115 SDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIPED 169 + DP + +A +L + +R + E+ L A +DG E P + Sbjct: 120 ASWDPALAHVAEMAMAAKLAYAVTSSASLRDSMAQEAAYALRQAKAIDGQEEPPE 174 >gi|169795390|ref|YP_001713183.1| putative phage related protein [Acinetobacter baumannii AYE] gi|169148317|emb|CAM86182.1| conserved hypothetical protein; putative phage related protein [Acinetobacter baumannii AYE] Length = 186 Score = 203 bits (516), Expect = 9e-51, Method: Composition-based stats. Identities = 41/179 (22%), Positives = 83/179 (46%), Gaps = 8/179 (4%) Query: 1 MTELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 ++++ ICN AL +G + + ++ E S A CR + +LRS WS A + L P Sbjct: 3 VSKVTICNNALSMIGGQQIASFEEDSKLAQTCRNIYDTTRLSILRSHPWSCAKKRQILSP 62 Query: 61 LY--FTYNGELKYYIPLKCLKVLNT-SEETQLRGRCIVARGDTPRPLSLKYIGEVPLSD- 116 + ++ + +P + +++ +E ++ R I+A + + L+Y+ + Sbjct: 63 ISTYPSFGYAHAFPLPSDYVLIISANTERYEVENRYILADTEV---VYLEYVFDNDNEQT 119 Query: 117 CDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIP-EDRGYHN 174 D + EA K+A +LC P+ D + + + ++ A ++G E P +D Y Sbjct: 120 WDAMLVEAMTYKMASKLCKPITGSDAAGQSAEAQFQFLIKQARTVNGQERPSQDVQYAE 178 >gi|293609615|ref|ZP_06691917.1| predicted protein [Acinetobacter sp. SH024] gi|292828067|gb|EFF86430.1| predicted protein [Acinetobacter sp. SH024] Length = 186 Score = 202 bits (515), Expect = 1e-50, Method: Composition-based stats. Identities = 41/179 (22%), Positives = 83/179 (46%), Gaps = 8/179 (4%) Query: 1 MTELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 ++++ ICN AL +G + + ++ E S A CR + +LRS WS A + L P Sbjct: 3 VSKVTICNNALSMIGGQQIASFEEDSKLAQTCRNIYDTTRLSILRSHPWSCAKKRQILSP 62 Query: 61 LY--FTYNGELKYYIPLKCLKVLNT-SEETQLRGRCIVARGDTPRPLSLKYIGEVPLSD- 116 + ++ + +P + +++ +E ++ R I+A + + L+Y+ + Sbjct: 63 ISPYPSFGYAHAFPLPSDYVLIISANTERYEVENRYILADTEV---VYLEYVFDNDNEQT 119 Query: 117 CDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIP-EDRGYHN 174 D + EA K+A +LC P+ D + + + ++ A ++G E P +D Y Sbjct: 120 WDAMLVEAMTYKMASKLCKPITGSDAAGQSAEAQFQFLIKQARTVNGQERPSQDVQYAE 178 >gi|262043404|ref|ZP_06016529.1| gp11 family protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] gi|259039230|gb|EEW40376.1| gp11 family protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] Length = 187 Score = 202 bits (515), Expect = 1e-50, Method: Composition-based stats. Identities = 41/175 (23%), Positives = 76/175 (43%), Gaps = 11/175 (6%) Query: 3 ELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPLY 62 + IC+ ALL LG P+ +++E + A C L P + LLR+ W+ + + L P+ Sbjct: 4 SISICSNALLALGAHPINSFDENTDHARLCSNLYPTVRNKLLRAHPWNCVVKRVVLSPVS 63 Query: 63 FT--YNGELKYYIPLKCLKVLNTSE-----ETQLRGRCIVARGDTPRPLSLKYIGEV-PL 114 + ++ +P L+VL+ E ++ G ++A L L+Y+ Sbjct: 64 TAPVFGFRFQFALPGDLLRVLSVGEPCDDIPYRVEGNRLLANVQV---LKLRYVFRNEDE 120 Query: 115 SDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIPED 169 S D + + + ++ + +R L ES +L A +DG E P + Sbjct: 121 STWDAALVDVAEMMMQAKMAYAVTGSTSLRDSLAQESLVLLKQAKAVDGQEDPPE 175 >gi|295096863|emb|CBK85953.1| hypothetical protein ENC_24260 [Enterobacter cloacae subsp. cloacae NCTC 9394] Length = 187 Score = 202 bits (514), Expect = 2e-50, Method: Composition-based stats. Identities = 42/176 (23%), Positives = 80/176 (45%), Gaps = 11/176 (6%) Query: 2 TELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPL 61 + + IC+ ALL LG P+ +++E + A C L P + LLR+ W+ A + + L P+ Sbjct: 3 SSVSICSNALLALGAHPINDFDEDTDHARLCANLYPTVRNKLLRAHPWNCAIKRVVLSPV 62 Query: 62 YFT--YNGELKYYIPLKCLKVLNTSE-----ETQLRGRCIVARGDTPRPLSLKYIGEV-P 113 + ++ +P ++VL+ E + ++ G +++ D R L+YI Sbjct: 63 SAAPVFGYGYQFSLPGDLIRVLSVGEPRDDIDYRIEGTRLLSNVDVIR---LRYIYRNED 119 Query: 114 LSDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIPED 169 S D + + + +L + +R L E+ +L A +DG E P + Sbjct: 120 ESTWDAALVDVAEMMMQSKLAYAVTGSASLRDSLAQEAAFLLKQAKAVDGQEEPPE 175 >gi|85059167|ref|YP_454869.1| hypothetical protein SG1189 [Sodalis glossinidius str. 'morsitans'] gi|84779687|dbj|BAE74464.1| hypothetical protein [Sodalis glossinidius str. 'morsitans'] Length = 187 Score = 199 bits (506), Expect = 1e-49, Method: Composition-based stats. Identities = 41/177 (23%), Positives = 78/177 (44%), Gaps = 11/177 (6%) Query: 2 TELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPL 61 + + IC+ ALL LG P+ +++E + A C + P + LLR+ W A + + L P+ Sbjct: 3 SSVSICSNALLALGAHPINSFDEATPHARLCANIYPSVRNNLLRAHPWHCAIKRVVLSPV 62 Query: 62 --YFTYNGELKYYIPLKCLKVLNTSEE-----TQLRGRCIVARGDTPRPLSLKYIGE-VP 113 + ++ +P ++VL+ E ++ G+ ++A R L+YI Sbjct: 63 STAPAFGFGYQFPLPADLIRVLSVGERYDDIPYRIEGKRLLANVQEAR---LRYIFRSED 119 Query: 114 LSDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIPEDR 170 S D + + +L + +R L E+ +L A ++G E P +R Sbjct: 120 ESTWDVALVNVAEVTMQAKLAYAVTGSASLRDSLTQEAAFLLQQAKSINGQEDPPER 176 >gi|221201504|ref|ZP_03574543.1| conserved hypothetical protein [Burkholderia multivorans CGD2M] gi|221207940|ref|ZP_03580946.1| conserved hypothetical protein [Burkholderia multivorans CGD2] gi|221172125|gb|EEE04566.1| conserved hypothetical protein [Burkholderia multivorans CGD2] gi|221178772|gb|EEE11180.1| conserved hypothetical protein [Burkholderia multivorans CGD2M] Length = 207 Score = 194 bits (492), Expect = 6e-48, Method: Composition-based stats. Identities = 50/181 (27%), Positives = 79/181 (43%), Gaps = 12/181 (6%) Query: 2 TELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPL 61 +++ ICN AL KLG + + + +E S A + + LR WSFA L L Sbjct: 3 SQVGICNRALTKLGDKRITSLDEDSKAAAVLNSMYDDVLDACLREHVWSFAKTRAQLAAL 62 Query: 62 YFT--YNGELKYYIPLKCLKVLNTSE-----ETQLRGRCIVARG----DTPRPLSLKYIG 110 + +Y +P L++L + T RG + G D P PL ++Y Sbjct: 63 ADAPLFGFGYQYRLPADFLRLLQVGQFLIYPRTDTRGLFSIENGNILTDLPAPLYVRYTK 122 Query: 111 EVPLSD-CDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIPED 169 + + DPL++E FA +LA E C L R E +R +A A+ ++ +E P Sbjct: 123 RITDPNVMDPLFREVFACRLAAEACETLTQSSTKRQAAWAEHDRAIAQAIRVNAIERPSQ 182 Query: 170 R 170 Sbjct: 183 P 183 >gi|221213948|ref|ZP_03586921.1| conserved hypothetical protein [Burkholderia multivorans CGD1] gi|221166125|gb|EED98598.1| conserved hypothetical protein [Burkholderia multivorans CGD1] Length = 207 Score = 190 bits (482), Expect = 8e-47, Method: Composition-based stats. Identities = 48/181 (26%), Positives = 80/181 (44%), Gaps = 12/181 (6%) Query: 2 TELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPL 61 +++ ICN AL KLG++ + + +E S A + + LR WSFA L L Sbjct: 3 SQVGICNRALTKLGEKRITSLDEDSKPAAALNSMYDDVLDACLREHVWSFAKTRTQLAAL 62 Query: 62 YFT--YNGELKYYIPLKCLKVLNTSE-----ETQLRGRCIVARG----DTPRPLSLKYIG 110 + +Y +P ++++ + T RG + G D P PL ++Y Sbjct: 63 ADAPLFGFGYQYRLPTDFIRLIQIGQFLVYPRTDTRGLFSIENGNILTDLPAPLYVRYTK 122 Query: 111 EVPL-SDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIPED 169 + + DPL++E FA +LA E C L R E +R +A A+ ++ +E P Sbjct: 123 RITDPNAMDPLFREVFACRLAAEACESLTQSSTKRQAAWAEHDRAIAQAIRVNAIERPSQ 182 Query: 170 R 170 Sbjct: 183 P 183 >gi|48697201|ref|YP_024931.1| hypothetical protein BcepC6B_gp11 [Burkholderia phage BcepC6B] gi|47779007|gb|AAT38370.1| gp11 [Burkholderia phage BcepC6B] Length = 207 Score = 182 bits (463), Expect = 1e-44, Method: Composition-based stats. Identities = 46/181 (25%), Positives = 78/181 (43%), Gaps = 12/181 (6%) Query: 2 TELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPL 61 +++ ICN AL K+G + + + +E S A + + LR+ WSF L L Sbjct: 3 SQVGICNRALTKIGDKRITSLDEDSKAAATLNSMYDDVLDACLRAHVWSFTKARAQLAAL 62 Query: 62 YFT--YNGELKYYIPLKCLKVLNTSE-----ETQLRGRCIVARG----DTPRPLSLKYIG 110 + +Y +P +++L + T RG + G D PL ++Y Sbjct: 63 AEAPLFGFSYQYRLPTDFIRLLQVGQFDVYPRTDTRGLFSIENGNILTDMQAPLYIRYAK 122 Query: 111 EVPL-SDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIPED 169 V + D L++EAFA +LA E C L R E ++ +A A+ ++ +E P Sbjct: 123 RVTDPNAMDALFREAFACRLAAEACESLTQSATKRQGAWAEHDQAIAAAIRVNAIERPAQ 182 Query: 170 R 170 Sbjct: 183 P 183 >gi|298485989|ref|ZP_07004063.1| predicted phage protein [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] gi|298159466|gb|EFI00513.1| predicted phage protein [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] Length = 208 Score = 181 bits (459), Expect = 3e-44, Method: Composition-based stats. Identities = 42/176 (23%), Positives = 71/176 (40%), Gaps = 11/176 (6%) Query: 2 TELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPL 61 T + IC+ ALL LG +P+ ++ E +A L P LLR+ W+ + L P Sbjct: 16 TAVSICSNALLMLGAQPINDFQEAVDRAKIAANLYPSTRDDLLRNHPWNCCIRRAVLAPD 75 Query: 62 YFTYNGEL--KYYIPLKCLKVLNTSE-----ETQLRGRCIVARGDTPRPLSLKYIGEVPL 114 ++ +P CL+VL + + G+ I A L LKY+ + Sbjct: 76 AEPPAFGYDQQFELPSDCLRVLEVGSADAQIDYVVEGKRIQANTTV---LELKYVFRNEV 132 Query: 115 SD-CDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIPED 169 + D E L +A ++ + ++ +K + E A +DG E P Sbjct: 133 ENTWDAALVELMTLSMAAKMAYAITQSSAVQQSVKQDLEMAKRRARAIDGQEDPPQ 188 >gi|332875219|ref|ZP_08443052.1| hypothetical protein HMPREF0022_02685 [Acinetobacter baumannii 6014059] gi|332736663|gb|EGJ67657.1| hypothetical protein HMPREF0022_02685 [Acinetobacter baumannii 6014059] Length = 172 Score = 177 bits (450), Expect = 4e-43, Method: Composition-based stats. Identities = 36/167 (21%), Positives = 74/167 (44%), Gaps = 8/167 (4%) Query: 13 KLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPLY--FTYNGELK 70 +G + + ++ E S A CR + +LRS WS A + L P+ ++ Sbjct: 1 MIGGQQIASFEEDSKLAQTCRNIYDTTRLSILRSHPWSCAKKRQILSPISTYPSFGYAHA 60 Query: 71 YYIPLKCLKVLNT-SEETQLRGRCIVARGDTPRPLSLKYIGEVPLSD-CDPLYQEAFALK 128 + +P + +++ +E ++ R I+A + + L+Y+ + D + EA K Sbjct: 61 FPLPSDYVLIISANTEHYEVENRHILANAEV---IHLEYVFDNDNEQTWDAMLVEAMTYK 117 Query: 129 LAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIP-EDRGYHN 174 +A +LC P+ D + + + ++ A ++G E P +D Y Sbjct: 118 MASKLCKPITGSDAAGQSAEAQFQFLIKQARTVNGQERPSQDVQYAE 164 >gi|78219993|gb|ABB39342.1| hypothetical protein Dde_2546 [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] Length = 320 Score = 170 bits (432), Expect = 5e-41, Method: Composition-based stats. Identities = 47/195 (24%), Positives = 77/195 (39%), Gaps = 21/195 (10%) Query: 2 TELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPL 61 +E+ ICN AL +G + + ++ S A C P + LL + W+FAT+ L + Sbjct: 119 SEVSICNKALRYIGAEEIASLSQESRGARLCAQYYPEVRDELLEAHHWNFATRYAVLPQV 178 Query: 62 Y--FTYNGELKYYIPLKCLKVLNTSEE--TQLRGRCIVARGDT----------------P 101 + Y +P CL+V + L+G VA +T Sbjct: 179 AQEPPFGFLHAYRLPQDCLRVRRVLQHAAGVLQGGAAVADDETGIRTFEVVEGGILYTDA 238 Query: 102 RPLSLKYIGEVPLSDCDP-LYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMD 160 P V + C P L+ EA A KLA ++ L ++ + L+ +++ L A Sbjct: 239 SPARAVMTVRVTETACFPALFAEALARKLAADMAVALTGSTRLYAGLRDGADKALDTART 298 Query: 161 MDGVEIPEDRGYHNG 175 D E D + G Sbjct: 299 ADAGEEHADPQEYEG 313 >gi|242278912|ref|YP_002991041.1| hypothetical protein Desal_1440 [Desulfovibrio salexigens DSM 2638] gi|242121806|gb|ACS79502.1| conserved hypothetical protein [Desulfovibrio salexigens DSM 2638] Length = 197 Score = 169 bits (429), Expect = 1e-40, Method: Composition-based stats. Identities = 45/175 (25%), Positives = 74/175 (42%), Gaps = 9/175 (5%) Query: 1 MTELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 ++E++ICN AL KLG R +E+ ++ S +A C L + +LR W+FA + L Sbjct: 3 VSEVKICNLALRKLGARLIESLSDSSQEAVTCNLFYDQVRDAVLREHPWNFAASRIRLAK 62 Query: 61 LY--FTYNGELKYYIPLKCLKVL---NTSEETQLRGRCIVARGDTPRPLSLKYIGEVPLS 115 L + +Y +P CL + + +E + G I+ D + Y +V Sbjct: 63 LADNPAFGFVHQYQLPSDCLHLRQLNSPDDEFVVEGDKILTDRDGASAV---YTMKVTNP 119 Query: 116 D-CDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIPED 169 DP + AFA +LA E+ + + + A D E ED Sbjct: 120 VLFDPSFVMAFAARLAAEMADDITGSTTKAREMWTLYLNAMQSARLADSAEGRED 174 >gi|167041088|gb|ABZ05849.1| hypothetical protein ALOHA_HF400048F7ctg1g16 [uncultured marine microorganism HF4000_48F7] Length = 198 Score = 169 bits (429), Expect = 1e-40, Method: Composition-based stats. Identities = 42/177 (23%), Positives = 76/177 (42%), Gaps = 9/177 (5%) Query: 2 TELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPL 61 + ++ICN AL LG+ + + + +A C L +LRS W+ A + L Sbjct: 3 SVVQICNIALSNLGEAKIAALTDENERARQCNLRYEDCRDAVLRSHPWNAAVTRVALAAS 62 Query: 62 YFTYNGELK--YYIPLKCLKVLNTS---EETQLRGRCIVARGDTPRPLSLKYIGEVPL-S 115 + +P CL+VL+ + ++ GR + ++L YI +V + Sbjct: 63 VTAPAWGFAKKFALPADCLRVLDIEDFYQSYKVEGRFVFTD---ATAVNLLYIAKVTDPT 119 Query: 116 DCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIPEDRGY 172 D L A A+KLA E+ L ++R + + ++LA A +D E + + Sbjct: 120 QFDSLLLHAVAMKLASEIAEALTGRAELRDRMLSKYLQILAEARGVDSQERSQAGEF 176 >gi|144898782|emb|CAM75646.1| conserved hypothetical protein [Magnetospirillum gryphiswaldense MSR-1] Length = 189 Score = 169 bits (429), Expect = 1e-40, Method: Composition-based stats. Identities = 38/174 (21%), Positives = 68/174 (39%), Gaps = 6/174 (3%) Query: 1 MTELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 ++ + +C+ ALLKLG + +++E + +++ L P + L S+ W FAT + L Sbjct: 3 LSAIALCSRALLKLGATTIASFDEGTAESEVAANLYPSVRDATLSSYPWKFATGQVTLPR 62 Query: 61 LYFTY--NGELKYYIPLKCLKVLNTSEETQLRGRCIVARGD----TPRPLSLKYIGEVPL 114 L + + Y +P L+ ++ Q G ++L YI Sbjct: 63 LVAQPVADYSMAYQLPADFLRAMSAGVAGQGYGLSYRIAESRLHCDADEVTLTYIFRPDE 122 Query: 115 SDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIPE 168 P + + +LA E C PL +L SE A +D + Sbjct: 123 MAFPPFFDQVLIARLAAEFCIPLTESTSRAEFLYRLSEDEYRRAKLIDAQQDVP 176 >gi|83313368|ref|YP_423632.1| hypothetical protein amb4269 [Magnetospirillum magneticum AMB-1] gi|82948209|dbj|BAE53073.1| hypothetical protein [Magnetospirillum magneticum AMB-1] Length = 189 Score = 169 bits (428), Expect = 1e-40, Method: Composition-based stats. Identities = 40/173 (23%), Positives = 72/173 (41%), Gaps = 12/173 (6%) Query: 1 MTELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 ++ + +C+ AL++LG P+ +++ S +++ L PP+ G L S WSFAT + L Sbjct: 3 LSAIALCSRALIRLGAAPIAGFDDGSAESEVAANLYPPLRDGALSSHPWSFATGQVSLAR 62 Query: 61 LYFTY--NGELKYYIPLKCLKVLNTSEET-------QLRGRCIVARGDTPRPLSLKYIGE 111 L + Y +P L+VL+ ++ G ++ D + L Y+ Sbjct: 63 LAGEPLADFAYAYQLPTDFLRVLSAGPSGRGSGLVFRIAGSRLLCDADE---VVLTYVYR 119 Query: 112 VPLSDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGV 164 S P + + +L E C PL +L +E A +D Sbjct: 120 PDESAFPPYFDQMLMARLTAEFCIPLTESTTRAQFLFRLAEDEFRRAKLIDAQ 172 >gi|296532339|ref|ZP_06895076.1| conserved hypothetical protein [Roseomonas cervicalis ATCC 49957] gi|296267335|gb|EFH13223.1| conserved hypothetical protein [Roseomonas cervicalis ATCC 49957] Length = 189 Score = 165 bits (417), Expect = 2e-39, Method: Composition-based stats. Identities = 45/172 (26%), Positives = 72/172 (41%), Gaps = 6/172 (3%) Query: 1 MTELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 ++ L +C+ ALLK+G +P+ + E + +A+ L P + LL + WSFAT +L Sbjct: 3 LSALVLCSRALLKIGAQPVASLEEGTAEAEVAANLYPAVRDALLSAHPWSFATGQAELPR 62 Query: 61 LYFTY--NGELKYYIPLKCLKVLNTSEETQLRGRCIVARGD----TPRPLSLKYIGEVPL 114 L + Y +P L+VL+ E + RG + + L Y+ Sbjct: 63 LLAVPHADYAFAYQLPADFLRVLSAGGERRGRGLLYRIHENRLHTDSPRVVLTYLFRPAE 122 Query: 115 SDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEI 166 S P + A +LA E C PL L +E L A +D + Sbjct: 123 SALPPFFASALVARLAAEFCVPLTESTSRAEMLHRLAEAELRHARLVDSQQD 174 >gi|212710811|ref|ZP_03318939.1| hypothetical protein PROVALCAL_01879 [Providencia alcalifaciens DSM 30120] gi|212686508|gb|EEB46036.1| hypothetical protein PROVALCAL_01879 [Providencia alcalifaciens DSM 30120] Length = 201 Score = 165 bits (417), Expect = 3e-39, Method: Composition-based stats. Identities = 46/191 (24%), Positives = 81/191 (42%), Gaps = 21/191 (10%) Query: 2 TELEICNWALLKLGQ-RPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 +E+EICN AL ++G R + + +E+S +A+ C L +L F W+FA++ + L Sbjct: 3 SEIEICNMALSRIGNSRFINSLSEKSKEAEQCNLHYGHCRDTVLSDFPWNFASKRVALAD 62 Query: 61 L-YFTYNGELKYYIPLKCLKVLNTSEETQ----------------LRGRCIVARGDTPRP 103 + + Y P CLK + + Q + G + D P+ Sbjct: 63 TNNPPPDWKFAYSYPTDCLKAIAIIQSGQKYPQPNNAIQFIVGSDVSGTGKLIYCDQPQA 122 Query: 104 LSLKYIGEVPL-SDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMD 162 L+Y+ V + D L+ +A A +LAGEL PL + + S +A A Sbjct: 123 W-LQYVASVTDVNMFDSLFIDALAWRLAGELARPLASNAGIGSEAFQMYTMAIASA-GAH 180 Query: 163 GVEIPEDRGYH 173 ++ + + Sbjct: 181 SLDESSEPNDY 191 >gi|317120717|gb|ADV02539.1| hypothetical protein SC2_gp085 [Liberibacter phage SC2] gi|317120778|gb|ADV02599.1| hypothetical protein SC2_gp085 [Candidatus Liberibacter asiaticus] Length = 172 Score = 164 bits (416), Expect = 3e-39, Method: Composition-based stats. Identities = 62/174 (35%), Positives = 93/174 (53%), Gaps = 3/174 (1%) Query: 1 MTELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 MT+LE+CNWAL K+G+ P+E + S A C+ L P+H LLR F WSFAT L P Sbjct: 2 MTDLEVCNWALRKIGEEPVETLEDNSSLALSCKAQLQPLHSSLLRRFDWSFATFCYYLSP 61 Query: 61 LYFTYNGELKYYIPLKCLKVLNTSEETQLRGRCIVARGDTPRPLSLKYIGEVPLSDCDPL 120 L +G+ Y +P CL++L + LRG ++ L+L +I +V + DCDPL Sbjct: 62 LEQEDDGQWIYPLPRDCLRILK-PQNPLLRG--LMLYSSMEEFLALTFIKKVKIEDCDPL 118 Query: 121 YQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIPEDRGYHN 174 Y E A KLA E+C ++ +++ L + E A+ E + Y++ Sbjct: 119 YCEVLACKLALEVCVQARENEGLKNSLIRDFEVAWDNAVLSSSKERQGEPIYYD 172 >gi|209966376|ref|YP_002299291.1| hypothetical protein RC1_3114 [Rhodospirillum centenum SW] gi|209959842|gb|ACJ00479.1| conserved hypothetical protein [Rhodospirillum centenum SW] Length = 198 Score = 163 bits (412), Expect = 1e-38, Method: Composition-based stats. Identities = 42/180 (23%), Positives = 72/180 (40%), Gaps = 10/180 (5%) Query: 1 MTELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 +T + +C ALLK+G + ++ + + +A+ L P LL + WSFAT L Sbjct: 3 LTAIGLCARALLKIGAAGIASFADGTAEAEVAAALYAPTRDALLSAHGWSFATLHAGLAR 62 Query: 61 LYFTYNG------ELKYYIPLKCLKVLNTSEETQLRGRCIVARGDT----PRPLSLKYIG 110 L G + +P L+ L+ ++RG +G T + L Y+ Sbjct: 63 LADPPQGTAVAGPAHAFALPDGFLRALSAGRGGRMRGLEYRIQGGTLLADADAVILTYLH 122 Query: 111 EVPLSDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIPEDR 170 P + +A +LA E C PL + L ++ A +D + +DR Sbjct: 123 RPAEESFPPFFDQALIARLAAEFCLPLTENSSRAEMLWRLADAEFRRARLIDTQQDSQDR 182 >gi|83718530|ref|YP_441473.1| gp11 [Burkholderia thailandensis E264] gi|257139845|ref|ZP_05588107.1| gp11 [Burkholderia thailandensis E264] gi|83652355|gb|ABC36418.1| gp11 [Burkholderia thailandensis E264] Length = 200 Score = 160 bits (406), Expect = 6e-38, Method: Composition-based stats. Identities = 36/176 (20%), Positives = 71/176 (40%), Gaps = 11/176 (6%) Query: 2 TELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPL 61 + + IC+ ALL+LG +P+ ++ E + +A C L P + +LR+ W+ T+ + L PL Sbjct: 3 SSISICSNALLQLGDKPIASFTEPTDRAMVCSNLYPEVRDAVLRAHPWNSCTKRVVLAPL 62 Query: 62 --YFTYNGELKYYIPLKCLKVLNTSE-----ETQLRGRCIVARGDTPRPLSLKYIGEVP- 113 ++ ++ +P LK + G+ I+ + L L YI Sbjct: 63 ADAPAFDYPYQFQLPSDWLKTIQVGSARCPLNFTAEGQRILTFV---QALPLVYIFRNEI 119 Query: 114 LSDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIPED 169 + + + + L P+ M E +L A ++G + + Sbjct: 120 ETTWESTLVDVVTAAMKMRLAYPITQSAAMTQTATTEFSTILKQAKAINGQDDDTE 175 >gi|227355853|ref|ZP_03840246.1| conserved hypothetical protein [Proteus mirabilis ATCC 29906] gi|227164172|gb|EEI49069.1| conserved hypothetical protein [Proteus mirabilis ATCC 29906] Length = 201 Score = 159 bits (403), Expect = 1e-37, Method: Composition-based stats. Identities = 46/191 (24%), Positives = 74/191 (38%), Gaps = 21/191 (10%) Query: 2 TELEICNWALLKLGQ-RPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 +E+EICN AL ++G R + + E S +A C L +L F W+FAT+ + L Sbjct: 3 SEIEICNIALSRIGNSRSINSMTEASKEAVQCNLHYAQCRDSVLADFPWNFATKKVALAN 62 Query: 61 L-YFTYNGELKYYIPLKCLKVLNTSEETQLR----------------GRCIVARGDTPRP 103 N Y P CLK + E Q G + D P Sbjct: 63 TNNPPPNWAYAYRYPNDCLKAIGIVEPHQKYRRPDTAIHFHVGSDENGTGRLIFTDHPSA 122 Query: 104 LSLKYIGEVPL-SDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMD 162 L+Y+ + + D L+++A A +LA EL PL + + + V+ A Sbjct: 123 W-LEYVARITDVNMFDALFKDALAWRLAAELARPLASNAGIGGEALQIYQGVIKSA-AAH 180 Query: 163 GVEIPEDRGYH 173 + + + Sbjct: 181 SLSESAEPTDY 191 >gi|48696642|ref|YP_024421.1| hypothetical protein VP2p14 [Vibrio phage VP2] gi|40950040|gb|AAR97631.1| hypothetical protein [Vibrio phage VP2] Length = 741 Score = 159 bits (402), Expect = 2e-37, Method: Composition-based stats. Identities = 40/176 (22%), Positives = 77/176 (43%), Gaps = 7/176 (3%) Query: 2 TELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP- 60 + + I N AL +G +P+ + + + +A+ C + + L + WSFA + LLP Sbjct: 3 SLVSIYNKALAHVGAKPVLSETDANTRAEMCNAMYEFALRSALEAREWSFAAMRVRLLPD 62 Query: 61 --LYFTYNGELKYYIPLKCLKVLNTSEETQLRGRCIVARG--DTPRP-LSLKYIGEV-PL 114 + + Y +P C++V + + G RP + +KY V Sbjct: 63 PLITPAWGYNYGYRLPNDCIRVCEVRDGNSYNIDWAIESGFLYATRPTVDVKYNRLVEDP 122 Query: 115 SDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIPEDR 170 + P + EA +LKLA ++C PL + + + L ++ +DG++ +R Sbjct: 123 NAFTPTFVEALSLKLASDICIPLTENRGLGNDLISRYLMAVSEGAALDGLQASRER 178 >gi|48696686|ref|YP_024980.1| major tail subunit [Vibrio phage VP5] gi|40806149|gb|AAR92067.1| major tail subunit [Vibrio phage VP5] Length = 743 Score = 159 bits (402), Expect = 2e-37, Method: Composition-based stats. Identities = 40/176 (22%), Positives = 77/176 (43%), Gaps = 7/176 (3%) Query: 2 TELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP- 60 + + I N AL +G +P+ + + + +A+ C + + L + WSFA + LLP Sbjct: 3 SLVSIYNKALAHVGAKPVLSETDANTRAEMCNAMYEFALRSALEAREWSFAAMRVRLLPD 62 Query: 61 --LYFTYNGELKYYIPLKCLKVLNTSEETQLRGRCIVARG--DTPRP-LSLKYIGEV-PL 114 + + Y +P C++V + + G RP + +KY V Sbjct: 63 PLITPAWGYNYGYRLPNDCIRVCEVRDGNSYNIDWAIESGFLYATRPTVDVKYNRLVEDP 122 Query: 115 SDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIPEDR 170 + P + EA +LKLA ++C PL + + + L ++ +DG++ +R Sbjct: 123 NAFTPTFVEALSLKLASDICIPLTENRGLGNDLISRYLMAVSEGAALDGLQASRER 178 >gi|304398396|ref|ZP_07380270.1| conserved hypothetical protein [Pantoea sp. aB] gi|304354262|gb|EFM18635.1| conserved hypothetical protein [Pantoea sp. aB] Length = 201 Score = 158 bits (401), Expect = 2e-37, Method: Composition-based stats. Identities = 41/193 (21%), Positives = 77/193 (39%), Gaps = 26/193 (13%) Query: 2 TELEICNWALLKLGQ-RPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 + +EICN AL ++G R + +++E+S +A C L + +L F W+FAT+ L Sbjct: 3 SIIEICNIALSRIGNSRTINSFSEQSKEAGLCSLHFDTAREEVLADFDWNFATKRAALAD 62 Query: 61 LYFTY-NGELKYYIPLKCLKVL------------------NTSEETQLRGRCIVARGDTP 101 + Y P C+++ + GR ++ Sbjct: 63 TGLPPSDWRFAYRYPTDCMRITGIMLPGMRNPPERLRIQYEVGSDPDGEGRLVLCDEPD- 121 Query: 102 RPLSLKYIGEVPL-SDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVA-- 158 + +KY+ + + DPL++ A A KLA E+ PL + +++ A Sbjct: 122 --VWIKYVANITNANMFDPLFRSALAWKLASEIAMPLATAPNLVQNALTMYSQIIRSAGS 179 Query: 159 MDMDGVEIPEDRG 171 M+ + P + Sbjct: 180 HSMNESQEPVEPE 192 >gi|225158771|ref|ZP_03725088.1| GP11 [Opitutaceae bacterium TAV2] gi|224802606|gb|EEG20861.1| GP11 [Opitutaceae bacterium TAV2] Length = 203 Score = 158 bits (400), Expect = 2e-37, Method: Composition-based stats. Identities = 40/187 (21%), Positives = 78/187 (41%), Gaps = 18/187 (9%) Query: 1 MTELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 MTE++ICN AL + + + + ++ + A C+L + +LR W A + + L P Sbjct: 1 MTEVDICNQALGLVSENGIASLSDATKSARLCQLHFAATVREVLRDGQWRCARRRVALAP 60 Query: 61 LY--FTYNGELKYYIPLKCLKVLNTSEE---------TQLRGRCIVARGDTPRPLSLKYI 109 L + + +P L+ + + ++ GR ++ T R + + + Sbjct: 61 LAQPPAFEWSYGFQLPGDFLRTIRLNGHEVGSPDMPLFEIEGRTLLMNEGTARIVYTRDV 120 Query: 110 GE-------VPLSDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMD 162 + ++D DPL A LA +L P +R L E + V+A A + + Sbjct: 121 TQQIDDPGAAHIADLDPLCARAIVTALASKLAWPFQQSRTLRESLLSEYDGVIARARNAN 180 Query: 163 GVEIPED 169 + E+ Sbjct: 181 SRDAFEN 187 >gi|187736541|ref|YP_001878653.1| hypothetical protein Amuc_2062 [Akkermansia muciniphila ATCC BAA-835] gi|187426593|gb|ACD05872.1| hypothetical protein Amuc_2062 [Akkermansia muciniphila ATCC BAA-835] Length = 206 Score = 158 bits (399), Expect = 3e-37, Method: Composition-based stats. Identities = 50/175 (28%), Positives = 83/175 (47%), Gaps = 10/175 (5%) Query: 2 TELEICNWALLKLGQRPLENW-NERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 + L+ICN AL K+G+ PL+ S + C L P + L W+FA L Sbjct: 13 SALDICNAALSKIGEAPLDALIANESTASRLCVLHYHPARRETLCMARWTFAATQTTLDS 72 Query: 61 LYFTYNGE---LKYYIPLKCLKVLNTS-EETQLRGRCIVARGDTPRPLSLKYIGEVPLSD 116 + ++ +P CL+VL+ E +++GR I A PL L YI ++ +D Sbjct: 73 VSAQAPNSLTPYQFTLPADCLRVLDVECSEWKMQGRRIHASC---APLPLSYIADIENTD 129 Query: 117 -CDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVL-AVAMDMDGVEIPED 169 DPL+ +A A +LA +L PL + +R L E +++ A ++ V+ + Sbjct: 130 QFDPLFMDALATRLAEKLAMPLTGNQSLRQNLNQEFHKIILPQAATVNAVQCFSN 184 >gi|268589381|ref|ZP_06123602.1| conserved hypothetical protein [Providencia rettgeri DSM 1131] gi|291315408|gb|EFE55861.1| conserved hypothetical protein [Providencia rettgeri DSM 1131] Length = 201 Score = 157 bits (398), Expect = 5e-37, Method: Composition-based stats. Identities = 46/191 (24%), Positives = 78/191 (40%), Gaps = 21/191 (10%) Query: 2 TELEICNWALLKLGQ-RPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 +E+EICN AL ++G R + + +E+S +A+ C L L F W+FA++ + L Sbjct: 3 SEIEICNLALSRIGNSRFINSLSEKSKEAEQCNLHFNHCRDTTLSDFPWNFASKRVALAD 62 Query: 61 L-YFTYNGELKYYIPLKCLKVLNTSEETQLR----------------GRCIVARGDTPRP 103 + + Y P CLK + + Q G + D P+ Sbjct: 63 TNNPPPDWKFAYSYPTDCLKAIAIIQSGQKYPQPHNAINFLVGSDADGTGKLIYCDQPQA 122 Query: 104 LSLKYIGEVPL-SDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMD 162 L+Y V + D L+ +A A +LAGEL PL + + S +A A Sbjct: 123 W-LQYTASVTDVNMFDSLFIDALAWRLAGELARPLASNAGIGSEAFQMYTMSIANA-GAH 180 Query: 163 GVEIPEDRGYH 173 ++ + + Sbjct: 181 SLDESSEPNDY 191 >gi|309702805|emb|CBJ02136.1| hypothetical phage protein [Escherichia coli ETEC H10407] Length = 201 Score = 156 bits (395), Expect = 9e-37, Method: Composition-based stats. Identities = 42/191 (21%), Positives = 70/191 (36%), Gaps = 25/191 (13%) Query: 2 TELEICNWALLKLGQ-RPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 + +EICN AL +G R + + E S +A C L +L F W+FAT+ + L Sbjct: 3 SVVEICNRALSNIGNSRSINSLTEASKEAGECSLHFEACRDAVLSDFDWNFATKRVALAD 62 Query: 61 LY-FTYNGELKYYIPLKCLKVL------------------NTSEETQLRGRCIVARGDTP 101 + E Y P CL++ +T G+ I Sbjct: 63 TSNPPPDWEYAYQYPSDCLRITEIMLPGVRNPTAAMRVQYEVGADTNGTGKLIYTDQPQA 122 Query: 102 RPLSLKYIGEVPL-SDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMD 160 LKY+ V + D ++ EA A +LA + L + + ++ RV+ A Sbjct: 123 ---WLKYVSRVTDVNMFDAIFMEALAWRLAAAINMALTGNTDLGTFALNMYNRVILSA-G 178 Query: 161 MDGVEIPEDRG 171 ++ Sbjct: 179 SHSQNESQEPQ 189 >gi|300898434|ref|ZP_07116775.1| conserved hypothetical protein [Escherichia coli MS 198-1] gi|331648169|ref|ZP_08349259.1| conserved hypothetical protein [Escherichia coli M605] gi|300357901|gb|EFJ73771.1| conserved hypothetical protein [Escherichia coli MS 198-1] gi|331043029|gb|EGI15169.1| conserved hypothetical protein [Escherichia coli M605] Length = 201 Score = 156 bits (394), Expect = 1e-36, Method: Composition-based stats. Identities = 42/189 (22%), Positives = 70/189 (37%), Gaps = 21/189 (11%) Query: 2 TELEICNWALLKLGQ-RPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 + +EICN AL +G R + + E S +A C L +L F W+FAT+ + L Sbjct: 3 SVVEICNRALSNIGNSRSINSLTEASKEAGECSLHFEACRDAVLSDFDWNFATKRVALAD 62 Query: 61 LY-FTYNGELKYYIPLKCLKVLNTSEET----------------QLRGRCIVARGDTPRP 103 + E Y P CL++ G + D P+ Sbjct: 63 TSNPPPDWEYAYQYPSDCLRITEIMLPGVRNPTAAMRVQYEVGVDTNGTGKLIYTDQPQA 122 Query: 104 LSLKYIGEVPL-SDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMD 162 LKY+ V + D ++ EA A +LA + L + + ++ RV+ A Sbjct: 123 W-LKYVSRVTDVNMFDAIFMEALAWRLAAAINMALTGNADLGTFALNMYNRVILSA-GSH 180 Query: 163 GVEIPEDRG 171 ++ Sbjct: 181 SQNESQEPQ 189 >gi|298381711|ref|ZP_06991310.1| hypothetical protein ECFG_01448 [Escherichia coli FVEC1302] gi|298279153|gb|EFI20667.1| hypothetical protein ECFG_01448 [Escherichia coli FVEC1302] Length = 201 Score = 156 bits (394), Expect = 1e-36, Method: Composition-based stats. Identities = 41/191 (21%), Positives = 70/191 (36%), Gaps = 25/191 (13%) Query: 2 TELEICNWALLKLGQ-RPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 + +EICN AL +G R + + E S +A C L +L F W+FAT+ + L Sbjct: 3 SVVEICNRALSNIGNSRSINSLTEASKEAGECSLHFDACRDAVLSDFDWNFATKRVALAD 62 Query: 61 LY-FTYNGELKYYIPLKCLKVL------------------NTSEETQLRGRCIVARGDTP 101 + Y P CL++ +T G+ I Sbjct: 63 TSNPPPDWAYAYQYPSDCLRITEIMLPGVRNPTAAMRVQYEVGADTDGTGKLIYTDQPQA 122 Query: 102 RPLSLKYIGEV-PLSDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMD 160 LKY+ V ++ D ++ EA A +LA + L + + ++ RV+ A Sbjct: 123 ---WLKYVSRVSDVNMFDAIFMEALAWRLAAAINMALTGNADLGTFALNMYNRVILSA-G 178 Query: 161 MDGVEIPEDRG 171 ++ Sbjct: 179 SHSQNESQEPQ 189 >gi|320175039|gb|EFW50152.1| 11 [Shigella dysenteriae CDC 74-1112] Length = 201 Score = 155 bits (393), Expect = 2e-36, Method: Composition-based stats. Identities = 42/191 (21%), Positives = 70/191 (36%), Gaps = 25/191 (13%) Query: 2 TELEICNWALLKLGQ-RPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 + +EICN AL +G R + + E S +A C L +L F W+FAT+ + L Sbjct: 3 SVVEICNRALSNIGNSRSINSLTEASKEAGECSLHFEACRDAVLSDFDWNFATKRVSLAD 62 Query: 61 LY-FTYNGELKYYIPLKCLKVL------------------NTSEETQLRGRCIVARGDTP 101 + E Y P CL++ +T G+ I Sbjct: 63 TSNPPPDWEYAYQYPSDCLRITEIMLPGVRNPTAAMRVQYEVGADTNGTGKLIYTDQPQA 122 Query: 102 RPLSLKYIGEVPL-SDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMD 160 LKY+ V + D ++ EA A +LA + L + + ++ RV+ A Sbjct: 123 ---WLKYVSRVTDVNMFDAIFMEALAWRLAAAINMALTGNADLGTFALNMYNRVILSA-G 178 Query: 161 MDGVEIPEDRG 171 ++ Sbjct: 179 SHSQNESQEPQ 189 >gi|323948678|gb|EGB44583.1| hypothetical protein ERKG_04901 [Escherichia coli H252] gi|327252177|gb|EGE63849.1| hypothetical protein ECSTEC7V_3024 [Escherichia coli STEC_7v] Length = 201 Score = 155 bits (393), Expect = 2e-36, Method: Composition-based stats. Identities = 42/191 (21%), Positives = 71/191 (37%), Gaps = 25/191 (13%) Query: 2 TELEICNWALLKLGQ-RPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 + +EICN AL +G R + + E S +A C L +L F W+FAT+ + L Sbjct: 3 SVVEICNRALSNIGNSRSINSLTEASKEAGECSLHFEACRDAVLSDFDWNFATKRVALAD 62 Query: 61 LY-FTYNGELKYYIPLKCLKVL------------------NTSEETQLRGRCIVARGDTP 101 + E Y P CL++ +T G+ I Sbjct: 63 TSNPPPDWEYAYQYPSDCLRITEIMLPGVRNPTAAMRVQYEVGADTDGTGKLIYTDQPQA 122 Query: 102 RPLSLKYIGEV-PLSDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMD 160 LKY+ V ++ D ++ EA A +LA + L + + ++ RV+ A Sbjct: 123 ---WLKYVSRVSDVNMFDAIFMEALAWRLAAAINMALTGNADLGTFALNMYNRVILSA-G 178 Query: 161 MDGVEIPEDRG 171 ++ Sbjct: 179 SHSQNESQEPQ 189 >gi|89152435|ref|YP_512268.1| hypothetical protein PhiV10p14 [Escherichia phage phiV10] gi|74055458|gb|AAZ95907.1| hypothetical protein PhiV10p14 [Escherichia phage phiV10] gi|294493853|gb|ADE92609.1| conserved hypothetical protein [Escherichia coli IHE3034] gi|323156126|gb|EFZ42285.1| hypothetical protein ECEPECA14_1901 [Escherichia coli EPECa14] Length = 201 Score = 155 bits (393), Expect = 2e-36, Method: Composition-based stats. Identities = 42/191 (21%), Positives = 70/191 (36%), Gaps = 25/191 (13%) Query: 2 TELEICNWALLKLGQ-RPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 + +EICN AL +G R + + E S +A C L +L F W+FAT+ + L Sbjct: 3 SVVEICNRALSNIGNSRSINSLTEASKEAGECSLHFEACRDAVLSDFDWNFATKRVALAD 62 Query: 61 LY-FTYNGELKYYIPLKCLKVL------------------NTSEETQLRGRCIVARGDTP 101 + E Y P CL++ +T G+ I Sbjct: 63 TSNPPPDWEYAYQYPSDCLRITEIMLPGVRNPTAAMRVQYEVGADTNGTGKLIYTDQPQA 122 Query: 102 RPLSLKYIGEVPL-SDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMD 160 LKY+ V + D ++ EA A +LA + L + + ++ RV+ A Sbjct: 123 ---WLKYVSRVTDVNMFDAIFMEALAWRLAAAINMALTGNADLGTFALNMYNRVILSA-G 178 Query: 161 MDGVEIPEDRG 171 ++ Sbjct: 179 SHSQNESQEPQ 189 >gi|117624705|ref|YP_853618.1| hypothetical protein APECO1_4048 [Escherichia coli APEC O1] gi|301046401|ref|ZP_07193561.1| conserved hypothetical protein [Escherichia coli MS 185-1] gi|115513829|gb|ABJ01904.1| conserved hypothetical protein [Escherichia coli APEC O1] gi|300301627|gb|EFJ58012.1| conserved hypothetical protein [Escherichia coli MS 185-1] gi|324008553|gb|EGB77772.1| hypothetical protein HMPREF9532_01740 [Escherichia coli MS 57-2] gi|332344347|gb|AEE57681.1| conserved hypothetical protein [Escherichia coli UMNK88] Length = 201 Score = 155 bits (392), Expect = 2e-36, Method: Composition-based stats. Identities = 42/191 (21%), Positives = 71/191 (37%), Gaps = 25/191 (13%) Query: 2 TELEICNWALLKLGQ-RPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 + +EICN AL +G R + + E S +A C L +L F W+FAT+ + L Sbjct: 3 SVVEICNRALSNIGNSRSINSLTEASKEAGECSLHFEACRDAVLSDFDWNFATKRVALAD 62 Query: 61 LY-FTYNGELKYYIPLKCLKVL------------------NTSEETQLRGRCIVARGDTP 101 + E Y P CL++ +T G+ I Sbjct: 63 TSNPPPDWEYAYQYPSDCLRITEIMLPGVRNPTAAMRVQYEVGADTNGTGKLIYTDQPQA 122 Query: 102 RPLSLKYIGEV-PLSDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMD 160 LKY+ V ++ D ++ EA A +LA + L + + ++ RV+ A Sbjct: 123 ---WLKYVSRVSDVNMFDAIFMEALAWRLAAAINMALTGNADLGTFALNMYNRVILSA-G 178 Query: 161 MDGVEIPEDRG 171 ++ Sbjct: 179 SHSQNESQEPQ 189 >gi|262043558|ref|ZP_06016671.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] gi|259039092|gb|EEW40250.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] Length = 201 Score = 154 bits (390), Expect = 3e-36, Method: Composition-based stats. Identities = 43/193 (22%), Positives = 76/193 (39%), Gaps = 26/193 (13%) Query: 2 TELEICNWALLKLGQ-RPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 + +EICN AL +G R + + E S +A C L I +L F W+FAT+++ L Sbjct: 3 SVIEICNRALSNIGNNRSINSLEEASKEAGQCSLYYESIRDAVLADFDWNFATKNIALAD 62 Query: 61 LYFTY-NGELKYYIPLKCLKVLN------------------TSEETQLRGRCIVARGDTP 101 + Y P CLK++ ++ GR I Sbjct: 63 TNNPPQDWAFAYTYPTDCLKIIEIPVPGVRYPTAAMRVQYVVGADSAGTGRLIYTDLPQA 122 Query: 102 RPLSLKYIGEVPL-SDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVA-- 158 L+Y+ + + DP++QEA + +LA + L + + + R++ A Sbjct: 123 ---WLRYVARITDVNMFDPIFQEALSWRLAAAINMVLTGNADLGNNALSMYSRIILSAGS 179 Query: 159 MDMDGVEIPEDRG 171 M+ + P+ Sbjct: 180 HSMNESQEPQMPD 192 >gi|323699390|ref|ZP_08111302.1| hypothetical protein DND132_1981 [Desulfovibrio sp. ND132] gi|323459322|gb|EGB15187.1| hypothetical protein DND132_1981 [Desulfovibrio desulfuricans ND132] Length = 193 Score = 154 bits (389), Expect = 5e-36, Method: Composition-based stats. Identities = 39/178 (21%), Positives = 71/178 (39%), Gaps = 12/178 (6%) Query: 2 TELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPL 61 + +EICN ALL LG+ + + + S A C P + +LR+ W+ A +L Sbjct: 4 SVIEICNNALLDLGEDAIMSLTDDSKAAGLCNHRWPAVRDAVLRAHPWNCAMAQAELAAG 63 Query: 62 YFTYNGELKYY--IPLKCLKVLNTS-------EETQLRGRCIVARGDTPRPLSLKYIGEV 112 + Y +P L+++ + E+ +++ I+ + P+ + Y+ Sbjct: 64 TTAPLWKWAYKFVLPTDFLRIIAITGVDGEDVEDWEIQSGIILCDEE--APIFIAYVRRE 121 Query: 113 PLS-DCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIPED 169 D L EA + +LA L PL + + LA A +D E + Sbjct: 122 TDPKKYDALLSEALSARLAATLAYPLSGSTALAQSCWKTYQEKLAEARGVDAREGVPE 179 >gi|218700983|ref|YP_002408612.1| hypothetical protein ECIAI39_2673 [Escherichia coli IAI39] gi|218370969|emb|CAR18796.1| conserved hypothetical protein from phage origin [Escherichia coli IAI39] Length = 201 Score = 153 bits (388), Expect = 6e-36, Method: Composition-based stats. Identities = 42/190 (22%), Positives = 71/190 (37%), Gaps = 25/190 (13%) Query: 2 TELEICNWALLKLGQ-RPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 + +EICN AL +G R + + E S +A C L +L F W+FAT+ + L Sbjct: 3 SVVEICNRALSNIGNSRSINSLTEASKEAGECSLHFEACRDAVLSDFDWNFATKRVALAD 62 Query: 61 LY-FTYNGELKYYIPLKCLKVL------------------NTSEETQLRGRCIVARGDTP 101 + E Y P CL++ +T G+ I Sbjct: 63 TSNPPPDWEYAYQYPSDCLRITEIMLPGVRNPTAAMRVQYEVGADTNGTGKLIYTDQPQA 122 Query: 102 RPLSLKYIGEV-PLSDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMD 160 LKY+ V ++ D ++ EA A +LA + L + + ++ RV+ A Sbjct: 123 ---WLKYVSRVSDVNMFDAIFMEALAWRLAAAINMALTGNADLGTFALNMYNRVILSA-G 178 Query: 161 MDGVEIPEDR 170 ++ Sbjct: 179 SHSQNESQEP 188 >gi|215487814|ref|YP_002330245.1| hypothetical protein E2348C_2747 [Escherichia coli O127:H6 str. E2348/69] gi|215265886|emb|CAS10295.1| predicted protein [Escherichia coli O127:H6 str. E2348/69] Length = 201 Score = 151 bits (382), Expect = 3e-35, Method: Composition-based stats. Identities = 40/191 (20%), Positives = 68/191 (35%), Gaps = 25/191 (13%) Query: 2 TELEICNWALLKLGQ-RPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 + +EICN AL +G R + + NE S +A C L L F W+FAT+ + L Sbjct: 3 SVIEICNRALSNIGNSRSINSLNEASKEAGQCSLHFDACRDAALADFDWNFATKRVALAD 62 Query: 61 L-YFTYNGELKYYIPLKCLKVLN------------------TSEETQLRGRCIVARGDTP 101 + + Y P C+++ +L G+ I Sbjct: 63 TNNPPPDWQYAYQYPSDCVRITEIMPTGIRNPTAAQRIEYVVGSNEELTGKLIYTDQPKA 122 Query: 102 RPLSLKYIGEVPL-SDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMD 160 LKY+ V + D ++ EA + +LA + L + + RV+ A Sbjct: 123 ---WLKYVARVTDVNMYDAIFMEALSWRLAAAINMALTGSADLGNNALTMYNRVILSA-G 178 Query: 161 MDGVEIPEDRG 171 ++ Sbjct: 179 SHSQNESQEPQ 189 >gi|330007162|ref|ZP_08305904.1| hypothetical protein HMPREF9538_03593 [Klebsiella sp. MS 92-3] gi|328535509|gb|EGF61969.1| hypothetical protein HMPREF9538_03593 [Klebsiella sp. MS 92-3] Length = 203 Score = 150 bits (378), Expect = 8e-35, Method: Composition-based stats. Identities = 40/192 (20%), Positives = 68/192 (35%), Gaps = 24/192 (12%) Query: 2 TELEICNWALLKLGQ-RPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 + + ICN AL ++G R + + E++ +A C L I +L W+FA + L Sbjct: 3 SVINICNIALARIGNSRTINSLTEKTKEAYTCNLFYESIRDAVLADNDWNFAMSRVVLAD 62 Query: 61 LY-FTYNGELKYYIPLKCLK--------------------VLNTSEETQLRGRCIVARGD 99 L +Y P C + V GR I Sbjct: 63 LSDPAPGWLFRYQYPTDCARIAAILPKWFTGSHIVLQDKPVFEVGSNEDGTGRVIHTNES 122 Query: 100 TPRPLSLKYIGEVPLSDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAM 159 L +K I + D L+ +A + ++A E+ P+ + + ++VL AM Sbjct: 123 QAVLLYVKRITDPT--MFDALFADALSWRMAAEIAMPITANASLGQQAMANYQQVLTAAM 180 Query: 160 DMDGVEIPEDRG 171 E E + Sbjct: 181 QRSLDEAHEPQQ 192 >gi|30387390|ref|NP_848219.1| hypothetical protein epsilon15p11 [Enterobacteria phage epsilon15] gi|30266045|gb|AAO06074.1| 11 [Salmonella phage epsilon15] Length = 201 Score = 148 bits (374), Expect = 3e-34, Method: Composition-based stats. Identities = 39/191 (20%), Positives = 66/191 (34%), Gaps = 25/191 (13%) Query: 2 TELEICNWALLKLGQ-RPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 + +EICN AL +G R + + E S +A C L L F W+FAT+ + L Sbjct: 3 SVIEICNRALSNIGNSRSINSLIEASKEAGQCSLHFDACRDAALADFDWNFATKRVALAD 62 Query: 61 L-YFTYNGELKYYIPLKCLKVLN------------------TSEETQLRGRCIVARGDTP 101 + + Y P C+++ L G+ I Sbjct: 63 TNNPPPDWQYAYQYPSDCVRITEIMPTGIRNPTAAQRIEYVVGSNEDLTGKLIYTDQPKA 122 Query: 102 RPLSLKYIGEVPL-SDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMD 160 LKY+ V + D ++ EA + +LA + L + + RV+ A Sbjct: 123 ---WLKYMARVTDVNMYDAIFMEALSWRLAAAINMALTGSADLGNNALTMYNRVILSA-G 178 Query: 161 MDGVEIPEDRG 171 ++ Sbjct: 179 SHSQNESQEPQ 189 >gi|46581002|ref|YP_011810.1| hypothetical protein DVU2598 [Desulfovibrio vulgaris str. Hildenborough] gi|46450423|gb|AAS97070.1| hypothetical protein DVU_2598 [Desulfovibrio vulgaris str. Hildenborough] gi|311234687|gb|ADP87541.1| hypothetical protein Deval_2398 [Desulfovibrio vulgaris RCH1] Length = 184 Score = 145 bits (367), Expect = 2e-33, Method: Composition-based stats. Identities = 42/177 (23%), Positives = 69/177 (38%), Gaps = 5/177 (2%) Query: 2 TELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPL 61 +++ ICN AL LG + + ++ S +AD C LL S W+FAT+ L P+ Sbjct: 3 SDVAICNRALRYLGAPEILSLDQHSREADLCARHYVETRDELLESHHWNFATRYARLAPM 62 Query: 62 YFTYNGELK--YYIPLKCLKV--LNTSEETQLRGRCIVARGDTPRPLSLKYIGEVPLSDC 117 Y +P CL+V L + ++ ++ TP L + Sbjct: 63 ESAPPFGYARSYRLPGDCLRVRHLRAGDPFEVVEDRVLYTDATPAEAVLT-VRVTDPGRF 121 Query: 118 DPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIPEDRGYHN 174 L+ E A +LA L PL+ + ++ + +A D E D N Sbjct: 122 PALFTEMLARRLASSLAVPLMGNARLEQTMLQRFAAAFDLARMADAAEGAADPDEGN 178 >gi|120601702|ref|YP_966102.1| hypothetical protein Dvul_0652 [Desulfovibrio vulgaris DP4] gi|120561931|gb|ABM27675.1| conserved hypothetical protein [Desulfovibrio vulgaris DP4] Length = 184 Score = 145 bits (366), Expect = 2e-33, Method: Composition-based stats. Identities = 42/177 (23%), Positives = 69/177 (38%), Gaps = 5/177 (2%) Query: 2 TELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPL 61 +++ ICN AL LG + + ++ S +AD C LL S W+FAT+ L P+ Sbjct: 3 SDVAICNRALRYLGAPEILSLDQHSREADLCARHYVETRDELLESHHWNFATRYARLAPM 62 Query: 62 YFTYNGELK--YYIPLKCLKV--LNTSEETQLRGRCIVARGDTPRPLSLKYIGEVPLSDC 117 Y +P CL+V L + ++ ++ TP L + Sbjct: 63 ESAPPFGYARSYRLPGDCLRVRNLRAGDPFEVVEDRVLYTDATPAEAVLT-VRVTDPGRF 121 Query: 118 DPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIPEDRGYHN 174 L+ E A +LA L PL+ + ++ + +A D E D N Sbjct: 122 PALFTEMLARRLASSLAVPLMGNARLEQTMLQRFAAAFDLARMADAAEGAADPDEGN 178 >gi|288959324|ref|YP_003449665.1| hypothetical protein AZL_024830 [Azospirillum sp. B510] gi|288911632|dbj|BAI73121.1| hypothetical protein AZL_024830 [Azospirillum sp. B510] Length = 189 Score = 144 bits (363), Expect = 6e-33, Method: Composition-based stats. Identities = 38/177 (21%), Positives = 70/177 (39%), Gaps = 12/177 (6%) Query: 1 MTELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 +T + +C AL+KLG + ++++ S +A+ L P LL + +WSFA+ L Sbjct: 3 LTAIGLCGRALIKLGATAIASFDDGSAEAEVAAALYGPTRDALLSASAWSFASVQATLPR 62 Query: 61 LYFTY--NGELKYYIPLKCLK-------VLNTSEETQLRGRCIVARGDTPRPLSLKYIGE 111 L + + +P L+ ++ GR ++ ++L YIG Sbjct: 63 LAEPPVADYAFAFQLPADLLRALGAGGNGRGRGLPYRIDGRRLLCDAG---AVTLSYIGR 119 Query: 112 VPLSDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIPE 168 + +A +LA E C PL +L +E A +D + + Sbjct: 120 PAEEAFPAFFDQALIARLAAEFCLPLTESSTRAEFLAQLAESEFRRARQIDAQQDSQ 176 >gi|291336929|gb|ADD96457.1| hypothetical protein ALOHA_HF400048F7ctg1g16 [uncultured organism MedDCM-OCT-S09-C787] Length = 193 Score = 140 bits (354), Expect = 5e-32, Method: Composition-based stats. Identities = 38/170 (22%), Positives = 73/170 (42%), Gaps = 9/170 (5%) Query: 2 TELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP- 60 + ++ICN AL +LG + E S A C + L RS W+ + ++L Sbjct: 3 SIVDICNGALNQLGASTILTLTEDSKNARLCNARYTQVRDSLFRSHPWNCLIKRVELAKD 62 Query: 61 -LYFTYNGELKYYIPLKCLKVLNT---SEETQLRGRCIVARGDTPRPLSLKYIGEVPL-S 115 ++ ++ +P CL+VL + ++ GR IVA T + ++Y+ + + Sbjct: 63 TATPSWGFSYQFTLPADCLRVLTILNYDYDYKIEGRKIVANHGT---VKIQYVSRIDDPN 119 Query: 116 DCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVE 165 D L +E + LA ++ + + + S + + L A +D E Sbjct: 120 QYDELLRETISAALAADIAYAVTSSNPVASNMYNLFQDKLKEARFVDATE 169 >gi|254251748|ref|ZP_04945066.1| hypothetical protein BDAG_00945 [Burkholderia dolosa AUO158] gi|124894357|gb|EAY68237.1| hypothetical protein BDAG_00945 [Burkholderia dolosa AUO158] Length = 189 Score = 137 bits (344), Expect = 9e-31, Method: Composition-based stats. Identities = 28/167 (16%), Positives = 64/167 (38%), Gaps = 11/167 (6%) Query: 11 LLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPL--YFTYNGE 68 +L+LG +P+ +++E + +A C L + +LR+ W+ T+ + L PL ++ Sbjct: 1 MLQLGDKPISSFDEPTDRALVCSNLYAEVRDAMLRAHPWNSCTKRVVLAPLAEAPPFDYP 60 Query: 69 LKYYIPLKCLKVLNTSE-----ETQLRGRCIVARGDTPRPLSLKYIGEVPLSD-CDPLYQ 122 ++ +P L+ + + G I+A + + YI + Sbjct: 61 YQFQLPADWLRTIQVGSRKCPLDYAAEGNRILAFVNAMPFV---YIFRNENEATWESTLV 117 Query: 123 EAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIPED 169 + + + P+ M + E L A ++G + + Sbjct: 118 DVVTKAIKAAIAYPITQSAAMAQTAQTEFAAFLKQAKAINGQDDDTE 164 >gi|294648404|ref|ZP_06725903.1| phage protein [Acinetobacter haemolyticus ATCC 19194] gi|292825709|gb|EFF84413.1| phage protein [Acinetobacter haemolyticus ATCC 19194] Length = 185 Score = 133 bits (334), Expect = 1e-29, Method: Composition-based stats. Identities = 33/177 (18%), Positives = 73/177 (41%), Gaps = 7/177 (3%) Query: 2 TELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPL 61 T+ +I N +LL +G + ++ E ++ A R + K LLR + AT+ + L P+ Sbjct: 3 TKTDIANQSLLMIGADSITSFEENTITARRMRTVYDSSRKALLRLHPFQCATKRIKLNPI 62 Query: 62 Y--FTYNGELKYYIPLKCLKVLNT-SEETQLRGRCIVARGDTPRPLSLKYIGEVP-LSDC 117 + ++ +P ++++ +E+ + +++ D L+L Y+ + Sbjct: 63 STQPEFGYLYQFQLPDDLIRIITANTEDYVVETDRLLSNSDQ---LNLVYVFDNKNEESY 119 Query: 118 DPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIPEDRGYHN 174 D L+ E L LA ++ + + +L A E+P + + Sbjct: 120 DSLFIECLVLYLAYKVTKATTGSQGTSDSYYMQCQELLKQAKATQAQEVPSQQFFKE 176 >gi|317151945|ref|YP_004119993.1| hypothetical protein Daes_0220 [Desulfovibrio aespoeensis Aspo-2] gi|316942196|gb|ADU61247.1| hypothetical protein Daes_0220 [Desulfovibrio aespoeensis Aspo-2] Length = 194 Score = 133 bits (334), Expect = 1e-29, Method: Composition-based stats. Identities = 39/178 (21%), Positives = 73/178 (41%), Gaps = 12/178 (6%) Query: 2 TELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPL 61 + +EICN ALL LG+ + + + S A C P + +LR+ W+ AT +L Sbjct: 4 SVIEICNNALLDLGEDAIMSLGDESKAAGLCNHRWPAVRDAVLRAHPWNCATAQAELAAG 63 Query: 62 YFTYNGEL--KYYIPLKCLKVLNT-------SEETQLRGRCIVARGDTPRPLSLKYIGEV 112 T + +Y +P+ L+++ ++ +++G ++ + P+ + Y+ Sbjct: 64 AATPLWKWECRYMLPVDFLRIVRIVGGGGSQIQDWEIQGGVVLCNEE--APIYIAYVRRE 121 Query: 113 PLS-DCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIPED 169 D L E A +LA L PL + + L A +D E + Sbjct: 122 TDPRRYDALLDETLAARLAATLAYPLTGSTALAQSCWNLYQDKLNEARGVDAREGVPE 179 >gi|291334459|gb|ADD94113.1| hypothetical protein Desal_1440 [uncultured phage MedDCM-OCT-S04-C1161] gi|291334516|gb|ADD94169.1| hypothetical protein Desal_1440 [uncultured phage MedDCM-OCT-S04-C1201] Length = 164 Score = 132 bits (331), Expect = 2e-29, Method: Composition-based stats. Identities = 35/162 (21%), Positives = 69/162 (42%), Gaps = 9/162 (5%) Query: 2 TELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP- 60 + ++ICN AL +LG + E S A C + L RS W+ + ++L Sbjct: 3 SIVDICNGALNQLGASTILTLTEDSKNARLCNARYTQVRDSLFRSHPWNCLIKRVELAKD 62 Query: 61 -LYFTYNGELKYYIPLKCLKVLNT---SEETQLRGRCIVARGDTPRPLSLKYIGEVPL-S 115 ++ ++ +P CL+VL + ++ GR IVA T + ++Y+ + + Sbjct: 63 TATPSWGFSYQFTLPADCLRVLTILNYDYDYKIEGRKIVANHGT---VKIQYVSRIDDPN 119 Query: 116 DCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAV 157 D L +E + LA ++ + + + S + + L Sbjct: 120 QYDELLRETISAALAADIAYAVTSSNPVASNMYNLFQDKLKK 161 >gi|195541812|gb|ACF98015.1| hypothetical protein [uncultured bacterium 878] Length = 202 Score = 128 bits (323), Expect = 2e-28, Method: Composition-based stats. Identities = 34/177 (19%), Positives = 67/177 (37%), Gaps = 8/177 (4%) Query: 2 TELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQS--MDLL 59 ++ ICN AL LG P+ + ++ + +A R + + L W FA + + Sbjct: 3 SQTAICNRALEYLGDAPIVSIDDDTKQAKALRRVYDNTRRAFLSDHPWHFAKKRASLPAS 62 Query: 60 PLYFTYNGELKYYIPLKCLKVLNT--SEETQLRGRCIVAR---GDTPRPLSLKYIGEVPL 114 + Y +P+ L++L L ++ D PL + Y+ +V Sbjct: 63 AAAPVWGFSRGYPVPVDFLRLLAVKDGPAFSLEADATGSQWILSDAAAPLKILYLTDVTD 122 Query: 115 S-DCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIPEDR 170 + P EA A LA +L L + + ++ A ++G++ + Sbjct: 123 AGRFPPHAVEALARWLAYDLAEDLTQSNTKKQDAAQSLAIAISRAKRINGMQKQPEP 179 >gi|315122634|ref|YP_004063123.1| hypothetical protein CKC_04430 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496036|gb|ADR52635.1| hypothetical protein CKC_04430 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 228 Score = 127 bits (320), Expect = 5e-28, Method: Composition-based stats. Identities = 35/129 (27%), Positives = 57/129 (44%), Gaps = 5/129 (3%) Query: 43 LLRSFSWSFATQSMDLLPLYFT--YNGELKYYIPLKCLKVLNTSEETQLRGRCIVARGDT 100 +LR W FA + L L + Y P C+ +++ G+ I+A Sbjct: 1 MLRKARWHFAIKRKLLQELDEPLVEDRYHTYISPHDCICIIDHPPSWIFEGKKILA--AE 58 Query: 101 PRPLSLKYIGEV-PLSDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAM 159 PL ++YI + L DCD L+ E A +A ELC L+ + + L+ E E ++ A Sbjct: 59 TPPLKIRYIARLQDLEDCDSLFLEILATNMALELCETLLGSESRKQTLRKEFEMLMDKAK 118 Query: 160 DMDGVEIPE 168 +E+P Sbjct: 119 TTGAIEMPP 127 >gi|260557973|ref|ZP_05830185.1| predicted protein [Acinetobacter baumannii ATCC 19606] gi|260408483|gb|EEX01789.1| predicted protein [Acinetobacter baumannii ATCC 19606] Length = 184 Score = 126 bits (316), Expect = 1e-27, Method: Composition-based stats. Identities = 39/180 (21%), Positives = 71/180 (39%), Gaps = 10/180 (5%) Query: 2 TELEICNWALLKLGQ-RPLENWNERSLKADYCRLLLPPIHKGLL-RSFSWSFATQSMDLL 59 + +++CN AL L Q + E + A C P + LL W+FA + + L Sbjct: 3 SIVDLCNLALSHLAQGYVVNELTEPTKHARLCNTFYPICRRELLDNEHQWTFAVKRVRLN 62 Query: 60 PLYFTYNGELKYYIPLKCLKV--LNTSEETQLRGRCIVARGDTPRPLSLKYIGEV-PLSD 116 + Y Y +P + + L + + G + P L+Y+ +V L+ Sbjct: 63 -VDAGYEFGTAYVLPSDKVHIFQLESGSRFYVEGNLLFTEDTAP---ILRYVHDVKDLAL 118 Query: 117 CDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVE-IPEDRGYHNG 175 ++ A + LA + PL ++Q + + E A+ +D + E R H G Sbjct: 119 MPDSFKTALSYLLAARIAGPLTQNEQKQISMMQLYEIEKNKAIFIDLQQHRIEARPEHTG 178 >gi|291336444|gb|ADD95999.1| conserved hypothetical protein [uncultured organism MedDCM-OCT-S04-C1073] Length = 129 Score = 125 bits (315), Expect = 2e-27, Method: Composition-based stats. Identities = 28/120 (23%), Positives = 53/120 (44%), Gaps = 8/120 (6%) Query: 2 TELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP- 60 + ++ICN AL +LG + E S A C + L RS W+ + ++L Sbjct: 3 SIVDICNGALNQLGASTILTLTEDSKNARLCNARYTQVRDSLFRSHPWNCLIKRVELAKD 62 Query: 61 -LYFTYNGELKYYIPLKCLKVLNT---SEETQLRGRCIVARGDTPRPLSLKYIGEVPLSD 116 ++ ++ +P CL+VL + ++ GR IVA T + ++Y+ + + Sbjct: 63 TATPSWGFSYQFTLPADCLRVLTILNYDYDYKIEGRKIVANHGT---VKIQYVSRIDDPN 119 >gi|291334664|gb|ADD94311.1| conserved hypothetical protein [uncultured phage MedDCM-OCT-S04-C695] Length = 132 Score = 125 bits (315), Expect = 2e-27, Method: Composition-based stats. Identities = 31/128 (24%), Positives = 57/128 (44%), Gaps = 9/128 (7%) Query: 2 TELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP- 60 + ++ICN AL +LG + E S A C + L RS W+ + ++L Sbjct: 3 SIVDICNGALNQLGASTILTLTEDSKNARLCNARYTQVRDSLFRSHPWNCLIKRVELAKD 62 Query: 61 -LYFTYNGELKYYIPLKCLKVLNT---SEETQLRGRCIVARGDTPRPLSLKYIGEVPL-S 115 ++ ++ +P CL+VL + ++ GR IVA T + ++Y+ + + Sbjct: 63 TATPSWGFSYQFTLPADCLRVLTILNYDYDYKIEGRKIVANHGT---VKIQYVSRIDDPN 119 Query: 116 DCDPLYQE 123 D L +E Sbjct: 120 QYDELLRE 127 >gi|303257569|ref|ZP_07343581.1| conserved hypothetical protein [Burkholderiales bacterium 1_1_47] gi|302859539|gb|EFL82618.1| conserved hypothetical protein [Burkholderiales bacterium 1_1_47] Length = 214 Score = 125 bits (314), Expect = 2e-27, Method: Composition-based stats. Identities = 44/189 (23%), Positives = 78/189 (41%), Gaps = 18/189 (9%) Query: 2 TELEICNWALLKLGQR-PLENWN--ERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDL 58 + ++ICN AL +LG R + + + E S +AD+CR P K +L +++WSFAT +L Sbjct: 3 SVVDICNIALSRLGDRATVTSIDPPEGSAQADHCRRFYPIALKTILATYNWSFATTRKEL 62 Query: 59 LPLYFTY-NGELKYYIPLKCLKVL---------NTSEETQLRGRCI----VARGDTPRPL 104 L G + IP C+K++ N + +T R + V + + Sbjct: 63 ARLTAEPIGGGYAFPIPADCVKIIYAYPVDENGNATRQTLHYVRELINGQVCLVAEQKRI 122 Query: 105 SLKYI-GEVPLSDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDG 163 ++YI EV ++ +A A LA L ++ + Sbjct: 123 WIRYITTEVKPEKFSDVFSDALAFLLASNLAGTVVPGMTGVQMAAEMMRFYEDRLLKAQA 182 Query: 164 VEIPEDRGY 172 + +DR + Sbjct: 183 QDAVQDRDH 191 >gi|218886167|ref|YP_002435488.1| hypothetical protein DvMF_1066 [Desulfovibrio vulgaris str. 'Miyazaki F'] gi|218757121|gb|ACL08020.1| conserved hypothetical protein [Desulfovibrio vulgaris str. 'Miyazaki F'] Length = 198 Score = 125 bits (314), Expect = 2e-27, Method: Composition-based stats. Identities = 43/172 (25%), Positives = 58/172 (33%), Gaps = 3/172 (1%) Query: 2 TELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPL 61 +E+ ICN AL LG + ++ S +AD C LL W+FAT+ L PL Sbjct: 17 SEVSICNKALRYLGAPEIVALDQPSREADLCSRYYAEARDELLEHHHWNFATRYTSLAPL 76 Query: 62 YFTYNGEL--KYYIPLKCLKVLNTSEETQLRGRCIVARGDTPRPLSLKYIGEVPLSD-CD 118 Y +P CL+V E P V Sbjct: 77 AAVPPFGFACAYRLPGDCLRVRRLRGEPPFEVVEARTLYTDAAPAEAVLTVRVTDPARFP 136 Query: 119 PLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIPEDR 170 L+ EA A +LA L PL+ + + L A D E D Sbjct: 137 ALFVEALARRLAAALAVPLMNSSRTEQAMLQRFGDALEAARVADAAEGASDP 188 >gi|260549510|ref|ZP_05823728.1| conserved hypothetical protein [Acinetobacter sp. RUH2624] gi|260407303|gb|EEX00778.1| conserved hypothetical protein [Acinetobacter sp. RUH2624] Length = 184 Score = 123 bits (309), Expect = 9e-27, Method: Composition-based stats. Identities = 41/180 (22%), Positives = 74/180 (41%), Gaps = 10/180 (5%) Query: 2 TELEICNWALLKLGQRPLEN-WNERSLKADYCRLLLPPIHKGLL-RSFSWSFATQSMDLL 59 + +++CN AL L Q + NE + A+ C P + LL W+FA + + L Sbjct: 3 SIVDLCNLALSHLAQGYVVKELNEPTKHANLCNTFYPVCRRELLDNEHQWTFAVKRVRLN 62 Query: 60 PLYFTYNGELKYYIPLKCLKV--LNTSEETQLRGRCIVARGDTPRPLSLKYIGEVPLSDC 117 + Y Y +P +++ L + + G + P P+ L+Y+ +V Sbjct: 63 -VDAGYEFGRAYVLPSDKIRLFQLESGASYYVEGNLLFTDD--PAPV-LRYVHDVKDLAL 118 Query: 118 DP-LYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVE-IPEDRGYHNG 175 P +Q A + LA + PL ++ + + L A+ +D + E R H G Sbjct: 119 LPETFQVALSYLLAARIAGPLTQSEEKQRQMMAGYAMNLGKAIFIDLQQHRIEKRPEHVG 178 >gi|292670775|ref|ZP_06604201.1| conserved hypothetical protein [Selenomonas noxia ATCC 43541] gi|292647396|gb|EFF65368.1| conserved hypothetical protein [Selenomonas noxia ATCC 43541] Length = 199 Score = 123 bits (308), Expect = 1e-26, Method: Composition-based stats. Identities = 40/188 (21%), Positives = 72/188 (38%), Gaps = 19/188 (10%) Query: 1 MTELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 M ++++CN AL ++G +E E S A C + +LR + W++AT+ + L Sbjct: 2 MDKIDVCNLALSRIGIDTVEALTEASEPARVCSQFYDHCRRVVLRKYPWTWATRRVQLAE 61 Query: 61 LYFTY-NGELKYYIPLKC--LKVLNTSEETQL------------RGRCIVARGDTPRPLS 105 L Y P C L+ L + GR I +S Sbjct: 62 LTDKPQGYSYAYRYPASCVALRKLYNGHFDNIPAYTGYQIVSDKEGRVIYTDV---ANVS 118 Query: 106 LKYIGEVPLSD-CDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGV 164 +Y ++ + D + EA + KLAG + L + Q+ Y + + + A+ + Sbjct: 119 AEYTADIEDTGLFDDQFVEALSWKLAGAIAFKLTGNAQLPGYCEEQYTALFLDAVANNED 178 Query: 165 EIPEDRGY 172 E + Sbjct: 179 EQNAEEKE 186 >gi|282848882|ref|ZP_06258272.1| hypothetical protein HMPREF1035_1391 [Veillonella parvula ATCC 17745] gi|282581387|gb|EFB86780.1| hypothetical protein HMPREF1035_1391 [Veillonella parvula ATCC 17745] Length = 197 Score = 120 bits (300), Expect = 9e-26, Method: Composition-based stats. Identities = 44/190 (23%), Positives = 72/190 (37%), Gaps = 21/190 (11%) Query: 1 MTELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDL-L 59 MT +ICN AL + L ++ E + + CR+ P + +LR F W+FA ++ L L Sbjct: 1 MTPTDICNQALALINAGLLYSFEEETEQGRQCRMQYDPTRQLVLRQFEWNFARKNERLVL 60 Query: 60 PLYFTYNGELKYYIPLKCLKVLNTSEETQL----------------RGRCIVARGDTPRP 103 + Y P +C+++L + +CIV+ Sbjct: 61 SAHKINGWNYVYAYPERCIRILGVIPQGDRFHAESQPEYNIFNIGNNKKCIVSDVPLA-- 118 Query: 104 LSLKYIGEVPLSD-CDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMD 162 + YI +V D D + KLA L PL D + + + A M+ Sbjct: 119 -FIDYIYDVTDLDVWDSISLYMLQCKLASALAMPLTGDRGLFDQAYKLYQAAVQEAKGMN 177 Query: 163 GVEIPEDRGY 172 E +D Y Sbjct: 178 AKERKQDTVY 187 >gi|262043658|ref|ZP_06016767.1| hypothetical protein HMPREF0484_3786 [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] gi|259038996|gb|EEW40158.1| hypothetical protein HMPREF0484_3786 [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] Length = 202 Score = 118 bits (297), Expect = 2e-25, Method: Composition-based stats. Identities = 41/189 (21%), Positives = 81/189 (42%), Gaps = 23/189 (12%) Query: 3 ELEICNWAL-LKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPL 61 + +I N AL LG L + +E+ L+ P+ + L R ++W+F +S L PL Sbjct: 5 QTDIMNLALVSFLGSDHLMDRDEQDKNTRVMNLIYNPLLEKLQREYAWNFCARSTQLTPL 64 Query: 62 --YFTYNGELKYYIPLKCLKVLNTSEET-----------------QLRGRCIVARGDTPR 102 + + Y +P+ + +++ + ++ GR ++ P Sbjct: 65 NKTPVIDYQYAYQLPVDLMVLVSVGDRYYGRDFAEYDPRLVTAEYRIEGRELLTN--LPP 122 Query: 103 PLSLKYIGEVPL-SDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDM 161 PLSL+Y +V S D + +A A LA C + D + L + + ++ A+ Sbjct: 123 PLSLRYRAKVTDASKFDSTFVDALACMLAVRSCKAITGKDTLVESLLQQFQMIITGAVRA 182 Query: 162 DGVEIPEDR 170 + +E P ++ Sbjct: 183 NAIEKPSEK 191 >gi|226940468|ref|YP_002795542.1| hypothetical protein LHK_01545 [Laribacter hongkongensis HLHK9] gi|226715395|gb|ACO74533.1| hypothetical protein LHK_01545 [Laribacter hongkongensis HLHK9] Length = 201 Score = 116 bits (290), Expect = 1e-24, Method: Composition-based stats. Identities = 42/191 (21%), Positives = 71/191 (37%), Gaps = 20/191 (10%) Query: 2 TELEICNWALLKLGQ-RPLENWN--ERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDL 58 + ++ICN AL +LG + + + E S +A+ C P LL + W FA + + L Sbjct: 3 SVVDICNLALARLGDDATVASIDPPEGSPQAEQCARFYPVARDALLEAHPWQFAVRRVRL 62 Query: 59 LPLY-FTYNGELKYYIPLKCLKVLNTSE------------ETQLRGRCIVARGDTPRPLS 105 L T+N Y P L++L ET+ + + Sbjct: 63 AQLSLPTWNWRHVYAAPNNALQLLGVLPEAAASDDETESFETESDDSGAILIRTDQAAAT 122 Query: 106 LKYIGEVPLSD-CDPLYQEAFALKLAGELCPPLI---MDDQMRSYLKGESERVLAVAMDM 161 L+Y V + PL+ +A LA L PLI +M V + A+ Sbjct: 123 LRYTARVTDASRFSPLFVDALGWLLASHLAGPLIKGAEGAKMAQQCYANFRTVFSQAVVS 182 Query: 162 DGVEIPEDRGY 172 + + + + Sbjct: 183 NANQRHVEPSH 193 >gi|301019349|ref|ZP_07183535.1| conserved domain protein [Escherichia coli MS 196-1] gi|299882266|gb|EFI90477.1| conserved domain protein [Escherichia coli MS 196-1] Length = 128 Score = 115 bits (288), Expect = 2e-24, Method: Composition-based stats. Identities = 25/84 (29%), Positives = 38/84 (45%), Gaps = 2/84 (2%) Query: 2 TELEICNWALLKLGQ-RPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 + +EICN AL +G R + + E S +A C L +L F W+FAT+ + L Sbjct: 3 SVVEICNRALSNIGNSRSINSLTEASKEAGECSLHFEACRDAVLSDFDWNFATKRVALAD 62 Query: 61 LY-FTYNGELKYYIPLKCLKVLNT 83 + E Y P CL++ Sbjct: 63 TSNPPPDWEYAYQYPSDCLRITEI 86 >gi|288959383|ref|YP_003449724.1| hypothetical protein AZL_025420 [Azospirillum sp. B510] gi|288911691|dbj|BAI73180.1| hypothetical protein AZL_025420 [Azospirillum sp. B510] Length = 187 Score = 113 bits (284), Expect = 7e-24, Method: Composition-based stats. Identities = 36/168 (21%), Positives = 62/168 (36%), Gaps = 8/168 (4%) Query: 4 LEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPL-- 61 +++ N AL + + E ++ CR LR++ W+ A L L Sbjct: 5 IDLFNRALGWCRAQAMVTLEEDTVSGRACRRFYATSRDATLRAYPWNSAASRAVLAALVD 64 Query: 62 YFTYNGELKYYIPLKCLKVL----NTSEETQLRGRCIVARGDTPRPLSLKYIGEVPLSDC 117 + + +Y +P CL V + + + GR ++ PL LKY + Sbjct: 65 APEFEFDHQYRLPDDCLAVRQLWDDPQADYVVEGRSLLTN--LSAPLRLKYTARIEPDGM 122 Query: 118 DPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVE 165 DPL A A +LA ++ P L + + A +D E Sbjct: 123 DPLLFNAVAARLAYDIAPGLTESSTVTDKSLERFAAAVREARAVDAAE 170 >gi|167032762|ref|YP_001667993.1| hypothetical protein PputGB1_1754 [Pseudomonas putida GB-1] gi|166859250|gb|ABY97657.1| conserved hypothetical protein [Pseudomonas putida GB-1] Length = 211 Score = 110 bits (275), Expect = 7e-23, Method: Composition-based stats. Identities = 37/178 (20%), Positives = 63/178 (35%), Gaps = 28/178 (15%) Query: 2 TELEICNWALLKLGQR-PLENWNERSLKADYCRLLLPPIHKGLLRSF-SWSFATQSMDLL 59 + ++ICN AL ++G +++ E S A C+ +LR+ W FAT L Sbjct: 3 SVIDICNMALFRIGNSIRIDDLEENSTAARICKQFYESCRDFVLRADCDWGFATAFAQLA 62 Query: 60 PLY--FTYNGELKYYIPLKCLKVLNTSEETQLRGRC-IVARGDTP--------------- 101 + Y +P C++V +G P Sbjct: 63 EVASNPNPEYPYAYAVPNDCMRVRRIVNPVFPQGVWPAGMDCQVPEIPRIPFRIINGTSQ 122 Query: 102 -------RPLSLKYIGEVP-LSDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGES 151 P +L+Y +V DP++ A + LA ++ PL D + + GE Sbjct: 123 RLISTNVSPATLEYTLKVESPEMFDPIFVSALSWYLASQIAGPLAKDAGIATTCAGEY 180 >gi|220903986|ref|YP_002479298.1| hypothetical protein Ddes_0712 [Desulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774] gi|219868285|gb|ACL48620.1| conserved hypothetical protein [Desulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774] Length = 196 Score = 108 bits (271), Expect = 2e-22, Method: Composition-based stats. Identities = 47/188 (25%), Positives = 69/188 (36%), Gaps = 17/188 (9%) Query: 1 MTELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLL- 59 ++++++ N AL LG R + + E +A C L + +LR F WSFA + L Sbjct: 3 ISQIDVWNRALGFLGARSVASERESIPEALQCSLYWDSARRQVLRDFPWSFAQRRAWLAL 62 Query: 60 ---PLYFTYNGELKYYIPLKCLKVLNTSEETQL----------RGRCIVARGDTPRPLSL 106 P + Y +P CLKV E G + + R L L Sbjct: 63 QALPQGYAPEYRFAYALPENCLKVHEVRHEGLSPRPFCLAMSPAGDSSLLVTNASRALVL 122 Query: 107 KYIGEVPLSD-CDPLYQEAFALKLAGELCPPLI-MDDQMRSYLKGESERVLAVAMDMDGV 164 Y +V S D L+ A KLA + PL+ Q + L+ L A Sbjct: 123 -YTEDVRNSRLFDDLFAHMLARKLAALVSVPLLKGSGQKAAELEQLYAASLPPARGAAAS 181 Query: 165 EIPEDRGY 172 E E Sbjct: 182 ERSEKPAE 189 >gi|46580125|ref|YP_010933.1| hypothetical protein DVU1715 [Desulfovibrio vulgaris str. Hildenborough] gi|46449541|gb|AAS96192.1| hypothetical protein DVU_1715 [Desulfovibrio vulgaris str. Hildenborough] gi|311233882|gb|ADP86736.1| hypothetical protein Deval_1581 [Desulfovibrio vulgaris RCH1] Length = 195 Score = 105 bits (263), Expect = 2e-21, Method: Composition-based stats. Identities = 35/172 (20%), Positives = 65/172 (37%), Gaps = 10/172 (5%) Query: 2 TELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLL-- 59 + + I N AL +GQ + + +++ A C + +LR+ W+ AT L Sbjct: 3 SIVGIMNKALRLIGQTEIASPEQQTPSAVRCSAAWRDVVDEVLRAHPWAHATVWAALSKS 62 Query: 60 PLYFTYNGELKYYIPLKCLKVLNTSEETQLRGRCIVARG------DTPRPLSLKYIGEVP 113 + Y +P C++V++ + L + + P + +Y+ Sbjct: 63 DTPPPFGFAHAYVLPADCVRVIDVRADGNLARQGLPFEVVRGNIYTDAAPCNTRYVARDA 122 Query: 114 LSD-CDPLYQEAFALKLAGELCPPLIMD-DQMRSYLKGESERVLAVAMDMDG 163 P + +A A KLA E+ PL D M + + V+ A D Sbjct: 123 DPTFWPPDFYDAVAAKLAAEVALPLARDGGAMMRAMLDRYQWVVDAARLNDA 174 >gi|41179375|ref|NP_958683.1| Bbp14 [Bordetella phage BPP-1] gi|45569507|ref|NP_996576.1| hypothetical protein BMP-1p13 [Bordetella phage BMP-1] gi|45580758|ref|NP_996624.1| hypothetical protein BIP-1p13 [Bordetella phage BIP-1] gi|40950114|gb|AAR97680.1| Bbp14 [Bordetella phage BPP-1] Length = 223 Score = 105 bits (261), Expect = 4e-21, Method: Composition-based stats. Identities = 42/195 (21%), Positives = 66/195 (33%), Gaps = 24/195 (12%) Query: 2 TELEICNWALLKLGQRPLEN---WNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDL 58 +E++ICN AL LG E S++A+YC P LL +W FAT+ L Sbjct: 3 SEVDICNLALAYLGDEATVAGINPPEGSVQAEYCARFYPFARDSLLELHTWGFATKCAQL 62 Query: 59 LPLYFT-YNGELKYYIPLKCLKVLNTSEE----------------TQLRGRCIVARGDTP 101 + + Y P +K++ ++ Sbjct: 63 AAMGISRPEWRFAYAQPADAIKIVAVLPHDAANIEAGIDNAQPFSCEIDNTGADIILTNQ 122 Query: 102 RPLSLKYIGEVPLSD-CDPLYQEAFALKLAGELCPPLIMDD---QMRSYLKGESERVLAV 157 +YI V + PL+ +A A LA L PL+ D G + L+ Sbjct: 123 VNAVARYISLVKDTTKFSPLFVQALAWHLASMLAGPLLKGDVGAAESKRCVGAMQAYLSQ 182 Query: 158 AMDMDGVEIPEDRGY 172 AM D + + Sbjct: 183 AMVSDANQRKTKPAH 197 >gi|187476935|ref|YP_784959.1| phage protein [Bordetella avium 197N] gi|115421521|emb|CAJ48030.1| phage protein [Bordetella avium 197N] Length = 223 Score = 102 bits (255), Expect = 2e-20, Method: Composition-based stats. Identities = 45/188 (23%), Positives = 68/188 (36%), Gaps = 24/188 (12%) Query: 2 TELEICNWALLKLGQRPLEN---WNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDL 58 +E++ICN AL LG E S++A+YC P LL +W FAT+ L Sbjct: 3 SEVDICNLALAYLGDEATVAGINPPEGSVQAEYCARFYPFARDSLLELHTWGFATKCAQL 62 Query: 59 LPLYFT-YNGELKYYIPLKCLKVLNTSEETQLRGRCIVAR--------GDTPRPLSL--- 106 + + Y P +K++ + DT + L Sbjct: 63 AAMDISRPEWHFAYAQPADAIKIVAVLPHDAANIEAGIDNAQPFSCEIDDTGADIILSNQ 122 Query: 107 -----KYIGEVPLSD-CDPLYQEAFALKLAGELCPPLI-MDDQMRSYLKGESER--VLAV 157 +YI V + PL+ +A A LA L PL+ D + S L+ Sbjct: 123 VNAVARYISLVKDTTKFSPLFVQALAWHLASMLAGPLLKGDVGAAESKRCVSAMQAYLSQ 182 Query: 158 AMDMDGVE 165 AM D + Sbjct: 183 AMVSDANQ 190 >gi|118590939|ref|ZP_01548339.1| hypothetical protein SIAM614_19801 [Stappia aggregata IAM 12614] gi|118436461|gb|EAV43102.1| hypothetical protein SIAM614_19801 [Stappia aggregata IAM 12614] Length = 195 Score = 100 bits (249), Expect = 7e-20, Method: Composition-based stats. Identities = 34/176 (19%), Positives = 70/176 (39%), Gaps = 14/176 (7%) Query: 2 TELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPL 61 T +E+ N AL L + P+ +++ S+ + + + L W FA + ++L Sbjct: 7 TAVEMANLALAHLKEAPIRDFDFSSVASRWFKNHYAAHRDAYLAMHDWDFAIELVNLPAE 66 Query: 62 Y--FTYNGELKYYIPLKCLKVLNTSEET---------QLRGRCIVARGDTPRPLSLKYIG 110 + +Y +P L++ S ++ G+ I++ D P P L+++ Sbjct: 67 TEKPPFRWAYQYKLPAGTLRIPQQSVGGSPGGTIIPFEVVGQRIMS--DHPPPFPLRHVR 124 Query: 111 E-VPLSDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVE 165 V ++ PL+ FAL LA + + L+ + A D ++ Sbjct: 125 RVVREAEFAPLFVNGFALFLAAGCAHSITGKNSRAEALRQAAREAFDQAGTSDALQ 180 >gi|303328571|ref|ZP_07359006.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3] gi|302861337|gb|EFL84276.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3] Length = 199 Score = 96.6 bits (239), Expect = 1e-18, Method: Composition-based stats. Identities = 31/170 (18%), Positives = 60/170 (35%), Gaps = 7/170 (4%) Query: 5 EICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP--LY 62 I N AL+++G + + + + A + L WSFA + L Sbjct: 17 SIINTALMRVGAQGVTAAFQDAPAAQTAAAAYDRVLAYCLSLHPWSFAMRHAALARGTES 76 Query: 63 FTYNGELKYYIPLKCLKVLNTSEETQLRGRCIVARGDT----PRPLSLKYIGEVPLSDCD 118 + +P CL+V+ + + + C GD + + Y+ + Sbjct: 77 PAPGWRYAWPLPADCLRVVTAAADGGEQHICYSLAGDALQADAAAVIVSYVSS-EAASFP 135 Query: 119 PLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIPE 168 + +A A ++A E+ P + ER L A + ++ PE Sbjct: 136 DAFADALAWRVALEIAPYVQQGGGNARDYLEMFERALDRAKVENDMQRPE 185 >gi|291336966|gb|ADD96492.1| conserved hypothetical protein [uncultured organism MedDCM-OCT-S11-C1587] Length = 211 Score = 96.2 bits (238), Expect = 2e-18, Method: Composition-based stats. Identities = 38/196 (19%), Positives = 62/196 (31%), Gaps = 33/196 (16%) Query: 2 TELEICNWALLKLGQRPLENWNERS-LKADYCRLLLPPIHKGLLRSFSWSFATQSMDL-- 58 T + ICN AL LG + ++++ + A C L I K L + WSFA L Sbjct: 6 TNVTICNQALTLLGADTISSFSDTTNDAAAVCNNLYTTIKKKTLSLYPWSFALVKEQLNK 65 Query: 59 -LPLYFTYNGELKYYIPLKCL-------------KVLNTSE---ETQLRGRCIVARGDTP 101 + ++ +P + ++L G I D Sbjct: 66 SAGVTPVNEWTNQFILPSNSVSGTPLEVYNSSSTRILPIQSYELGYTSSGPAIFTNEDK- 124 Query: 102 RPLSLKYIGEV-PLSDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVL----- 155 + + YI V + + LA L P+ Y +G + L Sbjct: 125 --IYVDYISNVITEGLMPSYFVQLLVYMLAWHLAEPVTDQITKAEYWRGVALGSLTENGR 182 Query: 156 ----AVAMDMDGVEIP 167 A ++DG P Sbjct: 183 GGYFRQACNVDGRGKP 198 >gi|54302253|ref|YP_132246.1| hypothetical protein PBPRB0573 [Photobacterium profundum SS9] gi|46915674|emb|CAG22446.1| hypothetical protein PBPRB0573 [Photobacterium profundum SS9] Length = 207 Score = 93.1 bits (230), Expect = 1e-17, Method: Composition-based stats. Identities = 35/185 (18%), Positives = 67/185 (36%), Gaps = 25/185 (13%) Query: 2 TELEICNWALLKLG-QRPLENWN-ERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLL 59 + +ICN +L +G + ++ ER+ A +CR K L F WSFA + L Sbjct: 4 SRTDICNMSLGNIGHAEKIADYELERTPAAKHCRTYYEICVKTTLEDFDWSFARRVSPLA 63 Query: 60 PLYFTY--NGELKYYIPLKCLKVLNTSE-----ETQL--------------RGRCIVARG 98 L + + Y P CL V ++ Q+ G+ V Sbjct: 64 LLSGESSRSYKYVYAFPDDCLVVRRINQVMFAGHNQIPNNLVIGYEKGVSSDGKRKVLFC 123 Query: 99 DTPRPLSLKYIGEVPLSD-CDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAV 157 D P + + Y + +A ++ L L + ++ + + +++ Sbjct: 124 DEPD-IEIVYTSSDTNEALFTTTFVDAMTWRMGARLANVLPKNSKLSAQCQQTYSAIISQ 182 Query: 158 AMDMD 162 A++ D Sbjct: 183 AVEQD 187 >gi|290968642|ref|ZP_06560180.1| hypothetical protein HMPREF0889_0282 [Megasphaera genomosp. type_1 str. 28L] gi|290781295|gb|EFD93885.1| hypothetical protein HMPREF0889_0282 [Megasphaera genomosp. type_1 str. 28L] Length = 232 Score = 92.7 bits (229), Expect = 2e-17, Method: Composition-based stats. Identities = 38/191 (19%), Positives = 66/191 (34%), Gaps = 26/191 (13%) Query: 1 MTELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 +TE +CN AL LG+ ++ + A C L + +LR+F+W FA + L Sbjct: 25 LTETHLCNMALSYLGKGSIQELTNMNENAVTCSLFYDITRQEVLRTFNWGFAHRIEPLSV 84 Query: 61 LYF----------------TYNGELKYYIPLKCLKV----LNTSEETQLRGRCIVARGDT 100 L N Y P KC+K+ ++ + ++ + G Sbjct: 85 LVTERVDENGNKTDIKDKLPSNFSNFYMYPYKCIKLNAVTIHGDDSQRMEFETLSVGGIK 144 Query: 101 PRP-----LSLKYIGEVPLSD-CDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERV 154 Y+ + + D L+ AF LA ++ L Q+ + Sbjct: 145 AIATNIGNAYADYVCDETNPNEYDSLFITAFTHLLASKIAVRLTGMPQLEQAQMQIYMQT 204 Query: 155 LAVAMDMDGVE 165 + A D E Sbjct: 205 VQQAHLYDSRE 215 >gi|302339300|ref|YP_003804506.1| hypothetical protein Spirs_2809 [Spirochaeta smaragdinae DSM 11293] gi|301636485|gb|ADK81912.1| hypothetical protein Spirs_2809 [Spirochaeta smaragdinae DSM 11293] Length = 197 Score = 92.0 bits (227), Expect = 3e-17, Method: Composition-based stats. Identities = 36/188 (19%), Positives = 71/188 (37%), Gaps = 17/188 (9%) Query: 1 MTELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDL-- 58 MT LE+ N AL ++ + L + + AD CRL L P+ + W FA + L Sbjct: 1 MTALELINIALTRMNESKLTSITIDTYVADLCRLFLKPVADEVTLEEDWQFARKREALRT 60 Query: 59 -LPLYFTYNGELKYYIPLKCL--KVLNTSEETQLRGRCIVARGDTPRPLSLKYIGEVP-- 113 + Y +P CL + L + + ++ + + + + + E+ Sbjct: 61 DDEGENQTDYGYMYRLPADCLFPRQLRSKGQYEIEDGRLYTGDEAAVLIYTRSMVEIAHD 120 Query: 114 ----------LSDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDG 163 ++ + +A A +L ++ P + + + S L E +L DG Sbjct: 121 DENGLDIPVLVTSFPVTFAQAVACRLGSQIGPKISDNFNLASALGQEYMVMLEKTKAFDG 180 Query: 164 VEIPEDRG 171 + P + Sbjct: 181 MLSPAEDE 188 >gi|212703240|ref|ZP_03311368.1| hypothetical protein DESPIG_01282 [Desulfovibrio piger ATCC 29098] gi|212673506|gb|EEB33989.1| hypothetical protein DESPIG_01282 [Desulfovibrio piger ATCC 29098] Length = 195 Score = 91.2 bits (225), Expect = 5e-17, Method: Composition-based stats. Identities = 26/178 (14%), Positives = 57/178 (32%), Gaps = 18/178 (10%) Query: 5 EICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPLYFT 64 + N AL+++G R + + + L + WSFA + L Sbjct: 6 SVLNTALVRVGARMGNAGFQDTPAMQVAGAVYDKCRTLCLSLYPWSFALRRQRLALSAEA 65 Query: 65 YN--GELKYYIPLKCLKVLNTSE------------ETQLRGRCIVARGDTPRPLSLKYIG 110 + +P ++V+ + ++ G + D + L Y+ Sbjct: 66 AQGPWPFAFVLPSDSVRVVGAARPDPFTARPGPDVRFEIVGDRLETDADG---VLLTYVS 122 Query: 111 EVPLSDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIPE 168 + ++ L+ + A ++A E+ P + E+ L A D + Sbjct: 123 GL-VTTFPDLFADMLAWRVAFEMAPYISQGGAQAEGYLQMFEQALDRAKVEDDAQASP 179 >gi|212703341|ref|ZP_03311469.1| hypothetical protein DESPIG_01384 [Desulfovibrio piger ATCC 29098] gi|212673187|gb|EEB33670.1| hypothetical protein DESPIG_01384 [Desulfovibrio piger ATCC 29098] Length = 187 Score = 90.0 bits (222), Expect = 1e-16, Method: Composition-based stats. Identities = 47/179 (26%), Positives = 67/179 (37%), Gaps = 17/179 (9%) Query: 1 MTELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 +T++ I N AL LG R + +E + +A CRL + LR + W+FA LP Sbjct: 4 LTQIAIWNRALGFLGARSIAAEDENTPEALQCRLYWDSARRQALRDYPWAFA------LP 57 Query: 61 LYFTYNGELKYYIPLKCLKVLNTSEETQLRGRCIVARG---------DTPRPLSLKYIGE 111 F Y +P CLK E L +AR + L Y + Sbjct: 58 TGFELEYRFAYALPESCLKAHEVRHEGLLPRPFCLARDPAGDATILLTSASRALLLYTED 117 Query: 112 V-PLSDCDPLYQEAFALKLAGELCPPLIMDDQMR-SYLKGESERVLAVAMDMDGVEIPE 168 V D L+ A KLA L PL+ + + + L+ L A D E Sbjct: 118 VRHCHQFDDLFAHMLARKLACLLAAPLLKSNSQKIAELEQLYSASLPQARQSDASERRP 176 >gi|291334271|gb|ADD93934.1| hypothetical protein [uncultured marine bacterium MedDCM-OCT-S08-C235] Length = 119 Score = 89.7 bits (221), Expect = 1e-16, Method: Composition-based stats. Identities = 19/79 (24%), Positives = 40/79 (50%), Gaps = 2/79 (2%) Query: 2 TELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPL 61 T+++IC AL+ +G P+ ++++ + ++ L K +L S+ W FA++ L L Sbjct: 4 TKIDICARALVMIGASPISSFSDGTTESTVASNLYLDTIKNMLSSYRWRFASKQSQLSRL 63 Query: 62 YFTYN--GELKYYIPLKCL 78 T + + Y +P + Sbjct: 64 TDTPDHRWDSAYQLPSDLV 82 >gi|303328387|ref|ZP_07358824.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3] gi|302861381|gb|EFL84318.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3] Length = 196 Score = 83.1 bits (204), Expect = 1e-14, Method: Composition-based stats. Identities = 41/187 (21%), Positives = 67/187 (35%), Gaps = 25/187 (13%) Query: 9 WALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLL----PLYFT 64 AL +G R + +ER +A C L + LR + ++FA + L P + Sbjct: 1 MALGYVGTRTVAAEDERCEEAVQCSLYWDAARRQALRDYPYNFARRRAWLAEAALPEVYA 60 Query: 65 YNGELKYYIPLKCLKV-----------LNTSEETQLRGRCIVARGDTPRPLSLK------ 107 Y +P LKV ++ + R ++A D L L Sbjct: 61 VEWTHAYALPDGWLKVHHLVNAGWRRLIHADAAHERRAPFLLAATDEGAELLLTDVAPAL 120 Query: 108 --YIGEVPLSD-CDPLYQEAFALKLAGELCPPLIMDDQMR-SYLKGESERVLAVAMDMDG 163 Y +VP + D L+ A KLA + PL+ ++ + + L L A + Sbjct: 121 ISYTADVPDTSRFDDLFTGMLARKLAVMIAVPLLKNNTGKVNELAQLYSASLPDARQANA 180 Query: 164 VEIPEDR 170 E E Sbjct: 181 SEHKERP 187 >gi|119386473|ref|YP_917528.1| hypothetical protein Pden_3766 [Paracoccus denitrificans PD1222] gi|119377068|gb|ABL71832.1| hypothetical protein Pden_3766 [Paracoccus denitrificans PD1222] Length = 202 Score = 81.2 bits (199), Expect = 5e-14, Method: Composition-based stats. Identities = 39/191 (20%), Positives = 65/191 (34%), Gaps = 20/191 (10%) Query: 2 TELEICNWALL-KLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 + + ICN AL LG R + + + S +A+ C+L I LL + W FA++ L+ Sbjct: 3 SAIAICNVALTTYLGSRTITAFEDTSPEAEQCKLHYDRIRLSLLERWPWVFASRREKLVE 62 Query: 61 LYFTY---NGELKYYIPLKCLKV--------------LNTSEETQLRGRCIVARGDTPRP 103 +Y P + L S +T D P Sbjct: 63 ETLNDRAGAWTYRYARPGHMAAIRWVNDVTAARMAMQLGQSPDTLRELTADSIYSDVPGA 122 Query: 104 LSLKYIGEVPLSD-CDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMD 162 + ++Y + + + A LA + P+ D + K E+ +L AM D Sbjct: 123 V-IEYTRDETDPTVFTSAFADVLAAHLAAAIAMPITRDAAKVTGAKREASELLDQAMVAD 181 Query: 163 GVEIPEDRGYH 173 P Sbjct: 182 FNSRPATEQTF 192 >gi|13186159|emb|CAC33470.1| hypothetical protein [Legionella pneumophila] Length = 201 Score = 78.5 bits (192), Expect = 4e-13, Method: Composition-based stats. Identities = 35/180 (19%), Positives = 64/180 (35%), Gaps = 9/180 (5%) Query: 2 TELEICNWALLKLG-QRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 T++EI + A+ +G Q+ + + A L + L S W FA + Sbjct: 10 TKIEIISAAISMVGKQQTVNTIDGGGALAIDAEKLYDTLVSAELGSNRWRFAQAFQQISI 69 Query: 61 LYFT----YNGELKYYIPLKCLKV--LNTSEETQLRGRCIVARGDTPRPLSLKYIGEVPL 114 + + IP C+ V L + + + G I+ + + +L Y VP+ Sbjct: 70 ITTLNPTFDGWLYECQIPADCIMVQYLYPNIQYIVFGDKILTKSNQT--FTLIYSRNVPV 127 Query: 115 SDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIPEDRGYHN 174 S P + LA L + D+M + + + + A+ D HN Sbjct: 128 SKWPPPFCLYIVYHLASMLGISVTNSDRMLARISQGMQMWESRALFADAQSSVTLPFRHN 187 >gi|42526656|ref|NP_971754.1| hypothetical protein TDE1146 [Treponema denticola ATCC 35405] gi|41816849|gb|AAS11635.1| hypothetical protein TDE_1146 [Treponema denticola ATCC 35405] Length = 196 Score = 78.1 bits (191), Expect = 4e-13, Method: Composition-based stats. Identities = 24/171 (14%), Positives = 57/171 (33%), Gaps = 17/171 (9%) Query: 5 EICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPLYFT 64 ++ N AL +G+ L ++ S + P L W+ + L Sbjct: 6 QLANRALSAVGEPQLNETDKESKSFLLVKKFYLPTMLESLEMIPWTSGKKRARLEKDLSE 65 Query: 65 Y--NGELKYYIPLKCLKVLNTSEE--TQLRGRCIVARGDTPRPLSLK------------- 107 + + +P+ C K++ ++ + G+ + L + Sbjct: 66 NHTDYSGAFKLPIDCGKIIELKDKSYYIVEGKTLYTDAAEAVLLYITNGKLPEGMGNVDD 125 Query: 108 YIGEVPLSDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVA 158 + + + ++ +AF L+LA + L ++ L E+ + A Sbjct: 126 DFPDYDPPEYEAMFYQAFELRLASKFALELSGKPELHKMLLQEAAMIEEAA 176 >gi|146276491|ref|YP_001166650.1| hypothetical protein Rsph17025_0439 [Rhodobacter sphaeroides ATCC 17025] gi|145554732|gb|ABP69345.1| hypothetical protein Rsph17025_0439 [Rhodobacter sphaeroides ATCC 17025] Length = 198 Score = 77.3 bits (189), Expect = 8e-13, Method: Composition-based stats. Identities = 36/173 (20%), Positives = 59/173 (34%), Gaps = 8/173 (4%) Query: 6 ICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPLYFTY 65 I A + P+ ++ + S A + P + L W FA+ + L Sbjct: 10 IAAQACRYMEVAPISSFGDGSELAAAMAEMYPIALRACLEQADWEFASVLVRLPAAVPAA 69 Query: 66 NGELK------YYIPLKCLKVLNTSE-ETQLRGRCIVARGDTPRPLSLKYIGEV-PLSDC 117 Y +P +KV + +T R PL ++Y G + + Sbjct: 70 TVATDDEMPNLYTLPGDLVKVREVGDADTIWHIDREGLRASDAAPLRVRYTGTIETEARL 129 Query: 118 DPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIPEDR 170 +Q A A KLA + P + L+ E+E +L AM E R Sbjct: 130 PAEFQLAVAAKLASLMGPRWLGAQSKIDALRAEAEDLLKTAMRSSARAASEMR 182 >gi|256845614|ref|ZP_05551072.1| predicted protein [Fusobacterium sp. 3_1_36A2] gi|256719173|gb|EEU32728.1| predicted protein [Fusobacterium sp. 3_1_36A2] Length = 174 Score = 58.8 bits (141), Expect = 3e-07, Method: Composition-based stats. Identities = 30/165 (18%), Positives = 65/165 (39%), Gaps = 5/165 (3%) Query: 5 EICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPLYFT 64 EI + LL LG+ + N + +S C +L + + S ++ F ++ L + Sbjct: 14 EIISGTLLMLGENSIYN-DNKSDMYKICEKMLNSVIDNIATSSAFLFNAITVKLTSVGQV 72 Query: 65 YNGELKYYIPLKCLKVLNTSEETQLRGRCIVARGDTPRPLSLKYIGEVPLSDCDPLYQEA 124 NGE K+ +P+ CL VL ++ +L I + + ++Y + ++ Sbjct: 73 -NGENKFNLPVDCLNVLRCNKSYRLENEFIYSS---ESEIKIQYCRRIDFTEIPDNLFNL 128 Query: 125 FALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGVEIPED 169 ++ + + L+ E R+ + G + E+ Sbjct: 129 IVAMTGRKMALAVNTYNSRLEVLEAEVTRLKNNIIAQQGFQYWEE 173 >gi|325971690|ref|YP_004247881.1| hypothetical protein SpiBuddy_1863 [Spirochaeta sp. Buddy] gi|324026928|gb|ADY13687.1| hypothetical protein SpiBuddy_1863 [Spirochaeta sp. Buddy] Length = 213 Score = 50.4 bits (119), Expect = 1e-04, Method: Composition-based stats. Identities = 29/188 (15%), Positives = 61/188 (32%), Gaps = 18/188 (9%) Query: 1 MTELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 M +L + N AL L + +E+ S + L + ++++ + F + L Sbjct: 1 MNKLNLYNLALS-LFDQRIEDLQTPSKELTLLDLNYGKVVSFCIKAWDFPFLIKRTQLTE 59 Query: 61 LYFT--------YNGELKYYIPLKCLKVLNTSEE----TQLRGRCIVARGDTPRPLSLKY 108 + Y +P + + + R + + P + Sbjct: 60 FATDVVGNPMTWNSYPYGYDVPADFGRAIQINASKKNAYAYRFGKLWCKILNPELEYMPS 119 Query: 109 IGEVPLSDCDPL---YQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDGV- 164 V + P + A +LA + P L + Q +S + LA M+ + Sbjct: 120 TLSVDENGTFPYPDDFLALVAYQLALHIAPMLDPESQAQSVAAQMYQLTLASIMESETRS 179 Query: 165 -EIPEDRG 171 + PE+ Sbjct: 180 NDRPENWE 187 >gi|253583143|ref|ZP_04860351.1| predicted protein [Fusobacterium varium ATCC 27725] gi|251835035|gb|EES63588.1| predicted protein [Fusobacterium varium ATCC 27725] Length = 195 Score = 47.7 bits (112), Expect = 7e-04, Method: Composition-based stats. Identities = 31/154 (20%), Positives = 62/154 (40%), Gaps = 29/154 (18%) Query: 5 EICNWALLKLGQRPLENWNE-RSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPLYF 63 +I ALLK+G + ++N+ RS K C ++ + + + ++ F + ++ L Sbjct: 8 KIIQQALLKVGN--VNDYNDTRSKKYRACEAMMDGVIQQVCSDNTFRFQSTTVKLTYFEI 65 Query: 64 TYN-GELKYYIPLKC-------------------LKVLNTSE---ETQLRGRCIVARGDT 100 + + GE + +P L V++ +++ I A Sbjct: 66 SKDTGEYIFNLPQDFEGMQLKPKRSKSVGKYPERLSVMDNDRKFNNFRIQNEFIFADTPK 125 Query: 101 PRPLSLKYIGEVPLSDCDPLYQEAFALKLAGELC 134 + L Y+ +PL++ E +L LA ELC Sbjct: 126 ---VVLNYVRRIPLTEIPMYMSEYLSLMLARELC 156 >gi|307946247|ref|ZP_07661582.1| hypothetical protein TRICHSKD4_4952 [Roseibium sp. TrichSKD4] gi|307769911|gb|EFO29137.1| hypothetical protein TRICHSKD4_4952 [Roseibium sp. TrichSKD4] Length = 207 Score = 42.7 bits (99), Expect = 0.019, Method: Composition-based stats. Identities = 30/190 (15%), Positives = 63/190 (33%), Gaps = 20/190 (10%) Query: 1 MTELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLP 60 ++ L+I N A ++G PL++ +E ++ L+ + L + F L Sbjct: 4 LSTLDIVNAACAEIGADPLQDLDEETIGGQSVTLIYENLVDFNLSIEPFEFKFDLRQLTR 63 Query: 61 LYFTY---NGELKYYIPLKCL---KVL--NTSEETQLRGRCIVARGDTPR---PLSLKYI 109 + Y +P + L + ++ R + +G P+ Sbjct: 64 IDGEPPLSGFTYLYDVPGPKSGPPRWLTDDPTDPDCHYTRFRLIKGQVHAMDMPIFASVP 123 Query: 110 GEVPLSDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERV---------LAVAMD 160 + +P +++A LA L L D + LK + + A++ Sbjct: 124 FRPDPQNWEPAFRKATVSGLAAHLALSLASDANLHELLKMRAYGTPQENYRGGEMRTAIN 183 Query: 161 MDGVEIPEDR 170 + P R Sbjct: 184 NNAQATPPQR 193 >gi|158425208|ref|YP_001526500.1| tail tubular protein A [Azorhizobium caulinodans ORS 571] gi|158332097|dbj|BAF89582.1| tail tubular protein A [Azorhizobium caulinodans ORS 571] Length = 200 Score = 38.4 bits (88), Expect = 0.33, Method: Composition-based stats. Identities = 26/170 (15%), Positives = 67/170 (39%), Gaps = 12/170 (7%) Query: 2 TELEICNWALLKLGQRPLENWNE-RSLKADYCRLLLPPIHKGLLRSFSWSFAT-QSMDLL 59 TEL+ N L +G+ P+ + + A R +L + + +S W + T +++ L+ Sbjct: 8 TELDAINTMLSIIGEAPVNRVEDTGLVDAVMARQILDETMRQV-QSHGWHWNTDKNVVLV 66 Query: 60 PLYFTYNGELKYYIPLKCLKV--LNTSEETQLRGRCIVARGDT----PRPLSLKYIGEVP 113 P + +P L+V ++ S + RG + R + P+ ++ + +P Sbjct: 67 PSFPAPGDVY---VPDNTLQVDAMDPSIDVVQRGNRLWDRRNNTFKFSVPVEVEIVRLLP 123 Query: 114 LSDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDG 163 + + + ++ ++ + + + + R L + + Sbjct: 124 FEELPQIARHYIMVRAGRIFQDRVVGSETLSGFNDKDETRALIALRNAEA 173 >gi|26989007|ref|NP_744432.1| tail tubular protein A [Pseudomonas putida KT2440] gi|24983828|gb|AAN67896.1|AE016421_8 tail tubular protein A [Pseudomonas putida KT2440] Length = 195 Score = 38.4 bits (88), Expect = 0.38, Method: Composition-based stats. Identities = 33/177 (18%), Positives = 62/177 (35%), Gaps = 24/177 (13%) Query: 2 TELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLR-------SFSWSFATQ 54 TEL+ N L +G+ P+ + ++ AD I + +LR S W F T+ Sbjct: 8 TELDAVNIMLNTIGESPVNSLDDGLPVAD------AAIARSVLREVTIDVQSPGWQFNTE 61 Query: 55 SMDLLPLYFTYNGELKYYIPLKCLKVLNTSEET----QLRGRCIVARGDTP----RPLSL 106 L + +P CL+V + + LRG I R + +++ Sbjct: 62 RNY--KLTPALLTKYI-TVPGNCLEVTTSGKSATLDITLRGTRIYDRQNHTYEFGSSITV 118 Query: 107 KYIGEVPLSDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERVLAVAMDMDG 163 I + S+ + ++ A + D + + + + R L D Sbjct: 119 DMIVLLDFSELPQSVRHYITVRAARVFQQRNVGSDVLNGFTEKDETRALVAIRKYDS 175 >gi|78050004|ref|YP_366179.1| acetyltransferase (GNAT) family protein [Xanthomonas campestris pv. vesicatoria str. 85-10] gi|78038434|emb|CAJ26179.1| acetyltransferase (GNAT) family protein [Xanthomonas campestris pv. vesicatoria str. 85-10] Length = 185 Score = 37.7 bits (86), Expect = 0.70, Method: Composition-based stats. Identities = 13/82 (15%), Positives = 21/82 (25%), Gaps = 5/82 (6%) Query: 15 GQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPLYFTYNGELKYYIP 74 G + E+ + + L + L W AT+ L + + Sbjct: 23 GAHAIALREEQDADLPWLQDLYASTRREELAPVPWPEATKRAFLQQQFALQRAHYRQQFA 82 Query: 75 -LKCL----KVLNTSEETQLRG 91 L K L RG Sbjct: 83 DADFLIVQTKALAIGRLYLHRG 104 >gi|295675461|ref|YP_003603985.1| protein of unknown function DUF971 [Burkholderia sp. CCGE1002] gi|295435304|gb|ADG14474.1| protein of unknown function DUF971 [Burkholderia sp. CCGE1002] Length = 137 Score = 37.3 bits (85), Expect = 0.78, Method: Composition-based stats. Identities = 19/102 (18%), Positives = 37/102 (36%), Gaps = 7/102 (6%) Query: 61 LYFTYNGELKYYIPLKCLKVLNTSEETQLRGRCIVARGDTPRPLSLKYIGEVPLSDCDPL 120 L Y Y +P + L+V + S E Q G R +++ I V P Sbjct: 23 LELQYANGEAYRLPFELLRVYSPSAEVQGHGPGQETLQTGKRDVTITMIEGVGNYALQPT 82 Query: 121 YQE-----AFALKLAGELCPPLIMDDQMRSYLKGESERVLAV 157 + + ++ L ++ + D+ R+YL + + Sbjct: 83 FSDGHSTGIYSWDLLYDMA--VRQDELWRAYLAKLAAAGIDR 122 >gi|167561579|ref|ZP_02354495.1| hypothetical protein BoklE_03381 [Burkholderia oklahomensis EO147] Length = 137 Score = 37.3 bits (85), Expect = 0.91, Method: Composition-based stats. Identities = 24/132 (18%), Positives = 48/132 (36%), Gaps = 21/132 (15%) Query: 52 ATQSMDLLPLYFTYNGELKYYIPLKCLKVLNTSEETQLRGRCIVARGDTPRPLSLKYIGE 111 A + L Y Y +P + ++V + S E + G R +++ + Sbjct: 18 AVSRV----LELQYPNGESYRVPFELMRVCSPSAEVRGHGPGQETLQTGKRDVTITALEP 73 Query: 112 VPLSDCDPLYQE-----AFALKLAGELCPPLIMDDQMRSYLKGE-SERVLAVAMDMDGVE 165 V P + + ++ L EL + L E +++ A ++ D Sbjct: 74 VGNYALKPTFSDGHSTGIYSWDLLYELA-------TRQDALWREYFDKLKAAGVERDA-P 125 Query: 166 IPED---RGYHN 174 +P D RG+H+ Sbjct: 126 MPADSLPRGHHH 137 >gi|317487277|ref|ZP_07946072.1| hypothetical protein HMPREF0179_03435 [Bilophila wadsworthia 3_1_6] gi|316921467|gb|EFV42758.1| hypothetical protein HMPREF0179_03435 [Bilophila wadsworthia 3_1_6] Length = 211 Score = 36.9 bits (84), Expect = 1.1, Method: Composition-based stats. Identities = 13/49 (26%), Positives = 24/49 (48%), Gaps = 1/49 (2%) Query: 3 ELEICNWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGL-LRSFSWS 50 ELE N L +G+ P+ + +E + R +L + + + L F W+ Sbjct: 11 ELEAVNTMLSGIGEAPVNSLSEVTADVSLARHILNEVSREIQLEGFQWN 59 >gi|167568815|ref|ZP_02361689.1| hypothetical protein BoklC_03146 [Burkholderia oklahomensis C6786] Length = 131 Score = 36.9 bits (84), Expect = 1.1, Method: Composition-based stats. Identities = 24/132 (18%), Positives = 48/132 (36%), Gaps = 21/132 (15%) Query: 52 ATQSMDLLPLYFTYNGELKYYIPLKCLKVLNTSEETQLRGRCIVARGDTPRPLSLKYIGE 111 A + L Y Y +P + ++V + S E + G R +++ + Sbjct: 12 AVSRV----LELQYPNGESYRVPFELMRVCSPSAEVRGHGPGQETLQTGKRDVTITALEP 67 Query: 112 VPLSDCDPLYQE-----AFALKLAGELCPPLIMDDQMRSYLKGE-SERVLAVAMDMDGVE 165 V P + + ++ L EL + L E +++ A ++ D Sbjct: 68 VGNYALKPTFSDGHSTGIYSWDLLYELA-------TRQDALWREYFDKLKAAGVERDA-P 119 Query: 166 IPED---RGYHN 174 +P D RG+H+ Sbjct: 120 MPADSLPRGHHH 131 >gi|209521832|ref|ZP_03270510.1| protein of unknown function DUF971 [Burkholderia sp. H160] gi|209497736|gb|EDZ97913.1| protein of unknown function DUF971 [Burkholderia sp. H160] Length = 137 Score = 35.7 bits (81), Expect = 2.3, Method: Composition-based stats. Identities = 20/103 (19%), Positives = 34/103 (33%), Gaps = 12/103 (11%) Query: 61 LYFTYNGELKYYIPLKCLKVLNTSEETQLRGRCIVARGDTPRPLSLKYIGEVPLSDCDPL 120 L Y Y +P + L+V + S E Q G R +++ I V P Sbjct: 23 LELQYANGEAYRVPFELLRVYSPSAEVQGHGPGQGTLQTGKREVTITMIEGVGNYALQPT 82 Query: 121 YQE-----AFALKLAGELCPPLIMDDQMRSYLKGESERVLAVA 158 + + ++ L ++ + L E LA A Sbjct: 83 FSDGHSTGIYSWDLLYDMAV-------RQDELWREYLDKLAAA 118 >gi|308071880|emb|CBW54801.1| putative tail tubular protein A [Pantoea phage LIMElight] Length = 184 Score = 35.0 bits (79), Expect = 4.0, Method: Composition-based stats. Identities = 24/153 (15%), Positives = 45/153 (29%), Gaps = 15/153 (9%) Query: 8 NWALLKLGQRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPLYFTYNG 67 N +L +G+ + + + + A + + + LL + W F T + P Sbjct: 8 NISLRAIGESEVVSESTSNPTAGIIKSAITQHRRSLLATG-WWFNTVEQTITPTAD---- 62 Query: 68 ELKYYIPLKCLKV-------LNTSEETQLRGRCIVARGDTPRPLSLKYIGEVPLSDCDPL 120 + P L + L + I +K + D Sbjct: 63 -YRITPPSNALAIYGITGNKLAVRDGWLY--DLINQSNRFTEATYIKAFIDYDFEDLPEY 119 Query: 121 YQEAFALKLAGELCPPLIMDDQMRSYLKGESER 153 + A ++A E I D L +SE Sbjct: 120 AAQYIANRVAAETYRNDIGVDSNFQSLTSDSEE 152 >gi|56477677|ref|YP_159266.1| SAM-dependent methyltransferase [Aromatoleum aromaticum EbN1] gi|56313720|emb|CAI08365.1| predicted SAM-dependent methyltransferases [Aromatoleum aromaticum EbN1] Length = 396 Score = 35.0 bits (79), Expect = 4.4, Method: Composition-based stats. Identities = 22/114 (19%), Positives = 38/114 (33%), Gaps = 14/114 (12%) Query: 41 KGLLRSFSWSFATQSMDLLPLYFTYNGELKYYIPLKCLKVLNTSEETQLRGRCIVARGDT 100 + LLR W FA L +G + P ++V+ R Sbjct: 12 RSLLRRHPWVFAGSVARL-------DGRAR---PGDTVEVVAADGRFLGRAAWSPESQIR 61 Query: 101 PRPLSLKYIGEVPLSDCDPLYQEAFALKLAGELCPPLIMDDQMRSYLKGESERV 154 R S V ++ ++ A +A PL+ D+ + GES+ + Sbjct: 62 ARMWS----FAVDITIDHAFFKRRVAESVARRATHPLLADESGVRLIHGESDGL 111 >gi|325927583|ref|ZP_08188815.1| acetyltransferase [Xanthomonas perforans 91-118] gi|325542048|gb|EGD13558.1| acetyltransferase [Xanthomonas perforans 91-118] Length = 185 Score = 35.0 bits (79), Expect = 4.6, Method: Composition-based stats. Identities = 12/81 (14%), Positives = 20/81 (24%), Gaps = 5/81 (6%) Query: 16 QRPLENWNERSLKADYCRLLLPPIHKGLLRSFSWSFATQSMDLLPLYFTYNGELKYYIP- 74 + E+ + + L + L W AT+ L + + Sbjct: 24 AHAIALREEQDADLPWLQDLYASTRREELAPVPWPEATKRAFLQQQFALQRAHYRQQFAD 83 Query: 75 LKCL----KVLNTSEETQLRG 91 L K L RG Sbjct: 84 ADFLIVQTKALAIGRLYLHRG 104 >gi|156398729|ref|XP_001638340.1| predicted protein [Nematostella vectensis] gi|156225460|gb|EDO46277.1| predicted protein [Nematostella vectensis] Length = 758 Score = 34.2 bits (77), Expect = 6.4, Method: Composition-based stats. Identities = 17/79 (21%), Positives = 31/79 (39%), Gaps = 6/79 (7%) Query: 55 SMDLLPLYFTYNGELKYYIPLKCLKVLNTSEETQLRGRCIVARGDTPRPLSL------KY 108 + + G+L+Y IP L+V + + + T P + KY Sbjct: 265 RIAVPLSASVDAGDLQYPIPDGVLRVEVIEAKDLIAADMALLSKPTSDPYCIVEVGAQKY 324 Query: 109 IGEVPLSDCDPLYQEAFAL 127 + S+CDP+++E F Sbjct: 325 RTKTKKSNCDPVWKETFEA 343 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.309 0.142 0.400 Lambda K H 0.267 0.0434 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 3,288,102,214 Number of Sequences: 14124377 Number of extensions: 122185764 Number of successful extensions: 271664 Number of sequences better than 10.0: 115 Number of HSP's better than 10.0 without gapping: 127 Number of HSP's successfully gapped in prelim test: 66 Number of HSP's that attempted gapping in prelim test: 271317 Number of HSP's gapped (non-prelim): 201 length of query: 175 length of database: 4,842,793,630 effective HSP length: 129 effective length of query: 46 effective length of database: 3,020,748,997 effective search space: 138954453862 effective search space used: 138954453862 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.1 bits) S2: 76 (33.8 bits)