BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254780410|ref|YP_003064823.1| hypothetical protein CLIBASIA_01475 [Candidatus Liberibacter asiaticus str. psy62] (362 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done >gi|254780410|ref|YP_003064823.1| hypothetical protein CLIBASIA_01475 [Candidatus Liberibacter asiaticus str. psy62] gi|254040087|gb|ACT56883.1| hypothetical protein CLIBASIA_01475 [Candidatus Liberibacter asiaticus str. psy62] Length = 362 Score = 369 bits (946), Expect = e-100, Method: Composition-based stats. Identities = 362/362 (100%), Positives = 362/362 (100%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG Sbjct: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE Sbjct: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 RLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID Sbjct: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 Query: 181 IDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVID 240 IDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVID Sbjct: 181 IDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVID 240 Query: 241 YGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGK 300 YGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGK Sbjct: 241 YGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGK 300 Query: 301 FLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMPF 360 FLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMPF Sbjct: 301 FLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMPF 360 Query: 361 VN 362 VN Sbjct: 361 VN 362 >gi|150397453|ref|YP_001327920.1| hypothetical protein Smed_2253 [Sinorhizobium medicae WSM419] gi|150028968|gb|ABR61085.1| protein of unknown function DUF185 [Sinorhizobium medicae WSM419] Length = 367 Score = 339 bits (869), Expect = 4e-91, Method: Composition-based stats. Identities = 178/364 (48%), Positives = 235/364 (64%), Gaps = 6/364 (1%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 M N L KI LI+ NG ++V YF+LC+ADP+ GYY PFGA GDF TAPEISQ+FG Sbjct: 1 MTNPLADKIKALIRANGPISVTDYFSLCLADPQHGYYRVREPFGAAGDFTTAPEISQLFG 60 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 EM+ IFL+ AW+QHG P+ + E+GPGRG MM D+LRVI +L P + ++++VETS+ Sbjct: 61 EMIGIFLVHAWQQHGAPANAIISEIGPGRGTMMSDMLRVIRRLAPTLYGTATVHLVETSD 120 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 RL +Q K LA + KI W+ S +P GF L ANE FD++PI+QFV T G RERMI Sbjct: 121 RLREVQAKGLAEHEGKIRWHESFDSLPSGFLLLAANELFDAIPIRQFVRTPQGFRERMIG 180 Query: 181 IDQHDSLVFNIGDHEIKSNF--LTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIV 238 +D D L F G I + G IFE +P RD M ++ +RL GGTA++ Sbjct: 181 LDTEDRLTFAAGAGGIDPTLLPTPAASVPEGTIFEIAPARDAVMAALCERLRAGGGTALI 240 Query: 239 IDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQ 298 IDYG+L + GDTLQAV+ H Y PL NPG AD++SHVDF++L+S A + +NGL Q Sbjct: 241 IDYGHLATGYGDTLQAVRNHEYDPPLANPGFADMTSHVDFEQLASRAKAEGVQVNGLVRQ 300 Query: 299 GKFLEGLGIWQRAFSL--MKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVE 356 G FL GLG+ +RA +L K + ++ + D V+RL + MGELFK+LVVS +V Sbjct: 301 GDFLVGLGLLERAAALGRDKGESTQEGIRDDVERL--AGSGPGKMGELFKVLVVSSPEVA 358 Query: 357 LMPF 360 L PF Sbjct: 359 LAPF 362 >gi|227822833|ref|YP_002826805.1| putative transcriptional regulator, TetR family [Sinorhizobium fredii NGR234] gi|227341834|gb|ACP26052.1| putative transcriptional regulator, TetR family [Sinorhizobium fredii NGR234] Length = 367 Score = 334 bits (857), Expect = 1e-89, Method: Composition-based stats. Identities = 181/364 (49%), Positives = 235/364 (64%), Gaps = 6/364 (1%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 M N L KI LI+ NG ++V YF+LC+ADP+ GYY PFG GDF TAPEISQ+FG Sbjct: 1 MTNPLADKIKALIRTNGPISVTDYFSLCLADPQHGYYRVREPFGRAGDFTTAPEISQLFG 60 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 EM+ IFL+ AW++HG P+ V + E+GPGRG MM D+LRVI +L PD ++ +++VETSE Sbjct: 61 EMIGIFLVHAWQEHGSPAQVVIAEIGPGRGTMMSDMLRVIGRLAPDLYAAADVHLVETSE 120 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 RL +Q + LAS+ KI W+ S +P GF L ANE FD++PI+QFV T G RERM+ Sbjct: 121 RLRKVQAETLASHDGKIQWHASFDSLPSGFLLLAANELFDAIPIRQFVRTAQGFRERMVG 180 Query: 181 IDQHDSLVFNIGDHEIKSNF--LTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIV 238 +D D L F G I S+ G I E +P RD M ++ +RL D GTAI+ Sbjct: 181 LDADDELTFAAGVAGIDSSLLPTPAQSVAEGTILEVAPARDAVMAALCERLRADAGTAIL 240 Query: 239 IDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQ 298 IDYG+L + GDTLQAV+ H Y PL NPG+ADL+SHVDF++L+ A L +NGL Q Sbjct: 241 IDYGHLATGYGDTLQAVRNHRYDPPLANPGRADLTSHVDFEQLALRAKTEGLQVNGLARQ 300 Query: 299 GKFLEGLGIWQRAFSL--MKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVE 356 G FL GLG+ RA +L K A ++ + D V+RL A MGELFK+L VS +V Sbjct: 301 GDFLIGLGLLDRAAALGRDKDLATQERIRDDVERL--AGAGAGKMGELFKVLAVSSPEVA 358 Query: 357 LMPF 360 L PF Sbjct: 359 LAPF 362 >gi|222086587|ref|YP_002545121.1| hypothetical protein Arad_3178 [Agrobacterium radiobacter K84] gi|221724035|gb|ACM27191.1| conserved hypothetical protein [Agrobacterium radiobacter K84] Length = 365 Score = 334 bits (857), Expect = 1e-89, Method: Composition-based stats. Identities = 182/363 (50%), Positives = 248/363 (68%), Gaps = 5/363 (1%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 M L KI +I+ NG +++ YF+LC+ADP+ GYY T PFG+VGDFVTAPEISQ+FG Sbjct: 1 MTTPLGEKIKAIIRANGPVSITDYFSLCLADPQHGYYKTREPFGSVGDFVTAPEISQLFG 60 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 EM+ IF++ AW++HG PS V+LVE+GPGRG MM D+LRVI KL P + +S+++VETS+ Sbjct: 61 EMIGIFMVHAWQRHGAPSEVQLVEIGPGRGTMMADMLRVISKLAPPLYDAMSVHLVETSD 120 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 RL Q++ LA +GDK++W++ DVP GFT L ANE FD++PI+QFV T +G RER + Sbjct: 121 RLQEFQRQTLADHGDKVSWHSDFNDVPAGFTLLAANELFDAIPIRQFVRTANGFRERTVG 180 Query: 181 IDQHDSLVFNIGDHEIKSNFLTCSDYFL-GAIFENSPCRDREMQSISDRLACDGGTAIVI 239 +D +D L F +G + S FL G IFE +P R M ++ DR+A GGTA+ I Sbjct: 181 LDANDELTFAVGVAGLDSAFLPDGQLPPIGTIFEIAPARQAVMTTVCDRIAAHGGTALAI 240 Query: 240 DYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQG 299 DYG++ + GDTLQAV+ H Y PL +PG+ADL+SHVDFQ L++ A L ING QG Sbjct: 241 DYGHMATGFGDTLQAVRMHEYDPPLEHPGEADLTSHVDFQHLAATAAASGLQINGCCHQG 300 Query: 300 KFLEGLGIWQRAFSL--MKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVEL 357 FL GLG+ +RA +L + A +D + +V+RL A + MGELFK+L VS ++L Sbjct: 301 DFLIGLGLLERAAALGRDRDAATQDGIRAAVERL--AGAGEGKMGELFKVLAVSSPAIDL 358 Query: 358 MPF 360 PF Sbjct: 359 QPF 361 >gi|86358621|ref|YP_470513.1| hypothetical protein RHE_CH03019 [Rhizobium etli CFN 42] gi|86282723|gb|ABC91786.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 366 Score = 332 bits (852), Expect = 4e-89, Method: Composition-based stats. Identities = 177/364 (48%), Positives = 241/364 (66%), Gaps = 6/364 (1%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 M L KI +I+ NG ++V YF+LC+ADPE GYY T PFG GDFVTAPE+SQIFG Sbjct: 1 MTTALGEKIKAIIQANGPISVTDYFSLCLADPEHGYYRTREPFGRSGDFVTAPEVSQIFG 60 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 EM+ +F++ AW++HG P+ VRLVE+GPGRG MM D+LRVI ++ P F +S+++VETSE Sbjct: 61 EMIGVFVVHAWQRHGTPADVRLVEIGPGRGTMMADMLRVIARIAPPLFDTMSVHLVETSE 120 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 RL +Q + L +YG++I W+ +VP GFT + ANE FD++PI+QFV T+ G RERM+ Sbjct: 121 RLRDVQSQTLEAYGERIAWHGGFDEVPPGFTLIAANELFDAIPIRQFVRTQTGFRERMVG 180 Query: 181 IDQHDSLVFNIGDHEIKSNFLTCS--DYFLGAIFENSPCRDREMQSISDRLACDGGTAIV 238 +D L F G + L + LG +FE SP R M +I +RL GGTA++ Sbjct: 181 LDADGELTFAAGVAGLDPALLPEPVQNLPLGTLFEISPARQAVMMAICERLRAYGGTALI 240 Query: 239 IDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQ 298 IDYG+L + GDTLQAV+ H + PL +PG+ADL+SHVDFQ+L+ A L++NG Q Sbjct: 241 IDYGHLVTGFGDTLQAVRMHEFDPPLAHPGEADLTSHVDFQQLAETARAAGLHLNGALHQ 300 Query: 299 GKFLEGLGIWQRAFSLMKQTARKDI--LLDSVKRLVSTSADKKSMGELFKILVVSHEKVE 356 G FL GLGI +RA +L + + + +V RL A + MGELFK++ VS V+ Sbjct: 301 GDFLTGLGILERAAALGRDREPQTQQVIQSAVDRL--AGAGEGRMGELFKVMAVSDPAVD 358 Query: 357 LMPF 360 LMPF Sbjct: 359 LMPF 362 >gi|315122147|ref|YP_004062636.1| hypothetical protein CKC_01985 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495549|gb|ADR52148.1| hypothetical protein CKC_01985 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 368 Score = 332 bits (850), Expect = 7e-89, Method: Composition-based stats. Identities = 265/362 (73%), Positives = 304/362 (83%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 ++ L +KIV+LIK+NGQ+T+DQYF+LC++D EFGYY TCNPFG GDFVTAPEISQIFG Sbjct: 3 VKTGLYQKIVDLIKRNGQITIDQYFSLCLSDSEFGYYKTCNPFGVDGDFVTAPEISQIFG 62 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 EMLAIFLI AWEQHGFP CVRL+E+GPGRG MMLD+LR ICKL+PDFF++LSIYM+E SE Sbjct: 63 EMLAIFLIFAWEQHGFPRCVRLIEMGPGRGTMMLDVLRTICKLRPDFFAILSIYMIENSE 122 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 RL IQKK L+ YGDKINW ++DVP GFTFL+ANEFFDSLPIKQFV+T G+RERMID Sbjct: 123 RLVSIQKKNLSFYGDKINWCVGISDVPPGFTFLMANEFFDSLPIKQFVITNDGMRERMID 182 Query: 181 IDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVID 240 ID H+ LVF IG + I S +DYF G I E SPCRD +QSI+DRL C+GGTAIVID Sbjct: 183 IDHHELLVFGIGKNAITSPVSPFNDYFPGMILETSPCRDSAIQSIADRLVCEGGTAIVID 242 Query: 241 YGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGK 300 YG+LQS +GDTLQAVKGH Y PL++PGQADLSSHVDFQRLSSI+IL KLYING QGK Sbjct: 243 YGHLQSGMGDTLQAVKGHKYDPPLMHPGQADLSSHVDFQRLSSISILRKLYINGCVKQGK 302 Query: 301 FLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMPF 360 FLE LGIWQR FSLMK T R DILLDSV+RLV +D KSMGELFK+LVVSH KV+L+PF Sbjct: 303 FLECLGIWQRVFSLMKNTDRADILLDSVRRLVGMPSDDKSMGELFKVLVVSHRKVDLVPF 362 Query: 361 VN 362 VN Sbjct: 363 VN 364 >gi|15889490|ref|NP_355171.1| hypothetical protein Atu2214 [Agrobacterium tumefaciens str. C58] gi|15157362|gb|AAK87956.1| conserved hypothetical protein [Agrobacterium tumefaciens str. C58] Length = 366 Score = 331 bits (849), Expect = 9e-89, Method: Composition-based stats. Identities = 168/364 (46%), Positives = 241/364 (66%), Gaps = 6/364 (1%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 M L ++I +LI+ NG ++V +F+LC+ADPE GYY + PFG GDF+TAPE+SQ+FG Sbjct: 1 MTTPLAQRIKSLIRLNGPLSVTDFFSLCLADPEHGYYKSREPFGRSGDFITAPEVSQLFG 60 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 EML +F++ AW++HG P+ +LVE+GPGRG MM D+LRVI ++ P + + +++VETS Sbjct: 61 EMLGVFVVHAWQRHGAPAQTQLVEIGPGRGTMMSDMLRVIRRIAPPLYETMRVHLVETSP 120 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 RL+ IQK+ L ++ D++ W+ S DVP GF LVANE FD++PI+QFV T G RER++ Sbjct: 121 RLSAIQKETLTAHADRLTWHDSFDDVPEGFLLLVANELFDAIPIRQFVRTPQGFRERVVS 180 Query: 181 IDQHDSLVFNIGDHEIKSNFLTCSD--YFLGAIFENSPCRDREMQSISDRLACDGGTAIV 238 +D + LVF+ G I L LG +FE SP R+ M +I RL+ GGTA+ Sbjct: 181 LDANGELVFSTGLAGIDPTLLPPQPERQQLGTVFEVSPAREAVMTAICQRLSVHGGTALA 240 Query: 239 IDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQ 298 IDYG+L + GDTLQA++ H + PL +PG+ADL+SHVDF+ L A +++NG Q Sbjct: 241 IDYGHLVAGYGDTLQAMRNHAFDPPLAHPGEADLTSHVDFESLVKTAQATGVHVNGALRQ 300 Query: 299 GKFLEGLGIWQRAFSLMKQ--TARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVE 356 G FL GLG+ +RA +L + + + ++V RL A K MGELFK++ VS + Sbjct: 301 GDFLHGLGLKERASALAAKATPDQTLEIAEAVNRLAGEGAGK--MGELFKVIAVSSPALH 358 Query: 357 LMPF 360 L+PF Sbjct: 359 LLPF 362 >gi|218463849|ref|ZP_03503940.1| hypothetical protein RetlK5_32579 [Rhizobium etli Kim 5] Length = 366 Score = 331 bits (849), Expect = 9e-89, Method: Composition-based stats. Identities = 179/364 (49%), Positives = 242/364 (66%), Gaps = 6/364 (1%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 M L KI +I+ NG ++V YF+LC+ADPE GYY T PFG GDFVTAPEISQIFG Sbjct: 1 MSTALGEKIKAIIQANGPISVTDYFSLCLADPEHGYYRTREPFGRSGDFVTAPEISQIFG 60 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 EM+ +F++ AW++HG P+ VRLVE+GPGRG M+ D+LRVI ++ P F +++++VETSE Sbjct: 61 EMIGVFIVHAWQRHGTPADVRLVEIGPGRGTMISDMLRVISRIAPPLFDTMTVHLVETSE 120 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 RL +QK+ L YG+KI W+ +VP GFT + ANE FD++PI+QF+ T+ G RERM+ Sbjct: 121 RLRDVQKQTLEDYGEKIAWHDGFDEVPAGFTLIAANELFDAIPIRQFIRTQTGFRERMVG 180 Query: 181 IDQHDSLVFNIGDHEIKSNFLTCS--DYFLGAIFENSPCRDREMQSISDRLACDGGTAIV 238 +D L F G + L + LG +FE SP R M +I +RL GGTA+V Sbjct: 181 LDADGELTFAAGVAGLDPALLPEPVQNLPLGTLFEISPARQAVMMAICERLRAFGGTALV 240 Query: 239 IDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQ 298 IDYG+L + GDTLQAV+ H + PL +PG+ADL+SHVDFQ+L+ A+ L++NG Q Sbjct: 241 IDYGHLVTGFGDTLQAVRMHEFDPPLAHPGEADLTSHVDFQQLAETALAAGLHLNGALHQ 300 Query: 299 GKFLEGLGIWQRAFSLMKQTARKDI--LLDSVKRLVSTSADKKSMGELFKILVVSHEKVE 356 G FL GLGI +RA +L + + + +V RL A + MGELFK + VSH V+ Sbjct: 301 GDFLTGLGILERAAALGRDREPQTQQVIQTAVDRL--AGAGEGRMGELFKAMAVSHPAVD 358 Query: 357 LMPF 360 LMPF Sbjct: 359 LMPF 362 >gi|327188366|gb|EGE55583.1| hypothetical protein RHECNPAF_900038 [Rhizobium etli CNPAF512] Length = 366 Score = 331 bits (848), Expect = 1e-88, Method: Composition-based stats. Identities = 178/364 (48%), Positives = 242/364 (66%), Gaps = 6/364 (1%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 M L KI +I+ NG ++V YF+LC+ADPE GYY T PFG GDFVTAPEISQIFG Sbjct: 1 MNTALGEKIKAIIQANGPISVTDYFSLCLADPEHGYYRTREPFGRSGDFVTAPEISQIFG 60 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 EM+ +F++ AW++HG P+ VRLVE+GPGRG M+ D+LRVI ++ P F +++++VETSE Sbjct: 61 EMIGVFIVHAWQRHGTPADVRLVEIGPGRGTMISDMLRVISRIAPPLFDAMTVHLVETSE 120 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 RL +Q + L +YG+KI W+ +VP GFT + ANE FD++PI+QFV T+ G RERM+ Sbjct: 121 RLRDVQNQTLEAYGEKIAWHDGFDEVPPGFTLIAANELFDAIPIRQFVRTQTGFRERMVG 180 Query: 181 IDQHDSLVFNIGDHEIKSNFLTCS--DYFLGAIFENSPCRDREMQSISDRLACDGGTAIV 238 +D L F G + L + LG +FE SP R M +I +RL GGTA+V Sbjct: 181 LDADGELTFAAGVAGLDPALLPEPVQNLPLGTLFEISPARQAVMMAICERLRAFGGTALV 240 Query: 239 IDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQ 298 IDYG+L + GDTLQAV+ H + PL +PG+ADL+SHVDFQ+L+ A+ L++NG Q Sbjct: 241 IDYGHLVTGFGDTLQAVRMHEFDPPLAHPGEADLTSHVDFQQLAETALTSGLHLNGALHQ 300 Query: 299 GKFLEGLGIWQRAFSLMKQTARKDI--LLDSVKRLVSTSADKKSMGELFKILVVSHEKVE 356 G FL GLGI +RA +L + + + +V RL A + MGELFK++ VSH V+ Sbjct: 301 GDFLTGLGILERAAALGRDREPQTQQVIQTAVDRL--AGAGEGRMGELFKVMAVSHPAVD 358 Query: 357 LMPF 360 L PF Sbjct: 359 LTPF 362 >gi|153008968|ref|YP_001370183.1| hypothetical protein Oant_1638 [Ochrobactrum anthropi ATCC 49188] gi|151560856|gb|ABS14354.1| protein of unknown function DUF185 [Ochrobactrum anthropi ATCC 49188] Length = 363 Score = 331 bits (848), Expect = 1e-88, Method: Composition-based stats. Identities = 147/361 (40%), Positives = 207/361 (57%), Gaps = 9/361 (2%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L ++ LI +G ++V Y A C+ D E GYY+T PFG GDF+TAPE+SQ+FGE++ Sbjct: 5 SLKDRLKRLIATSGPISVADYMAACLGDRESGYYTTREPFGRDGDFITAPEVSQMFGELI 64 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 I+ + W+ G P + L E+GPGRG +M D+LR I +L P + + MVETS RL Sbjct: 65 GIWCVSEWDALGRPDNIVLCEIGPGRGTLMSDMLRTIGRLAPQMLGHVRVAMVETSPRLA 124 Query: 124 LIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQ 183 QK++L+ G KI+W+ +++ G LV NE FD++P +QFV + ERMI +D+ Sbjct: 125 EKQKEKLSDAGAKIDWFERFSNIADGPLILVTNELFDAIPFRQFVKVDGRFVERMIALDE 184 Query: 184 HDSLVFNIGDHEIKSNFLTC--SDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDY 241 F G I L ++ GAIFE +P R MQ I+ R+A G A+ IDY Sbjct: 185 KGEFHFVSGLGGIDPALLPAGHAEAPEGAIFEAAPARTALMQEIASRIATTRGAALNIDY 244 Query: 242 GYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKF 301 G+L++ GDTLQA+ H + NPG ADL+SHVDF L A G+TTQG+F Sbjct: 245 GHLKAGFGDTLQAMLKHGFDDVFANPGIADLTSHVDFDILDKTARASGCKT-GMTTQGEF 303 Query: 302 LEGLGIWQRAFSL--MKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMP 359 L +G+ RA L K A ++ + V+RL A MG LFK+L +S +L+P Sbjct: 304 LLAMGLLDRAGRLGTGKDAAFQEKIRQDVERL----AAPDQMGTLFKVLAISDSDTKLIP 359 Query: 360 F 360 F Sbjct: 360 F 360 >gi|190892761|ref|YP_001979303.1| hypothetical protein RHECIAT_CH0003177 [Rhizobium etli CIAT 652] gi|190698040|gb|ACE92125.1| hypothetical conserved protein [Rhizobium etli CIAT 652] Length = 366 Score = 330 bits (847), Expect = 2e-88, Method: Composition-based stats. Identities = 178/364 (48%), Positives = 242/364 (66%), Gaps = 6/364 (1%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 M L KI +I+ NG ++V YF+LC+ADPE GYY T PFG GDFVTAPEISQIFG Sbjct: 1 MNTALGEKIKAIIQANGPISVTDYFSLCLADPEHGYYRTREPFGRSGDFVTAPEISQIFG 60 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 EM+ +F++ AW++HG P+ VRLVE+GPGRG M+ D+LRVI ++ P F +++++VETSE Sbjct: 61 EMIGVFIVHAWQRHGTPADVRLVEIGPGRGTMISDMLRVISRIAPPLFDTMTVHLVETSE 120 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 RL +Q + L +YG+KI W+ +VP GFT + ANE FD++PI+QFV T+ G RER + Sbjct: 121 RLRDVQNQTLEAYGEKIAWHDGFDEVPPGFTLIAANELFDAIPIRQFVRTQTGFRERTVG 180 Query: 181 IDQHDSLVFNIGDHEIKSNFLTCS--DYFLGAIFENSPCRDREMQSISDRLACDGGTAIV 238 +D L F G + L + LG +FE SP R M +I +RL GGTA+V Sbjct: 181 LDADGELTFAAGVAGLDPALLPEPVQNLPLGTLFEISPARQAVMMAICERLRAFGGTALV 240 Query: 239 IDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQ 298 IDYG+L + GDTLQAV+ H + PL +PG+ADL+SHVDFQ+L+ A+ L++NG Q Sbjct: 241 IDYGHLVTGFGDTLQAVRMHEFDPPLAHPGEADLTSHVDFQQLAETALTAGLHLNGALHQ 300 Query: 299 GKFLEGLGIWQRAFSLMKQTARKDI--LLDSVKRLVSTSADKKSMGELFKILVVSHEKVE 356 G FL GLGI +RA +L + + + +V RL A + MGELFK++ VSH V+ Sbjct: 301 GDFLTGLGILERAAALGRDREPQTQQVIQTAVDRL--AGAGEGRMGELFKVMAVSHPAVD 358 Query: 357 LMPF 360 LMPF Sbjct: 359 LMPF 362 >gi|325293572|ref|YP_004279436.1| hypothetical protein AGROH133_07719 [Agrobacterium sp. H13-3] gi|325061425|gb|ADY65116.1| hypothetical protein AGROH133_07719 [Agrobacterium sp. H13-3] Length = 366 Score = 330 bits (847), Expect = 2e-88, Method: Composition-based stats. Identities = 167/364 (45%), Positives = 241/364 (66%), Gaps = 6/364 (1%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 M L ++I +LI+ NG ++V +F+LC+ADPE GYY + PFG +GDF+TAPE+SQ+FG Sbjct: 1 MTTPLAQRIKSLIRLNGPLSVTDFFSLCLADPEHGYYRSREPFGRLGDFITAPEVSQLFG 60 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 EML +F++ AW++HG P+ +RLVE+GPGRG MM D+LRVI ++ P + + +++VETS Sbjct: 61 EMLGVFVVHAWQRHGAPADIRLVEIGPGRGTMMSDMLRVIRRIAPPLYETMRVHLVETSP 120 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 RL IQK+ LA++ +++ W+ S VP GF LVANE FD++PI+QFV T G RER++ Sbjct: 121 RLCAIQKETLAAHAERLTWHDSFDAVPEGFLLLVANELFDAIPIRQFVKTPQGFRERVVS 180 Query: 181 IDQHDSLVFNIGDHEIKSNFLTCSD--YFLGAIFENSPCRDREMQSISDRLACDGGTAIV 238 + LVF+ G I L +G+IFE +P R+ M +I RL+ GGTA+ Sbjct: 181 LGTDGELVFSTGLAGIDPTLLPPGPERQPIGSIFEIAPAREAVMTAICQRLSAYGGTALA 240 Query: 239 IDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQ 298 IDYG+L + GDTLQA++ H + PL +PGQADL+SHVDF+ L A +++NG Q Sbjct: 241 IDYGHLVAGYGDTLQAMRNHAFDPPLSHPGQADLTSHVDFESLIRTAEANGVHVNGGIRQ 300 Query: 299 GKFLEGLGIWQRAFSLMKQ--TARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVE 356 G FL GLG+ +RA +L + + + ++V RL A + MGELFK++ VS + Sbjct: 301 GDFLYGLGLKERATALAAKATPDQTLEIAEAVNRLAGEGAGR--MGELFKVIAVSSPALH 358 Query: 357 LMPF 360 LMPF Sbjct: 359 LMPF 362 >gi|209550338|ref|YP_002282255.1| hypothetical protein Rleg2_2759 [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209536094|gb|ACI56029.1| protein of unknown function DUF185 [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 366 Score = 330 bits (846), Expect = 2e-88, Method: Composition-based stats. Identities = 177/364 (48%), Positives = 241/364 (66%), Gaps = 6/364 (1%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 M L KI +I+ NG ++V YF+LC+ADPE GYY T PFG GDFVTAPE+SQIFG Sbjct: 1 MTTALGEKIKAIIQANGPISVTDYFSLCLADPEHGYYRTREPFGRSGDFVTAPEVSQIFG 60 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 EM+ +F++ AW++HG P+ VRLVE+GPGRG MM D+LRVI ++ P +++++VETSE Sbjct: 61 EMIGVFIVHAWQRHGTPAGVRLVEIGPGRGTMMADMLRVISRIAPPLLDAMTVHLVETSE 120 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 RL +Q + L +YG+KI W+ +VP GFT + ANE FD++PI+QFV G RERMI Sbjct: 121 RLRDVQSQTLEAYGEKIAWHDGFDEVPPGFTLIAANELFDAIPIRQFVRMPTGFRERMIG 180 Query: 181 IDQHDSLVFNIGDHEIKSNFLTCS--DYFLGAIFENSPCRDREMQSISDRLACDGGTAIV 238 ID L F G + L + LG +FE SP R M +I +RL GGTA+ Sbjct: 181 IDADGELTFAAGVAGLDPALLPEPVQNLPLGTLFEISPARQAVMIAICERLRAFGGTALA 240 Query: 239 IDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQ 298 IDYG+L + GDTLQAV+ H + PL +PG+ADL+SHVDFQ+L+ A+ +++NG Q Sbjct: 241 IDYGHLVTGFGDTLQAVRMHEFDPPLAHPGEADLTSHVDFQQLAETALAAGVHLNGALHQ 300 Query: 299 GKFLEGLGIWQRAFSL--MKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVE 356 G FL GLGI +RA +L ++ + ++ +V RL A + MGELFK++ VSH V+ Sbjct: 301 GDFLAGLGILERAAALGHDREPQTQQVIQAAVDRL--AGAGEGRMGELFKVMAVSHPAVD 358 Query: 357 LMPF 360 LMPF Sbjct: 359 LMPF 362 >gi|239832410|ref|ZP_04680739.1| Hypothetical protein, conserved [Ochrobactrum intermedium LMG 3301] gi|239824677|gb|EEQ96245.1| Hypothetical protein, conserved [Ochrobactrum intermedium LMG 3301] Length = 364 Score = 329 bits (844), Expect = 3e-88, Method: Composition-based stats. Identities = 152/363 (41%), Positives = 206/363 (56%), Gaps = 9/363 (2%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGE 61 E+ L ++ LI +G ++V Y A C+ D E GYY+T PFG GDF+TAPE+SQ+FGE Sbjct: 3 ESSLKDRLKRLIAASGPISVADYMAACLGDREAGYYTTREPFGRDGDFITAPEVSQMFGE 62 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 ++ I+ + W+ G P V L E+GPGRG +M D+LR I +L P I MVETS R Sbjct: 63 LIGIWCVSEWDALGRPDNVVLCEIGPGRGTLMSDMLRTIGRLAPQMLGAARIAMVETSPR 122 Query: 122 LTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDI 181 L QK++LA G KI+W+ +D+ G LV NE FD++P +QFV ERMI + Sbjct: 123 LVERQKEKLAGAGVKIDWFERFSDIADGPLILVTNELFDAIPFRQFVKVGGRFVERMIAL 182 Query: 182 DQHDSLVFNIGDHEIKSNFLTC--SDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVI 239 D D F G I L + GAIFE +P R MQ I+ R+A G A+ I Sbjct: 183 DDKDEFHFVSGLGGIDPALLPQDHATAEEGAIFEAAPARTALMQEIASRIATTRGAALNI 242 Query: 240 DYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQG 299 DYG+L+S GDTLQA+ H++ +PG+ADL+SHVDF L A G TQG Sbjct: 243 DYGHLESGFGDTLQAMLKHSFDDVFAHPGEADLTSHVDFDMLEKAARASNCKT-GTMTQG 301 Query: 300 KFLEGLGIWQRAFSL--MKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVEL 357 +FL +G+ RA L K A ++ + V+RL A MG LFK+L +S +L Sbjct: 302 EFLLAMGLIDRAGRLGAGKDAAFQEKIRQDVERL----AAPDQMGTLFKVLAISDSDTKL 357 Query: 358 MPF 360 +PF Sbjct: 358 IPF 360 >gi|307305651|ref|ZP_07585398.1| protein of unknown function DUF185 [Sinorhizobium meliloti BL225C] gi|307317654|ref|ZP_07597093.1| protein of unknown function DUF185 [Sinorhizobium meliloti AK83] gi|306896812|gb|EFN27559.1| protein of unknown function DUF185 [Sinorhizobium meliloti AK83] gi|306902354|gb|EFN32950.1| protein of unknown function DUF185 [Sinorhizobium meliloti BL225C] Length = 367 Score = 328 bits (842), Expect = 6e-88, Method: Composition-based stats. Identities = 179/364 (49%), Positives = 232/364 (63%), Gaps = 6/364 (1%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 M N L KI LI+ NG ++V YF+LC+ADP+ GYY PFG GDF TAPEISQ+FG Sbjct: 1 MTNPLADKIEALIRTNGPISVTDYFSLCLADPQHGYYRVREPFGRAGDFTTAPEISQLFG 60 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 EM+ IFL+ AW+QHG P + E+GPGRG MM D+LRVI +L P + ++++VETS+ Sbjct: 61 EMIGIFLVHAWQQHGTPGDAIIAEIGPGRGTMMSDMLRVIRRLAPALYRTATVHLVETSD 120 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 RL +Q + LA + K+ W+ S +P GF L ANE FD++PI+QFV T G RERM+ Sbjct: 121 RLRRLQAETLAEHEGKVRWHESFDSLPSGFLLLAANELFDAIPIRQFVRTAQGFRERMVG 180 Query: 181 IDQHDSLVFNIGDHEIKSNF--LTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIV 238 +D L F G I G IFE SP RD M ++ +RL GGTAI+ Sbjct: 181 LDAEGRLTFAAGIAGIDPALLPSPAPAVAEGTIFEISPARDAVMAALCERLRAGGGTAII 240 Query: 239 IDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQ 298 IDYG+L + GDTLQAV+ H Y PL NPG ADL+SHVDF++L+S A + INGL Q Sbjct: 241 IDYGHLATGYGDTLQAVRNHEYDPPLANPGLADLTSHVDFEQLASRAKAEGVQINGLARQ 300 Query: 299 GKFLEGLGIWQRAFSL--MKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVE 356 G FL GLG+ +RA +L K ++ + D V+RL + A K MGELFK+LVVS +V Sbjct: 301 GDFLVGLGLLERASTLGRDKDETTQESIRDDVERLAGSGAGK--MGELFKVLVVSSPEVA 358 Query: 357 LMPF 360 L PF Sbjct: 359 LAPF 362 >gi|15966097|ref|NP_386450.1| hypothetical protein SMc02682 [Sinorhizobium meliloti 1021] gi|15075367|emb|CAC46923.1| Conserved hypothetical protein [Sinorhizobium meliloti 1021] Length = 372 Score = 328 bits (842), Expect = 6e-88, Method: Composition-based stats. Identities = 179/364 (49%), Positives = 232/364 (63%), Gaps = 6/364 (1%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 M N L KI LI+ NG ++V YF+LC+ADP+ GYY PFG GDF TAPEISQ+FG Sbjct: 6 MTNPLADKIEALIRTNGPISVTDYFSLCLADPQHGYYRVREPFGRAGDFTTAPEISQLFG 65 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 EM+ IFL+ AW+QHG P + E+GPGRG MM D+LRVI +L P + ++++VETS+ Sbjct: 66 EMIGIFLVHAWQQHGTPGDAIIAEIGPGRGTMMSDMLRVIRRLAPALYRTATVHLVETSD 125 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 RL +Q + LA + K+ W+ S +P GF L ANE FD++PI+QFV T G RERM+ Sbjct: 126 RLRRLQAETLAEHEGKVRWHESFDSLPSGFLLLAANELFDAIPIRQFVRTAQGFRERMVG 185 Query: 181 IDQHDSLVFNIGDHEIKSNF--LTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIV 238 +D L F G I G IFE SP RD M ++ +RL GGTAI+ Sbjct: 186 LDAEGRLTFAAGIAGIDPALLPSPAPAVAEGTIFEISPARDAVMAALCERLRAGGGTAII 245 Query: 239 IDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQ 298 IDYG+L + GDTLQAV+ H Y PL NPG ADL+SHVDF++L+S A + INGL Q Sbjct: 246 IDYGHLATGYGDTLQAVRNHEYDPPLANPGLADLTSHVDFEQLASRAKAEGVQINGLARQ 305 Query: 299 GKFLEGLGIWQRAFSL--MKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVE 356 G FL GLG+ +RA +L K ++ + D V+RL + A K MGELFK+LVVS +V Sbjct: 306 GDFLVGLGLLERASTLGRDKDETTQESIRDDVERLAGSGAGK--MGELFKVLVVSSPEVA 363 Query: 357 LMPF 360 L PF Sbjct: 364 LAPF 367 >gi|146342824|ref|YP_001207872.1| hypothetical protein BRADO6003 [Bradyrhizobium sp. ORS278] gi|146195630|emb|CAL79657.1| conserved hypothetical protein [Bradyrhizobium sp. ORS278] Length = 375 Score = 328 bits (841), Expect = 8e-88, Method: Composition-based stats. Identities = 141/363 (38%), Positives = 207/363 (57%), Gaps = 12/363 (3%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGE 61 + L +I LIK +G M V +Y LC+ PE GYY + +P G GDF TAPE+SQ+FGE Sbjct: 4 TSPLQPEIKRLIKASGPMPVWRYMELCLMHPEHGYYISRDPLGREGDFTTAPEVSQMFGE 63 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 +L ++ W+ G P RL+ELGPGRG MM D LR + L P + +S+++VE + Sbjct: 64 LLGLWAASIWKAAGSPQQFRLIELGPGRGTMMSDALRALRVLPPL-YQTISVHLVEINPV 122 Query: 122 LTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDI 181 L QK L + + W+ S +VP G + + ANE+FD LP+ Q V E G ER++++ Sbjct: 123 LREKQKATLTGLRN-VTWHDSFDEVPEGPSVIFANEYFDVLPVHQMVRRETGWHERVVEL 181 Query: 182 DQHDSLVFNIGDHEIKSNFLTCSDY----FLGAIFENSPCRDREMQSISDRLACDGGTAI 237 D ++ V+ L S GAIFE P M +I+ RL G A+ Sbjct: 182 DDDENFVYGTAADPTPGFELLLSPLVRAAPAGAIFEWRPDTQ--MMAIARRLREQRGAAV 239 Query: 238 VIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTT 297 +IDYG+++S VGDT QA+ H++ PL PG AD+++HVDF LS A ++G T Sbjct: 240 IIDYGHVRSDVGDTFQAIARHSFADPLKTPGLADITAHVDFDALSRTAEAVGARVHGPIT 299 Query: 298 QGKFLEGLGIWQRAFSLMKQTARK--DILLDSVKRLVSTSADKKSMGELFKILVVSHEKV 355 QG+FL+ LGI RA +LM++ + + + + +KRL S + MG LFK+L VS + Sbjct: 300 QGEFLQRLGIETRALTLMQKASPEVSEDIASGLKRLTS--GGRGGMGSLFKVLGVSDPSI 357 Query: 356 ELM 358 ++ Sbjct: 358 PVL 360 >gi|241205722|ref|YP_002976818.1| hypothetical protein Rleg_3022 [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240859612|gb|ACS57279.1| protein of unknown function DUF185 [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 366 Score = 328 bits (840), Expect = 1e-87, Method: Composition-based stats. Identities = 176/364 (48%), Positives = 240/364 (65%), Gaps = 6/364 (1%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 M L KI +I+ NG ++V YF+LC+ADPE GYY T PFG GDFVTAPE+SQIFG Sbjct: 1 MTTALGEKIKAIIQANGPISVTDYFSLCLADPEHGYYRTREPFGRSGDFVTAPEVSQIFG 60 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 EM+ +F++ AW++HG P+ VRLVE+GPGRG M+ D+LRVI ++ P F V+++++VETSE Sbjct: 61 EMIGVFIVHAWQRHGTPTDVRLVEIGPGRGTMISDMLRVISRIAPPLFDVMTVHLVETSE 120 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 RL +Q + L +G+KI W+ +VP GFT + ANE FD++PI+QFV G RERM+ Sbjct: 121 RLRDVQSQTLEPHGEKITWHNGFDEVPPGFTLIAANELFDAIPIRQFVRMATGFRERMVG 180 Query: 181 IDQHDSLVFNIGDHEIKSNFLTCS--DYFLGAIFENSPCRDREMQSISDRLACDGGTAIV 238 ID L F G I L + +G +FE SP R M +I +RL GGTA+ Sbjct: 181 IDADGELTFAPGVAGIDPTLLPEPVQNVPVGTLFEISPARQAVMMAICERLRAFGGTALA 240 Query: 239 IDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQ 298 IDYG+L + GDTLQAV+ H + PL +PG+ADL+SHVDFQ+L+ A+ LY+NG Q Sbjct: 241 IDYGHLVTGFGDTLQAVRMHEFDPPLAHPGEADLTSHVDFQQLAETALAAGLYLNGALHQ 300 Query: 299 GKFLEGLGIWQRAFSLMKQTARKDI--LLDSVKRLVSTSADKKSMGELFKILVVSHEKVE 356 G FL GLGI +RA +L + + +V+RL A + MGELFK++ VS+ ++ Sbjct: 301 GDFLTGLGILERATALGRDREPHTQQVIQAAVERL--AGAGEGRMGELFKVMAVSYPAID 358 Query: 357 LMPF 360 LMPF Sbjct: 359 LMPF 362 >gi|254504952|ref|ZP_05117103.1| conserved hypothetical protein [Labrenzia alexandrii DFL-11] gi|222441023|gb|EEE47702.1| conserved hypothetical protein [Labrenzia alexandrii DFL-11] Length = 362 Score = 326 bits (836), Expect = 3e-87, Method: Composition-based stats. Identities = 144/362 (39%), Positives = 211/362 (58%), Gaps = 9/362 (2%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 +L K+ I G +TV Y A C+ DPE+GYY+T PFG GDF+TAPE+SQ+FGE+ Sbjct: 2 TELKEKLRQQIAAEGPITVATYMARCLGDPEYGYYTTREPFGRKGDFITAPEVSQMFGEL 61 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 + + A+E G PS +LVELGPGRG +M D LRV +P+F ++ +VE S RL Sbjct: 62 IGAVCLKAYETLGAPSNFQLVELGPGRGTLMADFLRV-AFHRPEFLEAATLNLVEISPRL 120 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 +Q + L + N+ + DVP G ++ANEFFD+LPI QFV T +G ERM+ +D Sbjct: 121 RQVQTQTLRNTQLPPNFRNTFQDVPDGPLIVIANEFFDALPIHQFVKTVNGWNERMVGLD 180 Query: 183 QHDSLVFNIGDHEIKSNFLTCS--DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVID 240 L F +G ++ ++ + + G+I E P + + I+ R+ GG A+ ID Sbjct: 181 ASGRLEFGVGPAQLPTDAIPPAAMKAPEGSILETQPAANAVAEEIATRIVEHGGFALFID 240 Query: 241 YGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGK 300 YGY Q+ GDTLQA+ H Y L +PG+ADL++HV+F+ L++ A L +QG Sbjct: 241 YGYAQTAPGDTLQALYRHAYNDVLAHPGEADLTAHVNFEALATAARRAGAVPLPLLSQGT 300 Query: 301 FLEGLGIWQRAFSL--MKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELM 358 FL G+ +RA +L K A +D + D+V+RL A MG+LFK+L ++ + Sbjct: 301 FLLQSGLLERAGALGAGKSPAEQDAIRDAVERL----AAPDQMGDLFKVLAIAPKGHVFA 356 Query: 359 PF 360 PF Sbjct: 357 PF 358 >gi|222149326|ref|YP_002550283.1| hypothetical protein Avi_3172 [Agrobacterium vitis S4] gi|221736310|gb|ACM37273.1| conserved hypothetical protein [Agrobacterium vitis S4] Length = 377 Score = 326 bits (835), Expect = 4e-87, Method: Composition-based stats. Identities = 172/364 (47%), Positives = 233/364 (64%), Gaps = 6/364 (1%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 M L KI LI+ NG ++V YFALC+ADPEFGYY T PFG GDF+T PEISQIFG Sbjct: 8 MTTTLGDKIKALIRLNGPLSVTDYFALCLADPEFGYYKTREPFGTSGDFITGPEISQIFG 67 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 EM+ +F++ AW++HG P+ VRL E+GPGRG MM D+LRVI +L PD + ++++VETS+ Sbjct: 68 EMIGVFIVHAWQRHGLPAPVRLAEVGPGRGTMMSDMLRVIARLAPDLYRDSTVHLVETSD 127 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 RL IQ+ L + +KI+W+ S DVP GF +VANE FD++PI+QFV RERM+ Sbjct: 128 RLRQIQRNSLEPHIEKIDWHDSFGDVPEGFVLVVANELFDAIPIRQFVKLGPHYRERMVS 187 Query: 181 IDQHDSLVFNIGDHEIKSNFLTC--SDYFLGAIFENSPCRDREMQSISDRLACDGGTAIV 238 + D L+F+ G + L S G +FE SP R M ++ +RL +GG A++ Sbjct: 188 LGLDDELIFSTGVATLDPALLPPGASAQPDGTVFEFSPARRAVMAAMCERLKAEGGAALI 247 Query: 239 IDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQ 298 IDYG++ + GDTLQA++ H Y PL NPG ADL+SHVDF+ L+ A+ + +G Q Sbjct: 248 IDYGHISTGFGDTLQALRAHDYDPPLANPGIADLTSHVDFEDLARTALAAGITPSGCLRQ 307 Query: 299 GKFLEGLGIWQRAFSL--MKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVE 356 G FL GLGI +RA L K A + L ++V RL + MGELFK+L +S + Sbjct: 308 GDFLLGLGIKERAGILGRGKDEATQLELAEAVNRLAGAGVGR--MGELFKVLALSSPTLS 365 Query: 357 LMPF 360 L PF Sbjct: 366 LAPF 369 >gi|116253206|ref|YP_769044.1| hypothetical protein RL3464 [Rhizobium leguminosarum bv. viciae 3841] gi|115257854|emb|CAK08952.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae 3841] Length = 366 Score = 325 bits (834), Expect = 5e-87, Method: Composition-based stats. Identities = 174/364 (47%), Positives = 239/364 (65%), Gaps = 6/364 (1%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 M L KI +I+ NG ++V YF+LC+ADPE GYY T PFG GDFVTAPE+SQIFG Sbjct: 1 MTTALGEKIKAIIQANGPISVTDYFSLCLADPEHGYYRTREPFGRSGDFVTAPEVSQIFG 60 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 EM+ +F++ AW++HG P+ VRLVE+GPGRG M+ D+LRVI ++ P F +++++VETSE Sbjct: 61 EMIGVFIVHAWQRHGTPTDVRLVEIGPGRGTMISDMLRVISRIAPPLFDAMTVHLVETSE 120 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 RL +Q + L YG+KI W+ +VP GFT + ANE FD++PI+QFV G RERM+ Sbjct: 121 RLRDVQSQTLEPYGEKITWHDGFDEVPPGFTLIAANELFDAIPIRQFVRMPTGFRERMVG 180 Query: 181 IDQHDSLVFNIGDHEIKSNFLTCS--DYFLGAIFENSPCRDREMQSISDRLACDGGTAIV 238 ID L F G + L + +G +FE SP R M +I +RL GGTA+ Sbjct: 181 IDADGELTFAAGVAGLDPAVLPEPVQNVPVGTLFEISPARQAVMIAICERLRAFGGTALT 240 Query: 239 IDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQ 298 IDYG+L + GDTLQAV+ H + PL +PG+ADL+SHVDFQ+L+ A+ L++NG Q Sbjct: 241 IDYGHLVTGFGDTLQAVRMHEFDPPLAHPGEADLTSHVDFQQLAETALAAGLHLNGALHQ 300 Query: 299 GKFLEGLGIWQRAFSLMKQTARKDI--LLDSVKRLVSTSADKKSMGELFKILVVSHEKVE 356 G FL GLGI +RA L + + + +V+RL A + MGELFK++ VS+ ++ Sbjct: 301 GDFLTGLGILERATVLGRDREPQTQHLIQAAVERL--AGAGEGRMGELFKVMAVSYPALD 358 Query: 357 LMPF 360 LMPF Sbjct: 359 LMPF 362 >gi|323136895|ref|ZP_08071975.1| protein of unknown function DUF185 [Methylocystis sp. ATCC 49242] gi|322397656|gb|EFY00178.1| protein of unknown function DUF185 [Methylocystis sp. ATCC 49242] Length = 360 Score = 324 bits (830), Expect = 1e-86, Method: Composition-based stats. Identities = 152/361 (42%), Positives = 212/361 (58%), Gaps = 7/361 (1%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 N L ++I I +G MT++ + +LC+ P GYY T +PFGA GDF+TAPEISQ+FGE+ Sbjct: 2 NPLKQEIAAAIAHDGPMTLEHFMSLCLGHPLHGYYMTRDPFGAGGDFITAPEISQMFGEL 61 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 + ++ AW G PS RL+ELGPGRG +M D+LRV + P F ++ ++VE S L Sbjct: 62 IGVWASEAWRAAGSPSPARLIELGPGRGTLMSDVLRVAR-ISPQFLDAITAHLVEMSPAL 120 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 IQ++ LAS ++W T A P G F++ANEFFD+LP++ FV T G RER++ +D Sbjct: 121 RDIQRQTLASAAKPVDWATDFAHTPHGPAFILANEFFDALPVRHFVKTIGGWRERLVGLD 180 Query: 183 QHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYG 242 L F + D + G+I E P R M I+ RL +GG +VIDYG Sbjct: 181 AGAELAFGLSDRVEP---TLTAAAREGSIIEVCPAGQRLMGDIAARLVAEGGAMLVIDYG 237 Query: 243 YLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFL 302 Y Q+ +GD+LQAV HTYV PL PG+ADL++HVDF L A + G TQ FL Sbjct: 238 YTQTSLGDSLQAVARHTYVDPLSAPGEADLTAHVDFAALGRAARAQGAKVMGPVTQAHFL 297 Query: 303 EGLGIWQRAFSLMKQ--TARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVE-LMP 359 LGI +RA SL K+ + + + + RL T ++ MG LFK++ V+H + L Sbjct: 298 LQLGIERRAQSLSKKATAEQAEEIASAFDRLTGTQDPRRHMGALFKVMAVTHPDMPDLPG 357 Query: 360 F 360 F Sbjct: 358 F 358 >gi|118591907|ref|ZP_01549302.1| hypothetical protein SIAM614_20945 [Stappia aggregata IAM 12614] gi|118435550|gb|EAV42196.1| hypothetical protein SIAM614_20945 [Stappia aggregata IAM 12614] Length = 362 Score = 324 bits (830), Expect = 1e-86, Method: Composition-based stats. Identities = 148/362 (40%), Positives = 212/362 (58%), Gaps = 9/362 (2%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 L +I I G ++V QY ++C+ DP+ GYY T PFG+ GDF+TAPE+SQ+FGE+ Sbjct: 2 TGLKDRIKARIATEGPLSVAQYMSVCLGDPDAGYYMTREPFGSEGDFITAPEVSQMFGEL 61 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 + + AW+ G P+ +LVELGPGRG +M D+LR + L+P F + MVETS RL Sbjct: 62 IGAACLSAWQALGEPAEFQLVELGPGRGTLMADLLR-MASLRPAFIKAARLNMVETSPRL 120 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 IQ L+ ++ DVP G LVANEFFD+LPI QFV T G +ER I + Sbjct: 121 REIQTATLSRGPLTPHFRNRFQDVPGGPLILVANEFFDALPIHQFVKTARGWQERQIGLS 180 Query: 183 QHDSLVFNIGDHEIKSNFLT--CSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVID 240 Q L+F +G + + + S GAIFE P + + I R+A +GG AI+ID Sbjct: 181 QDGELMFGVGTARLPDDAIPADLSSAPEGAIFETQPAANAIAEEIGHRIAGNGGAAILID 240 Query: 241 YGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGK 300 YGYL + GDTLQA+ H Y L +PG+ADL++HV+F+ L++ + TQG+ Sbjct: 241 YGYLNTAAGDTLQALYKHAYDDVLAHPGEADLTAHVNFEALAAATVRAGAQALAPLTQGE 300 Query: 301 FLEGLGIWQRAFSL--MKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELM 358 FL G+ +RA +L K + ++ + D+V+RL A MG+LFK+L V++ + Sbjct: 301 FLLRSGLLERAGALGAGKSHSEQEAIRDAVERL----AAPGQMGDLFKVLAVTNSGISFP 356 Query: 359 PF 360 PF Sbjct: 357 PF 358 >gi|254719562|ref|ZP_05181373.1| hypothetical protein Bru83_08465 [Brucella sp. 83/13] gi|265984571|ref|ZP_06097306.1| conserved hypothetical protein [Brucella sp. 83/13] gi|264663163|gb|EEZ33424.1| conserved hypothetical protein [Brucella sp. 83/13] Length = 365 Score = 323 bits (827), Expect = 3e-86, Method: Composition-based stats. Identities = 148/365 (40%), Positives = 199/365 (54%), Gaps = 13/365 (3%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L ++ LI G ++V Y A C+ D E GYY+T PFG GDF+TAPE+SQ+FGE++ Sbjct: 5 TLKERLKRLIATTGPISVADYMAACLGDREAGYYTTREPFGREGDFITAPEVSQMFGELI 64 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 I+ + W+ P+ L E+GPGRG +M D+LR I +L P I MVETS RL Sbjct: 65 GIWCLREWDALARPANFVLCEIGPGRGTLMSDMLRTIGRLAPQMLGGARIAMVETSPRLA 124 Query: 124 LIQKKQLASYGDKINWYTSLADVP----LGFTFLVANEFFDSLPIKQFVMTEHGIRERMI 179 Q+++LA I W+ AD+P G LV NE FD++P +QFV + ERM+ Sbjct: 125 EKQRQKLAGTKAHIEWFERFADIPADTVHGPLILVTNELFDAIPFRQFVKADGRFVERMV 184 Query: 180 DIDQHDSLVFNIGDHEIKSNFLTCS--DYFLGAIFENSPCRDREMQSISDRLACDGGTAI 237 +++ D F G I L GAIFE +P R MQ I+ R+A G A+ Sbjct: 185 ALNEQDEFHFVSGAGGIDPALLPKDHVKAEEGAIFEAAPARTALMQEIASRIAATRGAAL 244 Query: 238 VIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTT 297 IDYG+L+S GDTLQA+ H Y +PG ADL+SHVDF L A G T Sbjct: 245 NIDYGHLESGFGDTLQAMLKHAYDDVFAHPGAADLTSHVDFDILQKTAKACGCKT-GTMT 303 Query: 298 QGKFLEGLGIWQRAFSL--MKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKV 355 QG+FL +G+ RA L K A ++ + V+RL A MG LFK+L S E+ Sbjct: 304 QGEFLLAMGLVDRAGQLGAGKDAAFQEKIRQDVERL----AAPDQMGTLFKVLAFSDEQT 359 Query: 356 ELMPF 360 L+PF Sbjct: 360 RLLPF 364 >gi|148253293|ref|YP_001237878.1| hypothetical protein BBta_1764 [Bradyrhizobium sp. BTAi1] gi|146405466|gb|ABQ33972.1| hypothetical protein BBta_1764 [Bradyrhizobium sp. BTAi1] Length = 375 Score = 322 bits (826), Expect = 4e-86, Method: Composition-based stats. Identities = 137/362 (37%), Positives = 205/362 (56%), Gaps = 12/362 (3%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 + L +I +IK +G M V +Y LC+ PE GYY + +P G GDF TAPE+SQ+FGE+ Sbjct: 5 SPLHSEIKRVIKASGPMPVWRYMELCLMHPEHGYYISRDPLGREGDFTTAPEVSQMFGEL 64 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 L ++ W+ G P RL+ELGPGRG MM D LR + L P + +S++MVE + L Sbjct: 65 LGLWAASVWKASGSPQQFRLIELGPGRGTMMSDALRALRVLPPL-YQTISVHMVEINPVL 123 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 QK L + I W+ S DVP G + + ANE+FD LPI Q + E G ER++++D Sbjct: 124 REKQKATLTGLRN-ITWHESFDDVPEGPSVIFANEYFDVLPIHQMLKRETGWHERVVELD 182 Query: 183 QHDSLVFNIGDHEIKSNFLTCSDY----FLGAIFENSPCRDREMQSISDRLACDGGTAIV 238 ++ + L LGAIFE P + E+ +I+ R+ G A++ Sbjct: 183 AEENFAYGTAAEPTPGFELLLPPLVRAAPLGAIFEWRP--NNEIMAIAKRIREQRGAAVI 240 Query: 239 IDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQ 298 IDYG+++S VGDT QA+ H++ PL PG AD+++HVDF+ L+ A ++G TQ Sbjct: 241 IDYGHVRSDVGDTFQAIARHSFADPLKTPGLADITAHVDFEALARAADAVGARVHGPITQ 300 Query: 299 GKFLEGLGIWQRAFSLMKQTARK--DILLDSVKRLVSTSADKKSMGELFKILVVSHEKVE 356 +FL LGI RA +LM++ + + +KRL+ + MG LFK+L +S + Sbjct: 301 SEFLRRLGIETRALTLMQKASPDISRDIASGLKRLIE--GGRGGMGSLFKVLGLSDASIP 358 Query: 357 LM 358 ++ Sbjct: 359 VL 360 >gi|260467107|ref|ZP_05813286.1| protein of unknown function DUF185 [Mesorhizobium opportunistum WSM2075] gi|259029119|gb|EEW30416.1| protein of unknown function DUF185 [Mesorhizobium opportunistum WSM2075] Length = 362 Score = 322 bits (825), Expect = 5e-86, Method: Composition-based stats. Identities = 151/362 (41%), Positives = 221/362 (61%), Gaps = 9/362 (2%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 +L +IV+LI+ G + V++Y ALC+ DP GYY+T PFGA GDF+TAPEISQ+FGE+ Sbjct: 2 TRLKTRIVDLIEATGPIPVNEYMALCLFDPADGYYTTREPFGAAGDFITAPEISQMFGEL 61 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 +A++L AW G P V + E+GPGRG +M D+LR + +L+PD + + MVETS RL Sbjct: 62 VAVWLYQAWAAIGRPMPVTIAEIGPGRGTLMKDMLRTLSRLEPDLANGAAFAMVETSPRL 121 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 IQK+ L++ + W+ ++ +P +V NE FD++PI+QFV G RERM+ +D Sbjct: 122 AEIQKQTLSATPFAVGWHETIDTLPRQPLLIVGNELFDAVPIRQFVRAGSGWRERMVGLD 181 Query: 183 QHDSLVFNIGDHEIKSNFLT--CSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVID 240 D L F G + L + GAI E +P R M +I+ R++ GG + D Sbjct: 182 DKDELCFFAGAGSVDPTLLPADAGEAPQGAIAEVAPARSALMAAIAARISSHGGAGLFPD 241 Query: 241 YGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGK 300 YG+L+ VGDTLQA++ H + L NPG+ADL+SHVDF L++I + L + L+ QG Sbjct: 242 YGHLRPGVGDTLQALRKHNHEDVLANPGEADLTSHVDFAALAAIVRAHGLDAH-LSRQGD 300 Query: 301 FLEGLGIWQRAFSLMKQTAR--KDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELM 358 FL G+GI +RA L + +D + V+RL A ++MG+LFK+L + V + Sbjct: 301 FLLGMGILERAGRLGADAGQAARDKIASDVERL----AGPQAMGDLFKVLAIVPRGVTIR 356 Query: 359 PF 360 PF Sbjct: 357 PF 358 >gi|75676687|ref|YP_319108.1| hypothetical protein Nwi_2503 [Nitrobacter winogradskyi Nb-255] gi|74421557|gb|ABA05756.1| Protein of unknown function DUF185 [Nitrobacter winogradskyi Nb-255] Length = 390 Score = 322 bits (824), Expect = 7e-86, Method: Composition-based stats. Identities = 139/361 (38%), Positives = 205/361 (56%), Gaps = 12/361 (3%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L+ I LIK +G + V +Y LC+ PE GYY +P G GDFVT+PE+SQ+FGE+L Sbjct: 21 PLLTYIKKLIKTSGPLPVWRYMQLCLTHPEHGYYIARDPLGREGDFVTSPEVSQMFGELL 80 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 ++ W G P +RL+ELGPGRG +M D LR + L P + LS++MVE + L Sbjct: 81 GLWAASVWRMMGSPDPLRLIELGPGRGTLMADALRALRVLPP-MYESLSVHMVEINPVLV 139 Query: 124 LIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQ 183 Q L+ I W+TSL VP G ++ANE+FD LP+ Q V + G ER++DID Sbjct: 140 EKQMAALSD-APNIEWHTSLDQVPQGPAIILANEYFDVLPVHQMVRRDGGWHERVVDIDG 198 Query: 184 HDSLVFNIGDHEIKSNFLTCSDY----FLGAIFENSPCRDREMQSISDRLACDGGTAIVI 239 LVF + + +GAIFE P + M SI+ R+ +GG A++I Sbjct: 199 SGQLVFGVSAEPTPRFDVLLPPLVRAAPVGAIFEWRPDAE--MMSIATRVRDNGGAALII 256 Query: 240 DYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQG 299 DYG+++S GDT QA+ H++ PL PG+ D+++HVDF+ L+ A ++G QG Sbjct: 257 DYGHVRSDAGDTFQAISRHSFADPLKYPGRVDVTAHVDFEALARAAEDVGARVHGPVPQG 316 Query: 300 KFLEGLGIWQRAFSLMKQTARK--DILLDSVKRLVSTSADKKSMGELFKILVVSHEKVEL 357 +FL LGI RA +LM + + D + ++KRL + MG +FK++ VS + + Sbjct: 317 EFLRRLGIEARAVNLMAKATPELSDDIATALKRLTE--GGRGGMGSMFKVIGVSDPSLSV 374 Query: 358 M 358 + Sbjct: 375 L 375 >gi|306843125|ref|ZP_07475747.1| Hypothetical protein BIBO2_2887 [Brucella sp. BO2] gi|306286730|gb|EFM58283.1| Hypothetical protein BIBO2_2887 [Brucella sp. BO2] Length = 365 Score = 321 bits (823), Expect = 9e-86, Method: Composition-based stats. Identities = 148/365 (40%), Positives = 198/365 (54%), Gaps = 13/365 (3%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L ++ LI G ++V Y A C+ D E GYY+T PFG GDF+TAPE+SQ+FGE++ Sbjct: 5 ALKERLKRLIATTGPISVADYMAACLGDREAGYYTTREPFGREGDFITAPEVSQMFGELI 64 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 I+ + W+ P+ L E+GPGRG +M D+LR I +L P I MVETS RL Sbjct: 65 GIWCLSEWDALARPANFVLCEIGPGRGTLMSDMLRTIGRLAPQMLGGARIAMVETSPRLA 124 Query: 124 LIQKKQLASYGDKINWYTSLADVP----LGFTFLVANEFFDSLPIKQFVMTEHGIRERMI 179 QK++LA I W+ AD+P G LV NE FD++P +QFV + ERM+ Sbjct: 125 EKQKQKLAGTKAHIEWFERFADIPADTVNGPLILVTNELFDAIPFRQFVKADGRFVERMV 184 Query: 180 DIDQHDSLVFNIGDHEIKSNFLTCS--DYFLGAIFENSPCRDREMQSISDRLACDGGTAI 237 +++ D F G I L GAIFE +P R MQ I+ R+A G A+ Sbjct: 185 ALNEQDEFHFVSGAGGIDPALLPKDHVKAEEGAIFEAAPARTALMQEIASRIAATRGAAL 244 Query: 238 VIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTT 297 IDYG+L+S GDTLQA+ H Y +PG ADL+SHVDF L A G T Sbjct: 245 NIDYGHLESGFGDTLQAMLKHAYDDVFAHPGVADLTSHVDFDILQKTAKACGCKT-GTMT 303 Query: 298 QGKFLEGLGIWQRAFSL--MKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKV 355 QG+FL +G+ RA L K A ++ + V+RL A MG LFK+L S + Sbjct: 304 QGEFLLAMGLVDRAGRLGAGKDAAFQEKIRQDVERL----AAPDQMGTLFKVLAFSDGQT 359 Query: 356 ELMPF 360 L+PF Sbjct: 360 RLLPF 364 >gi|256160255|ref|ZP_05457949.1| hypothetical protein BcetM4_14706 [Brucella ceti M490/95/1] gi|256255461|ref|ZP_05460997.1| hypothetical protein BcetB_14478 [Brucella ceti B1/94] gi|261222667|ref|ZP_05936948.1| conserved hypothetical protein [Brucella ceti B1/94] gi|265998631|ref|ZP_06111188.1| conserved hypothetical protein [Brucella ceti M490/95/1] gi|260921251|gb|EEX87904.1| conserved hypothetical protein [Brucella ceti B1/94] gi|262553255|gb|EEZ09089.1| conserved hypothetical protein [Brucella ceti M490/95/1] Length = 365 Score = 320 bits (821), Expect = 2e-85, Method: Composition-based stats. Identities = 148/365 (40%), Positives = 199/365 (54%), Gaps = 13/365 (3%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L ++ LI G ++V Y A C+ D E GYY+T PFG GDF+TAPE+SQ+FGE++ Sbjct: 5 SLKERLKRLIATTGPISVADYMAACLGDREAGYYTTREPFGREGDFITAPEVSQMFGELI 64 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 I+ + W+ P+ L E+GPGRG +M D+LR I +L P I MVETS RL Sbjct: 65 GIWCLSEWDALARPANFVLCEIGPGRGTLMSDMLRTIGRLAPQMLGGARIAMVETSPRLA 124 Query: 124 LIQKKQLASYGDKINWYTSLADVP----LGFTFLVANEFFDSLPIKQFVMTEHGIRERMI 179 QK++LA + W+ AD+P G LV NE FD++P +QFV + ERMI Sbjct: 125 EKQKQKLAGTKAHVEWFERFADIPADTVHGPLILVTNELFDAIPFRQFVKADGRFVERMI 184 Query: 180 DIDQHDSLVFNIGDHEIKSNFLTCS--DYFLGAIFENSPCRDREMQSISDRLACDGGTAI 237 +++ D F G I L GAIFE +P R MQ I++R+A G A+ Sbjct: 185 ALNEQDEFQFVSGAGGIDPALLPKDHVKAEEGAIFEAAPARTALMQEIANRIAATRGAAL 244 Query: 238 VIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTT 297 IDYG+L+S GDTLQA+ Y +PG ADL+SHVDF L A G T Sbjct: 245 NIDYGHLESGFGDTLQAMLKQAYDDVFAHPGVADLTSHVDFDILQKTAKACGCKT-GTMT 303 Query: 298 QGKFLEGLGIWQRAFSL--MKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKV 355 QG+FL +G+ RA L K A ++ + V+RL A MG LFK+L S E+ Sbjct: 304 QGEFLLAMGLVDRAGRLGAGKDAAFQEKIRQDVERL----AAPDQMGTLFKVLAFSDEQT 359 Query: 356 ELMPF 360 L+PF Sbjct: 360 RLLPF 364 >gi|307944195|ref|ZP_07659536.1| putative C2orf56-like protein [Roseibium sp. TrichSKD4] gi|307772541|gb|EFO31761.1| putative C2orf56-like protein [Roseibium sp. TrichSKD4] Length = 366 Score = 320 bits (820), Expect = 2e-85, Method: Composition-based stats. Identities = 137/364 (37%), Positives = 207/364 (56%), Gaps = 9/364 (2%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 M L+ K+ I G ++V Y + C+ADP+ GYY+T PFG +GDF TAPE+SQ+FG Sbjct: 1 MTTPLLAKLKKRIVATGPLSVVDYMSACLADPDHGYYTTKEPFGEMGDFTTAPEVSQMFG 60 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 E+L F + A + +LVELGPG G +M D LR+ L+P F + +VE S Sbjct: 61 ELLGAFCLQASDILQLGEPFQLVELGPGGGTLMADFLRMAA-LQPGFMENAQLNLVEMSP 119 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 RL Q L + ++VP G ++ANEFFD+LPI+QF+ TE G ERM+ Sbjct: 120 RLREKQADTLKHAPLAPTFRDMFSEVPDGPLIVIANEFFDALPIRQFIKTELGWSERMVG 179 Query: 181 IDQHDSLVFNIGDHEIKSNFLT--CSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIV 238 ++ +L F IG +++ + L + A+ E P + I++R+ GG A+ Sbjct: 180 LNDEGNLSFGIGVAQLEQSALPAYAAHAHRDAVLETQPAANAIASQIAERITRFGGLALF 239 Query: 239 IDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQ 298 IDYGY +S +GDTLQA+ H Y +PG AD+++HV+F+ L+ A +++ TQ Sbjct: 240 IDYGYTKSALGDTLQALYKHAYDDVFAHPGAADITAHVNFEALARSAAQAGAHVHAPLTQ 299 Query: 299 GKFLEGLGIWQRAFSL--MKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVE 356 G FL LG+ +RA +L K ++ + D+V+RL A MG+LFK+L ++H+ + Sbjct: 300 GDFLVKLGLLERAGALGYGKDPKTQECIRDAVERL----AAPDQMGDLFKVLAITHKPLA 355 Query: 357 LMPF 360 L PF Sbjct: 356 LPPF 359 >gi|62290414|ref|YP_222207.1| hypothetical protein BruAb1_1518 [Brucella abortus bv. 1 str. 9-941] gi|82700337|ref|YP_414911.1| hypothetical protein BAB1_1545 [Brucella melitensis biovar Abortus 2308] gi|237815921|ref|ZP_04594918.1| Protein of unknown function DUF185 [Brucella abortus str. 2308 A] gi|254689714|ref|ZP_05152968.1| hypothetical protein Babob68_05974 [Brucella abortus bv. 6 str. 870] gi|254697857|ref|ZP_05159685.1| hypothetical protein Babob28_09140 [Brucella abortus bv. 2 str. 86/8/59] gi|254730748|ref|ZP_05189326.1| hypothetical protein Babob42_06004 [Brucella abortus bv. 4 str. 292] gi|256257965|ref|ZP_05463501.1| hypothetical protein Babob9C_11581 [Brucella abortus bv. 9 str. C68] gi|260546950|ref|ZP_05822689.1| conserved hypothetical protein [Brucella abortus NCTC 8038] gi|260755247|ref|ZP_05867595.1| conserved hypothetical protein [Brucella abortus bv. 6 str. 870] gi|260758468|ref|ZP_05870816.1| conserved hypothetical protein [Brucella abortus bv. 4 str. 292] gi|260762293|ref|ZP_05874636.1| conserved hypothetical protein [Brucella abortus bv. 2 str. 86/8/59] gi|260884262|ref|ZP_05895876.1| conserved hypothetical protein [Brucella abortus bv. 9 str. C68] gi|297248800|ref|ZP_06932518.1| hypothetical protein BAYG_01766 [Brucella abortus bv. 5 str. B3196] gi|62196546|gb|AAX74846.1| conserved hypothetical protein [Brucella abortus bv. 1 str. 9-941] gi|82616438|emb|CAJ11501.1| Protein of unknown function DUF185 [Brucella melitensis biovar Abortus 2308] gi|237789219|gb|EEP63430.1| Protein of unknown function DUF185 [Brucella abortus str. 2308 A] gi|260096000|gb|EEW79877.1| conserved hypothetical protein [Brucella abortus NCTC 8038] gi|260668786|gb|EEX55726.1| conserved hypothetical protein [Brucella abortus bv. 4 str. 292] gi|260672725|gb|EEX59546.1| conserved hypothetical protein [Brucella abortus bv. 2 str. 86/8/59] gi|260675355|gb|EEX62176.1| conserved hypothetical protein [Brucella abortus bv. 6 str. 870] gi|260873790|gb|EEX80859.1| conserved hypothetical protein [Brucella abortus bv. 9 str. C68] gi|297175969|gb|EFH35316.1| hypothetical protein BAYG_01766 [Brucella abortus bv. 5 str. B3196] Length = 365 Score = 320 bits (819), Expect = 3e-85, Method: Composition-based stats. Identities = 148/365 (40%), Positives = 198/365 (54%), Gaps = 13/365 (3%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L ++ LI G ++V Y A C+ D E GYY+T PFG GDF+TAPE+SQ+FGE++ Sbjct: 5 SLKERLKRLIATTGPISVADYMAACLGDREAGYYTTREPFGREGDFITAPEVSQMFGELI 64 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 I+ + W+ P+ L E+GPGRG +M D+LR I +L P I MVETS RL Sbjct: 65 GIWCLSEWDALARPANFVLCEIGPGRGTLMSDMLRTIGRLAPQMLGGAQIAMVETSPRLA 124 Query: 124 LIQKKQLASYGDKINWYTSLADVP----LGFTFLVANEFFDSLPIKQFVMTEHGIRERMI 179 QK++LA + W+ AD+P G LV NE FD++P +QFV + ERMI Sbjct: 125 EKQKQKLAGTKAHVEWFERFADIPADTVHGPLILVTNELFDAIPFRQFVKADGRFVERMI 184 Query: 180 DIDQHDSLVFNIGDHEIKSNFLTCS--DYFLGAIFENSPCRDREMQSISDRLACDGGTAI 237 +++ D F G I L GAIFE +P R MQ I+ R+A G A+ Sbjct: 185 ALNEQDEFQFVSGAGGIDPALLPKDHVKAEEGAIFEAAPARTALMQEIASRIAATRGAAL 244 Query: 238 VIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTT 297 IDYG+L+S GDTLQA+ Y +PG ADL+SHVDF L A G T Sbjct: 245 NIDYGHLESGFGDTLQAMLKQAYDDVFAHPGVADLTSHVDFDILQKTAKACGCKT-GTMT 303 Query: 298 QGKFLEGLGIWQRAFSL--MKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKV 355 QG+FL +G+ RA L K A ++ + V+RL A MG LFK+L S E+ Sbjct: 304 QGEFLLAMGLVDRAGRLGAGKDAAFQEKIRQDVERL----AAPDQMGTLFKVLAFSDEQT 359 Query: 356 ELMPF 360 L+PF Sbjct: 360 RLLPF 364 >gi|254714748|ref|ZP_05176559.1| hypothetical protein BcetM6_15699 [Brucella ceti M644/93/1] gi|254717808|ref|ZP_05179619.1| hypothetical protein BcetM_15676 [Brucella ceti M13/05/1] gi|261219656|ref|ZP_05933937.1| conserved hypothetical protein [Brucella ceti M13/05/1] gi|261322544|ref|ZP_05961741.1| conserved hypothetical protein [Brucella ceti M644/93/1] gi|260924745|gb|EEX91313.1| conserved hypothetical protein [Brucella ceti M13/05/1] gi|261295234|gb|EEX98730.1| conserved hypothetical protein [Brucella ceti M644/93/1] Length = 365 Score = 320 bits (819), Expect = 3e-85, Method: Composition-based stats. Identities = 147/365 (40%), Positives = 197/365 (53%), Gaps = 13/365 (3%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L ++ LI G ++V Y A C+ D E GYY+T PFG GDF+TAPE+SQ+FGE++ Sbjct: 5 SLKERLKRLIATTGPISVADYMAACLGDREAGYYTTREPFGREGDFITAPEVSQMFGELI 64 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 I+ + W+ P+ L E+GPGRG +M D+LR I +L P I MVETS RL Sbjct: 65 GIWCLSEWDALARPANFVLCEIGPGRGTLMSDMLRTIGRLAPQMLGGARIAMVETSPRLA 124 Query: 124 LIQKKQLASYGDKINWYTSLADVP----LGFTFLVANEFFDSLPIKQFVMTEHGIRERMI 179 QK++LA + W+ AD+P G LV NE FD++P +QFV + ERMI Sbjct: 125 EKQKQKLAGTKAHVEWFERFADIPADTVHGPLILVTNELFDAIPFRQFVKADGRFVERMI 184 Query: 180 DIDQHDSLVFNIGDHEIKSNFLTCS--DYFLGAIFENSPCRDREMQSISDRLACDGGTAI 237 +++ D F G I L GAIFE +P R MQ I+ R+A G A+ Sbjct: 185 ALNEQDEFQFVSGAGGIDPALLPKDHVKAEEGAIFEAAPARTALMQEIASRIAATRGAAL 244 Query: 238 VIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTT 297 IDYG+L+S GDTLQA+ Y +P ADL+SHVDF L A G T Sbjct: 245 NIDYGHLESGFGDTLQAMLKQAYDDVFAHPSVADLTSHVDFDILQKTAKACGCKT-GTMT 303 Query: 298 QGKFLEGLGIWQRAFSL--MKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKV 355 QG+FL +G+ RA L K A ++ + V+RL A MG LFK+L S E+ Sbjct: 304 QGEFLLAMGLVDRAGRLGAGKDAAFQEKIRQDVERL----AAPDQMGTLFKVLAFSDEQT 359 Query: 356 ELMPF 360 L+PF Sbjct: 360 RLLPF 364 >gi|23502397|ref|NP_698524.1| hypothetical protein BR1529 [Brucella suis 1330] gi|161619475|ref|YP_001593362.1| hypothetical protein BCAN_A1566 [Brucella canis ATCC 23365] gi|225627970|ref|ZP_03786006.1| Hypothetical protein, conserved [Brucella ceti str. Cudo] gi|225853007|ref|YP_002733240.1| hypothetical protein BMEA_A1583 [Brucella melitensis ATCC 23457] gi|254704782|ref|ZP_05166610.1| hypothetical protein Bsuib36_12849 [Brucella suis bv. 3 str. 686] gi|254708195|ref|ZP_05170023.1| hypothetical protein BpinM_14884 [Brucella pinnipedialis M163/99/10] gi|254710565|ref|ZP_05172376.1| hypothetical protein BpinB_09916 [Brucella pinnipedialis B2/94] gi|256032059|ref|ZP_05445673.1| hypothetical protein BpinM2_15688 [Brucella pinnipedialis M292/94/1] gi|256045151|ref|ZP_05448050.1| hypothetical protein Bmelb1R_11716 [Brucella melitensis bv. 1 str. Rev.1] gi|256061582|ref|ZP_05451723.1| hypothetical protein Bneo5_14615 [Brucella neotomae 5K33] gi|256114091|ref|ZP_05454856.1| hypothetical protein Bmelb3E_14912 [Brucella melitensis bv. 3 str. Ether] gi|256263513|ref|ZP_05466045.1| conserved hypothetical protein [Brucella melitensis bv. 2 str. 63/9] gi|256369945|ref|YP_003107456.1| hypothetical protein BMI_I1543 [Brucella microti CCM 4915] gi|260169194|ref|ZP_05756005.1| hypothetical protein BruF5_12698 [Brucella sp. F5/99] gi|260565251|ref|ZP_05835735.1| conserved hypothetical protein [Brucella melitensis bv. 1 str. 16M] gi|260565975|ref|ZP_05836445.1| conserved hypothetical protein [Brucella suis bv. 4 str. 40] gi|261315700|ref|ZP_05954897.1| conserved hypothetical protein [Brucella pinnipedialis M163/99/10] gi|261318138|ref|ZP_05957335.1| conserved hypothetical protein [Brucella pinnipedialis B2/94] gi|261325589|ref|ZP_05964786.1| conserved hypothetical protein [Brucella neotomae 5K33] gi|261755476|ref|ZP_05999185.1| conserved hypothetical protein [Brucella suis bv. 3 str. 686] gi|261758707|ref|ZP_06002416.1| conserved hypothetical protein [Brucella sp. F5/99] gi|265989169|ref|ZP_06101726.1| conserved hypothetical protein [Brucella pinnipedialis M292/94/1] gi|265991583|ref|ZP_06104140.1| conserved hypothetical protein [Brucella melitensis bv. 1 str. Rev.1] gi|265995419|ref|ZP_06107976.1| conserved hypothetical protein [Brucella melitensis bv. 3 str. Ether] gi|23348382|gb|AAN30439.1| conserved hypothetical protein [Brucella suis 1330] gi|161336286|gb|ABX62591.1| protein of unknown function DUF185 [Brucella canis ATCC 23365] gi|225617133|gb|EEH14179.1| Hypothetical protein, conserved [Brucella ceti str. Cudo] gi|225641372|gb|ACO01286.1| Hypothetical protein, conserved [Brucella melitensis ATCC 23457] gi|256000108|gb|ACU48507.1| hypothetical protein BMI_I1543 [Brucella microti CCM 4915] gi|260151319|gb|EEW86413.1| conserved hypothetical protein [Brucella melitensis bv. 1 str. 16M] gi|260155493|gb|EEW90573.1| conserved hypothetical protein [Brucella suis bv. 4 str. 40] gi|261297361|gb|EEY00858.1| conserved hypothetical protein [Brucella pinnipedialis B2/94] gi|261301569|gb|EEY05066.1| conserved hypothetical protein [Brucella neotomae 5K33] gi|261304726|gb|EEY08223.1| conserved hypothetical protein [Brucella pinnipedialis M163/99/10] gi|261738691|gb|EEY26687.1| conserved hypothetical protein [Brucella sp. F5/99] gi|261745229|gb|EEY33155.1| conserved hypothetical protein [Brucella suis bv. 3 str. 686] gi|262766532|gb|EEZ12321.1| conserved hypothetical protein [Brucella melitensis bv. 3 str. Ether] gi|263002367|gb|EEZ14942.1| conserved hypothetical protein [Brucella melitensis bv. 1 str. Rev.1] gi|263093534|gb|EEZ17568.1| conserved hypothetical protein [Brucella melitensis bv. 2 str. 63/9] gi|264661366|gb|EEZ31627.1| conserved hypothetical protein [Brucella pinnipedialis M292/94/1] gi|326409547|gb|ADZ66612.1| conserved hypothetical protein [Brucella melitensis M28] gi|326539253|gb|ADZ87468.1| conserved hypothetical protein [Brucella melitensis M5-90] Length = 365 Score = 320 bits (819), Expect = 3e-85, Method: Composition-based stats. Identities = 148/365 (40%), Positives = 198/365 (54%), Gaps = 13/365 (3%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L ++ LI G ++V Y A C+ D E GYY+T PFG GDF+TAPE+SQ+FGE++ Sbjct: 5 SLKERLKRLIATTGPISVADYMAACLGDREAGYYTTREPFGREGDFITAPEVSQMFGELI 64 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 I+ + W+ P+ L E+GPGRG +M D+LR I +L P I MVETS RL Sbjct: 65 GIWCLSEWDALARPANFVLCEIGPGRGTLMSDMLRTIGRLAPQMLGGARIAMVETSPRLA 124 Query: 124 LIQKKQLASYGDKINWYTSLADVP----LGFTFLVANEFFDSLPIKQFVMTEHGIRERMI 179 QK++LA + W+ AD+P G LV NE FD++P +QFV + ERMI Sbjct: 125 EKQKQKLAGTKAHVEWFERFADIPADTVHGPLILVTNELFDAIPFRQFVKADGRFVERMI 184 Query: 180 DIDQHDSLVFNIGDHEIKSNFLTCS--DYFLGAIFENSPCRDREMQSISDRLACDGGTAI 237 +++ D F G I L GAIFE +P R MQ I+ R+A G A+ Sbjct: 185 ALNEQDEFQFVSGAGGIDPALLPKDHVKAEEGAIFEAAPARTALMQEIASRIAATRGAAL 244 Query: 238 VIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTT 297 IDYG+L+S GDTLQA+ Y +PG ADL+SHVDF L A G T Sbjct: 245 NIDYGHLESGFGDTLQAMLKQAYDDVFAHPGVADLTSHVDFDILQKTAKACGCKT-GTMT 303 Query: 298 QGKFLEGLGIWQRAFSL--MKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKV 355 QG+FL +G+ RA L K A ++ + V+RL A MG LFK+L S E+ Sbjct: 304 QGEFLLAMGLVDRAGRLGAGKDAAFQEKIRQDVERL----AAPDQMGTLFKVLAFSDEQT 359 Query: 356 ELMPF 360 L+PF Sbjct: 360 RLLPF 364 >gi|17986770|ref|NP_539404.1| ATP synthase beta subunit/transription termination factor Rho [Brucella melitensis bv. 1 str. 16M] gi|17982399|gb|AAL51668.1| ATP synthase beta subunit/transription termination factor rho [Brucella melitensis bv. 1 str. 16M] Length = 401 Score = 320 bits (819), Expect = 3e-85, Method: Composition-based stats. Identities = 148/365 (40%), Positives = 198/365 (54%), Gaps = 13/365 (3%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L ++ LI G ++V Y A C+ D E GYY+T PFG GDF+TAPE+SQ+FGE++ Sbjct: 41 SLKERLKRLIATTGPISVADYMAACLGDREAGYYTTREPFGREGDFITAPEVSQMFGELI 100 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 I+ + W+ P+ L E+GPGRG +M D+LR I +L P I MVETS RL Sbjct: 101 GIWCLSEWDALARPANFVLCEIGPGRGTLMSDMLRTIGRLAPQMLGGARIAMVETSPRLA 160 Query: 124 LIQKKQLASYGDKINWYTSLADVP----LGFTFLVANEFFDSLPIKQFVMTEHGIRERMI 179 QK++LA + W+ AD+P G LV NE FD++P +QFV + ERMI Sbjct: 161 EKQKQKLAGTKAHVEWFERFADIPADTVHGPLILVTNELFDAIPFRQFVKADGRFVERMI 220 Query: 180 DIDQHDSLVFNIGDHEIKSNFLTCS--DYFLGAIFENSPCRDREMQSISDRLACDGGTAI 237 +++ D F G I L GAIFE +P R MQ I+ R+A G A+ Sbjct: 221 ALNEQDEFQFVSGAGGIDPALLPKDHVKAEEGAIFEAAPARTALMQEIASRIAATRGAAL 280 Query: 238 VIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTT 297 IDYG+L+S GDTLQA+ Y +PG ADL+SHVDF L A G T Sbjct: 281 NIDYGHLESGFGDTLQAMLKQAYDDVFAHPGVADLTSHVDFDILQKTAKACGCKT-GTMT 339 Query: 298 QGKFLEGLGIWQRAFSL--MKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKV 355 QG+FL +G+ RA L K A ++ + V+RL A MG LFK+L S E+ Sbjct: 340 QGEFLLAMGLVDRAGRLGAGKDAAFQEKIRQDVERL----AAPDQMGTLFKVLAFSDEQT 395 Query: 356 ELMPF 360 L+PF Sbjct: 396 RLLPF 400 >gi|294852846|ref|ZP_06793519.1| ATP synthase beta subunit/transcription termination factor Rho [Brucella sp. NVSL 07-0026] gi|294821435|gb|EFG38434.1| ATP synthase beta subunit/transcription termination factor Rho [Brucella sp. NVSL 07-0026] Length = 404 Score = 319 bits (818), Expect = 4e-85, Method: Composition-based stats. Identities = 147/365 (40%), Positives = 197/365 (53%), Gaps = 13/365 (3%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L ++ LI G ++V Y A C+ D E GYY+T PFG GDF+TAPE+SQ+FGE++ Sbjct: 44 SLKERLKRLIATTGPISVADYMAACLGDREAGYYTTREPFGREGDFITAPEVSQMFGELI 103 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 I+ + W+ P+ L E+GPGRG +M D+LR I +L P I MVETS RL Sbjct: 104 GIWCLSEWDALARPANFVLCEIGPGRGTLMSDMLRTIGRLAPQMLGGARIAMVETSPRLA 163 Query: 124 LIQKKQLASYGDKINWYTSLADV----PLGFTFLVANEFFDSLPIKQFVMTEHGIRERMI 179 QK++LA + W+ AD+ G LV NE FD++P +QFV + ERMI Sbjct: 164 EKQKQKLAGTKAHVEWFERFADISADTVHGPLILVTNELFDAIPFRQFVKADGRFVERMI 223 Query: 180 DIDQHDSLVFNIGDHEIKSNFLTCS--DYFLGAIFENSPCRDREMQSISDRLACDGGTAI 237 +++ D F G I L GAIFE +P R MQ I+ R+A G A+ Sbjct: 224 ALNEQDEFQFVSGAGGIDPALLPKDHVKAEEGAIFEAAPARTALMQEIASRIAATRGAAL 283 Query: 238 VIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTT 297 IDYG+L+S GDTLQA+ Y +PG ADL+SHVDF L A G T Sbjct: 284 NIDYGHLESGFGDTLQAMLKQAYDDVFAHPGVADLTSHVDFDILQKTAKACGCKT-GTMT 342 Query: 298 QGKFLEGLGIWQRAFSL--MKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKV 355 QG+FL +G+ RA L K A ++ + V+RL A MG LFK+L S E+ Sbjct: 343 QGEFLLAMGLVDRAGRLGAGKDAAFQEKIRQDVERL----AAPDQMGTLFKVLAFSDEQT 398 Query: 356 ELMPF 360 L+PF Sbjct: 399 RLLPF 403 >gi|254694204|ref|ZP_05156032.1| hypothetical protein Babob3T_05979 [Brucella abortus bv. 3 str. Tulya] gi|261214511|ref|ZP_05928792.1| conserved hypothetical protein [Brucella abortus bv. 3 str. Tulya] gi|260916118|gb|EEX82979.1| conserved hypothetical protein [Brucella abortus bv. 3 str. Tulya] Length = 365 Score = 319 bits (817), Expect = 5e-85, Method: Composition-based stats. Identities = 148/365 (40%), Positives = 198/365 (54%), Gaps = 13/365 (3%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L ++ LI G ++V Y A C+ D E GYY+T PFG GDF+TAPE+SQ+FGE++ Sbjct: 5 SLKERLKRLIATTGPISVADYMAACLGDREAGYYTTREPFGREGDFITAPEVSQMFGELI 64 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 I+ + W+ P+ L E+GPGRG +M D+LR I +L P I MVETS RL Sbjct: 65 GIWCLSKWDALARPANFVLCEIGPGRGTLMSDMLRTIGRLAPQMLGGAQIAMVETSPRLA 124 Query: 124 LIQKKQLASYGDKINWYTSLADVP----LGFTFLVANEFFDSLPIKQFVMTEHGIRERMI 179 QK++LA + W+ AD+P G LV NE FD++P +QFV + ERMI Sbjct: 125 EKQKQKLAGTKAHVEWFERFADIPADTVHGPLILVTNELFDAIPFRQFVKADGRFVERMI 184 Query: 180 DIDQHDSLVFNIGDHEIKSNFLTCS--DYFLGAIFENSPCRDREMQSISDRLACDGGTAI 237 +++ D F G I L GAIFE +P R MQ I+ R+A G A+ Sbjct: 185 ALNEQDEFQFVSGAGGIDPALLPKDHVKAEEGAIFEAAPARTALMQEIASRIAATRGAAL 244 Query: 238 VIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTT 297 IDYG+L+S GDTLQA+ Y +PG ADL+SHVDF L A G T Sbjct: 245 NIDYGHLESGFGDTLQAMLKQAYDDVFAHPGVADLTSHVDFDILQKTAKACGCKT-GTMT 303 Query: 298 QGKFLEGLGIWQRAFSL--MKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKV 355 QG+FL +G+ RA L K A ++ + V+RL A MG LFK+L S E+ Sbjct: 304 QGEFLLAMGLVDRAGRLGAGKDAAFQEKIRQDVERL----AAPDQMGTLFKVLAFSDEQT 359 Query: 356 ELMPF 360 L+PF Sbjct: 360 RLLPF 364 >gi|110634500|ref|YP_674708.1| hypothetical protein Meso_2151 [Mesorhizobium sp. BNC1] gi|110285484|gb|ABG63543.1| protein of unknown function DUF185 [Chelativorans sp. BNC1] Length = 358 Score = 318 bits (816), Expect = 6e-85, Method: Composition-based stats. Identities = 155/362 (42%), Positives = 212/362 (58%), Gaps = 9/362 (2%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 +L +I+ LI+ G M +D Y ALC+ DP+ GYY+T PFGA GDF TAPE+SQ+FGE+ Sbjct: 2 TRLHERILRLIEATGPMGIDAYMALCLFDPDDGYYTTREPFGAAGDFTTAPEVSQMFGEL 61 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 +A++L AW+ G P E+GPGRG +M DILR + KL P S M+E S RL Sbjct: 62 VAVWLYAAWKACGTPDSPLFAEIGPGRGTLMKDILRTLSKLDPQLVSTHRFAMIEVSPRL 121 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 T IQKK L K W++ + D+P G F+V NE FD++PIK++V T G RER++ Sbjct: 122 TAIQKKTLEEAQAKPAWFSRVEDLPDGPLFIVGNELFDAVPIKEYVKTPAGWRERVVGHT 181 Query: 183 QHDSLVFNIGDHEIKSNFLT--CSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVID 240 +L F IG + + L G IFE +P R+ M I++RL+ G + ID Sbjct: 182 DDGALAFGIGPGALDPSLLPKDAEQAPEGTIFETAPAREALMDVIAERLSRQFGAGLFID 241 Query: 241 YGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGK 300 YGY +GDTLQAV+ H Y PL +PG+ADL++HVDF L S+A + L L QG+ Sbjct: 242 YGYSDPALGDTLQAVRRHAYDDPLAHPGEADLTAHVDFAALGSVARTHGLET-RLMPQGE 300 Query: 301 FLEGLGIWQRAFSL--MKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELM 358 FL LG+ +RA L K + + V+RL A + MGELFK + V +E+ Sbjct: 301 FLLDLGLLERAGRLGAGKSAEEQQRIRGEVERL----AGPQEMGELFKAMAVLPVGIEVP 356 Query: 359 PF 360 PF Sbjct: 357 PF 358 >gi|27382556|ref|NP_774085.1| hypothetical protein blr7445 [Bradyrhizobium japonicum USDA 110] gi|27355728|dbj|BAC52710.1| blr7445 [Bradyrhizobium japonicum USDA 110] Length = 371 Score = 318 bits (815), Expect = 8e-85, Method: Composition-based stats. Identities = 140/360 (38%), Positives = 209/360 (58%), Gaps = 12/360 (3%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGE 61 E L+ +I LIK +G M V +Y LC+ P +GYY + +P G GDF TAPE+SQ+FGE Sbjct: 3 EQPLLNEIKALIKSSGPMPVWRYMELCLMHPRYGYYVSRDPLGREGDFTTAPEVSQMFGE 62 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 +L ++ W+Q G P +RL+ELGPGRG MM D LR + L P + L I++VE + Sbjct: 63 LLGLWTASVWKQMGSPQSLRLIELGPGRGTMMADALRALRVLPPL-YQALQIHLVEVNPV 121 Query: 122 LTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDI 181 L Q L+ + + W+ S+ DVP G + ++ANE+FD LPI Q V E+G ER+I+I Sbjct: 122 LRERQSATLSGARN-VAWHDSIDDVPEGPSIILANEYFDVLPIHQMVKRENGWHERVIEI 180 Query: 182 DQHDSLVFNIGDHEIKSNFLTCSDY----FLGAIFENSPCRDREMQSISDRLACDGGTAI 237 D + L F + +GA+FE P + + ++ R+ G A+ Sbjct: 181 DPNGKLQFGAASEPTPRFDVLLPPLVRAAPVGAVFEWRPDGE--VMKLATRVRDQDGAAL 238 Query: 238 VIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTT 297 +IDYG+L+S GDT QA+ HT+ PL PGQAD+++HVDFQ L+ A ++G T Sbjct: 239 IIDYGHLRSDAGDTFQAIARHTFTDPLKAPGQADVTAHVDFQALARAAEDVGARVHGPVT 298 Query: 298 QGKFLEGLGIWQRAFSLMKQTARK--DILLDSVKRLVSTSADKKSMGELFKILVVSHEKV 355 QG FL+ +GI RA +LM++ + + ++KRL T + MG +FK+L +S ++ Sbjct: 299 QGDFLKRVGIDTRAAALMQKATPEVATDISVALKRLTDTG--RSGMGSMFKVLGISEPRL 356 >gi|163843785|ref|YP_001628189.1| hypothetical protein BSUIS_A1588 [Brucella suis ATCC 23445] gi|163674508|gb|ABY38619.1| protein of unknown function DUF185 [Brucella suis ATCC 23445] Length = 365 Score = 318 bits (814), Expect = 9e-85, Method: Composition-based stats. Identities = 147/365 (40%), Positives = 198/365 (54%), Gaps = 13/365 (3%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L ++ LI G ++V Y A C+ D + GYY+T PFG GDF+TAPE+SQ+FGE++ Sbjct: 5 SLKERLKRLIAITGPISVADYMAACLGDRKAGYYTTREPFGREGDFITAPEVSQMFGELI 64 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 I+ + W+ P+ L E+GPGRG +M D+LR I +L P I MVETS RL Sbjct: 65 GIWCLSEWDALARPANFVLCEIGPGRGTLMSDMLRTIGRLAPQMLGGARIAMVETSPRLA 124 Query: 124 LIQKKQLASYGDKINWYTSLADVP----LGFTFLVANEFFDSLPIKQFVMTEHGIRERMI 179 QK++LA + W+ AD+P G LV NE FD++P +QFV + ERMI Sbjct: 125 EKQKQKLAGTKAHVEWFERFADIPADTVHGPLILVTNELFDAIPFRQFVKADGRFVERMI 184 Query: 180 DIDQHDSLVFNIGDHEIKSNFLTCS--DYFLGAIFENSPCRDREMQSISDRLACDGGTAI 237 +++ D F G I L GAIFE +P R MQ I+ R+A G A+ Sbjct: 185 ALNEQDEFQFVSGAGGIDPALLPKDHVKAEEGAIFEAAPARTALMQEIASRIAATRGAAL 244 Query: 238 VIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTT 297 IDYG+L+S GDTLQA+ Y +PG ADL+SHVDF L A G T Sbjct: 245 NIDYGHLESGFGDTLQAMLKQAYDDVFAHPGVADLTSHVDFDILQKTAKACGCKT-GTMT 303 Query: 298 QGKFLEGLGIWQRAFSL--MKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKV 355 QG+FL +G+ RA L K A ++ + V+RL A MG LFK+L S E+ Sbjct: 304 QGEFLLAMGLVDRAGRLGAGKDAAFQEKIRQDVERL----AAPDQMGTLFKVLAFSDEQT 359 Query: 356 ELMPF 360 L+PF Sbjct: 360 RLLPF 364 >gi|148559414|ref|YP_001259406.1| hypothetical protein BOV_1478 [Brucella ovis ATCC 25840] gi|148370671|gb|ABQ60650.1| conserved hypothetical protein [Brucella ovis ATCC 25840] Length = 365 Score = 317 bits (812), Expect = 2e-84, Method: Composition-based stats. Identities = 148/365 (40%), Positives = 198/365 (54%), Gaps = 13/365 (3%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L ++ LI G ++V Y A C+ D E GYY+T PFG GDF+TAPE+SQ+FGE++ Sbjct: 5 SLKERLKRLIATTGPISVADYIAACLGDREAGYYTTREPFGREGDFITAPEVSQMFGELI 64 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 I+ + W+ P+ L E+GPGRG +M D+LR I +L P I MVETS RL Sbjct: 65 GIWCLSEWDALARPANFVLCEIGPGRGTLMSDMLRTIGRLAPQMLGGARIAMVETSPRLA 124 Query: 124 LIQKKQLASYGDKINWYTSLADVP----LGFTFLVANEFFDSLPIKQFVMTEHGIRERMI 179 QK++LA + W+ AD+P G LV NE FD++P +QFV + ERMI Sbjct: 125 EKQKQKLAGTKAHVEWFERFADIPADTVHGPLILVTNELFDAIPFRQFVKADGRFVERMI 184 Query: 180 DIDQHDSLVFNIGDHEIKSNFLTCS--DYFLGAIFENSPCRDREMQSISDRLACDGGTAI 237 +++ D F G I L GAIFE +P R MQ I+ R+A G A+ Sbjct: 185 ALNEQDEFQFVSGAGGIDPALLPKDHVKAEEGAIFEAAPARTALMQEIASRIAATRGAAL 244 Query: 238 VIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTT 297 IDYG+L+S GDTLQA+ Y +PG ADL+SHVDF L A G T Sbjct: 245 NIDYGHLESGFGDTLQAMLKQAYDDVFAHPGVADLTSHVDFDILQKTAKACGCKT-GTMT 303 Query: 298 QGKFLEGLGIWQRAFSL--MKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKV 355 QG+FL +G+ RA L K A ++ + V+RL A MG LFK+L S E+ Sbjct: 304 QGEFLLAMGLVDRAGRLGAGKDAAFQEKIRQDVERL----AAPDQMGTLFKVLAFSDEQT 359 Query: 356 ELMPF 360 L+PF Sbjct: 360 RLLPF 364 >gi|85716018|ref|ZP_01046995.1| hypothetical protein NB311A_14415 [Nitrobacter sp. Nb-311A] gi|85697216|gb|EAQ35097.1| hypothetical protein NB311A_14415 [Nitrobacter sp. Nb-311A] Length = 374 Score = 316 bits (810), Expect = 3e-84, Method: Composition-based stats. Identities = 138/363 (38%), Positives = 200/363 (55%), Gaps = 15/363 (4%) Query: 1 MENK--LIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQI 58 M L+ I LIK +G + V +Y LC+ PE GYY +P G GDF+T+PE+SQ+ Sbjct: 1 MTEPAPLLADIKRLIKTSGPLPVWRYMQLCLTHPEHGYYIARDPLGREGDFITSPEVSQM 60 Query: 59 FGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVET 118 FGE+L ++ W G P +RL+ELGPGRG MM D LR + L P + LS++MVE Sbjct: 61 FGELLGLWGASVWRTIGSPLTLRLIELGPGRGTMMADALRALRVLPP-MYESLSVHMVEI 119 Query: 119 SERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERM 178 + L Q L+ I W+ SL +VP G + ANE+FD LP+ Q V + G ER+ Sbjct: 120 NPVLREKQMAALSD-APNIQWHASLDEVPQGPAIIFANEYFDVLPVHQMVKGDDGWHERV 178 Query: 179 IDIDQHDSLVFNIGDHEIKSNFLTCSDY----FLGAIFENSPCRDREMQSISDRLACDGG 234 +DI LVF + + +GAIFE P + SI+ R+ GG Sbjct: 179 VDI-DGGQLVFGVSATPTPRFDVLLPPLVRAAPVGAIFEWRPDAEIM--SIATRVRDQGG 235 Query: 235 TAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYING 294 A++IDYG+ +S GDT QA+ H++ PL PG+ D+++HVDF+ L+ A ++G Sbjct: 236 AALIIDYGHERSDAGDTFQAIARHSFADPLKYPGRVDVTAHVDFEALARAAEDVGARVHG 295 Query: 295 LTTQGKFLEGLGIWQRAFSLMKQTARK--DILLDSVKRLVSTSADKKSMGELFKILVVSH 352 TQG+FL LGI RA +LM + + D + ++KRL + MG +FK++ VS Sbjct: 296 PVTQGEFLRRLGIEARAVNLMAKATAEVSDGIASALKRLTE--GGRGGMGSMFKVIGVSS 353 Query: 353 EKV 355 + Sbjct: 354 PGL 356 >gi|154252708|ref|YP_001413532.1| hypothetical protein Plav_2261 [Parvibaculum lavamentivorans DS-1] gi|154156658|gb|ABS63875.1| protein of unknown function DUF185 [Parvibaculum lavamentivorans DS-1] Length = 356 Score = 316 bits (809), Expect = 4e-84, Method: Composition-based stats. Identities = 140/366 (38%), Positives = 202/366 (55%), Gaps = 16/366 (4%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 M + L R+I LI++ G + + QY AL + PE GYY T +P GA GDFVTAPEISQ+FG Sbjct: 1 MTSPLARQIARLIEQTGPIPLSQYMALALGHPEHGYYMTRDPLGARGDFVTAPEISQMFG 60 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 E++ ++L W + G P L ELGPGRG +M D LR I + P SI++VETS Sbjct: 61 ELVGLWLADQWLEQGSPKPFVLAELGPGRGTLMADALRAIAAV-PHMVEAASIHLVETSP 119 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 L Q K++ + +W+ + D+P FLVANEFFD+LP+ Q+ TE G ER + Sbjct: 120 VLRNAQSKRIP----QAHWHEHVDDLPDLPLFLVANEFFDALPVTQYQRTERGWCERFVS 175 Query: 181 IDQHDSLVFNIGDHEIKSNFLTC---SDYFLGAIFENSPCRDREMQSISDRLACDGGTAI 237 V + + + G+I E SP ++I+ R+A GG A+ Sbjct: 176 -MAEGRFVPVLAPVPLADDSGLPAAMKAAQEGSIAEVSPASTSITETIAHRIARRGGAAL 234 Query: 238 VIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTT 297 VIDYG++ S GDTLQA++ H + P PG+ADL++HVDF+ LS A +G Sbjct: 235 VIDYGHVSSAPGDTLQALRDHKFADPFEAPGEADLTAHVDFEALSHAASAAGAAAHGAVE 294 Query: 298 QGKFLEGLGIWQRAFSLMKQ--TARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKV 355 QG+FL LGI RA +L + A+++ + +++RL + MG LFK+L ++ Sbjct: 295 QGRFLMALGIEARAEALSRNATPAQREDIASAMQRLTAR----DGMGSLFKVLGITPRGA 350 Query: 356 E-LMPF 360 F Sbjct: 351 PSPAGF 356 >gi|92118562|ref|YP_578291.1| hypothetical protein Nham_3094 [Nitrobacter hamburgensis X14] gi|91801456|gb|ABE63831.1| protein of unknown function DUF185 [Nitrobacter hamburgensis X14] Length = 375 Score = 315 bits (807), Expect = 7e-84, Method: Composition-based stats. Identities = 140/355 (39%), Positives = 204/355 (57%), Gaps = 12/355 (3%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 + L+ I LIK +G + V +Y LC+ PE GYY +P G GDF+T+PE+SQ+FGE+ Sbjct: 5 SPLLPDIKKLIKTSGPLPVWRYMQLCLTHPEHGYYIARDPLGREGDFITSPEVSQMFGEL 64 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 + ++ W G P+ +RL+ELGPGRG MM D LR + L P LS+++VE + L Sbjct: 65 IGLWAASVWRAMGSPTTLRLIELGPGRGTMMADALRALRVLPPMH-QALSVHLVEINPVL 123 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 QK L+ I W+ SL +VP G ++ANE+FD LP+ Q V + G ER++DID Sbjct: 124 REKQKAALSDAR-TIQWHASLDEVPQGPAIILANEYFDVLPVHQMVKRDDGWYERVVDID 182 Query: 183 QHDSLVFNIGDHEIKSNFLTCSDY----FLGAIFENSPCRDREMQSISDRLACDGGTAIV 238 LVF +GAIFE P + M +I+ R+ GG A++ Sbjct: 183 GSGQLVFGTTAAPTPRFDALLPPLVRAAPVGAIFEWRPDAE--MMTIATRVRDHGGAALI 240 Query: 239 IDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQ 298 IDYG+++S GDT QA+ GH++ PL PGQAD+++HVDFQ L+ A ++G TQ Sbjct: 241 IDYGHVRSDAGDTFQAIAGHSFADPLKYPGQADVTAHVDFQALARAAEDIGARVHGPVTQ 300 Query: 299 GKFLEGLGIWQRAFSLMKQTARK--DILLDSVKRLVSTSADKKSMGELFKILVVS 351 G+FL+ LGI RA +LM + + + + ++KRL + MG +FK + VS Sbjct: 301 GEFLQRLGIEARAVNLMAKATPEISEGISTALKRLTE--GGRGGMGSMFKAIGVS 353 >gi|328542815|ref|YP_004302924.1| ATP synthase beta subunit/transription termination factor rho [polymorphum gilvum SL003B-26A1] gi|326412561|gb|ADZ69624.1| ATP synthase beta subunit/transription termination factor rho [Polymorphum gilvum SL003B-26A1] Length = 364 Score = 315 bits (806), Expect = 9e-84, Method: Composition-based stats. Identities = 148/366 (40%), Positives = 211/366 (57%), Gaps = 11/366 (3%) Query: 1 ME-NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQI 58 M L +++ I G +TV Y A C+ DP+ GYY+T PFG GDF+TAPE+SQ+ Sbjct: 1 MSATPLCDRLIRRIALYGPITVADYMAACLGDPDHGYYTTAAEPFGRAGDFITAPEVSQM 60 Query: 59 FGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVET 118 FGE++ + + AW+ G P+ VRLVELGPGRG +M D+LR L+PDF + ++++VET Sbjct: 61 FGELIGAWTVAAWQAMGAPASVRLVELGPGRGTLMADLLRTAA-LRPDFLAAATLHLVET 119 Query: 119 SERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERM 178 S RL +Q K LA W+ L DVP G LVANEFFD+LPI Q+V T G RER Sbjct: 120 SPRLGAVQAKTLAGAALAPIWHDRLDDVPDGPLLLVANEFFDALPIHQYVRTPTGWRERC 179 Query: 179 IDIDQHDSLVFNIGDHEIKSNFLT--CSDYFLGAIFENSPCRDREMQSISDRLACDGGTA 236 + + + SL F IG + + GAI E +P + I RL GG A Sbjct: 180 VGLSEEGSLAFGIGVARLPDTAIPGTARAAPDGAILETAPMAAGIARRIGARLKAQGGAA 239 Query: 237 IVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLT 296 +++DYG++ + GDTLQA++ H + L +PG ADL++HVDF+ L+ A + G Sbjct: 240 LIVDYGHMHTAPGDTLQALRRHAHDDVLASPGVADLTAHVDFESLAQAARDGGAHSFGPL 299 Query: 297 TQGKFLEGLGIWQRAFSL--MKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 QG FL LG+ +RA +L K ++ + V+RL A + MG LFK+L ++ + Sbjct: 300 EQGDFLLRLGLLERAGALGAGKSPDVQERIRADVERL----AAPERMGSLFKVLALTDGR 355 Query: 355 VELMPF 360 +PF Sbjct: 356 FVPVPF 361 >gi|91978339|ref|YP_570998.1| hypothetical protein RPD_3876 [Rhodopseudomonas palustris BisB5] gi|91684795|gb|ABE41097.1| protein of unknown function DUF185 [Rhodopseudomonas palustris BisB5] Length = 376 Score = 314 bits (805), Expect = 1e-83, Method: Composition-based stats. Identities = 142/360 (39%), Positives = 212/360 (58%), Gaps = 12/360 (3%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGE 61 + L+ +I LIK G M V +Y LC+ P +GYY + +P G GDF T+PEISQ+FGE Sbjct: 4 NSPLLAEIKRLIKSTGPMPVWRYMELCLNHPLYGYYVSRDPLGREGDFTTSPEISQMFGE 63 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 ++ ++ W+ G P +RL+E+GPGRG M+ D LR + L P + LS+++VE + Sbjct: 64 LIGLWAASVWKATGEPDVLRLIEIGPGRGTMIADALRALRVLPPL-YQSLSVHLVEINPV 122 Query: 122 LTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDI 181 L QK LA + I+W+ + ADVP G ++ANE+FD LPI Q V + G ER+I+I Sbjct: 123 LREKQKATLAGIRN-IHWHDTFADVPDGPAVILANEYFDVLPIHQAVKRDGGWHERVIEI 181 Query: 182 DQHDSLVFNIGDHEIKSNFLTCSDY----FLGAIFENSPCRDREMQSISDRLACDGGTAI 237 LVF + I + GA+FE D E+ +I+ RL GG A+ Sbjct: 182 SASGELVFGVAPDPIPRFDILLPHLVRMAPAGAVFEWR--SDAEIMAIATRLRDQGGAAL 239 Query: 238 VIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTT 297 +IDYG+++S VGDT QA+ H++ PL NPG+ADL++HVDFQ L A ++G T Sbjct: 240 IIDYGHIRSDVGDTFQAIARHSFADPLQNPGRADLTAHVDFQALGRAAEDVGARLHGPVT 299 Query: 298 QGKFLEGLGIWQRAFSLMKQTARK--DILLDSVKRLVSTSADKKSMGELFKILVVSHEKV 355 QG+FL+ LGI RA SLM + + + + + +++RL + +MG +FK++ VS + Sbjct: 300 QGEFLKRLGIETRALSLMAKASPQVSEDISGALRRLT--GEGRGAMGSMFKVIGVSDPNI 357 >gi|90422923|ref|YP_531293.1| hypothetical protein RPC_1412 [Rhodopseudomonas palustris BisB18] gi|90104937|gb|ABD86974.1| protein of unknown function DUF185 [Rhodopseudomonas palustris BisB18] Length = 375 Score = 314 bits (804), Expect = 1e-83, Method: Composition-based stats. Identities = 145/364 (39%), Positives = 209/364 (57%), Gaps = 14/364 (3%) Query: 1 MENK--LIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQI 58 M L I LIK G M V +Y LC+ PEFGYY + +P G GDF TAPE+SQ+ Sbjct: 1 MTEPFSLQDVIKKLIKSAGPMPVWRYMELCLTHPEFGYYVSRDPLGREGDFTTAPEVSQM 60 Query: 59 FGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVET 118 FGE+L ++ W G P VRL+E GPGRG MM D LR + + P F L ++++E Sbjct: 61 FGELLGLWAASVWRSIGSPQLVRLIEFGPGRGTMMADALRALRVVPPL-FQALHVHLIEI 119 Query: 119 SERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERM 178 + L QK LA + ++W+ SL +VP G T + ANE+FD LPI Q V EHG ER Sbjct: 120 NPVLREKQKATLAGAQN-LHWHASLDEVPGGSTIIFANEYFDVLPIHQMVRGEHGWHERT 178 Query: 179 IDIDQHDSLVFNIGDHEIKSNFLTCSDY----FLGAIFENSPCRDREMQSISDRLACDGG 234 ++ID + LVF + + GA+FE P + I+ R+ +GG Sbjct: 179 VEIDAAERLVFGVAPEPVPHFEQLLPPLVRAAPQGAVFEWRPDAEIM--KIASRVRDEGG 236 Query: 235 TAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYING 294 A++IDYG+ +S GDT QA+ H++ PL NPG+AD+++HVDFQ L+ A ++G Sbjct: 237 AALIIDYGHPRSDAGDTFQAIARHSFADPLQNPGRADVTAHVDFQALARGAQDVGARVHG 296 Query: 295 LTTQGKFLEGLGIWQRAFSLMKQTARK--DILLDSVKRLVSTSADKKSMGELFKILVVSH 352 TQG+FL+ LGI RA +LM + + + + + ++KRLV + MG +FK++ VS Sbjct: 297 PVTQGEFLKRLGIENRAVALMAKASLEVSEDVASALKRLVE--GGRGGMGSMFKVMAVSE 354 Query: 353 EKVE 356 ++E Sbjct: 355 PEIE 358 >gi|163742925|ref|ZP_02150309.1| hypothetical protein RG210_12425 [Phaeobacter gallaeciensis 2.10] gi|161383889|gb|EDQ08274.1| hypothetical protein RG210_12425 [Phaeobacter gallaeciensis 2.10] Length = 355 Score = 312 bits (800), Expect = 5e-83, Method: Composition-based stats. Identities = 126/361 (34%), Positives = 187/361 (51%), Gaps = 10/361 (2%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L +I + I+ G + V Y A + P +GYY+T +P G GDF+TAPEISQ+FGE++ Sbjct: 2 SLSDRIASRIRTEGPIPVADYMAEALLHPTYGYYTTRDPLGRAGDFITAPEISQMFGELI 61 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 + L W G P L ELGPGRG +M D+LR ++ P F + I ++E S L Sbjct: 62 GLALAQCWLDQGSPKPFTLAELGPGRGTLMADLLRATKQV-PGFHDAMQIALLEASPTLR 120 Query: 124 LIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQ 183 Q + L+ W +L +P FL+ANEFFD+LP++QF+ G RE+ + + Sbjct: 121 SRQAETLSGSTPL--WLDTLEALPEQPLFLIANEFFDALPVRQFLRDGDGWREKSVGLQD 178 Query: 184 HDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGY 243 + D + E +Q+I+ R+A GG A+++DYG Sbjct: 179 GKLSFGLGAAAPQPALAHRLEDTRDDDLVELCEAAQPMVQTIAARIATHGGAALIVDYGD 238 Query: 244 LQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLE 303 +S +GDTLQA++ H PL +PG ADL++HVDF+ L+ A + LT QG FLE Sbjct: 239 WRS-LGDTLQALRAHAPSDPLKDPGSADLTTHVDFEALALTAKAAGCTHSRLTPQGVFLE 297 Query: 304 GLGIWQRAFSLMKQTARK--DILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMPFV 361 LGI RA +L + L+ + +RL + MG LFK+L + + P + Sbjct: 298 RLGITDRAQALAARLEGDSLQSLIAAHRRLTH----PEEMGNLFKVLGIFPMQASPPPGL 353 Query: 362 N 362 N Sbjct: 354 N 354 >gi|154246490|ref|YP_001417448.1| hypothetical protein Xaut_2549 [Xanthobacter autotrophicus Py2] gi|154160575|gb|ABS67791.1| protein of unknown function DUF185 [Xanthobacter autotrophicus Py2] Length = 368 Score = 312 bits (799), Expect = 5e-83, Method: Composition-based stats. Identities = 140/364 (38%), Positives = 200/364 (54%), Gaps = 11/364 (3%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 M L ++I LI G M + +Y ALC+ P GYY T +P GA GDF TAPEISQ+FG Sbjct: 1 MTTPLSKEISALIAAEGPMPLSRYMALCLGHPRHGYYMTRDPLGARGDFTTAPEISQMFG 60 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 E+L ++ + W+ G P RLVELGPGRG +M D LR L PDF + I++VETS Sbjct: 61 ELLGLWAVAQWQAMGSPPAFRLVELGPGRGTLMADALRAAR-LVPDFGAAARIHLVETSP 119 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 L Q + LA++ D+++W+ + +VP G ++ANEFFD+LPI Q+V ER + Sbjct: 120 VLRAAQARTLAAHADRVSWHDRVEEVPDGPALVLANEFFDALPIDQYVFHAGHWHERRVG 179 Query: 181 IDQHDSLVFNIGDHEIKSNFLTCSDYF---LGAIFENSPCRDREMQSISDRLACDGGTAI 237 +D LV + ++ + G + E+ +++S+RL GG A+ Sbjct: 180 LDDGGRLVLGLDPAPSRAAPAFAAHLPPPAEGVVLEHLESGPA--RALSERLKTQGGAAL 237 Query: 238 VIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTT 297 +IDYG+ GDT QA++ H + PL PG ADL++HVDF L+ I L G Sbjct: 238 IIDYGHAGG-YGDTFQALEQHRFADPLAAPGNADLTAHVDFSALARIGRAAGLRAFGPLE 296 Query: 298 QGKFLEGLGIWQRAFSLMKQTARKDIL--LDSVKRLVSTSADKKSMGELFKILVVSHEKV 355 QG FL LG+ QRA L + + + +RL A MG LFK+LV++H ++ Sbjct: 297 QGAFLARLGLAQRAERLKRDATDELRAGVDAAARRLAGDGA--GEMGRLFKVLVLAHPEI 354 Query: 356 ELMP 359 L P Sbjct: 355 GLPP 358 >gi|86751272|ref|YP_487768.1| hypothetical protein RPB_4165 [Rhodopseudomonas palustris HaA2] gi|86574300|gb|ABD08857.1| Protein of unknown function DUF185 [Rhodopseudomonas palustris HaA2] Length = 376 Score = 312 bits (799), Expect = 6e-83, Method: Composition-based stats. Identities = 142/360 (39%), Positives = 213/360 (59%), Gaps = 12/360 (3%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGE 61 ++ L+ +I LI+ G M V +Y LC+A PE+GYY + +P G GDF T+PEISQ+FGE Sbjct: 4 DSPLLAEIKRLIETAGPMPVWRYMELCLAHPEYGYYVSRDPLGREGDFTTSPEISQMFGE 63 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 ++ ++ W+ G P +RL+E+GPGRG M+ D LR + L P + LS+++VE + Sbjct: 64 LIGLWTASVWKAVGEPGVLRLIEIGPGRGTMIADALRALRVLPPL-YQSLSVHLVEINPV 122 Query: 122 LTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDI 181 L Q+ LA + ++W+ A+VP G ++ANE+FD LPI Q V + G ER+I+I Sbjct: 123 LRAKQQATLAGIRN-VHWHEDFAEVPEGPAVVLANEYFDVLPIHQAVKRDGGWHERVIEI 181 Query: 182 DQHDSLVFNIGDHEIKSNFLTCSDY----FLGAIFENSPCRDREMQSISDRLACDGGTAI 237 LVF + D I + G +FE P D E+ +I+ RL GG A+ Sbjct: 182 SASGDLVFGVADDPIPRFEVLLPPLVQMAPAGTVFEWRP--DNEIMAIAARLRDQGGAAL 239 Query: 238 VIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTT 297 +IDYG+++S VGDT QA+ H++ PL +PG ADL++HVDFQ L A I+G T Sbjct: 240 IIDYGHVRSDVGDTFQAIARHSFADPLQHPGGADLTAHVDFQALGRAAETIGARIHGPVT 299 Query: 298 QGKFLEGLGIWQRAFSLMKQTARK--DILLDSVKRLVSTSADKKSMGELFKILVVSHEKV 355 QG+FL+ LGI RA SLM + + + + + ++KRL + MG +FK++ VS + Sbjct: 300 QGEFLKRLGIETRALSLMAKASAQVSEDIAGALKRLT--GEGRGGMGAMFKVIGVSDPSI 357 >gi|13472396|ref|NP_103963.1| hypothetical protein mlr2680 [Mesorhizobium loti MAFF303099] gi|14023142|dbj|BAB49749.1| mlr2680 [Mesorhizobium loti MAFF303099] Length = 362 Score = 312 bits (798), Expect = 7e-83, Method: Composition-based stats. Identities = 154/362 (42%), Positives = 220/362 (60%), Gaps = 9/362 (2%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 +L +IV+LI G + V++Y A+C+ DP GYY+T PFGA GDFVTAPEISQ+FGE+ Sbjct: 2 TRLKTRIVDLIDALGPLPVNEYMAMCLFDPADGYYTTREPFGAAGDFVTAPEISQMFGEL 61 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 +A++L AW P V + E+GPGRG +M D+LR + +L P + M+ETS RL Sbjct: 62 VAVWLYQAWAAIARPMPVTIAEIGPGRGTLMKDMLRTLSRLDPALANGAVFAMIETSPRL 121 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 IQK+ L + + W+ ++ +P +V NE FD++PI+QFV T G RERM+ +D Sbjct: 122 AEIQKQTLGATPFAVRWHETIETLPDQPLLIVGNELFDAVPIRQFVRTATGWRERMVSLD 181 Query: 183 QHDSLVFNIGDHEIKSNFLTC--SDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVID 240 D L F G I L ++ GAI E +P R M +I+ R+A +GG + +D Sbjct: 182 DKDELRFFAGAGSIDPTLLPLDAAEAPQGAIVEVAPARAALMAAIAGRMARNGGAGLFLD 241 Query: 241 YGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGK 300 YG+LQ VGDTLQA++ H + L NPG+ADL+SHVDF L++ + L ++ L+TQG Sbjct: 242 YGHLQPGVGDTLQALRRHNHEDVLANPGEADLTSHVDFAALAATVRAHGLDVH-LSTQGS 300 Query: 301 FLEGLGIWQRAFSLMKQTAR--KDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELM 358 FL G+GI +RA L + +D + V+RL A ++MGELFK+L V V + Sbjct: 301 FLLGMGILERAGRLGADGDQAARDKITGEVERL----AGPQAMGELFKVLAVLPRGVPIR 356 Query: 359 PF 360 PF Sbjct: 357 PF 358 >gi|316935878|ref|YP_004110860.1| hypothetical protein Rpdx1_4577 [Rhodopseudomonas palustris DX-1] gi|315603592|gb|ADU46127.1| protein of unknown function DUF185 [Rhodopseudomonas palustris DX-1] Length = 379 Score = 311 bits (796), Expect = 1e-82, Method: Composition-based stats. Identities = 144/360 (40%), Positives = 203/360 (56%), Gaps = 12/360 (3%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 L +I LIK G M V +Y LC+ PE GYY T +P G GDF T+PEISQ+FGE+ Sbjct: 5 TALATEIKRLIKAAGPMPVWRYMELCLGHPEHGYYVTRDPLGREGDFTTSPEISQMFGEL 64 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 L ++ W+ P +RL+E+GPGRG MM D LR + L P + LS+++VE + L Sbjct: 65 LGLWSASVWKAADEPQTLRLIEIGPGRGTMMADALRALRVL-PILYQSLSVHLVEINPVL 123 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 Q+ LA + I+W+ S DVP G ++ANE+FD LPI Q + E G ER+I+I Sbjct: 124 RQKQQTTLAGIRN-IHWHDSFDDVPEGPAVVLANEYFDVLPIHQAIKRETGWHERVIEIG 182 Query: 183 QHDSLVFNIGDHEIKSNFLTCSDY----FLGAIFENSPCRDREMQSISDRLACDGGTAIV 238 LVF + I GA+FE P + + I+ R+ GG A++ Sbjct: 183 SAGELVFGVAADPIPGFEALLPPLVRLAPPGAVFEWRPDAE--ILKIASRVRDQGGAALI 240 Query: 239 IDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQ 298 IDYG+L+S VGDT QA+ H+Y PL +PG+ADL++HVDF L A +G TQ Sbjct: 241 IDYGHLRSDVGDTFQAIASHSYADPLQHPGRADLTAHVDFDALGRAAESVGARAHGPVTQ 300 Query: 299 GKFLEGLGIWQRAFSLMKQTARK--DILLDSVKRLVSTSADKKSMGELFKILVVSHEKVE 356 G FL LGI RA SLM + + + + +++RL + +MG +FK++ VS K+E Sbjct: 301 GTFLRRLGIETRALSLMAKATPQVSEDIAGALQRLT--GEGRGAMGSMFKVIGVSDPKIE 358 >gi|209886306|ref|YP_002290163.1| Aby [Oligotropha carboxidovorans OM5] gi|209874502|gb|ACI94298.1| Aby [Oligotropha carboxidovorans OM5] Length = 369 Score = 310 bits (795), Expect = 2e-82, Method: Composition-based stats. Identities = 141/359 (39%), Positives = 204/359 (56%), Gaps = 13/359 (3%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 + L + +LI G M V +Y LC+ P++GYY + +P G GDF+TAPEISQ+FGE+ Sbjct: 5 SPLEHYLRHLIATAGPMPVARYMQLCMTHPDYGYYVSRDPLGRGGDFITAPEISQMFGEL 64 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 + ++ W G P V+L+ELGPGRG MM D LR I L P F+ + ++++E S L Sbjct: 65 IGLWAASVWNAMGMPERVQLIELGPGRGTMMADALRAIRIL-PAFYEAIEVHLIELSPSL 123 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 +Q+ LA W+ L DVP G ++ANE+FD+LPI Q V E G ERM+ + Sbjct: 124 RAVQRDTLADVKPF-QWHHLLGDVPDGPAIILANEYFDALPIHQMVKQETGWHERMVGL- 181 Query: 183 QHDSLVFNIGDHEIKSNFLTCS----DYFLGAIFENSPCRDREMQSISDRLACDGGTAIV 238 D+ + I ++ + G IFE E+ +I+ R+ GG A++ Sbjct: 182 VDDAFAYTIAPEPTPRFGVSVPPHVRNAPNGTIFEWRE--TNEIMAIARRIREFGGAALL 239 Query: 239 IDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQ 298 IDYG+++S GDT QAV H Y PL PG ADL++HVDFQ L+ A + +G Q Sbjct: 240 IDYGHIRSDAGDTFQAVARHEYADPLRTPGTADLTAHVDFQALADAAQTMGVLTHGPVEQ 299 Query: 299 GKFLEGLGIWQRAFSLMKQTARK--DILLDSVKRLVSTSADKKSMGELFKILVVSHEKV 355 G FL LGI RA +L+K +A + + ++KRLV + K+MG LFK+L +SH + Sbjct: 300 GAFLTQLGIETRAQTLIKHSASQSASDVASALKRLVESG--PKAMGSLFKVLGLSHPSI 356 >gi|192293202|ref|YP_001993807.1| hypothetical protein Rpal_4843 [Rhodopseudomonas palustris TIE-1] gi|192286951|gb|ACF03332.1| protein of unknown function DUF185 [Rhodopseudomonas palustris TIE-1] Length = 379 Score = 310 bits (795), Expect = 2e-82, Method: Composition-based stats. Identities = 144/360 (40%), Positives = 204/360 (56%), Gaps = 12/360 (3%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 L +I LIK G M V +Y LC+ PE GYY T +P G GDF T+PEISQ+FGE+ Sbjct: 5 TALATEIKRLIKAAGPMPVWRYMELCLGHPEHGYYVTRDPLGREGDFTTSPEISQMFGEL 64 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 L ++ W+ P +RL+E+GPGRG MM D LR + L P + LS+++VE + L Sbjct: 65 LGLWSASVWKAADEPQTLRLIEIGPGRGTMMADALRALRVL-PILYQSLSVHLVEINPVL 123 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 Q+ LA + I+W+ S DVP G ++ANE+FD LPI Q + E G ER+I+I Sbjct: 124 RQKQQTLLAGIRN-IHWHDSFEDVPEGPAVILANEYFDVLPIHQAIKRETGWHERVIEIG 182 Query: 183 QHDSLVFNIGDHEIKSNFLTCSDY----FLGAIFENSPCRDREMQSISDRLACDGGTAIV 238 LVF + I GA+FE P + + I+ R+ GG A++ Sbjct: 183 ASGELVFGVAADPIPGFEALLPPLVRLSPPGAVFEWRPDTE--ILKIASRVRDQGGAALI 240 Query: 239 IDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQ 298 IDYG+L+S VGDT QA+ H+Y PL +PG+ADL++HVDF L A +G TQ Sbjct: 241 IDYGHLRSDVGDTFQAIASHSYADPLQHPGRADLTAHVDFDALGRAAESIGARAHGPVTQ 300 Query: 299 GKFLEGLGIWQRAFSLMKQTARK--DILLDSVKRLVSTSADKKSMGELFKILVVSHEKVE 356 G FL+ LGI RA SLM + + + + +++RL + +MG +FK++ VS K+E Sbjct: 301 GAFLKRLGIETRALSLMAKATPQVSEDIAGALQRLT--GEGRGAMGSMFKVIGVSDPKIE 358 >gi|39937419|ref|NP_949695.1| hypothetical protein RPA4359 [Rhodopseudomonas palustris CGA009] gi|39651278|emb|CAE29800.1| DUF185 [Rhodopseudomonas palustris CGA009] Length = 379 Score = 310 bits (795), Expect = 2e-82, Method: Composition-based stats. Identities = 144/360 (40%), Positives = 204/360 (56%), Gaps = 12/360 (3%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 L +I LIK G M V +Y LC+ PE GYY T +P G GDF T+PEISQ+FGE+ Sbjct: 5 TALATEIKRLIKAAGPMPVWRYMELCLGHPEHGYYVTRDPLGREGDFTTSPEISQMFGEL 64 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 L ++ W+ P +RL+E+GPGRG MM D LR + L P + LS+++VE + L Sbjct: 65 LGLWSASVWKAADEPQTLRLIEIGPGRGTMMADALRALRVL-PILYQSLSVHLVEINPVL 123 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 Q+ LA + I+W+ S DVP G ++ANE+FD LPI Q + E G ER+I+I Sbjct: 124 RQKQQTLLAGIRN-IHWHDSFEDVPEGPAVILANEYFDVLPIHQAIKRETGWHERVIEIG 182 Query: 183 QHDSLVFNIGDHEIKSNFLTCSDY----FLGAIFENSPCRDREMQSISDRLACDGGTAIV 238 LVF + I GA+FE P + + I+ R+ GG A++ Sbjct: 183 ASGELVFGVAADPIPGFEALLPPLARLSPPGAVFEWRPDTE--ILKIASRVRDQGGAALI 240 Query: 239 IDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQ 298 IDYG+L+S VGDT QA+ H+Y PL +PG+ADL++HVDF L A +G TQ Sbjct: 241 IDYGHLRSDVGDTFQAIASHSYADPLQHPGRADLTAHVDFDALGRAAESIGARAHGPVTQ 300 Query: 299 GKFLEGLGIWQRAFSLMKQTARK--DILLDSVKRLVSTSADKKSMGELFKILVVSHEKVE 356 G FL+ LGI RA SLM + + + + +++RL + +MG +FK++ VS K+E Sbjct: 301 GAFLKRLGIETRALSLMAKATPQVSEDIAGALQRLT--GEGRGAMGSMFKVIGVSDPKIE 358 >gi|46200845|ref|ZP_00207869.1| COG1565: Uncharacterized conserved protein [Magnetospirillum magnetotacticum MS-1] Length = 357 Score = 310 bits (795), Expect = 2e-82, Method: Composition-based stats. Identities = 129/359 (35%), Positives = 189/359 (52%), Gaps = 11/359 (3%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L + IK G + V ++ A + PE+GYY +PFG GDF TAPEISQ+FGE++ Sbjct: 2 SLSALLSERIKATGPIPVSEFMAEALGHPEYGYYRGRDPFGMAGDFTTAPEISQMFGELI 61 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 ++ W+ G P V L E+GPGRG +M D+LR L P L ++++ETS L Sbjct: 62 GLWCALVWQSMGSPERVVLAEIGPGRGTLMADLLRAAKALAPF-ARALDVHLIETSPSLR 120 Query: 124 LIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQ 183 Q + LA + W+ D+P G LVANE FD+LPI+Q ER++ +D Sbjct: 121 NRQAQALAD--QSVTWHERFEDLPDGPLLLVANELFDALPIRQLEKVGGVWHERVVGLDD 178 Query: 184 HDSLVFNIGDHEIKSNFLTC-SDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYG 242 +LV +G + G++ E P ++++ RLA GG A++IDYG Sbjct: 179 QGALVLALGPVVADPPLAPAVLNAPDGSLAEVCPQGRVLAEAVARRLAHQGGAALIIDYG 238 Query: 243 YLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFL 302 Y S GD+LQAVK H + L PG AD+++HVDFQ L+ A + G QG+FL Sbjct: 239 YETSAAGDSLQAVKSHRHHPVLSAPGTADITAHVDFQALAEAASGL-ARVYGPVPQGRFL 297 Query: 303 EGLGIWQRAFSLMKQTA--RKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMP 359 LG+ +R LM+ + + L +RL+ D MG LFK+L +++ + P Sbjct: 298 ARLGLEERVRMLMQHASVEQAAHLASGARRLI----DPAEMGTLFKVLALANPLLPAPP 352 >gi|319781893|ref|YP_004141369.1| hypothetical protein Mesci_2167 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317167781|gb|ADV11319.1| protein of unknown function DUF185 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 362 Score = 310 bits (795), Expect = 2e-82, Method: Composition-based stats. Identities = 154/362 (42%), Positives = 224/362 (61%), Gaps = 9/362 (2%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 +L +IV+LI+ G + V++Y ALC+ DP GYY+T PFGA GDFVTAPEISQ+FGE+ Sbjct: 2 TRLKTRIVDLIEALGPLPVNEYMALCLFDPADGYYTTREPFGAGGDFVTAPEISQMFGEL 61 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 +A+++ W G P V + E+GPGRG +M D+LR + +L PD + + M+ETS RL Sbjct: 62 VAVWMYQVWAASGRPLPVTIAEIGPGRGTLMKDMLRTLSRLDPDLANGATFAMIETSPRL 121 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 T +QKK L + W+ ++ +P F+V NE FD++PI+QF+ G RERM+ +D Sbjct: 122 TEVQKKTLGVTPFAVGWHETIETLPQQSLFIVGNELFDAVPIRQFIRAGAGWRERMVGLD 181 Query: 183 QHDSLVFNIGDHEIKSNFLT--CSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVID 240 + + L F G + L +D GAI E +P R M +I++R++ GG + +D Sbjct: 182 ETNDLCFFAGAGSVDPTLLPADAADAPQGAIAEVAPARTALMATIAERISRHGGAGLFLD 241 Query: 241 YGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGK 300 YG+ Q VGDTLQA++ H + L NPG+ADL+SHVDF L++IA + L + LTTQG Sbjct: 242 YGHFQPGVGDTLQALRSHDHEDVLANPGEADLTSHVDFAALAAIARAHGLEAH-LTTQGD 300 Query: 301 FLEGLGIWQRAFSLMKQTAR--KDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELM 358 FL G+GI +RA L + ++ + V+RL A ++MGELFK+L V V + Sbjct: 301 FLLGMGILERAGRLGADAGQAARERIAGDVERL----AGPQAMGELFKVLAVLPRGVAVR 356 Query: 359 PF 360 PF Sbjct: 357 PF 358 >gi|83309693|ref|YP_419957.1| hypothetical protein amb0594 [Magnetospirillum magneticum AMB-1] gi|82944534|dbj|BAE49398.1| Uncharacterized conserved protein [Magnetospirillum magneticum AMB-1] Length = 357 Score = 310 bits (794), Expect = 2e-82, Method: Composition-based stats. Identities = 128/359 (35%), Positives = 188/359 (52%), Gaps = 11/359 (3%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L + + I+ G + V ++ A + PE+GYY +PFG GDF+TAPEISQ+FGE++ Sbjct: 2 TLADILADRIRATGPIPVSEFMAEALGHPEYGYYMGRDPFGMAGDFITAPEISQMFGELI 61 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 ++ W+ G P V L E+GPGRG +M D+LR L P L+++++ETS L Sbjct: 62 GLWCALVWQSMGAPKRVVLAEIGPGRGTLMADLLRAAQALPPFAL-ALNVHLIETSPSLR 120 Query: 124 LIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQ 183 Q + L + W+ D+P G LVANE FD+LPI+Q RER++ +D+ Sbjct: 121 NRQAQALTDR--SVEWHERFEDLPDGPLLLVANELFDALPIRQLEKAGGVWRERVVALDE 178 Query: 184 HDSLVFNIGDHEIKSNFLTC-SDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYG 242 + F G + GA+ E P +I+ RLA GG A++IDYG Sbjct: 179 AGAFAFAQGPVVAEPPLAPAVLGAADGAVAELCPQGRALAGTIARRLAHQGGAALIIDYG 238 Query: 243 YLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFL 302 Y +S GD+LQA+K H L PG AD+++HVDFQ L+ A +G QG FL Sbjct: 239 YGKSAAGDSLQALKSHKRHPVLSGPGTADITAHVDFQALAEAASGL-ARAHGPVPQGSFL 297 Query: 303 EGLGIWQRAFSLMKQ--TARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMP 359 LG+ +R LM+ + L +RL+ D MG LFK+L ++ + + P Sbjct: 298 ARLGLEERVRMLMQNATPEQAAHLASGARRLI----DPGEMGTLFKVLALAAPLLPVPP 352 >gi|56695809|ref|YP_166160.1| hypothetical protein SPO0907 [Ruegeria pomeroyi DSS-3] gi|56677546|gb|AAV94212.1| conserved hypothetical protein [Ruegeria pomeroyi DSS-3] Length = 355 Score = 309 bits (791), Expect = 5e-82, Method: Composition-based stats. Identities = 132/358 (36%), Positives = 193/358 (53%), Gaps = 10/358 (2%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L ++ I + G +++ Y A C+ PE+GYY+T +P G GDF TAPEISQ+FGE++ Sbjct: 2 SLTGLLLERIAQQGPLSLADYMAECLLHPEYGYYTTRDPLGVAGDFTTAPEISQMFGELI 61 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 + L AW G P+ LVELGPGRG +M D LR + P F +++VE S L Sbjct: 62 GLALAQAWMDQGRPAPFTLVELGPGRGTLMADALRATRAV-PGFHEAARLWLVEASPVLR 120 Query: 124 LIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQ 183 Q + LA + W +++D+P G F VANEFFD+LP++QF RER++ Sbjct: 121 ATQAQALAGH--DPQWCDTVSDLPAGPLFGVANEFFDALPVRQFQRAGAVWRERLVGARD 178 Query: 184 HDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGY 243 + + D G + E P + ++ R+A DGG A+++DYG Sbjct: 179 GALCWGLGAEALQPALAHRLEDTREGDLVELCPAAGLILSELASRIAADGGAALIVDYGD 238 Query: 244 LQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLE 303 +S +GDT+QA++ H PL +PGQADL++HVDF+ L+ A + L+TQG FLE Sbjct: 239 WRS-LGDTVQALRNHAPADPLADPGQADLTAHVDFEVLAMTARAAGCAHSRLSTQGVFLE 297 Query: 304 GLGIWQRAFSLMKQTARK--DILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMP 359 LGI QRA +L + D L+ + +RL + MG LFK+L + P Sbjct: 298 RLGIAQRAQALARHADEAALDRLITAHRRLTH----PEEMGNLFKVLGLYPSDATPPP 351 >gi|163737769|ref|ZP_02145186.1| hypothetical protein RGBS107_19598 [Phaeobacter gallaeciensis BS107] gi|161389295|gb|EDQ13647.1| hypothetical protein RGBS107_19598 [Phaeobacter gallaeciensis BS107] Length = 355 Score = 308 bits (788), Expect = 1e-81, Method: Composition-based stats. Identities = 126/361 (34%), Positives = 189/361 (52%), Gaps = 10/361 (2%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L +I + I+ G ++V Y A + P +GYY+T +P G GDF+TAPEISQ+FGE++ Sbjct: 2 SLSERIASRIRTEGPISVADYMAEALLHPTYGYYTTRDPLGRAGDFITAPEISQMFGEVI 61 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 + L W G P L ELGPGRG +M D+LR ++ P F + I ++E S L Sbjct: 62 GLALAQCWLDQGSPKPFTLAELGPGRGTLMADLLRATKQV-PGFHDAMQIALLEASPTLR 120 Query: 124 LIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQ 183 Q + L+ + W +L +P FL+ANEFFD+LP++QF+ G RE+ + + Sbjct: 121 SRQAETLSGHTPL--WLDTLEALPEQPLFLIANEFFDALPVRQFLRDGDGWREKSVGLQD 178 Query: 184 HDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGY 243 + D + E +Q+I+ R+A GG A+++DYG Sbjct: 179 GKLSFGLGAAAPQPALAHRLEDTRDDDLVELCEAAQPMVQTIAARIATHGGAALIVDYGD 238 Query: 244 LQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLE 303 +S +GDTLQA++ H PL +PG ADL++HVDF+ L+ A + LT QG FLE Sbjct: 239 WRS-LGDTLQALRAHAPSDPLNDPGSADLTTHVDFEALTLAAKAAGCTHSRLTPQGVFLE 297 Query: 304 GLGIWQRAFSLMKQTARK--DILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMPFV 361 LGI RA +L + L+ + +RL + MG LFK+L + + P + Sbjct: 298 RLGITDRAQALAARLEGDSLQSLIVAHRRLTH----PEEMGNLFKVLGIFPMQASPPPGL 353 Query: 362 N 362 N Sbjct: 354 N 354 >gi|163761592|ref|ZP_02168663.1| hypothetical protein HPDFL43_13802 [Hoeflea phototrophica DFL-43] gi|162281188|gb|EDQ31488.1| hypothetical protein HPDFL43_13802 [Hoeflea phototrophica DFL-43] Length = 378 Score = 307 bits (786), Expect = 2e-81, Method: Composition-based stats. Identities = 161/364 (44%), Positives = 234/364 (64%), Gaps = 6/364 (1%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 + L K+ +I++ G + + +FALC+ADP+ GYY T PFG GDF+TAPE+SQ+FG Sbjct: 12 LRTPLAEKMARIIEQAGPLKISDFFALCLADPDHGYYKTREPFGRSGDFITAPEVSQLFG 71 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 EM+ +FL+ AW+ G P VR+ E+GPGRG +M D LRVI KL PD ++ +I+MVETS+ Sbjct: 72 EMIGVFLVHAWQAQGAPDQVRIAEIGPGRGTLMSDALRVIAKLAPDLYANATIHMVETSD 131 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 RL Q++ L D+I W+ + ++P GFT +VANE FD++PI QFV T +G RER++ Sbjct: 132 RLRNEQRQTLVRIKDRICWHQAFEEIPAGFTLMVANELFDAIPIHQFVKTPNGFRERVVG 191 Query: 181 IDQHDSLVFNIGDHEIKSNFLTCSDY--FLGAIFENSPCRDREMQSISDRLACDGGTAIV 238 +D++ L F IG + L + G IFE +P R MQ+++ +L DGGTA+ Sbjct: 192 LDENGRLAFGIGTGSFDPSLLPVDEAAVPEGEIFELAPARSAVMQAVASKLVRDGGTALS 251 Query: 239 IDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQ 298 IDYG+L + GDTLQAV H + +PL PG+ADL+SHVDFQ L+ A+ +++ TQ Sbjct: 252 IDYGHLVTGFGDTLQAVYRHEFDTPLARPGEADLTSHVDFQALAEAAVAAGAHLHRPLTQ 311 Query: 299 GKFLEGLGIWQRAFSL--MKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVE 356 G+FL GLG+ +RA +L + + + D+V RL + MG LFK+L +S +V Sbjct: 312 GEFLVGLGLVERAGALGSGRDALTQAAIRDAVNRL--AGEGEGRMGALFKVLAISGAQVR 369 Query: 357 LMPF 360 + PF Sbjct: 370 IAPF 373 >gi|319408906|emb|CBI82563.1| conserved hypothetical protein [Bartonella schoenbuchensis R1] Length = 362 Score = 307 bits (785), Expect = 3e-81, Method: Composition-based stats. Identities = 148/362 (40%), Positives = 210/362 (58%), Gaps = 5/362 (1%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 L KI +I NG +TV QY L + DP+FGYY T PFG+ GDF+TAPEISQ+FGEM Sbjct: 2 TTLKEKIKEIIAANGPITVSQYMTLALTDPQFGYYQTQEPFGSTGDFITAPEISQLFGEM 61 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 + I++ +W+ G P+ L E+GPGRG +M D+LR I KL F+ I++VE S+RL Sbjct: 62 IGIWVFASWKAQGSPNPFILAEIGPGRGTLMDDVLRTIQKLCKTAFNAAEIFLVEISQRL 121 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 QKK+L+SY I+ +P G FL+ANE FD+LPI Q++ RER I +D Sbjct: 122 ATEQKKRLSSYQKHIHTIEHFNQIPSGPLFLIANELFDALPIHQYIKINGEWRERCITLD 181 Query: 183 QHDSLVFNIGDHEIKSNFLT--CSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVID 240 Q F G H+ S L C+ G I+E +P R++ MQ IS+RL D G+A++ID Sbjct: 182 QDGHFTFIAGAHKFSSGDLPVYCAQMPDGTIWEYAPLRNQLMQQISNRLIQDKGSALLID 241 Query: 241 YGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGK 300 YG GDTLQA+ H + NPG+ DL+SHVDF L +IA+ + + QG Sbjct: 242 YGASDCAFGDTLQAISKHKFRDVFANPGENDLTSHVDFFTLKTIALQEGCFAE-ILEQGD 300 Query: 301 FLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMPF 360 FL +GI +RA L + ++ + +++ + MG+LFK+L VS + + F Sbjct: 301 FLVKMGILERAKQLS--INKDTLIQNKIRQDIERLVSPDQMGKLFKVLHVSDQSTTISNF 358 Query: 361 VN 362 + Sbjct: 359 FD 360 >gi|86136505|ref|ZP_01055084.1| hypothetical protein MED193_20319 [Roseobacter sp. MED193] gi|85827379|gb|EAQ47575.1| hypothetical protein MED193_20319 [Roseobacter sp. MED193] Length = 356 Score = 306 bits (783), Expect = 5e-81, Method: Composition-based stats. Identities = 140/362 (38%), Positives = 198/362 (54%), Gaps = 10/362 (2%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 + L +++ I +G +++ + A C+ PE GYY+T +PFG GDF TAPEISQ+FGE+ Sbjct: 2 SPLTDQLLARISSDGPISLADFMAECLLHPEHGYYTTRSPFGTQGDFTTAPEISQMFGEL 61 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 L + L +W G P L ELGPGRG +M D+LR + P F + L +Y+VE S L Sbjct: 62 LGLSLAQSWLNQGAPDTFTLAELGPGRGTLMADLLRATRGV-PGFHTALQLYLVEASPNL 120 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 Q K LA Y W + +P FLVANEFFD+LPI+QFV G RE+ I + Sbjct: 121 QEQQAKALARY--DATWVDTADALPQQPLFLVANEFFDALPIRQFVRDGDGWREKRIGLV 178 Query: 183 QHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYG 242 + D G + E SP + S++ R+A GG A+++DYG Sbjct: 179 DGGLGFGLGPAAPQPALEHRLRDTTDGDLVELSPGAAPILSSLAQRIASHGGAALIVDYG 238 Query: 243 YLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFL 302 +S +GDTLQA+K HT V PL PG+ADL++HVDF+ L S+A + +T QG FL Sbjct: 239 DWRS-LGDTLQALKSHTPVEPLETPGEADLTAHVDFEVLCSVAAGAGCAHSKVTPQGVFL 297 Query: 303 EGLGIWQRAFSLMKQTARK--DILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMPF 360 E LGI RA +L + + L+ + +RL MG LFK+L ++ + P Sbjct: 298 ERLGITDRARNLAAGLEGEALESLIAAHRRLTH----PSEMGNLFKVLGLTPADIAPPPG 353 Query: 361 VN 362 +N Sbjct: 354 LN 355 >gi|126739895|ref|ZP_01755586.1| hypothetical protein RSK20926_14444 [Roseobacter sp. SK209-2-6] gi|126719127|gb|EBA15838.1| hypothetical protein RSK20926_14444 [Roseobacter sp. SK209-2-6] Length = 356 Score = 305 bits (782), Expect = 6e-81, Method: Composition-based stats. Identities = 138/359 (38%), Positives = 202/359 (56%), Gaps = 10/359 (2%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 + L +++V I++NG +++ +Y + C+ PEFGYYST +PFG GDFVTAPEISQ+FGE+ Sbjct: 2 SSLEQQLVARIQENGPISLAEYMSECLLHPEFGYYSTRDPFGQSGDFVTAPEISQMFGEL 61 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 L + L W G PS LVELGPGRG +M D+LR + + +++VE S +L Sbjct: 62 LGLCLAQCWLDQGAPSPFALVELGPGRGTLMRDLLRATAGVTGFH-QAMQVFLVEASPKL 120 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 Q K L Y ++W ++P FLVANEFFD+LP +QFV G RER+I ++ Sbjct: 121 QREQAKALEEY--DVSWVAEPMELPNLPVFLVANEFFDALPARQFVRDSDGWRERLIGLE 178 Query: 183 QHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYG 242 + + + D G + E ++ I+DR++ GG A++IDYG Sbjct: 179 EGKLGFGLGSATDQPALAYRLEDTRPGDLVELCSPAATLLEPIADRISTFGGAALIIDYG 238 Query: 243 YLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFL 302 +S +GDTLQA++ H PL NPG+ADL+ HVDF+ L+ + +T QG FL Sbjct: 239 DWRS-LGDTLQALRAHKSTPPLENPGKADLTLHVDFEFLAQSTKSTGCAHSRVTPQGVFL 297 Query: 303 EGLGIWQRAFSLMK--QTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMP 359 E LGI +RA +L + Q D L+ + +RL + MG LFK+L + + P Sbjct: 298 ERLGITERAQALSRALQGEALDTLIAAHRRLTH----PEEMGNLFKVLGLYPAQFSPPP 352 >gi|115523453|ref|YP_780364.1| hypothetical protein RPE_1433 [Rhodopseudomonas palustris BisA53] gi|115517400|gb|ABJ05384.1| protein of unknown function DUF185 [Rhodopseudomonas palustris BisA53] Length = 372 Score = 305 bits (781), Expect = 7e-81, Method: Composition-based stats. Identities = 137/356 (38%), Positives = 205/356 (57%), Gaps = 12/356 (3%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGE 61 + L I LI+ G M V +Y LC+ PE GYY + +P G GDF+T+PE+SQ+FGE Sbjct: 3 DQPLHDTIKKLIRSAGPMPVWRYMELCLTHPEHGYYVSRDPLGREGDFITSPEVSQMFGE 62 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 +L ++ W+ G P VRL+ELGPGRG +M D +R + L P + +S+++VE + Sbjct: 63 LLGLWAASVWKAIGSPQQVRLIELGPGRGTLMADAMRALRVLPPL-YQAISVHLVEINPV 121 Query: 122 LTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDI 181 L Q+ LA+ + + W+ L +VP G + + ANE+FD LP+ Q V EHG ER+I+I Sbjct: 122 LRDKQRDTLANLSN-VAWHADLDEVPGGTSIIFANEYFDVLPVHQAVRGEHGWHERVIEI 180 Query: 182 DQHDSLVFNIGDHEIKSNFLTCSDY----FLGAIFENSPCRDREMQSISDRLACDGGTAI 237 D L F I + GA+FE D E+ I+ R+ +GG A+ Sbjct: 181 DAEGDLTFGAAAEPIPQFEVLLPPLVRAAPPGAVFEWR--ADSEIMKIASRVRDEGGAAL 238 Query: 238 VIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTT 297 +IDYG+L+S GDT QA+ H++ PL NPGQAD+++HVDFQ L+ A ++G T Sbjct: 239 IIDYGHLRSDAGDTFQAIAKHSFADPLANPGQADVTAHVDFQALAQAAEAVGARVHGPVT 298 Query: 298 QGKFLEGLGIWQRAFSLMKQTARK--DILLDSVKRLVSTSADKKSMGELFKILVVS 351 QG+FL LGI RA +LM + + + + + +++KRL +FK++ VS Sbjct: 299 QGEFLRRLGIETRALALMAKASHEISEDVANALKRLTGGGRGGMG--SMFKVIGVS 352 >gi|254464326|ref|ZP_05077737.1| ATP synthase beta subunit/transription termination factor rho [Rhodobacterales bacterium Y4I] gi|206685234|gb|EDZ45716.1| ATP synthase beta subunit/transription termination factor rho [Rhodobacterales bacterium Y4I] Length = 355 Score = 305 bits (781), Expect = 7e-81, Method: Composition-based stats. Identities = 138/362 (38%), Positives = 196/362 (54%), Gaps = 12/362 (3%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L+ + I+ +G M+V +Y C+ P+FGYY+T +P GA GDF TAPEISQ+FGE+L Sbjct: 2 SLMDHLSARIRADGPMSVAEYMGDCLLHPQFGYYTTRDPLGAQGDFTTAPEISQMFGELL 61 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 + L AW G P+ L ELGPGRG +M D+LR + P F + + I++VE S L Sbjct: 62 GLALAQAWMDQGSPAPFTLAELGPGRGTLMADLLRATRSV-PGFHAAMQIHLVEASPALR 120 Query: 124 LIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQ 183 Q K L Y W S ++P FL+ANEFFD+LPI+QF+ G E+ I + Sbjct: 121 AAQAKALEGY--APAWLDSADNLPDQPLFLIANEFFDALPIRQFLRAGGGWSEKRIGL-T 177 Query: 184 HDSLVFNIGDHEIKSNF-LTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYG 242 +L F + + +D G + E QSI+ R+A GG A+++DYG Sbjct: 178 DGALSFGLTPAAPQPALAHRLADTRDGDLVEICEPAAPITQSIAARIAAHGGAALIVDYG 237 Query: 243 YLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFL 302 +GDTLQA++ H PL PG+ADL++HVDF+ L++ A LT QG FL Sbjct: 238 -DWRALGDTLQALRAHEPADPLQTPGEADLTAHVDFEALANAAKTAGCAFTRLTPQGVFL 296 Query: 303 EGLGIWQRAFSLMK--QTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMPF 360 E LGI RA +L Q + L+ + +RL + MG LFK+L + + P Sbjct: 297 ERLGITDRARALAAPLQGGSLETLIAAHRRLTH----PEEMGNLFKVLGLYPSQAAPPPG 352 Query: 361 VN 362 +N Sbjct: 353 LN 354 >gi|163867945|ref|YP_001609149.1| hypothetical protein Btr_0732 [Bartonella tribocorum CIP 105476] gi|161017596|emb|CAK01154.1| conserved hypothetical protein [Bartonella tribocorum CIP 105476] Length = 359 Score = 305 bits (781), Expect = 7e-81, Method: Composition-based stats. Identities = 141/361 (39%), Positives = 204/361 (56%), Gaps = 9/361 (2%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 L KI +I +G +TV +Y L + D +FGYY T PFG GDF+TAPEISQ+FGEM Sbjct: 2 TPLKEKIKEIIASHGPITVSEYMTLALTDHQFGYYQTQRPFGRTGDFITAPEISQLFGEM 61 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 + I+ + +W+ G P L E+GPGRG +M DILR I KL F+ I+++E S++L Sbjct: 62 IGIWALASWKAQGCPHPFILAEIGPGRGTLMDDILRTIQKLSAIAFNAAEIFLIEISKKL 121 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 QK++L SY KI+ +L +P FL+ NEF D+LPI Q++ +ER I I+ Sbjct: 122 AKEQKQRLFSYQKKIHSIENLNQIPPKPLFLIGNEFLDTLPINQYIKVNGEWKERCITIN 181 Query: 183 QHDSLVFNIGDHEIK--SNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVID 240 Q +F H++ CS G IFE++P R + MQ IS+ L G+A++ID Sbjct: 182 QDGDFIFIAAPHKLPSSCLQTYCSKVPDGTIFEHAPLRHQFMQQISNHLVQVTGSALLID 241 Query: 241 YGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGK 300 YG GDTLQA+ H + PG+ DL+SHVDF L +IA+ + QG+ Sbjct: 242 YGARDLAFGDTLQALSKHRFRDVFDAPGEHDLTSHVDFSFLKNIALEQGCFAEIF-EQGE 300 Query: 301 FLEGLGIWQRAFSL--MKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELM 358 FL +G+ +RA L K + +D + ++RL A MG+LFK+L S + + + Sbjct: 301 FLLKMGLLERAQQLGAGKSASLQDKIRQDIERL----ASPDQMGKLFKVLHFSDKNIPIP 356 Query: 359 P 359 P Sbjct: 357 P 357 >gi|49475904|ref|YP_033945.1| hypothetical protein BH11810 [Bartonella henselae str. Houston-1] gi|49238712|emb|CAF27964.1| hypothetical protein BH11810 [Bartonella henselae str. Houston-1] Length = 359 Score = 304 bits (779), Expect = 1e-80, Method: Composition-based stats. Identities = 148/360 (41%), Positives = 206/360 (57%), Gaps = 9/360 (2%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L +KI +I NG +TV QY L + DP+FGYY T PFG GDF+TAPEISQ+FGEM+ Sbjct: 3 TLKQKIKEMIALNGPITVSQYMTLALTDPQFGYYKTQTPFGRTGDFITAPEISQLFGEMI 62 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 I+ + W+ HG P+ L E+GPGRG +M DILR I KL F+ I+++E S++L Sbjct: 63 GIWALANWKAHGCPAPFILAEIGPGRGTLMDDILRTIQKLSIKAFNAAEIFLIEISKKLA 122 Query: 124 LIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQ 183 QKK+LA Y +I + +PL L+ANEF D+LPI Q++ RER I ++Q Sbjct: 123 KEQKKRLAPYQKEIYSIENFDQLPLKPLLLIANEFLDTLPINQYIKINGEWRERRITVNQ 182 Query: 184 HDSLVFNIGDHEIKSNFLTCSD--YFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDY 241 + VF + FL D G IFE++P R + MQ IS+RL G+A++IDY Sbjct: 183 NGDFVFIAAPGKFSFPFLQFCDSEIPDGKIFEHAPSRHQFMQQISNRLIQVKGSALLIDY 242 Query: 242 GYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKF 301 G GDTLQA+ H + PG+ DL++HV F L IA+ + QG F Sbjct: 243 GASNLAFGDTLQALSKHRFRDIFDAPGEHDLTTHVGFSFLKKIALEQGC-FAKILEQGDF 301 Query: 302 LEGLGIWQRAFSLM--KQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMP 359 L +G+ +RA L K A +D + ++RL A MG+LFK+L VS++ + L P Sbjct: 302 LVKMGLLERAKQLAADKNAALQDKIHQDIERL----AGPDQMGKLFKVLHVSNQNIPLPP 357 >gi|158425607|ref|YP_001526899.1| hypothetical protein AZC_3983 [Azorhizobium caulinodans ORS 571] gi|158332496|dbj|BAF89981.1| protein of unknown function [Azorhizobium caulinodans ORS 571] Length = 367 Score = 304 bits (779), Expect = 1e-80, Method: Composition-based stats. Identities = 141/364 (38%), Positives = 198/364 (54%), Gaps = 12/364 (3%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 M N L +I LI+ G ++V +Y ALC+ P GYY T +PFGA GDF+TAPEISQ+FG Sbjct: 1 MTNPLKDEIRALIEVEGPISVGRYMALCLGHPRHGYYVTRDPFGAGGDFITAPEISQMFG 60 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 E++ ++ + W+Q G PS RLVELGPGRG +M D LR L P F + + +++VE S Sbjct: 61 ELIGLWAVACWQQMGEPSSFRLVELGPGRGTLMADALRAAR-LVPAFGAAMRLHLVEMSP 119 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 L Q + L + W+ + DVP G ++ANEFFD+LP+ QFV G ER + Sbjct: 120 VLRRRQAETLKDH--APQWHDRIEDVPEGPAIVIANEFFDALPVDQFVRGPTGWHERRVG 177 Query: 181 IDQHDSLVFNIGDHEIKSNFLTCSDYF---LGAIFENSPCRDREMQSISDRLACDGGTAI 237 +D SLVF + + + + G + E + A GGTA+ Sbjct: 178 LDVTGSLVFGLDPRPFRPIEAFAAGFPRPAEGDLLERMESGPARALAARL--AAQGGTAL 235 Query: 238 VIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTT 297 +DYG+ +S GDTLQA+K H + +PL PG+ADL++HVDF L+ +A G T Sbjct: 236 ALDYGHARSGFGDTLQAMKDHRFTNPLAEPGEADLTAHVDFAALAGMARAAGARAFGPLT 295 Query: 298 QGKFLEGLGIWQRAFSL--MKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKV 355 QG FL LGI RA +L A+K + ++ RL MG LFK+L ++ Sbjct: 296 QGDFLRRLGIEARAATLMGSASAAQKAAIGSALTRLTGAGT--GEMGTLFKVLALAAPTF 353 Query: 356 ELMP 359 P Sbjct: 354 GPPP 357 >gi|299134263|ref|ZP_07027456.1| protein of unknown function DUF185 [Afipia sp. 1NLS2] gi|298591010|gb|EFI51212.1| protein of unknown function DUF185 [Afipia sp. 1NLS2] Length = 369 Score = 304 bits (778), Expect = 1e-80, Method: Composition-based stats. Identities = 144/362 (39%), Positives = 200/362 (55%), Gaps = 13/362 (3%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 + LI G M V +Y LC+ PE+GYY +P G GDF TAPEISQ+FGE+ Sbjct: 7 TPFELYLRQLIATAGPMPVSRYMQLCMTHPEYGYYVNRDPLGRGGDFTTAPEISQMFGEL 66 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 + ++ W G P V+L+ELGPGRG MM D LR I L P FF + +++VE S L Sbjct: 67 IGLWAASVWNAMGMPEHVQLIELGPGRGTMMADALRAIRIL-PAFFDAIDVHLVEVSPSL 125 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 IQ+ LA W+ L+DVP G ++ANE+FD LPI Q V + G ERM+D+ Sbjct: 126 RAIQRDTLADVKPF-QWHHLLSDVPDGPAIILANEYFDVLPIHQMVKKDTGWHERMVDV- 183 Query: 183 QHDSLVFNIGDHEIKSNFLTCS----DYFLGAIFENSPCRDREMQSISDRLACDGGTAIV 238 D VF ++ + G IFE E+ +++ R+ GG A++ Sbjct: 184 DDDIFVFTTAPEPTPRFEVSVPPHVRNAPNGTIFEWRE--TNEIMALARRIREFGGAALL 241 Query: 239 IDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQ 298 IDYG+++S GDT QAV H Y PL PG ADL++HVDFQ L+ A + +G Q Sbjct: 242 IDYGHIRSDAGDTFQAVAKHKYTDPLRAPGTADLTAHVDFQALADAAQSMGVLAHGPVEQ 301 Query: 299 GKFLEGLGIWQRAFSLMKQTARKD--ILLDSVKRLVSTSADKKSMGELFKILVVSHEKVE 356 G FL LGI RA +L+K +A + + ++KRLV + K+MG LFK+L +SH + Sbjct: 302 GTFLGHLGIETRAQTLIKHSASQSAGDVASALKRLVESG--PKAMGSLFKVLGLSHPSIP 359 Query: 357 LM 358 + Sbjct: 360 AL 361 >gi|114707030|ref|ZP_01439929.1| hypothetical protein FP2506_03224 [Fulvimarina pelagi HTCC2506] gi|114537580|gb|EAU40705.1| hypothetical protein FP2506_03224 [Fulvimarina pelagi HTCC2506] Length = 370 Score = 303 bits (777), Expect = 2e-80, Method: Composition-based stats. Identities = 139/366 (37%), Positives = 199/366 (54%), Gaps = 8/366 (2%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 M + L KI I++ G MTV++++ + + D GYYS PFG GDF TAPEISQ+FG Sbjct: 1 MTSVLSEKITQEIRERGPMTVERFWEIALFDRRHGYYSAHEPFGRAGDFTTAPEISQMFG 60 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 E++ + AW Q G P L E+GPGRG +M D+LR + + PD + I +VETSE Sbjct: 61 ELIGAWCAGAWVQLGRPVPFLLTEIGPGRGTLMADLLRTLRRTAPDCLAAARIRLVETSE 120 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 RL +Q +L + I + ++ +VANE FD++PI+Q V E ER I+ Sbjct: 121 RLAALQASRLEGFDLPIKRVRRIGELEEMPAVVVANELFDAVPIRQTVFHEGRWHERTIE 180 Query: 181 IDQHDSLVFNIGDH----EIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTA 236 + SL F +G + + GAI E SP R+ + RL G Sbjct: 181 LQNDGSLGFAMGKTLDSLPACLSVIKMHGIEDGAIAEFSPAREGLAAELGARLRRTSGAG 240 Query: 237 IVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLT 296 + IDYG+ Q+ VGDT QAV H YV L NPG+ADL+SHVDF+ L+ L + + Sbjct: 241 LFIDYGHAQTAVGDTFQAVAAHRYVPVLDNPGEADLTSHVDFESLTRRFGEAGLTSSPVR 300 Query: 297 TQGKFLEGLGIWQRAFSLMK--QTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 TQG+FL LG+ +RA L A +D + V+RL + + MG+LFK+ V+ Sbjct: 301 TQGEFLLSLGLLERAGRLGAPLDEAGRDRIRMDVQRL--AGSGPEDMGDLFKVFAVASAP 358 Query: 355 VELMPF 360 +++ PF Sbjct: 359 LDIPPF 364 >gi|296447398|ref|ZP_06889324.1| protein of unknown function DUF185 [Methylosinus trichosporium OB3b] gi|296255101|gb|EFH02202.1| protein of unknown function DUF185 [Methylosinus trichosporium OB3b] Length = 359 Score = 303 bits (777), Expect = 2e-80, Method: Composition-based stats. Identities = 146/362 (40%), Positives = 218/362 (60%), Gaps = 8/362 (2%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 + L +I+ +I++ G +T+++Y ++ +A P GYY T +PFGA GDF+TAPEISQ+FGE+ Sbjct: 2 SALRDEIIAMIEQEGPITLERYMSIALAHPTLGYYMTRDPFGAGGDFITAPEISQMFGEL 61 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 L ++ AW G PS V+L+ELGPGRG +M D+LRV + P F +++VETS L Sbjct: 62 LGLWAQEAWRAAGSPSPVQLIELGPGRGTLMSDVLRVAR-IAPSFLFSSEVHLVETSPVL 120 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 Q++ LA + ++W +A +P G ++ANEFFD+LP++ +V T G ER++ +D Sbjct: 121 EAAQRRTLAEATN-VSWSADIAAIPPGPAIILANEFFDALPVRHYVRTARGWSERLLGLD 179 Query: 183 QHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYG 242 +L F +G+ + D G+I E R M I+ RL GG +VIDYG Sbjct: 180 DAGALAFGVGEAIEPGLTV---DAPEGSIIEIGAVGARLMSEIAARLVAHGGAMLVIDYG 236 Query: 243 YLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFL 302 Y Q+ +G++LQAV H YV PL PG+ADL++HVDF L+ A + G TQG FL Sbjct: 237 YTQTALGESLQAVARHAYVDPLEAPGEADLTAHVDFAALARAATAAGARVQGPVTQGAFL 296 Query: 303 EGLGIWQRAFSLMKQ--TARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVE-LMP 359 LG+ QRA +L K+ + + +++RL K+ MGELFK++ V+H + L Sbjct: 297 TNLGVVQRAEALQKRATPEQAADIAAALQRLTGADDHKRDMGELFKVMAVTHPAMPELPG 356 Query: 360 FV 361 FV Sbjct: 357 FV 358 >gi|90418278|ref|ZP_01226190.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] gi|90337950|gb|EAS51601.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] Length = 380 Score = 302 bits (772), Expect = 8e-80, Method: Composition-based stats. Identities = 131/364 (35%), Positives = 199/364 (54%), Gaps = 8/364 (2%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 N L RKI ++ G + VD+Y+ + D E GYY+ +PFG GDFVTAPE+SQ+FGE+ Sbjct: 2 NALARKIAEHVRVEGPLGVDRYWNFALYDREHGYYARRDPFGRAGDFVTAPEVSQMFGEL 61 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 L ++ AW G P LVE GPGRG MM D+LR + PD + +VETS+RL Sbjct: 62 LGAWVASAWTGLGGPDAFLLVECGPGRGTMMADMLRALRSAAPDCMRAADVRLVETSDRL 121 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 Q + L + I + D+ +VANE FD++ ++Q+V RER + Sbjct: 122 ADEQMQTLGRFDLPIRRVRRIEDLERRPMVVVANELFDAVAVRQYVFDGSEWRERCVSTT 181 Query: 183 QHDSLVFNIGDHEIKSNF----LTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIV 238 F + + + + + + GA+ E SP R+R +++RLA DGG ++ Sbjct: 182 DSGRFEFVLCEARPQVSDAVRAMGLVEPKAGAVLEVSPARERIAAGLAERLATDGGASLF 241 Query: 239 IDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQ 298 IDYG+ +S GDTLQA+ GH + PL PG+ D++SHVDF R+++ LY++ TQ Sbjct: 242 IDYGHSRSGYGDTLQAMHGHAFADPLDRPGENDITSHVDFDRIAAPFRASGLYVSPTVTQ 301 Query: 299 GKFLEGLGIWQRAFSL--MKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVE 356 FL LG+ +RA +L + + ++ +V+RL MG++FK+L + + Sbjct: 302 SDFLLTLGLLERAGALGAARDEKTRTEIIGAVQRL--AGTGPGDMGQVFKVLCAASRPMR 359 Query: 357 LMPF 360 L PF Sbjct: 360 LSPF 363 >gi|254475299|ref|ZP_05088685.1| ATP synthase beta subunit/transription termination factor rho [Ruegeria sp. R11] gi|214029542|gb|EEB70377.1| ATP synthase beta subunit/transription termination factor rho [Ruegeria sp. R11] Length = 356 Score = 300 bits (768), Expect = 2e-79, Method: Composition-based stats. Identities = 129/362 (35%), Positives = 203/362 (56%), Gaps = 11/362 (3%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L I+ I+ +G ++V Y A + P +GYY+T +P GA GDF TAPEISQ+FGE++ Sbjct: 2 SLQDHIIARIRTDGPISVADYMAEALLHPTYGYYTTRDPLGASGDFTTAPEISQMFGELI 61 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 + L W G P+ L ELGPGRG +M D+LR + P F + + I ++E S L Sbjct: 62 GLALAQTWIDQGSPTPFTLAELGPGRGTLMADVLRATKAV-PGFHAAMRISLLEASPTLR 120 Query: 124 LIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQ 183 Q + L+ Y W+ ++ ++P FLVANEFFD+LP++QF+ G RE+ + + + Sbjct: 121 KRQAEALSGY--AATWHENIEELPDQALFLVANEFFDALPVRQFLRDGEGWREKSVGLSE 178 Query: 184 HDSLVFNIGDHEIKSNF-LTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYG 242 +L F +G + D + E + +++ R+A GG A+++DYG Sbjct: 179 DGALQFGLGPVAPQPALSHRIEDTSDNDLVELCEAAQPIVSTLAQRIAAYGGCALIVDYG 238 Query: 243 YLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFL 302 +S +GDTLQA++ H PL++PG+ADL++HVDF+ L+ A + +T QG FL Sbjct: 239 DWRS-LGDTLQALRSHAPSDPLLSPGEADLTTHVDFEALALAAKSAGAEFSRVTPQGVFL 297 Query: 303 EGLGIWQRAFSLMKQTA--RKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMPF 360 E LGI RA +L + + + D L+ + +RL + MG LFK+L + ++ P Sbjct: 298 ERLGITARAQALAARLSGQQLDQLIAAHRRLTH----PEEMGNLFKVLALYPAQMTPPPG 353 Query: 361 VN 362 +N Sbjct: 354 LN 355 >gi|126735170|ref|ZP_01750916.1| hypothetical protein RCCS2_14874 [Roseobacter sp. CCS2] gi|126715725|gb|EBA12590.1| hypothetical protein RCCS2_14874 [Roseobacter sp. CCS2] Length = 352 Score = 299 bits (766), Expect = 4e-79, Method: Composition-based stats. Identities = 132/359 (36%), Positives = 190/359 (52%), Gaps = 14/359 (3%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 L ++ I + G +++ Y A C+ PE GYY+T +PFGA GDF TAPEISQ+FGE+ Sbjct: 2 TALGDLLIARIARTGPISLADYMADCLMHPEHGYYATRDPFGAAGDFTTAPEISQMFGEL 61 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 + + L AW G P+ L ELGPGRG +M D LR + P F L++++VETS L Sbjct: 62 IGLSLAQAWIDQGCPAPFTLAELGPGRGTLMADALRATKAV-PGFHDALTVHLVETSPVL 120 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 Q K + W+ S+ +P FL+ANEFFD+LPI+QF RE+M+ + Sbjct: 121 RAAQAKLIPD----ATWHDSVDHLPDAPLFLIANEFFDALPIRQFTRDGDAWREKMVGVT 176 Query: 183 QHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYG 242 I +D G + E+ + +IS ++ GG A++IDYG Sbjct: 177 DGKLGFGLSAAAPIALLEDRLADTKDGDLVEHCLALPSIVSTISAKIEMHGGCAVIIDYG 236 Query: 243 YLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFL 302 QS+ GDTLQA+K + PL PG ADL++HVDF +++ L K + LT QG FL Sbjct: 237 DWQSQ-GDTLQALKSQDHADPLATPGDADLTAHVDFAAIAANTGLAKF--SRLTPQGVFL 293 Query: 303 EGLGIWQRAFSLMKQTARKD--ILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMP 359 E LGI QRA +L + + + + +RL MG+LFK++ + P Sbjct: 294 ERLGITQRAQALATKLSGDALTSHIAAHRRLTH----PAEMGDLFKVISIHPSNANPPP 348 >gi|319403826|emb|CBI77413.1| conserved hypothetical protein [Bartonella rochalimae ATCC BAA-1498] Length = 358 Score = 299 bits (766), Expect = 4e-79, Method: Composition-based stats. Identities = 138/362 (38%), Positives = 204/362 (56%), Gaps = 9/362 (2%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 + L +I +I +G ++V QY L + DP+FGYY PFG GDF+TAPEISQ+FGEM Sbjct: 2 SNLKERIKEIIILDGPISVSQYMTLALTDPQFGYYQKQKPFGRAGDFITAPEISQLFGEM 61 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 + I+ I +W+ G P+ L E+GPGRG +M DILR I K+ F+ I+++E S+RL Sbjct: 62 IGIWAIMSWQAQGCPNPFILAEIGPGRGTLMDDILRTIRKICITAFNAADIFLIEISQRL 121 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 QKK+L ++ KI + +PL ++ANE FD+LPI Q+V +ER I ++ Sbjct: 122 ATEQKKRLFAHQKKIYSVENFEQIPLKPLIVIANELFDALPINQYVKVNGEWKERRITLN 181 Query: 183 QHDSLVFNIGDHEIKSNFLTC--SDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVID 240 + F H+ S FL + G I E P R++ Q IS RL G+A++ID Sbjct: 182 KEGGFTFTTDIHKFPSTFLPPQCAQMPNGTILEYGPSRNQLAQKISSRLMQTQGSALLID 241 Query: 241 YGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGK 300 YG GDTLQA+ H + PG+ DL+SHVDF L +IA ++ + QG Sbjct: 242 YGASDFAFGDTLQAISRHKFCDIFSAPGEHDLTSHVDFFSLKTIAAQQGCFVE-ILEQGN 300 Query: 301 FLEGLGIWQRAFSLMKQTAR--KDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELM 358 FL +G+ +RA L + K + + +KRL D K MG+LFK+L ++ + +++ Sbjct: 301 FLIKMGVLERAEKLSINKSTIIKKKIYEDIKRLT----DPKQMGKLFKVLHINDKNIQIP 356 Query: 359 PF 360 F Sbjct: 357 YF 358 >gi|126725164|ref|ZP_01741007.1| hypothetical protein RB2150_15051 [Rhodobacterales bacterium HTCC2150] gi|126706328|gb|EBA05418.1| hypothetical protein RB2150_15051 [Rhodobacterales bacterium HTCC2150] Length = 354 Score = 299 bits (765), Expect = 5e-79, Method: Composition-based stats. Identities = 131/357 (36%), Positives = 186/357 (52%), Gaps = 8/357 (2%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 KL + I +G +++ Y A C+ PE GYYS +PFGA GDF TAPEISQ+FGE+ Sbjct: 2 TKLNAILQARIGNHGPISIADYMAECLLHPELGYYSRRDPFGAKGDFTTAPEISQMFGEL 61 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 L ++L +W G P L E+GPGRG +M D+ R + P F Y++E S L Sbjct: 62 LGLWLAQSWIDAGQPPSFVLAEIGPGRGTLMADVWRATKGV-PGFHDAAKPYLIEASAHL 120 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 +QK L NW ++ ++P FL+ANEFFD+LPI+Q+ + G E +I + Sbjct: 121 RSVQKATLGG--VNANWVGTIDELPDAPLFLLANEFFDALPIRQYKRQKSGWSELLIGGN 178 Query: 183 QHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYG 242 S D G + E M++++DR+ +GG A+V+DYG Sbjct: 179 ADGLCFGQAAPSPNTSLDHRLDDTNEGDLVEVCSAMQGVMETVNDRIKTNGGVALVVDYG 238 Query: 243 YLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFL 302 +S +G TLQA+K H +V L N G+ADL++HVDF+ L++ A L + LT QG FL Sbjct: 239 DWRS-LGSTLQAIKSHAFVDVLTNSGEADLTAHVDFEALANAAPD--LNYSRLTPQGIFL 295 Query: 303 EGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMP 359 + LGI RA L Q + L+S K MG LFK++ L P Sbjct: 296 QRLGIDHRAEVLGNQLSGAA--LESHKAAHHRLTAADEMGTLFKVMAFYAPNSPLPP 350 >gi|240850150|ref|YP_002971543.1| hypothetical protein Bgr_05410 [Bartonella grahamii as4aup] gi|240267273|gb|ACS50861.1| hypothetical protein Bgr_05410 [Bartonella grahamii as4aup] Length = 359 Score = 299 bits (765), Expect = 5e-79, Method: Composition-based stats. Identities = 144/360 (40%), Positives = 201/360 (55%), Gaps = 9/360 (2%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L KI +I NG +TV QY L + DP+FGYY T PFG GDF+TAPEISQ+FGEM+ Sbjct: 3 NLKEKIKEIIALNGPITVSQYMTLALTDPQFGYYQTQTPFGRTGDFITAPEISQLFGEMI 62 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 I+++ +W HG P L E+GPGRG +M D+LR I KL F ++++E S++L Sbjct: 63 GIWVLASWNAHGCPRPFILAEIGPGRGTLMDDVLRTIQKLSTTAFESSEVFLLEISKKLA 122 Query: 124 LIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQ 183 QKK+L+SY +I+ S +P FL+ANEF D+LPI Q++ + RER I IDQ Sbjct: 123 EEQKKRLSSYQKQIHSIESFDQIPSKPLFLIANEFLDTLPINQYIKIKGEWRERRITIDQ 182 Query: 184 HDSLVFN--IGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDY 241 + F + CS G IFE +P R + MQ IS L G+A++IDY Sbjct: 183 NGDFTFIAALHKLPSSYLQTYCSKVPDGTIFEYAPLRHQFMQQISHHLVQVTGSALLIDY 242 Query: 242 GYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKF 301 G GDTLQA+ H + PG+ DL+SHV F L +IA+ + QG F Sbjct: 243 GAADLAFGDTLQALSKHRFRDIFDAPGEHDLTSHVGFSFLKNIALEQGCFAEIF-EQGDF 301 Query: 302 LEGLGIWQRAFSL--MKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMP 359 L +G+ +RA L K +D + ++RL A + MG+LFK+L S + + + P Sbjct: 302 LLKMGLLERAKQLGVGKSAPLQDKIRQDIERL----AGQDQMGKLFKVLHFSDKNISIPP 357 >gi|319406834|emb|CBI80469.1| conserved hypothetical protein [Bartonella sp. 1-1C] Length = 358 Score = 299 bits (765), Expect = 5e-79, Method: Composition-based stats. Identities = 136/362 (37%), Positives = 201/362 (55%), Gaps = 9/362 (2%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 + L +I +I +G ++V QY L + DP+FGYY PFG GDF+TAPEISQ+FGEM Sbjct: 2 SNLKERIKEIIILDGPISVSQYMTLALTDPQFGYYQKQKPFGRTGDFITAPEISQLFGEM 61 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 + I+ I +W+ G P+ L E+GPGRG +M DILR I K+ F+ I+++E S+RL Sbjct: 62 IGIWAIMSWQAQGCPNPFILAEIGPGRGTLMDDILRTIRKICITAFNAADIFLIEISQRL 121 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 QKK+L ++ KI S +PL ++ANE FD+LPI Q+V +ER I ++ Sbjct: 122 ATEQKKRLFAHQKKIYSVESFEQIPLKPLIVIANELFDALPINQYVKVNGEWKERRITLN 181 Query: 183 QHDSLVF--NIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVID 240 + F +I C+ G I E P R++ Q IS RL G+A++ID Sbjct: 182 KEGGFTFTTDIHKFPSTFLLPQCAQMPNGTILEYGPSRNQLAQKISSRLMQTQGSALLID 241 Query: 241 YGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGK 300 YG GDTLQA+ H + PG+ DL+SHVDF L +IA ++ + Q Sbjct: 242 YGASDFAFGDTLQAISRHKFCDIFSAPGEHDLTSHVDFFSLKTIAAQQGCFVE-ILEQRD 300 Query: 301 FLEGLGIWQRAFSLMKQTAR--KDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELM 358 FL +G+ +RA L + K + + +KRL D K MG+LFK+L ++ + +++ Sbjct: 301 FLIKMGVLERAEKLSINKSTIIKKKIYEDIKRLT----DPKQMGKLFKVLHINDKNIQIP 356 Query: 359 PF 360 F Sbjct: 357 YF 358 >gi|49473977|ref|YP_032019.1| hypothetical protein BQ03330 [Bartonella quintana str. Toulouse] gi|49239480|emb|CAF25833.1| hypothetical protein BQ03330 [Bartonella quintana str. Toulouse] Length = 363 Score = 298 bits (764), Expect = 7e-79, Method: Composition-based stats. Identities = 149/361 (41%), Positives = 214/361 (59%), Gaps = 10/361 (2%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 M+ L KI +I NG + V QY L + DP+FGYY T PFG GDF+TAPEISQ+FG Sbjct: 1 MDT-LKEKIEEIIALNGPIPVSQYITLALTDPQFGYYQTQTPFGRAGDFITAPEISQLFG 59 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 EM+ I+ + W+ G P L E+GPGRG +M DILR I KL P F I+++E S+ Sbjct: 60 EMIGIWALANWKVQGCPHPFILAEIGPGRGTLMDDILRTIQKLSPKAFDAAEIFLIEISK 119 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 +L + Q+++L+SY +I+ + + +PL FL+ANEFFD+LPI Q++ +ER I Sbjct: 120 KLAVEQQERLSSYQKQIHSIENFSQIPLSPLFLIANEFFDTLPINQYIKINGEWKERRIT 179 Query: 181 IDQHDSLVFNIGDHEIKSNF--LTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIV 238 ++Q + +F H + S + + S+ G IFE++P R + MQ ISDRL G+A++ Sbjct: 180 VNQDGNFMFIATPHTLPSYYLQFSLSEVPDGTIFEHAPSRYQFMQQISDRLVQIKGSALL 239 Query: 239 IDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQ 298 IDYG GDTLQA+ H + PG+ DL+SHVDF L +IA+ + + Q Sbjct: 240 IDYGSSDLAFGDTLQALSKHRFRDIFEAPGEHDLTSHVDFSFLKTIALEQGCFAE-ILEQ 298 Query: 299 GKFLEGLGIWQRAFSL--MKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVE 356 G FL +G+ +RA L K A +D + VKRL A MG+LFK+L V+ + + Sbjct: 299 GDFLFTMGLLERARQLGVNKSAALQDKICQDVKRL----AGPDQMGKLFKVLHVNDKNIP 354 Query: 357 L 357 L Sbjct: 355 L 355 >gi|259418837|ref|ZP_05742754.1| ATP synthase beta subunit/transcription termination factor rho [Silicibacter sp. TrichCH4B] gi|259345059|gb|EEW56913.1| ATP synthase beta subunit/transcription termination factor rho [Silicibacter sp. TrichCH4B] Length = 357 Score = 298 bits (763), Expect = 7e-79, Method: Composition-based stats. Identities = 136/359 (37%), Positives = 196/359 (54%), Gaps = 12/359 (3%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L+ + I +G M+V +Y + C+ +PE GYY+T GA GDF+TAPEISQ+FGE+L Sbjct: 2 SLMDTLRRRIHLDGPMSVAEYMSECLLNPEQGYYTTATAIGAEGDFITAPEISQMFGELL 61 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 + L+ AW G P+ L ELGPGRG +M D+LR + P F + + ++E S RL Sbjct: 62 GLALVQAWLDQGSPAPFTLAELGPGRGTLMADMLRATRAV-PGFHDAMDLTLIEASPRLR 120 Query: 124 LIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQ 183 +Q+ LA Y W S+ D+P FLVANEFFD+LPI+QF E RER + + Sbjct: 121 NLQEIALAPY--APRWLPSVEDLPQQPLFLVANEFFDALPIRQFQRDETQWRERRVGLTD 178 Query: 184 H--DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDY 241 + + D G + E ++I+ R+A GG A+V+DY Sbjct: 179 DASALALGLGAAAPQPALAHRIEDTKHGDLVEYCEIAAVVTEAIAQRIADHGGAALVVDY 238 Query: 242 GYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKF 301 G +S +GDTLQA++ H PL NPGQADL++HVDF+ + + A + LT QG F Sbjct: 239 GDWRS-LGDTLQALRAHAPTDPLQNPGQADLTAHVDFEAICNAARISGCAHTRLTPQGVF 297 Query: 302 LEGLGIWQRAFSLMK--QTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELM 358 LE LGI +RA +L + A D ++ + +RL + MG LFK+L + + Sbjct: 298 LERLGITERARALASGLEGAPLDQIVSAHRRLTH----PEEMGNLFKVLGLYPTQSPPP 352 >gi|254460068|ref|ZP_05073484.1| ATP synthase beta subunit/transription termination factor rho [Rhodobacterales bacterium HTCC2083] gi|206676657|gb|EDZ41144.1| ATP synthase beta subunit/transription termination factor rho [Rhodobacteraceae bacterium HTCC2083] Length = 354 Score = 298 bits (762), Expect = 1e-78, Method: Composition-based stats. Identities = 129/358 (36%), Positives = 190/358 (53%), Gaps = 12/358 (3%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 L + + + I G M+V +Y + C+ PE+GYYST +PFGA GDF+TAPEISQ+FGE+ Sbjct: 2 TPLEQILKSQITTQGPMSVAEYMSTCLLHPEYGYYSTRDPFGAGGDFITAPEISQMFGEL 61 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 + + L W G P+ + L ELGPGRG +M DILR + P F I+++E S +L Sbjct: 62 IGLTLAQVWMDQGQPAKIALAELGPGRGTLMADILRTAKAV-PSFAQACEIHLIEASPKL 120 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 +Q L++Y + + + + +ANEFFD+LPI+Q + G RER I +D Sbjct: 121 REVQAATLSAYTPVWHDHVNQL-PSDLPLYAIANEFFDALPIRQMIRDGEGWRERQIGLD 179 Query: 183 QHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYG 242 + + SD G + E M+++++R+ +GG A++ DYG Sbjct: 180 NDALAFGLSISAPLAALDHRLSDTKDGDLVELCAQAPLIMRTLAERIQANGGAAVIFDYG 239 Query: 243 YLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFL 302 +S +GDTLQAV H V L+ PGQ+DL+SHVDF+ L + LTTQG FL Sbjct: 240 DWRS-LGDTLQAVYAHEKVPALLKPGQSDLTSHVDFEALI---TDLPCTHSRLTTQGVFL 295 Query: 303 EGLGIWQRAFSLMKQTARKDI--LLDSVKRLVSTSADKKSMGELFKILVVSHEKVELM 358 E LGI RA +L K + + + +RL MG +FK L + + Sbjct: 296 ERLGITDRAQALAKSLGGTALEHHIAAHRRLTH----PDEMGTIFKTLALFPKGAHPP 349 >gi|166064237|gb|ABY79036.1| hypothetical protein [endosymbiont of Ridgeia piscesae] Length = 355 Score = 297 bits (761), Expect = 2e-78, Method: Composition-based stats. Identities = 127/359 (35%), Positives = 201/359 (55%), Gaps = 11/359 (3%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 +L + ++ IK +G +++ +Y C+ P+ GYYST +PFGA GDF TAPEISQ+FGE+ Sbjct: 2 TELAKLLIEHIKNSGPISLAEYMGECLLHPKHGYYSTRDPFGADGDFTTAPEISQMFGEL 61 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 L + + W Q G P+ L E+GPGRG +M D+LR + + I ++E S L Sbjct: 62 LGLCMAQTWLQQGSPNAFTLAEIGPGRGTLMADVLRATKGVAGFH-TAAQITLIEASPAL 120 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 IQ++QLA Y + W +++ P +L+ANEFFD+LPI Q++M + G RER++ + Sbjct: 121 QKIQREQLADY--DVTWLGDISETPKAPLYLLANEFFDALPIHQYIMEDDGWRERLVGVA 178 Query: 183 QHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYG 242 + + ++ D G + E I+ R+A GG AI+IDYG Sbjct: 179 DDELVFGASAAADLPPLEHRRKDCRTGDLVEVCGAASAIAGEIATRIAEHGGAAIIIDYG 238 Query: 243 YLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFL 302 +S +GDTLQA++ H + SPL +PG+AD+++HVDF+ L+ A+ + ++ + QG+ L Sbjct: 239 DWRS-LGDTLQALRNHEFDSPLAHPGEADITAHVDFEALAVSAVSH-TPVSKMIPQGELL 296 Query: 303 EGLGIWQRAFSLMK--QTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMP 359 + LGI QRA L + + A + + + +RL MG LFK + + P Sbjct: 297 KRLGIDQRAEVLAQGLEGAELENHMSTHRRLTE----PTEMGTLFKAIAFYPNGQQPPP 351 >gi|254450658|ref|ZP_05064095.1| ATP synthase beta subunit/transription termination factor rho [Octadecabacter antarcticus 238] gi|198265064|gb|EDY89334.1| ATP synthase beta subunit/transription termination factor rho [Octadecabacter antarcticus 238] Length = 364 Score = 297 bits (760), Expect = 2e-78, Method: Composition-based stats. Identities = 130/364 (35%), Positives = 188/364 (51%), Gaps = 14/364 (3%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 L I I+ +G +T+ Y A + P++GYY+T +P GA GDF TAPEISQ+FGE+ Sbjct: 2 TALHDLITARIQTSGPITLADYMADALMHPKYGYYATRDPLGAAGDFTTAPEISQMFGEL 61 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 + + L AW G PS L ELGPGRG +M DILR K+ F +++VETS L Sbjct: 62 IGLSLAQAWIDQGQPSTFALAELGPGRGTLMADILRATAKVV-GFVDAAQVHLVETSPAL 120 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVM----TEHGIRERM 178 Q + ++ + W+ ++ +P FLVANEFFD+LPI+QF G RE Sbjct: 121 RKKQAELMSGPHTNVTWHDDVSTLPDMPLFLVANEFFDALPIRQFHRATPADGSGWRELQ 180 Query: 179 IDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIV 238 I + + I + I E+ ++I++R+A GG A++ Sbjct: 181 IGLQDGTLVAGLSAAAPIAFLEHRLGNTKASDIVEHCTALAAITENIANRIATSGGAALI 240 Query: 239 IDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAI--LYKLYINGLT 296 +DYG +S +GDTLQA++ H P NPG+AD+++HVDF+ ++ A LT Sbjct: 241 VDYGDWRS-LGDTLQALQNHASTDPFANPGEADITAHVDFEAIAMTAASTNGGAIYTQLT 299 Query: 297 TQGKFLEGLGIWQRAFSLMKQTARK--DILLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 TQG FLE LGI QRA +L + D + + +RL MG LFK+L + Sbjct: 300 TQGVFLERLGIAQRAQTLAANLSGDALDAHITAHRRLTH----PSEMGSLFKVLGLYPTN 355 Query: 355 VELM 358 Sbjct: 356 APPP 359 >gi|254441180|ref|ZP_05054673.1| conserved hypothetical protein [Octadecabacter antarcticus 307] gi|198251258|gb|EDY75573.1| conserved hypothetical protein [Octadecabacter antarcticus 307] Length = 368 Score = 296 bits (758), Expect = 3e-78, Method: Composition-based stats. Identities = 128/368 (34%), Positives = 185/368 (50%), Gaps = 18/368 (4%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 KL I I G +T+ Y A + P++GYY+T +P GA GDF TAPEISQ+FGE+ Sbjct: 2 TKLNDLIAARIAATGPITLADYMADALMHPKYGYYATRDPLGAAGDFTTAPEISQMFGEL 61 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 + + L AW G PS LVELGPGRG +M DILR F +++VETS L Sbjct: 62 IGLSLAQAWIDQGHPSAFALVELGPGRGTLMSDILRAT-TSVSGFADAAQVHLVETSPTL 120 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVM--------TEHGI 174 Q +L + K+ W+ + +P +LVANEFFD+LPI+QF Sbjct: 121 RHEQATRLNARQSKVTWHDDIGTLPDLPLYLVANEFFDALPIRQFHRAALADRFADGAVW 180 Query: 175 RERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGG 234 RE + + I D +G + E+ P + I++R+A GG Sbjct: 181 REVQFGLQNDTLVAGLSAAAPIAFLDHRIHDTNVGDVVEHCPALAAITEDIANRIATHGG 240 Query: 235 TAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSS--IAILYKLYI 292 A++IDYG +S +GDTLQA++ H P +PG+ D+++HVDF+ ++ + Sbjct: 241 VALIIDYGDWRS-LGDTLQALQNHAAADPFAHPGETDITAHVDFEAIAMATTSTNGGAKY 299 Query: 293 NGLTTQGKFLEGLGIWQRAFSLMKQTARK--DILLDSVKRLVSTSADKKSMGELFKILVV 350 LTTQG FLE LGI +RA +L + + + + +RL MG LFK+L + Sbjct: 300 TRLTTQGVFLERLGITKRAQTLAATLSGDALETHIAAHRRLTH----PSEMGSLFKVLGL 355 Query: 351 SHEKVELM 358 Sbjct: 356 YPTNAPPP 363 >gi|163732263|ref|ZP_02139709.1| hypothetical membrane protein [Roseobacter litoralis Och 149] gi|161394561|gb|EDQ18884.1| hypothetical membrane protein [Roseobacter litoralis Och 149] Length = 352 Score = 296 bits (758), Expect = 3e-78, Method: Composition-based stats. Identities = 130/357 (36%), Positives = 194/357 (54%), Gaps = 13/357 (3%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L +++ I +G M+V +Y C+ P GYY+T PFG+ GDF TAPEISQ+FGE++ Sbjct: 2 TLKDQLIARITAHGPMSVAEYMGECLLHPTLGYYTTQMPFGSAGDFTTAPEISQMFGELI 61 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 + L+ W G P+ LVELGPGRG +M D+LR ++ P F I +VE S RL Sbjct: 62 GLCLVQTWIDQGQPTPFSLVELGPGRGTLMADVLRATSQV-PAFLHAAEIILVEASPRLQ 120 Query: 124 LIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQ 183 IQ+ L + I + T ++ +P F++ANEFFD+LP++QFV + RER I D Sbjct: 121 SIQRDTLKDH--DIAFVTEVSTLPQQPLFVIANEFFDALPVRQFVRSGAHWRERQIGSDG 178 Query: 184 HDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGY 243 + + + + SD + E +P + + R+ GG ++IDYG Sbjct: 179 DELIFGLGAETPQPALNDRLSDTKDNDVVEYTPAAAPILSQLGGRIEAHGGVGLIIDYGD 238 Query: 244 LQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLE 303 S +GDTLQAV+ H + L +PG++DL++HVDF+ L+ A + LT QG FLE Sbjct: 239 WHS-LGDTLQAVRRHQFTGILDHPGESDLTAHVDFEALAQAAR---CAYSRLTPQGVFLE 294 Query: 304 GLGIWQRAFSLMKQTARK--DILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELM 358 LGI QRA L + +++ +I + + +RL MG LFK+L + Sbjct: 295 RLGIAQRAQHLAQNLSKEQLEIHIQAHRRLTH----PAEMGNLFKVLGLFPHGKAPP 347 >gi|163745737|ref|ZP_02153097.1| hypothetical protein OIHEL45_09100 [Oceanibulbus indolifex HEL-45] gi|161382555|gb|EDQ06964.1| hypothetical protein OIHEL45_09100 [Oceanibulbus indolifex HEL-45] Length = 352 Score = 296 bits (758), Expect = 3e-78, Method: Composition-based stats. Identities = 126/358 (35%), Positives = 184/358 (51%), Gaps = 13/358 (3%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L ++ I G M +D Y C+ P++GYY+T PFG GDF TAPEISQ+FGE++ Sbjct: 2 SLKDHLLARIALEGPMRLDDYMQSCLLHPDWGYYTTRMPFGVQGDFTTAPEISQMFGELI 61 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 + L W G P+ L ELGPGRG +M D+LR ++ P F + + ++E S L Sbjct: 62 GLSLAQCWLDQGAPAPFTLAELGPGRGTLMADVLRACARV-PGFLAAAQVRLIEASPALR 120 Query: 124 LIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQ 183 +Q++ L + + W+ ++ ++P FL+ANEFFD+LPI+QF+ G ER I + Sbjct: 121 DLQRQTLEGF--EATWHDTVTELPDVPLFLIANEFFDALPIRQFLRQGAGWAERRIGAAE 178 Query: 184 HDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGY 243 + + D G + E M I +R+A GG A++IDYG Sbjct: 179 NGLCFGLAPVAPHPAIAHRLDDTKDGDLVEVCAPAADIMLEIGNRIAQQGGAALIIDYG- 237 Query: 244 LQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLE 303 +GDTLQA++ H PL NPG ADL++HVDF+ L A+ + L TQG FLE Sbjct: 238 DWRALGDTLQAMEHHAPADPLANPGCADLTAHVDFEAL---ALACPCQYSRLVTQGVFLE 294 Query: 304 GLGIWQRAFSLMKQTARKD--ILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMP 359 LGI RA L + + + +RL MG LFK++ + + P Sbjct: 295 RLGITARAQKLAEALTDDALHSHIAAHRRLTH----PAEMGNLFKVMGLYPKGQTPPP 348 >gi|99080460|ref|YP_612614.1| hypothetical protein TM1040_0619 [Ruegeria sp. TM1040] gi|99036740|gb|ABF63352.1| protein of unknown function DUF185 [Ruegeria sp. TM1040] Length = 357 Score = 296 bits (758), Expect = 4e-78, Method: Composition-based stats. Identities = 131/359 (36%), Positives = 194/359 (54%), Gaps = 12/359 (3%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L++ + I+ +G MTV Y + C+ P++GYY+T GA GDF+TAPEISQ+FGE+L Sbjct: 2 SLMQSLRRRIELDGPMTVADYMSECLLHPDYGYYTTAPAIGAEGDFITAPEISQMFGELL 61 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 + L+ +W G P L ELGPGRG +M D+LR + P F + + ++E S RL Sbjct: 62 GLVLVQSWLDQGRPQPFTLAELGPGRGTLMADMLRATRAV-PGFHEAMELLLIEASPRLR 120 Query: 124 LIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQ 183 +Q++ LA Y W S+ D+P FLVANEFFD+LPI+QF + RER + + + Sbjct: 121 DLQRQALAPY--APRWVPSVEDLPQHPLFLVANEFFDALPIRQFQREGNQWRERRVGLAE 178 Query: 184 H--DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDY 241 + + D G + E+ ++I+ R+ GG A+++DY Sbjct: 179 DASGLTLGLGAPAPQPALAHRLEDTKDGDLVEHCEVAAVVTEAIAQRIGDHGGVALLVDY 238 Query: 242 GYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKF 301 G +S +GDTLQA++ H PL PGQADL++HVDF+ + + A LT QG F Sbjct: 239 GDWRS-LGDTLQALRAHAPTDPLAEPGQADLTAHVDFEAICTAASATGCAHTRLTPQGVF 297 Query: 302 LEGLGIWQRAFSLMKQTARK--DILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELM 358 LE LGI RA +L A + ++ + +RL + MG LFK+L + K Sbjct: 298 LERLGITDRANALASGAAGEPLAQIIAAHRRLTH----PEEMGNLFKVLGLYPAKFAPP 352 >gi|304391301|ref|ZP_07373245.1| hypothetical protein R2A130_2682 [Ahrensia sp. R2A130] gi|303296657|gb|EFL91013.1| hypothetical protein R2A130_2682 [Ahrensia sp. R2A130] Length = 383 Score = 295 bits (756), Expect = 5e-78, Method: Composition-based stats. Identities = 137/368 (37%), Positives = 197/368 (53%), Gaps = 14/368 (3%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFG-------AVGDFVTAPEI 55 + L +I + I+++G +++ +Y ALC+ PE+GYY+T P G GDF+TAPEI Sbjct: 2 SPLETEIRSRIEQDGPLSIAEYMALCLLHPEYGYYTTGTPVGGRASASREGGDFITAPEI 61 Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 SQ+FGEM+ ++ + W+ G PS LVELGPGRG +M D+LRV L P F + IY+ Sbjct: 62 SQMFGEMIGVWCMEVWQALGEPSPFALVELGPGRGTLMADLLRVAKAL-PGFAAAADIYL 120 Query: 116 VETSERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIR 175 VE S L Q L G + W +P ++ NEF D+LP +Q+V E Sbjct: 121 VEVSGTLAEQQSLTLEKSGASLKWLRDTGQLPDMPAIIIGNEFLDALPFRQWVRLEGQWL 180 Query: 176 ERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGT 235 ER I I D L F + + ++ G IFE +P R+ ++ I+ L G Sbjct: 181 ERAIGIR-DDKLAFVAKANVLPQEDEPEGEHEDGTIFETAPAREAQIAQIAAHLKQHNGA 239 Query: 236 AIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGL 295 A+ IDYG+L+S GDT QAV+ H Y PL PGQ+DL+SHVDF+ L +IA + Sbjct: 240 ALFIDYGHLKSGTGDTFQAVRDHAYADPLAAPGQSDLTSHVDFETLLAIAKTAGCAVPPA 299 Query: 296 TTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHE-- 353 T QGKFL LG+ +RA +L + + D + + ++ V A +MG LFK+ Sbjct: 300 TAQGKFLVELGLLERAGTL--GSGKNDHIQNQLRSAVERLAGPDAMGNLFKVAAFGAPAS 357 Query: 354 -KVELMPF 360 F Sbjct: 358 LGNRWPGF 365 >gi|182680434|ref|YP_001834580.1| hypothetical protein Bind_3534 [Beijerinckia indica subsp. indica ATCC 9039] gi|182636317|gb|ACB97091.1| protein of unknown function DUF185 [Beijerinckia indica subsp. indica ATCC 9039] Length = 386 Score = 295 bits (756), Expect = 6e-78, Method: Composition-based stats. Identities = 125/379 (32%), Positives = 193/379 (50%), Gaps = 25/379 (6%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 L R I +I++ G ++++++ L + P FGYY P GA GDFVTAPEISQ+FGE+ Sbjct: 9 TPLQRLIAEMIEQEGPISLERFMDLALYHPAFGYYCAKMPLGAEGDFVTAPEISQMFGEL 68 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 + ++ W G P+ + LVE GPGRG +M D+LR + P F + + +++VE + L Sbjct: 69 IGLWAAEVWRTMGAPARIALVEFGPGRGTLMADLLRAARAV-PAFSAAIEVHLVEANPVL 127 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLG----FTFLVANEFFDSLPIKQFVMTEHGIRERM 178 +Q++ LA G + W+ S+ G +ANEFFD LP++QF+ G ER+ Sbjct: 128 RRVQEQILAGTGHPLIWHESMDMFLAGGEETPVLCIANEFFDCLPLRQFIRGRAGWHERL 187 Query: 179 IDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIV 238 + + F + ++ GA+ E + M ++ R++ G + Sbjct: 188 VGLAAGGGFQFGLAAEAAPELTGIAAE--PGAVLEINAGAVSAMHRLATRISRVSGVLLA 245 Query: 239 IDYGYLQS--RVGD--TLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYING 294 IDYG+ R G TLQA++ H V PL G+ADL++HVDF RL+ A I G Sbjct: 246 IDYGHAAPSGRFGGGETLQALRHHRRVDPLEAAGEADLTAHVDFTRLTEAARAGGAEIYG 305 Query: 295 LTTQGKFLEGLGIWQRAFSLMK--QTARKDILLDSVKRLV------------STSADKKS 340 TQG+FL LG+ +RA +L + + + L +V RL + + Sbjct: 306 PVTQGEFLCRLGLVERAEALARRANPQQMEALHMAVARLAGDDFPSSGLAGDQGMSLQAG 365 Query: 341 MGELFKILVVSHEKVELMP 359 MG LFK+L V+ L P Sbjct: 366 MGTLFKVLAVTAPGFPLPP 384 >gi|121602661|ref|YP_989273.1| hypothetical protein BARBAKC583_0995 [Bartonella bacilliformis KC583] gi|120614838|gb|ABM45439.1| conserved hypothetical protein [Bartonella bacilliformis KC583] Length = 366 Score = 295 bits (756), Expect = 6e-78, Method: Composition-based stats. Identities = 149/362 (41%), Positives = 210/362 (58%), Gaps = 9/362 (2%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L KI +I G +TV Q+ +L + D +FGYY T PFG GDF+TAPEISQ+FGEM+ Sbjct: 3 TLKHKIQKIIALEGPITVSQFMSLVLTDSQFGYYQTQTPFGRTGDFITAPEISQLFGEMI 62 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 I++I WE G P L E+GPGRGI+M D+LR I KL F+ I++VE S+ L Sbjct: 63 GIWIIANWEAQGCPHPFILAEIGPGRGILMDDVLRTIQKLCITAFNAAEIFLVEISQNLA 122 Query: 124 LIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQ 183 QKK+L+SY KI+ S +P G L+ANE FD+LPI Q++ + RER+I ++ Sbjct: 123 TEQKKRLSSYQKKIHNIESFDQIPPGLLILIANELFDALPIHQYIKIDGEWRERLITLNP 182 Query: 184 HDSLVFNIGDHEIKSNFLT--CSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDY 241 L+F H++ S L C++ G I+E +P R++ MQ IS RL G+A++IDY Sbjct: 183 SGHLIFIADSHKLSSESLPNYCAEMPDGTIWEYAPLRNQLMQKISSRLMQTQGSALLIDY 242 Query: 242 GYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKF 301 G GDTLQA+ H + PG+ DL+SHVDF L IA+ + + QG F Sbjct: 243 GASDIAFGDTLQAISKHQFRDIFSAPGEHDLTSHVDFFSLKEIALQKGCFAT-ILEQGDF 301 Query: 302 LEGLGIWQRAFSLM--KQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMP 359 L +GI +RA L K A ++ + ++RL S MG+LFK+L S + + L Sbjct: 302 LFKMGILERAKKLSSHKNVAIQNKIHQDIERLTS----PNQMGKLFKVLYASDKNIPLFS 357 Query: 360 FV 361 F+ Sbjct: 358 FL 359 >gi|84499690|ref|ZP_00997978.1| hypothetical protein OB2597_07165 [Oceanicola batsensis HTCC2597] gi|84392834|gb|EAQ05045.1| hypothetical protein OB2597_07165 [Oceanicola batsensis HTCC2597] Length = 362 Score = 295 bits (754), Expect = 8e-78, Method: Composition-based stats. Identities = 135/359 (37%), Positives = 190/359 (52%), Gaps = 12/359 (3%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 L+ +I + I G MT+ +Y + + DP+ GYY+T +PFG GDF+TAPE SQ+FGE+ Sbjct: 4 TPLLDRIRHRIGAQGPMTLAEYMQIALLDPDHGYYATRDPFGTAGDFITAPETSQMFGEL 63 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 + + L +W G P+ L E GPGRG +M DILR + P F LS+ ++E S L Sbjct: 64 VGLALAQSWIDQGRPAPFILAEPGPGRGTLMADILRATRSV-PGFHDGLSLVLIEASPVL 122 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 IQ + L+ Y + W L +P FLVANEFFD+LPI+QF G E M+ + Sbjct: 123 RDIQARTLSGY--RAEWIDDLGALPEAPLFLVANEFFDALPIRQFRRRGDGWAEVMVTVS 180 Query: 183 QHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYG 242 + D + E P R I R+A GG A+++DYG Sbjct: 181 GSGLATALAAPVPLPELAHRLGDTREDDVVELCPAAARAAAHIGARIADQGGAAVIVDYG 240 Query: 243 YLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFL 302 +S +GDT QA+KGH V PL PG ADL++HVDF+RL+ A + +G+ QG FL Sbjct: 241 DWRS-LGDTFQALKGHAPVDPLAAPGTADLTAHVDFERLAKAATPA--WASGMIPQGVFL 297 Query: 303 EGLGIWQRAFSLMKQT--ARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMP 359 E LGI RA +L + D + + +RL + MG LFK+L +S +P Sbjct: 298 ERLGITARAQALATRLQGPDLDAHVAAHRRLTH----PEEMGTLFKVLALSPPDAPPVP 352 >gi|319898499|ref|YP_004158592.1| hypothetical protein BARCL_0325 [Bartonella clarridgeiae 73] gi|319402463|emb|CBI76006.1| conserved protein of unknown function [Bartonella clarridgeiae 73] Length = 361 Score = 295 bits (754), Expect = 1e-77, Method: Composition-based stats. Identities = 138/362 (38%), Positives = 208/362 (57%), Gaps = 9/362 (2%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 + L +I +I +G ++V QY AL + DP+ GYY PFG GDF+TAPEISQ+FGEM Sbjct: 2 SNLKERIKEIIILDGPISVSQYMALALTDPQSGYYQKQKPFGHTGDFITAPEISQLFGEM 61 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 + I+ I +W+ G P+ L E+GPGRG +M D+LR I K+ ++ I+++E S+RL Sbjct: 62 IGIWTIMSWQAQGCPNPFILAEIGPGRGTLMDDVLRTIRKICMAAYNAADIFLIEISQRL 121 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 QKK+L+S+ +I+ + +P L+ANE FD+LPI Q++ RER I ++ Sbjct: 122 ATEQKKRLSSHKKQIHNIENFEQIPCKPLILIANELFDALPIDQYIKVNEEWRERRITLN 181 Query: 183 QHDSLVFNIGDHEIKSNFLT--CSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVID 240 Q + F + H+ S L C+ G I E +P R++ +Q IS L G+A++ID Sbjct: 182 QEGNFTFIVDAHKFPSTDLPAHCAQMPNGTILEYAPSRNQLIQKISSHLIHTKGSALLID 241 Query: 241 YGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGK 300 YG GDTLQA+ H + PG+ DL+SHV+F L +IAI + + QG Sbjct: 242 YGTSDFAFGDTLQAISKHKFCDIFSAPGKHDLTSHVNFFSLKTIAIQQGCFAE-ILEQGD 300 Query: 301 FLEGLGIWQRAFSLM--KQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELM 358 FL +G+ +RA L K K+ + ++RL A K MG+LFK+L VS + +++ Sbjct: 301 FLFKMGLLERAKQLSINKSIVIKEKIYQDIERL----AGPKQMGKLFKVLHVSDKNIQIP 356 Query: 359 PF 360 F Sbjct: 357 HF 358 >gi|110680664|ref|YP_683671.1| hypothetical protein RD1_3502 [Roseobacter denitrificans OCh 114] gi|109456780|gb|ABG32985.1| hypothetical membrane protein [Roseobacter denitrificans OCh 114] Length = 352 Score = 294 bits (753), Expect = 1e-77, Method: Composition-based stats. Identities = 132/357 (36%), Positives = 194/357 (54%), Gaps = 13/357 (3%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L +++ IK +G M+V +Y C+ P GYY+T +PFG GDF+TAPE SQ+FGE++ Sbjct: 2 SLKDQLIARIKAHGPMSVAEYMGDCLLHPTLGYYTTQHPFGGSGDFITAPETSQMFGELI 61 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 + L+ AW G PS LVELGPGRG++M DILR ++ PDF + +VE S +L Sbjct: 62 GLCLVQAWVDQGRPSPFALVELGPGRGVLMADILRAAAQV-PDFARAAEVILVEASPKLQ 120 Query: 124 LIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQ 183 IQ+ L ++ + + +A +P F+VANEFFD+LPI+QFV + RER + D Sbjct: 121 EIQRDTLKAH--AVTFVKDVASLPQCPLFVVANEFFDALPIRQFVRSGPHWRERQVGCDA 178 Query: 184 HDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGY 243 + + SD + E +P M + R+ GG ++IDYG Sbjct: 179 EQLIFGMGAQTPQPALNARLSDTKEHDLVEYAPAAAPIMSELGSRIDTHGGAGLIIDYGD 238 Query: 244 LQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLE 303 +S +GDTLQAV+ H Y L +PG++DL++HVDF+ L+ + LT QG FLE Sbjct: 239 WRS-LGDTLQAVRQHEYTGVLDHPGESDLTAHVDFEALAQ---AVPCAFSRLTPQGVFLE 294 Query: 304 GLGIWQRAFSLMKQTARK--DILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELM 358 LGI QRA L + + + + + +RL + MG LFK+L + Sbjct: 295 RLGITQRAQRLAQNLPKDLLEQHIKAHRRLTH----PEEMGNLFKVLGLFPHGKAPP 347 >gi|83859400|ref|ZP_00952921.1| hypothetical protein OA2633_13385 [Oceanicaulis alexandrii HTCC2633] gi|83852847|gb|EAP90700.1| hypothetical protein OA2633_13385 [Oceanicaulis alexandrii HTCC2633] Length = 378 Score = 294 bits (752), Expect = 1e-77, Method: Composition-based stats. Identities = 123/361 (34%), Positives = 182/361 (50%), Gaps = 9/361 (2%) Query: 5 LIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLA 64 + ++ I+ G ++V + A + P G+Y+T +P GA DF+TAPEISQ+FGE+L Sbjct: 12 ISERLAERIRTEGSLSVAAFMAEALFHPMAGFYATKDPLGAANDFITAPEISQMFGELLG 71 Query: 65 IFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTL 124 ++ W Q G PS L+ELGPG G MM D+LR + P F + + ++E S L + Sbjct: 72 LWAAECWMQMGAPSRFELIELGPGTGRMMSDMLRA-GRAAPGFLDAVHVTLIEASPALKM 130 Query: 125 IQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQH 184 +Q + LAS INW P G ++ NEF D LPI+Q + + RER++ + Sbjct: 131 VQGQTLASASVPINWAKDFDKAPSGPAVVIGNEFLDCLPIRQAIRHKGQWRERVVTLHPE 190 Query: 185 DSLVF------NIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIV 238 D F +G+ ++ + G + E P +++ ++ R D G A+ Sbjct: 191 DEARFVYGLGPVLGEADVAFIAPGLREADDGTLVELRPGDQQQIDQLAARFDRDPGYALF 250 Query: 239 IDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQ 298 +DYG + GDTLQA++ H V PL PG ADL++ VDF RL + L G TQ Sbjct: 251 VDYGSAKPETGDTLQAIRAHQKVDPLDAPGTADLTAWVDFDRLLRLGEDAGLSAFGPMTQ 310 Query: 299 GKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELM 358 G FL LGI QRA L + +KR + + MG LFK+ S E + Sbjct: 311 GDFLTELGIEQRAAVLSRSVDEAGQ--AKLKRQMHRLVSPEDMGTLFKLAAFSSEGLPPA 368 Query: 359 P 359 P Sbjct: 369 P 369 >gi|126733062|ref|ZP_01748818.1| hypothetical protein SSE37_14379 [Sagittula stellata E-37] gi|126706472|gb|EBA05553.1| hypothetical protein SSE37_14379 [Sagittula stellata E-37] Length = 357 Score = 293 bits (749), Expect = 3e-77, Method: Composition-based stats. Identities = 134/359 (37%), Positives = 190/359 (52%), Gaps = 14/359 (3%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 + L I I + G +T+ Y ALC++ PE GYY+T +P GA GDF TAPEISQ+FGE+ Sbjct: 2 SALKDIITRQISRTGPLTLADYMALCLSHPEHGYYATRDPLGAEGDFTTAPEISQMFGEL 61 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 + + L +W G P+ L ELGPGRG +M D LR ++ P F L +++VETS L Sbjct: 62 IGLALAQSWMDQGAPTRFVLSELGPGRGTLMADALRATTRV-PGFHDALELHLVETSPAL 120 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 Q +L W+ S+A +P FL+ANEFFD+LPI+QF+ G +ER++ + Sbjct: 121 RAEQAARLPD----ATWHESVASLPEAPLFLIANEFFDALPIRQFLRHAQGWQERVVGLK 176 Query: 183 QHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYG 242 + + +D G I EN +Q R+A GGTA+++DYG Sbjct: 177 DGQPTLGLTDPAPHDALDHRLADTEPGQIVENCAPAQAIVQETGRRIASHGGTALIVDYG 236 Query: 243 YLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFL 302 +SR GDT QA+ H P PG+ADL++HVDF+ L+ A T QG FL Sbjct: 237 DWRSR-GDTFQALYRHKPAEPFARPGEADLTAHVDFEALAKAAHPAAHSAL--TPQGVFL 293 Query: 303 EGLGIWQRAFSLMKQT--ARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMP 359 E LGI RA +L ++ A + + + +RL MG LFK+L + P Sbjct: 294 EHLGITARAQALARRLGGAALESHVAAHRRLTH----PGEMGSLFKVLALFPHDAPPPP 348 >gi|254486401|ref|ZP_05099606.1| ATP synthase beta subunit/transription termination factor rho [Roseobacter sp. GAI101] gi|214043270|gb|EEB83908.1| ATP synthase beta subunit/transription termination factor rho [Roseobacter sp. GAI101] Length = 354 Score = 293 bits (749), Expect = 4e-77, Method: Composition-based stats. Identities = 140/363 (38%), Positives = 201/363 (55%), Gaps = 14/363 (3%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 L I+ I G M +D+Y A+C+ P GYY+T +PFGA GDFVTAPEISQ+FGE+ Sbjct: 2 TSLRDHIIERIHTTGPMRIDEYMAMCLLHPTRGYYTTRDPFGAEGDFVTAPEISQMFGEL 61 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 + + L W G P+ L ELGPGRGI+M DILR + P F I +VE S+ L Sbjct: 62 IGLCLAQTWLSQGAPARFTLAELGPGRGILMADILRATRAV-PGFAQAAEITLVEASQTL 120 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 +Q+ LA + ++ W S +P +LVANEFFD+LPI+QFV G RER I + Sbjct: 121 RDVQRTTLAGH--QVQWCDSADALPDQPLYLVANEFFDALPIRQFVRDGTGWRERQIGL- 177 Query: 183 QHDSLVFNIGDH-EIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDY 241 +L F +G + D G + E+ +Q+I++R++ GG A++IDY Sbjct: 178 TDGALSFGLGPMLPQPAFADRLEDTQDGDLIEDCAQLAPTVQAIANRISTHGGAALIIDY 237 Query: 242 GYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKF 301 G +S +GDT QA++GH V PL PG +DL++HVDF++L+ + +T QG F Sbjct: 238 GDWRS-LGDTFQALRGHQTVDPLSAPGSSDLTAHVDFEKLA--IAAAPAQHSRITPQGVF 294 Query: 302 LEGLGIWQRAFSLMKQTARK--DILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMP 359 LE LGI QRA +L K + + + +RL MG LFK++ + P Sbjct: 295 LERLGITQRAQTLATTMTGKVLEGHVAAHRRLTH----PSEMGNLFKVIGLFPPTQNPPP 350 Query: 360 FVN 362 ++ Sbjct: 351 GLD 353 >gi|260576455|ref|ZP_05844445.1| protein of unknown function DUF185 [Rhodobacter sp. SW2] gi|259021338|gb|EEW24644.1| protein of unknown function DUF185 [Rhodobacter sp. SW2] Length = 351 Score = 292 bits (748), Expect = 4e-77, Method: Composition-based stats. Identities = 134/359 (37%), Positives = 191/359 (53%), Gaps = 14/359 (3%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 L ++ ++ G +T+ + A C+ P+ GYYST +PFG GDF TAPEISQ+FGE+ Sbjct: 2 TPLADLLIRRVQATGPVTLADFMADCLMHPQHGYYSTRDPFGRAGDFTTAPEISQMFGEL 61 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 L + L AW G P+ + L ELGPGRG +M D+LR + P F + + ++E S L Sbjct: 62 LGLCLAQAWLDQGSPAPITLAELGPGRGTLMADVLRATSGV-PGFHAAAQVVLLEASPTL 120 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 +Q + L + W T++AD+P F++ANEFFD+LPI QF + G R+RM+ + Sbjct: 121 RAVQAQTLGAR--AATWITTVADLPDQPLFVLANEFFDALPIHQFQRDDSGWRQRMVGVQ 178 Query: 183 QHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYG 242 + G + E P M I+ RL GG A+++DYG Sbjct: 179 DGSLAFGLSDPVPALMVGKAFVNDPPGTVVEVCPLARGIMDQIAHRLGNFGGAALIVDYG 238 Query: 243 YLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFL 302 +S+ GDTLQA++GH PL NPGQADL++HVDF+ L + +GLT QGK L Sbjct: 239 GWRSK-GDTLQALRGHAPEHPLANPGQADLTAHVDFEAL----CPPGIPHSGLTDQGKLL 293 Query: 303 EGLGIWQRAFSLMKQT--ARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMP 359 + LGI RA L + A + L + +RL D + MG LFK L + P Sbjct: 294 KRLGIDARAAKLATKLTGAALESHLAAHRRLT----DPEEMGTLFKALALHAPGTPPPP 348 >gi|260432873|ref|ZP_05786844.1| protein C2orf56 [Silicibacter lacuscaerulensis ITI-1157] gi|260416701|gb|EEX09960.1| protein C2orf56 [Silicibacter lacuscaerulensis ITI-1157] Length = 355 Score = 292 bits (747), Expect = 6e-77, Method: Composition-based stats. Identities = 134/358 (37%), Positives = 188/358 (52%), Gaps = 10/358 (2%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L ++ I++NG M++ Y A C+ P GYY+T +P GA GDFVTAPEISQ+FGE++ Sbjct: 2 SLRDHLIARIRQNGPMSIADYMAECLLHPTHGYYTTRDPLGAQGDFVTAPEISQMFGELI 61 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 + L AW G P L ELGPGRG +M DILR + P F + ++E S L Sbjct: 62 GLCLAQAWINQGKPERFALAELGPGRGTLMADILRATKGV-PGFHDAAQVVLLEASPVLR 120 Query: 124 LIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQ 183 +Q + LA + +W + +P +LVANEFFD+LPI+QF+ G RER++ + Sbjct: 121 GLQAEALAGH--APDWIIQVGALPDLPLYLVANEFFDALPIRQFLRDGDGWRERLVGLKG 178 Query: 184 HDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGY 243 D + D G + E + I+ R+ GG ++IDYG Sbjct: 179 DDLAFGLGAWAAQPALAARLEDTRDGDLVELCAAAVPVVTEIARRIRTHGGVGLIIDYGD 238 Query: 244 LQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLE 303 S +GDT QA+KGH PL NPGQADL++HVDF+ L+ A LT QG FLE Sbjct: 239 WHS-LGDTFQALKGHERTDPLANPGQADLTAHVDFELLAQAARAAGCAHTRLTQQGVFLE 297 Query: 304 GLGIWQRAFSLMKQTARKD--ILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMP 359 LGI +RA +L +L+ + +RL + MG LFK+L + + P Sbjct: 298 RLGITRRAQALAGPLTGDALNMLVAAHRRLTH----PEEMGNLFKVLGLYCKPAAPPP 351 >gi|84683545|ref|ZP_01011448.1| hypothetical protein 1099457000264_RB2654_19268 [Maritimibacter alkaliphilus HTCC2654] gi|84668288|gb|EAQ14755.1| hypothetical protein RB2654_19268 [Rhodobacterales bacterium HTCC2654] Length = 352 Score = 292 bits (746), Expect = 8e-77, Method: Composition-based stats. Identities = 148/358 (41%), Positives = 193/358 (53%), Gaps = 13/358 (3%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L K+ I+ G M+V + A C+ DPE GYY+T +PFG+ GDF TAPEISQ+FGE++ Sbjct: 2 SLADKLRARIEGTGPMSVADFMAECLLDPEHGYYTTRDPFGSAGDFTTAPEISQMFGELV 61 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 + L W G P+ L ELGPGRG +M DILR + P F I +VE S RL Sbjct: 62 GLCLAQGWMDQGSPAPFVLAELGPGRGTLMADILRATRGV-PGFHDAARIVLVEASPRLR 120 Query: 124 LIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQ 183 Q+ L Y + W SL D P G FLVANEFFD+LP++QF RER + + + Sbjct: 121 ERQQATLTGY--GVTWVDSLEDAPDGPLFLVANEFFDALPVRQFQRDADDWRERQVGL-K 177 Query: 184 HDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGY 243 +L F +G + +D G + E P D M I R+A GG A+VIDYG Sbjct: 178 DGALTFGLGGPTAHAPLDRWTDAQPGDLVELRPAADAVMAEIDRRIAAQGGAALVIDYGD 237 Query: 244 LQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLE 303 S +GDTLQAV H PL NPG ADL++HVDF+ L+ A L + LT QG FLE Sbjct: 238 WHS-LGDTLQAVAKHEAADPLANPGAADLTAHVDFEALALAATR--LTHSRLTPQGVFLE 294 Query: 304 GLGIWQRAFSLMKQTARK--DILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMP 359 LGI RA +L + + + + +RL MG LFK L E ++P Sbjct: 295 RLGITARAQALAARLEGEPLTRHVAAHRRLTH----PDEMGTLFKTLAFVPEGAAMLP 348 >gi|254511961|ref|ZP_05124028.1| hypothetical protein RKLH11_2504 [Rhodobacteraceae bacterium KLH11] gi|221535672|gb|EEE38660.1| hypothetical protein RKLH11_2504 [Rhodobacteraceae bacterium KLH11] Length = 355 Score = 291 bits (744), Expect = 1e-76, Method: Composition-based stats. Identities = 135/358 (37%), Positives = 189/358 (52%), Gaps = 10/358 (2%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L ++ I+++G MTV Y C+ P GYY+T +P GA GDF+TAPEISQ+FGE++ Sbjct: 2 TLKDHLLRRIREHGPMTVADYMNACLLHPIHGYYTTRDPLGAQGDFITAPEISQMFGELV 61 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 + L +W G P+ + L ELGPGRG +M DILR + P F I ++E S L Sbjct: 62 GLCLAQSWIGQGQPARIALAELGPGRGTLMADILRATRNV-PGFHDAAEITLLEASPALR 120 Query: 124 LIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQ 183 IQ + L + W ++ D+P FLVANEFFD+LPI+QF+ RER++ D Sbjct: 121 HIQSETLRDH--TPRWIDAIDDLPDLPLFLVANEFFDALPIRQFLREGSSWRERLVGGDG 178 Query: 184 HDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGY 243 + SD G + E + I+ ++ GG A++IDYG Sbjct: 179 TSLTFGLGPQTAQPALSERLSDTQDGDLVELCSATAPMLGFIAGQITRHGGVAMIIDYGD 238 Query: 244 LQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLE 303 +S +GDTLQAV+ H PL +PGQADL++HVDF+ L+ A LT QG FLE Sbjct: 239 WRS-LGDTLQAVRSHEVTDPLKDPGQADLTAHVDFETLALAAKAAGCAYTKLTPQGVFLE 297 Query: 304 GLGIWQRAFSLMKQT--ARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMP 359 LGI RA SL ++ + L+ + +RL MG LFK+L + P Sbjct: 298 RLGITGRARSLAAPLGGSQLETLVAAHRRLTH----PDEMGNLFKVLGLYPPNAAPPP 351 >gi|83941322|ref|ZP_00953784.1| hypothetical protein EE36_03798 [Sulfitobacter sp. EE-36] gi|83847142|gb|EAP85017.1| hypothetical protein EE36_03798 [Sulfitobacter sp. EE-36] Length = 354 Score = 291 bits (744), Expect = 1e-76, Method: Composition-based stats. Identities = 137/359 (38%), Positives = 185/359 (51%), Gaps = 12/359 (3%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 L + + I NG M +D+Y A C+ P GYY+T +PFG GDF TAPEISQ+FGE+ Sbjct: 2 TTLRDILHSRIASNGPMRIDEYMATCLLHPTQGYYTTRDPFGTQGDFTTAPEISQMFGEL 61 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 L + L +W PS L ELGPGRG +M DILR + P F I +VE S L Sbjct: 62 LGLCLAQSWLAQDAPSAFTLAELGPGRGTLMADILRATRNV-PGFIEAARITLVEASPTL 120 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 +Q K LA + T +P FLVANEFFD+LPI+QFV E RER I + Sbjct: 121 RDVQAKTLAGHQVIWADGTD--ALPDQPLFLVANEFFDALPIRQFVRGETSWRERQIGLA 178 Query: 183 QHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYG 242 + + +D G + E+ + +S+R+A GG A+++DYG Sbjct: 179 DGALSFGLGPELPQPALADRLADTKPGDLVEDCTQLAPILHPVSERIATHGGAALIVDYG 238 Query: 243 YLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFL 302 S +GDTLQA++GH PL PGQADL++HVDF++L+ +T QG FL Sbjct: 239 DWHS-LGDTLQALQGHEKADPLAAPGQADLTAHVDFEKLA--LAAAPASHTRITPQGVFL 295 Query: 303 EGLGIWQRAFSLMKQTARKD--ILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMP 359 E LGI QRA +L K + + +RL MG LFK++ + P Sbjct: 296 ERLGITQRAQTLAKGMTEDALNAHVAAHRRLTH----PSEMGNLFKVMGIYPPHHSPPP 350 >gi|255264600|ref|ZP_05343942.1| ATP synthase beta subunit/transcription termination factor rho [Thalassiobium sp. R2A62] gi|255106935|gb|EET49609.1| ATP synthase beta subunit/transcription termination factor rho [Thalassiobium sp. R2A62] Length = 355 Score = 290 bits (743), Expect = 2e-76, Method: Composition-based stats. Identities = 127/361 (35%), Positives = 180/361 (49%), Gaps = 11/361 (3%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L ++ I G ++V Y A C+ P+ GYY+T PFG GDF TAPEISQ+FGE++ Sbjct: 2 TLTDYLLRRIAAQGPLSVADYMAECLLHPDLGYYTTQQPFGRDGDFTTAPEISQMFGELV 61 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 + L AW G P L ELG GRG +M D LR + P F +++ MVE S + Sbjct: 62 GLSLAQAWIDAGAPDAFTLCELGGGRGTLMADALRAARAV-PRFIDAMTVIMVEASPQRQ 120 Query: 124 LIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQ 183 Q+ L Y +L D+P L+ANEFFD LP +QFV G ER+I Sbjct: 121 ADQETLLMDYAPIFRD--TLTDLPDQPLLLIANEFFDCLPPRQFVRDGAGWAERVIGAVD 178 Query: 184 HDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGY 243 +D G + E I+ +A GGTA+++DYG Sbjct: 179 GVLSWGLAPAQPRAELEHRLADTKDGDLVELHTLATAVTDEIAHHIASHGGTALIVDYGD 238 Query: 244 LQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLE 303 +S+ GDTLQA+KGH + PL PG +DL+ VDF+ ++ A + +T QG +LE Sbjct: 239 WRSQ-GDTLQALKGHAPIDPLAAPGLSDLTVQVDFEVIALAAQNVGARHSRVTPQGVWLE 297 Query: 304 GLGIWQRAFSLMKQT--ARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELM-PF 360 LGI RA +L ++ A+ D + + +RL MG LFK++ ++ L F Sbjct: 298 RLGITDRAQALAQKMQGAQLDAHIAAHRRLTH----PDEMGNLFKVIAITALDAPLPAGF 353 Query: 361 V 361 V Sbjct: 354 V 354 >gi|312114462|ref|YP_004012058.1| hypothetical protein Rvan_1711 [Rhodomicrobium vannielii ATCC 17100] gi|311219591|gb|ADP70959.1| protein of unknown function DUF185 [Rhodomicrobium vannielii ATCC 17100] Length = 373 Score = 290 bits (743), Expect = 2e-76, Method: Composition-based stats. Identities = 127/364 (34%), Positives = 183/364 (50%), Gaps = 11/364 (3%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 L K+ I + G + + Y C+ D + GYY +P G GDF+TAPEISQ+FGE+ Sbjct: 11 TPLALKLRRDIAERGPIPLHDYMEACLYDLQHGYYRKRDPLGRGGDFITAPEISQVFGEL 70 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 + ++ W Q G P V LVELGPGRG +M D LR + P F +++++VE+SE L Sbjct: 71 IGLWAAQVWMQMGQPQSVCLVELGPGRGTLMADALRAAR-VMPGFLQSIAVHLVESSEVL 129 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 QK LA I W+ + +VP G ++ANEFFD LP++QF R + Sbjct: 130 REAQKATLAGVPVPIQWHGDMGEVPSGPAIVIANEFFDCLPVRQFAFDGAAEVWRERVVA 189 Query: 183 QHDSLVFNIGDHEIKSNFLTCSDY---FLGAIFENSPCRDREMQSISDRLACDGGTAIVI 239 D ++ LT + Y G I E+ P + + R A+VI Sbjct: 190 FEDGAFHLATSADVAQPPLTAASYGEPRDGDILEHCPGVGPLLAKFAARAGDAPLAALVI 249 Query: 240 DYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQG 299 DYGY + G+TLQAV+ H + PG+ DL++HVDF RL+ +A + G G Sbjct: 250 DYGYAKPAFGETLQAVRRHKFAGLFDAPGETDLTAHVDFSRLARLAEEASFVVFGPMAMG 309 Query: 300 KFLEGLGIWQRAFSLMKQTARKD--ILLDSVKRLVSTSADKKSMGELFKILVVSHEKV-E 356 ++L LG+ RA L+ T+ ++ + S+ RLV D MG LFKIL + Sbjct: 310 EWLLRLGVEARANQLLASTSAEEARAIAQSIARLV----DPAQMGALFKILSWTRGISEP 365 Query: 357 LMPF 360 PF Sbjct: 366 PPPF 369 >gi|159045190|ref|YP_001533984.1| hypothetical protein Dshi_2650 [Dinoroseobacter shibae DFL 12] gi|157912950|gb|ABV94383.1| conserved hypothetical protein [Dinoroseobacter shibae DFL 12] Length = 363 Score = 290 bits (743), Expect = 2e-76, Method: Composition-based stats. Identities = 126/349 (36%), Positives = 175/349 (50%), Gaps = 13/349 (3%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 L + I G +TV ++ A C+ P GYY+T PFG GDF TAPEISQ+FGE+ Sbjct: 2 TPLAEILAARIAATGPITVAEFMAECLLHPTHGYYTTRTPFGQAGDFTTAPEISQMFGEL 61 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 L + L AW G P L E+GPGRG +M DI RV+ ++ + L ++VE S L Sbjct: 62 LGLALAQAWHDQGAPPGAILAEIGPGRGTLMADIRRVLKQVP--GAATLRPHLVEASPAL 119 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 Q ++ + D+P LVANEFFD+LPI+QF G ER I + Sbjct: 120 RAEQATRVPE----AVRLDRVEDLPDAPLLLVANEFFDALPIRQFERHAAGWAERQIGLA 175 Query: 183 QHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYG 242 + + S +D G + E + I+ R+A GG AI+ DYG Sbjct: 176 EGALAFGRAQPAALASLAHRMADTGPGDLVETCAPAQPIIAEIAGRIARHGGAAIIADYG 235 Query: 243 YLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFL 302 +S+ GDTLQAV+ H L +PGQADL++HVDF+ L+ A ++ + QG FL Sbjct: 236 DWRSK-GDTLQAVRAHRPDPVLAHPGQADLTAHVDFEPLAQAARTAGASVSAMIPQGVFL 294 Query: 303 EGLGIWQRAFSLMKQTARK--DILLDSVKRLVSTSADKKSMGELFKILV 349 E LGI RA +L + + +RL + MG LFK+L Sbjct: 295 ERLGITTRAQALATGLEGAALQSHIAAHRRLTH----PEEMGTLFKVLC 339 >gi|319405266|emb|CBI78880.1| conserved hypothetical protein [Bartonella sp. AR 15-3] Length = 374 Score = 290 bits (742), Expect = 2e-76, Method: Composition-based stats. Identities = 143/362 (39%), Positives = 206/362 (56%), Gaps = 9/362 (2%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 + L +I +I NG ++V QY AL + DP+FGYY PFG GDF+TAPEISQ+FGE+ Sbjct: 2 SNLKERIKEIIILNGPISVSQYMALALTDPQFGYYKKQKPFGHDGDFITAPEISQLFGEI 61 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 + I+ I +W HG P+ L E+GPGRG +M D+LR I K+ F+ I+++E S+RL Sbjct: 62 IGIWAIMSWRAHGSPNSFILAEIGPGRGTLMDDVLRTIRKICMTAFNAADIFLIEISQRL 121 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 QKK+L SY +I + +PL ++ANE FD+LPI Q++ RER I ++ Sbjct: 122 AAKQKKRLLSYQKQIYSIENFEQIPLKPLIIIANELFDALPINQYIKINEEWRERRITLN 181 Query: 183 QHDSLVFNIGDHEIKSNFLTC--SDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVID 240 Q F I H+ + FL + G I E +P R++ Q IS RL G+A++ID Sbjct: 182 QEGDFTFTIDVHKFPATFLPPHCAQMPNGTIVEYAPLRNQLAQKISSRLMQTQGSALLID 241 Query: 241 YGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGK 300 YG GDTLQAV H + PG+ DL+SHVDF L +IAI + + Q Sbjct: 242 YGASDFAFGDTLQAVSKHKFCDIFSAPGEHDLTSHVDFFSLKTIAIQQGCFAE-ILEQRD 300 Query: 301 FLEGLGIWQRAFSLM--KQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELM 358 FL +G+ +RA L K T K + ++RL K MG+LFK+L +S++ +++ Sbjct: 301 FLFKMGVLERAKKLSINKSTVMKKKIHQDIERLTGL----KQMGKLFKVLHISNKNLQIP 356 Query: 359 PF 360 F Sbjct: 357 YF 358 >gi|83854800|ref|ZP_00948330.1| hypothetical protein NAS141_08731 [Sulfitobacter sp. NAS-14.1] gi|83842643|gb|EAP81810.1| hypothetical protein NAS141_08731 [Sulfitobacter sp. NAS-14.1] Length = 354 Score = 290 bits (741), Expect = 3e-76, Method: Composition-based stats. Identities = 137/359 (38%), Positives = 186/359 (51%), Gaps = 12/359 (3%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 L + + I NG M +D+Y A C+ P GYY+T +PFG GDF TAPEISQ+FGE+ Sbjct: 2 TTLRDILHSRIASNGPMRIDEYMATCLLHPTQGYYTTRDPFGTQGDFTTAPEISQMFGEL 61 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 L + L +W PS L ELGPGRG +M DILR + P F I +VE S L Sbjct: 62 LGLCLAQSWIAQDAPSAFTLAELGPGRGTLMADILRATRNV-PGFIEAAQITLVEASPTL 120 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 +Q K LA + T +P FLVANEFFD+LPI+QFV E RER + + Sbjct: 121 RDVQAKTLAEHQVIWADGTD--ALPDQPLFLVANEFFDALPIRQFVRGETSWRERQVGLA 178 Query: 183 QHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYG 242 + + +D G + E+ + +S+R+A GG A+++DYG Sbjct: 179 DGALSFGLGPELPQPALADRLADTTPGDLVEDCTQLAPILHPVSERIATHGGAALIVDYG 238 Query: 243 YLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFL 302 S +GDTLQA++GH PLV PGQADL++HVDF++L+ +T QG FL Sbjct: 239 DWHS-LGDTLQALQGHEKADPLVAPGQADLTAHVDFEKLA--LAAAPASYTRITPQGVFL 295 Query: 303 EGLGIWQRAFSLMKQTARKD--ILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMP 359 E LGI QRA +L K + + +RL MG LFK++ + P Sbjct: 296 ERLGITQRAQTLAKGMTEDALNAHVAAHRRLTH----PSEMGNLFKVMGIYPPHHSPPP 350 >gi|67464609|pdb|1ZKD|A Chain A, X-Ray Structure Of The Putative Protein Q6n1p6 From Rhodopseudomonas Palustris At The Resolution 2.1 A , Northeast Structural Genomics Consortium Target Rpr58 gi|67464610|pdb|1ZKD|B Chain B, X-Ray Structure Of The Putative Protein Q6n1p6 From Rhodopseudomonas Palustris At The Resolution 2.1 A , Northeast Structural Genomics Consortium Target Rpr58 Length = 387 Score = 289 bits (740), Expect = 4e-76, Method: Composition-based stats. Identities = 139/360 (38%), Positives = 195/360 (54%), Gaps = 12/360 (3%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 L +I LIK G V +Y LC+ PE GYY T +P G GDF T+PEISQ FGE+ Sbjct: 5 TALATEIKRLIKAAGPXPVWRYXELCLGHPEHGYYVTRDPLGREGDFTTSPEISQXFGEL 64 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 L ++ W+ P +RL+E+GPGRG D LR + L P + LS+++VE + L Sbjct: 65 LGLWSASVWKAADEPQTLRLIEIGPGRGTXXADALRALRVL-PILYQSLSVHLVEINPVL 123 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 Q+ LA + I+W+ S DVP G ++ANE+FD LPI Q + E G ER+I+I Sbjct: 124 RQKQQTLLAGIRN-IHWHDSFEDVPEGPAVILANEYFDVLPIHQAIKRETGWHERVIEIG 182 Query: 183 QHDSLVFNIGDHEIKSNFLTCSDY----FLGAIFENSPCRDREMQSISDRLACDGGTAIV 238 LVF + I GA+FE P + + I+ R+ GG A++ Sbjct: 183 ASGELVFGVAADPIPGFEALLPPLARLSPPGAVFEWRPDTE--ILKIASRVRDQGGAALI 240 Query: 239 IDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQ 298 IDYG+L+S VGDT QA+ H+Y PL +PG+ADL++HVDF L A +G TQ Sbjct: 241 IDYGHLRSDVGDTFQAIASHSYADPLQHPGRADLTAHVDFDALGRAAESIGARAHGPVTQ 300 Query: 299 GKFLEGLGIWQRAFSLMKQTARK--DILLDSVKRLVSTSADKKSMGELFKILVVSHEKVE 356 G FL+ LGI RA SL + + + + +++RL G FK++ VS K+E Sbjct: 301 GAFLKRLGIETRALSLXAKATPQVSEDIAGALQRLTGEGRGAX--GSXFKVIGVSDPKIE 358 >gi|163794552|ref|ZP_02188523.1| hypothetical protein BAL199_05044 [alpha proteobacterium BAL199] gi|159180276|gb|EDP64799.1| hypothetical protein BAL199_05044 [alpha proteobacterium BAL199] Length = 360 Score = 289 bits (740), Expect = 4e-76, Method: Composition-based stats. Identities = 122/361 (33%), Positives = 170/361 (47%), Gaps = 14/361 (3%) Query: 6 IRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAI 65 + I+ G ++V A + PE GYY+T +PFGA GDFVTAPEISQ+FGE++ + Sbjct: 4 ADHLRRRIRAEGPLSVADMMASALVHPEHGYYTTRDPFGAAGDFVTAPEISQMFGELIGL 63 Query: 66 FLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLI 125 + W+ G P V LVELGPGRG +M D LR P F + +++VE S L Sbjct: 64 WAAVVWQGMGAPDPVALVELGPGRGTLMADALRA-AVGVPAFRAAAQVHLVEASPTLRQH 122 Query: 126 QKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHD 185 Q +L + W+ L +P ++ANEFFD+LPI Q V RER + + Sbjct: 123 QATRLEK--ARPIWHDGLDTLPDQPAIVIANEFFDALPIVQLVRDGRNWRERRLAVVADA 180 Query: 186 SLVFNIGDHEIKSNFLTC-----SDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVID 240 F + + GA+ E P + ++DRL GG + ID Sbjct: 181 EEFFWTLTPGASPHAGLLDPGLRQNAPDGALAEICPSGLSIARHLADRLNRFGGAVLAID 240 Query: 241 YGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGK 300 YG+ S GDTLQAV+ H L G ADL++HVDF L +G QG Sbjct: 241 YGHAVSAAGDTLQAVQRHKPADILATLGNADLTAHVDFGALGRAVAEAGAVQHGPLGQGA 300 Query: 301 FLEGLGIWQRAFSLMKQTARKD--ILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELM 358 FL LGI RA +L + + + + RL+ MG LFK++ + Sbjct: 301 FLRQLGIEARAEALCRAATTEQCAAVHSACARLIH----PDEMGTLFKVVAWTRADASPP 356 Query: 359 P 359 P Sbjct: 357 P 357 >gi|260427542|ref|ZP_05781521.1| ATP synthase beta subunit/transcription termination factor rho [Citreicella sp. SE45] gi|260422034|gb|EEX15285.1| ATP synthase beta subunit/transcription termination factor rho [Citreicella sp. SE45] Length = 353 Score = 288 bits (738), Expect = 6e-76, Method: Composition-based stats. Identities = 130/359 (36%), Positives = 180/359 (50%), Gaps = 13/359 (3%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 L + I G M+V +Y C+ P+ GYY+T +P GA GDF TAPEISQ+FGE+ Sbjct: 2 TPLAEILHRRIAAEGPMSVAEYMTACLLHPQHGYYATRDPLGAAGDFTTAPEISQMFGEL 61 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 L + L +W G P+ L ELGPGRG +M D R + + P +++VE S L Sbjct: 62 LGLCLAQSWIDQGRPAPFVLAELGPGRGTLMADATRAMRAV-PGMLEAARVHLVEASPTL 120 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 Q ++LA W+ S+AD+P FL+ANEFFD+LPI+QF+ G ER++ + Sbjct: 121 RDAQHQRLAPL--MPVWHESVADLPEASLFLLANEFFDALPIRQFLRVGTGWAERVVGVQ 178 Query: 183 QHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYG 242 + S +D G + E I R+A GG A+++DYG Sbjct: 179 DGALAFGLAEPVALASLESRLADTQEGDMVETCAPATGIAAEIGRRIAEHGGAALIVDYG 238 Query: 243 YLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFL 302 QS +GDT QAV+ H + PL PG+ADL++HVDF L+ L QG FL Sbjct: 239 SDQS-LGDTFQAVRRHRKLGPLDCPGEADLTAHVDFGALAQ---AAPCATAPLVPQGIFL 294 Query: 303 EGLGIWQRAFSLMKQTARK--DILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMP 359 E LGI RA +L + D + + +RL + MG LFK L E P Sbjct: 295 ERLGITDRARALAAKLEGPALDSHIAAHRRLTH----PEEMGSLFKTLGFFPESATPPP 349 >gi|144899708|emb|CAM76572.1| protein containing DUF185 [Magnetospirillum gryphiswaldense MSR-1] Length = 350 Score = 288 bits (737), Expect = 8e-76, Method: Composition-based stats. Identities = 129/363 (35%), Positives = 182/363 (50%), Gaps = 19/363 (5%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFG-YYSTCNPFGAVGDFVTAPEISQIFGEM 62 L K+ I + G +TV + G YY+T +PFG GDF TAPE+SQ+FGE+ Sbjct: 3 SLAEKLAARIAQGGPITVADFMHEA-----VGQYYATRDPFGRQGDFTTAPEVSQMFGEL 57 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 + ++ + W+ G P V L ELGPGRG +M D+LR + P F I +VETS RL Sbjct: 58 IGLWCVMVWQMMGAPDKVVLAELGPGRGTLMNDLLRA-AGVVPAFLKAADIRLVETSPRL 116 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 T +Q++ L+ + W ++ +P G ++ANE FD+LPI+QFV + ERM+ +D Sbjct: 117 TALQRQTLSGR--DVQWCENVDQLPDGPLIVIANELFDALPIRQFVKADGQWCERMVGLD 174 Query: 183 QHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYG 242 F G GAI E S+ RL G A++IDYG Sbjct: 175 GDG-FCFVAGPAADPDLPAEVLATPDGAIVETCDGGRALAASLGKRLNRQPGFALIIDYG 233 Query: 243 YLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFL 302 + +S GDTLQAV+ H + L PG AD+++HVDFQ L+ G QG FL Sbjct: 234 HGRSGTGDTLQAVRHHRFHPVLDQPGLADITAHVDFQALA--GAAVPARAWGPVDQGGFL 291 Query: 303 EGLGIWQRAFSLMKQTARK--DILLDSVKRLVSTSADKKSMGELFKILVVSHEKVE-LMP 359 LGI RA L + K ++ ++RL+ D MG LFK L ++ Sbjct: 292 RALGIETRAHLLAQAGGDKVAADIMGQLRRLI----DPGEMGTLFKALALASPHFPAPPG 347 Query: 360 FVN 362 FV+ Sbjct: 348 FVS 350 >gi|325192008|emb|CCA26474.1| conserved hypothetical protein [Albugo laibachii Nc14] Length = 476 Score = 288 bits (737), Expect = 9e-76, Method: Composition-based stats. Identities = 131/388 (33%), Positives = 197/388 (50%), Gaps = 36/388 (9%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 N+L + + I G +T+ +Y +A P GYY + FGA GDF TAPEISQ+FGE+ Sbjct: 91 NELFSILSSFIDVRGPLTLAEYMQRALAHPTHGYYMKKDVFGAQGDFTTAPEISQMFGEL 150 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 +AI+ I W++ G P + +VELGPGRG +M D LR P F S L ++MVE S L Sbjct: 151 IAIWCIATWKEMGSPDPIHIVELGPGRGSLMSDFLRSSRSF-PTFHSALQVHMVEISPAL 209 Query: 123 TLIQKKQLASYGDK--INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 IQ+ L + W+TSL VP G ++A EFFD++P+ QF TE G ER+ID Sbjct: 210 RKIQEGMLKDVDGIRSLQWHTSLTHVPEGPLLVIAQEFFDAMPVHQFEYTERGWCERLID 269 Query: 181 IDQHDS---LVFNIGDHEIK--------------------SNFLTCSDYFLGAIFENSPC 217 + Q + F + + + +G E SP Sbjct: 270 VHQKNGERFFRFVLSPGPTPAARVLIGHQNLQGLRSGVSEESSVITEQAQIGDQLEISPA 329 Query: 218 RDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVD 277 MQ I+ R++ + G A+++DYG+ +L+ ++ H +V L PG+ DLS++VD Sbjct: 330 SFTIMQEIARRISANRGAALIVDYGHNHP-STFSLRGIRNHAFVDALEEPGEIDLSANVD 388 Query: 278 FQRLSSIAILY-KLYINGLTTQGKFLEGLGIWQRAFSLMKQ---TARKDILLDSVKRLVS 333 F+ L+ A + G QG FL+ LG+ R L++ A+ L + KRLV Sbjct: 389 FKTLARYATAEPNISSLGPVPQGLFLKTLGVEHRLAVLLENCESDAQAQELYSAYKRLV- 447 Query: 334 TSADKKSMGELFKILVVSHEKVE-LMPF 360 D MG +FK++ +SH + ++ F Sbjct: 448 ---DSDQMGTIFKVMAISHSDISNIVGF 472 >gi|83952694|ref|ZP_00961424.1| hypothetical protein ISM_11090 [Roseovarius nubinhibens ISM] gi|83835829|gb|EAP75128.1| hypothetical protein ISM_11090 [Roseovarius nubinhibens ISM] Length = 353 Score = 287 bits (735), Expect = 1e-75, Method: Composition-based stats. Identities = 133/362 (36%), Positives = 189/362 (52%), Gaps = 16/362 (4%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 L I I G +++ Y LC+ PE GYY T +P G GDF+TAPEISQ+FGE+ Sbjct: 2 TPLKTLISRQIAATGPISIADYMTLCLLHPEHGYYPTRDPLGVSGDFITAPEISQMFGEL 61 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 + + L +W G P+ L ELGPGRG +M DILR K+ P F + +MVE S L Sbjct: 62 IGLALAQSWLDQGAPAPFALAELGPGRGTLMADILRATSKI-PGFHAAARPHMVEASPAL 120 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 +Q K + + + + +P FLVANEFFD+LP++QF+ RER++ +D Sbjct: 121 RALQAKAVPG----VTHHDHIDTLPELPLFLVANEFFDALPLRQFLRNGDQWRERLVGLD 176 Query: 183 QHDSLVFNI-GDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDY 241 + +L F + +++ +D G + SP + +++ R+A GG A++IDY Sbjct: 177 EGGALCFGLAAPLPLRALDHRLADTTEGDMVTLSPASEAMAETLGTRIATHGGVALLIDY 236 Query: 242 GYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKF 301 G QS GDT QAVK H PL PG ADL++HV+F L+ A T QG F Sbjct: 237 GDWQS-AGDTFQAVKSHAKTDPLEAPGTADLTAHVEFASLARAAAPAAHSRA--TPQGVF 293 Query: 302 LEGLGIWQRAFSLMKQT--ARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEK-VELM 358 LE LGI RA +L A D + + +RL + MG LFK++V+ Sbjct: 294 LERLGITARAQALASGLTGAALDAHIAAHRRLTH----PEEMGTLFKVMVLYPAGTAPPA 349 Query: 359 PF 360 F Sbjct: 350 GF 351 >gi|149916086|ref|ZP_01904608.1| hypothetical protein RAZWK3B_10522 [Roseobacter sp. AzwK-3b] gi|149809941|gb|EDM69790.1| hypothetical protein RAZWK3B_10522 [Roseobacter sp. AzwK-3b] Length = 366 Score = 287 bits (734), Expect = 2e-75, Method: Composition-based stats. Identities = 133/367 (36%), Positives = 187/367 (50%), Gaps = 20/367 (5%) Query: 5 LIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLA 64 L ++ I+ +G MT+ +Y A C+ PE GYY+T +P GA GDF TAPEISQ+FGE+L Sbjct: 4 LRDILIARIRADGPMTLAEYMADCLMHPEHGYYATRDPLGAAGDFTTAPEISQMFGELLG 63 Query: 65 IFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTL 124 + L AW G P + L E GPGRG +M D+LR + P F + + +++VETS L Sbjct: 64 LSLAQAWMDQGSPEGITLAECGPGRGTLMADVLRATRAV-PGFHAAMRVHLVETSATLRA 122 Query: 125 IQKKQLASY----------GDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGI 174 +Q L D I W+ + +P FL+ANEFFD+LPI+Q Sbjct: 123 VQGATLGKSLGKSLGKSLGRDDITWHDHVDALPDAPLFLLANEFFDALPIRQLQREGGMW 182 Query: 175 RERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGG 234 RER + + + + + D G I E +I R+ GG Sbjct: 183 RERCVGLSGDALALGVSAPKPVAALAHRMEDTRDGDIVEICAGAQAMAGAIGARIGARGG 242 Query: 235 TAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYING 294 A+++DYG +S +GDT QAV GH PL +PG ADL++HVDF+ L+ A K + Sbjct: 243 AALIVDYGDWRS-LGDTFQAVAGHEAADPLADPGGADLTAHVDFEALAQAAAPAK--HSR 299 Query: 295 LTTQGKFLEGLGIWQRAFSLMKQTARK--DILLDSVKRLVSTSADKKSMGELFKILVVSH 352 LT QG FLE LGI RA +L + D + + +RL MG +FK+L + Sbjct: 300 LTPQGVFLERLGITARAQALAQGLEGAALDAHVAAHRRLTH----PAEMGTVFKVLGLFP 355 Query: 353 EKVELMP 359 E P Sbjct: 356 ETAPPPP 362 >gi|288959074|ref|YP_003449415.1| hypothetical protein AZL_022330 [Azospirillum sp. B510] gi|288911382|dbj|BAI72871.1| hypothetical protein AZL_022330 [Azospirillum sp. B510] Length = 385 Score = 287 bits (734), Expect = 2e-75, Method: Composition-based stats. Identities = 138/375 (36%), Positives = 197/375 (52%), Gaps = 22/375 (5%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 L R + I +G ++V + A + P FGYY +PFG+ GDF TAPEISQ+FGE+ Sbjct: 16 ESLARLLARRILMDGPISVATFMAEALGHPRFGYYMRRDPFGSGGDFTTAPEISQMFGEL 75 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 + ++ + +W + G P LVELGPGRG +M D+LR L P F +I++VETS L Sbjct: 76 VGLWCVDSWARLGGPGPFHLVELGPGRGTLMADVLRAAAVL-PLFRDSATIHLVETSPAL 134 Query: 123 TLIQKKQLAS-YGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDI 181 Q++ L G+ + W+ L DVP G T L+ANEFFD+LPI+Q T HG ER++D+ Sbjct: 135 RERQRETLRPILGEAVRWHDRLEDVPDGPTILIANEFFDALPIRQVQKTNHGWFERLVDV 194 Query: 182 DQH---DSLVFNIG-----DHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDG 233 D D F + + D G + E SP + I RLA Sbjct: 195 DPDSLEDDPRFRFVLEAFGSSGNRLVPDSLRDAPEGCVVEVSPASQAVARLIGARLAAAP 254 Query: 234 GTAIVIDYGYLQ-SRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYI 292 G A+VIDYGY + VGD+LQA++ H Y L PG+ADL++HVDF ++ A Sbjct: 255 GAALVIDYGYGRGPAVGDSLQAMRRHAYAPVLEAPGEADLTAHVDFATIAVAAREGGAQP 314 Query: 293 NGLTTQGKFLEGLGIWQRAFSLMKQTA--RKDILLDSVKRLVSTSADKKSMGELFKILVV 350 G QG +L LGI QRA +L + + + + ++ RL+ D MG LFK++ + Sbjct: 315 FGPVEQGDWLTRLGIRQRASALAAKASPAQARDIGAALDRLI----DPAQMGRLFKLVAL 370 Query: 351 SHEKV-----ELMPF 360 + F Sbjct: 371 ATPGAFADATPPAGF 385 >gi|114767217|ref|ZP_01446082.1| hypothetical protein 1100011001181_R2601_09240 [Pelagibaca bermudensis HTCC2601] gi|114540627|gb|EAU43698.1| hypothetical protein R2601_09240 [Roseovarius sp. HTCC2601] Length = 354 Score = 287 bits (733), Expect = 2e-75, Method: Composition-based stats. Identities = 134/359 (37%), Positives = 187/359 (52%), Gaps = 13/359 (3%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 + L + I G MT+ +Y A C+ P +GYY T +P GA GDF TAPEISQ+FGE+ Sbjct: 2 SPLEEILHRRIAAEGPMTIAEYMATCLGHPRYGYYPTRDPLGAAGDFTTAPEISQMFGEL 61 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 L + L W + G PS L ELGPGRG +M D R + + P +++VETS RL Sbjct: 62 LGLCLAQCWLEQGRPSSFVLAELGPGRGTLMADATRAMRGV-PGMLEAARLHLVETSPRL 120 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 Q ++LA W+ S+A++P +L+ANEFFD+LPI+QF+ + G ER++ + Sbjct: 121 RDEQHRRLAPL--MPVWHDSVANLPEAPLYLLANEFFDALPIRQFLRSGEGWCERVVGLS 178 Query: 183 QHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYG 242 + +D G + E + I R+A GG A+++DYG Sbjct: 179 EGRLAFGLTEPAPHGELEHRLADTREGDLVETCAPATGIAEDIGRRIASQGGAALIVDYG 238 Query: 243 YLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFL 302 +S +GDT QAV+ H VSPL PG ADL++HVDF L A LT QG FL Sbjct: 239 SARS-LGDTFQAVRRHDKVSPLDAPGTADLTAHVDFGAL---ATAMPCATTTLTPQGVFL 294 Query: 303 EGLGIWQRAFSLMKQTA--RKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMP 359 E LGI RA +L + + + D + + +RL + MG LFK L E P Sbjct: 295 ERLGITDRARALAARLSGAQLDSHVAAHRRLTH----PEEMGSLFKTLGAFPEGAAPPP 349 >gi|298292440|ref|YP_003694379.1| hypothetical protein Snov_2465 [Starkeya novella DSM 506] gi|296928951|gb|ADH89760.1| protein of unknown function DUF185 [Starkeya novella DSM 506] Length = 369 Score = 286 bits (731), Expect = 4e-75, Method: Composition-based stats. Identities = 138/361 (38%), Positives = 185/361 (51%), Gaps = 7/361 (1%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 L I I + G +TV Y C+ P GYY+T PFGA GDFVTAPEISQ+FGE+ Sbjct: 2 TPLGEIIARQIGQTGPITVADYMQQCLFHPTLGYYTTHEPFGAQGDFVTAPEISQMFGEL 61 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 L ++ W + G PS L ELGPGRG MM D+LR + P + +VE S RL Sbjct: 62 LGLWAADTWMRLGSPSRFVLAELGPGRGTMMADMLRATR-IVPGLREAARVVLVEASPRL 120 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 Q + LA +++W +S+ D+P G L+ANEF D+LP++QFV T G+ ERM+ ++ Sbjct: 121 REKQAETLAG--QEVDWASSVDDLPAGPLILLANEFIDALPVRQFVRTPEGLAERMVGLE 178 Query: 183 QHDSLVFNIGDHE--IKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVID 240 + +L F + S D GAI E P + + RLA GG A+ ID Sbjct: 179 EDGALAFGLRPGARLDASAEARLRDAPPGAILEICPAGLVVAEKLGARLAATGGAALFID 238 Query: 241 YGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGK 300 YG+ GDTLQA+ H Y L +PG+ADL++HVDF L+ A G QG Sbjct: 239 YGHAGG-FGDTLQALHRHAYDDVLAHPGEADLTAHVDFAALARAATAAGARAFGPIGQGV 297 Query: 301 FLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKV-ELMP 359 LE LG+ RA L + R +K MG LFK L + H + + Sbjct: 298 LLERLGLDARAERLKRDADPAARASIDAARARLAGTGEKEMGALFKALALVHPALGPVAG 357 Query: 360 F 360 F Sbjct: 358 F 358 >gi|85704768|ref|ZP_01035869.1| hypothetical protein ROS217_06800 [Roseovarius sp. 217] gi|85670586|gb|EAQ25446.1| hypothetical protein ROS217_06800 [Roseovarius sp. 217] Length = 353 Score = 286 bits (731), Expect = 5e-75, Method: Composition-based stats. Identities = 134/360 (37%), Positives = 196/360 (54%), Gaps = 13/360 (3%) Query: 5 LIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLA 64 L ++ I + G +++ Y A C+ PEFGYY+T +PFGA GDFVTAPEISQ+FGE+L Sbjct: 4 LEAQLRARIAEAGPISLADYMAACLMHPEFGYYATRDPFGAGGDFVTAPEISQMFGELLG 63 Query: 65 IFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTL 124 + L W G P+ L ELGPGRG +M D+LR ++ P F +++VE S L Sbjct: 64 LCLAQVWLDQGRPARFVLAELGPGRGTLMADVLRATQRV-PGFRDAAEVHLVEGSAVLRA 122 Query: 125 IQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQH 184 Q++ +A + W+ + +P G +L+ANEFFD+LPI+QF G RER++ + Sbjct: 123 AQRRAIAG---DVIWHERVESLPEGPLYLLANEFFDALPIRQFQRFGDGWRERVVGLSDD 179 Query: 185 DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYL 244 + G + ++ G + E + I R+A GG A+++DYG Sbjct: 180 RLALGLSGPVAPPALVERLAETREGDVVEICGPGEAVAAEIGARIAGHGGAALIVDYGDW 239 Query: 245 QSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEG 304 +S +GDT QAVKGH V PL PG ADL++HVDF+ L+ A LT QG FLE Sbjct: 240 RS-LGDTFQAVKGHAPVDPLAAPGLADLTAHVDFEALARAASPA--VYTRLTPQGVFLER 296 Query: 305 LGIWQRAFSLMKQTARK--DILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMPFVN 362 LGI R+ L + + + + L + +RL + MG LFK+L + E P ++ Sbjct: 297 LGIGARSEVLARNLSGQALENHLAAYQRLTGA----EEMGTLFKVLGLYPEGTTPPPGLD 352 >gi|254470484|ref|ZP_05083888.1| ATP synthase beta subunit/transription termination factor rho [Pseudovibrio sp. JE062] gi|211960795|gb|EEA95991.1| ATP synthase beta subunit/transription termination factor rho [Pseudovibrio sp. JE062] Length = 363 Score = 285 bits (729), Expect = 8e-75, Method: Composition-based stats. Identities = 152/363 (41%), Positives = 213/363 (58%), Gaps = 7/363 (1%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGE 61 L KI I +G MTV +Y ALC++DPE GYY PFGA GDF TAPEISQ+FGE Sbjct: 6 STPLQEKIKKRIADHGPMTVAEYMALCLSDPEHGYYMRQQPFGAKGDFTTAPEISQLFGE 65 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 ++ + + W V LVE+GPGRG +M DILRVI L+P + + I++VETS Sbjct: 66 LIGAWFLHQWLSQDLKGPVHLVEMGPGRGTLMKDILRVIS-LRPQMLANIQIHLVETSPS 124 Query: 122 LTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDI 181 L Q+K L +Y I W+ +L +P G T LVANE FD+LPI Q+ +T+ G RER + + Sbjct: 125 LRKAQRKLLKAY--SIKWHDTLETIPEGPTLLVANELFDALPIHQYQLTDTGWRERCVGL 182 Query: 182 DQHDSLVFNIGDHEIKSNFLTCSDY--FLGAIFENSPCRDREMQSISDRLACDGGTAIVI 239 D+ +L F IG + + ++ +G E SP + I RL +GG A++I Sbjct: 183 DEDGNLTFGIGSGTLSPADVAKANLQAKIGDTLELSPASNAIASQIGHRLKANGGAALLI 242 Query: 240 DYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQG 299 DYGY ++ GDT QA+K H YVS L + G+ADL++HV+FQ L++ A +G QG Sbjct: 243 DYGYAKTATGDTFQALKKHEYVSTLEHCGEADLTAHVNFQALANAAAAEGATAHGPIGQG 302 Query: 300 KFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMP 359 +FL GLG+ +RA L + D D +++ V A MG LFK+L +S+ + +P Sbjct: 303 QFLLGLGLLERAGQLGAGKSTLDQ--DQIRKDVERLAADDQMGTLFKVLCISNTSEQPIP 360 Query: 360 FVN 362 F N Sbjct: 361 FQN 363 >gi|84515351|ref|ZP_01002713.1| hypothetical protein SKA53_01796 [Loktanella vestfoldensis SKA53] gi|84510634|gb|EAQ07089.1| hypothetical protein SKA53_01796 [Loktanella vestfoldensis SKA53] Length = 352 Score = 284 bits (727), Expect = 1e-74, Method: Composition-based stats. Identities = 131/362 (36%), Positives = 192/362 (53%), Gaps = 14/362 (3%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 L ++ I ++G +++ + + P GYY+T +PFGA GDF+TAPEISQ+FGE+ Sbjct: 2 TTLADLLLTRIARDGPISIASFMTDALMHPAHGYYATRDPFGAAGDFITAPEISQMFGEL 61 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 + + L AW G P V L ELGPGRG +M DILR + P F + ++++ VETS L Sbjct: 62 IGLSLAQAWLDQGAPDPVTLAELGPGRGTLMADILRATAAV-PGFHAAVTVHFVETSPHL 120 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 +Q +++ + W+ + +P LVANEFFD+LPI+QFV G RERM+ Sbjct: 121 RALQAERVP----QATWHDRIDTLPDAPLLLVANEFFDALPIRQFVRAGAGWRERMVGAQ 176 Query: 183 QHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYG 242 + D G + E+ P + +I+ R+A +GG A+VIDYG Sbjct: 177 DGTLCFGLSDAAALAVLTPRLDDTQDGDLVEHCPALPGIVAAIAGRIATNGGAALVIDYG 236 Query: 243 YLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFL 302 QS +GDT QA+ GH PL PG ADL++HVDF +++ A + LT QG FL Sbjct: 237 DWQS-LGDTFQALAGHAPTDPLAAPGAADLTAHVDFAAIAAHAAPA--RHSRLTPQGVFL 293 Query: 303 EGLGIWQRAFSLMKQTARK--DILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMPF 360 E LGI RA L + D + + +RL MG+LFK++ + P Sbjct: 294 ERLGITARALKLASGLTGEALDAHVAAHRRLTH----PAEMGDLFKVMALYPATAMPPPG 349 Query: 361 VN 362 ++ Sbjct: 350 LD 351 >gi|119384944|ref|YP_916000.1| hypothetical protein Pden_2212 [Paracoccus denitrificans PD1222] gi|119374711|gb|ABL70304.1| protein of unknown function DUF185 [Paracoccus denitrificans PD1222] Length = 355 Score = 284 bits (726), Expect = 2e-74, Method: Composition-based stats. Identities = 126/359 (35%), Positives = 177/359 (49%), Gaps = 16/359 (4%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 L R I I+ +G M +D+Y LC+ PE GYY+T +PFG GDF TAPEISQ+FGEM Sbjct: 2 TPLARLIATRIRLSGPMALDEYMRLCLLHPEHGYYATRDPFGTAGDFTTAPEISQMFGEM 61 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 + + L AW G P+ L E+GPGRG +M DILR I + P + +VE S L Sbjct: 62 IGLALGQAWLDQGRPAPFTLAEIGPGRGTLMADILRAIR-IVPGMAEAARVALVEASPHL 120 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 +Q+ +L I ++ +P LVANEFFD+LPI+QF G ER++ +D Sbjct: 121 RRVQRDRLGE----IVHLDDVSQLPQAPLLLVANEFFDALPIRQFQRGAQGWAERVVALD 176 Query: 183 QHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYG 242 L + + G I E P + ++ R+A GG AIV+DYG Sbjct: 177 AQGGLEMGLLPIP---GDASLPLAPEGVIRETCPEAAPIVAQVAGRIAAHGGCAIVVDYG 233 Query: 243 YLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFL 302 + GDT QA++ H L NPG+ADL++HVDF L++ + + + QG +L Sbjct: 234 GWDGQ-GDTFQALRRHRPEDVLANPGEADLTAHVDFAPLAAAGRAAGVRASRMVAQGDWL 292 Query: 303 EGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELM-PF 360 LGI RA L + + + MG LFK+L + F Sbjct: 293 LRLGIESRAQQLARAGDEGAMAA------LHRLTAPGEMGHLFKVLAFWARHAPVPAGF 345 >gi|329888131|ref|ZP_08266729.1| hypothetical protein BDIM_00510 [Brevundimonas diminuta ATCC 11568] gi|328846687|gb|EGF96249.1| hypothetical protein BDIM_00510 [Brevundimonas diminuta ATCC 11568] Length = 357 Score = 283 bits (724), Expect = 3e-74, Method: Composition-based stats. Identities = 124/362 (34%), Positives = 179/362 (49%), Gaps = 7/362 (1%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 M L ++V I G MTV Y C+ DP+ GYY+T G GDF+TAP +SQ+FG Sbjct: 1 MAETLKDRLVREIVLTGPMTVADYVTRCLHDPKGGYYATRPALGERGDFITAPMVSQMFG 60 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 E++ ++ + W + G P VRLVE+GPG G +M D+LR L P F + + ++E S Sbjct: 61 ELIGLWAVETWTRLGAPERVRLVEVGPGDGTLMSDVLRAAR-LVPGFLQAVDLILIEPSA 119 Query: 121 RLTLIQKKQLASYGDKINWYTSLADV-PLGFTFLVANEFFDSLPIKQFVMTEHGIRERMI 179 L Q ++LA W ++L + L+ANE D LP +QF+ TE G ER I Sbjct: 120 PLRAEQARRLADADVHPRWLSALHKIETDAPVILIANEVLDCLPARQFIKTEGGWAERRI 179 Query: 180 DIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVI 239 + D L F + D G I E S + + ++ + G A++I Sbjct: 180 GVTDADELTFGLTAIA-DGFEAPGFDVEPGQIIEISEQQAAFGRDLASLIRAASGAALLI 238 Query: 240 DYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQG 299 DYG + GDTLQA++ H V PL PG+ADL+ DF + A+ + G +QG Sbjct: 239 DYGRSKPEAGDTLQALRRHQKVDPLSTPGEADLTQWADFPLVLEAAVRGGADVTGCVSQG 298 Query: 300 KFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKI-LVVSHEKVELM 358 FL+ LGI RA LM+ + ++R + MGELFK + S + L Sbjct: 299 AFLQALGIEARAQRLMQGRPEAAPV---IQRQLDRLTAPDQMGELFKATAIFSPRSLALP 355 Query: 359 PF 360 F Sbjct: 356 GF 357 >gi|114770207|ref|ZP_01447745.1| hypothetical protein OM2255_11240 [alpha proteobacterium HTCC2255] gi|114549044|gb|EAU51927.1| hypothetical protein OM2255_11240 [alpha proteobacterium HTCC2255] Length = 354 Score = 281 bits (720), Expect = 9e-74, Method: Composition-based stats. Identities = 130/361 (36%), Positives = 196/361 (54%), Gaps = 13/361 (3%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 L I IK+ G M V +Y LC+ PE GYY+ + GA GDF TAPEISQ+FGE+ Sbjct: 2 TALSNIIKKQIKRFGPMPVSEYMTLCLLHPEHGYYTNRDALGATGDFTTAPEISQMFGEL 61 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 + + + +W P+ L ELGPG G +M DILR + P+F + + ++++E S + Sbjct: 62 IGLSIAQSWIDQEMPTPFILAELGPGNGTLMADILRATKSV-PNFHASMDLHLIEASPEM 120 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 Q+ L + + W +++P FL+ANEFFD LPIKQ+ T+ G +E+MI ++ Sbjct: 121 RKRQETALNGFN--VTWLNYFSELPQKPLFLIANEFFDCLPIKQYRRTDEGWQEQMIAVE 178 Query: 183 QHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYG 242 ++ L F +G + F +D + E SP +I + + +GG AI++DYG Sbjct: 179 -NEQLHFILGTATSEEVFSKTNDVPSADMLEVSPPTVAFASAIGEHIQGNGGCAIIVDYG 237 Query: 243 YLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFL 302 S GD+LQA+K H + PL + G ADL++HV F+ L++ A Y ++ QG L Sbjct: 238 EWDS-DGDSLQALKDHRKIDPLTHCGTADLTAHVSFKDLTNAASKY-AKVSSTIPQGILL 295 Query: 303 EGLGIWQRAFSLMKQTARK--DILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELM-P 359 E LGI QRA +L K + K + + + KRL MG LFK + + E +L Sbjct: 296 ERLGITQRAQTLAKNMSGKKLENHISAHKRLTH----PDEMGSLFKAIAIIPENTDLPAG 351 Query: 360 F 360 F Sbjct: 352 F 352 >gi|297815160|ref|XP_002875463.1| hypothetical protein ARALYDRAFT_905140 [Arabidopsis lyrata subsp. lyrata] gi|297321301|gb|EFH51722.1| hypothetical protein ARALYDRAFT_905140 [Arabidopsis lyrata subsp. lyrata] Length = 471 Score = 280 bits (716), Expect = 2e-73, Method: Composition-based stats. Identities = 121/398 (30%), Positives = 202/398 (50%), Gaps = 41/398 (10%) Query: 2 ENKLIRKIVNLIK-KNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 E +L++ + ++IK + G ++V +Y + +P+ G+Y + FGA GDF+T+PE+SQ+FG Sbjct: 75 ETELVKHLKSIIKFRGGPISVAEYMEEVLTNPKAGFYMNRDVFGAQGDFITSPEVSQMFG 134 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 EM+ ++ +C WEQ G P V LVELGPGRG +M D+LR K + L I++VE S Sbjct: 135 EMIGVWTVCLWEQMGRPERVNLVELGPGRGTLMADLLRGTSKFRNFT-ESLHIHLVECSP 193 Query: 121 RLTLIQKKQLASYGD--------------KINWYTSLADVPLG-FTFLVANEFFDSLPIK 165 L +Q + L + ++W+ +L +VP G T ++A+EF+D+LP+ Sbjct: 194 ALQKLQHQNLKCIDESSLEKKVISSLAGTPVHWHATLEEVPSGVPTLIIAHEFYDALPVH 253 Query: 166 QFVMTEHGIRERMIDIDQHDSLVFNIGDHEIK--------SNFLTCSDYFLGAIFENSPC 217 QF + G E+M+D+ + F + + T + E SP Sbjct: 254 QFQKSSRGWCEKMVDVGEDSKFHFVLSPQPTPAALYLMKRCTWATPEEREKLEHVEISPK 313 Query: 218 RDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVD 277 Q ++ R+ DGG A++IDYG + D+LQA++ H +V+ L +PG ADLS++VD Sbjct: 314 SMDLTQEMAKRIGSDGGGALIIDYGMN-EIISDSLQAIRKHKFVNILDDPGSADLSAYVD 372 Query: 278 FQRLSSIAILY--KLYINGLTTQGKFLEGLGIWQRAFSLMK--QTARKDILLDSVKRLVS 333 F + A + ++G TQ +FL LGI R +L++ + + L +LV Sbjct: 373 FPSIKHSAEEASENVSVHGPMTQSQFLGSLGINFRVDALLQNCNDEQAESLRAGYWQLVG 432 Query: 334 TSAD----------KKSMGELFKILVVSHEKVELM-PF 360 MG + + + ++ + PF Sbjct: 433 DGEAPFWEGPDEQTPIGMGTRYLAMTIVNKNQGIPAPF 470 >gi|217977318|ref|YP_002361465.1| protein of unknown function DUF185 [Methylocella silvestris BL2] gi|217502694|gb|ACK50103.1| protein of unknown function DUF185 [Methylocella silvestris BL2] Length = 369 Score = 280 bits (716), Expect = 2e-73, Method: Composition-based stats. Identities = 132/362 (36%), Positives = 201/362 (55%), Gaps = 9/362 (2%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 N L + +I ++G ++V++Y AL + P +GYY T GA GDF+TAPEISQ+FGE+ Sbjct: 2 NPLETLLQEMILESGPISVERYMALALGHPVYGYYRTHVAVGAEGDFITAPEISQMFGEL 61 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 + ++ + W G P ++LVELGPGRG +M D LR + + PDF +S+++VE S L Sbjct: 62 IGLWAVEVWRLMGAPKELKLVELGPGRGTLMADALRAVK-IAPDFRDAISVHLVEISLPL 120 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 Q+ L G KI W+ S+ +VP G +ANEFFD+LP++ +V + G RER I +D Sbjct: 121 REKQRAALEGQGIKIVWHASVDEVPPGPAIFIANEFFDALPVRHYVHRDGGWRERQIGVD 180 Query: 183 QHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYG 242 + L F + + I E R M + R+ GG + IDYG Sbjct: 181 ESGRLFFGVSGA---HESAIAAKGEPDDILEVGAGAARLMTQLGVRVVTQGGAVLAIDYG 237 Query: 243 YLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFL 302 Y + G+TLQA++ H +V PL +PG+ADL++HV+F L+ A ++G TQG FL Sbjct: 238 YEEPARGETLQAMRAHKFVDPLESPGEADLTAHVNFSALARAARGAGAAVHGPVTQGDFL 297 Query: 303 EGLGIWQRAFSLM--KQTARKDILLDSVKRLVSTSADKK---SMGELFKILVVSHEKVEL 357 LGI++RA +L A++ + +++RL M LFK+L V+ + + Sbjct: 298 ARLGIFERASALERAAAPAQRAAINSALERLAGEGLGFDRATDMARLFKVLAVTRREFDP 357 Query: 358 MP 359 P Sbjct: 358 PP 359 >gi|71082800|ref|YP_265519.1| cyclopropane-fatty-acyl-phospholipid synthase [Candidatus Pelagibacter ubique HTCC1062] gi|71061913|gb|AAZ20916.1| possible cyclopropane-fatty-acyl-phospholipid synthase [Candidatus Pelagibacter ubique HTCC1062] Length = 347 Score = 280 bits (715), Expect = 3e-73, Method: Composition-based stats. Identities = 113/353 (32%), Positives = 179/353 (50%), Gaps = 11/353 (3%) Query: 12 LIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAW 71 IK N T+D++ + + GYY NPFG GDF+T+P IS +F EM+AI+++ W Sbjct: 2 KIKNNQSFTLDKFIEESLYNKTSGYYMKKNPFGKKGDFITSPNISVLFSEMIAIWVVSFW 61 Query: 72 EQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLA 131 + G P L+ELG G G MM ++ K + F + I ++E S+ L K+++ Sbjct: 62 QNLGCPKKFNLIELGAGNGEMMKVLVNTFEKFQ-IFKNSCHIKILERSKLLRK--KQKIN 118 Query: 132 SYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFN- 190 I W L+++ +ANEFFD+LPIKQF+ E ER + + F+ Sbjct: 119 INKKNIQWLNDLSELDNSPCIFLANEFFDALPIKQFIKKERKWFERHVRFFNNKFEYFDV 178 Query: 191 IGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGD 250 D E N + E SP ++ I +++ + G ++IDY Y ++ + Sbjct: 179 PFDMEKFVNKIKFKITKQQNFIEYSPQSTEYLEIIFNKIKRNNGGILIIDYAYTDKKMKN 238 Query: 251 TLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQR 310 TLQAV H Y L G +D++ ++ F L+ I + TTQG+FL LGI +R Sbjct: 239 TLQAVSKHKYCDVLKGFGNSDITYNLSFSLLNRIVKELSSLTSMNTTQGEFLTKLGILER 298 Query: 311 AFSLMKQT--ARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVEL-MPF 360 A L K+ + K + +KRL+ DK MGELFK+++++ K + + F Sbjct: 299 AEILSKKMLFSEKADIYFRIKRLI----DKNQMGELFKVMLITTHKNKFKLGF 347 >gi|220925101|ref|YP_002500403.1| hypothetical protein Mnod_5254 [Methylobacterium nodulans ORS 2060] gi|219949708|gb|ACL60100.1| protein of unknown function DUF185 [Methylobacterium nodulans ORS 2060] Length = 367 Score = 279 bits (713), Expect = 5e-73, Method: Composition-based stats. Identities = 133/361 (36%), Positives = 184/361 (50%), Gaps = 5/361 (1%) Query: 1 ME-NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIF 59 M L+ ++ LI +NG + V+ Y ALC+ P FGYY T +P GA GDF TAPEISQIF Sbjct: 1 MSPTPLLAELRALIAENGPIPVEHYMALCLGHPRFGYYRTRDPLGAAGDFTTAPEISQIF 60 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 GE+L ++ W + G PS RLVELGPGRG +M D LR I P F + +++VETS Sbjct: 61 GELLGLWAASVWHEMGRPSPCRLVELGPGRGTLMADALRAIRTALPAFAEAVDLHLVETS 120 Query: 120 ERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMI 179 L Q+++LA G I W+ + DVP G ++ANEFFD+LP++Q+ T G R + Sbjct: 121 PSLRAAQRERLAPIGRPIAWHDRVEDVPAGPLLILANEFFDALPVRQYERTARGWCMRRV 180 Query: 180 DIDQHD-SLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIV 238 + L F + + + GA+ M+ ++ RL DGG + Sbjct: 181 GLAADGTGLAFGLDPDPVPDLAVAA---PEGAVLTVPSVALALMRILAGRLVRDGGALLA 237 Query: 239 IDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQ 298 IDYG + DTLQAV H + L PG+ DL++ VDF L+ A I+G Q Sbjct: 238 IDYGEAGLGLTDTLQAVSRHRRIGVLDAPGETDLTAQVDFGGLARAASEAGAAIHGPVMQ 297 Query: 299 GKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELM 358 FL LG+ R L + + T MG LFK+L VS + + Sbjct: 298 RDFLLALGLGSRVERLSARARPDQAAAIAAGAARLTDDAPTGMGRLFKVLGVSGPGLPSL 357 Query: 359 P 359 P Sbjct: 358 P 358 >gi|114569311|ref|YP_755991.1| hypothetical protein Mmar10_0760 [Maricaulis maris MCS10] gi|114339773|gb|ABI65053.1| protein of unknown function DUF185 [Maricaulis maris MCS10] Length = 366 Score = 278 bits (711), Expect = 1e-72, Method: Composition-based stats. Identities = 119/364 (32%), Positives = 183/364 (50%), Gaps = 13/364 (3%) Query: 5 LIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLA 64 + + + I+ G +++ + + DP G+Y+T +P GAV DF+TAPEISQ+FGE++ Sbjct: 2 IADTLRDRIRSGGPISIAAFMTEALFDPRHGFYATKDPIGAVADFITAPEISQMFGELIG 61 Query: 65 IFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTL 124 + W G P+ +L+E+GPGRG MM D LR + P F I ++E S L Sbjct: 62 LVAAQTWLDMGRPAAFKLIEMGPGRGTMMSDALRAARTV-PGFMDATEIMLIEASAALKA 120 Query: 125 IQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDI--- 181 +Q + L G +I W L G F+V NEF D LP++Q + + ER++ + Sbjct: 121 VQAQTLGPSGAQIRWIDRLDAAAPGPCFIVGNEFLDCLPVRQALRHKGEWHERLVGLAPT 180 Query: 182 ---DQHDSLVFNIGDH---EIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGT 235 D VF +G + D GA+ E P + + +++R A G Sbjct: 181 YGNDSESDFVFVLGPPLGQDTDLIPERLRDAEDGALVELRPGDRQVVDQLAERFANQPGR 240 Query: 236 AIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGL 295 A+ IDYG S +GDTLQA++ H PL +PG ADL++ VDF+ L L +G Sbjct: 241 ALFIDYGPATSEIGDTLQAIRAHKKQPPLQDPGTADLTARVDFESLMQAGRTAGLTAHGP 300 Query: 296 TTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKV 355 TQG++L LG+ RA L + K + R + D + MGELFK++ + ++ Sbjct: 301 QTQGQWLRDLGLEARAAVLSQSQPGK---RSEIARQIWRLTDTEQMGELFKLVCLDSAEL 357 Query: 356 ELMP 359 P Sbjct: 358 PPPP 361 >gi|42565270|ref|NP_189511.2| unknown protein [Arabidopsis thaliana] gi|95147286|gb|ABF57278.1| At3g28700 [Arabidopsis thaliana] gi|332643957|gb|AEE77478.1| uncharacterized protein [Arabidopsis thaliana] Length = 471 Score = 278 bits (711), Expect = 1e-72, Method: Composition-based stats. Identities = 121/398 (30%), Positives = 204/398 (51%), Gaps = 41/398 (10%) Query: 2 ENKLIRKIVNLIK-KNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 +++L++ + ++IK + G ++V +Y + +P+ G+Y + FGA GDF+T+PE+SQ+FG Sbjct: 75 DSELVKHLKSIIKFRGGPISVAEYMEEVLTNPKAGFYMNRDVFGAQGDFITSPEVSQMFG 134 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 EM+ ++ +C WEQ G P V LVELGPGRG +M D+LR K K L I++VE S Sbjct: 135 EMIGVWTVCLWEQMGRPERVNLVELGPGRGTLMADLLRGTSKFKNFT-ESLHIHLVECSP 193 Query: 121 RLTLIQKKQLASYGD--------------KINWYTSLADVPLG-FTFLVANEFFDSLPIK 165 L +Q + L + ++W+ +L +VP G T ++A+EF+D+LP+ Sbjct: 194 ALQKLQHQNLKCTDESSSEKKAVSSLAGTPVHWHATLQEVPSGVPTLIIAHEFYDALPVH 253 Query: 166 QFVMTEHGIRERMIDIDQHDSLVFNIGDHEIK--------SNFLTCSDYFLGAIFENSPC 217 QF + G E+M+D+ + F + + T + E SP Sbjct: 254 QFQKSTRGWCEKMVDVGEDSKFRFVLSPQPTPAALYLMKRCTWATPEEREKMEHVEISPK 313 Query: 218 RDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVD 277 Q ++ R+ DGG A++IDYG + + D+LQA++ H +V+ L +PG ADLS++VD Sbjct: 314 SMDLTQEMAKRIGSDGGGALIIDYGMN-AIISDSLQAIRKHKFVNILDDPGSADLSAYVD 372 Query: 278 FQRLSSIAILY--KLYINGLTTQGKFLEGLGIWQRAFSLMK--QTARKDILLDSVKRLVS 333 F + A + ++G TQ +FL LGI R +L++ + + L +LV Sbjct: 373 FPSIKHSAEEASENVSVHGPMTQSQFLGSLGINFRVDALLQNCNDEQAESLRAGYWQLVG 432 Query: 334 TSAD----------KKSMGELFKILVVSHEKVELM-PF 360 MG + + + ++ + PF Sbjct: 433 DGEAPFWEGPNEQTPIGMGTRYLAMSIVNKNQGIPAPF 470 >gi|321461519|gb|EFX72550.1| hypothetical protein DAPPUDRAFT_308204 [Daphnia pulex] Length = 430 Score = 277 bits (709), Expect = 1e-72, Method: Composition-based stats. Identities = 133/395 (33%), Positives = 192/395 (48%), Gaps = 39/395 (9%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGE 61 NKL+++I I G +TV Y + +P GYY + + FG GDF+T+PEISQ+FGE Sbjct: 38 SNKLLKQIEARILATGPITVADYMKEVLTNPSAGYYMSKDVFGEKGDFITSPEISQMFGE 97 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 ++AI+L+ W + G P+ ++VELGPGRG +M D+LRV K K S+ +VE S Sbjct: 98 LIAIWLMNEWTKCGKPTPFQIVELGPGRGTLMSDVLRVFSKFKMAESD-FSVSLVEVSPY 156 Query: 122 LTLIQKKQLAS----------------------YGDKINWYTSLADVPLGFTFLVANEFF 159 L+ IQ+K L YG + WY ++D+P FT +A+EFF Sbjct: 157 LSQIQEKCLCKTQNEKISELPIDSQHYKESKSLYGSPVRWYNHISDLPRTFTLFLAHEFF 216 Query: 160 DSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRD 219 D+LPI + V + G RE +ID+++ +S + + E L C + + E SP Sbjct: 217 DALPIHKLVKVDQGWREVLIDLNREESTLRYVLSRERTPASLYCRENEVRKDLEISPQSS 276 Query: 220 REMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQ 279 +Q ++ R+ DGG +VIDYG+ + DT + K H PLV PG ADL++ VDF Sbjct: 277 VMIQVMASRIHQDGGIGLVIDYGHEGDKT-DTFRGFKNHKLHDPLVEPGTADLTADVDFS 335 Query: 280 RLSSIAILYK-------LYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLV 332 L A K G Q FL +GI R L+ R + L + +K Sbjct: 336 ALKWAATHPKSVEDWKANLTFGTVDQKDFLTRMGINMRLEKLLASC-RNEKLANDLKSAH 394 Query: 333 STSADKKSMGELFKILVVSHE-------KVELMPF 360 D MG FK L + K F Sbjct: 395 RMLTDPIEMGSKFKFLALYPAVMEKILLKYPPPGF 429 >gi|91762777|ref|ZP_01264742.1| possible cyclopropane-fatty-acyl-phospholipid synthase [Candidatus Pelagibacter ubique HTCC1002] gi|91718579|gb|EAS85229.1| possible cyclopropane-fatty-acyl-phospholipid synthase [Candidatus Pelagibacter ubique HTCC1002] Length = 347 Score = 277 bits (709), Expect = 2e-72, Method: Composition-based stats. Identities = 113/353 (32%), Positives = 178/353 (50%), Gaps = 11/353 (3%) Query: 12 LIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAW 71 IK N T+D++ + + GYY NPFG GDF+T+P IS +F EM+AI+++ W Sbjct: 2 KIKNNQSFTLDKFIEESLYNETSGYYMKKNPFGKKGDFITSPNISVLFSEMIAIWVVSFW 61 Query: 72 EQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLA 131 + G P L+ELG G G MM ++ K + F + I ++E S+ L K+++ Sbjct: 62 QNLGCPKKFNLIELGAGNGEMMKVLVNTFEKFQ-IFKNSCHIKILERSKLLRK--KQKIN 118 Query: 132 SYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFN- 190 I W L+++ +ANEFFD+LPIKQF+ E ER + + F+ Sbjct: 119 INKKNIQWLNDLSELDNSPCIFLANEFFDALPIKQFIKKERKWFERHVRFFNNKFEYFDV 178 Query: 191 IGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGD 250 D E N + E SP ++ I +++ + G ++IDY Y ++ + Sbjct: 179 PFDIEKFENKIKFKITNQQNFIEYSPQSTEYLKIIFNKIKRNNGGILIIDYAYTDKKMKN 238 Query: 251 TLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQR 310 TLQAV H Y L G +D++ ++ F L+ I + TTQG+FL LGI +R Sbjct: 239 TLQAVSKHKYCDVLKGFGNSDITYNLSFSLLNRIVKELSSLTSMNTTQGEFLTKLGILER 298 Query: 311 AFSLMKQT--ARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVEL-MPF 360 A L K+ + K + +KRL+ DK MGELFK ++++ K + + F Sbjct: 299 AEILSKKMLFSEKADIYFRIKRLI----DKNQMGELFKAMLITTNKNKFKLGF 347 >gi|254292683|ref|YP_003058706.1| hypothetical protein Hbal_0307 [Hirschia baltica ATCC 49814] gi|254041214|gb|ACT58009.1| protein of unknown function DUF185 [Hirschia baltica ATCC 49814] Length = 361 Score = 276 bits (705), Expect = 4e-72, Method: Composition-based stats. Identities = 127/361 (35%), Positives = 183/361 (50%), Gaps = 12/361 (3%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L ++ +LIK +G +++ + L + D + GYY+T G DF TAPEISQIFGEML Sbjct: 9 TLEERLKSLIKTDGPISLSVFMQLALFDRKQGYYATRPGLGK--DFTTAPEISQIFGEML 66 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 ++ W+Q G PS L+E+GPGRGIMM DI R K+ Y++E S L Sbjct: 67 GVWAAHEWQQMGCPSPFYLIEMGPGRGIMMKDIWRATAKIAGFH-DAAHPYLIEPSPSLR 125 Query: 124 LIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQ 183 IQ K+L++ W L D+P G + ++ANE D LPI+QF+ + ER I +D+ Sbjct: 126 KIQAKRLSAL-KNPQWVNELTDIPNGPSIILANEVLDCLPIRQFIRQDGAWCERKIGLDK 184 Query: 184 HDSLVFNIGDHEIKSNFLTCSDY--FLGAIFENSPCRDREMQSISDRLACDGGTAIVIDY 241 + + + I T + + + E S D ++ ISDRL D A+ IDY Sbjct: 185 NGNFMLGISSPISTETENTPDNLSDMVQDVVEISSALDAFIELISDRLKHDNSRALFIDY 244 Query: 242 GYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKF 301 G S GDTL+A V PL N G DL++ VDF ++ A + L I G + QG F Sbjct: 245 GPANSTPGDTLRAYSDGKQVDPLSNVGNVDLTADVDFAKVVQSAKTHGLEIAGPSPQGWF 304 Query: 302 LEGLGIWQRAFSLMKQTARK-DILLDSVKRLVSTSADKKSMGELFKILVVSHEKVE-LMP 359 L LG +R +L+ Q K D + + R MGE F+ L +S + + Sbjct: 305 LNALGGVERVNALINQNPDKIDEISEGAMR----IMAPDQMGERFQALCLSTKDLPSPAG 360 Query: 360 F 360 F Sbjct: 361 F 361 >gi|254420131|ref|ZP_05033855.1| conserved hypothetical protein [Brevundimonas sp. BAL3] gi|196186308|gb|EDX81284.1| conserved hypothetical protein [Brevundimonas sp. BAL3] Length = 361 Score = 276 bits (705), Expect = 5e-72, Method: Composition-based stats. Identities = 119/358 (33%), Positives = 172/358 (48%), Gaps = 6/358 (1%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 L ++ I G MTV Y C+ DP+ GYY++ G GDF+TAP +SQ+FGE+ Sbjct: 5 ETLKTRLAREIALTGPMTVADYVTRCLHDPKGGYYASRPALGEGGDFITAPLVSQMFGEL 64 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 + ++ + W + G P RLVE+GPG G +M D LR L P F + ++E S L Sbjct: 65 IGLWAVETWNRLGAPERFRLVEVGPGDGTLMSDALRAAR-LVPGFLEACDLILIEPSAPL 123 Query: 123 TLIQKKQLASYGDKINWYTSL-ADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDI 181 +Q K LA W L L+ANE D LP +QFV T+ G ER I + Sbjct: 124 RDLQAKALAGADLSPRWVRDLTRIETDAPVILIANEVLDCLPARQFVRTDGGWAERRIGV 183 Query: 182 DQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDY 241 + L+F + D G +FE S + + ++ L G A++IDY Sbjct: 184 TDDNDLIFGLTAIS-GGFEKPAFDIEPGEVFEISEQQAIFGRDLAGLLKAASGAALLIDY 242 Query: 242 GYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKF 301 G + GDTLQA++ H V PL +PG+ADL+ DF R+ A+ + G TQG+F Sbjct: 243 GRARPEAGDTLQALRRHQKVDPLDSPGEADLTQWADFPRVLEAAVRAGADVTGCLTQGEF 302 Query: 302 LEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMP 359 L LGI RA L + + R + + MG LFK + + ++P Sbjct: 303 LRRLGIEARAERLKAGRPDAAPV---IDRQLHRLTAEDQMGSLFKAAAIFSPRALIVP 357 >gi|304321295|ref|YP_003854938.1| hypothetical protein PB2503_08704 [Parvularcula bermudensis HTCC2503] gi|303300197|gb|ADM09796.1| hypothetical protein PB2503_08704 [Parvularcula bermudensis HTCC2503] Length = 362 Score = 274 bits (701), Expect = 1e-71, Method: Composition-based stats. Identities = 132/361 (36%), Positives = 183/361 (50%), Gaps = 9/361 (2%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 L + I K G ++V Y A + PEFGYY T +P GA GDF TAPEISQ+FGEM Sbjct: 4 TPLAPLLAKRIDKEGPLSVGAYMAEALGHPEFGYYMTRDPLGAEGDFTTAPEISQLFGEM 63 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 + FL+ +W P V L E GPGRG +M D+LRV L+PDF ++ETS L Sbjct: 64 IGGFLLASWAAMAAPRPVTLAEFGPGRGTLMADMLRVAK-LRPDFLEAAEAVLLETSPVL 122 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVM--TEHGIRERMID 180 Q++ L+S + W A +P G L+ANEFFD LPI+QFV T RER++ Sbjct: 123 RSRQRETLSSPPLPLRWIEDAAALPSGPLLLIANEFFDCLPIRQFVRAGTGPLFRERLVT 182 Query: 181 I-DQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVI 239 + L + + + + I E ++ + RL G ++I Sbjct: 183 TGEMPGQLAYCLSEETYSPPPGAAAHGPPEEIVETCAPAHALIEVLIPRLTAAPGLVLII 242 Query: 240 DYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQG 299 DYG + GDT QAVK H + PL PG+ADL++HV+F L+ A L + G +Q Sbjct: 243 DYGAGRRGSGDTFQAVKNHQFHHPLALPGEADLTAHVNFAALADTARRGGLGVYGPISQS 302 Query: 300 KFLEGLGIWQRAFSL-MKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELM 358 +FL LG+ RA L A K + V+RLV MG LF++L ++ + Sbjct: 303 RFLTALGLPARADQLAGANPAAKAAIAQQVERLVGQ----DQMGTLFQVLCLTSPGSPVP 358 Query: 359 P 359 P Sbjct: 359 P 359 >gi|225452686|ref|XP_002276826.1| PREDICTED: hypothetical protein [Vitis vinifera] gi|296087782|emb|CBI35038.3| unnamed protein product [Vitis vinifera] Length = 483 Score = 274 bits (701), Expect = 1e-71, Method: Composition-based stats. Identities = 129/399 (32%), Positives = 204/399 (51%), Gaps = 44/399 (11%) Query: 2 ENKLIRKIVNLIK-KNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 +++L++ + +IK + G ++V +Y + +P+ G+Y + FG GDF+T+PE+SQ+FG Sbjct: 83 DSELVKHLKGIIKFRGGPISVAEYMEEVLTNPKAGFYINRDVFGTEGDFITSPEVSQMFG 142 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 EM+ ++++C WEQ G PS V LVELGPGRG +M D+LR K K +F L I+MVE S Sbjct: 143 EMIGVWVMCLWEQMGQPSKVNLVELGPGRGTLMADLLRGTSKFK-NFIESLQIHMVECSP 201 Query: 121 RLTLIQKKQLASYGD------------------KINWYTSLADVPLG-FTFLVANEFFDS 161 L +Q K L + ++W+ +L VP G T ++A+EF+D+ Sbjct: 202 TLQKLQHKNLKCVDEDSHNGNVDKRTISMLTGTPVSWHAALEQVPSGLPTIIIAHEFYDA 261 Query: 162 LPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIK--------SNFLTCSDYFLGAIFE 213 LP+ QF G E+MID+ + S F + + + E Sbjct: 262 LPVHQFQRASRGWCEKMIDVAEDSSFRFVLSPQSTPAKLYLMERCKWAGAEEIAKLDQIE 321 Query: 214 NSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLS 273 P +I+ R++ DGG A+VIDYG L V D+LQA++ H +V+ L NPG ADLS Sbjct: 322 VCPKAIELTHTIAKRISSDGGGALVIDYG-LDGIVSDSLQAIRKHKFVNILDNPGSADLS 380 Query: 274 SHVDFQRLSSIAILY--KLYINGLTTQGKFLEGLGIWQRAFSLMKQ--TARKDILLDSVK 329 ++VDF + A + ++G TQ +FL LGI R +L+K + + L Sbjct: 381 AYVDFASIRHSAEEASDDVIVHGPITQSQFLGSLGINFRVEALLKNCTDEQAESLRTGYW 440 Query: 330 RLVSTSAD----------KKSMGELFKILVVSHEKVELM 358 RLV MG + ++ + ++K + Sbjct: 441 RLVGEGEAPFWEGPDDQVPIGMGTRYLVMAIVNKKQGIP 479 >gi|149201041|ref|ZP_01878016.1| hypothetical protein RTM1035_15487 [Roseovarius sp. TM1035] gi|149145374|gb|EDM33400.1| hypothetical protein RTM1035_15487 [Roseovarius sp. TM1035] Length = 353 Score = 274 bits (701), Expect = 1e-71, Method: Composition-based stats. Identities = 134/357 (37%), Positives = 191/357 (53%), Gaps = 13/357 (3%) Query: 5 LIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLA 64 L + I + G M++ Y A+C+ PEFGYY+T +PFGA GDF+TAPEISQ+FGE+L Sbjct: 4 LRAHFLARIAEAGPMSLADYMAVCLMHPEFGYYATRDPFGARGDFITAPEISQMFGELLG 63 Query: 65 IFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTL 124 + L W G P+ L ELGPGRG +M D+LR ++ P F +++VE S L Sbjct: 64 LCLAQVWLDQGRPARFLLAELGPGRGTLMADVLRATQRV-PGFREAAEVHLVEGSAVLRA 122 Query: 125 IQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQH 184 Q++ +A + W+ + +P G +L+ANEFFD+LPI+QF + G RER++ Sbjct: 123 AQRRAIAG---DVIWHERVESLPEGPLYLLANEFFDALPIRQFQRSGEGWRERVVGQSAG 179 Query: 185 DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYL 244 L+ G + D G I E M I R+ GG A+++DYG Sbjct: 180 QLLLGLGGPVAPPALAERLVDTREGDIVETCGPAAAVMAEIGARIEGQGGAALIVDYGDW 239 Query: 245 QSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEG 304 +S +GDT QA+K H V PL PG ADL++HVDF+ L+ A LT QG FLE Sbjct: 240 RS-LGDTFQALKAHQPVDPLAEPGAADLTAHVDFEALALAAAPA--LHTRLTPQGVFLER 296 Query: 305 LGIWQRAFSLMKQTARK--DILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMP 359 LGI RA +L + + + L + +RL + MG L K++ + E P Sbjct: 297 LGIAARAEALARNLSGPALETHLAAYQRLTGA----EEMGTLLKVMGLYPEGAAPPP 349 >gi|190571091|ref|YP_001975449.1| hypothetical protein WPa_0686 [Wolbachia endosymbiont of Culex quinquefasciatus Pel] gi|213019611|ref|ZP_03335417.1| conserved hypothetical protein [Wolbachia endosymbiont of Culex quinquefasciatus JHB] gi|190357363|emb|CAQ54794.1| conserved hypothetical protein [Wolbachia endosymbiont of Culex quinquefasciatus Pel] gi|212995033|gb|EEB55675.1| conserved hypothetical protein [Wolbachia endosymbiont of Culex quinquefasciatus JHB] Length = 347 Score = 274 bits (700), Expect = 2e-71, Method: Composition-based stats. Identities = 119/355 (33%), Positives = 191/355 (53%), Gaps = 17/355 (4%) Query: 3 NKLIRKIVNLI-KKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGE 61 N ++ I LI K G +++ + + + E+GYY P G GDF+TAPEISQ+FGE Sbjct: 4 NNMLTYIHELIDKSQGSISISDFISAALYHKEYGYYMNKLPLGKDGDFITAPEISQLFGE 63 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 +A++++ WE+ G PS LVELGPG+G ++ DI+RV K FS ++I++VE S Sbjct: 64 TIAVWIMNTWEKLGKPSKFSLVELGPGKGTLIHDIIRVTKKYSCF-FSSMNIHLVEISPL 122 Query: 122 LTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDI 181 L IQK++L INW+T + ++P T ANE FD+LPI QF+ + E + Sbjct: 123 LQKIQKEKLKGL--DINWHTDVDNLPNQPTIFFANELFDALPIDQFIYRDEQWYENRVIK 180 Query: 182 DQHDSLVFNIGD--HEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVI 239 SL ++ + F GA+ E ++ + +++ +GG A++I Sbjct: 181 QDDGSLSLSLQCLTRPKTGFQTGMTGGFNGAVVEVCLAGIEILKKLENKIVNNGGAALII 240 Query: 240 DYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQG 299 DYGY+ TLQ+++ H Y + L N G +D+++ V+FQ L + TQ Sbjct: 241 DYGYVYPEYKSTLQSIRQHKYTNFLENVGNSDITALVNFQSLKDSLRHVNCE---ILTQR 297 Query: 300 KFLEGLGIWQRAFSLMKQTA--RKDILLDSVKRLVSTSADKKSMGELFKILVVSH 352 +FL GI +RA +LMK + +K+ + RL ++MG LFK++++ H Sbjct: 298 EFLHLFGIKERAQALMKNASNEQKNKIFSEFLRLT------ENMGTLFKVMLIHH 346 >gi|315498178|ref|YP_004086982.1| hypothetical protein Astex_1155 [Asticcacaulis excentricus CB 48] gi|315416190|gb|ADU12831.1| protein of unknown function DUF185 [Asticcacaulis excentricus CB 48] Length = 357 Score = 273 bits (699), Expect = 3e-71, Method: Composition-based stats. Identities = 116/358 (32%), Positives = 178/358 (49%), Gaps = 13/358 (3%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L +++ IK G +T+ Y C+ DP+ GYY+T G GDF+TAP ++Q+FGE L Sbjct: 3 SLKTRLIEQIKLEGPLTIADYMWACLFDPQEGYYATRPALGEAGDFITAPLVTQMFGERL 62 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 A++ + AW+ G P+ +R++E+GPG G +M D+LR L P F I ++E S+ L Sbjct: 63 ALWAMQAWQDMGAPAKIRVLEIGPGDGTLMGDLLRTFRAL-PAFVKAAEIGLIEPSQPLR 121 Query: 124 LIQKKQLASYGDKINWYTSLAD-VPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 +Q +L + Y SL ++ANE D LP +QF +T G ER + + Sbjct: 122 ALQTDRLGE----VLHYDSLDHVPTDAPLLIIANEVLDCLPARQFQLTPDGWFERCVGMH 177 Query: 183 QHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYG 242 IG +F G + E S ++R ++++ + G A+ IDYG Sbjct: 178 ---EGELVIGLVPAPQDFKAPFAAETGQVCEVSLAQNRLIEAVGALIHEATGAALFIDYG 234 Query: 243 YLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFL 302 + GDTLQA+ H PL PG DL+ DF L+ A+ L ++ +T QG FL Sbjct: 235 RDRPEPGDTLQALYRHEKTDPLAEPGAHDLTQWADFPSLAITALNMGLGVSQITPQGVFL 294 Query: 303 EGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVS-HEKVELMP 359 + LGI +R L + + + R V + MGELFK+L +S + L Sbjct: 295 QRLGIIERFDELRAENPEETD---RLARQVHRLIAGEEMGELFKVLALSYPRDLPLAG 349 >gi|72013976|ref|XP_781178.1| PREDICTED: hypothetical protein [Strongylocentrotus purpuratus] gi|115958909|ref|XP_001182552.1| PREDICTED: hypothetical protein [Strongylocentrotus purpuratus] Length = 384 Score = 273 bits (698), Expect = 3e-71, Method: Composition-based stats. Identities = 119/376 (31%), Positives = 182/376 (48%), Gaps = 37/376 (9%) Query: 9 IVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLI 68 + I+ G +TV Y + P GYY + FG GDF+T+PEISQ+FGE++A+++I Sbjct: 4 LKQRIRTLGAITVADYMKEVLTSPVGGYYMQGDVFGERGDFITSPEISQMFGELIALWII 63 Query: 69 CAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKK 128 W + G P ++LVELGPGRG + D+LRV + LS+++VE S ++ +Q K Sbjct: 64 HEWSRLGCPRPLQLVELGPGRGTLADDVLRVFKQFPQLPLDTLSLHLVEVSPGMSDVQHK 123 Query: 129 QLASYGD-------------------------KINWYTSLADVPLGFTFLVANEFFDSLP 163 L + ++WYTSL+ VP GFT +A+EFFD+LP Sbjct: 124 TLTGHQQRLKEEVSGGIVDGIPYRSASVKGGIPVSWYTSLSQVPNGFTCFLAHEFFDALP 183 Query: 164 IKQFVMTEHGIRERMIDIDQH----DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRD 219 I +F + RE M+D+D + L F + ++ E P Sbjct: 184 IHKFQKSSSRWREIMVDVDDDSNSPNDLRFVLSPAPTPASNSFIQASETRDHVEVCPTAA 243 Query: 220 REMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQ 279 Q ++ R+ DGG A++IDYG+ ++ DT + K H L+ PG ADL++ VDF Sbjct: 244 VIAQEMASRIYSDGGMALIIDYGHDGTKT-DTFRGFKDHKLHDVLIEPGTADLTADVDFA 302 Query: 280 RLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQ--TARKDILLDSVKRLVSTSAD 337 L + ++ G QG FL+ +GI R +L+K + L+ K L Sbjct: 303 YLRRMVGD-RVATYGPIHQGLFLQMMGIDTRLKALLKATPSEEHTNLISGYKMLTE---- 357 Query: 338 KKSMGELFKILVVSHE 353 MGE FK + + Sbjct: 358 PDQMGERFKFFSILPQ 373 >gi|217074438|gb|ACJ85579.1| unknown [Medicago truncatula] Length = 449 Score = 272 bits (696), Expect = 5e-71, Method: Composition-based stats. Identities = 129/402 (32%), Positives = 206/402 (51%), Gaps = 45/402 (11%) Query: 2 ENKLIRKIVNLIK-KNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 +++L++ + +IK + G +++ +Y + + +P+ GYY + FGA GDF+T+PE+SQ+FG Sbjct: 49 DSELVKHLKGIIKFRGGPISLGEYMSEVLTNPKAGYYINRDIFGAQGDFITSPEVSQMFG 108 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 EM+ ++++C WEQ G P V LVELGPGRG +M D+LR K K L +++VE S Sbjct: 109 EMVGVWVMCLWEQMGRPERVNLVELGPGRGTLMADLLRGASKFKNFT-ESLHVHLVECSP 167 Query: 121 RLTLIQKKQLASYGD------------------KINWYTSLADVPLG-FTFLVANEFFDS 161 L +Q K L + ++W+ +L VP G T ++A+EFFD+ Sbjct: 168 ALKTLQHKNLKCVDEENADGDTDKRTVSSFVGTPVSWHATLEQVPSGSPTIIIAHEFFDA 227 Query: 162 LPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAI--------FE 213 LP+ QF G E+M+D+ + SL F + H + + E Sbjct: 228 LPVHQFQKGSRGWCEKMVDVAEDSSLHFVLSPHPTPATLYLLKRAKWAGVEEIAKFNQIE 287 Query: 214 NSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLS 273 P Q+I +R++ DGG A++IDYG V D+LQA++ H +V L +PG ADLS Sbjct: 288 ICPKAMDLTQTIVERISSDGGGALIIDYG-SDGVVSDSLQAIRKHRFVDLLDDPGSADLS 346 Query: 274 SHVDFQRLSSIAILYK--LYINGLTTQGKFLEGLGIWQRAFSLMKQ--TARKDILLDSVK 329 ++VDF + A + ++G TQ +FL LGI RA SL++ + + L Sbjct: 347 AYVDFASIRHSAEEASGEVSVHGPMTQSQFLGALGINFRAESLLQNCTEEQAESLRTGYW 406 Query: 330 RLVSTS----------ADKKSMGELFKILVVSHEKVELM-PF 360 RLV + MG +K + + + + PF Sbjct: 407 RLVGDGEAPFWEGADDSAPIGMGTRYKAMAIVDKNQGVPVPF 448 >gi|197106320|ref|YP_002131697.1| hypothetical protein PHZ_c2859 [Phenylobacterium zucineum HLK1] gi|196479740|gb|ACG79268.1| conserved hypothetical protein [Phenylobacterium zucineum HLK1] Length = 359 Score = 272 bits (695), Expect = 6e-71, Method: Composition-based stats. Identities = 127/361 (35%), Positives = 182/361 (50%), Gaps = 8/361 (2%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 M L K+ I G +TV QY C+ DP+FGYY+T G GDFVTAP +SQ+FG Sbjct: 1 MSRNLAEKLAAQIAAGGPLTVAQYMTACLHDPQFGYYATRPALGEGGDFVTAPLVSQMFG 60 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 E++ ++ +WE G P VRLVE+GPG G +M D+LR + P F +++VETSE Sbjct: 61 ELVGVWAAVSWELMGRPETVRLVEMGPGDGTLMGDVLRAAR-MAPGFLDAADVWLVETSE 119 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 L Q+++L + L ANE D LP++QFV T G E+++ Sbjct: 120 PLKARQRERLGDGPRWAASLAEVP--GEAPLILFANELLDCLPVRQFVRTATGWAEQVVG 177 Query: 181 IDQHDSLVFNIGDHEIKSNFLT-CSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVI 239 +D + + + T D GA+FE S ++ I R+ DGG A++I Sbjct: 178 LDDQGGSGGRLAFGRVATPAGTLLPDAREGAVFEQSAAQEALGSEIGARVVRDGGAALLI 237 Query: 240 DYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQG 299 DYG + GDTLQA++ H V PL PG+ADL+ H DF + + A + TQ Sbjct: 238 DYGRARPGFGDTLQALRRHERVDPLACPGEADLTVHADFPAVMAAAEGE-GAQAAILTQA 296 Query: 300 KFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMP 359 +FL LGI +RA +L++ K + V R ++ MGELFK + Sbjct: 297 EFLARLGIGERAEALVRARPDKAPV---VGRQLNRLVAADQMGELFKACCLHSPGWTPPA 353 Query: 360 F 360 F Sbjct: 354 F 354 >gi|195607964|gb|ACG25812.1| uncharacterized ACR, COG1565 family protein [Zea mays] Length = 500 Score = 272 bits (695), Expect = 6e-71, Method: Composition-based stats. Identities = 126/400 (31%), Positives = 203/400 (50%), Gaps = 43/400 (10%) Query: 2 ENKLIRKIVNLIK-KNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 E++L++ I ++IK ++G +++ +Y + +P+ G+Y + FG GDF+T+PE+SQ+FG Sbjct: 102 ESELVKHIKSIIKFRSGPISIAEYMEEVLTNPQSGFYINRDVFGESGDFITSPEVSQMFG 161 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 EM+ ++ +C WEQ G P+ V L+ELGPGRG ++ D+LR K LSI +VE S Sbjct: 162 EMIGVWAMCLWEQMGKPAKVNLIELGPGRGTLLADLLRGSAKFANFT-KALSINLVECSP 220 Query: 121 RLTLIQKKQLASYGDK---------------INWYTSLADVPLG-FTFLVANEFFDSLPI 164 L IQ L + + W+ SL VP G T ++A+EF+D+LPI Sbjct: 221 TLQKIQYNTLKCEDEHVGDGKRTVSKICGAPVCWHASLEQVPSGSPTIIIAHEFYDALPI 280 Query: 165 KQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI--------KSNFLTCSDYFLGAIFENSP 216 QF G E+M+DI + F + H + + + + E P Sbjct: 281 HQFQKASRGWCEKMVDIAEDSLFRFVLSPHPTASLIYLAKRCGWASSEELEKIEHIEVCP 340 Query: 217 CRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHV 276 + I+DR++ DGG A++IDYG V ++LQA++ H +V L +PG ADLS++V Sbjct: 341 KAMELTEQIADRISSDGGGALIIDYGKNG-IVSNSLQAIRKHKFVDILDDPGSADLSAYV 399 Query: 277 DFQRLSSIAILY--KLYINGLTTQGKFLEGLGIWQRAFSLMKQ--TARKDILLDSVKRLV 332 DF + A + ++G TQ +FL LGI R +L++ + + L RLV Sbjct: 400 DFASIKRSAEEASDDISVHGPMTQSQFLGSLGINFRVEALLQNCTEEQAESLRTGYWRLV 459 Query: 333 STS-----------ADKKSMGELFKILVVSHEKV-ELMPF 360 A MG + + + ++K +PF Sbjct: 460 GDGEAPFWEGPEDQAAPVGMGTRYLAMAIVNKKQGTPIPF 499 >gi|242053431|ref|XP_002455861.1| hypothetical protein SORBIDRAFT_03g026410 [Sorghum bicolor] gi|241927836|gb|EES00981.1| hypothetical protein SORBIDRAFT_03g026410 [Sorghum bicolor] Length = 499 Score = 272 bits (695), Expect = 7e-71, Method: Composition-based stats. Identities = 128/399 (32%), Positives = 201/399 (50%), Gaps = 42/399 (10%) Query: 2 ENKLIRKIVNLIK-KNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 +++L++ I ++IK ++G +++ +Y + +P+ G+Y + FG GDF+T+PE+SQ+FG Sbjct: 102 DSELVKHIKSIIKFRSGPISIAEYMEEVLTNPQSGFYINRDVFGESGDFITSPEVSQMFG 161 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 EM+ I+ +C WEQ G P+ V L+ELGPGRG ++ D+LR K LSI +VE S Sbjct: 162 EMIGIWAMCLWEQMGKPAMVNLIELGPGRGTLLADLLRGSAKFVNFT-KALSINLVECSP 220 Query: 121 RLTLIQKKQLASYGDK---------------INWYTSLADVPLG-FTFLVANEFFDSLPI 164 L IQ L + I W+ SL VP G T ++A+EF+D+LPI Sbjct: 221 TLQKIQYNTLKCEDEHVDDGKRTVSKLCGAPICWHASLEQVPSGSPTIIIAHEFYDALPI 280 Query: 165 KQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCS--------DYFLGAIFENSP 216 QF G E+M+D+ + S F + H S + E P Sbjct: 281 HQFQKASRGWCEKMVDLAEDSSFRFVLSPHPTPSLIYLAKRSGWASSEELERIEHIEVCP 340 Query: 217 CRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHV 276 + I+DR++ DGG A++IDYG V D+LQA++ H +V L +PG ADLS++V Sbjct: 341 KAMELTEQIADRISSDGGGALIIDYGKNG-IVSDSLQAIRKHKFVDILDDPGSADLSAYV 399 Query: 277 DFQRLSSIAILY--KLYINGLTTQGKFLEGLGIWQRAFSLMKQ--TARKDILLDSVKRLV 332 DF + A + ++G TQ +FL LGI R +L++ + + L RLV Sbjct: 400 DFASIKHSAEEASDDISVHGPMTQSQFLGSLGINFRVEALLQNCTEEQAESLRTGYWRLV 459 Query: 333 STSAD----------KKSMGELFKILVVSHEKV-ELMPF 360 MG + + + ++K +PF Sbjct: 460 GDGEAPFWEGPEDQTPIGMGTRYLAMAIVNKKQGTPIPF 498 >gi|226505940|ref|NP_001141575.1| hypothetical protein LOC100273691 [Zea mays] gi|194705134|gb|ACF86651.1| unknown [Zea mays] Length = 500 Score = 272 bits (695), Expect = 7e-71, Method: Composition-based stats. Identities = 127/400 (31%), Positives = 203/400 (50%), Gaps = 43/400 (10%) Query: 2 ENKLIRKIVNLIK-KNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 E++L++ I ++IK ++G +++ +Y + +P+ G+Y + FG GDF+T+PE+SQ+FG Sbjct: 102 ESELVKHIKSIIKFRSGPISIAEYMEEVLTNPQSGFYINRDVFGESGDFITSPEVSQMFG 161 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 EM+ ++ +C WEQ G P+ V L+ELGPGRG ++ D+LR K LSI +VE S Sbjct: 162 EMIGVWAMCLWEQMGKPAKVNLIELGPGRGTLLADLLRGSAKFANFT-KALSINLVECSP 220 Query: 121 RLTLIQKKQLASYGDK---------------INWYTSLADVPLG-FTFLVANEFFDSLPI 164 L IQ L + + W+ SL VP G T ++A+EF+D+LPI Sbjct: 221 TLQKIQYNTLKCEDEHVGDGKRTVSKICGAPVCWHASLEQVPSGSPTIIIAHEFYDALPI 280 Query: 165 KQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI--------KSNFLTCSDYFLGAIFENSP 216 QF G E+M+DI + F + H + + + + E P Sbjct: 281 HQFQKASRGWCEKMVDIAEDSLFRFVLSPHPTASLIYLAKRCGWASSEELEKIEHIEVCP 340 Query: 217 CRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHV 276 + I+DR++ DGG A++IDYG V D+LQA++ H +V L +PG ADLS++V Sbjct: 341 KAMELTEQIADRISSDGGGALIIDYGKNG-IVSDSLQAIRKHKFVDILDDPGSADLSAYV 399 Query: 277 DFQRLSSIAILY--KLYINGLTTQGKFLEGLGIWQRAFSLMKQ--TARKDILLDSVKRLV 332 DF + A + ++G TQ +FL LGI R +L++ + + L RLV Sbjct: 400 DFASIKRSAEEASDDISVHGPMTQSQFLGSLGINFRVEALLQNCTEEQAESLRTGYWRLV 459 Query: 333 STS-----------ADKKSMGELFKILVVSHEKV-ELMPF 360 A MG + + + ++K +PF Sbjct: 460 GDGEAPFWEGPEDQAAPVGMGTRYLAMAIVNKKQGTPIPF 499 >gi|241568973|ref|XP_002402617.1| conserved hypothetical protein [Ixodes scapularis] gi|215500058|gb|EEC09552.1| conserved hypothetical protein [Ixodes scapularis] Length = 409 Score = 271 bits (694), Expect = 8e-71, Method: Composition-based stats. Identities = 124/394 (31%), Positives = 187/394 (47%), Gaps = 43/394 (10%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGE 61 E +L++++ + I G +TV +Y + +P GYY + FG+ GDF T+PEISQ+FGE Sbjct: 20 ETRLLQQLRSRILATGPITVAEYMKEVLTNPMSGYYMHRDVFGSSGDFTTSPEISQMFGE 79 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 ++A++ + W + G P + +VELGPGRG + D+LRV K D V+S+++VE S Sbjct: 80 LVAVWFLNEWVKAGKPKPLYIVELGPGRGTLSDDMLRVFSK-YSDAMEVVSLHLVEISPH 138 Query: 122 LTLIQ----------------------KKQLASYGDKINWYTSLADVPLGFTFLVANEFF 159 L+ +Q K+ + +G + WY L DVP GF+ VA+EF Sbjct: 139 LSQVQELKLCGTVSVVKDVLDHSPVTYKQSITKHGVPVGWYRHLHDVPRGFSCFVAHEFL 198 Query: 160 DSLPIKQFVMTEHGIRERMIDIDQ---HDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSP 216 D+LP+ +F T G RE ID+D L + + ++F E P Sbjct: 199 DALPVHKFQRTPEGWREVFIDLDDGPGPHHLRYVLSRGPTPASFF-ADVCKPRDHVEVCP 257 Query: 217 CRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHV 276 Q ++ R+ GG +V+DYG+ ++ DT +A K HT L PG ADL++ V Sbjct: 258 EAGVIAQELASRMHEHGGCGLVVDYGHDGTKT-DTFRAFKNHTLHPVLSEPGTADLTADV 316 Query: 277 DFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTAR---KDILLDSVKRLVS 333 DF L + K G TQ +FL+ +GI R L+ + LL L Sbjct: 317 DFSYLKR-ILAGKALTFGPVTQEQFLKNMGINIRLQKLLDNCQDANLRQELLSGYDMLT- 374 Query: 334 TSADKKSMGELFKILVVSH-------EKVELMPF 360 + + MGE FK V E F Sbjct: 375 ---NPEKMGERFKFFGVFPLDMQKPLETHPPAGF 405 >gi|89055825|ref|YP_511276.1| hypothetical protein Jann_3334 [Jannaschia sp. CCS1] gi|88865374|gb|ABD56251.1| protein of unknown function DUF185 [Jannaschia sp. CCS1] Length = 371 Score = 271 bits (692), Expect = 2e-70, Method: Composition-based stats. Identities = 127/354 (35%), Positives = 189/354 (53%), Gaps = 13/354 (3%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L +++ I + G +++ Y A C+ DP+FGYY+T +P G GDF+TAPEISQ+FGE++ Sbjct: 7 SLKARLLARIARLGPISLADYMAECLHDPQFGYYATRDPLGRGGDFITAPEISQMFGELI 66 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 ++L W G LVELGPGRG +M D++R + P F +++++VE S L Sbjct: 67 GLWLAQVWMDQGG-GAAALVELGPGRGTLMADVMRATRGV-PGFHDAVTVHLVEASPVLR 124 Query: 124 LIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQ 183 +Q + LA+Y ++ +LADVP G LVANEFFD+LPI+QF M++ G + Sbjct: 125 AMQTEALAAY--APRFHDNLADVPEGPILLVANEFFDALPIRQFQMSDAGDWQERQIGAS 182 Query: 184 HDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGY 243 +L++ + + G I E + I R+ GG A+++DYG Sbjct: 183 DGALIWGLAPPAPLDVRDGFAPGMPGMIVETCAPAEAIAAEIGRRV-AQGGAALIVDYGD 241 Query: 244 LQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHV-DFQRLSSIAILYKLYINGLTTQGKFL 302 S GDT QA+ H Y PL PG+ADL++HV + + +GLT QG FL Sbjct: 242 WHS-AGDTFQALAKHAYTDPLDAPGEADLTAHVAFAPIARAARAASGVVASGLTRQGMFL 300 Query: 303 EGLGIWQRAFSLMK--QTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 E LGI RA +L + D + + +RL MG LFK+L ++ E Sbjct: 301 ERLGITARAQALARGLDGTALDTHIAAHRRLTHG----DEMGTLFKVLALTPEG 350 >gi|302383938|ref|YP_003819761.1| hypothetical protein Bresu_2831 [Brevundimonas subvibrioides ATCC 15264] gi|302194566|gb|ADL02138.1| protein of unknown function DUF185 [Brevundimonas subvibrioides ATCC 15264] Length = 357 Score = 270 bits (690), Expect = 2e-70, Method: Composition-based stats. Identities = 117/359 (32%), Positives = 169/359 (47%), Gaps = 7/359 (1%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L ++ I G MTV Y C+ DP GYY+T G GDF+TAP ISQ+FGE++ Sbjct: 2 ALKDRLAREIALTGPMTVADYVTRCLHDPTDGYYATRPALGEGGDFITAPLISQMFGELI 61 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 ++ + W++ G P RLVE+GPG G +M D LR + P F + ++E S L Sbjct: 62 GLWAVETWQRLGAPERFRLVEVGPGDGTLMDDALRAAR-VAPGFLEACDLILIEPSGPLR 120 Query: 124 LIQKKQLASYGDKINWYTSLADVP-LGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 +Q ++LA W SL + L+ANE D LP +QFV TE G ER + + Sbjct: 121 EVQARRLAQADVSPRWVRSLGQIDTDAPVILIANEVLDCLPARQFVRTEGGWAERRVGVT 180 Query: 183 QHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYG 242 L F + + G + E S + + ++ LA G A++IDYG Sbjct: 181 DGGDLTFGLVGI-TGGFERPGFEVEPGQVIEASEQQAAFGRDLAAMLAEASGAALLIDYG 239 Query: 243 YLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFL 302 + VGDTLQA++ H V L PG+AD++ DF + A+ + G QG FL Sbjct: 240 RDRPGVGDTLQALRRHAKVDVLATPGEADVTQWADFPAVLEAAVRAGADVTGCVGQGDFL 299 Query: 303 EGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVV-SHEKVELMPF 360 LGI RA L ++ R ++ MG LFK + S + + F Sbjct: 300 RRLGIEARAERLRTGRPEAAPVIG---RQLARLTGPDQMGALFKAAAIFSPRSLSVPGF 355 >gi|89070837|ref|ZP_01158082.1| hypothetical protein OG2516_14036 [Oceanicola granulosus HTCC2516] gi|89043575|gb|EAR49784.1| hypothetical protein OG2516_14036 [Oceanicola granulosus HTCC2516] Length = 362 Score = 270 bits (690), Expect = 3e-70, Method: Composition-based stats. Identities = 131/364 (35%), Positives = 189/364 (51%), Gaps = 19/364 (5%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L I I G +TV Y ALC+ PE G Y+ +P GA G F TAPEISQ+FGE+L Sbjct: 2 SLAEIIRRQIAMAGPLTVADYMALCLNHPEHGVYAGIDPLGAGGHFTTAPEISQMFGELL 61 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 + L AW G P+ L ELGPGRG +M D+LR + P F + +++VETS L Sbjct: 62 GLALAQAWLDQGAPAPFALAELGPGRGTLMADVLRAARGV-PGFAAAAELHLVETSPALR 120 Query: 124 LIQKKQL--ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDI 181 Q+ +L A++ D + FL+ANEFFD LP++QF+ G RER+I + Sbjct: 121 DAQRDRLGAATWHDTVATLPD-----DRPLFLLANEFFDVLPVRQFLRDGEGWRERVIAL 175 Query: 182 DQHDSLVFNIGD-HEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVID 240 D+H + F + ++ +D G + E+ P ++I R+A GG A+++D Sbjct: 176 DEHGAPTFGLTPAAPLERLADRLADTAEGDMVEHCPALAPVAEAIGARIAAQGGAALIVD 235 Query: 241 YGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGK 300 YG +GD+LQA++ H + PL PG ADL++HVDF L++ LT QG Sbjct: 236 YG-DWRPLGDSLQALRRHEKIDPLDAPGSADLTAHVDFAALAAARPAG---HTALTPQGV 291 Query: 301 FLEGLGIWQRAFSLMKQTA--RKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELM 358 FLE LGI +RA L + L + +RL MG LFK++ + Sbjct: 292 FLERLGIAERARQLAAGLEGAALEAHLAAHRRLTH----PAEMGHLFKVMGLYPPNAAPP 347 Query: 359 PFVN 362 P ++ Sbjct: 348 PGLD 351 >gi|294011547|ref|YP_003545007.1| hypothetical protein SJA_C1-15610 [Sphingobium japonicum UT26S] gi|292674877|dbj|BAI96395.1| conserved hypothetical protein [Sphingobium japonicum UT26S] Length = 355 Score = 270 bits (689), Expect = 4e-70, Method: Composition-based stats. Identities = 120/365 (32%), Positives = 172/365 (47%), Gaps = 19/365 (5%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 +E L ++ I+ G +++ Y A YY T +P GA GDF TAPEISQ+FG Sbjct: 3 VELTLSERLARQIEAGGPISIAHYMAEANQH----YYGTRDPLGAAGDFTTAPEISQMFG 58 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 E++ + L W + G VELGPGRG + D LR + + ++ VETS Sbjct: 59 ELIGLCLADIWMRSGSRPAAHYVELGPGRGTLASDALRAMASVTFHP----RVHFVETSP 114 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 L Q + + + +SL + G +VANEFFD+LP +Q V RER++ Sbjct: 115 SLRERQGALIPNVAHH-DSVSSLPE--QGPLLVVANEFFDALPARQLVRVGSEWRERVVV 171 Query: 181 IDQHDSLVFNIGDHEIKSNFL----TCSDYFLGAIFENSPCRDREMQSISDRLACDGGTA 236 D + +D GAI E ++ R+A GG A Sbjct: 172 RPDPDQPDRFAPMAGYRRVESGIPAMAADAPEGAILEMPLAGSAIALELAHRIAKQGGAA 231 Query: 237 IVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLT 296 IVIDYGY GDTLQAV+ H Y P + PG++DL++HVDF + ++A L + Sbjct: 232 IVIDYGYEGPATGDTLQAVRAHRYADPFLEPGESDLTTHVDFTMIGNMARQAGLRVTRTV 291 Query: 297 TQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKV- 355 QG FL LGI RA L + T + +++ +R D +MG LFK + H Sbjct: 292 GQGAFLRQLGIDARADQLSRTTPARAEEVEAARR---RLTDDDAMGTLFKAMAWVHPDWA 348 Query: 356 ELMPF 360 + F Sbjct: 349 DPAGF 353 >gi|291242927|ref|XP_002741333.1| PREDICTED: protein midA homolog, mitochondrial-like [Saccoglossus kowalevskii] Length = 429 Score = 269 bits (688), Expect = 4e-70, Method: Composition-based stats. Identities = 128/385 (33%), Positives = 190/385 (49%), Gaps = 45/385 (11%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGE 61 +N L+ +I I NG ++V Y + P GYY + FG GDF+T+PEISQ+F E Sbjct: 38 KNLLLNQIKQTIHINGPISVADYMQTVLTSPLSGYYMKKDVFGVQGDFITSPEISQMFSE 97 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 ++ I+++ W G P +++VELGPGRG + DILRV K + +S+++VE S + Sbjct: 98 LIGIWIVHEWLISGKPKTLQVVELGPGRGTLSDDILRVFAKFQGIH-DAVSLHLVEVSPK 156 Query: 122 LTLIQKKQLASYGDK----------------------------INWYTSLADVPLG-FTF 152 L+ +Q+++L + I W+TS++D+P G T Sbjct: 157 LSQMQEEKLTGDTKQNPSTNNKDNEHAVLSGSYKTALSKTGIPITWHTSISDIPKGVPTC 216 Query: 153 LVANEFFDSLPIKQFVMTEHGIRERMIDIDQH--DSLVFNIGDHEIKSNFLTCSDYFLGA 210 +ANEFFD+LPI + T G RE +ID+ D L F + E S F+ Sbjct: 217 FIANEFFDALPIHKIQKTTKGWREILIDVANDSSDQLRFVLSPTET-----PASQLFIED 271 Query: 211 IFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQA 270 E ++ I R+ GG A+++DYG+ ++ DT +A K H L PG A Sbjct: 272 HVEVCTMGGVIVEEIVKRIDHSGGNALIVDYGHNGNKT-DTFRAFKEHELHDVLKEPGSA 330 Query: 271 DLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQT--ARKDILLDSV 328 DL++ VDF L L G TQ KFL +GI R +LMK+ ++ L+ Sbjct: 331 DLTADVDFSYLRKTIGDTAL-CYGPITQEKFLLNMGIEVRLEALMKKATKTQQKNLMSGY 389 Query: 329 KRLVSTSADKKSMGELFKILVVSHE 353 K LV D KSMGE FK + + Sbjct: 390 KMLV----DPKSMGERFKFFSILPK 410 >gi|255552842|ref|XP_002517464.1| conserved hypothetical protein [Ricinus communis] gi|223543475|gb|EEF45006.1| conserved hypothetical protein [Ricinus communis] Length = 490 Score = 269 bits (688), Expect = 4e-70, Method: Composition-based stats. Identities = 130/402 (32%), Positives = 198/402 (49%), Gaps = 45/402 (11%) Query: 2 ENKLIRKIVNLIK-KNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 E++L + + +IK + G +TV +Y + +P+ G+Y + FGA GDF+T+PE+SQ+FG Sbjct: 90 ESELFKHLKGIIKFRGGPITVAEYMEEVLTNPKAGFYINRDVFGAEGDFITSPEVSQMFG 149 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 EM+ ++ +C WEQ P V LVELGPGRG +M D+LR K K L I+MVE S Sbjct: 150 EMVGVWALCLWEQMEQPKSVNLVELGPGRGTLMADLLRGASKFKSFT-ESLHIHMVECSP 208 Query: 121 RLTLIQKKQLASYGDK------------------INWYTSLADVPLG-FTFLVANEFFDS 161 L +Q L D I+W+TSL VP G T ++A+EF+D+ Sbjct: 209 ALQKLQHHNLKCVDDNNSCGSGEERTISTLAGTPISWHTSLEQVPTGSPTIIIAHEFYDA 268 Query: 162 LPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIK--------SNFLTCSDYFLGAIFE 213 LP+ QF G E+M+D+ + F + + + E Sbjct: 269 LPVHQFQRASRGWCEKMVDVAEDSMFRFVLSPQPTPATLYLVKRCKWAAPEEIEKLNHIE 328 Query: 214 NSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLS 273 P +I+ R++ DGG A++IDYG L V D+LQA++ H +V L NPG ADLS Sbjct: 329 VCPKAIDLTCTIAKRISSDGGGALIIDYG-LNGVVSDSLQAIRKHKFVDILDNPGSADLS 387 Query: 274 SHVDFQRLSSIAILYK--LYINGLTTQGKFLEGLGIWQRAFSLMKQTA--RKDILLDSVK 329 ++VDF + A + ++G TQ +FL LGI R +L++ + + L Sbjct: 388 AYVDFASIRHSAEEASEAVSVHGPITQSQFLGSLGINFRVEALLQNCTEVQAEFLRTGYW 447 Query: 330 RLVSTSAD----------KKSMGELFKILVVSHEKVELM-PF 360 RLV MG + + + ++K + PF Sbjct: 448 RLVGEGEAPFWEGPEEQVPIGMGTRYLAMAIVNKKQGIPVPF 489 >gi|327262391|ref|XP_003216008.1| PREDICTED: protein midA homolog, mitochondrial-like [Anolis carolinensis] Length = 457 Score = 269 bits (687), Expect = 6e-70, Method: Composition-based stats. Identities = 122/373 (32%), Positives = 181/373 (48%), Gaps = 25/373 (6%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 +++ +V IK G +TV +Y + +P GYY + G GDF+T+PEISQIFGE+ Sbjct: 63 TPMLKHLVMKIKSTGPITVAEYMREVLTNPVKGYYIQHDMLGESGDFITSPEISQIFGEL 122 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSV-LSIYMVETSER 121 + I+ + W +G P+ +LVELGPGRG + DI+RV +L +S+++VE S + Sbjct: 123 IGIWFVSEWIANGKPNKFQLVELGPGRGSLTSDIIRVFNQLNSLLHKCDISVHLVEISPK 182 Query: 122 LTLIQ------------------KKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLP 163 L+ IQ + L G I WY L DVP GF F +A+EFFD+LP Sbjct: 183 LSEIQASVLTEGKIKLQESCLAYMQGLTKTGLPIFWYRDLNDVPGGFNFYLAHEFFDTLP 242 Query: 164 IKQFVMTEHGIRERMIDIDQH--DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDRE 221 I +F TE G RE ++DID D L F + + D E SP Sbjct: 243 IHKFQKTEKGWRELLVDIDPEAPDKLRFVLAPSATPAAEAFIHDKESRDHVEVSPDAGVT 302 Query: 222 MQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRL 281 +Q ++ +GG A++IDYG+ ++ DT + +GH L++PG ADL++ VDF L Sbjct: 303 VQKLAHNTEKNGGAALIIDYGHDGTKT-DTFRGFRGHKLHDVLISPGMADLTADVDFSYL 361 Query: 282 SSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSM 341 +A + G Q FL +GI R L++ + + R D M Sbjct: 362 RRMAQ--GVAALGPIKQQDFLRNMGIDIRLQVLLQNAKDANN-RKQLLRSYEMLMDLDKM 418 Query: 342 GELFKILVVSHEK 354 G F + Sbjct: 419 GGRFNFFAMLPSN 431 >gi|295687993|ref|YP_003591686.1| hypothetical protein Cseg_0556 [Caulobacter segnis ATCC 21756] gi|295429896|gb|ADG09068.1| protein of unknown function DUF185 [Caulobacter segnis ATCC 21756] Length = 383 Score = 268 bits (686), Expect = 6e-70, Method: Composition-based stats. Identities = 122/386 (31%), Positives = 184/386 (47%), Gaps = 33/386 (8%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L+ ++ I ++G + ++F C+ DP GYY+T GA GDF+TAP +SQ+FGE++ Sbjct: 2 SLLDRLKAQIAQDGPIGAPEFFTRCLHDPRDGYYATRPDLGASGDFITAPLVSQMFGELI 61 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 +++I W + G P+ RLVE+GPG G +M D+LR L PDF + +++VE S+ L Sbjct: 62 GLWVIETWTRMGRPAPFRLVEMGPGDGALMSDLLRAAR-LAPDFLAATDVWLVEVSQPLK 120 Query: 124 LIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQ 183 Q ++L + + LVANE D LP +QFV T+ G ER+I + + Sbjct: 121 ARQAERLGERPRWASRLDEVP--GGAPMILVANELLDCLPARQFVRTKDGWAERVIGLGE 178 Query: 184 HDSLVFNIGD---------------------------HEIKSNFLTCSDYFLGAIFENSP 216 L F + + D+ +GA+ E SP Sbjct: 179 DGDLAFGLRSLSPPPRGGGGRGATGGGSSSGPASSPSDASRHLPPQGEDFPVGAVVETSP 238 Query: 217 CRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHV 276 + I+ RL DGG A++IDYG + GDTLQAV+ H V PL G ADL+ Sbjct: 239 AQAALASEIAHRLVTDGGAALLIDYGRAEPEAGDTLQAVQNHQKVDPLKTAGLADLTVWA 298 Query: 277 DFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSA 336 DF + + A + TQG+FL LGI RA +L + K + R + Sbjct: 299 DFPSVVAAARDTGAKAGPILTQGQFLVALGILDRAEALAARQPEKTD---QIGRQLDRLL 355 Query: 337 DKKSMGELFKILVVSHEKVELMPFVN 362 + MG LFK+ + + F + Sbjct: 356 GEAQMGTLFKVACLCAPDLSPPLFED 381 >gi|146276315|ref|YP_001166474.1| hypothetical protein Rsph17025_0259 [Rhodobacter sphaeroides ATCC 17025] gi|145554556|gb|ABP69169.1| protein of unknown function DUF185 [Rhodobacter sphaeroides ATCC 17025] Length = 353 Score = 268 bits (686), Expect = 8e-70, Method: Composition-based stats. Identities = 142/359 (39%), Positives = 189/359 (52%), Gaps = 10/359 (2%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 L + I G +TV Y A C+ PE GYYST PFGA GDF TAPEISQ+FGE+ Sbjct: 2 TALAGILARRIGATGPITVADYMAECLLHPEHGYYSTREPFGAAGDFTTAPEISQMFGEL 61 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 L + L AW G P+ L ELGPGRG +M D+LR + P F + + +VE S RL Sbjct: 62 LGLCLAQAWLDQGAPARFTLAELGPGRGTLMADVLRATRGV-PGFHAAAQVRLVEASPRL 120 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 +Q+++L ++ W AD+P FL+ANEFFD+LPI+QFV G RERMI +D Sbjct: 121 RTLQRQRLGNH--PAEWLDRAADLPEAPLFLLANEFFDALPIRQFVRGLSGWRERMIGLD 178 Query: 183 QHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYG 242 + + S D G I E P M I+ R+ GG A+V+DYG Sbjct: 179 GGRPAFGLGPETGLASLEHRLKDTQPGEIVELCPAAGPIMAEIARRIDGHGGLALVVDYG 238 Query: 243 YLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFL 302 +S +GDTLQA++ H + PL PG+ADL++HVDF+ L++ A TTQG L Sbjct: 239 GWRS-LGDTLQALRSHQFDDPLAAPGEADLTAHVDFEALATAARPC---ATAFTTQGALL 294 Query: 303 EGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELM-PF 360 LG+ +RA L + + L S + D MG LFK L V + F Sbjct: 295 LRLGLAERAERLARALEGEA--LASHRAASHRLTDAAEMGTLFKALAVFPPQGPAPAGF 351 >gi|195453489|ref|XP_002073810.1| GK12946 [Drosophila willistoni] gi|194169895|gb|EDW84796.1| GK12946 [Drosophila willistoni] Length = 441 Score = 268 bits (685), Expect = 8e-70, Method: Composition-based stats. Identities = 115/396 (29%), Positives = 190/396 (47%), Gaps = 45/396 (11%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 + L ++ I G +TV Y + +P+ GYY + FG GDF+T+PEISQIFGE+ Sbjct: 52 SSLANQLKAKILATGPITVADYMREVLTNPQGGYYMKRDVFGREGDFITSPEISQIFGEL 111 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 + I+L+ W + G PS + VELGPGRG + D+L+V+ K + S+++VE S L Sbjct: 112 VGIWLVNEWRKLGSPSPFQFVELGPGRGTLARDVLKVLTKF--KLGAEFSMHLVEISPYL 169 Query: 123 TLIQKKQLASYGD----------------------KINWYTSLADVPLGFTFLVANEFFD 160 + +Q ++ + K W+ L DVP GF+ ++A+E+FD Sbjct: 170 SKLQAQRFCYQHETLTEDAAAQLPHYQVGTTATGSKAFWHKRLEDVPEGFSLVLAHEYFD 229 Query: 161 SLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLG------AIFEN 214 +LP + + E +ID+++ N + + + S F + E Sbjct: 230 ALPTHKLQLVNGKWHEVLIDVEEKPQNKENEFRYVLSKSQTPVSHVFRPLTEETRSCLEY 289 Query: 215 SPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSS 274 S +R++ ++ RL +GG ++++DYG+ + DT +A K H PL+ PG ADL++ Sbjct: 290 SLETERQVGLLAQRLEKNGGISLIMDYGHFGEKT-DTFRAFKQHALHDPLLEPGSADLTA 348 Query: 275 HVDFQRLSSIAILYK-LYINGLTTQGKFLEGLGIWQRAFSLMKQ--TARKDILLDSVKRL 331 VDF+ + + A +Y G QG FL+ + R L+ ++I+ + L Sbjct: 349 DVDFKLVKNTAESQGNIYCCGPIEQGDFLKRMQGDVRLEQLLAHALPENQEIIRSGYQML 408 Query: 332 VSTSADKKSMGELFKILVVSH-------EKVELMPF 360 D K MG FK L + EK + F Sbjct: 409 T----DPKQMGSRFKFLAMFPGILHEHLEKYPVAGF 440 >gi|57239058|ref|YP_180194.1| hypothetical protein Erum3300 [Ehrlichia ruminantium str. Welgevonden] gi|57161137|emb|CAH58050.1| conserved hypothetical protein [Ehrlichia ruminantium str. Welgevonden] Length = 339 Score = 268 bits (685), Expect = 1e-69, Method: Composition-based stats. Identities = 118/355 (33%), Positives = 190/355 (53%), Gaps = 19/355 (5%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 M + L + I G ++V+Q+ + + D GYY T PFG GDFVT+PEISQ+FG Sbjct: 1 MHSYLKKVI---FDNGGAISVEQFMRIALYDMNCGYYMTQMPFGVFGDFVTSPEISQLFG 57 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 E++A++++ WE+ G PS L+ELGPGRG ++ DI+RV+ K K +S + IY++E S Sbjct: 58 EVIALWVLLYWEKMGSPSKFVLLELGPGRGTLISDIIRVLKKFK-QCYSAVDIYLLEVSP 116 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 +L +Q L G+K+ W ++ +P ++ANEFFD+LPIKQF+ ER I Sbjct: 117 KLQEVQYNTLQDVGEKVLWCRNINSIPNYPILVIANEFFDALPIKQFICISDSWYERYIT 176 Query: 181 IDQHDSLVFNIGDHEIKSNFLTCS-DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVI 239 ++ + F + I NF + + I E ++ I ++ + G A++I Sbjct: 177 VEDNK---FRFINKLIDKNFQILNVNNINDPIIEVCDDAISIIKLIEHKILQNKGAAVII 233 Query: 240 DYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQG 299 DYGY+ T+Q+VK H Y + N G +D++ HVDF L + + TQ Sbjct: 234 DYGYIDPPYKSTMQSVKNHQYNNIFENVGNSDITVHVDFTALRK---SLSFLNSYIMTQR 290 Query: 300 KFLEGLGIWQRAFSLMKQTA--RKDILLDSVKRLVSTSADKKSMGELFKILVVSH 352 FL GI +R L++ ++ L+ RL ++MG +FK+L+++ Sbjct: 291 DFLYNFGIRERLQILIENATEVQQQNLMTGFLRLT------ENMGSMFKVLLINP 339 >gi|294678327|ref|YP_003578942.1| hypothetical protein RCAP_rcc02806 [Rhodobacter capsulatus SB 1003] gi|294477147|gb|ADE86535.1| protein of unknown function DUF185 [Rhodobacter capsulatus SB 1003] Length = 356 Score = 268 bits (685), Expect = 1e-69, Method: Composition-based stats. Identities = 126/346 (36%), Positives = 191/346 (55%), Gaps = 10/346 (2%) Query: 16 NGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHG 75 G + +DQY A C+ PE GYY+T +PFG GDF+TAPEISQ+FGEML + L W G Sbjct: 15 EGPIGLDQYMAACLLHPEHGYYATRDPFGRAGDFITAPEISQMFGEMLGLCLAQVWLDQG 74 Query: 76 FPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGD 135 P+ L E+GPGRG ++ D+ RVI ++ P ++++E S L +Q++ LA++ Sbjct: 75 RPAPFILAEIGPGRGTLLADVTRVIARV-PGMADAARLHLIEASPTLRAVQRQTLAAH-- 131 Query: 136 KINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHE 195 ++W+ S+A +P FL+ANEFFD+LPI+QF+ TE G ER + + + Sbjct: 132 PVSWHDSVATLPEAPLFLLANEFFDALPIRQFLRTEAGWAERQVGLVGECLVPGLAPPTR 191 Query: 196 IKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAV 255 + D G + E P M I+ R+A GG A+VIDYG+ +S +GDT QAV Sbjct: 192 FAALEHRLVDTTPGDVVETCPAAAPIMGEIARRIATHGGVALVIDYGHWRS-LGDTFQAV 250 Query: 256 KGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIW--QRAFS 313 + H + P PG+ADL++HV F+ L+ A + +T QG LE LGI A + Sbjct: 251 RAHGFCDPFATPGEADLTAHVAFEPLAEAARAAGAQASAMTAQGVLLERLGITARAEALA 310 Query: 314 LMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMP 359 A ++ + + +RL + MG++F+ L + + P Sbjct: 311 ARLSGAAREAHVAAHRRLTH----PEEMGQVFQSLAIFPATAPVPP 352 >gi|195107411|ref|XP_001998307.1| GI23700 [Drosophila mojavensis] gi|193914901|gb|EDW13768.1| GI23700 [Drosophila mojavensis] Length = 436 Score = 268 bits (685), Expect = 1e-69, Method: Composition-based stats. Identities = 118/392 (30%), Positives = 193/392 (49%), Gaps = 43/392 (10%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L +++ I+ G +TV +Y + +P+ GYY + FG GDF+T+PEISQIFGE++ Sbjct: 52 NLAKQLAAKIQATGPITVAEYMREVLTNPQGGYYMNRDVFGREGDFITSPEISQIFGELV 111 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 I+L+ W++ G PS +LVELGPGRG + D+L+V+ K K + +++MVE S L+ Sbjct: 112 GIWLMNEWQKLGSPSPFQLVELGPGRGTLARDVLKVLSKFKSG--AQFTMHMVEISPYLS 169 Query: 124 LIQKKQLASYGD--------------------KINWYTSLADVPLGFTFLVANEFFDSLP 163 Q ++ + ++ W+ L DVP GF+ ++A+EFFD+LP Sbjct: 170 QAQAQRFCYKHEVLPEGEQLSHYQLGTTATGTQVFWHRRLEDVPAGFSLVLAHEFFDALP 229 Query: 164 IKQFVMTEHGIRERMIDIDQ-----HDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCR 218 + + + + +E +ID+D V + + F E S Sbjct: 230 VHKLQLIDGQWQEVLIDVDTKSTTADFRYVLSKAQTPVSQLFKPVQQ-EQRTCLEYSLEA 288 Query: 219 DREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDF 278 +R ++ +S+RL GG A+++DYG+ + DT +A K H PL+ PG ADL++ VDF Sbjct: 289 ERHVRLLSERLEQHGGIALIMDYGHFGDKT-DTFRAFKQHALHEPLLAPGTADLTADVDF 347 Query: 279 QRLSSIAILYK-LYINGLTTQGKFLEGLGIWQRAFSLMKQ--TARKDILLDSVKRLVSTS 335 + L +A ++ ++ G QG FL + R L+ + I+ + L Sbjct: 348 RHLKHVAEMHGDIHCCGPVQQGAFLRNMQGEVRLEQLLAHALPENQSIIRSGYEMLT--- 404 Query: 336 ADKKSMGELFKILVVSH-------EKVELMPF 360 D MG FK L + EK + F Sbjct: 405 -DPNQMGSRFKFLAMFPGVVAAHLEKYPVAGF 435 >gi|310815440|ref|YP_003963404.1| hypothetical protein EIO_0956 [Ketogulonicigenium vulgare Y25] gi|308754175|gb|ADO42104.1| conserved hypothetical protein [Ketogulonicigenium vulgare Y25] Length = 351 Score = 268 bits (684), Expect = 1e-69, Method: Composition-based stats. Identities = 129/348 (37%), Positives = 187/348 (53%), Gaps = 15/348 (4%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L +I +I + G M + Y +LC+ DPE GYY+T GA GDF+TAPE+SQ+FGE++ Sbjct: 2 SLAARIKRMIAQGGPMRLSDYMSLCLLDPEAGYYTTRTAIGAGGDFITAPEVSQVFGELI 61 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 + L AW G P L ELGPGRG +M DILR K+ P F + + +VE S + Sbjct: 62 GLALAQAWLDQGAPDPCILAELGPGRGTLMADILRATRKV-PGFHAAAQVVLVEASPLMR 120 Query: 124 LIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQ 183 +Q + + W S+ +P G FL+ANEF D+LPI+QF + G ER++ + Q Sbjct: 121 TLQAANVPA----ARWCDSVEALPAGPLFLIANEFLDALPIRQFQRSSDGWHERLVTV-Q 175 Query: 184 HDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGY 243 +L +G + D +FE + + M+ I+ R+A GG AI IDYG Sbjct: 176 DGALTLGLGPQ------IALPDAPDADVFEQNTMAESVMRIIASRIAVAGGAAIFIDYGA 229 Query: 244 LQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLE 303 +SR GDT QAV+ H Y P +PG ADL++HV F L+ A + L TQ FL Sbjct: 230 DESR-GDTFQAVQNHAYADPFSDPGTADLTAHVAFGPLARAATNAGAAASALITQADFLL 288 Query: 304 GLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVS 351 LG+ R +L + + LD+ + ++ MG LFK+L ++ Sbjct: 289 TLGLSARGAALARHLSGAA--LDAHQAALNRLTHPGEMGTLFKVLGIT 334 >gi|170751782|ref|YP_001758042.1| hypothetical protein Mrad2831_5412 [Methylobacterium radiotolerans JCM 2831] gi|170658304|gb|ACB27359.1| protein of unknown function DUF185 [Methylobacterium radiotolerans JCM 2831] Length = 356 Score = 268 bits (684), Expect = 1e-69, Method: Composition-based stats. Identities = 133/361 (36%), Positives = 187/361 (51%), Gaps = 12/361 (3%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 L +I LI++NG + VD+Y ALC+ P GYY T +P GA GDF TAPEISQ+FGE+ Sbjct: 2 TPLGAEIAALIRQNGPIGVDRYMALCLGHPVHGYYRTRDPLGAQGDFTTAPEISQMFGEL 61 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 L + G P + LVELGPGRG +M D LR + P ++ ++VETS L Sbjct: 62 LGAWTAYVRGSIGAPDPLLLVELGPGRGTLMADALRALRAALPGV--RVAPHLVETSPVL 119 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 Q + L+ W+ S+ +P G T ++ANEFFD LP++QF G ER I +D Sbjct: 120 RAAQARALSG--TGAVWHDSVDTLPEGPTIILANEFFDCLPVRQFERRPSGWHERQIGLD 177 Query: 183 QHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYG 242 L F + + +D GA+ +++++ RL GG +V+DYG Sbjct: 178 SAGGLAFGLSPEPVPGLV---ADGPDGALMSVPAAGLALIRALARRLVSGGGALLVVDYG 234 Query: 243 YLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFL 302 +++ GDTLQA+ GH + PL PG+ADL+ HVDF L+ A I+G QG FL Sbjct: 235 HVRPGFGDTLQALAGHRFADPLAEPGEADLTHHVDFAALAQAARAEGAAIHGPVDQGDFL 294 Query: 303 EGLGI--WQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKV-ELMP 359 LG+ A+ + +V RL + MG LFK+L VS + L Sbjct: 295 AALGLGARAERLRARANPAQVTAIDAAVTRLTDPG--RGGMGRLFKVLAVSGPSLGPLPG 352 Query: 360 F 360 F Sbjct: 353 F 353 >gi|170741474|ref|YP_001770129.1| hypothetical protein M446_3290 [Methylobacterium sp. 4-46] gi|168195748|gb|ACA17695.1| protein of unknown function DUF185 [Methylobacterium sp. 4-46] Length = 367 Score = 267 bits (683), Expect = 2e-69, Method: Composition-based stats. Identities = 134/360 (37%), Positives = 184/360 (51%), Gaps = 3/360 (0%) Query: 1 ME-NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIF 59 M L+ ++ LI +NG + V++Y ALC+ P GYY+T +P GA GDF TAPEISQIF Sbjct: 1 MSGTPLLAELRALIAQNGPIPVERYMALCLGHPLHGYYTTRDPLGAAGDFTTAPEISQIF 60 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 GE+L ++ W G PS R+VELGPGRG ++ D LR I P F L +++VETS Sbjct: 61 GELLGLWAAEVWHGMGRPSPCRVVELGPGRGTLIADALRAIRAALPPFAEALDLHLVETS 120 Query: 120 ERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMI 179 L Q +LA+ G + W+ + DVP G ++ANEFFD+LP++QF ER I Sbjct: 121 PVLRAAQAARLAAIGREAAWHARIEDVPEGPAIVLANEFFDALPVRQFARGAGAWHERRI 180 Query: 180 DIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVI 239 +D + D + + GA+ M++I+ RLA GG + I Sbjct: 181 GLDPEGGGLVVGLDPDPTPEIAAAA--PEGAVLTLPSAALAAMRAIAGRLARQGGALLAI 238 Query: 240 DYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQG 299 DYG + DTLQAV H V L PG+ DL+ VDF L+ A ++G Q Sbjct: 239 DYGEATLGLTDTLQAVSRHRAVGILDAPGETDLTVPVDFGALARAAREAGAALHGPVPQR 298 Query: 300 KFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMP 359 FL LG+ QRA L + + T MG LFK+L VS + +P Sbjct: 299 DFLLALGLAQRAERLAARATPDQARAVAEAAARLTDPAPTGMGRLFKVLGVSDAGMAGLP 358 >gi|45825109|gb|AAS77462.1| AT11512p [Drosophila melanogaster] Length = 437 Score = 267 bits (683), Expect = 2e-69, Method: Composition-based stats. Identities = 116/393 (29%), Positives = 193/393 (49%), Gaps = 43/393 (10%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L +++ I G + V +Y + +P+ GYY + FG GDF+T+PEISQIFGE++ Sbjct: 51 SLAKQLRAKILSTGPIPVAEYMREVLTNPQAGYYMNRDVFGREGDFITSPEISQIFGELV 110 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 I+L+ W + G PS +LVELGPGRG + D+L+V+ K + S++MVE S L+ Sbjct: 111 GIWLVSEWRKMGSPSPFQLVELGPGRGTLARDVLKVLTKF--KQDAEFSMHMVEVSPFLS 168 Query: 124 LIQKKQLASYG--------------------DKINWYTSLADVPLGFTFLVANEFFDSLP 163 Q ++ K W+ L DVP GF+ ++A+EFFD+LP Sbjct: 169 KAQAQRFCYSHQTLPEDAQLPHYQEGTTASGTKAFWHRRLEDVPQGFSLVLAHEFFDALP 228 Query: 164 IKQFVMTEHGIRERMIDIDQHD-----SLVFNIGDHEIKS-NFLTCSDYFLGAIFENSPC 217 + + + + +E +ID+ D S + + + + + E+S Sbjct: 229 VHKLQLVDGKWQEVLIDVASSDGAQEASFRYVLSRSQTPVSSLYRPPPGETRSCLEHSLE 288 Query: 218 RDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVD 277 +R++ +++R+ DGG A+++DYG+ + DT +A K H PLV PG+ADL++ VD Sbjct: 289 TERQVGLLAERIERDGGIALIMDYGHFGEKP-DTFRAFKQHELHDPLVEPGRADLTADVD 347 Query: 278 FQRLSSIAILYK-LYINGLTTQGKFLEGLGIWQRAFSLMKQ--TARKDILLDSVKRLVST 334 F+ + IA ++ G QG FL+ + R L+ ++I+ + L Sbjct: 348 FKLVRHIAETRGNVHCCGPVEQGLFLQRMQGEARLEQLLAHALPENQEIIRSGYEMLT-- 405 Query: 335 SADKKSMGELFKILVVSH-------EKVELMPF 360 D MG FK L + +K ++ F Sbjct: 406 --DPAQMGTRFKFLAMFPGVLAAHLDKYPVVGF 436 >gi|195571791|ref|XP_002103886.1| GD20670 [Drosophila simulans] gi|194199813|gb|EDX13389.1| GD20670 [Drosophila simulans] Length = 437 Score = 267 bits (683), Expect = 2e-69, Method: Composition-based stats. Identities = 117/393 (29%), Positives = 192/393 (48%), Gaps = 43/393 (10%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L +++ I G + V +Y + +P+ GYY + FG GDF+T+PEISQIFGE++ Sbjct: 51 SLAKQLRAKILSTGPIPVAEYMREVLTNPQAGYYMNRDVFGREGDFITSPEISQIFGELV 110 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 I+L+ W + G PS +LVELGPGRG + D+L+V+ K + S++MVE S L+ Sbjct: 111 GIWLVSEWRKMGSPSPFQLVELGPGRGTLARDVLKVLTKF--KQDAEFSMHMVEVSPFLS 168 Query: 124 LIQKKQLASYG--------------------DKINWYTSLADVPLGFTFLVANEFFDSLP 163 Q ++ K W+ L DVP GF+ ++A+EFFD+LP Sbjct: 169 KAQAQRFCYSHQTLPEDAQLPHYQEGTTASGTKAFWHHRLEDVPQGFSLVLAHEFFDALP 228 Query: 164 IKQFVMTEHGIRERMIDI-----DQHDSLVFNIGDHEIKS-NFLTCSDYFLGAIFENSPC 217 + + + + +E +ID+ Q S + + + + + E+S Sbjct: 229 VHKLQLVDGKWQEVLIDVASSDGAQEGSFRYVLSRSQTPVSSLYRPLPGETRSCLEHSLE 288 Query: 218 RDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVD 277 +R++ +++R+ DGG A+V+DYG+ + DT +A K H PLV PG ADL++ VD Sbjct: 289 TERQVGLLAERIERDGGIALVMDYGHFGEKT-DTFRAFKQHKLHDPLVEPGSADLTADVD 347 Query: 278 FQRLSSIAILYK-LYINGLTTQGKFLEGLGIWQRAFSLMKQ--TARKDILLDSVKRLVST 334 F+ + IA ++ G QG FL+ + R L+ ++I+ + L Sbjct: 348 FKLVRHIAETRGNVHCCGPVEQGLFLQRMQGEARLEQLLAHALPENQEIIRSGYEMLT-- 405 Query: 335 SADKKSMGELFKILVVSH-------EKVELMPF 360 D MG FK L + +K ++ F Sbjct: 406 --DPSQMGSRFKFLAMFPGVLATHLDKYPVVGF 436 >gi|194743862|ref|XP_001954419.1| GF16742 [Drosophila ananassae] gi|190627456|gb|EDV42980.1| GF16742 [Drosophila ananassae] Length = 440 Score = 267 bits (683), Expect = 2e-69, Method: Composition-based stats. Identities = 116/392 (29%), Positives = 192/392 (48%), Gaps = 42/392 (10%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L +++ I G ++V +Y + +P+ GYY + FG GDF+T+PEISQIFGE++ Sbjct: 55 SLAKQLRAKILATGPISVAEYMREVLTNPQAGYYMARDVFGREGDFITSPEISQIFGELV 114 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 ++L+ W + G PS +LVELGPGRG + D+L+V+ K + LSI+MVE S L+ Sbjct: 115 GVWLVSEWRKMGSPSPFQLVELGPGRGTLARDVLKVLSKF--KLGAELSIHMVEVSPFLS 172 Query: 124 LIQKKQLASYGD--------------------KINWYTSLADVPLGFTFLVANEFFDSLP 163 IQ ++ + K W+ L DVP GF+ ++A+EFFD+LP Sbjct: 173 KIQAQRFCYTHETLPEDSQLPHYQTGTTASGTKAFWHRRLEDVPQGFSLILAHEFFDALP 232 Query: 164 IKQFVMTEHGIRERMIDIDQHDS----LVFNIGDHEIKSN-FLTCSDYFLGAIFENSPCR 218 + + + +E +ID+ + + + + + + E S Sbjct: 233 VHKLQWLDGQWQEVLIDVASKEEDQPGFRYVLSRSQTPVSRVFQPIPGEKRSCLEYSLET 292 Query: 219 DREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDF 278 +R++ +++R+ DGG A+++DYG+ + DT +A K H PLV+PG ADL++ VDF Sbjct: 293 ERQVGLLAERIERDGGIALIMDYGHFGEKT-DTFRAFKNHALHDPLVDPGSADLTADVDF 351 Query: 279 QRLSSIAILYK-LYINGLTTQGKFLEGLGIWQRAFSLMKQ--TARKDILLDSVKRLVSTS 335 + + A ++ G QG FL+ + R L+ + I+ + L Sbjct: 352 KLIRHAAEKRGSIHCCGPVEQGLFLQRMQGEARLEQLLAHALPENQQIIRSGYQMLT--- 408 Query: 336 ADKKSMGELFKILVVSH-------EKVELMPF 360 D MG FK L + EK ++ F Sbjct: 409 -DAAQMGTRFKFLAMFPGVVAPHLEKFPVVGF 439 >gi|99034193|ref|ZP_01314271.1| hypothetical protein Wendoof_01000933 [Wolbachia endosymbiont of Drosophila willistoni TSC#14030-0811.24] Length = 348 Score = 266 bits (681), Expect = 3e-69, Method: Composition-based stats. Identities = 116/360 (32%), Positives = 191/360 (53%), Gaps = 26/360 (7%) Query: 5 LIRKIVNLI-KKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 ++ I LI K G +++ + + ++GYY++ P G GDF TAPEISQ+FGE++ Sbjct: 1 MLTYIHELIDKSQGSISISDFMNAVLYHEKYGYYTSKLPLGKDGDFTTAPEISQLFGEVI 60 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 A++++ WE+ G PS LVELGPG+G ++ DI+RV K FF+ + I++VE S L Sbjct: 61 AVWIMHTWEKLGKPSKFSLVELGPGKGTLIHDIIRVTKK-YSSFFNSMLIHLVEISPTLR 119 Query: 124 LIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQ 183 IQK++L S +NW+ ++ ++P T +ANEFFD+LPI QFV + G E M+ Sbjct: 120 KIQKEKLKSL--DVNWHKNIDNLPEQPTIFLANEFFDALPIDQFVYHDEGWYENMVTKQD 177 Query: 184 HD-----------SLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACD 232 + + +T +F GA+ E ++ + ++ + Sbjct: 178 DGSLLVSCQCVTLESRKKESWIPVSATQMTNGKFFNGAVVEICSVGVEILKKLEKKIYNN 237 Query: 233 GGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYI 292 G A+++DYGY+ TLQ++K H Y + L N G +D+++ V+FQ L Sbjct: 238 KGAALIVDYGYVYPAYKSTLQSIKQHKYANFLENVGNSDITALVNFQALRDPLKHVDCE- 296 Query: 293 NGLTTQGKFLEGLGIWQRAFSLMKQTA--RKDILLDSVKRLVSTSADKKSMGELFKILVV 350 + TQ +FL GI +R +LMK + +K+ + RL ++MG LFK +++ Sbjct: 297 --ILTQREFLYLFGIKERTQALMKSASDEQKNRIFSEFLRLT------ENMGTLFKAMLL 348 >gi|195329908|ref|XP_002031652.1| GM23928 [Drosophila sechellia] gi|194120595|gb|EDW42638.1| GM23928 [Drosophila sechellia] Length = 437 Score = 266 bits (681), Expect = 3e-69, Method: Composition-based stats. Identities = 116/393 (29%), Positives = 191/393 (48%), Gaps = 43/393 (10%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L +++ I G + V +Y + +P+ GYY + FG GDF+T+PEISQIFGE++ Sbjct: 51 SLAKQLRAKILSTGPIPVAEYMREVLTNPQAGYYMNRDVFGREGDFITSPEISQIFGELV 110 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 I+L+ W + G PS +LVELGPGRG + D+L+V+ K + S++MVE S L+ Sbjct: 111 GIWLVSEWRKMGSPSPFQLVELGPGRGTLARDVLKVLTKF--KQDAEFSMHMVEVSPFLS 168 Query: 124 LIQKKQLASYG--------------------DKINWYTSLADVPLGFTFLVANEFFDSLP 163 Q ++ K W+ L DVP GF+ ++A+EFFD+LP Sbjct: 169 KAQAQRFCYSHQTLPEDAQLPHYQEGTTASGTKAFWHRRLEDVPQGFSLVLAHEFFDALP 228 Query: 164 IKQFVMTEHGIRERMIDI-----DQHDSLVFNIGDHEIKS-NFLTCSDYFLGAIFENSPC 217 + + + + +E +ID+ Q S + + + + + E+S Sbjct: 229 VHKLQLVDGKWQEVLIDVASSDGAQEGSFRYVLSRSQTPVSSLYRPLPGETRSCLEHSLE 288 Query: 218 RDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVD 277 +R++ +++R+ DGG +V+DYG+ + DT +A K H PLV PG ADL++ VD Sbjct: 289 TERQVGLLAERIERDGGITLVMDYGHFGEKT-DTFRAFKQHKLHDPLVEPGSADLTADVD 347 Query: 278 FQRLSSIAILYK-LYINGLTTQGKFLEGLGIWQRAFSLMKQ--TARKDILLDSVKRLVST 334 F+ + IA ++ G QG FL+ + R L+ ++I+ + L Sbjct: 348 FKLVRHIAETRGNVHCCGPVEQGLFLQRMQGEARLEQLLAHALPENQEIIRSGYEMLT-- 405 Query: 335 SADKKSMGELFKILVVSH-------EKVELMPF 360 D MG FK L + +K ++ F Sbjct: 406 --DPAQMGSRFKFLAMFPGVLATHLDKYPVVGF 436 >gi|103485980|ref|YP_615541.1| hypothetical protein Sala_0487 [Sphingopyxis alaskensis RB2256] gi|98976057|gb|ABF52208.1| protein of unknown function DUF185 [Sphingopyxis alaskensis RB2256] Length = 356 Score = 266 bits (681), Expect = 3e-69, Method: Composition-based stats. Identities = 123/355 (34%), Positives = 171/355 (48%), Gaps = 15/355 (4%) Query: 7 RKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIF 66 R+++N I +TV Y A A YY+T +P GA GDF TAPEISQ+FGEM+ I+ Sbjct: 11 RQLINDIAAARPVTVADYMAAANAH----YYATRDPLGAAGDFTTAPEISQMFGEMVGIW 66 Query: 67 LICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQ 126 + W + G P R VELGPGRG + D LR + + + + ++VETS L Q Sbjct: 67 IADLWTRAGNP-AFRYVELGPGRGTLAADALRTMARFGCEPVGI---HLVETSPALRAAQ 122 Query: 127 KKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDS 186 +L + A +VANEFFD+LPI Q+V T G RERM+ Sbjct: 123 LARLPAAQHH---DEVDALPGDAPLLIVANEFFDALPIHQYVRTADGWRERMVGRAGDAR 179 Query: 187 LVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS 246 + ++ G I E P MQ + RL+ GG + IDYGY Sbjct: 180 MAVAGDVSADEAIPAALRGAAEGTIVETMPVAAAIMQRCAFRLSRQGGAMLAIDYGYTGP 239 Query: 247 RVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 GDTLQAVK H + P +PG+ADL++HVDF L+ A + + G + QG +L +G Sbjct: 240 AAGDTLQAVKAHGFADPFADPGEADLTAHVDFAALADAARSGGVAVAGPSPQGAWLRRMG 299 Query: 307 IWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVE-LMPF 360 I R SL+ + L + + +MGELFK + + F Sbjct: 300 IDARLASLVAAAPARADELQGRR---DRLVNADAMGELFKAIAFTAPNWPTPAGF 351 >gi|163852302|ref|YP_001640345.1| hypothetical protein Mext_2884 [Methylobacterium extorquens PA1] gi|163663907|gb|ABY31274.1| protein of unknown function DUF185 [Methylobacterium extorquens PA1] Length = 361 Score = 266 bits (680), Expect = 3e-69, Method: Composition-based stats. Identities = 130/361 (36%), Positives = 186/361 (51%), Gaps = 15/361 (4%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 L+ + I+ +G + +D+Y A C+ P GYY+T +PFG GDFVTAPEISQ+FGE+ Sbjct: 6 TPLLAILAREIRASGPLGLDRYMAFCLGHPLHGYYATRDPFGRGGDFVTAPEISQMFGEL 65 Query: 63 LAIFLICAWEQHGFP-SCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 + + LVELGPGRG +M D LR + DF +++VETS Sbjct: 66 VGAWAAAVLAMMPATGVRPCLVELGPGRGTLMADALRALRAAGSDF----ELHLVETSPV 121 Query: 122 LTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDI 181 L +Q +LA ++ S+A +P +VANEFFD+LP +QFV TE G ER + + Sbjct: 122 LRRLQSARLAD--AAPTFHDSVASLPDAPLLIVANEFFDALPARQFVRTELGWCERRVGL 179 Query: 182 DQH-DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVID 240 D+L F + ++ GA+ M+ ++ RL GG + ID Sbjct: 180 APEGDALAFGLDPEPDP---RLTAEAPAGAVLTLPSQGLAVMRDLARRLVARGGALLAID 236 Query: 241 YGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGK 300 YG+ + GDT QAV GH + PL PG+ADL+ HVDF L+ A ++G Q Sbjct: 237 YGHDRPGFGDTFQAVAGHRFADPLARPGEADLTLHVDFGALARAAAAEGAALHGPVMQRD 296 Query: 301 FLEGLGIWQRAFSLMKQ--TARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELM 358 FL GLG+ RA L + + + +V RL D + MG LFK+L SH + + Sbjct: 297 FLLGLGLAMRAERLKARATPDQAQAIDAAVLRLTDP--DPRGMGALFKVLCASHPALGPL 354 Query: 359 P 359 P Sbjct: 355 P 355 >gi|311252815|ref|XP_003125280.1| PREDICTED: protein midA homolog, mitochondrial-like isoform 2 [Sus scrofa] gi|311252817|ref|XP_003125279.1| PREDICTED: protein midA homolog, mitochondrial-like isoform 1 [Sus scrofa] Length = 441 Score = 266 bits (680), Expect = 3e-69, Method: Composition-based stats. Identities = 125/378 (33%), Positives = 189/378 (50%), Gaps = 33/378 (8%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 ++R ++ IK G +TV +Y + +P GYY T + G GDF+T+PEISQIFGE+ Sbjct: 41 TPMLRHLIYKIKSTGPITVAEYMREVLTNPAKGYYVTHDMLGEKGDFITSPEISQIFGEL 100 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSV-LSIYMVETSER 121 L I+ I W G + +LVELGPGRG + DILRV +L + +SI++VE S++ Sbjct: 101 LGIWFISEWIATGKNAAFQLVELGPGRGTLSGDILRVFSQLGSVLKNCDISIHLVEVSQK 160 Query: 122 LTLIQ--------------------KKQLASYGDKINWYTSLADVPLGFTFLVANEFFDS 161 L+ IQ K + G I+WY L DVP G++F +A+EFFD Sbjct: 161 LSEIQALTLTEEKVPLEREAGSPVYMKGVTKSGIPISWYRDLQDVPKGYSFYLAHEFFDV 220 Query: 162 LPIKQFVMTEHGIRERMIDIDQH--DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRD 219 LP+ +F T G RE +IDID D L F + + D E P Sbjct: 221 LPVHKFQKTPQGWREVLIDIDPQVSDKLRFVLAPCATPAEAFIQKD-ETRDHVEVCPDAG 279 Query: 220 REMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQ 279 +Q ++ R++ GG A++ DYG+ ++ DT + GH L+ PG ADL++ VDF Sbjct: 280 VIIQELAQRISLTGGAALIADYGHDGTKT-DTFRGFCGHQLHDVLIAPGTADLTADVDFS 338 Query: 280 RLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTAR---KDILLDSVKRLVSTSA 336 L + K+ G +Q FL +GI R L+ ++ ++ LL + L+ Sbjct: 339 YLRRM-SQGKVASLGPISQQTFLRNMGIDVRLKILLDKSDEPSLREQLLQGYRMLM---- 393 Query: 337 DKKSMGELFKILVVSHEK 354 + + MGE F + + Sbjct: 394 NPEKMGERFNFFALVPHQ 411 >gi|16124746|ref|NP_419310.1| hypothetical protein CC_0491 [Caulobacter crescentus CB15] gi|221233461|ref|YP_002515897.1| cytosolic protein [Caulobacter crescentus NA1000] gi|13421668|gb|AAK22478.1| conserved hypothetical protein [Caulobacter crescentus CB15] gi|220962633|gb|ACL93989.1| conserved hypothetical cytosolic protein [Caulobacter crescentus NA1000] Length = 374 Score = 266 bits (680), Expect = 3e-69, Method: Composition-based stats. Identities = 122/378 (32%), Positives = 185/378 (48%), Gaps = 26/378 (6%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L+ ++ I ++G + V ++F C+ DP GYY+T GA GDF+TAP +SQ+FGE++ Sbjct: 2 SLLDRLKAQIAQDGPIGVPEFFTRCLHDPRDGYYATRPDLGAGGDFITAPLVSQMFGELI 61 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 ++++ W + G P+ RLVE+GPG G +M D+LR +L P F +++VE SE L Sbjct: 62 GLWVLETWTRMGRPAPFRLVEMGPGDGTLMSDLLRA-GRLDPAFLEAAQVWLVEVSEPLK 120 Query: 124 LIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQ 183 Q +L + + LVANE D LP +QF+ T G ER+I + + Sbjct: 121 ARQAARLGEGPRWASRLDEVP--GGAPMILVANELLDCLPARQFIRTRTGWAERVIGLGE 178 Query: 184 HDSLVFNIGDH------------------EIKSNFLTCSDYFLGAIFENSPCRDREMQSI 225 +L F + + GA+ E+SP + I Sbjct: 179 GGALAFGLRAINPPPRGRGPVGHAPSPSGPSDHLPRWGEELEAGAVVESSPAQAALASDI 238 Query: 226 SDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIA 285 + RL DGG A++IDYG + GDTLQA++ H V PL G ADL+ DF + + A Sbjct: 239 AHRLVIDGGAALLIDYGRAELEPGDTLQAIQNHRKVDPLETAGLADLTVWADFPSVITAA 298 Query: 286 ILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARK-DILLDSVKRLVSTSADKKSMGEL 344 + TQG FL LGI QRA +L + + D + + RL+ + MGEL Sbjct: 299 RDTGAKAGPILTQGAFLVALGIIQRAEALAARQPERGDQIARQLDRLI----GEAQMGEL 354 Query: 345 FKILVVSHEKVELMPFVN 362 FK+ + + F + Sbjct: 355 FKVACLCAPDLSPPLFED 372 >gi|42520561|ref|NP_966476.1| hypothetical protein WD0717 [Wolbachia endosymbiont of Drosophila melanogaster] gi|42410300|gb|AAS14410.1| conserved hypothetical protein [Wolbachia endosymbiont of Drosophila melanogaster] Length = 349 Score = 266 bits (680), Expect = 3e-69, Method: Composition-based stats. Identities = 116/360 (32%), Positives = 191/360 (53%), Gaps = 26/360 (7%) Query: 5 LIRKIVNLI-KKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 ++ I LI K G +++ + + ++GYY++ P G GDF TAPEISQ+FGE++ Sbjct: 1 MLTYIHELIDKSQGSISISDFMNAVLYHEKYGYYTSKLPLGKDGDFTTAPEISQLFGEVI 60 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 A++++ WE+ G PS LVELGPG+G ++ DI+RV K FF+ + I++VE S L Sbjct: 61 AVWIMHTWEKLGKPSKFSLVELGPGKGTLIHDIIRVTKK-YSSFFNSMLIHLVEISPTLR 119 Query: 124 LIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQ 183 IQK++L S +NW+ ++ ++P T +ANEFFD+LPI QFV + G E M+ Sbjct: 120 KIQKEKLKSL--DVNWHKNIDNLPEQPTIFLANEFFDALPIDQFVYHDEGWYENMVTKQD 177 Query: 184 HD-----------SLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACD 232 + + +T +F GA+ E ++ + ++ + Sbjct: 178 DGSLLVSCQCVTLESRKKESWIPVSATQMTNGKFFNGAVVEICSVGVEILKKLEKKIYNN 237 Query: 233 GGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYI 292 G A+++DYGY+ TLQ++K H Y + L N G +D+++ V+FQ L Sbjct: 238 KGAALIVDYGYVYPAYKSTLQSIKQHKYANFLENVGNSDITALVNFQALRDPLKHVDCE- 296 Query: 293 NGLTTQGKFLEGLGIWQRAFSLMKQTA--RKDILLDSVKRLVSTSADKKSMGELFKILVV 350 + TQ +FL GI +R +LMK + +K+ + RL ++MG LFK +++ Sbjct: 297 --ILTQREFLYLFGIKERTQALMKSASDEQKNRIFSEFLRLT------ENMGTLFKAMLL 348 >gi|58699083|ref|ZP_00373918.1| Uncharacterized ACR, COG1565 superfamily [Wolbachia endosymbiont of Drosophila ananassae] gi|225630423|ref|YP_002727214.1| hypothetical protein WRi_006580 [Wolbachia sp. wRi] gi|58534395|gb|EAL58559.1| Uncharacterized ACR, COG1565 superfamily [Wolbachia endosymbiont of Drosophila ananassae] gi|225592404|gb|ACN95423.1| hypothetical protein WRi_006580 [Wolbachia sp. wRi] Length = 349 Score = 266 bits (680), Expect = 4e-69, Method: Composition-based stats. Identities = 116/360 (32%), Positives = 191/360 (53%), Gaps = 26/360 (7%) Query: 5 LIRKIVNLI-KKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 ++ I LI K G +++ + + ++GYY++ P G GDF TAPEISQ+FGE++ Sbjct: 1 MLTYIHELIDKSQGSISISDFMNAVLYHEKYGYYTSKLPLGKDGDFTTAPEISQLFGEVI 60 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 A++++ WE+ G PS LVELGPG+G ++ DI+RV K FF+ + I++VE S L Sbjct: 61 AVWIMHTWEKLGKPSKFSLVELGPGKGTLIHDIIRVTKK-YSSFFNSMLIHLVEISPTLR 119 Query: 124 LIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQ 183 IQK++L S +NW+ ++ ++P T +ANEFFD+LPI QFV + G E M+ Sbjct: 120 KIQKEKLKSL--DVNWHKNIDNLPEQPTIFLANEFFDALPIDQFVYHDEGWYENMVTKQD 177 Query: 184 HD-----------SLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACD 232 + + +T +F GA+ E ++ + ++ + Sbjct: 178 DGSLLVSCQCVTLESRKKESWIPVSATQMTNGKFFNGAVVEICSVGVEILKKLEKKIYNN 237 Query: 233 GGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYI 292 G A+++DYGY+ TLQ++K H Y + L N G +D+++ V+FQ L Sbjct: 238 KGAALIVDYGYVYPAYKSTLQSIKQHKYANFLENVGNSDITALVNFQALRDSLKHVDCE- 296 Query: 293 NGLTTQGKFLEGLGIWQRAFSLMKQTA--RKDILLDSVKRLVSTSADKKSMGELFKILVV 350 + TQ +FL GI +R +LMK + +K+ + RL ++MG LFK +++ Sbjct: 297 --ILTQREFLYLFGIKERTQALMKSASDEQKNRIFSEFLRLT------ENMGTLFKAMLL 348 >gi|16768500|gb|AAL28469.1| GM06493p [Drosophila melanogaster] Length = 406 Score = 266 bits (679), Expect = 4e-69, Method: Composition-based stats. Identities = 117/393 (29%), Positives = 192/393 (48%), Gaps = 43/393 (10%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L +++ I G + V +Y + +P+ GYY + FG GDF+T+PEISQIFGE++ Sbjct: 20 SLAKQLRAKILSTGPIPVAEYMREVLTNPQAGYYMNRDVFGREGDFITSPEISQIFGELV 79 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 I+L+ W + G PS +LVELGPGRG + D+L+V+ K + S++MVE S L+ Sbjct: 80 GIWLVSEWRKMGSPSPFQLVELGPGRGTLARDVLKVLTKF--KQDAEFSMHMVEVSPFLS 137 Query: 124 LIQKKQLASYG--------------------DKINWYTSLADVPLGFTFLVANEFFDSLP 163 Q ++ K W+ L DVP GF+ ++A+EFFD+LP Sbjct: 138 KAQAQRFCYSHQTLPEDAQLPHYQEGTTASGTKAFWHRRLEDVPQGFSLVLAHEFFDALP 197 Query: 164 IKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLG------AIFENSPC 217 + + + + +E +ID+ D + + + S + + E+S Sbjct: 198 VHKLQLVDGKWQEVLIDVASSDGAQEASFRYVLSRSQTPVSSLYRPLPGETRSCLEHSLE 257 Query: 218 RDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVD 277 +R++ +++R+ DGG A+++DYG+ + DT +A K H PLV PG ADL++ VD Sbjct: 258 TERQVGLLAERIERDGGIALIMDYGHFGEKT-DTFRAFKQHKLHDPLVEPGSADLTADVD 316 Query: 278 FQRLSSIAILYK-LYINGLTTQGKFLEGLGIWQRAFSLMKQ--TARKDILLDSVKRLVST 334 F+ + IA ++ G QG FL+ + R L+ K+I+ + L Sbjct: 317 FKLVRHIAETRGNVHCCGPVEQGLFLQRMQGEARLEQLLAHALPENKEIIRSGYEMLT-- 374 Query: 335 SADKKSMGELFKILVVSH-------EKVELMPF 360 D MG FK L + +K ++ F Sbjct: 375 --DPAQMGTRFKFLAMFPGVLAAHLDKYPVVGF 405 >gi|24645885|ref|NP_650054.2| CG17726 [Drosophila melanogaster] gi|74868997|sp|Q9VGR2|MIDA_DROME RecName: Full=Protein midA homolog, mitochondrial gi|7299424|gb|AAF54614.1| CG17726 [Drosophila melanogaster] gi|202027956|gb|ACH95262.1| FI02863p [Drosophila melanogaster] Length = 437 Score = 266 bits (679), Expect = 5e-69, Method: Composition-based stats. Identities = 116/393 (29%), Positives = 192/393 (48%), Gaps = 43/393 (10%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L +++ I G + V +Y + +P+ GYY + FG GDF+T+PEISQIFGE++ Sbjct: 51 SLAKQLRAKILSTGPIPVAEYMREVLTNPQAGYYMNRDVFGREGDFITSPEISQIFGELV 110 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 I+L+ W + G PS +LVELGPGRG + D+L+V+ K + S++MVE S L+ Sbjct: 111 GIWLVSEWRKMGSPSPFQLVELGPGRGTLARDVLKVLTKF--KQDAEFSMHMVEVSPFLS 168 Query: 124 LIQKKQLASYG--------------------DKINWYTSLADVPLGFTFLVANEFFDSLP 163 Q ++ K W+ L DVP GF+ ++A+EFFD+LP Sbjct: 169 KAQAQRFCYSHQTLPEDAQLPHYQEGTTASGTKAFWHRRLEDVPQGFSLVLAHEFFDALP 228 Query: 164 IKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLG------AIFENSPC 217 + + + + +E +ID+ D + + + S + + E+S Sbjct: 229 VHKLQLVDGKWQEVLIDVASSDGAQEASFRYVLSRSQTPVSSLYRPLPGETRSCLEHSLE 288 Query: 218 RDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVD 277 +R++ +++R+ DGG A+++DYG+ + DT +A K H PLV PG ADL++ VD Sbjct: 289 TERQVGLLAERIERDGGIALIMDYGHFGEKT-DTFRAFKQHKLHDPLVEPGSADLTADVD 347 Query: 278 FQRLSSIAILYK-LYINGLTTQGKFLEGLGIWQRAFSLMKQ--TARKDILLDSVKRLVST 334 F+ + IA ++ G QG FL+ + R L+ ++I+ + L Sbjct: 348 FKLVRHIAETRGNVHCCGPVEQGLFLQRMQGEARLEQLLAHALPENQEIIRSGYEMLT-- 405 Query: 335 SADKKSMGELFKILVVSH-------EKVELMPF 360 D MG FK L + +K ++ F Sbjct: 406 --DPAQMGTRFKFLAMFPGVLAAHLDKYPVVGF 436 >gi|188582252|ref|YP_001925697.1| hypothetical protein Mpop_3007 [Methylobacterium populi BJ001] gi|179345750|gb|ACB81162.1| protein of unknown function DUF185 [Methylobacterium populi BJ001] Length = 362 Score = 266 bits (679), Expect = 5e-69, Method: Composition-based stats. Identities = 129/356 (36%), Positives = 187/356 (52%), Gaps = 18/356 (5%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 L+ + I+ +G + +D+Y ALC+ P GYY+T +PFG GDFVTAPEISQ+FGE+ Sbjct: 6 TPLLAILAREIRASGPIGLDRYMALCLGHPRHGYYATRDPFGRGGDFVTAPEISQMFGEL 65 Query: 63 LAIF---LICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 + + ++ + + LVELGPGRG +M D LR + F +++VETS Sbjct: 66 IGAWAGAVLATMQAASPAARPCLVELGPGRGTLMADALRALRAAGAAF----DLHLVETS 121 Query: 120 ERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMI 179 L +Q ++LA+ + SL D ++ANEFFD+LP +QFV T HG ER + Sbjct: 122 PVLRRLQAERLAAAPVFHDSVESLPD---APLLVIANEFFDALPARQFVRTGHGWCERRV 178 Query: 180 DIDQH-DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIV 238 + D+L F + ++ GA+ M+ ++ RLA GG + Sbjct: 179 GLTPEGDALAFGLDPEPDP---RLAAEAPEGAVLTVPRQGLAVMRDLARRLAARGGALLA 235 Query: 239 IDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQ 298 IDYG+ + GDT QA+ GH + PL PG+ADL+ HVDF L+ A ++G TQ Sbjct: 236 IDYGHDRPGFGDTFQALVGHRFADPLSRPGEADLTLHVDFGALARAASAEGAAVHGPATQ 295 Query: 299 GKFLEGLGIWQRAFSLM--KQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSH 352 FL GLG+ RA L + + + RL D + MG LFK+L VSH Sbjct: 296 RDFLLGLGLLTRAERLKVRATPDQAAAIDAAAARLTDP--DPRGMGALFKVLGVSH 349 >gi|332643958|gb|AEE77479.1| uncharacterized protein [Arabidopsis thaliana] Length = 471 Score = 266 bits (679), Expect = 5e-69, Method: Composition-based stats. Identities = 119/381 (31%), Positives = 196/381 (51%), Gaps = 40/381 (10%) Query: 2 ENKLIRKIVNLIK-KNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 +++L++ + ++IK + G ++V +Y + +P+ G+Y + FGA GDF+T+PE+SQ+FG Sbjct: 75 DSELVKHLKSIIKFRGGPISVAEYMEEVLTNPKAGFYMNRDVFGAQGDFITSPEVSQMFG 134 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 EM+ ++ +C WEQ G P V LVELGPGRG +M D+LR K K L I++VE S Sbjct: 135 EMIGVWTVCLWEQMGRPERVNLVELGPGRGTLMADLLRGTSKFKNFT-ESLHIHLVECSP 193 Query: 121 RLTLIQKKQLASYGD--------------KINWYTSLADVPLG-FTFLVANEFFDSLPIK 165 L +Q + L + ++W+ +L +VP G T ++A+EF+D+LP+ Sbjct: 194 ALQKLQHQNLKCTDESSSEKKAVSSLAGTPVHWHATLQEVPSGVPTLIIAHEFYDALPVH 253 Query: 166 QFVMTEHGIRERMIDIDQHDSLVFNIGDHEIK--------SNFLTCSDYFLGAIFENSPC 217 QF + G E+M+D+ + F + + T + E SP Sbjct: 254 QFQKSTRGWCEKMVDVGEDSKFRFVLSPQPTPAALYLMKRCTWATPEEREKMEHVEISPK 313 Query: 218 RDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVD 277 Q ++ R+ DGG A++IDYG + + D+LQA++ H +V+ L +PG ADLS++VD Sbjct: 314 SMDLTQEMAKRIGSDGGGALIIDYGMN-AIISDSLQAIRKHKFVNILDDPGSADLSAYVD 372 Query: 278 FQRLSSIAILY--KLYINGLTTQGKFLEGLGIWQRAFSLMK--QTARKDILLDSVKRLVS 333 F + A + ++G TQ +FL LGI R +L++ + + L +LV Sbjct: 373 FPSIKHSAEEASENVSVHGPMTQSQFLGSLGINFRVDALLQNCNDEQAESLRAGYWQLVG 432 Query: 334 TSAD----------KKSMGEL 344 MG Sbjct: 433 DGEAPFWEGPNEQTPIGMGTR 453 >gi|156371594|ref|XP_001628848.1| predicted protein [Nematostella vectensis] gi|156215834|gb|EDO36785.1| predicted protein [Nematostella vectensis] Length = 425 Score = 266 bits (679), Expect = 5e-69, Method: Composition-based stats. Identities = 120/383 (31%), Positives = 191/383 (49%), Gaps = 37/383 (9%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 + L++ I+ I +G ++V Y + +P GYY + FG GDF+T+PEI+Q+FGE+ Sbjct: 48 SALMKHIIQRITISGAISVAAYMQEVLTNPLAGYYMKKDVFGQAGDFITSPEITQVFGEL 107 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 + ++ + W Q G +++VELGPGRG +M DILRV+ K K LS+++VE S L Sbjct: 108 IGVWFVHQWMQTG-ERDIQIVELGPGRGTLMADILRVVKKFKALQ-ECLSVHLVEVSPAL 165 Query: 123 TLIQKKQLASYGDK-----------------------INWYTSLADVPLGFTFLVANEFF 159 + IQK L + + WY+S+ D+P ++F +A+EFF Sbjct: 166 SDIQKTTLTGISEMTNKEPSKENKPYYKQCCSKDGIPVFWYSSIKDIPKAYSFFLAHEFF 225 Query: 160 DSLPIKQFVMTEHGIRERMIDIDQHDSL--VFNIGDHEIKSNFLTCSDYFLGAIFENSPC 217 D+LPI QF T+ G RE ++D+D+ + F + ++ E P Sbjct: 226 DALPIHQFQRTDRGWREVLVDVDKERTHSLRFVLAPGPTLASQTYVPKGTSSRQLEVCPQ 285 Query: 218 RDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVD 277 M+ I +R+ DGG+A++IDYG TL++ H L PG AD++++VD Sbjct: 286 GGVIMEEIGERIRHDGGSALIIDYG-EDGNNRHTLRSFSKHKLHDVLEAPGTADITANVD 344 Query: 278 FQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTA--RKDILLDSVKRLVSTS 335 F+ L A + G TQ FL+ +GI QR L+ + + + L K L Sbjct: 345 FRFLRQ-AAGSDVNTYGPVTQVSFLKNMGIDQRMRILLSKASPDQAKQLYSGYKMLT--- 400 Query: 336 ADKKSMGELFKILVVSHEKVELM 358 + MGE FK+ V+ + Sbjct: 401 ---EEMGEKFKVFAVTDRQSPEP 420 >gi|58617060|ref|YP_196259.1| hypothetical protein ERGA_CDS_03330 [Ehrlichia ruminantium str. Gardel] gi|58416672|emb|CAI27785.1| Conserved hypothetical protein [Ehrlichia ruminantium str. Gardel] Length = 367 Score = 266 bits (679), Expect = 5e-69, Method: Composition-based stats. Identities = 120/355 (33%), Positives = 191/355 (53%), Gaps = 19/355 (5%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 M + L + I G ++V+Q+ + + D GYY T PFG GDFVT+PEISQ+FG Sbjct: 29 MHSYLKKVI---FDNGGAISVEQFMRIALYDMNCGYYMTQMPFGVFGDFVTSPEISQLFG 85 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 E++A++++ WE+ G PS L+ELGPGRG ++ DI+RV+ K K +S + IY++E S Sbjct: 86 EVIALWVLLYWEKMGSPSKFVLLELGPGRGTLISDIIRVLKKFK-QCYSAVDIYLLEVSP 144 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 +L +Q L G+K+ W + +P ++ANEFFD+LPIKQF+ ER I Sbjct: 145 KLQEVQYNTLQDVGEKVLWCRDINSIPNYPILVIANEFFDALPIKQFICISDSWYERYIT 204 Query: 181 IDQHDSLVFNIGDHEIKSNFLTCS-DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVI 239 ++ + F + I NF + + I E ++ I ++ + G A++I Sbjct: 205 VEDNK---FRFINKLIDKNFQILNVNNINDPIIEVCDDAISIIKLIEHKILQNKGAAVII 261 Query: 240 DYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQG 299 DYGY+ T+Q+VK H Y + N G +D++ HVDF L + + TQ Sbjct: 262 DYGYIDPPYKSTMQSVKNHQYNNIFENVGNSDITVHVDFTALRK---SLSFLNSYIMTQR 318 Query: 300 KFLEGLGIWQRAFSLMKQT--ARKDILLDSVKRLVSTSADKKSMGELFKILVVSH 352 FL GI +R L++ A++ L+ RL ++MG +FK+L+++H Sbjct: 319 DFLYNFGIRERLQILIENATEAQQQNLMTGFLRLT------ENMGSMFKVLLINH 367 >gi|224047646|ref|XP_002192694.1| PREDICTED: hypothetical protein [Taeniopygia guttata] Length = 446 Score = 265 bits (678), Expect = 5e-69, Method: Composition-based stats. Identities = 122/374 (32%), Positives = 188/374 (50%), Gaps = 31/374 (8%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 ++R + ++ +G +TV +Y + +P GYYS G GDF+T+PEISQ+FGE++ Sbjct: 48 TMLRHLTRKLRASGPVTVAEYMREALTNPGQGYYSRRGGVGESGDFITSPEISQVFGELI 107 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSV-LSIYMVETSERL 122 I+ I W G P+ +LVELGPGRG + DILRV +L + +S+++VE S +L Sbjct: 108 GIWYISEWMAMGKPTTFQLVELGPGRGTLTEDILRVFKQLASVLSTCDVSVHLVEVSPKL 167 Query: 123 TLIQ-------------------KKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLP 163 + IQ K ++ G I WY + DVP G++F +A+EFFD+LP Sbjct: 168 SEIQAVMLTGGKVQPSPEDETAYMKGISKTGIPIFWYRDIQDVPPGYSFYLAHEFFDALP 227 Query: 164 IKQFVMTEHGIRERMIDIDQH--DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDRE 221 I +F TE G RE ++DID D L F + + E P Sbjct: 228 IHKFQRTEKGWREVLVDIDPEVPDQLRFVLSPSRTPATQNFIQPEETRDHVEVCPEAGVI 287 Query: 222 MQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRL 281 +Q ++ R+ DGG A+V DYG+ ++ DT + + H L PG ADL++ VDF L Sbjct: 288 VQRLASRIEKDGGAALVADYGHDGTKT-DTFRGFRNHKLHDVLSAPGTADLTADVDFSYL 346 Query: 282 SSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTAR---KDILLDSVKRLVSTSADK 338 +A K G Q +FL+ +GI R L++ ++ + LL S L+ + Sbjct: 347 RKMA-QGKTATLGPIKQREFLKNMGIELRLQVLLQNSSDTATHEQLLHSYDMLM----NP 401 Query: 339 KSMGELFKILVVSH 352 + MG+ F + Sbjct: 402 EKMGDCFNFFALLP 415 >gi|218188556|gb|EEC70983.1| hypothetical protein OsI_02631 [Oryza sativa Indica Group] Length = 504 Score = 265 bits (678), Expect = 5e-69, Method: Composition-based stats. Identities = 124/400 (31%), Positives = 201/400 (50%), Gaps = 43/400 (10%) Query: 2 ENKLIRKIVNLIK-KNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 +++L++ + ++IK ++G ++V +Y + +P+ G+Y + FG GDF+T+PE+SQ+FG Sbjct: 106 DSELVKHLKSIIKFRSGPISVAEYMEEVLTNPQSGFYINRDVFGTSGDFITSPEVSQMFG 165 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 EM ++ +C WEQ G P V L+ELGPGRG ++ D+LR K L+I +VE S Sbjct: 166 EMTGVWAMCLWEQMGQPEKVNLIELGPGRGTLLADLLRGSSKFVNFT-KALNINLVECSP 224 Query: 121 RLTLIQKKQLASYGDKI---------------NWYTSLADVPLG-FTFLVANEFFDSLPI 164 L +Q L + I +W+ SL VP G T ++A+EF+D+LPI Sbjct: 225 TLQKVQYNTLKCEDEPIGDKTRTVSKLCGAPVHWHASLEQVPSGLPTIIIAHEFYDALPI 284 Query: 165 KQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI--------KSNFLTCSDYFLGAIFENSP 216 QF G E+M+D+ + S F + + + + + E P Sbjct: 285 HQFQKASRGWCEKMVDLAEDSSFRFVLSPQPTASLLFLSKRCGWASSEELEKVEHIEVCP 344 Query: 217 CRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHV 276 + I+DR++ DGG A++IDYG V D+LQA++ H +V L NPG ADLS++V Sbjct: 345 KAMEITEQIADRISSDGGGALIIDYG-KDGIVSDSLQAIRKHKFVHILDNPGSADLSAYV 403 Query: 277 DFQRLSSIAILY--KLYINGLTTQGKFLEGLGIWQRAFSLMKQ---TARKDILLDSVKRL 331 DF + A + ++G TQ +FL LGI R +L++ + + L RL Sbjct: 404 DFASIRHSAKEASDDISVHGPMTQSQFLGSLGINFRVEALLQNCATDEQAESLRTGYWRL 463 Query: 332 VSTSAD----------KKSMGELFKILVVSHEKV-ELMPF 360 V MG + + + ++K +PF Sbjct: 464 VGDGEAPFWEGPDDQTPIGMGTRYLAMAIVNKKQGTPVPF 503 >gi|115438020|ref|NP_001043438.1| Os01g0588800 [Oryza sativa Japonica Group] gi|53792246|dbj|BAD52879.1| ATP synthase beta subunit/transcription termination factor rho-like [Oryza sativa Japonica Group] gi|113532969|dbj|BAF05352.1| Os01g0588800 [Oryza sativa Japonica Group] gi|215704112|dbj|BAG92952.1| unnamed protein product [Oryza sativa Japonica Group] gi|222618761|gb|EEE54893.1| hypothetical protein OsJ_02410 [Oryza sativa Japonica Group] Length = 504 Score = 265 bits (678), Expect = 5e-69, Method: Composition-based stats. Identities = 124/400 (31%), Positives = 201/400 (50%), Gaps = 43/400 (10%) Query: 2 ENKLIRKIVNLIK-KNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 +++L++ + ++IK ++G ++V +Y + +P+ G+Y + FG GDF+T+PE+SQ+FG Sbjct: 106 DSELVKHLKSIIKFRSGPISVAEYMEEVLTNPQSGFYINRDVFGTSGDFITSPEVSQMFG 165 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 EM ++ +C WEQ G P V L+ELGPGRG ++ D+LR K L+I +VE S Sbjct: 166 EMTGVWAMCLWEQMGQPEKVNLIELGPGRGTLLADLLRGSSKFVNFT-KALNINLVECSP 224 Query: 121 RLTLIQKKQLASYGDKI---------------NWYTSLADVPLG-FTFLVANEFFDSLPI 164 L +Q L + I +W+ SL VP G T ++A+EF+D+LPI Sbjct: 225 TLQKVQYNTLKCEDEPIGDKTRTVSKLCGAPVHWHASLEQVPSGLPTIIIAHEFYDALPI 284 Query: 165 KQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI--------KSNFLTCSDYFLGAIFENSP 216 QF G E+M+D+ + S F + + + + + E P Sbjct: 285 HQFQKASRGWCEKMVDLAEDSSFRFVLSPQPTASLLFLSKRCGWASSEELEKVEHIEVCP 344 Query: 217 CRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHV 276 + I+DR++ DGG A++IDYG V D+LQA++ H +V L NPG ADLS++V Sbjct: 345 KAMEITEQIADRISSDGGGALIIDYG-KDGIVSDSLQAIRKHKFVHILDNPGSADLSAYV 403 Query: 277 DFQRLSSIAILY--KLYINGLTTQGKFLEGLGIWQRAFSLMKQ---TARKDILLDSVKRL 331 DF + A + ++G TQ +FL LGI R +L++ + + L RL Sbjct: 404 DFASIRHSAKEASDDISVHGPMTQSQFLGSLGINFRVEALLQNCATDEQAESLRTGYWRL 463 Query: 332 VSTSAD----------KKSMGELFKILVVSHEKV-ELMPF 360 V MG + + + ++K +PF Sbjct: 464 VGDGEAPFWEGPDDQTPIGMGTRYLAMAIVNKKQGTPVPF 503 >gi|296482565|gb|DAA24680.1| protein midA homolog, mitochondrial [Bos taurus] Length = 441 Score = 265 bits (677), Expect = 7e-69, Method: Composition-based stats. Identities = 120/378 (31%), Positives = 183/378 (48%), Gaps = 33/378 (8%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 ++R ++ IK G +TV +Y + +P GYY + G GDF+T+PEISQ+FGE+ Sbjct: 41 TPMLRHLIYKIKSTGPITVAEYMKEVLTNPAKGYYMNRDMLGEEGDFITSPEISQMFGEL 100 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSI-YMVETSER 121 L I+ I W G + +LVELGPG+G ++ DILRV +L + ++VE S++ Sbjct: 101 LGIWFISEWIAAGKNAAFQLVELGPGKGTLLGDILRVFSQLGSLLKNCDISLHLVEVSQK 160 Query: 122 LTLIQ--------------------KKQLASYGDKINWYTSLADVPLGFTFLVANEFFDS 161 L+ IQ K + G ++WY L DVP ++F +A+EFFD Sbjct: 161 LSEIQALTLTEEKVPLERNAESPVYMKGVTKSGIPVSWYRDLQDVPKEYSFYLAHEFFDV 220 Query: 162 LPIKQFVMTEHGIRERMIDIDQH--DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRD 219 LP+ +F T HG RE ++DID D L F + + +D E P Sbjct: 221 LPVHKFQKTPHGWREVLVDIDPQVSDKLRFVLAPCATPAEAFIQND-ETRDHVEVCPEAG 279 Query: 220 REMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQ 279 +Q +S R++ GG A++ DYG+ ++ DT + GH L PG ADL++ VDF Sbjct: 280 VVIQELSQRISLTGGAALIADYGHDGTKT-DTFRGFCGHRLHDVLTAPGTADLTADVDFS 338 Query: 280 RLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQ---TARKDILLDSVKRLVSTSA 336 L + K+ G Q FL +GI R L+ + + + LL L+ Sbjct: 339 YLRRM-SQGKVASLGPVEQQTFLRNMGIDVRLKILLDKTDDPSLRQQLLQGYNMLM---- 393 Query: 337 DKKSMGELFKILVVSHEK 354 + MGE F L + + Sbjct: 394 NPMKMGERFNFLALVPHQ 411 >gi|240139638|ref|YP_002964114.1| hypothetical protein MexAM1_META1p3090 [Methylobacterium extorquens AM1] gi|240009611|gb|ACS40837.1| conserved hypothetical protein [Methylobacterium extorquens AM1] Length = 361 Score = 265 bits (677), Expect = 8e-69, Method: Composition-based stats. Identities = 131/354 (37%), Positives = 185/354 (52%), Gaps = 15/354 (4%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 L+ + I+ +G + +D+Y ALC+ P GYY+T +PFG GDFVTAPEISQ+FGE+ Sbjct: 6 TPLLAILAREIRASGPLGLDRYMALCLGHPLHGYYATRDPFGRGGDFVTAPEISQMFGEL 65 Query: 63 LAIFLICAWEQHGFP-SCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 + + LVELGPGRG +M D LR + DF +++VETS Sbjct: 66 VGAWAAAVLAMMPATGVRPCLVELGPGRGTLMADALRALRAAGTDF----ELHLVETSPV 121 Query: 122 LTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDI 181 L +Q ++LA ++ S+A +P ++ANEFFD+LP +QFV TE G ER + + Sbjct: 122 LRGLQAERLAD--AAPIFHDSVASLPDAPLLIIANEFFDALPARQFVRTELGWCERRVGL 179 Query: 182 DQH-DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVID 240 D+L F + ++ GA+ M+ ++ RL GG + ID Sbjct: 180 TPEGDALAFGLDPEPDPR---LTAEAPAGAVLTLPSQGLAVMRDLARRLVARGGALLAID 236 Query: 241 YGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGK 300 YG+ + GDT QAV GH + PL PG+ADL+ HVDF L+ A ++G TQ Sbjct: 237 YGHDRPGFGDTFQAVAGHRFADPLARPGEADLTLHVDFGALARAAAAEGAALHGPVTQRD 296 Query: 301 FLEGLGIWQRAFSLMKQTARKDILL--DSVKRLVSTSADKKSMGELFKILVVSH 352 FL GLG+ RA L + L +V RL D + MG LFK+L SH Sbjct: 297 FLLGLGLAMRAERLKARATPDQALAIDAAVLRLTDP--DPRGMGALFKVLCASH 348 >gi|254562048|ref|YP_003069143.1| hypothetical protein METDI3653 [Methylobacterium extorquens DM4] gi|254269326|emb|CAX25292.1| conserved hypothetical protein [Methylobacterium extorquens DM4] Length = 361 Score = 265 bits (677), Expect = 8e-69, Method: Composition-based stats. Identities = 129/354 (36%), Positives = 185/354 (52%), Gaps = 15/354 (4%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 L+ + I+ +G + +D+Y ALC+ P GYY+T +PFG GDFVTAPEISQ+FGE+ Sbjct: 6 TPLLAILAREIRASGPLGLDRYMALCLGHPLHGYYATRDPFGRGGDFVTAPEISQMFGEL 65 Query: 63 LAIFLICAWEQHGFP-SCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 + + LVELGPGRG +M D LR + DF +++VETS Sbjct: 66 VGAWAAAVLAMMPATGVRPCLVELGPGRGTLMADALRALRAAGSDF----ELHLVETSPV 121 Query: 122 LTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDI 181 L +Q ++LA ++ S+A +P ++ANEFFD+LP +QFV TE G ER + + Sbjct: 122 LRRLQAERLAD--AAPTFHDSVASLPDAPLLVIANEFFDALPARQFVRTELGWCERRVGL 179 Query: 182 DQH-DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVID 240 D+L F + ++ GA+ M+ ++ RL GG + ID Sbjct: 180 APDGDALAFGLDPEPDP---RLTAEAPAGAVLTLPSQGLAVMRDLARRLVARGGALLAID 236 Query: 241 YGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGK 300 YG+ + GDT QAV H + PL PG+ADL+ HVDF L+ A ++G TQ Sbjct: 237 YGHDRPGFGDTFQAVAAHRFADPLARPGEADLTLHVDFGALARAAAAEGAALHGPVTQRD 296 Query: 301 FLEGLGIWQRAFSLMKQ--TARKDILLDSVKRLVSTSADKKSMGELFKILVVSH 352 FL GLG+ RA L + + + +V RL D + MG LFK+L SH Sbjct: 297 FLLGLGLAMRAERLKARATPDQAQAIDAAVLRLTDP--DPRGMGALFKVLCASH 348 >gi|58579001|ref|YP_197213.1| hypothetical protein ERWE_CDS_03370 [Ehrlichia ruminantium str. Welgevonden] gi|58417627|emb|CAI26831.1| Conserved hypothetical protein [Ehrlichia ruminantium str. Welgevonden] Length = 367 Score = 265 bits (677), Expect = 8e-69, Method: Composition-based stats. Identities = 118/355 (33%), Positives = 190/355 (53%), Gaps = 19/355 (5%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 M + L + I G ++V+Q+ + + D GYY T PFG GDFVT+PEISQ+FG Sbjct: 29 MHSYLKKVI---FDNGGAISVEQFMRIALYDMNCGYYMTQMPFGVFGDFVTSPEISQLFG 85 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 E++A++++ WE+ G PS L+ELGPGRG ++ DI+RV+ K K +S + IY++E S Sbjct: 86 EVIALWVLLYWEKMGSPSKFVLLELGPGRGTLISDIIRVLKKFK-QCYSAVDIYLLEVSP 144 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 +L +Q L G+K+ W ++ +P ++ANEFFD+LPIKQF+ ER I Sbjct: 145 KLQEVQYNTLQDVGEKVLWCRNINSIPNYPILVIANEFFDALPIKQFICISDSWYERYIT 204 Query: 181 IDQHDSLVFNIGDHEIKSNFLTCS-DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVI 239 ++ + F + I NF + + I E ++ I ++ + G A++I Sbjct: 205 VEDNK---FRFINKLIDKNFQILNVNNINDPIIEVCDDAISIIKLIEHKILQNKGAAVII 261 Query: 240 DYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQG 299 DYGY+ T+Q+VK H Y + N G +D++ HVDF L + + TQ Sbjct: 262 DYGYIDPPYKSTMQSVKNHQYNNIFENVGNSDITVHVDFTALRK---SLSFLNSYIMTQR 318 Query: 300 KFLEGLGIWQRAFSLMKQTA--RKDILLDSVKRLVSTSADKKSMGELFKILVVSH 352 FL GI +R L++ ++ L+ RL ++MG +FK+L+++ Sbjct: 319 DFLYNFGIRERLQILIENATEVQQQNLMTGFLRLT------ENMGSMFKVLLINP 367 >gi|114799553|ref|YP_760100.1| hypothetical protein HNE_1383 [Hyphomonas neptunium ATCC 15444] gi|114739727|gb|ABI77852.1| conserved hypothetical protein [Hyphomonas neptunium ATCC 15444] Length = 348 Score = 265 bits (677), Expect = 8e-69, Method: Composition-based stats. Identities = 119/360 (33%), Positives = 178/360 (49%), Gaps = 16/360 (4%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L +++ LI+ G + + Y + + DP+ GYY+ G DF TAPE SQIFGEM+ Sbjct: 2 SLEDRLIRLIETEGPIPLSAYMQIALHDPKEGYYAARPGIGR--DFTTAPETSQIFGEMI 59 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFF-SVLSIYMVETSERL 122 ++++ W G PS LVE+GPGR ++M D L++ D F S L + ++E S L Sbjct: 60 GLWIVHEWRAMGAPSPFHLVEIGPGRALLMHDALKIAALAGGDAFLSALQLTLIEPSPAL 119 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 L+Q ++L + + L+DVP G LVANE+ D LP +QF RE +I + Sbjct: 120 RLLQTERLQRFKPQFA--AQLSDVPAGPMLLVANEYLDCLPARQFRRDGDQWRECVIGLS 177 Query: 183 QHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYG 242 L+ + E + T G + E D + + R D A+ +DYG Sbjct: 178 PERRLIMGLAADEPRPPMGTA---LTGDVVEVQSGLDLIIADLVSR--TDPFRALFVDYG 232 Query: 243 YLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFL 302 + GDTL+A + VSP+ PG +DL+ VDF R + IA L ++G T QG FL Sbjct: 233 PVDRAPGDTLRAYREGQQVSPMETPGASDLTVDVDFGRFARIAATIGLDVSGPTPQGMFL 292 Query: 303 EGLGIWQRAFSL-MKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELM-PF 360 GLG R L + L ++ +RL+ D + MGE FK + +S + F Sbjct: 293 LGLGAQARLNQLIQANPDEAEALYNAAQRLI----DPQQMGERFKAICLSSAGLPKPAGF 348 >gi|307294381|ref|ZP_07574225.1| protein of unknown function DUF185 [Sphingobium chlorophenolicum L-1] gi|306880532|gb|EFN11749.1| protein of unknown function DUF185 [Sphingobium chlorophenolicum L-1] Length = 361 Score = 265 bits (677), Expect = 9e-69, Method: Composition-based stats. Identities = 115/365 (31%), Positives = 169/365 (46%), Gaps = 19/365 (5%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 +E L ++ I G ++V Y A YY T +P GA GDF TAPEISQ+FG Sbjct: 8 VELTLSERLARQIAAGGPISVAHYMAEANQH----YYGTRDPLGAAGDFTTAPEISQMFG 63 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 E++ + L W + G VELGPGRG + D LR + ++ VETS Sbjct: 64 ELIGLCLADIWMRSGGRPEAHYVELGPGRGTLASDALRSMASAGLRP----RVHFVETSP 119 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 L Q + + + +SL + G +VANEFFD+LP++Q + + RER++ Sbjct: 120 SLRERQSALIPNVSHH-DAVSSLPER--GPLLVVANEFFDALPVRQLIRVGNEWRERVVV 176 Query: 181 IDQHDSLVFNIGDHEIKSNFL----TCSDYFLGAIFENSPCRDREMQSISDRLACDGGTA 236 D + +D G I E ++ R+A GG A Sbjct: 177 RPDPDEPDRFAPMAGYRRVESGIPAMAADAPEGTILEMPLAGSAIALELAHRIAKQGGAA 236 Query: 237 IVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLT 296 I++DYGY GDTLQAV+ H Y P + PG++DL++HVDF + ++A L + Sbjct: 237 IIVDYGYEGPATGDTLQAVRAHRYADPFLEPGESDLTTHVDFTMIGNMARQAGLRVTQTV 296 Query: 297 TQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKV- 355 QG FL LGI RA L + T + +++ + +MG LFK + H Sbjct: 297 GQGAFLRQLGIDARADQLSRTTPARAEEVEAAR---HRLTADDAMGTLFKAMAWVHPDWA 353 Query: 356 ELMPF 360 + F Sbjct: 354 DPAGF 358 >gi|170044535|ref|XP_001849900.1| conserved hypothetical protein [Culex quinquefasciatus] gi|167867640|gb|EDS31023.1| conserved hypothetical protein [Culex quinquefasciatus] Length = 428 Score = 265 bits (676), Expect = 1e-68, Method: Composition-based stats. Identities = 124/381 (32%), Positives = 187/381 (49%), Gaps = 35/381 (9%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEM 62 L + + I G + V Y + +P GYY + FG+ GDFVT+PEI QIFGE+ Sbjct: 44 SLKNEFKSRILATGPIPVAAYMKQVLTNPAAGYYMNEADVFGSKGDFVTSPEIGQIFGEL 103 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 +A + + W + G P+ +L+ELGPG+G MM D+LRV +L LS+++VE SE L Sbjct: 104 VAAWCLNEWTKFGRPAPYQLIELGPGKGTMMRDVLRVFRRLGAS--DGLSVHLVEMSEHL 161 Query: 123 TLIQKKQLASYGD-----------------KINWYTSLADVPLGFTFLVANEFFDSLPIK 165 + +Q + L + K+ WY L DVP GF+ ++A+EFFD+LPI Sbjct: 162 SEVQAELLCRSSEECVDKAYYRAGVTRAGTKVFWYRHLEDVPAGFSIVLAHEFFDALPIH 221 Query: 166 QFVMTEHGIRERMIDIDQHDSLV--FNIGDHEIKSNFLTCSDYFL----GAIFENSPCRD 219 +F ++ +E ++D+D + F + E L ++Y E S + Sbjct: 222 KFQKQDNVWKEVLVDVDSDNKDKLRFVLSKAETPMLKLVLNNYPELVKDREHIEISLDSE 281 Query: 220 REMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQ 279 ++ I+ R GG A+V+DYG+ + GDT +A K H PL PG ADL++ VDF Sbjct: 282 SIIRQIAQRFNATGGYALVVDYGHTGEK-GDTFRAFKNHKLHDPLEEPGSADLTADVDFS 340 Query: 280 RLSSIAILYK-LYINGLTTQGKFLEGLGIWQRAFSLMKQT---ARKDILLDSVKRLVSTS 335 L+ ++ G QG FLE G R L+ K L D K L Sbjct: 341 LLNRFCENTGQIFTMGPIEQGSFLEMAGAKDRLQVLLANAKSEEEKHRLSDGYKMLT--- 397 Query: 336 ADKKSMGELFKILVVSHEKVE 356 D+ MG FK + +++E Sbjct: 398 -DRDQMGSRFKFFALFPKELE 417 >gi|218531060|ref|YP_002421876.1| hypothetical protein Mchl_3110 [Methylobacterium chloromethanicum CM4] gi|218523363|gb|ACK83948.1| protein of unknown function DUF185 [Methylobacterium chloromethanicum CM4] Length = 361 Score = 265 bits (676), Expect = 1e-68, Method: Composition-based stats. Identities = 128/354 (36%), Positives = 185/354 (52%), Gaps = 15/354 (4%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 L+ + I+ +G + +D+Y A C+ P GYY+T +PFG GDFVTAPEISQ+FGE+ Sbjct: 6 TPLLAILAREIRASGPLGLDRYMAFCLGHPLHGYYATRDPFGRGGDFVTAPEISQMFGEL 65 Query: 63 LAIFLICAWEQHGFP-SCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 + + LVELGPGRG +M D LR + DF +++VETS Sbjct: 66 VGAWAAAVLAMMPATGVRPCLVELGPGRGTLMADALRALRAAGSDF----ELHLVETSPV 121 Query: 122 LTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDI 181 L +Q ++LA ++ S+A +P ++ANEFFD+LP +QFV TE G ER + + Sbjct: 122 LRRLQAERLAD--AAPTFHDSVASLPDAPLLVIANEFFDALPARQFVRTELGWCERRVGL 179 Query: 182 DQH-DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVID 240 D+L F + ++ GA+ M+ ++ RL GG + ID Sbjct: 180 APDGDALAFGLDPEPDPR---LTAEAPAGAVLTLPSQGLAVMRDLARRLVARGGALLAID 236 Query: 241 YGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGK 300 YG+ + GDT QAV GH + PL PG+ADL+ HVDF L+ A ++G TQ Sbjct: 237 YGHDRPGFGDTFQAVAGHRFADPLARPGEADLTLHVDFGALARAAAAEGAALHGPVTQRD 296 Query: 301 FLEGLGIWQRAFSLMKQ--TARKDILLDSVKRLVSTSADKKSMGELFKILVVSH 352 FL LG+ RA L + + + +V RL + D + MG LFK+L SH Sbjct: 297 FLLALGLAMRAERLKARATPDQAQAIDAAVLRLTDS--DPRGMGALFKVLCASH 348 >gi|195499996|ref|XP_002097185.1| GE26081 [Drosophila yakuba] gi|194183286|gb|EDW96897.1| GE26081 [Drosophila yakuba] Length = 437 Score = 265 bits (676), Expect = 1e-68, Method: Composition-based stats. Identities = 114/393 (29%), Positives = 190/393 (48%), Gaps = 43/393 (10%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L +++ I G + V +Y + +P+ GYY + FG GDF+T+PEISQIFGE++ Sbjct: 51 SLAKQLRAKILATGPIPVAEYMREVLTNPQAGYYMNRDVFGREGDFITSPEISQIFGELV 110 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 I+L+ W + G PS +LVELGPGRG + D+L+V+ K + S++MVE S L+ Sbjct: 111 GIWLVSEWRKMGSPSPFQLVELGPGRGTLARDVLKVLTKF--KQDAEFSMHMVEVSPFLS 168 Query: 124 LIQKKQLASYG--------------------DKINWYTSLADVPLGFTFLVANEFFDSLP 163 Q ++ K W+ L DVP GF+ ++A+EFFD+LP Sbjct: 169 KAQAQRFCYSHQTLPEDAQQPHYQEGTTASGTKAFWHRRLEDVPQGFSLVLAHEFFDALP 228 Query: 164 IKQFVMTEHGIRERMIDIDQHD-----SLVFNIGDHEIKS-NFLTCSDYFLGAIFENSPC 217 + + + + +E +ID+ D + + + + + E+S Sbjct: 229 VHKLQLVDGKWQEVLIDVASSDGAEDAGFRYVLSRSQTPVSSLYRPMPGETRSCLEHSLE 288 Query: 218 RDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVD 277 +R++ +++R+ DGG A+++DYG+ + DT +A K H PL+ PG ADL++ VD Sbjct: 289 TERQVGLLAERIERDGGIALIMDYGHFGEKT-DTFRAFKQHKLHDPLLEPGSADLTADVD 347 Query: 278 FQRLSSIAILYK-LYINGLTTQGKFLEGLGIWQRAFSLMKQ--TARKDILLDSVKRLVST 334 F+ + IA ++ G QG FL+ + R L+ ++I+ + L Sbjct: 348 FKLVRHIAETRGNIHCCGPVEQGLFLQRMQGEARLEQLLAHALPENQEIIRSGYEMLT-- 405 Query: 335 SADKKSMGELFKILVVSH-------EKVELMPF 360 D MG FK L + +K + F Sbjct: 406 --DPAQMGSRFKFLAMFPGVLAPHLDKYPIAGF 436 >gi|126304576|ref|XP_001366666.1| PREDICTED: hypothetical protein [Monodelphis domestica] Length = 513 Score = 264 bits (675), Expect = 1e-68, Method: Composition-based stats. Identities = 124/378 (32%), Positives = 186/378 (49%), Gaps = 33/378 (8%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 ++R + IK G +TV +Y + +P GYY + G GDF+T+PEISQIFGE+ Sbjct: 117 TPMLRHLTYKIKATGPITVAEYMKEVLTNPVKGYYVHQDMIGERGDFITSPEISQIFGEL 176 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSV-LSIYMVETSER 121 L I+ I W G S +LVELGPGRG + DILRV +L + +S+++VE S++ Sbjct: 177 LGIWYISEWMASGKSSTFQLVELGPGRGTLTGDILRVFSQLGSVLKNCDISVHLVEVSQK 236 Query: 122 LTLIQ--------------------KKQLASYGDKINWYTSLADVPLGFTFLVANEFFDS 161 L+ IQ K + G I WY SL DVP G++F +A+EFFD+ Sbjct: 237 LSEIQALTLADETVTLEHNAESPVYMKGITKSGIPIYWYRSLQDVPQGYSFYLAHEFFDA 296 Query: 162 LPIKQFVMTEHGIRERMIDIDQH--DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRD 219 LP+ +F T+ G RE ID+D D L F + + D + E P Sbjct: 297 LPVHKFQKTQQGWREVFIDVDPQDSDKLRFVLAPSATPAETFIQPDEKRDHV-EVCPDAG 355 Query: 220 REMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQ 279 +Q +S + +GG A++ DYG+ ++ DT + GH L+ PG ADL++ VDF Sbjct: 356 VIIQILSKCIEENGGAALIADYGHDGTKT-DTFRGFCGHKLHDVLIAPGTADLTADVDFS 414 Query: 280 RLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVS---TSA 336 L + K+ G Q FL+ +GI R L+ ++ + K+L+ Sbjct: 415 YLRRM-TQGKVASLGPIQQCSFLKNMGIDVRLKVLLDNSSDTT----TRKQLIHGYDMLM 469 Query: 337 DKKSMGELFKILVVSHEK 354 + K MGE F + + Sbjct: 470 NPKKMGERFNFFALLPHQ 487 >gi|195389881|ref|XP_002053602.1| GJ23258 [Drosophila virilis] gi|194151688|gb|EDW67122.1| GJ23258 [Drosophila virilis] Length = 444 Score = 264 bits (674), Expect = 2e-68, Method: Composition-based stats. Identities = 113/398 (28%), Positives = 189/398 (47%), Gaps = 47/398 (11%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGE 61 + L R++ + G +TV +Y + +P+ GYY + FG GDFVT+PEISQIFGE Sbjct: 54 KTTLTRQLTAKMLATGPITVAEYMREVLTNPQGGYYMNRDVFGREGDFVTSPEISQIFGE 113 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 ++ ++L+ W++ G PS +LVELGPGRG + D+L+V+ K + +++MVE S Sbjct: 114 LVGVWLMNEWQKLGSPSPFQLVELGPGRGTLARDVLKVLSKF--KTGAQFTMHMVEISPF 171 Query: 122 LTLIQKKQLASYGDKI--------------------NWYTSLADVPLGFTFLVANEFFDS 161 L+ Q ++ + + W+ L DVP GF+ ++A+EFFD+ Sbjct: 172 LSKAQAQRFCYKHETVPDEAQLPYYQIGTTASGTQAYWHHRLEDVPPGFSLVLAHEFFDA 231 Query: 162 LPIKQFVMTEHGIRERMIDIDQ---------HDSLVFNIGDHEIKSNFLTCSDYFLGAIF 212 LP+ + + +E +ID+ Q V + + F Sbjct: 232 LPVHKLRLVNDQWQEVLIDVAQAQSTGSKSADFRYVVSKAQTPVSRLFKPVPQETRNY-L 290 Query: 213 ENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADL 272 E S +R + +++RL GG A+++DYG+ + DT + K H PL+ PG ADL Sbjct: 291 EYSLEAERHVGILAERLEQHGGIALIMDYGHFGDKK-DTFRGFKQHALHDPLLAPGTADL 349 Query: 273 SSHVDFQRLSSIAILYK-LYINGLTTQGKFLEGLGIWQRAFSLMKQ--TARKDILLDSVK 329 ++ VDF+ + +A ++ G QG FL+ + R L+ ++I+ + Sbjct: 350 TADVDFRLIKHVAETRGHIHCCGPVQQGDFLKRMQGEVRLEQLLAHALPENQNIIRSGYE 409 Query: 330 RLVSTSADKKSMGELFKILVVSH-------EKVELMPF 360 L + K MG FK L + +K + F Sbjct: 410 MLT----NPKQMGSRFKFLAMFPGVMADHLDKYPVAGF 443 >gi|194902098|ref|XP_001980588.1| GG18000 [Drosophila erecta] gi|190652291|gb|EDV49546.1| GG18000 [Drosophila erecta] Length = 437 Score = 264 bits (674), Expect = 2e-68, Method: Composition-based stats. Identities = 114/393 (29%), Positives = 191/393 (48%), Gaps = 43/393 (10%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L +++ I G + V +Y + +P+ GYY + FG GDF+T+PEISQIFGE++ Sbjct: 51 SLGKQLRAKILATGPIPVAEYMREVLTNPQAGYYMNRDVFGREGDFITSPEISQIFGELV 110 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 I+L+ W + G PS + VELGPGRG + D+L+V+ K + S++MVE S L+ Sbjct: 111 GIWLVSEWRKMGSPSPFQFVELGPGRGTLARDVLKVLTKF--KQDAEFSMHMVEVSPFLS 168 Query: 124 LIQKKQLASYGD--------------------KINWYTSLADVPLGFTFLVANEFFDSLP 163 Q ++ + K W+ L DVP GF+ ++A+EFFD+LP Sbjct: 169 KAQAQRFCYSHNALPEDAQLPHYQEGTTASGTKAFWHRRLQDVPQGFSLVLAHEFFDALP 228 Query: 164 IKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGA------IFENSPC 217 + + + + +E +ID+ D + + + S + E+S Sbjct: 229 VHKLQLVDGKWQEVLIDVASSDGAQEAGFRYVLSRSQTPVSSLYRPMPGETRSCLEHSLE 288 Query: 218 RDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVD 277 +R++ +++R+ DGG A+++DYG+ + DT +A K H PL+ PG ADL++ VD Sbjct: 289 TERQVGLLAERIERDGGIALIMDYGHFGEKS-DTFRAFKQHKLHDPLLEPGSADLTADVD 347 Query: 278 FQRLSSIAILYK-LYINGLTTQGKFLEGLGIWQRAFSLMKQ--TARKDILLDSVKRLVST 334 F+ + IA ++ G QG FL+ + R L+ ++I+ + L Sbjct: 348 FKLVRHIAETRGNIHCCGPVEQGLFLQRMQGEARLEQLLAHALPENQEIIRSGYEMLT-- 405 Query: 335 SADKKSMGELFKILVVSH-------EKVELMPF 360 D MG FK L + +K ++ F Sbjct: 406 --DPAQMGSRFKFLAMFPGVLAPHLDKYPVVGF 436 >gi|73980164|ref|XP_532933.2| PREDICTED: similar to CG17726-PA isoform 1 [Canis familiaris] Length = 440 Score = 263 bits (673), Expect = 2e-68, Method: Composition-based stats. Identities = 129/377 (34%), Positives = 185/377 (49%), Gaps = 32/377 (8%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 ++R +V IK G +TV +Y + +P GYY + G GDF+T+PEISQIFGE+ Sbjct: 41 TPMLRHLVYKIKATGPITVAEYMKEVLTNPAKGYYVHRDMLGEKGDFITSPEISQIFGEL 100 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSV-LSIYMVETSER 121 L I+ I W G + +LVELGPG+G + DILRV +L + +SI+MVE SE+ Sbjct: 101 LGIWFISEWMATGKNAAFQLVELGPGKGTLAGDILRVFSQLGSVLKNCDISIHMVEVSEK 160 Query: 122 LTLIQ--------------------KKQLASYGDKINWYTSLADVPLGFTFLVANEFFDS 161 L+ IQ K + G I+WY L DVP G++F +A+EFFD Sbjct: 161 LSEIQALTLTEEKIPLERNAGSSVYMKGVTKSGIPISWYRDLHDVPKGYSFYLAHEFFDV 220 Query: 162 LPIKQFVMTEHGIRERMIDIDQH--DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRD 219 LP+ +F T G RE IDID D L F + + D + E P Sbjct: 221 LPVHKFQKTPQGWREVFIDIDPQVSDKLRFVLAPCVTPAEVFIQRD-EIRDHVEVCPEAG 279 Query: 220 REMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQ 279 +Q +S R+A GG A++ DYG+ ++ DT + GH L PG ADL++ VDF Sbjct: 280 VIIQELSQRIALTGGAALIADYGHDGTKT-DTFRGFCGHKLHDVLTAPGTADLTADVDFS 338 Query: 280 RLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTAR--KDILLDSVKRLVSTSAD 337 L +A + G Q FL+ +GI R L+ ++ + LL L+ + Sbjct: 339 YLRRMAEGQ-VASLGPIKQQTFLKNMGIDVRLKVLLDKSDEPARQQLLQGYDMLM----N 393 Query: 338 KKSMGELFKILVVSHEK 354 K MGE F + + Sbjct: 394 PKKMGERFNFFALLPHQ 410 >gi|91079168|ref|XP_967572.1| PREDICTED: similar to CG17726 CG17726-PA [Tribolium castaneum] gi|270004240|gb|EFA00688.1| hypothetical protein TcasGA2_TC003565 [Tribolium castaneum] Length = 412 Score = 263 bits (672), Expect = 3e-68, Method: Composition-based stats. Identities = 134/386 (34%), Positives = 189/386 (48%), Gaps = 38/386 (9%) Query: 6 IRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAI 65 ++I + IK G +TV +Y + +P GYY + FG GDF+T+PE++Q+FGEM+AI Sbjct: 33 AKQIYSKIKATGPITVAEYMKEVLINPLGGYYMHKDVFGESGDFITSPELNQMFGEMVAI 92 Query: 66 FLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLI 125 + + W + G P +++VELGPGRG + D+LRV ++++VE S L+ + Sbjct: 93 WFLNEWSKVGSPKPIQIVELGPGRGTLSQDLLRVFDHFGA--LQSATLHLVEVSPLLSDL 150 Query: 126 QKKQLASYGDKI-------------------NWYTSLADVPLGFTFLVANEFFDSLPIKQ 166 Q ++L D I WY L DVP FT LVA+EFFD+LP+ + Sbjct: 151 QARKLCIQSDNIIDKKSVIHRQGISHQGIPVKWYRQLDDVPNCFTLLVAHEFFDALPVHK 210 Query: 167 FVMTEHGIRERMIDIDQHDSLVF--NIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQS 224 F T+ G RE +IDID F I E ++ L FE SP + Sbjct: 211 FQKTKDGYREILIDIDLSKECSFRYVIAREETPASKLYIRPNETREHFEISPESLVLAKQ 270 Query: 225 ISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSI 284 I++RL DGG A++ DYG+ S DT +A K H PLV PG ADL++ VDF LS Sbjct: 271 IAERLEIDGGLALIADYGHNGS-GTDTFRAFKKHKLHDPLVEPGTADLTADVDFDALSKS 329 Query: 285 AILY-KLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDI--LLDSVKRLVSTSADKKSM 341 A + G TTQ FL +GI R +L I L K + + M Sbjct: 330 ATEAGGVITFGPTTQRDFLLKMGIEHRFKALKANAKPDQIEGLEFGYKMMTES----NQM 385 Query: 342 GELFKILVVSH-------EKVELMPF 360 GE FK L + +K + F Sbjct: 386 GERFKFLALLPAVLEKLLKKYSVAGF 411 >gi|254994910|ref|ZP_05277100.1| hypothetical protein AmarM_02202 [Anaplasma marginale str. Mississippi] gi|255003046|ref|ZP_05278010.1| hypothetical protein AmarPR_01940 [Anaplasma marginale str. Puerto Rico] Length = 343 Score = 263 bits (672), Expect = 3e-68, Method: Composition-based stats. Identities = 116/340 (34%), Positives = 182/340 (53%), Gaps = 17/340 (5%) Query: 19 MTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPS 78 +T+D++ +L + E GYY T PFG GDFVT+ EISQ+FGE++A++++ E G Sbjct: 16 VTMDRFMSLALYHEEHGYYMTRVPFGRAGDFVTSAEISQLFGEVIALWILSCLESAGISE 75 Query: 79 CVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASY--GDK 136 L+ELGPGRG +M DILRV + P + ++L ++++E S L Q+ L S+ + Sbjct: 76 KFSLLELGPGRGTLMHDILRVFEQF-PRYDALLEVHLLEISPLLRNTQRATLESFSARKE 134 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 I+W+ L ++P T +VANEFFD+LP++QF+ T +E + +D I + Sbjct: 135 ISWHCKLEELPERPTIVVANEFFDALPVRQFIRTGGAWKECCV---CNDGGNLGIVAVDT 191 Query: 197 KSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVK 256 + N D G I E + + + +GG A + DYGYLQ T+Q+VK Sbjct: 192 QYNLDEYGDVPEGGIIERCEAASDVLACLEKIIVRNGGAAAIFDYGYLQPPYRSTIQSVK 251 Query: 257 GHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMK 316 H Y L N G+ D+++HVDF L A + TQ +FL GI +R L + Sbjct: 252 SHHYCDFLDNIGECDITAHVDFGLLQKHAQRLNSKV---VTQREFLYQFGIRERLACLER 308 Query: 317 QTARKD--ILLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 + L + RL ++MG +FK+L+++HE+ Sbjct: 309 NATERQRRELKGAFLRLT------ENMGTMFKVLLLNHER 342 >gi|62859273|ref|NP_001016145.1| protein midA homolog, mitochondrial precursor [Xenopus (Silurana) tropicalis] gi|82178636|sp|Q5BKM6|MIDA_XENTR RecName: Full=Protein midA homolog, mitochondrial; Flags: Precursor gi|60552371|gb|AAH91018.1| hypothetical protein LOC548899 [Xenopus (Silurana) tropicalis] gi|89268182|emb|CAJ81481.1| novel protein [Xenopus (Silurana) tropicalis] Length = 430 Score = 263 bits (672), Expect = 3e-68, Method: Composition-based stats. Identities = 125/386 (32%), Positives = 191/386 (49%), Gaps = 35/386 (9%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 N L+ ++ IK G +TV +Y + +P GYY + G GDFVT+PE+SQIFGE+ Sbjct: 41 NALLNHLIFKIKSTGPITVSEYMREVLTNPVKGYYMHHDMLGEHGDFVTSPELSQIFGEL 100 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSV-LSIYMVETSER 121 L ++ I W G P ++LVELGPGRG + D+LRV S +S+++VE S + Sbjct: 101 LGVWCISEWMSAGKPKSLQLVELGPGRGTLTDDLLRVFSNFGRLLNSCDISVHLVEVSPK 160 Query: 122 LTLIQKKQLASY--------------------GDKINWYTSLADVPLGFTFLVANEFFDS 161 L+ IQ ++L G + WY + DVP GF+F +A+EFFD+ Sbjct: 161 LSDIQAQRLTGKAIEVELDKNSPVYKKGITKTGFPVCWYQDIQDVPTGFSFYIAHEFFDA 220 Query: 162 LPIKQFVMTEHGIRERMIDIDQ--HDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRD 219 LPI + T+ G RE +IDID D L F +G + D E P Sbjct: 221 LPIHKLQKTKDGWREILIDIDPGIPDKLRFVLGPNVSLVANTFVQDDEPRDHVEVCPSAA 280 Query: 220 REMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQ 279 +Q +++++ GG A++ DYG++ R DT + + H L NPG ADL++ VDF Sbjct: 281 VIIQKLANQINSYGGAALIADYGHMGERT-DTFRGFRAHKLHDVLSNPGTADLTADVDFN 339 Query: 280 RLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTAR---KDILLDSVKRLVSTSA 336 + I G TQ +FL+ +GI R L+++++ + L+ L+ Sbjct: 340 FMRRIVGEA-ASCLGPVTQHEFLKNMGIDIRLKVLLEKSSDVAVQKQLIHGYNILM---- 394 Query: 337 DKKSMGELFKILVVSHE---KVELMP 359 + MG+ FK V + K + P Sbjct: 395 NADQMGQRFKFFSVVPQSRLKTTMPP 420 >gi|300024011|ref|YP_003756622.1| hypothetical protein Hden_2505 [Hyphomicrobium denitrificans ATCC 51888] gi|299525832|gb|ADJ24301.1| protein of unknown function DUF185 [Hyphomicrobium denitrificans ATCC 51888] Length = 379 Score = 263 bits (671), Expect = 4e-68, Method: Composition-based stats. Identities = 126/362 (34%), Positives = 183/362 (50%), Gaps = 11/362 (3%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGE 61 + L R I I+++G MTV Y A C+ D FGYY FGA GDF+TA +ISQ+FGE Sbjct: 10 DTPLGRLIKEGIQRDGPMTVQAYMARCLWDEPFGYYRRQRVFGASGDFITAADISQVFGE 69 Query: 62 MLAIFLICAWEQ-HGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 ++ ++ W+ G P + L E GPGRG MM D LR + P F + Y++E S+ Sbjct: 70 LIGVWTGVVWQTVFGAPGTITLAEYGPGRGTMMRDALRAARVV-PGFIEAVHPYLIEASQ 128 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 L+ +Q LA + + W L + +VANEF DS P+ Q+V T G R R + Sbjct: 129 TLSQLQATTLADFRSRATWGAKLDE-FSPPAIIVANEFLDSWPVAQWVKTVDGWRIRGVG 187 Query: 181 IDQHDSLVFNIGDHEIKSNFL--TCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIV 238 ++ L F D + D +GA+ E DR ++ + ++ Sbjct: 188 LNASGHLEFTAVDGDCPHEAFDALLPDAQVGAVVETQ-RLDRLADALQSLMQRGPVVMLL 246 Query: 239 IDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQ 298 IDYG+ + GDTLQAV+ H Y SPL +PG+ADL+ HV+F L+S L ++G TQ Sbjct: 247 IDYGHTAAAAGDTLQAVREHKYESPLASPGEADLTVHVNFYDLASTLHRAGLALDGPVTQ 306 Query: 299 GKFLEGLGIWQRAFSL-MKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVEL 357 +FL +GI +RA L R + V RL++ MG FK+L + Sbjct: 307 AEFLGAVGIVERASRLMSANPQRAGEIEAGVARLLA----PNGMGSRFKVLAARSPDLPP 362 Query: 358 MP 359 +P Sbjct: 363 LP 364 >gi|56416701|ref|YP_153775.1| hypothetical protein AM487 [Anaplasma marginale str. St. Maries] gi|222475067|ref|YP_002563482.1| hypothetical protein AMF_360 [Anaplasma marginale str. Florida] gi|56387933|gb|AAV86520.1| hypothetical protein AM487 [Anaplasma marginale str. St. Maries] gi|222419203|gb|ACM49226.1| Conserved hypothetical protein [Anaplasma marginale str. Florida] Length = 343 Score = 262 bits (670), Expect = 5e-68, Method: Composition-based stats. Identities = 116/340 (34%), Positives = 182/340 (53%), Gaps = 17/340 (5%) Query: 19 MTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPS 78 +T+D++ +L + E GYY T PFG GDFVT+ EISQ+FGE++A++++ E G Sbjct: 16 VTMDRFMSLALYHEEHGYYMTRVPFGRAGDFVTSAEISQLFGEVVALWILSYLESAGISE 75 Query: 79 CVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASY--GDK 136 L+ELGPGRG +M DILRV + P + ++L ++++E S L Q+ L S+ + Sbjct: 76 KFSLLELGPGRGTLMHDILRVFEQF-PRYDALLEVHLLEISPLLRNTQRATLESFSARKE 134 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 I+W+ L ++P T +VANEFFD+LP++QF+ T +E + +D I + Sbjct: 135 ISWHCKLEELPERPTIVVANEFFDALPVRQFIRTSGAWKECCV---CNDGGNLGIVAVDT 191 Query: 197 KSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVK 256 + N D G I E + + + +GG A + DYGYLQ T+Q+VK Sbjct: 192 QYNLDEYGDVPEGGIIERCEAASDVLACLEKIIVRNGGAAAIFDYGYLQPPYRSTIQSVK 251 Query: 257 GHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMK 316 H Y L N G+ D+++HVDF L A + TQ +FL GI +R L + Sbjct: 252 SHHYCDFLDNIGECDITAHVDFGLLQKHAQRLNSKV---VTQREFLYQFGIRERLACLER 308 Query: 317 QTARKD--ILLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 + L + RL ++MG +FK+L+++HE+ Sbjct: 309 NATERQRRELKGAFLRLT------ENMGTMFKVLLLNHER 342 >gi|291386928|ref|XP_002709807.1| PREDICTED: hypothetical protein [Oryctolagus cuniculus] Length = 442 Score = 262 bits (670), Expect = 5e-68, Method: Composition-based stats. Identities = 128/379 (33%), Positives = 189/379 (49%), Gaps = 33/379 (8%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGE 61 ++R +V IK G +TV +Y + +P GYY + G GDFVT+PEISQIFGE Sbjct: 41 TTPMLRHLVYKIKSTGPITVAEYMKEVLTNPAKGYYVYRDMLGEKGDFVTSPEISQIFGE 100 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSV-LSIYMVETSE 120 +L I+ I W G + +LVELGPGRG + DILRV +L + +SI++VE S+ Sbjct: 101 LLGIWFISEWMATGKSAAFQLVELGPGRGTLAGDILRVFNQLGSVLKNCDISIHLVEVSQ 160 Query: 121 RLTLIQ--------------------KKQLASYGDKINWYTSLADVPLGFTFLVANEFFD 160 +L+ IQ K ++ G + WY +L DVP G++F +A+EFFD Sbjct: 161 KLSEIQAVTLTEEKVPLERNADSPVYMKGVSKTGIPVCWYRNLQDVPKGYSFYLAHEFFD 220 Query: 161 SLPIKQFVMTEHGIRERMIDIDQH--DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCR 218 LP+ +F T G RE +ID D D L F + + D + E P Sbjct: 221 VLPVHKFQKTPQGWREVLIDTDPQVSDKLRFVLAPCATPAEAFIHHDEKRDHV-EVCPDA 279 Query: 219 DREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDF 278 +Q +S R+A GG A++ DYG+ ++ DT + GH L+ PG ADL++ VDF Sbjct: 280 GVVIQELSQRIALTGGAALIADYGHDGTKT-DTFRGFCGHKLHDVLIAPGMADLTADVDF 338 Query: 279 QRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQ---TARKDILLDSVKRLVSTS 335 L +A ++ G TQ FL+ +GI R L + + ++ LL L+ Sbjct: 339 SYLRRMA-QGQVASLGPITQQTFLKNMGIDVRLKILADKSHEPSVREQLLQGYDMLM--- 394 Query: 336 ADKKSMGELFKILVVSHEK 354 + K MGE F + + Sbjct: 395 -NPKKMGERFHFFALLPHQ 412 >gi|297667835|ref|XP_002812170.1| PREDICTED: protein midA homolog, mitochondrial-like isoform 1 [Pongo abelii] Length = 441 Score = 262 bits (669), Expect = 7e-68, Method: Composition-based stats. Identities = 126/378 (33%), Positives = 188/378 (49%), Gaps = 33/378 (8%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 ++R ++ IK G +TV +Y + +P GYY + G GDF+T+PEISQIFGE+ Sbjct: 41 TPMLRHLIYKIKSTGPITVAEYMKEVLTNPAKGYYVYRDMLGEKGDFITSPEISQIFGEL 100 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSV-LSIYMVETSER 121 L I+ I W G + +LVELGPGRG ++ DILRV +L + +S+++VE S++ Sbjct: 101 LGIWFISEWMATGKSTAFQLVELGPGRGTLVGDILRVFTQLGSVLKNCDISVHLVEVSQK 160 Query: 122 LTLIQ--------------------KKQLASYGDKINWYTSLADVPLGFTFLVANEFFDS 161 L+ IQ K + G ++WY L DVP G++F +A+EFFD Sbjct: 161 LSEIQALTLTEEKVPLERNAGSPVYMKGVTKSGIPVSWYRDLQDVPKGYSFYLAHEFFDV 220 Query: 162 LPIKQFVMTEHGIRERMIDIDQH--DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRD 219 LP+ +F T G RE +DID D L F + + D E P Sbjct: 221 LPVHKFQKTPQGWREVFVDIDPQVSDKLRFVLAPSATPAEAFIQHD-ETRDHVEVCPDAG 279 Query: 220 REMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQ 279 ++ +S+R+A GG A+V DYG+ +R DT + GH L+ PG ADL++ VDF Sbjct: 280 VIIEELSERIALTGGAALVADYGHDGTRT-DTFRGFCGHKLHDVLIAPGTADLTADVDFS 338 Query: 280 RLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTAR---KDILLDSVKRLVSTSA 336 L +A K+ G Q FL+ +GI R L+ ++ + LL L+ Sbjct: 339 YLRRMA-QGKVASVGPIKQHTFLKNMGIDVRLKVLLDKSNEPSVRQQLLQGYDMLM---- 393 Query: 337 DKKSMGELFKILVVSHEK 354 + K MGE F + + Sbjct: 394 NPKKMGERFNFFALLPHQ 411 >gi|195157774|ref|XP_002019769.1| GL12571 [Drosophila persimilis] gi|194116360|gb|EDW38403.1| GL12571 [Drosophila persimilis] Length = 437 Score = 261 bits (668), Expect = 9e-68, Method: Composition-based stats. Identities = 118/392 (30%), Positives = 187/392 (47%), Gaps = 43/392 (10%) Query: 5 LIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLA 64 L +++ I G +TV +Y + +P+ GYY + FG GDF+T+PEISQIFGE++ Sbjct: 52 LAKQLRAKILATGPITVAEYMREVLTNPQAGYYMNRDVFGREGDFITSPEISQIFGELVG 111 Query: 65 IFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTL 124 I+L+ W + G PS +LVELGPGRG + D+L+++ K + S++MVE S L+ Sbjct: 112 IWLVAEWRKMGSPSPFQLVELGPGRGTLARDVLKILTKF--KLGAEFSMHMVEVSPFLSK 169 Query: 125 IQKKQLASYGD--------------------KINWYTSLADVPLGFTFLVANEFFDSLPI 164 Q ++ + K W+ L DVP GF+ ++A+EFFD+LP+ Sbjct: 170 AQAQRFCYTHETLPEEAQLPHYQVGTTATGTKAYWHRRLEDVPQGFSLVLAHEFFDALPV 229 Query: 165 KQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLG------AIFENSPCR 218 + + +E +ID+ + + + S F E S Sbjct: 230 HKLQLANGQWQEVLIDVAPKSEPEAANFHYVLSKSQTPVSRVFHPMPGETRQTLEYSLET 289 Query: 219 DREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDF 278 +R++ +++RL DGG +++DYG+ + DT +A K H PL+ PG ADL++ VDF Sbjct: 290 ERQVGLLAERLERDGGIGLIMDYGHFGEKT-DTFRAFKQHALHEPLLEPGTADLTADVDF 348 Query: 279 QRLSSIAILYK-LYINGLTTQGKFLEGLGIWQRAFSLMKQ--TARKDILLDSVKRLVSTS 335 + + S A L+ G QG FL + R L+ + I+ + L Sbjct: 349 KLVKSTAETRGHLHCCGPIEQGLFLSRMQGEARLEQLLANALPENEAIIRSGYEMLT--- 405 Query: 336 ADKKSMGELFKILVVSH-------EKVELMPF 360 D K MG FK L + EK + F Sbjct: 406 -DPKQMGSRFKFLAMFPGVVAPHLEKFPVAGF 436 >gi|125778362|ref|XP_001359939.1| GA14629 [Drosophila pseudoobscura pseudoobscura] gi|54639689|gb|EAL29091.1| GA14629 [Drosophila pseudoobscura pseudoobscura] Length = 437 Score = 261 bits (668), Expect = 9e-68, Method: Composition-based stats. Identities = 118/394 (29%), Positives = 187/394 (47%), Gaps = 45/394 (11%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L +++ I G +TV +Y + +P+ GYY + FG GDF+T+PEISQIFGE++ Sbjct: 51 SLAKQLRAQILATGPITVAEYMREVLTNPQAGYYMNRDVFGREGDFITSPEISQIFGELV 110 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 I+L+ W + G PS +LVELGPGRG + D+L+++ K + S++MVE S L+ Sbjct: 111 GIWLVAEWRKMGSPSPFQLVELGPGRGTLARDVLKILTKF--KLGAEFSMHMVEVSPFLS 168 Query: 124 LIQKKQLASYGD--------------------KINWYTSLADVPLGFTFLVANEFFDSLP 163 Q ++ + K W+ L DVP GF+ ++A+EFFD+LP Sbjct: 169 KAQAQRFCYTHETLPEEAQLPHYQVGTTATGTKAYWHRRLEDVPQGFSLVLAHEFFDALP 228 Query: 164 IKQFVMTEHGIRERMIDIDQ-------HDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSP 216 + + + +E +ID+ + V + + F E S Sbjct: 229 VHKLQLANGQWQEVLIDVAPESEPEAANFRYVLSKSQTPVSRVFHPMP-GETRQTLEYSL 287 Query: 217 CRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHV 276 +R++ +++RL DGG +++DYG+ + DT +A K H PL+ PG ADL++ V Sbjct: 288 ETERQVGLLAERLERDGGIGLIMDYGHFGEKT-DTFRAFKQHALHEPLLEPGTADLTADV 346 Query: 277 DFQRLSSIAILYK-LYINGLTTQGKFLEGLGIWQRAFSLMKQ--TARKDILLDSVKRLVS 333 DF+ + S A L+ G QG FL + R L+ + I+ + L Sbjct: 347 DFKLVKSTAETRGHLHCCGPIEQGLFLSRMQGEARLEQLLANALPENEAIIRSGYEMLT- 405 Query: 334 TSADKKSMGELFKILVVSH-------EKVELMPF 360 D K MG FK L + EK + F Sbjct: 406 ---DPKQMGSRFKFLAMFPGVVAPHLEKFPVAGF 436 >gi|261490670|ref|NP_001094519.2| protein midA homolog, mitochondrial [Bos taurus] Length = 441 Score = 261 bits (668), Expect = 9e-68, Method: Composition-based stats. Identities = 119/378 (31%), Positives = 183/378 (48%), Gaps = 33/378 (8%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 ++R ++ IK G +TV +Y + +P GYY + G GDF+T+PEISQ+FGE+ Sbjct: 41 TPMLRHLIYKIKSTGPITVAEYMKEVLTNPAKGYYMNRDMLGEEGDFITSPEISQMFGEL 100 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSI-YMVETSER 121 L I+ I W G + +LVELGPG+G ++ DILRV +L + ++VE S++ Sbjct: 101 LGIWFISEWIAAGKNAAFQLVELGPGKGTLLGDILRVFSQLGSLLKNCDISLHLVEVSQK 160 Query: 122 LTLIQ--------------------KKQLASYGDKINWYTSLADVPLGFTFLVANEFFDS 161 L+ IQ K + G ++WY L DVP ++F +A+EFFD Sbjct: 161 LSEIQALTLTEEKVPLERNAESPVYMKGVTKSGIPVSWYRDLQDVPKEYSFYLAHEFFDV 220 Query: 162 LPIKQFVMTEHGIRERMIDIDQH--DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRD 219 LP+ +F T HG RE ++DID D L F + + +D E P Sbjct: 221 LPVHKFQKTPHGWREVLVDIDPQVSDKLRFVLAPCATPAEAFIQND-ETRDHVEVCPEAG 279 Query: 220 REMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQ 279 +Q +S R++ GG A++ DYG+ ++ DT + G+ L PG ADL++ VDF Sbjct: 280 VVIQELSQRISLTGGAALIADYGHDGTKT-DTFRGFCGYRLHDVLTAPGTADLTADVDFS 338 Query: 280 RLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQ---TARKDILLDSVKRLVSTSA 336 L + K+ G Q FL +GI R L+ + + + LL L+ Sbjct: 339 YLRRM-SQGKVASLGPVEQQTFLRNMGIDVRLKILLDKTDDPSLRQQLLQGYNMLM---- 393 Query: 337 DKKSMGELFKILVVSHEK 354 + MGE F L + + Sbjct: 394 NPMKMGERFNFLALVPHQ 411 >gi|260830669|ref|XP_002610283.1| hypothetical protein BRAFLDRAFT_126844 [Branchiostoma floridae] gi|229295647|gb|EEN66293.1| hypothetical protein BRAFLDRAFT_126844 [Branchiostoma floridae] Length = 387 Score = 261 bits (668), Expect = 1e-67, Method: Composition-based stats. Identities = 106/385 (27%), Positives = 177/385 (45%), Gaps = 69/385 (17%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGE 61 E +L++ + + +K G +TV Y + +P GYY + FG GDF+T+PEISQ+FGE Sbjct: 29 ETELLKHLRSQLKAAGPITVADYMREVLTNPTAGYYMHKDVFGTQGDFITSPEISQMFGE 88 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 +L ++ + W G P +++VELGPGRG + D+LRV + P ++S+++VE S + Sbjct: 89 LLGVWCVNEWMLGGSPRSMQVVELGPGRGTLAQDMLRVFQQF-PMMQDMVSLHLVEVSPK 147 Query: 122 LTLIQKKQLASYGD------------------------KINWYTSLADVPLGFTFLVANE 157 + +Q+++L + I+WY+ + DVP GF+ +A+E Sbjct: 148 MAAMQEERLTGVIEDDKRKNAASGGDIVYKKRKTKAGVPISWYSDIHDVPRGFSCYIAHE 207 Query: 158 FFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPC 217 F D+LP+ +F E E P Sbjct: 208 FLDALPVHKFQAGEKRDH------------------------------------LEVCPQ 231 Query: 218 RDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVD 277 +Q +++R+ GG A++ DYG+ ++ DTL+ + H L PG ADL++ VD Sbjct: 232 AGVLVQHLANRIVEHGGAALLADYGHDGTKT-DTLRGFRNHQLHEVLQEPGSADLTADVD 290 Query: 278 FQRLSS-IAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQT--ARKDILLDSVKRLVST 334 F L A K +G TQ +FLE + I R L + +++ LL L Sbjct: 291 FSYLRHMCAGKDKALTHGPITQREFLENMAIELRLQVLQRNADESQRKDLLSGYDMLT-- 348 Query: 335 SADKKSMGELFKILVVSHEKVELMP 359 + MG+ FK ++ ++ Sbjct: 349 --NPDKMGDRFKFFSITRQRTPPKG 371 >gi|87198703|ref|YP_495960.1| hypothetical protein Saro_0679 [Novosphingobium aromaticivorans DSM 12444] gi|87134384|gb|ABD25126.1| protein of unknown function DUF185 [Novosphingobium aromaticivorans DSM 12444] Length = 351 Score = 261 bits (667), Expect = 1e-67, Method: Composition-based stats. Identities = 124/358 (34%), Positives = 178/358 (49%), Gaps = 18/358 (5%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 M L+ V LI G +++ Y A YY+ +PFG GDF+TAPEISQ+FG Sbjct: 1 MTTSLLDTFVRLIANTGPISMAHYMAES----NARYYAAQDPFGVAGDFITAPEISQMFG 56 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 E++ ++L W + G P V VELGPGRG + D LR + V + VETS Sbjct: 57 ELIGLYLADIWIRAGRPEPVHYVELGPGRGTLARDALRAARRYG----LVPRTHFVETST 112 Query: 121 RLTLIQKKQLASYGDKINWYTSLAD-VPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMI 179 L +Q W+ L+ G +VANEF D+LP++Q V T G RERM+ Sbjct: 113 ALKALQLDMHPD----ARWHADLSTLPVDGPLLIVANEFLDALPVRQMVKTAAGWRERMV 168 Query: 180 DIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVI 239 +D + + + + GAI E SP + ++ RLA GGTA+ I Sbjct: 169 GLDDGRLVPVSGSAPMDAAVPAGRQEAPEGAILETSPACAAVIYEVAGRLAAQGGTALFI 228 Query: 240 DYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQG 299 DYG+ + R+G +LQAV+ H V PG+ADL++HVDF L+ IA ++ G QG Sbjct: 229 DYGHAEPRLGSSLQAVRAHRKVDVFAAPGEADLTAHVDFSALAPIAQSREVRWLGTVEQG 288 Query: 300 KFLEGLGIWQRAFSLMK-QTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVE 356 +L LGI RA +L + + RL+ D+ MG LFK++ ++ Sbjct: 289 DWLRALGIEARAEALATFSPPHAQAIHAARDRLI----DEGQMGSLFKVMGLAGPTWP 342 >gi|166225925|sp|Q2KHV5|MIDA_BOVIN RecName: Full=Protein midA homolog, mitochondrial; Flags: Precursor Length = 441 Score = 261 bits (667), Expect = 1e-67, Method: Composition-based stats. Identities = 119/378 (31%), Positives = 183/378 (48%), Gaps = 33/378 (8%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 ++R ++ IK G +TV +Y + +P GYY + G GDF+T+PEISQ+FGE+ Sbjct: 41 TPMLRHLIYKIKSTGPITVAEYMKEVLTNPAKGYYMNRDMLGEEGDFITSPEISQMFGEL 100 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSI-YMVETSER 121 L I+ I W G + +LVELGPG+G ++ DILRV +L + ++VE S++ Sbjct: 101 LGIWFISEWIAAGKNAAFQLVELGPGKGTLLGDILRVFSQLGSLLKNCDISLHLVEVSQK 160 Query: 122 LTLIQ--------------------KKQLASYGDKINWYTSLADVPLGFTFLVANEFFDS 161 L+ IQ K + G ++WY L DVP ++F +A+EFFD Sbjct: 161 LSEIQALTLTEEKVPLERNAESPVYMKGVTKSGIPVSWYRDLQDVPKEYSFYLAHEFFDV 220 Query: 162 LPIKQFVMTEHGIRERMIDIDQH--DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRD 219 LP+ +F T HG RE ++DID D L F + + +D E P Sbjct: 221 LPVHKFQKTPHGWREVLVDIDPQVSDKLRFVLAPCATPAGAFIQND-ETRDHVEVCPEAG 279 Query: 220 REMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQ 279 +Q +S R++ GG A++ DYG+ ++ DT + G+ L PG ADL++ VDF Sbjct: 280 VVIQELSQRISLTGGAALIADYGHDGTKT-DTFRGFCGYRLHDVLTAPGTADLTADVDFS 338 Query: 280 RLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQ---TARKDILLDSVKRLVSTSA 336 L + K+ G Q FL +GI R L+ + + + LL L+ Sbjct: 339 YLRRM-SQGKVASLGPVEQQTFLRNMGIDVRLKILLDKTDDPSLRQQLLQGYNMLM---- 393 Query: 337 DKKSMGELFKILVVSHEK 354 + MGE F L + + Sbjct: 394 NPMKMGERFNFLALVPHQ 411 >gi|114576970|ref|XP_001167268.1| PREDICTED: protein midA homolog, mitochondrial-like isoform 5 [Pan troglodytes] Length = 441 Score = 261 bits (666), Expect = 2e-67, Method: Composition-based stats. Identities = 125/378 (33%), Positives = 186/378 (49%), Gaps = 33/378 (8%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 ++R ++ IK G +TV +Y + +P GYY + G GDF+T+PEISQIFGE+ Sbjct: 41 TPMLRHLMYKIKSTGPITVAEYMKEVLTNPAKGYYVYRDMLGEKGDFITSPEISQIFGEL 100 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSV-LSIYMVETSER 121 L I+ I W G + +LVELGPGRG ++ DILRV +L + +S+++VE S++ Sbjct: 101 LGIWFISEWMATGKSTAFQLVELGPGRGTLVGDILRVFTQLGSVLKNCDISVHLVEVSQK 160 Query: 122 LTLIQ--------------------KKQLASYGDKINWYTSLADVPLGFTFLVANEFFDS 161 L+ IQ K + G I+WY L DVP G++F +A+EFFD Sbjct: 161 LSEIQALTLTEEKVPLERNAGSPVYMKGVTKSGIPISWYRDLHDVPKGYSFYLAHEFFDV 220 Query: 162 LPIKQFVMTEHGIRERMIDIDQH--DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRD 219 LP+ +F T G RE +DID D L F + + D E P Sbjct: 221 LPVHKFQKTPQGWREVFVDIDPQVSDKLRFVLAPSATPAEAFIQHD-ETRDHVEVCPDAG 279 Query: 220 REMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQ 279 ++ +S R+A GG A+V DYG+ ++ DT + H L+ PG ADL++ VDF Sbjct: 280 VIIEELSQRIALTGGAALVADYGHDGTKT-DTFRGFCDHKLHDVLIAPGTADLTADVDFS 338 Query: 280 RLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTAR---KDILLDSVKRLVSTSA 336 L +A K+ G Q FL+ +GI R L+ ++ + LL L+ Sbjct: 339 YLRRMA-QGKVASLGPIKQHTFLKNMGIDVRLKVLLDKSNEPSVRQQLLQGYDMLM---- 393 Query: 337 DKKSMGELFKILVVSHEK 354 + K MGE F + + Sbjct: 394 NPKKMGERFNFFALLPHQ 411 >gi|21396487|ref|NP_653337.1| protein midA homolog, mitochondrial isoform 1 [Homo sapiens] gi|74749891|sp|Q7L592|MIDA_HUMAN RecName: Full=Protein midA homolog, mitochondrial; Flags: Precursor gi|38197076|gb|AAH04548.2| Chromosome 2 open reading frame 56 [Homo sapiens] gi|62822267|gb|AAY14816.1| unknown [Homo sapiens] gi|119620807|gb|EAX00402.1| hypothetical protein PRO1853, isoform CRA_c [Homo sapiens] Length = 441 Score = 261 bits (666), Expect = 2e-67, Method: Composition-based stats. Identities = 125/378 (33%), Positives = 186/378 (49%), Gaps = 33/378 (8%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 ++R ++ IK G +TV +Y + +P GYY + G GDF+T+PEISQIFGE+ Sbjct: 41 TPMLRHLMYKIKSTGPITVAEYMKEVLTNPAKGYYVYRDMLGEKGDFITSPEISQIFGEL 100 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSV-LSIYMVETSER 121 L I+ I W G + +LVELGPGRG ++ DILRV +L + +S+++VE S++ Sbjct: 101 LGIWFISEWMATGKSTAFQLVELGPGRGTLVGDILRVFTQLGSVLKNCDISVHLVEVSQK 160 Query: 122 LTLIQ--------------------KKQLASYGDKINWYTSLADVPLGFTFLVANEFFDS 161 L+ IQ K + G I+WY L DVP G++F +A+EFFD Sbjct: 161 LSEIQALTLTKEKVPLERNAGSPVYMKGVTKSGIPISWYRDLHDVPKGYSFYLAHEFFDV 220 Query: 162 LPIKQFVMTEHGIRERMIDIDQH--DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRD 219 LP+ +F T G RE +DID D L F + + D E P Sbjct: 221 LPVHKFQKTPQGWREVFVDIDPQVSDKLRFVLAPSATPAEAFIQHD-ETRDHVEVCPDAG 279 Query: 220 REMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQ 279 ++ +S R+A GG A+V DYG+ ++ DT + H L+ PG ADL++ VDF Sbjct: 280 VIIEELSQRIALTGGAALVADYGHDGTKT-DTFRGFCDHKLHDVLIAPGTADLTADVDFS 338 Query: 280 RLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTAR---KDILLDSVKRLVSTSA 336 L +A K+ G Q FL+ +GI R L+ ++ + LL L+ Sbjct: 339 YLRRMA-QGKVASLGPIKQHTFLKNMGIDVRLKVLLDKSNEPSVRQQLLQGYDMLM---- 393 Query: 337 DKKSMGELFKILVVSHEK 354 + K MGE F + + Sbjct: 394 NPKKMGERFNFFALLPHQ 411 >gi|332227210|ref|XP_003262784.1| PREDICTED: protein midA homolog, mitochondrial-like isoform 1 [Nomascus leucogenys] Length = 441 Score = 261 bits (666), Expect = 2e-67, Method: Composition-based stats. Identities = 126/378 (33%), Positives = 188/378 (49%), Gaps = 33/378 (8%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 ++R ++ IK G +TV +Y + +P GYY + G GDF+T+PEISQIFGE+ Sbjct: 41 TPMLRHLIYKIKSTGPITVAEYMKEVLTNPAKGYYVYRDMLGEKGDFITSPEISQIFGEL 100 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSV-LSIYMVETSER 121 L I+ I W G + +LVELGPGRG ++ DILRV +L + +S+++VE S++ Sbjct: 101 LGIWFISEWMATGKSTAFQLVELGPGRGTLVGDILRVFTQLGSVLKNCDISVHLVEVSQK 160 Query: 122 LTLIQ--------------------KKQLASYGDKINWYTSLADVPLGFTFLVANEFFDS 161 L+ IQ K + G ++WY L DVP G++F +A+EFFD Sbjct: 161 LSEIQALTLTEEKVPLERNAGSPVYMKGVTKSGIPVSWYRDLHDVPKGYSFYLAHEFFDV 220 Query: 162 LPIKQFVMTEHGIRERMIDIDQH--DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRD 219 LP+ +F T G RE +DID D L F + + D E P Sbjct: 221 LPVHKFQKTPQGWREVFVDIDPQVSDKLRFVLAPSATPAEAFIQHD-ETRDHVEVCPDAG 279 Query: 220 REMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQ 279 ++ +S R+A GG A+V DYG+ ++ DTL+ GH L+ PG ADL++ VDF Sbjct: 280 VIIEELSQRIALTGGAALVADYGHEGTKT-DTLRGFCGHKLHDVLIAPGTADLTADVDFS 338 Query: 280 RLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTAR---KDILLDSVKRLVSTSA 336 L +A K+ G Q FL+ +GI R L+ ++ + LL L+ Sbjct: 339 YLRRMA-QGKVASLGPIKQHTFLKNMGIDVRLKVLLDKSNEPSVRQQLLQGYDMLM---- 393 Query: 337 DKKSMGELFKILVVSHEK 354 + K MGE F + + Sbjct: 394 NPKKMGERFNFFALLPHQ 411 >gi|326386423|ref|ZP_08208046.1| hypothetical protein Y88_2317 [Novosphingobium nitrogenifigens DSM 19370] gi|326209084|gb|EGD59878.1| hypothetical protein Y88_2317 [Novosphingobium nitrogenifigens DSM 19370] Length = 364 Score = 261 bits (666), Expect = 2e-67, Method: Composition-based stats. Identities = 122/354 (34%), Positives = 178/354 (50%), Gaps = 16/354 (4%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 +L + V LI G ++V Q+ A YY + +P G+ GDF+TAPEISQ+FGE++ Sbjct: 17 RLKDRFVRLIAATGPISVAQFVAES----NARYYDSRDPLGSAGDFITAPEISQMFGELI 72 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 ++L W++ G P + VELGPGRG + D L + I+ VE S L Sbjct: 73 GLWLADMWDRAGRPGPIHYVELGPGRGTLARDALGAARRFG----LSPEIHFVEGSTALR 128 Query: 124 LIQKKQLASYGDKINWYTSLADVPL-GFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 +Q+ +W+ LA +P G +VANEF D+LPI+Q VMT G RERM+ I+ Sbjct: 129 AVQQSHFPK----AHWHDDLASLPETGPLLIVANEFLDALPIRQLVMTASGWRERMVGIE 184 Query: 183 QHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYG 242 + + + GAI E SP + I+ RLA GG A+VIDYG Sbjct: 185 GDRLVPIAGTQPMDAAVPAELASAHEGAILETSPAAAAVTREIARRLATQGGAALVIDYG 244 Query: 243 YLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFL 302 + G +LQA++ HT V+P PG+ADL++HVDF L + G QG FL Sbjct: 245 RAEPAYGSSLQALRAHTKVNPFECPGEADLTAHVDFSVLRPVVEAEGARWLGTVEQGAFL 304 Query: 303 EGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVE 356 LG+ R +L + K +V+ A ++MG LFK++ ++ Sbjct: 305 ISLGLGPRMEALCRAAPDK---AQAVRAAAHRLAAPEAMGSLFKVMGLAAPGWP 355 >gi|158937256|ref|NP_082887.2| protein midA homolog, mitochondrial precursor [Mus musculus] Length = 436 Score = 261 bits (666), Expect = 2e-67, Method: Composition-based stats. Identities = 123/378 (32%), Positives = 183/378 (48%), Gaps = 33/378 (8%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 ++R ++ IK G +TV +Y + +P GYY + G GDF+T+PEISQIFGE+ Sbjct: 36 TPMLRHLMYKIKSTGPITVAEYMKEVLTNPAKGYYVHQDMLGEKGDFITSPEISQIFGEL 95 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFF-SVLSIYMVETSER 121 L ++ + W G +LVELGPGRG + DILRV +L +SI++VE S++ Sbjct: 96 LGVWFVSEWIASGKSPAFQLVELGPGRGTLTADILRVFSQLGSVLKTCAISIHLVEVSQK 155 Query: 122 LTLIQ--------------------KKQLASYGDKINWYTSLADVPLGFTFLVANEFFDS 161 L+ IQ K + G ++WY L DVP G++ +A+EFFD Sbjct: 156 LSEIQALTLAEEKVPLERDAESLVYMKGVTKSGIPVSWYRDLKDVPEGYSLYLAHEFFDV 215 Query: 162 LPIKQFVMTEHGIRERMIDIDQH--DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRD 219 LP+ +F T G RE +D+D D L F + + D E P Sbjct: 216 LPVHKFQKTPRGWREVFVDVDPQASDKLRFVLAPCATPAEAFIQRD-ERREHVEVCPDAG 274 Query: 220 REMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQ 279 +Q +S R+A GG A++ DYG+ ++ DTL+ GH L+ PG ADL++ VDF Sbjct: 275 VIIQELSQRIASTGGAALIADYGHDGTKT-DTLRGFYGHQLHDVLIAPGTADLTADVDFS 333 Query: 280 RLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTAR---KDILLDSVKRLVSTSA 336 L +A K+ G Q FL+ +GI R L+ + K LL L+ Sbjct: 334 YLRRMA-QGKVASLGPVEQRTFLKNMGIDVRLKVLLDKAGEPSAKQQLLGGYDMLM---- 388 Query: 337 DKKSMGELFKILVVSHEK 354 + + MGE F + + Sbjct: 389 NPQKMGERFHFFALLPHQ 406 >gi|149727989|ref|XP_001501018.1| PREDICTED: similar to CG17726 CG17726-PA [Equus caballus] Length = 442 Score = 261 bits (666), Expect = 2e-67, Method: Composition-based stats. Identities = 127/378 (33%), Positives = 183/378 (48%), Gaps = 33/378 (8%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 ++R ++ IK G +TV +Y + +P GYY + G GDF+T+PEISQIFGE+ Sbjct: 42 TPMLRHLMYKIKSTGPITVAEYMREVLTNPAKGYYVYRDMLGEKGDFITSPEISQIFGEL 101 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSV-LSIYMVETSER 121 L I+ I W G + +LVELGPGRG + DILRV +L + +SI++VE S++ Sbjct: 102 LGIWFISEWMATGKSAAFQLVELGPGRGTLAGDILRVFSQLGSVLKNCDISIHLVEVSQK 161 Query: 122 LTLIQ--------------------KKQLASYGDKINWYTSLADVPLGFTFLVANEFFDS 161 L+ IQ K + G I+WY L DVP ++F +A+EFFD Sbjct: 162 LSEIQALTLAEEKIPLERNAGSPAYMKGVTKSGIPISWYRDLQDVPKEYSFYLAHEFFDV 221 Query: 162 LPIKQFVMTEHGIRERMIDIDQH--DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRD 219 LP+ +F T G RE IDID D L F + + D E P Sbjct: 222 LPVHKFQKTPQGWREVFIDIDPQVSDKLRFVLAPCATPAEAFIQCD-ETRDHVEVCPDAG 280 Query: 220 REMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQ 279 +Q +S R+A GG A++ DYG+ ++ DT + GH L+ PG ADL++ VDF Sbjct: 281 VIIQELSQRIALTGGAALIADYGHDGTKT-DTFRGFCGHKLHDVLIAPGTADLTADVDFS 339 Query: 280 RLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSL---MKQTARKDILLDSVKRLVSTSA 336 L +A K+ G Q FL+ +GI R L + + LL L+ Sbjct: 340 YLRRMA-QGKVASLGPVQQQTFLKNMGIDVRLKVLLDKSDDPSMRQQLLQGYDMLM---- 394 Query: 337 DKKSMGELFKILVVSHEK 354 + K MGE F + + Sbjct: 395 NPKKMGERFNFFALLPHQ 412 >gi|115529407|ref|NP_001070231.1| protein midA homolog, mitochondrial precursor [Danio rerio] gi|123908270|sp|Q08BY0|MIDA_DANRE RecName: Full=Protein midA homolog, mitochondrial; Flags: Precursor gi|115313407|gb|AAI24509.1| Zgc:153989 [Danio rerio] gi|220672997|emb|CAX14543.1| novel protein (zgc:153989) [Danio rerio] Length = 422 Score = 261 bits (666), Expect = 2e-67, Method: Composition-based stats. Identities = 124/376 (32%), Positives = 190/376 (50%), Gaps = 30/376 (7%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 + +++ + + I G ++V +Y + +P GYY + GA GDF+T+PEISQIFG Sbjct: 26 INKSILKHLASKIIATGPISVAEYMREALTNPVLGYYVKNDMLGAGGDFITSPEISQIFG 85 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSV-LSIYMVETS 119 E+L ++ I W G S ++LVELGPGRG + DILRV +LK +SI++VE S Sbjct: 86 ELLGVWCISEWMAAGKSSALQLVELGPGRGSLTSDILRVFSQLKGVLGETGISIHLVEVS 145 Query: 120 ERLTLIQKKQLASYGDK-------------------INWYTSLADVPLGFTFLVANEFFD 160 +L+ +Q + L + I WY S+ DVP GF+ +A+EFFD Sbjct: 146 PKLSQVQAECLTGNQTQTYDNNHTFYRSGTTCTGLPIYWYHSIEDVPRGFSIFLAHEFFD 205 Query: 161 SLPIKQFVMTEHGIRERMIDIDQH--DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCR 218 +LPI +F TE+G RE ++DID L F + ++ E Sbjct: 206 ALPIHKFQRTENGWREVLVDIDPENPGKLRFVVSHRPTLASSTLIQKDESRRHVEVCAEA 265 Query: 219 DREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDF 278 +Q ++ R+A DGG A+++DYG+ ++ DT + KGH L PG ADL++ VDF Sbjct: 266 GVIVQKLASRIAEDGGAALIVDYGHDGTKT-DTFRGFKGHQIHDVLEAPGLADLTADVDF 324 Query: 279 QRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSL--MKQTARKDILLDSVKRLVSTSA 336 L +A + G TQ FL+ +GI R L + + L+ S L+ Sbjct: 325 SYLRKMAGDQ-VICLGPITQRSFLKNMGIDSRMQVLLSSNDPSIRAQLIHSYDMLI---- 379 Query: 337 DKKSMGELFKILVVSH 352 + + MGE F+ V + Sbjct: 380 NPEKMGERFQFFSVLN 395 >gi|56605714|ref|NP_001008319.1| protein midA homolog, mitochondrial precursor [Rattus norvegicus] gi|81883713|sp|Q5XI79|MIDA_RAT RecName: Full=Protein midA homolog, mitochondrial; Flags: Precursor gi|54035316|gb|AAH83810.1| Similar to PRO1853 homolog [Rattus norvegicus] gi|149050627|gb|EDM02800.1| similar to PRO1853 homolog, isoform CRA_a [Rattus norvegicus] Length = 436 Score = 261 bits (666), Expect = 2e-67, Method: Composition-based stats. Identities = 123/375 (32%), Positives = 183/375 (48%), Gaps = 27/375 (7%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 ++R ++ IK G +TV +Y + +P GYY + G GDF+T+PEISQIFGE+ Sbjct: 36 TPMLRHLMYKIKSTGPITVAEYMKEVLTNPAKGYYVHHDMLGEKGDFITSPEISQIFGEL 95 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSV-LSIYMVETSER 121 L ++ + W G + +LVELGPGRG + DILRV +L + +SI++VE S++ Sbjct: 96 LGVWFVSEWMASGKSTAFQLVELGPGRGTLTADILRVFSQLGSVLKTCDISIHLVEVSQK 155 Query: 122 LTLIQ--------------------KKQLASYGDKINWYTSLADVPLGFTFLVANEFFDS 161 L+ IQ K + G I+WY L DVP G++F +A+EFFD Sbjct: 156 LSEIQALTLTEETVPLERDAESLVYMKGVTKSGIPISWYRDLKDVPTGYSFYLAHEFFDV 215 Query: 162 LPIKQFVMTEHGIRERMIDIDQH--DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRD 219 LP+ +F T HG RE +DID D L F + + D E P Sbjct: 216 LPVHKFQKTPHGWREVFVDIDPQSPDKLRFVLAPCATPAEAFIQRD-ERREHVEVCPDAG 274 Query: 220 REMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQ 279 +Q +S R+A GG A++ DYG+ ++ DTL+ H L PG ADL++ VDF Sbjct: 275 VVIQELSQRIASTGGAALIADYGHDGTKT-DTLRGFYEHQLHDVLTAPGTADLTADVDFS 333 Query: 280 RLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKK 339 L +A ++ G Q FL+ +GI R L+ + L + R + + Sbjct: 334 YLRRMA-QGRVASLGPVEQRTFLKNMGIDVRLKVLLDKAGDPS-LQQQLLRGYDMLMNPQ 391 Query: 340 SMGELFKILVVSHEK 354 MGE F + + Sbjct: 392 KMGERFHFFALLPHQ 406 >gi|269958880|ref|YP_003328669.1| hypothetical protein ACIS_00816 [Anaplasma centrale str. Israel] gi|269848711|gb|ACZ49355.1| hypothetical protein ACIS_00816 [Anaplasma centrale str. Israel] Length = 342 Score = 260 bits (663), Expect = 3e-67, Method: Composition-based stats. Identities = 114/342 (33%), Positives = 178/342 (52%), Gaps = 16/342 (4%) Query: 16 NGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHG 75 + +T+D++ L + + GYY T PFG GDF+T+ EISQ+FGE++A++++ E G Sbjct: 13 SKYVTMDRFMDLALYHEKHGYYMTRVPFGRAGDFITSAEISQLFGEVVALWILSYLESAG 72 Query: 76 FPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASY-- 133 L+ELGPGRG +M D+LRV + P + ++L ++++E S L Q+ L S+ Sbjct: 73 ISEKFSLLELGPGRGTLMCDVLRVFERF-PKYDALLEVHLLEISPLLRNTQRATLESFSA 131 Query: 134 GDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGD 193 +I W+ L ++P T +VANEFFD+LP+KQFV G+ + +L D Sbjct: 132 RKEIFWHDKLEELPERPTVVVANEFFDALPVKQFVYAGSGMWKECCVYSDIGNLSVVALD 191 Query: 194 HEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQ 253 E + +D G I E ++ + + +GG A + DYGYLQ T+Q Sbjct: 192 TE--YSLNEYNDVPEGGIIERCEAAKDVLECLEGIIVRNGGAAAIFDYGYLQPPYRSTIQ 249 Query: 254 AVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFS 313 +VKGH L N G+ D+++HVDF L + TQ +FL GI +R Sbjct: 250 SVKGHHRCDFLYNVGECDITAHVDFGFLQGHVRRLNSRV---VTQREFLYQFGIRERLAH 306 Query: 314 LMKQTA--RKDILLDSVKRLVSTSADKKSMGELFKILVVSHE 353 L +K L + RL ++MG LFK+L++S + Sbjct: 307 LACNATERQKRELKSAFLRLT------ENMGTLFKVLLLSDK 342 >gi|166225927|sp|Q9CWG8|MIDA_MOUSE RecName: Full=Protein midA homolog, mitochondrial; Flags: Precursor Length = 436 Score = 260 bits (663), Expect = 3e-67, Method: Composition-based stats. Identities = 124/378 (32%), Positives = 183/378 (48%), Gaps = 33/378 (8%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 ++R ++ IK G +TV +Y + +P GYY + G GDF+T+PEISQIFGE+ Sbjct: 36 TPMLRHLMYKIKSTGPITVAEYMKEVLTNPAKGYYVHQDMLGEKGDFITSPEISQIFGEL 95 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFF-SVLSIYMVETSER 121 L ++ + W G +LVELGPGRG + DILRV +L +SI++VE S++ Sbjct: 96 LGVWFVSEWIASGKSPAFQLVELGPGRGTLTADILRVFSQLGSVLKTCAISIHLVEVSQK 155 Query: 122 LTLIQ--------------------KKQLASYGDKINWYTSLADVPLGFTFLVANEFFDS 161 L+ IQ K + G I+WY L DVP G++ +A+EFFD Sbjct: 156 LSEIQALTLAEEKVPLERDAESLVYMKGVTKSGIPISWYRDLKDVPEGYSLYLAHEFFDV 215 Query: 162 LPIKQFVMTEHGIRERMIDIDQH--DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRD 219 LP+ +F T G RE +D+D D L F + + D E P Sbjct: 216 LPVHKFQKTPRGWREVFVDVDPQASDKLRFVLAPCATPAEAFIQRD-ERREHVEVCPDAG 274 Query: 220 REMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQ 279 +Q +S R+A GG A++ DYG+ ++ DTL+ GH L+ PG ADL++ VDF Sbjct: 275 VIIQELSQRIASTGGAALIADYGHDGTKT-DTLRGFYGHQLHDVLIAPGTADLTADVDFS 333 Query: 280 RLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTAR---KDILLDSVKRLVSTSA 336 L +A K+ G Q FL+ +GI R L+ + K LL L+ Sbjct: 334 YLHRMA-QGKVASLGPVEQRTFLKNMGIDVRLKVLLDKAGEPSAKQQLLGGYDMLM---- 388 Query: 337 DKKSMGELFKILVVSHEK 354 + + MGE F + + Sbjct: 389 NPQKMGERFHFFALLPHQ 406 >gi|149596613|ref|XP_001514487.1| PREDICTED: hypothetical protein, partial [Ornithorhynchus anatinus] Length = 426 Score = 260 bits (663), Expect = 4e-67, Method: Composition-based stats. Identities = 124/376 (32%), Positives = 182/376 (48%), Gaps = 33/376 (8%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 ++R ++ +K G +TV +Y + +P GYY + G GDFVT+PEISQIFGE+L Sbjct: 27 SMLRHLLAKVKATGPITVAEYMREALTNPAKGYYVHHDVLGEKGDFVTSPEISQIFGELL 86 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSV-LSIYMVETSERL 122 I+ I W G S +LVELGPGRG + DILRV +L + +S++MVE S++L Sbjct: 87 GIWYISEWMAAGKSSTFQLVELGPGRGTLTGDILRVFNQLGSVLKNCDISVHMVEVSQKL 146 Query: 123 TLIQKKQLASYGD--------------------KINWYTSLADVPLGFTFLVANEFFDSL 162 + IQ L INWY L DVP G++F +A+EFFD+L Sbjct: 147 SEIQASTLTGEKTPLERDDGSPVYMSGVTKTGIPINWYRDLQDVPQGYSFYLAHEFFDAL 206 Query: 163 PIKQFVMTEHGIRERMIDIDQ--HDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDR 220 P+ +F T G RE ID+D D L F + + + E P Sbjct: 207 PVHKFQKTPQGWREIFIDVDPLVSDKLRFVLAPSSTPAELFIQKE-ETRDHVEVCPDAGV 265 Query: 221 EMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQR 280 +Q +S R+ GG A++ DYG+ +++ DT + +GH L PG ADL++ VDF Sbjct: 266 IVQRLSQRIEETGGAALIADYGHDGTKM-DTFRGFQGHKLHDVLTAPGTADLTADVDFSY 324 Query: 281 LSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTAR---KDILLDSVKRLVSTSAD 337 L + K+ G Q FL+ +GI R L+ + + LL L+ + Sbjct: 325 LRRMIR-GKVASLGPIKQQTFLQNMGIDARLKVLLDNASDPSLRQQLLCGYDMLM----N 379 Query: 338 KKSMGELFKILVVSHE 353 MGE F + + Sbjct: 380 PAKMGERFHFFALLPQ 395 >gi|296224076|ref|XP_002757896.1| PREDICTED: protein midA homolog, mitochondrial-like isoform 1 [Callithrix jacchus] Length = 449 Score = 259 bits (662), Expect = 4e-67, Method: Composition-based stats. Identities = 129/385 (33%), Positives = 187/385 (48%), Gaps = 40/385 (10%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 ++R ++ IK G +TV +Y + +P GYY + G GDF+T+PEISQIFGE+ Sbjct: 42 TPMLRHLMYKIKSTGPITVAEYMKEVLTNPAKGYYVHRDMLGEKGDFITSPEISQIFGEL 101 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSV-LSIYMVETSER 121 L I+ I W G + +LVELGPGRG ++ DILRV +L + +S+++VE S++ Sbjct: 102 LGIWFISEWMATGKSTAFQLVELGPGRGTLVGDILRVFSQLGSVLKNCDISVHLVEVSQK 161 Query: 122 LTLIQ--------------------KKQLASYGDKINWYTSLADVPLGFTFLVANEFFDS 161 L+ +Q K + G ++WY L DVP G +F +A+EFFD Sbjct: 162 LSEVQALTLTEEKVPLERNAESPVYMKGVTKSGIPVSWYRDLHDVPKGHSFYLAHEFFDV 221 Query: 162 LPIKQFVMTEHGIRERMIDIDQH--DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRD 219 LP+ +F T G RE IDID D L F + + D I E P Sbjct: 222 LPVHKFQKTPQGWREVFIDIDPQVSDKLRFVLAPCATPAEVFIQHDETRDHI-EVCPDAG 280 Query: 220 REMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQ 279 ++ +S R+A GG A+V DYG+ ++ DT + GH L+ PG ADL++ VDF Sbjct: 281 VIIEELSRRIALTGGAALVADYGHDGTKT-DTFRGFCGHKLHDVLIAPGTADLTADVDFS 339 Query: 280 RLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFS--------LMKQTAR--KDILLDSVK 329 L +A K+ G TQ FL+ +GI R L K + K LL Sbjct: 340 FLRRMA-QGKVASLGPITQHTFLKNMGIDVRLKVRIFFFPVLLDKSNEQSVKQQLLQGYD 398 Query: 330 RLVSTSADKKSMGELFKILVVSHEK 354 L+ + K MGE F + + Sbjct: 399 MLM----NPKKMGERFNFFALLPHQ 419 >gi|157123049|ref|XP_001653802.1| hypothetical protein AaeL_AAEL009374 [Aedes aegypti] gi|108874528|gb|EAT38753.1| conserved hypothetical protein [Aedes aegypti] Length = 426 Score = 259 bits (662), Expect = 5e-67, Method: Composition-based stats. Identities = 130/382 (34%), Positives = 190/382 (49%), Gaps = 36/382 (9%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYS-TCNPFGAVGDFVTAPEISQIFGEM 62 L + + I+ G + V Y + +P GYY + + FG+ GDF+T+PEI QIFGEM Sbjct: 45 SLKHDLQSRIRATGPIPVATYMKQVLTNPSAGYYMTSRDVFGSKGDFITSPEIGQIFGEM 104 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 +A++ + W + G P +L+ELGPG+G MM D+LRV KL +++ MVE SE L Sbjct: 105 IAVWCVNEWSKFGRPVPFQLIELGPGKGTMMRDVLRVFDKL--KVSQGMAVQMVEMSEHL 162 Query: 123 TLIQKKQLASYGD-----------------KINWYTSLADVPLGFTFLVANEFFDSLPIK 165 + +Q + L KI WY L DVP GF ++A+EFFD+LP+ Sbjct: 163 SEVQARLLCRSSMEYTDKPYYRSGITASGTKIYWYRQLEDVPEGFAVVLAHEFFDALPVH 222 Query: 166 QFVMTEHGIRERMIDIDQ--HDSLVFNIGDHEIKSNFLTCSDYF----LGAIFENSPCRD 219 +FV ++ +E +IDI+ D F + E L ++Y E S + Sbjct: 223 KFVKQDNAWKEVLIDIEPKSEDGFRFIVSKSETPMLRLFLNNYPDLVKDRNQIEISFEAE 282 Query: 220 REMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQ 279 ++ I R +GG +V+DYG+L + GDT +A K H PL PG ADL++ VDF Sbjct: 283 TIIRQIGSRFNGNGGFGLVVDYGHLGEK-GDTFRAFKNHKLHDPLQEPGSADLTADVDFS 341 Query: 280 RLSSIAIL-YKLYINGLTTQGKFLEGLGIWQRAFSLMKQ----TARKDILLDSVKRLVST 334 LS +L G T+Q FLE G +R LM K L D + L Sbjct: 342 LLSRFCEDTTQLVTIGPTSQRAFLEAAGAQERLNVLMGNGGLSEEEKQRLSDGFRMLT-- 399 Query: 335 SADKKSMGELFKILVVSHEKVE 356 D + MGE FK + +++E Sbjct: 400 --DPEQMGERFKFFGLYPKELE 419 >gi|148706528|gb|EDL38475.1| RIKEN cDNA 2410091C18, isoform CRA_b [Mus musculus] Length = 445 Score = 259 bits (662), Expect = 5e-67, Method: Composition-based stats. Identities = 124/378 (32%), Positives = 184/378 (48%), Gaps = 33/378 (8%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 ++R ++ IK G +TV +Y + +P GYY + G GDF+T+PEISQIFGE+ Sbjct: 45 TPMLRHLMYKIKSTGPITVAEYMKEVLTNPAKGYYVHQDMLGEKGDFITSPEISQIFGEL 104 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFF-SVLSIYMVETSER 121 L ++ + W G +LVELGPGRG + DILRV +L +SI++VE S++ Sbjct: 105 LGVWFVSEWIASGKSPAFQLVELGPGRGTLTADILRVFSQLGSVLKTCAISIHLVEVSQK 164 Query: 122 LTLIQ--------------------KKQLASYGDKINWYTSLADVPLGFTFLVANEFFDS 161 L+ IQ K + G I+WY L DVP G++ +A+EFFD Sbjct: 165 LSEIQALTLAEEKVPLERDAESLVYMKGVTKSGIPISWYRDLKDVPEGYSLYLAHEFFDV 224 Query: 162 LPIKQFVMTEHGIRERMIDIDQH--DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRD 219 LP+ +F T G RE +D+D D L F + + D E P Sbjct: 225 LPVHKFQKTPRGWREVFVDVDPQASDKLRFVLAPCATPAEAFIQRD-ERREHVEVCPDAG 283 Query: 220 REMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQ 279 +Q +S R+A GG A++ DYG+ ++ DTL+ GH L+ PG ADL++ VDF Sbjct: 284 VIIQELSQRIASTGGAALIADYGHDGTKT-DTLRGFYGHQLHDVLIAPGTADLTADVDFS 342 Query: 280 RLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMK---QTARKDILLDSVKRLVSTSA 336 L +A K+ G Q FL+ +GI R L+ + + K LL L+ Sbjct: 343 YLHRMA-QGKVASLGPVEQRTFLKNMGIDVRLKVLLDRAGEPSAKQQLLGGYDMLM---- 397 Query: 337 DKKSMGELFKILVVSHEK 354 + + MGE F + + Sbjct: 398 NPQKMGERFHFFALLPHQ 415 >gi|147904192|ref|NP_001085543.1| protein midA homolog, mitochondrial precursor [Xenopus laevis] gi|82184559|sp|Q6GQ37|MIDA_XENLA RecName: Full=Protein midA homolog, mitochondrial; Flags: Precursor gi|49118763|gb|AAH72911.1| MGC80371 protein [Xenopus laevis] Length = 437 Score = 259 bits (662), Expect = 5e-67, Method: Composition-based stats. Identities = 120/390 (30%), Positives = 186/390 (47%), Gaps = 39/390 (10%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 N L+ ++ IK G +TV +Y + +P GYY + G GDFVT+PEISQIFGE+ Sbjct: 44 NALLNHLIFKIKSTGPITVSEYMREVLTNPVKGYYMHNDMLGEHGDFVTSPEISQIFGEL 103 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSV-LSIYMVETSER 121 L ++ I W G P ++LVELGPGRG + D+LRV S +S+++VE S + Sbjct: 104 LGVWCISEWVSAGKPKAIQLVELGPGRGTLTDDLLRVFSNFGRLLDSCDISVHLVEVSPK 163 Query: 122 LTLIQKKQLASYGD--------------------KINWYTSLADVPLGFTFLVANEFFDS 161 L+ IQ ++L + WY + DVP G++F +A+EFFD+ Sbjct: 164 LSDIQAQRLTGKSIEVELDSNSPVYKNGITKTGRPVCWYQDIQDVPNGYSFYIAHEFFDA 223 Query: 162 LPIKQFVMTEHGIRERMIDIDQH--DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRD 219 LPI + + G RE +IDID D L F +G + D E P Sbjct: 224 LPIHKLQKIKDGWREMLIDIDPKLPDKLRFVLGSNMSLVAKTFVQDDEPRDHVEVCPSAA 283 Query: 220 REMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQ 279 +Q ++ ++ GG A++ DYG++ + DT + + H L +PG ADL++ VDF Sbjct: 284 VIIQKLAQQINSYGGAALIADYGHMGEKT-DTFRGFRAHQLHDVLTDPGTADLTADVDFN 342 Query: 280 RLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVST---SA 336 + + G TQ FL+ +GI R L++++ + K+L+ Sbjct: 343 FMRRMVGEA-ASCLGPVTQHVFLKNMGIDIRLKVLLEKSNDVTVQ----KQLIHGYNVLM 397 Query: 337 DKKSMGELFKILVVSHE-------KVELMP 359 + MG+ FK V K ++ P Sbjct: 398 NPDQMGQRFKFFSVVPHSRLKNTLKTKMPP 427 >gi|198428937|ref|XP_002122246.1| PREDICTED: similar to CG17726 CG17726-PA isoform 1 [Ciona intestinalis] gi|198428939|ref|XP_002122317.1| PREDICTED: similar to CG17726 CG17726-PA isoform 2 [Ciona intestinalis] Length = 418 Score = 258 bits (660), Expect = 7e-67, Method: Composition-based stats. Identities = 120/386 (31%), Positives = 193/386 (50%), Gaps = 34/386 (8%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 N ++ + IK G ++V ++ + +P+ GYY + G GDFVT+PE++QIFGE+ Sbjct: 30 NPVLEYFHSKIKATGPISVAEFMKETLTNPQSGYYMNRDMLGNDGDFVTSPELNQIFGEI 89 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSV-LSIYMVETSER 121 +AI+ I W G P+ +++VELGPGRG + DILR +L L VE S Sbjct: 90 IAIWFINEWNALGSPAELQIVELGPGRGTLAEDILRTFHQLGHVLKDTKLWYSFVEVSPT 149 Query: 122 LTLIQKKQL--------------------ASYGDKINWYTSLADVPLG-FTFLVANEFFD 160 L+ IQ ++L +++G + WY SL DVP G T VA+EFFD Sbjct: 150 LSKIQHERLLDSTSSKTSNGEEKWYLSGKSTHGVNLQWYKSLQDVPNGKVTIFVAHEFFD 209 Query: 161 SLPIKQFVMTEHGIRERMIDIDQHD---SLVFNIGDHEIKSNFLTCSDYFLGAIFENSPC 217 +LP+ +FV ++ G +E +DI D L + + ++ E SP Sbjct: 210 ALPVHKFVNSDKGWQEVYVDICPDDAAMKLRYVVLPKPTIASRTLIKKDENRNQIEVSPQ 269 Query: 218 RDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVD 277 +Q ++ R+ D G A+++DYG+ ++ DTL+A K H G+ADL++ VD Sbjct: 270 SGIIVQEMAQRIVADKGAALIVDYGHYGTK-QDTLRAFKSHQLCEVFSTVGEADLTADVD 328 Query: 278 FQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLV---ST 334 F+ L Y + G Q FL +GI R L++ T D+ ++L+ Sbjct: 329 FKYLKQSIEDYNVTTMGPIPQHVFLRNMGIDTRLMMLLRSTTDDDV----RRKLIGSYEM 384 Query: 335 SADKKSMGELFKIL-VVSHEKVELMP 359 + K MGE F+ ++S +++E +P Sbjct: 385 IMNPKQMGERFQFFSLLSKQRLEEVP 410 >gi|301758064|ref|XP_002914877.1| PREDICTED: protein midA homolog, mitochondrial-like [Ailuropoda melanoleuca] gi|281341951|gb|EFB17535.1| hypothetical protein PANDA_002819 [Ailuropoda melanoleuca] Length = 440 Score = 258 bits (658), Expect = 1e-66, Method: Composition-based stats. Identities = 128/377 (33%), Positives = 186/377 (49%), Gaps = 32/377 (8%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 ++R ++ IK G +TV +Y + +P GYY + G GDF+T+PEISQIFGE+ Sbjct: 41 TPMLRHLIYKIKATGPITVAEYMKEVLTNPAKGYYVYRDMLGEKGDFITSPEISQIFGEL 100 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSV-LSIYMVETSER 121 L I+ I W G + +LVELGPGRG + DILRV +L + +SI++VE SE+ Sbjct: 101 LGIWFISEWMAAGKNAAFQLVELGPGRGTLAGDILRVFSQLGSVLKNCDISIHLVEVSEK 160 Query: 122 LTLIQ--------------------KKQLASYGDKINWYTSLADVPLGFTFLVANEFFDS 161 L+ IQ K + G I+WY L DVP G++F +A+EFFD Sbjct: 161 LSEIQALTLTEEKVPVERNAGSPVYMKGVTKSGIPISWYRDLHDVPKGYSFYLAHEFFDV 220 Query: 162 LPIKQFVMTEHGIRERMIDIDQH--DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRD 219 LP+ +F T G RE +IDID D L F + + D + E P Sbjct: 221 LPVHKFQKTPQGWREVVIDIDPQVSDKLRFVLAPCVTPAEVFIQRDEMRDHV-EVCPEAG 279 Query: 220 REMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQ 279 +Q +S R+A GG A++ DYG+ ++ DT + GH L PG ADL++ VDF Sbjct: 280 VIVQELSQRIAIAGGAALIADYGHDGTKT-DTFRGFCGHKLHDVLTAPGTADLTADVDFS 338 Query: 280 RLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTAR--KDILLDSVKRLVSTSAD 337 L +A + G Q FL+ +GI R L+ ++ + LL L+ + Sbjct: 339 YLRRMAEGQ-VASLGPIKQQTFLKNMGIDVRLKVLLAKSDEPARQQLLQGYDMLM----N 393 Query: 338 KKSMGELFKILVVSHEK 354 K MGE F + + Sbjct: 394 PKKMGERFNFFALLPHQ 410 >gi|26371288|dbj|BAB27158.2| unnamed protein product [Mus musculus] Length = 436 Score = 258 bits (658), Expect = 1e-66, Method: Composition-based stats. Identities = 122/378 (32%), Positives = 183/378 (48%), Gaps = 33/378 (8%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 ++R ++ IK G +TV +Y + +P GYY + G GDF+T+P+ISQIFGE+ Sbjct: 36 TPMLRHLMYKIKSTGPITVAEYMKEVLTNPAKGYYVHQDMLGEKGDFITSPDISQIFGEL 95 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFF-SVLSIYMVETSER 121 L ++ + W G +LVELGPGRG + DILRV +L +SI++VE S++ Sbjct: 96 LGVWFVSEWIASGKSPAFQLVELGPGRGTLTADILRVFSQLGSVLKTCAISIHLVEVSQK 155 Query: 122 LTLIQ--------------------KKQLASYGDKINWYTSLADVPLGFTFLVANEFFDS 161 L+ IQ K + G ++WY L DVP G++ +A+EFFD Sbjct: 156 LSEIQALTLAEEKVPLERDAESLVYMKGVTKSGIPVSWYRDLKDVPEGYSLYLAHEFFDV 215 Query: 162 LPIKQFVMTEHGIRERMIDIDQH--DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRD 219 LP+ +F T G RE +D+D D L F + + D E P Sbjct: 216 LPVHKFQKTPRGWREVFVDVDPQASDKLRFVLAPCATPAEAFIQRD-ERREHVEVCPDAG 274 Query: 220 REMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQ 279 +Q +S R+A GG A++ DYG+ ++ DTL+ GH L+ PG ADL++ VDF Sbjct: 275 VIIQELSQRIASTGGAALIADYGHDGTKT-DTLRGFYGHQLHDVLIAPGTADLTADVDFS 333 Query: 280 RLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTAR---KDILLDSVKRLVSTSA 336 L +A K+ G Q FL+ +GI R L+ + K LL L+ Sbjct: 334 YLRRMA-QGKVASLGPVEQRTFLKNMGIDVRLKVLLDKAGEPSAKQQLLGGYDMLM---- 388 Query: 337 DKKSMGELFKILVVSHEK 354 + + MGE F + + Sbjct: 389 NPQKMGERFHFFALLPHQ 406 >gi|73666939|ref|YP_302955.1| hypothetical protein Ecaj_0313 [Ehrlichia canis str. Jake] gi|72394080|gb|AAZ68357.1| protein of unknown function DUF185 [Ehrlichia canis str. Jake] Length = 335 Score = 258 bits (658), Expect = 1e-66, Method: Composition-based stats. Identities = 111/352 (31%), Positives = 194/352 (55%), Gaps = 21/352 (5%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 M + L I G ++++Q+ + + D GYY T PFGA GDF+TAPEISQ+FG Sbjct: 1 MYSYLKEVI---FSSGGAISIEQFMQVALYDVHHGYYMTQMPFGAHGDFITAPEISQLFG 57 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 E++A++++ +W++ G PS +VELGPGRG ++ D++R++ + + ++ +S+Y+VE S Sbjct: 58 EIIALWVLLSWQKIGAPSKFVVVELGPGRGTLINDVIRILKRFE-QCYAAMSVYLVEISP 116 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 L +Q+ L + +K+ W +++DVP ++ANEFFD+LPI+QF ++ E + Sbjct: 117 VLENVQRDILKN--EKVFWCRNVSDVPDCPILIIANEFFDALPIRQFTYFDNTWYETYVT 174 Query: 181 IDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVID 240 ++ + F I + F SD + E ++ I +++ + G A++ID Sbjct: 175 LENDE---FKIIYKSVDERFEVSSDIEKP-VVETCNEAISIVKYIENKILQNSGAAVIID 230 Query: 241 YGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGK 300 YGY+ T+Q+VK H Y L N G++D++SHV+F L + + Q Sbjct: 231 YGYVNCPYKSTIQSVKCHKYNDLLKNIGKSDITSHVNFAVLRDSLSTLD---SVIMNQRD 287 Query: 301 FLEGLGIWQRAFSLMKQTA--RKDILLDSVKRLVSTSADKKSMGELFKILVV 350 FL GI +R L++ ++ L+ RL ++MG +FK+L++ Sbjct: 288 FLYSFGIKERLRILIENATEVQRQNLITGFLRLT------ENMGSMFKVLLI 333 >gi|68171253|ref|ZP_00544656.1| Protein of unknown function DUF185 [Ehrlichia chaffeensis str. Sapulpa] gi|67999335|gb|EAM85981.1| Protein of unknown function DUF185 [Ehrlichia chaffeensis str. Sapulpa] Length = 335 Score = 258 bits (658), Expect = 1e-66, Method: Composition-based stats. Identities = 113/354 (31%), Positives = 194/354 (54%), Gaps = 21/354 (5%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 M + L + I + G ++V+Q+ + + D +GYY T PFG GDF+TAPEISQ+FG Sbjct: 1 MHSYLKKIIFDC---GGAISVEQFMRIALYDVHYGYYMTQMPFGTYGDFITAPEISQLFG 57 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 E++A++++ W++ G PS +VELGPGRG ++ D++RV+ K + ++ + +Y+VE S Sbjct: 58 EVIALWILLNWQKMGSPSKFIIVELGPGRGTLISDVVRVLRKFE-QCYAAMVVYLVEISP 116 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 L +Q+ L +K+ W + D+P ++ANEFFD+LP+KQFV T E + Sbjct: 117 ILEKLQRDVLKD--EKVFWCKDIKDLPDYPVLIIANEFFDALPVKQFVYTNDSWCETYVT 174 Query: 181 IDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVID 240 ++ + F I ++ F +D + E ++ + +++ GG A+VID Sbjct: 175 VENDE---FKIAYKKVNKIFEMSNDMK-NPVIEICDEAVSIVKCMENKILQSGGAAVVID 230 Query: 241 YGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGK 300 YGY+ T+Q+VK H Y + L N G++D++++V+F L + + + TQ Sbjct: 231 YGYIDCPYKSTIQSVKNHQYNNLLKNVGESDITAYVNFAVLHNSLSTL---SSVIMTQRD 287 Query: 301 FLEGLGIWQRAFSLMKQTA--RKDILLDSVKRLVSTSADKKSMGELFKILVVSH 352 FL GI +R L+ +K L+ RL ++MG +FK+ +V+ Sbjct: 288 FLYNFGIKERLQVLIANATELQKQNLIAGFLRLT------ENMGSMFKVFLVNP 335 >gi|328867001|gb|EGG15384.1| DUF185 family protein [Dictyostelium fasciculatum] Length = 490 Score = 258 bits (658), Expect = 1e-66, Method: Composition-based stats. Identities = 116/383 (30%), Positives = 176/383 (45%), Gaps = 28/383 (7%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 + + + ++ + G VD C+ +P++GYY + FG GDF+TAPEISQ+FGEM Sbjct: 97 TEFEKYLQSIAQVRGPFPVDTLMKECLTNPKYGYYMNRDVFGRGGDFITAPEISQLFGEM 156 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 L I+ + WE G PS +++VE GPGRG +M DILR K DF+ + ++MVE S L Sbjct: 157 LGIWCVATWESMGKPSKLQIVECGPGRGTLMHDILRSTKVFK-DFYQSIEVHMVEVSTHL 215 Query: 123 TLIQKKQLASYGD--------------KINWYTSLADVPLGFTFLVANEFFDSLPIKQFV 168 +QK +L Y D I W+ S+ VP G T + EF D+LPI F Sbjct: 216 KSMQKTRLLYYRDDKPEASQGKSPEGINITWHQSIDTVPNGPTLYIGQEFLDALPINVFQ 275 Query: 169 MTEHGIRERMIDIDQHDSL-----VFNIGDHEIKSNFLTCSDYF----LGAIFENSPCRD 219 T+ ++ + F + + G E Sbjct: 276 FTKAKGWCEVMVDEDISKDGPHHLRFVLSNGPTAMTKAVQYLLPEFGVEGYTVELGVAGL 335 Query: 220 REMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQ 279 Q I+ R+A G A+ IDYG + ++LQA++ H +V L PG ADLS+ VDF Sbjct: 336 GISQKIALRIAEHSGAALFIDYG-KDKILNNSLQAIRNHKFVDILDKPGSADLSTWVDFS 394 Query: 280 RLSSIAILYKLYI--NGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSAD 337 + K + G QG FL+ +G R L+ + K + + K Sbjct: 395 AIRKCIKHLKKDVTSVGPVDQGIFLKEMGAEHRLQRLVGKIDEKSKIEELAKS-YHKLVS 453 Query: 338 KKSMGELFKILVVSHEKVELMPF 360 MG +K++ + +K+E + F Sbjct: 454 PDEMGTTYKVITILDKKLEPVGF 476 >gi|148554151|ref|YP_001261733.1| hypothetical protein Swit_1230 [Sphingomonas wittichii RW1] gi|148499341|gb|ABQ67595.1| protein of unknown function DUF185 [Sphingomonas wittichii RW1] Length = 351 Score = 257 bits (657), Expect = 2e-66, Method: Composition-based stats. Identities = 117/365 (32%), Positives = 173/365 (47%), Gaps = 22/365 (6%) Query: 1 MEN--KLIRKIVNLIKKNGQMTVDQYFALCVADPEFG-YYSTCNPFGAVGDFVTAPEISQ 57 ME +L +V +I+ NG + V Y G YY+ +PFG GDF+T+PEISQ Sbjct: 1 METAAELADALVRVIQANGPIPVADYMEAA-----NGLYYAAHDPFGVKGDFITSPEISQ 55 Query: 58 IFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVE 117 +FGE++ I++ W + VELGPGRG + D LR + L+ ++ VE Sbjct: 56 MFGELIGIWIADLWTRSRALGAY-YVELGPGRGTLAADALRAMGALRRHP----EVHFVE 110 Query: 118 TSERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRER 177 TS L +QK+++ +L +VANEFFD+LP +Q++ T G RER Sbjct: 111 TSPVLRRLQKERVPDAVWH-EGIETLP--TDAPLIIVANEFFDALPYRQYIKTYSGWRER 167 Query: 178 MIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAI 237 ++ D D G+I+E SP ++++ R+ GG + Sbjct: 168 VVTHDADGFRPVPGDAPAEDVVPEHLHDAVAGSIYETSPAGLAVARALAARIVKQGGALL 227 Query: 238 VIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTT 297 IDYG+ GDTLQA+ H Y NPG D+++HVDF L L I G + Sbjct: 228 AIDYGHENYAAGDTLQALNAHAYADVFSNPGANDITAHVDFTALGEAGRLGGARIEGPVS 287 Query: 298 QGKFLEGLGIWQRAFSLMK-QTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVE 356 Q FL LGI RA +L + R + + + RL S ++ MG LF+ + + K Sbjct: 288 QSYFLATLGIAARAAALSRHHPDRTEEIGAAYHRLTS----EEEMGTLFRAIAMVSPKWP 343 Query: 357 LM-PF 360 F Sbjct: 344 KPAGF 348 >gi|156542526|ref|XP_001600721.1| PREDICTED: similar to conserved hypothetical protein [Nasonia vitripennis] Length = 440 Score = 257 bits (657), Expect = 2e-66, Method: Composition-based stats. Identities = 119/390 (30%), Positives = 187/390 (47%), Gaps = 36/390 (9%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGE 61 + L + + IK +G ++V Y + P GYY+T + FG GDF+T+PE+SQ+FGE Sbjct: 49 NSDLSKDLYTRIKLSGPISVANYMKTVLTHPTKGYYTTKDVFGQKGDFITSPEVSQLFGE 108 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 M+ +++I + + ++VELGPGRG + DILRV +L + +I VE S Sbjct: 109 MIGLWIITECNKIHY-KSFQIVELGPGRGTLTHDILRVFRQLGW-TDRISAINYVEVSPV 166 Query: 122 LTLIQKKQLASYGD---------------------KINWYTSLADVPLGFTFLVANEFFD 160 L IQK+ L S + +I WY S+AD+P GFT +A EFFD Sbjct: 167 LAKIQKENLCSTVNSEEITAPSNKSYQFGKTKDKIEIYWYKSIADLPEGFTVFIAQEFFD 226 Query: 161 SLPIKQFVMTEHGIRERMIDIDQHDS--LVFNIGDHEIKSNFLTCSDYFLGAIFENSPCR 218 +LPI +F T+ G E ++D+D + F + + E SP Sbjct: 227 ALPIHKFQKTKDGWFEVLVDVDPNSEKVPKFRYVLAKTDACDSILDKSDKREHVEISPEA 286 Query: 219 DREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDF 278 M+ IS + +GG ++ +DYG+ + DT +A + H PL +PG ADL++ VDF Sbjct: 287 MNIMRYISSAITQNGGFSLFVDYGHNGEKT-DTFRAFRDHKQCDPLKDPGTADLTADVDF 345 Query: 279 QRLSSIAIL-YKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSAD 337 L +A KL G Q +FL+ GI R +L K ++ ++ + Sbjct: 346 ALLKKVAKESNKLLCYGPLAQREFLKDTGIDIRLMNLCKNATEEEK--QQLQSGYDKIMN 403 Query: 338 KKSMGELFKILVVSH-------EKVELMPF 360 MG FK++ + +K+ + F Sbjct: 404 PNEMGTCFKVVSMFPYVLKDYLKKLPVNGF 433 >gi|66811954|ref|XP_640156.1| DUF185 family protein [Dictyostelium discoideum AX4] gi|74854952|sp|Q54S83|MIDA_DICDI RecName: Full=Protein midA, mitochondrial; AltName: Full=Mitochondrial dysfunction gene A; Flags: Precursor gi|60468157|gb|EAL66167.1| DUF185 family protein [Dictyostelium discoideum AX4] Length = 484 Score = 257 bits (656), Expect = 2e-66, Method: Composition-based stats. Identities = 118/391 (30%), Positives = 188/391 (48%), Gaps = 39/391 (9%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 + + ++ K G M++D + + +P++GYY + FG GDF+TAPE+SQ+FGEM Sbjct: 87 TDFEKYLQDITKVRGPMSIDTFIKEVLTNPKYGYYMNKDVFGKGGDFITAPEVSQLFGEM 146 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 + I+ + WE G P +++VE+GPGRG +M DILR K +F+ +S+++VE S Sbjct: 147 IGIWCVATWEAMGKPKKLQIVEMGPGRGTLMKDILRSTKVFK-EFYDSISVHLVEASPAN 205 Query: 123 TLIQKKQLASYGDKI--NWYTSLADVPLG----------------FTFLVANEFFDSLPI 164 QK+ L + DK + ++ + P G T +A EFFD+LPI Sbjct: 206 KKTQKQNLLYFKDKAINFDHKTIGETPNGIKVTWVGKLEEVPTDIPTLFLAQEFFDALPI 265 Query: 165 KQFV--MTEHGIRERMI--DIDQHDSLVFNIGDHEIKSNFLTCSDYFL------GAIFEN 214 F ++ E ++ DI +H + + T + L G E Sbjct: 266 HVFRFSREKNDWCEVLVDEDITEHGEYYLRFVQSKGPTLMTTAVKHLLPEFGLDGYQVEL 325 Query: 215 SPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSS 274 Q I++R+ GG A++IDYGY V +LQA++ H +V L PG ADLS Sbjct: 326 GLAGLAISQQIANRIDKSGGAALIIDYGY-DKIVKSSLQAIRDHEFVDILDKPGTADLSV 384 Query: 275 HVDFQRLSSIAI--LYKLYINGLTTQGKFLEGLGIWQRAFSL---MKQTARKDILLDSVK 329 VDFQ + K G QG FL+ +GI R + + + + L+ K Sbjct: 385 WVDFQTIRKTVKLLKNKSTAIGPVDQGIFLKEMGIEHRLAQIGRKLDSNEKFEELVMGYK 444 Query: 330 RLVSTSADKKSMGELFKILVVSHEKVELMPF 360 +LV D K MG +K++ + + + + F Sbjct: 445 KLV----DPKEMGTNYKVITICDKNITPIGF 471 >gi|221641076|ref|YP_002527338.1| hypothetical protein RSKD131_2977 [Rhodobacter sphaeroides KD131] gi|221161857|gb|ACM02837.1| Hypothetical Protein RSKD131_2977 [Rhodobacter sphaeroides KD131] Length = 353 Score = 257 bits (656), Expect = 2e-66, Method: Composition-based stats. Identities = 143/359 (39%), Positives = 187/359 (52%), Gaps = 10/359 (2%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 L + I G +TV Y A C+ PE GYYST PFGA GDF TAPEISQ+FGE+ Sbjct: 2 TALAVLLARRIGATGPVTVADYMAECLLHPEHGYYSTREPFGAAGDFTTAPEISQMFGEL 61 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 L + L AW G PS V L ELGPGRG +M D+LR + P F +++VE S RL Sbjct: 62 LGLCLAQAWLDQGQPSPVTLAELGPGRGTLMADLLRATRGV-PGFHDAARVHLVEASPRL 120 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 +Q++ L + W AD+P G FLVANEFFD+LPI+QFV G RERM+ + Sbjct: 121 RALQRETLGGH--PAAWLDRAADLPEGPLFLVANEFFDALPIRQFVRGPEGWRERMVGLT 178 Query: 183 QHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYG 242 + + + + D G + E P M I+ R+A GG A+ +DYG Sbjct: 179 EGRLTWGLGPETSLAALAYRLEDTAPGDVVELCPAAGPIMAEIARRIATAGGLALAVDYG 238 Query: 243 YLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFL 302 +SR GDTLQA++ H + PL PG+ADL++HVDF+ L+ A L QG L Sbjct: 239 GWRSR-GDTLQALRAHRFDDPLAAPGEADLTAHVDFEALAQAAAPCG---TALVPQGALL 294 Query: 303 EGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELM-PF 360 LG+ QRA L + + L S + D MG LFK L V + F Sbjct: 295 LRLGLAQRAARLARSLTGEA--LASHEAASRRLTDATEMGTLFKALAVFPPQGPAPAGF 351 >gi|328789699|ref|XP_623890.2| PREDICTED: protein midA homolog, mitochondrial-like [Apis mellifera] Length = 389 Score = 256 bits (655), Expect = 3e-66, Method: Composition-based stats. Identities = 121/393 (30%), Positives = 188/393 (47%), Gaps = 44/393 (11%) Query: 5 LIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLA 64 L + + I G +TV Y + P GYY + FG GDF+T+PEISQ+FGEMLA Sbjct: 3 LYHHLYSKILACGPITVADYMKEVLTHPIIGYYMNKDVFGKQGDFITSPEISQLFGEMLA 62 Query: 65 IFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTL 124 +++ W++ ++VELGPGRG ++ DILRV + + +SI++VE S L+ Sbjct: 63 VWMKYEWQKI-SKDSFQIVELGPGRGTLIKDILRVFKQF--KSLNDISIHLVEVSPILSQ 119 Query: 125 IQKKQLASYG--------------------------DKINWYTSLADVPLGFTFLVANEF 158 IQ K L K+ WY S+ DVP F+ +A+EF Sbjct: 120 IQAKNLCKTIIEYDQKKNKSKNNSTSYYKEGITEDGIKLYWYHSIKDVPKKFSIFLAHEF 179 Query: 159 FDSLPIKQFVMTEHGIRERMIDI---DQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENS 215 FD+LPI +F ++ RE +IDI + + + + + +D I E S Sbjct: 180 FDALPIHKFQKIDNEWREVLIDIIQGCNEEKFRYVLSNTPTPATLFISNDEKREHI-EIS 238 Query: 216 PCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSH 275 P + ++D L GG A++ DYG+ + DT + H PL++PG ADL++ Sbjct: 239 PESLIIVDYLADFLWECGGFALICDYGHNGDKT-DTFRGFSQHKVHDPLLHPGTADLTAD 297 Query: 276 VDFQRLSSIAILYK-LYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVST 334 VDF + IA L G +Q FL+ LGI R L++ ++++ S++ Sbjct: 298 VDFAAIKKIAEKDNRLITFGPVSQSNFLQNLGINVRLQILLQNASKEE--RKSLESGYHM 355 Query: 335 SADKKSMGELFKILVVSH-------EKVELMPF 360 DK MG FK+L + +K+ + F Sbjct: 356 IMDKDKMGIRFKVLSLFPSILKEYFKKIPIAGF 388 >gi|114576978|ref|XP_515411.2| PREDICTED: hypothetical protein isoform 6 [Pan troglodytes] Length = 399 Score = 256 bits (655), Expect = 3e-66, Method: Composition-based stats. Identities = 125/376 (33%), Positives = 186/376 (49%), Gaps = 33/376 (8%) Query: 5 LIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLA 64 ++R ++ IK G +TV +Y + +P GYY + G GDF+T+PEISQIFGE+L Sbjct: 1 MLRHLMYKIKSTGPITVAEYMKEVLTNPAKGYYVYRDMLGEKGDFITSPEISQIFGELLG 60 Query: 65 IFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSV-LSIYMVETSERLT 123 I+ I W G + +LVELGPGRG ++ DILRV +L + +S+++VE S++L+ Sbjct: 61 IWFISEWMATGKSTAFQLVELGPGRGTLVGDILRVFTQLGSVLKNCDISVHLVEVSQKLS 120 Query: 124 LIQ--------------------KKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLP 163 IQ K + G I+WY L DVP G++F +A+EFFD LP Sbjct: 121 EIQALTLTEEKVPLERNAGSPVYMKGVTKSGIPISWYRDLHDVPKGYSFYLAHEFFDVLP 180 Query: 164 IKQFVMTEHGIRERMIDIDQH--DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDRE 221 + +F T G RE +DID D L F + + D E P Sbjct: 181 VHKFQKTPQGWREVFVDIDPQVSDKLRFVLAPSATPAEAFIQHD-ETRDHVEVCPDAGVI 239 Query: 222 MQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRL 281 ++ +S R+A GG A+V DYG+ ++ DT + H L+ PG ADL++ VDF L Sbjct: 240 IEELSQRIALTGGAALVADYGHDGTKT-DTFRGFCDHKLHDVLIAPGTADLTADVDFSYL 298 Query: 282 SSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTAR---KDILLDSVKRLVSTSADK 338 +A K+ G Q FL+ +GI R L+ ++ + LL L+ + Sbjct: 299 RRMA-QGKVASLGPIKQHTFLKNMGIDVRLKVLLDKSNEPSVRQQLLQGYDMLM----NP 353 Query: 339 KSMGELFKILVVSHEK 354 K MGE F + + Sbjct: 354 KKMGERFNFFALLPHQ 369 >gi|331249458|ref|XP_003337346.1| hypothetical protein PGTG_19045 [Puccinia graminis f. sp. tritici CRL 75-36-700-3] gi|309316336|gb|EFP92927.1| hypothetical protein PGTG_19045 [Puccinia graminis f. sp. tritici CRL 75-36-700-3] Length = 438 Score = 256 bits (655), Expect = 3e-66, Method: Composition-based stats. Identities = 114/379 (30%), Positives = 180/379 (47%), Gaps = 31/379 (8%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYST------CNPFGAVGDFVTAPEI 55 L++ I I +G ++V + LC+ P GYYS +PFG GDF+T+PEI Sbjct: 54 STSLLKIINQQILASGPISVPVWMKLCLHHPTLGYYSRTDRSNQADPFGKQGDFITSPEI 113 Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 SQ+FGE++AI+ I W+ G P R++ELGPGRG +M DI+R +K SI+ Sbjct: 114 SQVFGELIAIWFISRWQAAGCPRRTRIIELGPGRGTLMADIIRTFKSIKAFDDVDFSIHF 173 Query: 116 VETSERLTLIQKKQLASYG-------DKINWYTSLADVPLGFTFLVANEFFDSLPIKQFV 168 +E S + +Q ++L+++ + + +T ++A+EFFD+LP+ F Sbjct: 174 IENSPFMRALQDQKLSTFDGLKKENVSWFDRIDQVGKENDQWTMVIAHEFFDALPVHIFQ 233 Query: 169 MTEHGIRERMIDIDQ------HDSLVFNIGDHEIKSNFLTCSD----YFLGAIFENSPCR 218 T G RE MIDI+ SL F + ++ + S+ + A E SP Sbjct: 234 KTPRGFREVMIDINNADMSPTEKSLRFALSPGPTLASQMLISEEHQKLPVDAKLEVSPSA 293 Query: 219 DREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDF 278 ++ IS L D G +I + +L+ H V PL PG D++++VDF Sbjct: 294 NQIAGQISQLLNSDAGGTGLIIDYGAEHHFSHSLRGFYQHQIVDPLSRPGLTDITANVDF 353 Query: 279 QRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADK 338 L + G TQ +FL +GI R L + ++ + S +RL+S Sbjct: 354 ASLKRSM-SPNVLTYGPITQRQFLLSMGIEVRTKRLNQSSSLTED---SSQRLIS----P 405 Query: 339 KSMGELFKILVVSHEKVEL 357 MG+ +K L H +L Sbjct: 406 FGMGDQYKFLGFEHPPSQL 424 >gi|332187872|ref|ZP_08389605.1| hypothetical protein SUS17_2997 [Sphingomonas sp. S17] gi|332012033|gb|EGI54105.1| hypothetical protein SUS17_2997 [Sphingomonas sp. S17] Length = 360 Score = 256 bits (653), Expect = 4e-66, Method: Composition-based stats. Identities = 123/359 (34%), Positives = 174/359 (48%), Gaps = 21/359 (5%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGE 61 E+ L ++ I G + V Q+ A A YY T +P GA GDF T+PEISQ+FGE Sbjct: 16 EDALPERLARAIALAGPIPVAQFMAAANAH----YYGTRDPLGAGGDFTTSPEISQMFGE 71 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 ++ ++ W++ G P V VELGPGRG + D R + K + + VETS Sbjct: 72 LVGLWCADLWDRAGRPE-VHWVELGPGRGTLAADARRAMAKAG----LTPTTHFVETSAT 126 Query: 122 LTLIQKKQLASYGDKINWYTSLAD-VPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 L Q +++ W+ S+ +VANEFFD+LPI+Q V G ER++ Sbjct: 127 LRSAQGERVPD----AEWHDSVDTLPTDRPLIVVANEFFDALPIRQLVRRGDGWHERLVA 182 Query: 181 IDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVID 240 L + + G++ E SP M+ ++ R+A GG A++ID Sbjct: 183 AQDLLFLPIAGPPVPSEIIPEPLREAEAGSVIEVSPASVAVMRQLAARIAAQGGAALIID 242 Query: 241 YGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGK 300 YGY + DTLQAV+GH + +P PG+ DLS+HVDF L++ A L G TQ Sbjct: 243 YGYEGPAIADTLQAVRGHAFANPFDRPGEQDLSAHVDFTTLAAAAQGSGLAAFGPVTQRD 302 Query: 301 FLEGLGIWQRAFSLMK-QTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELM 358 L LGI QR SL + R D LL RL+ MG LF+ L ++ Sbjct: 303 LLGALGIDQRTASLARAHPDRADALLADRNRLMQD------MGTLFRALAITRPDWPAP 355 >gi|332560047|ref|ZP_08414369.1| hypothetical protein RSWS8N_13340 [Rhodobacter sphaeroides WS8N] gi|332277759|gb|EGJ23074.1| hypothetical protein RSWS8N_13340 [Rhodobacter sphaeroides WS8N] Length = 353 Score = 256 bits (653), Expect = 5e-66, Method: Composition-based stats. Identities = 143/359 (39%), Positives = 187/359 (52%), Gaps = 10/359 (2%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 L + I G +TV Y A C+ PE GYYST PFGA GDF TAPEISQ+FGE+ Sbjct: 2 TALAVLLARRIGAAGPVTVADYMAECLLHPEHGYYSTREPFGAAGDFTTAPEISQMFGEL 61 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 L + L AW G PS V L ELGPGRG +M D+LR + P F +++VE S RL Sbjct: 62 LGLCLAQAWLDQGQPSPVTLAELGPGRGTLMADLLRATRGV-PGFHDAARVHLVEASPRL 120 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 +Q++ L + W AD+P G FLVANEFFD+LPI+QFV G RERM+ + Sbjct: 121 RALQREMLGGH--PAAWLDRAADLPEGPLFLVANEFFDALPIRQFVRGPEGWRERMVGLT 178 Query: 183 QHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYG 242 + + + + D G + E P M I+ R+A GG A+ +DYG Sbjct: 179 EGRLTWGLGPETALAALAHRLEDTAPGDVVELCPAAGPIMAEIARRIATAGGLALAVDYG 238 Query: 243 YLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFL 302 +SR GDTLQA++ H + PL PG+ADL++HVDF+ L+ A L QG L Sbjct: 239 GWRSR-GDTLQALRAHRFDDPLAAPGEADLTAHVDFEALAQAAAPCG---TALVPQGALL 294 Query: 303 EGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELM-PF 360 LG+ QRA L + + L S + D MG LFK L V + F Sbjct: 295 LRLGLAQRAARLARSLTGEA--LASHEAASRRLTDATEMGTLFKALAVFSPQGPAPAGF 351 >gi|21434891|gb|AAM53573.1| Aby [Azospirillum brasilense] Length = 350 Score = 256 bits (653), Expect = 5e-66, Method: Composition-based stats. Identities = 133/342 (38%), Positives = 183/342 (53%), Gaps = 14/342 (4%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L + I +G ++V + A + P FGYY +PFG GDF TAPEISQ+FGE+ Sbjct: 8 SLAHHLARRILMDGPLSVAAFMAEALGHPRFGYYMRQDPFGVSGDFTTAPEISQMFGELA 67 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 ++ + W + G P+ V LVELGPGRG +M D LR L P F +++VETS L Sbjct: 68 GLWCVDTWARLGGPAPVHLVELGPGRGTLMQDALRAAA-LVPAFREATRVHLVETSPTLR 126 Query: 124 LIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQ 183 QK+ LA + W+ L DVP G T ++ANEFFD+LPI+Q T HG ER+IDID Sbjct: 127 ARQKETLAG--IPVAWHDRLEDVPEGPTLILANEFFDALPIRQVQKTNHGWFERLIDIDN 184 Query: 184 HDSL--------VFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGT 235 +S+ + G + D G++ E SP + I +RLA G Sbjct: 185 TESMDTPRFRFVLEAFGSAGARLIPPALRDAPEGSVVEVSPASQPVARLIGERLAAHPGA 244 Query: 236 AIVIDYGYL-QSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYING 294 A+VIDYGY VGDTLQA++ H Y L PG+ADL++HVDF +++ A G Sbjct: 245 ALVIDYGYRGGPAVGDTLQALRRHAYAPVLDAPGEADLTAHVDFAAIAAAAREGGAESFG 304 Query: 295 LTTQGKFLEGLGIWQRAFSL--MKQTARKDILLDSVKRLVST 334 QG +L LGI RA +L T + + ++ RL+ Sbjct: 305 PVDQGDWLVRLGIQPRATALKRSATTKQAADIDSALARLIHR 346 >gi|167628561|ref|YP_001679060.1| hypothetical protein HM1_0432 [Heliobacterium modesticaldum Ice1] gi|167591301|gb|ABZ83049.1| conserved hypothetical protein [Heliobacterium modesticaldum Ice1] Length = 389 Score = 255 bits (652), Expect = 6e-66, Method: Composition-based stats. Identities = 95/389 (24%), Positives = 153/389 (39%), Gaps = 39/389 (10%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNP-FGAVGDFVTAPEISQIF 59 M + L + I + I+ G + Y L + P GYY+ +P G GDF+TAPEIS +F Sbjct: 1 MSSPLQQAIGDRIRAEGPIPFRDYMELALYHPRHGYYTAGDPPMGRRGDFITAPEISPLF 60 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 G ++ L WE P ++E GPGRG++ +L + L ++VE S Sbjct: 61 GRVIGRQLTEMWEHLKRPDRFDIIEFGPGRGLLAKAVLEALTAGP--LADRLVYHLVEIS 118 Query: 120 ERLTLIQKKQLASYGDKINWYTSLADVPLGFT------------FLVANEFFDSLPIKQF 167 L Q++ LA + + A +++NEF D+LP+ + Sbjct: 119 PTLRAHQRESLAGLPLTVYPDPAEAIPGSYGWSAAKALPCGLTGVVLSNEFLDALPVHRL 178 Query: 168 VMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCS----------DYFLGAIFENSPC 217 + + E + + F + L G +FE + Sbjct: 179 IHKDGRPWELYVAKGTAHAGPFCWHYGPLSDPCLEDWIARHITGKGVTLEEGQLFEVNLA 238 Query: 218 RDREMQSISDRLACDGGTAIVIDYG------YLQSRVGDTLQAVKGHTYV-SPLVNPGQA 270 M+++ L G + +DYG Y R TL + H PL + G+ Sbjct: 239 AADWMKAVDRLLTR--GFVLTVDYGHPVEKLYSPERYEGTLVCYRRHRADADPLEDVGEK 296 Query: 271 DLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKR 330 D+++H+DF L S+A GL +Q FL W R L+ A + Sbjct: 297 DMTAHLDFTSLQSVASELGWQNLGLISQMWFLAH---WLRPEDLLTGPAMTVEDFRRHQA 353 Query: 331 LVSTSADKKSMGELFKILVVSHEKVELMP 359 L MGE+FK+L+ S + +P Sbjct: 354 LKKVLL-PGGMGEIFKVLIQSK-GLPPLP 380 >gi|297265811|ref|XP_001108260.2| PREDICTED: protein midA homolog, mitochondrial-like isoform 2 [Macaca mulatta] Length = 442 Score = 255 bits (652), Expect = 6e-66, Method: Composition-based stats. Identities = 124/376 (32%), Positives = 184/376 (48%), Gaps = 33/376 (8%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 ++R ++ IK G +TV +Y + +P GYY + G GDF+T+PEISQIFGE+ Sbjct: 42 TPMLRHLMYKIKSTGPITVAEYMKEVLTNPAKGYYVYRDMLGKQGDFITSPEISQIFGEL 101 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSV-LSIYMVETSER 121 L I+ I W G + +LVELGPGRG ++ DILRV +L + +S+++VE S++ Sbjct: 102 LGIWFISEWMATGKSTAFQLVELGPGRGTLVGDILRVFTQLGSVLKNCDISVHLVEVSQK 161 Query: 122 LTLIQ--------------------KKQLASYGDKINWYTSLADVPLGFTFLVANEFFDS 161 L+ IQ K + G I+WY + DVP G++F +A+EFFD Sbjct: 162 LSEIQALTLTEEKVPLERNAGSPVYMKGVTKSGIPISWYRHVHDVPKGYSFYLAHEFFDV 221 Query: 162 LPIKQFVMTEHGIRERMIDIDQH--DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRD 219 LP+ +F T G RE IDID D L F + + D E P Sbjct: 222 LPVHKFQKTPQGWREVFIDIDPQVSDKLRFVLAPSATPAEAFIQHD-ETRDHVEVCPDAG 280 Query: 220 REMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQ 279 ++ +S R+A GG A+V DYG+ ++ + GH L+ PG ADL++ VDF Sbjct: 281 VIIEELSQRIALTGGAALVADYGHDGTKTXM-FKGFCGHKLHDVLIAPGTADLTADVDFS 339 Query: 280 RLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTAR---KDILLDSVKRLVSTSA 336 L +A K+ G Q FL+ +GI R L+ ++ + LL L+ Sbjct: 340 YLRRMA-QGKVASLGPIKQHTFLKNMGIDVRLKVLLDKSNEPSVRQQLLQGYDMLM---- 394 Query: 337 DKKSMGELFKILVVSH 352 + K MGE F + Sbjct: 395 NPKKMGERFNFFALLP 410 >gi|307189530|gb|EFN73907.1| UPF0511 protein C2orf56-like protein, mitochondrial [Camponotus floridanus] Length = 421 Score = 255 bits (652), Expect = 7e-66, Method: Composition-based stats. Identities = 127/400 (31%), Positives = 190/400 (47%), Gaps = 47/400 (11%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGE 61 + L R++ I G +T+ +Y + P GYY+T + G GDF T+PEISQ+FGE Sbjct: 28 TSDLYRQLYAKILACGPITLAEYMKEILTHPTVGYYTTKDTIGQRGDFTTSPEISQLFGE 87 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 ++A+++I W + ++LVELGPGRG ++ DILRV KL + +S+++VE S Sbjct: 88 IIAVWIINEWRKITKE-SIQLVELGPGRGTLISDILRVFKKL--NVLDKISVHLVEVSPV 144 Query: 122 LTLIQKKQLASYGDK--------------------------INWYTSLADVPLGFTFLVA 155 L++IQ K+L I WY S+ DVP F+ +A Sbjct: 145 LSMIQAKKLCIESKNSELKVNENQKNSVTHYREGVTKDGVKIYWYYSINDVPREFSIFIA 204 Query: 156 NEFFDSLPIKQFVMTEHGIRERMIDIDQH---DSLVFNIGDHEIKSNFLTCSDYFLGAIF 212 EFFD+LPI +F T+ G RE ++DI Q + + + + + S Sbjct: 205 QEFFDALPIHKFQKTDKGWREILVDIIQEVKQEKFRYVLSQMPTAACKVYLSPNEKREHV 264 Query: 213 ENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADL 272 E SP + +S L GG A+VIDYG+ + DT +A H PL+NPG ADL Sbjct: 265 EVSPQCSIIIDYMSQFLWECGGFALVIDYGHEGEKT-DTFRAFYQHKLHDPLLNPGTADL 323 Query: 273 SSHVDFQRLSSIAILYK-LYINGLTTQGKFLEGLGIWQRAFSLMKQTA--RKDILLDSVK 329 ++ VDF + IA L G TQ KFL+ LGI R +++ +K + Sbjct: 324 TADVDFSLMKEIAEKDNRLITFGPVTQRKFLKSLGIDLRLKMILQNATNVQKQQIESGY- 382 Query: 330 RLVSTSADKKSMGELFKILVVSH-------EKVELMPFVN 362 D+ MG F++L +K + F N Sbjct: 383 ---HMITDEDKMGNCFQVLSFFPFVLKDHLKKWPVAGFEN 419 >gi|321254782|ref|XP_003193196.1| hypothetical protein CGB_C9220C [Cryptococcus gattii WM276] gi|317459665|gb|ADV21409.1| conserved hypothetical protein [Cryptococcus gattii WM276] Length = 449 Score = 255 bits (651), Expect = 9e-66, Method: Composition-based stats. Identities = 109/388 (28%), Positives = 173/388 (44%), Gaps = 35/388 (9%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 N+L + I + IK G + +Y C++ P GYYS + FG GDF+T+PEISQIFGE+ Sbjct: 53 NELAKVIRDSIKSTGPIPASRYMQFCLSHPVHGYYSKGDVFGQKGDFITSPEISQIFGEL 112 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 +AI+ + W + P+ VR+VELGPGRG +M D+LR + S+ S+++VE SE + Sbjct: 113 VAIWFLTRWMEVDSPTRVRIVELGPGRGTLMDDVLRTLFNFPGIAASINSVHLVENSEAM 172 Query: 123 TLIQKKQLASYG-------DKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIR 175 +Q + L+ + + + FT VA+EFFD++PI F T+ G R Sbjct: 173 REVQSQTLSPRIEGKDVKLNWYTSVEEIPETKDEFTLFVAHEFFDAMPINVFEKTDMGWR 232 Query: 176 ERMIDIDQHDSLVFNIGDHE-----------------IKSNFLTCSDYFLGAIFENSPCR 218 E +ID D S + S ++ G+ E S Sbjct: 233 EVLIDRDPSYSPDLPTSSSPSGLRFTLSSSPTTLSTILPSTSPRFANLPSGSRIEVSQDS 292 Query: 219 DREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDF 278 + M + + G ++ + +A + H V +PG DL+++VDF Sbjct: 293 YKIMHRLGQVINQGLGGCGLVVDYGADKAFASSFRAFRKHEIVDVFEDPGSCDLTANVDF 352 Query: 279 QRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTA--RKDILLDSVKRLVSTSA 336 L G +Q +FL LGI R L+ R++ + KRL+ Sbjct: 353 AYLRESLTGI-ATSLGPISQAQFLLSLGIQPRLRKLLDTAPLDRREAIEKGAKRLIDVL- 410 Query: 337 DKKSMGELFKILVVSHE----KVELMPF 360 MG ++++ V K + PF Sbjct: 411 ---GMGSQYQVMGVVSGEPEMKEGIYPF 435 >gi|88657760|ref|YP_507560.1| hypothetical protein ECH_0762 [Ehrlichia chaffeensis str. Arkansas] gi|88599217|gb|ABD44686.1| conserved hypothetical protein [Ehrlichia chaffeensis str. Arkansas] Length = 335 Score = 255 bits (651), Expect = 9e-66, Method: Composition-based stats. Identities = 112/354 (31%), Positives = 194/354 (54%), Gaps = 21/354 (5%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 M + L + I + G ++V+Q+ + + D +GYY T PFG GDF+TAP+ISQ+FG Sbjct: 1 MHSYLKKIIFDC---GGAISVEQFMRIALYDVHYGYYMTQMPFGTYGDFITAPDISQLFG 57 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 E++A++++ W++ G PS +VELGPGRG ++ D++RV+ K + ++ + +Y+VE S Sbjct: 58 EVIALWILLNWQKMGSPSKFIIVELGPGRGTLISDVVRVLRKFE-QCYAAMVVYLVEISP 116 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 L +Q+ L +K+ W + D+P ++ANEFFD+LP+KQFV T E + Sbjct: 117 ILEKLQRDVLKD--EKVFWCKDIKDLPDYPVLIIANEFFDALPVKQFVYTNDSWCETYVT 174 Query: 181 IDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVID 240 ++ + F I ++ F +D + E ++ + +++ GG A+VID Sbjct: 175 VENDE---FKIAYKKVNKIFEMSNDMK-NPVIEICDEAVSIVKCMENKILQSGGAAVVID 230 Query: 241 YGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGK 300 YGY+ T+Q+VK H Y + L N G++D++++V+F L + + + TQ Sbjct: 231 YGYIDCPYKSTIQSVKNHQYNNLLKNVGESDITAYVNFAVLHNSLSTL---SSVIMTQRD 287 Query: 301 FLEGLGIWQRAFSLMKQTA--RKDILLDSVKRLVSTSADKKSMGELFKILVVSH 352 FL GI +R L+ +K L+ RL ++MG +FK+ +V+ Sbjct: 288 FLYNFGIKERLQVLIANATELQKQNLIAGFLRLT------ENMGSMFKVFLVNP 335 >gi|255004169|ref|ZP_05278970.1| hypothetical protein AmarV_02142 [Anaplasma marginale str. Virginia] Length = 342 Score = 255 bits (651), Expect = 9e-66, Method: Composition-based stats. Identities = 116/340 (34%), Positives = 181/340 (53%), Gaps = 18/340 (5%) Query: 19 MTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPS 78 +T+D++ +L E GYY T PFG GDFVT+ EISQ+FGE++A++++ E G Sbjct: 16 VTMDRFMSLV-YHEEHGYYMTRVPFGRAGDFVTSAEISQLFGEVVALWILSYLESAGISE 74 Query: 79 CVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASY--GDK 136 L+ELGPGRG +M DILRV + P + ++L ++++E S L Q+ L S+ + Sbjct: 75 KFSLLELGPGRGTLMHDILRVFEQF-PRYDALLEVHLLEISPLLRNTQRATLESFSARKE 133 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 I+W+ L ++P T +VANEFFD+LP++QF+ T +E + +D I + Sbjct: 134 ISWHCKLEELPERPTIVVANEFFDALPVRQFIRTGGAWKECCV---CNDGGNLGIVAVDT 190 Query: 197 KSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVK 256 + N D G I E + + + +GG A + DYGYLQ T+Q+VK Sbjct: 191 QYNLDEYGDVPEGGIIERCEAASDVLARLEKIIVRNGGAAAIFDYGYLQPPYCSTIQSVK 250 Query: 257 GHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMK 316 H Y L N G+ D+++HVDF L A + TQ +FL GI +R L + Sbjct: 251 SHHYCDFLDNIGECDITAHVDFGLLQKHAQRLNSKV---VTQREFLYQFGIRERLACLER 307 Query: 317 QTARKD--ILLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 + L + RL ++MG +FK+L+++HE+ Sbjct: 308 NATERQRRELKGAFLRLT------ENMGTMFKVLLLNHER 341 >gi|330846878|ref|XP_003295218.1| hypothetical protein DICPUDRAFT_160448 [Dictyostelium purpureum] gi|325074101|gb|EGC28255.1| hypothetical protein DICPUDRAFT_160448 [Dictyostelium purpureum] Length = 483 Score = 255 bits (650), Expect = 1e-65, Method: Composition-based stats. Identities = 131/391 (33%), Positives = 189/391 (48%), Gaps = 39/391 (9%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 + + + ++ K G M+VD + + +P+FGYY + FG GDFVTAPEIS +FGE+ Sbjct: 84 TEFEKYLQDVTKVKGPMSVDTFIREVLTNPKFGYYMNRDVFGKGGDFVTAPEISNLFGEI 143 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 L I+ + WEQ G P + +VE+GPGRG +M DILR K DF+S +S+YM+E S L Sbjct: 144 LGIWCVATWEQMGRPKKLNIVEMGPGRGTLMKDILRSTKVFK-DFYSAISVYMLEASPAL 202 Query: 123 TLIQKKQLASYGDK-----------------INWYTSLADVPL-GFTFLVANEFFDSLPI 164 IQK++L + D I W + L DVP T +A EF+D+LPI Sbjct: 203 KKIQKEKLLYFKDPAINFDDKTVGKTPEGVKITWVSRLDDVPDTTPTLFLAQEFYDALPI 262 Query: 165 KQFV--MTEHGIRERMIDIDQHDSLVFNIGD--------HEIKSNFLTCSDYFLGAIFEN 214 F + E ++D D S +++ G E Sbjct: 263 HVFRFSKDLNTWCEVLVDEDITASNDYHLRFVQSRGSTAMATAVKNYLPEFGIDGYQVEL 322 Query: 215 SPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSS 274 Q IS R+ GG A++IDYGY V ++LQA++ H +V L PG ADLS Sbjct: 323 GVAGLAISQLISKRIEKSGGAALIIDYGY-DKIVKNSLQAIRNHEFVELLDKPGSADLSV 381 Query: 275 HVDFQRLSSIAILYKLYI--NGLTTQGKFLEGLGIWQRAFSLMKQTARKDI---LLDSVK 329 VDFQ L + K G QG FL+ GI R +L+ + K+ L+ K Sbjct: 382 WVDFQTLRRCVKMMKNKTTAIGPVDQGIFLKECGIEPRLMNLLDKLDSKEKMEELILGYK 441 Query: 330 RLVSTSADKKSMGELFKILVVSHEKVELMPF 360 RLV D MG +K++ + + + + F Sbjct: 442 RLV----DPAEMGTTYKVITICDKSIVPVGF 468 >gi|168033894|ref|XP_001769449.1| predicted protein [Physcomitrella patens subsp. patens] gi|162679369|gb|EDQ65818.1| predicted protein [Physcomitrella patens subsp. patens] Length = 391 Score = 255 bits (650), Expect = 1e-65, Method: Composition-based stats. Identities = 126/402 (31%), Positives = 191/402 (47%), Gaps = 58/402 (14%) Query: 5 LIRKIVNLIK-KNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 + + + LI+ + G +TV +Y + +P G+Y + FG GDFVT+P+ISQ+FGEM+ Sbjct: 1 MAKHLKALIRFRGGPITVAEYMEEVLTNPNAGFYMNRDVFGTHGDFVTSPDISQMFGEMV 60 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 ++ +C W Q G P V ++ELGPGRG +M D+LR K K DF LS+++VE S L Sbjct: 61 GVWSMCLWHQMGQPEAVNIIELGPGRGTLMADLLRGTAKFK-DFSQTLSVHLVECSPALR 119 Query: 124 LIQKKQLASYGD---------------------------KINWYTSLADVPLG-FTFLVA 155 IQ + L + W+ L VP G T ++A Sbjct: 120 KIQHETLKCVYKGGAEEKPTADGQNSEVVDDRISQISGVPVAWHFDLDQVPRGVPTIIIA 179 Query: 156 NEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENS 215 +EF+D+LPI QF + G E+++D+ + D + + + + E Sbjct: 180 HEFYDALPIHQFQKSPRGWCEKLVDVAEDD----------WRMKWASLQEKAEIEHVEVC 229 Query: 216 PCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSH 275 P + I+ R+ DGG A+++DYG V D+LQA+K H +V L +PG ADLS++ Sbjct: 230 PQAMKVTADIAKRVGGDGGGALIVDYGDS-KIVSDSLQAIKKHEFVHVLDSPGNADLSAY 288 Query: 276 VDFQRLSSIAILY--KLYINGLTTQGKFLEGLGIWQRAFSLMKQT--ARKDILLDSVKRL 331 VDF L + + G TQ +FL LGI R SL++ + + L RL Sbjct: 289 VDFAALKHVVEDAAVGAAVYGPITQSQFLGALGINFRLESLVQNATDEQAEALQLGYWRL 348 Query: 332 VSTSAD------------KKSMGELFKILVVSHEKVELM-PF 360 V MG +K LVV ++K F Sbjct: 349 VGDGPAPWLDSDDDVNRVPPGMGSRYKALVVVNDKYGAPVGF 390 >gi|296282188|ref|ZP_06860186.1| hypothetical protein CbatJ_01140 [Citromicrobium bathyomarinum JL354] Length = 331 Score = 255 bits (650), Expect = 1e-65, Method: Composition-based stats. Identities = 127/341 (37%), Positives = 184/341 (53%), Gaps = 17/341 (4%) Query: 19 MTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPS 78 M V ++ A YY+ +P G+ GDF TAPEISQ+FGE++ ++L W + G P Sbjct: 1 MPVARFMGESNAH----YYAARDPLGSAGDFTTAPEISQMFGELIGLWLADIWTRAGSPP 56 Query: 79 CVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKIN 138 VELGPGRG + D LR + + ++ VE S L +Q + + ++ Sbjct: 57 DAIYVELGPGRGTLAADALRSMARFGLQP----EVHFVEGSPALRSLQAEAVPG----VH 108 Query: 139 WYTSLADVPLG-FTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIK 197 ++ +P G LVANEFFD+LP++Q V TE G RERM+ + + D F GD + Sbjct: 109 FHDDPTSLPNGRPLLLVANEFFDALPVRQLVRTEKGWRERMVGLGEEDDFRFVAGDQPMD 168 Query: 198 SNFLTC-SDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVK 256 S +D +G I E P +Q I+ RLA GGTA++IDYG+L R G TLQA+ Sbjct: 169 SAVPADRADAEVGVIVETCPAATAILQDIAQRLAVQGGTALMIDYGHLTPRTGSTLQAIT 228 Query: 257 GHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMK 316 H V PLV PG ADL++HVDF L+ +A + G TQG FL LGI RA +L K Sbjct: 229 RHEKVDPLVMPGAADLTAHVDFAALAEVARREGARVLGSATQGAFLSALGIDARAAALAK 288 Query: 317 QTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVEL 357 ++ + ++ + + MG+LFK+L ++H + Sbjct: 289 AAPQR---AEEIETALHRLVSGQQMGDLFKVLAIAHPQWPA 326 >gi|195055476|ref|XP_001994645.1| GH14945 [Drosophila grimshawi] gi|193892408|gb|EDV91274.1| GH14945 [Drosophila grimshawi] Length = 445 Score = 255 bits (650), Expect = 1e-65, Method: Composition-based stats. Identities = 116/396 (29%), Positives = 187/396 (47%), Gaps = 47/396 (11%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L +++ + G +TV +Y + +P+ GYY + FG GDF+T+PEISQIFGE++ Sbjct: 57 NLTKQLTAKMLATGPITVAEYMREVLTNPQSGYYMHRDVFGREGDFITSPEISQIFGELV 116 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 I+L+ W++ G PS +LVELGPGRG + D+L+V+ K K + +MVE S L+ Sbjct: 117 GIWLLAEWQKLGSPSPFQLVELGPGRGTLARDVLKVLTKFKAGADFTM--HMVEISPYLS 174 Query: 124 LIQKKQLASYGDKI--------------------NWYTSLADVPLGFTFLVANEFFDSLP 163 Q ++ + + W+ L DVP GF+ ++A+EFFD+LP Sbjct: 175 QAQAQRFCYKHETVPEEAQLPHYQVGTTATGVQAFWHRHLEDVPPGFSLVLAHEFFDALP 234 Query: 164 IKQFVMTEHGIRERMIDIDQ---------HDSLVFNIGDHEIKSNFLTCSDYFLGAIFEN 214 + + + E +ID+ + S V + F + E Sbjct: 235 VHKLQLVNGQWLEVLIDVPRTQETETKNADFSYVLPKSQTPVSRLFKPVPQ-ETRSCLEY 293 Query: 215 SPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSS 274 S +R + ++DRL GG A+++DYG+L + DT +A K H PL+ PG ADL++ Sbjct: 294 SLETERHVGLLADRLERQGGIALIMDYGHLGDKT-DTFRAFKQHALHEPLLAPGTADLTA 352 Query: 275 HVDFQRLSSIAILYK-LYINGLTTQGKFLEGLGIWQRAFSLMKQ--TARKDILLDSVKRL 331 VDF+ + + + ++ G QG FL + R L+ +DI+ K L Sbjct: 353 DVDFRHIKHVVETHNQVHCCGPVQQGDFLSRMQGELRLEQLLANALPENQDIIRSGYKML 412 Query: 332 VSTSADKKSMGELFKILVVSH-------EKVELMPF 360 D MG +K L + +K + F Sbjct: 413 T----DANQMGSRYKFLAMFPGVMAEHLKKYPVAGF 444 >gi|294085763|ref|YP_003552523.1| hypothetical protein SAR116_2196 [Candidatus Puniceispirillum marinum IMCC1322] gi|292665338|gb|ADE40439.1| protein of unknown function DUF185 [Candidatus Puniceispirillum marinum IMCC1322] Length = 389 Score = 254 bits (648), Expect = 2e-65, Method: Composition-based stats. Identities = 109/370 (29%), Positives = 176/370 (47%), Gaps = 21/370 (5%) Query: 5 LIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLA 64 + +V I +G +++ +Y + ++ + GYY + +PFG GDF+TAPEIS +FGEM Sbjct: 14 MTASLVAQIVADGPLSLARYIEIALSTADAGYYQSSDPFGHKGDFITAPEISGLFGEMCG 73 Query: 65 IFLICAWEQHGFP-------SCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVE 117 +FL +E P ++E GPGRG +M D+ V +L P+ + +++++E Sbjct: 74 LFLAHMFELGKAPEETESGRKKPVIIECGPGRGTLMADMRHVWGQLMPE-LAACTVHLIE 132 Query: 118 TSERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRER 177 TS L +Q++ L I+W+ L+ +P + +ANEFFD+LP+ + + R R Sbjct: 133 TSPYLRTLQEQALPD--AVIHWHDDLSALPAAPLYGIANEFFDALPVAHAICRKGIWRHR 190 Query: 178 MIDIDQ---HDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGG 234 ++ + + + G + E + MQ ++ +A GG Sbjct: 191 LVTATPALGFGEGAPLTTAELDRWHLSHKAASPDGTVAEFCVMGEDIMQVLAAHIARFGG 250 Query: 235 TAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYING 294 ++IDYG GDT+QAV H V PGQAD+S VDF L++ A + G Sbjct: 251 AILIIDYGKT-DNFGDTVQAVAAHKPVDLFYQPGQADISHWVDFGALAACASEAGARLIG 309 Query: 295 LTTQGKFLEGLGIWQRAFSLMKQTAR--KDILLDSVKRLVSTSADKKSMGELFKILVVSH 352 QG FL +G+ RA K + LL + RLVS MG FK+ ++ Sbjct: 310 PVEQGSFLTQIGLKARAEQAAKHADPEMRRALLAAYDRLVS----PAQMGSAFKVALLVP 365 Query: 353 EKV-ELMPFV 361 + FV Sbjct: 366 QGDGTPPGFV 375 >gi|300112772|ref|YP_003759347.1| hypothetical protein Nwat_0035 [Nitrosococcus watsonii C-113] gi|299538709|gb|ADJ27026.1| protein of unknown function DUF185 [Nitrosococcus watsonii C-113] Length = 393 Score = 253 bits (646), Expect = 3e-65, Method: Composition-based stats. Identities = 93/382 (24%), Positives = 147/382 (38%), Gaps = 40/382 (10%) Query: 2 ENKLIRKIVNLI-KKNGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIF 59 KL I I + GQ+ ++ L + P GYY T G GDF+TAPE+S +F Sbjct: 20 SQKLENLIQTAIEQAGGQIPFARFMELALYAPGLGYYMTGLRKLGTSGDFITAPELSPLF 79 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 +A +E G ++E G G G + D+L + +++E S Sbjct: 80 ARCIARQCQQIFEMLG---TGNILEFGAGSGRLAADLLSELNLSGHLP---ERYFILELS 133 Query: 120 ERLTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIR 175 L Q++ L ++NW L D G ++ANE D++P F + Sbjct: 134 ADLRHRQQETLYQRVPLLAPRVNWLDRLPDSIDG--LVIANEVCDAMPAHCFQLENRHDW 191 Query: 176 ERMIDIDQHDSLVFNIGDHEIKSNF--------LTCSDYFLGAIFENSPCRDREMQSISD 227 ER + + F + + + E + + ++ Sbjct: 192 ERYVGY---EKDKFVWKKGPLSHSLLKDRIAKIRLLLKHVNNYESEINLAMEGWTTEVAH 248 Query: 228 RLACDGGTAIVIDY------GYLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQR 280 RL G ++IDY Y RV TL H +PL+ G D+++HVDF Sbjct: 249 RLQK--GMLLIIDYGFPRHEYYHPERVMGTLMCHYRHQAHPNPLILTGLQDITTHVDFTA 306 Query: 281 LSSIAILYKLYINGLTTQGKFLEGLGIWQRAFS-LMKQTARKDILLDSVKRLVSTSADKK 339 L+ L + G TQ FL G+ + A + + + +KRLV Sbjct: 307 LAEAGYSSGLRVAGYCTQADFLLACGLDKLAAAEIAAGGKQALETSQQIKRLVL----PS 362 Query: 340 SMGELFKILVVSHE-KVELMPF 360 MGELFK L ++ L+ F Sbjct: 363 EMGELFKALALTRGINQPLLGF 384 >gi|332024460|gb|EGI64658.1| Protein midA-like protein, mitochondrial [Acromyrmex echinatior] Length = 410 Score = 253 bits (645), Expect = 4e-65, Method: Composition-based stats. Identities = 129/400 (32%), Positives = 195/400 (48%), Gaps = 46/400 (11%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGE 61 + L R++ I G +T+ +Y + P GYY+T + FG GD+ T+PEISQ+FGE Sbjct: 12 KTDLYRQLYAKILACGPITLAEYMKEILLHPTAGYYTTRDVFGQRGDYTTSPEISQLFGE 71 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 ++AI++I W + ++LVELGPGRG ++ DILRV KL + S++++E S Sbjct: 72 IIAIWIINEWGKISR-DSIQLVELGPGRGTLINDILRVFKKLN-FSNKIRSVHLIEISPV 129 Query: 122 LTLIQKKQLASYGD--------------------------KINWYTSLADVPLGFTFLVA 155 L+ IQ ++L + KI WY S+ DVP F+ +A Sbjct: 130 LSAIQAEKLCTKSKSIEPRVNEDQKNSITHYREGVTRDNVKIYWYYSINDVPRKFSVFIA 189 Query: 156 NEFFDSLPIKQFVMTEHGIRERMIDIDQH---DSLVFNIGDHEIKSNFLTCSDYFLGAIF 212 EFFD+LPI +F T+ G RE +IDI Q + + + + + S + Sbjct: 190 QEFFDALPIHKFQKTDKGWREILIDIVQDSKEERFRYVLSQMPTAACKVYLSLHEKRDHV 249 Query: 213 ENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADL 272 E SP + +S L GG A+VIDYG+ + + DT +A H PL+NPG ADL Sbjct: 250 EISPQCSVIIDYMSQFLWEHGGFALVIDYGHEKEKT-DTFRAFCEHKLHDPLLNPGTADL 308 Query: 273 SSHVDFQRLSSIAILYK-LYINGLTTQGKFLEGLGIWQRAFSLMKQTA--RKDILLDSVK 329 ++ VDF L IA L G TQ KFL+ LGI R +++ + +K+ + Sbjct: 309 TADVDFLLLKEIAQKDNRLITFGPVTQRKFLKSLGIDLRLKMILQNASNNQKEHIESGY- 367 Query: 330 RLVSTSADKKSMGELFKILVVSH-------EKVELMPFVN 362 D+ MG FK+L + K + F + Sbjct: 368 ---HMIIDEDKMGNCFKVLSLFPFVLKDHLNKWPVAGFED 404 >gi|195998351|ref|XP_002109044.1| hypothetical protein TRIADDRAFT_52674 [Trichoplax adhaerens] gi|190589820|gb|EDV29842.1| hypothetical protein TRIADDRAFT_52674 [Trichoplax adhaerens] Length = 434 Score = 253 bits (645), Expect = 4e-65, Method: Composition-based stats. Identities = 118/398 (29%), Positives = 191/398 (47%), Gaps = 46/398 (11%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGE 61 + L++ +++ IK +G +++ Y + P GYY + + FG+ GDF T+PE++Q+FGE Sbjct: 38 KTPLVKDLISQIKADGPISIASYMRQVLTGPMGGYYMSSDVFGSKGDFTTSPEVNQMFGE 97 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 ++ I+L W Q S +++ELGPGRG + DILR I + + + LS+++VE S + Sbjct: 98 LIGIWLYYQWMQTRPKSHAQIIELGPGRGTLSADILRTIKQFR-NLQEGLSLHLVEISPK 156 Query: 122 LTLIQKKQLASYGD--------------------------KINWYTSLAD-VPLGFTFLV 154 L+ IQ+ + + I WY L D ++ +V Sbjct: 157 LSKIQEDTICMHDTKTTQSVKELDVKPAGCYKALMSSDGIPIYWYYHLKDVPNNDYSLVV 216 Query: 155 ANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLV---FNIGDHEIKSN------FLTCSD 205 ANEFFD+LPI QF E MID+D+ D F + + Sbjct: 217 ANEFFDALPIHQFRKVNGNWNEVMIDVDEGDGKHHLKFVLAPKPTLQTKLYTQDVMFAKS 276 Query: 206 YFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLV 265 + I E SP + I+DRL GG A+++DYG + DT++A + H V L Sbjct: 277 SKVKDIMEVSPDSATIYKEIADRLRVHGGCALIVDYGEFGT-GTDTIRAFRKHKQVHVLD 335 Query: 266 NPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQT--ARKDI 323 PG ADL++ VDF L + + G QG+FL +GI R L+K+ +++D Sbjct: 336 APGSADLTADVDFAFLK-YTVENTVKFYGPIPQGQFLLQMGIQARLKMLIKELEKSQRDD 394 Query: 324 LLDSVKRLVSTSADKKSMGELFKILVVSHEKV-ELMPF 360 LL + L+ + MG FK+ + + + + F Sbjct: 395 LLSAYYMLI----NPNKMGLRFKVACMVYPGLGDPPGF 428 >gi|58264370|ref|XP_569341.1| hypothetical protein [Cryptococcus neoformans var. neoformans JEC21] gi|134110145|ref|XP_776283.1| hypothetical protein CNBC6720 [Cryptococcus neoformans var. neoformans B-3501A] gi|50258955|gb|EAL21636.1| hypothetical protein CNBC6720 [Cryptococcus neoformans var. neoformans B-3501A] gi|57225573|gb|AAW42034.1| conserved hypothetical protein [Cryptococcus neoformans var. neoformans JEC21] Length = 449 Score = 252 bits (644), Expect = 5e-65, Method: Composition-based stats. Identities = 106/389 (27%), Positives = 173/389 (44%), Gaps = 37/389 (9%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 N+L + I + IK G ++ +Y C++ P GYYS + FG GDF+T+PEISQIFGE+ Sbjct: 53 NELAKVIRDSIKSTGPISASRYMQFCLSHPVHGYYSKGDVFGQKGDFITSPEISQIFGEL 112 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 +AI+ + W + P+ VR++ELGPGRG +M D+LR + S+ S+++VE SE + Sbjct: 113 VAIWFLTRWMEVDSPTRVRIIELGPGRGTLMDDVLRTLFNFPGIAASINSVHLVENSEAM 172 Query: 123 TLIQKKQLASYG-------DKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIR 175 +Q + L + + + FT VA+EFFD++PI F T+ G R Sbjct: 173 REVQSQTLTPRIKGKDVKLNWYTSIEEIPETKDEFTLFVAHEFFDAMPINVFEKTDMGWR 232 Query: 176 ERMIDIDQHDSL-----------VFNIGDHE------IKSNFLTCSDYFLGAIFENSPCR 218 E +ID D + F + + S + G+ E S Sbjct: 233 EVLIDRDPSYTPNLPTSSTPSGLRFTLSPSPTTLSTILPSTSPRFAKLPSGSRIEVSQDS 292 Query: 219 DREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDF 278 + M + + G ++ + +A + H V +PG DL+++VDF Sbjct: 293 YKIMHRLGQVVNQGLGGCGLVVDYGADKAFASSFRAFRKHEIVDVFEDPGNCDLTANVDF 352 Query: 279 QRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQ--TARKDILLDSVKRLVSTSA 336 L G +Q +FL LG+ R L+ R++ + KRL+ Sbjct: 353 AYLRESLTGT-ATPLGPISQAQFLISLGLQPRLRKLLDTAPPERREAIEKGAKRLIDVL- 410 Query: 337 DKKSMGELFKILVVS------HEKVELMP 359 MG ++++ V E + P Sbjct: 411 ---GMGSQYQVMGVVSGEPEMKEGIYPFP 436 >gi|307195477|gb|EFN77363.1| UPF0511 protein CG17726 [Harpegnathos saltator] Length = 419 Score = 252 bits (644), Expect = 5e-65, Method: Composition-based stats. Identities = 130/398 (32%), Positives = 191/398 (47%), Gaps = 48/398 (12%) Query: 5 LIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLA 64 L R++ I G +T+ +Y + P GYY T + FG GDF T+PEISQ+FGE++A Sbjct: 18 LYRQLYAKILACGPITLAEYMKEILIHPTAGYYMTRDVFGQKGDFTTSPEISQLFGEIIA 77 Query: 65 IFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTL 124 I++I W + V+LVELGPGRG ++ DILRV KL + +S+++VE S L+ Sbjct: 78 IWIINEWRKI-TNGPVQLVELGPGRGTLINDILRVFKKL--NLLDKVSVHLVEISPVLSQ 134 Query: 125 IQKKQL---------------------------ASYGDKINWYTSLADVPLGFTFLVANE 157 +Q ++L + G K+ WY S+ DVP F+ VA+E Sbjct: 135 LQAEKLCTESRNNESIADANEKSSVTYYKEGIAKNGGVKMYWYYSINDVPRNFSIFVAHE 194 Query: 158 FFDSLPIKQFVMTEHGIRERMIDI---DQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFEN 214 FFD+LPI +F T+ G RE ++DI + + + + + E Sbjct: 195 FFDALPIHKFQKTDKGWREVLVDIVQETNEERFRYVLSQTVTAACKVYLPPNEKRDHVEI 254 Query: 215 SPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSS 274 SP M +S L GG A+VIDYG+ + + DT +A H PL+ PG ADL++ Sbjct: 255 SPQCLVIMDYMSQFLWECGGFALVIDYGHEREKS-DTFRAFYQHKLHDPLLRPGTADLTA 313 Query: 275 HVDFQRLSSIAILYK-LYINGLTTQGKFLEGLGIWQRAFSLMKQTA--RKDILLDSVKRL 331 VDF + IA L G TQ +FL LGI R L++ T +K + Sbjct: 314 DVDFLLMKEIAQKNNRLITFGPVTQRRFLRNLGIDLRLKILLQNTTSIQKQQIESGY--- 370 Query: 332 VSTSADKKSMGELFKILVVSH-------EKVELMPFVN 362 DK MG FK+L + K ++ F + Sbjct: 371 -HMITDKDKMGNCFKVLTLFPFVLKDHLTKWPVVGFED 407 >gi|188996566|ref|YP_001930817.1| protein of unknown function DUF185 [Sulfurihydrogenibium sp. YO3AOP1] gi|188931633|gb|ACD66263.1| protein of unknown function DUF185 [Sulfurihydrogenibium sp. YO3AOP1] Length = 384 Score = 252 bits (643), Expect = 6e-65, Method: Composition-based stats. Identities = 98/364 (26%), Positives = 152/364 (41%), Gaps = 12/364 (3%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFG 60 + +LI I I++ G ++ + + + P GYY++ G +GDF T+ E+ FG Sbjct: 6 KEELINIIKQKIQQEGAISFKDFMEMALYYPNLGYYTSEKEKIGGLGDFYTSSELDPAFG 65 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 +LA +E + ++ ELG G+G++ D+L I P+ + L VE S Sbjct: 66 NLLAKQFNEIYENYFKNQKFQIAELGSGKGLLAYDVLSYIKNNYPNLYKTLEFISVEKSP 125 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 QKK L + D +++Y L ++ + +NE FD+LP+ I E I Sbjct: 126 YHRDYQKKLLKDF-DNVSFYEDLTEIDNINGIIYSNELFDALPVHLIRKIGGKIFEVYIT 184 Query: 181 IDQHD-SLVFNIGDHEIKSNFLTCS-DYFLGAIFENSPCRDREMQSISDRLACDGGTAIV 238 ++ D V +I + D G E + +Q I ++L I Sbjct: 185 LEGDDIKEVLKEPQKDILQYLKDLNIDIPEGMTTEINLYAKDLIQEIGNKLEKGFVFTID 244 Query: 239 IDYG----YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSSIAILYKLYIN 293 Y Y R+ TL HTY + N G D++SHV+F L KL Sbjct: 245 YGYPSKELYKPYRMRGTLLCYYKHTYNENFYQNVGLQDITSHVNFSALVYYGKKSKLDFV 304 Query: 294 GLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHE 353 G T Q FL LG+ L ++ K + + RL T K MGE FKIL+ S Sbjct: 305 GFTDQAHFLISLGLMDIFQELQEKGDYKS--YERLNRL-KTLILPKGMGEKFKILIQSKN 361 Query: 354 KVEL 357 Sbjct: 362 IQNP 365 >gi|94496200|ref|ZP_01302778.1| hypothetical protein SKA58_03780 [Sphingomonas sp. SKA58] gi|94424379|gb|EAT09402.1| hypothetical protein SKA58_03780 [Sphingomonas sp. SKA58] Length = 354 Score = 252 bits (643), Expect = 7e-65, Method: Composition-based stats. Identities = 120/360 (33%), Positives = 176/360 (48%), Gaps = 17/360 (4%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L ++ I G ++V Y A YY+T +P GA GDF TAPEISQ+FGE++ Sbjct: 7 PLADRLARQIASGGPISVAHYIAEANQH----YYATRDPLGAEGDFTTAPEISQMFGELV 62 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 + L W + G VELGPGRG + D LR + + +++VETS L Sbjct: 63 GLALADIWMRSGRSGQAAYVELGPGRGTLASDALRAMQRA----ALAPPVHLVETSPALR 118 Query: 124 LIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQ 183 Q+ L + + SL + G +VANEFFD+LP++Q + RER++ + Sbjct: 119 GRQQALLPTAIHH-DTIASLPE--QGPLLVVANEFFDALPVRQCIRVGDEWRERVLLPRE 175 Query: 184 H-DSLVFNIGDHEIKSNFLT-CSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDY 241 + G I+S +D GAI E+ +++ ++A GG AI++DY Sbjct: 176 EPGRFIAVAGYRRIESGLPPIAADAPDGAILESPIAAAEIAYALAQKIARQGGAAIIVDY 235 Query: 242 GYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKF 301 GY +GDTLQAVK H + P +PG+ DL++HVDF L ++A L ++G QG F Sbjct: 236 GYEGPALGDTLQAVKAHRFADPFADPGEVDLTTHVDFTMLGNMARQAGLRVHGPVGQGSF 295 Query: 302 LEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKV-ELMPF 360 L LGI RA L + V+ D +MG LFK + +H + F Sbjct: 296 LRQLGIDARAAQLAAGAPAR---AQDVQAAAHRLTDADAMGTLFKAMAWTHPDWADPAGF 352 >gi|258576949|ref|XP_002542656.1| conserved hypothetical protein [Uncinocarpus reesii 1704] gi|237902922|gb|EEP77323.1| conserved hypothetical protein [Uncinocarpus reesii 1704] Length = 486 Score = 252 bits (643), Expect = 8e-65, Method: Composition-based stats. Identities = 122/433 (28%), Positives = 184/433 (42%), Gaps = 83/433 (19%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTC-----NPFGAVGDFVTAPEIS 56 L + I I G +++ Y C+ PE GYY++ FG GDF+T+PEIS Sbjct: 40 STPLAKTIAEAINTTGPISIAAYMRQCLTSPEGGYYTSRGSPGAEVFGRRGDFITSPEIS 99 Query: 57 QIFGEMLAIFLICAWEQHGFPSC-VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 Q+FGE+L ++ + W G S V+L+E+GPGRG +M D+LR + K S+ +IY+ Sbjct: 100 QMFGELLGVWTVTEWMAQGRRSRGVQLIEVGPGRGTLMADMLRSVRNFKSFASSIEAIYL 159 Query: 116 VETSERLTLIQKKQLASYGD--------------------KINWYTSLADVPLGFTFLVA 155 VE S L IQK+ L SL F++A Sbjct: 160 VEASPTLRAIQKQMLCGDAPMEEIEAGYKSTSKHLGVPVIWAEHIRSLPQGDTDVPFIIA 219 Query: 156 NEFFDSLPIKQFV------------------------MTEHGIRERMIDI------DQHD 185 +EFFD+LPI F + RE ++ + + Sbjct: 220 HEFFDALPIHAFQSVASPPSDTIVTPTGPTKLRQPLASSPTQWRELVVSVNPAAEAHAEN 279 Query: 186 SLVFNI----------GDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRL------ 229 L F + S G+ E SP +Q + R+ Sbjct: 280 RLEFRLSLAKSTTPAAMVMPEMSERYKALKSTRGSTIEISPESHAYVQEFARRIGGKADG 339 Query: 230 ----ACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIA 285 G A+++DYG S ++L+ +K H VSP +PGQ DLS+ VDF L+ A Sbjct: 340 RSPGRKPAGAALILDYGPSHSIPVNSLRGIKDHQLVSPFTSPGQVDLSADVDFVGLAEAA 399 Query: 286 ILY--KLYINGLTTQGKFLEGLGIWQRAFSLMK---QTARKDILLDSVKRLVSTSADKKS 340 I + ++G T QG FL+ LGI +RA LMK +++ + KRLV Sbjct: 400 IKASPGVEVHGPTEQGSFLQSLGIMERAAQLMKRAEDESKRKSIETGWKRLVERGG--GG 457 Query: 341 MGELFKILVVSHE 353 MG+++K + + E Sbjct: 458 MGKIYKAMAIVPE 470 >gi|281205305|gb|EFA79497.1| DUF185 family protein [Polysphondylium pallidum PN500] Length = 502 Score = 251 bits (642), Expect = 1e-64, Method: Composition-based stats. Identities = 119/384 (30%), Positives = 181/384 (47%), Gaps = 29/384 (7%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 + + + + G VD C+ +P++GYY + FG+ GDF+TAPEISQ+FGEM Sbjct: 108 TEFEKYLQTSAQIRGPFPVDTLIKECLTNPKYGYYMNKDVFGSGGDFITAPEISQLFGEM 167 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 + I+ + WE G PS + +VELGPGRG +M DILR K DF+ +S +MVE S L Sbjct: 168 IGIWCVATWESMGMPSKLNIVELGPGRGTLMHDILRSTKVFK-DFYKAISCHMVEVSPHL 226 Query: 123 TLIQKKQLASYGD--------------KINWYTSLADVPLG-FTFLVANEFFDSLPIKQF 167 +QK +L + D +++WY ++ VP T +A EFFD+LPI F Sbjct: 227 RGMQKTKLLYFKDDKEGATTGKTPEGVQVSWYDNIDQVPNKVPTLYIAQEFFDALPINVF 286 Query: 168 VMTEHGIRERM---IDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAI------FENSPCR 218 ++ + DI + + Y L E Sbjct: 287 KFSKAKGWCEVLVDEDISKDGPYHLRFVMSSGPTLMTNAIQYLLPEFGVEDFTVELGVAG 346 Query: 219 DREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDF 278 Q I+ R+ + G A++IDYG +V +LQA++ H +V L PG ADLSS VDF Sbjct: 347 LGIAQKIALRIQENSGAALIIDYGQ-DKQVQTSLQAIRNHEFVDILDKPGSADLSSWVDF 405 Query: 279 QRLSSIAILYKLYIN--GLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSA 336 + K ++ G QG FL+ +GI R + K+ + + D VK Sbjct: 406 SSIRKCVKHLKKNVSAIGPVDQGIFLKEMGIEHRIERIAKKITDESKVEDLVKS-YHKLV 464 Query: 337 DKKSMGELFKILVVSHEKVELMPF 360 MG +K++ + +K+ + F Sbjct: 465 SPDEMGSTYKVITIIDKKLTPIGF 488 >gi|225851499|ref|YP_002731733.1| hypothetical protein PERMA_1980 [Persephonella marina EX-H1] gi|225646678|gb|ACO04864.1| hypothetical protein PERMA_1980 [Persephonella marina EX-H1] Length = 388 Score = 251 bits (641), Expect = 1e-64, Method: Composition-based stats. Identities = 98/376 (26%), Positives = 161/376 (42%), Gaps = 23/376 (6%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFG 60 + +L+ I N IKK G ++ + + + PE GYY++ G GDF TA E+ + FG Sbjct: 10 KQQLVNIIKNRIKKEGSISFRDFMDIALYYPELGYYTSPKAKIGGYGDFFTASELDKAFG 69 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 E+L + +++ G ++VE+G G+G + DIL + D + ++E S Sbjct: 70 ELLGKQFVEIYQKLG-EKNFQIVEIGAGKGYLAYDILNFLRANFEDVYRNSEYIIIEKSP 128 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFT--FLVANEFFDSLPIKQFVMTEHGIRERM 178 +QK+ L S+ D + W + D + +NE FDS P+ I E Sbjct: 129 YHVNLQKEILKSF-DNVRWVQDIIDFEDESITGVIFSNELFDSFPVHLIRKINGKIYEIY 187 Query: 179 IDIDQHDSLVFNIGDHE---IKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGT 235 I +DQ D++ + D I+ + G E + +Q I +L Sbjct: 188 ITVDQDDNVKEILKDPSEDIIRYLKELNINIPEGMTTEINLDAADYIQKIGKKLKKGYVI 247 Query: 236 AIVIDYG----YLQSRVGDTLQAVKGHTY-VSPLVNPGQADLSSHVDFQRLSSIAILYKL 290 I Y Y R+ TL H Y + N G D++SHV+F L+ + L Sbjct: 248 TIDYGYPSAELYKYYRMKGTLLCYYKHRYSENYYENVGMQDITSHVNFSALNYYGKIAGL 307 Query: 291 YINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVV 350 + G T Q FL LG+ L ++ + + + RL T K MGE FK+L+ Sbjct: 308 ELTGFTDQAHFLTNLGLMDIFAQLQEKGDYES--YERLNRL-KTLVLPKGMGEKFKVLIQ 364 Query: 351 SH-------EKVELMP 359 + +E++P Sbjct: 365 HKNVENPHLKGLEILP 380 >gi|126461008|ref|YP_001042122.1| hypothetical protein Rsph17029_0231 [Rhodobacter sphaeroides ATCC 17029] gi|126102672|gb|ABN75350.1| protein of unknown function DUF185 [Rhodobacter sphaeroides ATCC 17029] Length = 353 Score = 251 bits (640), Expect = 2e-64, Method: Composition-based stats. Identities = 143/359 (39%), Positives = 187/359 (52%), Gaps = 10/359 (2%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 L + I G +TV Y A C+ PE GYYST PFGA GDF TAPEISQ+FGE+ Sbjct: 2 TALAALLARRIGATGPVTVADYMAECLLHPEHGYYSTREPFGAAGDFTTAPEISQMFGEL 61 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 L + L AW G PS V L ELGPGRG +M D+LR + P F +++VE S RL Sbjct: 62 LGLCLAQAWLDQGQPSPVTLAELGPGRGTLMADLLRATRGV-PGFHDAARVHLVEASPRL 120 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 +Q++ L + W AD+P G FLVANEFFD+LPI+QFV G RERM+ + Sbjct: 121 RALQREMLGGH--PAAWLDRAADLPEGPLFLVANEFFDALPIRQFVRGPEGWRERMVGLT 178 Query: 183 QHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYG 242 + + + + D G + E P M I+ R+A GG A+ +DYG Sbjct: 179 EGRLTWGLGPETALAALAHRLEDTAPGDVVELCPAAGPIMAEIARRIAAAGGLALAVDYG 238 Query: 243 YLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFL 302 +SR GDTLQA++ H + PL PG+ADL++HVDF+ L+ A L QG L Sbjct: 239 GWRSR-GDTLQALRAHRFDDPLAAPGEADLTAHVDFEALAQAAAPCG---TALVPQGALL 294 Query: 303 EGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELM-PF 360 LG+ QRA L + + L S + D MG LFK L V + F Sbjct: 295 LRLGLAQRAARLARSLTGEA--LASHEAASRRLTDATEMGTLFKALAVFPPQGPAPAGF 351 >gi|294657281|ref|XP_459587.2| DEHA2E06072p [Debaryomyces hansenii CBS767] gi|199432572|emb|CAG87814.2| DEHA2E06072p [Debaryomyces hansenii] Length = 522 Score = 250 bits (639), Expect = 2e-64, Method: Composition-based stats. Identities = 112/416 (26%), Positives = 181/416 (43%), Gaps = 63/416 (15%) Query: 5 LIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAV-GDFVTAPEISQIFGEML 63 L + IK G +++ + C+ P+FGYY+T +P A GDF+T+PEIS +FGEM+ Sbjct: 106 LTNLLSETIKTTGPISLSAFMRQCLTHPQFGYYTTRDPLNASSGDFITSPEISSMFGEMI 165 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFF--SVLSIYMVETSER 121 I+L W P + ++E GPGRG +M D L+ K K + + I M+E S Sbjct: 166 GIWLFSTWLNQNKPQKLNIIEFGPGRGTLMYDCLKSFNKFKKNLIQEENIEITMIEASSI 225 Query: 122 LTLIQKKQLASYG--------------------DKINWYTSLADVPLGFTFLVANEFFDS 161 L Q K L ++ T + ++VA+EFFD+ Sbjct: 226 LRKEQWKLLCGSNEFITNSDGFNISRTQWGNRVKWVDNETDITKDENVANYIVAHEFFDA 285 Query: 162 LPIKQFVMTEHGIRERMIDIDQ-------------------------HDSLVFNIGDHE- 195 LPIK F T+HG RE +++ + + E Sbjct: 286 LPIKSFQKTKHGWRELVVEHTPSVDNTQLSLPEDASSKSTSENNDLLNTEFHLTLSPKET 345 Query: 196 ----IKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLA--CDGGTAIVIDYGYLQSRVG 249 I D + + E P + + ++ L G+ ++IDYG + Sbjct: 346 SSSVIPDLNPRFKDLPIDSRIEICPDAELYVLKMAQLLNNEKGMGSVLIIDYGISEGIPD 405 Query: 250 DTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQ 309 +TL+ + H +VSP +NPG+ DLS VDF L ++ G QG +L LGI Sbjct: 406 NTLRGIYKHKFVSPFINPGEVDLSVDVDFTNLKNVTEKM-CKSFGPVEQGDWLHELGIGY 464 Query: 310 RAFSLMK----QTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELM-PF 360 R L+K +D + ++ +RL D++SMG+++K L ++ + + F Sbjct: 465 RTDQLIKANDGNVNAQDKIYNAYQRLT--GKDERSMGKIYKFLCLTPHESKSPVGF 518 >gi|50740671|ref|XP_419525.1| PREDICTED: hypothetical protein [Gallus gallus] Length = 448 Score = 250 bits (639), Expect = 2e-64, Method: Composition-based stats. Identities = 118/372 (31%), Positives = 179/372 (48%), Gaps = 33/372 (8%) Query: 6 IRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAI 65 +R ++ ++ G +TV +Y + +P GYY+ G DF+T+PEISQIFGE++ I Sbjct: 54 LRHLLLKLRATGPVTVAEYMREALTNPGQGYYTRRGGVGE--DFITSPEISQIFGELIGI 111 Query: 66 FLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSV-LSIYMVETSERLTL 124 + I W G + +LVELGPG G + DILRV +L +SI++VE S +L+ Sbjct: 112 WYISEWMAMGKQNAFQLVELGPGMGTLTGDILRVFNQLASLLSKCDVSIHLVEVSPKLSA 171 Query: 125 IQKKQLA-------------------SYGDKINWYTSLADVPLGFTFLVANEFFDSLPIK 165 IQ + L G I WY + DVP G++F +A+EF D+LPI Sbjct: 172 IQAEMLTGGKVQSNPENKSAYMKGISKTGIPIYWYRDIQDVPQGYSFYLAHEFLDALPIH 231 Query: 166 QFVMTEHGIRERMIDIDQH--DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQ 223 +F TE G E ++DID D L F + + E P +Q Sbjct: 232 KFQRTEKGWHEVLVDIDPEVPDQLRFVLSPSRTPATENFIQPEETRDHVEVCPEAGVLIQ 291 Query: 224 SISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSS 283 ++ R+ DGG A++ DYG+ ++ DT + + H L PG ADL++ VDF L Sbjct: 292 RLACRIEKDGGAALIADYGHDGTKT-DTFRGFRNHKLHDVLKAPGTADLTADVDFSYLRK 350 Query: 284 IAILYKLYINGLTTQGKFLEGLGIWQRAFSL---MKQTARKDILLDSVKRLVSTSADKKS 340 +A + G Q +FL+ +GI R L + +A + LL S L+ + K Sbjct: 351 MAE-GRTATLGPIKQREFLKNMGIDLRLQVLLQHSRNSATHEQLLHSFDMLM----NPKK 405 Query: 341 MGELFKILVVSH 352 MG+ F + Sbjct: 406 MGDCFHFFALLP 417 >gi|85375189|ref|YP_459251.1| hypothetical protein ELI_11810 [Erythrobacter litoralis HTCC2594] gi|84788272|gb|ABC64454.1| hypothetical protein ELI_11810 [Erythrobacter litoralis HTCC2594] Length = 351 Score = 250 bits (638), Expect = 2e-64, Method: Composition-based stats. Identities = 124/359 (34%), Positives = 174/359 (48%), Gaps = 19/359 (5%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 M L LI+ G ++V Q+ YY + +P G+ GDF+TAPEISQ+FG Sbjct: 1 MAETLAEIFRRLIRNTGPISVSQFMGES----NARYYDSRDPLGSAGDFITAPEISQMFG 56 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 E++ ++L W G V+ VELGPGRG + D LR K V ++ VE S Sbjct: 57 ELIGLWLADMWINAGRDEYVQYVELGPGRGTLAKDALRAARKYG----FVPPVHFVEGSA 112 Query: 121 RLTLIQKKQLASYGDKINWYTSLAD-VPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMI 179 L Q K A ++ L+ VANEF D+LP++Q V T G RERM+ Sbjct: 113 TLREEQAKAFAE----AQFHNDLSTLPVDVPLVFVANEFLDALPVRQLVRTGQGWRERMV 168 Query: 180 DIDQHDSLVFNIGDHEIKSNFL-TCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIV 238 + + + VF GD + S D G I E P + ++ RL GGTA+ Sbjct: 169 ALGEDERFVFVAGDRPMDSAVPADWRDADPGTILETCPGAAATLYEVAGRLVEQGGTALF 228 Query: 239 IDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQ 298 IDYGY G TLQAV+ H V PL PG ADL++ VDF L+ +A + G Q Sbjct: 229 IDYGYETLEAGSTLQAVRAHDKVDPLAEPGSADLTALVDFGTLARVAQSREARHIGTVEQ 288 Query: 299 GKFLEGLGIWQRAFSLMKQTAR-KDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVE 356 G +L LGI RA +L + L ++ +RL++ MG LFK++ ++ Sbjct: 289 GAWLSALGIEARAQALAAKAPHYAAELEEAKQRLIA----PDQMGSLFKVMGLAGPGWN 343 >gi|237756343|ref|ZP_04584893.1| ATP synthase beta subunit/transription termination factor [Sulfurihydrogenibium yellowstonense SS-5] gi|237691494|gb|EEP60552.1| ATP synthase beta subunit/transription termination factor [Sulfurihydrogenibium yellowstonense SS-5] Length = 384 Score = 250 bits (638), Expect = 2e-64, Method: Composition-based stats. Identities = 99/364 (27%), Positives = 154/364 (42%), Gaps = 12/364 (3%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFG 60 + +LI I I++ G ++ + + + P GYY++ G +GDF T+ E+ FG Sbjct: 6 KEELINIIKQKIQQEGAISFKDFMEMALYYPNLGYYTSEKEKIGGLGDFYTSSELDPAFG 65 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 +LA +E + ++VE+G G+G + D+L I P+ + +L VE S Sbjct: 66 NLLAKQFNEIYENYFKNQKFQIVEIGSGKGYLAYDVLSYIKNNYPNLYKILEFISVEKSP 125 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 QKK L + D I+++ L ++ + +NE FD+LP+ I E I Sbjct: 126 YHRDYQKKLLKDF-DNISFHEDLTEINNINGIIYSNELFDALPVHLIRKINGKIFEVYIT 184 Query: 181 IDQHD-SLVFNIGDHEIKSNFLTCS-DYFLGAIFENSPCRDREMQSISDRLACDGGTAIV 238 ++ D V +I + D G E + +Q I ++L I Sbjct: 185 LEGDDIKEVLKEPQKDILQYLKDLNIDISEGMTTEINLYAKDLIQEIGNKLEKGFVFTID 244 Query: 239 IDYG----YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSSIAILYKLYIN 293 Y Y R+ TL HTY + N G D++SHV+F L KLY Sbjct: 245 YGYPSKELYKPYRMRGTLLCYYKHTYNENFYQNVGLQDITSHVNFSALVYYGKKSKLYFL 304 Query: 294 GLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHE 353 G T Q FL LG+ L ++ K + + RL T K MGE FKIL+ S Sbjct: 305 GFTDQAHFLISLGLMDIFQELQEKGDYKS--YERLNRL-KTLILPKGMGEKFKILIQSKN 361 Query: 354 KVEL 357 Sbjct: 362 IQNP 365 >gi|88607733|ref|YP_505157.1| hypothetical protein APH_0566 [Anaplasma phagocytophilum HZ] gi|88598796|gb|ABD44266.1| conserved hypothetical protein [Anaplasma phagocytophilum HZ] Length = 326 Score = 250 bits (638), Expect = 3e-64, Method: Composition-based stats. Identities = 119/336 (35%), Positives = 175/336 (52%), Gaps = 17/336 (5%) Query: 19 MTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPS 78 M +D++ + D GYY T PFG GDF+T+P+ISQ+FGE +AI+L+ E Sbjct: 1 MPIDKFMREALYDRTCGYYMTHVPFGLSGDFITSPDISQLFGETIAIWLLQYLEYVKLSE 60 Query: 79 CVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLAS--YGDK 136 LVELGPGRG +M DILR++ P + S+ +++VE S L IQK+ L K Sbjct: 61 RCILVELGPGRGTLMSDILRILSCF-PQYDSLFEVHLVEISPLLRNIQKETLKEAMLRKK 119 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 I W+ S+ D+P T L+ANEFFD+LPIKQFV + E + +I + Sbjct: 120 IFWHDSVYDLPECTTILIANEFFDALPIKQFVFHDGMWFENYVRSCAEG---LDIIPVKS 176 Query: 197 KSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVK 256 ++ G + E +++I L GGTA+++DYGY+ T+QAV+ Sbjct: 177 TDFIFPDNNVPDGGVIEICEAATDIIRNIEGVLLKHGGTALIVDYGYMHPVYKSTIQAVR 236 Query: 257 GHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMK 316 H Y S L + G++D+S+ VDF L K TQ +FL GI +R LM+ Sbjct: 237 NHQYCSFLDHIGESDISASVDFVMLQKSLKEIKCEA---MTQREFLYRFGIRERLEFLMQ 293 Query: 317 --QTARKDILLDSVKRLVSTSADKKSMGELFKILVV 350 Q + + L RL ++MG LFK+L++ Sbjct: 294 RAQAKQAEDLKCGFLRLT------ENMGTLFKVLLL 323 >gi|329851271|ref|ZP_08266028.1| hypothetical protein ABI_41120 [Asticcacaulis biprosthecum C19] gi|328840117|gb|EGF89689.1| hypothetical protein ABI_41120 [Asticcacaulis biprosthecum C19] Length = 348 Score = 250 bits (637), Expect = 3e-64, Method: Composition-based stats. Identities = 108/348 (31%), Positives = 168/348 (48%), Gaps = 13/348 (3%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L +++ I G M V Y A C+ DP+ GYY+ GA GDF+TAP +SQ+FGEM+ Sbjct: 2 SLRDRLIEQITLEGPMNVADYMARCLFDPQDGYYTCHVRIGADGDFLTAPMVSQMFGEMI 61 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 +++ W G P RLVE+G G G +M DILRV ++ P + MVE S RL Sbjct: 62 GVWVAQMWLALGSPPAFRLVEIGGGDGTLMSDILRVAKRV-PGLSDAAQVTMVEPSPRLR 120 Query: 124 LIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQ 183 Q++ ++ + D+ ++ANE D LP +QFV T++G E+ + + Sbjct: 121 ASQEQTISQAVFVPDVNALATDL---PVIVIANEVLDCLPARQFVRTDNGWAEKCVGV-I 176 Query: 184 HDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGY 243 L F + E + D G E S + ++ L G A+++DYG Sbjct: 177 DGHLAFGLVPTE----YTPQLDAEPGQTIEISAAQQHFAAQLTSLLKASTGAALLVDYGR 232 Query: 244 LQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLE 303 Q GDTLQA+ H PL PG DL+ DF ++ I + + + TQ FL+ Sbjct: 233 DQPEAGDTLQALHNHRKTDPLAAPGDHDLTVWADFPAIAQICNTS-VKFSTIKTQSAFLQ 291 Query: 304 GLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVS 351 LG+ R +L + + + ++R MG+LFK++ ++ Sbjct: 292 ALGMAARFNALCEAHPTE---AEKLQRQYDRLTAPDQMGDLFKVVGMA 336 >gi|300313681|ref|YP_003777773.1| hypothetical protein Hsero_4398 [Herbaspirillum seropedicae SmR1] gi|300076466|gb|ADJ65865.1| conserved hypothetical protein [Herbaspirillum seropedicae SmR1] Length = 394 Score = 249 bits (636), Expect = 4e-64, Method: Composition-based stats. Identities = 100/363 (27%), Positives = 156/363 (42%), Gaps = 23/363 (6%) Query: 4 KLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGE 61 L +I I G ++ +Y L + P+ GYYS G GDF TAPEIS ++G Sbjct: 26 TLQHQIAGEIAAAGGWISFQRYMELALYAPQVGYYSGGSAKLGKEGDFTTAPEISPLYGA 85 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 LA L L+E G G G + DIL + ++VE S + Sbjct: 86 TLAS-LAAEVIAASPSVDNVLLEFGAGTGKLAHDILTELQARAALPQR---YFIVEISAQ 141 Query: 122 LTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDI 181 L Q++ LA++ + W +L G +V NE D++P++ V E G +ER + + Sbjct: 142 LRARQQQTLAAFAPLVQWLDALPATFSG--VVVGNEVLDAMPVRLAVKAEQGWQERGVAL 199 Query: 182 DQHDSLVFNIGDHEIKS--NFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGT-AIV 238 D L F + + G + E +P M ++ LA G AI+ Sbjct: 200 DAEGRLRFEDRSVDDLPLAQIPDAHELPPGYLTEVAPVAIGFMHTLGRMLASGPGALAIL 259 Query: 239 IDY------GYLQSRVGDT-LQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLY 291 DY YL R T + + H + P PG D+++HVDF ++ A+ Sbjct: 260 PDYGFPAAEYYLHDRDQGTLMCHYRHHAHTDPFYWPGLQDITAHVDFTAMAVAAVEEGAE 319 Query: 292 INGLTTQGKFLEGLGIWQRAFSLM-KQTARKDILLDSVKRLVSTSADKKSMGELFKILVV 350 + T+QG FL GI + + +AR L + +++L+S MGELFK+L + Sbjct: 320 VLAYTSQGAFLLNAGIGELLLRTSPEDSARYLPLANGMQKLIS----PAEMGELFKVLAI 375 Query: 351 SHE 353 Sbjct: 376 GKN 378 >gi|297265809|ref|XP_002799255.1| PREDICTED: protein midA homolog, mitochondrial-like [Macaca mulatta] Length = 399 Score = 249 bits (636), Expect = 5e-64, Method: Composition-based stats. Identities = 124/374 (33%), Positives = 184/374 (49%), Gaps = 33/374 (8%) Query: 5 LIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLA 64 ++R ++ IK G +TV +Y + +P GYY + G GDF+T+PEISQIFGE+L Sbjct: 1 MLRHLMYKIKSTGPITVAEYMKEVLTNPAKGYYVYRDMLGKQGDFITSPEISQIFGELLG 60 Query: 65 IFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSV-LSIYMVETSERLT 123 I+ I W G + +LVELGPGRG ++ DILRV +L + +S+++VE S++L+ Sbjct: 61 IWFISEWMATGKSTAFQLVELGPGRGTLVGDILRVFTQLGSVLKNCDISVHLVEVSQKLS 120 Query: 124 LIQ--------------------KKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLP 163 IQ K + G I+WY + DVP G++F +A+EFFD LP Sbjct: 121 EIQALTLTEEKVPLERNAGSPVYMKGVTKSGIPISWYRHVHDVPKGYSFYLAHEFFDVLP 180 Query: 164 IKQFVMTEHGIRERMIDIDQH--DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDRE 221 + +F T G RE IDID D L F + + D E P Sbjct: 181 VHKFQKTPQGWREVFIDIDPQVSDKLRFVLAPSATPAEAFIQHD-ETRDHVEVCPDAGVI 239 Query: 222 MQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRL 281 ++ +S R+A GG A+V DYG+ ++ + GH L+ PG ADL++ VDF L Sbjct: 240 IEELSQRIALTGGAALVADYGHDGTKT-FMFKGFCGHKLHDVLIAPGTADLTADVDFSYL 298 Query: 282 SSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTAR---KDILLDSVKRLVSTSADK 338 +A K+ G Q FL+ +GI R L+ ++ + LL L+ + Sbjct: 299 RRMA-QGKVASLGPIKQHTFLKNMGIDVRLKVLLDKSNEPSVRQQLLQGYDMLM----NP 353 Query: 339 KSMGELFKILVVSH 352 K MGE F + Sbjct: 354 KKMGERFNFFALLP 367 >gi|326915451|ref|XP_003204031.1| PREDICTED: protein midA homolog, mitochondrial-like, partial [Meleagris gallopavo] Length = 389 Score = 249 bits (635), Expect = 6e-64, Method: Composition-based stats. Identities = 116/366 (31%), Positives = 176/366 (48%), Gaps = 33/366 (9%) Query: 12 LIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAW 71 ++ G +TV +Y + +P GYY+ G DF+T+PEISQIFGE++ I+ + W Sbjct: 1 KLRATGPVTVAEYMREALTNPGQGYYTRRGGVGE--DFITSPEISQIFGELIGIWYVSEW 58 Query: 72 EQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSV-LSIYMVETSERLTLIQ---- 126 G + +LVELGPG G + DILRV +L +SI++VE S +L+ IQ Sbjct: 59 MAMGKQNAFQLVELGPGTGTLTDDILRVFNQLASLLSKCDVSIHLVEVSPKLSAIQAEVL 118 Query: 127 ---------------KKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTE 171 K ++ G I WY + DVP G++F +A+EF D+LPI +F TE Sbjct: 119 TGGKVQSNPENRSAYMKGISKSGIPIYWYRDIQDVPQGYSFYLAHEFLDALPIHKFQRTE 178 Query: 172 HGIRERMIDIDQH--DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRL 229 G E ++DID D L F + + E P +Q ++ R+ Sbjct: 179 KGWHEVLVDIDPEVPDQLRFVLSPSRTPATENFIQPEETRDHVEVCPEAGILIQRLACRI 238 Query: 230 ACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYK 289 DGG A++ DYG+ ++ DT + + H L PG ADL++ VDF L +A + Sbjct: 239 EKDGGAALIADYGHDGTKT-DTFRGFRNHKLHDVLKAPGTADLTADVDFSYLRKMAE-GR 296 Query: 290 LYINGLTTQGKFLEGLGIWQRAFSL---MKQTARKDILLDSVKRLVSTSADKKSMGELFK 346 G Q +FL+ +GI R L + +A + LL S L+ + K MG+ F Sbjct: 297 TATLGPIKQREFLKNMGIDLRLQVLLQNSRNSATHEQLLHSFDMLM----NPKKMGDCFH 352 Query: 347 ILVVSH 352 + Sbjct: 353 FFALLP 358 >gi|242005675|ref|XP_002423688.1| conserved hypothetical protein [Pediculus humanus corporis] gi|212506864|gb|EEB10950.1| conserved hypothetical protein [Pediculus humanus corporis] Length = 428 Score = 248 bits (634), Expect = 7e-64, Method: Composition-based stats. Identities = 122/377 (32%), Positives = 183/377 (48%), Gaps = 39/377 (10%) Query: 7 RKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIF 66 + + I NG +TV Y C+A+P GYY + G GDF+T+PEISQ+FGE++ + Sbjct: 44 KALHLKITANGPITVADYMRQCLANPSLGYYMQKDMIGEKGDFITSPEISQMFGEIIGTW 103 Query: 67 LICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQ 126 + + + G P +++ELGPG+G +M D+L+ + K LS+++VE S L +Q Sbjct: 104 IFHEFRKIGSPKPWQIIELGPGKGTLMKDVLKTLNTFKAT--DDLSVHLVEISPGLASLQ 161 Query: 127 KKQLAS------------------------YGDKINWYTSLADVPLGFTFLVANEFFDSL 162 L+S Y I WY L VP GF+ ++A+EFFD+L Sbjct: 162 ATTLSSDVINIGVVSFEDKSNSHYKTCSSLYKVPIYWYDKLEKVPKGFSVILAHEFFDAL 221 Query: 163 PIKQFVMTEHGIRERMIDIDQH----DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCR 218 PI +F T G RE +ID + + + I E + L + E S Sbjct: 222 PIHKFQKTSSGWREVLIDSTMNENGTNKFNYVISPKETVMSKLLIAKDEKRDHVEISHEA 281 Query: 219 DREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDF 278 Q I+ RL GG ++ IDYG+ + DT +A K H V PL NPG +DL++ V+F Sbjct: 282 GLVAQEIAQRLEEFGGFSLFIDYGHFGEK-QDTFRAFKNHQLVDPLTNPGLSDLTADVNF 340 Query: 279 QRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLM---KQTARKDILLDSVKRLVSTS 335 L+ I K++I+G QGKFL +G+ R L+ K + IL+ + L Sbjct: 341 NYLTEIMKD-KMFISGPIEQGKFLNNMGMNLRLKMLLKSCKNEQDEKILVSAYNMLT--- 396 Query: 336 ADKKSMGELFKILVVSH 352 D MG +K L Sbjct: 397 -DDDKMGSRYKCLAAFP 412 >gi|254456091|ref|ZP_05069520.1| ATP synthase beta subunit/transription termination factor rho [Candidatus Pelagibacter sp. HTCC7211] gi|207083093|gb|EDZ60519.1| ATP synthase beta subunit/transription termination factor rho [Candidatus Pelagibacter sp. HTCC7211] Length = 324 Score = 248 bits (634), Expect = 8e-64, Method: Composition-based stats. Identities = 108/324 (33%), Positives = 170/324 (52%), Gaps = 11/324 (3%) Query: 38 STCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDIL 97 NPFG GDF+TAP I+++F EM+AI+++ W+ G P L ELG G G MM I+ Sbjct: 1 MKKNPFGKEGDFITAPNITRLFSEMIAIWIVTFWKSIGSPKKFNLFELGAGNGEMMKVIV 60 Query: 98 RVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANE 157 + P+ F Y+ E S+ LT Q+ L+S + I W ++ + T +ANE Sbjct: 61 ETLKNF-PECFENCKFYIHEKSKLLTKQQQSNLSS--ENIEWVDNIKKINSNPTIFLANE 117 Query: 158 FFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFN--IGDHEIKSNFLTCSDYFLGAIFENS 215 FFD+LPIKQF + G ER ++ + VF I + E L I E S Sbjct: 118 FFDALPIKQFFKKKEGWFERYVNFKEIKKAVFKDQIINIEEIEKKLKFKISKDQDIIEYS 177 Query: 216 PCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSH 275 P +Q+I + + + G ++IDYGYL S++ +TLQAVK H Y L + G +D++ + Sbjct: 178 PSSFNYLQNICEIININNGGMLIIDYGYLDSKMHETLQAVKNHKYSDILKDIGNSDITYN 237 Query: 276 VDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQT--ARKDILLDSVKRLVS 333 ++F + + ++ Q KFL +GI QRA + + ++K L V+RL+ Sbjct: 238 INFNLFKQFIDQFDDLNSIISNQKKFLTSMGILQRAEIISENIPFSKKTDLFYRVRRLI- 296 Query: 334 TSADKKSMGELFKILVVSHEKVEL 357 D+K MG LFK++++ + + Sbjct: 297 ---DEKQMGNLFKVMLIKKTENKF 317 >gi|88608244|ref|YP_506481.1| hypothetical protein NSE_0601 [Neorickettsia sennetsu str. Miyayama] gi|88600413|gb|ABD45881.1| conserved hypothetical protein [Neorickettsia sennetsu str. Miyayama] Length = 323 Score = 248 bits (634), Expect = 8e-64, Method: Composition-based stats. Identities = 119/346 (34%), Positives = 180/346 (52%), Gaps = 23/346 (6%) Query: 5 LIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLA 64 + I N I+ NG ++ ++ L + P GYY T NP G GD++TAPEIS +FG+ +A Sbjct: 1 MRSYIENFIRGNGSISFSKFIELSMYHPSKGYYMTRNPIGKSGDYITAPEISSLFGKTIA 60 Query: 65 IFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTL 124 ++++ WE+ G P + L ELGPG G+MM DIL I ++P + ++++MVE S L Sbjct: 61 VWILEQWEKLGKPGEIVLAELGPGSGMMMFDILNTIRNIEPF-YDAVTVHMVEISPFLRG 119 Query: 125 IQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQH 184 +Q + L + K W S+ ++P G ++ANEFFD+LPI QF+ ER Sbjct: 120 VQMENLRPHSCKTRWCKSVDELPNGKLLVLANEFFDALPIDQFIFWGGNFFER------- 172 Query: 185 DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYL 244 + + E K+ ++ G I E S + SI R+A DGG ++IDYG+ Sbjct: 173 -KITEDFQVEEEKTQKKFSGEFKDGDIVEISLLGKQIASSILARIAKDGGGGLIIDYGHA 231 Query: 245 QSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEG 304 T+QAVKGH ++ + G++D++ +DF L A L TQG FL Sbjct: 232 TRTRRSTVQAVKGHRFIDIFESIGESDITHEIDFSYLFPRAK--------LMTQGDFLSL 283 Query: 305 LGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVV 350 GI++ A IL ++ RLV DK MG LFK ++ Sbjct: 284 YGIFEFAKRFSATDTG--ILEQTLSRLV----DKGKMGRLFKCAII 323 >gi|119177909|ref|XP_001240685.1| hypothetical protein CIMG_07848 [Coccidioides immitis RS] Length = 487 Score = 248 bits (634), Expect = 8e-64, Method: Composition-based stats. Identities = 121/434 (27%), Positives = 186/434 (42%), Gaps = 84/434 (19%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTC-----NPFGAVGDFVTAPEIS 56 L + I ++I G +++ Y C+ PE GYY++ FG GDFVT+PEIS Sbjct: 40 STPLAKTIADVINTAGPISIAAYMRQCLTSPEGGYYTSRGSTGVEVFGRKGDFVTSPEIS 99 Query: 57 QIFGEMLAIFLICAWEQHGFPSC-VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 Q+FGE+L ++++ W G S V+L+E+GPGRG +M D+LR + K S+ ++Y+ Sbjct: 100 QMFGELLGVWMVTEWMAQGRRSRGVQLIEVGPGRGTLMADMLRSVRNFKSFSSSIEAVYL 159 Query: 116 VETSERLTLIQKKQLASYGD--------------------KINWYTSLADVPLGFTFLVA 155 VE S L IQK+ L SL F++A Sbjct: 160 VEASPTLRDIQKQMLCGDAPMEEIEVGYRSTSKHLGVPVVWTEHIRSLPQGDNDVPFIIA 219 Query: 156 NEFFDSLPIKQFVM------------------------TEHGIRERMIDI------DQHD 185 +EFFD+LPI F + RE ++ + + Sbjct: 220 HEFFDALPIHAFQCVASPPSETIITPTGPTTLRQPLSSSPTQWRELVVSVNPASQMHAEN 279 Query: 186 SLVFNIGDHEIKS----------NFLTCSDYFLGAIFENSPCRDREMQSISDRL------ 229 L F + + + G+ E SP +Q + R+ Sbjct: 280 RLEFRLSLAKTSTPASMVMPEMSERYKALKSTRGSTIEISPESQGYVQEFARRIGGHSNS 339 Query: 230 -----ACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSI 284 G A+++DYG S ++L+ +K H VSP +PGQ DLS+ VDF L+ Sbjct: 340 KIPTTRKPAGAALILDYGPSHSIPVNSLRGIKDHKLVSPFTSPGQVDLSADVDFIALADS 399 Query: 285 AILY--KLYINGLTTQGKFLEGLGIWQRAFSLMKQTA---RKDILLDSVKRLVSTSADKK 339 AI + ++G T QG FL LGI +RA LMK+ ++ + KRLV Sbjct: 400 AISASPGVEVHGPTEQGSFLHSLGISERAAQLMKRAEDETKRKNIEAGWKRLVERGG--G 457 Query: 340 SMGELFKILVVSHE 353 MG ++K + + E Sbjct: 458 GMGRIYKAMAIIPE 471 >gi|82701471|ref|YP_411037.1| hypothetical protein Nmul_A0337 [Nitrosospira multiformis ATCC 25196] gi|82409536|gb|ABB73645.1| Protein of unknown function DUF185 [Nitrosospira multiformis ATCC 25196] Length = 424 Score = 248 bits (634), Expect = 8e-64, Method: Composition-based stats. Identities = 96/380 (25%), Positives = 152/380 (40%), Gaps = 35/380 (9%) Query: 5 LIRKIVNLI-KKNGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGEM 62 L + I I G ++ + Y L + P GYYS FG GDFVTAPEIS +FG Sbjct: 47 LTKLIHEKISAAGGWISFEHYMRLALYAPGMGYYSGGPAKFGQEGDFVTAPEISPLFGRT 106 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 +A E SC ++E G G G + LD+L + KL +++E S L Sbjct: 107 VARQARQILELADEGSC--ILEFGAGTGKLALDLLVELEKLDCLPQ---QYFILEVSAEL 161 Query: 123 TLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERM 178 Q++ L ++ W L + G ++ANE D++P+ + ER Sbjct: 162 RQRQRQLLEQFAPHLASRVFWLKHLPEQFNG--LILANEVLDAMPVHLIAWRGTTVYERG 219 Query: 179 IDIDQHDSLVFN--------IGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLA 230 + H+ + + ++ + + E + S+ L Sbjct: 220 VSSAGHEFIWSERLLAEGVLFEAAQELADRIRLGRNEGEYVSEICLQARGFIASLGKMLQ 279 Query: 231 CDGGTAIVI-----DYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIA 285 I +Y + Q R G + + HT+ +P PG D++SHVDF +S Sbjct: 280 RGAILLIDYGFGRDEYYHPQRRQGTLMCHYRHHTHDNPFYLPGLQDITSHVDFSSAASSG 339 Query: 286 ILYKLYINGLTTQGKFLEGLGIWQ-RAFSLMKQTARKDILLDSVKRLVSTSADKKSMGEL 344 + L + G TTQ FL GI + A + L + V++LVS MGEL Sbjct: 340 LEAGLQLLGYTTQAHFLINCGITEILAETPAANAKDYLPLANQVQKLVS----PAEMGEL 395 Query: 345 FKILVVSH---EKVELM-PF 360 FK++++ F Sbjct: 396 FKVMILGKGIGNNHPPPVGF 415 >gi|300865239|ref|ZP_07110053.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506] gi|300336712|emb|CBN55203.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506] Length = 406 Score = 248 bits (633), Expect = 1e-63, Method: Composition-based stats. Identities = 100/378 (26%), Positives = 162/378 (42%), Gaps = 32/378 (8%) Query: 4 KLIRKIVNLIKKNGQ--MTVDQYFALCVADPEFGYYSTCNP-FGAVGDFVTAPEISQIFG 60 +L I I + Q +T +Y L + P+ GYY+T G GDF T+P + FG Sbjct: 14 ELCDLIFQRIATSSQQQITFAEYMDLALYHPQHGYYTTNEVNIGKHGDFFTSPHLGADFG 73 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 E+LA + W+ G P+ +VE+G G+GI+ DIL + DF +L ++E S Sbjct: 74 EVLAEQFVQMWDILGKPNSFIIVEMGAGQGILAADILAYLQLQYLDFSQILEYVIIEKSA 133 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 L Q+++L + +NE D+LP+ Q ++ + I+E + Sbjct: 134 VLKAEQQQRLTTIKSVRWCNWDEIPPNSIAGCFFSNELVDALPLHQIIIDKGQIKEVYVT 193 Query: 181 ----IDQHDSLVFNIGD-------HEIKSNFL------TCSDYFLGAIFENSPCRDREMQ 223 + + + +I F + S Y G E + + Sbjct: 194 AESQVQEDGKTARKFAEVIGEVSTPKISEYFNLVGINLSASGYTDGYRTEVNLAALDWIT 253 Query: 224 SISDRLACDGGTAIVIDYGYLQSRVGD------TLQAVKGHTYV-SPLVNPGQADLSSHV 276 +++++L G + IDYGY R + TLQ H + +P + G+ DL++HV Sbjct: 254 TVAEKLQR--GYLLTIDYGYPAHRYYNQNRREGTLQCYYHHQHHNNPYIYVGKQDLTAHV 311 Query: 277 DFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKR--LVST 334 DF L L L + GLT QG FL LG+ R +L A+ L ++R + Sbjct: 312 DFTALEKQGELCGLELVGLTQQGLFLMALGLGDRIAALSSNDAQVLDLAAFLRRREALHQ 371 Query: 335 SADKKSMGELFKILVVSH 352 D +G F +LV S Sbjct: 372 LIDPMGLGG-FGVLVQSK 388 >gi|303315715|ref|XP_003067862.1| hypothetical protein CPC735_041610 [Coccidioides posadasii C735 delta SOWgp] gi|240107538|gb|EER25717.1| hypothetical protein CPC735_041610 [Coccidioides posadasii C735 delta SOWgp] gi|320031581|gb|EFW13542.1| hypothetical protein CPSG_09889 [Coccidioides posadasii str. Silveira] Length = 487 Score = 248 bits (632), Expect = 1e-63, Method: Composition-based stats. Identities = 121/434 (27%), Positives = 186/434 (42%), Gaps = 84/434 (19%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTC-----NPFGAVGDFVTAPEIS 56 L + I ++I G +++ Y C+ PE GYY++ FG GDFVT+PEIS Sbjct: 40 STPLAKTIADVINTAGPISIAAYMRQCLTSPEGGYYTSRGSTGVEVFGRRGDFVTSPEIS 99 Query: 57 QIFGEMLAIFLICAWEQHGFPSC-VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 Q+FGE+L ++++ W G S V+L+E+GPGRG +M D+LR + K S+ ++Y+ Sbjct: 100 QMFGELLGVWMVTEWMAQGRRSRGVQLIEVGPGRGTLMADMLRSVRNFKSFSSSIEAVYL 159 Query: 116 VETSERLTLIQKKQLASYGD--------------------KINWYTSLADVPLGFTFLVA 155 VE S L IQK+ L SL F++A Sbjct: 160 VEASPTLRDIQKQMLCGDAPMEEIEVGYRSTSKHLGVPVVWTEHIRSLPQGDNDVPFIIA 219 Query: 156 NEFFDSLPIKQFVM------------------------TEHGIRERMIDI------DQHD 185 +EFFD+LPI F + RE ++ + + Sbjct: 220 HEFFDALPIHAFQCVASPPSETIITPTGPTTLRQPLSSSPTQWRELVVSVNPASQMHAEN 279 Query: 186 SLVFNIGDHEIKS----------NFLTCSDYFLGAIFENSPCRDREMQSISDRL------ 229 L F + + + G+ E SP +Q + R+ Sbjct: 280 RLEFRLSLAKTSTPASMVMPEMSERYKALKSTRGSTIEISPESQGYVQEFARRIGGHSNS 339 Query: 230 -----ACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSI 284 G A+++DYG S ++L+ +K H VSP +PGQ DLS+ VDF L+ Sbjct: 340 KIPTTRKPAGAALILDYGPSHSIPVNSLRGIKDHKLVSPFTSPGQVDLSADVDFIALADS 399 Query: 285 AILY--KLYINGLTTQGKFLEGLGIWQRAFSLMKQTA---RKDILLDSVKRLVSTSADKK 339 AI + ++G T QG FL LGI +RA LMK+ ++ + KRLV Sbjct: 400 AISASPGVEVHGPTEQGSFLHSLGISERAAQLMKRAEDETKRKNIEAGWKRLVERGG--G 457 Query: 340 SMGELFKILVVSHE 353 MG ++K + + E Sbjct: 458 GMGRIYKAMAIIPE 471 >gi|159027143|emb|CAO86774.1| unnamed protein product [Microcystis aeruginosa PCC 7806] Length = 375 Score = 248 bits (632), Expect = 1e-63, Method: Composition-based stats. Identities = 96/367 (26%), Positives = 160/367 (43%), Gaps = 23/367 (6%) Query: 4 KLIRKIVNLIKKN--GQMTVDQYFALCVADPEFGYYSTCNP-FGAVGDFVTAPEISQIFG 60 L I+ IK++ G+++ D++ L + P++GYY++ G+ GDF T+ + FG Sbjct: 2 NLEAIILEEIKQSAAGRISFDRWMDLALYHPDYGYYTSGKVEIGSKGDFFTSSSLGADFG 61 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 ++LA + E G LVE+G G GI+ DIL + DF+ LS ++E S+ Sbjct: 62 QLLAEQFVEMAEFLGNSRGFTLVEVGAGSGILAKDILDYLSDSYADFYQNLSYIIIEQSQ 121 Query: 121 RLTLIQKKQLASYGDK-INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMI 179 +L Q+ LA Y + +LAD L +NE D+ P+ + V+ +RE + Sbjct: 122 KLRERQRATLAGYSPVSWQSWPNLADNSLVGCV-FSNELIDAFPVHRVVIESGELREIYL 180 Query: 180 DIDQHDSLVF-NIGDHEIKSNFL------TCSDYFLGAIFENSPCRDREMQSISDRLACD 232 + + + ++ IK F S Y G E + +++++ +L Sbjct: 181 GLGEPFQEIIADLSTDRIKDYFDLVGINIPSSLYREGYQTEVNLLALDWLETVNRKLDR- 239 Query: 233 GGTAIVIDYGYL------QSRVGDTLQAVKGHTYVS-PLVNPGQADLSSHVDFQRLSSIA 285 G + IDYGY R TLQ + H P + G+ D+++HVDF L Sbjct: 240 -GYILTIDYGYTAEKYYHPQRSQGTLQCYRQHQRHDHPYLWVGEQDITTHVDFTALQRQG 298 Query: 286 ILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELF 345 L G T QG FL LG+ R L + + + L + +G F Sbjct: 299 EKLGLKNLGFTQQGLFLMALGLGDRLNQLSQGKSDILTIFQRRDAL-HQLINPTGLGG-F 356 Query: 346 KILVVSH 352 +L+ Sbjct: 357 GVLIQGK 363 >gi|302877437|ref|YP_003846001.1| hypothetical protein Galf_0192 [Gallionella capsiferriformans ES-2] gi|302580226|gb|ADL54237.1| protein of unknown function DUF185 [Gallionella capsiferriformans ES-2] Length = 388 Score = 247 bits (631), Expect = 2e-63, Method: Composition-based stats. Identities = 95/375 (25%), Positives = 156/375 (41%), Gaps = 31/375 (8%) Query: 2 ENKLIRKIVNLIK-KNGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIF 59 +LI I I + G ++ +Y L + P GYY+ + FG GDF+TAPE+S +F Sbjct: 20 SARLIEAIHREIADQGGWISFARYMELALYAPGLGYYTAGAHKFGEAGDFITAPELSPLF 79 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 G LA + E+ ++ELG G G + +D+L + +L S ++E S Sbjct: 80 GRTLARQVAQIMEES----APHILELGAGSGKLAVDMLGELERLGRLPDSYC---ILEVS 132 Query: 120 ERLTLIQK----KQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIR 175 L Q+ + L +++W +L + G +VANE D+LP+ + + Sbjct: 133 ADLRARQQALIGQCLPHLLGRVHWLDALPEQVKG--AVVANEVLDALPVHLVRWQDSALS 190 Query: 176 ERMIDIDQHDS-LVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGG 234 E + +D+ V + E + S++ RL G Sbjct: 191 EIGVALDESGFVRVERAIADAQLLQAAQQIKVPDNYVSEICLAARGLVTSLACRLTQ--G 248 Query: 235 TAIVIDYG------YLQSRVGDTLQAVKGHTYVSP-LVNPGQADLSSHVDFQRLSSIAIL 287 T + IDYG Y R+ TL H PG D+++HV+F ++ I Sbjct: 249 TLLFIDYGFGAREFYHPQRLNGTLMCHYRHRAHDDAFFLPGLQDITAHVNFTDIAETGID 308 Query: 288 YKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARK-DILLDSVKRLVSTSADKKSMGELFK 346 L ++G T+Q FL GI + R L +++L S MGELFK Sbjct: 309 AGLELSGYTSQAFFLINNGIADLMVETSPEDLRAYLPLSAQLQKLTS----PAEMGELFK 364 Query: 347 ILVVSHEKVELM-PF 360 ++ +S + + F Sbjct: 365 VIALSKNRANPLSGF 379 >gi|91774586|ref|YP_544342.1| hypothetical protein Mfla_0230 [Methylobacillus flagellatus KT] gi|91708573|gb|ABE48501.1| protein of unknown function DUF185 [Methylobacillus flagellatus KT] Length = 386 Score = 247 bits (630), Expect = 2e-63, Method: Composition-based stats. Identities = 98/374 (26%), Positives = 158/374 (42%), Gaps = 26/374 (6%) Query: 2 ENKLIRKIVNLI-KKNGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIF 59 +LI I + G ++ +Y L + P GYYS FG GDFVTAPE++ +F Sbjct: 17 SAQLIALIRQEVVDAGGWVSFARYMELALYAPGLGYYSAGAQKFGVAGDFVTAPEMTPLF 76 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLK--PDFFSVLSIYMVE 117 G+ LA ++ Q G ++ELG GRG + +L + P+ + +L + Sbjct: 77 GQTLARQVMAVLTQTGGS----ILELGAGRGKLAAVMLEELQLANALPERYEILEVSAGL 132 Query: 118 TSERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRER 177 S + +Q + ++ W SL + G ++ANE D++P+ E G +E Sbjct: 133 RSVQQQYLQSVLAPALYARVAWLDSLPEAFTG--VVLANEVLDAVPVHLVRKEEAGWQEL 190 Query: 178 MIDIDQHDSLVFNIGDHEIKSNFLTCS--DYFLGAIFENSPCRDREMQSISDRLACDGGT 235 + ++Q +L+ E SP + S++ L G Sbjct: 191 GVALNQEGNLMLASRPLVDAGLVSGIEVLALPEYYQTETSPAARALVASLAQCLQE--GV 248 Query: 236 AIVIDY------GYLQSRVGDTLQAVKGH-TYVSPLVNPGQADLSSHVDFQRLSSIAILY 288 + IDY Y R TL H + PL+ PG DL++HVDF ++ I Sbjct: 249 LLFIDYGFPRAEYYHPQRHQGTLMCHYRHYAHQDPLLYPGLQDLTAHVDFSAVAEAGIGN 308 Query: 289 KLYINGLTTQGKFLEGLGIWQRAFSLMK-QTARKDILLDSVKRLVSTSADKKSMGELFKI 347 L++ G TQ +FL GI + + R L + ++L+S MGELFK+ Sbjct: 309 GLHLLGYCTQAQFLINCGITELMSRVPAHDLMRYAPLASAAQKLLS----PAEMGELFKV 364 Query: 348 LVVSHEKVELMPFV 361 + + L FV Sbjct: 365 IALGRHVQPLSGFV 378 >gi|119486211|ref|ZP_01620271.1| hypothetical protein L8106_17747 [Lyngbya sp. PCC 8106] gi|119456702|gb|EAW37831.1| hypothetical protein L8106_17747 [Lyngbya sp. PCC 8106] Length = 409 Score = 247 bits (630), Expect = 2e-63, Method: Composition-based stats. Identities = 99/390 (25%), Positives = 165/390 (42%), Gaps = 36/390 (9%) Query: 2 ENKLIRKIVNLIKK--NGQMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQI 58 + L I I + N +++ +Y + P+ GYY+T GA GDF TAP + Sbjct: 13 NSILREFITQQINESPNQRISFAEYMNWVLYHPQQGYYATPQTRIGASGDFFTAPHLGID 72 Query: 59 FGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVET 118 FGE+LA L+ WE P LVE+G G+GI+ DIL+ I + P F V+ ++E Sbjct: 73 FGELLAEQLVEMWEILHQPQPFTLVEMGAGQGILAADILQYIQRRYPHCFKVVDYIIIEK 132 Query: 119 SERLTLIQKKQLASYGDKINWYTSLADVPLGFT-----FLVANEFFDSLPIKQFVMTEHG 173 S L Q+++L + +NE D+LP+ Q ++ H Sbjct: 133 SAALKAEQQQKLNDQIGSSVSVR-WCEWDDIPNDSITGCFFSNELVDALPVHQVIVRNHQ 191 Query: 174 IRERMIDIDQH--DSLVFNIGDHEIKSNFLTC---------------SDYFLGAIFENSP 216 +RE + ++ H + N EI+++ T Y G E + Sbjct: 192 LREIYVALNTHSEGNNSINAYFTEIEADLSTPQLQTYFQSLKIDLLSEIYSDGYRTEINL 251 Query: 217 CRDREMQSISDRLACDGGTAIVIDYGYLQSRVG------DTLQAVKGHTYV-SPLVNPGQ 269 + +++++L G + IDYGY R TLQ H + +P ++ GQ Sbjct: 252 AALDWITTVTNKL--QQGFVLTIDYGYSAERYYSPTRASGTLQCYYQHRHHNNPYIHIGQ 309 Query: 270 ADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVK 329 D+++HVDF L L L + G T Q FL LG+ R ++ + + + + Sbjct: 310 QDITAHVDFTALEKQGELLGLEVIGFTQQALFLMALGLGDRIAAISQTQGQNLSEVLRRR 369 Query: 330 RLVSTSADKKSMGELFKILVVSHEKVELMP 359 + + + +G F +L+ S P Sbjct: 370 EALHSLINPMGLGN-FGVLIQSKGLATKNP 398 >gi|30249153|ref|NP_841223.1| hypothetical protein NE1166 [Nitrosomonas europaea ATCC 19718] gi|30180472|emb|CAD85077.1| DUF185 [Nitrosomonas europaea ATCC 19718] Length = 390 Score = 247 bits (630), Expect = 2e-63, Method: Composition-based stats. Identities = 92/371 (24%), Positives = 146/371 (39%), Gaps = 32/371 (8%) Query: 2 ENKLIRKIVNLIK-KNGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIF 59 + L + I G ++ Y + PE GYYS FG GDFVTAPEIS +F Sbjct: 14 SDTLKTMLHERIAHSGGWISFADYMETVLYTPETGYYSGGAAKFGTAGDFVTAPEISPLF 73 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 G+ LA + ++E G G G + +D+L + +L Y+++ S Sbjct: 74 GQALARQIAPILSAVN---QGSILEFGAGSGKLAVDLLCALEELNNLPQ---HYYILDLS 127 Query: 120 ERLTLIQK----KQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIR 175 L Q+ + + +++W ++L + G ++ANE D++P+ I Sbjct: 128 ADLQQRQRAMIEQHIPHLASRVSWLSALPEQFEG--LILANEVLDAMPVHLVAWQNGNIA 185 Query: 176 ERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLAC---- 231 ER I + V+ F + + Sbjct: 186 ERG-VIWKDQGPVWQDQPLAAGELLDVARQLPPADQFSYPLYISEISLTNRHFICSLAML 244 Query: 232 -DGGTAIVIDY------GYLQSRVGDT-LQAVKGHTYVSPLVNPGQADLSSHVDFQRLSS 283 G +++DY Y R T + + H + P PG D++SHVDF ++ Sbjct: 245 LQRGAILLVDYGFGQNEYYHPQRHQGTLMCHYRHHAHDDPFFLPGLQDITSHVDFSTIAR 304 Query: 284 IAILYKLYINGLTTQGKFLEGLGIWQRAFSL-MKQTARKDILLDSVKRLVSTSADKKSMG 342 A+ L + G TTQ FL GI Q L+ V+RLVS MG Sbjct: 305 TALDSGLQLAGYTTQAHFLINCGITDLLARTPADQPGSYLPLVSQVQRLVS----PAEMG 360 Query: 343 ELFKILVVSHE 353 ELFK++V+S + Sbjct: 361 ELFKVMVLSRD 371 >gi|113478172|ref|YP_724233.1| hypothetical protein Tery_4813 [Trichodesmium erythraeum IMS101] gi|110169220|gb|ABG53760.1| protein of unknown function DUF185 [Trichodesmium erythraeum IMS101] Length = 397 Score = 246 bits (629), Expect = 3e-63, Method: Composition-based stats. Identities = 95/370 (25%), Positives = 155/370 (41%), Gaps = 23/370 (6%) Query: 2 ENKLIRKIVNLIK--KNGQMTVDQYFALCVADPEFGYYSTCNP-FGAVGDFVTAPEISQI 58 L I I +N ++T +Y L + P++GYY+T G GDF+T+P Sbjct: 12 NENLCTIIYKSISESQNKRITFAEYMDLVLYHPQYGYYATHPVNIGKQGDFLTSPHWGSD 71 Query: 59 FGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVET 118 FGE+LA + W P+ +VE+G G+GI+ IL + + DFF + +VE Sbjct: 72 FGELLAEQFLQMWHILQRPNNFTIVEMGAGQGILAEQILGYLKQKHLDFFQTVEYLIVEK 131 Query: 119 SERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERM 178 SE L + QK+ L SY + + + ++ + F +NE D+ P+ +F + E I+E Sbjct: 132 SEVLKVQQKQILQSYQVRWSDWDKISHSSITGCF-FSNELVDAFPVHKFRIEEREIKEIY 190 Query: 179 IDIDQHDSLVFNIGDHEIKSNFLTCSDYF---------LGAIFENSPCRDREMQSISDRL 229 + + V + G E + ++ +S++L Sbjct: 191 VSSNSQGKFVEITDKISTPEIAEYFNLVDIDLLSFVDVEGYQSEVNLQALDWIKIVSNKL 250 Query: 230 ACDGGTAIVIDYGYLQSRVGDTLQ-------AVKGHTYVSPLVNPGQADLSSHVDFQRLS 282 G + IDYGY R + ++ + H P N G+ D+++HVDF L Sbjct: 251 LK--GYLLTIDYGYQAVRYYNPVRKEGTLQCYYQHHRNNDPYWNVGRQDITAHVDFTALE 308 Query: 283 SIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMG 342 L L G T QG FL LG+ R L + + + + D +G Sbjct: 309 KQGNLLDLETLGFTKQGLFLMALGLGDRLNELSNNQGFSVEEVFRRREALHSLIDPIGLG 368 Query: 343 ELFKILVVSH 352 F +LV S Sbjct: 369 N-FGVLVQSK 377 >gi|91204830|ref|YP_537185.1| hypothetical protein RBE_0015 [Rickettsia bellii RML369-C] gi|157826399|ref|YP_001495463.1| hypothetical protein A1I_00070 [Rickettsia bellii OSU 85-389] gi|91068374|gb|ABE04096.1| unknown [Rickettsia bellii RML369-C] gi|157801703|gb|ABV78426.1| hypothetical protein A1I_00070 [Rickettsia bellii OSU 85-389] Length = 366 Score = 246 bits (629), Expect = 3e-63, Method: Composition-based stats. Identities = 119/366 (32%), Positives = 190/366 (51%), Gaps = 20/366 (5%) Query: 8 KIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFL 67 KI +I+++G +T D+ + YY GDF TAPE+SQ+FGE++ ++ Sbjct: 6 KIREIIEQSGYITCDRLMQEVLHVSPTSYYRQTKSLAEEGDFTTAPEVSQLFGEIIGLWC 65 Query: 68 ICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQK 127 I W++ G P + +VELGPGRG++M D+LR L P+F++ LSI +++ +E + QK Sbjct: 66 IKEWQRIGSPKNLSIVELGPGRGLLMRDLLRTAK-LVPEFYNALSINLIDINENFIVQQK 124 Query: 128 KQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQ-HDS 186 L ++ INWY S+ D+P ++ANEFFD++PIKQ++ + ER+ + Sbjct: 125 SNLQNFDLPINWYASIEDIPKKPALIIANEFFDAMPIKQYIKVKESWYERIFVVQPVDGK 184 Query: 187 LVFNIGDHEIKSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGY 243 + ++ + D GA+ E S M+ IS+ + GG+ ++IDYGY Sbjct: 185 IKYDKIAVSKQLQEYLQKTHLDAKDGAVLEESYKSIEIMKFISEHIKELGGSGLIIDYGY 244 Query: 244 -------LQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLT 296 + + TLQA+K H Y + N G+ DLS+HVDF L ++A K+ + Sbjct: 245 DINPNIRTRYQYNSTLQAIKNHKYCPIIENLGEEDLSAHVDFYVLKTVAQNSKINVIDTI 304 Query: 297 TQGKFLEGLGIWQRAFSLMK--QTARKDILLDSVKRLVSTSADKKSMGELFKILVV--SH 352 +Q FL GI R +L + +++ V RL+S K MG LFK+L V + Sbjct: 305 SQRDFLIENGILLRKQTLQNKLNPEQAELIERQVNRLISL----KEMGGLFKVLQVMKTP 360 Query: 353 EKVELM 358 V Sbjct: 361 PTVSPP 366 >gi|291612639|ref|YP_003522796.1| hypothetical protein Slit_0167 [Sideroxydans lithotrophicus ES-1] gi|291582751|gb|ADE10409.1| protein of unknown function DUF185 [Sideroxydans lithotrophicus ES-1] Length = 384 Score = 246 bits (629), Expect = 3e-63, Method: Composition-based stats. Identities = 97/376 (25%), Positives = 151/376 (40%), Gaps = 31/376 (8%) Query: 2 ENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIF 59 KL I I +G + ++ L + P GYY+ FG GDF+TAPE+S +F Sbjct: 16 SAKLCELIRGDIAAQSGWIPFSRFMELALYAPGLGYYTAGALKFGEAGDFITAPELSSLF 75 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 G LA L+ + ELG G G + +DIL + +L S ++E S Sbjct: 76 GHTLARQLVEVMHAS----APHIFELGAGSGKLAVDILGELERLGELPES---YSILEVS 128 Query: 120 ERLTLIQK----KQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIR 175 L Q+ K L +++ W +L + G ++ NE D+LP+ + I Sbjct: 129 ADLRERQQALLGKHLPHLVERVRWLDTLPEKISG--AVIGNEVLDALPVHLLYWSNRRIL 186 Query: 176 ERMIDID-QHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGG 234 ER + V D +F + E S + S+ +R+ G Sbjct: 187 ERGVTSKATRFLWVDRELDVPALLDFAKNLKVPDDYLSEVSLTTRGLIASLCERMDK--G 244 Query: 235 TAIVIDY------GYLQSRVGDT-LQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAIL 287 I IDY Y R T + + H++ P PG D++SHVDF ++ AI Sbjct: 245 ALIFIDYGFGAGEYYHPQRSRGTLMCHYRHHSHDDPFYLPGLQDITSHVDFTAVAEAAID 304 Query: 288 YKLYINGLTTQGKFLEGLGIWQRAFSLM-KQTARKDILLDSVKRLVSTSADKKSMGELFK 346 + G T+Q FL GI + + L +++L S MGELFK Sbjct: 305 HGASFLGYTSQAHFLFNNGITDHLGKVSPEDVKAYAPLSAQLQKLTS----PAEMGELFK 360 Query: 347 ILVVSHE-KVELMPFV 361 ++ + L F+ Sbjct: 361 VIALGKGIDQPLAGFL 376 >gi|85710056|ref|ZP_01041121.1| hypothetical protein NAP1_14263 [Erythrobacter sp. NAP1] gi|85688766|gb|EAQ28770.1| hypothetical protein NAP1_14263 [Erythrobacter sp. NAP1] Length = 324 Score = 246 bits (628), Expect = 4e-63, Method: Composition-based stats. Identities = 113/328 (34%), Positives = 167/328 (50%), Gaps = 15/328 (4%) Query: 36 YYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLD 95 YY++ +P G DF+TAPE+SQ+FGE++ ++L W + G + VELGPGRG + D Sbjct: 8 YYTSRDPLGEDADFITAPEVSQMFGELIGLWLADLWVRMGSRKRIHYVELGPGRGTLASD 67 Query: 96 ILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWYTSLADVPLG-FTFLV 154 LR + + ++ VETS L IQ + + L+ +P +V Sbjct: 68 ALRTAARYE----FAPQVHFVETSPALRKIQLEAFPD----AQHHDDLSTLPDDAPLLIV 119 Query: 155 ANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFEN 214 ANEFFD+LPI Q V + +G ER++ ++ + + + G + E Sbjct: 120 ANEFFDALPIHQLVRSANGWHERLVGLEDDEFVFVAGDKPMDSIVPRSWKSASQGTMIET 179 Query: 215 SPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSS 274 S MQ I+ RL GG A++IDYG + R G TLQA++ H V +PGQADL++ Sbjct: 180 SAAASAVMQEIAGRLKEQGGAALIIDYGAFELRAGSTLQAIRSHEKVDVFAHPGQADLTA 239 Query: 275 HVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLM-KQTARKDILLDSVKRLVS 333 HVDF+ L +A + GL QG++L +GI R +L K A KD L RLV Sbjct: 240 HVDFEMLKDVAEKNGADVMGLQMQGEWLRQMGIETRLEALQRKNPAEKDKLKRQFDRLV- 298 Query: 334 TSADKKSMGELFKILVVSHEKVEL-MPF 360 D MG LFK+L + + + + F Sbjct: 299 ---DDGQMGLLFKVLGICGRRWPIGVGF 323 >gi|149185006|ref|ZP_01863323.1| hypothetical protein ED21_18172 [Erythrobacter sp. SD-21] gi|148831117|gb|EDL49551.1| hypothetical protein ED21_18172 [Erythrobacter sp. SD-21] Length = 351 Score = 246 bits (628), Expect = 4e-63, Method: Composition-based stats. Identities = 124/357 (34%), Positives = 180/357 (50%), Gaps = 20/357 (5%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGE 61 L LI+++G M V +Y YY++ +P GA GDF TAPEISQ+FGE Sbjct: 7 NTDLAASFRRLIERHGPMPVSRYMGES----NARYYTSRDPLGAGGDFTTAPEISQMFGE 62 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 M+ ++L W + G P ELGPGRG + D LR + +++VE S Sbjct: 63 MVGLWLADLWSRSGHPQA-IYAELGPGRGTLARDALRAMASQGLRP----PVHLVEGSAA 117 Query: 122 LTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDI 181 L +Q LA +L D ++VANEF D+LPI+Q VMT G RER++ + Sbjct: 118 LREVQADALAGAQFH-ESIDTLPD--DRPLYIVANEFLDALPIRQLVMTTRGWRERLVAL 174 Query: 182 DQHDSLVFNIGDHEIKSNFLTCSDYFL-GAIFENSPCRDREMQSISDRLACDGGTAIVID 240 D VF G + + L G + E P ++ ++ R+ GG A+ ID Sbjct: 175 -DGDRFVFAAGPNPMDDAVLEERRAQDVGTVIETCPGAAAVIEDLARRIDRQGGAALFID 233 Query: 241 YGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGK 300 YGYL+SR G+TLQAVK H V PG+ DL++ VDF L+ IA L + TQG Sbjct: 234 YGYLESRTGETLQAVKAHGKVGVFDAPGEMDLTALVDFAELAQIARQEGLAVE-TATQGA 292 Query: 301 FLEGLGIWQRAFSLMK-QTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVE 356 +L+ +G+ RA +L + A + + + RL T MGELFK++ ++ + Sbjct: 293 WLDAMGLGLRAKALSERSPAHAEEIARACNRLAGTG----EMGELFKVMAITPGQAP 345 >gi|256071528|ref|XP_002572092.1| hypothetical protein [Schistosoma mansoni] gi|238657243|emb|CAZ28322.1| conserved hypothetical protein [Schistosoma mansoni] Length = 360 Score = 246 bits (627), Expect = 6e-63, Method: Composition-based stats. Identities = 108/330 (32%), Positives = 173/330 (52%), Gaps = 15/330 (4%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 + L ++++ I G +TV +Y C+++P +GYY+T + FG GDF T+PEI QIFG Sbjct: 33 LSETLKKQLLERINTFGPLTVSEYMKECLSNPLYGYYNTHSVFGKSGDFTTSPEICQIFG 92 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 E++ ++L+ W++ P ++LVELGPGRG + DILRV K P+ +S LSI++VE S+ Sbjct: 93 ELIGVWLLEEWKRQNNPKHLQLVELGPGRGTLCSDILRVFSKF-PEIYSTLSIHLVEISQ 151 Query: 121 RLTLIQKK-------QLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHG 173 + QK+ L + I W+T L VP F+F + +EFFD LP+ +F E Sbjct: 152 SMRQTQKQTIEKTLSHLNNKPPLIFWHTDLRQVPENFSFFIGHEFFDVLPVHRFQKHEGK 211 Query: 174 IRERMIDIDQH-DSLVFNIGDHEIKSNFLTC---SDYFLGAIFENSPCRDREMQSISDRL 229 E ++ + +SL F + ++ + + E P Q + R+ Sbjct: 212 WHEVLVSSMMNSNSLCFVRSGTKTPASIVYLPLVPNLSNRDSVEVCPDMICVAQLLCKRI 271 Query: 230 ACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYK 289 GG+A++IDYG+ + GDT + H+ PL+NPG DL+ VDF L Sbjct: 272 NKTGGSALLIDYGHEGEK-GDTFRGFHKHSVCDPLINPGHTDLTCDVDFSILCHAVKNSD 330 Query: 290 LYI--NGLTTQGKFLEGLGIWQRAFSLMKQ 317 + +G TQ FL +G++ R + + Sbjct: 331 AKVRLHGPVTQAYFLINMGLFTRLKVTIDK 360 >gi|126658233|ref|ZP_01729383.1| hypothetical protein CY0110_12577 [Cyanothece sp. CCY0110] gi|126620382|gb|EAZ91101.1| hypothetical protein CY0110_12577 [Cyanothece sp. CCY0110] Length = 378 Score = 245 bits (626), Expect = 6e-63, Method: Composition-based stats. Identities = 87/365 (23%), Positives = 143/365 (39%), Gaps = 18/365 (4%) Query: 5 LIRKIVNLIKK--NGQMTVDQYFALCVADPEFGYYSTCNP-FGAVGDFVTAPEISQIFGE 61 +++ I++ IK N ++T Y L + + GYYS+ G+ GDF TA + FGE Sbjct: 1 MLKIIIDTIKNSPNQRITFADYMDLVLYHTQHGYYSSGKVNIGSEGDFFTASSLGSDFGE 60 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 +LA + L+E+G G G + DIL + + D + ++E S+ Sbjct: 61 LLAEQFKEMSQLLNCSDSFTLIEVGAGTGNLAADILNYLKEKYSDCYDQFDYIIIEESQE 120 Query: 122 LTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDI 181 L QK +L + + +NE D+ P+ Q + ++E + Sbjct: 121 LIKEQKNKLEKFDKITWKSWEDIPNNSINGCIFSNELIDAFPVHQVIKKNKQLKEIYVTW 180 Query: 182 DQHD--------SLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDG 233 + S + E+ +T +Y E + ++++S +L Sbjct: 181 EDEQLKEKLEDISTPKLLDYFELIHIDITKDNYPENYRTEVNLKALDWLKTVSKKLNKGY 240 Query: 234 GTAIVIDY----GYLQSRVGDTLQAVKGH-TYVSPLVNPGQADLSSHVDFQRLSSIAILY 288 I Y Y R TL H + +P VN GQ D+++HVDF L L Sbjct: 241 LLTIDYGYDASKYYHPQRYQGTLNCYYKHRHHHNPYVNLGQQDMTAHVDFTALEKQGNLL 300 Query: 289 KLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKIL 348 L GLT QG FL LG+ R L + L + +G FK+L Sbjct: 301 GLETVGLTQQGLFLMALGLGDRLAELSNGNYSLPEIFQRRDAL-HQLINPTGLGG-FKVL 358 Query: 349 VVSHE 353 + S + Sbjct: 359 IQSKK 363 >gi|284053133|ref|ZP_06383343.1| hypothetical protein AplaP_16842 [Arthrospira platensis str. Paraca] gi|291568912|dbj|BAI91184.1| hypothetical protein [Arthrospira platensis NIES-39] Length = 389 Score = 245 bits (626), Expect = 6e-63, Method: Composition-based stats. Identities = 90/376 (23%), Positives = 153/376 (40%), Gaps = 29/376 (7%) Query: 1 MENKLIRKIVNLIKKN--GQMTVDQYFALCVADPEFGYYSTCNP-FGAVGDFVTAPEISQ 57 + L+ +I I + +T +Y + + DP+ GYY+ +P GA GDF T+P + Sbjct: 2 VSVTLVDRISQRIGNHPQNPITFAEYMEMVLYDPQSGYYNHNSPQIGAQGDFFTSPHLGS 61 Query: 58 IFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVE 117 FGE+LA L+ WE G P LVE+G G+GI+ DI+ + + P VL + E Sbjct: 62 DFGELLAEQLVEMWEVLGKPEPFTLVEMGAGQGILAADIIGYLQRQYPQVVRVLDYAIAE 121 Query: 118 TSERLTLIQKKQLASYGDKINWYT----SLADVPLGFTFLVANEFFDSLPIKQFVMTEHG 173 S RL Q+++L G+ +NE D+ P+ ++ Sbjct: 122 KSRRLKTEQQQRLQQLGEPFTQIRWCNLDDIANHSITGCFFSNELIDAFPVHLVTRQDNQ 181 Query: 174 IRERMIDIDQHDSLVFNIGDHEIKSNFLTCSD-------------YFLGAIFENSPCRDR 220 ++E + S F + + + + +D Y G E + Sbjct: 182 LQEIYVT-TTGKSPNFQLAEVVGELSTPQLADYFRLVGIDLLSDAYPEGYRTEVNLAALE 240 Query: 221 EMQSISDRLACDGG----TAIVIDYGYLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSH 275 M++++ +L D Y +R TLQ H + P + GQ D+++H Sbjct: 241 WMETVARKLQRGFVLTIDYGYSADRLYSPTRREGTLQCYYQHRHHNDPYIYIGQQDITAH 300 Query: 276 VDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTS 335 VDF L L G T Q FL LG+ R ++ + + K + + + + Sbjct: 301 VDFTALQQKGRSLGLQTIGFTQQALFLMALGLGDRIATVSE--SPKISQVLRRREALHSL 358 Query: 336 ADKKSMGELFKILVVS 351 D +G F +L+ Sbjct: 359 IDPMGLGN-FGVLIQG 373 >gi|77163584|ref|YP_342109.1| hypothetical protein Noc_0037 [Nitrosococcus oceani ATCC 19707] gi|254435577|ref|ZP_05049084.1| conserved hypothetical protein [Nitrosococcus oceani AFC27] gi|76881898|gb|ABA56579.1| Protein of unknown function DUF185 [Nitrosococcus oceani ATCC 19707] gi|207088688|gb|EDZ65960.1| conserved hypothetical protein [Nitrosococcus oceani AFC27] Length = 393 Score = 245 bits (626), Expect = 7e-63, Method: Composition-based stats. Identities = 92/382 (24%), Positives = 147/382 (38%), Gaps = 40/382 (10%) Query: 2 ENKLIRKIVNLI-KKNGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIF 59 KL I I + GQ+ ++ L + P GYY + G GDF+TAPE+S +F Sbjct: 20 SQKLENVIQTTIEQAGGQIPFARFMELALYTPGLGYYMAGLHKLGTFGDFITAPELSPLF 79 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 ++ +E G ++E G G G + D+L + +++E S Sbjct: 80 ARCISRQCQQIFELLG---TGDILEFGAGSGRLAADLLSELNLSG---NLPERYFILELS 133 Query: 120 ERLTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIR 175 L Q++ L +++W L D G F++ANE D++P F + Sbjct: 134 ADLRHRQQETLYQRVPLLASRVSWLDRLPDRIDG--FILANEVCDAMPTHCFQLENGYDW 191 Query: 176 ERMIDIDQHDSLVFNIGDHEIKS--------NFLTCSDYFLGAIFENSPCRDREMQSISD 227 ER + + F + + E + + I+ Sbjct: 192 ERYVGY---EKGKFVWKKGPLSHPLLKDRIAKIRLLLKHVNSYESEINLAMEGWTTEIAH 248 Query: 228 RLACDGGTAIVIDY------GYLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQR 280 RL G ++IDY Y R+ TL H +PL+ G D+++HVDF Sbjct: 249 RLRK--GMLLIIDYGFPRHEYYHPERMMGTLMCHYRHQAHPNPLIMAGLQDITTHVDFTA 306 Query: 281 LSSIAILYKLYINGLTTQGKFLEGLGIWQRAF-SLMKQTARKDILLDSVKRLVSTSADKK 339 L+ L + G TQ FL G+ + A + + +KRLV Sbjct: 307 LAEAGHSSGLRVAGYCTQADFLLACGLDKLAATEIAAGEKQALETSQQIKRLVL----PS 362 Query: 340 SMGELFKILVVSHE-KVELMPF 360 MGELFK L ++ E L+ F Sbjct: 363 EMGELFKALALTREINQPLLGF 384 >gi|224827081|ref|ZP_03700178.1| protein of unknown function DUF185 [Lutiella nitroferrum 2002] gi|224600747|gb|EEG06933.1| protein of unknown function DUF185 [Lutiella nitroferrum 2002] Length = 383 Score = 245 bits (625), Expect = 8e-63, Method: Composition-based stats. Identities = 98/367 (26%), Positives = 152/367 (41%), Gaps = 30/367 (8%) Query: 1 MENKLIRKIVNLIKKN-GQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQI 58 + +L R I I + G + +Y L + P GYYS FGA GDFVTAPE+S Sbjct: 14 VSQELSRHIAAEIATHDGWIPFSRYMELALYAPSLGYYSAGSRKFGAAGDFVTAPELSPY 73 Query: 59 FGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLK--PDFFSVLSIYMV 116 FG LA L Q G L E G G G + +DIL + L P+ ++ ++ Sbjct: 74 FGRTLARQLAELLPQTGGT----LYEFGAGTGRLAVDILTELEALGQLPERYA-----II 124 Query: 117 ETSERLTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEH 172 + S L Q++ L ++ W + L + G ++ NE D++P + T Sbjct: 125 DLSADLVERQRQTLAEALPHLAGRVEWLSELPEQFDG--VIIGNEVLDAMPCELLHWTPT 182 Query: 173 GIRERMIDIDQHDSLVFN-IGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLAC 231 + + D + I D + + G + E S + +++ RL Sbjct: 183 PQQRGVTVRDGAFAWEDRPIADPRLAAVAAALPPEAAGYLSEVSLANRAFIATLAARLVR 242 Query: 232 DGGTAIVI-----DYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAI 286 I +Y + Q +G + + HT P PG DL+SHVDF ++ Sbjct: 243 GAILLIDYGFPEREYYHPQRHMGTLIGHYRHHTVDDPFYLPGLMDLTSHVDFTAVALAGT 302 Query: 287 LYKLYINGLTTQGKFLEGLGIWQRAFSLMKQT-ARKDILLDSVKRLVSTSADKKSMGELF 345 L + G TTQ +FL GI L AR + +V++L+S MGELF Sbjct: 303 DAGLDLIGYTTQAQFLVNAGITALLQQLDPDDVARYAPRVAAVQKLLS----PNEMGELF 358 Query: 346 KILVVSH 352 K++ Sbjct: 359 KVIGFGK 365 >gi|153206142|ref|ZP_01945405.1| conserved hypothetical protein [Coxiella burnetii 'MSU Goat Q177'] gi|120577272|gb|EAX33896.1| conserved hypothetical protein [Coxiella burnetii 'MSU Goat Q177'] Length = 388 Score = 245 bits (624), Expect = 1e-62, Method: Composition-based stats. Identities = 94/373 (25%), Positives = 162/373 (43%), Gaps = 26/373 (6%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFG 60 +L IV I +NG +T D+Y L + P GYYS FGA GDFVTAPEIS +F Sbjct: 17 SEQLRLHIVREIAENGPLTFDRYMQLALYAPGLGYYSAGSRKFGAAGDFVTAPEISSLFS 76 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 + +A ++ELG G G M DILR + + +++E S Sbjct: 77 QCVARQCQQILIDLNGGD---ILELGAGSGRMAADILRELQHTGCLPHN---YFILEISA 130 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTF---LVANEFFDSLPIKQFVMTEHGIRER 177 L Q+K + + +++ + F ++ NE D++P+ +F ++GI+E Sbjct: 131 DLRDRQEKFIKNEIPELSHRVKWLNRLPSPHFKGVILGNEVIDAMPVHKF-KIDNGIKEV 189 Query: 178 MIDIDQHDSLVFNIGDHEIKSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGG 234 ++ +++ V+ IG+ + + + G E + + S++D L Sbjct: 190 YVN-WKNEQFVWEIGEPSAALSDYIKNLTIHFPEGYESEVNLLLKGWIASLADILQEGLI 248 Query: 235 TAIVI-----DYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYK 289 I +Y + G + H++ PL+ G D+++HVDF ++ A Sbjct: 249 LLIDYGFPRHEYYHTDRDRGTIACHYRHHSHFDPLILTGIQDITAHVDFTAIAEAAAKQG 308 Query: 290 LYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILV 349 L + G T Q FL GI ++ A + +K+L MGELFK + Sbjct: 309 LAVEGFTHQAGFLLNCGIATLMPQ-VEDVAEHYRIAQEIKKLTL----PGEMGELFKAIA 363 Query: 350 VSHE-KVELMPFV 361 ++ + L+ F+ Sbjct: 364 LTRNYRQSLLGFI 376 >gi|83591981|ref|YP_425733.1| hypothetical protein Rru_A0642 [Rhodospirillum rubrum ATCC 11170] gi|83574895|gb|ABC21446.1| Protein of unknown function DUF185 [Rhodospirillum rubrum ATCC 11170] Length = 364 Score = 245 bits (624), Expect = 1e-62, Method: Composition-based stats. Identities = 125/363 (34%), Positives = 196/363 (53%), Gaps = 19/363 (5%) Query: 6 IRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAI 65 + ++ ++ +G + V+ + A C+ + YY+ + GA GDF+TAPE +QIFGE+L + Sbjct: 13 VEALIARLR-SGPLPVEDWMAACLGE----YYARGDVLGAGGDFITAPECTQIFGELLGL 67 Query: 66 FLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLI 125 + W+ G P + LVELGPGRG +M D LR + + P F LS+++VETS L Sbjct: 68 WSAVVWQAMGSPERINLVELGPGRGTLMADALRALAPV-PAFRRALSVHLVETSPGLRAR 126 Query: 126 QKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHD 185 QK++L + G + W+ L VP G ++ANEF D+LPI+Q++ G RER++ + Sbjct: 127 QKQKLRASGVTVFWHERLDTVPSGPMIVLANEFLDALPIRQYLRDAEGWRERLVGLAGEG 186 Query: 186 SLVFNIGDHEIKSNFLTCS-----DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVID 240 + + + +G E P + ++ RLA DGG + +D Sbjct: 187 -PALTFTQGPLLGDPPPLAQPAHLQARVGEEIEVCPQAQAVVAQVAARLAADGGAGLFLD 245 Query: 241 YGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGK 300 YG S GD+LQA++ H + + L NPG DL++HVDFQ ++ +A ++G+ QG Sbjct: 246 YGPAHSAPGDSLQALRRHRFAAVLENPGAVDLTAHVDFQAMARVAAAAGAAVDGIVQQGP 305 Query: 301 FLEGLGIWQRAFSLMKQTA--RKDILLDSVKRLVSTSADKKSMGELFKILVV-SHEKVEL 357 FL+ LG+ RA LM A +K + +V+RL+ D MG LFK+L + S L Sbjct: 306 FLQSLGLEARAGVLMANAATDQKREIRFAVRRLI----DSSEMGTLFKVLGLRSPSMARL 361 Query: 358 MPF 360 F Sbjct: 362 PGF 364 >gi|170579548|ref|XP_001894878.1| hypothetical protein [Brugia malayi] gi|158598369|gb|EDP36276.1| conserved hypothetical protein [Brugia malayi] Length = 427 Score = 245 bits (624), Expect = 1e-62, Method: Composition-based stats. Identities = 117/383 (30%), Positives = 187/383 (48%), Gaps = 36/383 (9%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTC--NPFGAVGDFVTAPEISQIF 59 ++L+ I I NG M+V +Y L + P GYYS FG GDF+TAPE++Q+F Sbjct: 38 SDQLLHFIKQKINLNGPMSVAEYMRLTASSPIGGYYSRHGSKIFGEKGDFITAPELTQMF 97 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 GE++ I+ G +LVE GPG G +M DI R + +LK S+ ++VETS Sbjct: 98 GELIGIWCYYELINTGHSEEWQLVENGPGTGQLMSDITRTLRRLKVTKGSI---HLVETS 154 Query: 120 ERLTLIQKKQLASYGDK------------------INWYTSLADVPLGFTFLVANEFFDS 161 + L Q+ L + + I WY ++ D+P F+ ++NEF D+ Sbjct: 155 DALLDQQESLLCEHPSQFIDGKSYVRCNVTKNGFPIYWYRNVDDIPAQFSIFISNEFLDA 214 Query: 162 LPIKQFVMTEH-GIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAI----FENSP 216 LP+ QF + E +++++ D L F + E F + +E S Sbjct: 215 LPVNQFKRDDEGKWHEVYVNLNKDDKLCFMLSKSENLHTFGLLPKKIREDLSIKEWEISI 274 Query: 217 CRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHV 276 + ++D + GG +++DYG+ +R +L+A KGH V PL NPG+ D+++ V Sbjct: 275 DAGTYVNQVTDSITKFGGFVLIVDYGHNGTRKDLSLRAYKGHQIVHPLENPGEHDITADV 334 Query: 277 DFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLM---KQTARKDILLDSVKRLVS 333 +F L S+ L + G Q +F +GI R L+ K K LL S + L+S Sbjct: 335 NFGYLKSLVEDRTL-VFGPIEQREFFAQMGIGLRLQRLLTCCKTEEDKQNLLKSCEILLS 393 Query: 334 TSADKKSMGELFKILVVSHEKVE 356 +K MGE FK++ + + +E Sbjct: 394 ----EKGMGERFKVMSIFPKTLE 412 >gi|212217802|ref|YP_002304589.1| hypothetical protein CbuK_0124 [Coxiella burnetii CbuK_Q154] gi|212012064|gb|ACJ19444.1| hypothetical protein CbuK_0124 [Coxiella burnetii CbuK_Q154] Length = 417 Score = 244 bits (623), Expect = 1e-62, Method: Composition-based stats. Identities = 94/373 (25%), Positives = 162/373 (43%), Gaps = 26/373 (6%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFG 60 +L IV I +NG +T D+Y L + P GYYS FGA GDFVTAPEIS +F Sbjct: 46 SEQLRLHIVREIAENGPLTFDRYMQLALYAPGLGYYSAGSRKFGAAGDFVTAPEISSLFS 105 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 + +A ++ELG G G M DILR + + +++E S Sbjct: 106 QCVARQCQQILIDLNGGD---ILELGAGSGRMAADILRELQHTGCLPHN---YFILEISA 159 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTF---LVANEFFDSLPIKQFVMTEHGIRER 177 L Q+K + + +++ + F ++ NE D++P+ +F ++GI+E Sbjct: 160 DLRDRQEKFIKNEIPELSHRVKWLNRLPSPHFKGVILGNEVIDAMPVHKF-KIDNGIKEV 218 Query: 178 MIDIDQHDSLVFNIGDHEIKSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGG 234 ++ +++ V+ IG+ + + + G E + + S++D L Sbjct: 219 YVN-WKNEQFVWEIGEPSAALSDYIKNLTIHFPEGYESEVNLLLKGWIASLADILQEGLI 277 Query: 235 TAIVI-----DYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYK 289 I +Y + G + H++ PL+ G D+++HVDF ++ A Sbjct: 278 LLIDYGFPRHEYYHTDRDRGTIACHYRHHSHFDPLILTGIQDITAHVDFTAIAEAAAKQG 337 Query: 290 LYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILV 349 L + G T Q FL GI ++ A + +K+L MGELFK + Sbjct: 338 LAVEGFTHQAGFLLNCGIATLMPQ-VEDVAEHYRIAQEIKKLTL----PGEMGELFKAIA 392 Query: 350 VSHE-KVELMPFV 361 ++ + L+ F+ Sbjct: 393 LTRNYRQSLLGFI 405 >gi|193211224|ref|NP_499246.2| hypothetical protein ZK1128.1 [Caenorhabditis elegans] gi|166231760|sp|Q09644|MIDA_CAEEL RecName: Full=Protein midA homolog, mitochondrial gi|154147255|emb|CAA87427.3| C. elegans protein ZK1128.1, confirmed by transcript evidence [Caenorhabditis elegans] Length = 426 Score = 244 bits (623), Expect = 1e-62, Method: Composition-based stats. Identities = 122/386 (31%), Positives = 183/386 (47%), Gaps = 39/386 (10%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYST----CNPFGAVGDFVTAPEISQ 57 N L + +V+ I+ +G +TV +Y CV+ P GYY FGA GDF+T+PE++Q Sbjct: 32 TNHLKKFLVDKIRVSGPITVAEYMKTCVSAPLVGYYGQFSKDQKVFGAKGDFITSPELTQ 91 Query: 58 IFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVE 117 +FGEM+ +++ G +LVELGPGR +M D+L + K SV ++VE Sbjct: 92 LFGEMIGVWVFHELANTGHKGSWQLVELGPGRAQLMNDVLNALAKFNDKDVSV---HLVE 148 Query: 118 TSERLTLIQKKQLASYGD------------------KINWYTSLADVPLGFTFLVANEFF 159 TS+ L Q+K L Y I WY S+ D+P GFT + NEF Sbjct: 149 TSDALIDEQEKSLCIYTSKNSIDTPFIRKNKTRTGVNIYWYKSIDDIPDGFTVFIGNEFL 208 Query: 160 DSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGA----IFENS 215 D+LPI QF + E +++ + L F E +E S Sbjct: 209 DALPIHQFHKSGDSWNEVYVNLTKDGDLCFMKSKGENLHTKGLIPSAIRDDSSRVTWECS 268 Query: 216 PCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSH 275 P + I DR+ GG ++++DYG+ SR + +A K H V L NPG ADL++ Sbjct: 269 PESGTVVNQIVDRITTFGGFSLLVDYGHDGSRNTHSFRAYKNHKQVDTLENPGLADLTAD 328 Query: 276 VDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLM---KQTARKDILLDSVKRLV 332 VDF L S + ++ I G Q +FL LGI R L+ K +++ L++S L+ Sbjct: 329 VDFGYL-STLVKDRVVIYGPNEQREFLAQLGIEHRLRRLLQVCKDRKQQEQLIESYNMLM 387 Query: 333 STSADKKSMGELFKILVVSHEKVELM 358 MG FK + + +E + Sbjct: 388 ------GDMGLKFKAWALFPKTLEFI 407 >gi|117926895|ref|YP_867512.1| hypothetical protein Mmc1_3621 [Magnetococcus sp. MC-1] gi|117610651|gb|ABK46106.1| protein of unknown function DUF185 [Magnetococcus sp. MC-1] Length = 393 Score = 244 bits (623), Expect = 2e-62, Method: Composition-based stats. Identities = 103/381 (27%), Positives = 175/381 (45%), Gaps = 25/381 (6%) Query: 2 ENKLIRKIVNLIKKNGQM-TVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIF 59 L +++ K++G + + ++ + + P +GYY + G GDF TAPE++ +F Sbjct: 9 SEALQSELIEWAKEHGGILSFRKFMEMALYHPSYGYYMRKWSRLGVEGDFTTAPEMTSLF 68 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 GE+L + ++ W++ G P+ ++E+G G G + D+LR K PDF+ LS+ ++E S Sbjct: 69 GELLTLQMMEVWQRMGSPAFFAVMEVGAGSGKLAGDVLRTAKKF-PDFYDALSLIILEKS 127 Query: 120 ERLTLIQKKQLASY---GDKINWYTSLADVPLGFTF---LVANEFFDSLPIKQFVMTEHG 173 +Q + L K+ W L F + NE D+ P+ TE G Sbjct: 128 PDFRRVQAEFLQKKGVDIHKVRWVYDLDAWEGEGAFQGVVYGNEVLDAFPVHWVEQTEQG 187 Query: 174 IRERMIDID---QHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLA 230 ++E + D + LV + + + G E S + + IS + Sbjct: 188 LKEVVAQWDGRSWCEQLVEPESALQGDYFKVRGIELETGWRTEFSLDAQQWLGRISANME 247 Query: 231 CDGGTAIVIDYGYLQSRV------GDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSS 283 G ++IDYGY+ TL A + H + P + PG DL++HVDF + Sbjct: 248 Q--GAVLMIDYGYVAQDYYQGGLPHGTLMAHQRHQRIKEPWLWPGDMDLTAHVDFSAMQQ 305 Query: 284 I-AILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMG 342 + + L + G TTQG FL G+GI QR +K ++ + +++ VS +MG Sbjct: 306 VSCGQHGLDLLGFTTQGWFLLGMGILQRLEQAIKLDEDRERVAL-LRQTVSRLIMPDAMG 364 Query: 343 ELFKILVVSH--EKVELMPFV 361 E FK+L V + L F+ Sbjct: 365 ERFKVLAVGRGLGRERLAGFL 385 >gi|254251231|ref|ZP_04944549.1| hypothetical protein BDAG_00409 [Burkholderia dolosa AUO158] gi|124893840|gb|EAY67720.1| hypothetical protein BDAG_00409 [Burkholderia dolosa AUO158] Length = 412 Score = 244 bits (623), Expect = 2e-62, Method: Composition-based stats. Identities = 86/371 (23%), Positives = 156/371 (42%), Gaps = 34/371 (9%) Query: 2 ENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTC-NPFGAV----GDFVTAPEI 55 L ++ + I G + D++ + P GYYS FG DFVTAPE+ Sbjct: 38 SETLTAQLRDEIAAAGGWLPFDRFMERALYAPGLGYYSGGARKFGRRADDGSDFVTAPEL 97 Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 S +F + LA + A E G R++E G G G + + + L ++ + Sbjct: 98 SPLFAQTLAQPVAQALEASG---TRRVMEFGAGTGKLAAGL---LASLDALGAALDEYLI 151 Query: 116 VETSERLTLIQKKQLASYGDK----INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTE 171 V+ S L Q+ +A+ + W +L + G ++ NE D++P++ F Sbjct: 152 VDLSGELRERQRDTIAAAAPAQAAKVRWLDALPERFEG--VVIGNEVLDAMPVRLFAKAG 209 Query: 172 HGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCS--DYFLGAIFENSPCRDREMQSISDRL 229 RER + +D + VF+ + + + D G + E ++++ L Sbjct: 210 GTWRERGVALDARHAFVFDDRETGPDALPPALAGLDVDDGYVTETHEAALAFVRTVCTML 269 Query: 230 ACDGGTAIVIDY------GYLQSRVGDT-LQAVKGHTYVSPLVNPGQADLSSHVDFQRLS 282 A G +++DY Y R T + + H + P + PG D+++HV+F + Sbjct: 270 AR--GAVLLVDYGFPAHEYYHPQRDRGTLMCHYRHHAHDDPFLYPGLQDITAHVEFTGIY 327 Query: 283 SIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTA-RKDILLDSVKRLVSTSADKKSM 341 + G T+Q +FL GI ++ + ++V++L+S + M Sbjct: 328 EAGTAAGADLLGYTSQARFLLNAGITDALAAIDPSETMQFLPAANAVQKLIS----EAEM 383 Query: 342 GELFKILVVSH 352 GELFK++ S Sbjct: 384 GELFKVIAFSR 394 >gi|91785574|ref|YP_560780.1| hypothetical protein Bxe_A0203 [Burkholderia xenovorans LB400] gi|91689528|gb|ABE32728.1| Conserved hypothetical protein [Burkholderia xenovorans LB400] Length = 396 Score = 244 bits (622), Expect = 2e-62, Method: Composition-based stats. Identities = 85/370 (22%), Positives = 148/370 (40%), Gaps = 32/370 (8%) Query: 2 ENKLIRKIVNLI-KKNGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGD----FVTAPEI 55 L+ +I + G + D+Y + P GYYS FG GD FVTAPE+ Sbjct: 22 SEALVAQIRAELEAAGGWLPFDRYMERALYAPGLGYYSGGARKFGLRGDDGSDFVTAPEL 81 Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 S +F LA + A + G ++E G G G + +L + +F S + Sbjct: 82 SPLFAATLARPIAEALQASG---TRNVMEFGAGTGKLAAGLLHALDASGAEFDS---YSI 135 Query: 116 VETSERLTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTE 171 V+ S L Q + + + K+ W +L + G ++ NE D++P++ F T Sbjct: 136 VDLSGELRERQSETIGAAVPALAAKVRWLDALPERFEG--VVIGNEVLDAMPVRLFAFTG 193 Query: 172 HGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYF---LGAIFENSPCRDREMQSISDR 228 ER + + ++ F+ ++ S+ + E ++I Sbjct: 194 GAWHERG-VVWRDEAFAFDGQPVSAAADLALLSEIETAGEDYVTETHEAARAFTRTICTM 252 Query: 229 LACDGGTAIVIDYG----YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSS 283 L I + Y R TL H P + PG D+++HV+F ++ Sbjct: 253 LVRGAAFFIDYGFPRHEYYHAQRAQGTLMCHYRHRAHVDPFLYPGLQDITAHVEFTGIAE 312 Query: 284 IAILYKLYINGLTTQGKFLEGLGIWQ-RAFSLMKQTARKDILLDSVKRLVSTSADKKSMG 342 + + G T+Q +FL GI + A + ++V++L+S + MG Sbjct: 313 AGVETGADLLGFTSQARFLLNAGITEALAEIDPADPKQYLPAANAVQKLLS----EAEMG 368 Query: 343 ELFKILVVSH 352 ELFK++ S Sbjct: 369 ELFKVIAFSR 378 >gi|209363658|ref|YP_001423492.2| hypothetical protein CBUD_0057 [Coxiella burnetii Dugway 5J108-111] gi|207081591|gb|ABS77815.2| hypothetical protein CBUD_0057 [Coxiella burnetii Dugway 5J108-111] Length = 417 Score = 244 bits (622), Expect = 2e-62, Method: Composition-based stats. Identities = 94/373 (25%), Positives = 161/373 (43%), Gaps = 26/373 (6%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFG 60 +L IV I +NG +T +Y L + P GYYS FGA GDFVTAPEIS +F Sbjct: 46 SEQLRLHIVREIAENGPLTFARYMQLALYAPGLGYYSAGSRKFGAAGDFVTAPEISSLFS 105 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 + +A +VELG G G M DILR + + +++E S Sbjct: 106 QCVARQCQQILIDLNGGD---IVELGAGSGRMAADILRELQHTGCLPHN---YFILEISA 159 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTF---LVANEFFDSLPIKQFVMTEHGIRER 177 L Q+K + + +++ + F ++ NE D++P+ +F ++GI+E Sbjct: 160 DLRDRQEKFIKNEIPELSHRVKWLNRLPSPHFKGVILGNEVIDAMPVHKF-KIDNGIKEV 218 Query: 178 MIDIDQHDSLVFNIGDHEIKSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGG 234 ++ +++ V+ IG+ + + + G E + + S++D L Sbjct: 219 YVN-WKNEQFVWEIGEPSAALSDYIKNLTIHFPEGYESEVNLLLKGWIASLADILQEGLI 277 Query: 235 TAIVI-----DYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYK 289 I +Y + G + H++ PL+ G D+++HVDF ++ A Sbjct: 278 LLIDYGFPRHEYYHTDRDRGTIACHYRHHSHFDPLILTGIQDITAHVDFTAIAEAAAKQG 337 Query: 290 LYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILV 349 L + G T Q FL GI ++ A + +K+L MGELFK + Sbjct: 338 LAVEGFTHQAGFLLNCGIATLMPQ-VEDVAEHYRIAQEIKKLTL----PGEMGELFKAIA 392 Query: 350 VSHE-KVELMPFV 361 ++ + L+ F+ Sbjct: 393 LTRNYRQSLLGFI 405 >gi|118602453|ref|YP_903668.1| hypothetical protein Rmag_0438 [Candidatus Ruthia magnifica str. Cm (Calyptogena magnifica)] gi|118567392|gb|ABL02197.1| protein of unknown function DUF185 [Candidatus Ruthia magnifica str. Cm (Calyptogena magnifica)] Length = 364 Score = 244 bits (622), Expect = 2e-62, Method: Composition-based stats. Identities = 95/373 (25%), Positives = 158/373 (42%), Gaps = 33/373 (8%) Query: 4 KLIRKIVN-LIKKNGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGE 61 L + I N +I+ ++ D++ L + P GYY + FG GDF+TAPE S +FG Sbjct: 2 SLEQIIKNTIIQNANPISFDEFMDLALYHPTLGYYRSGLEKFGERGDFITAPETSDLFGF 61 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 LA ++E G G G++ IL + +L Y++E S Sbjct: 62 CLARQCAQVL-----NGTNDILEFGAGSGVLATQILFKLGRLNSLPKK---YYILELSAE 113 Query: 122 LTLIQ----KKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRER 177 L Q K L D++ W L G ++ANE D++P K+ V + E Sbjct: 114 LKHRQAQAINKILPELMDRVVWLDELPADFSG--VVIANEVLDAMPAKRIVYKNNQFYEL 171 Query: 178 MIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAI 237 +D ++ L ++ E + + S+ +A + + Sbjct: 172 GVDYCGNEFCWKIFDLPYQSDKTLLPNNMVEDYKTEVNLRAMAWINSLY--MATNEVLVL 229 Query: 238 VIDY------GYLQSRVGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKL 290 +IDY + R+ TL+ H +P VN G+ D+++ V+F ++ A + Sbjct: 230 LIDYGMGRNEYFHPQRLNGTLRCYYQHKASENPFVNIGEQDITTSVNFSDIADQASVSGF 289 Query: 291 YINGLTTQGKFLEGLGIWQ-RAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILV 349 ++G TQ FL LGI + L + + L VK+LV SA MGE FK+L Sbjct: 290 KVSGYATQALFLISLGIDEYLFEQLDEN--KYINLAQQVKQLVLPSA----MGESFKVLA 343 Query: 350 VSHE-KVELMPFV 361 +S + ++L F+ Sbjct: 344 LSKKLSIKLTGFI 356 >gi|165918264|ref|ZP_02218350.1| conserved hypothetical protein [Coxiella burnetii RSA 334] gi|165918124|gb|EDR36728.1| conserved hypothetical protein [Coxiella burnetii RSA 334] Length = 388 Score = 243 bits (621), Expect = 2e-62, Method: Composition-based stats. Identities = 93/373 (24%), Positives = 161/373 (43%), Gaps = 26/373 (6%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFG 60 +L IV I +NG +T +Y L + P GYYS FGA GDFVTAPEIS +F Sbjct: 17 SEQLRLHIVREIAENGPLTFARYMQLALYAPGLGYYSAGSRKFGAAGDFVTAPEISSLFS 76 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 + +A ++ELG G G M DILR + + +++E S Sbjct: 77 QCVARQCQQILIDLNGGD---ILELGAGSGRMAADILRELQHTGCLPHN---YFILEISA 130 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTF---LVANEFFDSLPIKQFVMTEHGIRER 177 L Q+K + + +++ + F ++ NE D++P+ +F ++GI+E Sbjct: 131 DLRDRQEKFIKNEIPELSHRVKWLNRLPSPHFKGVILGNEVIDAMPVHKF-KIDNGIKEV 189 Query: 178 MIDIDQHDSLVFNIGDHEIKSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGG 234 ++ +++ V+ IG+ + + + G E + + S++D L Sbjct: 190 YVN-WKNEQFVWEIGEPSAALSDYIKNLTIHFPEGYESEVNLLLKGWIASLADILQEGLI 248 Query: 235 TAIVI-----DYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYK 289 I +Y + G + H++ PL+ G D+++HVDF ++ A Sbjct: 249 LLIDYGFPRHEYYHTDRDRGTIACHYRHHSHFDPLILTGIQDITAHVDFTAIAEAAAKQG 308 Query: 290 LYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILV 349 L + G T Q FL GI ++ A + +K+L MGELFK + Sbjct: 309 LAVEGFTHQAGFLLNCGIATLMPQ-VEDVAEHYRIAQEIKKLTL----PGEMGELFKAIA 363 Query: 350 VSHE-KVELMPFV 361 ++ + L+ F+ Sbjct: 364 LTRNYRQSLLGFI 376 >gi|215919266|ref|NP_820807.2| hypothetical protein CBU_1828 [Coxiella burnetii RSA 493] gi|206584150|gb|AAO91321.2| hypothetical protein CBU_1828 [Coxiella burnetii RSA 493] Length = 417 Score = 243 bits (621), Expect = 2e-62, Method: Composition-based stats. Identities = 93/373 (24%), Positives = 161/373 (43%), Gaps = 26/373 (6%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFG 60 +L IV I +NG +T +Y L + P GYYS FGA GDFVTAPEIS +F Sbjct: 46 SEQLRLHIVREIAENGPLTFARYMQLALYAPGLGYYSAGSRKFGAAGDFVTAPEISSLFS 105 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 + +A ++ELG G G M DILR + + +++E S Sbjct: 106 QCVARQCQQILIDLNGGD---ILELGAGSGRMAADILRELQHTGCLPHN---YFILEISA 159 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTF---LVANEFFDSLPIKQFVMTEHGIRER 177 L Q+K + + +++ + F ++ NE D++P+ +F ++GI+E Sbjct: 160 DLRDRQEKFIKNEIPELSHRVKWLNRLPSPHFKGVILGNEVIDAMPVHKF-KIDNGIKEV 218 Query: 178 MIDIDQHDSLVFNIGDHEIKSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGG 234 ++ +++ V+ IG+ + + + G E + + S++D L Sbjct: 219 YVN-WKNEQFVWEIGEPSAALSDYIKNLTIHFPEGYESEVNLLLKGWIASLADILQEGLI 277 Query: 235 TAIVI-----DYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYK 289 I +Y + G + H++ PL+ G D+++HVDF ++ A Sbjct: 278 LLIDYGFPRHEYYHTDRDRGTIACHYRHHSHFDPLILTGIQDITAHVDFTAIAEAAAKQG 337 Query: 290 LYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILV 349 L + G T Q FL GI ++ A + +K+L MGELFK + Sbjct: 338 LAVEGFTHQAGFLLNCGIATLMPQ-VEDVAEHYRIAQEIKKLTL----PGEMGELFKAIA 392 Query: 350 VSHE-KVELMPFV 361 ++ + L+ F+ Sbjct: 393 LTRNYRQSLLGFI 405 >gi|212211857|ref|YP_002302793.1| hypothetical protein CbuG_0201 [Coxiella burnetii CbuG_Q212] gi|212010267|gb|ACJ17648.1| hypothetical protein CbuG_0201 [Coxiella burnetii CbuG_Q212] Length = 417 Score = 243 bits (621), Expect = 2e-62, Method: Composition-based stats. Identities = 93/373 (24%), Positives = 161/373 (43%), Gaps = 26/373 (6%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFG 60 +L IV I +NG +T +Y L + P GYYS FGA GDFVTAPEIS +F Sbjct: 46 SEQLRLHIVREIAENGPLTFARYMQLALYAPGLGYYSAGSRKFGAAGDFVTAPEISSLFS 105 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 + +A ++ELG G G M DILR + + +++E S Sbjct: 106 QCVARQCQQILIDLNGGD---ILELGAGSGRMAADILRELQHTGCLPHN---YFILEISA 159 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTF---LVANEFFDSLPIKQFVMTEHGIRER 177 L Q+K + + +++ + F ++ NE D++P+ +F ++GI+E Sbjct: 160 DLRDRQEKFIKNEIPELSHRVKWLNRLPSPHFKGVILGNEVIDAMPVHKF-KIDNGIKEV 218 Query: 178 MIDIDQHDSLVFNIGDHEIKSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGG 234 ++ +++ V+ IG+ + + + G E + + S++D L Sbjct: 219 YVN-WKNEQFVWEIGEPSAALSDYIKNLTIHFPEGYESEVNLLLKGWIASLADILQEGLI 277 Query: 235 TAIVI-----DYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYK 289 I +Y + G + H++ PL+ G D+++HVDF ++ A Sbjct: 278 LLIDYGFPRHEYYHTDRDRGTIACHYRHHSHFDPLILTGIQDITAHVDFTAIAEAAAKQG 337 Query: 290 LYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILV 349 L + G T Q FL GI ++ A + +K+L MGELFK + Sbjct: 338 LAVEGFTHQAGFLLNCGIATLMPQ-VEDVAEHYRIAQEIKKLTL----PGEMGELFKAIA 392 Query: 350 VSHE-KVELMPFV 361 ++ + L+ F+ Sbjct: 393 LTRNYRQSLLGFI 405 >gi|312794997|ref|YP_004027919.1| hypothetical protein RBRH_01965 [Burkholderia rhizoxinica HKI 454] gi|312166772|emb|CBW73775.1| Hypothetical protein RBRH_01965 [Burkholderia rhizoxinica HKI 454] Length = 396 Score = 243 bits (621), Expect = 3e-62, Method: Composition-based stats. Identities = 87/369 (23%), Positives = 142/369 (38%), Gaps = 34/369 (9%) Query: 4 KLIRKIVNLI-KKNGQMTVDQYFALCVADPEFGYYSTCN-PFGAV----GDFVTAPEISQ 57 L++ I I G + D+Y L + P GYYS + FG DF+TAPE+S Sbjct: 24 ALVQHIAKQICAAGGWLPFDRYMELALYTPGLGYYSGGSVKFGRGPDDGSDFITAPELSP 83 Query: 58 IFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVE 117 +F + A + G +VE G G G + + L +VE Sbjct: 84 LFAQTFAKPVADVLGATG---TRHVVEFGAGTGKFAAGL---LRTLHALGVGCTRYTIVE 137 Query: 118 TSERLTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHG 173 S L Q++ + + + W +L + G +V NE D++P++ F + Sbjct: 138 LSGELRARQRECIAQTAPQFASCVEWIDALPERVDG--VIVGNEVLDAMPVRLFARQDGC 195 Query: 174 IRERMIDIDQHDSLVFNIGDHEIKSNFLTCS--DYFLGAIFENSPCRDREMQSISDRLAC 231 ER + + VF + T + + E + ++ L Sbjct: 196 WHERGVTLADASRFVFADRPLAATAVPATLACVPGRHDYVTETHEAAAAFVHTVCSALGR 255 Query: 232 DGGTAIVIDY------GYLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSSI 284 G A+ +DY Y R TL H P + PG D+++HV F + Sbjct: 256 --GAALFVDYGFPAAEYYHPQRTEGTLMCHYRHRAHGDPFLYPGLQDITAHVQFSAIGQA 313 Query: 285 AILYKLYINGLTTQGKFLEGLGIWQRAFSLM-KQTARKDILLDSVKRLVSTSADKKSMGE 343 A ++ G T+Q +FL GI L AR ++V++L+S + MGE Sbjct: 314 ARDAGAHLLGYTSQARFLMNAGITDSLAQLDPADPARFLPAANAVQKLLS----EAEMGE 369 Query: 344 LFKILVVSH 352 LFK++ Sbjct: 370 LFKVIAFCR 378 >gi|224141343|ref|XP_002324033.1| predicted protein [Populus trichocarpa] gi|222867035|gb|EEF04166.1| predicted protein [Populus trichocarpa] Length = 377 Score = 243 bits (621), Expect = 3e-62, Method: Composition-based stats. Identities = 119/378 (31%), Positives = 185/378 (48%), Gaps = 44/378 (11%) Query: 25 FALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVE 84 + +P+FG+Y + + FG GDF+T+PE+SQ+FGEM+ ++ +C WEQ G P V LVE Sbjct: 1 MEEVLTNPKFGFYISRDVFGTEGDFITSPEVSQMFGEMVGVWAMCLWEQMGRPKQVNLVE 60 Query: 85 LGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK-------- 136 LGPGRG +M D+LR K K L +++VE S L +Q L + Sbjct: 61 LGPGRGTLMADLLRGASKFKSFT-ESLHVHLVECSPTLQKLQHHNLKCLDEDDNGDGVEK 119 Query: 137 ----------INWYTSLADVPLG-FTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHD 185 ++W+ L VP G + ++A+EF+D+LP+ QF G E+M+D+ + Sbjct: 120 RTISTLAGTLVSWHALLEQVPSGLPSIIIAHEFYDALPVHQFQRASRGWCEKMVDVSEDS 179 Query: 186 SLVFNIGDHEIK--------SNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAI 237 F + + + + E P +I+DR++CDGG A+ Sbjct: 180 MFRFVLSPQPTPATLYLMKRCKWAAPEEIEKLSHIEVCPKAMDLTHAIADRISCDGGGAL 239 Query: 238 VIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAIL--YKLYINGL 295 +IDYG L V D+LQA++ H +++ L NPG ADLS++VDF + A + ++G Sbjct: 240 IIDYG-LNGVVSDSLQAIRKHKFINILDNPGSADLSAYVDFASIRHSAEEVSADISVHGP 298 Query: 296 TTQGKFLEGLGIWQRAFSLMKQ--TARKDILLDSVKRLVSTSAD----------KKSMGE 343 TQ +FL LGI R SL++ + D L RLV MG Sbjct: 299 ITQSQFLGALGINFRVESLLQNCTDEQADSLRTGYWRLVGEGEAPFWEGPDEQVPIGMGT 358 Query: 344 LFKILVVSHEKVELM-PF 360 + + + + K + PF Sbjct: 359 RYLAMAIVNTKQGVPVPF 376 >gi|170734330|ref|YP_001766277.1| hypothetical protein Bcenmc03_2997 [Burkholderia cenocepacia MC0-3] gi|169817572|gb|ACA92155.1| protein of unknown function DUF185 [Burkholderia cenocepacia MC0-3] Length = 396 Score = 243 bits (621), Expect = 3e-62, Method: Composition-based stats. Identities = 92/371 (24%), Positives = 157/371 (42%), Gaps = 34/371 (9%) Query: 2 ENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTC-NPFGAV----GDFVTAPEI 55 L ++ + I G ++ D++ + P GYYS FG DFVTAPE+ Sbjct: 22 SETLAAQLRDEIAAAGGWLSFDRFMERALYAPGLGYYSGGARKFGRRADDGSDFVTAPEL 81 Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 S +F + LA + A G R++E G G G + +L + L + L + Sbjct: 82 SPLFAQTLAQPVAEALAASG---TRRVMEFGAGTGKLAAGLLATLDALGAELDEYL---I 135 Query: 116 VETSERLTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTE 171 V+ S L Q+ + + K+ W +L + G +V NE D++P++ F + Sbjct: 136 VDLSGELRERQRDTIEAAVPALAAKVRWLDALPERFDG--VVVGNEVLDAMPVRLFAKVD 193 Query: 172 HGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYF--LGAIFENSPCRDREMQSISDRL 229 RER + +D + VF+ + G + E +++ L Sbjct: 194 GAWRERGVALDARHAFVFDDRPVGAAGLPAVLAPLDVGDGYVTETHEAALAFTRTVCTML 253 Query: 230 ACDGGTAIVIDY------GYLQSRVGDT-LQAVKGHTYVSPLVNPGQADLSSHVDFQRLS 282 G A++IDY Y R T + + H + P V PG DL++HV+F + Sbjct: 254 GR--GAALLIDYGFPAHEYYHPQRDRGTLMCHYRHHAHDDPFVYPGLQDLTAHVEFTGIY 311 Query: 283 SIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARK-DILLDSVKRLVSTSADKKSM 341 A+ + + G T+Q +FL GI ++ R ++V++L+S + M Sbjct: 312 EAAVATGVDLLGYTSQARFLLNAGITDALAAIDPSDIRAFLPAANAVQKLIS----EAEM 367 Query: 342 GELFKILVVSH 352 GELFK++ S Sbjct: 368 GELFKVIAFSR 378 >gi|255727811|ref|XP_002548831.1| conserved hypothetical protein [Candida tropicalis MYA-3404] gi|240133147|gb|EER32703.1| conserved hypothetical protein [Candida tropicalis MYA-3404] Length = 512 Score = 243 bits (620), Expect = 3e-62, Method: Composition-based stats. Identities = 113/414 (27%), Positives = 176/414 (42%), Gaps = 62/414 (14%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFG-AVGDFVTAPEISQIFGE 61 N L IK G + + Y C+ PEFGYY+T NP GDF+T+PEIS +FGE Sbjct: 102 NNLTDLFQQTIKLTGPLPLSAYMRQCLTHPEFGYYTTRNPLSLRTGDFITSPEISSVFGE 161 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFS---VLSIYMVET 118 M+ I+ W+Q +P +R VE GPG+G ++ D+L+ K + I ++E Sbjct: 162 MIGIWYFSIWQQQNYPKHIRFVEFGPGKGTLIFDVLKTFNKFVEKLSKEKPTIEISLIEA 221 Query: 119 SERLTLIQKKQLA---------------------SYGDKINWYTSLADVPLGFTFLVANE 157 S+ L Q K + + ++ + F++A+E Sbjct: 222 SKVLRKEQWKLMCDPEQPFETTEEGYNRSVTKWGNEISWLDTEKDIKHDNEIANFIIAHE 281 Query: 158 FFDSLPIKQFVMTEHGIRERMIDIDQ----------------------HDSLVFNIGDHE 195 FFD+LPIK F+ E G RE M++ + I E Sbjct: 282 FFDALPIKGFIREEKGWRELMVEHTPSVNNTQLKLESKETPQEEDESLNTEFHLTIAPKE 341 Query: 196 IKSNFLT-----CSDYFLGAIFENSPCRDREMQSISDRLAC-DGGTAIVIDYGYLQSRVG 249 S+ + D +G E P + + + L + G +VIDYG Sbjct: 342 TPSSMIPQISKRYRDLPVGTRIEICPDAELYIMKMVQLLNNSNKGAILVIDYGTANEIPS 401 Query: 250 DTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQ 309 ++L+ + H +VSP NPG+ DLS VDF+ L + + G T+QG +L +GI Sbjct: 402 NSLRGIHQHKFVSPFWNPGEVDLSIDVDFENLKLLTNDI-VESFGPTSQGDWLHNIGIGY 460 Query: 310 RAFSLMKQTAR----KDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMP 359 R L+K +D + + +RL DK MG ++K L + + P Sbjct: 461 RVDQLIKMNNEAPHVQDKIYGAYRRLT----DKDQMGNIYKFLALLPKGSSAPP 510 >gi|325265583|ref|ZP_08132274.1| protein of hypothetical function DUF185 [Kingella denitrificans ATCC 33394] gi|324982931|gb|EGC18552.1| protein of hypothetical function DUF185 [Kingella denitrificans ATCC 33394] Length = 384 Score = 243 bits (620), Expect = 3e-62, Method: Composition-based stats. Identities = 95/376 (25%), Positives = 157/376 (41%), Gaps = 32/376 (8%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIFG 60 L + I N I++ G + ++ L + P GYY+ G GDF+TAP +S +F Sbjct: 17 SQALCQLIQNEIREKGDIPFSRFMELALYAPNLGYYANGRLKIGQAGDFITAPTLSPLFA 76 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 + LA L + G + E G G G++ D+L + S+ + Y+VE S Sbjct: 77 QTLAQQLKPLLPEVGG----VIYEFGAGTGVLAADLLNALES------SLHTYYIVEVSA 126 Query: 121 RLTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRE 176 +L Q++ + + DK+ W L + G L+ NE D++PI++ E + Sbjct: 127 QLAKQQRQHIAEYAPEFLDKVEWLAQLPEQLEG--ILIGNEVLDAMPIERIRHNESNKWQ 184 Query: 177 RMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLG---AIFENSPCRDREMQSISDRLACDG 233 R + D + + E ++ YF E + +Q+++ +L Sbjct: 185 RACVSQEGDKFILKYQETEDENIVQAALQYFPDNTPYTSELHLTQYAFIQTLAQKLTR-- 242 Query: 234 GTAIVIDY------GYLQSRVGDTLQ-AVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAI 286 G I IDY Y R TL + H+ P G DL+SHV+F ++ A Sbjct: 243 GAMIWIDYGFDAAQYYHPQRNDGTLIGHHRHHSIHDPFYRVGLTDLTSHVNFSDIADAAC 302 Query: 287 LYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFK 346 L + G TTQ FL LGI +R + + + + V + MGELFK Sbjct: 303 TNGLDLIGYTTQANFLFNLGILERLGQQYPEADSVEYIRAAA--AVHQLTAQHEMGELFK 360 Query: 347 ILVVSHE-KVELMPFV 361 ++ V+ F+ Sbjct: 361 VIAFGKNISVDWQGFL 376 >gi|296157397|ref|ZP_06840232.1| protein of unknown function DUF185 [Burkholderia sp. Ch1-1] gi|295892169|gb|EFG71952.1| protein of unknown function DUF185 [Burkholderia sp. Ch1-1] Length = 396 Score = 243 bits (620), Expect = 3e-62, Method: Composition-based stats. Identities = 86/370 (23%), Positives = 151/370 (40%), Gaps = 32/370 (8%) Query: 2 ENKLIRKIVNLI-KKNGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGD----FVTAPEI 55 L+ ++ + G + D+Y + P GYYS FG GD FVTAPE+ Sbjct: 22 SEALVAQLRAELEAAGGWLPFDRYMERALYAPGLGYYSGGARKFGLRGDDGSDFVTAPEL 81 Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 S +F LA + A + G ++E G G G + +L + L +F S + Sbjct: 82 SPLFAATLARPIAEALQASG---TRNVMEFGAGTGKLAAGLLDALDALGAEFDS---YSI 135 Query: 116 VETSERLTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTE 171 V+ S L Q++ + + K+ W +L + G ++ NE D++P++ F T Sbjct: 136 VDLSGELRERQREAIEAAVPALAAKVRWLDALPERFEG--VVIGNEVLDAMPVRLFAFTG 193 Query: 172 HGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYF---LGAIFENSPCRDREMQSISDR 228 ER + + ++ F+ ++ S+ + E ++I Sbjct: 194 GAWHERG-VVWRDEAFAFDDRPVSAAADLALLSEIDTAGEDYVTETHEAASAFTRTICTM 252 Query: 229 LACDGGTAIVIDYG----YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSS 283 L I + Y R TL H P + PG D+++HV+F ++ Sbjct: 253 LVRGAAFFIDYGFPRHEYYHAQRAQGTLMCHYRHRAHGDPFLYPGLQDITAHVEFTGIAE 312 Query: 284 IAILYKLYINGLTTQGKFLEGLGIWQ-RAFSLMKQTARKDILLDSVKRLVSTSADKKSMG 342 + + G T+Q +FL GI + A T + ++V++L+S + MG Sbjct: 313 AGVETGADLLGFTSQARFLLNAGITEALAEIDPADTKQYLPAANAVQKLLS----EAEMG 368 Query: 343 ELFKILVVSH 352 ELFK++ S Sbjct: 369 ELFKVIAFSR 378 >gi|292490198|ref|YP_003525637.1| hypothetical protein Nhal_0029 [Nitrosococcus halophilus Nc4] gi|291578793|gb|ADE13250.1| protein of unknown function DUF185 [Nitrosococcus halophilus Nc4] Length = 395 Score = 243 bits (620), Expect = 4e-62, Method: Composition-based stats. Identities = 98/378 (25%), Positives = 159/378 (42%), Gaps = 32/378 (8%) Query: 2 ENKLIRKIVNLIKKNG-QMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIF 59 KL I I++NG Q++ ++ L + P GYY + + GA GDF+TAPE+S +F Sbjct: 22 SQKLQSIIREDIEQNGGQISFARFMELALYKPGLGYYMSGLHKLGAAGDFITAPELSPLF 81 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVI--CKLKPDFFSVLSIYMVE 117 L+ E G ++E G G G M D+L + P+ + +L + E Sbjct: 82 ARCLSRQCQQVLELLGSGE---ILEFGAGSGRMAADLLAELDRRGQLPERYFILELSA-E 137 Query: 118 TSERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRER 177 +R ++Q+ K++W L D G ++ANE D++P+ F + + ER Sbjct: 138 LRQRQQQTLQQQVPHLAPKVSWLDRLPDNIQG--LVLANEVCDAMPVHCFQLEDEHSWER 195 Query: 178 MIDIDQHDSLVFNIGD------HEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLAC 231 + + DS V+ G E + E + + + I+ RL Sbjct: 196 YVGYE-GDSFVWKKGPLSHARLKERITEIRLLLKQVNRYESEVNLAMENWVAEIAHRL-- 252 Query: 232 DGGTAIVIDY------GYLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSSI 284 G ++IDY Y R TL H PL+ PG D+++HVDF L+ Sbjct: 253 QQGMLLIIDYGFPRQEYYHPDRTTGTLMCHYRHRAHPDPLILPGLQDITAHVDFTALAEA 312 Query: 285 AILYKLYINGLTTQGKFLEGLGIWQRAFS-LMKQTARKDILLDSVKRLVSTSADKKSMGE 343 L + G +Q FL G+ + A + + + + +KRL MGE Sbjct: 313 GHNSGLRVAGYCSQTDFLLACGLDELAAAEIAAGGYHALEISNQIKRLTL----PSEMGE 368 Query: 344 LFKILVVSHEKVEL-MPF 360 LFKIL ++ + F Sbjct: 369 LFKILALTRGIDPPLLGF 386 >gi|166368128|ref|YP_001660401.1| hypothetical protein MAE_53870 [Microcystis aeruginosa NIES-843] gi|166090501|dbj|BAG05209.1| hypothetical protein MAE_53870 [Microcystis aeruginosa NIES-843] Length = 375 Score = 243 bits (619), Expect = 4e-62, Method: Composition-based stats. Identities = 93/367 (25%), Positives = 157/367 (42%), Gaps = 23/367 (6%) Query: 4 KLIRKIVNLIKKN--GQMTVDQYFALCVADPEFGYYSTCNP-FGAVGDFVTAPEISQIFG 60 L I+ IK++ G+++ +++ L + P++GYY++ G+ GDF T+ + FG Sbjct: 2 NLEAIILEEIKQSVAGRISFERWMDLALYHPDYGYYTSGKVEIGSKGDFFTSSSLGADFG 61 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 ++LA + E G LVE+G G GI+ DIL + DF+ LS ++E S+ Sbjct: 62 QLLAEQFVEMAEFLGNSPGFTLVEVGAGSGILAKDILDYLSDSYADFYQNLSYIIIEQSQ 121 Query: 121 RLTLIQKKQLASYGDK-INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMI 179 +L Q+ LA Y + LAD L +NE D+ P+ + V+ +RE + Sbjct: 122 KLRERQQATLAGYSPVSWQSWPDLADNSLVGCV-FSNELIDAFPVHRVVIESGELREIYL 180 Query: 180 DI-DQHDSLVFNIGDHEIKSNFL------TCSDYFLGAIFENSPCRDREMQSISDRLACD 232 + + ++ ++ IK F Y G E + +++++ +L Sbjct: 181 GLGEPFQEIIGDLSTDRIKDYFDLVGINIPSPLYPEGYQTEVNLLALDWLETVNRKLDR- 239 Query: 233 GGTAIVIDYGYL------QSRVGDTLQAVKGHTYVS-PLVNPGQADLSSHVDFQRLSSIA 285 G + IDYGY R TLQ + H + G+ D+++HVDF L Sbjct: 240 -GYILTIDYGYTAEKYYHPQRSQGTLQCYRQHQRHDHAYLWVGEQDITTHVDFTALQRQG 298 Query: 286 ILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELF 345 L G T QG FL LG+ R L + + L + +G F Sbjct: 299 EKLGLKNLGFTQQGLFLMALGLGDRLKELSQGKIDILTIFQRRDAL-HQLINPTGLGG-F 356 Query: 346 KILVVSH 352 +L+ Sbjct: 357 GVLIQGK 363 >gi|324512121|gb|ADY45030.1| Protein midA [Ascaris suum] Length = 414 Score = 243 bits (619), Expect = 4e-62, Method: Composition-based stats. Identities = 115/382 (30%), Positives = 191/382 (50%), Gaps = 34/382 (8%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYS------TCNPFGAVGDFVTAPEISQ 57 L+R + I+ G M V Y V+ GYYS C+ FG GDF+TAPE+SQ Sbjct: 23 ALLRFLKRKIRLRGPMPVADYMRTVVSSSSVGYYSQFSRNENCDIFGEKGDFITAPELSQ 82 Query: 58 IFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVE 117 +FGEM+ ++ G +LVE GPG G +M DI+ V+ + + D S+ +++E Sbjct: 83 MFGEMIGVWCYYELANTGHKGHWQLVESGPGTGQLMKDIVGVMERFEEDKMSI---HLIE 139 Query: 118 TSERLTLIQKKQLASYGDK------------------INWYTSLADVPLGFTFLVANEFF 159 TS+ L L Q+K L S + I WY SL++VP F+ +ANEF Sbjct: 140 TSDPLILEQEKTLCSRPSQFIENNAHVRYNTTKGGIPIYWYRSLSEVPEKFSVFIANEFL 199 Query: 160 DSLPIKQFVMTEH-GIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAI----FEN 214 D+LP+ QF E E + ++ ++ L F + E + +E Sbjct: 200 DALPVHQFKKDETGNWHEIYVAMNDNEDLCFMLSRMETLFTVGLMPESIRNETDRTEWEV 259 Query: 215 SPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSS 274 SP + +++R+ GG ++IDYG+ SR +L+A + H V+PL NPG+ D+++ Sbjct: 260 SPDAGTYVNEVAERVIQFGGFGLMIDYGHDGSRKDLSLRAYRRHQLVNPLENPGEHDITA 319 Query: 275 HVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVST 334 V+F L S+ L I G Q +FL +GI R L+++ ++++ ++ +K + Sbjct: 320 DVNFGYLKSLIEDRAL-IFGPIDQREFLAQMGIGVRLRRLVEKCSKREDQVNLIKS-YNM 377 Query: 335 SADKKSMGELFKILVVSHEKVE 356 ++ MG FK++ V + ++ Sbjct: 378 LMSEEGMGTRFKVMSVFPKTLK 399 >gi|37521953|ref|NP_925330.1| hypothetical protein gll2384 [Gloeobacter violaceus PCC 7421] gi|35212952|dbj|BAC90325.1| gll2384 [Gloeobacter violaceus PCC 7421] Length = 396 Score = 243 bits (619), Expect = 4e-62, Method: Composition-based stats. Identities = 98/388 (25%), Positives = 156/388 (40%), Gaps = 33/388 (8%) Query: 1 MENK--------LIRKIVNLIKK--NGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDF 49 ME L I + G++ Q+ L + PE GYY+T G GD+ Sbjct: 1 METPPVDDSNPALRTLIAQRVAASPGGRLNFAQFMDLALYHPELGYYATHPGRIGGWGDY 60 Query: 50 VTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFS 109 +TA + FGE+LA+ W G P VE+G G+G+ D LR PDF + Sbjct: 61 ITAAHLGSDFGELLAVQAAQLWRHLGKPEGFDFVEMGAGQGLFAADFLRHAHANLPDFAA 120 Query: 110 VLSIYMVETSERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVM 169 L +VE S Q++ LA + +A + F ANE D+LP+ QFV+ Sbjct: 121 ALDYRIVERSAAQLAEQRRVLAGLPVRWCDLEQIAPDSVAGCF-FANELVDALPVHQFVV 179 Query: 170 TEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTC-------SDYFLGAIFENSPCRDREM 222 + E + ++ V + G E + + Sbjct: 180 HRGELLEIYVALEGERDFVEVAAAPSTPRLAAFLDRCGIATAPLGEGYRSEVNLAALDWL 239 Query: 223 QSISDRLACDGGTAIVIDYGYL-------QSRVGDTLQAVKGHTYVSPLVNPGQADLSSH 275 ++++ RLA G + +DYGY Q R G + + +P ++ G D+++H Sbjct: 240 EAVARRLAR--GYVLTVDYGYTARQYYAPQHRSGTLACYHRHRVHDNPYLHIGNQDITAH 297 Query: 276 VDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMK--QTARKDILLDSVKR--L 331 V+F L + Y L G T Q FL LG+ +R +L L +++R Sbjct: 298 VNFTALQTHGAAYALRSLGFTRQSFFLLALGLGERLAALGAPGAIENNAQLNAALRRREA 357 Query: 332 VSTSADKKSMGELFKILVVSHEKVELMP 359 + T D SMG+ F +L+ +P Sbjct: 358 LRTLVDPGSMGD-FGVLIQGKNTDPDLP 384 >gi|167644641|ref|YP_001682304.1| hypothetical protein Caul_0673 [Caulobacter sp. K31] gi|167347071|gb|ABZ69806.1| protein of unknown function DUF185 [Caulobacter sp. K31] Length = 402 Score = 243 bits (619), Expect = 4e-62, Method: Composition-based stats. Identities = 131/402 (32%), Positives = 189/402 (47%), Gaps = 50/402 (12%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPF-----GAVGDFVTAPEISQI 58 L+ ++ I ++G ++V ++F C+ DP GYY+T G GDF+TAP +SQ+ Sbjct: 2 SLLDRLKAQIAQDGPISVAEFFTRCLHDPRDGYYATRPALAGMKGGEDGDFLTAPGVSQM 61 Query: 59 FGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVET 118 FGE++ ++++ W + G PS VR+VE+GPG G ++ D+LR L P+F + +++VE Sbjct: 62 FGELIGLWILETWTRMGRPSPVRMVEMGPGDGTLISDVLRAARLL-PEFLNAADLWLVEV 120 Query: 119 SERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERM 178 S L Q +LA + LVANE D LP QFV TE G ER+ Sbjct: 121 SPPLRAAQAVKLAPLTPSWADRLEVV-PAGAPLILVANEVLDCLPAHQFVRTEGGWAERV 179 Query: 179 IDIDQHDSLVFNIG-----------------------------------DHEIKSNFLTC 203 + +D +L F + S Sbjct: 180 VGLDDSGNLAFGLKALQSPLPLDGGGAGVGGVRGVGRRANLGTSGSAFTPIPNPSPIERE 239 Query: 204 SDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSP 263 Y G I E+SP + I R+A DGG A++IDYG GDTLQA+K HT VSP Sbjct: 240 GSYPPGTIVESSPAQAALGSEIGHRIARDGGAALLIDYGRDAPGPGDTLQALKAHTKVSP 299 Query: 264 LVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDI 323 L PGQADL+ DF + + A + TQG FL+GLGI RA +L + Sbjct: 300 LAQPGQADLTVWADFPAVLAAAAEAGAATGPILTQGAFLQGLGIEARAQALAAARPDQAD 359 Query: 324 LLDSVKRLVSTSADKKSMGELFKILVVSHEKV-----ELMPF 360 + R + + MGELFK++ +S + E PF Sbjct: 360 ---KLTRQLDRLTGRAQMGELFKVVCLSAPGLAPPLFEPPPF 398 >gi|161830113|ref|YP_001597650.1| hypothetical protein COXBURSA331_A2027 [Coxiella burnetii RSA 331] gi|161761980|gb|ABX77622.1| conserved hypothetical protein [Coxiella burnetii RSA 331] Length = 388 Score = 243 bits (619), Expect = 4e-62, Method: Composition-based stats. Identities = 93/373 (24%), Positives = 162/373 (43%), Gaps = 26/373 (6%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFG 60 +L IV I +NG +T+ +Y L + P GYYS FGA GDFVTAPEIS +F Sbjct: 17 SEQLRLHIVREIAENGPLTLARYMQLALYAPGLGYYSAGSRKFGAAGDFVTAPEISSLFS 76 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 + +A ++ELG G G M DILR + + +++E S Sbjct: 77 QCVARQCQQILIDLNGGD---ILELGAGSGRMAADILRELQHTGCLPHN---YFILEISA 130 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTF---LVANEFFDSLPIKQFVMTEHGIRER 177 L Q+K + + +++ + F ++ NE D++P+ +F ++GI+E Sbjct: 131 DLRDRQEKFIKNEIPELSHRVKWLNRLPSPHFKGVILGNEVIDAMPVHKF-KIDNGIKEV 189 Query: 178 MIDIDQHDSLVFNIGDHEIKSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGG 234 ++ +++ V+ IG+ + + + G E + + S++D L Sbjct: 190 YVN-WKNEQFVWEIGEPSAALSDYIKNLTIHFPEGYESEVNLLLKGWIASLADILQEGLI 248 Query: 235 TAIVI-----DYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYK 289 I +Y + G + H++ PL+ G D+++HVDF ++ A Sbjct: 249 LLIDYGFPRHEYYHTDRDRGTIACHYRHHSHFDPLILTGIQDITAHVDFTAIAEAAAKQG 308 Query: 290 LYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILV 349 L + G T Q FL GI ++ A + +K+L MGELFK + Sbjct: 309 LAVEGFTHQAGFLLNCGIATLMPQ-VEDVAEHYRIAQEIKKLTL----PGEMGELFKAIA 363 Query: 350 VSHE-KVELMPFV 361 ++ + L+ F+ Sbjct: 364 LTRNYRQSLLGFI 376 >gi|282896985|ref|ZP_06304987.1| Protein of unknown function DUF185 [Raphidiopsis brookii D9] gi|281197637|gb|EFA72531.1| Protein of unknown function DUF185 [Raphidiopsis brookii D9] Length = 402 Score = 243 bits (619), Expect = 4e-62, Method: Composition-based stats. Identities = 101/399 (25%), Positives = 162/399 (40%), Gaps = 44/399 (11%) Query: 4 KLIRKIVNLIKKN--GQMTVDQYFALCVADPEFGYYSTCN-PFGAVG-DFVTAPEISQIF 59 L + I + I+ + ++T +Y L + E+GYYS+ + G G DF T+P +S+ F Sbjct: 2 SLAQIITDCIQSSEQKRITFAEYMDLVLYHREYGYYSSHSCQIGFEGSDFFTSPSLSEDF 61 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 GE+LA + WE P +LVE+G G+G++ IL + PDFF ++ +VE S Sbjct: 62 GELLAEQFLQMWENLDRPRPFQLVEMGAGKGVLAAQILTYLKSHHPDFFEIIEYIIVEKS 121 Query: 120 ERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMI 179 +L Q+++L + + L + F +NE D+ P+ QF++ ++E + Sbjct: 122 PQLRGEQQQRLEIFSIQWLDLQELHPGSIIGCF-FSNELVDAFPVHQFILQNGKLQEIYV 180 Query: 180 DID--------QHDSLVFNIGDHEIKSNFLTCSD------------------YFLGAIFE 213 QH S +I E S Y E Sbjct: 181 TFRTAPPNLKTQHSSQSLDIEILEFIEVIGEPSTPKLEEYLQLVGIDLSPNVYPEDYRSE 240 Query: 214 NSPCRDREMQSISDRLACDGGTAIVIDYG----YLQSRVGDTLQAVKGHTYVS-PLVNPG 268 + + +++ L I Y Y R TLQ H Y P + G Sbjct: 241 INLAALDWLSIVANCLQRGYVLTIDYGYPATRYYHPRRSQGTLQCYYQHRYHDNPYIKVG 300 Query: 269 QADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSV 328 + D+++HVDF L + L G T QG FL LG+ R +L Q LL Sbjct: 301 EQDITTHVDFTALENWGKKCGLNPVGWTQQGLFLMALGLGDRIAALSYQQQSVSQLLKRR 360 Query: 329 KRLVSTSADKKSMGELFKILVVSH------EKVELMPFV 361 + L + +G F +LV S ++ L F+ Sbjct: 361 EAL-HQLISPEGLGN-FGVLVQSKGLTNTQSQLPLQGFI 397 >gi|172037853|ref|YP_001804354.1| hypothetical protein cce_2940 [Cyanothece sp. ATCC 51142] gi|171699307|gb|ACB52288.1| DUF185-containing protein [Cyanothece sp. ATCC 51142] Length = 377 Score = 242 bits (618), Expect = 5e-62, Method: Composition-based stats. Identities = 93/372 (25%), Positives = 150/372 (40%), Gaps = 20/372 (5%) Query: 5 LIRKIVNLIKK--NGQMTVDQYFALCVADPEFGYYSTCNP-FGAVGDFVTAPEISQIFGE 61 ++ I++ IK + ++T Y L + PE GYYS+ G+ GDF TA + FGE Sbjct: 1 MLEIIIDRIKNSSHHRITFADYMDLVLYHPEQGYYSSGKVNIGSEGDFFTASSLGSDFGE 60 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 +LA E LVE+G G G + DIL + + +F+ L+ ++E S+ Sbjct: 61 LLAEQFREMSEFLNNSDSFTLVEVGAGTGNLASDILNYLKQKHSNFYDQLNYIIIEESQA 120 Query: 122 LTLIQKKQLASYGDK-INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 L Q+ +L + + + D + +NE D+ PI ++E + Sbjct: 121 LIKKQQDKLKGFDKITWTSWQDIPDNSITGCI-FSNELIDAFPIHLVTKHNKQLKEIYVT 179 Query: 181 IDQH--DSLVFNIGDHEIKSNF------LTCSDYFLGAIFENSPCRDREMQSISDRLACD 232 + I ++ F +T +Y E + ++ +S +L Sbjct: 180 WQDDQLKEKIEEISTTKLSDYFKLIDIDITKDNYPEDYRTEVNLKALGWLKLVSAKLKKG 239 Query: 233 GGTAIVIDY----GYLQSRVGDTLQAVKGH-TYVSPLVNPGQADLSSHVDFQRLSSIAIL 287 I Y Y R TL H + +P VN GQ D+++HVDF L L Sbjct: 240 YLLTIDYGYSSAKYYHPQRYQGTLNCYHQHRHHHNPYVNLGQQDITAHVDFTALEKQGNL 299 Query: 288 YKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKI 347 L GLT QG FL LG+ R L +L+ L + +G FK+ Sbjct: 300 LGLETVGLTQQGLFLMALGLGDRLAELSNGNYSLPEILNRRDAL-HQLINPTGLGG-FKV 357 Query: 348 LVVSHEKVELMP 359 L+ S + + P Sbjct: 358 LIQSKKIKKNQP 369 >gi|296813001|ref|XP_002846838.1| DUF185 domain-containing protein [Arthroderma otae CBS 113480] gi|238842094|gb|EEQ31756.1| DUF185 domain-containing protein [Arthroderma otae CBS 113480] Length = 490 Score = 242 bits (618), Expect = 6e-62, Method: Composition-based stats. Identities = 117/450 (26%), Positives = 185/450 (41%), Gaps = 93/450 (20%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTC-----NPFGAVGDFVTAPEIS 56 L ++I + I G +++ + C+ E GYY++ + FG GDFVT+PEIS Sbjct: 37 STPLAKRITDAINTTGPISIAAFMRQCLTSDEGGYYTSRGAPGSDVFGKEGDFVTSPEIS 96 Query: 57 QIFGEMLAIFLICAWEQHGFPSC-VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 Q+FGE+L I+++ W G S V+L+E+GPG+G +M DILR + K S+ IY+ Sbjct: 97 QMFGELLGIWIVTEWLSQGRRSSGVQLMEVGPGKGTLMADILRSVRNFKGFASSIEGIYL 156 Query: 116 VETSERLTLIQKKQLASYGD--------------------KINWYTSLADVPLGFTFLVA 155 +E S L IQK++L L +V F+VA Sbjct: 157 IEASPTLRDIQKQKLCGEAPMEECEIGHKSTSTHLGVPVYWTEHIRLLPEVENKAPFIVA 216 Query: 156 NEFFDSLPIKQFVMTE-------------------------HGIRERMIDIDQHDSLVFN 190 +EFFD+LPI F RE ++ + + + Sbjct: 217 HEFFDALPIHAFQSVHSPPPETINTPTGPATLRQPPLPLNGTQWRELVVATNPEPTRESD 276 Query: 191 IGDHEIKSNFLTCS-------------------DYFLGAIFENSPCRDREMQSISDRL-- 229 D +++ G+ E SP Q + + Sbjct: 277 KNDKKLEFRLALAKSPTPASLVMPEMSPRYKALKSTRGSTIEISPESHTYAQEFARLIGG 336 Query: 230 -----------ACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDF 278 G A+++DYG + ++L+ +K H VSP PGQ DLS+ VDF Sbjct: 337 ANPTGKDDSPTRTPAGAALILDYGPSSTIPVNSLRGIKNHQIVSPFATPGQVDLSADVDF 396 Query: 279 QRLSSIAILY--KLYINGLTTQGKFLEGLGIWQRAFSLM---KQTARKDILLDSVKRLVS 333 L+ A+ + + G QG FL LGI +RA L+ K ++ + S +RLV Sbjct: 397 TGLAESALNASPGVEVYGPNEQGSFLRSLGIAERAAQLLRNVKDETKRKQIESSWQRLVD 456 Query: 334 TSADKKSMGELFKILVVSHE---KVELMPF 360 MG+++K + + E K + F Sbjct: 457 RGG--GGMGKIYKAMAIVPESGGKRRPVGF 484 >gi|209543261|ref|YP_002275490.1| hypothetical protein Gdia_1092 [Gluconacetobacter diazotrophicus PAl 5] gi|209530938|gb|ACI50875.1| protein of unknown function DUF185 [Gluconacetobacter diazotrophicus PAl 5] Length = 339 Score = 242 bits (618), Expect = 6e-62, Method: Composition-based stats. Identities = 115/342 (33%), Positives = 169/342 (49%), Gaps = 19/342 (5%) Query: 21 VDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCV 80 +D++ A YY+ +PF DF+TAPEISQ+FGE+L ++ W+ G P Sbjct: 9 LDRFMARA----NAAYYAGRDPF---ADFITAPEISQMFGEILGAWVAVTWQGMGRPVPF 61 Query: 81 RLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYG-DKINW 139 LVE GPGRG +M D++R++ ++ PD +++VE S RL +Q+ LA + W Sbjct: 62 ALVEAGPGRGTLMADMMRLLARVAPDCHDAARVHLVELSPRLRDVQQAALAGRTAHPVTW 121 Query: 140 YTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSN 199 + + DVP G L+ANEF D+L I+QFV T G ER + F Sbjct: 122 HDRIEDVPEGAVILLANEFLDALAIRQFVRTADGWAERFV-----QGPAFVTQPASDLPP 176 Query: 200 FLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHT 259 G I E P + ++ RL GTA+ +DYGY + GDTLQA++ Sbjct: 177 GPFDRPVPCGEILECCPDALAVARHVAARLCRAPGTALFVDYGYDGAVWGDTLQALRDGQ 236 Query: 260 YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQ-- 317 PL +PG ADL++HVDF ++ +G TQG L LG++ RA L + Sbjct: 237 PAWPLADPGLADLTAHVDFAAFAAAVRDGGAVCHGSVTQGALLGALGLFARAEQLARNRA 296 Query: 318 TARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMP 359 + D+ +RL A MG LFK L ++ + + P Sbjct: 297 PGEAYAIRDAAQRL----AAPDRMGRLFKALAITSPGLPVPP 334 >gi|75909412|ref|YP_323708.1| hypothetical protein Ava_3205 [Anabaena variabilis ATCC 29413] gi|75703137|gb|ABA22813.1| Protein of unknown function DUF185 [Anabaena variabilis ATCC 29413] Length = 404 Score = 242 bits (618), Expect = 6e-62, Method: Composition-based stats. Identities = 96/382 (25%), Positives = 156/382 (40%), Gaps = 35/382 (9%) Query: 3 NKLIRKIVNLIKKNG--QMTVDQYFALCVADPEFGYYSTCN-PFG-AVGDFVTAPEISQI 58 + L I + I + ++T +Y + + PE+GYYS+ G GDF T+ + Sbjct: 5 SALQTAITHRIANSPEARITFAEYMDMALYHPEYGYYSSNTVKIGFKGGDFFTSVNLGAD 64 Query: 59 FGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVET 118 G++LA + WE G P+ LVE+G G+G++ L IL+ I P+ F+ L +VE Sbjct: 65 LGDLLAEQFVQMWEILGKPTPFYLVEMGAGQGLLALHILKYIQVKYPNLFTALQYLIVEK 124 Query: 119 SERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERM 178 S L Q+++L + + + ++ + F +NE D+LP+ QF++ +RE Sbjct: 125 SPGLKQEQQERLQGFSVRWCSWEEISPNSITGCF-FSNELVDALPVHQFILEGGELREIY 183 Query: 179 IDIDQHDSLVFNIGDHEIKSNFLTC-----------------------SDYFLGAIFENS 215 + + + LT Y G E + Sbjct: 184 LTVQGEKEAQEAKNLSPSPNYELTEVAAAPSTPRLAEYFDLVGINLAQGGYEDGYRSEIN 243 Query: 216 PCRDREMQSISDRLACDGGTAIVIDYG----YLQSRVGDTLQAVKGHTYV-SPLVNPGQA 270 + ++DRL I Y Y R TLQ H + +P +N GQ Sbjct: 244 LAALDWLSIVADRLQRGYVITIDYGYPASRYYNPRRSQGTLQCYYQHRHHNNPYINIGQQ 303 Query: 271 DLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKR 330 D+++HVDF L L G T Q FL LG+ R +L Q LL ++ Sbjct: 304 DITAHVDFTALERWGDRCGLEKLGFTQQALFLMALGLGDRIAALSYQQIPVSELLHR-RQ 362 Query: 331 LVSTSADKKSMGELFKILVVSH 352 + D +G F +L+ S Sbjct: 363 ALHQLIDPTGLGG-FGVLIQSK 383 >gi|170077325|ref|YP_001733963.1| hypothetical protein SYNPCC7002_A0702 [Synechococcus sp. PCC 7002] gi|169884994|gb|ACA98707.1| Protein of unknown function COG1565 [Synechococcus sp. PCC 7002] Length = 389 Score = 242 bits (618), Expect = 6e-62, Method: Composition-based stats. Identities = 92/373 (24%), Positives = 157/373 (42%), Gaps = 25/373 (6%) Query: 1 MENKLIRKIVNLIKK--NGQMTVDQYFALCVADPEFGYYSTCNP-FGAVGDFVTAPEISQ 57 M + L+ + ++ + ++T ++ L + P GYYS+ GA GDF TA + Sbjct: 3 MASPLLAILQEKLQTAPHQRLTFAEFMELALYHPTVGYYSSGKVAIGAQGDFFTATSLGP 62 Query: 58 IFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVE 117 FGE+LA L+ + G S ++VE+G G+G + DIL + + P + Y++E Sbjct: 63 DFGELLAEQLLQMKQILGR-SPFQIVEMGAGKGDLAKDILFYLQQHYPKELEEIEYYIIE 121 Query: 118 TSERLTLIQKKQLASYGDKINWYTSLADVPLGFT--FLVANEFFDSLPIKQFVMTEHGIR 175 S L Q+++LA + + +++ ++NE D+ P+ + ++ Sbjct: 122 KSPALRQQQQEKLAGFSYCKIQWCDWSELLENSIQGCFISNELLDAFPVHLVTVNAGKLQ 181 Query: 176 ERMIDIDQHDSLVFNIGD----HEIKSNFLTCS----DYFLGAIFENSPCRDREMQSISD 227 E + + + + + D EI++ F + +Y G E + ++ + Sbjct: 182 EVYVQYNPAQNALEEVNDSLSTPEIQAYFDHLNIDFSNYPDGYRTEVNLGMLSWLEQLQT 241 Query: 228 RLACDGGTAIVIDY------GYLQSRVGDTLQAVKGHTYVS-PLVNPGQADLSSHVDFQR 280 +L G + IDY Y R TLQ H P N G+ DL++HVDF Sbjct: 242 KL--KTGYILTIDYGHPAAKYYHPQRSQGTLQCYFQHRRHDNPYTNLGEQDLTAHVDFTS 299 Query: 281 LSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKS 340 L + L GLT QG FL LG+ R +L + L L D Sbjct: 300 LQNHGESLGLATLGLTQQGLFLMALGLGDRLNALSQNPTDVMTLFRRRDAL-HQLIDPMG 358 Query: 341 MGELFKILVVSHE 353 +G F +LV Sbjct: 359 LGG-FYVLVQGKN 370 >gi|332709754|ref|ZP_08429713.1| hypothetical protein LYNGBM3L_43420 [Lyngbya majuscula 3L] gi|332351581|gb|EGJ31162.1| hypothetical protein LYNGBM3L_43420 [Lyngbya majuscula 3L] Length = 421 Score = 242 bits (617), Expect = 6e-62, Method: Composition-based stats. Identities = 95/399 (23%), Positives = 159/399 (39%), Gaps = 49/399 (12%) Query: 2 ENKLIRKIVNLIKK--NGQMTVDQYFALCVADPEFGYYSTCNP-FGAVGDFVTAPEISQI 58 L + I N I N +T QY L + P+ GYY++ G+ GDF T+P + + Sbjct: 12 TQPLSKVIANHIATAPNQGITFAQYMDLALYHPQLGYYASGAVNIGSSGDFFTSPHLGKD 71 Query: 59 FGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVET 118 FGE+LA W+ G P+ L+E+G G+G++ D+L + + PD + ++ +VE Sbjct: 72 FGELLAEQFAQMWDILGQPNPFTLMEVGAGQGLLAADVLVYLHQHYPDCYGAVNYIIVEK 131 Query: 119 SERLT------------LIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQ 166 + + S + + + D L +NE D+LP+ Q Sbjct: 132 ATAMIAQQKQLLLKLNLPRLDNHQPSLPVRWCSWEEIQDNSLTGCC-FSNELVDALPVHQ 190 Query: 167 FVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSD-------------YFLGAIFE 213 V+ ++E I D + + + Y G E Sbjct: 191 VVLQAGDLKEIYIATVDQDDDDIKFVEVLDTPSTPQLREYFDLVGIDLFSGSYPEGYRCE 250 Query: 214 NSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRV------GDTLQAVKGHTYV-SPLVN 266 + +++++ RL + G + IDYGY + TLQ H + +P Sbjct: 251 VNLAALDWIKTVAKRL--NQGFVLTIDYGYPAQKYYLPARDQGTLQCYYRHRHHNNPYSY 308 Query: 267 PGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMK-------QTA 319 G+ D+++HVDF L L L GLT QG FL LG+ R +L + Sbjct: 309 IGEQDITAHVDFTALEQQGELCGLGKVGLTKQGLFLMALGLGDRIAALSEPREGGNANAT 368 Query: 320 RKD--ILLDSVKRLVSTSADKKSMGELFKILVVSHEKVE 356 +D ++ +RL D +G F +L+ S Sbjct: 369 AQDVIKIMQRRQRL-HQLIDPTGLGG-FGVLIQSKGLTP 405 >gi|91789925|ref|YP_550877.1| hypothetical protein Bpro_4086 [Polaromonas sp. JS666] gi|91699150|gb|ABE45979.1| protein of unknown function DUF185 [Polaromonas sp. JS666] Length = 394 Score = 241 bits (616), Expect = 9e-62, Method: Composition-based stats. Identities = 96/376 (25%), Positives = 160/376 (42%), Gaps = 36/376 (9%) Query: 1 MENKLIRKIVNLIK-KNGQMTVDQYFALCVADPEFGYYST-CNPFG------AVG--DFV 50 + L I I G + DQ+ AL + P GYY+ FG G DFV Sbjct: 30 LTTGLQAHIAKAIAEAGGWIGFDQFMALALYTPGLGYYANDSRKFGLMPGGVKDGGSDFV 89 Query: 51 TAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSV 110 TAPE+S FG+ LA L A + G + E G G G + L +L + L V Sbjct: 90 TAPELSPRFGQALARQLAQALQATG---TSEVWEFGAGSGALALQLLDTLGPL------V 140 Query: 111 LSIYMVETSERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMT 170 +V+ S L Q+++LA++ K++W + L G +V NE D++P+K Sbjct: 141 ARYTIVDVSGSLMQRQRERLAAHAGKVHWASELPAEMRG--VVVGNEVLDAMPVKLLSRL 198 Query: 171 EHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLA 230 E ER + + Q F D + + E P + +++++DRL Sbjct: 199 EKVWHERGVVVHQE---RFTWADQRTELRPPLEVAGAHDYLTEIHPQAEGFIRTLADRLE 255 Query: 231 CDGGTAIVIDYG----YLQSRVGDTLQAVKGH-TYVSPLVNPGQADLSSHVDFQRLSSIA 285 + + Y R T+ +GH + L++ G D+++HV+F ++ Sbjct: 256 AGAVFLLDYGFPEHEYYHPQRSMGTVMCHRGHLSDTDALLDVGSKDITAHVNFTGIALAG 315 Query: 286 ILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELF 345 L + G T+QG+FL G+ + + + + V++L++ + MGELF Sbjct: 316 QDAGLQVLGYTSQGRFLLNCGLLEGMEGGVDSANLAERAM--VQKLIA----EHEMGELF 369 Query: 346 KILVVSH-EKVELMPF 360 K++ S E M F Sbjct: 370 KVIAFSKGAPWEPMGF 385 >gi|258514907|ref|YP_003191129.1| hypothetical protein Dtox_1649 [Desulfotomaculum acetoxidans DSM 771] gi|257778612|gb|ACV62506.1| protein of unknown function DUF185 [Desulfotomaculum acetoxidans DSM 771] Length = 386 Score = 241 bits (616), Expect = 1e-61, Method: Composition-based stats. Identities = 75/373 (20%), Positives = 146/373 (39%), Gaps = 20/373 (5%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEM 62 L I + I+ G +T ++ + + PE GYY++ G GD+ T+ ++ +F M Sbjct: 6 SLTGIIKSFIELEGPVTFARFMEMALYYPELGYYASVREKIGRKGDYYTSSDVHALFAGM 65 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 +A W G P + +E G G+G + D L + + PD ++ L+ ++++ S Sbjct: 66 IARQAAQMWAILGHPPVWQFIEYGAGKGKLAYDFLNQLQQQYPDCYAALTYWIIDVSPDF 125 Query: 123 TLIQKKQLASYGDKINWYTSLA--------DVPLGFTFLVANEFFDSLPIKQFVMTEHGI 174 Q+ L+ + + +NE D+ P+ + M E G+ Sbjct: 126 REKQQAILSGLNLPPGKVSWADSPAQILELQGNRITGCIFSNELIDAFPVHRVRMREDGL 185 Query: 175 RERMIDIDQHDSLVFNIGDHEIKSNFLTCSD---YFLGAIFENSPCRDREMQSISDRLAC 231 +E +D + + E G E + + +++ ++ L Sbjct: 186 KEIYVDYRDNRFVEVEGLLSEKLLQDYFAKQRVALKTGQTAEVNLAAIKWLKNQAECLEK 245 Query: 232 DGG----TAIVIDYGYLQSRVGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAI 286 + D Y ++R TL+ + HT P G+ D++++V+F L Sbjct: 246 GYIITIDYGLTSDNLYNRARFDGTLRCFRRHTLNDDPYQYIGEQDITANVNFSALEIWGK 305 Query: 287 LYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFK 346 L + GL TQ FL GI + + + L ++ + + MG FK Sbjct: 306 EAGLNMAGLVTQSDFLLNAGILDILKTSDDYSFNEKKLHTTL--AIKQLIMPEGMGRYFK 363 Query: 347 ILVVSHEKVELMP 359 +L+ H+ + P Sbjct: 364 VLIQ-HKGLPAEP 375 >gi|9294283|dbj|BAB02185.1| unnamed protein product [Arabidopsis thaliana] Length = 378 Score = 241 bits (616), Expect = 1e-61, Method: Composition-based stats. Identities = 115/379 (30%), Positives = 186/379 (49%), Gaps = 45/379 (11%) Query: 25 FALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVE 84 + +P+ G+Y + FGA GDF+T+PE+SQ+FGEM+ ++ +C WEQ G P V LVE Sbjct: 1 MEEVLTNPKAGFYMNRDVFGAQGDFITSPEVSQMFGEMIGVWTVCLWEQMGRPERVNLVE 60 Query: 85 LGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGD--------- 135 LGPGRG +M D+LR K K L I++VE S L +Q + L + Sbjct: 61 LGPGRGTLMADLLRGTSKFKNFT-ESLHIHLVECSPALQKLQHQNLKCTDESSSEKKAVS 119 Query: 136 -----KINWYTSLADVPLG-FTFLVANEFFDSLPIKQF-----VMTEHGIRERMIDIDQH 184 ++W+ +L +VP G T ++A+EF+D+LP+ QF + G E+M+D+ + Sbjct: 120 SLAGTPVHWHATLQEVPSGVPTLIIAHEFYDALPVHQFQTQYLQKSTRGWCEKMVDVGED 179 Query: 185 DSLVFNIGDHEIK--------SNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTA 236 F + + T + E SP Q ++ R+ DGG A Sbjct: 180 SKFRFVLSPQPTPAALYLMKRCTWATPEEREKMEHVEISPKSMDLTQEMAKRIGSDGGGA 239 Query: 237 IVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILY--KLYING 294 ++IDYG + + D+LQA++ H +V+ L +PG ADLS++VDF + A + ++G Sbjct: 240 LIIDYGMN-AIISDSLQAIRKHKFVNILDDPGSADLSAYVDFPSIKHSAEEASENVSVHG 298 Query: 295 LTTQGKFLEGLGIWQRAFSLMK--QTARKDILLDSVKRLVSTSAD----------KKSMG 342 TQ +FL LGI R +L++ + + L +LV MG Sbjct: 299 PMTQSQFLGSLGINFRVDALLQNCNDEQAESLRAGYWQLVGDGEAPFWEGPNEQTPIGMG 358 Query: 343 ELFKILVVSHEKVELM-PF 360 + + + ++ + PF Sbjct: 359 TRYLAMSIVNKNQGIPAPF 377 >gi|241661669|ref|YP_002980029.1| hypothetical protein Rpic12D_0045 [Ralstonia pickettii 12D] gi|240863696|gb|ACS61357.1| protein of unknown function DUF185 [Ralstonia pickettii 12D] Length = 397 Score = 241 bits (616), Expect = 1e-61, Method: Composition-based stats. Identities = 83/377 (22%), Positives = 146/377 (38%), Gaps = 27/377 (7%) Query: 5 LIRKIVNLIK-KNGQMTVDQYFALCVADPEFGYYSTCN-PFGAV----GDFVTAPEISQI 58 L I + I G + D+Y L + P GYYS FG GDFVTAPE++ Sbjct: 21 LFAVIADAIAGAGGWLPFDRYMELALYAPGLGYYSGGAAKFGRRVEDGGDFVTAPELTPF 80 Query: 59 FGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVET 118 FG +A + + ++E G G G + DIL + L S +VE Sbjct: 81 FGRTVAHQIAQVLQAL-PEGQRHVLEFGAGTGKLAADILTELDALGARPDS---YGIVEL 136 Query: 119 SERLTLIQKKQLASYGDKINWYTSLADVPLGFT--FLVANEFFDSLPIKQFVMTEHGIRE 176 S L Q+++L + G ++ D ++ NE D++P+ + + Sbjct: 137 SGELRQRQQERLTALGPQLAALARWHDTLPAPFTGVMIGNEVLDAMPVSLWARRGGVWHQ 196 Query: 177 RMIDIDQHDSLVFNIGDHEIKSNFLTCSDYF--LGAIFENSPCRDREMQSISDRLACDGG 234 R + +D + L + + + E+ + ++S L Sbjct: 197 RGVMLDADNGLQWEDRLVSPSEVPAKLAALPGTDDFVTESHEAAEGFIRSAGAALERGLL 256 Query: 235 TAI-----VIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYK 289 I +Y + G + + H + P PG D+++HVDF ++ Sbjct: 257 LLIDYGFPAAEYYHAHRANGTLMCHYRQHAHDDPFWLPGLQDITAHVDFSGIAQAGQEAG 316 Query: 290 LYINGLTTQGKFLEGLGIWQRAFSLMKQTA-RKDILLDSVKRLVSTSADKKSMGELFKIL 348 L + G T+Q +FL G+ + L + ++V++L+S + MGELFK + Sbjct: 317 LELLGYTSQARFLLSAGVGELLMRLDPSDPMQFLPAANAVQKLLS----EAEMGELFKAI 372 Query: 349 VVSH---EKVELMPFVN 362 + + L F + Sbjct: 373 ALGKGIDAALPLAGFAD 389 >gi|126138720|ref|XP_001385883.1| hypothetical protein PICST_68115 [Scheffersomyces stipitis CBS 6054] gi|126093161|gb|ABN67854.1| predicted protein [Scheffersomyces stipitis CBS 6054] Length = 515 Score = 241 bits (616), Expect = 1e-61, Method: Composition-based stats. Identities = 112/415 (26%), Positives = 186/415 (44%), Gaps = 60/415 (14%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFG-AVGDFVTAPEISQIFGE 61 N L IK G +++ Y C+ P+FGYY+T +P GDF+T+PEIS +FGE Sbjct: 101 NNLTDLFTQTIKTTGPISLSAYMRQCLTHPDFGYYTTRDPLDHRSGDFITSPEISSVFGE 160 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM--VETS 119 M+ I+L W+ FP +R++E GPG+G +M D+L K + I + +E S Sbjct: 161 MIGIWLYTVWQNQNFPGKIRIIEFGPGKGTLMHDVLNTFNKFVFKSRKAVKIEINLIEAS 220 Query: 120 ERLTLIQKKQLAS------------------YGDKINWYTSLADVPLGFTF---LVANEF 158 + L Q K L + + I W + D+ ++A+EF Sbjct: 221 KVLRQEQWKLLCGEKNEFQTDNEGFNLSKTIWSNDIKWLDTEKDIIQDPDVANYVLAHEF 280 Query: 159 FDSLPIKQFVMTEHGIRERMIDIDQ---------------------HDSLVFNIGDHEIK 197 FD+LPIK F TEHG RE +++ + + + E Sbjct: 281 FDALPIKGFERTEHGWRELLVEHTESVVNTQPKLPGTSIDEDDSLLNTEFHLTLSKKETP 340 Query: 198 SNFLT-----CSDYFLGAIFENSPCRDREMQSISDRLA--CDGGTAIVIDYGYLQSRVGD 250 S+ + D +G+ E + + ++ + G +VIDYG + + Sbjct: 341 SSIIPTLHRRFKDLPVGSRIEICSEAELYIMKMAQLVNNEKKLGAVLVIDYGLVNQIPEN 400 Query: 251 TLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQR 310 +L+ + H +VSP + PG DLS VDF L ++ + + G QG +L +G+ R Sbjct: 401 SLRGIYQHKFVSPFIKPGDVDLSVDVDFDNLVNLTSTH-CSVFGPIDQGDWLHNIGVGYR 459 Query: 311 AFSLMKQT----ARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELM-PF 360 L+KQ +++ + +S +RL D+KSMG+++K L + + F Sbjct: 460 VDQLIKQNDHDHEKQEKIYNSYRRLTDK--DEKSMGKIYKFLCLGPHDSPIPAGF 512 >gi|115395892|ref|XP_001213585.1| conserved hypothetical protein [Aspergillus terreus NIH2624] gi|114193154|gb|EAU34854.1| conserved hypothetical protein [Aspergillus terreus NIH2624] Length = 809 Score = 241 bits (615), Expect = 1e-61, Method: Composition-based stats. Identities = 112/469 (23%), Positives = 179/469 (38%), Gaps = 112/469 (23%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTC-----NPFGAVGDFVTAPEIS 56 L + + N IK G + + + + P+ GYY+T FG GDFVT+PEIS Sbjct: 337 STPLAKTLANAIKVTGPIPIAAFMRQVLTSPDGGYYTTRPKGDGEVFGKKGDFVTSPEIS 396 Query: 57 QIFGEMLAIFLICAWEQHGFPSC-VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 Q+FGE++ I+ I W G V+L+E+GPG+G +M D+LR K S+ IY+ Sbjct: 397 QVFGELVGIWTIAEWMAQGRKRSGVQLMEVGPGKGTLMDDMLRTFRNFKTFTSSIEGIYL 456 Query: 116 VETSERLTLIQKKQLASY--------------------GDKINWYTSLADVPLGFTFLVA 155 VE S L +QK+ L + L F+ A Sbjct: 457 VEASPTLREVQKQLLCGEAAMEETDIGHRSVCKYFDVPVVWVEDIRLLPHEQDKTPFIFA 516 Query: 156 NEFFDSLPIKQFVMTE-----------------------------HGIRERMIDIDQH-- 184 +EFFD+LPI F RE M+ ++ Sbjct: 517 HEFFDALPIHAFESVPPSPENHPPDEIMTPTGPTKLHTPPKPTNTPQWRELMVTLNPKAV 576 Query: 185 -----DSLVFNIGDHEIKSN----------FLTCSDYFLGAIFENSPCRDREMQSISDRL 229 D F + + + GA E SP + R+ Sbjct: 577 DENQPDEPEFTLTRAKASTPSSLVIPEISARYRALKSQPGATIEVSPESRIYASDFARRI 636 Query: 230 ------------------------------ACDGGTAIVIDYGYLQSRVGDTLQAVKGHT 259 + G A+++DYG + + ++L+ ++ H Sbjct: 637 GGASQPPRTATRRDQLSSADATTGPAAAAKSVPSGAALIMDYGTMSTIPINSLRGIQNHR 696 Query: 260 YVSPLVNPGQADLSSHVDFQRLSSIAILY--KLYINGLTTQGKFLEGLGIWQRAFSLM-- 315 V PL PGQ D+S+ VDF L+ AI + ++G QG FL+ +GI +R L+ Sbjct: 697 NVPPLSAPGQVDVSADVDFVALAEAAIEASEGVEVHGPVEQGDFLQAMGITERMQQLLRG 756 Query: 316 -KQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVE---LMPF 360 + ++ L +RLV MG+++K++ + E + F Sbjct: 757 VQDEEKRKTLESGWQRLVEKGG--GGMGKIYKVMAIIPENNGQRRPVGF 803 >gi|157803232|ref|YP_001491781.1| succinate dehydrogenase iron-sulfur subunit [Rickettsia canadensis str. McKiel] gi|157784495|gb|ABV72996.1| succinate dehydrogenase iron-sulfur subunit [Rickettsia canadensis str. McKiel] Length = 358 Score = 241 bits (615), Expect = 1e-61, Method: Composition-based stats. Identities = 119/356 (33%), Positives = 181/356 (50%), Gaps = 18/356 (5%) Query: 8 KIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFL 67 KI LI +NG +T D + YY + GDF+TAPE+SQ+FGE++ ++ Sbjct: 6 KIRQLINQNGYITCDVLIQEILYSNPASYYRQTKSLASEGDFITAPEVSQLFGEIIGLWC 65 Query: 68 ICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQK 127 I W++ G P + LVELGPGRG++M D+LR L P+F+ LSI ++E ++ QK Sbjct: 66 IKEWQRIGCPKSLSLVELGPGRGLLMRDLLRTAK-LVPEFYKALSIELIEINQNFIAHQK 124 Query: 128 KQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQ-HDS 186 L I + + D+P T +VANEFFD++PIKQ++ + ER+ + Sbjct: 125 SNLQDINLPIKHLSFIEDIPKKPTIIVANEFFDAMPIKQYIKVKELWYERIFVVQPVDGR 184 Query: 187 LVFNIGDHEIKSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGY 243 + ++ + + GA+ E S ++ I+ L G+ ++IDYGY Sbjct: 185 IKYDKISVNKQLQEYLLQTHIEAKDGAVLEESYKSIEIIKFIAQHLKKLSGSCLIIDYGY 244 Query: 244 -------LQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLT 296 + + TLQAVK H Y L N G+ADLS+HVDF L ++A K+ + Sbjct: 245 DIALSNRNRYQYNPTLQAVKNHKYCPILENCGKADLSAHVDFYTLKTVAKNSKINVINTI 304 Query: 297 TQGKFLEGLGIWQRAFSLMK--QTARKDILLDSVKRLVSTSADKKSMGELFKILVV 350 Q FL GI R+ +L + I+ V+RL+S K MG LFK+L + Sbjct: 305 LQRDFLIENGILLRSKTLQDKLNNEQAQIIEKQVERLIS----PKQMGVLFKVLQI 356 >gi|317157165|ref|XP_001826262.2| hypothetical protein AOR_1_1144054 [Aspergillus oryzae RIB40] Length = 765 Score = 241 bits (614), Expect = 2e-61, Method: Composition-based stats. Identities = 111/456 (24%), Positives = 178/456 (39%), Gaps = 105/456 (23%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTC-----NPFGAVGDFVTAPEIS 56 L R + + IK G + + + + PE GYY+T FG GDFVT+PEIS Sbjct: 297 STPLARTLADAIKVTGPIPIAAFMRQVLTSPEGGYYTTRPAGDGEVFGKKGDFVTSPEIS 356 Query: 57 QIFGEMLAIFLICAWEQHGFPSC-VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 Q+FGE++ I+ I W G S V+L+E+GPG+G +M D+LR K S+ +IY+ Sbjct: 357 QVFGELVGIWTIAEWMAQGRKSSGVQLMEVGPGKGTLMDDMLRTFRNFKSFTSSIEAIYL 416 Query: 116 VETSERLTLIQKKQLAS--------------------YGDKINWYTSLADVPLGFTFLVA 155 VE S L +QK++L + L F++A Sbjct: 417 VEASPTLREVQKQRLCGDATMEETEIGHTSTCKYFNVPVIWVEDIRLLPHEEDKSPFIIA 476 Query: 156 NEFFDSLPIKQFVMTE--------------------------------HGIRERMIDIDQ 183 +EFFD+LPI F RE M+ ++ Sbjct: 477 HEFFDALPIHAFESVPPSPENQPPQSQDTIMTPTGPTKLHKPLKPANTPQWRELMVTLNP 536 Query: 184 H-------DSLVFNIGDHEIKS----------NFLTCSDYFLGAIFENSPCRDREMQSIS 226 + F + + + G+ E SP + Sbjct: 537 KAIDENLPNEPEFKLTHAKASTPSSLVIPEISPRYRALKSQPGSTIEISPESRIYASDFA 596 Query: 227 DRL-----------------------ACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSP 263 R+ G A+++DYG + + ++L+ ++ H V P Sbjct: 597 RRIGGASQPPRTKARNASTQPAAPAKRVPSGAALIMDYGTMDTIPVNSLRGIQHHRKVPP 656 Query: 264 LVNPGQADLSSHVDFQRLSSIAIL--YKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARK 321 L PGQ D+S+ VDF L+ A+ + ++G QG FL +GI +R L+K + Sbjct: 657 LSAPGQVDVSADVDFTALAEAALEGSEGVEVHGPVEQGDFLRTMGIAERMQQLLKHEKDE 716 Query: 322 DI---LLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 + L +RLV MG+++K + + E Sbjct: 717 EKRKTLESGWQRLVEKGG--GGMGKIYKFMAIVPEN 750 >gi|288576015|ref|ZP_05978005.2| putative peptidoglycan synthetase FtsI [Neisseria mucosa ATCC 25996] gi|288566555|gb|EFC88115.1| putative peptidoglycan synthetase FtsI [Neisseria mucosa ATCC 25996] Length = 385 Score = 241 bits (614), Expect = 2e-61, Method: Composition-based stats. Identities = 84/375 (22%), Positives = 146/375 (38%), Gaps = 28/375 (7%) Query: 2 ENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIF 59 +KL I IK N + ++ L + PE+GYY+ + G GDF+TAP ++ +F Sbjct: 16 SSKLFELITQEIKAQNNWIPFSRFMELALYAPEYGYYTGGSHKIGTDGDFITAPTLTPLF 75 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 G+ LA L Q L E G G G + +L+ + + Y++E S Sbjct: 76 GQTLARQLAELLPQT----AGNLYEFGAGTGHLAATLLKSLSD------DLKHYYIIELS 125 Query: 120 ERLTLIQKKQLASYGDK--INWYTSLADVPLGFT-FLVANEFFDSLPIKQFVMTEHGIRE 176 L Q++ +A + L ++P F ++ NE D++P++ T++ + Sbjct: 126 PELAERQRQFIAEHTTPQLAQKVIHLTELPKSFDGIIIGNEVLDAMPVEIIRRTQNTFQH 185 Query: 177 RMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLG----AIFENSPCRDREMQSISDRLACD 232 I I + + + YF E P + + +++ ++ Sbjct: 186 IGISITPDGQFEQSPQPLKQPDLLRLAATYFPETEHPYTSELHPAQYAFILTLAQKITRG 245 Query: 233 GGTAI-----VIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAIL 287 G I Y + Q G + + HT P + G DL++HV+F ++ Sbjct: 246 GMIFIDYGFDATQYYHPQRDEGTLIAHYRHHTVHDPFFHIGLTDLTAHVNFTDIAQAGTD 305 Query: 288 YKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKI 347 L + G Q FL LGI L + V V D+ MGELFK+ Sbjct: 306 GGLDLIGYLPQSHFLFNLGITDL---LAQTAPPGTADYLRVSTAVQKLTDQHEMGELFKV 362 Query: 348 LVVSHE-KVELMPFV 361 + ++ F+ Sbjct: 363 IAFGKNIDIDWTGFL 377 >gi|238493427|ref|XP_002377950.1| DUF185 domain protein [Aspergillus flavus NRRL3357] gi|220696444|gb|EED52786.1| DUF185 domain protein [Aspergillus flavus NRRL3357] Length = 507 Score = 241 bits (614), Expect = 2e-61, Method: Composition-based stats. Identities = 111/456 (24%), Positives = 179/456 (39%), Gaps = 105/456 (23%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTC-----NPFGAVGDFVTAPEIS 56 L R + + IK G + + + + PE GYY+T FG GDFVT+PEIS Sbjct: 39 STPLARTLADAIKVTGPIPIAAFMRQVLTSPEGGYYTTRPAGDGEVFGKKGDFVTSPEIS 98 Query: 57 QIFGEMLAIFLICAWEQHGFPSC-VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 Q+FGE++ I+ I W G S V+L+E+GPG+G +M D+LR K S+ +IY+ Sbjct: 99 QVFGELVGIWTIAEWMAQGRKSSGVQLMEVGPGKGTLMDDMLRTFRNFKSFTSSIEAIYL 158 Query: 116 VETSERLTLIQKKQLAS--------------------YGDKINWYTSLADVPLGFTFLVA 155 VE S L +QK++L + L F++A Sbjct: 159 VEASPTLREVQKQRLCGDATMEETEIGHKSTCKYFNVPVIWVEDIRLLPHEEDKSPFIIA 218 Query: 156 NEFFDSLPIKQFVMTE--------------------------------HGIRERMIDIDQ 183 +EFFD+LPI F RE M+ ++ Sbjct: 219 HEFFDALPIHAFESVPPSPENQPPQSQDTIMTPTGPTKLHKPLKPANTPQWRELMVTLNP 278 Query: 184 H-------DSLVFNIGDHEIKS----------NFLTCSDYFLGAIFENSPCRDREMQSIS 226 + F + + + G+ E SP + Sbjct: 279 KAIDENLPNEPEFKLTHAKASTPSSLVIPEISPRYRALKSQPGSTIEISPESRIYASDFA 338 Query: 227 DRL-----------------------ACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSP 263 R+ G A+++DYG + + ++L+ ++ H V P Sbjct: 339 RRIGGASQPPRTKARNASTQPAAPAKRVPSGAALIMDYGTMDTIPVNSLRGIQHHRKVPP 398 Query: 264 LVNPGQADLSSHVDFQRLSSIAIL--YKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARK 321 L +PGQ D+S+ VDF L+ A+ + ++G QG FL +GI +R L+K + Sbjct: 399 LSSPGQVDVSADVDFTALAEAALEGSEGVEVHGPVEQGDFLRTMGIAERMQQLLKHEKDE 458 Query: 322 DI---LLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 + L +RLV MG+++K + + E Sbjct: 459 EKRKTLESGWQRLVEKGG--GGMGKIYKFMAIVPEN 492 >gi|86606641|ref|YP_475404.1| hypothetical protein CYA_1998 [Synechococcus sp. JA-3-3Ab] gi|86555183|gb|ABD00141.1| conserved hypothetical protein [Synechococcus sp. JA-3-3Ab] Length = 408 Score = 241 bits (614), Expect = 2e-61, Method: Composition-based stats. Identities = 94/384 (24%), Positives = 160/384 (41%), Gaps = 29/384 (7%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIFG 60 + +L I + I+ G +T Q+ + +P GYY + P G D+ T+P ++ F Sbjct: 24 DRRLPELIGSRIRARGPVTFAQFMEWALYEPGLGYYEREHLPLGW--DYRTSPHLAADFA 81 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 ++LA + W G P ++E+G G G + D L +C PDF+ L ++E S Sbjct: 82 QLLAEQIFQFWHILGSPPHFAVIEMGAGSGRLAEDWLAYVCSNWPDFWQALEYGILERSA 141 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLG-FTFLVANEFFDSLPIKQFVMTEHGIRERMI 179 L +Q+++LA YG+K+ W +NE D+ P+ + + +RE + Sbjct: 142 FLRRLQQERLAGYGEKVRWLEWDEIPDGSVTGCFFSNELVDAFPVHRVQVQAGALREIYV 201 Query: 180 DIDQHDSLVFNIGDHEIKSNF-------LTCSDYFLGAIFENSPCRDREMQSISDRLACD 232 D + + +GD + + + Y G E + +Q ++ +L Sbjct: 202 DWAEGEGFREVLGDLSTPALEAYFARLGIPIATYPSGYQTEVNLKALEWLQLLARKLRR- 260 Query: 233 GGTAIVIDYGYLQSRV------GDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSSIA 285 G + +DYG+ R TL A + H P V G DL++HVDF L + Sbjct: 261 -GYVLTLDYGHTAQRYYSPQRFQGTLLAYRQHGTYADPYVWVGSQDLTAHVDFTTLQQVG 319 Query: 286 ILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKR--LVSTSADKKSMGE 343 L G T Q FL LG+ +R +L + L ++R + D +G Sbjct: 320 ESLGLKCLGFTQQSCFLVNLGLAERLAALGQVGDEGGDLGQVLQRRHALHALLDPLGLGS 379 Query: 344 LFKILVVSH------EKVELMPFV 361 F +L+ + L FV Sbjct: 380 -FGVLLQAKGLSEAEAAQPLQGFV 402 >gi|197119166|ref|YP_002139593.1| hypothetical protein Gbem_2793 [Geobacter bemidjiensis Bem] gi|197088526|gb|ACH39797.1| protein of unknown function DUF185 [Geobacter bemidjiensis Bem] Length = 386 Score = 240 bits (613), Expect = 2e-61, Method: Composition-based stats. Identities = 88/370 (23%), Positives = 157/370 (42%), Gaps = 26/370 (7%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFG 60 KL I+N I+ +G +T + + +P+ GYY++ GA GDF T+ + FG Sbjct: 7 TTKLAEIILNRIRTSGDITFASFMESALYEPDLGYYTSAGRKVGAEGDFYTSMNVHSAFG 66 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 ++A + WEQ P+ + E G G G + DIL I + P F++ L+ ++E Sbjct: 67 RLIAQEICRFWEQLDSPASFTIAEAGAGGGQLAQDILDAISEDNPRFYNGLTYRLIEKEP 126 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGF----TFLVANEFFDSLPIKQFVMTEHGIRE 176 L Q +L+ + D++ ++S ++ G +++NE FD++P+ +TE G+RE Sbjct: 127 SLQQAQAARLSRHADRL-AWSSPDELASGTLSFTGCIISNELFDAMPVHIVELTEAGLRE 185 Query: 177 RMIDIDQHDSLVFNIGDHEIKSNFLTCSDYF----LGAIFENSPCRDREMQSISDRLACD 232 + + V + Y G E + + + L Sbjct: 186 VYVSANADG-FVERLLPPSTPELEKYLRKYEVRLLPGQRAEINLAASGWIAQAAATLTR- 243 Query: 233 GGTAIVIDYG------YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSSIA 285 G + IDYG Y R TL H+ +P G+ D+++H++F +L Sbjct: 244 -GFVLTIDYGFLSGELYTPQRRNGTLLCYYKHSTNENPYQLVGEQDITTHINFSQLIVDG 302 Query: 286 ILYKLYINGLTTQGKFLEGLGIWQ---RAFSLMKQTARKDILLDSVKRLVSTSADKKSMG 342 L Q +FL G+ + R + K ++K+L+ + MG Sbjct: 303 EEAGLKKAWYGEQYRFLLSAGLMEELIRLEAQAKDEQESLKHRLALKKLMLP---EGGMG 359 Query: 343 ELFKILVVSH 352 + FK+L+ S Sbjct: 360 DTFKVLIQSK 369 >gi|326431506|gb|EGD77076.1| hypothetical protein PTSG_07416 [Salpingoeca sp. ATCC 50818] Length = 403 Score = 240 bits (613), Expect = 2e-61, Method: Composition-based stats. Identities = 109/385 (28%), Positives = 176/385 (45%), Gaps = 39/385 (10%) Query: 9 IVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPF-GAVGDFVTAPEISQIFGEMLAIFL 67 + ++I G M+V Y + P GYY P GDFVT+P++SQ+FGEM+ + Sbjct: 14 LHSVITTTGPMSVASYMKHVLTHPLHGYYVRQKPLDNDRGDFVTSPQLSQMFGEMVGAWT 73 Query: 68 ICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSI-YMVETSERLTLIQ 126 + W+ G P+ V VELGPG G++M DI++ L +V++ ++VE S ++ Q Sbjct: 74 VKEWQLSGKPTSVNFVELGPGTGLLMHDIIQSFTSLTKQEANVVTDVHLVEASPVMSQQQ 133 Query: 127 KKQL-------------------------ASYGDKINWYTSLADVPLGFTFLVANEFFDS 161 + L A + GFTFL A+EFFD+ Sbjct: 134 YETLGCGQAPDLSNVDLTRNEEYLTGRGHAGINFHWYRHLWALPKLPGFTFLYAHEFFDT 193 Query: 162 LPIKQFVMTEHGIRERMIDIDQH--DSLVFNIGDHEIKS-----NFLTCSDYFLGAIFEN 214 +P QF +TE G RERM+D+ L F + E + +G + E Sbjct: 194 MPTHQFQLTEDGWRERMVDVCDEAPHQLRFVLSPKETVQSKLYASLPLVRTARVGDVGEV 253 Query: 215 SPCRDREMQSISDRLACDGG--TAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADL 272 S + + ++ LA A++ DYG ++ DT +A + H V PG ADL Sbjct: 254 SFEAMEQAELVAHTLATSTMGGRALIFDYGDT-TKFDDTFRAFRDHQQVHVFDQPGSADL 312 Query: 273 SSHVDFQ-RLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRL 331 ++ V+F L + A +++ +G TQ +FL G+ QRA +L+ T S+++ Sbjct: 313 TTDVNFDHLLVAAARSHQVVSHGPVTQRQFLLRCGLQQRADALLASTTD-AKARASIEKH 371 Query: 332 VSTSADKKSMGELFKILVVSHEKVE 356 D MGE FK++ ++ + Sbjct: 372 TRMLTDPDEMGERFKVMAMTDANAD 396 >gi|289742045|gb|ADD19770.1| ATP synthase beta subunit/transcription termination factor rho-like protein [Glossina morsitans morsitans] Length = 431 Score = 240 bits (613), Expect = 2e-61, Method: Composition-based stats. Identities = 119/390 (30%), Positives = 181/390 (46%), Gaps = 42/390 (10%) Query: 5 LIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLA 64 L +++ I G +TV Y + P+ GYY + FG GDF+T+PEISQIF E++ Sbjct: 49 LTKQLKAKILSTGPITVADYMREVLTHPQGGYYMCKDVFGREGDFITSPEISQIFAELVG 108 Query: 65 IFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTL 124 I+ + W + G +LVELGPGRG ++ D+LRV+ K D I++VE S L Sbjct: 109 IWFLTEWYKLGS-LEFQLVELGPGRGTLIRDLLRVLTHFKVDPQFS--IHLVEISPYLGG 165 Query: 125 IQKKQLAS-------------------YGDKINWYTSLADVPLGFTFLVANEFFDSLPIK 165 +Q +++ G K+ WY DVP F+ +VA+EFFD+LPI Sbjct: 166 LQAERICHGSTLIDDNNSRFYRKGETPSGIKVFWYKHFEDVPRNFSLVVAHEFFDALPIH 225 Query: 166 QFVMTEHGIRERMIDIDQH----DSLVFNIGDHEIKS-NFLTCSDYFLGAIFENSPCRDR 220 + + + +E +IDID + F + + + E S DR Sbjct: 226 KLQLDNNLWKEVLIDIDPNVSEKSEFRFVLSKEQTPVSKIYQPLENDTRLSLEYSLESDR 285 Query: 221 EMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQR 280 + ++ RL DGG +VIDYG+ + DT +A + H PLV+PG ADL++ VDF+R Sbjct: 286 LISMLAKRLQEDGGIGLVIDYGHFGDKT-DTFRAFRKHALHDPLVDPGSADLTADVDFRR 344 Query: 281 LSSIAILYK-LYINGLTTQGKFLEGLGIWQRAFSLMKQ--TARKDILLDSVKRLVSTSAD 337 + A + G QG FL+ + R L+K + +L + L D Sbjct: 345 MKHTAEKNNEVVTFGPIKQGIFLQQMQGDVRLERLLKNALPENQSLLKSGYEMLT----D 400 Query: 338 KKSMGELFKILVVSH-------EKVELMPF 360 K MG +K + K + F Sbjct: 401 PKQMGGRYKFFSIFPAVLKEHLNKFPINGF 430 >gi|238880132|gb|EEQ43770.1| conserved hypothetical protein [Candida albicans WO-1] Length = 527 Score = 240 bits (613), Expect = 2e-61, Method: Composition-based stats. Identities = 115/426 (26%), Positives = 180/426 (42%), Gaps = 73/426 (17%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFG-AVGDFVTAPEISQIFGE 61 L IK G + + Y C+ PEFGYY+T +P GDF+T+PEIS +FGE Sbjct: 107 ENLSDFFRQTIKLTGPIPLSTYMRQCLTHPEFGYYTTRDPLNLRTGDFITSPEISSVFGE 166 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSI------YM 115 M+ I+ W+Q +P +R +E GPG+G ++ DI++ K + Sbjct: 167 MIGIWYFSIWQQQKYPESIRFIEFGPGKGTLIHDIMKTFNKFVEKLLPSDQKRPKIEIAL 226 Query: 116 VETSERLTLIQKKQLAS------------------YGDKINWYTSLADVPLGF----TFL 153 +E S L Q K L + +G+ I W + D+ G F+ Sbjct: 227 IEASHVLRKEQWKLLCNPEDPMETTGEGYNRSATKWGNDIIWLDTEKDIQQGDKNVANFI 286 Query: 154 VANEFFDSLPIKQFVMTEHGIRERMIDIDQ---------------------------HDS 186 VA+EFFD+LPIK F+ E G RE +++ Sbjct: 287 VAHEFFDALPIKSFIREEKGWRELVVEHTPSVNNTQPKLEESASARGANKFETDDSLDTE 346 Query: 187 LVFNIGDHEIKSNFLT-----CSDYFLGAIFENSPCRDREMQSISDRL--ACDGGTAIVI 239 I E S+ + D +G E P + + ++ L + D G +VI Sbjct: 347 FHLTISPKETPSSMIPQISKRYRDLPVGTRIEICPDAELYIMKMAKLLSDSNDKGAILVI 406 Query: 240 DYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQG 299 DYG ++L+ + H +VSP NPG+ DLS VDFQ L + + I G QG Sbjct: 407 DYGTENEIPENSLRGIYQHKFVSPFWNPGEVDLSIDVDFQALKQLTEGI-VDIYGPLKQG 465 Query: 300 KFLEGLGIWQRAFSLMKQTAR----KDILLDSVKRLVSTSADKKSMGELFKILVVSHEKV 355 +L +GI R L+K+ + +D + + +RL D + MG ++K + + + Sbjct: 466 DWLHNIGIGYRIDQLLKKNQQDPEVQDKIYGAYRRLT----DGEQMGSIYKFMALLPKGS 521 Query: 356 E-LMPF 360 F Sbjct: 522 SNPPGF 527 >gi|95929343|ref|ZP_01312086.1| protein of unknown function DUF185 [Desulfuromonas acetoxidans DSM 684] gi|95134459|gb|EAT16115.1| protein of unknown function DUF185 [Desulfuromonas acetoxidans DSM 684] Length = 388 Score = 240 bits (613), Expect = 2e-61, Method: Composition-based stats. Identities = 84/359 (23%), Positives = 140/359 (38%), Gaps = 15/359 (4%) Query: 5 LIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEML 63 L + I + I + G ++ Y C+ P+ GYY GA GDF T+ + +FG ++ Sbjct: 13 LEQIIADDISQRGGLSFCDYMQHCLYHPQHGYYMAARQRVGAKGDFFTSSSVHSLFGALI 72 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 A L W+ G +VE G G G + LDIL + P ++V+ +++ S Sbjct: 73 ARQLHQMWQLLGS-GSFTVVEQGAGDGFLALDILETLQADYPQMYAVIRYVLIDVSADNR 131 Query: 124 LIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQ 183 Q++ L + ++I + + ++ ++NE D+ + + ++E + Sbjct: 132 RRQQEHLHRHAEQI-TWQNFDELGSFTGCFLSNELLDAFAVHVVEKHDGQLQEVYVVQGG 190 Query: 184 HD---SLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVID 240 L G + G E ++S++++L G + ID Sbjct: 191 QGLEEELRTVDGPQFAHHFERVGAGVMEGCRGEVCLAVQPWVESVAEKLEQ--GFVLTID 248 Query: 241 YGYLQS------RVGDTLQAVKGHTYVS-PLVNPGQADLSSHVDFQRLSSIAILYKLYIN 293 YGY R TL HT P N G D++SHVDF L L Sbjct: 249 YGYPAQELYAPFRRQGTLLCYYQHTANDNPYQNIGCQDITSHVDFTLLQRCGEHVGLETL 308 Query: 294 GLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSH 352 T Q +FL GLG + L + + L + MGE FK+L+ Sbjct: 309 HFTEQYRFLIGLGFVEELVRLQAEETDPQKAMALRMTLKNLILPDGGMGETFKVLIQGK 367 >gi|119511132|ref|ZP_01630250.1| hypothetical protein N9414_16971 [Nodularia spumigena CCY9414] gi|119464227|gb|EAW45146.1| hypothetical protein N9414_16971 [Nodularia spumigena CCY9414] Length = 390 Score = 240 bits (613), Expect = 2e-61, Method: Composition-based stats. Identities = 97/375 (25%), Positives = 154/375 (41%), Gaps = 26/375 (6%) Query: 1 MENK--LIRKIVNLIKKN--GQMTVDQYFALCVADPEFGYYSTCN-PFGAVG-DFVTAPE 54 M + L I I + ++T ++ L + PE GYYS+ G G DF T+P Sbjct: 1 MTSNPALCNAIAYHISTSPQRRITFAEFMDLALYHPEHGYYSSHAVKIGFQGSDFFTSPH 60 Query: 55 ISQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIY 114 + FGE+LA W+ P LVE+G G+G++ + IL PDFF+ L Sbjct: 61 LGADFGELLAEQFWQMWDILARPVPFSLVEMGAGQGLLAMHILNYSGLHYPDFFAALDYV 120 Query: 115 MVETSERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGI 174 +VE S Q+++L + + + + F +NE D+LP+ QF++T+ + Sbjct: 121 IVEKSPGFQQEQQQRLQDFSVRWCSLEDIPTDSITGCF-FSNELVDALPVHQFILTDGKM 179 Query: 175 RERMIDIDQHDSLVF------NIGDHEIKSNFL------TCSDYFLGAIFENSPCRDREM 222 E + + DS + E++ T Y G E + + Sbjct: 180 HEVYVTTGKDDSEPLFLEVTGELSTPELQKYLDLVEIDLTARGYEDGYRSEINLAAGEWL 239 Query: 223 QSISDRLACDGG----TAIVIDYGYLQSRVGDTLQAVKGHTYVS-PLVNPGQADLSSHVD 277 ++DRL D Y R +LQ H + P +N G D+++HVD Sbjct: 240 SIVADRLHRGYVLTIDYGYPADRYYNPRRSQGSLQCYYNHRHHDNPYINVGMQDITAHVD 299 Query: 278 FQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSAD 337 F L +KL G QG FL LG+ +R ++ Q LL + L D Sbjct: 300 FTALERWGDRFKLEKIGFIQQGLFLMALGLGERLSAISHQELPLSQLLQRREAL-HQLID 358 Query: 338 KKSMGELFKILVVSH 352 +G F +L+ S Sbjct: 359 PTELGN-FGVLIQSK 372 >gi|254423748|ref|ZP_05037466.1| conserved hypothetical protein [Synechococcus sp. PCC 7335] gi|196191237|gb|EDX86201.1| conserved hypothetical protein [Synechococcus sp. PCC 7335] Length = 397 Score = 240 bits (613), Expect = 2e-61, Method: Composition-based stats. Identities = 92/377 (24%), Positives = 169/377 (44%), Gaps = 29/377 (7%) Query: 2 ENKLIRKIVNLIKKN--GQMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQI 58 L + I I+++ ++T Q+ L + P+ GYY+T + G+ GDFVT+P +S+ Sbjct: 4 SEYLQQVITQKIEQSPDHRITFAQFMDLALYHPQIGYYATPSSSLGSQGDFVTSPHMSRD 63 Query: 59 FGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVET 118 FGE++A + WE+ G P LVE+G G+G++ D + + PD F+ LS +VE Sbjct: 64 FGEVVAEQFVDMWEKLGRPDPFDLVEMGAGQGLVAEDAIAYLQSHHPDCFATLSYTIVEK 123 Query: 119 SERLTLIQKKQLASYGDKINWYTSLADVPLGFT----FLVANEFFDSLPIKQFVMTEHGI 174 S+ L Q+++L + ++ + + +NE D+ P+ + + + Sbjct: 124 SDSLKAEQQQRLRHWNEQGISIRWQNFDAIAPSSITGCAFSNELVDAFPVHWVELRDQKL 183 Query: 175 RERMIDIDQHD--SLVFNIGDHEIKSNFLT----CSDYFLGAIFENSPCRDREMQSISDR 228 +E + + + + ++ + S F S+Y G E + + I+D+ Sbjct: 184 QEIYVIHTEEGFLAQLGDLSTPSLASYFRRIGIDLSNYPEGYRTEVNLSALEWLAQIADK 243 Query: 229 LACDGGTAIVIDYGYLQSRVG------DTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRL 281 + G + IDYGY R TLQ H + P ++ G+ D+++HVDF + Sbjct: 244 IDR--GYLLTIDYGYSAQRYYSPARTEGTLQCYYQHNHHSDPFIHIGEQDITAHVDFTAI 301 Query: 282 SSIAILYKLYINGLTTQGKFLEGLGIWQRAFSL----MKQTARKDILLDSVKRL--VSTS 335 + L G QG FL LG+ R +L +TA + ++R + Sbjct: 302 ETQGNEVGLQSLGFIQQGLFLMALGLGDRLQALYEINSNRTADPTDIQAIMQRREVLQQL 361 Query: 336 ADKKSMGELFKILVVSH 352 + G F +L+ + Sbjct: 362 INPMGFGN-FGVLIQAK 377 >gi|217969131|ref|YP_002354365.1| hypothetical protein Tmz1t_0697 [Thauera sp. MZ1T] gi|217506458|gb|ACK53469.1| protein of unknown function DUF185 [Thauera sp. MZ1T] Length = 391 Score = 240 bits (613), Expect = 2e-61, Method: Composition-based stats. Identities = 95/382 (24%), Positives = 157/382 (41%), Gaps = 37/382 (9%) Query: 2 ENKLIRKIVNLIKKN-GQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIF 59 +L+ I + G + +Y L + P GYYS FG GDF+TAPE++ +F Sbjct: 15 SARLLEHIEAELAAAAGWIPFARYMELALYAPGLGYYSGGARKFGPGGDFITAPELTPLF 74 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVI--CKLKPDFFSVLSIYMVE 117 G+ LA + EQ S L+E+G G G++ D+L + P+ + ++E Sbjct: 75 GQALAAQV----EQVMRASTPALIEVGAGTGLLAADLLLELERRGCLPERYG-----ILE 125 Query: 118 TSERLTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHG 173 S L Q L +++W +L + G +VANE D +P+ V G Sbjct: 126 LSGELRERQFDTLAAKVPHLAARVHWLDALPERFSG--AVVANEVLDVMPVHLLVSRAEG 183 Query: 174 IRERMIDIDQHDSLVFNIGDHEIK---------SNFLTCSDYFLGAIFENSPCRDREMQS 224 + ER + I + + + ++ + E + + + Sbjct: 184 LFERGVAIATDAAGIRRLCWADVPAAGAVAEGARALALPVPQSGEYVTELNLAGKAWVAA 243 Query: 225 ISDRLACDGGTAIVIDYG----YLQSRVGDTL-QAVKGHTYVSPLVNPGQADLSSHVDFQ 279 ++RL I Y YL SR G TL + H + P + PG D+++ VDF Sbjct: 244 WAERLHAGALLLIDYGYPRAEYYLPSRSGGTLLCYYRHHAHGDPFLWPGLNDITAFVDFT 303 Query: 280 RLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKK 339 ++ L + G TTQ +FL G+ L ++ AR+ R V + Sbjct: 304 AVAEAGFEAGLDVQGYTTQAQFLFNCGV---LECLERRGARESADYIRAARAVQRLTAPQ 360 Query: 340 SMGELFKILVVSHEKV-ELMPF 360 MGELFK++ +S L+ F Sbjct: 361 EMGELFKVIALSRAIDGPLLGF 382 >gi|58697059|ref|ZP_00372516.1| DUF185 [Wolbachia endosymbiont of Drosophila simulans] gi|58536669|gb|EAL59966.1| DUF185 [Wolbachia endosymbiont of Drosophila simulans] Length = 324 Score = 240 bits (612), Expect = 3e-61, Method: Composition-based stats. Identities = 111/334 (33%), Positives = 179/334 (53%), Gaps = 25/334 (7%) Query: 30 ADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGR 89 ++GYY++ P G GDF TAPEISQ+FGE++A++++ WE+ G PS LVELGPG+ Sbjct: 2 YHEKYGYYTSKLPLGKDGDFTTAPEISQLFGEVIAVWIMHTWEKLGKPSKFSLVELGPGK 61 Query: 90 GIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWYTSLADVPLG 149 G ++ DI+RV K FF+ + I++VE S L IQK++L S +NW+ ++ ++P Sbjct: 62 GTLIHDIIRVTKK-YSSFFNSMLIHLVEISPTLRKIQKEKLKSL--DVNWHKNIDNLPEQ 118 Query: 150 FTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHD-----------SLVFNIGDHEIKS 198 T +ANEFFD+LPI QFV + G E M+ + + Sbjct: 119 PTIFLANEFFDALPIDQFVYHDEGWYENMVTKQDDGSLLVSCQCVTLESRKKESWIPVSA 178 Query: 199 NFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGH 258 +T +F GA+ E ++ + ++ + G A+++DYGY+ TLQ++K H Sbjct: 179 TQMTNGKFFNGAVVEICSVGVEILKKLEKKIYNNKGAALIVDYGYVYPAYKSTLQSIKQH 238 Query: 259 TYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQT 318 Y + L N G +D+++ V+FQ L + TQ +FL GI +R +LMK Sbjct: 239 KYANFLENVGNSDITALVNFQALRDSLKHVDCE---ILTQREFLYLFGIKERTQALMKSA 295 Query: 319 A--RKDILLDSVKRLVSTSADKKSMGELFKILVV 350 + +K+ + RL ++MG LFK +++ Sbjct: 296 SDEQKNRIFSEFLRLT------ENMGTLFKAMLL 323 >gi|315041064|ref|XP_003169909.1| hypothetical protein MGYG_08083 [Arthroderma gypseum CBS 118893] gi|311345871|gb|EFR05074.1| hypothetical protein MGYG_08083 [Arthroderma gypseum CBS 118893] Length = 501 Score = 240 bits (612), Expect = 3e-61, Method: Composition-based stats. Identities = 116/456 (25%), Positives = 179/456 (39%), Gaps = 99/456 (21%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTC-----NPFGAVGDFVTAPEIS 56 L ++I + I G +++ + C+ E GYY++ + FG GDFVT+PEIS Sbjct: 42 STPLAKRITDAINTTGPISIAAFMRQCLTSDEGGYYTSRGTPGSDVFGKEGDFVTSPEIS 101 Query: 57 QIFGEMLAIFLICAWEQHGFPSC-VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 Q+FGE+L I+++ W G S V+L+E GPG+G +M D+LR + K S+ +YM Sbjct: 102 QMFGELLGIWIVTEWLSQGRRSSGVQLMEFGPGKGTLMADVLRSVRNFKAFASSIEGVYM 161 Query: 116 VETSERLTLIQKKQLASYGD--------------------KINWYTSLADVPLGFTFLVA 155 VE S L IQKK L L + F++A Sbjct: 162 VEASPTLREIQKKALCGDAPMEECEIGYKSVSTHLGVPVYWTEHIRILPESEDKAPFIIA 221 Query: 156 NEFFDSLPIKQFVMTEHGIRE-----------RMIDIDQHDSLVFNIGDHEIKSNFLTCS 204 +EFFD+LPI F E R + + + + Sbjct: 222 HEFFDALPIHAFQAVHSPPPETINTPTGPTTLRQPSLPLNGTQWRELVVATNPEAAREPD 281 Query: 205 DYF---------------------------------------LGAIFENSPCRDREMQSI 225 D G+ E SP Q I Sbjct: 282 DIDSSDKNDKKREFRLALAKSHTPASLVMPEMSPRYKALKSTRGSTIEISPESHTYAQEI 341 Query: 226 SDRL-------------ACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADL 272 + + G A+++DYG + ++L+ +K H VSP PG+ DL Sbjct: 342 ARLIGGPNPTDKNPSPARTPAGAALILDYGPSSTIPVNSLRGIKNHQVVSPFATPGEVDL 401 Query: 273 SSHVDFQRLSSIAILY--KLYINGLTTQGKFLEGLGIWQRAFSLM---KQTARKDILLDS 327 S+ VDF L+ A+ + + G QG FL LGI +RA L+ K ++ + S Sbjct: 402 SADVDFTGLAESALNASPGVEVYGPNEQGSFLRSLGIAERAAQLLRNVKDEEKRKQIESS 461 Query: 328 VKRLVSTSADKKSMGELFKILVVSHE---KVELMPF 360 +RLV MG+++K + + E K + F Sbjct: 462 WQRLVERGG--GGMGKIYKAMAIVPESGGKRRPVGF 495 >gi|293337173|ref|NP_001169078.1| hypothetical protein LOC100382919 [Zea mays] gi|223974807|gb|ACN31591.1| unknown [Zea mays] Length = 512 Score = 240 bits (612), Expect = 3e-61, Method: Composition-based stats. Identities = 116/463 (25%), Positives = 181/463 (39%), Gaps = 106/463 (22%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTC-----NPFGAVGDFVTAPEIS 56 L + + N IK G + + + + +PE GYY+T FG GDFVT+PEIS Sbjct: 46 STPLAQTLANAIKVTGPVPIAAFMRQVLTNPEGGYYTTRPEGHGEVFGKKGDFVTSPEIS 105 Query: 57 QIFGEMLAIFLICAWEQHGFPSC-VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 Q+FGE++ I+ I W G V+L+E+GPG+G +M D+LR K S+ +IY+ Sbjct: 106 QVFGELVGIWTIAEWMAQGRKRSGVQLMEVGPGKGTLMDDMLRTFRNFKMFSSSIEAIYL 165 Query: 116 VETSERLTLIQKKQLAS--------------------YGDKINWYTSLADVPLGFTFLVA 155 VE S L +QKK L + L F+ A Sbjct: 166 VEASATLREVQKKLLCGDAVMEETDIGHKSTCKYFDVPIVWVEDIRLLPHEEEKTPFIFA 225 Query: 156 NEFFDSLPIKQFVMTE------------------------------HGIRERMIDIDQ-- 183 +EFFD+LPI F RE ++ ++ Sbjct: 226 HEFFDALPIHAFESIPPSPENQPEQKEIMTPTGPAKLHQPLKPANTPQWRELLVTLNPKA 285 Query: 184 -----HDSLVFNIG----------DHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDR 228 F + S G+ E SP + R Sbjct: 286 VEENIEGEPEFKLTLAKASTPSSLVIPEISPRYRALKSQPGSTIEVSPESRIYAADFARR 345 Query: 229 L-----------------------ACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLV 265 + G A+++DYG L + ++L+ ++ H V PL Sbjct: 346 IGGASEPPRTVTKGAAASAPAPAKRTSSGAALIMDYGTLNTIPINSLRGIQEHKNVPPLS 405 Query: 266 NPGQADLSSHVDFQRLSSIAILY--KLYINGLTTQGKFLEGLGIWQRAFSLMKQ---TAR 320 +PGQ D+S+ VDF L+ AI + ++G QG FL+ +GI +R L+K+ + Sbjct: 406 SPGQVDVSADVDFTALAEAAIEASEGVEVHGPVEQGDFLQAMGIEERMQQLLKKVDDEEK 465 Query: 321 KDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVE---LMPF 360 + L KRLV SMG+++K++ + E + F Sbjct: 466 RKTLETGWKRLVEKGG--GSMGKIYKVMAIVPENDGKRRPIGF 506 >gi|254796952|ref|YP_003081789.1| ATP synthase beta subunit/transription termination factor rho [Neorickettsia risticii str. Illinois] gi|254590199|gb|ACT69561.1| ATP synthase beta subunit/transription termination factor rho [Neorickettsia risticii str. Illinois] Length = 323 Score = 240 bits (611), Expect = 3e-61, Method: Composition-based stats. Identities = 116/346 (33%), Positives = 179/346 (51%), Gaps = 23/346 (6%) Query: 5 LIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLA 64 + I N I++NG + ++ L + P GYY T NP G GD++TAPEIS +FG +A Sbjct: 1 MRSYIKNFIRENGSIAFSKFIELSMYHPSKGYYITRNPIGKTGDYITAPEISSLFGRTVA 60 Query: 65 IFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTL 124 ++++ WE+ P + +VELGPG G+MM DIL I ++ + +++YM+E S L Sbjct: 61 VWILEQWERLEKPREIAIVELGPGSGMMMFDILNTIRNVESF-YDSVTVYMIEISPFLRG 119 Query: 125 IQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQH 184 +Q + L + KI W S+ ++P G ++ANEFFD+LPI QF+ ER Sbjct: 120 VQMENLRPHSCKIRWCNSIDELPNGKVIVLANEFFDALPIDQFIFWGGTFFER------- 172 Query: 185 DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYL 244 + + E ++ + G I E S + SI R+A DGG+ +++DYG+ Sbjct: 173 -KITEDFQIEEEETRKRFSGKFKDGDIVEISLLGKQIASSILTRIAKDGGSGLIVDYGHA 231 Query: 245 QSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEG 304 T+QAVKGH ++ + G++D++ +DF L A L TQG FL Sbjct: 232 TCTRRSTIQAVKGHRFIDIFESIGESDITHEIDFSYLFPRAK--------LMTQGDFLSL 283 Query: 305 LGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVV 350 GI++ A IL ++ RLV DK MG LFK ++ Sbjct: 284 YGIFELAKR--SSATDTGILEQTLFRLV----DKGKMGRLFKCAII 323 >gi|221064985|ref|ZP_03541090.1| protein of unknown function DUF185 [Comamonas testosteroni KF-1] gi|220710008|gb|EED65376.1| protein of unknown function DUF185 [Comamonas testosteroni KF-1] Length = 375 Score = 240 bits (611), Expect = 3e-61, Method: Composition-based stats. Identities = 95/377 (25%), Positives = 147/377 (38%), Gaps = 36/377 (9%) Query: 1 MENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTC-NPFG----AVGDFVTAPE 54 + + L +I I G + D++ L + P GYY+ FG + DFVTAPE Sbjct: 11 LTSALQSRIAKEIAAVGGWLPFDRFMELALYAPGLGYYANETAKFGTMPESGSDFVTAPE 70 Query: 55 ISQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIY 114 IS IFG+++A L A ++ + E G G G + L IL + Sbjct: 71 ISPIFGQLVASQLREALQKTN---TREIWEFGAGTGALALQILDELAAQGALP---ERYT 124 Query: 115 MVETSERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGI 174 +V+ S L QK L Y + W +L + G ++ NE D++P++ T Sbjct: 125 IVDLSGTLRARQKLALTRYEHLVRWVDALPEAMEG--VIIGNEVLDAMPVQLLQRTAGQW 182 Query: 175 RERMIDIDQH-DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDG 233 ER + + D F D + + E + + ++ +RL Sbjct: 183 HERGVVLGAGGDEAAFAWEDRATELRPPLDIGGPHDFLTEIHRQGEAFIHTLGERLTR-- 240 Query: 234 GTAIVIDY------GYLQSRVGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAI 286 G A IDY Y + R TL H PLV G D+++HV+F + A Sbjct: 241 GAAFFIDYGFGESEYYHEQRHMGTLVCHYQHQVDNDPLVLVGLKDITAHVNFTGTAVAAQ 300 Query: 287 LYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFK 346 + G T Q FL G+ R D + + S + MGELFK Sbjct: 301 DAGFDVLGYTNQAHFLINCGLAPRL----------DAAGIKARSMASKLVMEHEMGELFK 350 Query: 347 ILVVSH--EKVELMPFV 361 ++ +S E + FV Sbjct: 351 VVALSKGVEPWTPLGFV 367 >gi|241959024|ref|XP_002422231.1| conserved hypothetical protein [Candida dubliniensis CD36] gi|223645576|emb|CAX40235.1| conserved hypothetical protein [Candida dubliniensis CD36] Length = 527 Score = 240 bits (611), Expect = 4e-61, Method: Composition-based stats. Identities = 111/426 (26%), Positives = 175/426 (41%), Gaps = 73/426 (17%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFG-AVGDFVTAPEISQIFGE 61 L IK G + + Y C+ PEFGYY+T NP GDF+T+PEIS +FGE Sbjct: 107 ENLSDFFRQTIKLTGPIPLSTYMRQCLTHPEFGYYTTRNPLSLRTGDFITSPEISSVFGE 166 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSI------YM 115 M+ I+ W+Q +P +R +E GPG+G ++ DI++ K + Sbjct: 167 MIGIWYFSIWQQQKYPKSIRFIEFGPGKGTLIHDIMKTFNKFVEKLLPSDQKRPKIEIAL 226 Query: 116 VETSERLTLIQKKQLASYGDK-----------INWYTSLADVPLGF-----------TFL 153 +E S L Q K L + D I+ + + F+ Sbjct: 227 IEASRVLRKEQWKLLCNPQDPMDTTEEGYNRSISKWGNDIIWLDTEKDIKQGDKNVANFI 286 Query: 154 VANEFFDSLPIKQFVMTEHGIRERMIDIDQ---------------------------HDS 186 VA+EFFD+LPIK F+ E G RE +++ Sbjct: 287 VAHEFFDALPIKSFIREEKGWRELVVEHTPSVNNTQPKLEESKSSTRADKVETDNSLDTE 346 Query: 187 LVFNIGDHEIKSNFLT-----CSDYFLGAIFENSPCRDREMQSISDRL--ACDGGTAIVI 239 I E S+ + D +G E P + + ++ L + D G +VI Sbjct: 347 FHLTISPKETPSSMIPQISKRYRDLPVGTRIEICPDAELYIMKMAKLLSDSNDKGAILVI 406 Query: 240 DYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQG 299 DYG ++L+ + H +VSP +PG+ DLS VDFQ L + + G QG Sbjct: 407 DYGTENEIPENSLRGIYQHKFVSPFWSPGEVDLSIDVDFQALKQLTEGI-VNAYGPVKQG 465 Query: 300 KFLEGLGIWQRAFSLMKQTAR----KDILLDSVKRLVSTSADKKSMGELFKILVVSHEK- 354 +L +GI R L+K+ + +D + + +RL D + MG ++K + + + Sbjct: 466 DWLHNIGIGYRIDQLLKKNQQDPNVQDKIYGAYRRLT----DSEQMGSIYKFMALLPKGS 521 Query: 355 VELMPF 360 F Sbjct: 522 NNPPGF 527 >gi|56477341|ref|YP_158930.1| hypothetical protein ebA3370 [Aromatoleum aromaticum EbN1] gi|56313384|emb|CAI08029.1| conserved hypothetical protein [Aromatoleum aromaticum EbN1] Length = 386 Score = 240 bits (611), Expect = 4e-61, Method: Composition-based stats. Identities = 94/378 (24%), Positives = 149/378 (39%), Gaps = 33/378 (8%) Query: 2 ENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIF 59 +L+R I I G + +Y L + P GYYS FG GDF+TAPE++ +F Sbjct: 14 SARLVRSITASIAAAGGWIPFSRYMELALYSPGLGYYSGGARKFGPGGDFITAPELTPLF 73 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVI--CKLKPDFFSVLSIYMVE 117 G+ LA + EQ S ++E G G G++ D+L + + P+ + ++E Sbjct: 74 GQALAAQV----EQVMRASAAHVIEAGAGTGLLAADLLLELERRECLPETYG-----ILE 124 Query: 118 TSERLTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHG 173 S L Q L ++ W +L + G LVANE D +P+ V G Sbjct: 125 LSGELRERQFDLLAEKAPRLASRVRWLDALPERFSG--ALVANEVLDVMPVHLVVSRPEG 182 Query: 174 IRERMIDIDQHDSLVFNIGDH-----EIKSNFLTCSDYFLGAIFENSPCRDREMQSISDR 228 + ER + +D +L + + + E + + + R Sbjct: 183 LFERGVAVDPAGTLRWADSPASGAVADAARALDLPLPESGEYVTELNLAARAWVGEWAQR 242 Query: 229 LACDGGTAIVIDYG----YLQSRVGDTL-QAVKGHTYVSPLVNPGQADLSSHVDFQRLSS 283 L C + Y YL SR TL + H + P + PG D+++ VDF ++ Sbjct: 243 LDCGVLLLVDYGYPRAEYYLPSRSNGTLLCYYRHHAHADPFLWPGLNDITAFVDFTSVAE 302 Query: 284 IAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGE 343 A L + G T Q FL G+ L ++ R + MGE Sbjct: 303 AAFDAGLDVLGYTNQAAFLFNCGL---LECLARRGPETSADYIRAARAAQRLTTPQEMGE 359 Query: 344 LFKILVVSHE-KVELMPF 360 LFK+L + L+ F Sbjct: 360 LFKVLALGKGIPEPLLGF 377 >gi|68487619|ref|XP_712358.1| hypothetical protein CaO19.6152 [Candida albicans SC5314] gi|46433739|gb|EAK93170.1| conserved hypothetical protein [Candida albicans SC5314] Length = 527 Score = 240 bits (611), Expect = 4e-61, Method: Composition-based stats. Identities = 115/426 (26%), Positives = 181/426 (42%), Gaps = 73/426 (17%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFG-AVGDFVTAPEISQIFGE 61 L IK G + + Y C+ PEFGYY+T +P GDF+T+PEIS +FGE Sbjct: 107 ENLSDFFRQTIKLTGPIPLSTYMRQCLTHPEFGYYTTRDPLNLRTGDFITSPEISSVFGE 166 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSI------YM 115 M+ I+ W+Q +P +R +E GPG+G ++ DI++ K + Sbjct: 167 MIGIWYFSIWQQQKYPESIRFIEFGPGKGTLIHDIMKTFNKFVEKLLPSDQKRPKIEIAL 226 Query: 116 VETSERLTLIQKKQLAS------------------YGDKINWYTSLADVPLG----FTFL 153 +E S L Q K L + +G+ I W + D+ G F+ Sbjct: 227 IEASHVLRKEQWKLLCNPEDPMETTGEGYNRSATKWGNDIIWLDTEKDIQQGDKNVANFI 286 Query: 154 VANEFFDSLPIKQFVMTEHGIRERMIDIDQ---------------------------HDS 186 VA+EFFD+LPIK F+ E G RE +++ Sbjct: 287 VAHEFFDALPIKSFIREEKGWRELVVEHTPSVNNTQPKLEESASARGANKFETDDSLDTE 346 Query: 187 LVFNIGDHEIKSNFLT-----CSDYFLGAIFENSPCRDREMQSISDRL--ACDGGTAIVI 239 I E S+ + D +G EN P + + ++ L + D G +VI Sbjct: 347 FHLTISPKETPSSMIPQISKRYRDLPVGTRIENCPDAELYIMKMAKLLSDSNDKGAILVI 406 Query: 240 DYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQG 299 DYG ++L+ + H +VSP NPG+ DLS VDFQ L + + I G QG Sbjct: 407 DYGTENEIPENSLRGIYQHKFVSPFWNPGEVDLSIDVDFQALKQLTEGI-VDIYGPLKQG 465 Query: 300 KFLEGLGIWQRAFSLMKQTAR----KDILLDSVKRLVSTSADKKSMGELFKILVVSHEKV 355 +L +GI R ++K+ + +D + + +RL D + MG ++K + + + Sbjct: 466 DWLHNIGIGYRIDQVLKKNEQDPEVQDKIYGAYRRLT----DGEQMGSIYKFMALLPKGS 521 Query: 356 E-LMPF 360 F Sbjct: 522 SNPPGF 527 >gi|302035992|ref|YP_003796314.1| hypothetical protein NIDE0616 [Candidatus Nitrospira defluvii] gi|300604056|emb|CBK40388.1| conserved protein of unknown function [Candidatus Nitrospira defluvii] Length = 420 Score = 239 bits (610), Expect = 4e-61, Method: Composition-based stats. Identities = 77/385 (20%), Positives = 141/385 (36%), Gaps = 39/385 (10%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYST------CNPFGAVGDFVTAPEISQ 57 +L+ +I I G + ++ + + P++GYY G GDF T+ ++ Sbjct: 20 QLLAEIRAEIAATGPIPFARFMDVALYHPQYGYYVRPVDDPAKERIGWSGDFYTSSDVHP 79 Query: 58 IFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVE 117 I G+ +A G P +VE+G G+G++ D L + + L ++E Sbjct: 80 ILGQAVARQAQQLDALLGHPDPFTVVEMGAGKGLLARDFLTACRNAPANLGNRLRYILIE 139 Query: 118 TSERLTLIQKKQLASYGDKINWYTSLADVPLGFT-----FLVANEFFDSLPIKQFVMTEH 172 S + Q+ LA + + L + +NE D+ P+ + + + Sbjct: 140 RSAAMRTQQQHNLAPWVGEAGRVAWLDRLDDLPPNSVTGLFFSNELVDAFPVHRLAVVDG 199 Query: 173 GIRERMIDIDQHDSLVFNIGDHEIKSNFLTCS---DYFLGAIFENSPCRDREMQSISDRL 229 +E +D ++ D + + + G E + R M ++ + Sbjct: 200 RPQEIYVD-NRDDRFCEVYRPLSNELSAYLREGGINLPDGYRAEINLDAVRWMTQVAQVM 258 Query: 230 ACDGGTAIVIDYGYL------QSRVGDTLQAVKGHTY-VSPLVNPGQADLSSHVDFQRLS 282 G + IDYG+ R T T G+ D+++HVDF L+ Sbjct: 259 VR--GAVLTIDYGHTAEDLYGPDRKNGTFLCYYHQTTSEDAYDRVGEQDMTAHVDFTTLA 316 Query: 283 SIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMG 342 L + G T Q FL GLG A L++ + + L+ MG Sbjct: 317 QTGRRAGLDVTGFTNQMSFLIGLG----AEQLLESLQPESPEFYAAIHLLR----PDGMG 368 Query: 343 ELFKILVVSH-------EKVELMPF 360 FK+L+ + ++ PF Sbjct: 369 RTFKVLIQHKGMAKPELDGLKFQPF 393 >gi|317036685|ref|XP_001397854.2| hypothetical protein ANI_1_1788144 [Aspergillus niger CBS 513.88] Length = 767 Score = 239 bits (610), Expect = 4e-61, Method: Composition-based stats. Identities = 117/463 (25%), Positives = 181/463 (39%), Gaps = 106/463 (22%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTC-----NPFGAVGDFVTAPEIS 56 L + + N IK G + + + + +PE GYY+T FG GDFVT+PEIS Sbjct: 301 STPLAQTLANAIKVTGPVPIAAFMRQVLTNPEGGYYTTRPEGHGAVFGKKGDFVTSPEIS 360 Query: 57 QIFGEMLAIFLICAWEQHGFPSC-VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 Q+FGE++ I+ I W G V+L+E+GPG+G +M D+LR K S+ +IY+ Sbjct: 361 QVFGELVGIWTIAEWMAQGRKRSGVQLMEVGPGKGTLMDDMLRTFRNFKMFSSSMEAIYL 420 Query: 116 VETSERLTLIQKKQLAS--------------------YGDKINWYTSLADVPLGFTFLVA 155 VE S L +QKK L + L F+ A Sbjct: 421 VEASATLREVQKKLLCGDAVMEATDIGHKSTCKYFDVPIVWVEDIRLLPHEEEKTPFIFA 480 Query: 156 NEFFDSLPIKQFVMTE------------------------------HGIRERMIDIDQ-- 183 +EFFD+LPI F RE M+ ++ Sbjct: 481 HEFFDALPIHAFESIPPSPENQPEQKEIMTPTGPAKLHQPLKPANTPQWREIMVTLNPKA 540 Query: 184 -----HDSLVFNIG----------DHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDR 228 F + S G+ E SP + R Sbjct: 541 VEENIEGEPEFKLTLAKASTPSSLVIPEISPRYRALKSQPGSTIEVSPESRIYAADFARR 600 Query: 229 L-----------------------ACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLV 265 + G A+++DYG L + ++L+ ++ H V PL Sbjct: 601 IGGASEPPRTATKGAAASAPAPAKRVSSGAALIMDYGTLNTIPINSLRGIQEHKNVPPLS 660 Query: 266 NPGQADLSSHVDFQRLSSIAILY--KLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDI 323 +PGQ D+S+ VDF L+ AI + ++G QG FL+ +GI +R L+K+ ++ Sbjct: 661 SPGQVDVSADVDFTALAEAAIEASEGVEVHGPVEQGDFLQAMGIEERMQQLLKKVEDEEK 720 Query: 324 ---LLDSVKRLVSTSADKKSMGELFKILVVSHEKVE---LMPF 360 L KRLV SMG+++K++ + E + F Sbjct: 721 RKTLETGWKRLVEKGG--GSMGKIYKVMAIVPENDGKRRPIGF 761 >gi|134083408|emb|CAK46886.1| unnamed protein product [Aspergillus niger] Length = 512 Score = 239 bits (610), Expect = 4e-61, Method: Composition-based stats. Identities = 117/463 (25%), Positives = 181/463 (39%), Gaps = 106/463 (22%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTC-----NPFGAVGDFVTAPEIS 56 L + + N IK G + + + + +PE GYY+T FG GDFVT+PEIS Sbjct: 46 STPLAQTLANAIKVTGPVPIAAFMRQVLTNPEGGYYTTRPEGHGAVFGKKGDFVTSPEIS 105 Query: 57 QIFGEMLAIFLICAWEQHGFPSC-VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 Q+FGE++ I+ I W G V+L+E+GPG+G +M D+LR K S+ +IY+ Sbjct: 106 QVFGELVGIWTIAEWMAQGRKRSGVQLMEVGPGKGTLMDDMLRTFRNFKMFSSSMEAIYL 165 Query: 116 VETSERLTLIQKKQLAS--------------------YGDKINWYTSLADVPLGFTFLVA 155 VE S L +QKK L + L F+ A Sbjct: 166 VEASATLREVQKKLLCGDAVMEATDIGHKSTCKYFDVPIVWVEDIRLLPHEEEKTPFIFA 225 Query: 156 NEFFDSLPIKQFVMTE------------------------------HGIRERMIDIDQ-- 183 +EFFD+LPI F RE M+ ++ Sbjct: 226 HEFFDALPIHAFESIPPSPENQPEQKEIMTPTGPAKLHQPLKPANTPQWREIMVTLNPKA 285 Query: 184 -----HDSLVFNIG----------DHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDR 228 F + S G+ E SP + R Sbjct: 286 VEENIEGEPEFKLTLAKASTPSSLVIPEISPRYRALKSQPGSTIEVSPESRIYAADFARR 345 Query: 229 L-----------------------ACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLV 265 + G A+++DYG L + ++L+ ++ H V PL Sbjct: 346 IGGASEPPRTATKGAAASAPAPAKRVSSGAALIMDYGTLNTIPINSLRGIQEHKNVPPLS 405 Query: 266 NPGQADLSSHVDFQRLSSIAILY--KLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDI 323 +PGQ D+S+ VDF L+ AI + ++G QG FL+ +GI +R L+K+ ++ Sbjct: 406 SPGQVDVSADVDFTALAEAAIEASEGVEVHGPVEQGDFLQAMGIEERMQQLLKKVEDEEK 465 Query: 324 ---LLDSVKRLVSTSADKKSMGELFKILVVSHEKVE---LMPF 360 L KRLV SMG+++K++ + E + F Sbjct: 466 RKTLETGWKRLVEKGG--GSMGKIYKVMAIVPENDGKRRPIGF 506 >gi|319761315|ref|YP_004125252.1| hypothetical protein Alide_0595 [Alicycliphilus denitrificans BC] gi|317115876|gb|ADU98364.1| protein of unknown function DUF185 [Alicycliphilus denitrificans BC] Length = 361 Score = 239 bits (610), Expect = 4e-61, Method: Composition-based stats. Identities = 93/377 (24%), Positives = 163/377 (43%), Gaps = 42/377 (11%) Query: 1 MENK---LIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTC-NPFGA----VGDFVT 51 M L + I + G + D++ L + P GYY+ G+ DFVT Sbjct: 1 MTTPTDALFQHIRQDLAAAGGWIGFDRFMQLALYTPGLGYYAGGLRKLGSMPEDGSDFVT 60 Query: 52 APEISQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVL 111 APE+S +FG++LA + A + G + E G G G + +L + + V Sbjct: 61 APELSPVFGKVLAAQVREALDATG---TDEVWEFGAGSGALAGQLLEALGE------RVR 111 Query: 112 SIYMVETSERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTE 171 +V+ S L Q+++LA++ +++W L G +V NE D++P++ + Sbjct: 112 RYTIVDLSGSLRARQQERLAAHAGRVHWAERLPAAIEG--VVVGNELLDAMPVQLLHRVQ 169 Query: 172 HGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLAC 231 ER + +D +L ++ ++ + + E P + M+++ +RLA Sbjct: 170 GAWHERGVALDAGGALAWSDRPTALRPPVEI--EGGHDYLTEIHPQGEAFMRTLGERLAR 227 Query: 232 DGGTAIVIDY------GYLQSRVGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSI 284 G A++IDY Y R T+ + H PL + G+ D+++HV+F ++ Sbjct: 228 --GAALLIDYGFGEDEYYHPQRHMGTVMCHRAHRADDDPLADVGEKDITAHVNFTAMALA 285 Query: 285 AILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGEL 344 A L + G T+Q +FL G+ Q L + + L + + MGEL Sbjct: 286 AQDAGLNVLGYTSQARFLLNCGLLQEMERL----------TLAQRALAAKLMLEHEMGEL 335 Query: 345 FKILVVSH-EKVELMPF 360 FK+L V E M F Sbjct: 336 FKVLAVGPGEPWTPMGF 352 >gi|307731309|ref|YP_003908533.1| hypothetical protein BC1003_3296 [Burkholderia sp. CCGE1003] gi|307585844|gb|ADN59242.1| protein of unknown function DUF185 [Burkholderia sp. CCGE1003] Length = 396 Score = 239 bits (610), Expect = 4e-61, Method: Composition-based stats. Identities = 91/370 (24%), Positives = 148/370 (40%), Gaps = 32/370 (8%) Query: 2 ENKLIRKIVNLIK-KNGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGD----FVTAPEI 55 L+ +I ++ G M D+Y + P GYYS FG GD FVTAPE+ Sbjct: 22 SEALVARIRAELQEAGGWMPFDRYMERALYAPGLGYYSGGARKFGLRGDDGSDFVTAPEL 81 Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 S +F LA + A + G ++E G G G + +L+ + L F S + Sbjct: 82 SPLFAATLARPVAEALQASG---TREVMEFGAGTGKLAAGVLKALAALGVAFDS---YSI 135 Query: 116 VETSERLTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTE 171 V+ S L Q++ + K+ W +L + G ++ NE D++P++ F Sbjct: 136 VDLSGELRERQRETIEAATPELAAKVRWLDALPERFEG--VVIGNEVLDAMPVRLFAWAN 193 Query: 172 HGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFL---GAIFENSPCRDREMQSISDR 228 ER + + F ++ S+ I E ++I Sbjct: 194 GAWHERG-VVWRDGRFAFEDRPVTAPADLARLSEIDTAGADYIAEMHDAACAFTRTICTM 252 Query: 229 LACDGGTAIVIDYG----YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSS 283 LA I + Y R TL H P V PG D+++HV+F ++ Sbjct: 253 LARGAAFFIDYGFPRHEYYHAQRAEGTLMCHYRHRAHGDPFVYPGLQDITAHVEFTGIAE 312 Query: 284 IAILYKLYINGLTTQGKFLEGLGIWQRAFSLM-KQTARKDILLDSVKRLVSTSADKKSMG 342 + + G T+Q +FL GI + TAR ++V++L+S + MG Sbjct: 313 AGVETGADLLGFTSQARFLLNAGITDALSEIDPADTARFLRAANAVQKLLS----EAEMG 368 Query: 343 ELFKILVVSH 352 ELFK++ S Sbjct: 369 ELFKVIAFSR 378 >gi|222109726|ref|YP_002551990.1| hypothetical protein Dtpsy_0510 [Acidovorax ebreus TPSY] gi|221729170|gb|ACM31990.1| protein of unknown function DUF185 [Acidovorax ebreus TPSY] Length = 362 Score = 239 bits (610), Expect = 4e-61, Method: Composition-based stats. Identities = 96/372 (25%), Positives = 147/372 (39%), Gaps = 34/372 (9%) Query: 1 MENKLIRKIVNLI-KKNGQMTVDQYFALCVADPEFGYYSTC-NPFGA----VGDFVTAPE 54 + L I I + G + D++ AL + P GYY+ FGA DFVTAPE Sbjct: 4 LTPSLFEHIRQRITAQGGWIGFDRFMALALYTPGLGYYAGDLPKFGAMPASGSDFVTAPE 63 Query: 55 ISQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIY 114 +S +FG +LA + A + + + E G G L V Sbjct: 64 LSPVFGSVLARQVREALDAT---ATDEVWEFGAGS------GALAAQLLGALGDRVRRYT 114 Query: 115 MVETSERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGI 174 +V+ S L Q+ LA++GD+++W L D G +V NE D++P++ Sbjct: 115 IVDLSGSLRARQQATLAAWGDRVHWVDRLPDQIQG--VVVGNEVLDAMPVQLLQRRAGLW 172 Query: 175 RERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGG 234 ER + +D+ + F D D + E P + M++++ L Sbjct: 173 HERGVALDESGNG-FVWQDRPTPLRPPVEIDGPHDYLTEIHPQGEAFMRTLAQHLVRGAA 231 Query: 235 TAIVI----DYGYLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSSIAILYK 289 I D Y R TL +GH PL + G D+++HV+F ++ A Sbjct: 232 FLIDYGFGEDEYYHPQRHMGTLVCHRGHQVDSDPLADVGLKDITAHVNFTAMALAAQDAG 291 Query: 290 LYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILV 349 L + G TTQ FL G+ L + + K + MGELFK+L Sbjct: 292 LQVLGYTTQAHFLINCGLLSELERL-----TQAQRAQAAK-----LMMEHEMGELFKVLA 341 Query: 350 VSH-EKVELMPF 360 V E M F Sbjct: 342 VGAGEPWRPMGF 353 >gi|264680402|ref|YP_003280312.1| hypothetical protein CtCNB1_4270 [Comamonas testosteroni CNB-2] gi|262210918|gb|ACY35016.1| hypothetical conserved protein [Comamonas testosteroni CNB-2] Length = 372 Score = 239 bits (610), Expect = 5e-61, Method: Composition-based stats. Identities = 96/377 (25%), Positives = 149/377 (39%), Gaps = 36/377 (9%) Query: 1 MENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTC-NPFGA----VGDFVTAPE 54 + + L +I I G + D++ L + P GYY+ FGA DFVTAPE Sbjct: 8 LTSALQSRIAKEIAAVGGWLPFDRFMELALYAPGLGYYANETAKFGAMPESGSDFVTAPE 67 Query: 55 ISQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIY 114 IS IFG+++A L A ++ + E G G G + L IL + Sbjct: 68 ISPIFGQLVASQLREALQKTN---TREIWEFGAGTGALALQILDELAAQGALP---ERYT 121 Query: 115 MVETSERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGI 174 +V+ S L QK LA Y + W +L + G ++ NE D++P++ T Sbjct: 122 IVDLSGTLRARQKLALAKYEHLVRWVDALPEAMEG--VIIGNEVLDAMPVQLLQRTRGQW 179 Query: 175 RERMIDIDQH-DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDG 233 ER + + D F D + + E + + ++ +RL Sbjct: 180 HERGVVLGAGGDEAAFAWEDRPTELRPPVDIGGPHDFLTEIHRQGEAFIHTLGERLVR-- 237 Query: 234 GTAIVIDY------GYLQSRVGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAI 286 G A IDY Y + R TL H PLV G D+++HV+F + A Sbjct: 238 GAAFFIDYGFGESEYYHEQRHMGTLVCHYQHQVDNDPLVLVGLKDITAHVNFTGTAVAAQ 297 Query: 287 LYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFK 346 + G T Q FL G+ + D + + + S + MGELFK Sbjct: 298 DAGFDVLGYTNQAHFLINCGL----------APKLDAVGIKARSMASKLVMEHEMGELFK 347 Query: 347 ILVVSH--EKVELMPFV 361 ++ +S E + FV Sbjct: 348 VVALSKGVEPWTPLGFV 364 >gi|330818521|ref|YP_004362226.1| hypothetical protein bgla_1g36670 [Burkholderia gladioli BSR3] gi|327370914|gb|AEA62270.1| hypothetical protein bgla_1g36670 [Burkholderia gladioli BSR3] Length = 400 Score = 239 bits (610), Expect = 5e-61, Method: Composition-based stats. Identities = 95/375 (25%), Positives = 147/375 (39%), Gaps = 38/375 (10%) Query: 2 ENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGD----FVTAPEI 55 L + I G + ++ + P GYYS FG GD FVTAPE+ Sbjct: 22 SETLSASLRAEIAASGGWVPFSRFMERALYAPGLGYYSGGARKFGRRGDDGSDFVTAPEL 81 Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRG--IMMLDILRVICKLKPDFFSVLSI 113 S +F + LA L A E G RL+E G G G L P+ + Sbjct: 82 SPLFAQTLARPLAEALEASG---TRRLMEFGAGTGKLAAGLLAALDALGAPPERYE---- 134 Query: 114 YMVETSERLTLIQKKQLA-----SYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFV 168 +VE S L Q+ LA +++W +L + G +V NE D++P++ + Sbjct: 135 -IVELSGELRERQRATLAAELPAPLASRVHWLDALPERFEG--VVVGNEVLDAMPVRLVL 191 Query: 169 MTEHGIRERMIDIDQHDSLVFNIGDHEIK-----SNFLTCSDYFLGAIFENSPCRDREMQ 223 E G RER + +D + VF L + G + E Sbjct: 192 RGETGWRERGVAVDAARAFVFEDRPLPESAGESALAALDALELPEGYLTEIHEAARAFTG 251 Query: 224 SISDRLACDGGTAIVIDYG----YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDF 278 ++ LA I + Y R TL H P V PG D+++HV+F Sbjct: 252 TVCRMLARGAAFFIDYGFPAGEYYHPQRAEGTLMCHYRHRAHGDPFVWPGLQDITAHVEF 311 Query: 279 QRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLM-KQTARKDILLDSVKRLVSTSAD 337 + A+ + G T+QG+FL GI + + AR ++V++L++ Sbjct: 312 SGIHEAAVAAGAELLGYTSQGRFLLNAGITEVLADIDPADGARFLPAANAVQKLIA---- 367 Query: 338 KKSMGELFKILVVSH 352 + MGELFK++ Sbjct: 368 ESEMGELFKVIAFGR 382 >gi|255068187|ref|ZP_05320042.1| putative peptidoglycan synthetase FtsI [Neisseria sicca ATCC 29256] gi|255047529|gb|EET42993.1| putative peptidoglycan synthetase FtsI [Neisseria sicca ATCC 29256] Length = 385 Score = 239 bits (610), Expect = 5e-61, Method: Composition-based stats. Identities = 81/368 (22%), Positives = 146/368 (39%), Gaps = 31/368 (8%) Query: 2 ENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIF 59 +KL I IK N + ++ L + PE+GYY+ + G GDF+TAP ++ +F Sbjct: 16 SSKLFEIIKQEIKAQNNWIPFSRFMELALYTPEYGYYTGGSHKIGTDGDFITAPTLTPLF 75 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 G+ LA L Q + E G G G + +L+ + + Y++E S Sbjct: 76 GQTLARQLTELLPQT----AGNIYEFGAGTGHLAATLLKSLSD------DLKHYYIIELS 125 Query: 120 ERLTLIQKKQLASYGDK-----INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGI 174 L Q++ +A + + T L + G ++ NE D++P++ T++ Sbjct: 126 SELAERQRQFIAEHTTPQLAQKVIHLTKLPESFDG--IIIGNEVLDAMPVETIRRTQNTF 183 Query: 175 RERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLG----AIFENSPCRDREMQSISDRLA 230 + + ++ L + + + + YF E P + + +++ ++ Sbjct: 184 QHIGVSVNPDGQLEQSPQPLKQPNLLRLAATYFPETEHPYTSELHPAQYAFILTLAQKIR 243 Query: 231 CDGGTAI-----VIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIA 285 G I Y + Q G + + HT P + G DL++HV+F ++ Sbjct: 244 RGGMIFIDYGFDAAQYYHPQRDEGTLIAHYRHHTVHDPFFHIGLTDLTAHVNFTDIAQAG 303 Query: 286 ILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELF 345 L + G +Q FL LGI L + V V D+ MGELF Sbjct: 304 TDGGLDLIGYLSQSHFLFNLGITDL---LAQTAPPGTADYLRVSTAVQKLTDQHEMGELF 360 Query: 346 KILVVSHE 353 K++ Sbjct: 361 KVIAFGKN 368 >gi|68487546|ref|XP_712394.1| hypothetical protein CaO19.13571 [Candida albicans SC5314] gi|46433778|gb|EAK93208.1| conserved hypothetical protein [Candida albicans SC5314] Length = 527 Score = 239 bits (610), Expect = 5e-61, Method: Composition-based stats. Identities = 113/426 (26%), Positives = 180/426 (42%), Gaps = 73/426 (17%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFG-AVGDFVTAPEISQIFGE 61 L IK G + + Y C+ PEFGYY+T +P GDF+T+PEIS +FGE Sbjct: 107 ENLSDFFRQTIKLTGPIPLSTYMRQCLTHPEFGYYTTRDPLNLRTGDFITSPEISSVFGE 166 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSI------YM 115 M+ I+ W+Q +P +R +E GPG+G ++ DI++ K + Sbjct: 167 MIGIWYFSIWQQQKYPESIRFIEFGPGKGTLIHDIMKTFNKFVEKLLPSDQKRPKIEIAL 226 Query: 116 VETSERLTLIQKKQLAS------------------YGDKINWYTSLADVPLGF----TFL 153 +E S L Q K L + + + I W+ + D+ G F+ Sbjct: 227 IEASHVLRKEQWKLLCNPEDPMETTGEGYNRSATKWNNDIIWFDTEKDIQQGDKNVANFI 286 Query: 154 VANEFFDSLPIKQFVMTEHGIRERMIDIDQ---------------------------HDS 186 VA+EFFD+LPIK F+ E G RE +++ Sbjct: 287 VAHEFFDALPIKSFIREEKGWRELVVEHTPSVNNTQPKLEESTSARGANKFETDDSLDTE 346 Query: 187 LVFNIGDHEIKSNFLT-----CSDYFLGAIFENSPCRDREMQSISDRL--ACDGGTAIVI 239 I E S+ + D +G E P + + ++ L + D G +VI Sbjct: 347 FHLTIAPKETPSSMIPQISKRYRDLPVGTRIEICPDAELYIMKMAKLLSDSNDKGAILVI 406 Query: 240 DYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQG 299 DYG ++L+ + H +VSP NPG+ DLS VDFQ L + + I G QG Sbjct: 407 DYGTENEIPENSLRGIYQHKFVSPFWNPGEVDLSIDVDFQALKQLTEGI-VDIYGPLKQG 465 Query: 300 KFLEGLGIWQRAFSLMKQTAR----KDILLDSVKRLVSTSADKKSMGELFKILVVSHEKV 355 +L +GI R ++K+ + +D + + +RL D + MG ++K + + + Sbjct: 466 DWLHNIGIGYRIDQVLKKNEQDPEVQDKIYGAYRRLT----DGEQMGSIYKFMALLPKGS 521 Query: 356 E-LMPF 360 F Sbjct: 522 SNPPGF 527 >gi|162146752|ref|YP_001601211.1| hypothetical protein GDI_0930 [Gluconacetobacter diazotrophicus PAl 5] gi|161785327|emb|CAP54873.1| conserved hypothetical protein [Gluconacetobacter diazotrophicus PAl 5] Length = 339 Score = 239 bits (610), Expect = 5e-61, Method: Composition-based stats. Identities = 114/342 (33%), Positives = 168/342 (49%), Gaps = 19/342 (5%) Query: 21 VDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCV 80 +D++ A YY+ +PF DF+TAPEISQ+FGE+L ++ W+ G Sbjct: 9 LDRFMARA----NAAYYAGRDPF---ADFITAPEISQMFGEILGAWVAVTWQGMGRRVPF 61 Query: 81 RLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYG-DKINW 139 LVE GPGRG +M D++R++ ++ PD +++VE S RL +Q+ LA + W Sbjct: 62 ALVEAGPGRGTLMADMMRLLARVAPDCHDAARVHLVELSPRLRDVQQAALAGRTAHPVTW 121 Query: 140 YTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSN 199 + + DVP G L+ANEF D+L I+QFV T G ER + F Sbjct: 122 HDRIEDVPEGAVILLANEFLDALAIRQFVRTADGWAERFV-----QGPAFVTQPASDLPP 176 Query: 200 FLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHT 259 G I E P + ++ RL GTA+ +DYGY + GDTLQA++ Sbjct: 177 GPFDRSVPCGEILECCPDALAVARHVAARLCRAPGTALFVDYGYDGAVWGDTLQALRDGQ 236 Query: 260 YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQ-- 317 PL +PG ADL++HVDF ++ +G TQG L LG++ RA L + Sbjct: 237 PAWPLADPGLADLTAHVDFAAFAAAVRDGGAVCHGSVTQGALLGALGLFARAEQLARNRA 296 Query: 318 TARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMP 359 + D+ +RL A MG LFK L ++ + + P Sbjct: 297 PGEAYAIRDAAQRL----AAPDRMGRLFKALAITSPGLPVPP 334 >gi|221109290|ref|XP_002159876.1| PREDICTED: similar to CG17726 CG17726-PA, partial [Hydra magnipapillata] Length = 700 Score = 239 bits (609), Expect = 6e-61, Method: Composition-based stats. Identities = 118/385 (30%), Positives = 177/385 (45%), Gaps = 49/385 (12%) Query: 17 GQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 G MTV Y + +P++GYY + FGA GDF T+PEISQ+FGE++ I+ + W Q G Sbjct: 320 GPMTVANYMKEALTNPKWGYYMKNDVFGAKGDFTTSPEISQMFGELIGIWFVAQWIQIGK 379 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGD- 135 P V+LVELGPGRG +M DILRV+ + P+ S + VE SE++ +QK+ L + Sbjct: 380 PCGVQLVELGPGRGTLMADILRVMKQF-PETLSNFEVNFVEVSEKMISLQKQNLDISHEK 438 Query: 136 ----------KINWYTSLADVPLGFTFLVANEFFDSLPIKQFVM---------------- 169 K++W+T + DVP G TF +A+EFFD+LP+ F + Sbjct: 439 KDFYITPSGTKVSWFTHVQDVPKGLTFYLAHEFFDALPVHLFKLLDLVLSPGLTNIRNVN 498 Query: 170 -------TEHGIRERMI-----DIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPC 217 + + L + E P Sbjct: 499 PFAQIPPDTMMDFGKFCLFIIGYCQSENELQLVTAPGPSPVAKTFLNSETNADECEVCPE 558 Query: 218 RDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVS-PLVNPGQADLSSHV 276 M IS+ ++ GG A++IDYG S+ +L+ K H V NPG D++++V Sbjct: 559 AAVVMSYISENISMYGGCAMIIDYGESDSQ-RFSLRGYKNHVLVDNIFKNPGSCDITANV 617 Query: 277 DFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLM--KQTARKDILLDSVKRLVST 334 DF L + G TQ FL +GI QR L+ + + L+ S+ L+S Sbjct: 618 DFGFLKRCIRGE-VRHYGPITQRSFLLQMGIQQRLKVLVSTATSEQARNLISSLDYLIS- 675 Query: 335 SADKKSMGELFKILVVSHEKVELMP 359 + MGE FK V+++ + P Sbjct: 676 ---PEKMGEKFKCFVLTNLSSPVPP 697 >gi|186685066|ref|YP_001868262.1| hypothetical protein Npun_F4975 [Nostoc punctiforme PCC 73102] gi|186467518|gb|ACC83319.1| protein of unknown function DUF185 [Nostoc punctiforme PCC 73102] Length = 395 Score = 239 bits (609), Expect = 6e-61, Method: Composition-based stats. Identities = 101/380 (26%), Positives = 155/380 (40%), Gaps = 31/380 (8%) Query: 1 MENK--LIRKIVNLIKKN--GQMTVDQYFALCVADPEFGYYST-CNPFG-AVGDFVTAPE 54 M++ L I N I ++T ++ L + PE+GYYS+ G DF T+P Sbjct: 1 MDSNPALCAAIANHITNTPQQRITFAEFMDLALYHPEYGYYSSDAVKIGFKDSDFFTSPN 60 Query: 55 ISQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIY 114 + FGE+LA + WE G P LVE+G G+G++ L IL+ PDFF+ L Sbjct: 61 LCSDFGELLAEQFLQMWEILGKPVPFSLVEMGAGQGLLALHILKYHQLHYPDFFTALEYI 120 Query: 115 MVETSERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGI 174 +VE S L Q+++L + + + + F +NE D+ P+ QF++ + Sbjct: 121 IVEKSPILRQEQQQRLQDFPVRWCNLEEIPPNAIAGCF-FSNELVDAFPVHQFILETGEL 179 Query: 175 RERMIDIDQHDSLVFNIGDH-----------------EIKSNFLTCSDYFLGAIFENSPC 217 RE + D+++ ++ T S Y G E + Sbjct: 180 REIYVTTDKNEKETNAPYPSFAEVIGEPSTPQLAEYLDLVEMNFTQSAYPDGYRSEINLA 239 Query: 218 RDREMQSISDRLACDGGTAIVIDYG----YLQSRVGDTLQAVKGHTYVS-PLVNPGQADL 272 + ++DRL I Y Y R TLQ H + P +N G+ D+ Sbjct: 240 ALDWLSIVADRLQRGYVLTIDYGYPASRYYNPRRSQGTLQCYYHHRFHDNPYINIGRQDI 299 Query: 273 SSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLV 332 ++HVDF L L G QG FL LG+ R +L Q LL L Sbjct: 300 TAHVDFTALERWGQKCNLKNVGFIQQGLFLMALGLGDRIAALSYQKQPLLELLQRRDAL- 358 Query: 333 STSADKKSMGELFKILVVSH 352 D +G F +L+ S Sbjct: 359 HQLIDPTGLGG-FGVLIQSK 377 >gi|330823189|ref|YP_004386492.1| hypothetical protein Alide2_0560 [Alicycliphilus denitrificans K601] gi|329308561|gb|AEB82976.1| protein of unknown function DUF185 [Alicycliphilus denitrificans K601] Length = 361 Score = 239 bits (609), Expect = 6e-61, Method: Composition-based stats. Identities = 92/377 (24%), Positives = 163/377 (43%), Gaps = 42/377 (11%) Query: 1 MENK---LIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTC-NPFGA----VGDFVT 51 M L + I + G + D++ L + P GYY+ G+ DFVT Sbjct: 1 MTTPTDALFQHIRQDLAAAGGWIGFDRFMQLALYTPGLGYYAGGLRKLGSMPEDGSDFVT 60 Query: 52 APEISQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVL 111 APE+S +FG++LA + A + G + E G G G + +L + + V Sbjct: 61 APELSPVFGKVLAAQVREALDATG---TDEVWEFGAGSGALAGQLLEALGE------RVR 111 Query: 112 SIYMVETSERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTE 171 +V+ S L Q+++LA++ +++W L G +V NE D++P++ + Sbjct: 112 RYTIVDLSGSLRARQQERLAAHAGRVHWAERLPAAIEG--VVVGNELLDAMPVQLLHRVQ 169 Query: 172 HGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLAC 231 ER + +D +L ++ ++ + + E P + M+++ +RLA Sbjct: 170 GAWHERGVALDAGGALAWSDRPTALRPPVEI--EGGHDYLTEIHPQGEAFMRTLGERLAR 227 Query: 232 DGGTAIVIDY------GYLQSRVGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSI 284 G A++IDY Y R T+ + H PL + G+ D+++HV+F ++ Sbjct: 228 --GAALLIDYGFGEDEYYHPQRHMGTVMCHRAHRADDDPLADVGEKDITAHVNFTAMALA 285 Query: 285 AILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGEL 344 A L + G T+Q +FL G+ Q L + + L + + MGEL Sbjct: 286 AQDAGLNVLGYTSQARFLLNCGLLQEMERL----------TLAQRALAAKLMLEHEMGEL 335 Query: 345 FKILVVSH-EKVELMPF 360 FK+L V E + F Sbjct: 336 FKVLAVGPGEPWTPVGF 352 >gi|148284461|ref|YP_001248551.1| hypothetical protein OTBS_0811 [Orientia tsutsugamushi str. Boryong] gi|146739900|emb|CAM79877.1| conserved hypothetical protein [Orientia tsutsugamushi str. Boryong] Length = 386 Score = 239 bits (609), Expect = 7e-61, Method: Composition-based stats. Identities = 125/379 (32%), Positives = 194/379 (51%), Gaps = 19/379 (5%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 M ++ + I +I+ +TV+ ++ + YY P G GDF+TAPEISQ+FG Sbjct: 1 MLMEIEQHIRQIIRSENNITVENLMSIVMESRYNSYYRIQQPLGKAGDFITAPEISQMFG 60 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 EM+ I+ I W + P + L+ELGPG G ++ DIL +K ++ S+ +VE + Sbjct: 61 EMIGIWCIDLWHKLNCPQKIDLIELGPGNGTLLHDILNATRHIKKFSTAI-SLILVEINC 119 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLG-FTFLVANEFFDSLPIKQFVMT------EHG 173 L IQ+ L S+ I W S+ + T ++ANEFFD+LPIKQ++ + Sbjct: 120 TLKKIQRDTLLSFNVPIKWVKSVNHIVSSYPTIILANEFFDALPIKQYIKKINQQSGQIN 179 Query: 174 IRERMIDIDQHDSLVFNIGDHEIKSNFLTC--SDYFLGAIFENSPCRDREMQSISDRLAC 231 ER++ ID ++ L F+ D +I N ++ G + E SP + + +Q++S+ L Sbjct: 180 WLERVVKIDNNNKLYFDTIDADINENKFLKLHNNAPNGGVLEISPAQHQTIQAVSNLLKK 239 Query: 232 DGGTAIVIDYGYLQSR-------VGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSI 284 +GG ++IDYGY S +LQAVK H Y L N G ADLS+HVDF L +I Sbjct: 240 NGGGGLIIDYGYDISPEQRKNYQYNSSLQAVKHHQYHPLLENLGCADLSAHVDFWSLKNI 299 Query: 285 AILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGEL 344 A+ + G +Q L LGI R L KQ + L + + ++MGEL Sbjct: 300 AVAEGIVSFGSISQNVLLHKLGIKTRLNML-KQINSDENLASKLDLQYNRLTSVRAMGEL 358 Query: 345 FKILVVSH-EKVELMPFVN 362 FK + ++ + + F N Sbjct: 359 FKAIAITSAPSIIPLGFYN 377 >gi|238022307|ref|ZP_04602733.1| hypothetical protein GCWU000324_02214 [Kingella oralis ATCC 51147] gi|237866921|gb|EEP67963.1| hypothetical protein GCWU000324_02214 [Kingella oralis ATCC 51147] Length = 390 Score = 238 bits (608), Expect = 7e-61, Method: Composition-based stats. Identities = 85/377 (22%), Positives = 146/377 (38%), Gaps = 30/377 (7%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFG 60 + L + I I + G + ++ L + P++GYY+ + GA GDFVTAP +S +FG Sbjct: 17 SDALTQIIAAEIARQGAIPFSRFMELALYAPQYGYYTGGAHKIGAAGDFVTAPTLSPLFG 76 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 + LA L Q + E G G G + +L + + + Y+VE S Sbjct: 77 QTLARQLTPLLPQT----AGNVYEFGAGTGALAATLLDALSGC------LNTYYIVELSP 126 Query: 121 RLTLIQKKQLASY-GDKINWYTSLADVPLGFT-FLVANEFFDSLPIKQFVMTEHGIRERM 178 L Q++ + + D + L +P F ++ NE D++P ++ E+G R+ Sbjct: 127 ELAARQREYIQQHAPDLAHKVKHLTALPCSFDGIIIGNEVLDAMPSERVCRHENGDFSRV 186 Query: 179 IDIDQHDSLVFNIGDHEIKSNFLTCSDYFL---------GAIFENSPCRDREMQSISDRL 229 + F +YF E P + +++++ +L Sbjct: 187 CVGYEAGRFALRYQSLLDPDLFQAALNYFPEPECLQHAGDYTSELHPTQHAFIRTLAQKL 246 Query: 230 ACDGGTAIVIDY----GYLQSRVGDTLQ-AVKGHTYVSPLVNPGQADLSSHVDFQRLSSI 284 + + Y R TL + HT P G DL+ HV+F ++ Sbjct: 247 TRGAMIWLDYGFDAAQYYHPERSDGTLIGHHRHHTIHDPFYRVGLTDLTVHVNFSDIAEA 306 Query: 285 AILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGEL 344 L + G T Q FL LGI + + + V+T + MGEL Sbjct: 307 GCAAGLDLIGYTNQATFLYNLGILDLL--VAQFPETDTPSYFQAAQAVNTLTAQHEMGEL 364 Query: 345 FKILVVSHE-KVELMPF 360 FK++ +V+ F Sbjct: 365 FKVMAFGRGVEVDWPGF 381 >gi|34499588|ref|NP_903803.1| hypothetical protein CV_4133 [Chromobacterium violaceum ATCC 12472] gi|34105439|gb|AAQ61794.1| conserved hypothetical protein [Chromobacterium violaceum ATCC 12472] Length = 408 Score = 238 bits (608), Expect = 8e-61, Method: Composition-based stats. Identities = 86/375 (22%), Positives = 144/375 (38%), Gaps = 29/375 (7%) Query: 2 ENKLIRKIVNLIK-KNGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIF 59 L +I I G + Y L + P GYY+ G GDFVTAPE++ +F Sbjct: 38 SQALCGRISQAIAEAGGWIPFSHYMELALYAPGMGYYAAGSRKLGEDGDFVTAPELTPLF 97 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 G+ LA L Q + E G G G + +D+L + L +++ S Sbjct: 98 GQSLARQLAELLPQT----AGAVYEFGAGTGKLAIDLLAELKALNQLPAK---YVILDLS 150 Query: 120 ERLTLIQKKQ----LASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIR 175 L Q++ L D++ W +SL D G ++ NE D++P + E Sbjct: 151 PDLIDRQRQNIRAALPHLADRVEWASSLPDAIDG--IVIGNELLDAMPCEMVFRDEDRRL 208 Query: 176 ERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLG---AIFENSPCRDREMQSISDRLACD 232 + + +L + D + E S + +++ L Sbjct: 209 RQRGVTVRDGALCYEDRDFADSRLAELAAQLLPDIPRYETEISLANRAFIATVAGALTRG 268 Query: 233 GGTAI-----VIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAIL 287 I +Y + + +G L + HT P PG DL+ HVDF ++ I Sbjct: 269 AILMIDYGHAASEYYHPERSMGTLLGHYRHHTVQDPFYLPGLMDLTCHVDFTAVAQSGID 328 Query: 288 YKLYINGLTTQGKFLEGLGIWQRAFSL-MKQTARKDILLDSVKRLVSTSADKKSMGELFK 346 L + G T+Q +FL G+ +R +L + + +RL+S + MGE FK Sbjct: 329 AGLDLIGYTSQAQFLVNCGLIERLQTLDADDVKAYLPIAKAAQRLLS----PQEMGETFK 384 Query: 347 ILVVSHE-KVELMPF 360 + ++ + F Sbjct: 385 AIGFGKGVSIDWLGF 399 >gi|67539742|ref|XP_663645.1| hypothetical protein AN6041.2 [Aspergillus nidulans FGSC A4] gi|40738826|gb|EAA58016.1| hypothetical protein AN6041.2 [Aspergillus nidulans FGSC A4] gi|259479775|tpe|CBF70305.1| TPA: DUF185 domain protein (AFU_orthologue; AFUA_2G09740) [Aspergillus nidulans FGSC A4] Length = 504 Score = 238 bits (608), Expect = 9e-61, Method: Composition-based stats. Identities = 116/454 (25%), Positives = 181/454 (39%), Gaps = 103/454 (22%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTC-----NPFGAVGDFVTAPEIS 56 L R + N+IK G + + + + PE GYY+T FG GDFVT+PEIS Sbjct: 38 STPLARTLANVIKTTGPVPIAAFMRQVLTSPEGGYYTTKPGGGGEVFGKKGDFVTSPEIS 97 Query: 57 QIFGEMLAIFLICAWEQH-GFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 Q+FGE++ I+ I W G S V+L+E+GPG+G +M D+LR KP S+ +IY+ Sbjct: 98 QVFGELVGIWTIAEWMAQGGKKSGVQLMEIGPGKGTLMDDMLRTFRNFKPFTSSLEAIYL 157 Query: 116 VETSERLTLIQKKQLASY--------------------GDKINWYTSLADVPLGFTFLVA 155 VE S L +QK+ L + L F+ A Sbjct: 158 VEASPTLREVQKQLLCGNAVMEETDIGHRCTSKYFNVPVIWVEDIRLLPHEEDKTPFIFA 217 Query: 156 NEFFDSLPIKQFVMTE------------------------------HGIRERMIDIDQ-- 183 +EFFD+LPI F RE M+ ++ Sbjct: 218 HEFFDALPIHAFESVPPSPENEQQEQEIMTPTGRTKLQRPPKAANTPQWRELMVTLNPKA 277 Query: 184 -----HDSLVFNIG----------DHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDR 228 D F + S G+ E SP I+ R Sbjct: 278 VDENIKDEPEFKLTLAKASTPSSLVIPEISERYRALKSQPGSTIEVSPESRIYASDIARR 337 Query: 229 L-----------------------ACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLV 265 + G A+++DYG + + ++L+ ++ H V L Sbjct: 338 IGGSSQPPRTAAGRNASAPSAIAKRIPSGAALIMDYGTMSTVPINSLRGIQNHKIVPALS 397 Query: 266 NPGQADLSSHVDFQRLSSIAILY--KLYINGLTTQGKFLEGLGIWQRAFSLM---KQTAR 320 +PG+ D+S+ VDF L+ A+ + ++G QG FL+ +GI +R L+ K + Sbjct: 398 SPGRVDVSADVDFTSLAEAALEASEGVEVHGPVEQGHFLQAMGIAERMQQLLSTVKDEKK 457 Query: 321 KDILLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 + IL +RLV MG+L+K++ + E Sbjct: 458 RKILETGWQRLVERGG--GGMGKLYKVMTIIPEN 489 >gi|107023909|ref|YP_622236.1| hypothetical protein Bcen_2363 [Burkholderia cenocepacia AU 1054] gi|116690995|ref|YP_836618.1| hypothetical protein Bcen2424_2977 [Burkholderia cenocepacia HI2424] gi|105894098|gb|ABF77263.1| protein of unknown function DUF185 [Burkholderia cenocepacia AU 1054] gi|116649084|gb|ABK09725.1| protein of unknown function DUF185 [Burkholderia cenocepacia HI2424] Length = 396 Score = 238 bits (608), Expect = 9e-61, Method: Composition-based stats. Identities = 91/371 (24%), Positives = 156/371 (42%), Gaps = 34/371 (9%) Query: 2 ENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTC-NPFGAV----GDFVTAPEI 55 L ++ + I G ++ D++ + P GYYS FG DFVTAPE+ Sbjct: 22 SETLAAQLRDEIAAAGGWLSFDRFMERALYAPGLGYYSGGARKFGRRADDGSDFVTAPEL 81 Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 S +F + LA + A G R++E G G G + +L + L + L + Sbjct: 82 SPLFAQTLAQPVAEALAASG---TRRVMEFGAGTGKLAAGLLAALDALGAELDEYL---I 135 Query: 116 VETSERLTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTE 171 V+ S L Q+ + + K+ W +L + G +V NE D++P++ F + Sbjct: 136 VDLSGELRERQRDTIEAAVPALAAKVRWLDALPERFDG--VVVGNEVLDAMPVRLFAKAD 193 Query: 172 HGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYF--LGAIFENSPCRDREMQSISDRL 229 RER + +D + VF+ + G + E +++ L Sbjct: 194 GAWRERGVALDARHAFVFDDRPVGAAGLPAVLAPLDVGDGYVTETHEAALAFTRTVCTML 253 Query: 230 ACDGGTAIVIDY------GYLQSRVGDT-LQAVKGHTYVSPLVNPGQADLSSHVDFQRLS 282 G ++IDY Y R T + + H + P V PG DL++HV+F + Sbjct: 254 GR--GAVLLIDYGFPAHEYYHPQRDRGTLMCHYRHHAHDDPFVYPGLQDLTAHVEFTGIY 311 Query: 283 SIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARK-DILLDSVKRLVSTSADKKSM 341 A+ + + G T+Q +FL GI ++ R ++V++L+S + M Sbjct: 312 EAAVATGVDLLGYTSQARFLLNAGITDALAAIDPSDVRAFLPAANAVQKLIS----EAEM 367 Query: 342 GELFKILVVSH 352 GELFK++ S Sbjct: 368 GELFKVIAFSR 378 >gi|253700046|ref|YP_003021235.1| hypothetical protein GM21_1418 [Geobacter sp. M21] gi|251774896|gb|ACT17477.1| protein of unknown function DUF185 [Geobacter sp. M21] Length = 385 Score = 238 bits (607), Expect = 9e-61, Method: Composition-based stats. Identities = 87/369 (23%), Positives = 152/369 (41%), Gaps = 24/369 (6%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFG 60 KL I+N I+ +G +T + + +P+ GYY++ GA GDF T+ + FG Sbjct: 6 TTKLAEIILNRIRTSGDITFASFMDAALYEPDLGYYTSAGRKVGAEGDFYTSMNVHSAFG 65 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 ++A + WE P+ + E G G G + DIL I + P F+S L+ ++E Sbjct: 66 RLIAQEICRFWEVLDSPASFTIAEAGAGGGQLAQDILDAISEDNPAFYSGLTYRLIEKEP 125 Query: 121 RLTLIQKKQLASYGDKINWYTSLA---DVPLGFTFLVANEFFDSLPIKQFVMTEHGIRER 177 L Q +L+ + D++ W + +++NE FD++P+ +TE G+RE Sbjct: 126 SLQQAQAARLSRHADRLAWSSPDELAAGTLSFTGCIISNELFDAMPVHIVELTEAGLREV 185 Query: 178 MIDIDQHDSLVFNIGDHEIKSNFLTCSDYF----LGAIFENSPCRDREMQSISDRLACDG 233 + + V + Y G E + + + L Sbjct: 186 YVS-ADDNGFVERLLPPSTPELEQYLRKYEVRLLPGQRAEINLAASGWIAQAAATLTR-- 242 Query: 234 GTAIVIDYG------YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSSIAI 286 G + IDYG Y R TL H+ +P G+ D+++H++F +L Sbjct: 243 GFVLTIDYGYLSGELYTPQRKNGTLLCYYKHSTNENPYQLVGEQDITTHINFSQLIVDGE 302 Query: 287 LYKLYINGLTTQGKFLEGLGIWQ---RAFSLMKQTARKDILLDSVKRLVSTSADKKSMGE 343 L Q +FL G+ + R + K ++K+L+ + MG+ Sbjct: 303 ESGLKKAWYGEQYRFLLAAGLMEELIRLEAQAKDEQESLKHRLALKKLMLP---EGGMGD 359 Query: 344 LFKILVVSH 352 FK+L+ S Sbjct: 360 TFKVLIQSK 368 >gi|17230849|ref|NP_487397.1| hypothetical protein alr3357 [Nostoc sp. PCC 7120] gi|17132452|dbj|BAB75056.1| alr3357 [Nostoc sp. PCC 7120] Length = 404 Score = 238 bits (607), Expect = 9e-61, Method: Composition-based stats. Identities = 95/382 (24%), Positives = 154/382 (40%), Gaps = 35/382 (9%) Query: 3 NKLIRKIVNLIKKNG--QMTVDQYFALCVADPEFGYYSTCNP-FG-AVGDFVTAPEISQI 58 + L + I + I + ++T +Y + + PE GYYS+ G GDF T+ + Sbjct: 5 SALQKAIAHRIATSPKARITFAEYMDMALYHPEHGYYSSNAVNIGFKGGDFFTSVNLGAD 64 Query: 59 FGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVET 118 G++LA + W G P+ LVE+G G+G++ L IL+ I P+ + L +VE Sbjct: 65 LGDLLAEQFVQMWGIFGQPTPFYLVEMGAGQGLLALHILKYIQVQYPNLYKALKYLIVEK 124 Query: 119 SERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERM 178 S L Q+++L + + + ++ + F +NE D+LP+ QF++ +RE Sbjct: 125 SPGLKQEQQERLQGFPVRWCSWEEISPNSITGCF-FSNELVDALPVHQFILEGGELREIY 183 Query: 179 IDIDQHDSLVFNIGDHEIKSNFLTC-----------------------SDYFLGAIFENS 215 + + + + LT Y G E + Sbjct: 184 LTMQEDQEAQEAKNLSPSPNYELTEVAAAPSTPKLAEYFDLIGINLAQGGYEDGYRSEIN 243 Query: 216 PCRDREMQSISDRLACDGGTAIVIDYG----YLQSRVGDTLQAVKGHTYV-SPLVNPGQA 270 + ++DRL I Y Y R TLQ H + +P +N GQ Sbjct: 244 LAALDWLSIVADRLQRGYVITIDYGYPASRYYNPRRSQGTLQCYYQHRHHNNPYINIGQQ 303 Query: 271 DLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKR 330 D+++HVDF L L G T Q FL LG+ R +L Q LL + Sbjct: 304 DITAHVDFTALERWGDRCGLEKLGFTQQALFLMALGLGSRIAALSYQEIPVSELLHRRQT 363 Query: 331 LVSTSADKKSMGELFKILVVSH 352 L D +G F +L+ S Sbjct: 364 L-HQLIDPTGLGG-FGVLIQSK 383 >gi|312082569|ref|XP_003143498.1| hypothetical protein LOAG_07918 [Loa loa] gi|307761340|gb|EFO20574.1| hypothetical protein LOAG_07918 [Loa loa] Length = 431 Score = 238 bits (607), Expect = 9e-61, Method: Composition-based stats. Identities = 119/382 (31%), Positives = 184/382 (48%), Gaps = 36/382 (9%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTC--NPFGAVGDFVTAPEISQIF 59 ++L+ I I NG M+V Y L + P GYYS FG GDF+TAPE++QIF Sbjct: 42 SDQLLHFIKQKINLNGPMSVADYMRLTASSPIGGYYSHHGSKIFGEKGDFITAPELTQIF 101 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 GE++ I+ G +LVE GPG G +M DI R + +LK S+ ++VETS Sbjct: 102 GELVGIWCYYELSYTGHSGEWQLVENGPGTGQLMYDITRTLIQLKATEGSI---HLVETS 158 Query: 120 ERLTLIQKKQLASYGDK------------------INWYTSLADVPLGFTFLVANEFFDS 161 + L Q+ L + + + WY S+ D+P F+ ++NEF D+ Sbjct: 159 DALLSQQESLLCEHTSQFVDGEPYVRCNVTKNGFPVYWYRSVDDIPAKFSVFISNEFLDA 218 Query: 162 LPIKQFVMTEH-GIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAI----FENSP 216 LP+ QF + E +++D+ D L F + E + +E S Sbjct: 219 LPVNQFKRDDKGKWHEVYVNLDKDDRLCFMLSRSENLHTLGLLPKNIREDLSIKEWEISI 278 Query: 217 CRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHV 276 + ++D + GG +++DYG+ +R +L+A KGH V PL NPGQ D+++ V Sbjct: 279 DAGTYINQVADSITKFGGFVLIVDYGHNGTRKDLSLRAYKGHQIVHPLENPGQHDITADV 338 Query: 277 DFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLM---KQTARKDILLDSVKRLVS 333 +F L S+ L + G Q +FL +GI R L+ K K+ L S + L+S Sbjct: 339 NFGYLKSLVKDRTL-VFGPIEQREFLAQMGIGLRLQKLLACCKTKEEKENLFKSYEILLS 397 Query: 334 TSADKKSMGELFKILVVSHEKV 355 K MGE FK++ V + + Sbjct: 398 A----KGMGERFKMMSVFPKTL 415 >gi|78486156|ref|YP_392081.1| hypothetical protein Tcr_1815 [Thiomicrospira crunogena XCL-2] gi|78364442|gb|ABB42407.1| conserved hypothetical protein containing DUF185 [Thiomicrospira crunogena XCL-2] Length = 394 Score = 238 bits (607), Expect = 1e-60, Method: Composition-based stats. Identities = 95/378 (25%), Positives = 159/378 (42%), Gaps = 33/378 (8%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGEM 62 L +KI ++K++G M ++ + + P GYY++ G GDF+TAPE+S IF Sbjct: 20 SLQQKIRQMLKRHGMMPFPRFMEMALYTPGLGYYASGLPKIGQQGDFITAPEVSPIFSRC 79 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 LA E P ++E G G+G M DIL + L+ + Y+VE S L Sbjct: 80 LARQAAQVLETLETP---NVIEFGAGKGTMAKDILLELDALEQP---IEQYYIVELSADL 133 Query: 123 TLIQKKQLASYGDKI-NWYTSLADVPLGF--TFLVANEFFDSLPIKQFVMTEHGIRERMI 179 Q++ L + + + N L +P ++ANE D++P+++ + E + Sbjct: 134 RARQQETLQALPETLFNKVVWLDQLPKDPLQAVVLANEVLDAMPVERLRLEEEQSLRGYV 193 Query: 180 DIDQHDSLVFNIGDHEIKSNFLTCSDYFL---------GAIFENSPCRDREMQSISDRLA 230 ++ + S+ L G E + +QSI+D L+ Sbjct: 194 IFNEDKQRFGWDYHPITDATLQKASNAILNLIGTPSARGYETEINLNIHPWLQSIADFLS 253 Query: 231 CDGGTAIVIDYGYL------QSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSS 283 G ++IDYGY SR TL+ H P PG D+++HVDF ++ Sbjct: 254 Q--GAVLLIDYGYNRKEYYQPSRHMGTLRCHYQHRAHGDPFFFPGLQDITAHVDFTSVAE 311 Query: 284 IAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGE 343 + G TTQ FL G G+ + + + +K L MGE Sbjct: 312 SGFDTGFKVAGYTTQAHFLMGSGLLEMSGDPTADITESLKIAQQIKTLTL----PDEMGE 367 Query: 344 LFKILVVSHE-KVELMPF 360 FK++ ++ + + L+ F Sbjct: 368 SFKVIALTKDVDLSLIGF 385 >gi|15603924|ref|NP_220439.1| hypothetical protein RP045 [Rickettsia prowazekii str. Madrid E] gi|3860615|emb|CAA14516.1| unknown [Rickettsia prowazekii] gi|292571641|gb|ADE29556.1| hypothetical protein rpr22_CDS043 [Rickettsia prowazekii Rp22] Length = 358 Score = 238 bits (607), Expect = 1e-60, Method: Composition-based stats. Identities = 116/354 (32%), Positives = 184/354 (51%), Gaps = 14/354 (3%) Query: 8 KIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFL 67 KI LI ++G +T D ++ YY + GDFVTAPEISQ+FGE++ ++ Sbjct: 6 KIRQLIYQHGYITCDVLMQEVLSSNPNSYYKQVKSLASEGDFVTAPEISQLFGEIIGLWC 65 Query: 68 ICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQK 127 I W++ G P + LVELGPGRG++M D+LR L P+F+ LSI ++E ++ QK Sbjct: 66 IREWQRIGNPKSLSLVELGPGRGLLMRDLLRTAK-LVPEFYKALSIKLIEINKNFIAHQK 124 Query: 128 KQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSL 187 L I + ++P T ++ NEFFD++PIKQ++ + ER+ + D Sbjct: 125 SNLQDINLPIKHLAFIEEIPQKPTIIITNEFFDTMPIKQYIKVKELWYERIFLVQPVDGR 184 Query: 188 V----FNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGY 243 + +I + T + GAI E S ++ I++ L G+ ++IDYGY Sbjct: 185 IKYDKISINKRLQEYLLRTHIEAKDGAILEESYKSIEIIKFIAEHLKKVRGSCLIIDYGY 244 Query: 244 -------LQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLT 296 + + TLQAV+ H Y L N G+ADLS+HVDF L ++A K+ + Sbjct: 245 DIAPYDRTRYQYNPTLQAVRKHKYCPILENLGEADLSAHVDFYSLKTVAKNSKINVIDTI 304 Query: 297 TQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVV 350 +Q FL GI R +L + K +L+ +++ + + MG+LFK+L + Sbjct: 305 SQRDFLIQNGILLRKQTLKDKLNDKQVLI--IEKQMERLILPEQMGKLFKVLQI 356 >gi|319945213|ref|ZP_08019475.1| protein of hypothetical function DUF185 [Lautropia mirabilis ATCC 51599] gi|319741783|gb|EFV94208.1| protein of hypothetical function DUF185 [Lautropia mirabilis ATCC 51599] Length = 406 Score = 238 bits (607), Expect = 1e-60, Method: Composition-based stats. Identities = 83/388 (21%), Positives = 150/388 (38%), Gaps = 37/388 (9%) Query: 1 MENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQI 58 + +L +I I + G ++ +Y + + +P GYYS FGA GDFVTAPE++ + Sbjct: 22 VSQELSTRIAAEIARHGGWLSFARYMEMALYEPGLGYYSNPGQVFGAAGDFVTAPELTPL 81 Query: 59 FGEMLAIFLICAWEQHGFPSCVRLV-ELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVE 117 FG LA + + ++V E+G G G++ +L + + + ++E Sbjct: 82 FGATLARQVSPWLKDPALAGSGQVVLEVGGGSGMLAAQLLNALDNVGHHE---VRYLILE 138 Query: 118 TSERLTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHG 173 S Q++ L D++ W + + G +VANE D++P++ F Sbjct: 139 LSAERREHQRQTLKSLAPGLMDRVGWLETFPESFAG--VVVANELLDAMPVQLFEWQADA 196 Query: 174 IRERMIDIDQHDSLVFNIGDHEIKSNFLTC------------SDYFLGAIFENSPCRDRE 221 E F + + + E P + Sbjct: 197 GAELQEMGVTWVDGQFAWAPRPADAVLTETVTALRNRLGPEGAQWHSPYRSEICPAQQAW 256 Query: 222 MQSISDRLACDGGTAI----VIDYGYLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHV 276 M++++D + + Y R TL H P + PG +D+++HV Sbjct: 257 MRTLADCMTAGVVMLLDYGFAAPEYYHPQRDQGTLMCHYRHRSHADPFLWPGLSDITAHV 316 Query: 277 DFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARK-DILLDSVKRLVSTS 335 DF L+ A + G T+ FL G+ L ++ +V++L+S Sbjct: 317 DFTALARAATAEGFSLVGYTSMAAFLLNAGLLDELADLPREPESFWFAQAQAVQQLIS-- 374 Query: 336 ADKKSMGELFKILVVSH---EKVELMPF 360 + MGELFK++ E ++ F Sbjct: 375 --EAEMGELFKVIAFEKNLREAASVLGF 400 >gi|325198908|gb|ADY94364.1| conserved hypothetical protein [Neisseria meningitidis G2136] Length = 382 Score = 238 bits (606), Expect = 1e-60, Method: Composition-based stats. Identities = 93/376 (24%), Positives = 153/376 (40%), Gaps = 37/376 (9%) Query: 4 KLIRKIVNLIKKNG-QMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGE 61 KL I I K+G + ++ L + P++GYY+ + G GDF+TAP ++ +F + Sbjct: 16 KLQTLIAEKIGKHGNWIPFSRFMELVLYAPQYGYYTGGSHKIGNTGDFITAPTLTSLFAQ 75 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 LA L Q + E G G G + D+L I + Y++E S Sbjct: 76 TLARQLQELLSQT----AGNIYEFGAGTGQLAADLLGSISD------GISRYYIIEISPE 125 Query: 122 LTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRER 177 L QK + K+ T+L + G ++ NE D++P++ E G E Sbjct: 126 LAARQKNLIQARAPEASQKVVHLTTLPEAFDG--IIIGNEVLDAMPVEIVRKDEGGSFEH 183 Query: 178 MIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLG----AIFENSPCRDREMQSISDRLACDG 233 + + ++ S + S YF E P + +++++ RL Sbjct: 184 VGVCTDNGRFAYSARPLHDPSLSTSASLYFPQTDYPYTSELHPQQYAFIRTLASRLE--H 241 Query: 234 GTAIVIDY------GYLQSRVGDTLQ-AVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAI 286 G I IDY Y R TL + H +P G ADL++HV+F ++ Sbjct: 242 GCMIFIDYGFDAAQYYHPQRNQGTLIGHYRHHVIHNPFDFIGLADLTAHVNFTDIAQAGT 301 Query: 287 LYKLYINGLTTQGKFLEGLGIWQRAFSLMK-QTARKDILLDSVKRLVSTSADKKSMGELF 345 L + G Q FL LGI + K +A +V++L+ D+ MGELF Sbjct: 302 DAGLDLIGYLPQSHFLLNLGITELLAQTGKTNSAAYIREAAAVQKLI----DQHEMGELF 357 Query: 346 KILVVSHE-KVELMPF 360 K++ ++ F Sbjct: 358 KVIAFGKNIGIDWAGF 373 >gi|119480871|ref|XP_001260464.1| hypothetical protein NFIA_085200 [Neosartorya fischeri NRRL 181] gi|119408618|gb|EAW18567.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181] Length = 503 Score = 238 bits (606), Expect = 1e-60, Method: Composition-based stats. Identities = 112/453 (24%), Positives = 177/453 (39%), Gaps = 103/453 (22%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTC-----NPFGAVGDFVTAPEIS 56 L + + N IK G + + + + PE GYY+T FG GDFVT+PEIS Sbjct: 39 STPLAKTLANAIKVTGPIPIAAFMRQVLTSPEGGYYTTRPEGGGEVFGKKGDFVTSPEIS 98 Query: 57 QIFGEMLAIFLICAWEQHGFPSC-VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 Q+FGE++ I+ I W G V+L+E+GPG+G +M D+LR K S+ +IY+ Sbjct: 99 QVFGELVGIWTITEWMAQGLKRSGVQLIEVGPGKGTLMDDMLRTFRNFKSFASSLEAIYL 158 Query: 116 VETSERLTLIQKKQLAS--------------------YGDKINWYTSLADVPLGFTFLVA 155 VE S L +QK++L + L F+ A Sbjct: 159 VEASPTLREVQKQRLCGDAAMEETDIGHKSISKYFNVPVIWVEDIRLLPHEEDKTPFIFA 218 Query: 156 NEFFDSLPIKQFVMTE------------------------------HGIRERMIDIDQ-- 183 +EFFD+LPI F RE M+ ++ Sbjct: 219 HEFFDALPIHAFESIPPAPENQSEQKEIMTPTGPAKLHQPMKPANTPQWREIMVTLNPKA 278 Query: 184 -----HDSLVFNIG-----------DHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISD 227 F + EI + G+ E SP + Sbjct: 279 VEENIEGEPEFKLTLAKASTPSSLVIPEISERYRKLKSTP-GSTIEVSPESRIYASDFAR 337 Query: 228 RL---------------------ACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVN 266 R+ G A+++DYG + + ++L+ ++ H V L + Sbjct: 338 RIGGSSQPPRTVGSRNAPAAQPKKVPSGAALIMDYGTMSTIPINSLRGIQHHRTVPALSS 397 Query: 267 PGQADLSSHVDFQRLSSIAILY--KLYINGLTTQGKFLEGLGIWQRAFSLMK---QTARK 321 PGQ D+S+ VDF L+ AI + ++G QG FL+ +GI +R L+K ++ Sbjct: 398 PGQVDVSADVDFMALAEAAIEASEGVEVHGPVEQGDFLQVMGIAERMQQLLKGVQDEEKR 457 Query: 322 DILLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 L KRL+ MG+++K + + E Sbjct: 458 KTLESGWKRLIERGG--GGMGKIYKFMAIIPEN 488 >gi|284799413|ref|ZP_05983926.2| putative peptidoglycan synthetase FtsI [Neisseria subflava NJ9703] gi|284797793|gb|EFC53140.1| putative peptidoglycan synthetase FtsI [Neisseria subflava NJ9703] Length = 383 Score = 238 bits (606), Expect = 1e-60, Method: Composition-based stats. Identities = 80/372 (21%), Positives = 154/372 (41%), Gaps = 32/372 (8%) Query: 5 LIRKIVNLIKKN-GQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGEM 62 L + I N I+++ + ++ L + P++GYYS + G GDF+TAP +S +FG+ Sbjct: 19 LTKLIKNEIEQHQNWIPFSRFMELALYTPQYGYYSGGSHKIGTDGDFITAPTLSPLFGQT 78 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 LA L Q + E G G G + +L+ + + Y++E S L Sbjct: 79 LAKQLAELLPQT----AGNIYEFGAGTGHLAATLLQNLSD------GLNHYYIIELSAEL 128 Query: 123 TLIQKKQLASYGDK-----INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRER 177 Q++ + + + T+L + G ++ NE D++P+++ + + G ++ Sbjct: 129 AERQRQHILEHTSPEAATKVIHLTTLPEHFDG--IIIGNEVLDAMPVERLIYQDEGFQQI 186 Query: 178 MIDIDQHD--SLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGT 235 + ++ + + + E+ F E P + +Q+++ +L G Sbjct: 187 GVSLENDELIEAIRPLTQAELIQTASLYLPPFHSYTSELHPAQYAFIQTLAAKLQRGGII 246 Query: 236 AI-----VIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKL 290 I Y + Q + G + + HT P N G DL++HV+F ++ L Sbjct: 247 FIDYGFDATQYYHPQRKEGTFIGHYRHHTIHDPFFNIGLTDLTAHVNFTDIARAGTEAGL 306 Query: 291 YINGLTTQGKFLEGLGIWQRAFSL-MKQTARKDILLDSVKRLVSTSADKKSMGELFKILV 349 + G Q FL LGI + + +V++L+ MGELFK++ Sbjct: 307 DLIGYLPQSYFLLNLGITDLLAQIGSPDSIEYIQAAAAVQKLIHQ----HEMGELFKVIA 362 Query: 350 VSHE-KVELMPF 360 ++ F Sbjct: 363 FGKNIDIDWAGF 374 >gi|51473296|ref|YP_067053.1| hypothetical protein RT0085 [Rickettsia typhi str. Wilmington] gi|51459608|gb|AAU03571.1| conserved hypothetical protein [Rickettsia typhi str. Wilmington] Length = 358 Score = 238 bits (606), Expect = 1e-60, Method: Composition-based stats. Identities = 116/357 (32%), Positives = 186/357 (52%), Gaps = 16/357 (4%) Query: 8 KIVNLIKKNGQMTVDQYFALCV-ADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIF 66 KI LI ++G +T D + P YY + GDFVTAPEISQ+FGE++ ++ Sbjct: 6 KIRQLIDQHGYITCDVLMQEVLSLHPNA-YYKQVKSLASEGDFVTAPEISQLFGEIIGLW 64 Query: 67 LICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQ 126 I W++ G P + LVELGPGRG++M D+LR L P+F++ LSI ++E ++ Q Sbjct: 65 CIREWQRIGCPKSLSLVELGPGRGLLMRDLLRTAK-LVPEFYNSLSITLIEINKNFIAHQ 123 Query: 127 KKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQ-HD 185 K L I + ++P T ++ NEFFD++PIKQ++ + ER+ + Sbjct: 124 KSNLQDINLPIKHLEFIEEIPQKPTIIITNEFFDTMPIKQYIKVKELWYERIFSVQPVDG 183 Query: 186 SLVFNIGDHEIKSNFLTCSDY---FLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYG 242 S+ ++ + + GA+ E S ++ I++ L G+ ++IDYG Sbjct: 184 SIKYDKISINKRLQEYLLRTHIAAKDGAVLEESYKSIEIIKFIAEHLKKVSGSCLIIDYG 243 Query: 243 Y-------LQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGL 295 Y + + TLQAV+ H Y L N G+ADLS+HVDF L ++A K+ + Sbjct: 244 YDIAPYNRTRYQYNPTLQAVRKHKYCPILENLGEADLSAHVDFYSLKTVAKNSKINVIDT 303 Query: 296 TTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSH 352 +Q FL GI R +L ++ + +L+ +++ V K MG+LFK+L + H Sbjct: 304 ISQRNFLIQNGILLRKQALKEKLNDEQVLI--IEKQVERLILPKQMGKLFKVLQIMH 358 >gi|298492340|ref|YP_003722517.1| hypothetical protein Aazo_3886 ['Nostoc azollae' 0708] gi|298234258|gb|ADI65394.1| protein of unknown function DUF185 ['Nostoc azollae' 0708] Length = 394 Score = 237 bits (605), Expect = 2e-60, Method: Composition-based stats. Identities = 97/381 (25%), Positives = 157/381 (41%), Gaps = 34/381 (8%) Query: 1 MENK--LIRKIVNLIKKN--GQMTVDQYFALCVADPEFGYYST-CNPFG-AVGDFVTAPE 54 M++ L + I I + ++T +Y + + P+ GYYS+ G GDF T+ Sbjct: 1 MDSNPALCQAIAKCINTSPQHRITFAEYMDMALYHPKHGYYSSDAVKIGFRGGDFFTSSS 60 Query: 55 ISQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIY 114 + FGE+LA+ W+ PS LVE+G G+G + IL + PDF + L Sbjct: 61 LGNDFGELLAVQFFQMWQILEQPSPFHLVEMGAGQGTLASHILNYLKLQHPDFLTALEYI 120 Query: 115 MVETSERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGI 174 +VE S L Q+++L + + + + F +NE D+ P+ QF++ + Sbjct: 121 IVEKSPSLRTQQQQRLQDFSVRWCNLEEITPNSIVGCF-FSNELVDAFPVHQFILEAGEL 179 Query: 175 RERMIDIDQHDSLVFNIG----------------DHEIKSNFLTCSDYFLGAIFENSPCR 218 RE + ++ + ++ L + Y G E + Sbjct: 180 REIYVTTLENGENTASDFSFIEVVAEVSTPKLVAHFDLVGIDLNPNVYEDGYRSEINLAA 239 Query: 219 DREMQSISDRLACDGGTAIVIDY------GYLQSRVGDTLQAVKGHTYVS-PLVNPGQAD 271 + ++D L G + IDY Y R TLQ H Y P +N G D Sbjct: 240 LDWLGIVADCL--QQGYVLTIDYGYPASRYYHPRRSQGTLQCYYHHRYHDHPYINIGGQD 297 Query: 272 LSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRL 331 +++HVDF L + L G T QG FL LG+ +R +L Q LL+ + L Sbjct: 298 ITAHVDFTALETWGKKCGLDAVGWTQQGLFLMALGLGERIAALSYQEQPISQLLNRREAL 357 Query: 332 VSTSADKKSMGELFKILVVSH 352 + +G F +LV S Sbjct: 358 -HQLIEPTGLGN-FGVLVQSK 376 >gi|170099455|ref|XP_001880946.1| predicted protein [Laccaria bicolor S238N-H82] gi|164644471|gb|EDR08721.1| predicted protein [Laccaria bicolor S238N-H82] Length = 441 Score = 237 bits (605), Expect = 2e-60, Method: Composition-based stats. Identities = 124/433 (28%), Positives = 194/433 (44%), Gaps = 81/433 (18%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYS--TCNPFGAVGDFVTAPEISQIFG 60 +K+ + I++ IK G + Y LC++ P GYY + FG GDF+T+PEISQ+FG Sbjct: 10 SKVEKIILDGIKATGPIPFSTYMQLCLSHPTHGYYMNPSHPVFGTRGDFITSPEISQVFG 69 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 E++ ++L+ WE G P+ VRLVELGPGRG +M D+LRVI ++ P S +++++VETS Sbjct: 70 ELVGVWLLSQWENAGRPAAVRLVELGPGRGTLMDDVLRVISRIIP-GNSSINVHLVETSS 128 Query: 121 RLTLIQKKQLASY-------GDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHG 173 + +Q+ +L S + + + +T A+EFFD+LPI TE G Sbjct: 129 SMRSLQEAKLCSSSRQAKFDIHWHHSVSDIPPSVSEYTMFFAHEFFDALPIHTLQKTETG 188 Query: 174 IRERMIDIDQHDSLVFNIGDHEIKSNF------------------------------LTC 203 E +ID + + + K N Sbjct: 189 WHEVLIDANPDYNATGDCQTDVEKPNVLKTTTNSNSRLRRVLSRSPSATSTLLGQSSPRF 248 Query: 204 SDYFLGAIFENSPCRDREMQSISDRLACDG------------GTAIVIDYGYLQSRVGDT 251 S+ +G+ E SP R + + + L + G ++IDYG GD+ Sbjct: 249 SEIPIGSSIEVSPTSFRTARQVGELLLGNVKNQEDESSRSPGGCGLIIDYG-DDHVFGDS 307 Query: 252 LQA-----------------VKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYING 294 + H V PG DL+S+VDF L + ++G Sbjct: 308 FRVSRMRKVNLHFFTAAAKAFSNHKLVDVFDQPGDCDLTSNVDFTYLREAVEDL-VTVHG 366 Query: 295 LTTQGKFLEGLGIWQRAFSL---MKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVS 351 Q FLE +G+ R SL K ARK + D+ +RLVS K MG +K++ ++ Sbjct: 367 PIPQYMFLEKMGLQLRLDSLVRAAKAEARKTAIRDAAERLVS----KSGMGTEYKVMGLT 422 Query: 352 ---HEKVELMPFV 361 + ++ PFV Sbjct: 423 TGVKDGGKVWPFV 435 >gi|206559015|ref|YP_002229775.1| hypothetical protein BCAL0616 [Burkholderia cenocepacia J2315] gi|198035052|emb|CAR50925.1| conserved hypothetical protein [Burkholderia cenocepacia J2315] Length = 396 Score = 237 bits (605), Expect = 2e-60, Method: Composition-based stats. Identities = 87/371 (23%), Positives = 150/371 (40%), Gaps = 34/371 (9%) Query: 2 ENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTC-NPFGAV----GDFVTAPEI 55 L ++ + I G + D++ + P GYYS FG DFVTAPE+ Sbjct: 22 SETLAAQLRDEIAAAGGWLPFDRFMERALYAPGLGYYSGGARKFGRRADDGSDFVTAPEL 81 Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 S +F + LA + A G R++E G G G ++ L + + Sbjct: 82 SPLFAQTLAQPVAEALAASG---TRRVMEFGAGTG---KLAAGLLAALDAPGVELDEYLI 135 Query: 116 VETSERLTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTE 171 V+ S L Q+ + + K+ W +L + G +V NE D++P++ F + Sbjct: 136 VDLSGELRERQRDTIEAAVPALAAKVRWLDALPERFDG--VVVGNEVLDAMPVRLFAKAD 193 Query: 172 HGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCS--DYFLGAIFENSPCRDREMQSISDRL 229 RER + +D + VF+ + D G + E +++ L Sbjct: 194 GAWRERGVALDARHAFVFDDRPVGAAGLPAVLAALDVGDGYVTETHEAALAFTRTVCTML 253 Query: 230 ACDGGTAIVIDY------GYLQSRVGDT-LQAVKGHTYVSPLVNPGQADLSSHVDFQRLS 282 G +++DY Y R T + + H + P PG D+++HV+F + Sbjct: 254 GR--GAVLLVDYGFPAHEYYHPQRDRGTLMCHYRHHAHDDPFAYPGLQDITAHVEFTGIY 311 Query: 283 SIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARK-DILLDSVKRLVSTSADKKSM 341 AI + G T+Q +FL GI ++ ++V++L+S + M Sbjct: 312 EAAIATGAELLGYTSQARFLLNAGITDALAAIDPSDIHAFLPAANAVQKLIS----EAEM 367 Query: 342 GELFKILVVSH 352 GELFK++ S Sbjct: 368 GELFKVIAFSR 378 >gi|209965197|ref|YP_002298112.1| hypothetical protein RC1_1903 [Rhodospirillum centenum SW] gi|209958663|gb|ACI99299.1| conserved hypothetical protein [Rhodospirillum centenum SW] Length = 331 Score = 237 bits (605), Expect = 2e-60, Method: Composition-based stats. Identities = 125/311 (40%), Positives = 167/311 (53%), Gaps = 8/311 (2%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGE 61 E L I I+ G + V Y ALC+ P+ GYY+T +P GA GDF TAPEISQ+FGE Sbjct: 14 ETPLAALIARQIRLTGPLPVSAYMALCLGHPQHGYYTTRDPLGAGGDFTTAPEISQMFGE 73 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 ML ++ W G P+ V LVELGPGRG +M D LR ++ P F + L ++VETS Sbjct: 74 MLGLWAAHCWLAMGSPAGVALVELGPGRGTLMADALRATARV-PGFHAALRPHLVETSPV 132 Query: 122 LTLIQKKQLASYGDKIN--WYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIR---- 175 L Q LA+ W+ L +VP G L+ANEFFD+LPI+Q V EH R Sbjct: 133 LRQRQAAALAALPAPPAPVWHDRLEEVPEGPLLLLANEFFDALPIRQLVRQEHRGRLLWA 192 Query: 176 ERMIDIDQHDSLVFNIGDHEIKSNFLT-CSDYFLGAIFENSPCRDREMQSISDRLACDGG 234 ER + +D L + + ++ G++ E +P Q I RLA G Sbjct: 193 ERKVGLDADGRLAWVLDPAAGEALVPPALRGSPPGSVVELAPAAAAVAQEIGGRLARAPG 252 Query: 235 TAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYING 294 A+++DYG+ VGDTLQAV+ H + PL PG+ADL++HVDF L++ G Sbjct: 253 AALMVDYGHAGPAVGDTLQAVRRHRFADPLEAPGEADLTAHVDFAALAAALRAGGAAAWG 312 Query: 295 LTTQGKFLEGL 305 TQG L L Sbjct: 313 PVTQGDLLSAL 323 >gi|121592928|ref|YP_984824.1| hypothetical protein Ajs_0497 [Acidovorax sp. JS42] gi|120605008|gb|ABM40748.1| protein of unknown function DUF185 [Acidovorax sp. JS42] Length = 366 Score = 237 bits (605), Expect = 2e-60, Method: Composition-based stats. Identities = 93/370 (25%), Positives = 145/370 (39%), Gaps = 34/370 (9%) Query: 3 NKLIRKIVNLIK-KNGQMTVDQYFALCVADPEFGYYSTC-NPFGA----VGDFVTAPEIS 56 + L + I G + D++ AL + P GYY+ FGA DFVTAPE+S Sbjct: 10 SALQTHVAKAIAEAGGWIGFDRFMALALYTPGLGYYAGDLPKFGAMPASGSDFVTAPELS 69 Query: 57 QIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMV 116 +FG +LA + A + + + E G G L V +V Sbjct: 70 PVFGRVLARQVREALDAT---ATDEVWEFGAGS------GALAAQLLGALDDRVRRYTIV 120 Query: 117 ETSERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRE 176 + S L Q+ LA++GD+++W L D G +V NE D++P++ E Sbjct: 121 DLSGSLRARQQATLAAWGDRVHWVDRLPDQMQG--VVVGNEVLDAMPVQLLQRRAGLWHE 178 Query: 177 RMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTA 236 R + +D+ + F D D + E P + M++++ L Sbjct: 179 RGVALDESGNG-FVWQDRPTPLRPPVEIDGPHDYLTEIHPQGEAFMRTLAQHLVRGAAFL 237 Query: 237 IVI----DYGYLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSSIAILYKLY 291 I D Y R TL +GH PL + G D+++HV+F ++ A L Sbjct: 238 IDYGFGEDEYYHPQRHMGTLVCHRGHQVDSDPLADVGLKDITAHVNFTAMALAAQDAGLQ 297 Query: 292 INGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVS 351 + G TTQ FL G+ L + + K + MGELFK+L V Sbjct: 298 VLGYTTQAHFLINCGLLSELERL-----TQAQRAQAAK-----LMMEHEMGELFKVLAVG 347 Query: 352 H-EKVELMPF 360 + F Sbjct: 348 AGAPWRPLGF 357 >gi|296114641|ref|ZP_06833294.1| hypothetical protein GXY_02651 [Gluconacetobacter hansenii ATCC 23769] gi|295978997|gb|EFG85722.1| hypothetical protein GXY_02651 [Gluconacetobacter hansenii ATCC 23769] Length = 367 Score = 237 bits (605), Expect = 2e-60, Method: Composition-based stats. Identities = 116/350 (33%), Positives = 169/350 (48%), Gaps = 23/350 (6%) Query: 18 QMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFP 77 + +D++ A YYS +PF DF+TAPEISQ+FGE+L ++ W+ G P Sbjct: 7 PVRLDRFMARA----NAAYYSGRDPF---ADFITAPEISQMFGELLGAWVAVTWQAMGTP 59 Query: 78 SCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASY--GD 135 + L E GPGRG +M D LR++ ++ P ++ ++MVETS RL +Q LA + Sbjct: 60 TPFVLAEAGPGRGTLMADALRLLARVAPACYAAARVHMVETSPRLRQVQATSLAPHVGVC 119 Query: 136 KINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHE 195 W+ ++ +P G L+ANEF D+LPI+QFV T G ER + VF Sbjct: 120 APVWHDAVTGLPPGAMILLANEFLDALPIRQFVRTARGWDERSVV-----GEVFVTAPAT 174 Query: 196 IKSNFLTCS--DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQ 253 + D G + E + + RL G A+ IDYGY GDTLQ Sbjct: 175 DLPADGPLATRDVPEGEVLETCEGALGIARQLGRRLMDGMGAALFIDYGYDGPGWGDTLQ 234 Query: 254 AVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFS 313 A+ PL PG ADL++HVDF A G TQG FL LG++ RA Sbjct: 235 ALCDGHPAWPLARPGMADLTAHVDFAAFGMAARAGGCTTWGSVTQGAFLSTLGLFPRAQQ 294 Query: 314 LMKQTARK--DILLDSVKRLVSTSADKKSMGELFKILVVSHEKVE-LMPF 360 L + + + ++ +RL A MG LF+++ ++ + L F Sbjct: 295 LARNQPHEVARQIGEAAQRL----AAPDRMGGLFRVMALTSPGITGLAGF 340 >gi|22297960|ref|NP_681207.1| hypothetical protein tlr0417 [Thermosynechococcus elongatus BP-1] gi|22294138|dbj|BAC07969.1| tlr0417 [Thermosynechococcus elongatus BP-1] Length = 385 Score = 237 bits (605), Expect = 2e-60, Method: Composition-based stats. Identities = 78/366 (21%), Positives = 144/366 (39%), Gaps = 17/366 (4%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIFGE 61 +L+ + I+ G ++ ++ AL + P++GYY+ G GDF+T+ +++ F E Sbjct: 5 TQLLAALQERIRAAGAISFCEFMALALYAPQWGYYNRPQLQIGRRGDFITSSSLTRDFAE 64 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 +L + W P L+E+G G G +L PDFF+ L + E S Sbjct: 65 LLTEAFVQMWHALERPQRFTLLEMGAGEGQFAEGVLGYSQATYPDFFAALEYQIQEPSPS 124 Query: 122 LTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDI 181 L Q+++LA +GD++ W + +NE D+ P+ + +E + + Sbjct: 125 LRERQRQRLAPWGDRLRWRDLDTACEPIVGCIFSNELVDAFPVHRLQWQGDNWQEIYVSL 184 Query: 182 DQHDSLVFNIGDHEIKSNFLTCSD---------YFLGAIFENSPCRDREMQSISDRLACD 232 + + +G + Y G E + ++ +S+ L Sbjct: 185 NAQGAFQEVLGPLSDDRIHEYFATVGIDPQQQGYSDGYRTEVNLNLIPWLKDLSEHLKRG 244 Query: 233 GGTAIVIDYG----YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSSIAIL 287 I Y Y +R TLQ H +P G+ D+++HVD L+ Sbjct: 245 FVLTIDYGYPAQQYYHPARCEGTLQCYYQHRCHNNPYCFVGEQDITAHVDVTALTCYGEQ 304 Query: 288 YKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKI 347 + L +T Q FL LG+ R + + L+ + L D +G F + Sbjct: 305 FGLATLYVTRQSLFLMALGLGDRLVAQQQSNGNLLQALNRHQAL-HQLIDPLGLGG-FYV 362 Query: 348 LVVSHE 353 ++ + Sbjct: 363 VLQGKQ 368 >gi|187925703|ref|YP_001897345.1| hypothetical protein Bphyt_3733 [Burkholderia phytofirmans PsJN] gi|187716897|gb|ACD18121.1| protein of unknown function DUF185 [Burkholderia phytofirmans PsJN] Length = 396 Score = 237 bits (605), Expect = 2e-60, Method: Composition-based stats. Identities = 88/370 (23%), Positives = 151/370 (40%), Gaps = 32/370 (8%) Query: 2 ENKLIRKIVNLIK-KNGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGD----FVTAPEI 55 L+ +I + G + D+Y + P GYYS FG GD FVTAPE+ Sbjct: 22 SEALVAQIRAELDTAGGWLPFDRYMERALYAPGLGYYSGGARKFGLRGDDGSDFVTAPEL 81 Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 S +F LA + A + G ++E G G G + +L + L +F S + Sbjct: 82 SPLFAATLARPVAEALQASG---TRDVMEFGAGTGKLAAGVLNALDGLGAEFDS---YSI 135 Query: 116 VETSERLTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTE 171 V+ S L Q++ + + K+ W +L + G ++ NE D++P++ F Sbjct: 136 VDLSGELRERQRETIEAAAPTLAAKVRWLDALPERFEG--VVIGNEVLDAMPVRLFAFNG 193 Query: 172 HGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYF---LGAIFENSPCRDREMQSISDR 228 ER + ++ + F+ ++ S+ + E ++I Sbjct: 194 GAWLERG-VVWRNGAFAFDDRPVSAAADLALLSEIETAGEAYVTETHEAASAFTRTICTM 252 Query: 229 LACDGGTAIVIDYG----YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSS 283 LA I + Y R TL H P + PG D+++HV+F ++ Sbjct: 253 LARGAAFFIDYGFPRHEYYHAQRAQGTLMCHYRHRAHGDPFLYPGLQDITAHVEFTGIAE 312 Query: 284 IAILYKLYINGLTTQGKFLEGLGIWQRAFSLM-KQTARKDILLDSVKRLVSTSADKKSMG 342 + + G T+Q +FL GI + TAR ++V++L+S + MG Sbjct: 313 AGVETGADLLGFTSQARFLLNAGITDVLSEIDPADTARYLPAANAVQKLLS----EAEMG 368 Query: 343 ELFKILVVSH 352 ELFK++ S Sbjct: 369 ELFKVIAFSR 378 >gi|258543360|ref|YP_003188793.1| hypothetical protein APA01_23000 [Acetobacter pasteurianus IFO 3283-01] gi|256634438|dbj|BAI00414.1| hypothetical protein [Acetobacter pasteurianus IFO 3283-01] gi|256637496|dbj|BAI03465.1| hypothetical protein [Acetobacter pasteurianus IFO 3283-03] gi|256640548|dbj|BAI06510.1| hypothetical protein [Acetobacter pasteurianus IFO 3283-07] gi|256643605|dbj|BAI09560.1| hypothetical protein [Acetobacter pasteurianus IFO 3283-22] gi|256646660|dbj|BAI12608.1| hypothetical protein [Acetobacter pasteurianus IFO 3283-26] gi|256649713|dbj|BAI15654.1| hypothetical protein [Acetobacter pasteurianus IFO 3283-32] gi|256652701|dbj|BAI18635.1| hypothetical protein [Acetobacter pasteurianus IFO 3283-01-42C] gi|256655757|dbj|BAI21684.1| hypothetical protein [Acetobacter pasteurianus IFO 3283-12] Length = 342 Score = 237 bits (604), Expect = 2e-60, Method: Composition-based stats. Identities = 109/342 (31%), Positives = 167/342 (48%), Gaps = 17/342 (4%) Query: 21 VDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCV 80 +D + A YY++ DF+TAPEISQ+FGE+L + W+ G P+ V Sbjct: 12 LDAFMARA----NARYYASKPLL---SDFITAPEISQVFGELLGAWAATVWQNMGCPAQV 64 Query: 81 RLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWY 140 L E GPGRG +M D LR+I + PDF L++++VETS + QK+ LA Y + W+ Sbjct: 65 ILAEAGPGRGTLMADALRLITRCAPDFAHALNVHLVETSPLMRQAQKQALAPY-ARPTWH 123 Query: 141 TSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNF 200 + D+P G L+ NEF D+LPI+QFV T G ER + ++ D + Sbjct: 124 DRIEDLPSGPLILLGNEFLDALPIRQFVQTHSGWHERYV---IDEAFHLVPCDAPV---M 177 Query: 201 LTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTY 260 I E + + R A G A+ IDYG+ + G +LQA++ Sbjct: 178 PDGRKLEPDTIVELCEPALEVARYLGQRFAAQPGVALFIDYGHHSTLTGASLQALRHARP 237 Query: 261 VSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTAR 320 PL G+ADL++HVDF + A +I G TQG FL LG+ +R L + Sbjct: 238 AHPLEAAGEADLTAHVDFTAFGATAQQAGGHIYGAETQGTFLRALGLIERTEQLATRATT 297 Query: 321 KDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVE-LMPFV 361 +D + ++ A + MG LF+++ ++ + F+ Sbjct: 298 EDAAI--LRSAAHRLAAPEKMGHLFRVMALASPGLPSPPGFM 337 >gi|308388611|gb|ADO30931.1| hypothetical protein NMBB_0451 [Neisseria meningitidis alpha710] Length = 405 Score = 237 bits (604), Expect = 2e-60, Method: Composition-based stats. Identities = 93/376 (24%), Positives = 153/376 (40%), Gaps = 37/376 (9%) Query: 4 KLIRKIVNLIKKNG-QMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGE 61 KL I I K+G + ++ L + P++GYY+ + G GDF+TAP ++ +F + Sbjct: 39 KLQTLIAEKIGKHGNWIPFSRFMELVLYAPQYGYYTGGSHKIGNTGDFITAPTLTSLFAQ 98 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 LA L Q + E G G G + D+L I + Y++E S Sbjct: 99 TLARQLQELLSQT----AGNIYEFGAGTGQLAADLLGSISD------GISRYYIIEISPE 148 Query: 122 LTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRER 177 L QK + K+ T+L + G ++ NE D++P++ E G E Sbjct: 149 LAARQKNLIQARAPEASQKVVHLTALPEAFDG--IIIGNEVLDAMPVEIVRKDEGGSFEH 206 Query: 178 MIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLG----AIFENSPCRDREMQSISDRLACDG 233 + + ++ S + S YF E P + +++++ RL Sbjct: 207 VGVCTDNGRFAYSARPLHDPSLSTSASLYFPQTDYPYTSELHPQQYAFIRTLASRLE--H 264 Query: 234 GTAIVIDY------GYLQSRVGDTLQ-AVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAI 286 G I IDY Y R TL + H +P G ADL++HV+F ++ Sbjct: 265 GCMIFIDYGFDAAQYYHPQRNQGTLIGHYRHHVIHNPFDFIGLADLTAHVNFTDIAQAGT 324 Query: 287 LYKLYINGLTTQGKFLEGLGIWQRAFSLMK-QTARKDILLDSVKRLVSTSADKKSMGELF 345 L + G Q FL LGI + K +A +V++L+ D+ MGELF Sbjct: 325 DAGLDLIGYLPQSHFLLNLGITELLAQTGKTNSAAYIREAAAVQKLI----DQHEMGELF 380 Query: 346 KILVVSHE-KVELMPF 360 K++ ++ F Sbjct: 381 KVIAFGKNIGIDWAGF 396 >gi|189184598|ref|YP_001938383.1| hypothetical protein OTT_1691 [Orientia tsutsugamushi str. Ikeda] gi|189181369|dbj|BAG41149.1| hypothetical protein OTT_1691 [Orientia tsutsugamushi str. Ikeda] Length = 384 Score = 237 bits (604), Expect = 3e-60, Method: Composition-based stats. Identities = 123/376 (32%), Positives = 195/376 (51%), Gaps = 19/376 (5%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 ++ + I +I+ +TV+ ++ + YY P G GDF+TAPEISQ+FGEM+ Sbjct: 2 EIEQHIRQIIRSENNITVENLMSIVMESRYNSYYRIQQPLGKAGDFITAPEISQMFGEMI 61 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 I+ I W + P + L+ELGPG+G ++ DIL +K ++ S+ +VE + L Sbjct: 62 GIWCIDLWHKLNRPQKIDLIELGPGKGTLLCDILNATRHIKKFSTAI-SLTLVEINCSLK 120 Query: 124 LIQKKQLASYGDKINWYTSLADVPLG-FTFLVANEFFDSLPIKQFVMT------EHGIRE 176 IQ+ L S+ I W S+ + T ++ANEFFD+LPIKQ++ + E Sbjct: 121 KIQQDNLLSFNVPIKWVKSVNHIVSSYPTIILANEFFDALPIKQYIKKINQQSGQINWLE 180 Query: 177 RMIDIDQHDSLVFNIGDHEIKSNFLTC--SDYFLGAIFENSPCRDREMQSISDRLACDGG 234 R++ ID ++ L F+ D +I + ++ G + E SP + + +Q++S+ L +GG Sbjct: 181 RVVKIDNNNKLYFDTIDADINEHKFLKLHNNAPNGGVLEISPAQHQTIQAVSNLLKKNGG 240 Query: 235 TAIVIDYGYLQSR-------VGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAIL 287 +++IDYGY S +LQAVK H Y L N G ADLS+HVDF L +IA+ Sbjct: 241 GSLIIDYGYDISPEQRKNYQYNSSLQAVKHHQYHPLLENLGCADLSAHVDFWSLKNIAVA 300 Query: 288 YKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKI 347 + G +Q L LGI R L KQ + L + + ++MGELFK Sbjct: 301 EGIASFGSISQNVLLHKLGIKTRLNML-KQINSDENLASKLDLQYNRLTYVRAMGELFKA 359 Query: 348 LVVSH-EKVELMPFVN 362 + ++ + + F N Sbjct: 360 IAITSAPSIIPLGFCN 375 >gi|172061927|ref|YP_001809579.1| hypothetical protein BamMC406_2887 [Burkholderia ambifaria MC40-6] gi|171994444|gb|ACB65363.1| protein of unknown function DUF185 [Burkholderia ambifaria MC40-6] Length = 396 Score = 237 bits (604), Expect = 3e-60, Method: Composition-based stats. Identities = 83/371 (22%), Positives = 149/371 (40%), Gaps = 34/371 (9%) Query: 2 ENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTC-NPFGAV----GDFVTAPEI 55 L ++ + I G + D++ + P GYYS FG DFVTAPE+ Sbjct: 22 SEMLAAQLRDEIAAAGGWLPFDRFMERALYAPGLGYYSGGARKFGRRADDGSDFVTAPEL 81 Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 S +F + LA + A G R++E G G G + + + + + Sbjct: 82 SPLFAQTLANPVADALAASG---TRRVMEFGAGTGKLAAGL---LAAFDALDVELDEYLI 135 Query: 116 VETSERLTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTE 171 V+ S L Q+ + + K+ W +L + G +V NE D++P++ F + Sbjct: 136 VDLSGELRERQRDTIAASAPALAGKVRWLDALPERFDG--VVVGNEVLDAMPVRLFAKSG 193 Query: 172 HGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCS--DYFLGAIFENSPCRDREMQSISDRL 229 ER + +D + VF + D G + E +++ L Sbjct: 194 GAWLERGVTLDARHAFVFGDRPVGAAGLPPVLATLDVDDGYVTETHEAALAFTRTVCTML 253 Query: 230 ACDGGTAIVIDY------GYLQSRVGDT-LQAVKGHTYVSPLVNPGQADLSSHVDFQRLS 282 A G +++DY Y R T + + H + + PG D+++HV+F + Sbjct: 254 AR--GAVLLVDYGFPAHEYYHPQRDRGTLMCHYRHHAHDDAFLYPGLQDITAHVEFTGIY 311 Query: 283 SIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARK-DILLDSVKRLVSTSADKKSM 341 + + G T+Q +FL GI ++ + ++V++L+S + M Sbjct: 312 DAGVGTGADLLGYTSQARFLLNAGITDALAAIDPSDIHQFLPAANAVQKLIS----EAEM 367 Query: 342 GELFKILVVSH 352 GELFK++ S Sbjct: 368 GELFKVIAFSR 378 >gi|78067795|ref|YP_370564.1| hypothetical protein Bcep18194_A6326 [Burkholderia sp. 383] gi|77968540|gb|ABB09920.1| protein of unknown function DUF185 [Burkholderia sp. 383] Length = 397 Score = 236 bits (603), Expect = 3e-60, Method: Composition-based stats. Identities = 88/372 (23%), Positives = 152/372 (40%), Gaps = 35/372 (9%) Query: 2 ENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTC-NPFGAV----GDFVTAPEI 55 L ++ + I G + D++ + P GYYS FG DFVTAPE+ Sbjct: 22 SETLATQLRDEIAAAGGWLPFDRFMERALYAPGLGYYSGGARKFGRRADDGSDFVTAPEL 81 Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 S +F + LA + A G ++E G G G + +L + L + L + Sbjct: 82 SPLFAQTLAQPVAEALAASG---TRGVMEFGAGTGKLAAGLLAALDALGAELDEYL---I 135 Query: 116 VETSERLTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTE 171 V+ S L Q+ + K+ W +L + G +V NE D++P++ F Sbjct: 136 VDLSGELRERQRDTITAAAPVLAAKVRWLDALPERFDG--VVVGNEVLDAMPVRLFAKAG 193 Query: 172 HGIRERMIDIDQHDSLVFNIG---DHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDR 228 G +ER + +D + VF + D G + E +++ Sbjct: 194 GGWQERGVALDAQQAFVFEDRAAAPAGLPPVLAGLDDAADGYVTETHEAALAFTRTVCTM 253 Query: 229 LACDGGTAIVIDY------GYLQSRVGDT-LQAVKGHTYVSPLVNPGQADLSSHVDFQRL 281 L G +++DY Y R T + + H + P + PG D+++HV+F + Sbjct: 254 LGR--GAVLLVDYGFPAHEYYHPQRDRGTLMCHYRHHAHDDPFLYPGLQDITAHVEFTGI 311 Query: 282 SSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARK-DILLDSVKRLVSTSADKKS 340 A+ + G T+Q +FL GI ++ R ++V++L+S + Sbjct: 312 YDAAVATGADLLGYTSQARFLLNAGITDALAAIDPSDIRAFLPAANAVQKLIS----EAE 367 Query: 341 MGELFKILVVSH 352 MGELFK++ S Sbjct: 368 MGELFKVIAFSR 379 >gi|325130835|gb|EGC53568.1| hypothetical protein NMBOX9930304_0386 [Neisseria meningitidis OX99.30304] gi|325136976|gb|EGC59573.1| hypothetical protein NMBM0579_0378 [Neisseria meningitidis M0579] Length = 382 Score = 236 bits (603), Expect = 3e-60, Method: Composition-based stats. Identities = 93/376 (24%), Positives = 153/376 (40%), Gaps = 37/376 (9%) Query: 4 KLIRKIVNLIKKNG-QMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGE 61 KL I I K+G + ++ L + P++GYY+ + G GDF+TAP ++ +F + Sbjct: 16 KLQTLIAEKIGKHGNWIPFSRFMELVLYAPQYGYYTGGSHKIGNTGDFITAPTLTSLFAQ 75 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 LA L Q + E G G G + D+L I + Y++E S Sbjct: 76 TLARQLQELLSQT----AGNIYEFGAGTGQLAADLLGSISD------GISRYYIIEISPE 125 Query: 122 LTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRER 177 L QK + K+ T+L + G ++ NE D++P++ E G E Sbjct: 126 LAARQKNLIQARAPEASQKVVHLTALPEAFDG--IIIGNEVLDAMPVEIVRKDEGGSFEH 183 Query: 178 MIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLG----AIFENSPCRDREMQSISDRLACDG 233 + + ++ S + S YF E P + +++++ RL Sbjct: 184 VGVCTDNGRFAYSARPLHDPSLSTSASLYFPQTDYPYTSELHPQQYAFIRTLASRLE--H 241 Query: 234 GTAIVIDY------GYLQSRVGDTLQ-AVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAI 286 G I IDY Y R TL + H +P G ADL++HV+F ++ Sbjct: 242 GCMIFIDYGFDAAQYYHPQRNQGTLIGHYRHHVIHNPFDFIGLADLTAHVNFTDIAQAGT 301 Query: 287 LYKLYINGLTTQGKFLEGLGIWQRAFSLMK-QTARKDILLDSVKRLVSTSADKKSMGELF 345 L + G Q FL LGI + K +A +V++L+ D+ MGELF Sbjct: 302 DAGLDLIGYLPQSHFLLNLGITELLAQTGKTNSAAYIREAAAVQKLI----DQHEMGELF 357 Query: 346 KILVVSHE-KVELMPF 360 K++ ++ F Sbjct: 358 KVIAFGKNIGIDWAGF 373 >gi|296137278|ref|YP_003644520.1| protein of unknown function DUF185 [Thiomonas intermedia K12] gi|295797400|gb|ADG32190.1| protein of unknown function DUF185 [Thiomonas intermedia K12] Length = 410 Score = 236 bits (603), Expect = 3e-60, Method: Composition-based stats. Identities = 101/384 (26%), Positives = 155/384 (40%), Gaps = 44/384 (11%) Query: 2 ENKLIRKIVNLIKKNG-QMTVDQYFALCVADPEFGYYSTCN----PFGAVGDFVTAPEIS 56 +L+ +I + G + D Y + P GYY+ FG+ DFVTAPE+S Sbjct: 19 SAQLLAQIRAALHAGGGWLPFDAYMQQALYAPGLGYYTGQAGQFGDFGSDSDFVTAPELS 78 Query: 57 QIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMV 116 +FG LA + +Q +VE G G G + IL + L + +V Sbjct: 79 PLFGRTLAAQVAQVLQQSDL---HTVVEFGAGSGRLAAQILGELDHLGC---APRHYAIV 132 Query: 117 ETSERLTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEH 172 E S L Q + L D++ W+T+L + ++ NE D++P+K E Sbjct: 133 EVSGALKHRQMQTLRSAVPHLFDRVQWWTALPETFE--AVVIGNEVLDAMPVKLLHRHED 190 Query: 173 GIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACD 232 G ER + + D LVF + +GA+ E P +++++DRLA Sbjct: 191 GWMERG-VVQEGDDLVFADRATALLPPAPEAHALPIGAVTEIHPQALAFVRTLADRLAR- 248 Query: 233 GGTAIVIDY------GYLQSRVGDTLQAVKGHTY-VSPLVNPGQADLSSHVDFQRLSSIA 285 G A+ IDY Y R TLQA H L+ PG AD++SH+DF ++ A Sbjct: 249 -GAALFIDYGFPQREYYHPQRHMGTLQAHYRHRVLDDVLLWPGLADITSHIDFTAIALAA 307 Query: 286 ILYKLYINGLTTQGKFLEGLGIWQR-AFSLMKQTARKD----------------ILLDSV 328 L + G T+Q FL G+ A L Sbjct: 308 QDAGLDVLGYTSQASFLMNCGLLDLVAAELAAGPPPVAPSSAAALCAATTAPGGSHYLRQ 367 Query: 329 KRLVSTSADKKSMGELFKILVVSH 352 V+ ++ MGELFK++ + Sbjct: 368 TAAVNKLLNESEMGELFKVIALGR 391 >gi|167579551|ref|ZP_02372425.1| Uncharacterized ACR, COG1565 superfamily protein [Burkholderia thailandensis TXDOH] Length = 396 Score = 236 bits (603), Expect = 3e-60, Method: Composition-based stats. Identities = 87/369 (23%), Positives = 143/369 (38%), Gaps = 30/369 (8%) Query: 2 ENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTC-NPFGAV----GDFVTAPEI 55 L + I G + +Y + P GYYS FG DFVTAPE+ Sbjct: 22 SEALAASLRAEIAAAGGWIPFSRYMERVLYAPGMGYYSGGAQKFGRRADDGSDFVTAPEL 81 Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 S +F + LA + A + G R++E G G G ++ L + + Sbjct: 82 SPLFAQTLARPVAQALDASG---TRRVMEFGAGTG---KLAAGLLTALAALGVELDEYAI 135 Query: 116 VETSERLTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTE 171 V+ S L Q++ L ++ W +L + G +V NE D++P++ Sbjct: 136 VDLSGELRARQRETLGAQAPGLAARVRWLDALPERFEG--VVVGNEVLDAMPVRLVAKQA 193 Query: 172 HGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCS--DYFLGAIFENSPCRDREMQSISDRL 229 G ER + ID + VF + D G + E ++++ L Sbjct: 194 RGWCERGVSIDDAGAFVFADRPFARAEEAARLAGIDADEGYVTETHDAAAAFVRTVCTML 253 Query: 230 ACDGGTAIVIDYG----YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSSI 284 A I + Y + R TL H P V PG D+++HV+F + Sbjct: 254 ARGAAFFIDYGFPSHEYYHRQRAQGTLMCHYRHRAHGDPFVYPGLQDITAHVEFSAIHEA 313 Query: 285 AILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARK-DILLDSVKRLVSTSADKKSMGE 343 + + G T+Q +FL GI + A+ ++V++L+S + MGE Sbjct: 314 GVGAGADLLGYTSQARFLLNAGITDVLAEIDPSDAQHFLPAANAVQKLIS----EAEMGE 369 Query: 344 LFKILVVSH 352 LFK++ S Sbjct: 370 LFKVIAFSR 378 >gi|146322884|ref|XP_755307.2| DUF185 domain protein [Aspergillus fumigatus Af293] gi|129558508|gb|EAL93269.2| DUF185 domain protein [Aspergillus fumigatus Af293] gi|159129388|gb|EDP54502.1| DUF185 domain protein [Aspergillus fumigatus A1163] Length = 502 Score = 236 bits (603), Expect = 3e-60, Method: Composition-based stats. Identities = 111/453 (24%), Positives = 177/453 (39%), Gaps = 103/453 (22%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTC-----NPFGAVGDFVTAPEIS 56 L + + N IK G + + + + PE GYY+T FG GDFVT+PEIS Sbjct: 38 STPLAKTLANAIKVTGPIPIAAFMRQVLTSPEGGYYTTRPEGGGEVFGKKGDFVTSPEIS 97 Query: 57 QIFGEMLAIFLICAWEQHGFPSC-VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 Q+FGE++ I+ I W G V+L+E+GPG+G +M D+LR K S+ +IY+ Sbjct: 98 QVFGELVGIWTITEWMAQGSKRSGVQLIEVGPGKGTLMDDMLRTFRNFKSFASSLEAIYL 157 Query: 116 VETSERLTLIQKKQLASYGD--------------------KINWYTSLADVPLGFTFLVA 155 VE S L +QK++L + L F+ A Sbjct: 158 VEASPTLREVQKQRLCGDAAMEETDIGHKSISKYFNVPVLWVEDIRLLPHEEDKTPFIFA 217 Query: 156 NEFFDSLPIKQFVMTE------------------------------HGIRERMIDIDQ-- 183 +EFFD+LPI F RE M+ ++ Sbjct: 218 HEFFDALPIHAFESIPPAPENSPEQKEIITPTGPAKLHQPMKPANTPQWREIMVTLNPKA 277 Query: 184 -----HDSLVFNIG-----------DHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISD 227 F + EI + G+ E SP + Sbjct: 278 VEDNIEGEPEFKLTLAKASTPSSLVIPEISERYRKLKSTP-GSTIEVSPESRIYASDFAR 336 Query: 228 RL---------------------ACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVN 266 R+ G A+++DYG + + ++L+ ++ H V L + Sbjct: 337 RIGGSSQPPRTVGSRNSPAAQPKKIPSGAALIMDYGTMSTIPINSLRGIQHHRTVPALSS 396 Query: 267 PGQADLSSHVDFQRLSSIAILY--KLYINGLTTQGKFLEGLGIWQRAFSLM---KQTARK 321 PGQ D+S+ VDF L+ AI + ++G QG FL+ +GI +R L+ + ++ Sbjct: 397 PGQVDVSADVDFMALAEAAIEASEGVEVHGPVEQGDFLQVMGIAERMQQLLRGVQDEEKR 456 Query: 322 DILLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 L KRL+ MG+++K + + E Sbjct: 457 KTLESGWKRLIERGG--GGMGKIYKFMAIIPEN 487 >gi|254569706|ref|XP_002491963.1| hypothetical protein [Pichia pastoris GS115] gi|238031760|emb|CAY69683.1| Hypothetical protein PAS_chr2-2_0233 [Pichia pastoris GS115] Length = 457 Score = 236 bits (603), Expect = 3e-60, Method: Composition-based stats. Identities = 113/419 (26%), Positives = 180/419 (42%), Gaps = 67/419 (15%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNP---FGAVGDFVTAPEISQIF 59 L + + IK G + V Y C+ PEFGYY+T +P DFVT+PEISQ F Sbjct: 37 TNLSQILEFAIKTTGPIPVSSYMKQCLVHPEFGYYTTRDPLSPISETSDFVTSPEISQTF 96 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 GEM+ I+ W G P VR +E GPG+G ++ D +R +L + + +VE S Sbjct: 97 GEMIGIYHYTTWLLQGKPKEVRFIEFGPGKGTLIFDCIRTFERLSKGTV-LYEVILVEAS 155 Query: 120 ERLTLIQKKQLA--------------------SYGDKINWYTSLADVPLGFTFLVANEFF 159 L Q+K+L + + G +++A+EFF Sbjct: 156 PILREEQRKKLCGDTSLNVLEDGTWEAETLAGKRCHWVETELDIKKT--GTNYIIAHEFF 213 Query: 160 DSLPIKQFVMTEHGIRERMIDIDQHD-----------------------------SLVFN 190 D+LP++Q+ T+ G RE M+D + + + Sbjct: 214 DALPVQQYEKTKDGWREYMVDFSEKNVIRAKTDPLALPNRTTITSKELENPALRFNFHSV 273 Query: 191 IGDHEIKSNFLTCSD-----YFLGAIFENSPCRDREMQSISDRLA--CDGGTAIVIDYGY 243 + HE +++ ++ +G+ E P + I+ + C G ++IDYG Sbjct: 274 LSPHETPGSYIPKNNPRYEALPVGSRIEICPEAHTYSKHIAALINSGCGAGGCLIIDYGP 333 Query: 244 LQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLE 303 + +T++ +K H VSP +PG DLS+ VDF +L I ++G QG +L Sbjct: 334 ADTVPINTIRGIKNHKIVSPFDDPGNVDLSADVDFGQLKQIFENQSCQVHGPVAQGDWLH 393 Query: 304 GLGIWQRAFSL---MKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMP 359 LG+ R L K K ++ + KRLV S MG ++K L V ++ P Sbjct: 394 ELGLGFRTDQLVHIAKSENDKQKIIKAYKRLVGKSHGD--MGSIYKFLAVLPNGSQIQP 450 >gi|330938441|ref|XP_003305738.1| hypothetical protein PTT_18657 [Pyrenophora teres f. teres 0-1] gi|311317121|gb|EFQ86168.1| hypothetical protein PTT_18657 [Pyrenophora teres f. teres 0-1] Length = 519 Score = 236 bits (602), Expect = 4e-60, Method: Composition-based stats. Identities = 117/468 (25%), Positives = 179/468 (38%), Gaps = 111/468 (23%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTC-----NPFGAVGDFVTAPEIS 56 L + + I G ++V Y C+ PE GYY+ + FG GDFVT+PEIS Sbjct: 48 STPLAKTLAEAITTTGPISVAAYMRQCLTHPEGGYYTRQTSSGQDQFGTKGDFVTSPEIS 107 Query: 57 QIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMV 116 Q+FGE++ I+L W G V+++E+GPGRG +M D+LR I K S+ +IY+V Sbjct: 108 QVFGELVGIWLYAEWLAQGRREKVQIIEVGPGRGTLMDDVLRTISSFKAFTKSIEAIYLV 167 Query: 117 ETSERLTLIQKKQLASYGD---------------------KINWYTSLADVPLGFTFLVA 155 E S L Q K L+ D + F++A Sbjct: 168 EASPYLQKQQAKLLSGTEDLKKNDIGFTAPCKYISGCQIQWCEDIRLVPKEDTAAPFILA 227 Query: 156 NEFFDSLPIKQFVMT---------------------------EHGIRERM---------I 179 +EFFD+LPI F ++ E + Sbjct: 228 HEFFDALPIHVFQNIANSSLPASSTIITPTGPIKPKHGVTTPKNTWHELVVSPTSPYKEP 287 Query: 180 DIDQHDSLVFNIG--DHEIKSNFLTCS--------DYFLGAIFENSPCRDREMQSISDRL 229 + + L F + + S + AI E SP + + R+ Sbjct: 288 EKPGQEKLDFELTVSKTPTPHSLYLPSLSDRYKKLENTPDAIIEISPESLAYIADFAVRI 347 Query: 230 ---------------------------ACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVS 262 + G A+++DYG + +TL+ ++ HT VS Sbjct: 348 GGSNAEVSSSPTSPPSAVSMIPEPFTKSQPSGAALILDYGPSSTIPANTLRGIRSHTTVS 407 Query: 263 PLVNPGQADLSSHVDFQRLSSIAILY--KLYINGLTTQGKFLEGLGIWQRAFSL---MKQ 317 P +PG DLS+ VDF L+ A+ + ++G Q FL +GI +RA L K Sbjct: 408 PFASPGLVDLSADVDFLALADTALSASPGVEVHGPVEQSFFLSTMGIKERADRLLSAAKD 467 Query: 318 TARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVE-----LMPF 360 + L KRLV MG+ +K L + K + + F Sbjct: 468 EETRSRLETGWKRLVDRG--PNGMGKTYKALAIVPYKEKGPVRRPIGF 513 >gi|67921448|ref|ZP_00514966.1| Protein of unknown function DUF185 [Crocosphaera watsonii WH 8501] gi|67856560|gb|EAM51801.1| Protein of unknown function DUF185 [Crocosphaera watsonii WH 8501] Length = 377 Score = 236 bits (602), Expect = 4e-60, Method: Composition-based stats. Identities = 96/375 (25%), Positives = 156/375 (41%), Gaps = 26/375 (6%) Query: 5 LIRKIVNLIKK--NGQMTVDQYFALCVADPEFGYYSTCNP-FGAVGDFVTAPEISQIFGE 61 ++ I++ I+ N +T Y L + P+ GYYS+ G+ GDF T+ + FGE Sbjct: 1 MLEIIIDSIETSPNNCITFADYMDLVLYHPQKGYYSSAQIDIGSQGDFFTSSSLGSDFGE 60 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 +LA LVE+G G G + DIL PDF+ + +VE S+ Sbjct: 61 LLAEQFKEMSAVLNSSDSFTLVEVGAGTGSLAADILHYFKIQYPDFYQNIKYIIVEESQG 120 Query: 122 LTLIQKKQLASYGDKINWYTSLADVPLGFTF--LVANEFFDSLPIKQFVMTEHGIRERMI 179 L QK +L + +I + S D+ + +NE D+ P+ Q V+ + ++E + Sbjct: 121 LIAEQKHKLQEF--EIVTWKSWQDITDNSIVGCIFSNELIDAFPVHQIVVEDQTVKEIYL 178 Query: 180 DIDQH--DSLVFNIGDHEIKSNFL------TCSDYFLGAIFENSPCRDREMQSISDRLAC 231 + + ++ N + F T DY E + +Q +++++ Sbjct: 179 TWENNQIKEIIDNTSSPRLLEYFQLVDIDLTRDDYPENYRTEVNLKALDWLQVVTNKIKK 238 Query: 232 DGGTAIVIDY------GYLQSRVGDTLQAVKGH-TYVSPLVNPGQADLSSHVDFQRLSSI 284 G + IDY Y R TL H + +P VN G+ D+++HV+F L Sbjct: 239 --GYLLTIDYGYSASKYYHPQRYQGTLNCYYQHRHHHNPYVNLGEQDITTHVNFTALEKQ 296 Query: 285 AILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGEL 344 L L GLT QG FL LG+ R L +L L + +G Sbjct: 297 GNLLGLETVGLTQQGLFLMALGLGDRLSELSNGNYTLPEILKRRDAL-HQLINPTGLGG- 354 Query: 345 FKILVVSHEKVELMP 359 FK+L+ E + P Sbjct: 355 FKVLIQGKEIDKNKP 369 >gi|225848099|ref|YP_002728262.1| hypothetical protein SULAZ_0267 [Sulfurihydrogenibium azorense Az-Fu1] gi|225643810|gb|ACN98860.1| hypothetical protein SULAZ_0267 [Sulfurihydrogenibium azorense Az-Fu1] Length = 384 Score = 236 bits (602), Expect = 4e-60, Method: Composition-based stats. Identities = 96/365 (26%), Positives = 151/365 (41%), Gaps = 13/365 (3%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYS-TCNPFGAVGDFVTAPEISQIFG 60 + +LI ++N IKK G ++ + + P GYY+ G GDF T+ E+ +FG Sbjct: 6 KKELIDIVLNDIKKRGGISFKDFMDYALYYPSLGYYTCDKEKIGGYGDFFTSSELDPVFG 65 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 ++LA + + ++LVELG G+G++ DIL I P+F+ L VE S Sbjct: 66 QLLAKQFNEIYLNYFKGKKIKLVELGSGKGVLAFDILNEIKTNYPEFYENLEFISVEKSP 125 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 Q+K L + + W S+ D+ + +NE FD+LP+ I E ++ Sbjct: 126 FHIQHQQKVLNGF--NVKWLESIEDLEDIEGIVYSNELFDALPVHLIKKKNGKIYEIYLN 183 Query: 181 IDQHD--SLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIV 238 + + I + + D G E + +Q+I +L + Sbjct: 184 EKDGEIVEELREISEDVLTYIKELKIDIPEGMTTEVNLLAKDLIQTIGQKLKKGFVFTVD 243 Query: 239 IDYG----YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSSIAILYKLYIN 293 Y Y R+ TL HTY + N G D++SHV+F L L Sbjct: 244 YGYPSKELYKPYRMKGTLLCYYKHTYNENFYENIGFQDITSHVNFSALVYYGKKAGLEFT 303 Query: 294 GLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHE 353 G T Q FL LG+ L ++ K + + RL T K MGE FKIL+ Sbjct: 304 GFTDQAHFLINLGLGDIMIQLQEKGDSKS--FERINRL-KTLILPKGMGEKFKILIQHKN 360 Query: 354 KVELM 358 + Sbjct: 361 IKNPI 365 >gi|167568476|ref|ZP_02361350.1| Uncharacterized ACR, COG1565 superfamily protein [Burkholderia oklahomensis C6786] Length = 396 Score = 236 bits (602), Expect = 4e-60, Method: Composition-based stats. Identities = 88/369 (23%), Positives = 144/369 (39%), Gaps = 30/369 (8%) Query: 2 ENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTC-NPFGAV----GDFVTAPEI 55 L + I G + +Y + P GYYS FG DFVTAPE+ Sbjct: 22 SESLAASLRAEIAAAGGWIPFSRYMERALYAPGAGYYSGGAQKFGRRAEDGSDFVTAPEL 81 Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 S +F + LA + A E G ++E G G G ++ L F + + Sbjct: 82 SPLFAQTLARPVAQALEASG---TRCVMEFGAGTG---KLAAGLLNALAALGFELDEYAI 135 Query: 116 VETSERLTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTE 171 V+ S L Q++ L ++ W +L + G +V NE D++P++ V Sbjct: 136 VDLSGELRARQRETLEAEAPGLAARVRWLDALPERFEG--VVVGNEVLDAMPVRLVVKHA 193 Query: 172 HGIRERMIDIDQHDSLVFNIGDHEIKSNFLTC--SDYFLGAIFENSPCRDREMQSISDRL 229 G RER + ID + F + D G + E ++++ L Sbjct: 194 DGWRERGVAIDDAGAFAFADRPLARAEDAARLVEIDADEGYVTETHDAAAAFVRTVCTML 253 Query: 230 ACDGGTAIVIDYG----YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSSI 284 A I + Y + R TL H P + PG D++SHV+F + Sbjct: 254 ARGAAFFIDYGFPSHEYYHRQRAQGTLMCHYRHRAHGDPFLYPGLQDITSHVEFSGIYEA 313 Query: 285 AILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARK-DILLDSVKRLVSTSADKKSMGE 343 + + G T+Q +FL G+ + A+ ++V++L+S + MGE Sbjct: 314 GVGAGADLLGYTSQARFLLNAGVTDVLAEIDPSDAQHFLPAANAVQKLIS----EAEMGE 369 Query: 344 LFKILVVSH 352 LFK++ S Sbjct: 370 LFKVIAFSR 378 >gi|221199785|ref|ZP_03572828.1| conserved hypothetical protein [Burkholderia multivorans CGD2M] gi|221207546|ref|ZP_03580555.1| conserved hypothetical protein [Burkholderia multivorans CGD2] gi|221172749|gb|EEE05187.1| conserved hypothetical protein [Burkholderia multivorans CGD2] gi|221180024|gb|EEE12428.1| conserved hypothetical protein [Burkholderia multivorans CGD2M] Length = 396 Score = 236 bits (602), Expect = 4e-60, Method: Composition-based stats. Identities = 88/371 (23%), Positives = 151/371 (40%), Gaps = 34/371 (9%) Query: 2 ENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTC-NPFGAV----GDFVTAPEI 55 L ++ I G + ++ + P GYYS FG DFVTAPE+ Sbjct: 22 SETLAAQLRAEIAAAGGWLPFSRFMERALYAPGLGYYSGGARKFGRRADDGSDFVTAPEL 81 Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 S +F + LA + A G R++E G G G + +L + L L + Sbjct: 82 SPLFAQTLAQPVADALAASG---TRRVMEFGAGTGKLAAGLLAALDALGATLDEYL---I 135 Query: 116 VETSERLTLIQKKQLASYGDK----INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTE 171 V+ S L Q+ +A+ + W +L + G ++ NE D++P++ F Sbjct: 136 VDLSGELRARQRDTIAAAAPALAAKVRWLDALPERFEG--VVIGNEVLDAMPVRLFAKAG 193 Query: 172 HGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCS--DYFLGAIFENSPCRDREMQSISDRL 229 RER + +D + VF+ + D G + E +++ L Sbjct: 194 GAWRERGVALDAQHAFVFDDRAVAPADVPPALAGLDVDDGYVTETHEAALAFTRTVCTML 253 Query: 230 ACDGGTAIVIDY------GYLQSRVGDT-LQAVKGHTYVSPLVNPGQADLSSHVDFQRLS 282 A G ++IDY Y R T + + H + P + PG D+++HV+F + Sbjct: 254 AR--GAVLLIDYGFPAHEYYHPQRDRGTLMCHYRHHAHDDPFLYPGLQDITAHVEFTGIY 311 Query: 283 SIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQT-ARKDILLDSVKRLVSTSADKKSM 341 I + G T+Q +FL GI ++ + ++V++L+S + M Sbjct: 312 EAGIAAGADLLGYTSQARFLLNAGITDALAAIDPSDVTQFLPAANAVQKLIS----EAEM 367 Query: 342 GELFKILVVSH 352 GELFK++ S Sbjct: 368 GELFKVIAFSR 378 >gi|121715340|ref|XP_001275279.1| DUF185 domain protein [Aspergillus clavatus NRRL 1] gi|119403436|gb|EAW13853.1| DUF185 domain protein [Aspergillus clavatus NRRL 1] Length = 501 Score = 236 bits (602), Expect = 4e-60, Method: Composition-based stats. Identities = 114/452 (25%), Positives = 178/452 (39%), Gaps = 101/452 (22%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTC-----NPFGAVGDFVTAPEIS 56 L + + N IK G + + + + PE GYY+T FG GDFVT+PEIS Sbjct: 37 STPLAKTLANAIKITGPIPISAFMRQVLTSPEGGYYTTRPEGGGEVFGKKGDFVTSPEIS 96 Query: 57 QIFGEMLAIFLICAWEQHGFPSC-VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 Q+FGE++A++ I W G V+L+E+GPG+G +M D+LR K S+ +IY+ Sbjct: 97 QVFGELVAVWTITEWMAQGRKRSGVQLIEVGPGKGTLMDDMLRTFQNFKSFSSSIEAIYL 156 Query: 116 VETSERLTLIQKKQLASYGD--------------------KINWYTSLADVPLGFTFLVA 155 VE S L +QK++L + L F+ A Sbjct: 157 VEASPTLREVQKQRLCGDAPMEETDIGHRSTSKYFNVPVIWVEDIRLLPHEEGTTPFIFA 216 Query: 156 NEFFDSLPIKQFVMTE------------------------------HGIRERMIDIDQHD 185 +EFFD+LPI F RE M+ ++ Sbjct: 217 HEFFDALPIHAFESVPPAPESQTEQSEIMTPTGPAKLHQPMKPANTPQWREIMVTLNPEA 276 Query: 186 SLVFNIGDHEIKSNFLTCS-----------------DYFLGAIFENSPCRDREMQSISDR 228 G+ E K S G+ E SP + R Sbjct: 277 VEENKEGEPEFKLTLAKASTPSSLVIPEISERYRKLKSQPGSTIEISPESRVYASDFARR 336 Query: 229 L---------------------ACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNP 267 + G A+++DYG + + ++L+ ++ H V L +P Sbjct: 337 IGGSSQPPRTVNRQDAGPVQPKRVPSGAALIMDYGTMSTIPVNSLRGIQNHRNVPALSSP 396 Query: 268 GQADLSSHVDFQRLSSIAILY--KLYINGLTTQGKFLEGLGIWQRAFSLM---KQTARKD 322 GQ D+S+ VDF L+ AI + ++G QG FL+ +GI +R L+ K ++ Sbjct: 397 GQVDVSADVDFIALAEAAIDASEGVEVHGPVEQGDFLQVMGIAERMQQLLKGIKDEEKRK 456 Query: 323 ILLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 L KRLV MG+++K + + E Sbjct: 457 TLESGWKRLVERGG--GGMGKIYKFMAIIPEN 486 >gi|167617645|ref|ZP_02386276.1| Uncharacterized ACR, COG1565 superfamily protein [Burkholderia thailandensis Bt4] gi|257140492|ref|ZP_05588754.1| hypothetical protein BthaA_14990 [Burkholderia thailandensis E264] Length = 396 Score = 236 bits (602), Expect = 4e-60, Method: Composition-based stats. Identities = 87/369 (23%), Positives = 143/369 (38%), Gaps = 30/369 (8%) Query: 2 ENKLIRKIVNLIK-KNGQMTVDQYFALCVADPEFGYYSTC-NPFGAV----GDFVTAPEI 55 L + I G + +Y + P GYYS FG DFVTAPE+ Sbjct: 22 SEALAASLRAEIASAGGWIPFSRYMERVLYAPGMGYYSGGAQKFGRRADDGSDFVTAPEL 81 Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 S +F + LA + A + G R++E G G G ++ L + + Sbjct: 82 SPLFAQTLARPVAQALDASG---TRRVMEFGAGTG---KLAAGLLTALAALGVELDEYAI 135 Query: 116 VETSERLTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTE 171 V+ S L Q++ L ++ W +L + G +V NE D++P++ Sbjct: 136 VDLSGELRARQRETLGAQAPGLAARVRWLDALPERFEG--VVVGNEVLDAMPVRLVAKQA 193 Query: 172 HGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCS--DYFLGAIFENSPCRDREMQSISDRL 229 G ER + ID + VF + D G + E ++++ L Sbjct: 194 RGWCERGVSIDDAGAFVFADRPFARAEEAARLAGIDADEGYVTETHDAAVAFVRTVCAML 253 Query: 230 ACDGGTAIVIDYG----YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSSI 284 A I + Y + R TL H P V PG D+++HV+F + Sbjct: 254 ARGAAFFIDYGFPSHEYYHRQRAQGTLMCHYRHRAHGDPFVYPGLQDITAHVEFSAIHEA 313 Query: 285 AILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARK-DILLDSVKRLVSTSADKKSMGE 343 + + G T+Q +FL GI + A+ ++V++L+S + MGE Sbjct: 314 GVGAGADLLGYTSQARFLLNAGITDVLAEIDPSDAQHFLPAANAVQKLIS----EAEMGE 369 Query: 344 LFKILVVSH 352 LFK++ S Sbjct: 370 LFKVIAFSR 378 >gi|83719366|ref|YP_440852.1| hypothetical protein BTH_I0294 [Burkholderia thailandensis E264] gi|83653191|gb|ABC37254.1| Uncharacterized ACR, COG1565 superfamily [Burkholderia thailandensis E264] Length = 410 Score = 236 bits (602), Expect = 4e-60, Method: Composition-based stats. Identities = 87/369 (23%), Positives = 143/369 (38%), Gaps = 30/369 (8%) Query: 2 ENKLIRKIVNLIK-KNGQMTVDQYFALCVADPEFGYYSTC-NPFGAV----GDFVTAPEI 55 L + I G + +Y + P GYYS FG DFVTAPE+ Sbjct: 36 SEALAASLRAEIASAGGWIPFSRYMERVLYAPGMGYYSGGAQKFGRRADDGSDFVTAPEL 95 Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 S +F + LA + A + G R++E G G G ++ L + + Sbjct: 96 SPLFAQTLARPVAQALDASG---TRRVMEFGAGTG---KLAAGLLTALAALGVELDEYAI 149 Query: 116 VETSERLTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTE 171 V+ S L Q++ L ++ W +L + G +V NE D++P++ Sbjct: 150 VDLSGELRARQRETLGAQAPGLAARVRWLDALPERFEG--VVVGNEVLDAMPVRLVAKQA 207 Query: 172 HGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCS--DYFLGAIFENSPCRDREMQSISDRL 229 G ER + ID + VF + D G + E ++++ L Sbjct: 208 RGWCERGVSIDDAGAFVFADRPFARAEEAARLAGIDADEGYVTETHDAAVAFVRTVCAML 267 Query: 230 ACDGGTAIVIDYG----YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSSI 284 A I + Y + R TL H P V PG D+++HV+F + Sbjct: 268 ARGAAFFIDYGFPSHEYYHRQRAQGTLMCHYRHRAHGDPFVYPGLQDITAHVEFSAIHEA 327 Query: 285 AILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARK-DILLDSVKRLVSTSADKKSMGE 343 + + G T+Q +FL GI + A+ ++V++L+S + MGE Sbjct: 328 GVGAGADLLGYTSQARFLLNAGITDVLAEIDPSDAQHFLPAANAVQKLIS----EAEMGE 383 Query: 344 LFKILVVSH 352 LFK++ S Sbjct: 384 LFKVIAFSR 392 >gi|221211001|ref|ZP_03583980.1| conserved hypothetical protein [Burkholderia multivorans CGD1] gi|221168362|gb|EEE00830.1| conserved hypothetical protein [Burkholderia multivorans CGD1] Length = 396 Score = 236 bits (602), Expect = 4e-60, Method: Composition-based stats. Identities = 87/371 (23%), Positives = 151/371 (40%), Gaps = 34/371 (9%) Query: 2 ENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTC-NPFGAV----GDFVTAPEI 55 L ++ I G + ++ + P GYYS FG DFVTAPE+ Sbjct: 22 SETLAAQLRAEIAAAGGWLPFSRFMERALYAPGLGYYSGGARKFGRRADDGSDFVTAPEL 81 Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 S +F + LA + A G R++E G G G + +L + L L + Sbjct: 82 SPLFAQTLAQPVADALAASG---TRRVMEFGAGTGKLAAGLLAALDALGAALDEYL---I 135 Query: 116 VETSERLTLIQKKQLASYGDK----INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTE 171 V+ S L Q+ +A+ + W +L + G ++ NE D++P++ F Sbjct: 136 VDLSGELRARQRDTIAAAAPALAAKVRWLDALPERFEG--VVIGNEVLDAMPVRLFAKAG 193 Query: 172 HGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCS--DYFLGAIFENSPCRDREMQSISDRL 229 RER + +D + VF+ + D G + E +++ L Sbjct: 194 DAWRERGVALDAQQAFVFDDRAVAPADVPPALAGLDVDDGYVTETHEAALAFTRTVCTML 253 Query: 230 ACDGGTAIVIDY------GYLQSRVGDT-LQAVKGHTYVSPLVNPGQADLSSHVDFQRLS 282 A G ++IDY Y R T + + H + P + PG D+++HV+F + Sbjct: 254 AR--GAVLLIDYGFPAHEYYHPQRDRGTLMCHYRHHAHDDPFLYPGLQDITAHVEFTGIY 311 Query: 283 SIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQT-ARKDILLDSVKRLVSTSADKKSM 341 + + G T+Q +FL GI ++ + ++V++L+S + M Sbjct: 312 EAGVAAGADLLGYTSQARFLLNAGITDALAAIDPSDVTQFLPAANAVQKLIS----EAEM 367 Query: 342 GELFKILVVSH 352 GELFK++ S Sbjct: 368 GELFKVIAFSR 378 >gi|332976197|gb|EGK13061.1| hypothetical protein HMPREF9374_1053 [Desmospora sp. 8437] Length = 374 Score = 236 bits (601), Expect = 5e-60, Method: Composition-based stats. Identities = 86/369 (23%), Positives = 141/369 (38%), Gaps = 26/369 (7%) Query: 1 MENKLIRKIVNLIKKN--GQMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQ 57 M++ L+ IV I+ + +++ +Y + P +GYY G GDF T+P + Sbjct: 1 MDDPLLAVIVEEIQAHPEKRISFRRYMEQALYHPRWGYYRREGLKIGKRGDFYTSPHLGD 60 Query: 58 IFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVE 117 +FGE L + LVE G G G + +IL + + K S +++VE Sbjct: 61 VFGETLGRVISGMVSSFSPGCPWTLVEAGGGDGRLAGNILSSLEEGKRLPQS---LWLVE 117 Query: 118 TSERLTLIQKKQLASYGDKINWYTSLAD-VPLGFTFLVANEFFDSLPIKQFVMTEHGIRE 176 TS +Q ++L ++W ++ + P L +NE D+ P+ + E + E Sbjct: 118 TSPFHRELQSERLRDAPVPVHWAEAVTEIPPDSPCILFSNELLDAFPVHRVTRKEGELLE 177 Query: 177 RMIDIDQHDSLVFNIGDHEIKSN------FLTCSDYFLGAIFENSPCRDREMQSISDRLA 230 + D + D G E ++ I + L Sbjct: 178 IHVA-WDEDRERLVECSRPLSHPSLAAYFHRLDWDLPEGWTAEVPLDALAWLEEIGEWL- 235 Query: 231 CDGGTAIVIDYGYL------QSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSS 283 G I IDYG R TL+ + H +PG +DL+SHV F L Sbjct: 236 -KNGYLITIDYGGTTEELSLPQRKDGTLRCFRNHQLHVDCYSHPGDSDLTSHVHFSALMD 294 Query: 284 IAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGE 343 + + L TTQ +FL+ GI R + R + A MG+ Sbjct: 295 LGEVMGLQNLVYTTQTRFLQAAGILDRLAVAPADPFSPE---ARRNRAIRQLALAGGMGD 351 Query: 344 LFKILVVSH 352 F++L+ S Sbjct: 352 SFRVLIQSK 360 >gi|86609776|ref|YP_478538.1| hypothetical protein CYB_2336 [Synechococcus sp. JA-2-3B'a(2-13)] gi|86558318|gb|ABD03275.1| conserved hypothetical protein [Synechococcus sp. JA-2-3B'a(2-13)] Length = 416 Score = 236 bits (601), Expect = 5e-60, Method: Composition-based stats. Identities = 92/386 (23%), Positives = 158/386 (40%), Gaps = 36/386 (9%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGE 61 KL IV+ I+ G +T Q+ + +P GYY +P G D++T+P ++ F + Sbjct: 21 TQKLRELIVSRIRAQGPVTFAQFMEWALYEPGLGYYEQGSPIGP--DYLTSPHLAADFAQ 78 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKL--------KPDFFSVLSI 113 +LA ++ W G P +++E+G G G + D L + +P F+ L Sbjct: 79 LLAEQILQFWHILGSPPDFKVIEMGAGSGRLAQDWLTYVRSARPAARLREQPSFWQALDY 138 Query: 114 YMVETSERLTLIQKKQLASYGDKINWYTSLADVPLG-FTFLVANEFFDSLPIKQFVMTEH 172 ++E S L +Q+++LA +G+K+ W +NE D+ P+ + + + Sbjct: 139 GILERSAHLRRLQQERLAPFGEKVRWLDWDGIPDESVTGCFFSNELVDAFPVHRVQVQDG 198 Query: 173 GIRERMIDIDQHDSLVF-----NIGDHEIKSNFLTC----SDYFLGAIFENSPCRDREMQ 223 +RE +D + F ++ E++ F Y G E + +Q Sbjct: 199 ALREIYVDCSEAAEADFREVLGDLSTPELREYFARLGIPVETYPSGYQTEVNLKALEWLQ 258 Query: 224 SISDRLACDGGTAIVIDYGYLQSRV------GDTLQAVKGHTYV-SPLVNPGQADLSSHV 276 ++ +L G + IDYG+ R TL A + H P V G DL++HV Sbjct: 259 LLARKLRR--GYVLTIDYGHTAQRYYSPHRAQGTLLAYRQHGSYTDPYVRVGSQDLTAHV 316 Query: 277 DFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILL------DSVKR 330 DF L + L G T Q FL LG+ +R +L + D + Sbjct: 317 DFTTLEQVGESLGLRCLGFTRQSSFLVNLGLTERLAALSQMGNPVDGEAVDVGQVLRRRS 376 Query: 331 LVSTSADKKSMGELFKILVVSHEKVE 356 + D +G F +LV + Sbjct: 377 ALHALLDPMGLGG-FGVLVQAKGLTP 401 >gi|300692832|ref|YP_003753827.1| hypothetical protein RPSI07_3218 [Ralstonia solanacearum PSI07] gi|299079892|emb|CBJ52570.1| conserved protein of unknown function [Ralstonia solanacearum PSI07] Length = 397 Score = 236 bits (601), Expect = 5e-60, Method: Composition-based stats. Identities = 90/380 (23%), Positives = 149/380 (39%), Gaps = 27/380 (7%) Query: 2 ENKLIRKIVNLI-KKNGQMTVDQYFALCVADPEFGYYSTCN-PFGAV----GDFVTAPEI 55 ++L IV I G M ++Y L + P GYYS FG GDF+TAPE+ Sbjct: 18 SDRLFSTIVRAIEAAGGWMPFERYMELALYAPGLGYYSGGAAKFGRRVEDGGDFITAPEL 77 Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 + FG +A + P +VE G G G + DIL + L S + Sbjct: 78 TPFFGRTVAHQIAQVLRTL-PPGQRHVVEFGAGTGKLAADILTELETLGMRPDS---YGI 133 Query: 116 VETSERLTLIQKKQLASYGDKINWYTSLAD--VPLGFTFLVANEFFDSLPIKQFVMTEHG 173 +E S L Q++ LA+ G + D +V NE D++P+ + Sbjct: 134 IELSGELRQRQQQTLAALGPDLAGLARWHDTLPARFTGVMVGNEVLDAMPVSLWARRGGV 193 Query: 174 IRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYF--LGAIFENSPCRDREMQSISDRLAC 231 R + D + L ++ + + I E+ + ++S L Sbjct: 194 WHRRGVAFDANQGLRWSERAADPAEVPPKLAALPGRDDFITESHEAAEGFIRSTGAALER 253 Query: 232 DGGTAI-----VIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAI 286 I +Y + G + + H + P PG D+++HVDF ++ A Sbjct: 254 GLLLLIDYGFPAAEYYHAHRANGTLMCHYRQHAHDDPFWLPGLQDITAHVDFSGIAQAAR 313 Query: 287 LYKLYINGLTTQGKFLEGLGIWQRAFSLM-KQTARKDILLDSVKRLVSTSADKKSMGELF 345 L + G +Q +FL G G+ Q +L R ++V++L+S + MGELF Sbjct: 314 EAGLEVLGYASQARFLLGAGVGQLLMTLDPADPVRFLPAANAVQKLLS----EAEMGELF 369 Query: 346 KILVVSH---EKVELMPFVN 362 K + + + L F + Sbjct: 370 KAIALGRGIDAALPLAGFAD 389 >gi|254805559|ref|YP_003083780.1| hypothetical protein NMO_1631 [Neisseria meningitidis alpha14] gi|254669101|emb|CBA07679.1| conserved hypothetical protein [Neisseria meningitidis alpha14] Length = 405 Score = 236 bits (601), Expect = 5e-60, Method: Composition-based stats. Identities = 95/376 (25%), Positives = 154/376 (40%), Gaps = 37/376 (9%) Query: 4 KLIRKIVNLIKKNG-QMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGE 61 KL I I K+G + ++ L + P++GYY+ + G GDF+TAP ++ +F + Sbjct: 39 KLQTLIAEKIGKHGNWIPFSRFMELVLYAPQYGYYTGGSHKIGNTGDFITAPTLTSLFAQ 98 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 LA L Q + E G G G + D+L I + Y++E S Sbjct: 99 TLARQLQELLSQT----AGNIYEFGAGTGQLAADLLGSISD------GISRYYIIEISPE 148 Query: 122 LTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRER 177 L QK + K+ T+L + G ++ NE D++P++ E G E Sbjct: 149 LAARQKNLIQARAPEASQKVVHLTALPEAFDG--IIIGNEVLDAIPVEIVRKDEGGSFEH 206 Query: 178 MIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLG----AIFENSPCRDREMQSISDRLACDG 233 + + ++ S + S YF E P + +++++ RL Sbjct: 207 VGVCTDNGRFAYSARPLHDPSLSTSASLYFPQTDYPYTSELHPQQYAFIRTLASRLE--H 264 Query: 234 GTAIVIDY------GYLQSRVGDTLQ-AVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAI 286 G I IDY Y R TL + H +P G ADL++HV+F ++ Sbjct: 265 GCMIFIDYGFDAAQYYHPQRNQGTLIGHYRHHVIHNPFDFIGLADLTAHVNFTDIAQAGT 324 Query: 287 LYKLYINGLTTQGKFLEGLGIWQRAFSLMK-QTARKDILLDSVKRLVSTSADKKSMGELF 345 L + G Q FL LGI + K +A +V++L+ D+ MGELF Sbjct: 325 DAGLDLIGYLPQSHFLLNLGITELLAQTGKTNSAAYIREAAAVQKLI----DQHEMGELF 380 Query: 346 KILVVSHE-KVELMPF 360 K++V V+ F Sbjct: 381 KVIVFGKNIGVDWAGF 396 >gi|226939216|ref|YP_002794287.1| hypothetical protein LHK_00283 [Laribacter hongkongensis HLHK9] gi|226714140|gb|ACO73278.1| Uncharacterized conserved protein [Laribacter hongkongensis HLHK9] Length = 383 Score = 236 bits (601), Expect = 5e-60, Method: Composition-based stats. Identities = 80/367 (21%), Positives = 132/367 (35%), Gaps = 28/367 (7%) Query: 1 MENKLIRKIVNLI-KKNGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQI 58 + +L I I ++ G + +Y L + P GYY+ + G GDFVTAPE++ + Sbjct: 12 ISGRLADLIRAEIGEQQGFIPFSRYMELALYAPGLGYYTAGSHKLGEGGDFVTAPELTPL 71 Query: 59 FGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLK--PDFFSVLSIYMV 116 F LA L G + E G G GI+ D+L + +L P + ++ Sbjct: 72 FARTLARQLAQLLPLTGGT----VYEFGAGSGILAADLLDALRQLDVLPVRYRIM----- 122 Query: 117 ETSERLTLIQKKQLASYGDKINWYTSLADVP--LGFTFLVANEFFDSLPIKQFVMTEHGI 174 E S L Q+ LA+ + + ++ NE D++P + T G Sbjct: 123 ELSPDLRARQQALLAARHPDLLERIDWLEQWPEQFDGVVLGNEVLDAMPCELVTRTAEGE 182 Query: 175 RERMIDIDQHDSLVFNIGDHEIKSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLAC 231 + + + E + + ++ L Sbjct: 183 LMQTGVGCAGPDWQWQQRPVTEPVLAQAAAGRLPARGPYTSEIGLAAEAFVATLGRHLQR 242 Query: 232 DGGTAIVIDYG----YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSSIAI 286 + + Y R TL H P PG D++ HVDF ++ + Sbjct: 243 GAVILLDYGFPAHEFYHPQRAQGTLMCHYQHLAHTDPFQWPGLTDITCHVDFSAIAEAGL 302 Query: 287 LYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARK-DILLDSVKRLVSTSADKKSMGELF 345 L + G TTQ FL G+ L R + +V++LVS MGELF Sbjct: 303 NAGLELAGYTTQANFLLNCGLTDCLAELDPDDVRTYLPQVHAVQKLVS----PAEMGELF 358 Query: 346 KILVVSH 352 K++ + Sbjct: 359 KVIGFAK 365 >gi|237747069|ref|ZP_04577549.1| conserved hypothetical protein [Oxalobacter formigenes HOxBLS] gi|229378420|gb|EEO28511.1| conserved hypothetical protein [Oxalobacter formigenes HOxBLS] Length = 384 Score = 236 bits (601), Expect = 5e-60, Method: Composition-based stats. Identities = 97/369 (26%), Positives = 159/369 (43%), Gaps = 27/369 (7%) Query: 4 KLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTCNP-FGAVGDFVTAPEISQIFGE 61 L +IV I+ +G ++ Y + +PE+GYYS FG GDFVTAPE S ++G Sbjct: 16 ALKNRIVARIESRSGWISFADYMQQVLYEPEYGYYSGGAANFGGQGDFVTAPETSPLYGR 75 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 +A LI EQ +++E+G G G + DIL + SV ++E S Sbjct: 76 AMAHALIPLIEQT----RPQILEIGAGTGRLAHDILAELA---SKGISVDCYDILELSSE 128 Query: 122 LTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDI 181 L Q+ L++ +NW ++L + G ++ANE D++P++ V + G +E + Sbjct: 129 LRERQQTSLSA-CPHVNWLSALPERFDG--VVIANEVLDAMPVQLVVKRKTGWQELGVCH 185 Query: 182 DQHD-----SLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLAC-DGGT 235 D I G + E ++++++ LA G Sbjct: 186 LNGDFVLSERPCDTFLADAITRQIPDSDSLPEGYVTEIHTHACGFVRTLAEMLASGSGAA 245 Query: 236 AIVIDY------GYLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSSIAILY 288 A+ +DY Y + R TL H P PG D+++HVDF L+ IA Sbjct: 246 AVFVDYGFPAHEYYHRDRSSGTLMCHYRHRLHTDPFFLPGLQDMTAHVDFTALARIAEAG 305 Query: 289 KLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKIL 348 L + +Q FL G G+ + + + + L S + V MGELFK++ Sbjct: 306 GLDLLCYASQANFLIGAGLMELMQGVSAEMDARQYSLQS--QAVQKLLSPAEMGELFKVM 363 Query: 349 VVSHEKVEL 357 ++ H + Sbjct: 364 ILGHNVIPP 372 >gi|189204400|ref|XP_001938535.1| conserved hypothetical protein [Pyrenophora tritici-repentis Pt-1C-BFP] gi|187985634|gb|EDU51122.1| conserved hypothetical protein [Pyrenophora tritici-repentis Pt-1C-BFP] Length = 519 Score = 236 bits (601), Expect = 5e-60, Method: Composition-based stats. Identities = 113/468 (24%), Positives = 174/468 (37%), Gaps = 111/468 (23%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTC-----NPFGAVGDFVTAPEIS 56 L + + I G ++V Y C+ PE GYY+ + FG GDFVT+PEIS Sbjct: 48 STPLAKTLAEAITTTGPISVAAYMRQCLTHPEGGYYTRQTSSGQDQFGTKGDFVTSPEIS 107 Query: 57 QIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMV 116 Q+FGE++ I+L W G V+++E+GPGRG +M D+LR I K S+ +IY++ Sbjct: 108 QVFGELVGIWLYAEWLAQGRREKVQIIEVGPGRGTLMDDVLRTISSFKGFTKSIEAIYLI 167 Query: 117 ETSERLTLIQKKQLASYGD---------------------KINWYTSLADVPLGFTFLVA 155 E S L Q K L+ D + F++A Sbjct: 168 EASPYLQKQQAKLLSGTEDLKKNDIGFAAPCKYIPGCQIQWCEDIRLVLKENTASPFILA 227 Query: 156 NEFFDSLPIKQFVMT---------------------------EHGIRERMIDIDQHDSLV 188 +EFFD+LPI F ++ E ++ Sbjct: 228 HEFFDALPIHVFQNIANSSLPASSTIITPTGPIKPKHGVTTPKNTWHELVVSPTSPYKQP 287 Query: 189 FNI-----------GDHEIKSNFLTCS--------DYFLGAIFENSPCRDREMQSISDRL 229 + + + A+ E SP + + R+ Sbjct: 288 EKPGQEKLDFELTVSKTPTPHSLYLPNLSDRYKKLENTPDAVIEISPESLAYIADFAVRI 347 Query: 230 ---------------------------ACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVS 262 + G A+++DYG + +TL+ ++ HT VS Sbjct: 348 GGSNAKLSSSAASLPPSVSMIPEPFTKSQPSGAALILDYGPSSTIPANTLRGIRSHTTVS 407 Query: 263 PLVNPGQADLSSHVDFQRLSSIAILY--KLYINGLTTQGKFLEGLGIWQRAFSL---MKQ 317 P PG DLS+ VDF L+ A+ + ++G Q FL +GI +RA L K Sbjct: 408 PFALPGLVDLSADVDFLALADTALSASPGVEVHGPVEQSFFLSTMGIKERADRLLSAAKD 467 Query: 318 TARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVE-----LMPF 360 K L KRLV MG+ +K L + K + + F Sbjct: 468 EETKRRLETGWKRLVDRG--PNGMGKTYKALAIVPYKEKGPVRRPIGF 513 >gi|329118758|ref|ZP_08247456.1| protein of hypothetical function DUF185 [Neisseria bacilliformis ATCC BAA-1200] gi|327465105|gb|EGF11392.1| protein of hypothetical function DUF185 [Neisseria bacilliformis ATCC BAA-1200] Length = 384 Score = 236 bits (601), Expect = 6e-60, Method: Composition-based stats. Identities = 84/364 (23%), Positives = 143/364 (39%), Gaps = 25/364 (6%) Query: 2 ENKLIRKIVNLIKKNG-QMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIF 59 KL + I N IK + + ++ L + P++GYY+ + GA GDFVTAP ++ +F Sbjct: 17 SQKLEQIIQNEIKASSAPIPFSRFMELALYAPQYGYYTGGAHKIGAAGDFVTAPALTPLF 76 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 G+ LA + Q + E G G G + +L + D + +++E S Sbjct: 77 GQTLARQIGELLPQT----AGGICEFGAGTGELAATLLDSLRGTALDAY-----FIIEVS 127 Query: 120 ERLTLIQKKQLASY-GDKINWYTSLADVPLGFT-FLVANEFFDSLPIKQFVMTEHGIRER 177 L Q+ +A+ D + LA +P F ++ NE D++P + Sbjct: 128 PELAERQRAHIAARVPDMAHKVRHLAALPAAFDGIIIGNEVLDAMPCELVRREGGRFWRI 187 Query: 178 MIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLG---AIFENSPCRDREMQSISDRLACDGG 234 + ++ + V YF E P + +++++D L Sbjct: 188 CVG-TENGAFVQIPRSLADPQLLRLAQQYFPDTEPYTAELHPVQYAFVRTLADHLQRGAI 246 Query: 235 TAI-----VIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYK 289 I Y + Q +G + + HT P G DL++HV+F + Sbjct: 247 ILIDYGFDAAQYYHPQRHMGTLIGHYRHHTVHDPFFRVGLTDLTAHVNFTDTAQAGTDGG 306 Query: 290 LYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILV 349 L + G TTQ FL LGI + A + + + I S + MGELFK++ Sbjct: 307 LDLIGYTTQADFLLNLGITELAGNGLPHDSPAYIQTAS---ALHKLLMPHEMGELFKVIT 363 Query: 350 VSHE 353 Sbjct: 364 FGRN 367 >gi|326485066|gb|EGE09076.1| DUF185 domain-containing protein [Trichophyton equinum CBS 127.97] Length = 501 Score = 236 bits (601), Expect = 6e-60, Method: Composition-based stats. Identities = 115/446 (25%), Positives = 177/446 (39%), Gaps = 96/446 (21%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTC-----NPFGAVGDFVTAPEIS 56 L ++I + I G +++ + C+ E GYY++ + FG GDFVT+PEIS Sbjct: 42 STPLAKRITDAINTTGPISIAAFMRQCLTSDEGGYYTSRGTPGSDVFGKEGDFVTSPEIS 101 Query: 57 QIFGEMLAIFLICAWEQHGFPSC-VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 Q+FGE+L I+++ W G S V+L+E GPG+G +M DILR + K SV +YM Sbjct: 102 QMFGELLGIWIVTEWLSQGRRSSGVQLMEFGPGKGTLMADILRSVRNFKGFASSVEGVYM 161 Query: 116 VETSERLTLIQKKQLASYGD--------------------KINWYTSLADVPLGFTFLVA 155 +E S L IQKK L L F++A Sbjct: 162 IEASPTLREIQKKALCGDAPMEECDIGYKSISSHLGVPVYWTEHIRILPQTEDKAPFIIA 221 Query: 156 NEFFDSLPIKQFVMTEHGIRE-----------RMIDIDQHDSLVFNI------------- 191 +EFFD+LPI F E R + + + + Sbjct: 222 HEFFDALPIHAFQAVHSPPPETINTPTGPAELRQPSLPLNGTQWRELVVATNPEAEREPD 281 Query: 192 -------GDHEIKSNFLTCS-------------------DYFLGAIFENSPCRDREMQSI 225 D +++ G+ E SP Q I Sbjct: 282 GDGNSDKNDKKLEFRLALAKSPTPASLVMPEMSPRYKALKSTRGSTIEISPESHTYAQEI 341 Query: 226 SDRL-------------ACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADL 272 + + G A+++DYG + ++L+ +K H VSP PG+ DL Sbjct: 342 ARLIGGPNPIDKNPSPTRTPAGAALILDYGPSSTIPVNSLRGIKNHEVVSPFATPGEVDL 401 Query: 273 SSHVDFQRLSSIAILY--KLYINGLTTQGKFLEGLGIWQRAFSLM---KQTARKDILLDS 327 S+ VDF L+ A+ + + G QG FL LGI +RA L+ K ++ + S Sbjct: 402 SADVDFTGLAESALDASPGVEVYGPNEQGSFLRSLGIAERAAQLLRNVKDEEKRKQIESS 461 Query: 328 VKRLVSTSADKKSMGELFKILVVSHE 353 +RLV MG ++K + + E Sbjct: 462 WQRLVERGG--GGMGRIYKAMAIVPE 485 >gi|167561257|ref|ZP_02354173.1| Uncharacterized ACR, COG1565 superfamily protein [Burkholderia oklahomensis EO147] Length = 506 Score = 236 bits (601), Expect = 6e-60, Method: Composition-based stats. Identities = 88/369 (23%), Positives = 144/369 (39%), Gaps = 30/369 (8%) Query: 2 ENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTC-NPFGAV----GDFVTAPEI 55 L + I G + +Y + P GYYS FG DFVTAPE+ Sbjct: 132 SESLAASLRAEIAAAGGWIPFSRYMERALYAPGAGYYSGGAQKFGRRAEDGSDFVTAPEL 191 Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 S +F + LA + A E G R++E G G G ++ L + + Sbjct: 192 SPLFAQTLARPVAQALEASG---TRRVMEFGAGTG---KLAAGLLNALAALGVELDEYAI 245 Query: 116 VETSERLTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTE 171 V+ S L Q++ L ++ W +L + G +V NE D++P++ V Sbjct: 246 VDLSGELRARQRETLEAEAPGLAARVRWLDALPERFEG--VVVGNEVLDAMPVRLVVKHA 303 Query: 172 HGIRERMIDIDQHDSLVFNIGDHEIKSNFLTC--SDYFLGAIFENSPCRDREMQSISDRL 229 G RER + ID + F + D G + E ++++ L Sbjct: 304 DGWRERGVAIDDAGAFAFADRPLARAEDAARLVEIDADEGYVTETHDAAAAFVRTVCTML 363 Query: 230 ACDGGTAIVIDYG----YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSSI 284 A I + Y + R TL H P + PG D++SHV+F + Sbjct: 364 ARGAAFFIDYGFPSHEYYHRQRAQGTLMCHYRHRAHGDPFLYPGLQDITSHVEFSAIYEA 423 Query: 285 AILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARK-DILLDSVKRLVSTSADKKSMGE 343 + + G T+Q +FL G+ + A+ ++V++L+S + MGE Sbjct: 424 GVGAGSDLLGYTSQARFLLNAGVTDVLAEIDPSDAQHFLPAANAVQKLIS----EAEMGE 479 Query: 344 LFKILVVSH 352 LFK++ S Sbjct: 480 LFKVIAFSR 488 >gi|327302796|ref|XP_003236090.1| hypothetical protein TERG_03140 [Trichophyton rubrum CBS 118892] gi|326461432|gb|EGD86885.1| hypothetical protein TERG_03140 [Trichophyton rubrum CBS 118892] Length = 501 Score = 235 bits (600), Expect = 6e-60, Method: Composition-based stats. Identities = 116/456 (25%), Positives = 181/456 (39%), Gaps = 99/456 (21%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTC-----NPFGAVGDFVTAPEIS 56 L ++I + I G +++ + C+ E GYY++ + FG GDFVT+PEIS Sbjct: 42 STPLAKRITDAINTTGPISIAAFMRQCLTSDEGGYYTSRGTPGRDVFGKEGDFVTSPEIS 101 Query: 57 QIFGEMLAIFLICAWEQHGFPSC-VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 Q+FGE+L I+++ W G S V+L+E GPG+G +M DILR + K SV +YM Sbjct: 102 QMFGELLGIWIVTEWLSQGRRSSGVQLMEFGPGKGTLMADILRSVRNFKGFASSVEGVYM 161 Query: 116 VETSERLTLIQKKQLASYGD--------------------KINWYTSLADVPLGFTFLVA 155 +E S L IQKK L L F++A Sbjct: 162 IEASPTLRDIQKKALCGDAPMEECDIGYKSISIHLGVPVYWTEHIRILTQTEDKAPFIIA 221 Query: 156 NEFFDSLPIKQFVMTEHGIRE-----------RMIDIDQHDSLVFNIG-------DHEIK 197 +EFFD+LPI F E R + + + + + E Sbjct: 222 HEFFDALPIHAFQAVHSPPPETINTPTGPAELRQPSLPLNGTQWRELVVATNPEAEREPD 281 Query: 198 SNFLTCSDYFL--------------------------------GAIFENSPCRDREMQSI 225 ++ + G+ E SP Q I Sbjct: 282 CDYGNDKNDKKLEFRLALAKSPTPASLVMPEMSPRYKALKSTRGSTIEISPESHTYAQEI 341 Query: 226 SDRL-------------ACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADL 272 + + G A+++DYG + ++L+ +K H VSP PG+ DL Sbjct: 342 ARLIGGPNPTDKKPLPTRTPAGAALILDYGPSSTIPVNSLRGIKNHQVVSPFATPGEVDL 401 Query: 273 SSHVDFQRLSSIAILY--KLYINGLTTQGKFLEGLGIWQRAFSLMKQ---TARKDILLDS 327 S+ VDF L+ A+ + + G QG FL LGI +RA L++ ++ + S Sbjct: 402 SADVDFTGLAESALNASPGVEVYGPNEQGSFLRSLGIAERAAQLLRNVNDEEKRKQIESS 461 Query: 328 VKRLVSTSADKKSMGELFKILVVSHE---KVELMPF 360 +RLV MG ++K + + E K + F Sbjct: 462 WQRLVERGG--GGMGRIYKAMAIVPESGGKRRPVGF 495 >gi|269839615|ref|YP_003324307.1| hypothetical protein Tter_2597 [Thermobaculum terrenum ATCC BAA-798] gi|269791345|gb|ACZ43485.1| protein of unknown function DUF185 [Thermobaculum terrenum ATCC BAA-798] Length = 384 Score = 235 bits (600), Expect = 6e-60, Method: Composition-based stats. Identities = 91/368 (24%), Positives = 148/368 (40%), Gaps = 22/368 (5%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNP-FGAVGDFVTAPEISQIFGEM 62 +L+R I I + G++T +++ L + P GYYS P G GD+ T+ ++S +FG Sbjct: 13 QLLRIIRREIAERGRITFERFMDLALYHPAHGYYSAGGPRIGPEGDYYTSTDVSPLFGAT 72 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 L ++ WE G P +VE G G+G++ D+L P+F+ + +VE S Sbjct: 73 LGRQVVEMWELLGRPDPFHVVEHGAGKGLLAADLLGWAGAAHPEFYRAVRYLIVEVSPAA 132 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 Q++ L L + + ++NE D+LP + MT GI E + Sbjct: 133 RERQREHLWRLPVSWADDADLEEDSII-GVCLSNELADALPFHRIRMTGAGIDELWVV-- 189 Query: 183 QHDSLVFNIGDHEIKSNFLTC------SDYFLGAIFENSPCRDREMQSISDRLACDGGTA 236 D L + E L G E + + + L G Sbjct: 190 -DDGLALGLAPGEPSDRRLEAYVERWGRALRPGQQAEANLRAIDWARRVVRSLRR--GFF 246 Query: 237 IVIDYG------YLQSRVGDTLQAVKGH-TYVSPLVNPGQADLSSHVDFQRLSSIAILYK 289 + IDYG + TL H PL G+ D+++HV+F L+ Sbjct: 247 LTIDYGGRAEEVHGPDHPDGTLTCYYRHTQNRDPLRRVGEQDITAHVNFSALAETVREAG 306 Query: 290 LYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILV 349 + G TTQG FL LGI + +++ A+ + V+ V +G FK+LV Sbjct: 307 GEVTGYTTQGYFLAALGIGEAVEWALRR-AKSTREFEQVRAQVEELVRPDGLGG-FKVLV 364 Query: 350 VSHEKVEL 357 + Sbjct: 365 AHKGLMSP 372 >gi|148244557|ref|YP_001219251.1| hypothetical protein COSY_0408 [Candidatus Vesicomyosocius okutanii HA] gi|146326384|dbj|BAF61527.1| conserved hypothetical protein [Candidatus Vesicomyosocius okutanii HA] Length = 373 Score = 235 bits (600), Expect = 6e-60, Method: Composition-based stats. Identities = 95/369 (25%), Positives = 151/369 (40%), Gaps = 27/369 (7%) Query: 5 LIRKIVN-LIKKNGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGEM 62 L + I N +I+ G + D++ L + P GYY + F GDF+TAPE S +FG Sbjct: 12 LEQIIKNTIIQNAGPIGFDEFMNLALYYPALGYYRSGLEKFSKNGDFITAPETSDLFGFC 71 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 LA ++E G G GI+ IL + +LK Y++E S L Sbjct: 72 LANQCAQVL-----NGTNDILEFGSGSGILATQILFELGRLKKLPQK---YYILELSGEL 123 Query: 123 TLIQ----KKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERM 178 Q K L D+I W L G ++ANE D++P K+ + E Sbjct: 124 KHRQAETISKVLPELIDRIVWLDELPSDFSG--VVIANEVLDAMPAKRVIYKNKQFYELG 181 Query: 179 IDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTA-- 236 ID +++ L ++ E + + S+ + Sbjct: 182 IDHHENEFFWRKFDSPYQNDKILLPNNVIEDYRTEINLHAIAWIDSLYNATNKALVLLID 241 Query: 237 --IVIDYGYLQSRVGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYIN 293 + D + R+ TL+ H +P +N G+ D+++ V+F ++ A + +N Sbjct: 242 YGMSRDEYFHPQRLDGTLRCYYHHKASENPFLNIGKQDITTSVNFSDIADQASISGFKVN 301 Query: 294 GLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHE 353 G TQ FL LGI K ++ L +K+LV S MGE FK+L +S + Sbjct: 302 GYATQALFLISLGISDYLLK-QKDKNKRMNLAQQIKQLVLPSV----MGESFKVLALSKK 356 Query: 354 -KVELMPFV 361 V L+ FV Sbjct: 357 LSVRLIGFV 365 >gi|297537480|ref|YP_003673249.1| hypothetical protein M301_0285 [Methylotenera sp. 301] gi|297256827|gb|ADI28672.1| protein of unknown function DUF185 [Methylotenera sp. 301] Length = 388 Score = 235 bits (600), Expect = 6e-60, Method: Composition-based stats. Identities = 94/380 (24%), Positives = 165/380 (43%), Gaps = 32/380 (8%) Query: 2 ENKLIRKIVNLI-KKNGQMTVDQYFALCVADPEFGYYSTC-NPFGA----VGDFVTAPEI 55 +LI I I + G ++ ++ L + P GYYS FG GDFVTAP+I Sbjct: 15 SQQLITLIQKTINAQKGWISFAEFMHLALYAPGLGYYSAGSQKFGDSKKGGGDFVTAPQI 74 Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 S +F + ++ + + ++ELG G G + DIL + +L ++ Sbjct: 75 SPLFAQTISNQIAQVLDIT----HGNILELGAGTGKLAADILLTMAELGSVPAK---YFI 127 Query: 116 VETSERLTLIQKKQLASYGDK--INWYTSLADVPLGFT-FLVANEFFDSLPIKQFVMTEH 172 +E S+ L +Q + L S + + L ++P F ++ NE D++P+ + Sbjct: 128 LEVSDHLRQVQLETLQSKLPQNLVQRVEWLTELPSNFNGVIIGNEVLDAIPVHMVNVKND 187 Query: 173 GIRERMIDIDQHDSLVFNIGDHEIKSNFLTCS-DYFLGAIFENSPCRDREMQSISDRLAC 231 GI E I ++ + + + E + G + E P + S++ L Sbjct: 188 GIYEHGIVVEDGEFVWQDKISSEPSMLDAVSKLNLPEGYVTEICPAASGLITSLAHSL-- 245 Query: 232 DGGTAIVIDY------GYLQSRVGDTLQAVKGH-TYVSPLVNPGQADLSSHVDFQRLSSI 284 G ++IDY Y R TL H + PL+N G D+++HVDF ++ Sbjct: 246 QQGIILMIDYGFAAREYYHPQRNQGTLMCHYQHYAHSDPLINIGLQDITAHVDFTSIAHA 305 Query: 285 AILYKLYINGLTTQGKFLEGLGIWQRAFSLMK-QTARKDILLDSVKRLVSTSADKKSMGE 343 + + L ++G +Q +FL GI + + AR L + ++L+S MG+ Sbjct: 306 GVNHGLALSGFCSQAQFLMNCGILELMSQVSPHDMARYAPLAAAAQKLLS----PAEMGD 361 Query: 344 LFKILVVSHE-KVELMPFVN 362 LFK++ S L+ FV+ Sbjct: 362 LFKVVAFSKNIDEPLIGFVS 381 >gi|40063245|gb|AAR38072.1| conserved hypothetical protein [uncultured marine bacterium 577] Length = 398 Score = 235 bits (600), Expect = 6e-60, Method: Composition-based stats. Identities = 91/379 (24%), Positives = 154/379 (40%), Gaps = 28/379 (7%) Query: 2 ENKLIRKIVNLIK-KNGQMTVDQYFALCVADPEFGYY-STCNPFGAVGDFVTAPEISQIF 59 + + I + I G + ++Y L + P GYY S FG GDFVTAPE+S +F Sbjct: 15 SSAVKNMICSEITTAGGWIPFERYMELAIYSPGMGYYCSGTTKFGCAGDFVTAPEVSSLF 74 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 G +A + E G ++E G G G + LDIL + L ++E S Sbjct: 75 GRAIAQQAVQIIESVG-EDSSDILEFGAGTGKLALDILLELENLNRLP---RHYLILEVS 130 Query: 120 ERLTLIQKKQLASYGDKINWYTSLADV--PLGFTFLVANEFFDSLPIKQFVMTEHGIRER 177 L Q K A + + + ++ANE D++P+ ++ + ER Sbjct: 131 GELQEKQNKLFAKFAPHLMSRVQWLEQLPTKFKGLILANEVLDAMPVHLVARRDNDLFER 190 Query: 178 MIDIDQHDSLV---------FNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDR 228 + + EI + I E + ++SI++ Sbjct: 191 GVVWNGKRFEWSDRLLVEGELFRIAEEIIPLASPDNKNIEIYISEINLSARGFIRSIANI 250 Query: 229 LACDGGTAIVI----DYGYLQSRVGDTL-QAVKGHTYVSPLVNPGQADLSSHVDFQRLSS 283 L I D Y + R T+ + H + P PG D++SHVDF ++ Sbjct: 251 LEKGAVVLIDYGFGRDEYYHEQRNRGTMMCHYRHHAHDDPFYFPGLQDITSHVDFTAITD 310 Query: 284 IAILYKLYINGLTTQGKFLEGLGIWQRAFSL-MKQTARKDILLDSVKRLVSTSADKKSMG 342 +A L + G T+Q +FL GI + + ++ T+ + + +++LVS MG Sbjct: 311 VAAGEGLELLGYTSQAQFLINCGITEILSRIPVENTSDYLPMANQMQKLVS----PAEMG 366 Query: 343 ELFKILVVSHEK-VELMPF 360 ELFK++ + + L+ F Sbjct: 367 ELFKVIALGKDNQQPLIGF 385 >gi|91200015|emb|CAJ73057.1| conserved hypothetical protein [Candidatus Kuenenia stuttgartiensis] Length = 377 Score = 235 bits (600), Expect = 6e-60, Method: Composition-based stats. Identities = 86/368 (23%), Positives = 159/368 (43%), Gaps = 28/368 (7%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNP-FGAVGDFVTAPEISQIFGE 61 L I+ IK G +T ++ + + PE+GYY++ G GDF T+P + ++FGE Sbjct: 2 TILSDLIIERIKNKGNITFAEFMQMALYYPEYGYYNSNAVSIGKSGDFYTSPAVHRMFGE 61 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 ++A+ L W G + +VE+G G + DI+R I P F+ + ++VE++ Sbjct: 62 LIAVQLEEMWRILGR-ASFTVVEMGANAGWLCYDIMRYIKNEYPWFYKKVQYFIVESNPH 120 Query: 122 LTLIQKKQLASYGDKINWYTSLADVPLGFTF------LVANEFFDSLPIKQFVMTEHGIR 175 L Q++ + + GF+F ++NEF D+LP+ + ++ E ++ Sbjct: 121 LREKQQELFCGNPVFDEKLSWHSYGEDGFSFDAVQGCFLSNEFIDALPVHRLLVKEGEVK 180 Query: 176 ERMIDIDQHD--SLVFNIGDHEIKSNFLTCS-DYFLGAIFENSPCRDREMQSISDRLACD 232 E + + +D +V ++ +H +K + + E + + ++ +L C Sbjct: 181 EIYVGYNGNDFYEIVGDVCNHALKDYCINAEMPLRESRVLEINLAARDYLNHVAQKLNC- 239 Query: 233 GGTAIVIDYGYLQSR------VGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIA 285 G + IDYG R G TL+ H G+ D+++ VDF L + Sbjct: 240 -GFVLTIDYGDTAQRRYRDNTTGGTLRCYYRHNVNHDYYERLGEQDITASVDFSFLMDVG 298 Query: 286 ILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELF 345 L + GL Q +L LG+ ++ ++ V + MGE+F Sbjct: 299 KGAGLEVTGLVKQSHYLIALGVLEKLNNIRNNLETVLK--------VKNLFLPEGMGEVF 350 Query: 346 KILVVSHE 353 K L+ Sbjct: 351 KALIQHRN 358 >gi|209527409|ref|ZP_03275915.1| protein of unknown function DUF185 [Arthrospira maxima CS-328] gi|209492144|gb|EDZ92493.1| protein of unknown function DUF185 [Arthrospira maxima CS-328] Length = 389 Score = 235 bits (600), Expect = 6e-60, Method: Composition-based stats. Identities = 85/375 (22%), Positives = 147/375 (39%), Gaps = 27/375 (7%) Query: 1 MENKLIRKIVNLIKKN--GQMTVDQYFALCVADPEFGYYSTCNP-FGAVGDFVTAPEISQ 57 + L+ +I I + ++T +Y + + DP+ GYY+ +P GA GDF T+P + Sbjct: 2 VSVTLVDRISQRIGNHPQNRITFAEYMEMALYDPKQGYYNHNSPQIGAQGDFFTSPHLGS 61 Query: 58 IFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVE 117 FGE+LA L+ WE G P LVE+G G+GI+ DI+ + P VL + E Sbjct: 62 DFGELLAEQLVEMWEILGKPEPFTLVEMGAGQGILAADIIGYLQGQYPQVVGVLDYAIAE 121 Query: 118 TSERLTLIQKKQLASYGDKINWYT----SLADVPLGFTFLVANEFFDSLPIKQFVMTEHG 173 S RL Q+++ G +NE D+ P+ + Sbjct: 122 KSTRLKTEQQRRFQQLGAPFTQIRWCDLDEIANHSITGCFFSNELIDAFPVHLVTRQNNQ 181 Query: 174 IRERMIDIDQHDSLVF------NIGDHEIKSNFL------TCSDYFLGAIFENSPCRDRE 221 ++E + S + ++ F Y G E + Sbjct: 182 LQEIYLTTTGSKSDYQLAEVVGELSTPQLADYFRLVGIDLLSEAYPEGYRTEVNLAALGW 241 Query: 222 MQSISDRLACDGG----TAIVIDYGYLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHV 276 +++++ +L D Y +R TLQ H + +P + G+ D+++HV Sbjct: 242 VETVARKLRRGFVLTIDYGYSADRLYSPTRREGTLQCYYQHRHHNNPYIYIGEQDITAHV 301 Query: 277 DFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSA 336 DF L L G T Q F+ LG+ R ++ + K + + + + Sbjct: 302 DFTALQQKGRSLGLQTIGFTQQALFMMALGLGDRIATVSE--GPKISQVLRRREALHSLI 359 Query: 337 DKKSMGELFKILVVS 351 D +G F +L+ Sbjct: 360 DPMGLGN-FGVLIQG 373 >gi|242795147|ref|XP_002482520.1| DUF185 domain protein [Talaromyces stipitatus ATCC 10500] gi|218719108|gb|EED18528.1| DUF185 domain protein [Talaromyces stipitatus ATCC 10500] Length = 526 Score = 235 bits (600), Expect = 7e-60, Method: Composition-based stats. Identities = 116/475 (24%), Positives = 183/475 (38%), Gaps = 124/475 (26%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYY------STCNPFGAVGDFVTAPEI 55 L + + + I+ G +++ Y + +P+ GYY S FG GDF+T+PEI Sbjct: 39 STPLAKILADAIRTTGPISIAAYMRQVLTNPDAGYYTTPSSQSKTEVFGKKGDFITSPEI 98 Query: 56 SQIFGEMLAIFLICAWEQHGFPSC-VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIY 114 +QIFGE++ I+ + W G P V L+E+GPG+G +M DILR + K S+ +IY Sbjct: 99 TQIFGELVGIWTVTEWMAQGMPKEGVELIEVGPGKGTLMDDILRTLRNFKTFSKSIENIY 158 Query: 115 MVETSERLTLIQKKQL------------------ASYGDKINWYTSLA--DVPLGFTFLV 154 +VE S L +QK L G I W + F+ Sbjct: 159 LVEASAPLREVQKNLLCGPDAVLEEIDIGYRGINKHTGAPIVWVEDIRLLPYNDKMPFIF 218 Query: 155 ANEFFDSLPIKQFVM------------------------------------TEHGIRERM 178 A+EFFD+LPI F T RE M Sbjct: 219 AHEFFDALPIHAFECIQPTESEEKQQPKQIMTPTGPLDLDHTNQRNTKNRPTGPQWRELM 278 Query: 179 IDI-------DQHDSLVFNIGDHEIKS----------NFLTCSDYFLGAIFENSPCRDRE 221 + + + D F + +I + G++ E SP Sbjct: 279 VALNSKSVVENIKDEPEFQLSRAKISTPNSLLLPEISERYKALKSQPGSVIEVSPESRIY 338 Query: 222 MQSISDRL-----------------------------------ACDGGTAIVIDYGYLQS 246 + + R+ G A+++DYG + Sbjct: 339 VADFARRIGGYAPPSEPRLPKRKPGEVAKQVSPGDIPTAPIQKKHPSGAALILDYGTTST 398 Query: 247 RVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILY--KLYINGLTTQGKFLEG 304 ++L+ ++ H SP PGQ D+S+ VDF L+ A+ + ++G Q +FL Sbjct: 399 VPINSLRGIRQHATTSPFAYPGQVDVSADVDFTSLAEAALEASEGVEVHGPVDQAEFLHS 458 Query: 305 LGIWQRAFSL-----MKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 LGI +RA L + ++ +L + KRLV MG+L+K LV+ E Sbjct: 459 LGIAERAEQLLSKLPADKEEKRKMLQTAWKRLVDKG--PNGMGKLYKALVIVPEN 511 >gi|186477643|ref|YP_001859113.1| hypothetical protein Bphy_2895 [Burkholderia phymatum STM815] gi|184194102|gb|ACC72067.1| protein of unknown function DUF185 [Burkholderia phymatum STM815] Length = 397 Score = 235 bits (600), Expect = 7e-60, Method: Composition-based stats. Identities = 86/371 (23%), Positives = 145/371 (39%), Gaps = 33/371 (8%) Query: 2 ENKLIRKIVNLIKKNG-QMTVDQYFALCVADPEFGYYSTCN-PFGAV----GDFVTAPEI 55 L+ +I I NG M D+Y + P GYYS FG DFVTAPE+ Sbjct: 22 SEALVSRIRAEIDANGGWMPFDRYMERALYAPGLGYYSGGAVKFGRRAEDGSDFVTAPEL 81 Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 S +F LA + A E G ++E G G G + +L + +L F + + Sbjct: 82 SPLFAGTLARPVAQALEMSG---TRHVMEFGAGTGKLASGLLNALFELGAPFD---TYSI 135 Query: 116 VETSERLTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTE 171 V+ S L Q++ + + ++ W +L G +V NE D++P++ F Sbjct: 136 VDLSGELRERQRETIDALAPALAPRVRWLDALPKAFEG--VVVGNEVLDAMPVRLFARAS 193 Query: 172 HGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFL----GAIFENSPCRDREMQSISD 227 ER + + L F + D + + E +++ Sbjct: 194 GTWHERGVA-AEGGMLRFEDRPLPSTHDAAFLRDLDIEGDADYVTETHEAALAFTRTVCT 252 Query: 228 RLACDGGTAIVIDYG----YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLS 282 L I + Y R TL H P + PG D+++HV+F ++ Sbjct: 253 MLTRGAVFLIDYGFPRHEYYHAQRAQGTLMCHYRHRAHGDPFLYPGLQDITAHVEFTGIA 312 Query: 283 SIAILYKLYINGLTTQGKFLEGLGIWQRAFSLM-KQTARKDILLDSVKRLVSTSADKKSM 341 + + G T+Q +FL GI + ++ ++V++L+S + M Sbjct: 313 EAGVDAGADLLGYTSQARFLMNAGITEALSAIDPSDIPNFLPAANAVQKLLS----EAEM 368 Query: 342 GELFKILVVSH 352 GELFK++ S Sbjct: 369 GELFKVIAFSR 379 >gi|326471185|gb|EGD95194.1| hypothetical protein TESG_02686 [Trichophyton tonsurans CBS 112818] Length = 501 Score = 235 bits (600), Expect = 8e-60, Method: Composition-based stats. Identities = 114/446 (25%), Positives = 173/446 (38%), Gaps = 96/446 (21%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTC-----NPFGAVGDFVTAPEIS 56 L ++I + I G +++ + C+ E GYY++ + FG GDFVT+PEIS Sbjct: 42 STPLAKRITDAINTTGPISIAAFMRQCLTSDEGGYYTSRGTPGSDVFGKEGDFVTSPEIS 101 Query: 57 QIFGEMLAIFLICAWEQHGFPSC-VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 Q+FGE+L I+++ W G S V+L+E GPG+G +M DILR + K SV +YM Sbjct: 102 QMFGELLGIWIVTEWLSQGRRSSGVQLMEFGPGKGTLMADILRSVRNFKGFASSVEGVYM 161 Query: 116 VETSERLTLIQKKQLASYGD--------------------KINWYTSLADVPLGFTFLVA 155 +E S L IQKK L L F++A Sbjct: 162 IEASPTLREIQKKALCGDAPMEECDIGYKSISSHLGVPVYWTEHIRILPQTEDKAPFIIA 221 Query: 156 NEFFDSLPIKQFVMTEHGIRE-----------RMIDIDQHDSLVFNIGDHEIKSNFLTCS 204 +EFFD+LPI F E R + + + + Sbjct: 222 HEFFDALPIHAFQAVHCPPPETINTPTGPAELRQPSLPLNGTQWRELVVATNPEAEHEPD 281 Query: 205 ---------------------------------------DYFLGAIFENSPCRDREMQSI 225 G+ E SP Q I Sbjct: 282 GDGNSDKNDKKLEFRLALAKSPTPASLVMPEMSPRYKALKSTRGSTIEISPESHTYAQEI 341 Query: 226 SDRL-------------ACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADL 272 + + G A+++DYG + ++L+ +K H VSP PG+ DL Sbjct: 342 ARLIGGPNPIDKNPSPTRTPAGAALILDYGPSSTIPVNSLRGIKNHEVVSPFATPGEVDL 401 Query: 273 SSHVDFQRLSSIAILY--KLYINGLTTQGKFLEGLGIWQRAFSLM---KQTARKDILLDS 327 S+ VDF L+ A+ + + G QG FL LGI +RA L+ K ++ + S Sbjct: 402 SADVDFTGLAESALDASPGVEVYGPNEQGSFLRSLGIAERAAQLLRNVKDEEKRKQIESS 461 Query: 328 VKRLVSTSADKKSMGELFKILVVSHE 353 +RLV MG ++K + + E Sbjct: 462 WQRLVERGG--GGMGRIYKAMAIVPE 485 >gi|194099361|ref|YP_002002461.1| hypothetical protein NGK_1836 [Neisseria gonorrhoeae NCCP11945] gi|239999617|ref|ZP_04719541.1| hypothetical protein Ngon3_09063 [Neisseria gonorrhoeae 35/02] gi|240014792|ref|ZP_04721705.1| hypothetical protein NgonD_09145 [Neisseria gonorrhoeae DGI18] gi|240017240|ref|ZP_04723780.1| hypothetical protein NgonFA_08763 [Neisseria gonorrhoeae FA6140] gi|240081123|ref|ZP_04725666.1| hypothetical protein NgonF_07399 [Neisseria gonorrhoeae FA19] gi|240116318|ref|ZP_04730380.1| hypothetical protein NgonPID1_08797 [Neisseria gonorrhoeae PID18] gi|240118605|ref|ZP_04732667.1| hypothetical protein NgonPID_09129 [Neisseria gonorrhoeae PID1] gi|240121315|ref|ZP_04734277.1| hypothetical protein NgonPI_06020 [Neisseria gonorrhoeae PID24-1] gi|240124148|ref|ZP_04737104.1| hypothetical protein NgonP_09455 [Neisseria gonorrhoeae PID332] gi|240126236|ref|ZP_04739122.1| hypothetical protein NgonSK_08493 [Neisseria gonorrhoeae SK-92-679] gi|240128818|ref|ZP_04741479.1| hypothetical protein NgonS_09374 [Neisseria gonorrhoeae SK-93-1035] gi|254494332|ref|ZP_05107503.1| conserved hypothetical protein [Neisseria gonorrhoeae 1291] gi|260439865|ref|ZP_05793681.1| hypothetical protein NgonDG_02034 [Neisseria gonorrhoeae DGI2] gi|268601985|ref|ZP_06136152.1| conserved hypothetical protein [Neisseria gonorrhoeae PID18] gi|268604317|ref|ZP_06138484.1| conserved hypothetical protein [Neisseria gonorrhoeae PID1] gi|268682773|ref|ZP_06149635.1| conserved hypothetical protein [Neisseria gonorrhoeae PID332] gi|268684817|ref|ZP_06151679.1| conserved hypothetical protein [Neisseria gonorrhoeae SK-92-679] gi|268687200|ref|ZP_06154062.1| conserved hypothetical protein [Neisseria gonorrhoeae SK-93-1035] gi|193934651|gb|ACF30475.1| Conserved hypothetical protein [Neisseria gonorrhoeae NCCP11945] gi|226513372|gb|EEH62717.1| conserved hypothetical protein [Neisseria gonorrhoeae 1291] gi|268586116|gb|EEZ50792.1| conserved hypothetical protein [Neisseria gonorrhoeae PID18] gi|268588448|gb|EEZ53124.1| conserved hypothetical protein [Neisseria gonorrhoeae PID1] gi|268623057|gb|EEZ55457.1| conserved hypothetical protein [Neisseria gonorrhoeae PID332] gi|268625101|gb|EEZ57501.1| conserved hypothetical protein [Neisseria gonorrhoeae SK-92-679] gi|268627484|gb|EEZ59884.1| conserved hypothetical protein [Neisseria gonorrhoeae SK-93-1035] gi|317164868|gb|ADV08409.1| hypothetical protein NGTW08_1448 [Neisseria gonorrhoeae TCDC-NG08107] Length = 392 Score = 235 bits (599), Expect = 8e-60, Method: Composition-based stats. Identities = 89/374 (23%), Positives = 151/374 (40%), Gaps = 33/374 (8%) Query: 4 KLIRKIVNLIKKNG-QMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGE 61 L I I K+G + ++ L + P++GYY+ + G GDF+TAP ++ +F + Sbjct: 14 NLQTLIAEEIGKHGNWIPFSRFMELVLYAPQYGYYTGGSHKIGNTGDFITAPTLTPLFAQ 73 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 LA L Q + E G G G + D+ L S+ Y++E S Sbjct: 74 TLARQLQELLPQT----AGNIYEFGAGTGQLAADL------LGSVSDSINCYYIIEISPE 123 Query: 122 LTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRER 177 L QK + K+ T+L + G ++ NE D++P++ E G+ E Sbjct: 124 LAARQKNLIQARAPEASQKVVHLTALPEAFDG--IIIGNELLDAIPVEIVRKNEGGLLEH 181 Query: 178 MIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLG----AIFENSPCRDREMQSISDRLACDG 233 + + ++ S + S YF E P + +++++ RL G Sbjct: 182 IGVCTDNGRFAYSARPLHDPSLSTSASLYFPQTDYPYTSELHPQQYAFIRTLASRLERGG 241 Query: 234 GTAIVIDY----GYLQSRVGDTLQ-AVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILY 288 I + Y R TL + H +P G ADL++HV+F ++ Sbjct: 242 MIFIDYGFDAAQYYHPQRNQGTLIGHYRHHVIHNPFDFIGLADLTAHVNFTDIAQAGTDA 301 Query: 289 KLYINGLTTQGKFLEGLGIWQRAFSLMK-QTARKDILLDSVKRLVSTSADKKSMGELFKI 347 L + G Q FL LGI + K +A +V++L+ D+ MGELFK+ Sbjct: 302 GLDLTGYLPQSHFLLNLGITELLAQTGKTDSAAYIREAAAVQKLI----DQHEMGELFKV 357 Query: 348 LVVSHE-KVELMPF 360 + ++ F Sbjct: 358 IAFGKNIGIDWAGF 371 >gi|240113335|ref|ZP_04727825.1| hypothetical protein NgonM_07133 [Neisseria gonorrhoeae MS11] gi|268599409|ref|ZP_06133576.1| conserved hypothetical protein [Neisseria gonorrhoeae MS11] gi|268583540|gb|EEZ48216.1| conserved hypothetical protein [Neisseria gonorrhoeae MS11] Length = 392 Score = 235 bits (599), Expect = 8e-60, Method: Composition-based stats. Identities = 89/374 (23%), Positives = 151/374 (40%), Gaps = 33/374 (8%) Query: 4 KLIRKIVNLIKKNG-QMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGE 61 L I I K+G + ++ L + P++GYY+ + G GDF+TAP ++ +F + Sbjct: 14 NLQTLIAEEIGKHGNWIPFSRFMELVLYAPQYGYYTGGSHKIGNTGDFITAPTLTPLFAQ 73 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 LA L Q + E G G G + D+ L S+ Y++E S Sbjct: 74 TLARQLQELLPQT----AGNIYEFGAGTGQLAADL------LGSVSDSINCYYIIEISPE 123 Query: 122 LTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRER 177 L QK + K+ T+L + G ++ NE D++P++ E G+ E Sbjct: 124 LAARQKNLIQARAPEASQKVVHLTALPEAFDG--IIIGNELLDAIPVEIVRKNEGGLLEH 181 Query: 178 MIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLG----AIFENSPCRDREMQSISDRLACDG 233 + + ++ S + S YF E P + +++++ RL G Sbjct: 182 IGVCTDNGRFTYSARPLHDPSLSTSASLYFPQTDYPYTSELHPQQYAFIRTLASRLERGG 241 Query: 234 GTAIVIDY----GYLQSRVGDTLQ-AVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILY 288 I + Y R TL + H +P G ADL++HV+F ++ Sbjct: 242 MIFIDYGFDAAQYYHPQRNQGTLIGHYRHHVIHNPFDFIGLADLTAHVNFTDIAQAGTDA 301 Query: 289 KLYINGLTTQGKFLEGLGIWQRAFSLMK-QTARKDILLDSVKRLVSTSADKKSMGELFKI 347 L + G Q FL LGI + K +A +V++L+ D+ MGELFK+ Sbjct: 302 GLDLTGYLPQSHFLLNLGITELLAQTGKTDSAAYIREAAAVQKLI----DQHEMGELFKV 357 Query: 348 LVVSHE-KVELMPF 360 + ++ F Sbjct: 358 IAFGKNIGIDWAGF 371 >gi|190348924|gb|EDK41478.2| hypothetical protein PGUG_05576 [Meyerozyma guilliermondii ATCC 6260] Length = 520 Score = 235 bits (599), Expect = 8e-60, Method: Composition-based stats. Identities = 107/416 (25%), Positives = 178/416 (42%), Gaps = 61/416 (14%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGA-VGDFVTAPEISQIFGE 61 L + +IK NG +++ Y C+ P++GYY+T NP GDF+T+PEIS +FGE Sbjct: 105 ESLSDLLAEIIKTNGPLSLSAYMRQCLTHPDYGYYTTTNPLDKYTGDFITSPEISSVFGE 164 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLS--IYMVETS 119 M+ I+L W P +R++E GPG+G +M D++R KL I ++E S Sbjct: 165 MIGIWLFSTWTSQDNPQNIRIIEFGPGKGTLMFDVVRTFNKLAKSRIRSDQIEICLIEAS 224 Query: 120 ERLTLIQKKQLASYG---------------------DKINWYTSLADVPLGFTFLVANEF 158 L Q + L + ++D +++A+EF Sbjct: 225 PILRDEQAELLCGSKLNSADIKDSFYTKSSIWGNTVKWLETEKDISDDVQYANYILAHEF 284 Query: 159 FDSLPIKQFVMTEHGIRERMI--------------------DIDQHDSLVFNIGDHE--- 195 FD+LPIK F ++ G RE ++ + F++ Sbjct: 285 FDALPIKSFQKSDSGWRELLVEHSPSVLNTQGALPSGGSSSEFSPDLETDFHLTVSPKDT 344 Query: 196 ----IKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGT--AIVIDYGYLQSRVG 249 I + G+ E + ++ + + G A++IDYG Sbjct: 345 PSSLIPELSSRFNALPTGSRIEICTDAELYALKMASLINNEQGNGAALIIDYGLKSGIPS 404 Query: 250 DTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQ 309 ++L+ + H +VSP +PG+ DLS+ VDF+ L++I G QG +L +GI Sbjct: 405 NSLRGIYKHKFVSPFFSPGKVDLSADVDFENLAAITAKA-CSSFGPVDQGDWLHEMGIGY 463 Query: 310 RAFSLMK----QTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELM-PF 360 R L+K A +D + S +RL ++ SMG +KIL + ++ F Sbjct: 464 RIDQLLKSNEGNPAEQDKVYASYRRLTDK--NENSMGGAYKILCLVPHSAQMPIGF 517 >gi|78224276|ref|YP_386023.1| hypothetical protein Gmet_3084 [Geobacter metallireducens GS-15] gi|78195531|gb|ABB33298.1| protein of unknown function DUF185 [Geobacter metallireducens GS-15] Length = 385 Score = 235 bits (599), Expect = 9e-60, Method: Composition-based stats. Identities = 98/368 (26%), Positives = 171/368 (46%), Gaps = 22/368 (5%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFG 60 +N+L I+N I++ G++ + A+C+ +P GYY++ GA GDF T+ + +FG Sbjct: 6 DNRLRGIILNRIREQGRIPFADFMAMCLYEPGLGYYTSPGRKVGAEGDFYTSINVHSVFG 65 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 ++A + WE+ G P+ +VE G G G + D++ I +L P + + + ++E Sbjct: 66 RLIAREICRMWEEMGRPASFDIVEAGAGHGRLATDVIDAIQELNPTLYDGIRLTLIEAEP 125 Query: 121 RLTLIQKKQLASYGDKINWYT--SLADVPLGFT-FLVANEFFDSLPIKQFVMTEHGIRER 177 L +Q + LA + K++W T LA+ L FT L +NE DS P M G+RE Sbjct: 126 SLAAVQGELLAPHLPKVSWSTPADLAEGRLRFTGCLYSNELIDSFPPHLVEMGPEGLREV 185 Query: 178 MIDIDQHDSLVFNIGD--HEIKSNFLTCSDYF-LGAIFENSPCRDREMQSISDRLACDGG 234 + + E+++ F +G E + R ++S++ L G Sbjct: 186 FVAAEGDQFSEILDLPSTPELEAYLSRLGITFAVGQRAEINLNAVRWLESVATALER--G 243 Query: 235 TAIVIDYGYLQS------RVGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAIL 287 + IDYGYL R+ TL HT +P + G D+++HVDF L++ Sbjct: 244 FVLTIDYGYLAPELYGPMRLNGTLLCYYRHTIEENPYIRIGLQDMTTHVDFTTLAARGEE 303 Query: 288 YKLYINGLTTQGKFLEGLGIWQRAFSLMK---QTARKDILLDSVKRLVSTSADKKSMGEL 344 L Q +FL G+ + +L + + ++K+L+ + MG+ Sbjct: 304 LGLRKVWYGEQYRFLVATGMMEELMALEAAATSEKEQIAIRLALKKLILP---EGGMGDT 360 Query: 345 FKILVVSH 352 FK+LV + Sbjct: 361 FKVLVQAK 368 >gi|152981198|ref|YP_001355125.1| hypothetical protein mma_3435 [Janthinobacterium sp. Marseille] gi|151281275|gb|ABR89685.1| Uncharacterized conserved protein [Janthinobacterium sp. Marseille] Length = 386 Score = 235 bits (599), Expect = 9e-60, Method: Composition-based stats. Identities = 101/369 (27%), Positives = 160/369 (43%), Gaps = 33/369 (8%) Query: 2 ENKLIRKIVNLI-KKNGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIF 59 N L I + I ++ G ++ +Y L + P+ GYYS G GDF TAPEI+ +F Sbjct: 16 SNTLQNLIADEINRRAGWISFARYMELALYAPDVGYYSGGAAKLGKDGDFTTAPEITSLF 75 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 GE LA Q +++E G G G + LDIL + +VE S Sbjct: 76 GETLAHAAGELMAQS----APQILEFGAGTGKLALDILTECAA---AGIPLERYSIVELS 128 Query: 120 ERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMI 179 L Q++ LA + +++W G ++ NE D++P+ V E ER + Sbjct: 129 GELRARQQQTLAGF-PQVSWLDDFPPAFSG--VVLGNEVLDAMPVSLVVKGEAEWLERGV 185 Query: 180 DIDQHDSLVFN--IGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLAC------ 231 I +F D E+ + ++ +G + E P M +++ L Sbjct: 186 SIA-DGRFIFVDRPCDQELITQIPDAAELPVGYLTEVHPIAAGFMHTLATMLTAGFEQSG 244 Query: 232 DGGTAIVIDY------GYLQSRVGDT-LQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSI 284 GG AI+IDY YL R T + + H++ P PG D+++HVDF ++ Sbjct: 245 KGGAAILIDYGFPASEYYLDQRAEGTLMCHYRHHSHPDPFYLPGLQDITAHVDFTSMAYA 304 Query: 285 AILYKLYINGLTTQGKFLEGLGIWQRAFSLM-KQTARKDILLDSVKRLVSTSADKKSMGE 343 A+ L + G +Q FL G+ R + + ++V++L S MGE Sbjct: 305 AVRNGLEMVGYMSQAGFLLTAGLGDRLLATAPEDHMAYLPKANAVQKLTS----PAEMGE 360 Query: 344 LFKILVVSH 352 LFK+LVV Sbjct: 361 LFKVLVVGK 369 >gi|313667804|ref|YP_004048088.1| hypothetical protein NLA_4590 [Neisseria lactamica ST-640] gi|313005266|emb|CBN86699.1| conserved hypothetical protein [Neisseria lactamica 020-06] Length = 382 Score = 235 bits (599), Expect = 9e-60, Method: Composition-based stats. Identities = 87/366 (23%), Positives = 149/366 (40%), Gaps = 32/366 (8%) Query: 4 KLIRKIVNLIKKNG-QMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGE 61 L + IKK+G + ++ L + P++GYY+ + G GDF+TAP ++ +F Sbjct: 16 NLQTLLAEEIKKHGNWIPFSRFMELVLYAPQYGYYTGGSHKIGNGGDFITAPTLTPLFAR 75 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 LA L Q + E G G G + D+L + + Y++E S Sbjct: 76 TLARQLQELLPQT----AGNIYEFGAGTGQLAADLLNNLSD------GINRYYIIEISPE 125 Query: 122 LTLIQKKQLASYGDK----INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRER 177 L QK + + + I ++L + G ++ NE D++P++ E G E Sbjct: 126 LAARQKDLIHTLVPQAAQKIVHLSALPETFDG--IIIGNEVLDAMPVEIIRKDEGGSFEH 183 Query: 178 MIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIF----ENSPCRDREMQSISDRLACDG 233 + + ++ S + S YF F E P + +++++ RL G Sbjct: 184 VGVCLDNGRFAYSARPLNDPSLSASASLYFPQTDFPYTGELHPQQYAFIRTLASRLERGG 243 Query: 234 GTAIVIDY----GYLQSRVGDTLQ-AVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILY 288 I + Y R TL + H +P G ADL++HV+F ++ Sbjct: 244 MIFIDYGFDAAQYYHPQRSQGTLIGHYRHHVIHNPFDFIGLADLTAHVNFTDIAQAGTDA 303 Query: 289 KLYINGLTTQGKFLEGLGIWQRAFSLMK-QTARKDILLDSVKRLVSTSADKKSMGELFKI 347 L + G Q FL LGI + K +A +V++L+ D+ MGELFK+ Sbjct: 304 GLDLIGYLPQSHFLLNLGITELLAQTGKTDSAAYIREAAAVQKLI----DQHEMGELFKV 359 Query: 348 LVVSHE 353 + Sbjct: 360 IAFGKN 365 >gi|134096343|ref|YP_001101418.1| hypothetical protein HEAR3190 [Herminiimonas arsenicoxydans] gi|133740246|emb|CAL63297.1| Conserved hypothetical protein [Herminiimonas arsenicoxydans] Length = 386 Score = 235 bits (599), Expect = 9e-60, Method: Composition-based stats. Identities = 100/365 (27%), Positives = 156/365 (42%), Gaps = 31/365 (8%) Query: 4 KLIRKIVNLIKKN-GQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIFGE 61 KL I I+++ G ++ +Y L + P+ GYYS G GDF TAPEI+ +FGE Sbjct: 18 KLQNLIAEEIRRHDGWISFARYMELALYAPDLGYYSGGAAKLGKDGDFTTAPEITSLFGE 77 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 LA Q ++E G G G + LDIL + +VE S Sbjct: 78 TLAHAAGDLMAQS----APEILEFGAGTGKLALDILTECAA---AGIHLEHYAIVELSGE 130 Query: 122 LTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDI 181 L Q++ LA + +++W G ++ NE D++P+ V EH ER + Sbjct: 131 LRARQQQLLAGF-PQVSWLDDFPPAFSG--VVLGNEVLDAMPVSLVVKGEHAWLERGVAY 187 Query: 182 DQHDSLVFN-IGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLAC------DGG 234 + + D + + + +G + E P M ++++ A GG Sbjct: 188 TDGKFVYADRPCDAALVAQIPDEENLPVGYLTEVHPVAAGFMGTLAEMFAAGLEQSGKGG 247 Query: 235 TAIVIDY------GYLQSRVGDT-LQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAIL 287 AI+ DY YL R T + + H + P PG D+++HVDF ++ A+ Sbjct: 248 AAILFDYGFPAGEYYLDQRSEGTLMCHYRHHAHPDPFYLPGLQDITAHVDFTAMAYAAVR 307 Query: 288 YKLYINGLTTQGKFLEGLGIWQRAFSLMKQTAR-KDILLDSVKRLVSTSADKKSMGELFK 346 L + G +Q FL G+ R AR ++V++L S MGELFK Sbjct: 308 SGLEMVGYMSQAAFLLAAGLGDRLLQTSPADARVYLPKANAVQKLTS----PAEMGELFK 363 Query: 347 ILVVS 351 +LVV Sbjct: 364 VLVVG 368 >gi|309379055|emb|CBX22357.1| unnamed protein product [Neisseria lactamica Y92-1009] Length = 382 Score = 235 bits (599), Expect = 9e-60, Method: Composition-based stats. Identities = 90/368 (24%), Positives = 151/368 (41%), Gaps = 36/368 (9%) Query: 4 KLIRKIVNLIKKNG-QMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGE 61 L + IKK+G + ++ L + P++GYY+ + G GDF+TAP ++ +F Sbjct: 16 NLQTLLAEEIKKHGNWIPFSRFMELVLYAPQYGYYTGGSHKIGNGGDFITAPTLTPLFAR 75 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 LA L Q + E G G G + D+L + + Y++E S Sbjct: 76 TLARQLQELLPQT----AGNIYEFGAGTGQLAADLLNNLSD------GINRYYIIEISPE 125 Query: 122 LTLIQKKQLASYGDK----INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRER 177 L QK + + + I ++L + G ++ NE D++P++ E G E Sbjct: 126 LAARQKDLIHTLVPQAAQKIVHLSALPETFDG--IIIGNEVLDAMPVEIIRKDEGGSFEH 183 Query: 178 MIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIF----ENSPCRDREMQSISDRLACDG 233 + + ++ S + S YF F E P + +++++ RL Sbjct: 184 VGVCLDNGRFAYSARPLNDPSLSASASLYFPQTDFPYTGELHPQQYAFIRTLASRLE--H 241 Query: 234 GTAIVIDY------GYLQSRVGDTLQ-AVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAI 286 G I IDY Y R TL + H +P G ADL++HV+F ++ Sbjct: 242 GCMIFIDYGFDAAQYYHPQRSQGTLIGHYRHHVIHNPFDFIGLADLTAHVNFTDIAQAGT 301 Query: 287 LYKLYINGLTTQGKFLEGLGIWQRAFSLMK-QTARKDILLDSVKRLVSTSADKKSMGELF 345 L + G Q FL LGI + K +A +V++L+ D+ MGELF Sbjct: 302 DAGLDLIGYLPQSHFLLNLGITELLAQTGKTDSAAYIREAAAVQKLI----DQHEMGELF 357 Query: 346 KILVVSHE 353 K++ Sbjct: 358 KVIAFGKN 365 >gi|134279822|ref|ZP_01766534.1| conserved hypothetical protein [Burkholderia pseudomallei 305] gi|134249022|gb|EBA49104.1| conserved hypothetical protein [Burkholderia pseudomallei 305] Length = 396 Score = 235 bits (599), Expect = 9e-60, Method: Composition-based stats. Identities = 93/371 (25%), Positives = 151/371 (40%), Gaps = 34/371 (9%) Query: 2 ENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTCNP-FGAVGD----FVTAPEI 55 + L + I G + +Y + P GYYS P FG GD FVTAPE+ Sbjct: 22 SDALAASLRAEIAAAGGWIPFSRYMERVLYAPGLGYYSGGAPKFGRRGDDGSDFVTAPEL 81 Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 S +F + LA + A G R++E G G G + +L + L + + Sbjct: 82 SPLFAQTLARPVAQALAASG---TRRVMEFGAGTGQLAAGLLNALAALGVELDE---YAI 135 Query: 116 VETSERLTLIQKKQLASYGD----KINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTE 171 V+ S L Q++ L ++ W +L + G +V NE D++P++ Sbjct: 136 VDLSGELRARQRETLDEQASGAAARVRWLDALPERFEG--VIVGNEVLDAMPVQLVAKHA 193 Query: 172 HGIRERMIDIDQHDSLVFNIGD--HEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRL 229 HG ER + + + F + L D G + E + ++ L Sbjct: 194 HGWCERGVSLGDAGAFAFADRPLARAEDAARLAALDADEGYVTETHDAAAAFVGTVCAML 253 Query: 230 ACDGGTAIVIDY------GYLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLS 282 A G A+ IDY Y + R TL H P V PG D+++HV+F + Sbjct: 254 AR--GAALFIDYGFPRHEYYHRQRAQGTLMCHYRHRAHGDPFVYPGLQDITAHVEFSAVY 311 Query: 283 SIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARK-DILLDSVKRLVSTSADKKSM 341 + + G T+Q +FL GI + A++ ++V++L+S + M Sbjct: 312 EAGVGAGAELLGYTSQARFLLNAGITDVLAEIDPSDAQRFLPAANAVQKLIS----EAEM 367 Query: 342 GELFKILVVSH 352 GELFK++ S Sbjct: 368 GELFKVIAFSR 378 >gi|310798388|gb|EFQ33281.1| hypothetical protein GLRG_08425 [Glomerella graminicola M1.001] Length = 512 Score = 235 bits (599), Expect = 9e-60, Method: Composition-based stats. Identities = 112/451 (24%), Positives = 179/451 (39%), Gaps = 94/451 (20%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYS-----TCNPFGAVGDFVTAPEIS 56 L +++ I G + + Y +C+ GYY+ + FG GDFVT+PEIS Sbjct: 60 STPLAKQLAEAISMTGPVPLASYMRMCLTGDIDGYYTGLAEENRDQFGLKGDFVTSPEIS 119 Query: 57 QIFGEMLAIFLICAWEQHGFPSC-VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 QIFGE++ ++ + W G P V L+E+GPGRG +M D+LR I K S+ +IYM Sbjct: 120 QIFGELIGVWFVAEWLSQGKPKQGVELIEVGPGRGTLMDDMLRTIQNFKGLAQSIDAIYM 179 Query: 116 VETSERLTLIQKKQLASYGD---------------------KINWYTSLADVPLGFTFLV 154 VE S +L QK L S+ P F+V Sbjct: 180 VEASPQLREAQKNLLCGPDAPMTESKVGYHSVCKYTNLPIVWTETIKSIPQSPNKMPFIV 239 Query: 155 ANEFFDSLPIKQFV-------------------MTEHGIRERMIDIDQHDSLVFNIGD-- 193 A+EFFD+LPI F + RE ++ D+ + Sbjct: 240 AHEFFDALPIHVFQAVNVPHPSLPKNPTPGPPIPPKVEWREMLVSPTPPDATHATMKIPK 299 Query: 194 -------------------------HEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDR 228 E S + A+ E P + R Sbjct: 300 SEQGDPIPDFQMTLSPGTTRHSRFLPESSSRYRRLKASVPDAVVEICPDASLYASDFASR 359 Query: 229 L--------ACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQR 280 + + G A+++DYG + ++L+ ++ H VSP PG DLS+ VDF Sbjct: 360 IGGSKQHPKSRPTGAALILDYGTSDTVPINSLRGIRRHRRVSPFSEPGLVDLSADVDFTA 419 Query: 281 LSSIAILY--KLYINGLTTQGKFLEGLGIWQRAFSLM------KQTARKDILLDSVKRLV 332 ++ A + ++G QG FLE +GI QRA L+ ++ + + + + RL+ Sbjct: 420 IAEAATRASQGVEVHGPIEQGAFLELMGIRQRAQVLINQVRGKEEFVKAEDIAKACGRLI 479 Query: 333 STSADKKSMGELFKILVVSHEKVE---LMPF 360 MG+++K++ + E + F Sbjct: 480 DRG--PGGMGKVYKVMAILPENDGRRRPVGF 508 >gi|126441994|ref|YP_001057383.1| hypothetical protein BURPS668_0330 [Burkholderia pseudomallei 668] gi|126221487|gb|ABN84993.1| conserved hypothetical protein [Burkholderia pseudomallei 668] Length = 396 Score = 235 bits (599), Expect = 1e-59, Method: Composition-based stats. Identities = 92/371 (24%), Positives = 150/371 (40%), Gaps = 34/371 (9%) Query: 2 ENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGD----FVTAPEI 55 + L + I G + +Y + P GYYS FG GD FVTAPE+ Sbjct: 22 SDALAASLRAEIAAAGGWIPFSRYMERVLYAPGLGYYSGGAQKFGRRGDDGSDFVTAPEL 81 Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 S +F + LA + A G R++E G G G + +L + L + + Sbjct: 82 SPLFAQTLARPVAQALAASG---TRRVMEFGAGTGQLAAGLLNALAALGVELDE---YAI 135 Query: 116 VETSERLTLIQKKQLASYGD----KINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTE 171 V+ S L Q++ L ++ W +L + G +V NE D++P++ Sbjct: 136 VDLSGELRARQRETLDEQASGAAARVRWLDALPERFEG--VIVGNEVLDAMPVQLVAKHA 193 Query: 172 HGIRERMIDIDQHDSLVFNIGD--HEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRL 229 HG ER + + + F + L D G + E + ++ L Sbjct: 194 HGWCERGVSLGDAGAFAFADRPLARAEDAARLAALDADEGYVTETHDAAAAFVGTVCAML 253 Query: 230 ACDGGTAIVIDY------GYLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLS 282 A G A+ IDY Y + R TL H P V PG D+++HV+F + Sbjct: 254 AR--GAALFIDYGFPRHEYYHRQRAQGTLMCHYRHRAHGDPFVYPGLQDITAHVEFSAVY 311 Query: 283 SIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARK-DILLDSVKRLVSTSADKKSM 341 + + G T+Q +FL GI + A++ ++V++L+S + M Sbjct: 312 EAGVGAGAELLGYTSQARFLLNAGITDVLAEIDPSDAQRFLPAANAVQKLIS----EAEM 367 Query: 342 GELFKILVVSH 352 GELFK++ S Sbjct: 368 GELFKVIAFSR 378 >gi|300705454|ref|YP_003747057.1| hypothetical protein RCFBP_21301 [Ralstonia solanacearum CFBP2957] gi|299073118|emb|CBJ44476.1| conserved protein of unknown function [Ralstonia solanacearum CFBP2957] Length = 420 Score = 235 bits (599), Expect = 1e-59, Method: Composition-based stats. Identities = 87/378 (23%), Positives = 148/378 (39%), Gaps = 27/378 (7%) Query: 2 ENKLIRKIVNLI-KKNGQMTVDQYFALCVADPEFGYYSTCN-PFGAV----GDFVTAPEI 55 ++L IV+ I G + ++Y L + P GYYS FG GDF+TAPE+ Sbjct: 41 SDRLFSTIVHAIEAAGGWIPFERYMELALYAPGLGYYSGGAAKFGRRVEDGGDFITAPEL 100 Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 + FG +A + + P ++E G G G + DIL + L S + Sbjct: 101 TPFFGRTVAHQIAQVLQAL-PPGQRHVLEFGAGTGRLAADILTELETLGMRPDS---YGI 156 Query: 116 VETSERLTLIQKKQLASYGDKINWYTSLAD--VPLGFTFLVANEFFDSLPIKQFVMTEHG 173 VE S L Q++ LA+ G + D ++ NE D++P+ + Sbjct: 157 VELSGELRQRQQQALAALGPDLAGLARWHDRLPARFTGAMIGNEVLDAMPVSLWARRGGA 216 Query: 174 IRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYF--LGAIFENSPCRDREMQSISDRLAC 231 R + D L ++ + + E+ + ++S L Sbjct: 217 WHRRGVAFDAEHGLRWSERAAAPAEVPPKLAALPGHEDFVTESHEAAEGFIRSTGAALER 276 Query: 232 DGGTAI-----VIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAI 286 I +Y + G + + H + P PG D+++HVDF ++ A Sbjct: 277 GLLLLIDYGFPAAEYYHAHRANGTLMCHYRQHAHDDPFWLPGLQDITAHVDFSGIARAAH 336 Query: 287 LYKLYINGLTTQGKFLEGLGIWQRAFSLM-KQTARKDILLDSVKRLVSTSADKKSMGELF 345 L + G +Q +FL G G+ Q +L R ++V++L+S + MGELF Sbjct: 337 EAGLEVLGYASQARFLLGAGVGQLLMTLDPADPVRFLPAANAVQKLLS----EAEMGELF 392 Query: 346 KILVVSH---EKVELMPF 360 K + + + L F Sbjct: 393 KAIALGRGIDAALPLAGF 410 >gi|254295863|ref|ZP_04963320.1| conserved hypothetical protein [Burkholderia pseudomallei 406e] gi|157805726|gb|EDO82896.1| conserved hypothetical protein [Burkholderia pseudomallei 406e] Length = 446 Score = 235 bits (599), Expect = 1e-59, Method: Composition-based stats. Identities = 92/371 (24%), Positives = 150/371 (40%), Gaps = 34/371 (9%) Query: 2 ENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGD----FVTAPEI 55 + L + I G + +Y + P GYYS FG GD FVTAPE+ Sbjct: 72 SDALAASLRAEIAAAGGWIPFSRYMERVLYAPGLGYYSGGAQKFGRRGDDGSDFVTAPEL 131 Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 S +F + LA + A G R++E G G G + +L + L + + Sbjct: 132 SPLFAQTLARPVAQALAASG---TRRVMEFGAGTGQLAAGLLNALAALGVELDE---YAI 185 Query: 116 VETSERLTLIQKKQLASYGD----KINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTE 171 V+ S L Q++ L ++ W +L + G +V NE D++P++ Sbjct: 186 VDLSGELRARQRETLDEQASGAAARVRWLDALPERFEG--VIVGNEVLDAMPVQLVAKHA 243 Query: 172 HGIRERMIDIDQHDSLVFNIGD--HEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRL 229 HG ER + + + F + L D G + E + ++ L Sbjct: 244 HGWCERGVSLGDAGAFAFADRPLARAEDAARLAALDADEGYVTETHDAAAAFVGTVCAML 303 Query: 230 ACDGGTAIVIDY------GYLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLS 282 A G A+ IDY Y + R TL H P V PG D+++HV+F + Sbjct: 304 AR--GAALFIDYGFPRHEYYHRQRAQGTLMCHYRHRAHGDPFVYPGLQDITAHVEFSAVY 361 Query: 283 SIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARK-DILLDSVKRLVSTSADKKSM 341 + + G T+Q +FL GI + A++ ++V++L+S + M Sbjct: 362 EAGVGAGAELLGYTSQARFLLNAGITDVLAEIDPSDAQRFLPAANAVQKLIS----EAEM 417 Query: 342 GELFKILVVSH 352 GELFK++ S Sbjct: 418 GELFKVIAFSR 428 >gi|89899622|ref|YP_522093.1| hypothetical protein Rfer_0812 [Rhodoferax ferrireducens T118] gi|89344359|gb|ABD68562.1| protein of unknown function DUF185 [Rhodoferax ferrireducens T118] Length = 375 Score = 235 bits (599), Expect = 1e-59, Method: Composition-based stats. Identities = 83/380 (21%), Positives = 154/380 (40%), Gaps = 40/380 (10%) Query: 1 MENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTC-NPFGA------------V 46 + + L I I G + D++ AL + P GYY+ FGA Sbjct: 7 LTSALGAHIGQTITHHGGWIGFDEFMALALYTPGLGYYANNSAKFGALPYARQAGVTVAG 66 Query: 47 GDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPD 106 DFVTAPE++ +FG+ LA + A + + E G G G + L +L + Sbjct: 67 SDFVTAPEMTPLFGQTLAAQVAQALQ---VTQTHEVWEFGAGSGALALQVLEALAA---Q 120 Query: 107 FFSVLSIYMVETSERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQ 166 ++ +V+ S L Q+ L +GD + W L D G ++ NE D++P+K Sbjct: 121 GQALTRYSIVDLSGSLRERQQATLEKFGDTVQWVNELPDTLQG--VVIGNEVLDAMPVKL 178 Query: 167 FVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSIS 226 + ER + + L++ +++ D + E P + +++++ Sbjct: 179 LARVKGVWFERGVALGAASELIWQDRPTDLRPPLAI--DGEHDYLTEIHPQGESFIRTLA 236 Query: 227 DRLACDGGTAIVI-----DYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRL 281 D+L+ I +Y + Q +G + + PL + G D+++HV+F + Sbjct: 237 DKLSAGAAFFIDYGFPEHEYYHAQRHMGTVMCHRSHQSDTDPLTDVGLKDITAHVNFTGI 296 Query: 282 SSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSM 341 + + + G TQ +FL G+ R + ++L+ + M Sbjct: 297 ALACQMEDWGVLGYCTQARFLINCGLVSRMEN------ATLQERVRAQKLIM----EHEM 346 Query: 342 GELFKILVVSH-EKVELMPF 360 GELFK++ E + + F Sbjct: 347 GELFKVIGFYKGEPWQALGF 366 >gi|53717958|ref|YP_106944.1| hypothetical protein BPSL0317 [Burkholderia pseudomallei K96243] gi|121601017|ref|YP_994351.1| hypothetical protein BMASAVP1_A3057 [Burkholderia mallei SAVP1] gi|124384864|ref|YP_001028000.1| hypothetical protein BMA10229_A2036 [Burkholderia mallei NCTC 10229] gi|126449724|ref|YP_001081777.1| hypothetical protein BMA10247_2248 [Burkholderia mallei NCTC 10247] gi|126451987|ref|YP_001064626.1| hypothetical protein BURPS1106A_0343 [Burkholderia pseudomallei 1106a] gi|167001806|ref|ZP_02267598.1| conserved hypothetical protein [Burkholderia mallei PRL-20] gi|167813865|ref|ZP_02445545.1| hypothetical protein Bpse9_01921 [Burkholderia pseudomallei 91] gi|167822382|ref|ZP_02453853.1| hypothetical protein Bpseu9_01809 [Burkholderia pseudomallei 9] gi|167843970|ref|ZP_02469478.1| hypothetical protein BpseB_01677 [Burkholderia pseudomallei B7210] gi|167892474|ref|ZP_02479876.1| hypothetical protein Bpse7_01844 [Burkholderia pseudomallei 7894] gi|167909189|ref|ZP_02496280.1| hypothetical protein Bpse112_01762 [Burkholderia pseudomallei 112] gi|167917223|ref|ZP_02504314.1| hypothetical protein BpseBC_01654 [Burkholderia pseudomallei BCC215] gi|217425662|ref|ZP_03457153.1| conserved hypothetical protein [Burkholderia pseudomallei 576] gi|226200278|ref|ZP_03795822.1| conserved hypothetical protein [Burkholderia pseudomallei Pakistan 9] gi|237810525|ref|YP_002894976.1| hypothetical protein GBP346_A0249 [Burkholderia pseudomallei MSHR346] gi|238563697|ref|ZP_00438429.2| conserved hypothetical protein [Burkholderia mallei GB8 horse 4] gi|242315791|ref|ZP_04814807.1| conserved hypothetical protein [Burkholderia pseudomallei 1106b] gi|254176899|ref|ZP_04883556.1| conserved hypothetical protein [Burkholderia mallei ATCC 10399] gi|254182088|ref|ZP_04888685.1| conserved hypothetical protein [Burkholderia pseudomallei 1655] gi|254188018|ref|ZP_04894530.1| conserved hypothetical protein [Burkholderia pseudomallei Pasteur 52237] gi|254196631|ref|ZP_04903055.1| conserved hypothetical protein [Burkholderia pseudomallei S13] gi|254201948|ref|ZP_04908312.1| conserved hypothetical protein [Burkholderia mallei FMH] gi|254258011|ref|ZP_04949065.1| conserved hypothetical protein [Burkholderia pseudomallei 1710a] gi|52208372|emb|CAH34306.1| conserved hypothetical protein [Burkholderia pseudomallei K96243] gi|121229827|gb|ABM52345.1| conserved hypothetical protein [Burkholderia mallei SAVP1] gi|124292884|gb|ABN02153.1| conserved hypothetical protein [Burkholderia mallei NCTC 10229] gi|126225629|gb|ABN89169.1| conserved hypothetical protein [Burkholderia pseudomallei 1106a] gi|126242594|gb|ABO05687.1| conserved hypothetical protein [Burkholderia mallei NCTC 10247] gi|147747842|gb|EDK54918.1| conserved hypothetical protein [Burkholderia mallei FMH] gi|157935698|gb|EDO91368.1| conserved hypothetical protein [Burkholderia pseudomallei Pasteur 52237] gi|160697940|gb|EDP87910.1| conserved hypothetical protein [Burkholderia mallei ATCC 10399] gi|169653374|gb|EDS86067.1| conserved hypothetical protein [Burkholderia pseudomallei S13] gi|184212626|gb|EDU09669.1| conserved hypothetical protein [Burkholderia pseudomallei 1655] gi|217391338|gb|EEC31369.1| conserved hypothetical protein [Burkholderia pseudomallei 576] gi|225927600|gb|EEH23643.1| conserved hypothetical protein [Burkholderia pseudomallei Pakistan 9] gi|237503009|gb|ACQ95327.1| conserved hypothetical protein [Burkholderia pseudomallei MSHR346] gi|238520150|gb|EEP83612.1| conserved hypothetical protein [Burkholderia mallei GB8 horse 4] gi|242139030|gb|EES25432.1| conserved hypothetical protein [Burkholderia pseudomallei 1106b] gi|243062401|gb|EES44587.1| conserved hypothetical protein [Burkholderia mallei PRL-20] gi|254216700|gb|EET06084.1| conserved hypothetical protein [Burkholderia pseudomallei 1710a] Length = 396 Score = 235 bits (599), Expect = 1e-59, Method: Composition-based stats. Identities = 92/371 (24%), Positives = 150/371 (40%), Gaps = 34/371 (9%) Query: 2 ENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGD----FVTAPEI 55 + L + I G + +Y + P GYYS FG GD FVTAPE+ Sbjct: 22 SDALAASLRAEIAAAGGWIPFSRYMERVLYAPGLGYYSGGAQKFGRRGDDGSDFVTAPEL 81 Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 S +F + LA + A G R++E G G G + +L + L + + Sbjct: 82 SPLFAQTLARPVAQALAASG---TRRVMEFGAGTGQLAAGLLNALAALGVELDE---YAI 135 Query: 116 VETSERLTLIQKKQLASYGD----KINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTE 171 V+ S L Q++ L ++ W +L + G +V NE D++P++ Sbjct: 136 VDLSGELRARQRETLDEQASGAAARVRWLDALPERFEG--VIVGNEVLDAMPVQLVAKHA 193 Query: 172 HGIRERMIDIDQHDSLVFNIGD--HEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRL 229 HG ER + + + F + L D G + E + ++ L Sbjct: 194 HGWCERGVSLGDAGAFAFADRPLARAEDAARLAALDADEGYVTETHDAAAAFVGTVCAML 253 Query: 230 ACDGGTAIVIDY------GYLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLS 282 A G A+ IDY Y + R TL H P V PG D+++HV+F + Sbjct: 254 AR--GAALFIDYGFPRHEYYHRQRAQGTLMCHYRHRAHGDPFVYPGLQDITAHVEFSAVY 311 Query: 283 SIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARK-DILLDSVKRLVSTSADKKSM 341 + + G T+Q +FL GI + A++ ++V++L+S + M Sbjct: 312 EAGVGAGAELLGYTSQARFLLNAGITDVLAEIDPSDAQRFLPAANAVQKLIS----EAEM 367 Query: 342 GELFKILVVSH 352 GELFK++ S Sbjct: 368 GELFKVIAFSR 378 >gi|53724887|ref|YP_104842.1| hypothetical protein BMA3384 [Burkholderia mallei ATCC 23344] gi|76808817|ref|YP_331937.1| hypothetical protein BURPS1710b_0523 [Burkholderia pseudomallei 1710b] gi|52428310|gb|AAU48903.1| conserved hypothetical protein [Burkholderia mallei ATCC 23344] gi|76578270|gb|ABA47745.1| conserved hypothetical protein [Burkholderia pseudomallei 1710b] Length = 410 Score = 235 bits (599), Expect = 1e-59, Method: Composition-based stats. Identities = 92/371 (24%), Positives = 150/371 (40%), Gaps = 34/371 (9%) Query: 2 ENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGD----FVTAPEI 55 + L + I G + +Y + P GYYS FG GD FVTAPE+ Sbjct: 36 SDALAASLRAEIAAAGGWIPFSRYMERVLYAPGLGYYSGGAQKFGRRGDDGSDFVTAPEL 95 Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 S +F + LA + A G R++E G G G + +L + L + + Sbjct: 96 SPLFAQTLARPVAQALAASG---TRRVMEFGAGTGQLAAGLLNALAALGVELDE---YAI 149 Query: 116 VETSERLTLIQKKQLASYGD----KINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTE 171 V+ S L Q++ L ++ W +L + G +V NE D++P++ Sbjct: 150 VDLSGELRARQRETLDEQASGAAARVRWLDALPERFEG--VIVGNEVLDAMPVQLVAKHA 207 Query: 172 HGIRERMIDIDQHDSLVFNIGD--HEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRL 229 HG ER + + + F + L D G + E + ++ L Sbjct: 208 HGWCERGVSLGDAGAFAFADRPLARAEDAARLAALDADEGYVTETHDAAAAFVGTVCAML 267 Query: 230 ACDGGTAIVIDY------GYLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLS 282 A G A+ IDY Y + R TL H P V PG D+++HV+F + Sbjct: 268 AR--GAALFIDYGFPRHEYYHRQRAQGTLMCHYRHRAHGDPFVYPGLQDITAHVEFSAVY 325 Query: 283 SIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARK-DILLDSVKRLVSTSADKKSM 341 + + G T+Q +FL GI + A++ ++V++L+S + M Sbjct: 326 EAGVGAGAELLGYTSQARFLLNAGITDVLAEIDPSDAQRFLPAANAVQKLIS----EAEM 381 Query: 342 GELFKILVVSH 352 GELFK++ S Sbjct: 382 GELFKVIAFSR 392 >gi|254207281|ref|ZP_04913632.1| conserved hypothetical protein [Burkholderia mallei JHU] gi|147752823|gb|EDK59889.1| conserved hypothetical protein [Burkholderia mallei JHU] Length = 497 Score = 235 bits (598), Expect = 1e-59, Method: Composition-based stats. Identities = 92/371 (24%), Positives = 150/371 (40%), Gaps = 34/371 (9%) Query: 2 ENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGD----FVTAPEI 55 + L + I G + +Y + P GYYS FG GD FVTAPE+ Sbjct: 123 SDALAASLRAEIAAAGGWIPFSRYMERVLYAPGLGYYSGGAQKFGRRGDDGSDFVTAPEL 182 Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 S +F + LA + A G R++E G G G + +L + L + + Sbjct: 183 SPLFAQTLARPVAQALAASG---TRRVMEFGAGTGQLAAGLLNALAALGVELDE---YAI 236 Query: 116 VETSERLTLIQKKQLASYGD----KINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTE 171 V+ S L Q++ L ++ W +L + G +V NE D++P++ Sbjct: 237 VDLSGELRARQRETLDEQASGAAARVRWLDALPERFEG--VIVGNEVLDAMPVQLVAKHA 294 Query: 172 HGIRERMIDIDQHDSLVFNIGD--HEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRL 229 HG ER + + + F + L D G + E + ++ L Sbjct: 295 HGWCERGVSLGDAGAFAFADRPLARAEDAARLAALDADEGYVTETHDAAAAFVGTVCAML 354 Query: 230 ACDGGTAIVIDY------GYLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLS 282 A G A+ IDY Y + R TL H P V PG D+++HV+F + Sbjct: 355 AR--GAALFIDYGFPRHEYYHRQRAQGTLMCHYRHRAHGDPFVYPGLQDITAHVEFSAVY 412 Query: 283 SIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARK-DILLDSVKRLVSTSADKKSM 341 + + G T+Q +FL GI + A++ ++V++L+S + M Sbjct: 413 EAGVGAGAELLGYTSQARFLLNAGITDVLAEIDPSDAQRFLPAANAVQKLIS----EAEM 468 Query: 342 GELFKILVVSH 352 GELFK++ S Sbjct: 469 GELFKVIAFSR 479 >gi|59801882|ref|YP_208594.1| hypothetical protein NGO1546 [Neisseria gonorrhoeae FA 1090] gi|59718777|gb|AAW90182.1| conserved hypothetical protein [Neisseria gonorrhoeae FA 1090] Length = 415 Score = 235 bits (598), Expect = 1e-59, Method: Composition-based stats. Identities = 89/374 (23%), Positives = 151/374 (40%), Gaps = 33/374 (8%) Query: 4 KLIRKIVNLIKKNG-QMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGE 61 L I I K+G + ++ L + P++GYY+ + G GDF+TAP ++ +F + Sbjct: 37 NLQTLIAEEIGKHGNWIPFSRFMELVLYAPQYGYYTGGSHKIGNTGDFITAPTLTPLFAQ 96 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 LA L Q + E G G G + D+ L S+ Y++E S Sbjct: 97 TLARQLQELLPQT----AGNIYEFGAGTGQLAADL------LGSVSDSINCYYIIEISPE 146 Query: 122 LTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRER 177 L QK + K+ T+L + G ++ NE D++P++ E G+ E Sbjct: 147 LAARQKNLIQARAPEASQKVVHLTALPEAFDG--IIIGNELLDAIPVEIVRKNEGGLLEH 204 Query: 178 MIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLG----AIFENSPCRDREMQSISDRLACDG 233 + + ++ S + S YF E P + +++++ RL G Sbjct: 205 IGVCTDNGRFAYSARPLHDPSLSTSASLYFPQTDYPYTSELHPQQYAFIRTLASRLERGG 264 Query: 234 GTAIVIDY----GYLQSRVGDTLQ-AVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILY 288 I + Y R TL + H +P G ADL++HV+F ++ Sbjct: 265 MIFIDYGFDAAQYYHPQRNQGTLIGHYRHHVIHNPFDFIGLADLTAHVNFTDIAQAGTDA 324 Query: 289 KLYINGLTTQGKFLEGLGIWQRAFSLMK-QTARKDILLDSVKRLVSTSADKKSMGELFKI 347 L + G Q FL LGI + K +A +V++L+ D+ MGELFK+ Sbjct: 325 GLDLTGYLPQSHFLLNLGITELLAQTGKTDSAAYIREAAAVQKLI----DQHEMGELFKV 380 Query: 348 LVVSHE-KVELMPF 360 + ++ F Sbjct: 381 IAFGKNIGIDWAGF 394 >gi|187927158|ref|YP_001897645.1| hypothetical protein Rpic_0048 [Ralstonia pickettii 12J] gi|309780141|ref|ZP_07674892.1| hypothetical protein HMPREF1004_01491 [Ralstonia sp. 5_7_47FAA] gi|187724048|gb|ACD25213.1| protein of unknown function DUF185 [Ralstonia pickettii 12J] gi|308920844|gb|EFP66490.1| hypothetical protein HMPREF1004_01491 [Ralstonia sp. 5_7_47FAA] Length = 397 Score = 235 bits (598), Expect = 1e-59, Method: Composition-based stats. Identities = 80/365 (21%), Positives = 143/365 (39%), Gaps = 26/365 (7%) Query: 16 NGQMTVDQYFALCVADPEFGYYSTCN-PFGAV----GDFVTAPEISQIFGEMLAIFLICA 70 G ++ D+Y L + P GYYS FG GDF+TAPE++ FG +A L Sbjct: 33 GGWLSFDRYMELALYAPGLGYYSGGAAKFGRRVEDGGDFITAPELTPFFGRTVAHQLAQV 92 Query: 71 WEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQL 130 + ++E G G G + DIL + L S +VE S L Q+++L Sbjct: 93 LQAL-PEGQRHVLEFGAGTGKLAADILIELDALSVRPDS---YGIVELSGELRQRQQERL 148 Query: 131 ASYGDKINWYTSLADVPLGFT--FLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLV 188 + G + D ++ NE D++P+ + +R + +D L Sbjct: 149 TALGPDLGALAQWHDTLPAPFTGVMIGNEVLDAMPVSLWARRGGMWHQRGVMLDAEHGLQ 208 Query: 189 FNIGDHEIKSNFLTCSDYF--LGAIFENSPCRDREMQSISDRLACDGGTAI-----VIDY 241 + + + + E+ + ++S L I +Y Sbjct: 209 WEDRLVDPSEVPAKLAALPGTDDFVTESHEAAEGFIRSTGAALERGLLLLIDYGFPAAEY 268 Query: 242 GYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKF 301 + G + + H + P PG D+++HVDF ++ L + G T+Q +F Sbjct: 269 YHAHRANGTLMCHYRQHAHDDPFWLPGLQDITAHVDFSGIAQAGQEAGLELLGYTSQARF 328 Query: 302 LEGLGIWQRAFSLMKQTA-RKDILLDSVKRLVSTSADKKSMGELFKILVVSH---EKVEL 357 L G+ + +L + ++V++L+S + MGELFK + + + L Sbjct: 329 LLSAGVGELLMTLDPSDPMQFLPAANAVQKLLS----EAEMGELFKAIALGKGIDAALPL 384 Query: 358 MPFVN 362 F + Sbjct: 385 TGFAD 389 >gi|254671184|emb|CBA08314.1| conserved hypothetical protein [Neisseria meningitidis alpha153] Length = 405 Score = 235 bits (598), Expect = 1e-59, Method: Composition-based stats. Identities = 93/376 (24%), Positives = 152/376 (40%), Gaps = 37/376 (9%) Query: 4 KLIRKIVNLIKKNG-QMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGE 61 KL I I K+G + ++ L + P++GYY+ + G GDF+TAP ++ +F + Sbjct: 39 KLQTLIAEKIGKHGNWIPFSRFMELVLYAPQYGYYTGGSHKIGNTGDFITAPTLTSLFAQ 98 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 LA L Q + E G G G + D+L I + Y++E S Sbjct: 99 TLARQLQELLSQT----AGNIYEFGAGTGQLAADLLGSISD------GISRYYIIEISPE 148 Query: 122 LTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRER 177 L QK + K+ T+L + G ++ NE D++P++ E G E Sbjct: 149 LAARQKNLIQARAPEASQKVVHLTALPEAFDG--IIIGNEVLDAMPVEIVRKNEGGSFEH 206 Query: 178 MIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLG----AIFENSPCRDREMQSISDRLACDG 233 + +D ++ S YF E P + +++++ RL Sbjct: 207 VGVCLDNDRFTYSARPLHDLQLSALASLYFPQTDYPYTSELHPQQYAFIRTLASRLE--H 264 Query: 234 GTAIVIDY------GYLQSRVGDTLQ-AVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAI 286 G I IDY Y R TL + H +P G ADL++HV+F ++ Sbjct: 265 GCMIFIDYGFDAAQYYHPQRNQGTLIGHYRHHVIHNPFDFIGLADLTAHVNFTDIAQAGT 324 Query: 287 LYKLYINGLTTQGKFLEGLGIWQRAFSLMK-QTARKDILLDSVKRLVSTSADKKSMGELF 345 L + G Q FL LGI + K +A +V++L+ D+ MGELF Sbjct: 325 DAGLDLIGYLPQSHFLLNLGITELLAQTGKTNSAAYIREAAAVQKLI----DQHEMGELF 380 Query: 346 KILVVSHE-KVELMPF 360 K++ ++ F Sbjct: 381 KVIAFGKNIGIDWAGF 396 >gi|207742028|ref|YP_002258420.1| hypothetical protein RSIPO_04965 [Ralstonia solanacearum IPO1609] gi|206593414|emb|CAQ60341.1| conserved hypothetical protein [Ralstonia solanacearum IPO1609] Length = 397 Score = 235 bits (598), Expect = 1e-59, Method: Composition-based stats. Identities = 89/378 (23%), Positives = 149/378 (39%), Gaps = 27/378 (7%) Query: 2 ENKLIRKIVNLI-KKNGQMTVDQYFALCVADPEFGYYSTCN-PFGAV----GDFVTAPEI 55 ++L IV+ I +G + ++Y L + P GYYS FG GDF+TAPE+ Sbjct: 18 SDRLFSTIVHAIEAADGWIPFERYMELALYAPGLGYYSGGAAKFGRRVEDGGDFITAPEL 77 Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 + FG +A + + P ++E G G G + DIL + L S + Sbjct: 78 TPFFGRTVAHQIAQVLQAL-PPGQRHVLEFGAGTGRLAADILTELETLGMRPDS---YGI 133 Query: 116 VETSERLTLIQKKQLASYGDKINWYTSLAD--VPLGFTFLVANEFFDSLPIKQFVMTEHG 173 VE S L Q++ LA+ G + D +V NE D++P+ + Sbjct: 134 VELSGELRQRQQQALAALGPDLAGLARWHDRLPARFTGAMVGNEVLDAMPVSLWARRGGA 193 Query: 174 IRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYF--LGAIFENSPCRDREMQSISDRLAC 231 R + D L ++ + I E+ + ++S L Sbjct: 194 WHRRGVAFDAEHGLRWSERAAAPAEVPPKLAALPGREDFITESHEAAEGFIRSTGAALER 253 Query: 232 DGGTAI-----VIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAI 286 I +Y + G + + H + P PG D+++HVDF ++ A Sbjct: 254 GLLLLIDYGFPAAEYYHAHRANGTLMCHYRQHAHDDPFWLPGLQDITAHVDFSGIARAAH 313 Query: 287 LYKLYINGLTTQGKFLEGLGIWQRAFSLM-KQTARKDILLDSVKRLVSTSADKKSMGELF 345 L + G +Q +FL G G+ Q +L R ++V++L+S + MGELF Sbjct: 314 EAGLEVLGYASQARFLLGAGVGQLLMTLDPADPVRFLPAANAVQKLLS----EAEMGELF 369 Query: 346 KILVVSH---EKVELMPF 360 K + + + L F Sbjct: 370 KAIALGRGIDAALPLAGF 387 >gi|83745953|ref|ZP_00943009.1| Hypothetical cytosolic protein [Ralstonia solanacearum UW551] gi|83727347|gb|EAP74469.1| Hypothetical cytosolic protein [Ralstonia solanacearum UW551] Length = 491 Score = 235 bits (598), Expect = 1e-59, Method: Composition-based stats. Identities = 89/378 (23%), Positives = 149/378 (39%), Gaps = 27/378 (7%) Query: 2 ENKLIRKIVNLI-KKNGQMTVDQYFALCVADPEFGYYSTCN-PFGAV----GDFVTAPEI 55 ++L IV+ I +G + ++Y L + P GYYS FG GDF+TAPE+ Sbjct: 112 SDRLFSTIVHAIEAADGWIPFERYMELALYAPGLGYYSGGAAKFGRRVEDGGDFITAPEL 171 Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 + FG +A + + P ++E G G G + DIL + L S + Sbjct: 172 TPFFGRTVAHQIAQVLQAL-PPGQRHVLEFGAGTGRLAADILTELETLGMRPDS---YGI 227 Query: 116 VETSERLTLIQKKQLASYGDKINWYTSLAD--VPLGFTFLVANEFFDSLPIKQFVMTEHG 173 VE S L Q++ LA+ G + D +V NE D++P+ + Sbjct: 228 VELSGELRQRQQQALAALGPDLAGLARWHDRLPARFTGAMVGNEVLDAMPVSLWARRGGA 287 Query: 174 IRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYF--LGAIFENSPCRDREMQSISDRLAC 231 R + D L ++ + I E+ + ++S L Sbjct: 288 WHRRGVAFDAEHGLRWSERAAAPAEVPPKLAALPGREDFITESHEAAEGFIRSTGAALER 347 Query: 232 DGGTAI-----VIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAI 286 I +Y + G + + H + P PG D+++HVDF ++ A Sbjct: 348 GLLLLIDYGFPAAEYYHAHRANGTLMCHYRQHAHDDPFWLPGLQDITAHVDFSGIARAAH 407 Query: 287 LYKLYINGLTTQGKFLEGLGIWQRAFSLM-KQTARKDILLDSVKRLVSTSADKKSMGELF 345 L + G +Q +FL G G+ Q +L R ++V++L+S + MGELF Sbjct: 408 EAGLEVLGYASQARFLLGAGVGQLLMTLDPADPVRFLPAANAVQKLLS----EAEMGELF 463 Query: 346 KILVVSH---EKVELMPF 360 K + + + L F Sbjct: 464 KAIALGRGIDAALPLAGF 481 >gi|319639026|ref|ZP_07993784.1| hypothetical protein HMPREF0604_01408 [Neisseria mucosa C102] gi|317399930|gb|EFV80593.1| hypothetical protein HMPREF0604_01408 [Neisseria mucosa C102] Length = 383 Score = 235 bits (598), Expect = 1e-59, Method: Composition-based stats. Identities = 80/373 (21%), Positives = 153/373 (41%), Gaps = 32/373 (8%) Query: 5 LIRKIVNLIKKN-GQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGEM 62 L + I N I+++ + ++ L + P++GYYS + G GDF+TAP +S +FG+ Sbjct: 19 LTKLIKNEIEQHRNWIPFSRFMELALYTPQYGYYSGGSHKIGTDGDFITAPTLSPLFGQT 78 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 LA L Q + E G G G + +L+ + + Y++E S L Sbjct: 79 LARQLTELLLQT----AGNIYEFGAGTGHLAATLLQNLSD------GLNHYYIIELSAEL 128 Query: 123 TLIQKKQLASYGDK-----INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRER 177 Q++ + + + T+L + G ++ NE D++P+++ + + G ++ Sbjct: 129 AERQRQHILEHTSPEAAAKVIHLTALPEHFDG--IIIGNEVLDAMPVERLIYQDEGFQQI 186 Query: 178 MIDIDQHD--SLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGT 235 + ++ V + E+ E + +Q+++ +L G Sbjct: 187 GVSLENDKLIEAVRPLAQTELIQTASLYFPPLPSYTSELHSAQYAFIQTLAAKLQRGGMI 246 Query: 236 AI-----VIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKL 290 I Y + Q + G + + HT P N G DL++HV+F ++ L Sbjct: 247 FIDYGFDAAQYYHPQRKEGTFIGHYRHHTIHDPFFNIGLTDLTAHVNFTDIARAGTESGL 306 Query: 291 YINGLTTQGKFLEGLGIWQRAFSL-MKQTARKDILLDSVKRLVSTSADKKSMGELFKILV 349 + G Q FL LGI + + +V++L+ MGELFK++ Sbjct: 307 DLIGYLPQSYFLLNLGITDLLAQIGSPDSIEYIQAAAAVQKLIHQ----HEMGELFKVIA 362 Query: 350 VSHE-KVELMPFV 361 + ++ FV Sbjct: 363 FGKDIDIDWTGFV 375 >gi|261391923|emb|CAX49385.1| conserved hypothetical protein [Neisseria meningitidis 8013] Length = 405 Score = 235 bits (598), Expect = 1e-59, Method: Composition-based stats. Identities = 90/374 (24%), Positives = 150/374 (40%), Gaps = 33/374 (8%) Query: 4 KLIRKIVNLIKKNG-QMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGE 61 KL I I K+G + ++ L + P++GYY+ + G GDF+TAP ++ +F + Sbjct: 39 KLQTLIAEKIGKHGNWIPFSRFMELVLYAPQYGYYTGGSHKIGNTGDFITAPTLTSLFAQ 98 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 LA L Q + E G G G + D+L I + Y++E S Sbjct: 99 TLARQLQELLSQT----AGNIYEFGAGTGQLAADLLGSISD------GISRYYIIEISPE 148 Query: 122 LTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRER 177 L QK + K+ T+L + G ++ NE D++P++ E G E Sbjct: 149 LAARQKNLIQARAPEASQKVVHLTALPEAFDG--IIIGNEVLDAMPVEIVRKNEGGSFEH 206 Query: 178 MIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLG----AIFENSPCRDREMQSISDRLACDG 233 + +D ++ S YF E P + +++++ RL G Sbjct: 207 VGVCLDNDRFTYSARPLHDLQLSALASLYFPQTDYPYTSELHPQQYAFIRTLASRLERGG 266 Query: 234 GTAIVIDY----GYLQSRVGDTLQ-AVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILY 288 I + Y R TL + H +P G ADL++HV+F ++ Sbjct: 267 MIFIDYGFDAAQYYHPQRNQGTLIGHYRHHVIHNPFDFIGLADLTAHVNFTDIAQAGTDA 326 Query: 289 KLYINGLTTQGKFLEGLGIWQRAFSLMK-QTARKDILLDSVKRLVSTSADKKSMGELFKI 347 L + G Q FL LGI + K +A +V++L+ D+ MGELFK+ Sbjct: 327 GLDLIGYLPQSHFLLNLGITELLAQTGKTNSAAYIREAAAVQKLI----DQHEMGELFKV 382 Query: 348 LVVSHE-KVELMPF 360 + ++ F Sbjct: 383 IAFGKNIGIDWAGF 396 >gi|148706527|gb|EDL38474.1| RIKEN cDNA 2410091C18, isoform CRA_a [Mus musculus] Length = 393 Score = 235 bits (598), Expect = 1e-59, Method: Composition-based stats. Identities = 115/333 (34%), Positives = 167/333 (50%), Gaps = 26/333 (7%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 ++R ++ IK G +TV +Y + +P GYY + G GDF+T+PEISQIFGE+ Sbjct: 46 TPMLRHLMYKIKSTGPITVAEYMKEVLTNPAKGYYVHQDMLGEKGDFITSPEISQIFGEL 105 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFF-SVLSIYMVETSER 121 L ++ + W G +LVELGPGRG + DILRV +L +SI++VE S++ Sbjct: 106 LGVWFVSEWIASGKSPAFQLVELGPGRGTLTADILRVFSQLGSVLKTCAISIHLVEVSQK 165 Query: 122 LTLIQ--------------------KKQLASYGDKINWYTSLADVPLGFTFLVANEFFDS 161 L+ IQ K + G I+WY L DVP G++ +A+EFFD Sbjct: 166 LSEIQALTLAEEKVPLERDAESLVYMKGVTKSGIPISWYRDLKDVPEGYSLYLAHEFFDV 225 Query: 162 LPIKQFVMTEHGIRERMIDIDQH--DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRD 219 LP+ +F T G RE +D+D D L F + + D E P Sbjct: 226 LPVHKFQKTPRGWREVFVDVDPQASDKLRFVLAPCATPAEAFIQRD-ERREHVEVCPDAG 284 Query: 220 REMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQ 279 +Q +S R+A GG A++ DYG+ ++ DTL+ GH L+ PG ADL++ VDF Sbjct: 285 VIIQELSQRIASTGGAALIADYGHDGTKT-DTLRGFYGHQLHDVLIAPGTADLTADVDFS 343 Query: 280 RLSSIAILYKLYINGLTTQGKFLEGLGIWQRAF 312 L +A K+ G Q FL+ +GI R Sbjct: 344 YLHRMA-QGKVASLGPVEQRTFLKNMGIDVRLK 375 >gi|26354925|dbj|BAC41089.1| unnamed protein product [Mus musculus] gi|82568951|gb|AAI08351.1| 2410091C18Rik protein [Mus musculus] Length = 383 Score = 235 bits (598), Expect = 1e-59, Method: Composition-based stats. Identities = 115/333 (34%), Positives = 167/333 (50%), Gaps = 26/333 (7%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 ++R ++ IK G +TV +Y + +P GYY + G GDF+T+PEISQIFGE+ Sbjct: 36 TPMLRHLMYKIKSTGPITVAEYMKEVLTNPAKGYYVHQDMLGEKGDFITSPEISQIFGEL 95 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFF-SVLSIYMVETSER 121 L ++ + W G +LVELGPGRG + DILRV +L +SI++VE S++ Sbjct: 96 LGVWFVSEWIASGKSPAFQLVELGPGRGTLTADILRVFSQLGSVLKTCAISIHLVEVSQK 155 Query: 122 LTLIQ--------------------KKQLASYGDKINWYTSLADVPLGFTFLVANEFFDS 161 L+ IQ K + G I+WY L DVP G++ +A+EFFD Sbjct: 156 LSEIQALTLAEEKVPLERDAESLVYMKGVTKSGIPISWYRDLKDVPEGYSLYLAHEFFDV 215 Query: 162 LPIKQFVMTEHGIRERMIDIDQH--DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRD 219 LP+ +F T G RE +D+D D L F + + D E P Sbjct: 216 LPVHKFQKTPRGWREVFVDVDPQASDKLRFVLAPCATPAEAFIQRD-ERREHVEVCPDAG 274 Query: 220 REMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQ 279 +Q +S R+A GG A++ DYG+ ++ DTL+ GH L+ PG ADL++ VDF Sbjct: 275 VIIQELSQRIASTGGAALIADYGHDGTKT-DTLRGFYGHQLHDVLIAPGTADLTADVDFS 333 Query: 280 RLSSIAILYKLYINGLTTQGKFLEGLGIWQRAF 312 L +A K+ G Q FL+ +GI R Sbjct: 334 YLHRMA-QGKVASLGPVEQRTFLKNMGIDVRLK 365 >gi|253997960|ref|YP_003050023.1| hypothetical protein Msip34_0247 [Methylovorus sp. SIP3-4] gi|253984639|gb|ACT49496.1| protein of unknown function DUF185 [Methylovorus sp. SIP3-4] Length = 385 Score = 235 bits (598), Expect = 1e-59, Method: Composition-based stats. Identities = 94/376 (25%), Positives = 149/376 (39%), Gaps = 32/376 (8%) Query: 2 ENKLIRKIVNLIK-KNGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIF 59 +L I I G + QY L + P GYYS FG GDFVTAPE+S +F Sbjct: 16 SQQLKLHIGRHIAEAGGWLDFAQYMDLVLYAPSLGYYSAGAKKFGPAGDFVTAPELSPLF 75 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 LA ++ELG G G + D+L + +L+ ++E S Sbjct: 76 ARTLATQAADIISAT----AGDVLELGAGSGRLAADLLLELDRLQQLPS---QYRILEIS 128 Query: 120 ERLTLIQKKQL-----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGI 174 L +QK L ++ W SL +V G ++ NE D+LP+ G+ Sbjct: 129 AYLRQVQKDYLQKVLPPHLMQRVEWLDSLPEVFSG--LVLGNEVLDALPVHILHQQADGL 186 Query: 175 RERMIDIDQHDSLVFNIGDH-EIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDG 233 +R + + + + + E + S++ L Sbjct: 187 LQRGVGLAPDGEFQWVDQPADPLIQAAFRETRLPEPYTTEICMAAGGLIASLASMLQR-- 244 Query: 234 GTAIVIDY------GYLQSRVGDT-LQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAI 286 G ++IDY Y R T + + H + P + PG D+++HVDF R++ A+ Sbjct: 245 GVVLLIDYGFPRHEYYHPQRQQGTLMCHYRHHAHTDPFLYPGLQDITAHVDFTRIAESAM 304 Query: 287 LYKLYINGLTTQGKFLEGLGIWQRAFSLMK-QTARKDILLDSVKRLVSTSADKKSMGELF 345 L + G +Q +FL GI + + A L S ++L+S MGELF Sbjct: 305 QQGLSVMGYASQAQFLINCGITECLAEVSPHDIAVYAPLASSAQKLLS----PAEMGELF 360 Query: 346 KILVVSHEKVELM-PF 360 K++ V E + F Sbjct: 361 KVIAVGRGVDEPLRGF 376 >gi|21064487|gb|AAM29473.1| RE41779p [Drosophila melanogaster] Length = 366 Score = 235 bits (598), Expect = 1e-59, Method: Composition-based stats. Identities = 111/372 (29%), Positives = 182/372 (48%), Gaps = 43/372 (11%) Query: 25 FALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVE 84 + +P+ GYY + FG GDF+T+PEISQIFGE++ I+L+ W + G PS +LVE Sbjct: 1 MREVLTNPQAGYYMNRDVFGREGDFITSPEISQIFGELVGIWLVSEWRKMGSPSPFQLVE 60 Query: 85 LGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYG---------- 134 LGPGRG + D+L+V+ K + S++MVE S L+ Q ++ Sbjct: 61 LGPGRGTLARDVLKVLTKF--KQDAEFSMHMVEVSPFLSKAQAQRFCYSHQTLPEDAQLP 118 Query: 135 ----------DKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQH 184 K W+ L DVP GF+ ++A+EFFD+LP+ + + + +E +ID+ Sbjct: 119 HYQEGTTASGTKAFWHRRLEDVPQGFSLVLAHEFFDALPVHKLQLVDGKWQEVLIDVASS 178 Query: 185 DSLVFNIGDHEIKSNFLTCSDYFLG------AIFENSPCRDREMQSISDRLACDGGTAIV 238 D + + + S + + E+S +R++ +++R+ DGG A++ Sbjct: 179 DGAQEASFRYVLSRSQTPVSSLYRPLPGETRSCLEHSLETERQVGLLAERIERDGGIALI 238 Query: 239 IDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYK-LYINGLTT 297 +DYG+ + DT +A K H PLV PG ADL++ VDF+ + IA ++ G Sbjct: 239 MDYGHFGEKT-DTFRAFKQHKLHDPLVEPGSADLTADVDFKLVRHIAETRGNVHCCGPVE 297 Query: 298 QGKFLEGLGIWQRAFSLMKQ--TARKDILLDSVKRLVSTSADKKSMGELFKILVVSH--- 352 QG FL+ + R L+ ++I+ + L D MG FK L + Sbjct: 298 QGLFLQRMQGEARLEQLLAHALPENQEIIRSGYEMLT----DPAQMGTRFKFLAMFPGVL 353 Query: 353 ----EKVELMPF 360 +K ++ F Sbjct: 354 AAHLDKYPVVGF 365 >gi|299532632|ref|ZP_07046020.1| hypothetical protein CTS44_17603 [Comamonas testosteroni S44] gi|298719267|gb|EFI60236.1| hypothetical protein CTS44_17603 [Comamonas testosteroni S44] Length = 361 Score = 234 bits (597), Expect = 1e-59, Method: Composition-based stats. Identities = 95/373 (25%), Positives = 147/373 (39%), Gaps = 36/373 (9%) Query: 5 LIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTC-NPFGA----VGDFVTAPEISQI 58 + +I I G + D++ L + P GYY+ FGA DFVTAPEIS I Sbjct: 1 MQSRIAKEIAAVGGWLPFDRFMELALYAPGLGYYANETAKFGAMPESGSDFVTAPEISPI 60 Query: 59 FGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVET 118 FG+++A L A ++ + E G G G + L IL + +V+ Sbjct: 61 FGQLVASQLREALQKTN---TREIWEFGAGTGALALQILDELAAQGALP---ERYTIVDL 114 Query: 119 SERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERM 178 S L QK LA Y + W +L + G ++ NE D++P++ T ER Sbjct: 115 SGTLRARQKLALAKYEHLVRWVDALPEAMEG--VIIGNEVLDAMPVQLLQRTAGQWHERG 172 Query: 179 IDIDQH-DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAI 237 + + D F D + + E + + ++ +RL G A Sbjct: 173 VVLGAGGDEAAFAWEDRPTELRPPVDIGGPHDFLTEIHRQGEAFIHTLGERLIR--GAAF 230 Query: 238 VIDY------GYLQSRVGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKL 290 IDY Y + R TL H PLV G D+++HV+F + A Sbjct: 231 FIDYGFGESEYYHEQRHMGTLVCHYQHQVDNDPLVLVGLKDITAHVNFTGTAVAAQDAGF 290 Query: 291 YINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVV 350 + G T Q FL G+ + D + + + S + MGELFK++ + Sbjct: 291 DVLGYTNQAHFLINCGL----------APKLDAVGIKARSMASKLVMEHEMGELFKVVAL 340 Query: 351 SH--EKVELMPFV 361 S E + FV Sbjct: 341 SKGVEPWTPLGFV 353 >gi|161526141|ref|YP_001581153.1| hypothetical protein Bmul_2972 [Burkholderia multivorans ATCC 17616] gi|189349144|ref|YP_001944772.1| hypothetical protein BMULJ_00261 [Burkholderia multivorans ATCC 17616] gi|160343570|gb|ABX16656.1| protein of unknown function DUF185 [Burkholderia multivorans ATCC 17616] gi|189333166|dbj|BAG42236.1| conserved hypothetical protein [Burkholderia multivorans ATCC 17616] Length = 396 Score = 234 bits (597), Expect = 1e-59, Method: Composition-based stats. Identities = 87/371 (23%), Positives = 150/371 (40%), Gaps = 34/371 (9%) Query: 2 ENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTC-NPFGAV----GDFVTAPEI 55 L ++ I G + ++ + P GYYS FG DFVTAPE+ Sbjct: 22 SETLAAQLRAEIAAAGGWLPFSRFMERALYAPGLGYYSGGARKFGRRADDGSDFVTAPEL 81 Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 S +F + LA + A G ++E G G G + +L + L L + Sbjct: 82 SPLFAQTLAQPVADALAASG---TRSVMEFGAGTGKLAAGLLAALDALGAALDEYL---I 135 Query: 116 VETSERLTLIQKKQLASYGDK----INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTE 171 V+ S L Q+ +A+ + W +L + G ++ NE D++P++ F Sbjct: 136 VDLSGELRARQRDTIAAAAPTLAAKVRWLDALPERFEG--VVIGNEVLDAMPVRLFAKAG 193 Query: 172 HGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCS--DYFLGAIFENSPCRDREMQSISDRL 229 RER + +D + VF+ + D G + E +++ L Sbjct: 194 DAWRERGVALDAQQAFVFDDRAVAPADVPPALAGLDVDDGYVTETHEAALAFTRTVCTML 253 Query: 230 ACDGGTAIVIDY------GYLQSRVGDT-LQAVKGHTYVSPLVNPGQADLSSHVDFQRLS 282 A G ++IDY Y R T + + H + P + PG D+++HV+F + Sbjct: 254 AR--GAVLLIDYGFPAHEYYHPQRDRGTLMCHYRHHAHDDPFLYPGLQDITAHVEFTGIY 311 Query: 283 SIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQT-ARKDILLDSVKRLVSTSADKKSM 341 I + G T+Q +FL GI ++ + ++V++L+S + M Sbjct: 312 EAGIAAGADLLGYTSQARFLLNAGITDALAAIDPSDVTQFLPAANAVQKLIS----EAEM 367 Query: 342 GELFKILVVSH 352 GELFK++ S Sbjct: 368 GELFKVIAFSR 378 >gi|254673391|emb|CBA08695.1| conserved hypothetical protein [Neisseria meningitidis alpha275] gi|319411062|emb|CBY91462.1| conserved hypothetical protein [Neisseria meningitidis WUE 2594] Length = 405 Score = 234 bits (597), Expect = 1e-59, Method: Composition-based stats. Identities = 90/374 (24%), Positives = 150/374 (40%), Gaps = 33/374 (8%) Query: 4 KLIRKIVNLIKKNG-QMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGE 61 KL I I K+G + ++ L + P++GYY+ + G GDF+TAP ++ +F + Sbjct: 39 KLQTLIAEKIGKHGNWIPFSRFMELVLYAPQYGYYTGGSHKIGNTGDFITAPTLTSLFAQ 98 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 LA L Q + E G G G + D+L I + Y++E S Sbjct: 99 TLARQLQELLSQT----AGNIYEFGAGTGQLAADLLGSISD------GISRYYIIEISPE 148 Query: 122 LTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRER 177 L QK + K+ T+L + G ++ NE D++P++ E G E Sbjct: 149 LAARQKNLIQARAPEASQKVVHLTALPEAFDG--IIIGNEVLDAMPVEIVRKNEGGSFEH 206 Query: 178 MIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLG----AIFENSPCRDREMQSISDRLACDG 233 + +D ++ S YF E P + +++++ RL G Sbjct: 207 VGVCLDNDRFTYSARPLHDLQLSALASLYFPQTDYPYTSELHPQQYAFIRTLASRLERGG 266 Query: 234 GTAIVIDY----GYLQSRVGDTLQ-AVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILY 288 I + Y R TL + H +P G ADL++HV+F ++ Sbjct: 267 MIFIDYGFDAAQYYHPQRNQGTLIGHYRHHVIHNPFDFIGLADLTAHVNFTDIAQAGTDA 326 Query: 289 KLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKD-ILLDSVKRLVSTSADKKSMGELFKI 347 L + G Q FL LGI + + K + +V++L+ D+ MGELFK+ Sbjct: 327 GLDLTGYLPQSHFLLNLGITELLAQVGKTDSPAYIREAAAVQKLI----DQHEMGELFKV 382 Query: 348 LVVSHE-KVELMPF 360 + V+ F Sbjct: 383 IAFGKNIGVDWAGF 396 >gi|148263016|ref|YP_001229722.1| hypothetical protein Gura_0943 [Geobacter uraniireducens Rf4] gi|146396516|gb|ABQ25149.1| protein of unknown function DUF185 [Geobacter uraniireducens Rf4] Length = 385 Score = 234 bits (597), Expect = 1e-59, Method: Composition-based stats. Identities = 91/362 (25%), Positives = 153/362 (42%), Gaps = 12/362 (3%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGE 61 L +++ I G++T + A C+ +P GYY++ GA GDF T+ + +FG Sbjct: 7 TPLKTILLDRILSKGRITFADFMAACLYEPGLGYYTSPGRKVGAEGDFYTSMNVHLVFGR 66 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 ++A + WE G P ++E G G G + DIL I + F+ L+ +VE Sbjct: 67 LVAREICRMWESMGSPGRFDIIEAGAGAGQLAKDILDTIAGINLPFYDTLTYCLVEKEPT 126 Query: 122 LTLIQKKQLASYGDKINWYTSLADVPLGFTF---LVANEFFDSLPIKQFVMTEHGIRERM 178 L QK +LA + +++W+T G TF L++NE D++P+ MT G+ E Sbjct: 127 LKEAQKAKLADHLARLDWHTPEELAEGGSTFSGCLLSNELIDAMPVHLVEMTPAGLMEVY 186 Query: 179 IDI--DQHDSLVFNIGDHEIKSNFLTCS-DYFLGAIFENSPCRDREMQSISDRLACDGGT 235 + + ++ E+ + F G E S +++ + + Sbjct: 187 VTALDGEFGEMLDEPSTPELAAYLKRLGISLFAGQRAEISLAAIGWLEAAAKAVEKGFVL 246 Query: 236 AIVIDYG----YLQSRVGDTLQAVKGH-TYVSPLVNPGQADLSSHVDFQRLSSIAILYKL 290 I Y Y R TL H T SP + G D++SHV+F L L Sbjct: 247 TIDYGYPADELYAPMRKNGTLLCYYQHTTEESPYIRVGLQDITSHVNFTALMERGEELGL 306 Query: 291 YINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVV 350 + Q +FL G G+ + +L K + L+ + L MG+ FK+L+ Sbjct: 307 HTVWFGEQYRFLLGAGLMEEMLALEKSGVSETELIKNRLALKKLMLPDGGMGDTFKVLIQ 366 Query: 351 SH 352 + Sbjct: 367 AK 368 >gi|294670596|ref|ZP_06735475.1| hypothetical protein NEIELOOT_02321 [Neisseria elongata subsp. glycolytica ATCC 29315] gi|291307721|gb|EFE48964.1| hypothetical protein NEIELOOT_02321 [Neisseria elongata subsp. glycolytica ATCC 29315] Length = 435 Score = 234 bits (597), Expect = 2e-59, Method: Composition-based stats. Identities = 83/365 (22%), Positives = 141/365 (38%), Gaps = 27/365 (7%) Query: 2 ENKLIRKIVNLIK-KNGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIF 59 KL I N IK G + ++ L + P +GYY+ + GA GDFVTAP IS +F Sbjct: 68 SEKLEHIIHNEIKTSGGWLPFSRFMELALYVPRYGYYTGGAHKIGAEGDFVTAPVISPLF 127 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 G+ LA + Q + E G G G + +L + + ++E S Sbjct: 128 GQALARQIDFLLPQT----AGIVYEFGAGTGELAATLLNSLSNDNLKTY-----CIIELS 178 Query: 120 ERLTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIR 175 L Q + K+ ++L D G ++ NE D++P + +E Sbjct: 179 AELAERQLAHIKQTAPEAAHKVRHLSALPDSFDG--IIIGNEVLDAMPCELVRRSEGRFL 236 Query: 176 ERMIDIDQHDS--LVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDG 233 + + I+ + + + D + + E P + +++++D+L Sbjct: 237 QIGVGIENGELSLIPHPLSDTSLIDAAHSYFPDTEPYTSELHPAQYAFIRTLADKLQRGA 296 Query: 234 GTAI-----VIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILY 288 I +Y + Q +G + + HT P N G DL++HV+F ++ Sbjct: 297 VIMIDYGFDAAEYYHPQRHMGTLIGHYRHHTVHDPFFNIGLTDLTAHVNFTDIAQAGTDG 356 Query: 289 KLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKIL 348 L + G TTQ FL LGI + + A + MGELFK++ Sbjct: 357 GLNLIGYTTQADFLLNLGITEL---IPPDIAHNSPAYIQTASALHKLLMPHEMGELFKVI 413 Query: 349 VVSHE 353 Sbjct: 414 AFGRN 418 >gi|296313571|ref|ZP_06863512.1| conserved hypothetical protein [Neisseria polysaccharea ATCC 43768] gi|296839872|gb|EFH23810.1| conserved hypothetical protein [Neisseria polysaccharea ATCC 43768] Length = 382 Score = 234 bits (597), Expect = 2e-59, Method: Composition-based stats. Identities = 86/376 (22%), Positives = 151/376 (40%), Gaps = 33/376 (8%) Query: 2 ENKLIRKIVNLIKKN-GQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIF 59 L + IKK+ + ++ L + P++GYY+ + G GDF+TAP ++ +F Sbjct: 14 SANLQTLLAKEIKKHRNWIPFSRFMELVLYTPQYGYYTGGSHKIGNNGDFITAPTLTPLF 73 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 + LA L Q + E G G G + D+L I + Y++E S Sbjct: 74 AQTLARQLQELLPQT----AGNIYEFGAGTGQLAADLLGSISD------GINRYYIIEIS 123 Query: 120 ERLTLIQKKQLASYGDK----INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIR 175 L QK + + + I ++L + G ++ NE D++P++ E G Sbjct: 124 PELAARQKDLIQTLAPQAAQKIVHLSALPETFDG--IIIGNEVLDAMPVEIIRKNEGGSF 181 Query: 176 ERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLG----AIFENSPCRDREMQSISDRLAC 231 E + + ++ S + S YF E P + +++++ RL Sbjct: 182 EHVGVCLDNGRFAYSAKPLHDPSLSTSASLYFPQTDYPYTSELHPQQYAFIRTLASRLKR 241 Query: 232 DGGTAIVIDY----GYLQSRVGDTLQ-AVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAI 286 G I + Y R TL + H +P G ADL++HV+F ++ Sbjct: 242 GGMIFIDYGFDVAQYYHPQRNQGTLIGHYRHHVIHNPFDFIGLADLTAHVNFTDIAQAGT 301 Query: 287 LYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKD-ILLDSVKRLVSTSADKKSMGELF 345 L + G Q FL LGI + + K + +V++L+ D+ MGELF Sbjct: 302 DAGLDLTGYLPQSHFLLNLGIIELLTQVGKTDSPAYIREAAAVQKLI----DQHEMGELF 357 Query: 346 KILVVSHE-KVELMPF 360 K++ ++ F Sbjct: 358 KVIAFGKNIGIDWAGF 373 >gi|268595428|ref|ZP_06129595.1| conserved hypothetical protein [Neisseria gonorrhoeae 35/02] gi|268597234|ref|ZP_06131401.1| conserved hypothetical protein [Neisseria gonorrhoeae FA19] gi|291043141|ref|ZP_06568864.1| conserved hypothetical protein [Neisseria gonorrhoeae DGI2] gi|293398477|ref|ZP_06642655.1| hypothetical protein NGNG_01133 [Neisseria gonorrhoeae F62] gi|268548817|gb|EEZ44235.1| conserved hypothetical protein [Neisseria gonorrhoeae 35/02] gi|268551022|gb|EEZ46041.1| conserved hypothetical protein [Neisseria gonorrhoeae FA19] gi|291012747|gb|EFE04730.1| conserved hypothetical protein [Neisseria gonorrhoeae DGI2] gi|291610948|gb|EFF40045.1| hypothetical protein NGNG_01133 [Neisseria gonorrhoeae F62] Length = 406 Score = 234 bits (597), Expect = 2e-59, Method: Composition-based stats. Identities = 89/374 (23%), Positives = 151/374 (40%), Gaps = 33/374 (8%) Query: 4 KLIRKIVNLIKKNG-QMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGE 61 L I I K+G + ++ L + P++GYY+ + G GDF+TAP ++ +F + Sbjct: 28 NLQTLIAEEIGKHGNWIPFSRFMELVLYAPQYGYYTGGSHKIGNTGDFITAPTLTPLFAQ 87 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 LA L Q + E G G G + D+ L S+ Y++E S Sbjct: 88 TLARQLQELLPQT----AGNIYEFGAGTGQLAADL------LGSVSDSINCYYIIEISPE 137 Query: 122 LTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRER 177 L QK + K+ T+L + G ++ NE D++P++ E G+ E Sbjct: 138 LAARQKNLIQARAPEASQKVVHLTALPEAFDG--IIIGNELLDAIPVEIVRKNEGGLLEH 195 Query: 178 MIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLG----AIFENSPCRDREMQSISDRLACDG 233 + + ++ S + S YF E P + +++++ RL G Sbjct: 196 IGVCTDNGRFAYSARPLHDPSLSTSASLYFPQTDYPYTSELHPQQYAFIRTLASRLERGG 255 Query: 234 GTAIVIDY----GYLQSRVGDTLQ-AVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILY 288 I + Y R TL + H +P G ADL++HV+F ++ Sbjct: 256 MIFIDYGFDAAQYYHPQRNQGTLIGHYRHHVIHNPFDFIGLADLTAHVNFTDIAQAGTDA 315 Query: 289 KLYINGLTTQGKFLEGLGIWQRAFSLMK-QTARKDILLDSVKRLVSTSADKKSMGELFKI 347 L + G Q FL LGI + K +A +V++L+ D+ MGELFK+ Sbjct: 316 GLDLTGYLPQSHFLLNLGITELLAQTGKTDSAAYIREAAAVQKLI----DQHEMGELFKV 371 Query: 348 LVVSHE-KVELMPF 360 + ++ F Sbjct: 372 IAFGKNIGIDWAGF 385 >gi|254410642|ref|ZP_05024421.1| conserved hypothetical protein [Microcoleus chthonoplastes PCC 7420] gi|196182848|gb|EDX77833.1| conserved hypothetical protein [Microcoleus chthonoplastes PCC 7420] Length = 409 Score = 234 bits (596), Expect = 2e-59, Method: Composition-based stats. Identities = 97/387 (25%), Positives = 163/387 (42%), Gaps = 39/387 (10%) Query: 2 ENKLIRKIVNLIKK--NGQMTVDQYFALCVADPEFGYYSTCNP-FGAVGDFVTAPEISQI 58 + L I + I N +++ +Y L + P+ GYY+T G+ GDF T+ + Sbjct: 7 DKSLRHLIADTIVATPNRRISFAEYMDLVLYHPQQGYYATGAVNIGSEGDFFTSSHLGHD 66 Query: 59 FGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVET 118 FGE+LA WE +P LVE+G G+G++ D+L + + PD F+ L +VE Sbjct: 67 FGELLAEQFAQIWEILEYPEPFTLVEMGAGQGLVAADVLNYLYRQYPDCFAALDYIIVEK 126 Query: 119 SERLTLIQKKQLASYGDK--INWYTSLADVPLGFTF--LVANEFFDSLPIKQFVMTEHGI 174 + L Q++ L + + ++P +NE D+ P+ Q V+ + + Sbjct: 127 AAGLISKQQQVLTQLNLPGLPLRWCTFDEIPDNSIIGCCFSNELVDAFPVHQVVLEKGKL 186 Query: 175 RERMIDIDQHDSLVFNIGDHEIKSNFLTCSD-------------YFLGAIFENSPCRDRE 221 RE + + + G+ S+ ++ Y G E + Sbjct: 187 REVYVTTVTTEGDEIHFGEVTNDSSTPQLNEYFQWIGIDLYSGAYPDGYRTEVNLAALDW 246 Query: 222 MQSISDRLACDGGTAIVIDYGYLQSRV------GDTLQAVKGHTYVS-PLVNPGQADLSS 274 M++I+++L G + IDYGY R TL+ H + P VN GQ D+++ Sbjct: 247 METITNKLQR--GFLLTIDYGYSADRYYLPVRHQGTLKCYYRHRHHDNPYVNVGQQDITA 304 Query: 275 HVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLM-------KQTARKDILLDS 327 HV+F L L L LT QG FL LG+ R +L + A ++ Sbjct: 305 HVNFTALERQGELSGLQTVELTKQGLFLMALGLGDRISALSGLGSHQGQAMAGAQDVIRI 364 Query: 328 VKR--LVSTSADKKSMGELFKILVVSH 352 ++R + D +G F +LV Sbjct: 365 MQRRDALHQLIDPTGLGG-FTVLVQCK 390 >gi|323527656|ref|YP_004229809.1| hypothetical protein BC1001_3335 [Burkholderia sp. CCGE1001] gi|323384658|gb|ADX56749.1| protein of unknown function DUF185 [Burkholderia sp. CCGE1001] Length = 396 Score = 234 bits (596), Expect = 2e-59, Method: Composition-based stats. Identities = 95/370 (25%), Positives = 151/370 (40%), Gaps = 32/370 (8%) Query: 2 ENKLIRKIVNLIK-KNGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGD----FVTAPEI 55 L+ +I + +G M D+Y + P GYYS FG GD FVTAPE+ Sbjct: 22 SEALVARIRAELDDADGWMPFDRYMERALYAPGLGYYSGGARKFGLRGDDGSDFVTAPEL 81 Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 S +F LA + A + G ++E G G G + +L + L +F S + Sbjct: 82 SPLFAATLARPIAEALQASG---TRDVMEFGAGTGKLAAGLLNALAALGAEFDS---YSI 135 Query: 116 VETSERLTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTE 171 V+ S L Q++ + + K+ W +L + G ++ NE D++P++ F T+ Sbjct: 136 VDLSGELRERQRETIDAAAPALAGKVRWLDALPERFEG--VVIGNEVLDAMPVRLFACTD 193 Query: 172 HGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFL---GAIFENSPCRDREMQSISDR 228 RER I + F + SD I E ++I Sbjct: 194 GAWRERG-VIWRDGRFAFEDRPVAASAELALLSDIDTAGGDYIAETHDAARAFTRTICTM 252 Query: 229 LACDGGTAIVIDYG----YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSS 283 LA I + Y R TL H P V PG D+++HV+F ++ Sbjct: 253 LARGAAFFIDYGFPRHEYYHPQRTQGTLMCHYRHRAHGDPFVYPGLQDITAHVEFTGIAE 312 Query: 284 IAILYKLYINGLTTQGKFLEGLGIWQ-RAFSLMKQTARKDILLDSVKRLVSTSADKKSMG 342 + + G T+Q +FL G+ A TAR ++V++L+S + MG Sbjct: 313 AGVETGADLLGFTSQARFLLNAGMTDALAEIDPADTARFLPAANAVQKLLS----EAEMG 368 Query: 343 ELFKILVVSH 352 ELFK++ S Sbjct: 369 ELFKVIAFSR 378 >gi|294341578|emb|CAZ89995.1| conserved hypothetical protein [Thiomonas sp. 3As] Length = 397 Score = 234 bits (596), Expect = 2e-59, Method: Composition-based stats. Identities = 100/384 (26%), Positives = 154/384 (40%), Gaps = 44/384 (11%) Query: 2 ENKLIRKIVNLIKKNG-QMTVDQYFALCVADPEFGYYSTCN----PFGAVGDFVTAPEIS 56 +L+ +I + G + D Y + P GYY+ FG+ DFVTAPE+S Sbjct: 6 SAQLLAQIRAALHAGGGWLPFDAYMQQALYAPGLGYYTGQAGQFGDFGSDSDFVTAPELS 65 Query: 57 QIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMV 116 +FG LA + +Q +VE G G G + IL + L + +V Sbjct: 66 PLFGRTLAAQVAQVLQQGDL---HTVVEFGAGSGRLAAQILGELDHLGC---APRHYAIV 119 Query: 117 ETSERLTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEH 172 E S L Q + L D++ W+T+L + ++NE D++P+K E Sbjct: 120 EVSGALKHRQMQTLRSAVPHLFDRVQWWTALPETFE--AVAISNEVLDAMPVKLLHRHEG 177 Query: 173 GIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACD 232 G ER + + D LVF + +G + E P +++++DRLA Sbjct: 178 GWMERGVAQEGDD-LVFADRATALLPPAPEAHALPIGTVTEIHPQALAFVRTLADRLAR- 235 Query: 233 GGTAIVIDY------GYLQSRVGDTLQAVKGHTY-VSPLVNPGQADLSSHVDFQRLSSIA 285 G A+ IDY Y R TLQA H L+ PG AD+++HVDF ++ A Sbjct: 236 -GAALFIDYGFPQREYYHPQRHMGTLQAHYRHRVLDDVLLWPGLADITAHVDFTAIALAA 294 Query: 286 ILYKLYINGLTTQGKFLEGLGIWQR-AFSLMKQTARKD----------------ILLDSV 328 L + G T+Q FL G+ A L Sbjct: 295 QDAGLDVLGYTSQASFLMNCGLLDLVAAELAAGPPPVAPSSAAALCAATTAPGGSHYLRQ 354 Query: 329 KRLVSTSADKKSMGELFKILVVSH 352 V+ ++ MGELFK++ + Sbjct: 355 TAAVNKLLNESEMGELFKVIALGR 378 >gi|170701979|ref|ZP_02892900.1| protein of unknown function DUF185 [Burkholderia ambifaria IOP40-10] gi|170133100|gb|EDT01507.1| protein of unknown function DUF185 [Burkholderia ambifaria IOP40-10] Length = 412 Score = 234 bits (596), Expect = 2e-59, Method: Composition-based stats. Identities = 87/371 (23%), Positives = 154/371 (41%), Gaps = 34/371 (9%) Query: 2 ENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTC-NPFGAV----GDFVTAPEI 55 L ++ + I G + D++ + P GYYS FG DFVTAPE+ Sbjct: 38 SETLAAQLRDEIAAVGGWLPFDRFMERALYAPGLGYYSGGARKFGRRADDGSDFVTAPEL 97 Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 S +F + LA + A G R++E G G G + +L + L + L + Sbjct: 98 SPLFAQTLANPVADALVASG---TRRVMEFGAGTGKLAAGLLAALAALDVELDEYL---I 151 Query: 116 VETSERLTLIQKKQLASYGDK----INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTE 171 V+ S L Q+ +A+ + W +L + G +V NE D++P++ F + Sbjct: 152 VDLSGELRERQRDTIAAAAPALAGKVRWLDALPERFDG--VVVGNEVLDAMPVRLFAKSG 209 Query: 172 HGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCS--DYFLGAIFENSPCRDREMQSISDRL 229 ER + +D + VF+ + D G + E +++ L Sbjct: 210 GAWLERGVALDARHAFVFDDRPVGAPGLPPVLATLDVDDGYVTETHEAALAFTRTVCTML 269 Query: 230 ACDGGTAIVIDY------GYLQSRVGDT-LQAVKGHTYVSPLVNPGQADLSSHVDFQRLS 282 A G +++DY Y R G T + + H + + PG D+++HV+F + Sbjct: 270 AR--GAILLVDYGFPAHEYYHPQRGGGTLMCHYRHHAHDDAFLYPGLQDITAHVEFTGIY 327 Query: 283 SIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARK-DILLDSVKRLVSTSADKKSM 341 + + G T+Q +FL GI ++ + ++V++L+S + M Sbjct: 328 DAGVGTGADLLGYTSQARFLLNAGITDALAAIDPSDIHQFLPAANAVQKLIS----EAEM 383 Query: 342 GELFKILVVSH 352 GELFK++ S Sbjct: 384 GELFKVIAFSR 394 >gi|115353075|ref|YP_774914.1| hypothetical protein Bamb_3024 [Burkholderia ambifaria AMMD] gi|115283063|gb|ABI88580.1| protein of unknown function DUF185 [Burkholderia ambifaria AMMD] Length = 396 Score = 234 bits (596), Expect = 2e-59, Method: Composition-based stats. Identities = 86/371 (23%), Positives = 153/371 (41%), Gaps = 34/371 (9%) Query: 2 ENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTC-NPFGAV----GDFVTAPEI 55 L ++ + I G + D++ + P GYYS FG DFVTAPE+ Sbjct: 22 SETLAAQLRDEIAAAGGWLPFDRFMERALYAPGLGYYSGGARKFGRRADDGSDFVTAPEL 81 Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 S +F + LA + A G R++E G G G + +L + L + L + Sbjct: 82 SPLFAQTLANPVADALAASG---TRRVMEFGAGTGKLAAGLLAALDALDVELDEYL---I 135 Query: 116 VETSERLTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTE 171 V+ S L Q+ + + K+ W +L + G +V NE D++P++ F + Sbjct: 136 VDLSGELRERQRDTIATAVPALAGKVRWLDALPERFDG--VVVGNEVLDAMPVRLFAKSG 193 Query: 172 HGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCS--DYFLGAIFENSPCRDREMQSISDRL 229 ER + +D + VF+ + D G + E +++ L Sbjct: 194 GAWLERGVALDARHAFVFDDRPVGAAGLPAVLATLDVDDGYVTETHEAALAFTRTVCTML 253 Query: 230 ACDGGTAIVIDY------GYLQSRVGDT-LQAVKGHTYVSPLVNPGQADLSSHVDFQRLS 282 A G +++DY Y R T + + H + + PG D+++HV+F + Sbjct: 254 AR--GAVLLVDYGFPAHEYYHPQRDRGTLMCHYRHHAHDDAFLYPGLQDITAHVEFTGIY 311 Query: 283 SIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARK-DILLDSVKRLVSTSADKKSM 341 + + G T+Q +FL GI ++ + ++V++L+S + M Sbjct: 312 DAGVGTGADLLGYTSQARFLLNAGITDALAAIDPSDVHQFLPAANAVQKLIS----EAEM 367 Query: 342 GELFKILVVSH 352 GELFK++ S Sbjct: 368 GELFKVIAFSR 378 >gi|325134876|gb|EGC57509.1| hypothetical protein NMBM13399_0410 [Neisseria meningitidis M13399] Length = 382 Score = 233 bits (595), Expect = 2e-59, Method: Composition-based stats. Identities = 93/376 (24%), Positives = 152/376 (40%), Gaps = 37/376 (9%) Query: 4 KLIRKIVNLIKKNG-QMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGE 61 KL I I K+G + ++ L + P++GYY+ + G GDF+TAP ++ +F + Sbjct: 16 KLQTLIAEKIGKHGNWIPFSRFMELVLYAPQYGYYTGGSHKIGNTGDFITAPTLTSLFAQ 75 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 LA L Q + E G G G + D+L I + Y++E S Sbjct: 76 TLARQLQELLSQT----AGNIYEFGAGTGQLAADLLGSISD------GISRYYIIEISPE 125 Query: 122 LTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRER 177 L QK + K+ T+L + G ++ NE D++P++ E G E Sbjct: 126 LAARQKNLIQARAPEASQKVVHLTALPEAFDG--IIIGNEVLDAMPVEIVRKNEGGSFEH 183 Query: 178 MIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLG----AIFENSPCRDREMQSISDRLACDG 233 + +D ++ S YF E P + +++++ RL Sbjct: 184 VGVCLDNDRFTYSARPLHDLQLSALASLYFPQTDYPYTSELHPQQYAFIRTLASRLE--H 241 Query: 234 GTAIVIDY------GYLQSRVGDTLQ-AVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAI 286 G I IDY Y R TL + H +P G ADL++HV+F ++ Sbjct: 242 GCMIFIDYGFDAAQYYHPQRNQGTLIGHYRHHVIHNPFDFIGLADLTAHVNFTDIAQAGT 301 Query: 287 LYKLYINGLTTQGKFLEGLGIWQRAFSLMK-QTARKDILLDSVKRLVSTSADKKSMGELF 345 L + G Q FL LGI + K +A +V++L+ D+ MGELF Sbjct: 302 DAGLDLIGYLPQSHFLLNLGITELLAQTGKTNSAAYIREAAAVQKLI----DQHEMGELF 357 Query: 346 KILVVSHE-KVELMPF 360 K++ ++ F Sbjct: 358 KVIAFGKNIGIDWAGF 373 >gi|325142964|gb|EGC65321.1| hypothetical protein NMB9615945_0454 [Neisseria meningitidis 961-5945] gi|325208775|gb|ADZ04227.1| conserved hypothetical protein [Neisseria meningitidis NZ-05/33] Length = 382 Score = 233 bits (595), Expect = 3e-59, Method: Composition-based stats. Identities = 90/374 (24%), Positives = 150/374 (40%), Gaps = 33/374 (8%) Query: 4 KLIRKIVNLIKKNG-QMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGE 61 KL I I K+G + ++ L + P++GYY+ + G GDF+TAP ++ +F + Sbjct: 16 KLQTLIAEKIGKHGNWIPFSRFMELVLYAPQYGYYTGGSHKIGNTGDFITAPTLTSLFAQ 75 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 LA L Q + E G G G + D+L I + Y++E S Sbjct: 76 TLARQLQELLSQT----AGNIYEFGAGTGQLAADLLGSISD------GISRYYIIEISPE 125 Query: 122 LTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRER 177 L QK + K+ T+L + G ++ NE D++P++ E G E Sbjct: 126 LAARQKNLIQARAPEASQKVVHLTALPEAFDG--IIIGNEVLDAMPVEIVRKNEGGSFEH 183 Query: 178 MIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLG----AIFENSPCRDREMQSISDRLACDG 233 + +D ++ S YF E P + +++++ RL G Sbjct: 184 VGVCLDNDRFTYSARPLHDLQLSALASLYFPQTDYPYTSELHPQQYAFIRTLASRLERGG 243 Query: 234 GTAIVIDY----GYLQSRVGDTLQ-AVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILY 288 I + Y R TL + H +P G ADL++HV+F ++ Sbjct: 244 MIFIDYGFDAAQYYHPQRNQGTLIGHYRHHVIHNPFDFIGLADLTAHVNFTDIAQAGTDA 303 Query: 289 KLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKD-ILLDSVKRLVSTSADKKSMGELFKI 347 L + G Q FL LGI + + K + +V++L+ D+ MGELFK+ Sbjct: 304 GLDLTGYLPQSHFLLNLGITELLAQVGKTDSPAYIREAAAVQKLI----DQHEMGELFKV 359 Query: 348 LVVSHE-KVELMPF 360 + V+ F Sbjct: 360 IAFGKNIGVDWAGF 373 >gi|239615053|gb|EEQ92040.1| DUF185 domain-containing protein [Ajellomyces dermatitidis ER-3] Length = 512 Score = 233 bits (595), Expect = 3e-59, Method: Composition-based stats. Identities = 118/461 (25%), Positives = 181/461 (39%), Gaps = 104/461 (22%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNP-------FGAVGDFVTAPE 54 L + + I G +++ Y C+ P+ GYY++ FG GDFVT+PE Sbjct: 48 STPLAKSLGEAISVTGPVSIAAYMRQCLTSPDGGYYTSRGQEAEDTELFGTKGDFVTSPE 107 Query: 55 ISQIFGEMLAIFLICAWEQHGFPS-CVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSI 113 ISQIFGE+L I+ + W G S V+++E GPG+G +M D+LR K ++ ++ Sbjct: 108 ISQIFGELLGIWTVAEWMGQGRKSGGVQIIEFGPGKGTLMGDMLRCFRNFKSFASTIEAV 167 Query: 114 YMVETSERLTLIQKKQLASYGD--------------------KINWYTSLADVPLGFTFL 153 Y+VE S L +Q+K L + L D P F+ Sbjct: 168 YLVEASPVLREVQRKLLCGDAPMEEVEAGYKSKSIHLGVPIVWAEHISFLPDEPDKTPFI 227 Query: 154 VANEFFDSLPIKQFV--------------------------MTEHGIRERMIDIDQHDSL 187 A+EFFD+LPI F + + R + + + Sbjct: 228 FAHEFFDALPIHAFQSVEVPSQPQTINSPTGPITLHQSSAPSSSTTTQWRELVVSPNPET 287 Query: 188 VFNIGDHEIKSNFLTCS-------------------DYFLGAIFENSPCRDREMQSISDR 228 E + G+ E SP +Q I+ R Sbjct: 288 PEVKSSKEPEFRLSLAKASTPSSLILPEMSPRYKALKSTPGSTIEISPESQTCVQDIAKR 347 Query: 229 L------------------ACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQA 270 + G A+++DYG + ++L+ ++ H VSP PGQ Sbjct: 348 IGGAFTSPSSPAATDAKKNKVPSGAALILDYGTTSTIPINSLRGIRKHQLVSPFAAPGQV 407 Query: 271 DLSSHVDFQRLSSIAILY--KLYINGLTTQGKFLEGLGIWQRAFSLMKQ------TARKD 322 D+S+ VDF L+ AI + + G T QG FLE LGI +RA L+K+ ++ Sbjct: 408 DVSADVDFTALAEAAIDASPGVEVYGPTEQGAFLEALGISERAAQLLKKVEGEGDEEKRK 467 Query: 323 ILLDSVKRLVSTSADKKSMGELFKILVVSHE---KVELMPF 360 + KRLV MG L+K L + E K + F Sbjct: 468 RIESGWKRLVERGG--GGMGRLYKALAIVPESGGKRRPVGF 506 >gi|261192102|ref|XP_002622458.1| DUF185 domain-containing protein [Ajellomyces dermatitidis SLH14081] gi|239589333|gb|EEQ71976.1| DUF185 domain-containing protein [Ajellomyces dermatitidis SLH14081] Length = 512 Score = 233 bits (595), Expect = 3e-59, Method: Composition-based stats. Identities = 118/461 (25%), Positives = 181/461 (39%), Gaps = 104/461 (22%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNP-------FGAVGDFVTAPE 54 L + + I G +++ Y C+ P+ GYY++ FG GDFVT+PE Sbjct: 48 STPLAKSLGEAISVTGPVSIAAYMRQCLTSPDGGYYTSRGQEAEDTELFGTKGDFVTSPE 107 Query: 55 ISQIFGEMLAIFLICAWEQHGFPS-CVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSI 113 ISQIFGE+L I+ + W G S V+++E GPG+G +M D+LR K ++ ++ Sbjct: 108 ISQIFGELLGIWTVAEWMGQGRKSGGVQIIEFGPGKGTLMGDMLRCFRNFKSFASTIEAV 167 Query: 114 YMVETSERLTLIQKKQLASYGD--------------------KINWYTSLADVPLGFTFL 153 Y+VE S L +Q+K L + L D P F+ Sbjct: 168 YLVEASPVLREVQRKLLCGDAPMEEVEAGYKSKSIHLGVPIVWAEHISFLPDEPDKTPFI 227 Query: 154 VANEFFDSLPIKQFV--------------------------MTEHGIRERMIDIDQHDSL 187 A+EFFD+LPI F + + R + + + Sbjct: 228 FAHEFFDALPIHAFQSVEVPSQPQTINSPTGPITLHQSSAPSSSTTTQWRELVVSPNPET 287 Query: 188 VFNIGDHEIKSNFLTCS-------------------DYFLGAIFENSPCRDREMQSISDR 228 E + G+ E SP +Q I+ R Sbjct: 288 PEVKSSKEPEFRLSLAKASTPSSLILPEMSPRYKALKSTPGSTIEISPESQTCVQDIAKR 347 Query: 229 L------------------ACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQA 270 + G A+++DYG + ++L+ ++ H VSP PGQ Sbjct: 348 IGGAFTSPSSPAATDAKKNKVPSGAALILDYGTTSTIPINSLRGIRKHQLVSPFAAPGQV 407 Query: 271 DLSSHVDFQRLSSIAILY--KLYINGLTTQGKFLEGLGIWQRAFSLMKQ------TARKD 322 D+S+ VDF L+ AI + + G T QG FLE LGI +RA L+K+ ++ Sbjct: 408 DVSADVDFTALAEAAIDASPGVEVYGPTEQGAFLEALGISERAAQLLKKVEGEGDEEKRK 467 Query: 323 ILLDSVKRLVSTSADKKSMGELFKILVVSHE---KVELMPF 360 + KRLV MG L+K L + E K + F Sbjct: 468 RIESGWKRLVERGG--GGMGRLYKALAIVPESGGKRRPVGF 506 >gi|325128826|gb|EGC51685.1| hypothetical protein NMXN1568_0372 [Neisseria meningitidis N1568] Length = 382 Score = 233 bits (594), Expect = 3e-59, Method: Composition-based stats. Identities = 93/376 (24%), Positives = 152/376 (40%), Gaps = 37/376 (9%) Query: 4 KLIRKIVNLIKKNG-QMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGE 61 KL I I K+G + ++ L + P++GYY+ + G GDF+TAP ++ +F + Sbjct: 16 KLQTLIAEKIGKHGNWIPFSRFMELVLYAPQYGYYTGGSHKIGNTGDFITAPTLTSLFAQ 75 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 LA L Q + E G G G + D+L I + Y++E S Sbjct: 76 TLARQLQELLPQT----AGNIYEFGAGTGQLAADLLGSISD------GISRYYIIEISPE 125 Query: 122 LTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRER 177 L QK + K+ T+L + G ++ NE D++P++ E G E Sbjct: 126 LAARQKNLIQARAPEASQKVVHLTALPEAFDG--IIIGNEVLDAMPVEIVRKNEGGSFEH 183 Query: 178 MIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLG----AIFENSPCRDREMQSISDRLACDG 233 + +D ++ S YF E P + +++++ RL Sbjct: 184 VGVCLDNDRFTYSARPLHDLQLSALASLYFPKITSPYTSELHPQQYAFIRTLASRLE--H 241 Query: 234 GTAIVIDY------GYLQSRVGDTLQ-AVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAI 286 G I IDY Y R TL + H +P G ADL++HV+F ++ Sbjct: 242 GCMIFIDYGFDAAQYYHPQRNQGTLIGHYRHHVIHNPFDFIGLADLTAHVNFTDIAQAGT 301 Query: 287 LYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKD-ILLDSVKRLVSTSADKKSMGELF 345 L + G Q FL LGI + + K + +V++L+ D+ MGELF Sbjct: 302 DAGLDLTGYLPQSHFLLNLGITELLAQVGKTDSPAYIREAAAVQKLI----DQHEMGELF 357 Query: 346 KILVVSHE-KVELMPF 360 K++ V+ F Sbjct: 358 KVIAFGKNIGVDWAGF 373 >gi|226313453|ref|YP_002773347.1| hypothetical protein BBR47_38660 [Brevibacillus brevis NBRC 100599] gi|226096401|dbj|BAH44843.1| conserved hypothetical protein [Brevibacillus brevis NBRC 100599] Length = 363 Score = 233 bits (594), Expect = 3e-59, Method: Composition-based stats. Identities = 83/363 (22%), Positives = 147/363 (40%), Gaps = 19/363 (5%) Query: 5 LIRKIVNLIKK--NGQMTVDQYFALCVADPEFGYYSTCNP-FGAVGDFVTAPEISQIFGE 61 +I I I+ +T ++ L + GYY P G GDF T+ + +F E Sbjct: 1 MIELIREEIESQPGKAITFARFMELALYHDTHGYYMVEQPKVGKAGDFYTSASVHPVFAE 60 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 +A ++ WE+ S V LVE+G G G + +L I KP+ + L++ ++E S Sbjct: 61 TIADAVLALWEEADITSPV-LVEIGGGTGAVCRHMLERIRACKPEIYKELTVILIEASPY 119 Query: 122 LTLIQKKQLASYGDKINWYTSLADVPLGFTF---LVANEFFDSLPIKQFVMTEHGIRERM 178 +Q++ L + WY+S+ + + +NE+ D+ P+ + G +E Sbjct: 120 HRKMQQEALQWHEGPKRWYSSVNEAAKQEKIEGVIFSNEWLDAFPVHIVEKNKSGWQEVW 179 Query: 179 IDIDQHD--SLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTA 236 + + + + + + +G E + ++ Q +S L G Sbjct: 180 VRVGEDGLEECLGEMTPALGEYLRGLNLKLPIGMRIEINLAMEQAAQDVSCLLKK--GFV 237 Query: 237 IVIDYG------YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSSIAILYK 289 I IDYG Y SR TL H +P +N G+ DL++HV+F Sbjct: 238 ITIDYGDLQEELYHPSRKNGTLMCYHRHQAHTNPYLNIGEQDLTTHVNFSAWKEYGEKAG 297 Query: 290 LYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILV 349 L G Q +FL G+ +A + M + + + R + D +G F ++V Sbjct: 298 LQEIGYMRQDRFLMRNGLLHKAVAHMDRDPFTSVAMKR-NRAIQQLIDPAGLGGRFWVMV 356 Query: 350 VSH 352 Sbjct: 357 QGK 359 >gi|313200027|ref|YP_004038685.1| hypothetical protein MPQ_0260 [Methylovorus sp. MP688] gi|312439343|gb|ADQ83449.1| conserved hypothetical protein [Methylovorus sp. MP688] Length = 385 Score = 233 bits (594), Expect = 3e-59, Method: Composition-based stats. Identities = 94/376 (25%), Positives = 151/376 (40%), Gaps = 32/376 (8%) Query: 2 ENKLIRKIVNLIK-KNGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIF 59 +L I I G + QY L + P GYYS FG GDFVTAPE+S +F Sbjct: 16 SQQLKLHIGRHIAEAGGWLDFAQYMDLVLYAPSLGYYSAGAKKFGPAGDFVTAPELSPLF 75 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 LA+ ++ELG G G + D+L + +L+ ++E S Sbjct: 76 ARTLAMQAADILSAT----AGDVLELGAGSGRLAADLLLELDRLQQLPS---QYRILEIS 128 Query: 120 ERLTLIQKKQL-----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGI 174 L +QK L ++ W SL + G ++ NE D+LP+ G+ Sbjct: 129 AYLRQVQKDYLQKVLPPHLMQRVEWLDSLPEAFSG--LVLGNEVLDALPVHIVHQQADGL 186 Query: 175 RERMIDIDQHDSLVFNIGDHE-IKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDG 233 +R + + L + + + + E + S++ L Sbjct: 187 LQRGVGLAPDAELQWVDQPADALIQAAFRETRLPESYTTEICMAAGGLIASLASMLQR-- 244 Query: 234 GTAIVIDY------GYLQSRVGDT-LQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAI 286 G ++IDY Y R T + + H + P + PG D+++HVDF R++ A+ Sbjct: 245 GVVLLIDYGFPRHEYYHPQRQQGTLMCHYRHHAHTDPFLYPGLQDITAHVDFTRIAESAM 304 Query: 287 LYKLYINGLTTQGKFLEGLGIWQRAFSLMKQT-ARKDILLDSVKRLVSTSADKKSMGELF 345 L + G +Q +FL GI + + A L S ++L+S MGELF Sbjct: 305 QQGLAVMGYASQAQFLINCGITECLVEVSPHDVAAYAPLASSAQKLLS----PAEMGELF 360 Query: 346 KILVVSHEKVELM-PF 360 K++ V E + F Sbjct: 361 KVIAVGRGIDEPLRGF 376 >gi|325144948|gb|EGC67231.1| hypothetical protein NMBM01240013_0430 [Neisseria meningitidis M01-240013] Length = 382 Score = 233 bits (594), Expect = 3e-59, Method: Composition-based stats. Identities = 89/374 (23%), Positives = 150/374 (40%), Gaps = 33/374 (8%) Query: 4 KLIRKIVNLIKKNG-QMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGE 61 KL I I K+G + ++ L + P++GYY+ + G GDF+TAP ++ +F + Sbjct: 16 KLQTLIAEEIGKHGNWIPFSRFMELVLYAPQYGYYTGGSHKIGNTGDFITAPTLTSLFAQ 75 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 LA L Q + E G G G + D+L I + Y++E S Sbjct: 76 TLARQLQELLSQT----AGNIYEFGAGTGQLAADLLGSISD------GISRYYIIEISPE 125 Query: 122 LTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRER 177 L QK + K+ T+L + G ++ NE D++P++ E G E Sbjct: 126 LAARQKNLIQARAPEASQKVVHLTALPEAFDG--IIIGNEVLDAMPVEIVRKNEGGSFEH 183 Query: 178 MIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLG----AIFENSPCRDREMQSISDRLACDG 233 + +D ++ S YF E P + +++++ RL G Sbjct: 184 VGVCLDNDRFTYSARPLHDLQLSALASLYFPQTDYPYTSELHPQQYAFIRTLASRLERGG 243 Query: 234 GTAIVIDY----GYLQSRVGDTLQ-AVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILY 288 I + Y R TL + H +P G ADL++HV+F ++ Sbjct: 244 MIFIDYGFDAAQYYHPQRNQGTLIGHYRHHVIHNPFDFIGLADLTAHVNFTDIAQAGTDA 303 Query: 289 KLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKD-ILLDSVKRLVSTSADKKSMGELFKI 347 L + G Q FL LGI + + K + +V++L+ D+ MGELFK+ Sbjct: 304 GLDLTGYLPQSHFLLNLGITELLAQVGKTDSPAYIREAAAVQKLI----DQHEMGELFKV 359 Query: 348 LVVSHE-KVELMPF 360 + ++ F Sbjct: 360 IAFGKNIGIDWAGF 373 >gi|325981510|ref|YP_004293912.1| hypothetical protein NAL212_0818 [Nitrosomonas sp. AL212] gi|325531029|gb|ADZ25750.1| protein of unknown function DUF185 [Nitrosomonas sp. AL212] Length = 395 Score = 233 bits (594), Expect = 3e-59, Method: Composition-based stats. Identities = 100/381 (26%), Positives = 160/381 (41%), Gaps = 34/381 (8%) Query: 2 ENKLIRKIVNLI-KKNGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIF 59 ++ I I +G ++ +Q+ L + P GYY++ G+ GDFVTAPEIS +F Sbjct: 18 SQSVLTLIKEQILASDGWISFEQFMNLALYAPGMGYYNSGATKLGSAGDFVTAPEISSLF 77 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 G L + L P ++E G G G + LDIL + K +++E S Sbjct: 78 GRTLVLQLSQISHCLQHPY---ILEFGAGSGRLALDILIELEKTAELP---EKYFIMEVS 131 Query: 120 ERLTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIR 175 L Q L ++ W L D G ++ANE D++P+ ++ I Sbjct: 132 AELRQRQLTLLASKAPHLIHRVEWLERLPDQFNG--IILANEVLDAMPVHLVAWHDNAIF 189 Query: 176 ERMIDID-----QHDSLVFNIGDHEIK----SNFLTCSDYFLGAIFENSPCRDREMQSIS 226 ER + D + +I H I S SD + E + M+S++ Sbjct: 190 ERGVSWHNDQLTWQDRPLQDIYLHSIAGQMTSQINPNSDSHFEYVSEFNLSAIGFMRSLA 249 Query: 227 DRLACDGGTAIVI----DYGYLQSRVGDT-LQAVKGHTYVSPLVNPGQADLSSHVDFQRL 281 L I D Y R T + + H + +P PG D++SHVDF + Sbjct: 250 KLLRQGVILLIDYGFGRDEYYHPQRNQGTLMCHYRHHAHDNPFYLPGLQDITSHVDFSAV 309 Query: 282 SSIAILYKLYINGLTTQGKFLEGLGIWQRAFSL-MKQTARKDILLDSVKRLVSTSADKKS 340 + A+ L + G TTQ FL GI + + ++ TAR + +++L+ + Sbjct: 310 TQAAVDSNLTLLGYTTQAFFLINSGITKILAQIPVEDTARYLPQSNQLQKLI----NPAE 365 Query: 341 MGELFKILVVSHE-KVELMPF 360 MGELFK++ E L+ F Sbjct: 366 MGELFKVIAFGKEFTEPLIGF 386 >gi|167900972|ref|ZP_02488177.1| hypothetical protein BpseN_01759 [Burkholderia pseudomallei NCTC 13177] Length = 396 Score = 233 bits (594), Expect = 3e-59, Method: Composition-based stats. Identities = 91/371 (24%), Positives = 149/371 (40%), Gaps = 34/371 (9%) Query: 2 ENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGD----FVTAPEI 55 + L + I G + +Y + P GYYS FG GD FVTAPE+ Sbjct: 22 SDALAASLRAEIAAAGGWIPFSRYMERVLYAPGLGYYSGGAQKFGWRGDDGSDFVTAPEL 81 Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 S +F + LA + G R++E G G G + +L + L + + Sbjct: 82 SPLFAQTLARPVAQVLAASG---TRRVMEFGAGTGQLAAGLLNALAALGVELDE---YAI 135 Query: 116 VETSERLTLIQKKQLASYGD----KINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTE 171 V+ S L Q++ L ++ W +L + G +V NE D++P++ Sbjct: 136 VDLSGELRARQRETLDEQASGAAARVRWLDALPERFEG--VIVGNEVLDAMPVQLVAKHA 193 Query: 172 HGIRERMIDIDQHDSLVFNIGD--HEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRL 229 HG ER + + + F + L D G + E + ++ L Sbjct: 194 HGWCERGVSLGDAGAFAFADRPLARAEDAARLAALDADEGYVTETHDAAAAFVGTVCAML 253 Query: 230 ACDGGTAIVIDY------GYLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLS 282 A G A+ IDY Y + R TL H P V PG D+++HV+F + Sbjct: 254 AR--GAALFIDYGFPRHEYYHRQRAQGTLMCHYRHRAHGDPFVYPGLQDITAHVEFSAVY 311 Query: 283 SIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARK-DILLDSVKRLVSTSADKKSM 341 + + G T+Q +FL GI + A++ ++V++L+S + M Sbjct: 312 EAGVGAGAELLGYTSQARFLLNAGITDVLAEIDPSDAQRFLPAANAVQKLIS----EAEM 367 Query: 342 GELFKILVVSH 352 GELFK++ S Sbjct: 368 GELFKVIAFSR 378 >gi|121635457|ref|YP_975702.1| hypothetical protein NMC1757 [Neisseria meningitidis FAM18] gi|120867163|emb|CAM10930.1| conserved hypothetical protein [Neisseria meningitidis FAM18] Length = 405 Score = 233 bits (594), Expect = 4e-59, Method: Composition-based stats. Identities = 90/374 (24%), Positives = 150/374 (40%), Gaps = 33/374 (8%) Query: 4 KLIRKIVNLIKKNG-QMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGE 61 KL I I K+G + ++ L + P++GYY+ + G GDF+TAP ++ +F + Sbjct: 39 KLQTLIAEEIGKHGNWIPFSRFMELVLYAPQYGYYTGGSHKIGNTGDFITAPTLTSLFAQ 98 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 LA L Q + E G G G + D+L I + Y++E S Sbjct: 99 TLARQLQELLSQT----AGNIYEFGAGTGQLAADLLGSISD------GISRYYIIEISPE 148 Query: 122 LTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRER 177 L QK + K+ T+L + G ++ NE D++P++ E G E Sbjct: 149 LAARQKNLIQARAPEASQKVVHLTALPEAFDG--IIIGNEVLDAMPVEIVRKNEGGSFEH 206 Query: 178 MIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLG----AIFENSPCRDREMQSISDRLACDG 233 + +D ++ S YF E P + +++++ RL G Sbjct: 207 VGVCLDNDRFTYSARPLHDLQLSALASLYFPQTDYPYTSELHPQQYAFIRTLASRLERGG 266 Query: 234 GTAIVIDY----GYLQSRVGDTLQ-AVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILY 288 I + Y R TL + H +P G ADL++HV+F ++ Sbjct: 267 MIFIDYGFDAAQYYHPQRNQGTLIGHYRHHVIHNPFDFIGLADLTAHVNFTDIAQAGTDA 326 Query: 289 KLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKD-ILLDSVKRLVSTSADKKSMGELFKI 347 L + G Q FL LGI + + K + +V++L+ D+ MGELFK+ Sbjct: 327 GLDLTGYLPQSHFLLNLGITELLAQVGKTDSPAYIREAAAVQKLI----DQHEMGELFKV 382 Query: 348 LVVSHE-KVELMPF 360 + V+ F Sbjct: 383 IAFGKNIGVDWAGF 396 >gi|161870665|ref|YP_001599838.1| hypothetical protein NMCC_1735 [Neisseria meningitidis 053442] gi|161596218|gb|ABX73878.1| conserved hypothetical protein [Neisseria meningitidis 053442] Length = 382 Score = 233 bits (594), Expect = 4e-59, Method: Composition-based stats. Identities = 90/374 (24%), Positives = 150/374 (40%), Gaps = 33/374 (8%) Query: 4 KLIRKIVNLIKKNG-QMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGE 61 KL I I K+G + ++ L + P++GYY+ + G GDF+TAP ++ +F + Sbjct: 16 KLQTLIAEKIGKHGNWIPFSRFMELVLYAPQYGYYTGGSHKIGNTGDFITAPTLTSLFAQ 75 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 LA L Q + E G G G + D+L I + Y++E S Sbjct: 76 TLARQLQELLSQT----AGNIYEFGAGTGQLAADLLGSISD------GISRYYIIEISPE 125 Query: 122 LTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRER 177 L QK + K+ T+L + G ++ NE D++P++ E G E Sbjct: 126 LAARQKNLIQARAPEASQKVVHLTALPEAFDG--IIIGNEVLDAMPVEIVRKNEGGSFEH 183 Query: 178 MIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLG----AIFENSPCRDREMQSISDRLACDG 233 + +D ++ S YF E P + +++++ RL G Sbjct: 184 VGVCLDNDRFTYSARPLHDLQLSALASLYFPQTDYPYTSELHPQQYAFIRTLASRLERGG 243 Query: 234 GTAIVIDY----GYLQSRVGDTLQ-AVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILY 288 I + Y R TL + H +P G ADL++HV+F ++ Sbjct: 244 MIFIDYGFDAAQYYHPQRNQGTLIGHYRHHVIHNPFDFIGLADLTAHVNFTDIAQAGTDA 303 Query: 289 KLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDI-LLDSVKRLVSTSADKKSMGELFKI 347 L + G Q FL LGI + K + I +V++L+ D+ MGELFK+ Sbjct: 304 GLDLIGYLPQSHFLLNLGITELLAQTGKTDSAAYICEAAAVQKLI----DQHEMGELFKV 359 Query: 348 LVVSHE-KVELMPF 360 + ++ F Sbjct: 360 IAFGKNIGIDWAGF 373 >gi|325205479|gb|ADZ00932.1| conserved hypothetical protein [Neisseria meningitidis M04-240196] Length = 405 Score = 233 bits (594), Expect = 4e-59, Method: Composition-based stats. Identities = 90/374 (24%), Positives = 150/374 (40%), Gaps = 33/374 (8%) Query: 4 KLIRKIVNLIKKNG-QMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGE 61 KL I I K+G + ++ L + P++GYY+ + G GDF+TAP ++ +F + Sbjct: 39 KLQTLIAEKIGKHGNWIPFSRFMELVLYAPQYGYYTGGSHKIGNTGDFITAPTLTSLFAQ 98 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 LA L Q + E G G G + D+L I + Y++E S Sbjct: 99 TLARQLQELLSQT----AGNIYEFGAGTGQLAADLLGSISD------GISRYYIIEISPE 148 Query: 122 LTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRER 177 L QK + K+ T+L + G ++ NE D++P++ E G E Sbjct: 149 LAARQKNLIQARAPEASQKVVHLTALPEAFDG--IIIGNEVLDAMPVEIVRKNEGGSFEH 206 Query: 178 MIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLG----AIFENSPCRDREMQSISDRLACDG 233 + +D ++ S YF E P + +++++ RL G Sbjct: 207 VGVCLDNDRFTYSARPLHDLQLSALASLYFPQTDYPYTSELHPQQYAFIRTLASRLERGG 266 Query: 234 GTAIVIDY----GYLQSRVGDTLQ-AVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILY 288 I + Y R TL + H +P G ADL++HV+F ++ Sbjct: 267 MIFIDYGFDAAQYYHPQRNQGTLIGHYRHHVIHNPFDFIGLADLTAHVNFTDIAQAGTDA 326 Query: 289 KLYINGLTTQGKFLEGLGIWQRAFSLMK-QTARKDILLDSVKRLVSTSADKKSMGELFKI 347 L + G Q FL LGI + K +A +V++L+ D+ MGELFK+ Sbjct: 327 GLDLIGYLPQSHFLLNLGITELLAQTGKTDSAAYIREAAAVQKLI----DQHEMGELFKV 382 Query: 348 LVVSHE-KVELMPF 360 + ++ F Sbjct: 383 IAFGKNIGIDWAGF 396 >gi|238028810|ref|YP_002913041.1| hypothetical protein bglu_1g32740 [Burkholderia glumae BGR1] gi|237878004|gb|ACR30337.1| Hypothetical protein bglu_1g32740 [Burkholderia glumae BGR1] Length = 402 Score = 233 bits (593), Expect = 4e-59, Method: Composition-based stats. Identities = 93/385 (24%), Positives = 151/385 (39%), Gaps = 39/385 (10%) Query: 2 ENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGD----FVTAPEI 55 L + I G + ++ + P GYYS FG GD FVTAPE+ Sbjct: 22 SETLAASLRAEIAAAGGWLPFSRFMERALYAPGLGYYSGGARKFGRRGDDGSDFVTAPEL 81 Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 S +F LA + E G R++E G G G + +L + L + Sbjct: 82 SPLFAHTLARPVA---EALGASGTRRVMEFGAGTGRLAAGLLAALEALGAAP---EHYQI 135 Query: 116 VETSERLTLIQKKQLASYGD-----KINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMT 170 VE S L Q+ LA+ ++ W +L + G ++ NE D++P++ + Sbjct: 136 VELSGELRERQRATLAAALPAALAARVQWLDALPERFEG--VVIGNEVLDAMPVRLVLRA 193 Query: 171 E---HGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSD------YFLGAIFENSPCRDRE 221 G RER + +D S F D + ++ D G + E Sbjct: 194 AGAGAGWRERGVAVDAA-SRAFVFEDRPLAAHAPELIDTLAALDLPAGYLTETHEAARAF 252 Query: 222 MQSISDRLACDGGTAIVIDYG----YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHV 276 +++ LA I + Y R TL H P V PG D+++HV Sbjct: 253 TRTVCTMLARGAAFFIDYGFPAAEYYHPQRAEGTLMCHYRHRAHGDPFVWPGLQDITAHV 312 Query: 277 DFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLM-KQTARKDILLDSVKRLVSTS 335 +F + + + + G T+QG+FL GI + ++ AR ++V++L+S Sbjct: 313 EFSGIHAAGVAAGAELLGYTSQGRFLLNAGITEVLAAIDPSDPARFLPAANAVQKLIS-- 370 Query: 336 ADKKSMGELFKILVVSHEKVELMPF 360 + MGELFK++ L F Sbjct: 371 --EAEMGELFKVIAFGRGIDGLAAF 393 >gi|167585225|ref|ZP_02377613.1| hypothetical protein BuboB_07807 [Burkholderia ubonensis Bu] Length = 401 Score = 233 bits (593), Expect = 4e-59, Method: Composition-based stats. Identities = 87/376 (23%), Positives = 147/376 (39%), Gaps = 39/376 (10%) Query: 2 ENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTC-NPFGAV----GDFVTAPEI 55 L ++ I G + D++ + P GYYS FG DFVTAPE+ Sbjct: 22 SETLAAQLRGEIAAAGGWLPFDRFMERALYAPGLGYYSGGARKFGRRADDGSDFVTAPEL 81 Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 S +F + LA + A G R++E G G G ++ L + + Sbjct: 82 SPLFAQTLARPVAQALAASG---TRRVMEFGAGTG---KLAAGLLAALDALAAELDEYLI 135 Query: 116 VETSERLTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTE 171 V+ S L Q+ + K+ W +L + G +V NE D++P++ F + Sbjct: 136 VDLSGELRERQRDTIAAAAPGLASKVRWLDALPERFEG--VVVGNEVLDAMPVRLFAKAD 193 Query: 172 HGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYF-------LGAIFENSPCRDREMQS 224 RER + +D + VF ++ G + E ++ Sbjct: 194 GAWRERGVALDARQAFVFEDRPAAPAASADLPPALAALDADAGDGYVTETHEAALAFTRT 253 Query: 225 ISDRLACDGGTAIVIDY------GYLQSRVGDTLQAVKGHTYVS-PLVNPGQADLSSHVD 277 + LA G ++IDY Y R TL H P + PG D+++HV+ Sbjct: 254 VCTMLAR--GAVLLIDYGFPAHEYYHPQRDRGTLMCHYRHRAHDDPFLYPGLQDITAHVE 311 Query: 278 FQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARK-DILLDSVKRLVSTSA 336 F + + + + G T+Q +FL GI ++ ++V++L+S Sbjct: 312 FSGIYAAGVATGADLLGYTSQARFLLNAGITDVLAAIDPSDVHAFLPAANAVQKLIS--- 368 Query: 337 DKKSMGELFKILVVSH 352 + MGELFK++ S Sbjct: 369 -EAEMGELFKVIAFSR 383 >gi|325132955|gb|EGC55632.1| hypothetical protein NMBM6190_0339 [Neisseria meningitidis M6190] gi|325138943|gb|EGC61493.1| hypothetical protein NMBES14902_0391 [Neisseria meningitidis ES14902] Length = 382 Score = 233 bits (593), Expect = 4e-59, Method: Composition-based stats. Identities = 90/374 (24%), Positives = 150/374 (40%), Gaps = 33/374 (8%) Query: 4 KLIRKIVNLIKKNG-QMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGE 61 KL I I K+G + ++ L + P++GYY+ + G GDF+TAP ++ +F + Sbjct: 16 KLQTLIAEEIGKHGNWIPFSRFMELVLYAPQYGYYTGGSHKIGNTGDFITAPTLTSLFAQ 75 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 LA L Q + E G G G + D+L I + Y++E S Sbjct: 76 TLARQLQELLSQT----AGNIYEFGAGTGQLAADLLGSISD------GISRYYIIEISPE 125 Query: 122 LTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRER 177 L QK + K+ T+L + G ++ NE D++P++ E G E Sbjct: 126 LAARQKNLIQARAPEASQKVVHLTALPEAFDG--IIIGNEVLDAMPVEIVRKNEGGSFEH 183 Query: 178 MIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLG----AIFENSPCRDREMQSISDRLACDG 233 + +D ++ S YF E P + +++++ RL G Sbjct: 184 VGVCLDNDRFTYSARPLHDLQLSALASLYFPQTDYPYTSELHPQQYAFIRTLASRLERGG 243 Query: 234 GTAIVIDY----GYLQSRVGDTLQ-AVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILY 288 I + Y R TL + H +P G ADL++HV+F ++ Sbjct: 244 MIFIDYGFDAAQYYHPQRNQGTLIGHYRHHVIHNPFDFIGLADLTAHVNFTDIAQAGTDA 303 Query: 289 KLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKD-ILLDSVKRLVSTSADKKSMGELFKI 347 L + G Q FL LGI + + K + +V++L+ D+ MGELFK+ Sbjct: 304 GLDLTGYLPQSHFLLNLGITELLAQVGKTDSPAYIREAAAVQKLI----DQHEMGELFKV 359 Query: 348 LVVSHE-KVELMPF 360 + V+ F Sbjct: 360 IAFGKNIGVDWAGF 373 >gi|325203516|gb|ADY98969.1| conserved hypothetical protein [Neisseria meningitidis M01-240355] Length = 382 Score = 233 bits (593), Expect = 4e-59, Method: Composition-based stats. Identities = 91/374 (24%), Positives = 149/374 (39%), Gaps = 33/374 (8%) Query: 4 KLIRKIVNLIKKNG-QMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGE 61 KL I I K+G + ++ L + P++GYY+ + G GDF+TAP ++ +F + Sbjct: 16 KLQTLIAEKIGKHGNWIPFSRFMELVLYAPQYGYYTGGSHKIGNTGDFITAPTLTSLFAQ 75 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 LA L Q + E G G G + D+L I + Y++E S Sbjct: 76 TLARQLQELLSQT----AGNIYEFGAGTGQLAADLLGSISD------GISRYYIIEISPE 125 Query: 122 LTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRER 177 L QK + K+ T+L + G ++ NE D++P++ E G E Sbjct: 126 LAARQKNLIQARAPEASQKVVHLTALPEAFDG--IIIGNEVLDAMPVEIVRKNEGGSFEH 183 Query: 178 MIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLG----AIFENSPCRDREMQSISDRLACDG 233 + +D ++ S YF E P + +++++ RL G Sbjct: 184 VGVCLDNDRFTYSARPLHDLQLSALASLYFPQTDYPYTSELHPQQYAFIRTLASRLERGG 243 Query: 234 GTAIVIDY----GYLQSRVGDTLQ-AVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILY 288 I + Y R TL + H +P G ADL+ HV+F ++ Sbjct: 244 MIFIDYGFDAAQYYHPQRNQGTLIGHYRHHVIHNPFDFIGLADLTVHVNFTDIAQAGTDA 303 Query: 289 KLYINGLTTQGKFLEGLGIWQRAFSLMK-QTARKDILLDSVKRLVSTSADKKSMGELFKI 347 L + G Q FL LGI + K +A +V++L+ D+ MGELFK+ Sbjct: 304 GLDLIGYLPQSHFLLNLGITELLAQTGKTDSAAYIREAAAVQKLI----DQHEMGELFKV 359 Query: 348 LVVSHE-KVELMPF 360 + V+ F Sbjct: 360 IAFGKNIGVDWAGF 373 >gi|326318891|ref|YP_004236563.1| hypothetical protein Acav_4106 [Acidovorax avenae subsp. avenae ATCC 19860] gi|323375727|gb|ADX47996.1| protein of unknown function DUF185 [Acidovorax avenae subsp. avenae ATCC 19860] Length = 382 Score = 233 bits (593), Expect = 4e-59, Method: Composition-based stats. Identities = 91/379 (24%), Positives = 150/379 (39%), Gaps = 40/379 (10%) Query: 1 MENKLIRKIVNLIK-KNGQMTVDQYFALCVADPEFGYYSTC-NPFG----AVGDFVTAPE 54 + + L R+I + I G + D++ + P GYY+ FG + DFVTAPE Sbjct: 16 LSSALARRIADGIAEAGGWIGFDRFMEWALYTPGLGYYANALPKFGTLPQSGSDFVTAPE 75 Query: 55 ISQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIY 114 +S +FG+ LA + A + G + E G G G + +L + V Sbjct: 76 LSPVFGQALARQVQDALDATG---TDDVWEFGAGSGALAAQLLEALGD------RVRRYT 126 Query: 115 MVETSERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMT---- 170 +V+ S L Q+++LA +GD++ W +L + G +V NE D++P++ Sbjct: 127 IVDLSGSLRERQRERLAPWGDRVQWAQALPERIEG--VVVGNEVLDAMPVQLLARHGGAE 184 Query: 171 EHGIRERMIDIDQHD--SLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDR 228 ER + ++ D F D + + E + ++ + R Sbjct: 185 SSTWHERGVVVEPGDGGEPRFAWRDRPTPLRPPVEPEGPQDYLTEIHAQGEGFLRMLGQR 244 Query: 229 LACDGGTAIVIDYG----YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSS 283 LA I + Y R TL + H PLV+ G D+++HV+F ++ Sbjct: 245 LARGAAFLIDYGFPEAEYYHPQRHMGTLVCHRAHQVDSDPLVDVGAKDITAHVNFTAMAV 304 Query: 284 IAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGE 343 A L + G TTQ FL G+ L + + K + MGE Sbjct: 305 AAQEAGLDVLGYTTQAHFLINCGLLALLEPL-----PQAQRAQAAK-----LMMEHEMGE 354 Query: 344 LFKILVVSH--EKVELMPF 360 LFK+L V + F Sbjct: 355 LFKVLAVGAGVPAWTPIGF 373 >gi|225077128|ref|ZP_03720327.1| hypothetical protein NEIFLAOT_02183 [Neisseria flavescens NRL30031/H210] gi|224951539|gb|EEG32748.1| hypothetical protein NEIFLAOT_02183 [Neisseria flavescens NRL30031/H210] Length = 383 Score = 233 bits (593), Expect = 4e-59, Method: Composition-based stats. Identities = 80/364 (21%), Positives = 150/364 (41%), Gaps = 31/364 (8%) Query: 5 LIRKIVNLIKKN-GQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGEM 62 L + I N I+++ + ++ L + P++GYYS + G GDF+TAP +S +FG+ Sbjct: 19 LTKLIKNEIEQHQNWIPFSRFMELALYTPQYGYYSGGSHKIGTDGDFITAPTLSPLFGQT 78 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 LA L Q + E G G G + +L+ + + Y++E S L Sbjct: 79 LAKQLAELLPQT----AGNIYEFGAGTGHLAATLLQNLSD------GLNHYYIIELSAEL 128 Query: 123 TLIQKKQLASYGD-----KINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRER 177 Q++ + KI T+L + G ++ NE D++P+++ + + G ++ Sbjct: 129 AERQRQYILENTSLEVAAKIIHLTTLPEHFDG--IIIGNEVLDAMPVERLIYQDEGFQQI 186 Query: 178 MIDIDQHD--SLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGT 235 + ++ + + E+ E P + +Q+++ +L G Sbjct: 187 GVSLENDKLIEAIRPLAQAELTQTAALYFPPLPSYTSELHPAQYAFIQTLAAKLQRGGMI 246 Query: 236 AI-----VIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKL 290 I Y + Q + G + + HT P N G DL++HV+F ++ L Sbjct: 247 FIDYGFDAAQYYHPQRKEGTFIGHYRHHTIHDPFFNIGLTDLTAHVNFTDIARAGTESGL 306 Query: 291 YINGLTTQGKFLEGLGIWQRAFSL-MKQTARKDILLDSVKRLVSTSADKKSMGELFKILV 349 + G Q FL LGI + + +V++L+ MGELFK++ Sbjct: 307 DLIGYLPQSYFLLNLGIIDLLAQIGSPDSVEYIQTAATVQKLIHQ----HEMGELFKVIA 362 Query: 350 VSHE 353 + Sbjct: 363 FGKD 366 >gi|299749961|ref|XP_001836447.2| hypothetical protein CC1G_07094 [Coprinopsis cinerea okayama7#130] gi|298408676|gb|EAU85400.2| hypothetical protein CC1G_07094 [Coprinopsis cinerea okayama7#130] Length = 457 Score = 233 bits (593), Expect = 4e-59, Method: Composition-based stats. Identities = 119/387 (30%), Positives = 185/387 (47%), Gaps = 59/387 (15%) Query: 15 KNGQMTVDQYFALCVADPEFGYYSTCN--PFGAVGDFVTAPEISQIFGEMLAIFLICAWE 72 G ++ +Y LC++ P GYY N FG GDF+T+PEISQ+FGE++ ++L+ W Sbjct: 60 ATGPISFAKYMQLCLSHPTHGYYMNPNNAVFGTSGDFITSPEISQVFGELVGVWLVSQWA 119 Query: 73 QHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLAS 132 G P +RLVELGPGRG +M DILR++ K P+ ++ +++VETSE L +QK +L Sbjct: 120 DAGTPPAIRLVELGPGRGTLMDDILRIVKKFLPE-KALTGVHLVETSEALRSVQKAKLGE 178 Query: 133 YGDKINWYTSLADVPLGFTF---LVANEFFDSLPIKQFVMTEHGIRERMIDIDQHD---- 185 D ++++ + ++P + LVA+EFFD+LP+ TE G E MI + Sbjct: 179 KCD-LHFHNGIHEIPRNPSVYTMLVAHEFFDALPVHVVQKTEAGWNEVMIASNDSLSSSE 237 Query: 186 --------------SLVFNIGDHEIKSNFLTCS----DYFLGAIFENSPCRDREMQSISD 227 V N + S + +G+ E SP R I Sbjct: 238 SSPSQPQTQKQGVLRRVLNPLPSPASTLLGNSSLRFRNLPIGSTIEVSPTSFRIAHQIGR 297 Query: 228 RLACDGG-------------------TAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPG 268 L+ G +VIDYG GD+ +A K H V PG Sbjct: 298 LLSARGLEEKPLDLVEASQQETGVGGCGLVIDYG-ADHAFGDSFRAFKEHKIVDVFHRPG 356 Query: 269 QADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTA---RKDILL 325 + D++++VDF L + +G TQ FLE + + R +L++ + RK ++L Sbjct: 357 ECDITANVDFAYLKEAMT---VEPHGPITQADFLERMALQTRVEALVRNASSEERKKVIL 413 Query: 326 DSVKRLVSTSADKKSMGELFKILVVSH 352 D+ RLV D+ MG +K+L ++ Sbjct: 414 DAANRLV----DRSGMGTQYKVLGITS 436 >gi|307150081|ref|YP_003885465.1| hypothetical protein Cyan7822_0139 [Cyanothece sp. PCC 7822] gi|306980309|gb|ADN12190.1| protein of unknown function DUF185 [Cyanothece sp. PCC 7822] Length = 386 Score = 233 bits (593), Expect = 4e-59, Method: Composition-based stats. Identities = 92/376 (24%), Positives = 149/376 (39%), Gaps = 23/376 (6%) Query: 1 MENKLIRKIVNLIKKN--GQMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQ 57 +E+ LI+ I I ++ +++ +Y L + P+FGYYS+ G GD+ T+ + Sbjct: 2 IESTLIKTITERIHQSPLHRISFSEYMQLVLYHPQFGYYSSEKAKIGKSGDYFTSSSLGP 61 Query: 58 IFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVE 117 FGE+LA I WE G PS L+E+G G G++ DIL K PD L ++E Sbjct: 62 DFGELLAKQFIEMWEILGQPSHFILLEMGAGLGLLASDILNYFRKTAPDLLEKLEYQIIE 121 Query: 118 TSERLTLIQKKQL-----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEH 172 S L QK+QL + + +AD + +NE D+ P+ + + Sbjct: 122 QSLDLIARQKEQLQSELAQGIKIEWKTWQDIADESIIGC-AFSNELVDAFPVHRLAIQGG 180 Query: 173 GIRERMIDIDQHDSLVFNIGDHEIKSNFL-------TCSDYFLGAIFENSPCRDREMQSI 225 ++E + + + + + Y G E + ++++ Sbjct: 181 ELKEIYVTYSDNQFQEIIDKPTQDFNQYFQLVGVELPSDAYREGYQTEVNTAALSWLETL 240 Query: 226 SDRLACDGGTAIVIDYG----YLQSRVGDTL-QAVKGHTYVSPLVNPGQADLSSHVDFQR 280 S +L I Y Y R TL K H + P +N G D+++H+DF Sbjct: 241 SKKLKRGYLLTIDYGYPAHKYYHPQRYRGTLNCYYKHHHHHDPYINIGLQDITTHIDFTA 300 Query: 281 LSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKS 340 L L L G T QG FL LG+ R L ++ L D Sbjct: 301 LERQGELCGLEKLGSTKQGMFLMSLGLGDRLAELSSGQYNFGEVIQRRDAL-HQLIDPSG 359 Query: 341 MGELFKILVVSHEKVE 356 +G F +L+ Sbjct: 360 LGG-FGVLIQCKGLTP 374 >gi|220933515|ref|YP_002512414.1| protein of unknown function DUF185 [Thioalkalivibrio sp. HL-EbGR7] gi|219994825|gb|ACL71427.1| protein of unknown function DUF185 [Thioalkalivibrio sp. HL-EbGR7] Length = 393 Score = 233 bits (593), Expect = 5e-59, Method: Composition-based stats. Identities = 91/380 (23%), Positives = 154/380 (40%), Gaps = 34/380 (8%) Query: 2 ENKLIRKIVNLI-KKNGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIF 59 L ++ + I G + +Y L + P GYY+ + GA GDF TAPE S +F Sbjct: 18 SEALTARLRDEIEAAGGFLPFRRYMELALYAPGLGYYAAGSHKLGAGGDFTTAPETSPLF 77 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLK--PDFFSVLSIYMVE 117 LA + E+ G +++E G G G + +++LR + L P+ + ++E Sbjct: 78 ARCLARQVAQVLEELGG---GQVLEFGAGTGALAVEMLRALAALDRLPEQY-----LILE 129 Query: 118 TSERLTLIQKKQLASYGDKIN---WYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGI 174 S L Q+ +A + + + ++ NE D++P++ F T + Sbjct: 130 LSPDLRERQQAAVADLPEALAARVCWLDALPPRGFRGVMLGNEVLDAMPVEVFTWTGESV 189 Query: 175 RERMIDIDQHDSLVFNIGDHEI-------KSNFLTCSDYFLGAIFENSPCRDREMQSISD 227 +R + + V+ E + T S + G E SP + S+++ Sbjct: 190 LQRGVA-WEGGRFVWAERPAEPGLGAAVQRLQAETGSAWPPGYTSEYSPGLAPWVASLAE 248 Query: 228 RLACDGGTAIVIDYG----YLQSRVGDTLQAVKGHTY-VSPLVNPGQADLSSHVDFQRLS 282 LA I Y Y R TL H P PG ADL++HVDF ++ Sbjct: 249 VLAAGLILLIDYGYPRAEYYSPERSRGTLMGYYRHRALDDPFFLPGLADLTAHVDFTAVA 308 Query: 283 SIAILYKLYINGLTTQGKFLEGLGIWQRA-FSLMKQTARKDILLDSVKRLVSTSADKKSM 341 + L + G TTQ FL G G+ Q + + + L V++L M Sbjct: 309 EAGVGAGLDVLGYTTQAWFLIGAGLDQVFQEAASEDPREQLALAGQVRQLTL----PGEM 364 Query: 342 GELFKILVVSHE-KVELMPF 360 GE F+++ + + L+ F Sbjct: 365 GERFQVIGLGRGIEGPLVGF 384 >gi|126031971|gb|ABN71547.1| uncharacterized conserved protein [uncultured bacterium] Length = 327 Score = 233 bits (593), Expect = 5e-59, Method: Composition-based stats. Identities = 121/359 (33%), Positives = 175/359 (48%), Gaps = 40/359 (11%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 + + + IK G M+V+ Y C YY+T +P G GDF TAPEISQ++GE+ Sbjct: 2 SPFEKALAERIKAEGPMSVEAYMEACNTY----YYATRDPLGVRGDFTTAPEISQMYGEL 57 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 + L W + G P VR ELGPGRG + D + ++ S+ VETS L Sbjct: 58 IGAALADCWNRAGQPEGVRYAELGPGRGTLASD---ALRVMRAAGCEPASVEFVETSPVL 114 Query: 123 TLIQKKQLAS--YGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 Q + + D+I + G +VA+EFFD+LP++Q+V E + Sbjct: 115 REAQANAVTDAIFHDEIAGLAN----SDGPLLVVASEFFDALPVQQWVDAIERRVEWIGG 170 Query: 181 IDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVID 240 D GAI E SP R+ M ++ LA GG I ID Sbjct: 171 HFAFDR---------------------DGAIIETSPAREEAMAELARVLAAKGGIGIFID 209 Query: 241 YGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGK 300 YGY S G++LQAV GH + SPL PG+ DL++HVDF L A ++ LT+QG Sbjct: 210 YGYA-SGTGESLQAVGGHKFASPLEKPGEQDLTAHVDFASLGRAAREAGASVSRLTSQGT 268 Query: 301 FLEGLGIWQRAFSLM-KQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELM 358 +LE LGI RA +L + + + + + +RL D+++MG LFK++ V+ L Sbjct: 269 WLETLGIGGRALALAGRNPEQTEAIGAARRRL----CDEEAMGRLFKVMGVAAPHWPLP 323 >gi|325202779|gb|ADY98233.1| conserved hypothetical protein [Neisseria meningitidis M01-240149] Length = 382 Score = 233 bits (593), Expect = 5e-59, Method: Composition-based stats. Identities = 91/374 (24%), Positives = 150/374 (40%), Gaps = 33/374 (8%) Query: 4 KLIRKIVNLIKKNG-QMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGE 61 KL I I K+G + ++ L + P++GYY+ + G GDF+TAP ++ +F + Sbjct: 16 KLQTLIAEKIGKHGNWIPFSRFMELVLYAPQYGYYTGGSHKIGNTGDFITAPTLTSLFAQ 75 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 LA L Q + E G G G + D+L I + Y++E S Sbjct: 76 TLARQLQELLSQT----AGNIYEFGAGTGQLAADLLGSISD------GISRYYIIEISPE 125 Query: 122 LTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRER 177 L QK + K+ T+L + G ++ NE D++P++ E G E Sbjct: 126 LAARQKNLIQARAPEASQKVVHLTALPEAFDG--IIIGNEVLDAMPVEIVRKNEGGSFEH 183 Query: 178 MIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLG----AIFENSPCRDREMQSISDRLACDG 233 + +D ++ S YF E P + +++++ RL G Sbjct: 184 VGVCLDNDRFTYSARPLHDLQLSALASLYFPQTDYPYTSELHPQQYAFIRTLASRLERGG 243 Query: 234 GTAIVIDY----GYLQSRVGDTLQ-AVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILY 288 I + Y R TL + H +P G ADL++HV+F ++ Sbjct: 244 MIFIDYGFDAAQYYHPQRNQGTLIGHYRHHVIHNPFDFIGLADLTAHVNFTDIAQAGTDA 303 Query: 289 KLYINGLTTQGKFLEGLGIWQRAFSLMK-QTARKDILLDSVKRLVSTSADKKSMGELFKI 347 L + G Q FL LGI + K +A +V++L+ D+ MGELFK+ Sbjct: 304 GLDLIGYLPQSHFLLNLGITELLAQTGKTDSAAYIREAAAVQKLI----DQHEMGELFKV 359 Query: 348 LVVSHE-KVELMPF 360 + V+ F Sbjct: 360 IAFGKNIGVDWAGF 373 >gi|322709436|gb|EFZ01012.1| DUF185 domain-containing protein [Metarhizium anisopliae ARSEF 23] Length = 518 Score = 232 bits (592), Expect = 5e-59, Method: Composition-based stats. Identities = 113/458 (24%), Positives = 177/458 (38%), Gaps = 101/458 (22%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYST-----CNPFGAVGDFVTAPEIS 56 L +++ I G + + Y +C+ GYY+ + FG GDFVT+PEIS Sbjct: 56 STPLAKQLFEAISTTGPVPLASYMRMCLTGDLGGYYTGAIGQDRDQFGVKGDFVTSPEIS 115 Query: 57 QIFGEMLAIFLICAWEQHGFPSC-VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 QIFGE++ ++ I W G P V+L+E+GPGRG +M D+LR I + S+ S++M Sbjct: 116 QIFGELVGVWFIAEWISQGQPKEGVQLIEVGPGRGTLMDDMLRTIKRFPAMVDSIESVFM 175 Query: 116 VETSERLTLIQK---------------------KQLASYGDKINWYTSLADVPLGFTFLV 154 VE S L QK K L S+ P F+V Sbjct: 176 VEASPELREKQKTLLCGSDAPSEDCAAGFRSTGKHLGKPVVWAESLKSIPIEPNKVPFIV 235 Query: 155 ANEFFDSLPIKQFVMTE----------------------------HGIRERMIDIDQH-- 184 A+EFFD+LPI F + RE M+ Sbjct: 236 AHEFFDALPIHCFQSAPAPASTPKTASTSTSTVKPSSTEANSSPAYEWREMMVSPTHPAE 295 Query: 185 -----------------DSLVFNIGDHEIKSNFLTCSDYF--------LGAIFENSPCRD 219 + + + G++ E P Sbjct: 296 VASDQAKAKAAGREASAAEFQLILSSKPTRHSRYLPESSPRYRHLKQSPGSVVEICPDAS 355 Query: 220 REMQSISDRL--------ACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQAD 271 + R+ + G A+++DYG + ++L+ ++ H VSP PG D Sbjct: 356 LYAADFAARIGGSDKVKKSQPCGAALILDYGTSDTIPINSLRGIRHHKLVSPFSAPGLVD 415 Query: 272 LSSHVDFQRLSSIAILY--KLYINGLTTQGKFLEGLGIWQRAFSLMK----QTARKDILL 325 LS+ VDF ++ A L + ++G Q FLE +GI +RA L+K + + + Sbjct: 416 LSADVDFTAIAEAATLASDGVEVHGPVPQADFLELMGIRERAEMLIKAAGTDESTAERIR 475 Query: 326 DSVKRLVSTSADKKSMGELFKILVVSHEKVE---LMPF 360 S +RLV MG+++K L + E + F Sbjct: 476 KSWRRLVDRG--PSGMGKVYKALAILPENDGRRRPVGF 511 >gi|157825199|ref|YP_001492919.1| hypothetical protein A1C_00390 [Rickettsia akari str. Hartford] gi|157799157|gb|ABV74411.1| hypothetical protein A1C_00390 [Rickettsia akari str. Hartford] Length = 374 Score = 232 bits (592), Expect = 5e-59, Method: Composition-based stats. Identities = 120/370 (32%), Positives = 179/370 (48%), Gaps = 26/370 (7%) Query: 8 KIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFL 67 KI LIK+NG +T D + YY GDFVTAPEISQ+FGE++ ++ Sbjct: 6 KIRQLIKQNGYITCDVLMQEVLNLNPTSYYKKVKSLAGEGDFVTAPEISQLFGEIIGLWC 65 Query: 68 ICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQK 127 I W++ G P + LVELGPGRG++M D+LR L P+F+ LSI ++E ++ QK Sbjct: 66 IKEWQRIGCPKSLSLVELGPGRGLLMRDLLRTAK-LVPEFYKALSIELIEINQNFIAHQK 124 Query: 128 KQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSL 187 L I+ + + D+P T ++ANEFFD++ IKQ++ + ER+ + D Sbjct: 125 ANLQDINLPISHRSFVEDIPKKPTIIIANEFFDAMTIKQYIKVKELWYERIFVVQPVDGR 184 Query: 188 V----FNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGY 243 + +I + T + GA+ E S ++ I+ L G+ ++IDYGY Sbjct: 185 IKYDKISINKPLQEYLLRTHIEAKDGAVLEESYKSIEIIKFIAQHLKTLSGSCLIIDYGY 244 Query: 244 -------LQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLT 296 + TLQAVK H Y L N G+ADLS+HVDF L ++A K+ + Sbjct: 245 DIAPNDRTGYQYNPTLQAVKNHQYCPILENLGEADLSAHVDFYTLKTVAKNNKINVIDTI 304 Query: 297 TQGKFLEGLGIWQRAFSLMKQTARK--------------DILLDSVKRLVSTSADKKSMG 342 Q FL GI R +L + + +++ V K MG Sbjct: 305 AQRDFLIENGILLRKQTLQNKLNNRHLSKLPLEVEFEKVSKQAGIIEKQVERLISPKQMG 364 Query: 343 ELFKILVVSH 352 LFK+L + H Sbjct: 365 TLFKVLQIMH 374 >gi|294788559|ref|ZP_06753801.1| hypothetical protein HMPREF9021_00950 [Simonsiella muelleri ATCC 29453] gi|294483436|gb|EFG31121.1| hypothetical protein HMPREF9021_00950 [Simonsiella muelleri ATCC 29453] Length = 386 Score = 232 bits (592), Expect = 6e-59, Method: Composition-based stats. Identities = 87/377 (23%), Positives = 161/377 (42%), Gaps = 34/377 (9%) Query: 2 ENKLIRKIVNLIKKNG-QMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIF 59 +L I I++NG + Q+ L + P+ GYY+ + G GDF+TAP ++ +F Sbjct: 17 SAQLTEFISEKIRENGGSIPFSQFMQLALYAPKRGYYTGGAHKIGVSGDFMTAPMLTPLF 76 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 + LA + Q + E G G G++ D+L + S+ + Y++E S Sbjct: 77 AQTLANQIKPLLMQT----AANIYEFGAGTGVLAADLLNTLSG------SLKNYYIIELS 126 Query: 120 ERLTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIR 175 L Q+ + ++ ++ W +L + G L+ NE D++P+++ +G Sbjct: 127 SELAERQQNYIQQYAPNFAHQVTWLDTLPEQFDG--VLIGNEVLDAMPVERVRCAGNGQF 184 Query: 176 ERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFL----GAIFENSPCRDREMQSISDRLAC 231 ER+ +++ ++ F Y E + +++++++L Sbjct: 185 ERVCVAVENEQFIWQFKPLLDDDLFQAACKYLPKNVANYTSELHLTQYAFVRTLAEKLVR 244 Query: 232 DGGTAIVIDY------GYLQSRVGDTLQ-AVKGHTYVSPLVNPGQADLSSHVDFQRLSSI 284 G I IDY Y + R TL + H+ P G DL++HV+F ++ Sbjct: 245 --GAMIWIDYGFDYKQYYHEQRNDGTLIGHHRHHSIHDPFFRVGLTDLTAHVNFTDIAEA 302 Query: 285 AILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGEL 344 + +L + G TTQ FL LGI + QT + + + V + MGEL Sbjct: 303 GVQAELDLIGYTTQANFLFNLGILDLLAAQFPQTDTPEYVKAAH--AVQQLTAQHEMGEL 360 Query: 345 FKILVVSHE-KVELMPF 360 FK++ + V+ + F Sbjct: 361 FKVIAFGRDVDVDWLGF 377 >gi|77462119|ref|YP_351623.1| hypothetical protein RSP_1579 [Rhodobacter sphaeroides 2.4.1] gi|77386537|gb|ABA77722.1| conserved hypothetical protein [Rhodobacter sphaeroides 2.4.1] Length = 351 Score = 232 bits (592), Expect = 6e-59, Method: Composition-based stats. Identities = 125/298 (41%), Positives = 164/298 (55%), Gaps = 7/298 (2%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 L I I G +TV Y A C+ PE GYYST PFGA GDF TAPEISQ+FGE+ Sbjct: 2 TALAVLIARRIGATGPVTVADYMAECLLHPEHGYYSTREPFGAAGDFTTAPEISQMFGEL 61 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 L + L AW G PS V L ELGPGRG +M D+LR + P F +++VE S RL Sbjct: 62 LGLCLAQAWLDQGQPSPVTLAELGPGRGTLMADLLRATRGV-PGFHEAARVHLVEASPRL 120 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 +Q++ L + W AD+P G FLVANEFFD+LPI+QFV G RERM+ + Sbjct: 121 RALQRETLGGH--PAAWLDRAADLPEGPLFLVANEFFDALPIRQFVRGPEGWRERMVGLT 178 Query: 183 QHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYG 242 + + + + D G + E P M I+ R+A GG A+ +DYG Sbjct: 179 EGRLTWGLGPETSLAALAYRLEDTAPGDVVELCPAAGPIMAEIARRIAAAGGLALAVDYG 238 Query: 243 YLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGK 300 +S GDTLQA++ H + PL PG+ADL++HVDF+ L+ A L QG+ Sbjct: 239 GWRSH-GDTLQALRAHRFDDPLAAPGEADLTAHVDFEALAQAAAPCG---TALVPQGR 292 >gi|121608437|ref|YP_996244.1| hypothetical protein Veis_1466 [Verminephrobacter eiseniae EF01-2] gi|121553077|gb|ABM57226.1| protein of unknown function DUF185 [Verminephrobacter eiseniae EF01-2] Length = 369 Score = 232 bits (592), Expect = 6e-59, Method: Composition-based stats. Identities = 90/377 (23%), Positives = 154/377 (40%), Gaps = 39/377 (10%) Query: 1 MENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYST-CNPFGA----VGDFVTAPE 54 + L + I I G + D++ L + P GYY+ FG+ DFVTAPE Sbjct: 8 LTTALQQHIRQAIAAAGGWIGFDRFMELALYTPGLGYYANDSAKFGSSPASGSDFVTAPE 67 Query: 55 ISQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIY 114 ++ +FG+ LA+ L A + G +L E G G + L V Sbjct: 68 LTPLFGQTLAVQLEQALQATG---TQQLWEFGAGS------GALALQLLDALGARVQRYT 118 Query: 115 MVETSERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHG- 173 +V+ S L Q+ +LA++ K+ W +L + G ++ANE D++P++ Sbjct: 119 IVDLSGHLRARQQTRLAAHAHKLRWVDALPEKFSG--VVLANEVLDAMPVQLLARHGGEQ 176 Query: 174 ---IRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLA 230 ER + + +L + ++ + E + +++++DRL Sbjct: 177 GGVWHERGVALGADGALAWADRPTGLRPPVGIAG--PQDYLTEIHTQGEGFIRTLADRLE 234 Query: 231 CDGGTAIVIDY----GYLQSRVGDTLQAVKGH-TYVSPLVNPGQADLSSHVDFQRLSSIA 285 + + Y R T+ +GH PL+ G+ D+++HV+F L+ A Sbjct: 235 RGAVFLLDYGFGASEYYHPQRHMGTVMCHQGHLVDSDPLLALGRKDITAHVNFSALALTA 294 Query: 286 ILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELF 345 L++ G TTQ FL G+ + L Q R L + MGELF Sbjct: 295 QEAGLHVLGYTTQAHFLLNCGLLAKME-LRPQAERAPAAL---------LVLEHEMGELF 344 Query: 346 KILVVSH-EKVELMPFV 361 K+L + + E + FV Sbjct: 345 KVLALGAGQPWEPLGFV 361 >gi|225023751|ref|ZP_03712943.1| hypothetical protein EIKCOROL_00615 [Eikenella corrodens ATCC 23834] gi|224943633|gb|EEG24842.1| hypothetical protein EIKCOROL_00615 [Eikenella corrodens ATCC 23834] Length = 383 Score = 232 bits (591), Expect = 7e-59, Method: Composition-based stats. Identities = 86/375 (22%), Positives = 140/375 (37%), Gaps = 33/375 (8%) Query: 2 ENKLIRKIVNLIKKN-GQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIF 59 L I I+ N G + + L + P++GYY+ + GA GDF+TAP ++ +F Sbjct: 17 SQALCGLIQQRIQANHGFLPFADFMQLALYQPQYGYYTGGAHKIGAAGDFITAPALTPLF 76 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 G+ LAI L Q + E G G G + ++ + S+ Y++E S Sbjct: 77 GQTLAIQLQSLLPQT----AGNIYEFGAGTGELAAQLIGKLSG------SLRHYYIIEVS 126 Query: 120 ERLTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIR 175 L Q++ L + +I W T L G ++ NE D++P + Sbjct: 127 PDLAERQRRHLAAALPQHQHQITWLTELPAEFDG--IVIGNEVLDAMPCDIVRYQSGQWQ 184 Query: 176 ERMIDIDQHDSLVFN--IGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDG 233 + +D + + E+ G E + +++++ RL Sbjct: 185 LMGVGLDANQQFQWQSAPLPAELLPAAQALLPAIDGYTSELHLRQQAFIRTLAQRLTR-- 242 Query: 234 GTAIVIDY------GYLQSRVGDTLQ-AVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAI 286 G + IDY Y R G TL + H +P + G DL+ HV+F ++ A Sbjct: 243 GALLFIDYGFDAAQYYHPQRSGGTLIGHYRHHAVHNPFEHVGLTDLTCHVNFTAIAEAAC 302 Query: 287 LYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFK 346 L + G TTQ FL L Q + T + MGELFK Sbjct: 303 QAGLDLIGYTTQAAFLLN---LGLTDLLAAQGEPESQAYIRAATACQTLLAPQEMGELFK 359 Query: 347 ILVVSHEKVEL-MPF 360 ++ F Sbjct: 360 VIAFGRNIDPDWPGF 374 >gi|304386663|ref|ZP_07368945.1| protein of hypothetical function DUF185 [Neisseria meningitidis ATCC 13091] gi|304339248|gb|EFM05326.1| protein of hypothetical function DUF185 [Neisseria meningitidis ATCC 13091] Length = 396 Score = 232 bits (591), Expect = 7e-59, Method: Composition-based stats. Identities = 90/374 (24%), Positives = 150/374 (40%), Gaps = 33/374 (8%) Query: 4 KLIRKIVNLIKKNG-QMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGE 61 KL I I K+G + ++ L + P++GYY+ + G GDF+TAP ++ +F + Sbjct: 30 KLQTLIAEKIGKHGNWIPFSRFMELVLYAPQYGYYTGGSHKIGNTGDFITAPTLTSLFAQ 89 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 LA L Q + E G G G + D+L I + Y++E S Sbjct: 90 TLARQLQELLSQT----AGNIYEFGAGTGQLAADLLGSISD------GISRYYIIEISPE 139 Query: 122 LTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRER 177 L QK + K+ T+L + G ++ NE D++P++ E G E Sbjct: 140 LAARQKNLIQARAPEASQKVVHLTALPEAFDG--IIIGNEVLDAMPVEIVRKNEGGSFEH 197 Query: 178 MIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLG----AIFENSPCRDREMQSISDRLACDG 233 + +D ++ S YF E P + +++++ RL G Sbjct: 198 VGVCLDNDRFTYSARPLHDLQLSALASLYFPQTDYPYTSELHPQQYAFIRTLASRLERGG 257 Query: 234 GTAIVIDY----GYLQSRVGDTLQ-AVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILY 288 I + Y R TL + H +P G ADL++HV+F ++ Sbjct: 258 MIFIDYGFDAAQYYHPQRNQGTLIGHYRHHVIHNPFDFIGLADLTAHVNFTDIAQAGTDA 317 Query: 289 KLYINGLTTQGKFLEGLGIWQRAFSLMK-QTARKDILLDSVKRLVSTSADKKSMGELFKI 347 L + G Q FL LGI + K +A +V++L+ D+ MGELFK+ Sbjct: 318 GLDLIGYLPQSHFLLNLGITELLAQTGKTDSAAYIREAAAVQKLI----DQHEMGELFKV 373 Query: 348 LVVSHE-KVELMPF 360 + ++ F Sbjct: 374 IAFGKNIGIDWAGF 387 >gi|218768822|ref|YP_002343334.1| hypothetical protein NMA2076 [Neisseria meningitidis Z2491] gi|121052830|emb|CAM09178.1| conserved hypothetical protein [Neisseria meningitidis Z2491] Length = 405 Score = 232 bits (591), Expect = 7e-59, Method: Composition-based stats. Identities = 90/374 (24%), Positives = 150/374 (40%), Gaps = 33/374 (8%) Query: 4 KLIRKIVNLIKKNG-QMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGE 61 KL I I K+G + ++ L + P++GYY+ + G GDF+TAP ++ +F + Sbjct: 39 KLQTLIAEEIGKHGNWIPFSRFMELVLYAPQYGYYTGGSHKIGNTGDFITAPTLTSLFAQ 98 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 LA L Q + E G G G + D+L I + Y++E S Sbjct: 99 TLARQLQELLSQT----AGNIYEFGAGTGQLAADLLGSISD------GISRYYIIEISPE 148 Query: 122 LTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRER 177 L QK + K+ T+L + G ++ NE D++P++ E G E Sbjct: 149 LAARQKNLIQARAPEASQKVVHLTALPEAFDG--IIIGNEVLDAMPVEIVRKNEGGSFEH 206 Query: 178 MIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLG----AIFENSPCRDREMQSISDRLACDG 233 + +D ++ S YF E P + +++++ RL G Sbjct: 207 VGVCLDNDRFTYSARPLHDLQLSALASLYFPQTDYPYTSELHPQQYAFIRTLASRLERGG 266 Query: 234 GTAIVIDY----GYLQSRVGDTLQ-AVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILY 288 I + Y R TL + H +P G ADL++HV+F ++ Sbjct: 267 MIFIDYGFDAAQYYHPQRNQGTLIGHYRHHVIHNPFDFIGLADLTAHVNFTDIAQAGTDA 326 Query: 289 KLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDI-LLDSVKRLVSTSADKKSMGELFKI 347 L + G Q FL LGI + K + I +V++L+ D+ MGELFK+ Sbjct: 327 GLDLIGYLPQSHFLLNLGITELLAQTGKTDSAAYICEAAAVQKLI----DQHEMGELFKV 382 Query: 348 LVVSHE-KVELMPF 360 + ++ F Sbjct: 383 IAFGKNIGIDWAGF 396 >gi|189423454|ref|YP_001950631.1| hypothetical protein Glov_0383 [Geobacter lovleyi SZ] gi|189419713|gb|ACD94111.1| protein of unknown function DUF185 [Geobacter lovleyi SZ] Length = 378 Score = 232 bits (591), Expect = 7e-59, Method: Composition-based stats. Identities = 88/364 (24%), Positives = 149/364 (40%), Gaps = 20/364 (5%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEM 62 L I + I+ G++T Y A C+ +P GYY++ G GDF T+ + FG + Sbjct: 5 TLHDLIADRIRAAGRITFADYMAACLYEPGLGYYTSPGRKVGTEGDFYTSITVHATFGRV 64 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 +A + W P+ LVE G G G + DI+ + + +PD ++ + +VE L Sbjct: 65 IAREIAAMWRSMDCPADFTLVEAGAGHGRLACDIMDFLAEQQPDCYAATKLVLVEQEPTL 124 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFT-FLVANEFFDSLPIKQFVMTEHGIRERMIDI 181 Q LA++ +++W + F+ L +NE D++P+ + +M+ G++E + + Sbjct: 125 AEAQAALLANHAARLSWLSPAELPGFRFSGVLYSNELLDAMPVHRVLMSPQGLKEIYVTL 184 Query: 182 ---DQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIV 238 D G E S + +++ L G + Sbjct: 185 DGDQFQDQSDLPSTPALAAYLDRFGIPLHPGQEAEVSLAGLAWFEDVAECL--QQGFILT 242 Query: 239 IDYG------YLQSRVGDTLQAVKGHTYVS-PLVNPGQADLSSHVDFQRLSSIAILYKLY 291 IDYG Y R TL HT P GQ D+++H++F L L Sbjct: 243 IDYGWAKAELYSPQRNLGTLLCYYKHTVEDNPYQRLGQQDITTHINFSALIERGEELGLK 302 Query: 292 INGLTTQGKFLEGLGIWQRAFSLMK-QTARKDILLDS--VKRLVSTSADKKSMGELFKIL 348 Q +FL G+ + + KD L +KRL+ + MG+ F++L Sbjct: 303 PLWFGEQSRFLLSAGVIEELEQIEASDLPEKDKLRLRLIIKRLIMP---EGGMGDTFRVL 359 Query: 349 VVSH 352 V S Sbjct: 360 VQSK 363 >gi|257091912|ref|YP_003165553.1| hypothetical protein CAP2UW1_0267 [Candidatus Accumulibacter phosphatis clade IIA str. UW-1] gi|257044436|gb|ACV33624.1| protein of unknown function DUF185 [Candidatus Accumulibacter phosphatis clade IIA str. UW-1] Length = 394 Score = 232 bits (591), Expect = 7e-59, Method: Composition-based stats. Identities = 94/384 (24%), Positives = 145/384 (37%), Gaps = 46/384 (11%) Query: 2 ENKLIRKIVNLIKKN-GQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIF 59 L + I I G + D++ +L + P GYYS + FG GDFVTAPEI F Sbjct: 23 SRALSQHISTEIAAGDGWIGFDRFMSLALYAPGMGYYSGGAHKFGPAGDFVTAPEICPAF 82 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 + L R++E+G G G D+L + + S +++ S Sbjct: 83 AQTLGAQAAQILVLS----APRIIEVGAGTGEFAADLLLELERRGALPDS---YAILDLS 135 Query: 120 ERLTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIR 175 L Q+ L ++ W L + G ++ANE D++P + I Sbjct: 136 GELRARQQATLARRTPHLLPRVRWLERLPERFDG--LVLANEVLDAMPAHLVLWGPTHIA 193 Query: 176 ERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYF-------LGAIFENSPCRDREMQSISDR 228 ER + + D F D L + G + E S + + Sbjct: 194 ERGVGV---DEGRFVWRDRPASGPLLGRAKVLAGECEIAPGYLSEISLTAPAWITEWAGI 250 Query: 229 LACDGGTAIVIDY------GYLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRL 281 L G ++IDY Y R TL H P PG D+++HVDF + Sbjct: 251 LGQ--GALLLIDYGFPRHEYYHPQRSAGTLMCHYRHRSHTDPFYLPGLQDITTHVDFTGI 308 Query: 282 SSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMK----QTARKDILLDSVKRLVSTSAD 337 L + G TTQ FL G+ + L + R L ++V++LVS Sbjct: 309 VEAGCQAGLELLGYTTQSSFLFNCGLTE---ILSRTPVGDALRYLPLANAVQKLVS---- 361 Query: 338 KKSMGELFKILVVSHE-KVELMPF 360 MGE+FK++ + L+ F Sbjct: 362 PAEMGEIFKVIALGKGISQPLLGF 385 >gi|303280333|ref|XP_003059459.1| predicted protein [Micromonas pusilla CCMP1545] gi|226459295|gb|EEH56591.1| predicted protein [Micromonas pusilla CCMP1545] Length = 386 Score = 232 bits (591), Expect = 8e-59, Method: Composition-based stats. Identities = 118/388 (30%), Positives = 176/388 (45%), Gaps = 54/388 (13%) Query: 25 FALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVE 84 + PE+GYY + FGA GDFVT+PEISQ+FGE++ ++ WE G PS LVE Sbjct: 1 MQEALTHPEYGYYMHRDVFGARGDFVTSPEISQVFGELVGVWCASTWEALGKPSKFSLVE 60 Query: 85 LGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGD--------- 135 LGPGRG +M D+LR K K + + ++MVE S +L +Q+++L G Sbjct: 61 LGPGRGTLMSDLLRATSKFKAFT-AAMDVHMVEVSPKLREMQREKLRCSGGGGGGGGGDA 119 Query: 136 ------------KINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQ 183 + W+ + VP G ++A+EFFD++P+ QF TE G E++ + Sbjct: 120 GAAAATSELNGRPVRWHDTFDAVPEGPIIVIAHEFFDAMPVHQFTRTERGWCEKLTEAAA 179 Query: 184 HDSL---VFNIGDHEIKSNFLTCSDYFLG---------AIFENSPCRDREMQSISDRLAC 231 + + L G E SP + ++ RL Sbjct: 180 AADDGALELVLSPGLTPAGALMIPRRLDGLPAERKESIRQLEISPRSVAIWERVAGRLRV 239 Query: 232 DGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSI------A 285 G A+ +DYG + G+TLQA+K H +V L +PG ADLS++VDF + + Sbjct: 240 HPGAALAVDYG-SEGPTGNTLQAIKDHKFVDLLADPGTADLSAYVDFGAMRKVIEDGPSF 298 Query: 286 ILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQ---TARKDILLDSVKRLV--------ST 334 + G TTQ L L I R L+K+ D L++ RL+ Sbjct: 299 ARDGVRCYGPTTQRDLLGSLQIGARLERLVKECGSEEEADRLIEGCTRLMAGDEGEGEGG 358 Query: 335 SADKKSMGELFKILVVSHEKVELM-PFV 361 SAD +G +K L + + V FV Sbjct: 359 SADP-GLGIRYKALAMVSKGVGAPVGFV 385 >gi|212536208|ref|XP_002148260.1| DUF185 domain protein [Penicillium marneffei ATCC 18224] gi|210070659|gb|EEA24749.1| DUF185 domain protein [Penicillium marneffei ATCC 18224] Length = 525 Score = 232 bits (591), Expect = 9e-59, Method: Composition-based stats. Identities = 116/474 (24%), Positives = 181/474 (38%), Gaps = 123/474 (25%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYY------STCNPFGAVGDFVTAPEI 55 L + + + I+ G +++ Y + +P+ GYY S FG GDF+T+PEI Sbjct: 39 STPLAKILADAIRTTGPISIAAYMRQVLTNPDAGYYTTPSSQSKTEVFGKKGDFITSPEI 98 Query: 56 SQIFGEMLAIFLICAWEQHGFPSC-VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIY 114 +QIFGE++ I+ + W G P V L+E+GPG+G +M DILR + K S+ +IY Sbjct: 99 TQIFGELVGIWAVTEWMAQGMPKEGVELIEVGPGKGTLMDDILRTVQNFKQFSKSIENIY 158 Query: 115 MVETSERLTLIQKKQL------------------ASYGDKINWYTSLA--DVPLGFTFLV 154 +VE S L IQK L G I W + F+ Sbjct: 159 LVEASAPLREIQKNLLCGPDAVLEEIDIGYRGINKHTGAPIVWVEDIRLLPYNDKMPFIF 218 Query: 155 ANEFFDSLPIKQF-------------------------------------VMTEHGIRER 177 A+EFFD+LPI F T RE Sbjct: 219 AHEFFDALPIHAFESVQPSPESEEQPKQQIMTPTGPITLDSPTQTNANNKQPTGPQWREL 278 Query: 178 MIDID-------QHDSLVFNIGDHEIKS----------NFLTCSDYFLGAIFENSPCRDR 220 M+ ++ D F + +I + G++ E SP Sbjct: 279 MVALNSKSVLEDVKDEPEFQLSRAKISTPNSLLLAEISERYRALKSQPGSVIEVSPESRI 338 Query: 221 EMQSISDRL----------------------------------ACDGGTAIVIDYGYLQS 246 + + R+ G A+++DYG + Sbjct: 339 YVADFARRIGGYTPPEPRPPKRKPGEVAKPVTPIDNSAALKKKERPSGAALILDYGTSST 398 Query: 247 RVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILY--KLYINGLTTQGKFLEG 304 ++L+ ++ H +SP PGQ D+S+ VDF L+ A+ + ++G Q ++L Sbjct: 399 IPVNSLRGIRQHKAISPFAYPGQVDVSADVDFISLAEAALEASEGVEVHGPVDQAEYLHS 458 Query: 305 LGIWQRAFSLMKQTARKDI----LLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 LGI +RA L+K L + KRLV MG+L+K L + E Sbjct: 459 LGIAERAEQLLKHMPEDSEKHKTLQTAWKRLVDKG--PNGMGKLYKALAIVPEN 510 >gi|15676322|ref|NP_273458.1| hypothetical protein NMB0409 [Neisseria meningitidis MC58] gi|7225632|gb|AAF40848.1| conserved hypothetical protein [Neisseria meningitidis MC58] Length = 403 Score = 232 bits (591), Expect = 9e-59, Method: Composition-based stats. Identities = 93/376 (24%), Positives = 152/376 (40%), Gaps = 37/376 (9%) Query: 4 KLIRKIVNLIKKNG-QMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGE 61 KL I I K+G + ++ L + P++GYY+ + G GDF+TAP ++ +F + Sbjct: 37 KLQTLIAEKIGKHGNWIPFSRFMELVLYAPQYGYYTGGSHKIGNTGDFITAPTLTSLFAQ 96 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 LA L Q + E G G G + D+L I + Y++E S Sbjct: 97 TLARQLQELLSQT----AGNIYEFGAGTGQLAADLLGSISD------GISRYYIIEISPE 146 Query: 122 LTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRER 177 L QK + K+ T+L + G ++ NE D++P++ E G E Sbjct: 147 LAARQKNLIQARAPEASQKVVHLTALPEAFDG--IIIGNEVLDAMPVEIVRKNEGGSFEH 204 Query: 178 MIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLG----AIFENSPCRDREMQSISDRLACDG 233 + +D ++ S YF E P + +++++ RL Sbjct: 205 VGVCLDNDRFTYSARPLHDLQLSALASLYFPQTDYPYTSELHPQQYAFIRTLASRLE--H 262 Query: 234 GTAIVIDY------GYLQSRVGDTLQ-AVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAI 286 G I IDY Y R TL + H +P G ADL++HV+F ++ Sbjct: 263 GCMIFIDYGFDAAQYYHPQRNQGTLIGHYRHHIIHNPFDFIGLADLTAHVNFTDIAQAGT 322 Query: 287 LYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDI-LLDSVKRLVSTSADKKSMGELF 345 L + G Q FL LGI + K + I +V++L+ D+ MGELF Sbjct: 323 DAGLDLIGYLPQSHFLLNLGITELLAQTGKTDSAAYICEAAAVQKLI----DQHEMGELF 378 Query: 346 KILVVSHE-KVELMPF 360 K++ ++ F Sbjct: 379 KVIAFGKNIGIDWAGF 394 >gi|269214876|ref|ZP_05987318.2| conserved hypothetical protein [Neisseria lactamica ATCC 23970] gi|269208874|gb|EEZ75329.1| conserved hypothetical protein [Neisseria lactamica ATCC 23970] Length = 396 Score = 231 bits (590), Expect = 9e-59, Method: Composition-based stats. Identities = 88/368 (23%), Positives = 153/368 (41%), Gaps = 36/368 (9%) Query: 4 KLIRKIVNLI-KKNGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGE 61 L + I K + + ++ L + P++GYY+ + G GDF+TAP ++ +F Sbjct: 30 NLQTLLAEEIGKHDNWIPFSRFMELVLYAPQYGYYTGGSHKIGNDGDFITAPTLTPLFAR 89 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 LA L Q + E G G G + D+L + + Y++E S Sbjct: 90 TLARQLQELLPQT----AGNIYEFGAGTGQLAADLLNNLSD------GINRYYIIEISPE 139 Query: 122 LTLIQKKQLASY----GDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRER 177 L + QK + ++ K+ ++L ++ G ++ NE D++P++ E G+ E Sbjct: 140 LAVRQKNLIQAHAPEASHKVIHLSALPEIFDG--IIIGNEVLDAMPVEIIRKDEGGLFEH 197 Query: 178 MIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIF----ENSPCRDREMQSISDRLACDG 233 + + ++ S + S YF F E P + +++++ RL Sbjct: 198 VGVCLDNGRFAYSARPLNDPSLSASASLYFPQTDFPYTGELHPQQYAFIRTLASRLE--H 255 Query: 234 GTAIVIDY------GYLQSRVGDTLQ-AVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAI 286 G I IDY Y R TL + H +P G ADL++HV+F ++ Sbjct: 256 GCMIFIDYGFDAAQYYHPQRSQGTLIGHYRHHVIHNPFDFIGLADLTAHVNFTDIAQAGT 315 Query: 287 LYKLYINGLTTQGKFLEGLGIWQRAFSLMK-QTARKDILLDSVKRLVSTSADKKSMGELF 345 L + G Q FL LGI + K +A +V++L+ D+ MGELF Sbjct: 316 DAGLDLIGYLPQSHFLLNLGITELLAQTGKTDSAAYIREAAAVQKLI----DQHEMGELF 371 Query: 346 KILVVSHE 353 K++ Sbjct: 372 KVIAFGKN 379 >gi|319791727|ref|YP_004153367.1| hypothetical protein Varpa_1038 [Variovorax paradoxus EPS] gi|315594190|gb|ADU35256.1| protein of unknown function DUF185 [Variovorax paradoxus EPS] Length = 370 Score = 231 bits (590), Expect = 9e-59, Method: Composition-based stats. Identities = 86/358 (24%), Positives = 147/358 (41%), Gaps = 30/358 (8%) Query: 14 KKNGQMTVDQYFALCVADPEFGYY-STCNPFG----AVGDFVTAPEISQIFGEMLAIFLI 68 + G + D++ AL + P GYY +T FG + DFVTAPE++ +FG+ LA + Sbjct: 23 RAGGWIGFDRFMALALYAPGLGYYANTSAKFGHMPSSGSDFVTAPELTPMFGQTLAAQVA 82 Query: 69 CAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKK 128 A E+ G + E G G G + + +L + S + +V+ S L Q++ Sbjct: 83 EALEKTG---TDTVWEFGAGSGALAVQLLHAL---DEMGRSDVRYRIVDLSGTLRERQQQ 136 Query: 129 QLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLV 188 L Y ++ W L + G +V NE D++P++ ER + + Sbjct: 137 ALVRYAGRVEWLGELPESMQG--VVVGNEVLDAMPVQLLARVNGQWFERG-VVRNAGNDG 193 Query: 189 FNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYG----YL 244 + D + + E P + +++DRL I + Y Sbjct: 194 WTWADRPTELRPPVEVPGEHDYLTEIHPQAQAFIATLADRLEKGAAFLIDYGFPESEYYH 253 Query: 245 QSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLE 303 R T+ + H PL + G D+++HVDF ++ L + G T+Q +FL Sbjct: 254 PQRHMGTVMCHRAHQADGDPLADVGYKDITAHVDFTGIAVAGQEAGLEVLGYTSQARFLL 313 Query: 304 GLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVS-HEKVELMPF 360 G+ R A + + RL+ + MGELFK++ + E + M F Sbjct: 314 NCGLLARMEQ--GTVAERAMAA----RLIH----EHEMGELFKVVGFAVGEAWDAMGF 361 >gi|316984915|gb|EFV63871.1| conserved hypothetical protein [Neisseria meningitidis H44/76] gi|325140924|gb|EGC63431.1| hypothetical protein NMBCU385_0366 [Neisseria meningitidis CU385] gi|325199598|gb|ADY95053.1| conserved hypothetical protein [Neisseria meningitidis H44/76] Length = 380 Score = 231 bits (590), Expect = 1e-58, Method: Composition-based stats. Identities = 93/376 (24%), Positives = 152/376 (40%), Gaps = 37/376 (9%) Query: 4 KLIRKIVNLIKKNG-QMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGE 61 KL I I K+G + ++ L + P++GYY+ + G GDF+TAP ++ +F + Sbjct: 14 KLQTLIAEKIGKHGNWIPFSRFMELVLYAPQYGYYTGGSHKIGNTGDFITAPTLTSLFAQ 73 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 LA L Q + E G G G + D+L I + Y++E S Sbjct: 74 TLARQLQELLSQT----AGNIYEFGAGTGQLAADLLGSISD------GISRYYIIEISPE 123 Query: 122 LTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRER 177 L QK + K+ T+L + G ++ NE D++P++ E G E Sbjct: 124 LAARQKNLIQARAPEASQKVVHLTALPEAFDG--IIIGNEVLDAMPVEIVRKNEGGSFEH 181 Query: 178 MIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLG----AIFENSPCRDREMQSISDRLACDG 233 + +D ++ S YF E P + +++++ RL Sbjct: 182 VGVCLDNDRFTYSARPLHDLQLSALASLYFPQTDYPYTSELHPQQYAFIRTLASRLE--H 239 Query: 234 GTAIVIDY------GYLQSRVGDTLQ-AVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAI 286 G I IDY Y R TL + H +P G ADL++HV+F ++ Sbjct: 240 GCMIFIDYGFDAAQYYHPQRNQGTLIGHYRHHIIHNPFDFIGLADLTAHVNFTDIAQAGT 299 Query: 287 LYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDI-LLDSVKRLVSTSADKKSMGELF 345 L + G Q FL LGI + K + I +V++L+ D+ MGELF Sbjct: 300 DAGLDLIGYLPQSHFLLNLGITELLAQTGKTDSAAYICEAAAVQKLI----DQHEMGELF 355 Query: 346 KILVVSHE-KVELMPF 360 K++ ++ F Sbjct: 356 KVIAFGKNIGIDWAGF 371 >gi|146413170|ref|XP_001482556.1| hypothetical protein PGUG_05576 [Meyerozyma guilliermondii ATCC 6260] Length = 520 Score = 231 bits (590), Expect = 1e-58, Method: Composition-based stats. Identities = 108/416 (25%), Positives = 178/416 (42%), Gaps = 61/416 (14%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGA-VGDFVTAPEISQIFGE 61 L + +IK NG +++ Y C+ P++GYY+T NP GDF+T+PEIS +FGE Sbjct: 105 ESLSDLLAEIIKTNGPLSLLAYMRQCLTHPDYGYYTTTNPLDKYTGDFITSPEISSVFGE 164 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLS--IYMVETS 119 M+ I+L W P +R++E GPG+G +M D++R KL I ++E S Sbjct: 165 MIGIWLFSTWTSQDNPQNIRIIEFGPGKGTLMFDVVRTFNKLAKSRIRSDQIEICLIEAS 224 Query: 120 ERLTLIQKKQLASYG---------------------DKINWYTSLADVPLGFTFLVANEF 158 L Q + L + + D +++A+EF Sbjct: 225 PILRDEQAELLCGSKLNSADIKDSFYTKSSIWGNTVKWLETEKDILDDVQYANYILAHEF 284 Query: 159 FDSLPIKQFVMTEHGIRERMI--------------------DIDQHDSLVFNIGDHE--- 195 FD+LPIK F ++ G RE ++ + F++ Sbjct: 285 FDALPIKSFQKSDSGWRELLVEHSPSVLNTQGALPSGGSSSEFSPDLETDFHLTVSPKDT 344 Query: 196 ----IKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGT--AIVIDYGYLQSRVG 249 I + G+ E + ++ + + G A++IDYG Sbjct: 345 PSSLIPELSSRFNALPTGSRIEICTDAELYALKMASLINNEQGNGAALIIDYGLKSGIPS 404 Query: 250 DTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQ 309 ++L+ + H +VSP +PG+ DLS+ VDF+ L++I L G QG +L +GI Sbjct: 405 NSLRGIYKHKFVSPFFSPGKVDLSADVDFENLAAITAKACLSF-GPVDQGDWLHEMGIGY 463 Query: 310 RAFSLMK----QTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELM-PF 360 R L+K A +D + S +RL ++ SMG +KIL + ++ F Sbjct: 464 RIDQLLKSNEGNPAEQDKVYASYRRLTDK--NENSMGGAYKILCLVPHSAQMPIGF 517 >gi|270158226|ref|ZP_06186883.1| conserved hypothetical protein [Legionella longbeachae D-4968] gi|289163517|ref|YP_003453655.1| hypothetical protein LLO_0173 [Legionella longbeachae NSW150] gi|269990251|gb|EEZ96505.1| conserved hypothetical protein [Legionella longbeachae D-4968] gi|288856690|emb|CBJ10501.1| putative conserved hypothetical protein [Legionella longbeachae NSW150] Length = 370 Score = 231 bits (590), Expect = 1e-58, Method: Composition-based stats. Identities = 82/371 (22%), Positives = 152/371 (40%), Gaps = 25/371 (6%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGEM 62 +++ + + + + ++ L + P GYYS+ G GDF+TAPE++ +FG+ Sbjct: 2 SILQTLQEQLAQRQAIPFVEFMQLALYAPGEGYYSSGLQKLGKQGDFITAPELTPLFGKT 61 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 LA + ++ PS + E G G G + + IL + + + Y++E S L Sbjct: 62 LANQCLQVFDVLESPS---IFEFGAGSGALCVSILEYLAEYNSLP---EAYYILEVSANL 115 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTF---LVANEFFDSLPIKQFVMTEHGIRERMI 179 Q++ +A + + D F ++ANE D++P+ +F+ T GI E + Sbjct: 116 CHRQREMVAQKIPHLAHLVTWLDRWPETPFNGVVLANEVLDAMPVHRFMNTNQGIMESYV 175 Query: 180 DIDQHDSLVFNIGDHEIK----SNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGT 235 +D+ +V + K S + + E + D + + L Sbjct: 176 RLDEQQQVVEIFKPCQNKRLQHYINHKISSLGVPYLSEVNLFIDDWILNTYRMLKQGVVF 235 Query: 236 AIVIDYG----YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSSIAILYKL 290 I + Y R TL H +P +N G D+++HVDF ++ Sbjct: 236 LIDYGFPRHEYYHPDRNQGTLMCHYQHHSHSNPFLNLGAQDITAHVDFTHVAEAGQQAGF 295 Query: 291 YINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVV 350 ++ G T Q FL G+ ++ ++ K+ + MGELFK++ + Sbjct: 296 HVAGYTNQASFLLANGLLSFI-----NSSDSELEQMRAKQAIKQLTQPSEMGELFKVIAL 350 Query: 351 SHE-KVELMPF 360 S E + L F Sbjct: 351 SKEIDIALHGF 361 >gi|239813984|ref|YP_002942894.1| hypothetical protein Vapar_0977 [Variovorax paradoxus S110] gi|239800561|gb|ACS17628.1| protein of unknown function DUF185 [Variovorax paradoxus S110] Length = 360 Score = 231 bits (590), Expect = 1e-58, Method: Composition-based stats. Identities = 91/361 (25%), Positives = 160/361 (44%), Gaps = 36/361 (9%) Query: 14 KKNGQMTVDQYFALCVADPEFGYY-STCNPFG----AVGDFVTAPEISQIFGEMLAIFLI 68 + G + D++ AL + P GYY ++ FG + DFVTAPE++ +FG+ LA + Sbjct: 13 RAGGWIGFDRFMALALYAPGLGYYANSSAKFGHMPSSGSDFVTAPELTPMFGQALAAQVA 72 Query: 69 CAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKK 128 A E+ G + E G G G + + +L + ++ + +V+ S L Q++ Sbjct: 73 EALEKTG---TDTVWEFGAGTGALAVQLLHALDEIGRGD---VRYRIVDLSGTLRERQQQ 126 Query: 129 QLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQH-DSL 187 LA Y D++ W L + G +V NE D++P++ ER + + D Sbjct: 127 ALARYTDRVEWLGELPEAMQG--VVVGNEVLDAMPVQLLARVGGQWFERGVVRNVRADGW 184 Query: 188 VFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDY------ 241 + + ++ F + + E P + + +++DRL G A IDY Sbjct: 185 AWADRETALRPPFEVPGE--HDYLTEIHPQAEAFVATLADRL--KSGAAFFIDYGFPERE 240 Query: 242 GYLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGK 300 Y R T+ + H PL + G D+++HV+F ++ L + G T+Q + Sbjct: 241 YYHPQRHMGTVMCHRAHQADGDPLSDVGYKDITAHVNFTGIALAGQEAGLEVLGYTSQAR 300 Query: 301 FLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVS-HEKVELMP 359 FL G+ +R + + + + RLV + MGELFK++ + E E M Sbjct: 301 FLLNCGLLERMGQ----GSTAERGMAA--RLVH----EHEMGELFKVVGFAVGEGWEAMG 350 Query: 360 F 360 F Sbjct: 351 F 351 >gi|269213858|ref|ZP_05983006.2| conserved hypothetical protein [Neisseria cinerea ATCC 14685] gi|269145211|gb|EEZ71629.1| conserved hypothetical protein [Neisseria cinerea ATCC 14685] Length = 396 Score = 231 bits (589), Expect = 1e-58, Method: Composition-based stats. Identities = 88/378 (23%), Positives = 152/378 (40%), Gaps = 37/378 (9%) Query: 2 ENKLIRKIVNLIKKNG-QMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIF 59 L + I+K+G + ++ L + P++GYY+ + G GDF+TAP ++ +F Sbjct: 28 SANLQTLLAEEIRKHGNWIPFSRFMELVLYTPQYGYYTGGSHKIGNNGDFITAPTLTPLF 87 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 LA L Q + E G G G + D+L + + Y++E S Sbjct: 88 ARTLARQLQELLPQT----AGNIYEFGAGTGQLAADLLNNLSD------GINRYYIIEIS 137 Query: 120 ERLTLIQKKQLASYGDK----INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIR 175 L QK + + + I ++L + G ++ NE D++P++ E G Sbjct: 138 SELAARQKDLIQTLAPQAAQKIVHLSALPETFNG--IIIGNEVLDAMPVEIIRKDEGGSF 195 Query: 176 ERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLG----AIFENSPCRDREMQSISDRLAC 231 E + +D ++ S YF E P + +++++ RL Sbjct: 196 EHVGVCLDNDRFTYSARPLHDLQLSALASLYFSKISSPYTSELHPQQYAFIRTLASRLE- 254 Query: 232 DGGTAIVIDY------GYLQSRVGDTLQ-AVKGHTYVSPLVNPGQADLSSHVDFQRLSSI 284 G I IDY Y R TL + H +P G +DL++HV+F ++ Sbjct: 255 -HGCMIFIDYGFDAAQYYHPQRNQGTLIGHYRHHVIHNPFDFIGLSDLTAHVNFTDIAQA 313 Query: 285 AILYKLYINGLTTQGKFLEGLGIWQRAFSLMK-QTARKDILLDSVKRLVSTSADKKSMGE 343 L + G Q FL LGI + K +A +V++L+ D+ MGE Sbjct: 314 GTDAGLDLIGYLPQSHFLLNLGITELLAQTGKTDSAAYIREAAAVQKLI----DQHEMGE 369 Query: 344 LFKILVVSHE-KVELMPF 360 LFK++ ++ F Sbjct: 370 LFKVIAFGKNINIDWTGF 387 >gi|282901111|ref|ZP_06309043.1| protein of unknown function DUF185 [Cylindrospermopsis raciborskii CS-505] gi|281194010|gb|EFA68975.1| protein of unknown function DUF185 [Cylindrospermopsis raciborskii CS-505] Length = 396 Score = 231 bits (589), Expect = 1e-58, Method: Composition-based stats. Identities = 92/385 (23%), Positives = 163/385 (42%), Gaps = 29/385 (7%) Query: 3 NKLIRKIVNLIKKNG--QMTVDQYFALCVADPEFGYYSTCN-PFG-AVGDFVTAPEISQI 58 N L + I + I + ++T +Y L + E+GYYS+ + G A GDF T+P + Sbjct: 10 NPLYQIIADYIDTSDQRRITFAEYMDLVLYHSEYGYYSSHSGQIGFAGGDFFTSPSLGDD 69 Query: 59 FGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVET 118 FGE+LA + WE P LVE+G G G++ IL+ + PDF+ ++ ++E Sbjct: 70 FGELLAKQFLQMWENLDQPRPFHLVEMGGGTGVLAFQILKFLKNHHPDFWEIIEYIIIEK 129 Query: 119 SERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERM 178 S +L Q+++L + + + + F +NE D+ P+ QF++ + ++E Sbjct: 130 SPKLKWEQQQRLEGFSVQWLDLPEILPGSMVGCF-FSNELVDAFPVHQFILEKGKLQEIY 188 Query: 179 IDIDQHDSLVFNIGDHEIKSNF-----------LTCSDYFLGAIFENSPCRDREMQSISD 227 + + + F E + ++ + Y E + + +++ Sbjct: 189 VTYRASNPIEFMEVVGEPSTPKLAEYLQLVEIDISQNAYPENYRSEINLAALDWLSIVAN 248 Query: 228 RLACDGGTAIVIDYG----YLQSRVGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLS 282 L I Y Y R TLQ H + +P + G+ D+++++DF L Sbjct: 249 CLQRGYVLTIDYGYPATRYYHPRRSQGTLQCYYQHRYHHNPYIKVGEQDITTYIDFTALE 308 Query: 283 SIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMG 342 + L G T QG FL LG+ R +L Q LL + L + +G Sbjct: 309 NWGKRCGLNPVGWTQQGLFLMALGLGDRISALSYQQHPLSQLLKRREAL-HQLISPEGLG 367 Query: 343 ELFKILVVSH------EKVELMPFV 361 F +LV S ++ L F+ Sbjct: 368 N-FGVLVQSKGLTTTQSQLPLQGFI 391 >gi|171322816|ref|ZP_02911539.1| protein of unknown function DUF185 [Burkholderia ambifaria MEX-5] gi|171091827|gb|EDT37326.1| protein of unknown function DUF185 [Burkholderia ambifaria MEX-5] Length = 412 Score = 231 bits (589), Expect = 1e-58, Method: Composition-based stats. Identities = 85/371 (22%), Positives = 150/371 (40%), Gaps = 34/371 (9%) Query: 2 ENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTC-NPFGAV----GDFVTAPEI 55 L ++ + I G + D++ + P GYYS FG DFVTAPE+ Sbjct: 38 SETLAAQLRDEIAAAGGWLPFDRFMERALYAPGLGYYSGGARKFGRRADDGSDFVTAPEL 97 Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 S +F LA + A G R++E G G G + +L + L + L + Sbjct: 98 SPLFARTLANPVADALAASG---TRRVMEFGAGTGKLAAGLLAALDALGVELDEYL---I 151 Query: 116 VETSERLTLIQKKQLASYGDK----INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTE 171 V+ S L Q+ +A+ + W +L + G +V NE D++P++ F Sbjct: 152 VDLSGELRERQRDTIAAAAPALAGKVRWLDALPERFDG--VVVGNEVLDAMPVRLFAKAG 209 Query: 172 HGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCS--DYFLGAIFENSPCRDREMQSISDRL 229 ER + +D + F+ + D G + E +++ L Sbjct: 210 GAWLERGVALDARHAFAFDDRPAGAGGLPTVLATLDVDDGYVTETHEAALAFTRTVCTML 269 Query: 230 ACDGGTAIVIDY------GYLQSRVGDT-LQAVKGHTYVSPLVNPGQADLSSHVDFQRLS 282 A G +++DY Y R T + + H + + PG D+++HV+F + Sbjct: 270 AR--GAVLLVDYGFPAHEYYHPQRDRGTLMCHYRHHAHDDAFLYPGLQDITAHVEFTGIY 327 Query: 283 SIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARK-DILLDSVKRLVSTSADKKSM 341 + + G T+Q +FL GI ++ + ++V++L+S + M Sbjct: 328 DAGVGTGADLLGYTSQARFLLNAGITDALAAIDPSDVHQFLPAANAVQKLIS----EAEM 383 Query: 342 GELFKILVVSH 352 GELFK++ S Sbjct: 384 GELFKVIAFSR 394 >gi|226288654|gb|EEH44166.1| DUF185 domain-containing protein [Paracoccidioides brasiliensis Pb18] Length = 509 Score = 231 bits (588), Expect = 1e-58, Method: Composition-based stats. Identities = 119/448 (26%), Positives = 182/448 (40%), Gaps = 98/448 (21%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTC-------NPFGAVGDFVTAPE 54 L + I I G +++ Y C+ P+ GYY++ FGA GDFVT+PE Sbjct: 48 STPLAKTIAEAISVTGPISIAAYMRQCLTSPDGGYYTSRGQEAEGTEVFGAKGDFVTSPE 107 Query: 55 ISQIFGEMLAIFLICAWEQHGFPS-CVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSI 113 ISQIFGE+L I+ + W G V+++ELGPG+G +M D+LR I K ++ ++ Sbjct: 108 ISQIFGELLGIWTVAEWMGQGRRKGGVQIIELGPGKGTLMADMLRSIRNFKTFASAIEAV 167 Query: 114 YMVETSERLTLIQKKQLASYGD--------------------KINWYTSLADVPLGFTFL 153 Y+VE S L +Q K L L D P F+ Sbjct: 168 YLVEASTVLREVQHKLLCGDAPTEEIEVGYKSTSVHLGVPVIWTEHIKLLPDEPDKTPFI 227 Query: 154 VANEFFDSLPIKQFVMTE----------------------------HGIRERM------- 178 A+EFFD+LPI F E RE + Sbjct: 228 FAHEFFDALPIHAFQSIETPPRSQTINTPTGPATLHNPPATSSSPATQWRELVVSPNPEI 287 Query: 179 IDIDQHDSLVFNIG--DHEIKSNFLTCS--------DYFLGAIFENSPCRDREMQSISDR 228 ++ + F++ S+ + G+ E SP Q I+ R Sbjct: 288 PELKSGNEPEFHLSLAKSPTPSSLVLPEMSPRYKAMKSTPGSTIEISPEGQTCAQDIARR 347 Query: 229 L---------------ACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLS 273 + G A+++DYG + ++L+ ++ H VSP PGQ D+S Sbjct: 348 IGGSFSSSSSEQSNKKRVPSGAALILDYGTTSTIPINSLRGIRKHQLVSPFAVPGQVDIS 407 Query: 274 SHVDFQRLSSIAILY--KLYINGLTTQGKFLEGLGIWQRAFSLMKQ------TARKDILL 325 ++VDF L+ AI + + G Q +FLE LGI +RA L+ + ++ + Sbjct: 408 ANVDFTALAEAAIDASPGVEVYGPVEQCQFLEALGISKRASQLLTKVEGEGGEEKRKRIE 467 Query: 326 DSVKRLVSTSADKKSMGELFKILVVSHE 353 KRLV MG+L+K L + E Sbjct: 468 SGWKRLVERGG--GGMGKLYKALAIVPE 493 >gi|298369648|ref|ZP_06980965.1| hypothetical protein HMPREF9016_02089 [Neisseria sp. oral taxon 014 str. F0314] gi|298282205|gb|EFI23693.1| hypothetical protein HMPREF9016_02089 [Neisseria sp. oral taxon 014 str. F0314] Length = 383 Score = 231 bits (588), Expect = 2e-58, Method: Composition-based stats. Identities = 89/366 (24%), Positives = 148/366 (40%), Gaps = 33/366 (9%) Query: 4 KLIRKIVNLIKKNG-QMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGE 61 +L R I + I NG + ++ L + P+FGYY+ + GA GDF+TAP +S +FG+ Sbjct: 18 QLCRLISDEISDNGNWIPFSRFMELALYAPDFGYYTGGSHKIGAGGDFITAPVLSPLFGK 77 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 L L Q + E G G G + + +L+ ++ +VE S Sbjct: 78 TLFAQLSVLLPQT----AGNIYEFGAGTGDLAVSLLQNFSDGLSHYY------IVELSPE 127 Query: 122 LTLIQKKQL-----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRE 176 L Q+ + KI +L D G ++ NE D++P++ ++ Sbjct: 128 LAERQRAMISNSLPPETARKIIHLDTLPDEFDG--IVIGNEVLDAMPVELVRKEGGNFQQ 185 Query: 177 RMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLG---AIFENSPCRDREMQSISDRLACDG 233 + I + V + + +YF E P + + +++ +L G Sbjct: 186 IGVSIK-NGEFVQVPKTLATPTLLRSAENYFPDAEPYTSELHPAQHAFVHTVASKLRRGG 244 Query: 234 GTAI-----VIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILY 288 I Y + Q +G + + HT P PG DL++HV+F ++ A Sbjct: 245 MIFIDYGFDAAQYYHPQRHMGTLIGHYRHHTVHDPFFLPGLTDLTAHVNFTDIAQAATDA 304 Query: 289 KLYINGLTTQGKFLEGLGIWQRAFSL-MKQTARKDILLDSVKRLVSTSADKKSMGELFKI 347 L + G TTQ FL LGI TA +V++LV + MGELFK+ Sbjct: 305 GLDLIGYTTQANFLLNLGITDLLAQTGHPDTAAYLTAAAAVQQLV----NPHEMGELFKV 360 Query: 348 LVVSHE 353 + Sbjct: 361 IAFGKN 366 >gi|222054854|ref|YP_002537216.1| protein of unknown function DUF185 [Geobacter sp. FRC-32] gi|221564143|gb|ACM20115.1| protein of unknown function DUF185 [Geobacter sp. FRC-32] Length = 382 Score = 230 bits (587), Expect = 2e-58, Method: Composition-based stats. Identities = 95/374 (25%), Positives = 161/374 (43%), Gaps = 20/374 (5%) Query: 1 MENK--LIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQ 57 ME+ L ++ I K G++T + A C+ +P GYY++ GA GDF T+ + Sbjct: 1 MESTPALKDILLERIWKAGRLTFADFMAACLYEPGLGYYTSPGRKVGAEGDFYTSMNVHL 60 Query: 58 IFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVE 117 +FG ++A + WE G P + E G G G + DIL I + F+ VL+ ++E Sbjct: 61 MFGRLIAREISRMWEILGSPESFTIAEAGAGGGQLARDILDTIAETNRSFYDVLTYRLIE 120 Query: 118 TSERLTLIQKKQLASYGDKINW--YTSLADVPLGFT-FLVANEFFDSLPIKQFVMTEHGI 174 L Q+++L + +++W LA L F+ +++NE D++P+ MT G+ Sbjct: 121 KEPTLKEAQQEKLTRHLARLSWSAPEDLAAGRLHFSGCVLSNELIDAMPVHLVEMTPAGL 180 Query: 175 RERMIDIDQHDSLVFNIGDHEIKSNFLTCSD----YFLGAIFENSPCRDREMQSISDRLA 230 E + + + + + G E + ++S++D L Sbjct: 181 MEVYVT-AIDGEFGEMLDEPSTPALADYLKENGVTLLAGQRGEINLAATGWLRSVADTLE 239 Query: 231 CDGGTAIVIDYG------YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSS 283 G + IDYG Y R TL HT P G D++SHV+F L Sbjct: 240 K--GFVLTIDYGYEADELYAPMRKNGTLLCYYQHTTCEDPYTRVGAQDITSHVNFTALIR 297 Query: 284 IAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGE 343 + L+ Q +FL G G+ + +L K A + LL + L MG+ Sbjct: 298 EGVKCGLHRAWFGEQYRFLLGAGLMEEMLALEKSGATETELLKTRLALKKLMLPDGGMGD 357 Query: 344 LFKILVVSHEKVEL 357 F++L+ + E + Sbjct: 358 TFRVLIQAKEVEDP 371 >gi|171687044|ref|XP_001908463.1| hypothetical protein [Podospora anserina S mat+] gi|170943483|emb|CAP69136.1| unnamed protein product [Podospora anserina S mat+] Length = 529 Score = 230 bits (587), Expect = 2e-58, Method: Composition-based stats. Identities = 110/448 (24%), Positives = 176/448 (39%), Gaps = 97/448 (21%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYST-----CNPFGAVGDFVTAPEIS 56 L +++ I+ G + + + +C+ GYY+ + FG GDFVT+PEIS Sbjct: 69 STPLAKQLAAAIELTGPIPLASFMRMCLTSDIGGYYTGAIEKDRDQFGLKGDFVTSPEIS 128 Query: 57 QIFGEMLAIFLICAWEQHGFPSC-VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 Q+FGE++ ++ + W G S V L+E+GPGRG +M D+LR I S+ +IYM Sbjct: 129 QVFGELIGVWFLTEWLAQGRQSRGVELIEVGPGRGTLMDDVLRTIQSFPAMANSIDAIYM 188 Query: 116 VETSERLTLIQKKQLASYGD---------------------KINWYTSLADVPLGFTFLV 154 VE S L + QK L S+ P F+V Sbjct: 189 VEASPELRMAQKNLLCGEDAPMTESKVGYHSVCKYNALPIVWTETIKSIPIAPQKMPFIV 248 Query: 155 ANEFFDSLPIKQFVMTEH-------------------------GIRERMIDIDQHDSLVF 189 A+EFFD+LPI F + RE ++ S Sbjct: 249 AHEFFDALPIHAFELVSVPASKSEAPSSTDNSSPSSKTTTPTLQWREMLVSPTPPGSTHE 308 Query: 190 NIGDHEIKSNFLTCSDYFL-----------------------------GAIFENSPCRDR 220 ++ +S D+ L GA+ E P Sbjct: 309 SLKTPATQSRETPPPDFQLTRATSSTRHSLYLPESSPRFRALKSSVGPGALLEVCPDASL 368 Query: 221 EMQSISDRL--------ACDGGTAIVIDY-GYLQSRVGDTLQAVKGHTYVSPLVNPGQAD 271 + R+ G A+++DY + ++L+ ++ H VSP PG D Sbjct: 369 YASDFAARIGGSPQHPKPKPSGAALILDYGPGDGTIPTNSLRGIRKHRRVSPFAEPGLTD 428 Query: 272 LSSHVDFQRLSSIAILY--KLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDI---LLD 326 LS VDF ++ A + ++G QG FLE +GI +RA L K+ ++ + Sbjct: 429 LSVDVDFAGIAESATRASEGVEVHGPVAQGDFLELMGIRERAEVLAKRAGEEEKGRAVEK 488 Query: 327 SVKRLVSTSADKKSMGELFKILVVSHEK 354 + +RLV MG++++ L + E Sbjct: 489 AWRRLVDRG--PGGMGKVYQALAIVPEN 514 >gi|114328975|ref|YP_746132.1| hypothetical protein GbCGDNIH1_2311 [Granulibacter bethesdensis CGDNIH1] gi|114317149|gb|ABI63209.1| hypothetical protein GbCGDNIH1_2311 [Granulibacter bethesdensis CGDNIH1] Length = 366 Score = 230 bits (587), Expect = 2e-58, Method: Composition-based stats. Identities = 112/344 (32%), Positives = 164/344 (47%), Gaps = 20/344 (5%) Query: 21 VDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCV 80 +D++ A YY+T +PF DF TAPEISQ FGE+L ++ W+Q G P+ + Sbjct: 30 LDRFMARA----NAAYYATHDPF---ADFTTAPEISQAFGEVLGLWAAMGWQQIGAPARI 82 Query: 81 RLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWY 140 L E GPGRG +M D L I + P F +I +ETS RL IQ +++ + Sbjct: 83 VLAEAGPGRGTLMADALSAIRRAIPAFADAATIMFIETSPRLRAIQAQRVPE----ARFA 138 Query: 141 TSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNF 200 L ++P L+ NEF D+LPI++FV + RE+ +++ S E Sbjct: 139 ADLDEIPDAPLILLGNEFLDALPIRRFVRQKAEWREQFVEVQPDGSATLTTDPMEPPFP- 197 Query: 201 LTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTY 260 G I E S + ++ RL G A+ +DYG QS GD+LQA+ Sbjct: 198 CLPETVAEGTIREWSEASHDIITRLAGRLTRQSGMALFLDYGPAQSGFGDSLQALAQGVP 257 Query: 261 VSPLVNP-GQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLM--KQ 317 V P P G ADL++HVDF L A G QG+FL LG+ QR L K Sbjct: 258 VDPFSLPAGAADLTAHVDFSALLDTARSAGANGYGPVAQGRFLLSLGLMQRCQRLASGKP 317 Query: 318 TARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELM-PF 360 A ++++ +RL + MG LFK + ++ + + F Sbjct: 318 AAMARQIMNAGQRLTQA----EYMGTLFKAMALTSPGLPVPAGF 357 >gi|323455216|gb|EGB11085.1| hypothetical protein AURANDRAFT_11471 [Aureococcus anophagefferens] Length = 386 Score = 230 bits (587), Expect = 2e-58, Method: Composition-based stats. Identities = 113/390 (28%), Positives = 185/390 (47%), Gaps = 48/390 (12%) Query: 5 LIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYST---CNPFGAVGDFVTAPEISQIFGE 61 L ++ + I G ++V +Y C+ P GYY+ FGA GDFVTAPE+SQ+FGE Sbjct: 1 LGLELGSQILARGPLSVYEYMRQCLLHPRHGYYARSGAERNFGAGGDFVTAPELSQLFGE 60 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 ++ ++ + W + G P VRLVE+GPGRG ++ D+LR + + +VE S Sbjct: 61 LVGVWFVSEWVRLGEPEKVRLVEVGPGRGTLLGDVLRATAAWPKFRRAASDVRLVEPSAS 120 Query: 122 LTLIQKKQL------------------------ASYGDKINWYTSLADVPLG--FTFLVA 155 L L Q++ L +G + W+ SL ++ T LV Sbjct: 121 LRLAQRRTLGARPATHAQKPDAETPVTWALDGFGDFGTRATWHESLDEIDDDGVPTLLVG 180 Query: 156 NEFFDSLPIKQFVMTEHGIRERMIDIDQ--HDSLVFNIGDHEIKSNFLTCSDYFLG---- 209 E D+ P QFV T++G RE+++D+ D F + ++ + Sbjct: 181 QEVLDAFPAYQFVKTDNGWREKLVDLADAGPDRFRFVLAPSPTPASRALAAADAPSLATK 240 Query: 210 ----AIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLV 265 E +P + +++ ++A GG A+ +DYGY + + GDTL+ + H VSPL Sbjct: 241 DGEAETLEVAPGALAFVDAVAAKVAKAGGAALFVDYGYARGQRGDTLRGFRRHAQVSPLR 300 Query: 266 NPGQADLSSHVDFQRL-SSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMK----QTAR 320 PG DL+ DF A + G QG++L+ +GI R +L+ ++ Sbjct: 301 EPGLVDLTVDADFGACGDRAAAVDGAAAFGAVDQGEWLQRMGIVPRLEALLNLDDITESQ 360 Query: 321 KDILLDSVKRLVSTSADKKSMGELFKILVV 350 + L+ + +RL+ D + MGE +K+L V Sbjct: 361 VEDLISACERLL----DPEQMGERYKVLAV 386 >gi|295662458|ref|XP_002791783.1| DUF185 domain-containing protein [Paracoccidioides brasiliensis Pb01] gi|226279909|gb|EEH35475.1| DUF185 domain-containing protein [Paracoccidioides brasiliensis Pb01] Length = 508 Score = 230 bits (587), Expect = 2e-58, Method: Composition-based stats. Identities = 115/447 (25%), Positives = 178/447 (39%), Gaps = 97/447 (21%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNP-------FGAVGDFVTAPE 54 L + I I G +++ Y C+ P+ GYY++ FG GDFVT+PE Sbjct: 48 STPLAKTIAEAISVTGPISIAAYMRQCLTSPDGGYYTSRGQEAEGTELFGPKGDFVTSPE 107 Query: 55 ISQIFGEMLAIFLICAWEQHGFPS-CVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSI 113 ISQIFGE+L I+ + W G V+++ELGPG+G +M D+LR I K ++ +I Sbjct: 108 ISQIFGELLGIWTVAEWMGQGRKKGGVQIIELGPGKGTLMADMLRSIRNFKTFASAIEAI 167 Query: 114 YMVETSERLTLIQKKQLASYGD--------------------KINWYTSLADVPLGFTFL 153 Y+VE S L +Q K L L + P F+ Sbjct: 168 YLVEASTVLREVQHKLLCGDAPTEEMEVGYKSTSVHLGVPVIWTEHIKLLTEEPDKTPFI 227 Query: 154 VANEFFDSLPIKQFVMTEHGI--------------------------RERMIDIDQHDSL 187 A+EFFD+LPI F E + R + + + Sbjct: 228 FAHEFFDALPIHAFQSIETPPPSQTINTPTGPATLHNPPATSSSPATQWRELVVSPNPET 287 Query: 188 VFNIGDHEIKSNFLTCS-------------------DYFLGAIFENSPCRDREMQSISDR 228 E + + G+ E SP Q I+ R Sbjct: 288 PELKSGKEPEFHLSLAKSHTPSSLVLPEMSPRYKAMKSMPGSTIEISPEGQTCAQDIARR 347 Query: 229 LACDGGT--------------AIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSS 274 + + A+++DYG + ++L+ ++ H VSP PGQ D+S+ Sbjct: 348 IGGSFYSSSEQSNKKRVPSGAALILDYGTTSTIPINSLRGIRKHQLVSPFAVPGQVDISA 407 Query: 275 HVDFQRLSSIAILY--KLYINGLTTQGKFLEGLGIWQRAFSLMKQ------TARKDILLD 326 +VDF L+ AI + + G Q +FLE LGI +RA L+++ ++ + Sbjct: 408 NVDFTALAEAAIDASPGVEVYGPVEQCQFLEALGISKRASQLLRKVEGEGGEEKRKRIES 467 Query: 327 SVKRLVSTSADKKSMGELFKILVVSHE 353 KRLV MG+L+K L + E Sbjct: 468 GWKRLVERGG--GGMGKLYKALAIVPE 492 >gi|241760239|ref|ZP_04758335.1| conserved hypothetical protein [Neisseria flavescens SK114] gi|241319350|gb|EER55815.1| conserved hypothetical protein [Neisseria flavescens SK114] Length = 411 Score = 230 bits (586), Expect = 3e-58, Method: Composition-based stats. Identities = 76/364 (20%), Positives = 150/364 (41%), Gaps = 31/364 (8%) Query: 5 LIRKIVNLIKKN-GQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGEM 62 L + I N I+++ ++ ++ L + P++GYYS + G GDF+T P ++ +FG+ Sbjct: 47 LTKLIKNEIEQHQNWISFSRFMELALYTPQYGYYSGGSHKIGTDGDFITGPTLTPLFGQT 106 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 LA L Q + E G G G + +L+ + + Y++E S L Sbjct: 107 LAKQLAELLPQT----AGNIYEFGAGTGHLAATLLQNLSD------GLNHYYIIELSAEL 156 Query: 123 TLIQKKQLASYGDK-----INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRER 177 Q++ + + + T+L + G ++ NE D++P+++ + E ++ Sbjct: 157 AERQRQHIIEHTSPEAAAKVIHLTALPEHFDG--IIIGNEVLDAMPVERLIYQEERFQQI 214 Query: 178 MIDIDQHD--SLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGT 235 + ++ + + E+ E P + +Q+++ +L G Sbjct: 215 GVSLENDGLIEAIRPLAQAELTQTAALYFPPLPSYTSELHPAQYAFIQTLAAKLQRGGMI 274 Query: 236 AI-----VIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKL 290 I Y + Q + G + + HT P N G DL++HV+F ++ + L Sbjct: 275 FIDYGFDAAQYYHPQRKEGTFIGHYRHHTIHDPFFNIGLTDLTAHVNFTDIARASTESGL 334 Query: 291 YINGLTTQGKFLEGLGIWQRAFSL-MKQTARKDILLDSVKRLVSTSADKKSMGELFKILV 349 + G Q FL LGI + + +V++L+ MGELFK++ Sbjct: 335 DLIGYLPQSYFLLNLGITDLLAQIGSPDSVEYIQAASAVQKLIHQ----HEMGELFKVIA 390 Query: 350 VSHE 353 + Sbjct: 391 FGKD 394 >gi|295677986|ref|YP_003606510.1| protein of unknown function DUF185 [Burkholderia sp. CCGE1002] gi|295437829|gb|ADG16999.1| protein of unknown function DUF185 [Burkholderia sp. CCGE1002] Length = 400 Score = 230 bits (586), Expect = 3e-58, Method: Composition-based stats. Identities = 88/376 (23%), Positives = 154/376 (40%), Gaps = 40/376 (10%) Query: 2 ENKLIRKIVNLI-KKNGQMTVDQYFALCVADPEFGYYSTC-NPFGAV----GDFVTAPEI 55 + L+ +I + G + D+Y + P GYYS FG DFVTAPE+ Sbjct: 22 SDALVAEIRAQLDAAGGWLPFDRYMERALYAPGLGYYSGGARKFGLRADDGSDFVTAPEL 81 Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 S +F LA + A E G ++E G G G + +L + L +F S + Sbjct: 82 SPLFAATLARPVAEALEASG---TRDVMEFGAGTGKLAAGLLNALDTLGAEFDS---YSI 135 Query: 116 VETSERLTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTE 171 V+ S L Q++ + + K+ W +L + G ++ NE D++P++ F Sbjct: 136 VDLSGELRERQRETIAGAAPALLAKVRWLDALPERFEG--VVIGNEVLDAMPVRLFAHNG 193 Query: 172 HGIRERMIDIDQHDSLVFNIGDHEIKSNFLTC-------SDYFLGAIFENSPCRDREMQS 224 ER + + + F+ ++ + G + E ++ Sbjct: 194 GVWHERG-VVWRDGAFGFDDRPVAAAADQALLTEIDTDRENAGNGYVTETHEAARAFTRT 252 Query: 225 ISDRLACDGGTAIVIDY------GYLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVD 277 I LA G ++IDY Y R TL H P + PG D+++HV+ Sbjct: 253 ICTMLAR--GAILLIDYGFPRHEYYHDQRAQGTLMCHYRHRAHGDPFLYPGLQDITAHVE 310 Query: 278 FQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARK-DILLDSVKRLVSTSA 336 F ++ + + G T+Q +FL G+ + + R+ ++V++L+S Sbjct: 311 FTGIAEAGVETGADLLGFTSQARFLLNAGVTEVLGEIDPADTRRFLPAANAVQKLLS--- 367 Query: 337 DKKSMGELFKILVVSH 352 + MGELFK++ S Sbjct: 368 -EAEMGELFKVIAFSR 382 >gi|306837807|ref|ZP_07470670.1| Hypothetical protein BROD_0614 [Brucella sp. NF 2653] gi|306407103|gb|EFM63319.1| Hypothetical protein BROD_0614 [Brucella sp. NF 2653] Length = 307 Score = 230 bits (586), Expect = 3e-58, Method: Composition-based stats. Identities = 122/311 (39%), Positives = 165/311 (53%), Gaps = 13/311 (4%) Query: 58 IFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVE 117 +FGE++ I+ + W+ P+ L E+GPGRG +M D+LR I +L P I MVE Sbjct: 1 MFGELIGIWCLREWDALARPANFVLCEIGPGRGTLMSDMLRTIGRLAPQMLGGARIAMVE 60 Query: 118 TSERLTLIQKKQLASYGDKINWYTSLADVP----LGFTFLVANEFFDSLPIKQFVMTEHG 173 TS RL Q+++LA I W+ AD+P G LV NE FD++P +QFV + Sbjct: 61 TSPRLAEKQRQKLAGTKAHIEWFERFADIPADTVHGPLILVTNELFDAIPFRQFVKADGR 120 Query: 174 IRERMIDIDQHDSLVFNIGDHEIKSNFLTCS--DYFLGAIFENSPCRDREMQSISDRLAC 231 ERM+ +++ D F G I L GAIFE +P R MQ I+ R+A Sbjct: 121 FVERMVALNEQDEFHFVSGAGGIDPALLPKDHVKAEEGAIFEAAPARTALMQEIASRIAA 180 Query: 232 DGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLY 291 G A+ IDYG+L+S GDTLQA+ H Y +PG ADL+SHVDF L A Sbjct: 181 TRGAALNIDYGHLESGFGDTLQAMLKHAYDDVFAHPGAADLTSHVDFDILQKTAKACGCK 240 Query: 292 INGLTTQGKFLEGLGIWQRAFSL--MKQTARKDILLDSVKRLVSTSADKKSMGELFKILV 349 G TQG+FL +G+ RA L K A ++ + V+RL A MG LFK+L Sbjct: 241 T-GTMTQGEFLLAMGLVDRAGQLGAGKDAAFQEKIRQDVERL----AAPDQMGTLFKVLA 295 Query: 350 VSHEKVELMPF 360 S E+ L+PF Sbjct: 296 FSDEQTRLLPF 306 >gi|17544874|ref|NP_518276.1| hypothetical protein RSc0155 [Ralstonia solanacearum GMI1000] gi|17427163|emb|CAD13683.1| hypothetical protein of unknown function duf185 [Ralstonia solanacearum GMI1000] Length = 397 Score = 229 bits (585), Expect = 4e-58, Method: Composition-based stats. Identities = 87/380 (22%), Positives = 145/380 (38%), Gaps = 27/380 (7%) Query: 2 ENKLIRKIVNLI-KKNGQMTVDQYFALCVADPEFGYYSTCN-PFGAV----GDFVTAPEI 55 ++L IV + G + ++Y L + P GYYS FG GDF+TAPE+ Sbjct: 18 SDRLFSTIVRAVEAAGGWLPFERYMELALYAPGLGYYSGGAAKFGRRVEDGGDFITAPEL 77 Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 + FG +A + + P ++E G G G + DIL + L S + Sbjct: 78 TPFFGRTVAHQIAQVLQTL-PPGQRHVLEFGAGTGRLAADILTELETLGMRPDS---YGI 133 Query: 116 VETSERLTLIQKKQLASYGDKINWYTSLAD--VPLGFTFLVANEFFDSLPIKQFVMTEHG 173 VE S L Q++ LA+ G + D +V NE D++P+ + Sbjct: 134 VELSGELRQRQQQALAALGPDLAGLARWHDALPARFTGVMVGNEVLDAMPVSLWARRGGV 193 Query: 174 IRERMIDIDQHD--SLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLAC 231 R + D + D L + E + ++S L Sbjct: 194 WHRRGVAFDADHGLRWSERVADPADVPPKLAALPGRDDFVTEAHEAAEGFIRSAGAALER 253 Query: 232 -----DGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAI 286 +Y + G + + H + P PG D+++HVDF ++ A Sbjct: 254 GLLLLLDYGFPAGEYYHAHRANGTLMCHYRQHAHDDPFWLPGLQDITAHVDFSGIALAAR 313 Query: 287 LYKLYINGLTTQGKFLEGLGIWQRAFSLM-KQTARKDILLDSVKRLVSTSADKKSMGELF 345 L + G +Q +FL G G+ Q +L R ++V++L+S + MGELF Sbjct: 314 EAGLEVLGYASQARFLLGAGVGQLLMTLDPADPVRFLPAANAVQKLLS----EAEMGELF 369 Query: 346 KILVVSH---EKVELMPFVN 362 K + + + L F + Sbjct: 370 KAIALGRGLDAALPLAGFAD 389 >gi|326402667|ref|YP_004282748.1| hypothetical protein ACMV_05190 [Acidiphilium multivorum AIU301] gi|325049528|dbj|BAJ79866.1| hypothetical protein ACMV_05190 [Acidiphilium multivorum AIU301] Length = 324 Score = 229 bits (585), Expect = 4e-58, Method: Composition-based stats. Identities = 112/341 (32%), Positives = 168/341 (49%), Gaps = 24/341 (7%) Query: 21 VDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCV 80 +D + A YY++ +PF DFVTAPEISQ+FGE+L +++ AW G P+ Sbjct: 5 LDAFMARA----NAAYYASRDPF---VDFVTAPEISQVFGELLGLWVAMAWSMLGSPAPF 57 Query: 81 RLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWY 140 LVE GPGRG +M D LR + P F I+++ETS RL + K +L ++ Sbjct: 58 ALVEAGPGRGHLMTDALRAAKRAMPAFVEAADIHLIETSPRLIGLLKDRLPQAM----FH 113 Query: 141 TSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNF 200 L+ +P L+ANEF D+LPI+QFV RER + F E Sbjct: 114 ADLSTIPDQSIILLANEFLDALPIRQFVRRGPAWRERYVA-----DGAFVEASAETA--- 165 Query: 201 LTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTY 260 D G + E + + + +++ R++ GG+A++IDYG+ G++LQA+ Sbjct: 166 --LPDAPEGTVIERNETAEAFVAALASRISRRGGSALLIDYGHEGGATGESLQAIMDGQP 223 Query: 261 VSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTAR 320 PL PG+ DL++HVDF RL+ A I G QG FL LGI +R L + Sbjct: 224 ADPLAEPGRRDLTAHVDFSRLAMAARNAGAEIRGPAAQGAFLAALGIHERTAQLGHHAGQ 283 Query: 321 KDILLDSVKRLVSTSADKKSMGELFKILVVSHEKV-ELMPF 360 D L + ++MG LF+++ + + L F Sbjct: 284 DDAL--RLLAATRRLTSPEAMGSLFRVMAICPRGMEPLPGF 322 >gi|113866288|ref|YP_724777.1| hypothetical protein H16_A0257 [Ralstonia eutropha H16] gi|113525064|emb|CAJ91409.1| uncharacterized conserved protein [Ralstonia eutropha H16] Length = 399 Score = 229 bits (585), Expect = 4e-58, Method: Composition-based stats. Identities = 90/386 (23%), Positives = 157/386 (40%), Gaps = 40/386 (10%) Query: 2 ENKLIRKIVNLI-KKNGQMTVDQYFALCVADPEFGYYSTC-NPFGAV----GDFVTAPEI 55 + L +I I G + D+Y AL + P GYYS FG DF+TAPE+ Sbjct: 18 SDTLTARIGESIDAAGGWIGFDRYMALALYAPGLGYYSGGSAKFGRDARDGSDFITAPEL 77 Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 S F LA G P R++E G G G + D+ + L+ + + + Sbjct: 78 SPFFARTLARQFAP-LMAQGLP---RMLEFGAGTGRLAADL---LLGLEQEGQLPDTYAI 130 Query: 116 VETSERLTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTE 171 VE S L Q+ L D++ W +L G +V NE D++P++ + + Sbjct: 131 VELSGELRARQQATLAQRAPHLADRVTWLDTLPASFEG--VIVGNEVLDAMPVQLYARSG 188 Query: 172 HGIRERMIDID----QHDSLVFNIGDHEIKSNFLTCS----DYFLGAIFENSPCRDREMQ 223 ER + + F D + + + + E + ++ Sbjct: 189 GRWHERGVVHSVARSDDAAPAFRFEDRPLADADMPEALRAIPGDHDLVTETHAEAEGFIR 248 Query: 224 SISDRLACDGGTAIVIDYG----YLQSRVGDT-LQAVKGHTYVSPLVNPGQADLSSHVDF 278 ++ LA I + Y R G T + + H + P + PG D+++HV+F Sbjct: 249 AVGAMLARGAAFFIDYGFPAGEYYHPQRAGGTLMCHYRHHAHPDPFLYPGLQDITAHVNF 308 Query: 279 QRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARK-DILLDSVKRLVSTSAD 337 ++ A+ L + G +Q +FL GI + +L AR ++V++L+S Sbjct: 309 SGIALAAVDAGLTVAGFASQARFLMNAGITELLMALDPSDARAFLPQANAVQKLLS---- 364 Query: 338 KKSMGELFKILVVSH---EKVELMPF 360 + MGELFK++ ++ + + F Sbjct: 365 EAEMGELFKVIALTRGLDDSEPMDGF 390 >gi|148259441|ref|YP_001233568.1| hypothetical protein Acry_0424 [Acidiphilium cryptum JF-5] gi|146401122|gb|ABQ29649.1| protein of unknown function DUF185 [Acidiphilium cryptum JF-5] Length = 324 Score = 229 bits (585), Expect = 4e-58, Method: Composition-based stats. Identities = 112/341 (32%), Positives = 167/341 (48%), Gaps = 24/341 (7%) Query: 21 VDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCV 80 +D + A YY++ +PF DFVTAPEISQ+FGE+L ++ AW G P+ Sbjct: 5 LDAFMARA----NAAYYASRDPF---VDFVTAPEISQVFGELLGLWAAMAWSMLGSPAPF 57 Query: 81 RLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWY 140 LVE GPGRG +M D LR + P F I+++ETS RL + K +L ++ Sbjct: 58 ALVEAGPGRGHLMTDALRAAKRAMPAFVEAADIHLIETSPRLIGLLKDRLPQAM----FH 113 Query: 141 TSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNF 200 L+ +P L+ANEF D+LPI+QFV RER + F E Sbjct: 114 ADLSTIPDQSIILLANEFLDALPIRQFVRRGPAWRERYVA-----DGAFVEASAETA--- 165 Query: 201 LTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTY 260 D G + E + + + +++ R++ GG+A++IDYG+ G++LQA+ Sbjct: 166 --LPDAPEGTVIERNETAEAFVAALASRISRRGGSALLIDYGHDGGATGESLQAIMDGQP 223 Query: 261 VSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTAR 320 PL PG+ DL++HVDF RL+ A I G QG FL LGI +R L + Sbjct: 224 ADPLAEPGRRDLTAHVDFSRLAMAARNAGAEIRGPAAQGAFLAALGIHERTAQLGHHAGQ 283 Query: 321 KDILLDSVKRLVSTSADKKSMGELFKILVVSHEKV-ELMPF 360 D L + ++MG LF+++ + + L F Sbjct: 284 DDAL--RLLAATRRLTSPEAMGSLFRVMAICPRGMEPLPGF 322 >gi|56750467|ref|YP_171168.1| hypothetical protein syc0458_c [Synechococcus elongatus PCC 6301] gi|81299900|ref|YP_400108.1| hypothetical protein Synpcc7942_1091 [Synechococcus elongatus PCC 7942] gi|56685426|dbj|BAD78648.1| hypothetical protein [Synechococcus elongatus PCC 6301] gi|81168781|gb|ABB57121.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942] Length = 387 Score = 229 bits (584), Expect = 4e-58, Method: Composition-based stats. Identities = 92/373 (24%), Positives = 154/373 (41%), Gaps = 29/373 (7%) Query: 1 MENKLIRKIVNLIKKN--GQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQ 57 M + +++ ++ ++T+ Q+ + + DPE GYYS+ FG GDFVT+P +S Sbjct: 1 MTQAIPDRLLEWLESQPQRRLTMAQFMSWALYDPESGYYSSRTGQFGDRGDFVTSPSLSA 60 Query: 58 IFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVE 117 F E+LA+ W+ G P VE+G G G D L I + + + L ++E Sbjct: 61 DFAELLAVQAAEFWQVLGRPDRFVWVEMGAGAGQFAGDFLAAIAGTELE--ATLHYRIIE 118 Query: 118 TSERLTLIQKKQLASYGDKINWYTSLADVPLGFT--FLVANEFFDSLPIKQFVMTEHGIR 175 S +L Q+++L + D+++W+ + D T +NE D+LP+ + + Sbjct: 119 RSPQLRRQQQERLEPWRDRVDWW-TWEDWATQPTVGVAFSNELVDALPVHRIQWQGGEWQ 177 Query: 176 ERMIDIDQHDSLVFNIGDHEIKSNFLTCSD---------YFLGAIFENSPCRDREMQSIS 226 E + + L +G +D G E P + + + Sbjct: 178 EIYVT-ENAGVLQEVLGPLSSPDLVDVFADLGLAAAIARLPEGYRTEVHPAAKQWLAQVV 236 Query: 227 DRLACDGGTAIVIDYGYLQSRV------GDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQ 279 L G + IDYGY R TLQA + PGQ DL++HV+F Sbjct: 237 QGLQR--GYLLTIDYGYSGDRYYAAGRTDGTLQAYWQQRFHNDLYARPGQQDLTAHVNFS 294 Query: 280 RLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMK-QTARKDILLDSVKRLVSTSADK 338 L L T Q FL LG+ R +L + Q+ L + ++ D Sbjct: 295 ALEVWGEALGLKRLAFTEQALFLMALGLGDRLMALSQPQSGSDLSQLLRRREVLHRLLDP 354 Query: 339 KSMGELFKILVVS 351 ++G F +L+ Sbjct: 355 TALGN-FGVLLQG 366 >gi|158293643|ref|XP_315000.4| AGAP004909-PA [Anopheles gambiae str. PEST] gi|157016546|gb|EAA10494.4| AGAP004909-PA [Anopheles gambiae str. PEST] Length = 467 Score = 229 bits (584), Expect = 5e-58, Method: Composition-based stats. Identities = 122/393 (31%), Positives = 181/393 (46%), Gaps = 46/393 (11%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGEM 62 L + I+ G MTV Y + +P GYYST N FG GDF+TAPEI QIFGE+ Sbjct: 65 TLAEALHGRIRATGPMTVATYMREVLLNPAAGYYSTKENVFGTTGDFITAPEIGQIFGEL 124 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 +AI+ I ++ + ++L+ELGPG+G +M D+LRV + +S+++VE S L Sbjct: 125 VAIWCINELQKFNYDGHIQLIELGPGKGTLMHDVLRVFERFG-LSKDRVSVHLVEMSSNL 183 Query: 123 TLIQKKQLASYGDK---------------------INWYTSLADVPLGFTFLVANEFFDS 161 +Q +L + I WYT + +VP GF+ ++ANEFFD+ Sbjct: 184 QRLQADKLCNGMAHRTPADQSEPHVQEGTASSGINIRWYTDVVEVPKGFSIILANEFFDA 243 Query: 162 LPIKQFVMTEHG----IRERMIDIDQ---HDSLVFNIGDHEIKSNFL-------TCSDYF 207 LP+ F +E ++DI+ S F + + + S Sbjct: 244 LPVHVFCKEASEGGASWKEMLVDINPELKEPSFRFIQSNRATPYSVVFGKRFDGKESLLR 303 Query: 208 LGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNP 267 E S ++ Q I+ R+ GG ++IDYG+ + DTL++ K H PL +P Sbjct: 304 DRNRVEVSFETEQIAQDIARRIDGHGGFGLIIDYGHEGDKT-DTLRSFKSHQLHDPLQDP 362 Query: 268 GQADLSSHVDFQRLSSIAIL-YKLYINGLTTQGKFLEGLGIWQRAFSL---MKQTARKDI 323 G ADL+ VDF L K G +QG FLE + R SL K + + Sbjct: 363 GSADLTVDVDFGFLKHFLEQDDKAITLGPVSQGTFLEAMEGSARLKSLLSAAKDEKYRKM 422 Query: 324 LLDSVKRLVSTSADKKSMGELFKILVVSHEKVE 356 L D L + MGE FK+L V ++ Sbjct: 423 LSDGYDELT----NPSKMGERFKLLSVFPSTLK 451 >gi|289667563|ref|ZP_06488638.1| hypothetical protein XcampmN_03402 [Xanthomonas campestris pv. musacearum NCPPB4381] Length = 394 Score = 229 bits (584), Expect = 5e-58, Method: Composition-based stats. Identities = 79/383 (20%), Positives = 136/383 (35%), Gaps = 33/383 (8%) Query: 2 ENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIF 59 ++L + I+ G + ++ L + P GYYS FG GDFVT+PE+ +F Sbjct: 16 SDRLAAHLRAEIQAAGGAIPFSRFMELALYAPGLGYYSAGASKFGEAGDFVTSPELGPLF 75 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 ++ L +Q G R++E+G G G + +L ++E S Sbjct: 76 AATVSGALAPVLQQLG--PQARVLEVGGGSGAFAEV---TLKRLLELDALPERYAILEPS 130 Query: 120 ERLTLIQKKQLASYGDKINWYTSLADVPLGFT------FLVANEFFDSLPIKQFVMTEHG 173 L Q+++L L + L ANE D+LP +F + + Sbjct: 131 ADLRERQRERLGRSLIPP--VFDLVEWLDAPFPDDWDGVLFANEVIDALPTPRFALRDGE 188 Query: 174 IRERMIDIDQHDSLVFNIGDHEI-------KSNFLTCSDYFLGAIFENSPCRDREMQSIS 226 + E + +D + + G E P +Q+++ Sbjct: 189 VYEETVVLDAQQQFARGEQPADALLSAAVRHLERYLEQPFADGYRSELLPQLPYWIQAVA 248 Query: 227 DRLACDGGTAIVIDYG----YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRL 281 L + Y Y R TL+A H PG DL++ VDF L Sbjct: 249 GGLKRGAMLFVDYGYPRGEFYRAQREDGTLRAFYRHRMHEDLYRWPGLQDLTASVDFTAL 308 Query: 282 SSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDIL--LDSVKRLVSTSADKK 339 + + G TQ FL G G+ +L+ Q + ++ + Sbjct: 309 AEAGTGAGFELAGYCTQASFLLGNGLD----ALLAQADTRTDEVGRMRLREQIKRLTLPS 364 Query: 340 SMGELFKILVVSHEKVELMPFVN 362 MGE F+++ S + F++ Sbjct: 365 EMGERFQVMGFSRDVDFAPAFLS 387 >gi|296421134|ref|XP_002840121.1| hypothetical protein [Tuber melanosporum Mel28] gi|295636333|emb|CAZ84312.1| unnamed protein product [Tuber melanosporum] Length = 490 Score = 229 bits (584), Expect = 5e-58, Method: Composition-based stats. Identities = 112/433 (25%), Positives = 181/433 (41%), Gaps = 76/433 (17%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYST------CNPFGAVGDFVTAPEI 55 L R + I+ G +T+ Y C+ GYY+T + FG GDF+T+PEI Sbjct: 51 STPLARYLAEAIEATGPITLAAYMRQCLVSDLGGYYTTERGVGTGDQFGRKGDFITSPEI 110 Query: 56 SQIFGEMLAIFLICAWEQHGFP-SCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIY 114 SQ+FGE++ I+++ W + ++++ELGPGRG +M D+ RV+ K SV +IY Sbjct: 111 SQVFGELIGIWVVSEWMRQRRQGEKIQIIELGPGRGTLMSDLWRVVLIFKTMADSVEAIY 170 Query: 115 MVETSERLTLIQKKQLASYGDKINWYTS----------------------LADVPLGFTF 152 +VE S L QK L K+ + + + Sbjct: 171 LVEASSSLREAQKVLLCGPKTKMRQIENGFACQSKASPKTDIIWYEDFSFIPRDKDRTPY 230 Query: 153 LVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFN-------------------IGD 193 ++A+EFFD+LPI F T++G RE +++ + + Sbjct: 231 IIAHEFFDALPIHAFTNTDNGWREMLVNTSPGKGQFITKDGVSELETRSNTPEFSLALAN 290 Query: 194 HEIKSNFLTCS--------DYFLGAIFENSPCRDREMQSISDRLAC-DGGTAIVIDYGYL 244 + L + A E SP M+ IS R+ G A+++DYG Sbjct: 291 APTPHSILLPPLSQRYKDLEKLKNATIEISPESLGLMEKISQRIGEAGAGAALIMDYGTS 350 Query: 245 QSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAIL--YKLYINGLTTQGKFL 302 + ++L+ + H SP PG+ D+S+ VDF L+ AI K+ ++G Q FL Sbjct: 351 DTVPANSLRGILKHEICSPFSRPGEVDVSADVDFTALAEKAINSSKKVEVHGPVKQRAFL 410 Query: 303 EGLGIWQRAFSLMK-------------QTARKDILLDSVKRLVSTSADKKSMGELFKILV 349 LGI +R L++ ++ L RLV MG ++K L Sbjct: 411 GMLGIKERTEFLIRELKEEPSYGWDDGNEEKEKALRLGHDRLV--GEHAGGMGAIYKALA 468 Query: 350 V--SHEKVELMPF 360 V + + + F Sbjct: 469 VIPARKGRRPVGF 481 >gi|170696057|ref|ZP_02887194.1| protein of unknown function DUF185 [Burkholderia graminis C4D1M] gi|170139049|gb|EDT07240.1| protein of unknown function DUF185 [Burkholderia graminis C4D1M] Length = 396 Score = 229 bits (584), Expect = 5e-58, Method: Composition-based stats. Identities = 93/370 (25%), Positives = 153/370 (41%), Gaps = 32/370 (8%) Query: 2 ENKLIRKIVNLIK-KNGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGD----FVTAPEI 55 L+ +I I G + D+Y A + P GYYS FG GD FVTAPE+ Sbjct: 22 SEALVARIRADIDDAGGWLPFDRYMARALYAPGLGYYSGGARKFGLRGDDGSDFVTAPEL 81 Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 S +F LA + A + G ++E G G G + +L + L +F S + Sbjct: 82 SPLFAATLARPIAEALQASG---TRDVMEFGAGTGKLAAGLLNALAALGAEFDS---YSI 135 Query: 116 VETSERLTLIQKKQLASYGDK----INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTE 171 V+ S L Q++ +A+ + W +L + G ++ NE D++P++ F + Sbjct: 136 VDLSGELRERQRETIAAAAPALAAKVRWLDALPERFEG--VVIGNEVLDAMPVRLFASVD 193 Query: 172 HGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLG---AIFENSPCRDREMQSISDR 228 RER + + F + ++ S+ G + E ++I Sbjct: 194 GAWRERG-VVWRDGWFAFEDREVSASADTALLSEIDTGGGDYVAETHDAARAFTRTICTM 252 Query: 229 LACDGGTAIVIDYG----YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSS 283 LA I + Y R TL H P V PG D+++HV+F ++ Sbjct: 253 LARGAAFFIDYGFPRHEYYHAQRAQGTLMCHYRHRAHGDPFVYPGLQDITAHVEFTGIAE 312 Query: 284 IAILYKLYINGLTTQGKFLEGLGIWQRAFSLM-KQTARKDILLDSVKRLVSTSADKKSMG 342 + + G T+Q +FL GI + TAR ++V++L+S + MG Sbjct: 313 AGVETGADLLGFTSQARFLLNAGITDALSEIDPADTARFLPAANAVQKLLS----EAEMG 368 Query: 343 ELFKILVVSH 352 ELFK++ S Sbjct: 369 ELFKVIAFSR 378 >gi|302409170|ref|XP_003002419.1| DUF185 domain-containing protein [Verticillium albo-atrum VaMs.102] gi|261358452|gb|EEY20880.1| DUF185 domain-containing protein [Verticillium albo-atrum VaMs.102] Length = 515 Score = 229 bits (584), Expect = 5e-58, Method: Composition-based stats. Identities = 113/463 (24%), Positives = 177/463 (38%), Gaps = 106/463 (22%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYST-----CNPFGAVGDFVTAPEIS 56 L +++ + I G + + Y +C+ GYY+ + FG GDFVT+PEIS Sbjct: 51 TTPLAKQLADAISITGPVPLASYMRMCMTGDIGGYYTGLIEQGRDQFGTKGDFVTSPEIS 110 Query: 57 QIFGEMLAIFLICAWEQHGFPSC-VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 QIFGE++ I+ + W G P+ V L+ELGPGRG +M DILR + K S+ +IYM Sbjct: 111 QIFGELVGIWFVTEWMSQGKPNKGVELIELGPGRGTLMDDILRTVQNFKEFASSIDAIYM 170 Query: 116 VETSERLTLIQKKQLASY----------GDKINWYTSLADVPLG-----------FTFLV 154 VE S L QK L I+ Y ++ V G +V Sbjct: 171 VEASPTLREAQKSLLCGPDAAMTESKIGHHSISKYGNIPIVWTGVVKAIPEGPEKMPLMV 230 Query: 155 ANEFFDSLPIKQFVMTE----------------------------------HGIRERMID 180 A+EFFD+LPI F + RE ++ Sbjct: 231 AHEFFDALPIHAFQSAKKAPPPIPPRTPTPSPPNLPTQPEPSVVEAAPLPVFEWRELVVS 290 Query: 181 IDQHD------------------SLVFNIGDHEIKSNFLTCS--------DYFLGAIFEN 214 D + K G++ E Sbjct: 291 PTPPDATHATLNTPTSEQKEMVPEFQLTLSPAPTKHALYVPESSQRYRALKSVPGSVIEV 350 Query: 215 SPCRDREMQSISDRL--------ACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVN 266 P ++ R+ G A+++DYG + ++L+ ++ H V+P Sbjct: 351 CPDASMFAGDLAARIGGTAEAPRKRPAGAALILDYGTEDTIPINSLRGIRRHRRVNPFTE 410 Query: 267 PGQADLSSHVDFQRLSSIAILY--KLYINGLTTQGKFLEGLGIWQRAFSLMKQT----AR 320 PG DLS+ VDF L+ + ++G Q FL +GI +RA L + R Sbjct: 411 PGLVDLSADVDFAALAEAVTEASDGVEVHGPVEQAAFLGQMGIKERAEMLSNRAGLPKER 470 Query: 321 KDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVE---LMPF 360 + + S +RLV MG+++K++ + E + F Sbjct: 471 AEDIEKSWRRLVDRG--PGGMGKVYKVMAILPENDGQRRPVGF 511 >gi|33594922|ref|NP_882565.1| hypothetical protein BPP0204 [Bordetella parapertussis 12822] gi|33564998|emb|CAE39945.1| conserved hypothetical protein [Bordetella parapertussis] Length = 419 Score = 229 bits (584), Expect = 5e-58, Method: Composition-based stats. Identities = 95/379 (25%), Positives = 160/379 (42%), Gaps = 41/379 (10%) Query: 7 RKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTCN-----------PFGAVGDFVTAPE 54 R + I G + Q+ + + P GYY+ P GDFVTAPE Sbjct: 48 RHLRAAIAAAGGWLPFSQWMSAALYAPGLGYYTAGATKLASPADAQGPALPAGDFVTAPE 107 Query: 55 ISQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKL-KPDFFSVLSI 113 ++ +F +A + ++E G G G + +LR + L P + Sbjct: 108 LTPLFAATVARQIAQVLRAT---DTASVLEFGAGTGALAEGVLRALAGLDCPARY----- 159 Query: 114 YMVETSERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEH- 172 +VE S L Q+ +LA +GD++ W L D G ++ NE D++P F +E Sbjct: 160 LIVEVSADLRQRQQSRLAPFGDRVQWLDQLPDAFAGC--VLGNEVLDAMPATLFRWSETG 217 Query: 173 GIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFL--GAIFENSPCRDREMQSISDRLA 230 ++ER + +D + + D + G + E + + ++S+ L Sbjct: 218 VVQERGVTVDANGEFAWQDRDADAPLAQAVAQRMPPLPGYVSEINLQAEAWVRSMG--LW 275 Query: 231 CDGGTAIVIDY------GYLQSRVGDT-LQAVKGHTYVSPLVNPGQADLSSHVDFQRLSS 283 G A+++DY Y R G T + ++ H + P PG D++SHVDF ++ Sbjct: 276 LQRGAALLLDYGFPRSEYYHPQRAGGTLMCHLRHHAHADPFAAPGLQDITSHVDFTAMAD 335 Query: 284 IAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDS-VKRLVSTSADKKSMG 342 A+ L + G +Q +FL G+ L AR + V++L+S + MG Sbjct: 336 AALAGGLQVLGYLSQARFLVNAGLLDALSQLDPADARAYAQAVAPVQKLLS----EAEMG 391 Query: 343 ELFKILVVSHE-KVELMPF 360 ELFK+L V + L+ F Sbjct: 392 ELFKVLAVGRDMPEPLLGF 410 >gi|157964110|ref|YP_001498934.1| hypothetical protein RMA_0077 [Rickettsia massiliae MTU5] gi|157843886|gb|ABV84387.1| hypothetical protein RMA_0077 [Rickettsia massiliae MTU5] Length = 418 Score = 229 bits (583), Expect = 6e-58, Method: Composition-based stats. Identities = 121/400 (30%), Positives = 182/400 (45%), Gaps = 58/400 (14%) Query: 8 KIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFL 67 KI LI +NG +T D + YY + GDFVTAPEISQ+FGE++ ++ Sbjct: 18 KIRQLIDQNGYITCDVLMQEVLNLNPTSYYKQVKSLASEGDFVTAPEISQLFGEIIGLWC 77 Query: 68 ICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQK 127 I W++ G P + LVELGPGRG++M D+LR L P+F+ LSI ++E ++ QK Sbjct: 78 IREWQRIGCPKSLSLVELGPGRGLLMRDLLRTAK-LVPEFYKALSIALIEINKNFIAHQK 136 Query: 128 KQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSL 187 L I+ + + D+P T ++ANEFFD++PIKQ++ + ER+ + D Sbjct: 137 ANLQDINLPISHQSFVEDIPQKPTIIIANEFFDAIPIKQYIKVKELWYERIFVVQPVDER 196 Query: 188 V----FNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGY 243 + +I + T + GA+ E S ++ I+ L G+ ++IDYGY Sbjct: 197 IKYDKISINKQLQEYLLRTHIEAKDGAVLEESYKSIEIIKFIAQHLKRLSGSGLIIDYGY 256 Query: 244 -------LQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLT 296 + + TLQAVK H Y L N G+ADLS+HVDF L ++A K+ + Sbjct: 257 DIAPNGRTRYQYNQTLQAVKNHKYCPILENLGEADLSAHVDFYTLKTVAKNSKINVIDTI 316 Query: 297 TQGKFLEGLGIWQRAFSLMKQTARK----------------------------------- 321 +Q FL GI R +L + + Sbjct: 317 SQRDFLIENGILLRKQTLQDKLNDRHLSKFAYREEFKGDTKRSTAACTLVREDASIGSTY 376 Query: 322 -----------DILLDSVKRLVSTSADKKSMGELFKILVV 350 ++R V K MGELFK+L + Sbjct: 377 KLPLEVEFGKMSEQAQIIERQVERLISPKQMGELFKVLQI 416 >gi|209517338|ref|ZP_03266181.1| protein of unknown function DUF185 [Burkholderia sp. H160] gi|209502221|gb|EEA02234.1| protein of unknown function DUF185 [Burkholderia sp. H160] Length = 400 Score = 229 bits (583), Expect = 7e-58, Method: Composition-based stats. Identities = 87/374 (23%), Positives = 151/374 (40%), Gaps = 36/374 (9%) Query: 2 ENKLIRKIVNLI-KKNGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGD----FVTAPEI 55 + L+ I + G + D+Y + P GYYS FG GD FVTAPE+ Sbjct: 22 SDALVATIRAQLDATGGWLPFDRYMERALYAPGLGYYSGGARKFGLRGDDGSDFVTAPEL 81 Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 S +F LA + A E G ++E G G G + +L + L +F S + Sbjct: 82 SPLFAATLARPVAEALEASG---TRDVIEFGAGTGKLAAGLLNALDTLGAEFDS---YSI 135 Query: 116 VETSERLTLIQKKQLASYGDK----INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTE 171 V+ S L Q++ +A+ + W +L + G ++ NE D++P++ F + Sbjct: 136 VDLSGELRERQRETIAAAAPALLAKVRWLDALPERFEG--VVIGNEVLDAMPVRLFAHSG 193 Query: 172 HGIRERMIDIDQHDSLVFNIGDHEIKSNFLTC-------SDYFLGAIFENSPCRDREMQS 224 ER + + + F+ ++ + + E ++ Sbjct: 194 GVWHERG-VVWRDGAFAFDDRPVAAAADLALLTEIDTARENAGDDYVTETHEAARAFTRT 252 Query: 225 ISDRLACDGGTAIVIDYG----YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQ 279 I LA I + Y R TL H P + PG D+++HV+F Sbjct: 253 ICTMLARGAIFLIDYGFPRHEYYHDQRAQGTLMCHYRHRAHGDPFLYPGLQDITAHVEFT 312 Query: 280 RLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLM-KQTARKDILLDSVKRLVSTSADK 338 ++ + + G T+Q +FL G+ + + TAR ++V++L+S + Sbjct: 313 GIAEAGVETGADLLGFTSQARFLLNAGVTEALGEIDPADTARFLPAANAVQKLLS----E 368 Query: 339 KSMGELFKILVVSH 352 MGELFK++ S Sbjct: 369 AEMGELFKVIAFSR 382 >gi|187476685|ref|YP_784708.1| hypothetical protein BAV0170 [Bordetella avium 197N] gi|115421271|emb|CAJ47776.1| conserved hypothetical protein [Bordetella avium 197N] Length = 400 Score = 228 bits (582), Expect = 7e-58, Method: Composition-based stats. Identities = 91/371 (24%), Positives = 161/371 (43%), Gaps = 34/371 (9%) Query: 2 ENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTCNP--------FG-AVGDFVT 51 LI + + I +G ++ +Q+ A + P GYY+ G A GDF+T Sbjct: 28 SAALIEHLRDRIAAADGWLSFEQWMAQALYAPGLGYYTAGAVKLASDPGQAGLAAGDFIT 87 Query: 52 APEISQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVL 111 APE+S +F LA + R++E G G G + ++ + +L + Sbjct: 88 APELSPLFAHTLARQAAQWLQAT---QTHRVLEFGAGTGALAEGVMAELGRLG----LSV 140 Query: 112 SIYMVETSERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTE 171 +VE S L Q+ +LA G ++ W L G ++ANE D++P++ F E Sbjct: 141 EYAIVEVSADLRARQQARLAPLGSRVQWLDHLPQAFEG--VVLANEVLDAMPVRLFRYDE 198 Query: 172 H-GIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLA 230 + ++ER + DQ D + G + E + + ++ + L Sbjct: 199 NGQVQERGVSWDQGFVWRDRPADATLAEAVHARLPALPGYMSEINLQAEAWVREMGRWLK 258 Query: 231 CDGGTAIVIDY------GYLQSRVGDT-LQAVKGHTYVSPLVNPGQADLSSHVDFQRLSS 283 G A++IDY Y R G + + ++ H + P V PG D+++HVDF ++ Sbjct: 259 R--GAALLIDYGFPRREYYHPQRAGGSLMCHLRHHAHADPFVAPGIQDITAHVDFTAIAD 316 Query: 284 IAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDS-VKRLVSTSADKKSMG 342 A+ L + G T+Q +FL G+ L + + V++L+S + MG Sbjct: 317 AALEGGLDVLGYTSQARFLMNAGLLDLLGQLDPSDVKTYAQATAPVQKLLS----EAEMG 372 Query: 343 ELFKILVVSHE 353 ELFK++ + + Sbjct: 373 ELFKVIAIGRD 383 >gi|154272489|ref|XP_001537097.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1] gi|150409084|gb|EDN04540.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1] Length = 505 Score = 228 bits (582), Expect = 8e-58, Method: Composition-based stats. Identities = 118/463 (25%), Positives = 183/463 (39%), Gaps = 106/463 (22%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNP-------FGAVGDFVTAPE 54 L + I I G +++ Y C+ P+ GYY++ FGA GDFVT+PE Sbjct: 39 STPLAKSIAEAINVTGPVSIAAYMRQCLTSPDGGYYTSRGQEDEDTALFGAKGDFVTSPE 98 Query: 55 ISQIFGEMLAIFLICAWEQHGFPS-CVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSI 113 ISQIFGE+L ++ + W G S V+++E GPG+G +M D+LR K ++ ++ Sbjct: 99 ISQIFGELLGVWTVTEWMGQGRKSGGVQIIEFGPGKGTLMGDMLRSFRNFKNFASAIEAV 158 Query: 114 YMVETSERLTLIQKKQLASYGD--------------------KINWYTSLADVPLGFTFL 153 Y+VETS L +Q+K L L + F Sbjct: 159 YLVETSPVLREVQRKLLCGDTPLEEVEIGYKSTSIHLGVPVIWTEHIKLLPNESDKTPFF 218 Query: 154 VANEFFDSLPIKQF--------------------------VMTEHGIRERMIDIDQHDSL 187 +A+EFFD+LPI F + + H R + + + Sbjct: 219 LAHEFFDALPIHAFQSIQTPAPSQTTINTPTGPTTLHQPPISSSHTTEWRELVVSPNPET 278 Query: 188 VFNIGDHEIKSNFLTCS-------------------DYFLGAIFENSPCRDREMQSISDR 228 E + G+ E SP +Q I+ R Sbjct: 279 PEVKSGQEPEFRLSLAKASTPSSLVLPEMSSRYKALKSTPGSTIEISPESQACVQDIARR 338 Query: 229 LAC--------------------DGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPG 268 + G A+++DYG + ++L+ ++ H VSPLV PG Sbjct: 339 IGGGGGLVSAPSPGVTDPPKNKVPSGAALILDYGTTSTIPINSLRGIRKHRLVSPLVAPG 398 Query: 269 QADLSSHVDFQRLSSIAILY--KLYINGLTTQGKFLEGLGIWQRAFSLMK------QTAR 320 + D+S+ VDF L+ AI + + G QG FLE LGI +RA L++ + Sbjct: 399 EVDISADVDFTALAEAAIDASPGVEVYGPMEQGPFLEALGISERAAQLLRRTEGEGDEEK 458 Query: 321 KDILLDSVKRLVSTSADKKSMGELFKILVVSHE---KVELMPF 360 + + KRLV MG+L+K L + E K + F Sbjct: 459 RKRIESGWKRLVERGG--GGMGKLYKALAIVPESGGKRRPVGF 499 >gi|221633566|ref|YP_002522792.1| ACR protein [Thermomicrobium roseum DSM 5159] gi|221157203|gb|ACM06330.1| Uncharacterized ACR [Thermomicrobium roseum DSM 5159] Length = 381 Score = 228 bits (582), Expect = 8e-58, Method: Composition-based stats. Identities = 85/372 (22%), Positives = 140/372 (37%), Gaps = 15/372 (4%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 M + L I I + G +T ++ L + P+ GYY T G GDF+TAPE IFG Sbjct: 1 MGSPLADLIQREIAERGPITFARFMELALYHPQHGYYMTSVRAGRAGDFLTAPETHPIFG 60 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 ++A + W+ P LVE GPG G ++L ILR + + +P L +E S Sbjct: 61 WVIARQVAECWDLLDRPEPFHLVEYGPGSGTLVLAILRYLSQNEPHLLERLRYCPIEPSA 120 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 ++L + + A ++ANE D+LP+ + +RE + Sbjct: 121 PARAELLRRLGT--AGWAHLVTDAPPAAASGLVLANEVVDALPVHRVRQEGGELRELFVG 178 Query: 181 IDQHD--SLVFNIGDHEIKSNFLTCSDYF-LGAIFENSPCRDREMQSISDRLACDGGTAI 237 D ++ + E+ G E + ++ RL + Sbjct: 179 WDGQRFVAVPGPLSTPELADWLARLGVQPVEGQETEVCLAMLGWLDEVARRLERGYLLVL 238 Query: 238 -----VIDYGYLQSRVGDTLQAVKGHTY-VSPLVNPGQADLSSHVDFQRLSSIAILYKLY 291 + T++ H PL +PG+ D+++HVDF LS A L Sbjct: 239 DYGAPAPERADPTRFPRGTIRTYAAHARGSDPLADPGERDITAHVDFTMLSLAAAERGLV 298 Query: 292 INGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVS 351 LTTQ +FL G+ + +L + + + D MG F++ + Sbjct: 299 PLALTTQAEFLAQAGLGELLVALQAEPGMTAAQYLAARAAALHLLDPGGMG-AFRVALFG 357 Query: 352 HEKVE---LMPF 360 F Sbjct: 358 KAIDPAALPSGF 369 >gi|15891994|ref|NP_359708.1| hypothetical protein RC0071 [Rickettsia conorii str. Malish 7] gi|15619108|gb|AAL02609.1| unknown [Rickettsia conorii str. Malish 7] Length = 406 Score = 228 bits (582), Expect = 8e-58, Method: Composition-based stats. Identities = 120/400 (30%), Positives = 181/400 (45%), Gaps = 58/400 (14%) Query: 8 KIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFL 67 KI LI +NG +T D + YY GDFVTAPEISQ+FGE++ ++ Sbjct: 6 KIRQLIDQNGYITCDVLMQEVLNLNPTSYYKQVKSLANEGDFVTAPEISQLFGEIIGLWC 65 Query: 68 ICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQK 127 I W++ G P + LVELGPGRG++M D+LR L P+F+ LSI ++E ++ QK Sbjct: 66 IREWQRIGCPKSLSLVELGPGRGLLMRDLLRTAK-LVPEFYKALSIELIEINKNFIAYQK 124 Query: 128 KQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSL 187 L I+ + + D+P T ++ANEFFD++PIKQ++ + ER+ + D Sbjct: 125 ANLQDINLPISHQSFVEDIPKKPTIIIANEFFDAIPIKQYIKVKELWYERIFVVQPVDER 184 Query: 188 V----FNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGY 243 + ++ + T + GA+ E S ++ I+ L G+ ++IDYGY Sbjct: 185 IKYDKISVNKQLQEYLLCTHIEAKDGAVLEESYKSIEIIKFIAQHLKRLSGSGLIIDYGY 244 Query: 244 -------LQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLT 296 + + TLQAVK H Y L N G+ADLS+HVDF L ++A K+ + Sbjct: 245 DIAPNGRTRYQYNQTLQAVKNHKYCPILENLGEADLSAHVDFYALKTVAKNSKINVIDTI 304 Query: 297 TQGKFLEGLGIWQRAFSLMKQTARK----------------------------------- 321 +Q FL GI R +L + + Sbjct: 305 SQRDFLIENGILLRKQTLQDKLNDRHLAKFAYREEFKGDTKRSTAAYTLVREDASIGSTY 364 Query: 322 -----------DILLDSVKRLVSTSADKKSMGELFKILVV 350 ++R V K MGELFK+L + Sbjct: 365 KLPLEVEFGKMSEQAQIIERQVERLISPKQMGELFKVLQI 404 >gi|260222285|emb|CBA31696.1| hypothetical protein Csp_D28410 [Curvibacter putative symbiont of Hydra magnipapillata] Length = 383 Score = 228 bits (581), Expect = 1e-57, Method: Composition-based stats. Identities = 92/379 (24%), Positives = 156/379 (41%), Gaps = 46/379 (12%) Query: 4 KLIRKIVNLI-KKNGQMTVDQYFALCVADPEFGYYSTC-NPFG--------------AVG 47 L R I + I K G + D + AL + P +GYY+ FG A Sbjct: 20 NLTRIIADAIEKAGGWIGFDTFMALALYAPGWGYYANGSTKFGQMPQGLVGEAGVEGAGS 79 Query: 48 DFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDF 107 DFVTAPEIS +FG+ LA + E G L E G G G + L +L + Sbjct: 80 DFVTAPEISPLFGQALARQVAQVLEATG---TDELWEFGAGTGALALQLLDALGD----- 131 Query: 108 FSVLSIYMVETSERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQF 167 V S +V+ S+ L Q+++LA + K+ W ++L G +V NE D++P++ Sbjct: 132 -RVRSYTIVDLSDSLKARQRERLAGHAGKVQWVSALPAHMRG--VVVGNEVLDAMPVQLL 188 Query: 168 VMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISD 227 ER + + LV+ ++ D + E + +++++D Sbjct: 189 ARVAGAWHERGVALA-QGQLVWADKPSTLRPPVDI--DGEHDYLTEIHAQGEAFVRTLAD 245 Query: 228 RLACDGGTAIVIDYG----YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLS 282 L + + Y R G T+ + H PL + G D+++HV+F ++ Sbjct: 246 HLQLGAAFLLDYGFPESEYYHVQRSGGTVMCHRAHRADADPLSDVGLKDITAHVNFTGVA 305 Query: 283 SIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMG 342 A L + G T Q FL G+ + ++ + ++L+ + MG Sbjct: 306 VAAQDAGLDVLGYTNQAHFLMNCGLVEAMQ------SQPLQARVAAQKLLM----EHEMG 355 Query: 343 ELFKILVVSHE-KVELMPF 360 ELFK++ + L+ F Sbjct: 356 ELFKVVALGRGVDAPLLGF 374 >gi|269838435|ref|YP_003320663.1| hypothetical protein Sthe_2425 [Sphaerobacter thermophilus DSM 20745] gi|269787698|gb|ACZ39841.1| protein of unknown function DUF185 [Sphaerobacter thermophilus DSM 20745] Length = 375 Score = 228 bits (581), Expect = 1e-57, Method: Composition-based stats. Identities = 88/362 (24%), Positives = 134/362 (37%), Gaps = 13/362 (3%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFG 60 L+ I + I G++T + L + P++GYY T G GDF+TAPE IFG Sbjct: 16 NEALVAIIRDRIADEGRITFAAFMELALYHPQYGYYRTDAVRAGRAGDFITAPEAHAIFG 75 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 +A L W Q P L E G G G + L IL + D + L VE + Sbjct: 76 HAIARRLAAMWRQLDRPEPFTLREYGAGAGTLALAILDGLRTDGDDLLTALRYEPVEINP 135 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 ++L + ++ANEF D+ P+ + M +RE Sbjct: 136 VREAELAERL-DAAGFADVLHQPVPGEQITGCVLANEFVDAFPVHRVEMHGGELREIY-V 193 Query: 181 IDQHDSLVFNIGDHEIKSNFLTCS----DYFLGAIFENSPCRDREMQSISDRLACDGGTA 236 + + G + G E + + ++ L Sbjct: 194 VWRDGWFADEPGPLSTPEISDRLAREGITLAEGQRAEIALGPTGWIAEVAAALERGYVLV 253 Query: 237 IVIDYG----YLQSRVGDTLQAVKGHTYVS-PLVNPGQADLSSHVDFQRLSSIAILYKLY 291 I Y Y R TL+A HT P G+ DL++HVDF L A + L Sbjct: 254 IDYGYPAAELYGPERRDGTLKAYTRHTVHDDPYRAVGEQDLTAHVDFTALMDAARDHGLT 313 Query: 292 INGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVS 351 + LTTQ FL GI + ++ ++ + + V D MG F++L + Sbjct: 314 VLDLTTQADFLADAGIGEILVAMQRRPGITAEDYLAARAAVMHLIDPGGMG-RFRVLTLG 372 Query: 352 HE 353 E Sbjct: 373 RE 374 >gi|268573202|ref|XP_002641578.1| Hypothetical protein CBG09880 [Caenorhabditis briggsae] Length = 382 Score = 228 bits (581), Expect = 1e-57, Method: Composition-based stats. Identities = 108/353 (30%), Positives = 167/353 (47%), Gaps = 33/353 (9%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYST----CNPFGAVGDFVTAPEISQI 58 N L + +++ I+ +G +TV +Y V+ P GYY FG GDF+T+PE++Q+ Sbjct: 33 NHLKKFLIDKIRTSGPITVAEYMKTSVSAPVVGYYGQFSRDQKVFGEKGDFITSPELTQL 92 Query: 59 FGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVET 118 FGEM+ +++ G +LVELGPGR +M D+L + K + +S+++VE Sbjct: 93 FGEMIGVWVFHELANTGHKGSWQLVELGPGRAQLMNDVLNALAKF---NDNDVSVHLVEM 149 Query: 119 SERLTLIQKKQLASYGD------------------KINWYTSLADVPLGFTFLVANEFFD 160 S+ L Q+ L Y I WY S+ D+P GFT +ANEF D Sbjct: 150 SDALIDEQENFLCIYNSENTKGTPHVRKNKTRTGVNIYWYKSIDDIPDGFTVFIANEFLD 209 Query: 161 SLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGA----IFENSP 216 +LP+ QF T +E +++ + L F E +E SP Sbjct: 210 ALPVHQFKKTGDLWKEVYVNLTKEGDLRFMTSKGENLHTKGLIPAAIRNENSRLTWECSP 269 Query: 217 CRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHV 276 + I DR+ GG ++++DYG+ SR + +A K H V PL NPG DL++ V Sbjct: 270 ESGTVVNQIVDRITTFGGFSLLVDYGHDGSRNTHSFRAYKNHEQVDPLSNPGVVDLTADV 329 Query: 277 DFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLM---KQTARKDILLD 326 DF L+ + + + G Q FL LGI R L+ K ++ L+ Sbjct: 330 DFGYLT-TLVEDRALVYGPIEQRVFLTQLGIEHRLRRLLQICKNREEQEQLIS 381 >gi|166710565|ref|ZP_02241772.1| hypothetical protein Xoryp_03635 [Xanthomonas oryzae pv. oryzicola BLS256] Length = 394 Score = 228 bits (581), Expect = 1e-57, Method: Composition-based stats. Identities = 82/383 (21%), Positives = 139/383 (36%), Gaps = 33/383 (8%) Query: 2 ENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIF 59 N+L + I+ G + ++ L + P GYYS FG GDFVTAPE+ +F Sbjct: 16 SNRLAAHVRAEIQAAGGAIPFSRFMELALYAPGLGYYSAGASKFGEAGDFVTAPEVGPLF 75 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 ++ L +Q G R++E+G G G +L+ + +L ++E S Sbjct: 76 AATVSGALAPVLQQLG--PQARVLEVGGGSGAFAEVMLKRLLELDALP---ERYAILEPS 130 Query: 120 ERLTLIQKKQLASYGDKINWYTSLADVPLGFT------FLVANEFFDSLPIKQFVMTEHG 173 L Q+++L L + L ANE D+LP +F + + Sbjct: 131 ADLRERQRERLGRSLIPP--VFDLVEWLDAPFPDDWDGVLFANEVIDALPTPRFALRDGE 188 Query: 174 IRERMIDIDQHDSLVFNIGDHEI-------KSNFLTCSDYFLGAIFENSPCRDREMQSIS 226 + E + +D + + G E P +Q+++ Sbjct: 189 VYEETVVLDAQQQFARGEQPADALLSAAVRHLERYLEQPFADGYRSELLPQLPYWIQAVA 248 Query: 227 DRLACDGGTAIVIDYG----YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRL 281 L + Y Y R TL+A H PG DL++ VDF L Sbjct: 249 GGLKRGAMLFVDYGYPRGEFYRAQREDGTLRAFYRHRMHEDLYRWPGLQDLTASVDFTAL 308 Query: 282 SSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDIL--LDSVKRLVSTSADKK 339 + + G TQ FL G G+ +L+ Q + ++ + Sbjct: 309 AEAGTGAGFELAGYCTQASFLLGNGLD----TLLAQADTRTDEVGRMRLREQIKRLTLPS 364 Query: 340 SMGELFKILVVSHEKVELMPFVN 362 MGE F+++ S + F++ Sbjct: 365 EMGERFQVMGFSRDVDFAPAFLS 387 >gi|21106799|gb|AAM35577.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri str. 306] Length = 417 Score = 228 bits (581), Expect = 1e-57, Method: Composition-based stats. Identities = 82/385 (21%), Positives = 138/385 (35%), Gaps = 37/385 (9%) Query: 2 ENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIF 59 ++L + I+ G + ++ L + P GYYS FG GDFVTAPE+ +F Sbjct: 39 SDRLAAHVRAEIQAAGGAIPFSRFMELALYAPGLGYYSAGASKFGEAGDFVTAPELGPLF 98 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 ++ L +Q G R++E+G G G + +L ++E S Sbjct: 99 AATVSGALAPVLQQLG--PQARVLEVGGGSGAFAEV---TLKRLLELDALPERYAILEPS 153 Query: 120 ERLTLIQKKQLASYGDKINWYTSLADVPLGFT------FLVANEFFDSLPIKQFVMTEHG 173 L Q+++L L + L ANE D+LP +F + + Sbjct: 154 ADLRERQRERLGRSLIPP--VFDLVEWLDAPFPDDWDGVLFANEVIDALPTPRFALRDGQ 211 Query: 174 IRERMIDIDQHDSLVFNIGDHEIKSNF---------LTCSDYFLGAIFENSPCRDREMQS 224 + E + D F G+ + + G E P +Q+ Sbjct: 212 VYE--ETVVLDDQQHFARGEQPADALLSAAVRHLERYLEQPFADGYRSELLPQLPYWIQA 269 Query: 225 ISDRLACDGGTAIVIDYG----YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQ 279 ++ L + Y Y R TL+A H PG DL++ VDF Sbjct: 270 VAGGLKRGAMLFVDYGYPRGEFYRAQRQDGTLRAFYRHRMHEDLYRWPGLQDLTASVDFT 329 Query: 280 RLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDIL--LDSVKRLVSTSAD 337 L+ + G TQ FL G G+ +L+ Q + ++ + Sbjct: 330 ALAEAGTGAGFELAGYCTQASFLLGNGLD----ALLAQADTRTDEVGRMRLREQIKRLTL 385 Query: 338 KKSMGELFKILVVSHEKVELMPFVN 362 MGE F+++ S + F++ Sbjct: 386 PSEMGERFQVMGFSRDVDFAPAFLS 410 >gi|77748546|ref|NP_641041.2| hypothetical protein XAC0687 [Xanthomonas axonopodis pv. citri str. 306] Length = 394 Score = 228 bits (581), Expect = 1e-57, Method: Composition-based stats. Identities = 82/385 (21%), Positives = 138/385 (35%), Gaps = 37/385 (9%) Query: 2 ENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIF 59 ++L + I+ G + ++ L + P GYYS FG GDFVTAPE+ +F Sbjct: 16 SDRLAAHVRAEIQAAGGAIPFSRFMELALYAPGLGYYSAGASKFGEAGDFVTAPELGPLF 75 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 ++ L +Q G R++E+G G G + +L ++E S Sbjct: 76 AATVSGALAPVLQQLG--PQARVLEVGGGSGAFAEV---TLKRLLELDALPERYAILEPS 130 Query: 120 ERLTLIQKKQLASYGDKINWYTSLADVPLGFT------FLVANEFFDSLPIKQFVMTEHG 173 L Q+++L L + L ANE D+LP +F + + Sbjct: 131 ADLRERQRERLGRSLIPP--VFDLVEWLDAPFPDDWDGVLFANEVIDALPTPRFALRDGQ 188 Query: 174 IRERMIDIDQHDSLVFNIGDHEIKSNF---------LTCSDYFLGAIFENSPCRDREMQS 224 + E + D F G+ + + G E P +Q+ Sbjct: 189 VYE--ETVVLDDQQHFARGEQPADALLSAAVRHLERYLEQPFADGYRSELLPQLPYWIQA 246 Query: 225 ISDRLACDGGTAIVIDYG----YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQ 279 ++ L + Y Y R TL+A H PG DL++ VDF Sbjct: 247 VAGGLKRGAMLFVDYGYPRGEFYRAQRQDGTLRAFYRHRMHEDLYRWPGLQDLTASVDFT 306 Query: 280 RLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDIL--LDSVKRLVSTSAD 337 L+ + G TQ FL G G+ +L+ Q + ++ + Sbjct: 307 ALAEAGTGAGFELAGYCTQASFLLGNGLD----ALLAQADTRTDEVGRMRLREQIKRLTL 362 Query: 338 KKSMGELFKILVVSHEKVELMPFVN 362 MGE F+++ S + F++ Sbjct: 363 PSEMGERFQVMGFSRDVDFAPAFLS 387 >gi|158336911|ref|YP_001518086.1| hypothetical protein AM1_3782 [Acaryochloris marina MBIC11017] gi|158307152|gb|ABW28769.1| conserved hypothetical protein [Acaryochloris marina MBIC11017] Length = 405 Score = 228 bits (580), Expect = 1e-57, Method: Composition-based stats. Identities = 93/382 (24%), Positives = 164/382 (42%), Gaps = 38/382 (9%) Query: 5 LIRKIVNLIKKN--GQMTVDQYFALCVADPEFGYYSTCNP-FGAVGDFVTAPEISQIFGE 61 L ++I N I + ++T ++ L + DPE GYY+T G GDF T+P + FGE Sbjct: 11 LFQRIANRIHETPHQRITFAEFMELALYDPEQGYYATNQVQIGVAGDFFTSPHLCPDFGE 70 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKL---------KPDFFSVLS 112 +LA + W G P LVE+G G+G++ D+L+ + F++ L Sbjct: 71 LLAEQFLDMWRVMGQPEPFTLVEMGAGQGLVAADVLKYLATRKQSAEASDDYAAFWTALR 130 Query: 113 IYMVETSERLTLIQKKQLASYGDKINWYTSLADVPLGFT----FLVANEFFDSLPIKQFV 168 +VE +E L Q++ L + + + L T +NE D+LP+ QFV Sbjct: 131 YLIVEKAEGLIAAQQRLLQPFQFSPDKVQWMGFEQLPKTGIVGCFFSNELVDALPVHQFV 190 Query: 169 MTEHGIRERMIDIDQHDSLVFNIGDHE---------IKSNFLTCSDYFLGAIFENSPCRD 219 + + ++E + +D + + ++ + Y G E + Sbjct: 191 VQDGALQEVFVTVDAESRSFTEVISNPSTGRLAEYLVEQGIDIGAGYEDGYRSEINLAAL 250 Query: 220 REMQSISDRLACDGGTAIVIDY------GYLQSRVGDTLQAVKGH-TYVSPLVNPGQADL 272 ++++S +L G + IDY Y R+ TLQ + H + P G D+ Sbjct: 251 DWLETVSQKLDR--GYVLTIDYGYLAPQYYSPQRLQGTLQCFRRHGAHQDPYAYLGHQDI 308 Query: 273 SSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRL- 331 ++HV+F L + L G T QG FL LG+ R + T+ L + ++R Sbjct: 309 TAHVNFTTLKKQGLALGLTSMGYTQQGLFLMALGLGDRLVA-NNNTSDISQLNEVIRRRE 367 Query: 332 -VSTSADKKSMGELFKILVVSH 352 + + + +G F++L+ Sbjct: 368 TLQSLINPLGLGG-FQVLIQGK 388 >gi|239946947|ref|ZP_04698700.1| succinate dehydrogenase iron-sulfur subunit [Rickettsia endosymbiont of Ixodes scapularis] gi|239921223|gb|EER21247.1| succinate dehydrogenase iron-sulfur subunit [Rickettsia endosymbiont of Ixodes scapularis] Length = 406 Score = 228 bits (580), Expect = 1e-57, Method: Composition-based stats. Identities = 121/400 (30%), Positives = 180/400 (45%), Gaps = 58/400 (14%) Query: 8 KIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFL 67 KI LI +NG +T D + YY + GDFVTAPEISQ+FGE++ ++ Sbjct: 6 KIRQLIDQNGYITCDVLMQQVLQSNPNSYYKQVKSLASEGDFVTAPEISQLFGEIIGLWC 65 Query: 68 ICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQK 127 I W++ G P + LVELGPGRG++M D+L L P+F+ LSI ++E ++ + QK Sbjct: 66 IKEWQRIGCPKSLSLVELGPGRGLLMRDLLSTAK-LVPEFYKALSIELIEINQNFIVHQK 124 Query: 128 KQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSL 187 L I + + D+P T ++ANEFFD++PIKQ+V + ER+ + D Sbjct: 125 ANLQDIDLPIKHLSFIEDIPKKPTIIIANEFFDAMPIKQYVKVKELWYERIFVVQPVDGR 184 Query: 188 V----FNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGY 243 + +I + T + GA+ E S ++ I++ L G+ + IDYGY Sbjct: 185 IKYDKISINKPLQEYLLRTHIEAKDGAVLEESYKSIEIIKFIAEHLKTQSGSCLTIDYGY 244 Query: 244 -------LQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLT 296 + + TLQAVK H Y L N G+ADLS+HVDF L +A K+ + Sbjct: 245 DIAPNGRTRYQYNPTLQAVKNHKYCPILENLGEADLSAHVDFYTLKIVAKNSKINVIDTI 304 Query: 297 TQGKFLEGLGIWQRAFSLMKQTARK--DILLDS--------------------------- 327 +Q FL GI R +L + + L Sbjct: 305 SQRDFLIENGILLRKQTLQDKLNYRHLSKLAYREEFKGDTKRSTAAYTLVREDASIGSTY 364 Query: 328 -----------------VKRLVSTSADKKSMGELFKILVV 350 ++R V K MG LFK+L + Sbjct: 365 KLPLEVEFGKMPEQAQIIERQVERLISPKQMGTLFKVLQI 404 >gi|306844530|ref|ZP_07477119.1| Hypothetical protein BIBO1_1205 [Brucella sp. BO1] gi|306275141|gb|EFM56897.1| Hypothetical protein BIBO1_1205 [Brucella sp. BO1] Length = 307 Score = 228 bits (580), Expect = 1e-57, Method: Composition-based stats. Identities = 123/311 (39%), Positives = 164/311 (52%), Gaps = 13/311 (4%) Query: 58 IFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVE 117 +FGE++ I+ + W+ P+ L E+GPGRG +M D+LR I +L P I MVE Sbjct: 1 MFGELIGIWCLSEWDALARPANFVLCEIGPGRGTLMSDMLRTIGRLAPQMLGGARIAMVE 60 Query: 118 TSERLTLIQKKQLASYGDKINWYTSLADVP----LGFTFLVANEFFDSLPIKQFVMTEHG 173 TS RL QK++LA I W+ AD+P G LV NE FD++P +QFV + Sbjct: 61 TSPRLAEKQKQKLAGTKAHIEWFERFADIPADTVHGPLILVTNELFDAIPFRQFVKADGR 120 Query: 174 IRERMIDIDQHDSLVFNIGDHEIKSNFLTCS--DYFLGAIFENSPCRDREMQSISDRLAC 231 ERMI +++ D F G I L GAIFE +P R MQ I+ R+A Sbjct: 121 FVERMIALNEQDEFQFVSGAGGIDPALLPKDHVKAEEGAIFEAAPARTALMQEIAGRIAA 180 Query: 232 DGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLY 291 G A+ IDYG+L+S GDTLQA+ H Y +PG ADL+SHVDF L A Sbjct: 181 TRGAALNIDYGHLESGFGDTLQAMLKHAYDDVFAHPGAADLTSHVDFDILQKTAKACGCK 240 Query: 292 INGLTTQGKFLEGLGIWQRAFSL--MKQTARKDILLDSVKRLVSTSADKKSMGELFKILV 349 G TQG+FL +G+ RA L K A ++ + V+RL A MG LFK+L Sbjct: 241 T-GTMTQGEFLLAMGLVDRAGRLGAGKDAAFQEKIRQDVERL----AAPDQMGTLFKVLA 295 Query: 350 VSHEKVELMPF 360 S + L+PF Sbjct: 296 FSDGQTRLLPF 306 >gi|78046305|ref|YP_362480.1| hypothetical protein XCV0749 [Xanthomonas campestris pv. vesicatoria str. 85-10] gi|78034735|emb|CAJ22380.1| conserved hypothetical protein [Xanthomonas campestris pv. vesicatoria str. 85-10] Length = 394 Score = 228 bits (580), Expect = 1e-57, Method: Composition-based stats. Identities = 84/383 (21%), Positives = 141/383 (36%), Gaps = 33/383 (8%) Query: 2 ENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIF 59 ++L + I+ G + ++ L + P GYYS FG GDFVTAPE+ +F Sbjct: 16 SDRLAAHVRAEIQAAGGAIPFSRFMELALYAPGLGYYSAGASKFGEAGDFVTAPELGPLF 75 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 ++ L +Q G R++E+G G G + +L ++E S Sbjct: 76 AATVSGALAPVLQQLG--PQARVLEVGGGSGAFAEV---TLKRLLELDALPERYAILEPS 130 Query: 120 ERLTLIQKKQLASYGDKINWYTSLADVPLGFT------FLVANEFFDSLPIKQFVMTEHG 173 L Q+++L L + L ANE D+LP +F + + Sbjct: 131 ADLRERQRERLGRSLIPP--VFDLVEWLDAPFQDDWDGVLFANEVIDALPTPRFALRDGQ 188 Query: 174 IRE--RMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFL-----GAIFENSPCRDREMQSIS 226 + E ++D QH + D + + Y G E P +Q+++ Sbjct: 189 VYEETVVLDAQQHFARGEQPADALLSAAVRHLERYLEQPFADGYRSELLPQLPYWIQAVA 248 Query: 227 DRLACDGGTAIVIDYG----YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRL 281 L + Y Y R TL+A H PG DL++ VDF L Sbjct: 249 GGLKRGAMLFVDYGYPRGEFYRAQREDGTLRAFYRHRMHEDLYRWPGLQDLTASVDFTAL 308 Query: 282 SSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDIL--LDSVKRLVSTSADKK 339 + + G TQ FL G G+ +L+ Q + ++ + Sbjct: 309 AEAGTGAGFELAGYCTQASFLLGNGLD----ALLAQADTRTDEVGRMRLREQIKRLTLPS 364 Query: 340 SMGELFKILVVSHEKVELMPFVN 362 MGE F+++ S + F++ Sbjct: 365 EMGERFQVMGFSRDVDFAPAFLS 387 >gi|189024643|ref|YP_001935411.1| hypothetical protein BAbS19_I14440 [Brucella abortus S19] gi|189020215|gb|ACD72937.1| Protein of unknown function DUF185 [Brucella abortus S19] Length = 307 Score = 228 bits (580), Expect = 1e-57, Method: Composition-based stats. Identities = 122/311 (39%), Positives = 164/311 (52%), Gaps = 13/311 (4%) Query: 58 IFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVE 117 +FGE++ I+ + W+ P+ L E+GPGRG +M D+LR I +L P I MVE Sbjct: 1 MFGELIGIWCLSEWDALARPANFVLCEIGPGRGTLMSDMLRTIGRLAPQMLGGAQIAMVE 60 Query: 118 TSERLTLIQKKQLASYGDKINWYTSLADVP----LGFTFLVANEFFDSLPIKQFVMTEHG 173 TS RL QK++LA + W+ AD+P G LV NE FD++P +QFV + Sbjct: 61 TSPRLAEKQKQKLAGTKAHVEWFERFADIPADTVHGPLILVTNELFDAIPFRQFVKADGR 120 Query: 174 IRERMIDIDQHDSLVFNIGDHEIKSNFLTCS--DYFLGAIFENSPCRDREMQSISDRLAC 231 ERMI +++ D F G I L GAIFE +P R MQ I+ R+A Sbjct: 121 FVERMIALNEQDEFQFVSGAGGIDPALLPKDHVKAEEGAIFEAAPARTALMQEIASRIAA 180 Query: 232 DGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLY 291 G A+ IDYG+L+S GDTLQA+ Y +PG ADL+SHVDF L A Sbjct: 181 TRGAALNIDYGHLESGFGDTLQAMLKQAYDDVFAHPGVADLTSHVDFDILQKTAKACGCK 240 Query: 292 INGLTTQGKFLEGLGIWQRAFSL--MKQTARKDILLDSVKRLVSTSADKKSMGELFKILV 349 G TQG+FL +G+ RA L K A ++ + V+RL A MG LFK+L Sbjct: 241 T-GTMTQGEFLLAMGLVDRAGRLGAGKDAAFQEKIRQDVERL----AAPDQMGTLFKVLA 295 Query: 350 VSHEKVELMPF 360 S E+ L+PF Sbjct: 296 FSDEQTRLLPF 306 >gi|325925804|ref|ZP_08187176.1| hypothetical protein XPE_1130 [Xanthomonas perforans 91-118] gi|325929102|ref|ZP_08190251.1| hypothetical protein XPE_4355 [Xanthomonas perforans 91-118] gi|325540520|gb|EGD12113.1| hypothetical protein XPE_4355 [Xanthomonas perforans 91-118] gi|325543790|gb|EGD15201.1| hypothetical protein XPE_1130 [Xanthomonas perforans 91-118] Length = 417 Score = 228 bits (580), Expect = 1e-57, Method: Composition-based stats. Identities = 78/381 (20%), Positives = 133/381 (34%), Gaps = 29/381 (7%) Query: 2 ENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIF 59 ++L + I+ G + ++ L + P GYYS FG GDFVTAPE+ +F Sbjct: 39 SDRLAAHVRAEIQAAGGAIPFSRFMELALYAPGLGYYSAGASKFGEAGDFVTAPELGPLF 98 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 ++ L +Q G R++E+G G G + +L ++E S Sbjct: 99 AATVSGALAPVLQQLG--PQARVLEVGGGSGAFAEV---TLKRLLELDALPERYAILEPS 153 Query: 120 ERLTLIQKKQLASYGDKINWYTSLADVPLGFT------FLVANEFFDSLPIKQFVMTEHG 173 L Q+++L L + L ANE D+LP +F + + Sbjct: 154 ADLRERQRERLGRSLIPP--VFDLVEWLDAPFPDDWDGVLFANEVIDALPTPRFALRDGQ 211 Query: 174 IRERMIDIDQHDSLVFNIGDHEI-------KSNFLTCSDYFLGAIFENSPCRDREMQSIS 226 + E + +D + + G E P +Q+++ Sbjct: 212 VYEETVVLDAQQQFARGEQPADALLSAAVRHLERYLEQPFADGYRSELLPQLPYWIQAVA 271 Query: 227 DRLACDGGTAIVIDYG----YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRL 281 L + Y Y R TL+A H PG DL++ VDF L Sbjct: 272 GGLKRGAMLFVDYGYPRGEFYRAQREDGTLRAFYRHRMHEDLYRWPGLQDLTASVDFTAL 331 Query: 282 SSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSM 341 + + G TQ FL G G+ ++ ++ + M Sbjct: 332 AEAGTGAGFELAGYCTQASFLLGNGLDALLAQ--ADPRTDEVGRMRLREQIKRLTLPSEM 389 Query: 342 GELFKILVVSHEKVELMPFVN 362 GE F+++ S + F++ Sbjct: 390 GERFQVMGFSRDVDFAPAFLS 410 >gi|188990161|ref|YP_001902171.1| hypothetical protein xccb100_0766 [Xanthomonas campestris pv. campestris str. B100] gi|167731921|emb|CAP50105.1| conserved hypothetical protein [Xanthomonas campestris pv. campestris] Length = 394 Score = 228 bits (580), Expect = 1e-57, Method: Composition-based stats. Identities = 80/380 (21%), Positives = 134/380 (35%), Gaps = 29/380 (7%) Query: 2 ENKLIRKIVNLI-KKNGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIF 59 ++L + I G + ++ L + P GYYS FG GDFVTAPE+ +F Sbjct: 16 SDQLAAHLRAEITAAGGAIPFSRFMELALYAPGLGYYSAGASKFGESGDFVTAPELGPLF 75 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 ++ L +Q G R++E+G G G + +L ++E S Sbjct: 76 AATVSGALAPVLQQLG--PQARVLEVGGGSGAFAEV---TLKRLLELDALPERYAILEPS 130 Query: 120 ERLTLIQKKQLASYGDKINWYTSLADVPLGFT------FLVANEFFDSLPIKQFVMTEHG 173 L Q+++L L + G L ANE D+LP +F + + Sbjct: 131 ADLRERQRERLGRSLIPP--VFDLVEWLDGPFPDDWDGVLFANEVIDALPTPRFTLRDGE 188 Query: 174 IRERMIDIDQHDSLVFNIGDHEI-------KSNFLTCSDYFLGAIFENSPCRDREMQSIS 226 + E + +D + + G E P +Q+++ Sbjct: 189 VYEETVVLDARQHFARGEQPADALLGAAVRHLERYLEQPFADGYRSELLPQLPHWIQAVA 248 Query: 227 DRLACDGGTAIVIDYG----YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRL 281 L + Y Y R TL+A H PG DL++ VDF L Sbjct: 249 GGLKRGAMLFVDYGYPRGEFYRAQREDGTLRAFYRHRMHTDLYRWPGLQDLTASVDFTAL 308 Query: 282 SSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSM 341 + + G TQ FL G G+ +T + +++ V M Sbjct: 309 AEAGTGAGFELAGYCTQANFLLGNGLDALLTQADARTDEVGRM--RLRQQVKQLTLPSEM 366 Query: 342 GELFKILVVSHEKVELMPFV 361 GE F+++ + + F+ Sbjct: 367 GERFQVMGFARDVDFAPAFL 386 >gi|50550583|ref|XP_502764.1| YALI0D12859p [Yarrowia lipolytica] gi|49648632|emb|CAG80952.1| YALI0D12859p [Yarrowia lipolytica] Length = 495 Score = 228 bits (580), Expect = 2e-57, Method: Composition-based stats. Identities = 111/421 (26%), Positives = 171/421 (40%), Gaps = 68/421 (16%) Query: 5 LIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLA 64 L + I++ G M+V + C+ +P GYY +P GA GDF T+PEISQ+FGE++ Sbjct: 49 LSTTLAMAIEQQGPMSVATFMKHCLTNPSGGYYIDKDPLGAKGDFTTSPEISQMFGELVG 108 Query: 65 IFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVI--CKLKPDFFSVLSIYMVETSERL 122 ++L W +G R++E GPGRG +M D LR + K ++ + +VE S L Sbjct: 109 LWLAAQWLYYGQKQPFRVIEYGPGRGTLMDDSLRALVSAKSTGAKEALKEVLLVEASPVL 168 Query: 123 TLIQKKQLA------------------SYGDKINWYTSLADVPLG-------FTFLVANE 157 Q+K+L YG I WY + ++VA+E Sbjct: 169 RDAQRKKLCGAESQFKTEEDGSITCVTKYGVPIRWYEDSKMLDKLASSNDPLHNYIVAHE 228 Query: 158 FFDSLPIKQFVMTEHGIRERMIDIDQHDSLV----------------------------- 188 FFD+LPI QF T+ G RE M++ + Sbjct: 229 FFDALPIYQFEKTDKGWRELMVNYGVENKTKESSILLPGQTHIKSSDLDKDKKKTFHLVT 288 Query: 189 ---FNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQ 245 + + I + D + E P + RL GG A ++DY Sbjct: 289 APTWTVASKVIPQSHKRYRDLPEWSKIEVCPDAWDVANQMG-RLVAKGGAAFIVDYAVKP 347 Query: 246 SRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRL---SSIAILYKLYINGLTTQGKFL 302 +TL+ ++ H SP PG+ DLS+ VDF + S + G Q +L Sbjct: 348 GVPVNTLRGIRDHKICSPFEEPGKVDLSADVDFTAIGIASRSKNKENVSAFGPINQATWL 407 Query: 303 EGLGIWQRAFSLMKQTAR--KDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELM-P 359 + +GI R LM+ K + KRLV + MG+++K ++H Sbjct: 408 KNMGIEMRTEKLMEGKEEYIKKRIESQYKRLVDIGIN--GMGKIYKAFFLTHSSHGYPVG 465 Query: 360 F 360 F Sbjct: 466 F 466 >gi|33599197|ref|NP_886757.1| hypothetical protein BB0208 [Bordetella bronchiseptica RB50] gi|33575243|emb|CAE30706.1| conserved hypothetical protein [Bordetella bronchiseptica RB50] Length = 419 Score = 228 bits (580), Expect = 2e-57, Method: Composition-based stats. Identities = 95/379 (25%), Positives = 159/379 (41%), Gaps = 41/379 (10%) Query: 7 RKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTCN-----------PFGAVGDFVTAPE 54 R + I G + Q+ + + P GYY+ P GDFVTAPE Sbjct: 48 RHLRAAIAAAGGWLPFSQWMSAALYAPGLGYYTAGATKLASPADAQGPALPAGDFVTAPE 107 Query: 55 ISQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKL-KPDFFSVLSI 113 ++ +F +A + ++E G G G + +LR + L P + Sbjct: 108 LTPLFAATVARQIAQVLRAT---DTASVLEFGAGTGALAEGVLRALAGLDCPARY----- 159 Query: 114 YMVETSERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEH- 172 +VE S L Q+ +LA +GD++ W L D G ++ NE D++P F +E Sbjct: 160 LIVEVSADLRQRQQSRLAPFGDRVQWLDQLPDAFAGC--VLGNEVLDAMPATLFRWSETG 217 Query: 173 GIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFL--GAIFENSPCRDREMQSISDRLA 230 ++ER + +D + D + G + E + + ++ + L Sbjct: 218 VVQERGVTVDADGEFAWQDRDADAPLAQAVAQRMPPLPGYVSEINLQAEAWVRGMG--LW 275 Query: 231 CDGGTAIVIDY------GYLQSRVGDT-LQAVKGHTYVSPLVNPGQADLSSHVDFQRLSS 283 G A+++DY Y R G T + ++ H + P V PG D++SHVDF ++ Sbjct: 276 LQRGAALLLDYGFPRSEYYHPQRAGGTLMCHLRHHAHADPFVAPGLQDITSHVDFTAMAD 335 Query: 284 IAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDS-VKRLVSTSADKKSMG 342 A+ L + G +Q +FL G+ L AR + V++L+S + MG Sbjct: 336 AALAGGLQVLGYLSQARFLVNAGLLDALSQLDPADARAYAQAVAPVQKLLS----EAEMG 391 Query: 343 ELFKILVVSHE-KVELMPF 360 ELFK+L V + L+ F Sbjct: 392 ELFKVLAVGRDMPEPLLGF 410 >gi|229586282|ref|YP_002844783.1| hypothetical protein RAF_ORF0065 [Rickettsia africae ESF-5] gi|228021332|gb|ACP53040.1| Unknown [Rickettsia africae ESF-5] Length = 406 Score = 228 bits (580), Expect = 2e-57, Method: Composition-based stats. Identities = 120/400 (30%), Positives = 181/400 (45%), Gaps = 58/400 (14%) Query: 8 KIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFL 67 KI LI +NG +T D + YY GDFVTAPEISQ+FGE++ ++ Sbjct: 6 KIRQLIDQNGYITCDVLMQEVLNLNPTSYYKQVKSLANEGDFVTAPEISQLFGEIIGLWC 65 Query: 68 ICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQK 127 I W++ G P + LVELGPGRG++M D+LR L P+F+ LSI ++E ++ QK Sbjct: 66 IREWQRIGCPKSLSLVELGPGRGLLMRDLLRTAK-LVPEFYKALSIELIEINKNFIAYQK 124 Query: 128 KQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSL 187 L I+ + + D+P T ++ANEFFD++PIKQ++ + ER+ + D Sbjct: 125 ANLQDINLPISHQSFVEDIPKKPTIIIANEFFDAIPIKQYIKVKELWYERIFVVQPVDER 184 Query: 188 V----FNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGY 243 + ++ + T + GA+ E S ++ I+ L G+ ++IDYGY Sbjct: 185 IKYDKISVNKQLQEYLLCTHIEAKDGAVLEESYKSIEIIKFIAQHLKRLSGSGLIIDYGY 244 Query: 244 -------LQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLT 296 + + TLQAVK H Y L N G+ADLS+HVDF L ++A K+ + Sbjct: 245 DIAPNGRTRYQYNQTLQAVKNHKYCPILDNLGEADLSAHVDFYALKTVAKNSKINVIDTI 304 Query: 297 TQGKFLEGLGIWQRAFSLMKQTARK----------------------------------- 321 +Q FL GI R +L + + Sbjct: 305 SQRDFLIENGILLRKQTLQDKLNDRHHAKFAYREEFKGDTKHSTAAYTLVREDASIGSTY 364 Query: 322 -----------DILLDSVKRLVSTSADKKSMGELFKILVV 350 ++R V K MGELFK+L + Sbjct: 365 KLPLEVEFGKMSEQAQIIERQVERLISPKQMGELFKVLQI 404 >gi|21114689|gb|AAM42701.1| conserved hypothetical protein [Xanthomonas campestris pv. campestris str. ATCC 33913] Length = 417 Score = 228 bits (580), Expect = 2e-57, Method: Composition-based stats. Identities = 80/380 (21%), Positives = 134/380 (35%), Gaps = 29/380 (7%) Query: 2 ENKLIRKIVNLI-KKNGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIF 59 ++L + I G + ++ L + P GYYS FG GDFVTAPE+ +F Sbjct: 39 SDQLAAHLRAEITAAGGAIPFSRFMELALYAPGLGYYSAGASKFGESGDFVTAPELGPLF 98 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 ++ L +Q G R++E+G G G + +L ++E S Sbjct: 99 AATVSGALAPVLQQLG--PQARVLEVGAGSGAFAEV---TLKRLLELDALPERYAILEPS 153 Query: 120 ERLTLIQKKQLASYGDKINWYTSLADVPLGFT------FLVANEFFDSLPIKQFVMTEHG 173 L Q+++L L + G L ANE D+LP +F + + Sbjct: 154 ADLRERQRERLGRSLIPP--VFDLVEWLDGPFPDDWDGVLFANEVIDALPTPRFTLRDGE 211 Query: 174 IRERMIDIDQHDSLVFNIGDHEI-------KSNFLTCSDYFLGAIFENSPCRDREMQSIS 226 + E + +D + + G E P +Q+++ Sbjct: 212 VYEETVVLDARQHFARGEQPADALLGAAVRHLERYLEQPFADGYRSELLPQLPHWIQAVA 271 Query: 227 DRLACDGGTAIVIDYG----YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRL 281 L + Y Y R TL+A H PG DL++ VDF L Sbjct: 272 GGLKRGAMLFVDYGYPRGEFYRAQREDGTLRAFYRHRMHTDLYRWPGLQDLTASVDFTAL 331 Query: 282 SSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSM 341 + + G TQ FL G G+ +T + +++ V M Sbjct: 332 AEAGTGAGFELAGYCTQANFLLGNGLDALLTQADARTDEVGRM--RLRQQVKQLTLPSEM 389 Query: 342 GELFKILVVSHEKVELMPFV 361 GE F+++ + + F+ Sbjct: 390 GERFQVMGFARDVDFAPAFL 409 >gi|77747936|ref|NP_638777.2| hypothetical protein XCC3432 [Xanthomonas campestris pv. campestris str. ATCC 33913] Length = 394 Score = 228 bits (580), Expect = 2e-57, Method: Composition-based stats. Identities = 80/380 (21%), Positives = 134/380 (35%), Gaps = 29/380 (7%) Query: 2 ENKLIRKIVNLI-KKNGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIF 59 ++L + I G + ++ L + P GYYS FG GDFVTAPE+ +F Sbjct: 16 SDQLAAHLRAEITAAGGAIPFSRFMELALYAPGLGYYSAGASKFGESGDFVTAPELGPLF 75 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 ++ L +Q G R++E+G G G + +L ++E S Sbjct: 76 AATVSGALAPVLQQLG--PQARVLEVGAGSGAFAEV---TLKRLLELDALPERYAILEPS 130 Query: 120 ERLTLIQKKQLASYGDKINWYTSLADVPLGFT------FLVANEFFDSLPIKQFVMTEHG 173 L Q+++L L + G L ANE D+LP +F + + Sbjct: 131 ADLRERQRERLGRSLIPP--VFDLVEWLDGPFPDDWDGVLFANEVIDALPTPRFTLRDGE 188 Query: 174 IRERMIDIDQHDSLVFNIGDHEI-------KSNFLTCSDYFLGAIFENSPCRDREMQSIS 226 + E + +D + + G E P +Q+++ Sbjct: 189 VYEETVVLDARQHFARGEQPADALLGAAVRHLERYLEQPFADGYRSELLPQLPHWIQAVA 248 Query: 227 DRLACDGGTAIVIDYG----YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRL 281 L + Y Y R TL+A H PG DL++ VDF L Sbjct: 249 GGLKRGAMLFVDYGYPRGEFYRAQREDGTLRAFYRHRMHTDLYRWPGLQDLTASVDFTAL 308 Query: 282 SSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSM 341 + + G TQ FL G G+ +T + +++ V M Sbjct: 309 AEAGTGAGFELAGYCTQANFLLGNGLDALLTQADARTDEVGRM--RLRQQVKQLTLPSEM 366 Query: 342 GELFKILVVSHEKVELMPFV 361 GE F+++ + + F+ Sbjct: 367 GERFQVMGFARDVDFAPAFL 386 >gi|34580971|ref|ZP_00142451.1| hypothetical protein [Rickettsia sibirica 246] gi|28262356|gb|EAA25860.1| unknown [Rickettsia sibirica 246] Length = 406 Score = 228 bits (580), Expect = 2e-57, Method: Composition-based stats. Identities = 120/400 (30%), Positives = 181/400 (45%), Gaps = 58/400 (14%) Query: 8 KIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFL 67 KI LI +NG +T D + YY GDFVTAPEISQ+FGE++ ++ Sbjct: 6 KIRQLIDQNGYITCDVLMQEVLNLNPTSYYKQVKSLANEGDFVTAPEISQLFGEIIGLWC 65 Query: 68 ICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQK 127 I W++ G P + LVELGPGRG++M D+LR L P+F+ LSI ++E ++ QK Sbjct: 66 IREWQRIGCPKSLSLVELGPGRGLLMRDLLRTAK-LVPEFYKALSIELIEINKNFIAYQK 124 Query: 128 KQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSL 187 L I+ + + D+P T ++ANEFFD++PIKQ++ + ER+ + D Sbjct: 125 ANLQDINLPISHQSFVEDIPKKPTIIIANEFFDAIPIKQYIKVKELWYERIFVVQPVDER 184 Query: 188 V----FNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGY 243 + ++ + T + GA+ E S ++ I+ L G+ ++IDYGY Sbjct: 185 IKYDKISVNKQLQEYLLRTHIEAKDGAVLEESYKSIEIIKFIAQHLKRLSGSGLIIDYGY 244 Query: 244 -------LQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLT 296 + + TLQAVK H Y L N G+ADLS+HVDF L ++A K+ + Sbjct: 245 DIAPNGRTRYQYNQTLQAVKNHKYCPILENLGEADLSAHVDFYALKTVAKNSKINVIDTI 304 Query: 297 TQGKFLEGLGIWQRAFSLMKQTARK----------------------------------- 321 +Q FL GI R +L + + Sbjct: 305 SQRDFLIENGILLRKQTLQDKLNDRHLAKFAYREEFKGDTKRNTAAYTLVREDASIGSTY 364 Query: 322 -----------DILLDSVKRLVSTSADKKSMGELFKILVV 350 ++R V K MGELFK+L + Sbjct: 365 KLPLEVEFGKMSEQAQIIERQVERLISPKQMGELFKVLQI 404 >gi|66572401|gb|AAY47811.1| conserved hypothetical protein [Xanthomonas campestris pv. campestris str. 8004] Length = 417 Score = 227 bits (579), Expect = 2e-57, Method: Composition-based stats. Identities = 80/380 (21%), Positives = 134/380 (35%), Gaps = 29/380 (7%) Query: 2 ENKLIRKIVNLI-KKNGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIF 59 ++L + I G + ++ L + P GYYS FG GDFVTAPE+ +F Sbjct: 39 SDQLAAHLRAEITAAGGAIPFSRFMELALYAPGLGYYSAGASKFGESGDFVTAPELGPLF 98 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 ++ L +Q G R++E+G G G + +L ++E S Sbjct: 99 AATVSGALAPVLQQLG--PQARVLEVGGGSGAFAEV---TLKRLLELDALPERYAILEPS 153 Query: 120 ERLTLIQKKQLASYGDKINWYTSLADVPLGFT------FLVANEFFDSLPIKQFVMTEHG 173 L Q+++L L + G L ANE D+LP +F + + Sbjct: 154 ADLRERQRERLGRSLIPP--VFDLVEWLDGPFPDDWDGVLFANEVIDALPTPRFTLRDGE 211 Query: 174 IRERMIDIDQHDSLVFNIGDHEI-------KSNFLTCSDYFLGAIFENSPCRDREMQSIS 226 + E + +D + + G E P +Q+++ Sbjct: 212 VYEETVVLDARQHFARGEQPADALLGAAVRHLERYLEQPFADGYRSELLPQLPYWIQAVA 271 Query: 227 DRLACDGGTAIVIDYG----YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRL 281 L + Y Y R TL+A H PG DL++ VDF L Sbjct: 272 GGLKRGAMLFVDYGYPRGEFYRAQREDGTLRAFYRHRMHTDLYRWPGLQDLTASVDFTAL 331 Query: 282 SSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSM 341 + + G TQ FL G G+ +T + +++ V M Sbjct: 332 AEAGTGAGFELAGYCTQANFLLGNGLDALLTQADARTDEVGRM--RLRQQVKQLTLPSEM 389 Query: 342 GELFKILVVSHEKVELMPFV 361 GE F+++ + + F+ Sbjct: 390 GERFQVMGFARDVDFAPAFL 409 >gi|77761110|ref|YP_241831.2| hypothetical protein XC_0732 [Xanthomonas campestris pv. campestris str. 8004] Length = 394 Score = 227 bits (579), Expect = 2e-57, Method: Composition-based stats. Identities = 80/380 (21%), Positives = 134/380 (35%), Gaps = 29/380 (7%) Query: 2 ENKLIRKIVNLI-KKNGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIF 59 ++L + I G + ++ L + P GYYS FG GDFVTAPE+ +F Sbjct: 16 SDQLAAHLRAEITAAGGAIPFSRFMELALYAPGLGYYSAGASKFGESGDFVTAPELGPLF 75 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 ++ L +Q G R++E+G G G + +L ++E S Sbjct: 76 AATVSGALAPVLQQLG--PQARVLEVGGGSGAFAEV---TLKRLLELDALPERYAILEPS 130 Query: 120 ERLTLIQKKQLASYGDKINWYTSLADVPLGFT------FLVANEFFDSLPIKQFVMTEHG 173 L Q+++L L + G L ANE D+LP +F + + Sbjct: 131 ADLRERQRERLGRSLIPP--VFDLVEWLDGPFPDDWDGVLFANEVIDALPTPRFTLRDGE 188 Query: 174 IRERMIDIDQHDSLVFNIGDHEI-------KSNFLTCSDYFLGAIFENSPCRDREMQSIS 226 + E + +D + + G E P +Q+++ Sbjct: 189 VYEETVVLDARQHFARGEQPADALLGAAVRHLERYLEQPFADGYRSELLPQLPYWIQAVA 248 Query: 227 DRLACDGGTAIVIDYG----YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRL 281 L + Y Y R TL+A H PG DL++ VDF L Sbjct: 249 GGLKRGAMLFVDYGYPRGEFYRAQREDGTLRAFYRHRMHTDLYRWPGLQDLTASVDFTAL 308 Query: 282 SSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSM 341 + + G TQ FL G G+ +T + +++ V M Sbjct: 309 AEAGTGAGFELAGYCTQANFLLGNGLDALLTQADARTDEVGRM--RLRQQVKQLTLPSEM 366 Query: 342 GELFKILVVSHEKVELMPFV 361 GE F+++ + + F+ Sbjct: 367 GERFQVMGFARDVDFAPAFL 386 >gi|299068282|emb|CBJ39503.1| conserved protein of unknown function [Ralstonia solanacearum CMR15] Length = 397 Score = 227 bits (579), Expect = 2e-57, Method: Composition-based stats. Identities = 85/380 (22%), Positives = 147/380 (38%), Gaps = 27/380 (7%) Query: 2 ENKLIRKIVNLIK-KNGQMTVDQYFALCVADPEFGYYSTCN-PFGAV----GDFVTAPEI 55 ++L IV ++ G + ++Y L + P GYYS FG GDF+TAPE+ Sbjct: 18 SDRLFSTIVRAVESAGGWLPFERYMELALYAPGLGYYSGGAAKFGRRVEDGGDFITAPEL 77 Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 + FG +A + + P ++E G G G + DIL + L S + Sbjct: 78 TPFFGRTVAHQIAQVLQTL-PPGQRHVLEFGAGTGRLAADILTELETLGMRPDS---YGI 133 Query: 116 VETSERLTLIQKKQLASYGDKINWYTSLAD--VPLGFTFLVANEFFDSLPIKQFVMTEHG 173 VE S L Q++ LA+ G + D +V NE D++P+ + Sbjct: 134 VELSGELRQRQQQALAALGPDLTGLARWHDALPARFTGVMVGNEVLDAMPVSLWARRGGV 193 Query: 174 IRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYF--LGAIFENSPCRDREMQSISDRLAC 231 R + D L ++ + + + E + ++S L Sbjct: 194 WHRRGVAFDADQGLRWSERAADPAEVPPKLAALPGRDDFVTEAHEAAEGFIRSAGAALER 253 Query: 232 DGGTAI-----VIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAI 286 I +Y + G + + H + P PG D+++HVDF ++ A Sbjct: 254 GLLLLIDYGFPAGEYYHAHRANGTLMCHYRQHAHDDPFWLPGLQDITAHVDFSGIALAAR 313 Query: 287 LYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTAR-KDILLDSVKRLVSTSADKKSMGELF 345 L + G +Q +FL G+ Q +L ++V++L+S + MGELF Sbjct: 314 ETGLEVLGYASQARFLLSAGVGQLLMTLDPADPVCFLPAANAVQKLLS----EAEMGELF 369 Query: 346 KILVVSH---EKVELMPFVN 362 K + + + L F + Sbjct: 370 KAIALGRGLDAALPLAGFAD 389 >gi|119896778|ref|YP_931991.1| hypothetical protein azo0487 [Azoarcus sp. BH72] gi|119669191|emb|CAL93104.1| conserved hypothetical protein [Azoarcus sp. BH72] Length = 390 Score = 227 bits (579), Expect = 2e-57, Method: Composition-based stats. Identities = 97/379 (25%), Positives = 155/379 (40%), Gaps = 32/379 (8%) Query: 2 ENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIF 59 +L+ + I G ++ +Y + + P GYYS FG GDF+TAPE++ +F Sbjct: 15 SARLVALLHAEIAAAGGWLSFARYMEITLYAPGLGYYSGGARKFGPGGDFITAPELTPLF 74 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 G+ LA + EQ S ++E+G G G++ D+L + + S ++E S Sbjct: 75 GQALASQV----EQVMRASAPAVIEVGAGTGLLATDLLLELERRGCLPDS---YGILELS 127 Query: 120 ERLTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIR 175 L Q L ++ W SL + G +VANE D +P+ V G+ Sbjct: 128 GELRERQFDTLASQAPHLAGRVRWLESLPESFSG--AVVANEVLDVMPVHLVVARAEGLF 185 Query: 176 ERMI-DIDQHDSLVFNIGD-------HEIKSNFLTCSDYFLGAIFENSPCRDREMQSISD 227 ER + + + D D E + + E + + + ++ Sbjct: 186 ERGVAVVQREDGPALQWADVPAAGAVREAALALQLPTPSSGEYVTEINLAGGAWVAAWAE 245 Query: 228 RLACDGGTAIVIDYG----YLQSRVGDTL-QAVKGHTYVSPLVNPGQADLSSHVDFQRLS 282 RL I Y YL SR G TL + H + P + PG D+++ VDF ++ Sbjct: 246 RLRQGAMLLIDYGYPRAEYYLPSRSGGTLLCYYRHHAHGDPFLWPGLNDITAFVDFTAVA 305 Query: 283 SIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMG 342 A L + G TTQ +FL G+ L ++ + R V + MG Sbjct: 306 EAAFGAGLDVTGYTTQAQFLFNCGV---LECLARRGPEERPEYIRAARAVQRLTAPQEMG 362 Query: 343 ELFKILVVSHE-KVELMPF 360 ELFK+L VS L+ F Sbjct: 363 ELFKVLAVSRGLSEPLLGF 381 >gi|325915059|ref|ZP_08177388.1| hypothetical protein XVE_1272 [Xanthomonas vesicatoria ATCC 35937] gi|325538757|gb|EGD10424.1| hypothetical protein XVE_1272 [Xanthomonas vesicatoria ATCC 35937] Length = 394 Score = 227 bits (579), Expect = 2e-57, Method: Composition-based stats. Identities = 84/382 (21%), Positives = 142/382 (37%), Gaps = 33/382 (8%) Query: 2 ENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIF 59 ++L + I+ G + ++ L + P GYYS + FG GDFVTAPE+ +F Sbjct: 16 SDRLAAHMRAEIQAAGGAIPFSRFMELALYAPGLGYYSAGSSKFGEAGDFVTAPELGPLF 75 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 ++ L +Q G R++E+G G G + +L ++E S Sbjct: 76 AATVSGALAPVLQQLG--PQARMLEVGGGSGAFAEV---TLKRLLELDALPERYAILEPS 130 Query: 120 ERLTLIQKKQLASYGDKINWYTSLADVPLGFT------FLVANEFFDSLPIKQFVMTEHG 173 L Q+++L L + G L ANE D+LP +F + + Sbjct: 131 ADLRERQRERLGRSLIPP--VFDLVEWLDGPFPDDWDGVLFANEVIDALPTPRFAIRDGE 188 Query: 174 IRE--RMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFL-----GAIFENSPCRDREMQSIS 226 + E ++D QH + D + + Y G E P +Q+++ Sbjct: 189 VYEETVVLDAQQHFARGEQPADALLSAAVRHLERYLEQPFADGYRSELLPQLPYWIQAVA 248 Query: 227 DRLACDGGTAIVIDYG----YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRL 281 L + Y Y R TL+A H PG DL++ VDF L Sbjct: 249 GGLKRGAMVFVDYGYPRGEFYRAQREDGTLRAFYRHRMHEDLYRWPGLQDLTASVDFTAL 308 Query: 282 SSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDIL--LDSVKRLVSTSADKK 339 + + G TQ FL G G+ +L+ Q + ++ + Sbjct: 309 AEAGTGAGFELAGYCTQASFLLGNGLD----ALLTQADTRTDEVGRMRLREQIKRLTLPS 364 Query: 340 SMGELFKILVVSHEKVELMPFV 361 MGE F+++ + + F+ Sbjct: 365 EMGERFQVMGFARDVDIAPAFL 386 >gi|163859074|ref|YP_001633372.1| hypothetical protein Bpet4753 [Bordetella petrii DSM 12804] gi|163262802|emb|CAP45105.1| conserved hypothetical protein [Bordetella petrii] Length = 398 Score = 227 bits (579), Expect = 2e-57, Method: Composition-based stats. Identities = 95/385 (24%), Positives = 157/385 (40%), Gaps = 40/385 (10%) Query: 2 ENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTCN-----------PFG---AV 46 ++ + I + +G + DQ+ A + P GYY+ N P G A Sbjct: 19 SARMAAHLRQAIAERDGWLPFDQWMAQALYAPGLGYYAAGNVKLASAEAARTPDGLPVAP 78 Query: 47 GDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPD 106 GDFVTAPE++ +FG +A G ++E G G G + +L + Sbjct: 79 GDFVTAPELTPLFGHTVARQAADILAATG---TDTVLEFGAGTGALADSVLAALDAQGI- 134 Query: 107 FFSVLSIYMVETSERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQ 166 ++E S L Q+ +LA YG ++ W +L + G ++ANE D++P+ Sbjct: 135 ---SAQYRIIEVSADLRARQQARLAGYGARVQWLDALPERFAGC--VLANEVLDAMPVSL 189 Query: 167 FVM-TEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFL--GAIFENSPCRDREMQ 223 F + + ER + +D ++ + G + E + + MQ Sbjct: 190 FRWNDDGVLLERGVALDAAGQFIWQDRAAGEPLARAVAARMPPLPGYVSEINLQAEAWMQ 249 Query: 224 SISDRLACDGGTAIVIDY------GYLQSRVGDT-LQAVKGHTYVSPLVNPGQADLSSHV 276 ++ L G A++ DY Y R G T + ++ H + PL PG D+++HV Sbjct: 250 AMGGWLER--GAALLFDYGFPRGEYYHPQRAGGTLMCHLRHHAHADPLAAPGVQDITAHV 307 Query: 277 DFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSA 336 DF ++ A L + G T+Q +FL G+ A L + V V Sbjct: 308 DFTAMADAAQGAGLQVLGYTSQARFLMNAGL---AELLARHDPADARNYAQVVAPVQKLL 364 Query: 337 DKKSMGELFKILVVSHE-KVELMPF 360 + MGELFK+L V + L F Sbjct: 365 SEAEMGELFKVLAVGRDMPAPLRGF 389 >gi|289663938|ref|ZP_06485519.1| hypothetical protein XcampvN_12920 [Xanthomonas campestris pv. vasculorum NCPPB702] Length = 394 Score = 227 bits (578), Expect = 2e-57, Method: Composition-based stats. Identities = 79/383 (20%), Positives = 137/383 (35%), Gaps = 33/383 (8%) Query: 2 ENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIF 59 ++L + I+ G + ++ L + P GYYS FG GDFVT+PE+ +F Sbjct: 16 SDRLAAHVRAEIQAAGGAIPFSRFMELALYAPGLGYYSAGASKFGEAGDFVTSPELGPLF 75 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 ++ L +Q G R++E+G G G + +L ++E S Sbjct: 76 AATVSGALAPVLQQLG--PQARVLEVGGGSGAFAEV---TLKRLLELDALPERYAILEPS 130 Query: 120 ERLTLIQKKQLASYGDKINWYTSLADVPLGFT------FLVANEFFDSLPIKQFVMTEHG 173 L Q+++L L + L ANE D+LP +F + + Sbjct: 131 ADLRERQRERLGRSLIPP--VFDLVEWLDAPFPDDWDGVLFANEVIDALPTPRFALRDGE 188 Query: 174 IRERMIDIDQHDSLVFNIGDHEI-------KSNFLTCSDYFLGAIFENSPCRDREMQSIS 226 + E + +D + + G E+ P +Q+++ Sbjct: 189 VYEETVVLDAQQQFARGEQPADALLSAAVRHLERYLEQPFADGYRSESLPQLPYWIQAVA 248 Query: 227 DRLACDGGTAIVIDYG----YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRL 281 L + Y Y R TL+A H PG DL++ VDF L Sbjct: 249 GGLKRGAMLFVDYGYPRGEFYRAQREDGTLRAFYRHRMHEDLYRWPGLQDLTASVDFTAL 308 Query: 282 SSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDIL--LDSVKRLVSTSADKK 339 + + G TQ FL G G+ +L+ Q + ++ + Sbjct: 309 AEAGTGAGFELAGYCTQASFLLGNGLD----ALLAQADTRTDEVGRMRLREQIKRLTLPS 364 Query: 340 SMGELFKILVVSHEKVELMPFVN 362 MGE F+++ S + F++ Sbjct: 365 EMGERFQVMGFSRDVDFAPAFLS 387 >gi|157827945|ref|YP_001494187.1| hypothetical protein A1G_00455 [Rickettsia rickettsii str. 'Sheila Smith'] gi|165932633|ref|YP_001649422.1| hypothetical protein RrIowa_0094 [Rickettsia rickettsii str. Iowa] gi|157800426|gb|ABV75679.1| hypothetical protein A1G_00455 [Rickettsia rickettsii str. 'Sheila Smith'] gi|165907720|gb|ABY72016.1| hypothetical cytosolic protein [Rickettsia rickettsii str. Iowa] Length = 406 Score = 227 bits (578), Expect = 2e-57, Method: Composition-based stats. Identities = 121/400 (30%), Positives = 181/400 (45%), Gaps = 58/400 (14%) Query: 8 KIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFL 67 KI LI +NG +T D + YY GDFVTAPEISQ+FGE++ ++ Sbjct: 6 KIRQLIDQNGYITCDVLMQEVLNLNPTSYYKQVKSLANEGDFVTAPEISQLFGEIIGLWC 65 Query: 68 ICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQK 127 I W++ G P + LVELGPGRG++M D+LR L P+F+ LSI ++E ++ QK Sbjct: 66 IREWQRIGCPKSLSLVELGPGRGLLMRDLLRTAK-LVPEFYKALSIELIEINKNFIAYQK 124 Query: 128 KQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSL 187 L I+ + + D+P T ++ANEFFD++PIKQ++ + ER+ + D Sbjct: 125 ANLQDINLPISHQSFVEDIPKKPTIIIANEFFDAIPIKQYIKVKELWYERIFVMQPVDER 184 Query: 188 V----FNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGY 243 + ++ + T + GAI E S ++ I+ L G+ ++IDYGY Sbjct: 185 IKYDKISVNKQLQEYLLCTHIEAKDGAILEESYKSIEIIKFIAQHLKRLSGSGLIIDYGY 244 Query: 244 -------LQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLT 296 + + TLQAVK H Y L N G+ADLS+HVDF L ++A K+ + Sbjct: 245 DIAPNGRTRYQYNQTLQAVKNHKYCPILENLGEADLSAHVDFYALKTVAKNSKINVIDTI 304 Query: 297 TQGKFLEGLGIWQRAFSLMKQTARK----------------------------------- 321 +Q FL GI R +L + + Sbjct: 305 SQRDFLIENGILLRKQTLQDKLNDRHLAKFAYREEFKGDTQRSNAAYTLVREDASIGSTY 364 Query: 322 -----------DILLDSVKRLVSTSADKKSMGELFKILVV 350 ++R V K MGELFK+L + Sbjct: 365 KLPLEVEFGKMSEQAQIIERQVERLISPKQMGELFKVLQI 404 >gi|196231309|ref|ZP_03130168.1| protein of unknown function DUF185 [Chthoniobacter flavus Ellin428] gi|196224645|gb|EDY19156.1| protein of unknown function DUF185 [Chthoniobacter flavus Ellin428] Length = 379 Score = 227 bits (578), Expect = 2e-57, Method: Composition-based stats. Identities = 94/371 (25%), Positives = 147/371 (39%), Gaps = 18/371 (4%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYY-STCNPFGAVGDFVTAPEISQIFG 60 EN + I++ G + Y A + DP++GYY ++ G GDF T + +FG Sbjct: 5 ENTAKNALRAEIQQRGPIPFRDYMARVLYDPDYGYYGASKAQVGRAGDFFTNVSVGPLFG 64 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 +LA W + G P+ +VE G G D+L + + ++ S+++VE Sbjct: 65 RLLARQFAEMWRRLGEPADFAIVEQGANSGDFAGDVLSALREFDAACYAATSLWLVEPLA 124 Query: 121 RLTLIQKKQLASY-GDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMI 179 +L L Q ++L + KI W S +P +NE D+ P+ + ER + Sbjct: 125 KLRLAQTERLRDFGSSKIRWVDSPTALPSFQGVHFSNELLDAFPVHRVCRRGDRWMERSV 184 Query: 180 DIDQHDSLVFN---IGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTA 236 D Q VF I E+ ++ G E + + ++ +L+ A Sbjct: 185 DF-QEGRFVFVDAEIASPELNAHLGHLPPVPEGYETEVNLAIAPWLAEVASKLSAGFVLA 243 Query: 237 IVIDYG----YLQSRVGDTLQAVKGHTY-VSPLVNPGQADLSSHVDFQRLSSIAILYKLY 291 I Y Y R TL A H PL PG+ DL++HVDF L+ A L L Sbjct: 244 IDYGYPRGDYYRPERSSGTLSAYAAHRREPDPLQRPGEIDLTAHVDFTILAETAPLLGLQ 303 Query: 292 INGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVS 351 + G T Q F+ GL R A + R T MG+ F + ++ Sbjct: 304 LAGFTDQHHFMVGL---SRLH-FSDDYAMTTAGQQEL-RAFRTLMHPTLMGQSFHAICLA 358 Query: 352 HE--KVELMPF 360 L F Sbjct: 359 KGVTNTALSGF 369 >gi|238650512|ref|YP_002916364.1| hypothetical protein RPR_02415 [Rickettsia peacockii str. Rustic] gi|238624610|gb|ACR47316.1| hypothetical protein RPR_02415 [Rickettsia peacockii str. Rustic] Length = 406 Score = 227 bits (578), Expect = 2e-57, Method: Composition-based stats. Identities = 120/400 (30%), Positives = 182/400 (45%), Gaps = 58/400 (14%) Query: 8 KIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFL 67 KI LI +NG +T D + YY GDFVTAPEISQ+FGE++ ++ Sbjct: 6 KIRQLIDQNGYITCDVLMQEVLNLNPTSYYKQVKSLANEGDFVTAPEISQLFGEIIGLWC 65 Query: 68 ICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQK 127 I W++ G P + LVELGPGRG++M D+LR L P+F+ LSI ++E ++ QK Sbjct: 66 IREWQRIGCPKSLSLVELGPGRGLLMRDLLRTAK-LVPEFYKALSIELIEINKNFIAYQK 124 Query: 128 KQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSL 187 L I+ + + D+P T ++ANEFFD++PIKQ++ + ER+ + D Sbjct: 125 ANLQDINLPISHQSFVEDIPKKPTIIIANEFFDAIPIKQYIKVKELWYERIFVVQPVDER 184 Query: 188 V----FNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGY 243 + ++ + T + GA+ E S ++ I+ L G+ ++IDYGY Sbjct: 185 IKYDKISVNKQLQEYLLCTHIEAKDGAVLEESYKSIEIIKFIAQHLKRLSGSGLIIDYGY 244 Query: 244 -------LQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLT 296 + + TLQAVK H Y L N G+ADLS+HVDF L ++A K+ + Sbjct: 245 DIAPNGRTRYQYNQTLQAVKNHKYCPTLENLGEADLSAHVDFYALKTVAKNSKINVIDTI 304 Query: 297 TQGKFLEGLGIWQRAFSLMKQTARKDILLDSVK--------------------------- 329 +Q FL GI R +L + + + + + Sbjct: 305 SQRDFLIENGILLRKQTLQDKLNDRHLAKFAYREEFKGDTKRSTAAYTLVREDASIGSTY 364 Query: 330 -------------------RLVSTSADKKSMGELFKILVV 350 R V K MGELFK+L + Sbjct: 365 KLPLEVEFGKMYEQAQIIERQVERLISPKQMGELFKVLQI 404 >gi|294666175|ref|ZP_06731430.1| conserved hypothetical protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB 10535] gi|292604040|gb|EFF47436.1| conserved hypothetical protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB 10535] Length = 394 Score = 227 bits (578), Expect = 2e-57, Method: Composition-based stats. Identities = 85/385 (22%), Positives = 139/385 (36%), Gaps = 37/385 (9%) Query: 2 ENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIF 59 ++L + I+ G + ++ L + P GYYS FG GDFVTAPE+ +F Sbjct: 16 SDRLAAHVRAEIQAAGGAIPFSRFMELALYAPALGYYSAGASKFGEAGDFVTAPELGPLF 75 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 ++ L +Q G R++E+G G G + +L ++E S Sbjct: 76 AATVSGALAPVLQQLG--PQARVLEVGGGSGAFAGV---TLKRLLELDALPERYAILEPS 130 Query: 120 ERLTLIQKKQLASYGDKINWYTSLADVPLGFT------FLVANEFFDSLPIKQFVMTEHG 173 L Q+++L L + L ANE D+LP +F + + Sbjct: 131 ADLRERQRERLGRSLIPP--VFDLVEWLDAPFPDDWDGVLFANEVIDALPTPRFALRDGQ 188 Query: 174 IRERMIDIDQHDSLVFNIGDHEIKSNF---------LTCSDYFLGAIFENSPCRDREMQS 224 + E + D F G+ + + G E P +Q+ Sbjct: 189 VYE--ETVVLDDQQHFARGEQPADALLSAAVRHLERYLEQPFADGYRSELLPQLPYWIQA 246 Query: 225 ISDRLACDGGTAIVIDYG----YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQ 279 ++ L + Y Y R TL+A H PG DL++ VDF Sbjct: 247 VAGGLKRGAMLFVDYGYPRGEFYRAQREDGTLRAFYRHRMHEDLYRWPGLQDLTASVDFT 306 Query: 280 RLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTAR--KDILLDSVKRLVSTSAD 337 L+ + G TQ FL G G+ +T + L + +KRL Sbjct: 307 ALAEAGTGAGFELAGYCTQASFLLGNGLDALLAQADTRTDEVGRARLREQIKRLTL---- 362 Query: 338 KKSMGELFKILVVSHEKVELMPFVN 362 MGE F+++ S + F++ Sbjct: 363 PSEMGERFQVMGFSRDVDFAPAFLS 387 >gi|9106480|gb|AAF84267.1|AE003976_1 conserved hypothetical protein [Xylella fastidiosa 9a5c] Length = 398 Score = 226 bits (577), Expect = 3e-57, Method: Composition-based stats. Identities = 85/373 (22%), Positives = 137/373 (36%), Gaps = 31/373 (8%) Query: 2 ENKLIRKIVNL-IKKNGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIF 59 +L I I+ G + ++ L + P GYYS FG GDF+TAPE+ +F Sbjct: 20 SEQLAAYIRQQMIQSGGAIPFSRFMELALYAPGLGYYSAGASKFGEAGDFITAPELGSLF 79 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 +A L +Q G +C ++ELG G G + + +L ++E S Sbjct: 80 ATTVANALAPVLQQLGALAC--VLELGGGSGAFAEML---LKRLMELHRLPQRYAILEPS 134 Query: 120 ERLTLIQK----KQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIR 175 L Q+ + L + + + ANE D+LP +F+M + + Sbjct: 135 AELRQRQQLHLKRTLPPSLFALVEWVDAPFSEQWDGVVFANEVIDALPASRFIMRDREVY 194 Query: 176 ERMIDIDQHDSLVFNIGDHEI-------KSNFLTCSDYFLGAIFENSPCRDREMQSISDR 228 E + +D V + + + G E P +Q+++ Sbjct: 195 EATVVLDAQQRFVSAQHPADALLQQAVRHIERDLSARFADGYCSEVLPQLPYWVQAVAGG 254 Query: 229 LACDGGTAIVIDYG----YLQSRVGDTLQAVKGHTYVSPL-VNPGQADLSSHVDFQRLSS 283 L I Y Y R TL+A H PG D+++ VDF L+ Sbjct: 255 LKRGVLLFIDYGYPRTEYYRSERDTGTLRAFYRHRVHDDWYRWPGLQDVTASVDFTALAE 314 Query: 284 IAILYKLYINGLTTQGKFLEGLGIWQRAFSLMK---QTARKDILLDSVKRLVSTSADKKS 340 + G TQ FL G+ R + + K L + VKRL Sbjct: 315 AGTAAGFDMAGYCTQASFLLSHGL-DRLLAHAEEGVDEVAKLWLRNQVKRLTL----PTE 369 Query: 341 MGELFKILVVSHE 353 MGE F+++ + E Sbjct: 370 MGERFQVMGFARE 382 >gi|77747547|ref|NP_298747.2| hypothetical protein XF1458 [Xylella fastidiosa 9a5c] Length = 394 Score = 226 bits (577), Expect = 3e-57, Method: Composition-based stats. Identities = 85/373 (22%), Positives = 137/373 (36%), Gaps = 31/373 (8%) Query: 2 ENKLIRKIVNL-IKKNGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIF 59 +L I I+ G + ++ L + P GYYS FG GDF+TAPE+ +F Sbjct: 16 SEQLAAYIRQQMIQSGGAIPFSRFMELALYAPGLGYYSAGASKFGEAGDFITAPELGSLF 75 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 +A L +Q G +C ++ELG G G + + +L ++E S Sbjct: 76 ATTVANALAPVLQQLGALAC--VLELGGGSGAFAEML---LKRLMELHRLPQRYAILEPS 130 Query: 120 ERLTLIQK----KQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIR 175 L Q+ + L + + + ANE D+LP +F+M + + Sbjct: 131 AELRQRQQLHLKRTLPPSLFALVEWVDAPFSEQWDGVVFANEVIDALPASRFIMRDREVY 190 Query: 176 ERMIDIDQHDSLVFNIGDHEI-------KSNFLTCSDYFLGAIFENSPCRDREMQSISDR 228 E + +D V + + + G E P +Q+++ Sbjct: 191 EATVVLDAQQRFVSAQHPADALLQQAVRHIERDLSARFADGYCSEVLPQLPYWVQAVAGG 250 Query: 229 LACDGGTAIVIDYG----YLQSRVGDTLQAVKGHTYVSPL-VNPGQADLSSHVDFQRLSS 283 L I Y Y R TL+A H PG D+++ VDF L+ Sbjct: 251 LKRGVLLFIDYGYPRTEYYRSERDTGTLRAFYRHRVHDDWYRWPGLQDVTASVDFTALAE 310 Query: 284 IAILYKLYINGLTTQGKFLEGLGIWQRAFSLMK---QTARKDILLDSVKRLVSTSADKKS 340 + G TQ FL G+ R + + K L + VKRL Sbjct: 311 AGTAAGFDMAGYCTQASFLLSHGL-DRLLAHAEEGVDEVAKLWLRNQVKRLTL----PTE 365 Query: 341 MGELFKILVVSHE 353 MGE F+++ + E Sbjct: 366 MGERFQVMGFARE 378 >gi|188590994|ref|YP_001795594.1| hypothetical protein RALTA_A0199 [Cupriavidus taiwanensis LMG 19424] gi|170937888|emb|CAP62872.1| conserved hypothetical protein, DUF185; putative exported protein [Cupriavidus taiwanensis LMG 19424] Length = 403 Score = 226 bits (577), Expect = 3e-57, Method: Composition-based stats. Identities = 91/390 (23%), Positives = 156/390 (40%), Gaps = 44/390 (11%) Query: 2 ENKLIRKIVNLI-KKNGQMTVDQYFALCVADPEFGYYSTC-NPFGAV----GDFVTAPEI 55 + L +I I G + D+Y AL + P GYYS FG DF+TAPE+ Sbjct: 18 SDTLTARIGESIDAAGGWIGFDRYMALALYAPGLGYYSGGSAKFGRDARDGSDFITAPEL 77 Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 S F LA G P RL+E G G G + D+ + L+ + + + Sbjct: 78 SPFFARTLARQFAPLL-AQGLP---RLLEFGAGTGRLAADL---LLGLEQEGQLPDTYAI 130 Query: 116 VETSERLTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTE 171 VE S L Q+ L +++ W +L G +V NE D++P++ + Sbjct: 131 VELSGELRARQQDTLARRAPHLAERVTWLDTLPAAFEG--VIVGNEVLDAMPVQLYARRG 188 Query: 172 HGIRERMIDI----DQHDSLVFNIGDHEIKSNFLTCS----DYFLGAIFENSPCRDREMQ 223 ER + + + F D + + + + + E + + Sbjct: 189 GSWHERGVARAAVPPEGGAPAFRFEDRALAAADVPEALRAIPGEHDLVTETHAEAEGFTR 248 Query: 224 SISDRLACDGGTAIVIDYG----YLQSRVGDT-LQAVKGHTYVSPLVNPGQADLSSHVDF 278 ++ LA I + Y R G T + + H + P + PG D+++HV+F Sbjct: 249 AVGAMLARGAAFFIDYGFPGGEYYHPQRAGGTLMCHYRHHAHPDPFLYPGLQDITAHVNF 308 Query: 279 QRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARK-DILLDSVKRLVSTSAD 337 ++ A+ L + G +Q +FL GI +L AR ++V++L+S Sbjct: 309 SGIAHAAVESGLTVAGFASQARFLMNAGITDLLMALDPSDARTFLPQANAVQKLLS---- 364 Query: 338 KKSMGELFKILVVSH-------EKVELMPF 360 + MGELFK++ ++ + L F Sbjct: 365 EAEMGELFKVIALTRGFDRGLEDSEPLAGF 394 >gi|73539930|ref|YP_294450.1| hypothetical protein Reut_A0224 [Ralstonia eutropha JMP134] gi|72117343|gb|AAZ59606.1| Protein of unknown function DUF185 [Ralstonia eutropha JMP134] Length = 400 Score = 226 bits (577), Expect = 3e-57, Method: Composition-based stats. Identities = 90/381 (23%), Positives = 153/381 (40%), Gaps = 41/381 (10%) Query: 8 KIVNLI-KKNGQMTVDQYFALCVADPEFGYYSTCN-PFGAV----GDFVTAPEISQIFGE 61 I I G + D+Y +L + P GYYS FG DF+TAPE+S F Sbjct: 24 LIGETIDAAGGWIGFDRYMSLALYAPGLGYYSGGAAKFGRDVRDGSDFITAPELSPFFAR 83 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 LA G P R++E G G G + D+L + + + + +VE S Sbjct: 84 TLARQFAPLL-AQGLP---RMLEFGAGTGRLAADLLLALEQ---EGQLPDTYGIVELSGE 136 Query: 122 LTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRER 177 L Q+ L ++ W +L + G +V NE D++P++ F T ER Sbjct: 137 LRARQQDTLAQRAPQLAPRVQWLDTLPEHFEG--IVVGNEVLDAMPVRLFARTAGRWHER 194 Query: 178 MIDIDQHD-----SLVFNIGDHEIKSNFLT----CSDYFLGAIFENSPCRDREMQSISDR 228 + + F D ++ + + + E D +++ Sbjct: 195 GVARSVAGDDSTAAHAFTFEDRQLPAQAIPEVLHAIPGDHDIVTETHAEADGFARAVGAM 254 Query: 229 LACDGGTAIVIDYG----YLQSRVGDT-LQAVKGHTYVSPLVNPGQADLSSHVDFQRLSS 283 LA I + Y R G T + + H++ P PG D+++HV+F ++ Sbjct: 255 LARGAAFFIDYGFPAGEYYHPQRTGGTLMCHYRHHSHPDPFFYPGLQDITAHVNFSSIAH 314 Query: 284 IAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARK-DILLDSVKRLVSTSADKKSMG 342 A+ L ++G +Q +FL GI +L R ++V++L+S + MG Sbjct: 315 AAVDAGLSVSGFASQARFLMNAGITDLLMTLDPSDTRAFLPAANAVQKLLS----EAEMG 370 Query: 343 ELFKILVVSH---EKVELMPF 360 ELFK++ ++ + L F Sbjct: 371 ELFKVVALTRGLDDSEPLAGF 391 >gi|94969782|ref|YP_591830.1| hypothetical protein Acid345_2755 [Candidatus Koribacter versatilis Ellin345] gi|94551832|gb|ABF41756.1| protein of unknown function DUF185 [Candidatus Koribacter versatilis Ellin345] Length = 370 Score = 226 bits (577), Expect = 3e-57, Method: Composition-based stats. Identities = 97/365 (26%), Positives = 156/365 (42%), Gaps = 26/365 (7%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEM 62 L I+++I+++G + +Y LC+ PE GYYS FG GDF T+ ++ +FG + Sbjct: 2 SLREVIIDIIRRDGPIPFSRYMELCLYHPELGYYSRPREKFGKAGDFYTSSDVHAVFGRL 61 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 L W G P + LVELGPGRG+ D+L K P+F L ++VE+S L Sbjct: 62 LCRQFEEMWRLLGSPGQMDLVELGPGRGLFGQDVLDWAGKKFPEFAKALRYWLVESSPSL 121 Query: 123 TLIQKKQLAS--YGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 +++ A + A + NEFFD++P++ + Sbjct: 122 RARLRERFAGDSRVSVYEGLEAAASNCGDSLVMFGNEFFDAIPVELLSRAGE------LY 175 Query: 181 IDQHDSLVFNIGDHEIKSNFLTCSDYFLG----AIFENSPCRDREMQSISDRLACDGGTA 236 I + + + DY + E + M+ I+ A G A Sbjct: 176 IAEDKGHFIDRWVQPPHDHVQYLRDYSVPPESRGRVEVAMQSQEWMEKIAHAFAERRGFA 235 Query: 237 IVIDYGYLQS-----RVGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKL 290 + IDYGY + R DTL + H +P PG+ D+++HV+F L +A + Sbjct: 236 LFIDYGYTREQQLAGRHLDTLMTFREHQASANPYEAPGEQDITTHVNFTALQGVAEKNGM 295 Query: 291 YINGLTTQGKFLEGLGIWQRAFSLMKQ---TARKDILLDSVKRLVSTSADKKSMGELFKI 347 GL TQ +FL G+G + + + +K L+S + MGE F Sbjct: 296 TSLGLVTQSQFLLGVGQQTEFADAFESCVLPQERAKVAMQLKHLIS----PEEMGERFHA 351 Query: 348 LVVSH 352 LV++ Sbjct: 352 LVLAR 356 >gi|253995665|ref|YP_003047729.1| hypothetical protein Mmol_0292 [Methylotenera mobilis JLW8] gi|253982344|gb|ACT47202.1| protein of unknown function DUF185 [Methylotenera mobilis JLW8] Length = 387 Score = 226 bits (577), Expect = 3e-57, Method: Composition-based stats. Identities = 94/367 (25%), Positives = 153/367 (41%), Gaps = 34/367 (9%) Query: 1 MENKLIRKIVNLIKKNG-QMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQI 58 + +L I + I +NG ++ Y + + P GYYS FG GDFVTAPEIS + Sbjct: 14 LSQQLALLIQDKISQNGGWLSFADYMHMALYTPGLGYYSGGAKKFGMGGDFVTAPEISPL 73 Query: 59 FGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVET 118 F + +A + Q ++ELG G G + +D+L + L +++E Sbjct: 74 FAQAMANQVAEVLVQT----QGDVLELGAGSGRLAVDLLLALQALNQVPS---HYFILEV 126 Query: 119 SERLTLIQKKQLASYGD-----KINWYTSLADVPLGFTFLVANEFFDSLPIKQFVM---- 169 S L +Q++ + + + W SL D +G ++ NE D+LP+ Sbjct: 127 STYLRQVQRETIQQHLPVALAECVVWLGSLPDNFVG--VMLGNEVLDALPVHLLYKPAAT 184 Query: 170 TEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRL 229 + ER + + + D + E SP + S++D L Sbjct: 185 EAPALCERGVAFNGEFYWQDQPLPAGNLLDLAATYDLPDDYLTEVSPAATGLIASLADAL 244 Query: 230 ACDGGTAIVIDY------GYLQSRVGDTLQAVKGH-TYVSPLVNPGQADLSSHVDFQRLS 282 G I++DY Y R TL H +V PLV G D+++HVDF ++ Sbjct: 245 --KHGAIIMVDYGFSAREYYHPQRNLGTLMCHYQHYAHVDPLVYVGLQDITAHVDFSSVA 302 Query: 283 SIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMK-QTARKDILLDSVKRLVSTSADKKSM 341 S L + G +Q +FL GI + AR L + ++L+S M Sbjct: 303 SAGEHSGLAVMGFCSQAQFLMNCGILDIMSQISPHDMARYAPLAAAAQKLLS----PAEM 358 Query: 342 GELFKIL 348 G+LFK++ Sbjct: 359 GDLFKVI 365 >gi|225554781|gb|EEH03076.1| DUF185 domain-containing protein [Ajellomyces capsulatus G186AR] Length = 543 Score = 226 bits (577), Expect = 3e-57, Method: Composition-based stats. Identities = 117/454 (25%), Positives = 180/454 (39%), Gaps = 104/454 (22%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNP-------FGAVGDFVTAPE 54 L + I I G +++ Y C+ P+ GYY++ FGA GDFVT+PE Sbjct: 48 STPLAKSIAEAINVTGPVSIAAYMRQCLTLPDGGYYTSRGQEDEDTALFGAKGDFVTSPE 107 Query: 55 ISQIFGEMLAIFLICAWEQHGFPS-CVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSI 113 ISQIFGE+L ++ + W G S V+++E GPG+G +M D+LR K ++ ++ Sbjct: 108 ISQIFGELLGVWTVTEWMGQGRKSGGVQVIEFGPGKGTLMGDMLRSFRNFKSFASAIEAV 167 Query: 114 YMVETSERLTLIQKKQLASYGD--------------------KINWYTSLADVPLGFTFL 153 Y+VETS L +Q+K L L + F+ Sbjct: 168 YLVETSPVLREVQRKLLCGDTPMEEVEVGYKSTSIHLGVPVIWTEHIKLLPNESDKTPFI 227 Query: 154 VANEFFDSLPIKQF--------------------------VMTEHGIRERMIDIDQHDSL 187 A+EFFD+LPI F + + H R + + + Sbjct: 228 FAHEFFDALPIHAFQSIQTPAPSQTTINTPTGPTTLHQPPISSPHTTEWRELVVSPNPET 287 Query: 188 VFNIGDHEIKSNFLTCS-------------------DYFLGAIFENSPCRDREMQSISDR 228 E + G+ E SP +Q I+ R Sbjct: 288 PEVKSGQEPEFRLSLAKASTPSSLVLPEMSSRYKALKSTPGSTIEISPESQACVQDIARR 347 Query: 229 LAC---------------------DGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNP 267 + G A+++DYG + ++L+ ++ H VSPLV P Sbjct: 348 IGGGGGGLVSAPSPGVTDTLKNKVPSGAALILDYGTTSTIPINSLRGIRKHRLVSPLVAP 407 Query: 268 GQADLSSHVDFQRLSSIAILY--KLYINGLTTQGKFLEGLGIWQRAFSLMK------QTA 319 G+ D+S+ VDF L+ AI + + G QG FLE LGI +RA L++ Sbjct: 408 GEVDISADVDFTALAEAAIDASPGVEVYGPMEQGPFLEALGISERAAQLLRRMEGEGDEE 467 Query: 320 RKDILLDSVKRLVSTSADKKSMGELFKILVVSHE 353 ++ + KRLV MG+L+K L V E Sbjct: 468 KRKRIESGWKRLVERGG--GGMGKLYKALAVVPE 499 >gi|256016559|emb|CAR63575.1| hypothetical protein [Angiostrongylus cantonensis] Length = 445 Score = 226 bits (577), Expect = 3e-57, Method: Composition-based stats. Identities = 111/388 (28%), Positives = 175/388 (45%), Gaps = 42/388 (10%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYS----TCNPFGAVGDFVTAPEISQIF 59 L R I++ I+ G +TV +Y V+ P GYY + FG GDFVTAPE++Q+F Sbjct: 40 ALKRFIIDKIRATGPITVAEYMKTVVSAPRIGYYGGFSESRKIFGKEGDFVTAPELTQLF 99 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 GE++ ++ G LVE GPG G +M DI + + +S+++VE S Sbjct: 100 GELVGVWCYYELANTGHHGPWLLVECGPGTGQLMSDI---LRVMVNFQEKNVSVHLVECS 156 Query: 120 ERLTLIQKKQLASYGDK--------------------------INWYTSLADVPLGFTFL 153 + L Q++ L I WY ++ D+P F+ Sbjct: 157 DALIEQQERLLCGRCGFLPSSESQKSDDSASYVKKSVSKSGVPIYWYKTIDDIPDQFSVF 216 Query: 154 VANEFFDSLPIKQFVMTEHGIR-ERMIDIDQHDSLVFNIGDHEIKSN--FLTCSDYFLGA 210 V+NEF DSLP+ QF +G E ++ID+ + L F E + + + Sbjct: 217 VSNEFLDSLPVHQFSRDSNGTWNEVYVNIDKANELCFMRSRGENLHTRGLIPVNIRYDDQ 276 Query: 211 IF--ENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPG 268 E SP + ++R+ C+GG I+IDYG+ SR + + K H V PL PG Sbjct: 277 RIHWECSPEAGTFINQTTNRIICNGGFGIIIDYGHDGSRNDLSFRGYKKHEQVHPLSQPG 336 Query: 269 QADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSV 328 DL++ V+F L S + + + G TQ +FL +GI R L+K ++ + Sbjct: 337 AIDLTADVNFGYLKS-LVADRAAVYGPNTQREFLGQMGIELRLRKLLKSCNEREKQESLI 395 Query: 329 KRLVSTSADKKSMGELFKILVVSHEKVE 356 K S + MGE F + + + + Sbjct: 396 K---SYNFLMGDMGERFLTISIFPKTLS 420 >gi|171056795|ref|YP_001789144.1| hypothetical protein Lcho_0104 [Leptothrix cholodnii SP-6] gi|170774240|gb|ACB32379.1| protein of unknown function DUF185 [Leptothrix cholodnii SP-6] Length = 404 Score = 226 bits (577), Expect = 3e-57, Method: Composition-based stats. Identities = 91/380 (23%), Positives = 158/380 (41%), Gaps = 46/380 (12%) Query: 4 KLIRKIVNLIK-KNGQMTVDQYFALCVADPEFGYYSTCNP-FG----AVGDFVTAPEISQ 57 L+ +I I+ G ++ ++Y AL + P GYYS + FG + DFVTAPE+S Sbjct: 39 ALLVRIAAAIEDAGGWISFERYMALALYTPGLGYYSRGDRQFGLMPASGSDFVTAPELSP 98 Query: 58 IFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVE 117 +FG LA + A + G + E G G G + +L + + +V+ Sbjct: 99 LFGRALARQVAQALQATG---TQEVWEFGAGSGALAAQLLGELGD------RITRYTIVD 149 Query: 118 TSERLTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHG 173 S L Q++++ + K+ W L + G +VANE D++P+ H Sbjct: 150 LSGTLRARQRERIEAAHPALAHKVRWLAELPERFEG--VVVANELLDAMPVTLLHWDGHH 207 Query: 174 IRERMIDIDQH----DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRL 229 +R + ++ + DH + +GA+ E + ++++RL Sbjct: 208 WHDRGVALEPGSAAAGAPRLRFADHPTHLAPPVDQHWPVGAVVELPRIAVAYILTLAERL 267 Query: 230 ACDGGTAIVIDYG----YLQSRVGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSI 284 A I + Y R TL + H PL +PG D+++HVDF ++ Sbjct: 268 ARGAAFFIDYGFPEHEFYHPQRSAGTLMCHRAHRADPDPLSDPGDKDITAHVDFTAIAVA 327 Query: 285 AILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDS--VKRLVSTSADKKSMG 342 A + + G T+Q +FL G+ L + ++L++ + MG Sbjct: 328 AQDAGMGVLGYTSQARFLMNCGLIGDLE--------HADLRERAMAQKLIT----EHEMG 375 Query: 343 ELFKILVVSHEKVEL--MPF 360 ELFK+L V+ E + F Sbjct: 376 ELFKVLGVASAGAEFDAIGF 395 >gi|290984613|ref|XP_002675021.1| hypothetical protein NAEGRDRAFT_80425 [Naegleria gruberi] gi|284088615|gb|EFC42277.1| hypothetical protein NAEGRDRAFT_80425 [Naegleria gruberi] Length = 442 Score = 226 bits (577), Expect = 3e-57, Method: Composition-based stats. Identities = 109/394 (27%), Positives = 178/394 (45%), Gaps = 43/394 (10%) Query: 5 LIRKIVNLIKKNGQMTVDQYFALCVADPEFGYY----------STCNPFGAVGDFVTAPE 54 +++ + N IK G ++V + + +P +GYY S N G GDFVT+PE Sbjct: 1 MVKHLKNKIKGAGPISVSTFIQETLLNPIYGYYYTAKKTSLDNSKQNVIGREGDFVTSPE 60 Query: 55 ISQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICK------LKPDFF 108 IS +F EM+ ++ + W + G P + LVELGPG+G +M D+L + + + Sbjct: 61 ISSVFSEMIGLWCVDMWTKLGKPKQIELVELGPGKGTLMHDLLDSLVQSKNSPTIDAFRQ 120 Query: 109 SVLSIYMVETSERLTLIQKKQLASY-----GDKINWYTSLADVP-LGFTFLVANEFFDSL 162 SV + M E SE L +QK +L S+ + I+ + +D ++A+EFFD+L Sbjct: 121 SVKKVTMCEASEALKEVQKDKLKSFTETHEFNWIDRFDKYSDFDSNMPVLIIAHEFFDAL 180 Query: 163 PIKQFVMTEHGIRERMIDIDQ--HDSLVFNIGDHEIKSNFLTCSDYFLGA-----IFENS 215 P+ F TE G E ++DID F + + G I E Sbjct: 181 PVYHFEYTERGWMEVLVDIDDAKDSPHHFKFVLSPGPTMATAFVNLVEGKKDVGSIREVC 240 Query: 216 PCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVS-PLVNPGQADLSS 274 ++ I + L G +++IDYG +G TLQ + H + PL PG+ DLS+ Sbjct: 241 AMGIGYVEKIGEILNNRKGGSLIIDYGNDYP-MGFTLQGIYQHRFTETPLEKPGEVDLST 299 Query: 275 HVDFQRLSSIAILYK-LYINGLTTQGKFLEGLGIWQRAFSLMKQ----TARKDILLDSVK 329 VDF L +K + + G Q FL +G+ R L++ + ++ + + Sbjct: 300 FVDFSSLRKAVEKFKNVKVYGPQYQADFLHQMGMDARFAKLLQNPKLSPEQVTSMITAYE 359 Query: 330 RLVSTSADKKSMGELFKILVVSH---EKVELMPF 360 RL MG +K + ++H K+ F Sbjct: 360 RLTH----PTEMGHHYKAMALAHLPSNKLTATGF 389 >gi|294627641|ref|ZP_06706223.1| conserved hypothetical protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB 11122] gi|292597993|gb|EFF42148.1| conserved hypothetical protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB 11122] Length = 394 Score = 226 bits (576), Expect = 4e-57, Method: Composition-based stats. Identities = 82/385 (21%), Positives = 138/385 (35%), Gaps = 37/385 (9%) Query: 2 ENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIF 59 ++L + I+ G + ++ L + P GYYS FG GDFVTAPE+ +F Sbjct: 16 SDRLAAHVRAEIQAAGGAIPFSRFMELALYAPALGYYSAGASKFGEAGDFVTAPELGPLF 75 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 ++ L +Q G R++E+G G G + +L ++E S Sbjct: 76 AATVSGALAPVLQQLG--PQARVLEVGGGSGAFAEV---TLKRLLELDALPERYAILEPS 130 Query: 120 ERLTLIQKKQLASYGDKINWYTSLADVPLGFT------FLVANEFFDSLPIKQFVMTEHG 173 L Q+++L L + L ANE D+LP +F + + Sbjct: 131 ADLRERQRERLGRSLIPP--VFDLVEWLDAPFPDDWDGVLFANEVIDALPTPRFALRDGQ 188 Query: 174 IRERMIDIDQHDSLVFNIGDHEIKSNF---------LTCSDYFLGAIFENSPCRDREMQS 224 + E + D F G+ + + G E P +Q+ Sbjct: 189 VYE--ETVVLDDQQHFARGEQPADALLSAAVRHLERYLEQPFADGYRSELLPQLPYWIQA 246 Query: 225 ISDRLACDGGTAIVIDYG----YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQ 279 ++ L + Y Y R TL+A H PG DL++ VDF Sbjct: 247 VAGGLKRGAMLFVDYGYPRGEFYRAQREDGTLRAFYRHRMHEDLYRWPGLQDLTASVDFT 306 Query: 280 RLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDIL--LDSVKRLVSTSAD 337 L+ + G TQ FL G G+ +L+ Q + ++ + Sbjct: 307 ALAEAGTGAGFELAGYCTQASFLLGNGLD----ALLAQADTRTDEVGRMRLREQIKRLTL 362 Query: 338 KKSMGELFKILVVSHEKVELMPFVN 362 MGE F+++ S + F++ Sbjct: 363 PSEMGERFQVMGFSRDVDFAPAFLS 387 >gi|33593985|ref|NP_881629.1| hypothetical protein BP3058 [Bordetella pertussis Tohama I] gi|33564059|emb|CAE43327.1| conserved hypothetical protein [Bordetella pertussis Tohama I] gi|332383402|gb|AEE68249.1| hypothetical protein BPTD_3022 [Bordetella pertussis CS] Length = 419 Score = 226 bits (576), Expect = 4e-57, Method: Composition-based stats. Identities = 93/378 (24%), Positives = 157/378 (41%), Gaps = 39/378 (10%) Query: 7 RKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTCN-----------PFGAVGDFVTAPE 54 R + I G + Q+ + + P GYY+ P GDFVTAPE Sbjct: 48 RHLRAAIAAAGGWLPFSQWMSAALYAPGLGYYTAGATKLASPADAQGPALPAGDFVTAPE 107 Query: 55 ISQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIY 114 ++ +F +A + ++E G G G + +LR + Sbjct: 108 LTPLFAATVARQVAQVLRAT---DTASVLEFGAGTGALAEGVLRALAG----MDCPARYL 160 Query: 115 MVETSERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEH-G 173 +VE S L Q+ +LA +GD++ W L D G ++ NE D++P F +E Sbjct: 161 IVEVSADLRQRQQSRLAPFGDRVQWLDQLPDAFAGC--VLGNEVLDAMPATLFRWSETGT 218 Query: 174 IRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFL--GAIFENSPCRDREMQSISDRLAC 231 ++ER + +D + + D + G + E + + ++ + L Sbjct: 219 VQERGVTVDADGAFAWQDRDADAPLAQAVAQRMPPLPGYVSEINLQAEAWVRGMG--LWL 276 Query: 232 DGGTAIVIDY------GYLQSRVGDT-LQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSI 284 G A+++DY Y R G T + ++ H + P V PG D++SHVDF ++ Sbjct: 277 QRGAALLLDYGFPRSEYYHPQRAGGTLMCHLRHHAHADPFVAPGLQDITSHVDFTAMADA 336 Query: 285 AILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDS-VKRLVSTSADKKSMGE 343 A+ L + G +Q +FL G+ L AR + V++L+S + MGE Sbjct: 337 ALAGGLQVLGYLSQARFLVNAGLLDALSQLDPADARAYAQAVAPVQKLLS----EAEMGE 392 Query: 344 LFKILVVSHE-KVELMPF 360 LFK+L V + L+ F Sbjct: 393 LFKVLAVGRDMPEPLLGF 410 >gi|83775006|dbj|BAE65129.1| unnamed protein product [Aspergillus oryzae] Length = 535 Score = 226 bits (576), Expect = 4e-57, Method: Composition-based stats. Identities = 109/484 (22%), Positives = 177/484 (36%), Gaps = 133/484 (27%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTC-----NPFGAVGDFVTAPEIS 56 L R + + IK G + + + + PE GYY+T FG GDFVT+PEIS Sbjct: 39 STPLARTLADAIKVTGPIPIAAFMRQVLTSPEGGYYTTRPAGDGEVFGKKGDFVTSPEIS 98 Query: 57 QIFGEMLAIFLICAWEQHGFPSC-VRLVELGPGRGIMMLDI------------------- 96 Q+FGE++ I+ I W G S V+L+E+GPG+G +M D+ Sbjct: 99 QVFGELVGIWTIAEWMAQGRKSSGVQLMEVGPGKGTLMDDMLRVSLPHLLFSPARRREVF 158 Query: 97 ---------LRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLAS--------------- 132 + K S+ +IY+VE S L +QK++L Sbjct: 159 DTILTTGVISQTFRNFKSFTSSIEAIYLVEASPTLREVQKQRLCGDATMEETEIGHTSTC 218 Query: 133 -----YGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTE---------------- 171 + L F++A+EFFD+LPI F Sbjct: 219 KYFNVPVIWVEDIRLLPHEEDKSPFIIAHEFFDALPIHAFESVPPSPENQPPQSQDTIMT 278 Query: 172 ----------------HGIRERMIDIDQH-------DSLVFNIGDHEIKS---------- 198 RE M+ ++ + F + + + Sbjct: 279 PTGPTKLHKPLKPANTPQWRELMVTLNPKAIDENLPNEPEFKLTHAKASTPSSLVIPEIS 338 Query: 199 NFLTCSDYFLGAIFENSPCRDREMQSISDRL-----------------------ACDGGT 235 G+ E SP + R+ G Sbjct: 339 PRYRALKSQPGSTIEISPESRIYASDFARRIGGASQPPRTKARNASTQPAAPAKRVPSGA 398 Query: 236 AIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAIL--YKLYIN 293 A+++DYG + + ++L+ ++ H V PL PGQ D+S+ VDF L+ A+ + ++ Sbjct: 399 ALIMDYGTMDTIPVNSLRGIQHHRKVPPLSAPGQVDVSADVDFTALAEAALEGSEGVEVH 458 Query: 294 GLTTQGKFLEGLGIWQRAFSLMKQTARKDI---LLDSVKRLVSTSADKKSMGELFKILVV 350 G QG FL +GI +R L+K ++ L +RLV MG+++K + + Sbjct: 459 GPVEQGDFLRTMGIAERMQQLLKHEKDEEKRKTLESGWQRLVEKGG--GGMGKIYKFMAI 516 Query: 351 SHEK 354 E Sbjct: 517 VPEN 520 >gi|122879301|ref|YP_202570.6| hypothetical protein XOO3931 [Xanthomonas oryzae pv. oryzae KACC10331] gi|188575189|ref|YP_001912118.1| hypothetical protein PXO_04309 [Xanthomonas oryzae pv. oryzae PXO99A] gi|188519641|gb|ACD57586.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae PXO99A] Length = 394 Score = 226 bits (576), Expect = 4e-57, Method: Composition-based stats. Identities = 81/383 (21%), Positives = 139/383 (36%), Gaps = 33/383 (8%) Query: 2 ENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIF 59 ++L + I+ G + ++ L + P GYYS FG GDFVTAPE+ +F Sbjct: 16 SDRLAAHVRAEIQAAGGAIPFSRFMELALYAPGLGYYSAGASKFGEAGDFVTAPEVGPLF 75 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 ++ L +Q G R++E+G G G +L+ + +L ++E S Sbjct: 76 AATVSGALAPVLQQLG--PQARVLEVGGGSGAFAEVMLKRLLELDALP---ERYAILEPS 130 Query: 120 ERLTLIQKKQLASYGDKINWYTSLADVPLGFT------FLVANEFFDSLPIKQFVMTEHG 173 L Q+++L L + L ANE D+LP +F + + Sbjct: 131 ADLRERQRERLGRSLIPP--VFDLVEWLDAPFPDDWDGVLFANEVIDALPTPRFALRDGE 188 Query: 174 IRERMIDIDQHDSLVFNIGDHEI-------KSNFLTCSDYFLGAIFENSPCRDREMQSIS 226 + E + +D + + G E P +Q+++ Sbjct: 189 VYEETVVLDAQQQFARGEQPADALLSAAVRHLERYLEQPFADGYRSELLPQLPYWIQAVA 248 Query: 227 DRLACDGGTAIVIDYG----YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRL 281 L + Y Y R TL+A H PG DL++ VDF L Sbjct: 249 GGLKRGAMLFVDYGYPRGEFYRAQREDGTLRAFYRHRMHEDLYRWPGLQDLTASVDFTAL 308 Query: 282 SSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDIL--LDSVKRLVSTSADKK 339 + + G TQ FL G G+ +L+ Q + ++ + Sbjct: 309 AEAGTGAGFELAGYCTQASFLLGNGLD----TLLAQADTRTDEVGRMRLREQIKRLTLPS 364 Query: 340 SMGELFKILVVSHEKVELMPFVN 362 MGE F+++ S + F++ Sbjct: 365 EMGERFQVMGFSRDVDFAPAFLS 387 >gi|58428148|gb|AAW77185.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae KACC10331] Length = 439 Score = 226 bits (576), Expect = 4e-57, Method: Composition-based stats. Identities = 81/383 (21%), Positives = 139/383 (36%), Gaps = 33/383 (8%) Query: 2 ENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIF 59 ++L + I+ G + ++ L + P GYYS FG GDFVTAPE+ +F Sbjct: 61 SDRLAAHVRAEIQAAGGAIPFSRFMELALYAPGLGYYSAGASKFGEAGDFVTAPEVGPLF 120 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 ++ L +Q G R++E+G G G +L+ + +L ++E S Sbjct: 121 AATVSGALAPVLQQLG--PQARVLEVGGGSGAFAEVMLKRLLELDALP---ERYAILEPS 175 Query: 120 ERLTLIQKKQLASYGDKINWYTSLADVPLGFT------FLVANEFFDSLPIKQFVMTEHG 173 L Q+++L L + L ANE D+LP +F + + Sbjct: 176 ADLRERQRERLGRSLIPP--VFDLVEWLDAPFPDDWDGVLFANEVIDALPTPRFALRDGE 233 Query: 174 IRERMIDIDQHDSLVFNIGDHEI-------KSNFLTCSDYFLGAIFENSPCRDREMQSIS 226 + E + +D + + G E P +Q+++ Sbjct: 234 VYEETVVLDAQQQFARGEQPADALLSAAVRHLERYLEQPFADGYRSELLPQLPYWIQAVA 293 Query: 227 DRLACDGGTAIVIDYG----YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRL 281 L + Y Y R TL+A H PG DL++ VDF L Sbjct: 294 GGLKRGAMLFVDYGYPRGEFYRAQREDGTLRAFYRHRMHEDLYRWPGLQDLTASVDFTAL 353 Query: 282 SSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDIL--LDSVKRLVSTSADKK 339 + + G TQ FL G G+ +L+ Q + ++ + Sbjct: 354 AEAGTGAGFELAGYCTQASFLLGNGLD----TLLAQADTRTDEVGRMRLREQIKRLTLPS 409 Query: 340 SMGELFKILVVSHEKVELMPFVN 362 MGE F+++ S + F++ Sbjct: 410 EMGERFQVMGFSRDVDFAPAFLS 432 >gi|84625366|ref|YP_452738.1| hypothetical protein XOO_3710 [Xanthomonas oryzae pv. oryzae MAFF 311018] gi|84369306|dbj|BAE70464.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae MAFF 311018] Length = 417 Score = 226 bits (576), Expect = 4e-57, Method: Composition-based stats. Identities = 81/383 (21%), Positives = 139/383 (36%), Gaps = 33/383 (8%) Query: 2 ENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIF 59 ++L + I+ G + ++ L + P GYYS FG GDFVTAPE+ +F Sbjct: 39 SDRLAAHVRAEIQAAGGAIPFSRFMELALYAPGLGYYSAGASKFGEAGDFVTAPEVGPLF 98 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 ++ L +Q G R++E+G G G +L+ + +L ++E S Sbjct: 99 AATVSGALAPVLQQLG--PQARVLEVGGGSGAFAEVMLKRLLELDALP---ERYAILEPS 153 Query: 120 ERLTLIQKKQLASYGDKINWYTSLADVPLGFT------FLVANEFFDSLPIKQFVMTEHG 173 L Q+++L L + L ANE D+LP +F + + Sbjct: 154 ADLRERQRERLGRSLIPP--VFDLVEWLDAPFPDDWDGVLFANEVIDALPTPRFALRDGE 211 Query: 174 IRERMIDIDQHDSLVFNIGDHEI-------KSNFLTCSDYFLGAIFENSPCRDREMQSIS 226 + E + +D + + G E P +Q+++ Sbjct: 212 VYEETVVLDAQQQFARGEQPADALLSAAVRHLERYLEQPFADGYRSELLPQLPYWIQAVA 271 Query: 227 DRLACDGGTAIVIDYG----YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRL 281 L + Y Y R TL+A H PG DL++ VDF L Sbjct: 272 GGLKRGAMLFVDYGYPRGEFYRAQREDGTLRAFYRHRMHEDLYRWPGLQDLTASVDFTAL 331 Query: 282 SSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDIL--LDSVKRLVSTSADKK 339 + + G TQ FL G G+ +L+ Q + ++ + Sbjct: 332 AEAGTGAGFELAGYCTQASFLLGNGLD----TLLAQADTRTDEVGRMRLREQIKRLTLPS 387 Query: 340 SMGELFKILVVSHEKVELMPFVN 362 MGE F+++ S + F++ Sbjct: 388 EMGERFQVMGFSRDVDFAPAFLS 410 >gi|171464271|ref|YP_001798384.1| protein of unknown function DUF185 [Polynucleobacter necessarius subsp. necessarius STIR1] gi|171193809|gb|ACB44770.1| protein of unknown function DUF185 [Polynucleobacter necessarius subsp. necessarius STIR1] Length = 395 Score = 226 bits (575), Expect = 5e-57, Method: Composition-based stats. Identities = 89/373 (23%), Positives = 158/373 (42%), Gaps = 27/373 (7%) Query: 5 LIRKIVNLIK-KNGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGEM 62 L KI + I + G + +Y + + +P GYYS + GA GDF TAPE+S +FG Sbjct: 16 LKAKIASQIASQGGWLPFSRYMEMALYEPGMGYYSAGAHKLGAGGDFTTAPELSPLFGAT 75 Query: 63 LAIFLICAWEQHG-FPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 + L+ E +++E G G G + IL + L F + ++E S Sbjct: 76 ICSTLLPVLEGLKEKGLSTQILEFGAGTGKLATSILTRLNDLG---FVLDRYDIIEISPD 132 Query: 122 LTLIQKKQLASYGDKIN---WYTSLADVPLGFT-FLVANEFFDSLPIKQFVMTEHGIRER 177 L Q++++ + +++N L ++P F ++ANE D++P + Sbjct: 133 LAQRQQEKIKNTVEQLNLKTQCNWLTELPKNFNGVILANEVIDAIPCDVVIFKNGFWYWH 192 Query: 178 MIDIDQHDSLVFNIGDHEIKSNFLTC--SDYFLGAIFENSPCRDREMQSISDRLACDGGT 235 + I+ + E K + S++ G + E + MQ ++ L D G Sbjct: 193 GVAIENDNLTWKAGSPVEQKLLPQSLLSSNFSEGYVTELHTPANAWMQQVAKHL--DAGL 250 Query: 236 AIVIDY------GYLQSRVGDTLQ-AVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILY 288 + DY Y R+ TL + H P PG D+++HV++ +++ + Sbjct: 251 FLTFDYGFPENEYYHPQRLEGTLMAHHRHHAIQDPFHFPGLCDVTTHVEWSQIARSTLTE 310 Query: 289 KLYINGLTTQGKFLEGLGIWQRAFSLMK--QTARKDILLDSVKRLVSTSADKKSMGELFK 346 + LT Q +L GI A + + +S+++L+S + MGELFK Sbjct: 311 NVDDVYLTNQAAYLLDAGIGDIALEIGDPSDPETFLPISNSLQKLLS----EAEMGELFK 366 Query: 347 ILVVSHEKVELMP 359 + S L+P Sbjct: 367 VFTFSKSLDSLLP 379 >gi|220907759|ref|YP_002483070.1| hypothetical protein Cyan7425_2351 [Cyanothece sp. PCC 7425] gi|219864370|gb|ACL44709.1| protein of unknown function DUF185 [Cyanothece sp. PCC 7425] Length = 410 Score = 226 bits (575), Expect = 5e-57, Method: Composition-based stats. Identities = 97/381 (25%), Positives = 160/381 (41%), Gaps = 37/381 (9%) Query: 5 LIRKIVNLIKKN--GQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIFGE 61 L+ I I+ + ++ +Y L + + GYY++ G GDF T+P + F E Sbjct: 12 LLALISAEIQASPDQRIPFARYMDLVLYQSQQGYYASNAVKIGQGGDFFTSPHLGSDFAE 71 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 +L + W+ G PS LVE+G G+GI+ D+L+ + + PDFF+ L+ +VE + Sbjct: 72 LLGEQFLQMWQVMGQPSAFNLVEMGAGQGIIANDLLKYLQRQYPDFFASLNYVIVEKAAG 131 Query: 122 LTLIQKKQLASYGDKINW--YTSLADVPLGFT--FLVANEFFDSLPIKQFVMTEHGIRER 177 L QK L + + + L ++ G L +NE D+ P+ Q V+ + +++ Sbjct: 132 LIAEQKYHLKPWLETWGRLKWLGLEEIADGTIAGCLFSNELLDAFPVDQVVIQQGQLQQV 191 Query: 178 MIDIDQHD------SLVFNIGDHEIKSN-----------FLTCSDYFLGAIFENSPCRDR 220 + ++ F E L SDY G E + Sbjct: 192 FVGLNPDHPGEGQRQHPFQEILAEPTMAGLAEYFQWLGIQLPGSDYPEGYRTEVNLAALD 251 Query: 221 EMQSISDRLACDGGTAIVIDYGYLQS------RVGDTLQAVKGH-TYVSPLVNPGQADLS 273 + ++S +L G + IDYGY R TLQ H T +P N G DL+ Sbjct: 252 WIATVSRKLQR--GYVLTIDYGYPARQYYQPARREGTLQCYYHHATNTNPYFNVGHQDLT 309 Query: 274 SHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKR--L 331 +HV+F L L G T QG FL LG+ R + A + L ++R Sbjct: 310 AHVNFTALEQRGKECGLVNLGFTQQGLFLMSLGLGDRLVA-NNSGAFSNDLSTVIRRREA 368 Query: 332 VSTSADKKSMGELFKILVVSH 352 + + +G F +L+ + Sbjct: 369 LHQLMNPLGLGG-FGVLIQAK 388 >gi|308502085|ref|XP_003113227.1| hypothetical protein CRE_25213 [Caenorhabditis remanei] gi|308265528|gb|EFP09481.1| hypothetical protein CRE_25213 [Caenorhabditis remanei] Length = 430 Score = 226 bits (575), Expect = 5e-57, Method: Composition-based stats. Identities = 113/350 (32%), Positives = 168/350 (48%), Gaps = 30/350 (8%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYST----CNPFGAVGDFVTAPEISQI 58 N L + I + IK +G +TV +Y V+ P GYY FG GDF+T+PE+SQ+ Sbjct: 63 NHLKKFIADKIKTSGPITVAEYMKTSVSAPLVGYYGQFSDDQKVFGEKGDFITSPELSQL 122 Query: 59 FGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVET 118 FGEM+ +++ G +LVELGPGR +M D+L + K SV ++VET Sbjct: 123 FGEMIGVWVFHELANTGHKGSWQLVELGPGRAQLMNDVLNALSKFHDKDVSV---HLVET 179 Query: 119 SERLTLIQKKQLASYGD------------------KINWYTSLADVPLGFTFLVANEFFD 160 S+ L Q+K L Y + WY ++ D+P GFT +ANEF D Sbjct: 180 SDALIDEQEKALCIYKSNNTEDTPHVRKNKSRTGVNVYWYKAIDDIPDGFTVFIANEFLD 239 Query: 161 SLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGA----IFENSP 216 +LP+ QF T E I++ + D+L F E + +E SP Sbjct: 240 ALPVHQFQKTGDTWNEIYINLTKDDNLCFMKSKGENLNTKGLIPTAIRSDSKRVTWECSP 299 Query: 217 CRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHV 276 + I DR+ GG +++IDYG+ +R + +A K H V PL NPG DL++ V Sbjct: 300 ESGTVVNQIVDRITTFGGFSLLIDYGHDGTRNTHSFRAYKNHKQVDPLANPGMVDLTADV 359 Query: 277 DFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLD 326 DF L S + ++ + G Q +FL LGI R L++ ++ Sbjct: 360 DFGYL-SSLVKDRVLVYGAKEQREFLAQLGIEHRLRRLLQICKDREQQEQ 408 >gi|320169715|gb|EFW46614.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864] Length = 471 Score = 225 bits (574), Expect = 7e-57, Method: Composition-based stats. Identities = 114/378 (30%), Positives = 173/378 (45%), Gaps = 45/378 (11%) Query: 17 GQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 G+++ ++ + P GYY + FGA GDF T+PEIS +FGEM+ +L+ W+ Sbjct: 77 GKVSTAEFMRTVLTAPSGGYYMRTDVFGAGGDFTTSPEISPLFGEMIGAWLLNHWQVSNS 136 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQL------ 130 P V L+E GPGRG +M D+LR I + ++ VE S L +Q + L Sbjct: 137 PKKVNLIEFGPGRGTLMHDVLRKIFARF-KAVDAVHVHFVEKSPALLRVQAEMLGVPLGA 195 Query: 131 ------------ASYGDKINWYTSLAD-VPLGFTFLVANEFFDSLPIKQFVMTEHGIRER 177 +G ++ W+ S+ F+ ++A+EFFD+LP F TE G RE Sbjct: 196 DLVQTEPVHGKSERFGIQVTWHQSVDTVPDDDFSLILAHEFFDALPALSFTRTERGWREV 255 Query: 178 MIDIDQHDSLVF-----------------NIGDHEIKSNFLTCSDYFLGAIFENSPCRDR 220 ++D+D+ F + + SN S SP Sbjct: 256 LVDLDEKGPYPFRLVTANAHTVASRSLLVDPNEAGSSSNLGRVSPPAHVRSLTLSPDSFI 315 Query: 221 EMQSISDRLACDGGTAIVIDYGYLQS-RVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQ 279 +++S R+ GG A++IDYGY TL+A +GH V G +DL+ VDF+ Sbjct: 316 LTENLSKRIINRGGAALIIDYGYATPAPKEMTLRAFRGHKEVHLFDKIGMSDLTVDVDFE 375 Query: 280 RLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLM---KQTARKDILLDSVKRLVSTSA 336 L++ A + Y T QG+FLE LGI QR L+ +Q L V+RL S Sbjct: 376 YLAAAAQAHGAYCTAATPQGEFLEALGIGQRLARLLGDPRQAEHHQTLKSGVERLTS--- 432 Query: 337 DKKSMGELFKILVVSHEK 354 MG+ FK + + + Sbjct: 433 -PTDMGQRFKAMAIVPQN 449 >gi|71909749|ref|YP_287336.1| hypothetical protein Daro_4140 [Dechloromonas aromatica RCB] gi|71849370|gb|AAZ48866.1| Protein of unknown function DUF185 [Dechloromonas aromatica RCB] Length = 386 Score = 225 bits (574), Expect = 7e-57, Method: Composition-based stats. Identities = 90/364 (24%), Positives = 151/364 (41%), Gaps = 33/364 (9%) Query: 17 GQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHG 75 G ++ ++ L + P GYY+ FGA GDFVT+PE++ +FG++L + + Sbjct: 31 GWVSFARFMELVLYAPGLGYYTAGARKFGAAGDFVTSPEMTPLFGQVLTRQVAQVMAES- 89 Query: 76 FPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQL----A 131 ++E+G G G + D+L + ++ ++++ S L Q++ + Sbjct: 90 ---APVVLEVGAGSGRLAADLLLALERMGELP---EHYFILDLSADLRQRQRQTIAEAAP 143 Query: 132 SYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNI 191 ++ W L + G +VANE D++P E+GI +R + +D+ S ++N Sbjct: 144 HLLSRVEWLDRLPETFSG--VVVANELLDAMPANIVAWRENGIFDRGVVVDEAGSFMWNE 201 Query: 192 GDH-----EIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYG---- 242 G E S + RL G ++IDYG Sbjct: 202 RPATGTLLAAAEEIGAQCSLPPGFESEISLTVRAWLSEWGRRLEK--GALLLIDYGFPRR 259 Query: 243 --YLQSRVGDT-LQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQG 299 Y Q R T + + H + P PG D++ HVDF + + A L + G T QG Sbjct: 260 EFYHQQRGRGTLMCHYRHHAHPDPFYLPGLQDVTVHVDFTAVIAAAHAAGLDLLGYTNQG 319 Query: 300 KFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELM- 358 +FL GI + + T +V L+ MGELFK++ V E + Sbjct: 320 QFLLNCGILDQLAEIPNGTPEYIRAAGAVNMLLM----PHEMGELFKVIAVGRGIDEPLC 375 Query: 359 PFVN 362 F N Sbjct: 376 GFAN 379 >gi|319778597|ref|YP_004129510.1| hypothetical protein TEQUI_0423 [Taylorella equigenitalis MCE9] gi|317108621|gb|ADU91367.1| Uncharacterized conserved protein [Taylorella equigenitalis MCE9] Length = 373 Score = 225 bits (574), Expect = 7e-57, Method: Composition-based stats. Identities = 102/371 (27%), Positives = 157/371 (42%), Gaps = 31/371 (8%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNP-FGAVGDFVTAPEISQIF 59 + KL + I + D++ + P GYY+ P FG+ GDFVTAPEIS F Sbjct: 14 ISQKLDKYIRQKFANRNVIEFDKWLNEVLYAPSLGYYNNALPIFGSKGDFVTAPEISHFF 73 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 G+ L + EQ ++E G G G M IL+ +F ++E S Sbjct: 74 GKCLGNQIRQILEQ---CDSKHILEFGAGSGAMAKQILKASTDAHIKYF------ILELS 124 Query: 120 ERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMI 179 L +Q+ L+ Y D+I W SL + G ++ANE DS+P K F ++ + Sbjct: 125 ADLRALQQNTLSEYADRIVWLDSLPEKFQGC--ILANEVLDSIPPKIFEFSDSEGHIEIG 182 Query: 180 DIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVI 239 F L G E + + ++S++ L G ++I Sbjct: 183 VRATEVGYEFAKVGIATHKEVLERIPNINGYRSEINFQAEAWIRSLAGLLT--NGGILLI 240 Query: 240 DY------GYLQSRVGDT-LQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYI 292 DY Y R T + K HT+ +PL+N G D++SHVDF ++ AI L + Sbjct: 241 DYGFARSEYYHPQRNEGTIMCHFKHHTHSNPLINIGIQDITSHVDFTAVADSAIDAGLEL 300 Query: 293 NGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSH 352 G TTQ FL G+ + + L S K ++T + MGELFK+++ S Sbjct: 301 WGYTTQASFLISCGLEHELL-------KTNDLSLSQKTGINTLISEAEMGELFKVMLFSK 353 Query: 353 E---KVELMPF 360 + + F Sbjct: 354 NLDWPEDPLGF 364 >gi|255950224|ref|XP_002565879.1| Pc22g19770 [Penicillium chrysogenum Wisconsin 54-1255] gi|211592896|emb|CAP99265.1| Pc22g19770 [Penicillium chrysogenum Wisconsin 54-1255] Length = 507 Score = 225 bits (573), Expect = 8e-57, Method: Composition-based stats. Identities = 114/457 (24%), Positives = 180/457 (39%), Gaps = 106/457 (23%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCN----PFGAVGDFVTAPEISQ 57 L + + N++K G + + + + P+ GYY+T FG GDFVT+PEISQ Sbjct: 38 STPLAKTLANVMKVTGPVPIAAFMRQVLTSPDGGYYTTRGEGGGVFGKHGDFVTSPEISQ 97 Query: 58 IFGEMLAIFLICAWEQHGFPSC-VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMV 116 +FGE++ I+ I W G V+L+E+GPG+G +M D+LR K SV +IY+V Sbjct: 98 VFGELIGIWTIAEWMAQGRARSGVQLMEVGPGKGTLMDDMLRTFRNFKSFSSSVEAIYLV 157 Query: 117 ETSERLTLIQKKQLASY---------------------GDKINWYTSLADVPLGFTFLVA 155 E S L +QK+ L + L F+ A Sbjct: 158 EASGTLREVQKRLLCGEEAVMEETDIGHRSVCKYFDVPIVWVEDIRLLPHEEGKTPFIFA 217 Query: 156 NEFFDSLPIKQFVMTE--------------------------------HGIRERMIDIDQ 183 +EFFD+LPI F RE M+ ++ Sbjct: 218 HEFFDALPIHAFESVPPSPENEQANAPRTIMTPTGPAELHNPPKPANTPQWRELMVTLNP 277 Query: 184 -------HDSLVFNIGDHEIKS----------NFLTCSDYFLGAIFENSPCRDREMQSIS 226 F + + + G+ E SP + Sbjct: 278 KAVEENIKGEPEFQLTKAKASTPSSLVIPEISQRYRALKSRPGSTIEISPESRIYAADFA 337 Query: 227 DRL------------------------ACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVS 262 R+ G A+++DYG L + ++L+ +K H V+ Sbjct: 338 RRIGGDSASALAAKKSSASSPPPSSEKKTPSGAALIMDYGTLSTIPINSLRGIKSHEKVA 397 Query: 263 PLVNPGQADLSSHVDFQRLSSIAILY--KLYINGLTTQGKFLEGLGIWQRAFSLMKQ--- 317 PL PG+ D+S+ VDF L+ AI + ++G QG FL +GI +R L+++ Sbjct: 398 PLSEPGRVDVSADVDFTSLAEAAIEGSDGVEVHGPVEQGDFLGAMGIEERMRQLLRKVPD 457 Query: 318 TARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 K L + KRLV D SMG+++K++ + E Sbjct: 458 EEHKKTLETAWKRLVEK--DGGSMGQIYKVMAIVPEN 492 >gi|120612860|ref|YP_972538.1| hypothetical protein Aave_4224 [Acidovorax citrulli AAC00-1] gi|120591324|gb|ABM34764.1| protein of unknown function DUF185 [Acidovorax citrulli AAC00-1] Length = 376 Score = 225 bits (573), Expect = 1e-56, Method: Composition-based stats. Identities = 91/379 (24%), Positives = 148/379 (39%), Gaps = 40/379 (10%) Query: 1 MENKLIRKIVNLIK-KNGQMTVDQYFALCVADPEFGYYSTC-NPFGA----VGDFVTAPE 54 + + L R+I + I G D++ + P GYY+ FGA DFVTAPE Sbjct: 10 LSSALARRIADGIAEAGGWTGFDRFMEWALYTPGLGYYANALPKFGALPQSGSDFVTAPE 69 Query: 55 ISQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIY 114 +S +FG+ LA + A + G + E G G G + +L + V Sbjct: 70 MSPVFGQALARQVREALDATG---THDIWEFGAGSGALAAQLLEALGD------RVRRYT 120 Query: 115 MVETSERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEH-- 172 +V+ S L Q+++LA +GD+++W +L + G +V NE D++P++ Sbjct: 121 IVDLSGSLRERQRERLAPWGDRVHWAQALPERIEG--VVVGNEVLDAMPVRLLARHGGAE 178 Query: 173 --GIRERMIDIDQHDS--LVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDR 228 ER + + D F D + + E + ++ + R Sbjct: 179 GGTWHERGVVVAPADGGVPRFAWEDRPTALRPPVEPEGPQDYVTEIHAQGEGFLRMLGQR 238 Query: 229 LACDGGTAIVIDYG----YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSS 283 LA I + Y R TL + H PL + G D+++HV+F ++ Sbjct: 239 LARGAAFLIDYGFPEAEYYHPQRHMGTLVCHRAHQVDSDPLSDVGAKDITAHVNFTAMAV 298 Query: 284 IAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGE 343 A L + G TTQ FL G+ L + + K + MGE Sbjct: 299 AAQEAGLGVLGYTTQAHFLINCGLLSLLEPL-----PQAQRAQAAK-----LMMEHEMGE 348 Query: 344 LFKILVVSH--EKVELMPF 360 LFK+L V + F Sbjct: 349 LFKVLAVGAGVPAWTPVGF 367 >gi|237749222|ref|ZP_04579702.1| DUF185 domain-containing protein [Oxalobacter formigenes OXCC13] gi|229380584|gb|EEO30675.1| DUF185 domain-containing protein [Oxalobacter formigenes OXCC13] Length = 383 Score = 225 bits (573), Expect = 1e-56, Method: Composition-based stats. Identities = 95/377 (25%), Positives = 166/377 (44%), Gaps = 31/377 (8%) Query: 1 MENKLIRKIVNLI-KKNGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQI 58 + L ++I+ I +K+G ++ Y + P GYYS G GDF TAPE++ + Sbjct: 13 LSASLEKRIMADIGEKSGWISFADYMQQVLYTPLLGYYSGSLVKLGEAGDFTTAPEMTDL 72 Query: 59 FGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVET 118 +G LA +I EQ G ++ELG G G + D+L + + ++E Sbjct: 73 YGRTLAQAMIPLLEQTGA----NILELGAGTGKLAFDVLTALAG---AGIRIGKYRILEL 125 Query: 119 SERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERM 178 S L Q+ L + D + W T L + G ++ANE D++P+ G E Sbjct: 126 SAELRQRQQVSLKGF-DNVEWLTVLPERFDG--VVLANEVLDAMPVHLVKKYGSGWYETG 182 Query: 179 IDIDQHDSLVFNIGDHE-----IKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDG 233 + + + + +I E I+ + + +G E ++++S+ LA Sbjct: 183 VSVHDNRLVFADIPCEESLVDTIRQSIPDHVNLPVGYQTEVHIHARGFIKTLSEMLAKSA 242 Query: 234 GTA-IVIDY------GYLQSRVGDTLQAVKGHTYVS-PLVNPGQADLSSHVDFQRLSSIA 285 A I+IDY Y R TL H P PG D+++HV+F ++ +A Sbjct: 243 CAAAILIDYGFPAHEYYHPDRSAGTLMCHFRHRGHDDPFFLPGLQDITAHVNFTGMAQMA 302 Query: 286 ILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQT-ARKDILLDSVKRLVSTSADKKSMGEL 344 + L + +Q FL G+ + ++ + L +V++L+S MGEL Sbjct: 303 SDHGLDVICYASQASFLLASGLPDLLSNGSAESGNSRAGQLQAVQKLLS----PAEMGEL 358 Query: 345 FKILVVSHEKVELMPFV 361 FK++V+ H+ +E F+ Sbjct: 359 FKVMVLGHK-IEPPAFM 374 >gi|78043067|ref|YP_359248.1| hypothetical protein CHY_0386 [Carboxydothermus hydrogenoformans Z-2901] gi|77995182|gb|ABB14081.1| conserved hypothetical protein [Carboxydothermus hydrogenoformans Z-2901] Length = 345 Score = 224 bits (572), Expect = 1e-56, Method: Composition-based stats. Identities = 84/359 (23%), Positives = 145/359 (40%), Gaps = 25/359 (6%) Query: 5 LIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLA 64 L ++ IKK +T + L + P++GYY+ G GDF TAP +S+ FG L Sbjct: 2 LKEVLIEKIKKM-PLTFRDFMELALYHPDYGYYTRNVTLGKEGDFYTAPILSKSFGYTLG 60 Query: 65 IFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTL 124 ++ ++ +G P + G G+ + + L P+ +++E S L Sbjct: 61 KYIFNLYQTYGMPLTLLEFGAGTGKMAQDILVWFAAQGLFPE------YHILEISAHLRE 114 Query: 125 IQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQH 184 +Q+K L ++I G ++ANE D+ P+ + + + +E + +D + Sbjct: 115 VQRKNLECQQNQIKHLPEFPQNFSG--IIIANEVVDAFPVHRVIYQKGIFQEIYVGVDDN 172 Query: 185 DSLVFNIGDHEIKSNFLTCSDYF----LGAIFENSPCRDREMQSISDRLACDGGTAIVID 240 L + G IF + + ++ I +L C I Sbjct: 173 GKFFTYPSRLSSPEIELYLKEANISPVEGQIFNVNLEAGKWLEEIYQKLNCGAFIIIDYG 232 Query: 241 YG----YLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLT 296 + Y +R TL + H PL NPG+ D+++HVDF L A Sbjct: 233 FDTAELYHPARSTGTLTSFSRHRQKDPLQNPGEQDITAHVDFGMLRKKAKSLGFKEELFL 292 Query: 297 TQGKFLEGLGIWQRAFS--LMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHE 353 +QG+FL GI + L + L +K+L+ MGE+FK+LV+ Sbjct: 293 SQGEFLMKAGIMEFVKRDDLTRPLESYQETL-KIKKLILPP-----MGEVFKVLVLIKN 345 >gi|54295688|ref|YP_128103.1| hypothetical protein lpl2776 [Legionella pneumophila str. Lens] gi|53755520|emb|CAH17019.1| hypothetical protein lpl2776 [Legionella pneumophila str. Lens] Length = 370 Score = 224 bits (572), Expect = 1e-56, Method: Composition-based stats. Identities = 84/372 (22%), Positives = 157/372 (42%), Gaps = 25/372 (6%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGEM 62 L + ++N +K ++ + + P GYY++ G GDF+TAPE++ ++G+ Sbjct: 2 NLKQTLINQLKLQKEIPFIDFMQQALYAPYEGYYTSGLQKLGKHGDFITAPELTSLYGKT 61 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 LA P L E G G G + +D L + +L+ S +++E S L Sbjct: 62 LANQFQQILPLLNSPV---LFEFGAGTGKLCIDTLTHLEQLQCLPES---YFILEVSADL 115 Query: 123 TLIQKKQLASYGDKINWYTSLADVPL---GFTFLVANEFFDSLPIKQFVMTEHGIRERMI 179 Q++ + ++ D ++ANE D++P+ +F+ +E I E + Sbjct: 116 RHRQQELIQQKIPQLAGLIHWLDKWPEAPFNGVIIANEVLDAMPVYRFMQSETEILESYV 175 Query: 180 DIDQHDSLVFNIGDHEIKSNF----LTCSDYFLGAIFENSPCRDREMQSISDRLACDGGT 235 +D+HD L + + + E + D + + S L Sbjct: 176 TLDEHDQLAEIFKPVQNQRLLAYIKNRLPPLDYPYLTEANLFLDDWLLNCSHMLKKGALF 235 Query: 236 AIVIDYG----YLQSRVGDTLQAVKGH-TYVSPLVNPGQADLSSHVDFQRLSSIAILYKL 290 I + Y R TL H ++ +PL+NPG+ D+++HVDF ++ Sbjct: 236 IIDYGFPRHEYYHPDRNQGTLMCHYQHYSHPNPLLNPGEQDITAHVDFTHVAEAGHQAGF 295 Query: 291 YINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVV 350 +I G T Q FL G+ +L + L + K+ + MGELFK++ + Sbjct: 296 HIAGYTNQASFLLANGLLSLIHTL-----DDEQELFAAKQAIKQLTQPSEMGELFKVIAL 350 Query: 351 SHE-KVELMPFV 361 + + +++L F+ Sbjct: 351 TKDLEIDLNGFL 362 >gi|218439985|ref|YP_002378314.1| hypothetical protein PCC7424_3044 [Cyanothece sp. PCC 7424] gi|218172713|gb|ACK71446.1| protein of unknown function DUF185 [Cyanothece sp. PCC 7424] Length = 387 Score = 224 bits (572), Expect = 1e-56, Method: Composition-based stats. Identities = 94/378 (24%), Positives = 152/378 (40%), Gaps = 25/378 (6%) Query: 1 MEN-KLIRKIVNLIKKNGQ--MTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEIS 56 M N LI+ ++ I ++ Q +T +Y L + PE GYY + G GD+ T+ + Sbjct: 1 MSNLNLIKILIEQINQSSQQGITFAEYMHLVLYHPELGYYCSHLPKIGTQGDYFTSSSLG 60 Query: 57 QIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMV 116 FGE+LA + WE G PS +VE+G G G++ DIL + F L+ +++ Sbjct: 61 ADFGELLAKQFLEMWEILGQPSPFIIVEMGAGLGLLAQDILNYFEQNNTHFLDSLNYWLI 120 Query: 117 ETSERLTLIQKKQLASYGD-----KINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTE 171 E S L QK QL Y + + +AD + +NE D+ P+ + + + Sbjct: 121 EQSSTLIKAQKNQLTPYLEKGVKLDWKTWEDIADESIIGCV-FSNELVDAFPVHRVGLEK 179 Query: 172 HGIRERMIDIDQH--DSLVFNIGDHEIKSNF------LTCSDYFLGAIFENSPCRDREMQ 223 ++E + ++ ++ + E+ F Y G E + ++ Sbjct: 180 GELKEIYVTYTENTFKEILADPSSEELNHYFKFVGVEFPSDAYPEGFQTEVNLSALSWLK 239 Query: 224 SISDRLACDGGTAIVIDYG----YLQSRVGDTLQAVKGH-TYVSPLVNPGQADLSSHVDF 278 ++S +L I Y Y R TL H + P +N G DL++HVDF Sbjct: 240 TLSQKLKRGYILTIDYGYPAHKYYHPQRYRGTLNCYYQHRHHHDPYINIGYQDLTAHVDF 299 Query: 279 QRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADK 338 L L G T QG FL LG+ R L +++ L D Sbjct: 300 TALERQGQKCGLEKLGFTQQGLFLMSLGLGDRLNDLSSGRYNLIEVMNRRDAL-HQLIDP 358 Query: 339 KSMGELFKILVVSHEKVE 356 +G F +LV Sbjct: 359 TGLGG-FGVLVQCKGLSP 375 >gi|52843059|ref|YP_096858.1| hypothetical protein lpg2864 [Legionella pneumophila subsp. pneumophila str. Philadelphia 1] gi|52630170|gb|AAU28911.1| hypothetical protein lpg2864 [Legionella pneumophila subsp. pneumophila str. Philadelphia 1] Length = 372 Score = 224 bits (571), Expect = 1e-56, Method: Composition-based stats. Identities = 85/375 (22%), Positives = 158/375 (42%), Gaps = 25/375 (6%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIF 59 M L + ++N +K ++ + + P GYY++ G GDF+TAPE++ ++ Sbjct: 1 MIMNLKQTLINQLKLQKEIPFIDFMQQALYAPYEGYYTSGLQKLGKHGDFITAPELTSLY 60 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 G+ LA P L E G G G + +D L + +L+ S +++E S Sbjct: 61 GKTLANQFQQILPLLDSPV---LFEFGAGTGKLCIDTLTHLEQLQCLPES---YFILEVS 114 Query: 120 ERLTLIQKKQLASYGDKINWYTSLADVPL---GFTFLVANEFFDSLPIKQFVMTEHGIRE 176 L Q++ + ++ D ++ANE D++P+ +F+ +E I E Sbjct: 115 ANLRHRQQELIQQKIPQLAGLIHWLDKWPEAPFNGVIIANEVLDAMPVYRFMQSETEILE 174 Query: 177 RMIDIDQHDSLVFNIGDHEIKSNF----LTCSDYFLGAIFENSPCRDREMQSISDRLACD 232 + +D+HD L + + + E + D + + S L Sbjct: 175 SYVTLDEHDQLTEIFKPVQNQRLLAYIKNRLPPLDYPYLTEANLFLDDWLLNCSHMLKKG 234 Query: 233 GGTAIVIDYG----YLQSRVGDTLQAVKGH-TYVSPLVNPGQADLSSHVDFQRLSSIAIL 287 I + Y R TL H ++ +PL+NPG+ D+++HVDF ++ Sbjct: 235 ALFIIDYGFPRHEYYHPDRNQGTLMCHYQHYSHPNPLLNPGEQDITAHVDFTHVAEAGHQ 294 Query: 288 YKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKI 347 +I G T Q FL G+ +L + L + K+ + MGELFK+ Sbjct: 295 AGFHIAGYTNQASFLLANGLLSLIHTL-----DDEQELFAAKQAIKQLTQPSEMGELFKV 349 Query: 348 LVVSHE-KVELMPFV 361 + ++ + +++L F+ Sbjct: 350 IALTKDLEIDLNGFL 364 >gi|302853902|ref|XP_002958463.1| hypothetical protein VOLCADRAFT_69474 [Volvox carteri f. nagariensis] gi|300256191|gb|EFJ40463.1| hypothetical protein VOLCADRAFT_69474 [Volvox carteri f. nagariensis] Length = 374 Score = 224 bits (571), Expect = 2e-56, Method: Composition-based stats. Identities = 116/375 (30%), Positives = 178/375 (47%), Gaps = 44/375 (11%) Query: 28 CVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVELGP 87 C+ P G+Y T + FG+ GDFVT+PEISQ+FGEM+ ++ + W G P + LVELGP Sbjct: 2 CLTSPHGGFYMTRDVFGSSGDFVTSPEISQLFGEMVGVWCVHTWLSLGRPPRLLLVELGP 61 Query: 88 GRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGD------------ 135 GRG +M D+LR + +F + L +++VE S L +Q L + Sbjct: 62 GRGTLMADLLRGTAAFR-EFSASLELHLVEISPALRAVQWAALRCTNNSSSSGGSDSSGR 120 Query: 136 --------KINWYTSLADVPLG--FTFLVANEFFDSLPIKQFVMTEHGIR-ERMIDIDQH 184 +++W+TSL VP G +A+EFFD+LP+ QFV E+M+D+ Sbjct: 121 GIHTFDRTQVSWHTSLDAVPDGPAPALYIAHEFFDALPVHQFVRDPKRGWLEKMVDVRTG 180 Query: 185 DSLVFNIGDHEIKSNFLT-----CSDYFLGAI--FENSPCRDREMQSISDRLACDGGTAI 237 +V + G + + + G + E S + ++ R+ GG A+ Sbjct: 181 LRMVLSPGPTPASALLVPRRLGGLAPGSAGELGALEVSAVGMAVAEKLAQRIVRHGGAAL 240 Query: 238 VIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLS--------SIAILYK 289 V+DYG + GD+L A++GH V L PG ADLS+ VDF L Sbjct: 241 VVDYGREEPPYGDSLMAIRGHRGVGILDAPGSADLSAWVDFGALKLAPPSPAAVAPPAAS 300 Query: 290 LYINGLTTQGKFLEGLGIWQRAFSLMK---QTARKDILLDSVKRLVSTSADKKSMGELFK 346 + +G +Q FL LGI R LM+ + L RL+ S + MGE +K Sbjct: 301 VTTSGPVSQSSFLRALGIEMRLQWLMRGATDPGAAESLAAGCARLLG-STESGGMGESYK 359 Query: 347 ILVVSHEKV-ELMPF 360 ++ + + +L F Sbjct: 360 VMAIHRGDLRDLAGF 374 >gi|255074399|ref|XP_002500874.1| predicted protein [Micromonas sp. RCC299] gi|226516137|gb|ACO62132.1| predicted protein [Micromonas sp. RCC299] Length = 378 Score = 224 bits (571), Expect = 2e-56, Method: Composition-based stats. Identities = 121/380 (31%), Positives = 177/380 (46%), Gaps = 46/380 (12%) Query: 25 FALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVE 84 C+ PEFGYY + FG GDFVT+PE+SQ FGE++ + WE G PS VR+VE Sbjct: 1 MQECLTHPEFGYYMHRDVFGEAGDFVTSPEVSQAFGELMGAWAAWTWESMGKPSTVRIVE 60 Query: 85 LGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQL-------------- 130 LGPGRG +M D+LR LK F ++++MV+ S Q++ L Sbjct: 61 LGPGRGTLMADLLRGTKNLK-GFADAVTVHMVDVSPANRKAQREALKCGPKTDDAENGDD 119 Query: 131 -----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHD 185 + W+ ++ VP G T ++A+EFFD++P+ QF TE G ER++ I Sbjct: 120 NTGKNPTRHGLGKWHETMDAVPPGPTIVIAHEFFDAMPVHQFTRTERGWCERLVAISGDM 179 Query: 186 SLVFNIGDHEIKSNFLTCSDYFLGAI-----FENSPCRDREMQSISDRLACDGGTAIVID 240 L + E SP + I+ RL GG AI ID Sbjct: 180 VLSPGLTPAGALMVPRRLEGVEASRRDGLRQLEISPRSLAIWERIAARLEEHGGAAIAID 239 Query: 241 YGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLS---SIAILYKLYINGLTT 297 YG + +GDTLQA++ H +V L +PG+ADLS++VDF + + +G T Sbjct: 240 YG-EEGPLGDTLQAIRDHEFVDVLTDPGRADLSAYVDFGAMRRVIETRKNSGVECHGPVT 298 Query: 298 QGKFLEGLGIWQRAFSLMKQTARK---DILLDSVKRLVSTSADK-------------KSM 341 Q L GLGI Q ++++ A + D L+ +RLVS M Sbjct: 299 QRDLLFGLGIGQWLEKMVEKCATEKEVDKLIAGCERLVSGEQGALGEGGRTEGAGKGAGM 358 Query: 342 GELFKILVVSHEKV-ELMPF 360 G +K L + + + + F Sbjct: 359 GFRYKALAMVSKGLGKPAGF 378 >gi|284929791|ref|YP_003422313.1| hypothetical protein UCYN_12660 [cyanobacterium UCYN-A] gi|284810235|gb|ADB95932.1| uncharacterized conserved protein [cyanobacterium UCYN-A] Length = 403 Score = 224 bits (570), Expect = 2e-56, Method: Composition-based stats. Identities = 84/350 (24%), Positives = 149/350 (42%), Gaps = 20/350 (5%) Query: 19 MTVDQYFALCVADPEFGYYSTCNP-FGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFP 77 +T Y L + D + GYY + N G+ GDF T+ + FGE+LA E G Sbjct: 35 ITFFDYMNLVLYDSQQGYYGSGNVNIGSEGDFFTSSSLGPDFGELLAEQFKEMAETLGCS 94 Query: 78 SCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKI 137 + L+E+G G ++ DIL+ + K P+F+ +L ++E SE L QK+ L + Sbjct: 95 NKFTLIEVGAGYAVLASDILKYLEKKYPEFYQILDYIIIEESEALIKKQKEHLKHFSKI- 153 Query: 138 NWYTSLADVPLGFTF--LVANEFFDSLPIKQFVMTEHGIRERMIDIDQHD--------SL 187 ++S D+ + +NE D+ P+ Q ++ + ++E + + + + S Sbjct: 154 -KWSSWEDISNNSVVGCIFSNELIDAFPVHQIIVQDKELKEVYVTVREGNIEETIQELST 212 Query: 188 VFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAI----VIDYGY 243 + ++ + LT Y E + ++++S ++ I + Y Sbjct: 213 SQLLEYFQLINIDLTSQHYPENYRTEVNLKALNWLETVSHKIKKGYLVTIDYGHIASKYY 272 Query: 244 LQSRVGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFL 302 R TL H + +P +N G D+++HVDF + L L + GLT Q F Sbjct: 273 HPQRYQGTLSCYYQHRYHCNPYINLGSQDITAHVDFSAMEIQGNLLGLELIGLTQQELFF 332 Query: 303 EGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSH 352 LG+ +R L + +L L D +G FK+L+ Sbjct: 333 INLGLGERLAELSSNKYKLSDVLARRDSL-RQLIDPGGLG-RFKVLIQGK 380 >gi|148361175|ref|YP_001252382.1| hypothetical protein LPC_3149 [Legionella pneumophila str. Corby] gi|148282948|gb|ABQ57036.1| conserved hypothetical protein; DUF185 [Legionella pneumophila str. Corby] Length = 372 Score = 224 bits (570), Expect = 2e-56, Method: Composition-based stats. Identities = 87/375 (23%), Positives = 160/375 (42%), Gaps = 25/375 (6%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIF 59 M L + ++N +K ++ + + P GYY++ G GDF+TAPE++ ++ Sbjct: 1 MIMNLKQTLINQLKLQKEIPFIDFMQQALYAPYEGYYTSGLQKLGKHGDFITAPELTSLY 60 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 G+ LA P L E G G G + +D L + +L+ S L ++E S Sbjct: 61 GKTLANQFQQILPLLDSPV---LFEFGAGTGKLCIDTLTHLEQLQCLPESYL---ILEVS 114 Query: 120 ERLTLIQKKQLASYGDKINWYTSLADVPL---GFTFLVANEFFDSLPIKQFVMTEHGIRE 176 L Q++ + ++ D ++ANE D++P+ +F+ +E I E Sbjct: 115 ANLRHRQQELIQQKIPQLAGLIHWLDKWPEAPFNGVIIANEVLDAMPVYRFMQSETEILE 174 Query: 177 RMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLG----AIFENSPCRDREMQSISDRLACD 232 + +D+HD L + + ++ + E + D + + S L Sbjct: 175 SYVTLDEHDQLAEIFKPVQNQRLLAYIKNHLPPLGYPYLTEANLFLDDWLLNCSHMLKKG 234 Query: 233 GGTAIVIDYG----YLQSRVGDTLQAVKGH-TYVSPLVNPGQADLSSHVDFQRLSSIAIL 287 I + Y R TL H ++ +PL+NPG+ DL++HVDF ++ Sbjct: 235 ALFIIDYGFPRHEYYHPDRNQGTLMCHYQHYSHPNPLLNPGEQDLTAHVDFTHVAEAGHQ 294 Query: 288 YKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKI 347 +I G T Q FL G+ +L + L + K+ + MGELFK+ Sbjct: 295 AGFHIAGYTNQASFLLANGLLSLIHTL-----DDEQELFAAKQAIKQLTQPSEMGELFKV 349 Query: 348 LVVSHE-KVELMPFV 361 + ++ + +++L F+ Sbjct: 350 IALTKDLEIDLNGFL 364 >gi|307611736|emb|CBX01441.1| hypothetical protein LPW_31301 [Legionella pneumophila 130b] Length = 370 Score = 224 bits (570), Expect = 2e-56, Method: Composition-based stats. Identities = 84/372 (22%), Positives = 156/372 (41%), Gaps = 25/372 (6%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGEM 62 L + ++N +K ++ + + P GYY++ G GDF+TAPE++ ++G+ Sbjct: 2 NLKQTLINQLKLQKEIPFIDFMQQALYAPYEGYYTSGLQKLGKHGDFITAPELTSLYGKT 61 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 LA P L E G G G + +D L + +L+ S +++E S L Sbjct: 62 LANQFQQILPLLDSPV---LFEFGAGTGKLCIDTLTHLEQLQCLPES---YFILEVSANL 115 Query: 123 TLIQKKQLASYGDKINWYTSLADVPL---GFTFLVANEFFDSLPIKQFVMTEHGIRERMI 179 Q++ + ++ D ++ANE D++P+ +F+ +E I E + Sbjct: 116 RHRQQELIQQKIPQLAGLIHWLDKWPEAPFNGVIIANEVLDAMPVYRFMQSEIEILESYV 175 Query: 180 DIDQHDSLVFNIGDHEIKSNF----LTCSDYFLGAIFENSPCRDREMQSISDRLACDGGT 235 +D+HD L + + + E + D + + S L Sbjct: 176 TLDEHDQLAEIFKPVQNQRLLAYIKNRLPPLDYPYLTEANLFLDDWLLNCSHMLKKGALF 235 Query: 236 AIVIDYG----YLQSRVGDTLQAVKGH-TYVSPLVNPGQADLSSHVDFQRLSSIAILYKL 290 I + Y R TL H ++ +PL+NPG+ D+++HVDF ++ Sbjct: 236 IIDYGFPRHEYYHPDRNQGTLMCHYQHYSHPNPLLNPGEQDITAHVDFTHVAEAGHQAGF 295 Query: 291 YINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVV 350 +I G T Q FL G+ +L L + K+ + MGELFK++ + Sbjct: 296 HIAGYTNQASFLLANGLLSLIHTL-----DDAQELFAAKQAIKQLTQPSEMGELFKVIAL 350 Query: 351 SHE-KVELMPFV 361 + + +++L F+ Sbjct: 351 TKDLEIDLNGFL 362 >gi|94309126|ref|YP_582336.1| hypothetical protein Rmet_0181 [Cupriavidus metallidurans CH34] gi|93352978|gb|ABF07067.1| conserved hypothetical protein; putative exported protein [Cupriavidus metallidurans CH34] Length = 398 Score = 224 bits (570), Expect = 2e-56, Method: Composition-based stats. Identities = 88/376 (23%), Positives = 150/376 (39%), Gaps = 40/376 (10%) Query: 9 IVNLI-KKNGQMTVDQYFALCVADPEFGYYSTCN-PFGAV----GDFVTAPEISQIFGEM 62 I I G + D++ +L + P GYYS FG DF+TAPE++ F Sbjct: 25 IAGAIDAAGGWIGFDRFMSLALYAPRLGYYSGGAAKFGRDVNDGSDFITAPELTPFFART 84 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRG--IMMLDILRVICKLKPDFFSVLSIYMVETSE 120 LA + P R++E G G G L + PD + +VE S Sbjct: 85 LARQFAP-LVRMNLP---RVMEFGAGTGRLAADLLLALETEDALPDTYQ-----IVELSG 135 Query: 121 RLTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRE 176 L Q+ L ++ W +L D G +V NE D++P++ + T E Sbjct: 136 ELRARQQATLDQRAPHLAGRVTWLDALPDRFEG--VVVGNEVLDAMPVRLYARTGGRWHE 193 Query: 177 RMIDIDQHDS---LVFNIGDHEIKSNFLTC----SDYFLGAIFENSPCRDREMQSISDRL 229 R + + F D ++ + + E + +++ L Sbjct: 194 RGVVCAKDGKAGAEAFAFEDRPLEDGAIPAVLLGIPGDHDIVTETHTEAEGFTRAVGALL 253 Query: 230 ACDGGTAIVIDYG----YLQSRVGDT-LQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSI 284 A I + Y R G T + + H + P + PG D+++HV+F ++ Sbjct: 254 ARGAAFFIDYGFPASEYYHPHRTGGTLMCHYRHHAHPDPFLYPGLQDITAHVNFSGIAQA 313 Query: 285 AILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARK-DILLDSVKRLVSTSADKKSMGE 343 + L ++G +Q +FL GI + SL AR ++V++L+S + MGE Sbjct: 314 GVEAGLQVSGYASQARFLMNAGITELMMSLDPSDARAFLPQANAVQKLLS----EAEMGE 369 Query: 344 LFKILVVSHEKVELMP 359 LFK++ ++ + MP Sbjct: 370 LFKVIALTRGIDDAMP 385 >gi|54298855|ref|YP_125224.1| hypothetical protein lpp2922 [Legionella pneumophila str. Paris] gi|53752640|emb|CAH14075.1| hypothetical protein lpp2922 [Legionella pneumophila str. Paris] Length = 370 Score = 224 bits (570), Expect = 2e-56, Method: Composition-based stats. Identities = 85/372 (22%), Positives = 157/372 (42%), Gaps = 25/372 (6%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGEM 62 L + ++N +K ++ + + P GYY++ G GDF+TAPE++ ++G+ Sbjct: 2 NLKQTLINQLKLQKEIPFIDFMQQALYAPYEGYYTSGLQKLGKHGDFITAPELTSLYGKT 61 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 LA P L E G G G + +D L + +L+ S +++E S L Sbjct: 62 LANQFQQILPLLDSPV---LFEFGAGTGKLCIDTLTHLEQLQCLPES---YFILEVSANL 115 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTF---LVANEFFDSLPIKQFVMTEHGIRERMI 179 Q++ + ++ D F ++ANE D++P+ +F+ +E I E + Sbjct: 116 RHRQQELIQQKIPQLAGLIHWLDKWPETPFNGVIIANEVLDAMPVYRFMQSETEILESYV 175 Query: 180 DIDQHDSLVFNIGDHEIKSNF----LTCSDYFLGAIFENSPCRDREMQSISDRLACDGGT 235 +D+HD L + + + E + D + + S L Sbjct: 176 ALDEHDQLAEIFKPVQNQRLLAYIKNRLPPLDYPYLTEANLFLDDWLLNCSHMLKKGALF 235 Query: 236 AIVIDYG----YLQSRVGDTLQAVKGH-TYVSPLVNPGQADLSSHVDFQRLSSIAILYKL 290 I + Y R TL H ++ +PL+NPG+ D++SHVDF ++ Sbjct: 236 IIDYGFPRHEYYHPDRNQGTLMCHYQHYSHPNPLLNPGEQDITSHVDFTHVAEAGHQAGF 295 Query: 291 YINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVV 350 ++ G T Q FL G+ +L + L K+ + MGELFK++ + Sbjct: 296 HVAGYTNQASFLLANGLLSLIHTL-----DDEQELFVAKQAIKQLTQPSEMGELFKVIAL 350 Query: 351 SHE-KVELMPFV 361 + + +++L F+ Sbjct: 351 TKDLEIDLNGFL 362 >gi|296108510|ref|YP_003620211.1| hypothetical protein lpa_04164 [Legionella pneumophila 2300/99 Alcoy] gi|295650412|gb|ADG26259.1| Hypothetical protein lpa_04164 [Legionella pneumophila 2300/99 Alcoy] Length = 370 Score = 223 bits (569), Expect = 2e-56, Method: Composition-based stats. Identities = 86/372 (23%), Positives = 159/372 (42%), Gaps = 25/372 (6%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGEM 62 L + ++N +K ++ + + P GYY++ G GDF+TAPE++ ++G+ Sbjct: 2 NLKQTLINQLKLQKEIPFIDFMQQALYAPYEGYYTSGLQKLGKHGDFITAPELTSLYGKT 61 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 LA P L E G G G + +D L + +L+ S L ++E S L Sbjct: 62 LANQFQQILPLLDSPV---LFEFGAGTGKLCIDTLTHLEQLQCLPESYL---ILEVSANL 115 Query: 123 TLIQKKQLASYGDKINWYTSLADVPL---GFTFLVANEFFDSLPIKQFVMTEHGIRERMI 179 Q++ + ++ D ++ANE D++P+ +F+ +E I E + Sbjct: 116 RHRQQELIQQKIPQLAGLIHWLDKWPEAPFNGVIIANEVLDAMPVYRFMQSETEILESYV 175 Query: 180 DIDQHDSLVFNIGDHEIKSNFLTCSDYFLG----AIFENSPCRDREMQSISDRLACDGGT 235 +D+HD L + + ++ + E + D + + S L Sbjct: 176 TLDEHDQLAEIFKPVQNQRLLAYIKNHLPPLGYPYLTEANLFLDDWLLNCSHMLKKGALF 235 Query: 236 AIVIDYG----YLQSRVGDTLQAVKGH-TYVSPLVNPGQADLSSHVDFQRLSSIAILYKL 290 I + Y R TL H ++ +PL+NPG+ DL++HVDF ++ Sbjct: 236 IIDYGFPRHEYYHPDRNQGTLMCHYQHYSHPNPLLNPGEQDLTAHVDFTHVAEAGHQAGF 295 Query: 291 YINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVV 350 +I G T Q FL G+ +L + L + K+ + MGELFK++ + Sbjct: 296 HIAGYTNQASFLLANGLLSLIHTL-----DDEQELFAAKQAIKQLTQPSEMGELFKVIAL 350 Query: 351 SHE-KVELMPFV 361 + + +++L F+ Sbjct: 351 TKDLEIDLNGFL 362 >gi|284106845|ref|ZP_06386288.1| succinate dehydrogenase iron-sulfur subunit [Candidatus Poribacteria sp. WGA-A3] gi|283830024|gb|EFC34300.1| succinate dehydrogenase iron-sulfur subunit [Candidatus Poribacteria sp. WGA-A3] Length = 382 Score = 223 bits (568), Expect = 4e-56, Method: Composition-based stats. Identities = 92/370 (24%), Positives = 164/370 (44%), Gaps = 35/370 (9%) Query: 12 LIKKNGQMTVDQYFALCVADPEFGYYSTC--------NPFGAVG-DFVTAPEISQIFGEM 62 LI++ G +T ++ L + D E GYY+T +P G G DF TAP +S + + Sbjct: 4 LIRETGPLTFARFMELALYDDEHGYYTTGGGRSASVTSPIGREGGDFFTAPSLSPVLAKC 63 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 L L ++ G P LVE+GPG G ++ D+L+ + +P S L+ +VE S Sbjct: 64 LVRQLAEIDDRLGHPPVFDLVEMGPGDGTLLRDMLQECREQQPSLLSRLACILVERSPAF 123 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFT-----FLVANEFFDSLPIKQFVMTEHGIRER 177 Q++ LA++ ++ + ++ L++NE D+ P+ + M + G++E Sbjct: 124 RRRQQETLAAWKEQGTEIQWVDELQAISEASLTGTLLSNELVDAFPVHRVRMGQDGLQEL 183 Query: 178 MIDIDQH--DSLVFNIGDHEIKSNFLTCS-DYFLGAIFENSPCRDREMQSISDRLACDGG 234 + H ++ S + G E S + ++ ++ L G Sbjct: 184 YVTSRDHALQEQWGEPSTPDLSSFLSELDVELPAGFTTEISLEAVKWIKQVARVLDR--G 241 Query: 235 TAIVIDYGYLQS------RVGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAIL 287 I IDYG+ R TL A HT +P G+ DL++HV+F L+ Sbjct: 242 VVITIDYGHTARDYVALERKNGTLMAYYRHTVSTNPYQRVGEQDLTAHVNFSSLAHTGEQ 301 Query: 288 YKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKI 347 L GLT FL LGI + L++ ++ ++ + +L+ MG FK Sbjct: 302 VGLTTTGLTNLQHFLMSLGIEE----LVRGCDQESAVVRAAAQLLR----PHGMGTTFK- 352 Query: 348 LVVSHEKVEL 357 +++ H+ +++ Sbjct: 353 ILMQHKGLDI 362 >gi|328770714|gb|EGF80755.1| hypothetical protein BATDEDRAFT_11090 [Batrachochytrium dendrobatidis JAM81] Length = 408 Score = 223 bits (568), Expect = 4e-56, Method: Composition-based stats. Identities = 113/396 (28%), Positives = 188/396 (47%), Gaps = 57/396 (14%) Query: 16 NGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHG 75 NG ++ Q+ + P GYY FG GDF+T+PEI+Q+FGE++AI+ + W+ + Sbjct: 2 NGPISTAQFMRQALIHPLGGYYMKGKVFGPHGDFITSPEINQMFGELMAIWFMNHWQTYQ 61 Query: 76 F------PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLS-IYMVETSERLTLIQKK 128 ++ELGPGRG +M D+L + +LK + L+ +++VE S L Q + Sbjct: 62 SIPSTAPQKPFNIIELGPGRGTLMADMLTTLTQLKTSTINPLNAVHLVEASPELRKTQAQ 121 Query: 129 QLASYGDKIN----------------------WYTSLADVPLGFT----FLVANEFFDSL 162 L D + ++ D F++A+EFFD++ Sbjct: 122 MLNCTMDPLENSTYRPGDGTTVTGTTSQGVKVYWHDTFDCIPVDETVASFIIAHEFFDAM 181 Query: 163 PIKQFVMTEHGIRERMIDIDQ----HDSLVFNIGDHEIKSNF--------LTCSDYFLGA 210 P+ +F TE G RE M+D+D+ + F K++ ++ +G Sbjct: 182 PVYKFQKTEQGWREIMVDLDETPTLPYNFRFICAPSATKASVALMQPTVTDRFANVQIGE 241 Query: 211 IFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQA 270 E +P ++I++R+A DGG A+ +DYG + D+L+ ++ H V PL PG Sbjct: 242 RIELAPDVVGVSKAIAERVANDGGIALFVDYGRDH-VIDDSLRGIRNHKIVHPLSMPGDC 300 Query: 271 DLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMK---QTARKDILLDS 327 DLS+ VDF + A+ G QG FLE +GI RA +L++ K ++ + Sbjct: 301 DLSADVDFSAI-QHAVKGIANAYGPVPQGLFLEKMGIGARAMALIQGATDNKTKAAVVAA 359 Query: 328 VKRLVSTSADKKSMGELFKILVVSHEKV---ELMPF 360 +RLV D +MG+ +KI+ ++ L PF Sbjct: 360 YERLV----DPDAMGQAYKIMAITSSTTVPYALEPF 391 >gi|317401374|gb|EFV82009.1| hypothetical protein HMPREF0005_01017 [Achromobacter xylosoxidans C54] Length = 393 Score = 223 bits (567), Expect = 4e-56, Method: Composition-based stats. Identities = 90/364 (24%), Positives = 156/364 (42%), Gaps = 33/364 (9%) Query: 16 NGQMTVDQYFALCVADPEFGYYSTCNP-------FGAVGDFVTAPEISQIFGEMLAIFLI 68 +G + D + A + P GYY+ N GDFVTAP+++ +F LA + Sbjct: 35 SGWLPFDHWMAEALYAPGLGYYAAGNVKLAEADARAPAGDFVTAPQLTPLFARTLARQVA 94 Query: 69 CAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKK 128 Q ++E G G G + V+ +L ++E S L Q + Sbjct: 95 QVLRQT---DTQTVLEFGAGTGALAEG---VLRELDAQGLQETQYLILEISADLRARQAE 148 Query: 129 QLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVM-TEHGIRERMIDIDQHDSL 187 +LA +G+++ W +L + G ++ANE D++P+ F + + ER + +D Sbjct: 149 RLAPFGERVRWLDALPERFRGC--VLANEVLDAMPVSLFRWSDDGVVLERGVTLDPAQGF 206 Query: 188 VFNIGDHEIKSNFLTCSDYF--LGAIFENSPCRDREMQSISDRLACDGGTAIVIDY---- 241 V+ + + G + E + + + +S L G A+++DY Sbjct: 207 VWEDRPAPRRLAEAVAARMPALPGYVSEINLQAEAWIDGMSGWLEQ--GAALLVDYGFPQ 264 Query: 242 --GYLQSRVGDT-LQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQ 298 Y R G T + ++ H + P PG D+++HVDF ++ A L + G T+Q Sbjct: 265 GEYYHPQRAGGTLMCHLRHHAHGDPFTAPGLQDITAHVDFTAMADAAQAAGLQVLGYTSQ 324 Query: 299 GKFLEGLGIWQRAFSLMKQTARKDILLDS-VKRLVSTSADKKSMGELFKILVVSHE-KVE 356 +FL G+ + L AR + V++L+S + MGELFK+L V Sbjct: 325 ARFLMNAGLMELLAQLDPTDARTYAQAVAPVQKLLS----EAEMGELFKVLAVGRGITQP 380 Query: 357 LMPF 360 L+ F Sbjct: 381 LIGF 384 >gi|311109371|ref|YP_003982224.1| hypothetical protein AXYL_06216 [Achromobacter xylosoxidans A8] gi|310764060|gb|ADP19509.1| hypothetical protein AXYL_06216 [Achromobacter xylosoxidans A8] Length = 395 Score = 223 bits (567), Expect = 5e-56, Method: Composition-based stats. Identities = 90/368 (24%), Positives = 155/368 (42%), Gaps = 35/368 (9%) Query: 14 KKNGQMTVDQYFALCVADPEFGYYSTCNP---------FGAVGDFVTAPEISQIFGEMLA 64 G ++ D + A + P GYY+ N GDFVTAP++S +F LA Sbjct: 33 AHGGWLSFDHWMAQALYAPGLGYYAAGNVKLADADGDVRAPAGDFVTAPQLSPLFARTLA 92 Query: 65 IFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTL 124 Q ++E G G G + V+ +L + ++E S L Sbjct: 93 RQAAQVLRQT---QTETVLEFGAGTGALAEG---VLRELDAMGLAQTRYLILEVSADLRA 146 Query: 125 IQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEH-GIRERMIDIDQ 183 Q ++LA++GD++ W +L G ++ANE D++P+ F +E + ER + +D Sbjct: 147 RQAERLAAFGDRVQWLDALPQSFAGC--VLANEVLDAMPVSIFHWSEAGEVLERGVAVDA 204 Query: 184 HDSLVFNIGDHEIKSNFLTCSDYF--LGAIFENSPCRDREMQSISDRLACDGGTAIVIDY 241 V+ + G + E + + + ++ L G A+++DY Sbjct: 205 SQDFVWQDRPAPPALAAAVAARMPALPGYVSEINLQGEAWIAAMGGWLER--GAALLVDY 262 Query: 242 ------GYLQSRVGDT-LQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYING 294 Y R G T + ++ H + P PG D+++HVDF ++ A+ L + G Sbjct: 263 GFPRSEYYHPQRAGGTLMCHLRHHAHGDPFTAPGLQDITAHVDFTAMADAALEAGLQVLG 322 Query: 295 LTTQGKFLEGLGIWQRAFSLMKQTARKDILLDS-VKRLVSTSADKKSMGELFKILVVSHE 353 T+Q +FL G+ L ++ + V++L+S + MGELFK+L V Sbjct: 323 YTSQARFLMNAGLMDLLAQLDPSDVQQYAQAVAPVQKLLS----ESEMGELFKVLAVGRG 378 Query: 354 -KVELMPF 360 L F Sbjct: 379 ITEPLAGF 386 >gi|322499230|emb|CBZ34302.1| unnamed protein product [Leishmania donovani BPK282A1] Length = 447 Score = 222 bits (566), Expect = 5e-56, Method: Composition-based stats. Identities = 112/383 (29%), Positives = 180/383 (46%), Gaps = 30/383 (7%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYS-TCNPFG-AVGDFVTAPEISQIF 59 + L ++VN I G + Q+ C+ P++GYY+ N G DF+TA EI F Sbjct: 56 KTALCNELVNKITAQGYYPMSQFVKDCLTHPQYGYYAAKKNVIGSEKADFITAAEI-PFF 114 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 G++LA +++ AW++ G P + LVE+GPGRG +M +L+ I P L I++VE Sbjct: 115 GDVLAAWVMDAWQKMGTPRVLHLVEMGPGRGTLMRTMLKQIQYSNPHLLHFLQIHLVEVG 174 Query: 120 ERLTLIQKKQLASYGD---KINWYTSLADVPLG--FTFLVANEFFDSLPIKQFVMTEHGI 174 QK+ L + KI W+ L +P T +ANE+FD+LP+ QF TE G Sbjct: 175 AARREEQKRALTEFQTAQGKIKWWMDLESLPFSLEPTVFIANEYFDALPVAQFRYTERGW 234 Query: 175 RERMIDIDQHDSLVFNIGDHEIKSNFLTCS--------DYFLGAIFENSPCRDREMQSIS 226 E +++D + S ++ G E + + M+++ Sbjct: 235 VETCVEVDTDPGTEAHFRLVHAPSGSMSAYLIPDDIRTKGKTGDCVEVNTVGMQTMETLM 294 Query: 227 DR-LACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIA 285 + + C ++IDYG + TL+ ++GH +V PL++PG+ DLS V F++L Sbjct: 295 KKMIDCQKAACLLIDYG-KDDHMHSTLRGIRGHRFVDPLLSPGEVDLSCWVSFRQLRWAL 353 Query: 286 ILY-----KLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDIL--LDSVKRLVSTSADK 338 L + TQ +FL GI R ++K K L L + +RL+ DK Sbjct: 354 ERLEVARRHLKWFPVMTQHEFLLENGIDIRLAHVIKDEETKTALKVLQNYRRLM----DK 409 Query: 339 KSMGELFKILVVSHEKVE-LMPF 360 + MGE +K+ + PF Sbjct: 410 EEMGESYKVFAFQTRNFPNVSPF 432 >gi|146087293|ref|XP_001465782.1| hypothetical protein [Leishmania infantum JPCM5] gi|134069882|emb|CAM68210.1| conserved hypothetical protein [Leishmania infantum JPCM5] Length = 447 Score = 222 bits (566), Expect = 6e-56, Method: Composition-based stats. Identities = 112/383 (29%), Positives = 179/383 (46%), Gaps = 30/383 (7%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYS-TCNPFG-AVGDFVTAPEISQIF 59 + L ++VN I G + Q+ C+ P++GYY+ N G DF+TA EI F Sbjct: 56 KTALCNELVNKITAQGYYPMSQFVKDCLTHPQYGYYAAKKNVIGSEKADFITAAEI-PFF 114 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 G++LA +++ AW++ G P + LVE+GPGRG +M +L+ I P L I++VE Sbjct: 115 GDVLAAWVMDAWQKMGTPRVLHLVEMGPGRGTLMRTMLKQIQYSNPHLLHFLQIHLVEVG 174 Query: 120 ERLTLIQKKQLASYGD---KINWYTSLADVPLG--FTFLVANEFFDSLPIKQFVMTEHGI 174 QK+ L + KI W+ L +P T +ANE+FD+LP+ QF TE G Sbjct: 175 AARREEQKRALTEFQTAQGKIKWWMDLESLPFSLEPTVFIANEYFDALPVAQFRYTERGW 234 Query: 175 RERMIDIDQHDSLVFNIGDHEIKSNFLTCS--------DYFLGAIFENSPCRDREMQSIS 226 E +++D + S ++ G E + + M+++ Sbjct: 235 VETCVEVDTDPGTEAHFRLVHAPSGSMSAYLIPDDIRTKGKTGDCVEVNTVGMQTMETLM 294 Query: 227 DR-LACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIA 285 + + C ++IDYG + TL+ ++GH +V PL++PG+ DLS V F++L Sbjct: 295 KKMIDCQKAACLLIDYG-KDDHMHSTLRGIRGHRFVDPLLSPGEVDLSCWVSFRQLRWAL 353 Query: 286 ILY-----KLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDIL--LDSVKRLVSTSADK 338 L + TQ FL GI R ++K K L L + +RL+ DK Sbjct: 354 ERLEVARRHLKWFPVMTQHDFLLENGIDIRLAHVIKDEETKTALKVLQNYRRLM----DK 409 Query: 339 KSMGELFKILVVSHEKVE-LMPF 360 + MGE +K+ + PF Sbjct: 410 EEMGESYKVFAFQTRNFPNVSPF 432 >gi|296536483|ref|ZP_06898576.1| protein of hypothetical function DUF185 [Roseomonas cervicalis ATCC 49957] gi|296263195|gb|EFH09727.1| protein of hypothetical function DUF185 [Roseomonas cervicalis ATCC 49957] Length = 335 Score = 222 bits (565), Expect = 7e-56, Method: Composition-based stats. Identities = 123/342 (35%), Positives = 176/342 (51%), Gaps = 19/342 (5%) Query: 21 VDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCV 80 +D++ A A G +PFGA GDF+TAPEISQ FGE L ++ AW+ G P+ V Sbjct: 8 LDRFMARAAAAYYAG----RDPFGARGDFITAPEISQAFGECLGLWAAIAWQAMGRPAPV 63 Query: 81 RLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWY 140 LVELGPGRG +M D LR I ++ PDF + L +++VE S L Q + LA W+ Sbjct: 64 LLVELGPGRGTLMADALRAIAQVVPDFRAALRLHLVEQSPALRARQAELLAG--ADPAWH 121 Query: 141 TSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNF 200 + D+P G ++ANEF D+LPI+QF ER ++ N Sbjct: 122 DRVEDLPPGPALVLANEFLDALPIRQFERRGGAWLERHVE-------DGNFVLLPAADAP 174 Query: 201 LTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTY 260 ++ GAI E + RLA GG A+ +DYG +S +GD+LQA+ H Sbjct: 175 PLPAEAPEGAIQEVCEPARAIAAQLGARLAAQGGAALFVDYGPARSGLGDSLQALSAHGA 234 Query: 261 VSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMK-QTA 319 PL PG AD+++HVDFQ ++ + G QG FL+ LG+ RA L + + A Sbjct: 235 ADPLGTPGAADITAHVDFQAVAEAGMAAGAAAQGPVPQGIFLQSLGLVTRAAMLARARPA 294 Query: 320 RKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVE-LMPF 360 + L + +RL++ + MG LFK L + H + L F Sbjct: 295 SAGMQLSAAQRLIA----PEGMGRLFKALALCHPALPILPGF 332 >gi|322491965|emb|CBZ27238.1| conserved hypothetical protein [Leishmania mexicana MHOM/GT/2001/U1103] Length = 470 Score = 222 bits (565), Expect = 8e-56, Method: Composition-based stats. Identities = 111/383 (28%), Positives = 180/383 (46%), Gaps = 30/383 (7%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYS-TCNPFG-AVGDFVTAPEISQIF 59 + L ++VN I G + Q+ C+ P++GYY+ N G DF+TA EI F Sbjct: 79 KTALCNELVNKITAQGYYPMSQFVKDCLTHPQYGYYAAKKNVIGSEKADFITAAEI-PFF 137 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 G++LA +++ AW++ G P + LVE+GPGRG +M +L+ I P L I++VE Sbjct: 138 GDVLAAWVMDAWQKMGTPRVLHLVEMGPGRGTLMRTMLKQIQYSNPHLLHFLQIHLVEVG 197 Query: 120 ERLTLIQKKQLASYGD---KINWYTSLADVPLG--FTFLVANEFFDSLPIKQFVMTEHGI 174 QK+ LA + KI W+ L +P T +ANE+FD+LP+ QF TE G Sbjct: 198 AARREEQKRALAEFQTAQGKIKWWMDLESLPFSLEPTVFIANEYFDALPVAQFRYTERGW 257 Query: 175 RERMIDIDQHDSLVFNIGDHEIKSNFLTCS--------DYFLGAIFENSPCRDREMQSIS 226 E +++D + S ++ G E + + M+++ Sbjct: 258 VETCVEVDTDPGTEAHFRLVHAPSGSMSAYLIPDDIRTKGKKGDCVEVNTVGMQTMETLM 317 Query: 227 DR-LACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIA 285 + + C ++IDYG + TL+ ++GH +V PL++PG+ DLS V F++L Sbjct: 318 KKMIDCQKAACLLIDYG-KDDHMHSTLRGIRGHRFVDPLLSPGEVDLSCWVSFRQLRWAL 376 Query: 286 ILY-----KLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDIL--LDSVKRLVSTSADK 338 L + TQ FL GI R ++K K + L + +RL+ DK Sbjct: 377 ERLEVARRHLKWFPVMTQHDFLLENGIDIRLAHVIKDEETKAAMKVLQNYRRLM----DK 432 Query: 339 KSMGELFKILVVSHEKVE-LMPF 360 + MG+ +K+ + PF Sbjct: 433 EEMGDSYKVFAFQTRNFPNVSPF 455 >gi|67458526|ref|YP_246150.1| hypothetical protein RF_0134 [Rickettsia felis URRWXCal2] gi|67004059|gb|AAY60985.1| Uncharacterized conserved protein [Rickettsia felis URRWXCal2] Length = 407 Score = 222 bits (565), Expect = 8e-56, Method: Composition-based stats. Identities = 115/401 (28%), Positives = 181/401 (45%), Gaps = 59/401 (14%) Query: 8 KIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFL 67 KI I++NG +T D + YY + GDFVTAPEISQ+FGE++ ++ Sbjct: 6 KIRQSIEQNGYITCDVLMQEVLQSNPTSYYKQVKSLASEGDFVTAPEISQLFGEIIGLWC 65 Query: 68 ICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQK 127 I W++ G P + LVELGPGRG++M D+LR L P+F+ LSI ++E ++ QK Sbjct: 66 IKEWQRIGCPKSLSLVELGPGRGLLMRDLLRTAK-LVPEFYKALSIELIEINQNFIAHQK 124 Query: 128 KQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQ-HDS 186 L I+ + + D+P T ++ANEFFD++PIKQ++ + ER+ + Sbjct: 125 ANLQDINLPISHRSFVEDIPKKPTIIIANEFFDAMPIKQYIKVKELWYERIFVVQPVDGR 184 Query: 187 LVFNIGDHEIKSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGY 243 + ++ + + GA+ E S ++ I+ L G+ ++IDYGY Sbjct: 185 IKYDKISVNKQLQEYLLRTHIEAKDGAVLEESYTSIEIIKFIAQHLKKLSGSCLIIDYGY 244 Query: 244 -------LQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLT 296 + + TLQAVK H Y L N G+ADLS+HVDF L ++A K+ + Sbjct: 245 DLAPGNRTRYQYNPTLQAVKNHKYCPILENLGEADLSAHVDFYTLKTVAKNSKINVIDTI 304 Query: 297 TQGKFLEGLGIWQRAFSLMKQTARKDILLDSVK--------------------------- 329 +Q FL GI R +L + + + + + Sbjct: 305 SQRDFLIENGILLRKQTLQDKLNNRHLFKFAYREEFKGDTETLATAAYTLVREDTSLGST 364 Query: 330 --------------------RLVSTSADKKSMGELFKILVV 350 + V K MG+LFK+L + Sbjct: 365 SKLPLEVEFEKMSEQAGIIEKQVERLISSKQMGKLFKVLQI 405 >gi|285017629|ref|YP_003375340.1| hypothetical protein XALc_0834 [Xanthomonas albilineans GPE PC73] gi|283472847|emb|CBA15352.1| conserved hypothetical protein [Xanthomonas albilineans] Length = 394 Score = 221 bits (564), Expect = 9e-56, Method: Composition-based stats. Identities = 84/378 (22%), Positives = 139/378 (36%), Gaps = 25/378 (6%) Query: 2 ENKLIRKIVNLIKKN-GQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIF 59 N+L+ + I + G + ++ L + P GYYS + FG GDFVTAPE+ +F Sbjct: 16 SNRLVAHLRAEIAASDGAIAFSRFMELALYAPGLGYYSAGSSKFGDAGDFVTAPELGALF 75 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 ++ L Q G R++E+G G G + + +L ++E S Sbjct: 76 AATVSNALAPVLRQLG--PQARMLEVGGGSGAFAEVM---LKRLSALDALPQRYAILEPS 130 Query: 120 ERLTLIQKKQLAS-YGDKINWYTSLADVPLGFTF---LVANEFFDSLPIKQFVMTEHGIR 175 L Q+++LA + D P + L ANE D+LP +FV+ + + Sbjct: 131 ADLRARQRERLARVLIPPVFELLEWLDRPFDDDWNGVLFANEVIDALPTPRFVLRDGEVF 190 Query: 176 ERMIDIDQHDSLVFNIGDHEI-------KSNFLTCSDYFLGAIFENSPCRDREMQSISDR 228 E + +D + + G E P +Q+++ Sbjct: 191 EETVTLDGDGRFQRGEQPADALLAAAVRHVERDLETSLPDGYRSELLPQLPYWIQAVAGG 250 Query: 229 LACDGGTAIVIDYG----YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSS 283 L + Y YL R TL+A H PG DL++ VDF L+ Sbjct: 251 LRSGAMLFVDYGYPRREFYLPERNDGTLRAFYRHRSHADVYRWPGLQDLTASVDFTALAE 310 Query: 284 IAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGE 343 + G Q FL G G+ +T L +++ V MGE Sbjct: 311 AGTGAGFDLAGYCNQASFLLGNGLDTLLAEAEARTDDAGRL--RLRQQVKRLTLPSEMGE 368 Query: 344 LFKILVVSHEKVELMPFV 361 F+++ + F+ Sbjct: 369 RFQMMGFERDVALAPAFL 386 >gi|77747653|ref|NP_778900.2| hypothetical protein PD0678 [Xylella fastidiosa Temecula1] gi|182681267|ref|YP_001829427.1| hypothetical protein XfasM23_0712 [Xylella fastidiosa M23] gi|182631377|gb|ACB92153.1| protein of unknown function DUF185 [Xylella fastidiosa M23] gi|307579718|gb|ADN63687.1| hypothetical protein XFLM_08975 [Xylella fastidiosa subsp. fastidiosa GB514] Length = 394 Score = 221 bits (562), Expect = 2e-55, Method: Composition-based stats. Identities = 83/373 (22%), Positives = 136/373 (36%), Gaps = 31/373 (8%) Query: 2 ENKLIRKIVN-LIKKNGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIF 59 +L I +I+ G + ++ L + P GYYS FG GDF+TAPE+ +F Sbjct: 16 SEQLAAYIRQQIIQSGGAIPFSRFMELALYAPGLGYYSAGASKFGEAGDFITAPELGSLF 75 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 +A L +Q G +C ++ELG G G + + +L ++E S Sbjct: 76 ATTVANALAPVLQQLGALAC--VLELGGGSGAFAEML---LKRLMELHRLPQRYAILEPS 130 Query: 120 ERLTLIQK----KQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIR 175 L Q+ + L + + + ANE D+LP +F + + + Sbjct: 131 AELRQRQQLHLKRTLPPSLFALVEWVDAPFSEQWDGVVFANEVIDALPASRFTIRDGQVY 190 Query: 176 ERMIDIDQHDSLVFNIGDHEI-------KSNFLTCSDYFLGAIFENSPCRDREMQSISDR 228 E + +D V + + G E P +Q++ Sbjct: 191 EATVLLDAQQRFVSGQQPADALLHQAVRHIERDLSVRFADGYCSEVLPQLPYWVQAVVGG 250 Query: 229 LACDGGTAIVIDYG----YLQSRVGDTLQAVKGHTYVSPL-VNPGQADLSSHVDFQRLSS 283 L + Y Y R TL+A H PG D+++ VDF L+ Sbjct: 251 LKRGVLLFVDYGYPRAEYYRPERDTGTLRAFYRHRVHDDWYRWPGLQDVTASVDFTALAE 310 Query: 284 IAILYKLYINGLTTQGKFLEGLGIWQRAFSLMK---QTARKDILLDSVKRLVSTSADKKS 340 + G TQ FL G+ R + + K +L + VKRL Sbjct: 311 AGTAAGFDMAGYCTQANFLLSHGL-DRLLAHAEEGVDEVAKLLLRNQVKRLTL----PSE 365 Query: 341 MGELFKILVVSHE 353 MGE F+++ + E Sbjct: 366 MGERFQVMGFARE 378 >gi|28056670|gb|AAO28549.1| conserved hypothetical protein [Xylella fastidiosa Temecula1] Length = 401 Score = 221 bits (562), Expect = 2e-55, Method: Composition-based stats. Identities = 83/373 (22%), Positives = 136/373 (36%), Gaps = 31/373 (8%) Query: 2 ENKLIRKIVN-LIKKNGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIF 59 +L I +I+ G + ++ L + P GYYS FG GDF+TAPE+ +F Sbjct: 23 SEQLAAYIRQQIIQSGGAIPFSRFMELALYAPGLGYYSAGASKFGEAGDFITAPELGSLF 82 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 +A L +Q G +C ++ELG G G + + +L ++E S Sbjct: 83 ATTVANALAPVLQQLGALAC--VLELGGGSGAFAEML---LKRLMELHRLPQRYAILEPS 137 Query: 120 ERLTLIQK----KQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIR 175 L Q+ + L + + + ANE D+LP +F + + + Sbjct: 138 AELRQRQQLHLKRTLPPSLFALVEWVDAPFSEQWDGVVFANEVIDALPASRFTIRDGQVY 197 Query: 176 ERMIDIDQHDSLVFNIGDHEI-------KSNFLTCSDYFLGAIFENSPCRDREMQSISDR 228 E + +D V + + G E P +Q++ Sbjct: 198 EATVLLDAQQRFVSGQQPADALLHQAVRHIERDLSVRFADGYCSEVLPQLPYWVQAVVGG 257 Query: 229 LACDGGTAIVIDYG----YLQSRVGDTLQAVKGHTYVSPL-VNPGQADLSSHVDFQRLSS 283 L + Y Y R TL+A H PG D+++ VDF L+ Sbjct: 258 LKRGVLLFVDYGYPRAEYYRPERDTGTLRAFYRHRVHDDWYRWPGLQDVTASVDFTALAE 317 Query: 284 IAILYKLYINGLTTQGKFLEGLGIWQRAFSLMK---QTARKDILLDSVKRLVSTSADKKS 340 + G TQ FL G+ R + + K +L + VKRL Sbjct: 318 AGTAAGFDMAGYCTQANFLLSHGL-DRLLAHAEEGVDEVAKLLLRNQVKRLTL----PSE 372 Query: 341 MGELFKILVVSHE 353 MGE F+++ + E Sbjct: 373 MGERFQVMGFARE 385 >gi|332969042|gb|EGK08082.1| protein of hypothetical function DUF185 [Kingella kingae ATCC 23330] Length = 408 Score = 220 bits (561), Expect = 2e-55, Method: Composition-based stats. Identities = 89/375 (23%), Positives = 145/375 (38%), Gaps = 29/375 (7%) Query: 2 ENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIF 59 L + I I NG + ++ L + P GYY+ + GA GDF+TAP +S +F Sbjct: 40 SEALHQVIAQEIAAQNGSIPFSRFMELALYAPRLGYYTGGAHKIGAGGDFITAPTLSPLF 99 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 + L+ + Q L E G G G++ +L S+ Y+VE S Sbjct: 100 SKTLSKQIQTLLPQT----AGNLYEFGAGTGVLAAQLLAHCQS------SLQHYYIVELS 149 Query: 120 ERLTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIR 175 L Q+ + + K+ W L D G L+ NE D++PI++ E+G Sbjct: 150 PDLVARQRDYIAQHAPDFLHKVRWIGELPDTLDG--VLLGNEVLDAMPIERIRRAENGQW 207 Query: 176 ERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLG---AIFENSPCRDREMQSISDRLACD 232 +R Q++ + + + + YF E + + +++ +L Sbjct: 208 QRAYVQHQNNEFLLEYKELDKQDMVQAALQYFPEIAPYTSELHLTQHAFITTLAQKLTRG 267 Query: 233 GGTA----IVIDYGYLQSRVGDTLQ-AVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAIL 287 Y R TL K H+ P G DL++HV+F ++ Sbjct: 268 AMIWIDYGFNAAQYYHPQRQDGTLIGHHKHHSIHDPFYCVGLTDLTAHVNFSDIADAGCS 327 Query: 288 YKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKI 347 L + G T Q FL LGI QT K+ + + V + MGELFK+ Sbjct: 328 AGLDLIGYTIQANFLFNLGILDYLAQDYPQTDSKEYVQAAF--AVQQLTAQHEMGELFKV 385 Query: 348 LVVSHE-KVELMPFV 361 + + V+ FV Sbjct: 386 MAFGKQVDVDWQGFV 400 >gi|307111042|gb|EFN59277.1| hypothetical protein CHLNCDRAFT_8390 [Chlorella variabilis] Length = 400 Score = 220 bits (561), Expect = 2e-55, Method: Composition-based stats. Identities = 111/366 (30%), Positives = 170/366 (46%), Gaps = 40/366 (10%) Query: 2 ENKLIRKIVNLIK-KNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 E L+R + LI+ + G +++ ++ + + +P+ GYYS + FGA GDFVT+PEI Q+ G Sbjct: 31 ETPLVRHLKALIQFRGGPLSLAEFMSEALTNPQHGYYSQRDVFGATGDFVTSPEICQMMG 90 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 EM+ I+ + AW+Q G P+ + LVELGPGRG +M D+LR + F L + MVE S Sbjct: 91 EMVGIWCVAAWQQLGCPATLHLVELGPGRGTLMADLLRGTAAFQ-QFSQALRVSMVEVSP 149 Query: 121 RLTLIQKKQLAS-------------------YGDKINWYTSLADV--PLGFTFLVANEFF 159 L + G ++W+ SL +V G + +A+EF Sbjct: 150 HLRGMHGDDPGPDTNSSSSSGSGGVSGVSGWSGVPVSWHRSLEEVAAEGGPSLYIAHEFL 209 Query: 160 DSLPIKQFVMTEHGIRERMIDIDQHDSL---VFNIGDHEIKSNFLTCSDYFLG------- 209 D+LP+ QF TE G ER++D DS + + + Sbjct: 210 DALPVHQFQKTERGWCERLVDCASPDSPLHLRMVLSPGPTPAARVLLPRRLRQLPQAEAS 269 Query: 210 --AIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNP 267 A E P + ++ R+A GG A++IDYG +LQA++ H +V L P Sbjct: 270 EAAALEVCPQAMALAEGLARRVAQHGGAALIIDYGQDAP-YEASLQAIRQHQFVGLLEGP 328 Query: 268 GQADLSSHVDFQRLSSIAILYK--LYINGLTTQGKFLEGLGIWQRAFSLM--KQTARKDI 323 G AD+S+ VD+ L + G Q FL GLGI R L+ + + Sbjct: 329 GGADISNRVDYSALRATVEESGAAADCLGPIPQALFLLGLGIEARLEQLLVGATPQQAEA 388 Query: 324 LLDSVK 329 L + Sbjct: 389 LQTGYR 394 >gi|254496625|ref|ZP_05109490.1| conserved hypothetical protein [Legionella drancourtii LLAP12] gi|254354147|gb|EET12817.1| conserved hypothetical protein [Legionella drancourtii LLAP12] Length = 369 Score = 220 bits (561), Expect = 2e-55, Method: Composition-based stats. Identities = 83/356 (23%), Positives = 146/356 (41%), Gaps = 25/356 (7%) Query: 19 MTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFP 77 ++ ++ L + P GYYS+ G GDF+TAPE++ +FG+ LA P Sbjct: 16 LSFVEFMQLALYAPGEGYYSSGLQKLGKQGDFITAPELTPLFGKTLANQCQQIMMTLDAP 75 Query: 78 SCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKI 137 L+E G G G + + IL + L + Y++E S L Q + + + Sbjct: 76 V---LLEFGAGTGALCVAILEHLALLNCLP---EAYYIMEVSANLRHRQAEFIRQKIPHL 129 Query: 138 NWYTSLADVPLGFTF---LVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDH 194 + F ++ANE D++P+ +F+ TEHG+ E I +D+ L Sbjct: 130 ADKVIWLERWPETPFNGVILANEVLDAMPVHRFMRTEHGVMESYITLDEQQQLKEVFKPC 189 Query: 195 EIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLAC--DGGTAIVIDY------GYLQS 246 + + + + + + G ++IDY Y Sbjct: 190 QNLRLLDYINQRLHLETVPYLSEANLFIDDWILNIYRILNQGVVLLIDYGFPRHEYYHPD 249 Query: 247 RVGDTLQAVKGHTYVS-PLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGL 305 R TL H PL++PG+ D+++HVDF ++ ++ G T Q FL Sbjct: 250 RQQGTLMCHYQHHSHPEPLLHPGEQDITAHVDFTHVAEAGQQAGFHVAGYTNQASFLLAN 309 Query: 306 GIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHE-KVELMPF 360 G+ SL + + ++K+L MGELFK++ ++ +++L F Sbjct: 310 GLLSFVNSL-ENEVEQLKAKQAIKQLTQ----PSEMGELFKVIALNKNMEIDLNGF 360 >gi|154337892|ref|XP_001565172.1| hypothetical protein [Leishmania braziliensis MHOM/BR/75/M2904] gi|134062219|emb|CAM36607.1| conserved hypothetical protein [Leishmania braziliensis MHOM/BR/75/M2904] Length = 416 Score = 220 bits (561), Expect = 2e-55, Method: Composition-based stats. Identities = 110/383 (28%), Positives = 182/383 (47%), Gaps = 30/383 (7%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYS-TCNPFG-AVGDFVTAPEISQIF 59 + L ++VN I G + Q+ C+ P++GYY+ + G DF+TA EI F Sbjct: 25 KTALCNELVNKITAQGYYPMSQFIKDCLTHPQYGYYAAKKHVIGGEKADFITAAEI-PFF 83 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 G++LA +++ AW++ G P + LVE+GPGRG +M +L+ I P L I+++E Sbjct: 84 GDVLAAWVMDAWQKMGTPRVLHLVEMGPGRGTLMKTMLKQIQYSNPHLVHFLQIHLIEVG 143 Query: 120 ERLTLIQKKQLASYGD---KINWYTSLADVPLG--FTFLVANEFFDSLPIKQFVMTEHGI 174 QK+ LA++ KI W+ L +P T +ANE+FD+LP+ QF TE G Sbjct: 144 AARREEQKRALAAFQTAQGKIKWWMDLESLPFSLEPTVFIANEYFDALPVAQFRYTERGW 203 Query: 175 RERMIDIDQHDSLVFNIGDHEIKSNFLTCS--------DYFLGAIFENSPCRDREMQSIS 226 E +++D + S ++ G E + + M+++ Sbjct: 204 VETCVEVDTDPGTEAHFRLVHAPSGSMSAYLIPDDVRTKGKSGDCVEVNTVGMQTMETLM 263 Query: 227 DRLA-CDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIA 285 ++ C ++IDYG + TL+ ++GH +V PL++PG+ DLS V F++L + Sbjct: 264 KKMMDCQKAACLLIDYG-KDDHMHSTLRGIRGHRFVDPLLSPGEVDLSCWVSFRQLRWVL 322 Query: 286 ILY-----KLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDIL--LDSVKRLVSTSADK 338 L + TQ FL GI R ++K K + L + +RL+ DK Sbjct: 323 ERLEVARRHLKWFPVMTQHDFLLENGIDVRLAHVIKDEETKTAMKVLQNYRRLM----DK 378 Query: 339 KSMGELFKILVVSHEKVE-LMPF 360 + MGE +K+ + PF Sbjct: 379 EEMGESYKVFAFQTRNFPNVSPF 401 >gi|124265744|ref|YP_001019748.1| hypothetical protein Mpe_A0551 [Methylibium petroleiphilum PM1] gi|124258519|gb|ABM93513.1| conserved hypothetical protein [Methylibium petroleiphilum PM1] Length = 358 Score = 220 bits (561), Expect = 2e-55, Method: Composition-based stats. Identities = 83/373 (22%), Positives = 144/373 (38%), Gaps = 41/373 (10%) Query: 5 LIRKIVNLI-KKNGQMTVDQYFALCVADPEFGYYSTCNPF-----GAVGDFVTAPEISQI 58 + +I I + G + ++Y L + P GYY + G+ DFVTAPE+S + Sbjct: 1 MKARIATEIERAGGWLGFERYMELVLYAPGLGYYVRGDRQFGLMPGSGSDFVTAPELSPL 60 Query: 59 FGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVET 118 FG LA + A + G + E G G G + +L + + + +V+ Sbjct: 61 FGRALARQMGQALDATG---TREVWEFGAGSGALATQLLEALGE------RIDRYVIVDL 111 Query: 119 SERLTLIQKKQL-----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHG 173 S L Q+ + + K W L + G LV NE D++P+ H Sbjct: 112 SGALRERQRTTIAARVPPQHAAKAVWLDRLPERIDG--VLVGNEVLDAMPVALLHFDGHQ 169 Query: 174 IRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDG 233 ER + F D +D G E + +++++ L Sbjct: 170 WLERGVVFSDG---RFMYADRPTSLRPPLAADRVPGTTVELQRQGEAFIRTLAATLRRGA 226 Query: 234 GTAIVIDYG----YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSSIAILY 288 I + + R+G TL + H PL + G D+++HVDF ++ Sbjct: 227 AFFIDYGFPEAEFHHPQRIGGTLMCHRAHQADGDPLSDVGDKDITAHVDFTGIALAGQNA 286 Query: 289 KLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKIL 348 L + G TTQ +FL G+ + + ++L++ + MGELFK++ Sbjct: 287 GLAVLGYTTQARFLMNCGLLDDLQ------GADLRAIAAAQKLLT----EHEMGELFKVI 336 Query: 349 VVSH-EKVELMPF 360 + E + + F Sbjct: 337 GFAAGEPFDAIGF 349 >gi|241766474|ref|ZP_04764344.1| protein of unknown function DUF185 [Acidovorax delafieldii 2AN] gi|241363326|gb|EER58856.1| protein of unknown function DUF185 [Acidovorax delafieldii 2AN] Length = 376 Score = 220 bits (561), Expect = 2e-55, Method: Composition-based stats. Identities = 85/358 (23%), Positives = 148/358 (41%), Gaps = 38/358 (10%) Query: 18 QMTVDQYFALCVADPEFGYYST-CNPFG----AVGDFVTAPEISQIFGEMLAIFLICAWE 72 + D++ AL + P GYY+ FG + DFVTAPE++ +FG+ LA + A Sbjct: 33 WIGFDRFMALALYTPGLGYYANDSTKFGTMPESGSDFVTAPELTPLFGQTLAEQVAEALA 92 Query: 73 QHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLAS 132 Q G + E G G G + L +L + V +V+ S L Q+ LA Sbjct: 93 QTG---THEVWEFGAGSGALALQLLDALGD------RVHRYTIVDLSGSLRARQQALLAR 143 Query: 133 YGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHG----IRERMIDIDQHDSLV 188 + K+ W +L G +V NE D++P++ ER + + + V Sbjct: 144 HAHKLQWVDALPAQFEG--VVVGNEVLDAMPVQLLARQGGQEGGAWHERGVALGDGGAFV 201 Query: 189 FNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDY----GYL 244 + ++ + + E + +++++DRL + + Y Sbjct: 202 WADRPTSLRPPVEI--EGPHDYLTEIHAQGEAFIRTLADRLVRGAIFLLDYGFGESEYYH 259 Query: 245 QSRVGDTLQAVKGHTYVS-PLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLE 303 R T+ + H PLV+ G D+++HV+F ++ A L + G TTQ FL Sbjct: 260 PQRAMGTVMCHQAHQADDNPLVSVGLKDITAHVNFTAMALAAQEAGLEVLGYTTQAHFLI 319 Query: 304 GLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSH-EKVELMPF 360 G+ Q+ L + ++VK + MGELFK+L ++ ++ F Sbjct: 320 NCGLLQKMEQL-----PQAARANAVK-----LIMEHEMGELFKVLALAKGASWNVLGF 367 >gi|145543013|ref|XP_001457193.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124425008|emb|CAK89796.1| unnamed protein product [Paramecium tetraurelia] Length = 429 Score = 220 bits (561), Expect = 2e-55, Method: Composition-based stats. Identities = 99/408 (24%), Positives = 175/408 (42%), Gaps = 57/408 (13%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 LI + + I + G ++ +++ + +FGYY + F GDFVT+ EISQ+F E++ Sbjct: 21 SLINIVRSEIAEKGAISFPRFWHQALLHEKFGYYMKQDVFNKDGDFVTSVEISQMFNELI 80 Query: 64 AIFLICAWEQHG--------FPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 I+L+ ++ G + ++E GPG G + ++LRV + + L Sbjct: 81 GIWLLNTFQHIGVLDNNFRPTNQKMHILEFGPGLGTLSSNVLRVFAQF--NLLENLQYSY 138 Query: 116 VETSERLTLIQKKQLASYGDKINWYTSLAD------------------------------ 145 VE S+ + Q++ + K N Y Sbjct: 139 VEYSDYMRKKQQEAVLKQLQKSNIYPKFEYNNQKVEKFESDEVNLKWFKMYEHFLYEETR 198 Query: 146 --VPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSL-VFNIGDHEIKSNFLT 202 ++A+EFFD+LP+ F + ER++ +D F + D ++ F + Sbjct: 199 EGQHCDPVLILAHEFFDALPVNVFEYSNGQWCERLVGLDNDGKTLKFVLSDGPNQNYFSS 258 Query: 203 CSDY--FLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTY 260 G E SP S+++ ++ GG A++IDYG + D+++A++ H Sbjct: 259 EIKKLIKEGDTIEISPQSGVIANSLAELISKVGGAALIIDYG-EKRAFSDSVRAIQRHKL 317 Query: 261 VS---PLVNPGQADLSSHVDFQRLSSIAIL-YKLYINGLTTQGKFLEGLGIWQRAFSLMK 316 + L PG DLS++V+F L A+ + + + TQG +LE +GI R L K Sbjct: 318 MDKKDILNKPGDCDLSAYVNFMALEQAALKVDGIKVPEIMTQGNWLEQMGIQARLQILCK 377 Query: 317 QTAR--KDILLDSVKRLVSTSADKKSMGELFKILVVSH-EKVELMPFV 361 + + L +RLV + MG FK + + + L PF+ Sbjct: 378 NANKATEKRLQSEYERLVHS----NQMGSNFKFMCIHRTKNNNLFPFI 421 >gi|325095113|gb|EGC48423.1| DUF185 domain-containing protein [Ajellomyces capsulatus H88] Length = 498 Score = 220 bits (561), Expect = 2e-55, Method: Composition-based stats. Identities = 117/463 (25%), Positives = 183/463 (39%), Gaps = 112/463 (24%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNP-------FGAVGDFVTAPE 54 L + I I G +++ Y C+ P+ GYY++ FGA GDFVT+PE Sbjct: 38 STPLAKSIAEAINVTGPVSIATYIRQCLTSPDGGYYTSRGQEDEDTALFGAKGDFVTSPE 97 Query: 55 ISQIFGEMLAIFLICAWEQHGFPS-CVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSI 113 ISQIFGE+L ++ + W G S V+++E GPG+G +M D+LR ++ ++ Sbjct: 98 ISQIFGELLGVWTVTEWMGQGRKSGGVQIIEFGPGKGTLMGDMLRSFAS------AIEAV 151 Query: 114 YMVETSERLTLIQKKQLASYGD--------------------KINWYTSLADVPLGFTFL 153 Y+VETS L +Q+K L L + F+ Sbjct: 152 YLVETSPVLREVQRKLLCGDTPMEEVEVGYKSTSVHLGVPVIWTEHIKLLPNESDKTPFI 211 Query: 154 VANEFFDSLPIKQF--------------------------VMTEHGIRERMIDIDQHDSL 187 A+EFFD+LPI F + + H R + + + Sbjct: 212 FAHEFFDALPIHAFQSIQTPAPSQTTINTPTGPTTLHQPPISSPHTTEWRELVVSPNPET 271 Query: 188 VFNIGDHEIKSNFLTCS-------------------DYFLGAIFENSPCRDREMQSISDR 228 E + G+ E SP +Q I+ R Sbjct: 272 PEVKSGQEPEFRLSLAKASTPSSLVLPEMSSRYKALKSTPGSTIEISPESQACVQDIARR 331 Query: 229 LAC--------------------DGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPG 268 + G A+++DYG + ++L+ ++ H VSPLV PG Sbjct: 332 IGGGGGLVSATSPGVTDTLKNKVPSGAALILDYGTTSTIPINSLRGIRNHRLVSPLVAPG 391 Query: 269 QADLSSHVDFQRLSSIAILY--KLYINGLTTQGKFLEGLGIWQRAFSLMK------QTAR 320 + D+S+ VDF L+ AI + + G QG FLE LGI +RA L++ + Sbjct: 392 EVDISADVDFTALAEAAIDASPGVEVYGPMEQGPFLEALGISERAAQLLRRMEGEGDEEK 451 Query: 321 KDILLDSVKRLVSTSADKKSMGELFKILVVSHE---KVELMPF 360 + ++ KRLV MG+L+K L + E K + F Sbjct: 452 RKLIESGWKRLVERGG--GGMGKLYKALAIVPESGGKRRPVGF 492 >gi|71274768|ref|ZP_00651056.1| Protein of unknown function DUF185 [Xylella fastidiosa Dixon] gi|71901414|ref|ZP_00683505.1| Protein of unknown function DUF185 [Xylella fastidiosa Ann-1] gi|170729987|ref|YP_001775420.1| hypothetical protein Xfasm12_0801 [Xylella fastidiosa M12] gi|71164500|gb|EAO14214.1| Protein of unknown function DUF185 [Xylella fastidiosa Dixon] gi|71728819|gb|EAO30959.1| Protein of unknown function DUF185 [Xylella fastidiosa Ann-1] gi|167964780|gb|ACA11790.1| conserved hypothetical protein [Xylella fastidiosa M12] Length = 394 Score = 220 bits (560), Expect = 3e-55, Method: Composition-based stats. Identities = 83/373 (22%), Positives = 135/373 (36%), Gaps = 31/373 (8%) Query: 2 ENKLIRKIVNL-IKKNGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIF 59 +L I I+ G + ++ L + P GYYS FG GDF+TAPE+ +F Sbjct: 16 SEQLAAYIRQQMIQSGGAIPFSRFMELALYAPGLGYYSAGASKFGEAGDFITAPELGSLF 75 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 +A L +Q G +C ++ELG G G + + +L ++E S Sbjct: 76 ATTVANALAPVLQQLGALAC--VLELGGGSGAFAEML---LKRLMELHRLPQRYAILEPS 130 Query: 120 ERLTLIQK----KQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIR 175 L Q+ + L + + + ANE D+LP +F + + + Sbjct: 131 AELRQRQQLHLKRTLPPSLFALVEWVDAPFSEQWDGVVFANEVIDALPASRFTIRDGQVY 190 Query: 176 ERMIDIDQHDSLVFNIGDHEI-------KSNFLTCSDYFLGAIFENSPCRDREMQSISDR 228 E + +D V + + G E P +Q++ Sbjct: 191 EATVLLDAQQRFVSGQQPADALLHQAVRNIERDLSVRFADGYCSEVLPQLPYWVQAVVGG 250 Query: 229 LACDGGTAIVIDYG----YLQSRVGDTLQAVKGHTYVSPL-VNPGQADLSSHVDFQRLSS 283 L + Y Y R TL+A H PG D+++ VDF L+ Sbjct: 251 LKRGVLLFVDYGYPRAEYYRPERDTGTLRAFYRHRVHDDWYRWPGLQDVTASVDFTALAE 310 Query: 284 IAILYKLYINGLTTQGKFLEGLGIWQRAFSLMK---QTARKDILLDSVKRLVSTSADKKS 340 + G TQ FL G+ R + + K +L + VKRL Sbjct: 311 AGTAAGFDMAGYCTQANFLLSHGL-DRLLAHAEEGVDEVAKLLLRNQVKRLTL----PTE 365 Query: 341 MGELFKILVVSHE 353 MGE F+++ + E Sbjct: 366 MGERFQVMGFARE 378 >gi|149238706|ref|XP_001525229.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL YB-4239] gi|146450722|gb|EDK44978.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL YB-4239] Length = 612 Score = 220 bits (560), Expect = 3e-55, Method: Composition-based stats. Identities = 107/481 (22%), Positives = 178/481 (37%), Gaps = 129/481 (26%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFG-AVGDFVTAPEISQIFGEM 62 L I+ G + + Y C+ P+FGYY+T +P GDF+T+PEIS +FGEM Sbjct: 137 SLSDFFRQTIRLTGPIPLSAYMRQCLTHPDFGYYTTRDPLNLKTGDFITSPEISSVFGEM 196 Query: 63 LAIFLICAWEQHGF---PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSV---LSIYMV 116 + I+ W+ P +R +E GPG+G ++ D+L K + + I M+ Sbjct: 197 IGIWFFNIWQTSKGKTPPKNIRFIEFGPGKGTLIHDVLHTFNKFVTTVSDIKPKIEIVMI 256 Query: 117 ETSERLTLIQKKQLAS-----------YGDKINWYTSLADVPLGF--------------- 150 E S L Q+ L + + + + Sbjct: 257 EASPFLRKEQQNLLCDTTKEFKKNDAGFDTSVTKWGNDIIWVDTEKSIPKGGSGKDGNGN 316 Query: 151 -----------TFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQ---------------- 183 +++A+EFFD+LPIK F+ E G RE +++ Sbjct: 317 NINAEDDNRLVNYIIAHEFFDALPIKSFIRQEDGWRELVVEHTPSVDTSSQLKLETRQDP 376 Query: 184 -------------------------------------HDSLVFNIGDHEIKSNFLT---- 202 +I E S+ + Sbjct: 377 QTHFISTSGGLSRQEQEQKQKQKQEQEQEQEQLQEQLDTEFHLSISPKETPSSMIPKISK 436 Query: 203 -CSDYFLGAIFENSPCRDREMQSISDRLA-----------------CDGGTAIVIDYGYL 244 D +G E P + + I+ ++ D G A+V+DYG Sbjct: 437 RYKDLPVGTRIEICPDAELFIMKIAQLVSGNVDGSGGGGDAASFGGQDEGAALVVDYGPS 496 Query: 245 QSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEG 304 +TL+ + H +VSP PG+ DLS VDFQ L ++ + G QG +L Sbjct: 497 SEIPENTLRGIYQHKFVSPFYKPGEVDLSVDVDFQNLKNLTEAV-CNVYGPIQQGDWLHN 555 Query: 305 LGIWQRAFSLMKQTAR----KDILLDSVKRLVSTSADKKSMGELFKILVVSHEKV-ELMP 359 +GI R L+K+ A +D + + +RLV D MG+++K + + + + + Sbjct: 556 IGIGYRVDQLLKKNANDEEMQDKIYGAYRRLV----DDDQMGKVYKFMALLPKGAQKPIG 611 Query: 360 F 360 F Sbjct: 612 F 612 >gi|157869746|ref|XP_001683424.1| hypothetical protein [Leishmania major strain Friedlin] gi|68126489|emb|CAJ04517.1| conserved hypothetical protein [Leishmania major strain Friedlin] Length = 447 Score = 219 bits (559), Expect = 3e-55, Method: Composition-based stats. Identities = 111/383 (28%), Positives = 181/383 (47%), Gaps = 30/383 (7%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYS-TCNPFG-AVGDFVTAPEISQIF 59 + L ++VN I G + Q+ C+ P++GYY+ N G DF+TA EI F Sbjct: 56 KTALCNELVNKITAQGYYPMSQFVKDCLTHPQYGYYAAKKNVIGSEKADFITAAEI-PFF 114 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 G++LA +++ AW++ G P + LVE+GPGRG +M +L+ I P L I++VE Sbjct: 115 GDVLAAWVMDAWQKMGTPRVLHLVEMGPGRGTLMRTMLKQIQYSNPHLLHFLQIHLVEVG 174 Query: 120 ERLTLIQKKQLASYGD---KINWYTSLADVPLG--FTFLVANEFFDSLPIKQFVMTEHGI 174 QK+ LA + KI W+ +L +P T +ANE+FD+LP+ QF TE G Sbjct: 175 AARREEQKRALAEFQTAQGKIKWWMNLESLPFSLEPTVFMANEYFDALPVAQFRYTERGW 234 Query: 175 RERMIDIDQHDSLVFNIGDHEIKSNFLTCS--------DYFLGAIFENSPCRDREMQSIS 226 E +++D + S ++ G E + + ++++ Sbjct: 235 VETCVEVDTDPGTEAHFRLVHAPSGSMSAYLIPDDIRTKGKTGDCVEVNTVGMQTIETLM 294 Query: 227 DRLA-CDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIA 285 ++ C ++IDYG + TL+ ++GH +V PL++PG+ DLS V F++L Sbjct: 295 KKMMDCQKAACLLIDYG-KDDHMHSTLRGIRGHRFVDPLLSPGEVDLSCWVSFRQLRWAL 353 Query: 286 ILY-----KLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDIL--LDSVKRLVSTSADK 338 L + TQ FL GI R ++K K + L + +RL+ DK Sbjct: 354 ERLEVARQHLKWFPVMTQHDFLLENGIDIRLAHVIKDEETKTAMKVLQNYRRLM----DK 409 Query: 339 KSMGELFKILVVSHEKVE-LMPF 360 + MGE +K+ + PF Sbjct: 410 EEMGESYKVFAFQTRNFPNVSPF 432 >gi|193643457|ref|XP_001944630.1| PREDICTED: protein midA homolog, mitochondrial-like [Acyrthosiphon pisum] Length = 421 Score = 219 bits (559), Expect = 4e-55, Method: Composition-based stats. Identities = 123/392 (31%), Positives = 192/392 (48%), Gaps = 48/392 (12%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 ++ L + + I+ NG +T+ +Y + YY++ N FG+ GDF+T+PEISQ++G Sbjct: 25 VQQNLTKYFQDKIRINGPITLAEYMRESLKT----YYNSGNVFGSDGDFITSPEISQLYG 80 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 EM+ ++L+ WE+ G PS V L+ELGPG G+MM D+LR++ + + LSI+MVETS+ Sbjct: 81 EMVMLWLLSLWEKAGCPSPVNLIELGPGTGVMMTDMLRLLKQTQYSSLD-LSIHMVETSK 139 Query: 121 RLTLIQKKQLASY---------------------GDKINWYTSLAD-VPLGFTFLVANEF 158 + +L Q +L +I WY S+ D F+ +VA EF Sbjct: 140 KCSLEQADKLGCSDLRTDSGKCYYQHGITSEKYGMKEIFWYESIDDIPRNTFSLIVAQEF 199 Query: 159 FDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSN---------FLTCSDYFLG 209 FD+LP+ +F RE +IDI+ S F + + + CS+ Sbjct: 200 FDALPVHKFRKINEKWREIVIDIENDISGTFRYVLSKTSTTTSNLISTIDYYKCSNLKNV 259 Query: 210 AIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQ 269 E MQ +S+R+ DGG + IDYG + + DT +A GH V+PL PG Sbjct: 260 NEIEVGLDAAIVMQKLSERIILDGGIFLCIDYGQDKPII-DTFRAFSGHQQVNPLHKPGT 318 Query: 270 ADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLM---KQTARKDILLD 326 DL++ VDF RL ++A + G Q FLE + I R +L+ K L Sbjct: 319 VDLTADVDFNRLKNVAGT-DVLAFGPIGQNIFLENMMINLRYQNLLTLTKNEKEATSLTF 377 Query: 327 SVKRLVSTSADKKSMGELFKILVVSHEKVELM 358 +L+ MG+ FKI+ + + + Sbjct: 378 GYNKLM-------EMGKKFKIMGMVPATMAPL 402 >gi|71897625|ref|ZP_00679870.1| Protein of unknown function DUF185 [Xylella fastidiosa Ann-1] gi|71732528|gb|EAO34581.1| Protein of unknown function DUF185 [Xylella fastidiosa Ann-1] Length = 391 Score = 219 bits (559), Expect = 4e-55, Method: Composition-based stats. Identities = 82/373 (21%), Positives = 136/373 (36%), Gaps = 31/373 (8%) Query: 2 ENKLIRKIVNL-IKKNGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIF 59 +L I I+ G + ++ L + P GYYS FG GDF+TAPE+ +F Sbjct: 13 SEQLAAYIRQQMIQSGGAIPFSRFMELALYAPGLGYYSAGASKFGEAGDFITAPELGSLF 72 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 +A L +Q G +C ++ELG G G + + +L ++E S Sbjct: 73 ATTVANALAPVLQQLGALAC--VLELGGGSGAFAEML---LKRLMELHRLPQRYAILEPS 127 Query: 120 ERLTLIQK----KQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIR 175 L Q+ + L + + + ANE D+LP +F +++ + Sbjct: 128 AELRQRQQLHLKRTLPPSLFALVEWVDAPFSEQWDGVVFANEVIDALPASRFTISDGQVY 187 Query: 176 ERMIDIDQHDSLVFNIGDHEI-------KSNFLTCSDYFLGAIFENSPCRDREMQSISDR 228 E + +D V + + G E P ++++ Sbjct: 188 EATVLLDAQQRFVSGQQPADALLHQAVRHIERDLSVRFADGYCSEVLPQLPYWVKAVVGG 247 Query: 229 LACDGGTAIVIDYG----YLQSRVGDTLQAVKGHTYVSPL-VNPGQADLSSHVDFQRLSS 283 L + Y Y R TL+A H PG D+++ VDF L+ Sbjct: 248 LKRGVLLFVDYGYPRAEYYRPERDTGTLRAFYRHRVHDDWYRWPGLQDVTASVDFTALAE 307 Query: 284 IAILYKLYINGLTTQGKFLEGLGIWQRAFSLMK---QTARKDILLDSVKRLVSTSADKKS 340 + G TQ FL G+ R + + K +L + VKRL Sbjct: 308 AGTAAGFDMAGYCTQANFLLSHGL-DRLLAHAEEGVDEVAKLLLRNQVKRLTL----PSE 362 Query: 341 MGELFKILVVSHE 353 MGE F+++ + E Sbjct: 363 MGERFQVMGFARE 375 >gi|16331364|ref|NP_442092.1| hypothetical protein slr0351 [Synechocystis sp. PCC 6803] gi|1001535|dbj|BAA10162.1| slr0351 [Synechocystis sp. PCC 6803] gi|1256598|gb|AAB72122.1| ORF416 [Synechocystis sp. PCC 6803] Length = 416 Score = 219 bits (558), Expect = 5e-55, Method: Composition-based stats. Identities = 89/374 (23%), Positives = 164/374 (43%), Gaps = 25/374 (6%) Query: 1 MENKLIRKIVNLIKK--NGQMTVDQYFALCVADPEFGYYSTCNP-FGAVGDFVTAPEISQ 57 M ++ +++ I++ + ++T ++ + P++GYYS+ GDFVTA + Sbjct: 19 MTQNILPALLDHIRQSPHQRLTFAEFMEWVLYQPDYGYYSSGQVDIAIRGDFVTAIALGA 78 Query: 58 IFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVE 117 FGE+LA + W++ G P L+ELG G G +L +L P+FF+ L +++E Sbjct: 79 DFGELLAEQFLEMWQRLGEPDRFDLLELGAGTGAFAQTVLAQTQRLYPEFFAALHYHIIE 138 Query: 118 TSERLTLIQKKQLASYGDKIN-WYTSLADVPLGFTF--LVANEFFDSLPIKQFVMTEHGI 174 S L Q L + + + + ++ +NEFFD+LP+ + + + + Sbjct: 139 ESAALRRRQANLLEPWREMGKIQWQTWPELANDSLVGCCFSNEFFDALPVHRVGIKQGVL 198 Query: 175 RERMIDIDQHD--SLVFNIGDHEIKSNFLTC------SDYFLGAIFENSPCRDREMQSIS 226 +E+ I+ D S+ ++ ++ F + Y E + + ++ I+ Sbjct: 199 KEQYIEAGSTDLTSVWDDLSTDKLVDYFASFGLQLTSPPYGENYETEVNLAAQQALEEIA 258 Query: 227 DRLACDGGTAIVIDY------GYLQSRVGDTLQAVKGHTYVS-PLVNPGQADLSSHVDFQ 279 LA G + +DY Y R G TLQ + P VN GQ D+++HV+F Sbjct: 259 RVLAQ--GWVLTVDYGHPAGKYYHPQRTGGTLQCYFQQRHHDNPFVNLGQQDVTAHVNFT 316 Query: 280 RLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKK 339 L Y L + QG FL LG+ R L + + + ++ D Sbjct: 317 ALEWWGECYGLNRLDFSQQGPFLMNLGLGDRLADLSSGQYDLNEIFSR-RAILHQLIDPN 375 Query: 340 SMGELFKILVVSHE 353 +G+ F +L+ + Sbjct: 376 GLGK-FGVLLQGKK 388 >gi|121603432|ref|YP_980761.1| hypothetical protein Pnap_0519 [Polaromonas naphthalenivorans CJ2] gi|120592401|gb|ABM35840.1| protein of unknown function DUF185 [Polaromonas naphthalenivorans CJ2] Length = 368 Score = 219 bits (557), Expect = 7e-55, Method: Composition-based stats. Identities = 89/379 (23%), Positives = 153/379 (40%), Gaps = 45/379 (11%) Query: 1 MENKLIRKIVNLIK-KNGQMTVDQYFALCVADPEFGYYSTCN-PFG------AVG-DFVT 51 + + L I I+ G + D + AL + P GYY+ + FG G DFVT Sbjct: 7 LTSPLQADIAKAIESAGGWLGFDVFMALALYQPGLGYYARDSLKFGLMPGGVKGGSDFVT 66 Query: 52 APEISQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVL 111 APE+S FG+ LA + A + G + E G G G + +L + + V Sbjct: 67 APELSPRFGQTLARQVAQALQASG---TTEVWEFGAGSGALAQQVLDTLGE------QVT 117 Query: 112 SIYMVETSERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTE 171 +V+ S L Q+++LA++ K+ W T L G ++ NE D++P+K Sbjct: 118 RYTIVDLSSSLRERQRERLAAHAGKVQWATELPARMRG--VVLGNEVLDAMPVKLLARLG 175 Query: 172 HGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLAC 231 ER + + F D + E P + +++++DRL Sbjct: 176 GVWHERGVVLH---EGRFTYADCPTDLRPPLEVAGRHDYVTEIHPQAEGFVRTLADRLEA 232 Query: 232 DGGTAIVIDY------GYLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSSI 284 G A+++DY Y R T+ + H L + G+ D+++HV+F ++ Sbjct: 233 --GAALLLDYGFPEHEYYHPQRDMGTVMCHRAHLLDADALADVGEKDITAHVNFTGIALA 290 Query: 285 AILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGEL 344 L + G T+QG+F + + + +V ++ MGEL Sbjct: 291 GQEAGLQVLGYTSQGRF----------LLNCGLLGGLEDASLAERAMVQKLVNEHEMGEL 340 Query: 345 FKILVVSHEK---VELMPF 360 FK++ + E M F Sbjct: 341 FKVIAFGAKNSPAWEPMGF 359 >gi|254491150|ref|ZP_05104331.1| conserved hypothetical protein [Methylophaga thiooxidans DMS010] gi|224463663|gb|EEF79931.1| conserved hypothetical protein [Methylophaga thiooxydans DMS010] Length = 379 Score = 218 bits (556), Expect = 9e-55, Method: Composition-based stats. Identities = 98/376 (26%), Positives = 156/376 (41%), Gaps = 32/376 (8%) Query: 2 ENKLIRKIVNLI-KKNGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIF 59 ++L I I ++ G ++ Y C+ P FGYYS + G GDF TAPEIS +F Sbjct: 7 SDQLKALITAEIEQQGGFISFADYMQRCLYQPGFGYYSAGSHKLGQGGDFTTAPEISPLF 66 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 G +A + A +Q + ++E G G G + + +L + L +++E S Sbjct: 67 GYAVANQVHDALQQ---CTSKHILEFGAGSGQLAIAMLTQLESLNALP---ERYFILEIS 120 Query: 120 ERLTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIR 175 L Q + + D++ W +L DV G ++ANE D++P+ +T G+ Sbjct: 121 ADLQARQLQLIEQQRPDLADRVEWIQTLPDVFNG--VMLANEVCDAMPVHLLRLTTSGMF 178 Query: 176 ERMIDIDQHDSLVFNIGDHEIKS-------NFLTCSDYFLGAIFENSPCRDREMQSISDR 228 ER + I ++ F D ++ + + E + MQ+ + Sbjct: 179 ERGVSI---ENGHFIWQDKKLAHPRLIAFAERINEPSQAEDYLTEVNLNAVDWMQTAAAS 235 Query: 229 LACDGGTAIVIDYG----YLQSRVGDTLQ-AVKGHTYVSPLVNPGQADLSSHVDFQRLSS 283 L I Y Y RV TL+ + PL PG D+S+HVDF L+ Sbjct: 236 LQQGAIFIIDYGYPFNDYYAAERVQGTLRSYYRQQAIDDPLQLPGLQDISAHVDFTTLAE 295 Query: 284 IAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGE 343 A+ ++ G QG FL I Q A L + L S + MG Sbjct: 296 TALEGGCHVAGFHEQGDFLVAANITQIAAQLEAKKDALSWLQHSA--ALKQLLMPHVMGY 353 Query: 344 LFKILVVSHEKVELMP 359 FK+L +S +EL+P Sbjct: 354 QFKVLSLSKA-IELLP 368 >gi|289207339|ref|YP_003459405.1| hypothetical protein TK90_0153 [Thioalkalivibrio sp. K90mix] gi|288942970|gb|ADC70669.1| protein of unknown function DUF185 [Thioalkalivibrio sp. K90mix] Length = 398 Score = 218 bits (555), Expect = 1e-54, Method: Composition-based stats. Identities = 91/371 (24%), Positives = 150/371 (40%), Gaps = 30/371 (8%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIF 59 M + L + I G + Y + P GYY GA GDFVTAPE+S +F Sbjct: 22 MHDSLQQSIA---AHGGFLPFVDYMHQALYAPGLGYYVNGARKLGAGGDFVTAPELSSLF 78 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 GE LA +L +L+E G G G + D+L + +L + ++E S Sbjct: 79 GETLASWLAPLLRDE-LAGGGQLLEFGAGSGRLAGDVLVTLRELGVGWQ---CYKIIEVS 134 Query: 120 ERLTLIQKKQL-----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGI 174 L +Q++ L ++ W +L + P+ ++ANE D+LP++ F E Sbjct: 135 PDLRAVQQEHLAERLEPEEYARVEWLDALPEEPIR-GVVLANEVLDALPVELFRWREGQP 193 Query: 175 RERMIDIDQHDSLVFNIGDHEIKSNFLTCS------DYFLGAIFENSPCRDREMQSISDR 228 + + +D LV + + G E P + + S++ Sbjct: 194 WQMGVTVDGDGRLVLAEQSAPEPLADVVRELQAVHGPWPDGYTSEWRPAQAAWVASVAAC 253 Query: 229 LACDGGTAIVIDY------GYLQSRVGDTLQ-AVKGHTYVSPLVNPGQADLSSHVDFQRL 281 L G A+++DY Y R TL + + PL PG DL++ VDF + Sbjct: 254 LEQ--GVALLVDYGFPRAAYYAAERHQGTLVGYYRQQMMLDPLAQPGLMDLTASVDFTAV 311 Query: 282 SSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSM 341 + A L + G QG+FL G G+ QR + + + + + V M Sbjct: 312 AEAADAAGLDVLGYAAQGEFLLGAGLAQRFEARVGSGEDARKTMQAAQ-AVRMLTLPGEM 370 Query: 342 GELFKILVVSH 352 GE F+++ + Sbjct: 371 GERFQVMTLGR 381 >gi|223992673|ref|XP_002286020.1| predicted protein [Thalassiosira pseudonana CCMP1335] gi|220977335|gb|EED95661.1| predicted protein [Thalassiosira pseudonana CCMP1335] Length = 431 Score = 218 bits (555), Expect = 1e-54, Method: Composition-based stats. Identities = 104/434 (23%), Positives = 180/434 (41%), Gaps = 81/434 (18%) Query: 5 LIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTC-----NPFGAVGDFVTAPEISQIF 59 + +++ I G +TV ++ + D +GYY++ GA GDF+TAPE+SQ+F Sbjct: 1 MTDELIAYIGVRGPITVAEFMRRVLRDGRYGYYTSKGSRREQVIGAAGDFITAPEVSQLF 60 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 GE L ++L+ ++ G P+ ++L+E+GPG+G ++ DI+R K + +++VE + Sbjct: 61 GESLLVWLMTQYQSLGSPAKIQLIEIGPGKGTLICDIVRSGEKRHKVC---VGVHLVEVT 117 Query: 120 ERLTLIQKKQLASYGDKINWYTSLADVPL--------------GFTFLVANEFFDSLPIK 165 + QK+ + + + + + TF++ E D+LPI Sbjct: 118 NGMRSRQKESIRNLQKETSVNVISFEWHDVLSSVPIHDDSGDPIPTFVICQELVDALPIH 177 Query: 166 QFVMTEHG-IRERMIDIDQHDS----------LVFNIGDHEIKSNFLTCSDYFLGAIFEN 214 F E RER++D+ D + + + + +G+I E Sbjct: 178 SFQKIEGNLWRERLVDVAIRDDSEASDAAKDVHLAVAPPSDDNHASTSLNSLPVGSIIEA 237 Query: 215 SPCRDREMQSISDRLA-CDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLS 273 P +Q I+DR+ C+GG A++IDYG + GDTL+ HT V PL PG+ D++ Sbjct: 238 CPEGLILVQDIADRIQNCNGGAALIIDYGENGASGGDTLRGFWRHTQVHPLSRPGEVDVT 297 Query: 274 SHVDFQRL------------------SSIAILYKLYI----------------------- 292 + VDF L A K Sbjct: 298 ADVDFGALREAVNRRVTFEDSFERKRKENAYKRKGKTLEEDANENDKPDNKSTPPITRQY 357 Query: 293 --NGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVV 350 G TQG+FL +GI +R ++ D + + MGE +K+L + Sbjct: 358 EAYGPITQGQFLASMGIVERVERYIEDDDTTDEQAYELFSALERLVGSDEMGERYKVLAI 417 Query: 351 ---SHEKV-ELMPF 360 + + F Sbjct: 418 AAAKKDGLFPPPGF 431 >gi|46122055|ref|XP_385581.1| hypothetical protein FG05405.1 [Gibberella zeae PH-1] Length = 465 Score = 218 bits (555), Expect = 1e-54, Method: Composition-based stats. Identities = 107/432 (24%), Positives = 167/432 (38%), Gaps = 94/432 (21%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYST-----CNPFGAVGDFVTAPEIS 56 L +++ I G + + Y +C+ GYY+ + FG GDFVT+PEIS Sbjct: 27 STPLAKQLFAAISTTGPVPLASYMRMCLTGDIGGYYTGAIGEGRDQFGTKGDFVTSPEIS 86 Query: 57 QIFGEMLAIFLICAWEQHGFPSC-VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 QIFGE++ I+ I W G P V+++E+GPGRG +M D+LR I + S+ ++YM Sbjct: 87 QIFGELIGIWFIAEWMSQGRPKQGVQIIEVGPGRGTLMDDMLRTIQRFPAMANSIDAVYM 146 Query: 116 VETSERLTLIQKKQLA---------------------SYGDKINWYTSLADVPLGFTFLV 154 VE S L QK+ L + S+ F++ Sbjct: 147 VEASRELRSAQKELLCGPDASSSESESGFHSASKYNGKQIVWTDNIKSIPYESDKMPFII 206 Query: 155 ANEFFDSLPIKQFV-------------------------MTEHGIRERMIDIDQHDSLVF 189 A+EFFD+LPI F RE M+ Sbjct: 207 AHEFFDALPIHSFQSAPAPPPQPKPSSSTSAPQPLPQNTKPTMEWREMMVSPTLPGVTHA 266 Query: 190 NIGDHEIKSNFLTCS--------------------------DYFLGAIFENSPCRDREMQ 223 +G + + + + E P Sbjct: 267 QLGTPKSEQHEPPPEFQLTLSSTPTRHSRFLPETSTRYRKLKSMPNSNIEICPDASIFAT 326 Query: 224 SISDRL--------ACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSH 275 + R+ G A+++DYG + ++L+ ++ H VSP PG DLS+ Sbjct: 327 DFATRVGGSDEHPKPKPSGAALILDYGTSDTVPINSLRGIRHHRRVSPFSAPGLVDLSAD 386 Query: 276 VDFQRLSSIAILY--KLYINGLTTQGKFLEGLGIWQRAFSLMKQTARK----DILLDSVK 329 VDF ++ +A+L + ++G TQG FL +GI +RA L K + D + + K Sbjct: 387 VDFTAIAEVAMLASEGVEVHGPVTQGDFLGVMGIRERAEQLTKAPGVEKDTVDKIDGAWK 446 Query: 330 RLVSTSADKKSM 341 RLV M Sbjct: 447 RLVDKG--PDGM 456 >gi|153873329|ref|ZP_02001946.1| protein containing DUF185 [Beggiatoa sp. PS] gi|152070205|gb|EDN68054.1| protein containing DUF185 [Beggiatoa sp. PS] Length = 406 Score = 218 bits (554), Expect = 1e-54, Method: Composition-based stats. Identities = 91/399 (22%), Positives = 162/399 (40%), Gaps = 53/399 (13%) Query: 2 ENKLIRKIVNLIKKNG-QMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIF 59 +LI KI L+ NG + ++ + P GYYS + G GDF+TAPEIS +F Sbjct: 15 SQQLINKI--LLAMNGEAIPFVKFMEQALYAPGLGYYSAGIHKLGEKGDFITAPEISPLF 72 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 +A ++E G G G M +IL+ + +L +++E S Sbjct: 73 SHCVAKQCQQILASF---ESGVILEFGAGSGKMAAEILKELERLDCLP---TEYFILEVS 126 Query: 120 ERLTLIQKKQLASYGDKINWYTSLADVPLGFTF---LVANEFFDSLPIKQFVMTEHGIRE 176 L Q++ L + + + ++ANE D++P+++F + ++ I E Sbjct: 127 ADLQKYQQETLQAQVPHLFNRIQWLERLPSQPISGVILANEVLDAMPVRRFRLDDNEISE 186 Query: 177 RMI-------------DIDQHDSLVFNIGDHEIKSNFLTCSD-------------YFLGA 210 + ++ + + F + L D +G Sbjct: 187 FFVGSEDSKINFGLQEELSKDSKINFGLQ-SPFSWQILPSRDESLRMLVEKLRPTLPIGY 245 Query: 211 IFENSPCRDREMQSISDRLACDGGTAIVIDY------GYLQSRVGDT-LQAVKGHTYVSP 263 + E + +QS+SD LA G ++IDY Y R T + + H + P Sbjct: 246 VSEINLALPAWIQSVSDILAA--GMVLLIDYGFPRREYYHPQRTQGTLMCHYRHHAHDDP 303 Query: 264 LVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDI 323 L+ G D+++HVDF ++ A+ L + G TTQ FL G+ + +L + Sbjct: 304 LILVGLQDITAHVDFTAVAEAAVAADLEVAGYTTQANFLLASGLPEFFSTLDPDDTKT-- 361 Query: 324 LLDSVKRLVSTSADKKSMGELFKILVVSHE-KVELMPFV 361 + V MGELFK++ ++ + + L+ F+ Sbjct: 362 -YLQYTQQVKKLTLPSEMGELFKVMALTRDYEKPLLGFI 399 >gi|145347903|ref|XP_001418399.1| predicted protein [Ostreococcus lucimarinus CCE9901] gi|144578628|gb|ABO96692.1| predicted protein [Ostreococcus lucimarinus CCE9901] Length = 461 Score = 218 bits (554), Expect = 1e-54, Method: Composition-based stats. Identities = 118/389 (30%), Positives = 184/389 (47%), Gaps = 55/389 (14%) Query: 5 LIRKIVNLIK-KNGQMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEM 62 L+ + +K G + V +Y C+ PE+GYY + FG GDFVT+PEISQ+FGE+ Sbjct: 95 LLGHLERAMKFAGGSIPVSEYVRECLTHPEYGYYMRDADVFGKKGDFVTSPEISQVFGEL 154 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 + ++ +E G P +R+VE GPGRG +M D+LR K +S++++E S L Sbjct: 155 IGVWAALQYEALGSPDTLRIVEFGPGRGTLMADLLRGTRKFAKFR-DAVSVHLIEVSPAL 213 Query: 123 TLIQKKQLAS------------------YGDKINWYTSLADVPLGFTFLVANEFFDSLPI 164 Q K L ++ W+ L VP G T ++ +EFFD+LP+ Sbjct: 214 RKTQAKTLRCGELETTAAEGNARFVSEINDAEVFWHDGLESVPRGPTLVICHEFFDALPV 273 Query: 165 KQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQS 224 +QF TE G E++I ID + + E SP + Sbjct: 274 RQFQRTERGWCEKLITIDSEKADSLRL--------------------IELSPPSMTLWDA 313 Query: 225 ISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSI 284 + DR+ + G + IDYG + +G+TL+A+K H +V L +PG+ADLS++VDF L I Sbjct: 314 LVDRIEKNSGAVLAIDYG-EEGPLGNTLEAIKDHKFVHVLDSPGEADLSAYVDFGALRQI 372 Query: 285 AI---LYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTA---RKDILLDSVKRLVSTSA-D 337 + G TQ + L LG+ R L++ + + D L+ +RLV A D Sbjct: 373 VEEKPQSGVKCYGPVTQQQLLLSLGLVPRLEKLVENASSEAQADELVQGCERLVGDGAGD 432 Query: 338 KK-----SMGELFKILVVSHEKVELM-PF 360 + MG +K + + + F Sbjct: 433 PESGVAPGMGSRYKAIAMVSRGLPKPVGF 461 >gi|145590174|ref|YP_001156771.1| hypothetical protein Pnuc_1996 [Polynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1] gi|145048580|gb|ABP35207.1| protein of unknown function DUF185 [Polynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1] Length = 395 Score = 218 bits (554), Expect = 1e-54, Method: Composition-based stats. Identities = 86/379 (22%), Positives = 156/379 (41%), Gaps = 33/379 (8%) Query: 2 ENKLIRKIVNLI-KKNGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIF 59 L KI + I + G + ++ + + +P GYYS + G+ GDF TAPE+S +F Sbjct: 13 SELLRAKISSQINSEGGWIPFSRFMEMALYEPAMGYYSAGAHKLGSGGDFTTAPELSPLF 72 Query: 60 GEMLAIFLICAWEQHGFPS-CVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVET 118 G + L+ E +++E G G G + I + +L FS+ ++E Sbjct: 73 GAAICSTLLPVLEGFKAQGLPTQILEFGAGTGKLASSI---LTRLHDLDFSLDRYDILEI 129 Query: 119 SERLTLIQKKQLASYGDK------INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEH 172 S L QK+ ++ D+ +W +L G ++ANE D++P V Sbjct: 130 SPDLAQRQKEHISKTVDQLNSSTQCDWLKALPQNFKG--VILANEVIDAIPCDAIVYQNG 187 Query: 173 GIRERMIDIDQHDSLVFN---IGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRL 229 + ++ L++ + ++ L + + + E + M+ ++ L Sbjct: 188 FWYWYGVALN-DGKLIWKTGSPVEQDLLPESLLSASFSESYVTELHAPANDWMRQVARNL 246 Query: 230 ACDGGTAIVIDY------GYLQSRVGDTLQ-AVKGHTYVSPLVNPGQADLSSHVDFQRLS 282 D G + DY Y R+ TL + H P PG D+++HV++ +++ Sbjct: 247 --DSGLFLTFDYGFPEGEYYHPQRLEGTLMAHHRHHAIQDPFYLPGLCDITTHVEWSQIA 304 Query: 283 SIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMK--QTARKDILLDSVKRLVSTSADKKS 340 A+ LT Q +L GI A + + +S+++L+S + Sbjct: 305 RSALTENADDVYLTNQAAYLLDAGIGDIALEIGDPSNPETFLPISNSLQKLLS----EAE 360 Query: 341 MGELFKILVVSHEKVELMP 359 MGELFK S L+P Sbjct: 361 MGELFKAFAFSKNLDSLLP 379 >gi|319785821|ref|YP_004145296.1| hypothetical protein Psesu_0203 [Pseudoxanthomonas suwonensis 11-1] gi|317464333|gb|ADV26065.1| protein of unknown function DUF185 [Pseudoxanthomonas suwonensis 11-1] Length = 396 Score = 217 bits (553), Expect = 2e-54, Method: Composition-based stats. Identities = 86/369 (23%), Positives = 137/369 (37%), Gaps = 29/369 (7%) Query: 5 LIRKIVNLIK-KNGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGEM 62 L I I+ G + + ++ LC+ P GYYS FG GDFVTAPE ++F Sbjct: 21 LADHIRGEIRHSGGAIPMSRFMELCLYAPGLGYYSAGSTKFGPAGDFVTAPESGRLFAAT 80 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 ++ L Q G + R++E+G G G + ++ ++E S L Sbjct: 81 VSGVLADGLRQLGGHA--RVLEVGGGSGAFAEV---ALKRMLELDALPDRYAILEPSADL 135 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFT------FLVANEFFDSLPIKQFVMTEHGIRE 176 Q++ L + L + L ANE D+LP +FV+ E + E Sbjct: 136 RERQREHLE--RTLVPTVFELVEWLDRPFQDEWDGLLFANEVIDALPTPRFVLREGRVYE 193 Query: 177 RMIDIDQHDSLVFNIGDHEI-------KSNFLTCSDYFLGAIFENSPCRDREMQSISDRL 229 + +D +D V + + G E P +Q++ L Sbjct: 194 EHVALDGNDGFVRVDRPADALLEAAVRHIEDYLEQPFADGYRSELLPQLPYWLQAVGGGL 253 Query: 230 ACDGGTAIVIDY----GYLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSSI 284 G Y Y R TL+A L PG DL++ VDF L+ Sbjct: 254 RSGGMLFFDYGYARREYYQPQRSDGTLRAFYRQRMHEDVLRWPGLQDLTASVDFTALAEA 313 Query: 285 AILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGEL 344 L + G +Q FL G GI Q +T L +++ + + MGE Sbjct: 314 GTGAGLDLAGYCSQASFLLGNGIDQLLAEAESRTDEAGAL--RLRQELKRLTLPEQMGER 371 Query: 345 FKILVVSHE 353 F+ + + + Sbjct: 372 FQAMAFTRD 380 >gi|288941549|ref|YP_003443789.1| hypothetical protein Alvin_1832 [Allochromatium vinosum DSM 180] gi|288896921|gb|ADC62757.1| protein of unknown function DUF185 [Allochromatium vinosum DSM 180] Length = 385 Score = 217 bits (552), Expect = 2e-54, Method: Composition-based stats. Identities = 90/372 (24%), Positives = 150/372 (40%), Gaps = 36/372 (9%) Query: 1 MENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTCNP-FGAVGDFVTAPEISQI 58 + +L ++ I+ G + D++ L + P GYY P FG GDFVTAPE+S + Sbjct: 12 ISRRLEDRVRAEIRARGGVLPFDRFMELALYAPGLGYYVAGAPKFGPGGDFVTAPELSPL 71 Query: 59 FGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLK--PDFFSVLSIYMV 116 FG LA+ E+ ++E G G G + + IL + L+ P+ + ++ Sbjct: 72 FGRCLAVQCAEVLERLDSGE---ILEFGAGSGALAVQILLELASLERLPERYR-----IL 123 Query: 117 ETSERLTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEH 172 E S L Q+ + + +W T+L + G ++ANE D++P+ +F + + Sbjct: 124 EPSPDLQERQRAAIQTAAPHLLARCDWLTALPERFDG--VVIANEVLDAMPVHRFRLGDA 181 Query: 173 GIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFL-------GAIFENSPCRDREMQSI 225 G + ++ LV S G E + + ++ Sbjct: 182 GEILEIGVGERDGRLVEVAIPPVSTGLVEAVSALHEAGLANTPGYESEINLRLSPWLSAL 241 Query: 226 SDRLACDGGTAIVIDYG----YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQR 280 S + I Y Y R TL+ H +P V+ G D+++HVDF Sbjct: 242 SRVMERGLALLIDYGYPRAEYYQPERHMGTLRCHHRHQAHSNPYVHLGLQDITAHVDFTA 301 Query: 281 LSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKS 340 + + + G TTQ FL G GI R L K+L+ +A Sbjct: 302 AAEAGVAAGFELAGFTTQAHFLIGCGI-DRLMQASA-PETAFDLALGAKQLLLPTA---- 355 Query: 341 MGELFKILVVSH 352 MGE FK+L ++ Sbjct: 356 MGERFKVLGLAK 367 >gi|312219814|emb|CBX99756.1| similar to DUF185 domain-containing protein [Leptosphaeria maculans] Length = 478 Score = 216 bits (549), Expect = 6e-54, Method: Composition-based stats. Identities = 110/449 (24%), Positives = 175/449 (38%), Gaps = 115/449 (25%) Query: 16 NGQMTVDQYFALCVADPEFGYYSTC-----NPFGAVGDFVTAPEISQIFGEMLAIFLICA 70 N + V Y C+ PE GYY+ + FGA GDFVT+PEISQIFGE++ ++L Sbjct: 14 NSPIPVAAYMRQCLTHPEGGYYTRQTTSGQDQFGAKGDFVTSPEISQIFGELVGVWLYAE 73 Query: 71 WEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQL 130 W G V+++E+GPGRG +M D+LR + L+ S+ ++Y++E S L Q K L Sbjct: 74 WHAQGRKDKVQIIEVGPGRGTLMDDVLRTVTSLRGFAQSIETVYLIEASPYLQKQQGKLL 133 Query: 131 ASYGD---------------------KINWYTSLADVPLGFTFLVANEFFDSLPIKQFVM 169 + D L F++A+EFFD+LPI F Sbjct: 134 SGTEDLQKSDIGLTAMCKYIPGCKIEWCEDIRLLPKEVTPTPFIIAHEFFDALPIHVFQN 193 Query: 170 ---------------------------TEHGIRERMIDIDQHD-----------SLVFNI 191 +++ E ++ D + Sbjct: 194 VAQSSIPASSTIITPTGPIRPKNGSTSSQNIWHELVVSPTSPDMNSATTGQEKLEFELTV 253 Query: 192 GDHEIKSNFLTCSDYF--------LGAIFENSPCRDREMQSISDRL-------------- 229 + AI E SP ++ ++R+ Sbjct: 254 SKSPTPHSLYLPKTSKRYRALENTPDAIIEISPESMSYIEDFAERIGGGNPKPPATPAAS 313 Query: 230 ---------------------ACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPG 268 G A+++DYG L++ ++L+ ++ HT VSP PG Sbjct: 314 SSSPTLHPSTVKTKLDTPYQKPTPSGAALILDYGSLETIPANSLRGIRNHTTVSPFAAPG 373 Query: 269 QADLSSHVDFQRLSSIAILY--KLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDI--- 323 DLS+ VDF L+ A+ + ++G QG FL +GI +RA LMK+ ++ Sbjct: 374 LVDLSADVDFVALADSALRASPGVEVHGPVEQGFFLGTMGIKERAERLMKEAKDEEARQR 433 Query: 324 LLDSVKRLVSTSADKKSMGELFKILVVSH 352 L +RLV MG+ +K + + Sbjct: 434 LATGWQRLVDRHV---GMGKTYKAMAIVP 459 >gi|327349837|gb|EGE78694.1| DUF185 domain-containing protein [Ajellomyces dermatitidis ATCC 18188] Length = 510 Score = 216 bits (549), Expect = 6e-54, Method: Composition-based stats. Identities = 110/435 (25%), Positives = 170/435 (39%), Gaps = 99/435 (22%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNP-------FGAVGDFVTAPE 54 L + + I G +++ Y C+ P+ GYY++ FG GDFVT+PE Sbjct: 48 STPLAKSLGEAISVTGPVSIAAYMRQCLTSPDGGYYTSRGQEAEDTELFGTKGDFVTSPE 107 Query: 55 ISQIFGEMLAIFLICAWEQHGFPS-CVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSI 113 ISQIFGE+L I+ + W G S V+++E GPG+G +M D+LR K ++ ++ Sbjct: 108 ISQIFGELLGIWTVAEWMGQGRKSGGVQIIEFGPGKGTLMGDMLRCFRNFKSFASTIEAV 167 Query: 114 YMVETSERLTLIQKKQLASYGD--------------------KINWYTSLADVPLGFTFL 153 Y+VE S L +Q+K L + L D P F+ Sbjct: 168 YLVEASPVLREVQRKLLCGDAPMEEVEAGYKSKSIHLGVPIVWAEHISFLPDEPDKTPFI 227 Query: 154 VANEFFDSLPIKQFV--------------------------MTEHGIRERMIDIDQHDSL 187 A+EFFD+LPI F + + R + + + Sbjct: 228 FAHEFFDALPIHAFQSVEVPSQPQTINSPTGPITLHQSSAPSSSTTTQWRELVVSPNPET 287 Query: 188 VFNIGDHEIKSNFLTCS-------------------DYFLGAIFENSPCRDREMQSISDR 228 E + G+ E SP +Q I+ R Sbjct: 288 PEVKSSKEPEFRLSLAKASTPSSLILPEMSPRYKALKSTPGSTIEISPESQTCVQDIAKR 347 Query: 229 L------------------ACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQA 270 + G A+++DYG + ++L+ ++ H VSP PGQ Sbjct: 348 IGGAFTSPSSPAATDAKKNKVPSGAALILDYGTTSTIPINSLRGIRKHQLVSPFAAPGQV 407 Query: 271 DLSSHVDFQRLSSIAILY--KLYINGLTTQGKFLEGLGIWQRAFSLMKQ------TARKD 322 D+S+ VDF L+ AI + + G T QG FLE LGI +RA L+K+ ++ Sbjct: 408 DVSADVDFTALAEAAIDASPGVEVYGPTEQGAFLEALGISERAAQLLKKVEGEGDEEKRK 467 Query: 323 ILLDSVKRLVSTSAD 337 + KRLV Sbjct: 468 RIESGWKRLVERGGG 482 >gi|328858640|gb|EGG07752.1| hypothetical protein MELLADRAFT_85499 [Melampsora larici-populina 98AG31] Length = 367 Score = 215 bits (548), Expect = 7e-54, Method: Composition-based stats. Identities = 108/359 (30%), Positives = 169/359 (47%), Gaps = 46/359 (12%) Query: 25 FALCVADPEFGYYST--------CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 LC+ GYYS +PFG +GDF+T+PEISQIFGE++ ++ + W G Sbjct: 1 MNLCLNHTSLGYYSKPESNRTNLSDPFGKLGDFITSPEISQIFGELIGVWFLSRWIDFGS 60 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLAS---- 132 P +R++ELGPGRG ++ DILR +K + I++VE S L IQ+++L+ Sbjct: 61 PDSIRIIELGPGRGTLISDILRTFKSIKSCNPKIKEIHLVENSPFLRKIQEEKLSKDLSN 120 Query: 133 ---YGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVF 189 + + + ++ ++A+EFFD+LPI F T+ G RE ++DID + Sbjct: 121 GNTKLYWYDRIEEIQESSDHWSMIIAHEFFDALPIHVFQKTDKGFREILVDIDDKNKTTE 180 Query: 190 NIGDHE------------------IKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLAC 231 IG + + +G+ E SP +S L Sbjct: 181 PIGLIKSIKPIRFVLASKPTIGSQTLIQEKDYEKFSVGSKIEISPLSLSVAFHLSKLLKL 240 Query: 232 DGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLY 291 G+ +++DYG GD+L+ H V PL PG DL+++V+F +L I + Sbjct: 241 GNGSGLIVDYG-DDHHFGDSLRGFYKHRVVDPLHKPGLTDLTANVNFSKLKEIMNPF-CK 298 Query: 292 INGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVV 350 G +Q +FL +GI R L K T + SV RL+S + MG +K L + Sbjct: 299 SYGPISQREFLLKMGIEVRKQKL-KNTDQ------SVDRLIS----PRGMGTQYKFLGI 346 >gi|257061621|ref|YP_003139509.1| hypothetical protein Cyan8802_3870 [Cyanothece sp. PCC 8802] gi|256591787|gb|ACV02674.1| protein of unknown function DUF185 [Cyanothece sp. PCC 8802] Length = 377 Score = 214 bits (546), Expect = 1e-53, Method: Composition-based stats. Identities = 98/381 (25%), Positives = 159/381 (41%), Gaps = 30/381 (7%) Query: 6 IRKIVNLIKKN--GQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIFGEM 62 + I+ I+++ ++T Y L + P+ GYYS+ N G GDF TA + FGE+ Sbjct: 1 MEFILEAIQQSPEHKITFADYMNLALYHPQKGYYSSGNAKIGTQGDFFTASSLGADFGEL 60 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 LA + W G+P+ LVE+G G G DIL + P F+ + ++E ++ L Sbjct: 61 LAEQFLEMWSILGYPNRFSLVEVGAGSGFFAADILDYLSNQHPHFYEAIEYLIIEEAKGL 120 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 QK QL + + + +NE D+ P+ + + ++E + I Sbjct: 121 IEQQKAQLKNSDKVSWKSWDEIENCSIIGCIFSNELIDAFPVHLVTLEQGKLQEIYVTI- 179 Query: 183 QHDSLVFNIGD---------HEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDG 233 + I D E+ LT DY E + +++++++L Sbjct: 180 SEGKITEAIADLSTSQLRDYFELVEVNLTAKDYPENYRTEVNLAALDCLKTVANKLKK-- 237 Query: 234 GTAIVIDYGYL------QSRVGDTLQAVKGHTYVS-PLVNPGQADLSSHVDFQRLSSIAI 286 G + IDYGY R TL+ H + P VN G+ D+++HV+F L + Sbjct: 238 GYLLTIDYGYTAQKYYHPQRYQGTLKCYYKHRHHDNPYVNIGEQDITTHVNFTALENHGE 297 Query: 287 LYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFK 346 L L G T QG FL GLG+ R L ++ L D +G F Sbjct: 298 LLGLDKLGFTQQGLFLMGLGLGDRLSDLSNGNYSFLEVIQRRDSL-HQLIDPMGLGG-FG 355 Query: 347 ILVVS------HEKVELMPFV 361 +L+ S +K L F+ Sbjct: 356 VLLQSQGLTPEQKKRSLKGFL 376 >gi|330993030|ref|ZP_08316968.1| Protein midA-like protein [Gluconacetobacter sp. SXCC-1] gi|329759800|gb|EGG76306.1| Protein midA-like protein [Gluconacetobacter sp. SXCC-1] Length = 350 Score = 214 bits (546), Expect = 1e-53, Method: Composition-based stats. Identities = 113/342 (33%), Positives = 171/342 (50%), Gaps = 18/342 (5%) Query: 21 VDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCV 80 +D + A YY+ C+PF DF+T+PEISQ+FGE+L ++ AW Q G P Sbjct: 8 LDHFMARA----NAAYYAGCDPF---ADFITSPEISQMFGELLGAWVAVAWGQLGSPDPF 60 Query: 81 RLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLA-SYGDKINW 139 LVE GPGRG +M D LR++ ++ P + ++++ETS RL +Q L W Sbjct: 61 MLVEAGPGRGTLMADALRLVRRVAPACHRAVRLHLIETSPRLRGVQAHALEGGTLLPPCW 120 Query: 140 YTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSN 199 + SLA VP G L+ANEF D+LPI+QF G ER + H + V Sbjct: 121 HDSLATVPDGPMILLANEFLDALPIRQFTRRAGGWEERFV---HHGAFVGAPASFPATHP 177 Query: 200 FLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHT 259 + GA+ E Q+++ RL G A++IDYGY GD+LQA++ Sbjct: 178 ARGRA-VPEGAVLETCQPALDFTQALAARLRRWSGAALLIDYGYDAPAWGDSLQALRDGR 236 Query: 260 YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTA 319 L +PG ADL++HVDF+ ++ A + + + G QG+FL LG+ R L + Sbjct: 237 PADLLRDPGTADLTAHVDFRSIAQAA--HGVDVWGSVPQGRFLSALGLAARCARLERAAP 294 Query: 320 RKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKV-ELMPF 360 + + A + MG+LF++L ++ +L F Sbjct: 295 DQAAATRDAA---ARLAAPERMGQLFRVLGLAAGLAGDLPGF 333 >gi|114321671|ref|YP_743354.1| hypothetical protein Mlg_2524 [Alkalilimnicola ehrlichii MLHE-1] gi|114228065|gb|ABI57864.1| protein of unknown function DUF185 [Alkalilimnicola ehrlichii MLHE-1] Length = 403 Score = 214 bits (546), Expect = 1e-53, Method: Composition-based stats. Identities = 94/383 (24%), Positives = 147/383 (38%), Gaps = 32/383 (8%) Query: 2 ENKLIRKIVNLIK-KNGQMTVDQYFALCVADPEFGYYSTCNP-FGAVGDFVTAPEISQIF 59 L +I + I+ G + D+Y + + +P GYYS P FG GDF TAP IS +F Sbjct: 22 SEALQARIRDAIRSAGGWLPFDRYMGMALYEPGLGYYSAGAPRFGEGGDFTTAPLISPLF 81 Query: 60 GEMLAIFLICAWEQHGF-PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVET 118 LA + A + ++ELG G G M DIL + +L L ++E Sbjct: 82 SRTLAHTVQRALQALELATGQGEVLELGAGSGRMAADILLELERLGQLPARYL---ILEV 138 Query: 119 SERLTLIQKKQLASYGDKINWYTSLADVPLGFTF---LVANEFFDSLPIKQFVMTEHGIR 175 S L Q + L + + + L+ANE D+LP + F I Sbjct: 139 SAALRQEQHRTLGEHAPHLLDRVEWLEQLPEHPITGALLANEVLDALPFRCFERGRDDIL 198 Query: 176 ERMIDIDQHDSLVFNIGDHEIK-------SNFLTCSDYFLGAIFENSPCRDREMQSISDR 228 ER + +D D + + T G E P ++ + Sbjct: 199 ERGVALDDDDHPQWATRPADEPLAGHVRHIEAETGRRLPPGYRSECLPQLADWLRDTTRC 258 Query: 229 LACDGGTAIVIDY------GYLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRL 281 LA G + IDY YL R TL H P + PG D+++ VDF + Sbjct: 259 LAR--GLVLYIDYGYPRREYYLPDRHMGTLLCHYRHRAHEDPFLWPGLQDITAFVDFTAV 316 Query: 282 SSIAILYKLYINGLTTQGKF---LEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADK 338 + A+ L + G T+Q ++ + A + + + V+RL Sbjct: 317 AEAALAADLDVLGFTSQAQYLLAAGLAHLADEAMAQHDDDMHRLQIAQQVRRLTL----P 372 Query: 339 KSMGELFKILVVSHEKVELMPFV 361 +GE FK+L + + L F+ Sbjct: 373 SELGERFKVLPLGRDLAPLPEFI 395 >gi|297667839|ref|XP_002812172.1| PREDICTED: protein midA homolog, mitochondrial-like isoform 3 [Pongo abelii] Length = 370 Score = 214 bits (546), Expect = 1e-53, Method: Composition-based stats. Identities = 103/358 (28%), Positives = 155/358 (43%), Gaps = 64/358 (17%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 ++R ++ IK G +TV +Y + +P GYY + G GDF+T+PEISQIFGE+ Sbjct: 41 TPMLRHLIYKIKSTGPITVAEYMKEVLTNPAKGYYVYRDMLGEKGDFITSPEISQIFGEL 100 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSV-LSIYMVETSER 121 L I+ I W G + +LVELGPGRG ++ DILRV +L + +S+++VE + + Sbjct: 101 LGIWFISEWMATGKSTAFQLVELGPGRGTLVGDILRVFTQLGSVLKNCDISVHLVEKTPQ 160 Query: 122 LTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDI 181 G RE +DI Sbjct: 161 ---------------------------------------------------GWREVFVDI 169 Query: 182 DQH--DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVI 239 D D L F + + D E P ++ +S+R+A GG A+V Sbjct: 170 DPQVSDKLRFVLAPSATPAEAFIQHD-ETRDHVEVCPDAGVIIEELSERIALTGGAALVA 228 Query: 240 DYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQG 299 DYG+ +R DT + GH L+ PG ADL++ VDF L +A K+ G Q Sbjct: 229 DYGHDGTRT-DTFRGFCGHKLHDVLIAPGTADLTADVDFSYLRRMA-QGKVASVGPIKQH 286 Query: 300 KFLEGLGIWQRAFSLMKQTAR---KDILLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 FL+ +GI R L+ ++ + LL L+ + K MGE F + + Sbjct: 287 TFLKNMGIDVRLKVLLDKSNEPSVRQQLLQGYDMLM----NPKKMGERFNFFALLPHQ 340 >gi|297171519|gb|ADI22518.1| uncharacterized conserved protein [uncultured verrucomicrobium HF0500_08N17] Length = 378 Score = 214 bits (544), Expect = 2e-53, Method: Composition-based stats. Identities = 91/368 (24%), Positives = 139/368 (37%), Gaps = 30/368 (8%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIFGE 61 N L I I G M ++ L + PEFGYY G GDFVT+ + FG+ Sbjct: 23 NPLAGMIRAEIDDGGPMPFARFMELALYHPEFGYYEKEAGQVGPQGDFVTSVSVGAAFGQ 82 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 +LA E P +VE G G + DIL + F+ L + E S + Sbjct: 83 LLAYRFAEWLEAIDGPVQ--VVEAGAHDGTLAGDILEWLAANDEPLFARLRYVIAEPSAK 140 Query: 122 LTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQ--FVMTEHGIRERMI 179 Q K+LA + ++ W SLA P + NE D+ P+ + + E G E + Sbjct: 141 RRSWQAKRLAKFDTQLEWRDSLAGAPKIPGIIFCNELLDAFPVNRIGWSKAERGWFEWAV 200 Query: 180 DIDQHDSLVFNI------GDHEIKSNFLTCSD-YFLGAIFENSPCRDREMQSISDRLACD 232 D I + + D E SP + ++ +D+LA Sbjct: 201 DWRDGAFCWEPIDGGGAAWRSLLPAWPSALLDVLPDQYTTELSPQANAWWRAAADQLA-- 258 Query: 233 GGTAIVIDYGY------LQSRVGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIA 285 G I +DYG+ ++ T + + L +PG+ DL++H +F Sbjct: 259 VGKLIALDYGHGPDDWPAANQPDGTARGYRCQRLVDDVLTDPGKQDLTAHANFGLAKQSG 318 Query: 286 ILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVK-RLVSTSADKKSMGEL 344 L T+Q +FL G A L A L ++ R + MG Sbjct: 319 EAAGLQTELFTSQERFL--NG--TFAEMLQSAPA----LGQAMDVRQLQALTHPAHMGRP 370 Query: 345 FKILVVSH 352 F++LV S Sbjct: 371 FRVLVQSR 378 >gi|322695717|gb|EFY87521.1| DUF185 domain-containing protein [Metarhizium acridum CQMa 102] Length = 460 Score = 214 bits (544), Expect = 2e-53, Method: Composition-based stats. Identities = 111/441 (25%), Positives = 169/441 (38%), Gaps = 101/441 (22%) Query: 19 MTVDQYFALCVADPEFGYYS-----TCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQ 73 + + Y +C+ GYY+ + FG GDFVT+PEISQIFGE++ ++ I W Sbjct: 15 VPLASYMRMCLTGDLGGYYTGAIGQNRDQFGVKGDFVTSPEISQIFGELVGVWFIAEWIS 74 Query: 74 HGFPSC-VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQK----- 127 G P V+L+E+GPGRG +M D+LR I + S+ S++MVE S L QK Sbjct: 75 QGQPKQGVQLIEVGPGRGTLMDDMLRTIKRFPAMVDSIESVFMVEASPELREKQKTLLCG 134 Query: 128 ----------------KQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTE 171 K L S+ P F+VA+EFFD+LPI F Sbjct: 135 SDAPSEDCAAGSRSTGKHLGKPVVWAESLKSIPIEPNKVPFIVAHEFFDALPIHCFQSAP 194 Query: 172 ----------------------------HGIRERMIDIDQH------------------- 184 + RE M+ Sbjct: 195 APASTPETASTRTSTVKPSPTEANSSPAYEWREMMVSPTHPAEVASDQAKAKAAGRETSA 254 Query: 185 DSLVFNIGDHEIKSNFLTCSDYF--------LGAIFENSPCRDREMQSISDRL------- 229 + + + G++ E P + R+ Sbjct: 255 AEFQLILSSKPTRHSRYLPESSPRYRQLKQSPGSVVEICPDASLYAADFAARIGGSDKVK 314 Query: 230 -ACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILY 288 G A+++DYG + ++L+ ++ H VSP PG DLS+ VDF ++ A L Sbjct: 315 KPQPCGAALILDYGTSDTIPINSLRGIRHHKLVSPFSAPGLVDLSADVDFTAIAEAATLA 374 Query: 289 --KLYINGLTTQGKFLEGLGIWQRAFSLMK----QTARKDILLDSVKRLVSTSADKKSMG 342 + ++G Q FLE +GI +RA L+K + + S KRLV MG Sbjct: 375 SDGVEVHGPVPQADFLELMGIRERAEMLIKAAGTDESTAQRIRKSWKRLVDRG--PSGMG 432 Query: 343 ELFKILVVSHEKVE---LMPF 360 +++K L + E + F Sbjct: 433 KIYKALAILPENDGRRRPVGF 453 >gi|297181970|gb|ADI18146.1| uncharacterized conserved protein [uncultured Verrucomicrobiales bacterium HF0200_39L05] Length = 378 Score = 214 bits (544), Expect = 2e-53, Method: Composition-based stats. Identities = 91/368 (24%), Positives = 142/368 (38%), Gaps = 30/368 (8%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGE 61 N L I I +G M ++ L + PEFGYY +P G GDFVT+ + FG+ Sbjct: 23 NPLAGMIRAKIDDDGPMPFARFMELALYHPEFGYYEKEASPVGRQGDFVTSVSVGAAFGQ 82 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 +LA E P +VE G G + DIL + F+ L + E S + Sbjct: 83 LLAYRFAKWLEAIDGPVQ--VVEAGAHDGTLAGDILEWLAANDEPLFARLRYVITEPSAK 140 Query: 122 LTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQ--FVMTEHGIRERMI 179 Q K+LA + ++ W SL + + NE D+ P+ + + E G E + Sbjct: 141 RQSWQAKRLAKFDAQLEWRDSLTGITEIRGVIFCNELLDAFPVNRIGWSKAERGWFEWAV 200 Query: 180 DIDQHDSLVFNI------GDHEIKSNFLTCSD-YFLGAIFENSPCRDREMQSISDRLACD 232 D I + + D E SP + ++ +D+LA Sbjct: 201 DWRDGAFCWEPIDGGGAAWRSLLPAWPSALLDVLPDQYTTELSPQANEWWRAAADQLA-- 258 Query: 233 GGTAIVIDYGY------LQSRVGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIA 285 G I +DYGY ++ T++ + L +PG+ DL++H +F Sbjct: 259 VGKLIALDYGYGPDDWPTANQPDGTVRGYRRQKLVDDVLTDPGEQDLTAHANFGLAKQSG 318 Query: 286 ILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVK-RLVSTSADKKSMGEL 344 L T+Q +FL G A L A L ++ R + MG Sbjct: 319 EAAGLQTELFTSQERFLNGA----FAEMLQSAPA----LGQALDVRQLQALTHPAHMGRP 370 Query: 345 FKILVVSH 352 F++LV S Sbjct: 371 FRVLVQSR 378 >gi|293602449|ref|ZP_06684895.1| conserved hypothetical protein [Achromobacter piechaudii ATCC 43553] gi|292819211|gb|EFF78246.1| conserved hypothetical protein [Achromobacter piechaudii ATCC 43553] Length = 402 Score = 213 bits (543), Expect = 2e-53, Method: Composition-based stats. Identities = 88/367 (23%), Positives = 157/367 (42%), Gaps = 35/367 (9%) Query: 15 KNGQMTVDQYFALCVADPEFGYYSTCNP---------FGAVGDFVTAPEISQIFGEMLAI 65 G + D + A + P GYY+ N GDFVTAP+++ +F +A Sbjct: 41 NGGWLPFDHWMAQALYAPGLGYYAAGNVKLADADDDAKAPAGDFVTAPQLTPLFARTIAR 100 Query: 66 FLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLI 125 Q ++E G G G + V+ +L + ++E S L Sbjct: 101 QAAQVLRQT---QTHAVLEFGAGTGALAEG---VLRELDALGLTDTQYLILEVSADLRAR 154 Query: 126 QKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEH-GIRERMIDIDQH 184 Q ++LA++G ++ W +L + G ++ANE D++P+ F +E + ER + +D Sbjct: 155 QAERLAAFGARVQWLDALPNAFAGC--VLANEVLDAMPVSLFCWSEDGTVMERGVSLDAQ 212 Query: 185 DSLVFNIGDHEIKSNFLTCSDYF--LGAIFENSPCRDREMQSISDRLACDGGTAIVIDY- 241 V++ + G + E + + + ++ L G A+++DY Sbjct: 213 QDFVWSDRPAPPALAQAVAARMPALPGYVSEINLQGEAWIAAMGSWLER--GAALLVDYG 270 Query: 242 -----GYLQSRVGDT-LQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGL 295 Y R G T + ++ H + PL PG D+++HVDF ++ A+ L + G Sbjct: 271 FPRNEYYHPQRAGGTLMCHLRHHAHGDPLTAPGLQDITAHVDFTAMADAALEAGLQVLGY 330 Query: 296 TTQGKFLEGLGIWQRAFSLMKQTARKDILLDS-VKRLVSTSADKKSMGELFKILVVSHEK 354 T+Q +FL G+ L ++ + V++L+S + MGELFK+L V Sbjct: 331 TSQARFLMNAGLMDLLAQLDPSDVQQYAQAVAPVQKLLS----EAEMGELFKVLAVGRAM 386 Query: 355 V-ELMPF 360 L+ F Sbjct: 387 TEPLIGF 393 >gi|160871820|ref|ZP_02061952.1| protein of unknown function [Rickettsiella grylli] gi|159120619|gb|EDP45957.1| protein of unknown function [Rickettsiella grylli] Length = 392 Score = 213 bits (543), Expect = 3e-53, Method: Composition-based stats. Identities = 79/368 (21%), Positives = 140/368 (38%), Gaps = 28/368 (7%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFG 60 + L + I I + ++ ++ L + +P +GYY+ G GDF+TAPEIS +F Sbjct: 19 SDALCQLIREEINRAASLSFSRFMTLALYEPTWGYYTAGLEKLGKRGDFITAPEISPLFS 78 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 + + G ++E+G G G M +D+L + + + + + E S Sbjct: 79 QCVGRQYQQIVMHLG---QSNILEIGAGTGQMAVDLLLFLEQQHSLPDTYM---IFEMSP 132 Query: 121 RLTLIQKKQLASYGDKIN-WYTSLADVPLGFT--FLVANEFFDSLPIKQFVMTEHGIRER 177 L QK++L + + L D P ++ NE D+LP+ +FV I E+ Sbjct: 133 TLKQRQKERLKQAIPHLFPFIQWLDDWPKKPIKGVILVNEVVDALPVDRFVWHGECIYEQ 192 Query: 178 MIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIF-------ENSPCRDREMQSISDRLA 230 + + + + S + E ++ + +SD L Sbjct: 193 RVSY-KKGRFCYQLSPVRNGSLLNQLTQLKKNYFLTTPIYQSEVCQQLEQWVNKLSDALE 251 Query: 231 CDGGTAIVIDY----GYLQSRVGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIA 285 + Y Y +V TL+ H + PL G D+++ VDF RL+ A Sbjct: 252 QGVAFIMDYGYTAKEYYHADKVNGTLRCYYQHRVHQDPLTLIGLQDITASVDFSRLAHAA 311 Query: 286 ILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELF 345 + + + G Q FL + + D V R V MGEL Sbjct: 312 VHTQFKVAGYIPQAAFLLNNDLLPLVEKCYDGLSSLD-----VNRQVHLLTSPSEMGELV 366 Query: 346 KILVVSHE 353 K++ ++ Sbjct: 367 KVMGLTRN 374 >gi|256823378|ref|YP_003147341.1| hypothetical protein Kkor_2163 [Kangiella koreensis DSM 16069] gi|256796917|gb|ACV27573.1| protein of unknown function DUF185 [Kangiella koreensis DSM 16069] Length = 404 Score = 213 bits (542), Expect = 3e-53, Method: Composition-based stats. Identities = 79/369 (21%), Positives = 138/369 (37%), Gaps = 16/369 (4%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIF 59 + KL + I I+K G + Q+ + +P GYYS + G GDFVTAPE S +F Sbjct: 34 VSYKLSQTIAGEIEKAGAIPFSQFMHHALYEPGLGYYSAGSHKLGEGGDFVTAPEFSPLF 93 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 + A I +EQ ++ELG G G +++++ + Y++E S Sbjct: 94 AKTFAQSFISIFEQS----AANVLELGAGTGTFAVELVKELEV---QGNLPEQYYILEVS 146 Query: 120 ERLTLIQKKQLASYGDKINWYTSLADV--PLGFTFLVANEFFDSLPIKQFVMTEHGIRER 177 L Q++ + + D + ANE D+LPI + +++ Sbjct: 147 ADLKQRQRQAIELKIPHLANRFKWLDHLPNEFSGVIFANEVADALPIDLVRKQKSSLQKA 206 Query: 178 MIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTA- 236 + +D + + + G E + + + + L Sbjct: 207 QVKLDDSGFRLRWQDTGPMLETNPYVRQWPHGYTTEQHLQTEFWLGGLVECLTKGAIILV 266 Query: 237 ---IVIDYGYLQSRVGDT-LQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYI 292 D Y R T + H + L+ PG D+S+ V+F +++ A I Sbjct: 267 DYGYSADEYYAPQRTQGTLQCYYRQHKHNDALLLPGLQDISASVNFSQIAYAAHKLGAEI 326 Query: 293 NGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSH 352 G Q FL GI + A L + + + + + MGE K+L+ + Sbjct: 327 VGYCEQALFLMSSGIEREAAKLAETLGSDTMAQVKISQQLKKLLMPDEMGENCKVLIAAK 386 Query: 353 E-KVELMPF 360 V+L F Sbjct: 387 NCPVQLQGF 395 >gi|114331493|ref|YP_747715.1| hypothetical protein Neut_1505 [Nitrosomonas eutropha C91] gi|114308507|gb|ABI59750.1| protein of unknown function DUF185 [Nitrosomonas eutropha C91] Length = 391 Score = 213 bits (542), Expect = 4e-53, Method: Composition-based stats. Identities = 90/369 (24%), Positives = 152/369 (41%), Gaps = 35/369 (9%) Query: 5 LIRKIVNLIK-KNGQMTVDQYFALCVADPEFGYYSTCNPFGAVG-DFVTAPEISQIFGEM 62 L + + I G ++ Y + + PE GYYS DFVT+PEIS +FG+ Sbjct: 17 LKTILRDRISLSGGWISFADYMDIVLYTPEAGYYSGGAAKFGAAGDFVTSPEISPLFGQT 76 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 LA + ++E G G G + ++ L+ +++ S L Sbjct: 77 LARQVAQVLRSVNH---GSILEFGAGSG---KLAVDLLLALEALESLPEYYNILDLSADL 130 Query: 123 TLIQK----KQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERM 178 Q+ +++ ++ W T+L D G + ANE D++P+ V I ER Sbjct: 131 QQRQQAIIKQRIPHLAARVQWLTALPDQFKG--LIFANEVLDAMPVHLVVWKNGNIAERG 188 Query: 179 IDIDQHDSLVFN--IGDHEIKSNFLTCSD-----YFLGAIFENSPCRDREMQSISDRLAC 231 + ++ + + + E+ + + I E S + S++ L Sbjct: 189 VIWKDYELAWQDQPLTEGELLNVARQLPPSDQPSFPHPYISEISLANRHFIHSLASIL-- 246 Query: 232 DGGTAIVIDY------GYLQSRVGDT-LQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSI 284 G +++DY Y R T + + H + PL PG D++SHVDF +S I Sbjct: 247 QQGAILLVDYGFGQSEYYHPQRHQGTLMCHYRHHAHDDPLFLPGLQDITSHVDFSAISRI 306 Query: 285 AILYKLYINGLTTQGKFLEGLGIWQ-RAFSLMKQTARKDILLDSVKRLVSTSADKKSMGE 343 A+ L + G TTQ FL GI A + Q L+ + L+S MGE Sbjct: 307 ALDSGLQLAGYTTQAHFLINCGITDLLAHTPADQPGIYLPLVSQAQCLIS----PAEMGE 362 Query: 344 LFKILVVSH 352 LFK+++++ Sbjct: 363 LFKVMILNK 371 >gi|194386570|dbj|BAG61095.1| unnamed protein product [Homo sapiens] Length = 370 Score = 213 bits (542), Expect = 4e-53, Method: Composition-based stats. Identities = 101/358 (28%), Positives = 153/358 (42%), Gaps = 64/358 (17%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 ++R ++ IK G +TV +Y + +P GYY + G GDF+T+PEISQIFGE+ Sbjct: 41 TPMLRHLMYKIKSTGPITVAEYMKEVLTNPAKGYYVYRDMLGEKGDFITSPEISQIFGEL 100 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSV-LSIYMVETSER 121 L I+ I W G + +LVELGPGRG ++ DILRV +L + +S+++VE + + Sbjct: 101 LGIWFISEWMATGKSTAFQLVELGPGRGTLVGDILRVFTQLGSVLKNCDISVHLVEKTPQ 160 Query: 122 LTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDI 181 G RE +DI Sbjct: 161 ---------------------------------------------------GWREVFVDI 169 Query: 182 DQH--DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVI 239 D D L F + + D E P ++ +S R+A GG A+V Sbjct: 170 DPQVSDKLRFVLAPSATPAEAFIQHD-ETRDHVEVCPDAGVIIEELSQRIALTGGAALVA 228 Query: 240 DYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQG 299 DYG+ ++ DT + H L+ PG ADL++ VDF L +A K+ G Q Sbjct: 229 DYGHDGTKT-DTFRGFCDHKLHDVLIAPGTADLTADVDFSYLRRMA-QGKVASLGPIKQH 286 Query: 300 KFLEGLGIWQRAFSLMKQTAR---KDILLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 FL+ +GI R L+ ++ + LL L+ + K MGE F + + Sbjct: 287 TFLKNMGIDVRLKVLLDKSNEPSVRQQLLQGYDMLM----NPKKMGERFNFFALLPHQ 340 >gi|114576972|ref|XP_001167175.1| PREDICTED: protein midA homolog, mitochondrial-like isoform 4 [Pan troglodytes] Length = 370 Score = 213 bits (542), Expect = 4e-53, Method: Composition-based stats. Identities = 101/358 (28%), Positives = 153/358 (42%), Gaps = 64/358 (17%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 ++R ++ IK G +TV +Y + +P GYY + G GDF+T+PEISQIFGE+ Sbjct: 41 TPMLRHLMYKIKSTGPITVAEYMKEVLTNPAKGYYVYRDMLGEKGDFITSPEISQIFGEL 100 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSV-LSIYMVETSER 121 L I+ I W G + +LVELGPGRG ++ DILRV +L + +S+++VE + + Sbjct: 101 LGIWFISEWMATGKSTAFQLVELGPGRGTLVGDILRVFTQLGSVLKNCDISVHLVEKTPQ 160 Query: 122 LTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDI 181 G RE +DI Sbjct: 161 ---------------------------------------------------GWREVFVDI 169 Query: 182 DQH--DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVI 239 D D L F + + D E P ++ +S R+A GG A+V Sbjct: 170 DPQVSDKLRFVLAPSATPAEAFIQHD-ETRDHVEVCPDAGVIIEELSQRIALTGGAALVA 228 Query: 240 DYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQG 299 DYG+ ++ DT + H L+ PG ADL++ VDF L +A K+ G Q Sbjct: 229 DYGHDGTKT-DTFRGFCDHKLHDVLIAPGTADLTADVDFSYLRRMA-QGKVASLGPIKQH 286 Query: 300 KFLEGLGIWQRAFSLMKQTAR---KDILLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 FL+ +GI R L+ ++ + LL L+ + K MGE F + + Sbjct: 287 TFLKNMGIDVRLKVLLDKSNEPSVRQQLLQGYDMLM----NPKKMGERFNFFALLPHQ 340 >gi|302811396|ref|XP_002987387.1| hypothetical protein SELMODRAFT_126162 [Selaginella moellendorffii] gi|300144793|gb|EFJ11474.1| hypothetical protein SELMODRAFT_126162 [Selaginella moellendorffii] Length = 392 Score = 213 bits (541), Expect = 4e-53, Method: Composition-based stats. Identities = 122/389 (31%), Positives = 183/389 (47%), Gaps = 57/389 (14%) Query: 25 FALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVE 84 + +P GYY FGA G F+T+P++SQ+FGEM+ I+ + WE+ G P ++LVE Sbjct: 1 MEEVLTNPSAGYYLHQEVFGAAGSFITSPDVSQMFGEMIGIWSVSLWEKMGKPRKLQLVE 60 Query: 85 LGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLAS------------ 132 LGPGRG +M D+LR K DF LSI+ VE S L Q++ L Sbjct: 61 LGPGRGTLMQDLLRSTLTFK-DFSKALSIHFVECSPALRKQQRRALQCPGEEKKHEGGDR 119 Query: 133 ----------YGDKINWYTSLADVPLG-FTFLVANEFFDSLPIKQF-----VMTEHGIRE 176 + ++WY L DVP G T ++A EFFD+LPI QF T G E Sbjct: 120 PAVENSRSQRFETNVSWYLDLKDVPRGVPTIIIAQEFFDALPIHQFQHRLSQKTPVGWCE 179 Query: 177 RMIDIDQHDSLVFNIG-----------DHEIKSNFLTCSDYFLGAIFENSPCRDREMQSI 225 ++ID+D + F + + N++T + E P + Q I Sbjct: 180 KLIDVDSRQANPFRFVLSSQPTAATLLYLKKRLNWITAVEEESIHHIEVCPKALQVSQEI 239 Query: 226 SDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIA 285 + R+A D G I+IDYG + V D+ QA++ H +V+ L PG ADLS+HVDF + + Sbjct: 240 AKRVAEDSGGGIIIDYGKDEP-VTDSFQAIRNHEFVNVLDKPGTADLSAHVDFAAIKRMV 298 Query: 286 ILYK---LYINGLTTQGKFLEGLGIWQRAFSLMKQTARK--DILLDSVKRLVSTSADK-- 338 + G Q +FL LGI R +L+K + + + L RLV Sbjct: 299 AETASPTVSTYGPIFQQEFLAMLGINVRLEALVKDASDEQGEKLQLGYWRLVGEGPPPWL 358 Query: 339 ---------KSMGELFKILVVSHEKVELM 358 + MG+ +++L ++ K+ Sbjct: 359 SEGDDGYKIQGMGKHYRVLAIADSKLGAP 387 >gi|309361569|emb|CAP29424.2| hypothetical protein CBG_09880 [Caenorhabditis briggsae AF16] Length = 390 Score = 213 bits (541), Expect = 4e-53, Method: Composition-based stats. Identities = 106/371 (28%), Positives = 166/371 (44%), Gaps = 41/371 (11%) Query: 25 FALCVADPEFGYYST----CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCV 80 V+ P GYY FG GDF+T+PE++Q+FGEM+ +++ G Sbjct: 1 MKTSVSAPVVGYYGQFSRDQKVFGEKGDFITSPELTQLFGEMIGVWVFHELANTGHKGSW 60 Query: 81 RLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGD----- 135 +LVELGPGR +M D+L + K + +S+++VE S+ L Q+ L Y Sbjct: 61 QLVELGPGRAQLMNDVLNALAKF---NDNDVSVHLVEMSDALIDEQENFLCIYNSENTKG 117 Query: 136 -------------KINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 I WY S+ D+P GFT +ANEF D+LP+ QF T +E +++ Sbjct: 118 TPHVRKNKTRTGVNIYWYKSIDDIPDGFTVFIANEFLDALPVHQFKKTGDLWKEVYVNLT 177 Query: 183 QHDSLVFNIGDHEIKSNFLTCSDYFLGA----IFENSPCRDREMQSISDRLACDGGTAIV 238 + L F E +E SP + I DR+ GG +++ Sbjct: 178 KEGDLRFMTSKGENLHTKGLIPAAIRNENSRLTWECSPESGTVVNQIVDRITTFGGFSLL 237 Query: 239 IDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQ 298 +DYG+ SR + +A K H V PL NPG DL++ VDF L+ + + + G Q Sbjct: 238 VDYGHDGSRNTHSFRAYKNHEQVDPLSNPGVVDLTADVDFGYLT-TLVEDRALVYGPIEQ 296 Query: 299 GKFLEGLGIWQRAFSLM---KQTARKDILLDSV--KRLVSTSADK------KSMGELFKI 347 FL LGI R L+ K ++ L+ + L++ + MG FK Sbjct: 297 RVFLTQLGIEHRLRRLLQICKNREEQEQLISEHIFQILLNPFILESYNMLIGDMGTKFKA 356 Query: 348 LVVSHEKVELM 358 + + ++ + Sbjct: 357 WALFPKTLKFI 367 >gi|302796294|ref|XP_002979909.1| hypothetical protein SELMODRAFT_111843 [Selaginella moellendorffii] gi|300152136|gb|EFJ18779.1| hypothetical protein SELMODRAFT_111843 [Selaginella moellendorffii] Length = 392 Score = 213 bits (541), Expect = 4e-53, Method: Composition-based stats. Identities = 123/389 (31%), Positives = 182/389 (46%), Gaps = 57/389 (14%) Query: 25 FALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVE 84 + +P GYY FGA G F+T+P++SQ+FGEM+ I+ + WEQ G P ++LVE Sbjct: 1 MEEVLTNPSAGYYLHQEVFGAAGSFITSPDVSQMFGEMIGIWSVSLWEQMGKPRKLQLVE 60 Query: 85 LGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLAS------------ 132 LGPGRG +M D+LR K DF LSI+ VE S L Q++ L Sbjct: 61 LGPGRGTLMQDLLRSTLTFK-DFSKALSIHFVECSPALRKQQRRALQCPSEEKKHEGGDR 119 Query: 133 ----------YGDKINWYTSLADVPLG-FTFLVANEFFDSLPIKQF-----VMTEHGIRE 176 + + WY L DVP G T ++A EFFD+LPI QF T G E Sbjct: 120 PAVENSRSQRFETNVAWYLDLKDVPRGVPTIIIAQEFFDALPIHQFQHRLSQKTPVGWCE 179 Query: 177 RMIDIDQHDSLVFNIG-----------DHEIKSNFLTCSDYFLGAIFENSPCRDREMQSI 225 ++ID+D + F + + N++T + E P + Q I Sbjct: 180 KLIDVDSRQANPFRFVLSSQPTAATLLYLKKRLNWITAVEEESIHHIEVCPKALQVSQEI 239 Query: 226 SDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIA 285 + R+A D G I+IDYG + V D+ QA++ H +V+ L PG ADLS+HVDF + + Sbjct: 240 AKRVAEDSGGGIIIDYGKDEP-VTDSFQAIRNHEFVNVLDKPGTADLSAHVDFAAIKRMV 298 Query: 286 ILYK---LYINGLTTQGKFLEGLGIWQRAFSLMKQTARK--DILLDSVKRLVSTSADK-- 338 + G Q +FL LGI R +L+K + + + L RLV Sbjct: 299 AETASPTVSTYGPIFQQEFLAMLGINVRLEALVKDASDEQGEKLQLGYWRLVGEGPPPWL 358 Query: 339 ---------KSMGELFKILVVSHEKVELM 358 + MG+ +++L ++ K+ Sbjct: 359 SEGDDGYKIQGMGKHYRVLAIADSKLGAP 387 >gi|39995584|ref|NP_951535.1| hypothetical protein GSU0476 [Geobacter sulfurreducens PCA] gi|39982347|gb|AAR33808.1| conserved hypothetical protein [Geobacter sulfurreducens PCA] gi|298504603|gb|ADI83326.1| protein of unknown function DUF185 [Geobacter sulfurreducens KN400] Length = 386 Score = 213 bits (541), Expect = 5e-53, Method: Composition-based stats. Identities = 94/370 (25%), Positives = 160/370 (43%), Gaps = 25/370 (6%) Query: 2 ENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIF 59 +++L I + IK+ G++ + A C+ +P GYY++ GA GDF T+ + ++F Sbjct: 6 DSRLRGIIHDRIKERGGRIPFADFMAACLYEPGLGYYTSPGRKVGAEGDFYTSINVHRVF 65 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 G ++ + WE G P+ LVE G G G + D+L + +L P+ ++ L++ +VE Sbjct: 66 GRLIGREICRMWEVMGCPAPFTLVEAGAGHGRLAADVLDAVRELNPELYASLTLRLVEAE 125 Query: 120 ERLTLIQKKQLASYGDKINWYTSLADVPLG----FTFLVANEFFDSLPIKQFVMTEHGIR 175 L Q++ LA + D++ + A++ G L +NE DS P MT G+R Sbjct: 126 PSLAEAQRQVLAEHLDRV-AWNDPAELMGGTLTFTGCLYSNELIDSFPTHVVEMTPAGLR 184 Query: 176 ERMIDIDQHD-SLVFNIGDHEIKSNFLTCSD--YFLGAIFENSPCRDREMQSISDRLACD 232 E + D + ++ +++ D G E + R ++ ++ L Sbjct: 185 EVFVTADGDGFAEQLDLPSTPDLADYFRRIDVNLQPGQRTEINLNACRWLEGVARCLER- 243 Query: 233 GGTAIVIDYG------YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSSIA 285 G + +DYG Y R TL HT P G D++SHVDF L Sbjct: 244 -GFVLTVDYGFLSPELYGPMRQNGTLLCYFRHTIQEDPYQRVGHQDITSHVDFTTLILRG 302 Query: 286 ILYKLYINGLTTQGKFLEGLG---IWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMG 342 L+ Q +FL G + + + +K+LV + MG Sbjct: 303 EELGLHKAWFGEQYRFLMAAGLMEELMALEAAAATEEERIKIRLVLKKLVLP---EGGMG 359 Query: 343 ELFKILVVSH 352 + FKILV + Sbjct: 360 DTFKILVQAK 369 >gi|325923724|ref|ZP_08185343.1| hypothetical protein XGA_4389 [Xanthomonas gardneri ATCC 19865] gi|325545810|gb|EGD17045.1| hypothetical protein XGA_4389 [Xanthomonas gardneri ATCC 19865] Length = 394 Score = 212 bits (540), Expect = 6e-53, Method: Composition-based stats. Identities = 77/363 (21%), Positives = 129/363 (35%), Gaps = 32/363 (8%) Query: 20 TVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPS 78 ++ L + P GYYS + FG GDFVTAPE+ +F ++ L +Q G Sbjct: 35 PFSRFMELALYAPGLGYYSAGSSKFGEAGDFVTAPELGPLFAATVSGALAPVLQQLG--P 92 Query: 79 CVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKIN 138 R++E+G G G + +L ++E S L Q+++L I Sbjct: 93 TARVLEVGGGSGAFAEV---TLKRLLELDALPERYAILEPSADLRERQRERLG--RTLIP 147 Query: 139 WYTSLADVPLGFT------FLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIG 192 L + L ANE D+LP +F + + + E + +D Sbjct: 148 PVFDLVEWLDAPFPDDWDGVLFANEVIDALPTPRFALRDGQVYEETVVLDAQQQFARGEQ 207 Query: 193 DHEI-------KSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYG--- 242 + + G E P +Q+++ L + Y Sbjct: 208 PADALLGAAVRHLERYLQQPFADGYRSELLPQLPYWIQAVAGGLKRGAMLFVDYGYPRGE 267 Query: 243 -YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGK 300 Y R TL+A H PG DL++ VDF L+ + G TQ Sbjct: 268 FYRAQREDGTLRAFYRHRMHEDLYRWPGLQDLTASVDFTALAEAGTGAGFELAGYCTQSS 327 Query: 301 FLEGLGIWQRAFSLMKQTARKDIL--LDSVKRLVSTSADKKSMGELFKILVVSHEKVELM 358 FL G G+ +L+ Q + ++ + MGE F+++ + + Sbjct: 328 FLLGNGLD----ALLAQADTRTDEVGRMRLREQIKRLTLPSEMGERFQVMGFARDVDFAP 383 Query: 359 PFV 361 F+ Sbjct: 384 AFL 386 >gi|116074786|ref|ZP_01472047.1| hypothetical protein RS9916_29669 [Synechococcus sp. RS9916] gi|116068008|gb|EAU73761.1| hypothetical protein RS9916_29669 [Synechococcus sp. RS9916] Length = 404 Score = 212 bits (539), Expect = 7e-53, Method: Composition-based stats. Identities = 79/368 (21%), Positives = 149/368 (40%), Gaps = 32/368 (8%) Query: 14 KKNGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIFGEMLAIFLICAWE 72 + G+M+ ++ L + DP G Y + G GDFVT+P + F +LA L + Sbjct: 18 AEGGRMSFRRFMELALHDPVDGAYGSGRLRVGTKGDFVTSPSMGSDFAALLATQLAEWLD 77 Query: 73 QHGFPSCVR---LVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQ 129 Q + LVE+GPG G + D+ + +L P + L + +VE + + Q+++ Sbjct: 78 QIHAEAPASPLSLVEVGPGEGDLAADVWAELHRLNPAWIGQLELVLVERNPGMESRQRER 137 Query: 130 LASYGDKINWYTSLADVPLGFT--FLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHD-- 185 LA+ +T+L + LVA+E D+ P+++ + + +R+ + + D Sbjct: 138 LAASAPGQVRWTTLDKLAADPIRGVLVAHELLDAFPVERLIWRDGAMRQMGVVLTSDDAG 197 Query: 186 SLVFNIGDHEIKSNFL--------------TCSDYFLGAIFENSPCRDREMQSISDRLAC 231 V + D + + + G E ++ ++ +A Sbjct: 198 QKVLHWDDQMLPDSIQQQLDWADRHCGISVPPAQVPEGWTTEWHGEVAPWLEQVAKAVA- 256 Query: 232 DGGTAIVIDY------GYLQSRVGDTLQAVKGHTYVS-PLVNPGQADLSSHVDFQRLSSI 284 G +++DY Y R TL A L + G DL++H+ L + Sbjct: 257 -SGVMLIVDYAHDAERYYSARRFAGTLLAYHQQQASDELLADAGCRDLTAHLCIDTLLAQ 315 Query: 285 AILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGEL 344 A + G QG+ L LG+ +R +L + A + + + D +GE Sbjct: 316 ARDQGWTVLGQCRQGEALLALGLGERLHNLQRLPATELPQALQRREALLRLVDPAGLGE- 374 Query: 345 FKILVVSH 352 F+ + +S Sbjct: 375 FRWIALSR 382 >gi|296224080|ref|XP_002757898.1| PREDICTED: protein midA homolog, mitochondrial-like isoform 3 [Callithrix jacchus] Length = 378 Score = 212 bits (539), Expect = 7e-53, Method: Composition-based stats. Identities = 107/365 (29%), Positives = 155/365 (42%), Gaps = 71/365 (19%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 ++R ++ IK G +TV +Y + +P GYY + G GDF+T+PEISQIFGE+ Sbjct: 42 TPMLRHLMYKIKSTGPITVAEYMKEVLTNPAKGYYVHRDMLGEKGDFITSPEISQIFGEL 101 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSV-LSIYMVETSER 121 L I+ I W G + +LVELGPGRG ++ DILRV +L + +S+++VE + + Sbjct: 102 LGIWFISEWMATGKSTAFQLVELGPGRGTLVGDILRVFSQLGSVLKNCDISVHLVEKTPQ 161 Query: 122 LTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDI 181 G RE IDI Sbjct: 162 ---------------------------------------------------GWREVFIDI 170 Query: 182 DQH--DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVI 239 D D L F + + D I E P ++ +S R+A GG A+V Sbjct: 171 DPQVSDKLRFVLAPCATPAEVFIQHDETRDHI-EVCPDAGVIIEELSRRIALTGGAALVA 229 Query: 240 DYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQG 299 DYG+ ++ DT + GH L+ PG ADL++ VDF L +A K+ G TQ Sbjct: 230 DYGHDGTKT-DTFRGFCGHKLHDVLIAPGTADLTADVDFSFLRRMA-QGKVASLGPITQH 287 Query: 300 KFLEGLGIWQRAFS--------LMKQTAR--KDILLDSVKRLVSTSADKKSMGELFKILV 349 FL+ +GI R L K + K LL L+ + K MGE F Sbjct: 288 TFLKNMGIDVRLKVRIFFFPVLLDKSNEQSVKQQLLQGYDMLM----NPKKMGERFNFFA 343 Query: 350 VSHEK 354 + + Sbjct: 344 LLPHQ 348 >gi|121999108|ref|YP_001003895.1| hypothetical protein Hhal_2330 [Halorhodospira halophila SL1] gi|121590513|gb|ABM63093.1| protein of unknown function DUF185 [Halorhodospira halophila SL1] Length = 394 Score = 211 bits (538), Expect = 9e-53, Method: Composition-based stats. Identities = 83/376 (22%), Positives = 138/376 (36%), Gaps = 28/376 (7%) Query: 2 ENKLIRKIVNLI-KKNGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIF 59 +L ++I + I + G +T + Y + +P GYY FG GDF TAPE+S +F Sbjct: 18 SQRLSKRIRSAIDEAGGALTFEAYMDRALYEPGLGYYRGGAARFGVGGDFATAPELSSLF 77 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 L G ++ELGPG G + L + L M+E S Sbjct: 78 SRTLGRQAAEILGHLGGGD---VIELGPGTGRLAAAALAELEHLDRLP---RRWRMLEVS 131 Query: 120 ERLTLIQKKQLASYGDKINWYTSLAD--VPLGFT-FLVANEFFDSLPIKQFVMTEHGIRE 176 L Q++ LA+ + + +ANE D+LP+++FV G+RE Sbjct: 132 AALRQEQEQTLAARVPHLLDRVEWLEALPADSPQAVWIANEVLDALPVRRFVKRGDGVRE 191 Query: 177 RMIDIDQHDSLVFNI------GDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLA 230 + + + G + E ++ +++ L Sbjct: 192 LGVVAADDGFAWTELAADTALQQAVADIEARLDAPLPDGYVSEVCTRVAPFLRGLAEALP 251 Query: 231 CDGGTAIVIDYG----YLQSRVGDTLQAVKGHTYVS-PLVNPGQADLSSHVDFQRLSSIA 285 + Y YL R TL H P PG D+++ VDF ++ A Sbjct: 252 QGVMLWLDYGYPRREYYLAERHRGTLLCHFRHRAHDDPFFYPGLQDITAFVDFTAVADAA 311 Query: 286 ILYKLYINGLTTQGKFLEGLGIWQ--RAFSLMKQTARKDILLDSVKRLVSTSADKKSMGE 343 + L + G QG FL G G+ A A + + ++R+ MGE Sbjct: 312 LACGLDVLGYAPQGPFLMGAGLAACTEAELAGADAAGHMAVSNEIQRITH----PGDMGE 367 Query: 344 LFKILVVSHEKVELMP 359 FK++ + + + Sbjct: 368 RFKVMALGRDYDGPLG 383 >gi|328351542|emb|CCA37941.1| Protein midA homolog, mitochondrial [Pichia pastoris CBS 7435] Length = 399 Score = 211 bits (537), Expect = 1e-52, Method: Composition-based stats. Identities = 107/397 (26%), Positives = 171/397 (43%), Gaps = 67/397 (16%) Query: 25 FALCVADPEFGYYSTCNP---FGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVR 81 C+ PEFGYY+T +P DFVT+PEISQ FGEM+ I+ W G P VR Sbjct: 1 MKQCLVHPEFGYYTTRDPLSPISETSDFVTSPEISQTFGEMIGIYHYTTWLLQGKPKEVR 60 Query: 82 LVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLA---------- 131 +E GPG+G ++ D +R +L + + +VE S L Q+K+L Sbjct: 61 FIEFGPGKGTLIFDCIRTFERLSKGTV-LYEVILVEASPILREEQRKKLCGDTSLNVLED 119 Query: 132 ----------SYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDI 181 + + G +++A+EFFD+LP++Q+ T+ G RE M+D Sbjct: 120 GTWEAETLAGKRCHWVETELDIKKT--GTNYIIAHEFFDALPVQQYEKTKDGWREYMVDF 177 Query: 182 DQHD-----------------------------SLVFNIGDHEIKSNFLTCSD-----YF 207 + + + + HE +++ ++ Sbjct: 178 SEKNVIRAKTDPLALPNRTTITSKELENPALRFNFHSVLSPHETPGSYIPKNNPRYEALP 237 Query: 208 LGAIFENSPCRDREMQSISDRLA--CDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLV 265 +G+ E P + I+ + C G ++IDYG + +T++ +K H VSP Sbjct: 238 VGSRIEICPEAHTYSKHIAALINSGCGAGGCLIIDYGPADTVPINTIRGIKNHKIVSPFD 297 Query: 266 NPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSL---MKQTARKD 322 +PG DLS+ VDF +L I ++G QG +L LG+ R L K K Sbjct: 298 DPGNVDLSADVDFGQLKQIFENQSCQVHGPVAQGDWLHELGLGFRTDQLVHIAKSENDKQ 357 Query: 323 ILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMP 359 ++ + KRLV S MG ++K L V ++ P Sbjct: 358 KIIKAYKRLVGKSHGD--MGSIYKFLAVLPNGSQIQP 392 >gi|223938952|ref|ZP_03630838.1| protein of unknown function DUF185 [bacterium Ellin514] gi|223892379|gb|EEF58854.1| protein of unknown function DUF185 [bacterium Ellin514] Length = 363 Score = 211 bits (537), Expect = 1e-52, Method: Composition-based stats. Identities = 75/364 (20%), Positives = 131/364 (35%), Gaps = 23/364 (6%) Query: 7 RKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPF-GAVGDFVTAPEISQIFGEMLAI 65 I+ + G + ++ L + P+FGYY + G GDF T+ + +FGE+LA Sbjct: 6 EIILKEVLVKGIIPFSRFMELALYCPKFGYYERQDVSPGRKGDFYTSVSVGALFGELLAF 65 Query: 66 FLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLI 125 W + ++VE G G + DIL+ I L+P L +++E SE Sbjct: 66 QF-SEWLYALSVAKCQIVEAGAHDGRLARDILQEIKVLQPQLSDNLEYWIIEPSEARQGW 124 Query: 126 QKKQLASYGDKINWYTSLADVPLG--FTFLVANEFFDSLPIKQFVMT-EHGIRERMIDID 182 Q L + WY S + P + +NE D++P+ + I Sbjct: 125 QADTLGELARSVRWYRSWQETPETGVNGVIFSNELLDAMPVHRMGWDASERKWFEWGVIL 184 Query: 183 QHDSLVFNIGDHEIKSNFLTCSDYF--------LGAIFENSPCRDREMQSISDRLACDGG 234 + ++ + E P + + +L Sbjct: 185 EGGRFAWSRMPQTNPEIGGALLELPQEFTAVLPDEFTTEVCPVALNWWRQAAGKLKSGKL 244 Query: 235 TA----IVIDYGYLQSRVGDTLQ-AVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYK 289 A + D +R TL+ + H L + G D+++HV+F + Sbjct: 245 LAIDYGLTADQFLTPARRNGTLRSYYRHHQSDDLLADAGNQDITAHVNFSAVQKTGESQG 304 Query: 290 LYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILV 349 L GL +Q +FL + A + ++ R T + +GE F++LV Sbjct: 305 LKTEGLWSQAQFLTRI-----AGRIFERKGHHAAWNSGKVRQFQTLTHPEHLGESFRVLV 359 Query: 350 VSHE 353 S E Sbjct: 360 QSAE 363 >gi|74318720|ref|YP_316460.1| hypothetical protein Tbd_2702 [Thiobacillus denitrificans ATCC 25259] gi|74058215|gb|AAZ98655.1| conserved hypothetical protein [Thiobacillus denitrificans ATCC 25259] Length = 384 Score = 211 bits (537), Expect = 2e-52, Method: Composition-based stats. Identities = 88/362 (24%), Positives = 145/362 (40%), Gaps = 30/362 (8%) Query: 14 KKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVG-DFVTAPEISQIFGEMLAIFLICAWE 72 G + ++ + P GYY+ DFVTAPE++ +FG LA + Sbjct: 29 AAGGWIPFSRFMQAVLYAPGLGYYAAGAAKFGAAGDFVTAPEMTPLFGRTLAHAIAPVLR 88 Query: 73 QHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKK---- 128 + G ++ELG G G + D+L + L ++E S L Q++ Sbjct: 89 ETGGD----VLELGGGSGRLAADLLAELDTLGALP---PRYRILEVSADLRARQQQTVAT 141 Query: 129 QLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLV 188 L ++ W ++ + G ++ NE D+LP + T G R R + Sbjct: 142 DLPRLASRVEWLDAVPERFSG--VILGNEVLDALPAELVHWTADGPRLRGVVRKGDGFAW 199 Query: 189 FN-IGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDY------ 241 + + T G + E +P + + S++ L G ++IDY Sbjct: 200 EDGPIADDALRERATALALAPGYVSEINPAAEALVASLAQCLVR--GLILMIDYGFGARE 257 Query: 242 GYLQSRVGDTLQ-AVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGK 300 Y R TL+ + H P PG DL++HVDF ++ + L + G T+Q Sbjct: 258 YYHPQRSMGTLRVHYRHHALDDPFYLPGLCDLTAHVDFSAIARAGVAAGLELAGYTSQAS 317 Query: 301 FLEGLGIWQRAFSL-MKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKV-ELM 358 FL G+ + + A ++V+RLVS MGELFK++ + V EL Sbjct: 318 FLLSGGLAELLMQTPPEDAATYLPQANAVQRLVS----PAEMGELFKVIGFTRGAVGELA 373 Query: 359 PF 360 F Sbjct: 374 GF 375 >gi|298245847|ref|ZP_06969653.1| protein of unknown function DUF185 [Ktedonobacter racemifer DSM 44963] gi|297553328|gb|EFH87193.1| protein of unknown function DUF185 [Ktedonobacter racemifer DSM 44963] Length = 371 Score = 210 bits (534), Expect = 3e-52, Method: Composition-based stats. Identities = 77/386 (19%), Positives = 135/386 (34%), Gaps = 53/386 (13%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYY-STCNPFGAVGDFVTAPEISQIFG 60 L I++ I+++G + +Y + + +P GYY S G GDF T+ ++S+IF Sbjct: 4 TTPLQAHILDTIRQHGPIPFAEYMRMALYEPGQGYYVSGKTRVGWEGDFFTSSDVSEIFA 63 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 L+ WE P+ ++E G RG++ I + P F L + Sbjct: 64 HCTGRQLLQMWEMLKQPAHFLVLEQGANRGLLGEGIRAWASQEHPAFHQALHYVSSDIGA 123 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 + T L++NE D+ P+ + E + Sbjct: 124 GEDAL-----------------SVASEQAPTVLLSNELVDAFPVHLVEKRGSELYEVYVG 166 Query: 181 IDQHDSLVFNIGDHE------IKSNFLTCSDYFLGAIFENSPCRDREMQSISDRL----- 229 + + Y G E + M+ + L Sbjct: 167 EQNGRFGELLQAPSSEEVASYLDDYKIPWRTYPDGWRAEINLDARHWMRRSAQLLLGSNP 226 Query: 230 -ACDGGTAIVIDYG------YLQSRVGDTLQAVKGHTYVS-PLVNPGQADLSSHVDFQRL 281 G + IDYG Y+ R TL H PL+ PG+ D+++HV+F L Sbjct: 227 KRKRRGFLLAIDYGDLARQHYIPERFTGTLACYYQHQLTERPLLRPGEQDITAHVNFSTL 286 Query: 282 SSIAILYKLYINGLTTQGKFLEGLGIWQRAFSL-MKQTARKDILLDS------------- 327 L ++ TTQ +L +GI++ + + A D + + Sbjct: 287 IEEGRRQGLRLHKFTTQRAWLTDMGIYEELERIRQRDFAALDDIQNRASDQGQVQLLQWY 346 Query: 328 -VKRLVSTSADKKSMGELFKILVVSH 352 V++ VS +G FK+L++ Sbjct: 347 NVRQRVSALTSPGGLGG-FKVLILKR 371 >gi|218248555|ref|YP_002373926.1| hypothetical protein PCC8801_3820 [Cyanothece sp. PCC 8801] gi|218169033|gb|ACK67770.1| protein of unknown function DUF185 [Cyanothece sp. PCC 8801] Length = 377 Score = 210 bits (534), Expect = 3e-52, Method: Composition-based stats. Identities = 95/365 (26%), Positives = 154/365 (42%), Gaps = 24/365 (6%) Query: 6 IRKIVNLIKKN--GQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIFGEM 62 + I+ I+++ ++T Y L + P+ GYYS+ N G GDF TA + FGE+ Sbjct: 1 MEFILEAIQQSPEHKITFAHYMNLALYHPQKGYYSSGNAKIGTQGDFFTASSLGADFGEL 60 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 LA + W G+P+ LVE+G G G DIL + P F+ + ++E ++ L Sbjct: 61 LAEQFLEMWSILGYPNRFSLVEVGAGAGFFAADILDYLSNQHPHFYEAIEYLIIEEAKGL 120 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 QK QL + + + +NE D+ P+ + + ++E + I Sbjct: 121 IEQQKAQLKNRDKVSWKSWDEIENCSIIGCIFSNELIDAFPVHLVTLEQGKLQEIYVTI- 179 Query: 183 QHDSLVFNIGD---------HEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDG 233 + I D E+ LT DY E + +++++++L Sbjct: 180 SEGKITEAIADLSTSQLRDYFELVEVNLTAKDYPENYRTEVNLAALDCLKTVANKLKK-- 237 Query: 234 GTAIVIDYGYL------QSRVGDTLQAVKGHTYVS-PLVNPGQADLSSHVDFQRLSSIAI 286 G + IDYGY R TL+ H + P VN G+ D+++HV+F L + Sbjct: 238 GYLLTIDYGYTAQKYYHPQRYQGTLKCYYKHRHHDNPYVNIGEQDITTHVNFTALENHGE 297 Query: 287 LYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFK 346 L L G T QG FL GLG+ R L ++ L D +G F Sbjct: 298 LLGLDKLGFTQQGLFLMGLGLGDRLSDLSNGNYSFFEVIQRRDSL-HQLIDPMGLGG-FG 355 Query: 347 ILVVS 351 +L+ S Sbjct: 356 VLLQS 360 >gi|261330287|emb|CBH13271.1| hypothetical protein, conserved [Trypanosoma brucei gambiense DAL972] Length = 428 Score = 210 bits (534), Expect = 3e-52, Method: Composition-based stats. Identities = 106/383 (27%), Positives = 183/383 (47%), Gaps = 28/383 (7%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYS-TCNPFG-AVGDFVTAPEISQI 58 ++ L ++++ + G + Q+ C+ P+ GYYS N G DF+TA EI Sbjct: 41 LKTPLCVELISKMSSQGYFPMSQFVKECLTHPQHGYYSTKKNVIGSEKADFITAAEI-PF 99 Query: 59 FGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVET 118 F ++++ +++ W++ G P + LVELGPGRG +M +IL+ I P L I++VE Sbjct: 100 FADIVSAWIMDVWQKMGTPRVLHLVELGPGRGTLMKNILKQIKYSNPHLLHFLQIHLVEV 159 Query: 119 SERLTLIQKKQLASY---GDKINWYTSLADVP--LGFTFLVANEFFDSLPIKQFVMTEHG 173 T Q+ L+ + KI W+ L +P L T +ANE+FD+LP+ QF TE G Sbjct: 160 GAARTDEQRSALSEFQTAQKKIKWWMGLESIPLTLEPTVYIANEYFDALPVAQFRYTERG 219 Query: 174 IRERMIDIDQHDSLVFNIGDHEIKSNFLTC--------SDYFLGAIFENSPCRDREMQSI 225 E +++D+ + + S + ++ +G E + + M+ I Sbjct: 220 WVETCLEVDEDPAHEAHFRMVHAPSGSFSAYLIPNDVRANGKIGDCIEINAVGMQTMELI 279 Query: 226 SDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIA 285 ++ +A +I + TL+ ++GH +V PL++PG+ DLSS V F++L Sbjct: 280 MKKMVECQKSACLIVDYGKDEHMHSTLRGIRGHRFVDPLLSPGEVDLSSWVSFKQLRWSM 339 Query: 286 ILY-----KLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKD--ILLDSVKRLVSTSADK 338 L + +Q +FL+ GI R ++K K +L + +RL+ DK Sbjct: 340 ERLETARRHLKWFPVISQSEFLQWGGIDVRLAHVIKDEETKSAMKILQNYRRLM----DK 395 Query: 339 KSMGELFKILVVSHEKVE-LMPF 360 MGE +K+ + + PF Sbjct: 396 NEMGESYKVFALQTRNFPNVSPF 418 >gi|72392599|ref|XP_847100.1| hypothetical protein [Trypanosoma brucei TREU927] gi|62175613|gb|AAX69746.1| hypothetical protein, conserved [Trypanosoma brucei] gi|70803130|gb|AAZ13034.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain 927/4 GUTat10.1] Length = 428 Score = 210 bits (534), Expect = 3e-52, Method: Composition-based stats. Identities = 106/383 (27%), Positives = 183/383 (47%), Gaps = 28/383 (7%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYS-TCNPFG-AVGDFVTAPEISQI 58 ++ L ++++ + G + Q+ C+ P+ GYYS N G DF+TA EI Sbjct: 41 LKTPLCVELISKMSSQGYFPMSQFVKECLTHPQHGYYSTKKNVIGSEKADFITAAEI-PF 99 Query: 59 FGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVET 118 F ++++ +++ W++ G P + LVELGPGRG +M +IL+ I P L I++VE Sbjct: 100 FADIVSAWIMDVWQKMGTPRVLHLVELGPGRGTLMKNILKQIKYSNPHLLHFLQIHLVEV 159 Query: 119 SERLTLIQKKQLASY---GDKINWYTSLADVP--LGFTFLVANEFFDSLPIKQFVMTEHG 173 T Q+ L+ + KI W+ L +P L T +ANE+FD+LP+ QF TE G Sbjct: 160 GAARTDEQRSALSEFQTAQKKIKWWMGLESIPLTLEPTVYIANEYFDALPVAQFRYTERG 219 Query: 174 IRERMIDIDQHDSLVFNIGDHEIKSNFLTC--------SDYFLGAIFENSPCRDREMQSI 225 E +++D+ + + S + ++ +G E + + M+ I Sbjct: 220 WVETCLEVDEDPAHEAHFRMVHAPSGSFSAYLIPNDVRANGKIGDCIEINAVGMQTMELI 279 Query: 226 SDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIA 285 ++ +A +I + TL+ ++GH +V PL++PG+ DLSS V F++L Sbjct: 280 MKKMVECQKSACLIVDYGKDEHMHSTLRGIRGHRFVDPLLSPGEVDLSSWVSFKQLRWSM 339 Query: 286 ILY-----KLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKD--ILLDSVKRLVSTSADK 338 L + +Q +FL+ GI R ++K K +L + +RL+ DK Sbjct: 340 ERLETARRHLKWFPVISQSEFLQWGGIDVRLAHVIKDEETKSAMKILQNYRRLM----DK 395 Query: 339 KSMGELFKILVVSHEKVE-LMPF 360 MGE +K+ + + PF Sbjct: 396 NEMGESYKVFALQTRNFPNVSPF 418 >gi|114576980|ref|XP_001167082.1| PREDICTED: hypothetical protein isoform 1 [Pan troglodytes] Length = 362 Score = 209 bits (533), Expect = 4e-52, Method: Composition-based stats. Identities = 114/338 (33%), Positives = 166/338 (49%), Gaps = 33/338 (9%) Query: 43 FGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICK 102 G GDF+T+PEISQIFGE+L I+ I W G + +LVELGPGRG ++ DILRV + Sbjct: 2 LGEKGDFITSPEISQIFGELLGIWFISEWMATGKSTAFQLVELGPGRGTLVGDILRVFTQ 61 Query: 103 LKPDFFSV-LSIYMVETSERLTLIQ--------------------KKQLASYGDKINWYT 141 L + +S+++VE S++L+ IQ K + G I+WY Sbjct: 62 LGSVLKNCDISVHLVEVSQKLSEIQALTLTEEKVPLERNAGSPVYMKGVTKSGIPISWYR 121 Query: 142 SLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQH--DSLVFNIGDHEIKSN 199 L DVP G++F +A+EFFD LP+ +F T G RE +DID D L F + + Sbjct: 122 DLHDVPKGYSFYLAHEFFDVLPVHKFQKTPQGWREVFVDIDPQVSDKLRFVLAPSATPAE 181 Query: 200 FLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHT 259 D E P ++ +S R+A GG A+V DYG+ ++ DT + H Sbjct: 182 AFIQHD-ETRDHVEVCPDAGVIIEELSQRIALTGGAALVADYGHDGTKT-DTFRGFCDHK 239 Query: 260 YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTA 319 L+ PG ADL++ VDF L +A K+ G Q FL+ +GI R L+ ++ Sbjct: 240 LHDVLIAPGTADLTADVDFSYLRRMA-QGKVASLGPIKQHTFLKNMGIDVRLKVLLDKSN 298 Query: 320 R---KDILLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 + LL L+ + K MGE F + + Sbjct: 299 EPSVRQQLLQGYDMLM----NPKKMGERFNFFALLPHQ 332 >gi|297265815|ref|XP_002799257.1| PREDICTED: protein midA homolog, mitochondrial-like [Macaca mulatta] Length = 371 Score = 209 bits (532), Expect = 5e-52, Method: Composition-based stats. Identities = 101/356 (28%), Positives = 151/356 (42%), Gaps = 64/356 (17%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 ++R ++ IK G +TV +Y + +P GYY + G GDF+T+PEISQIFGE+ Sbjct: 42 TPMLRHLMYKIKSTGPITVAEYMKEVLTNPAKGYYVYRDMLGKQGDFITSPEISQIFGEL 101 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSV-LSIYMVETSER 121 L I+ I W G + +LVELGPGRG ++ DILRV +L + +S+++VE + + Sbjct: 102 LGIWFISEWMATGKSTAFQLVELGPGRGTLVGDILRVFTQLGSVLKNCDISVHLVEKTPQ 161 Query: 122 LTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDI 181 G RE IDI Sbjct: 162 ---------------------------------------------------GWREVFIDI 170 Query: 182 DQH--DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVI 239 D D L F + + D E P ++ +S R+A GG A+V Sbjct: 171 DPQVSDKLRFVLAPSATPAEAFIQHD-ETRDHVEVCPDAGVIIEELSQRIALTGGAALVA 229 Query: 240 DYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQG 299 DYG+ ++ + GH L+ PG ADL++ VDF L +A K+ G Q Sbjct: 230 DYGHDGTKTXM-FKGFCGHKLHDVLIAPGTADLTADVDFSYLRRMA-QGKVASLGPIKQH 287 Query: 300 KFLEGLGIWQRAFSLMKQTAR---KDILLDSVKRLVSTSADKKSMGELFKILVVSH 352 FL+ +GI R L+ ++ + LL L+ + K MGE F + Sbjct: 288 TFLKNMGIDVRLKVLLDKSNEPSVRQQLLQGYDMLM----NPKKMGERFNFFALLP 339 >gi|149925911|ref|ZP_01914174.1| hypothetical protein LMED105_02645 [Limnobacter sp. MED105] gi|149825199|gb|EDM84410.1| hypothetical protein LMED105_02645 [Limnobacter sp. MED105] Length = 365 Score = 209 bits (531), Expect = 8e-52, Method: Composition-based stats. Identities = 86/351 (24%), Positives = 147/351 (41%), Gaps = 37/351 (10%) Query: 21 VDQYFALCVADPEFGYYSTCNP-FGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSC 79 D++ + DP+ GYY+ FG GDF+TAPE+S +FG+ L L + Sbjct: 11 FDEFMRFALYDPQHGYYARGEQIFGRQGDFITAPELSPLFGQTLGKALREVLPR----CN 66 Query: 80 VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYG----- 134 + E G G G + DIL+ L + +V+ S L +Q +L + Sbjct: 67 GVVYEFGAGTGQLACDILQTAGDL------ITQYNIVDVSAGLKPVQLAKLKALHGPQVE 120 Query: 135 DKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRE-RMIDIDQHDSLVFNIGD 193 K+ W L G ++ NE D+ P+++F + +E + ++ V+ D Sbjct: 121 QKVRWLGQLPTELGG--VVLGNEVLDATPVRRFKWQKDNPQEAWVQHLNGELRWVWKPAD 178 Query: 194 HEIKSNFLTC----SDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDY------GY 243 + + E + +++I++RL G A++IDY Y Sbjct: 179 PVFATTISNLQAAHGPWPEDYESEIAEQSTAWVKTITERL---NGLALMIDYGFHEALYY 235 Query: 244 LQSRVGDTLQAVKGHTYVSPL-VNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFL 302 +R TL+A HT V G+ DL++HV+F + + G + QG+FL Sbjct: 236 HPTRNKGTLRATSRHTAHDDFLVKVGEQDLTAHVNFSAIYDAMTDCGGDLEGYSHQGEFL 295 Query: 303 EGLGIWQRAFSLMKQTARKDILLDSVKRL-VSTSADKKSMGELFKILVVSH 352 GI + A KQ ++ R ++T ++ MGE FK++ S Sbjct: 296 LAHGILELAQ---KQPEFTHPTRGALMRQNLNTLLNEADMGEAFKVICWSK 343 >gi|160896698|ref|YP_001562280.1| hypothetical protein Daci_1251 [Delftia acidovorans SPH-1] gi|160362282|gb|ABX33895.1| protein of unknown function DUF185 [Delftia acidovorans SPH-1] Length = 337 Score = 208 bits (530), Expect = 9e-52, Method: Composition-based stats. Identities = 84/351 (23%), Positives = 148/351 (42%), Gaps = 36/351 (10%) Query: 25 FALCVADPEFGYYST-CNPFG----AVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSC 79 AL + +P GYY+ FG + DFVTAPE+S +FG++LA + A ++ Sbjct: 1 MALALYEPGLGYYANDTAKFGLMPSSGSDFVTAPEMSPVFGQLLAAQVAEALQRT---HT 57 Query: 80 VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINW 139 + E G G G + L +L + L +V+ S L Q+ +L Y ++W Sbjct: 58 REVWEFGAGTGALALQVLDELAALG---VRPDRYTIVDLSGTLRARQQLRLVKYEGLVHW 114 Query: 140 YTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSN 199 +L + G ++ NE D++P++ V ER + + SL + +++ Sbjct: 115 ADALPERLEG--VVIGNEVLDAMPVQLLVRKAGVWHERGVVLQPDGSLGWEDRPTQLRPP 172 Query: 200 FLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDY------GYLQSRVGDTLQ 253 + + E + ++++ +R+A G A IDY + R TL Sbjct: 173 MEIEGE--HDYLTEIHLQGEAFIRTLGERMAR--GAAFFIDYGFGESEYFHPQRHMGTLV 228 Query: 254 AVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAF 312 + H PL G D+++HV+F + A + G T+Q FL G+ Sbjct: 229 CHRLHKVDDDPLAEVGLKDITAHVNFTGTAVAAQEAGFEVLGYTSQAHFLINCGLG---- 284 Query: 313 SLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSH--EKVELMPFV 361 + D L + + + + MGELFK++ ++ E + M FV Sbjct: 285 ------PKLDALAQGPRAMATKLMMEHEMGELFKVIGLAKGVEPWDAMGFV 329 >gi|312383401|gb|EFR28503.1| hypothetical protein AND_03479 [Anopheles darlingi] Length = 483 Score = 208 bits (529), Expect = 1e-51, Method: Composition-based stats. Identities = 114/395 (28%), Positives = 179/395 (45%), Gaps = 59/395 (14%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIFGEM 62 L ++ I+ G +TV Y + +P GYYST + G+ GDFVTAPEI QIFGE Sbjct: 84 PLADQLQARIRATGPITVASYMKEVLLNPSAGYYSTKDTVLGSGGDFVTAPEIGQIFGE- 142 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 + + ++L+ELGPGRG +M D+LRV + + +++VE S +L Sbjct: 143 ----------KFNYDGHIQLIELGPGRGTLMQDVLRVCEQFG-FTKDRVGVHLVEMSAQL 191 Query: 123 TLIQKKQLAS-------------------YGDKINWYTSLADVPLGFTFLVANEFFDSLP 163 Q ++L + G ++ WY+ +A+VP GF ++ANEFFD+LP Sbjct: 192 QHTQAERLCNGRVERGIPSDCYVQRGTTASGIEVRWYSDVAEVPKGFAVVIANEFFDALP 251 Query: 164 IKQFVMTEHG-------IRERMIDID----QHDSLVFNIGDHEIKSNFLTCSDYFL---- 208 F +E +IDI+ F + + + + Sbjct: 252 AHVFCKETTEGSAGGASWKEVLIDINPAAANGPGFRFIQSNKATPYSVVFGKRFNEMDRL 311 Query: 209 ---GAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLV 265 E S ++ Q+++ R + GG ++IDYG+ ++ DT ++ K H PLV Sbjct: 312 LQGRNRVEVSFEMEQIAQNLAQRFSEHGGFGLIIDYGHEGDKM-DTFRSFKDHKLHDPLV 370 Query: 266 NPGQADLSSHVDFQRLSSIAIL-YKLYINGLTTQGKFLEGLGIWQRAFSLMK---QTARK 321 +PG ADL+ VDF L K G +QG FL+ + R +L+K R+ Sbjct: 371 SPGVADLTVDVDFGFLKHFLQQDDKAIALGPVSQGAFLKAMQGAARLENLLKATTDEDRR 430 Query: 322 DILLDSVKRLVSTSADKKSMGELFKILVVSHEKVE 356 IL+ L + MGE FK+L V ++ Sbjct: 431 KILVSGYDELT----NPTKMGERFKMLSVFPAALK 461 >gi|322420164|ref|YP_004199387.1| hypothetical protein GM18_2661 [Geobacter sp. M18] gi|320126551|gb|ADW14111.1| protein of unknown function DUF185 [Geobacter sp. M18] Length = 386 Score = 208 bits (528), Expect = 2e-51, Method: Composition-based stats. Identities = 88/367 (23%), Positives = 155/367 (42%), Gaps = 22/367 (5%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGE 61 KL I+ I+ G +T + + +P+ GYY++ GA GDF T+ + FG Sbjct: 8 TKLAEIILKRIRSRGDITFASFMESALYEPDLGYYTSPGRKVGAEGDFYTSMNVHSAFGR 67 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 +++ + WE P+ + E G G G + DIL I + P+ + L+ ++E Sbjct: 68 LISREIGRFWELLDSPASFTIAEAGAGGGQLAQDILDAIAQENPNLYGTLTYRLIEKEPT 127 Query: 122 LTLIQKKQLASYGDKINW--YTSLADVPLGFT-FLVANEFFDSLPIKQFVMTEHGIRERM 178 L Q +L + +++ W LAD L FT +++NE FD++P+ MT+ G++E Sbjct: 128 LQQAQAARLERHAERLAWSSPQELADAELSFTGCIISNELFDAMPVHLVEMTDEGLKEVF 187 Query: 179 IDIDQHD--SLVFNIGDHEIKSNFLTCS-DYFLGAIFENSPCRDREMQSISDRLACDGGT 235 + D + E+ + G E + + S + L G Sbjct: 188 VSADANGFRERFLPPSSPELAAYLQKFEVRLMPGQRAEINLAAPAWIASAARALER--GF 245 Query: 236 AIVIDYG------YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSSIAILY 288 + +DYG Y R TL HT P G+ D+++H++F L Sbjct: 246 VLTVDYGYLTEELYTPQRRNGTLLCYHKHTTNEDPYQLVGEQDITTHINFSALIEAGNEQ 305 Query: 289 KLYINGLTTQGKFLEGLGIWQ---RAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELF 345 L Q +FL G+G+ + R + K ++K+L+ + MG+ F Sbjct: 306 GLEKVWYGEQYRFLLGVGLLEELMRMEAQAKDEQESLKHRLAIKKLMLP---EGGMGDTF 362 Query: 346 KILVVSH 352 K+L+ + Sbjct: 363 KVLIQAK 369 >gi|190575866|ref|YP_001973711.1| hypothetical protein Smlt4029 [Stenotrophomonas maltophilia K279a] gi|190013788|emb|CAQ47424.1| conserved hypothetical protein [Stenotrophomonas maltophilia K279a] Length = 394 Score = 207 bits (527), Expect = 2e-51, Method: Composition-based stats. Identities = 83/371 (22%), Positives = 152/371 (40%), Gaps = 29/371 (7%) Query: 2 ENKLIRKIVNLI-KKNGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIF 59 ++L + I + G M ++ LC+ P +GYYS FG GDF TAPE+ +F Sbjct: 16 SDQLAAALRAEILAQGGAMPFSRFMELCLYAPGWGYYSAGASKFGGSGDFTTAPELGSLF 75 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLK--PDFFSVLSIYMVE 117 +A L + Q G + R++ELG G G +L + +L P +++L Sbjct: 76 AGSVANALAPVFAQLG--AQARMLELGGGTGAFAEAVLLRLAELDALPSRYAILEPSADL 133 Query: 118 TSERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRER 177 + +Q+ A +++W + + ANE D+LP +F++ + + E Sbjct: 134 RERQQQRLQQNLPAELAARVDWVDRPFEEDWE-GVVFANEVIDALPTPRFLIRDGEVYEE 192 Query: 178 MIDIDQHDSLVFNIGDHEI-------KSNFLTCSDYFLGAIFENSPCRDREMQSISDRLA 230 +++D + + +I + G E P +Q+++ + Sbjct: 193 TVELDAEGNFIRGAQPADILLNGAVRHIERYLEKPFAEGYRSEVLPQLPYWLQAVAGGMQ 252 Query: 231 CDGGTAIVIDYGYL------QSRVGDTLQ-AVKGHTYVSPLVNPGQADLSSHVDFQRLSS 283 G + +DYGY R T++ + H + PG D+++ VDF ++ Sbjct: 253 R--GAMLFVDYGYSRSEFYQHDRDDGTVRAFYRHHVHNDVHRWPGLQDITASVDFTAMAE 310 Query: 284 IAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTAR--KDILLDSVKRLVSTSADKKSM 341 + + G +Q FL G G+ Q ++T + L D VK+L M Sbjct: 311 AGMHGGFELAGYCSQASFLLGNGLDQVLLLAEERTDEVGRIQLRDQVKKLTL----PTEM 366 Query: 342 GELFKILVVSH 352 GE F+ + + Sbjct: 367 GERFQAIGLQR 377 >gi|254523404|ref|ZP_05135459.1| hypothetical protein SSKA14_2537 [Stenotrophomonas sp. SKA14] gi|219720995|gb|EED39520.1| hypothetical protein SSKA14_2537 [Stenotrophomonas sp. SKA14] Length = 394 Score = 207 bits (526), Expect = 3e-51, Method: Composition-based stats. Identities = 84/371 (22%), Positives = 151/371 (40%), Gaps = 29/371 (7%) Query: 2 ENKLIRKIVNLI-KKNGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIF 59 ++L + I + G M ++ LC+ P +GYYS FG GDF TAPE+ +F Sbjct: 16 SDELAAALRAEILAQGGAMPFSRFMELCLYTPGWGYYSAGASKFGGSGDFTTAPELGSLF 75 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLK--PDFFSVLSIYMVE 117 +A L + Q G + R++ELG G G +L + +L P +++L Sbjct: 76 AGSVANALAPVFAQLG--AQARMLELGGGTGAFAEAVLLRLAELDALPSRYAILEPSADL 133 Query: 118 TSERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRER 177 + +Q+ A +++W + + ANE D+LP +F++ + + E Sbjct: 134 RERQQQRLQQNLPAELAARVDWIDRPFEEDWE-GVVFANEVIDALPTPRFLIRDGEVYEE 192 Query: 178 MIDIDQHDSLVFNIGDHEI-------KSNFLTCSDYFLGAIFENSPCRDREMQSISDRLA 230 +++D + +I + G E P +Q+++ L Sbjct: 193 TVELDGDGHFIRGAQPADILLNGAVRHIERYLEKPFAEGYRSEVLPQLPYWLQAVAGGLQ 252 Query: 231 CDGGTAIVIDYGYL------QSRVGDTLQ-AVKGHTYVSPLVNPGQADLSSHVDFQRLSS 283 G + +DYGY R T++ + H + PG D+++ VDF ++ Sbjct: 253 R--GAMLFVDYGYNRGEFYQHDRDDGTVRAFYRHHVHNDVHRWPGLQDITASVDFTAMAE 310 Query: 284 IAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTAR--KDILLDSVKRLVSTSADKKSM 341 + + G +Q FL G G+ Q ++T + L D VK+L M Sbjct: 311 AGLHGGFELAGYCSQASFLLGNGLDQVLLLAEERTDEVGRIQLRDQVKKLTL----PTEM 366 Query: 342 GELFKILVVSH 352 GE F+ + + Sbjct: 367 GERFQAIGLQR 377 >gi|167465572|ref|ZP_02330661.1| hypothetical protein Plarl_23936 [Paenibacillus larvae subsp. larvae BRL-230010] gi|322383699|ref|ZP_08057450.1| hypothetical protein PL1_1619 [Paenibacillus larvae subsp. larvae B-3650] gi|321151911|gb|EFX44854.1| hypothetical protein PL1_1619 [Paenibacillus larvae subsp. larvae B-3650] Length = 376 Score = 206 bits (525), Expect = 3e-51, Method: Composition-based stats. Identities = 78/369 (21%), Positives = 145/369 (39%), Gaps = 21/369 (5%) Query: 2 ENKLIRKIVNLIKKN--GQMTVDQYFALCVADPEFGYY-STCNPFGAVGDFVTAPEISQI 58 + L + ++I+K + + Y +LC+ +GYY ++ G GDF T+ + Sbjct: 10 TSPLTGILRDMIEKTVERAIRFETYMSLCLYHETYGYYRTSTKKIGREGDFYTSSYVETS 69 Query: 59 FGEMLAIFLICAWEQHGFPS--CVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMV 116 GE LA +++ W + S + +VE G G G + L + + + + M+ Sbjct: 70 MGECLAAYMLSYWSECSGSSAGPLHVVEWGGGEGKLAAHTLDALMRADKQIYHRVRYTMI 129 Query: 117 ETSERLTLIQKKQLASYGDKINWYTS---LADVPLGFTFLVANEFFDSLPIKQFVMTEHG 173 ETS IQK LA++ +++ + T LA P + ++ANE D+ P+ + + Sbjct: 130 ETSGYHRSIQKSMLAAHENRLRFMTEEEWLAAPPHPGSIVLANELLDAFPVYRLRCDQGT 189 Query: 174 IRERMIDIDQH----DSLVFNIGDHEIKSNFLTCS-DYFLGAIFENSPCRDREMQSISDR 228 ++E + D+ + D + + G + E + ++ I+ Sbjct: 190 LQEGWVTWDRQAQAFAERWMPLCDPRLHAYLQQAGVKLAEGQVAEVNLAGPAWLERIAGA 249 Query: 229 LACDGGTAI----VIDYGYLQSRVGDTLQAVKGHTYVSPLV-NPGQADLSSHVDFQRLSS 283 L I + Y R+ +L + H PG+ D++SHVDF Sbjct: 250 LLRGRMVLIDYGDTAEELYAPHRMRGSLMCYRRHQAHDNFFLFPGEQDITSHVDFSACLR 309 Query: 284 IAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGE 343 L TQ +F+ GI R + + + R + MGE Sbjct: 310 TLEEAGYRT-RLMTQREFMVEHGILDRLQNHTYRDPFSP--VARQNRAIRQLLLSNGMGE 366 Query: 344 LFKILVVSH 352 LFK++ + Sbjct: 367 LFKVITAAK 375 >gi|262276847|ref|ZP_06054640.1| putative cyclopropane-fatty-acyl-phospholipid synthase [alpha proteobacterium HIMB114] gi|262223950|gb|EEY74409.1| putative cyclopropane-fatty-acyl-phospholipid synthase [alpha proteobacterium HIMB114] Length = 346 Score = 206 bits (525), Expect = 4e-51, Method: Composition-based stats. Identities = 92/340 (27%), Positives = 150/340 (44%), Gaps = 12/340 (3%) Query: 20 TVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSC 79 +D Y C+ + YY FG GDF+T+P IS IFGE++A+++ + + Sbjct: 14 PLDDYINTCLYKYKSSYYEKKKIFGPRGDFITSPYISSIFGEIIAVYITNYFLEKKL-YS 72 Query: 80 VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINW 139 ++E+G G G+M DI+ I + S +++E SE L K++ K+NW Sbjct: 73 FSILEIGAGEGVMARDIINTINTINKFKNINFSYFILEKSENLKK--KQKKNLSKLKVNW 130 Query: 140 YTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSN 199 SL D+ F+V+NE D+ PIK ++ E+ + D + + + N Sbjct: 131 IKSLDDLECKNLFIVSNELLDAFPIKHLKKIKNDWYEKYVYYDNKKNKIISEYAKLKNKN 190 Query: 200 FLTCSD-YFLGAIFENSPCRDREMQSISDRLACD-GGTAIVIDYGYLQSRVGDTLQAVKG 257 E SP + I L + + DYGY + +TLQ +K Sbjct: 191 QKIFKLVSKNNDFIEFSPLVIEFLNQIIKVLKKNKNNCFLTFDYGYQGNNFKNTLQGLKN 250 Query: 258 HTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFS---L 314 H VS +PG D++ ++F + I + N + +Q +FL GI +R + Sbjct: 251 HKKVSIFEDPGNVDITYLINFNLIKKIFNNKSNFNNIIMSQSEFLTRAGIIERLRQATNI 310 Query: 315 MKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 + K L SV RL+ K MG LFK L+V++ Sbjct: 311 LTSEKDKLKLEMSVDRLIH----PKKMGSLFKCLIVTNAN 346 >gi|78212981|ref|YP_381760.1| hypothetical protein Syncc9605_1451 [Synechococcus sp. CC9605] gi|78197440|gb|ABB35205.1| conserved hypothetical protein [Synechococcus sp. CC9605] Length = 410 Score = 206 bits (524), Expect = 5e-51, Method: Composition-based stats. Identities = 72/367 (19%), Positives = 139/367 (37%), Gaps = 27/367 (7%) Query: 5 LIRKIVNLIKKNGQMTVDQYFALCVADPEFGYY-STCNPFGAVGDFVTAPEISQIFGEML 63 L + G + ++ L + +PE GYY S GA GDFVT+P + F +L Sbjct: 23 LATHLHQ---AGGAVPFSRFMDLALNEPEHGYYGSGRARIGAQGDFVTSPALGSDFAVLL 79 Query: 64 AIFLICAWEQHGFPSCVR---LVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 A ++ + +VE+GPG G + D++ + P+ + + + +VE + Sbjct: 80 APQILAWLTSIPRSDPDQRLSIVEIGPGEGHLARDLVAALHGADPELLARIELVLVEANP 139 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFT--FLVANEFFDSLPIKQFVMTEHGIRERM 178 + Q+ L D + SL ++ ++A+E D+LP+++ + E ++++ Sbjct: 140 GMRRRQQALLQEADDLPLRWCSLDELRRAPVQGVVIAHELLDALPVERLIWREGSLQQQW 199 Query: 179 IDIDQHDSLVFNIGDHEI------------KSNFLTCSDYFLGAIFENSPCRDREMQSIS 226 +++ L L D G E + + + Sbjct: 200 VELAPKGDLRTTHRPLPDGLHQEIRRVCGQSGIQLPPPDAEEGWTTEWNSAMLDWFAAAA 259 Query: 227 DRLACDGG----TAIVIDYGYLQSRVGDTLQ-AVKGHTYVSPLVNPGQADLSSHVDFQRL 281 + A+ + Y R TL +SPL PG+ DL++H+ + + Sbjct: 260 AAVDAGVLLVIDYALEAERYYTARRSDGTLMAVCAQQAGLSPLDQPGEQDLTAHLCIEVV 319 Query: 282 SSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSM 341 A + QG+ L LG+ QR L + ++ + + D + Sbjct: 320 DEAAQRNGWLVGDQIKQGEALLALGLAQRLHGLQQLPGQQLAEALQRREALLRLVDPAGL 379 Query: 342 GELFKIL 348 G F+ L Sbjct: 380 G-AFRWL 385 >gi|206889280|ref|YP_002248408.1| hypothetical protein THEYE_A0565 [Thermodesulfovibrio yellowstonii DSM 11347] gi|206741218|gb|ACI20275.1| conserved hypothetical protein [Thermodesulfovibrio yellowstonii DSM 11347] Length = 348 Score = 206 bits (523), Expect = 5e-51, Method: Composition-based stats. Identities = 90/361 (24%), Positives = 157/361 (43%), Gaps = 26/361 (7%) Query: 5 LIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIFGEML 63 L I+ IKK+G + D++ + + PE GYY+ + G GDF TA + +FG L Sbjct: 2 LKEIIIKKIKKHGAIPFDEFMEMALYYPELGYYTKPDAKIGRQGDFFTASHLGSVFGFFL 61 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 A + +++ P + E+GPG G + DIL I ++ +VE + Sbjct: 62 ARQIEIFYQKLNCPKNFTVTEIGPGMGFLAKDILDNIDSSASIKYN-----LVEINPAFK 116 Query: 124 LIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQF-VMTEHGIRERMIDID 182 +Q+++L + DKI WY+S+ + ++ NE FD+LP++ F V I E +DI Sbjct: 117 KVQRERLKEHEDKIFWYSSIEQLESFSGLIICNEVFDALPVRIFEVNDSGQIMEVYVDIG 176 Query: 183 QHDSLVFNIGDHEIK-----SNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAI 237 + + L+ F E + ++S+S +L Sbjct: 177 EQEQLIEIFLPCRTDTLEYLQEFAPWVLKMKKYRSEVNLAMKSLIESLSKKLKKGYILIF 236 Query: 238 VIDY----GYLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSSIAILYKLYI 292 Y Y R TL H +P +N G D+++HV+F + +A + L Sbjct: 237 DYGYTSEEYYHPDRNKGTLLCYYKHNINENPYINIGHQDMTAHVNFTAVEKLATMAGLKF 296 Query: 293 NGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSH 352 G +QG +L L + ++ +K L + KRLV + MGE +++++S Sbjct: 297 EGYFSQGSYLISL----CDEKIFQKIYQK-NLKEHFKRLVL----PQGMGESHRVMILSK 347 Query: 353 E 353 + Sbjct: 348 D 348 >gi|318041547|ref|ZP_07973503.1| hypothetical protein SCB01_07540 [Synechococcus sp. CB0101] Length = 399 Score = 206 bits (523), Expect = 6e-51, Method: Composition-based stats. Identities = 80/371 (21%), Positives = 142/371 (38%), Gaps = 30/371 (8%) Query: 5 LIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIFGEML 63 L ++ G + Y + DPE G Y + G GDF T+P + F +L Sbjct: 15 LAERLRE---AGGAVPFRTYMQWALHDPEHGAYGSGRLQVGPRGDFATSPSLGPDFAALL 71 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 A + EQ + LVE GPG G + L + + + P+ + ++ ++E + + Sbjct: 72 APQIAQWLEQQPADQPLALVEAGPGEGDLALQLAQELAAGWPELAARTALVLIEPNAGMA 131 Query: 124 LIQKKQLASYGDKINWYTSLADVPLGFT--FLVANEFFDSLPIKQFVMTEHGIRERMIDI 181 Q+ +L + S++++ L+A+E D+L +++ V R + + + Sbjct: 132 ERQRARLRE-CPLPCRWKSISELAAQPVRGVLLAHEVLDALAVERIVWDGTLWRRQQLAL 190 Query: 182 DQ-------------HDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDR 228 + + E S G E P + +Q+ + Sbjct: 191 HEAPDAQPSLRLEPGEPLEPQELAQLETLGLLQPGSQRPPGWCTELHPEQAPWLQAAAAA 250 Query: 229 LACDGGTAIVIDY------GYLQSRVGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRL 281 L G +VIDY Y R TL A + PL PG DL++H+ + L Sbjct: 251 LG--SGVLLVIDYAHEAWRYYAPQRSNGTLMAYRQQQASPDPLQEPGHWDLTAHLCLETL 308 Query: 282 SSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSM 341 A+ G +QG+ L LG+ QR L Q+ L + + + D ++ Sbjct: 309 EQAALATGWQPLGQRSQGEALLALGLAQRLHGLQHQSGAGLDALLARREALLRLVDPHTL 368 Query: 342 GELFKILVVSH 352 G+ F+ S Sbjct: 369 GD-FRWAAFSR 378 >gi|254467893|ref|ZP_05081299.1| conserved hypothetical protein [beta proteobacterium KB13] gi|207086703|gb|EDZ63986.1| conserved hypothetical protein [beta proteobacterium KB13] Length = 363 Score = 205 bits (521), Expect = 9e-51, Method: Composition-based stats. Identities = 100/380 (26%), Positives = 161/380 (42%), Gaps = 42/380 (11%) Query: 1 MENKLIRKIVNLIKK--NGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQ 57 M++ L KI I+ +G ++ ++ +C+ DP++GYYS+ N FG GDF TAP + Q Sbjct: 1 MKSALKNKICETIQNEMDGSISFSKFMDMCLYDPDYGYYSSQFNQFGEHGDFYTAPMLGQ 60 Query: 58 IFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVE 117 F L + EQ ++E+G G + +D VI L + + Y++E Sbjct: 61 TFAITLTKQI----EQCFSEVRGNILEIGAGNAQLAVD---VILNLYERNIYLDNYYILE 113 Query: 118 TSERLTLIQKKQLA-----SYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEH 172 S L Q + L KI W D G ++ANE FD++P F E+ Sbjct: 114 KSHELKQYQLQALQDKLPIELFKKITWVEDFIDQFNG--VIIANELFDAIPTDVFSSFEN 171 Query: 173 GIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAI-FENSPCRDREMQSISDRLAC 231 I E+ + + D+ F+ E K F G FE SP + + Sbjct: 172 EIMEKKVRV---DNQNFSWALSENKKQFDYQLSLGDGTFDFEYSPGYQKIFTKFAR---A 225 Query: 232 DGGTAIVIDYGY------LQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIA 285 + + DYG SR T++ K + N G+ D++ HV+F L+ + Sbjct: 226 EQMVCFIFDYGMDERQLFNSSRPHGTVRGFKKNLLTEIFENIGEQDITYHVNFTHLAFLT 285 Query: 286 ILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRL--VSTSADKKSMGE 343 + L I G T Q FL LG+ + + + D +K L ++ MGE Sbjct: 286 QKFNLNILGYTHQSHFLNNLGL---------EIEDTNEMKDHLKLLSDINLLTSPAEMGE 336 Query: 344 LFKILVVSHE-KVELMPFVN 362 L K++ +S + L F+N Sbjct: 337 LIKVMAISKSCQASLNGFIN 356 >gi|194367201|ref|YP_002029811.1| hypothetical protein Smal_3429 [Stenotrophomonas maltophilia R551-3] gi|194350005|gb|ACF53128.1| protein of unknown function DUF185 [Stenotrophomonas maltophilia R551-3] Length = 394 Score = 205 bits (521), Expect = 9e-51, Method: Composition-based stats. Identities = 84/379 (22%), Positives = 155/379 (40%), Gaps = 29/379 (7%) Query: 2 ENKLIRKIVNLI-KKNGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIF 59 ++L + I + G M ++ LC+ P +GYYS FG GDF TAPE+ +F Sbjct: 16 SDQLAAALRAEILAQGGAMPFSRFMELCLYAPGWGYYSAGASKFGGSGDFTTAPELGSLF 75 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLK--PDFFSVLSIYMVE 117 +A L + Q G + R++ELG G G +L + +L P +++L Sbjct: 76 AGSVANALAPVFAQLG--AQARMLELGGGTGAFAEAVLLRLAELDALPARYAILEPSADL 133 Query: 118 TSERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRER 177 + +Q+ A +++W + + ANE D+LP +F++ + + E Sbjct: 134 RERQQQRLQQNLPAELAARVDWVDRPFEEDWE-GVVFANEVIDALPTPRFLIRDGEVYEE 192 Query: 178 MIDIDQHDSLVFNIGDHEI-------KSNFLTCSDYFLGAIFENSPCRDREMQSISDRLA 230 +++D + + ++ + G E P +Q+++ L Sbjct: 193 TVELDGDGNFIRGAQPADVLLNGAVRHLERYLEKPFAEGYRSEVLPQLPYWLQAVAGGLQ 252 Query: 231 CDGGTAIVIDYGYLQ------SRVGDTLQ-AVKGHTYVSPLVNPGQADLSSHVDFQRLSS 283 G + +DYGY + R T++ + H + PG D+++ VDF ++ Sbjct: 253 R--GAMLFVDYGYNRGEFYMEDRDDGTVRAFYRQHVHNEIYRWPGLQDITASVDFTAMAE 310 Query: 284 IAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTAR--KDILLDSVKRLVSTSADKKSM 341 + + G +Q FL G G+ Q ++T + L D VK+L M Sbjct: 311 AGMHAGFELAGYCSQASFLLGNGLDQVLLLAEERTDEVGRIQLRDQVKKLTL----PTEM 366 Query: 342 GELFKILVVSHEKVELMPF 360 GE F+ + + + F Sbjct: 367 GERFQAIGLQRDVDFEPAF 385 >gi|206601564|gb|EDZ38047.1| Conserved protein of unknown function [Leptospirillum sp. Group II '5-way CG'] Length = 359 Score = 204 bits (520), Expect = 1e-50, Method: Composition-based stats. Identities = 88/351 (25%), Positives = 141/351 (40%), Gaps = 27/351 (7%) Query: 19 MTVDQYFALCVADPEFGYYSTCNPFG-AVGDFVTAPEISQIFGEMLAIFLICAWEQHGFP 77 MT Y A ++DP GYY+ G + GDF TAPE+S F +L+ ++ G P Sbjct: 1 MTFRDYMARALSDPTGGYYTRNARIGFSRGDFYTAPELSPAFALLLSRQIVEIDAVLGHP 60 Query: 78 SCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLA----SY 133 L+E GPG G +M D+L + P + + E S L QK++L+ + Sbjct: 61 EQFYLMETGPGNGTLMRDLLVSLRLSAPQLARRVRPILYEISPVLVKKQKEKLSSIPLDH 120 Query: 134 GDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGD 193 + L+ ++ NEF D+LP+ + E I+ D + G+ Sbjct: 121 PPEWIRPGELSGRDPIDGVILGNEFLDALPVHRLRRKGDSFSEIYIEKDGSGKDIEVEGE 180 Query: 194 HEIKSN----FLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYG------Y 243 S T DY G +E ++ + L G + IDYG + Sbjct: 181 LSTPSLTEGVHSTAWDYPEGFEWEVQADLCTVVEDLYRFLG--NGFMLWIDYGDTARERF 238 Query: 244 LQSRVGDTLQAVKGHTYVSP--LVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKF 301 R +L + H V +PG D++ HVDF L+ A + + + G + Q + Sbjct: 239 SPKREKGSLMGYRKHALVEDVTQADPGSIDMTVHVDFPLLARKATMLGMRLEGFSDQMHY 298 Query: 302 LEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSH 352 L LGI + K ++ T MGE FK++++S Sbjct: 299 LMNLGIEEWLGDEQFSPEEKAAMV--------TLIHPLRMGEAFKVMLLSK 341 >gi|124515260|gb|EAY56770.1| conserved protein of unknown function [Leptospirillum rubarum] Length = 359 Score = 204 bits (520), Expect = 1e-50, Method: Composition-based stats. Identities = 88/351 (25%), Positives = 140/351 (39%), Gaps = 27/351 (7%) Query: 19 MTVDQYFALCVADPEFGYYSTCNPFG-AVGDFVTAPEISQIFGEMLAIFLICAWEQHGFP 77 MT + A ++DP GYY+ G + GDF TAPE+S F +L+ ++ G P Sbjct: 1 MTFRDFMARALSDPTGGYYTRNARIGFSRGDFYTAPELSPAFALLLSRQIVEIDAVLGHP 60 Query: 78 SCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASY---- 133 + L+E GPG G +M D+L + P + + E S L QK++L+S Sbjct: 61 NEFYLMETGPGNGTLMRDLLVSLRLSAPQLALRVRPILYEISPVLVEKQKEKLSSLSFDR 120 Query: 134 GDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGD 193 + L+ ++ NEF D+LP + T E I+ D + G Sbjct: 121 PPEWIRPGELSGRDPIDGVILGNEFLDALPAHRLRRTRDSFSEIYIEEDGSGKFIEVEGK 180 Query: 194 HE----IKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYL----- 244 + T DY G +E ++ + L G + IDYG Sbjct: 181 LSSSSLTEGVHSTAWDYPEGFEWEVQADLCSILEELYHSLGK--GCMLWIDYGDTARERV 238 Query: 245 -QSRVGDTLQAVKGHTYVSP--LVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKF 301 R +L + H V +PG D++ HVDF L+ A + + + G + Q + Sbjct: 239 SPKREKGSLMGYRKHALVEDVTQADPGSVDMTVHVDFPLLARKATMLGMRLEGFSDQMHY 298 Query: 302 LEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSH 352 L LGI + K ++ T MGE FK++++S Sbjct: 299 LMNLGIEEWLGDEQFSPEEKAAMV--------TLIHPLRMGEAFKVMLLSK 341 >gi|291287017|ref|YP_003503833.1| hypothetical protein Dacet_1105 [Denitrovibrio acetiphilus DSM 12809] gi|290884177|gb|ADD67877.1| protein of unknown function DUF185 [Denitrovibrio acetiphilus DSM 12809] Length = 384 Score = 204 bits (518), Expect = 2e-50, Method: Composition-based stats. Identities = 83/363 (22%), Positives = 144/363 (39%), Gaps = 24/363 (6%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 ++N L + + LI + GQ+T ++ + + GYY NPFG G F T+ S+ FG Sbjct: 5 LDNTLEKYVEQLISEKGQITFAEFMDIALYHEGLGYYQKQNPFGQQGSFYTSVNASESFG 64 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 LA + E R E+G G G++ DIL + + +P F+ L ++E S Sbjct: 65 RTLARSFVYMTELLKLE--HRFCEMGAGSGMLANDILNFLKEREPKFYESLDYLIIEKSG 122 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 L QK+ L + ++ +NE D+ P+ + + + ++E + Sbjct: 123 YLIERQKELLDKAHTGKVKWIRFEELDDFKGVFYSNELVDAFPVHRVIRMDGELKELYVK 182 Query: 181 IDQHDSLVFN---IGDHEIKSNFLTCS-DYFLGAIFENSPCRDREMQSISDRLACDGGTA 236 +L FN + E++ T + I + + R +++++D++ G Sbjct: 183 -KIDGALRFNPGELSTPELQDFLDTINLKVTETQIVDINLDLRRFIEAMADKITK--GVM 239 Query: 237 IVIDYG------YLQSRVGDTLQAVKGHTYVSPLVN-PGQADLSSHVDFQRLSSIAILYK 289 + IDYG Y R T+ HT + + G D+++ VDF LS Sbjct: 240 LTIDYGFEAPMLYQSYRRDGTVTCYYNHTQNNDFFDRIGYQDITAFVDFTSLSLYGAEKG 299 Query: 290 LYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILV 349 L Q FL GI A D+ S+K L+ G F+ + Sbjct: 300 LEPMAYMPQWLFLVQSGILDEI-----NEAENDLSKASIKALIMP---DGGFGTNFQAFI 351 Query: 350 VSH 352 Sbjct: 352 QGK 354 >gi|322827230|gb|EFZ31501.1| hypothetical protein TCSYLVIO_2189 [Trypanosoma cruzi] Length = 427 Score = 203 bits (516), Expect = 4e-50, Method: Composition-based stats. Identities = 109/384 (28%), Positives = 180/384 (46%), Gaps = 30/384 (7%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYS-TCNPFG-AVGDFVTAPEISQI 58 ++ L ++++ + G + Q+ C+ P+ GYY+ + G DF+TA EI Sbjct: 41 LKTPLCIELISKMSSQGYFPMSQFVKECLTHPQHGYYTAKKHVIGSEKADFITAAEI-PF 99 Query: 59 FGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVET 118 F ++++ +++ AW++ G P L+ELGPGRG +M +IL+ P L I++VE Sbjct: 100 FADVISAWIMDAWQKMGTPKAFHLIELGPGRGTLMKNILKQTKYSNPHLLHFLQIHLVEV 159 Query: 119 SERLTLIQKKQLASYGD---KINWYTSLADVPLG--FTFLVANEFFDSLPIKQFVMTEHG 173 QK LA + KI W+ L +P T VANE+FD+LP+ QF TE G Sbjct: 160 GAARMEEQKSALAEFQTAQGKIKWWMDLESIPFSLEPTIFVANEYFDALPVAQFRYTERG 219 Query: 174 IRERMIDIDQHDSLVFNIGDHEIKSNFLTC--------SDYFLGAIFENSPCRDREMQSI 225 E +++D+ + + S + S LG E + + M+ I Sbjct: 220 WVETCLEVDEDPANESHFRMVHAPSGSFSAYLIPNDIRSRGKLGDCIEVNAVGMQTMELI 279 Query: 226 SDRL-ACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSI 284 ++ C ++IDYG + TL+ ++GH +V PL++PG+ DLSS V F++L Sbjct: 280 MKKMVDCQKAACLIIDYG-KDEHMHSTLRGIRGHRFVDPLLSPGEVDLSSWVSFKQLRWA 338 Query: 285 AILY-----KLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKD--ILLDSVKRLVSTSAD 337 L L TQ +FL+ GI R ++K K +L + +RL+ D Sbjct: 339 LERLETARRHLKWFPLMTQREFLQWGGIDVRLAHVIKDEETKTAMKILQNYRRLM----D 394 Query: 338 KKSMGELFKIL-VVSHEKVELMPF 360 MG +K+ + + PF Sbjct: 395 VDEMGNSYKVFVAQTRSFPNVSPF 418 >gi|71412490|ref|XP_808427.1| hypothetical protein [Trypanosoma cruzi strain CL Brener] gi|70872631|gb|EAN86576.1| hypothetical protein, conserved [Trypanosoma cruzi] Length = 427 Score = 203 bits (515), Expect = 4e-50, Method: Composition-based stats. Identities = 108/384 (28%), Positives = 180/384 (46%), Gaps = 30/384 (7%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYS-TCNPFG-AVGDFVTAPEISQI 58 ++ L ++++ + G + Q+ C+ P+ GYY+ + G DF+TA EI Sbjct: 41 LKTPLCIELISKMSSQGYFPMSQFVKECLTHPQHGYYTAKKHVIGSEKADFITAAEI-PF 99 Query: 59 FGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVET 118 F ++++ +++ AW++ G P L+ELGPGRG +M +IL+ P L I++VE Sbjct: 100 FADVISAWIMDAWQKMGTPRAFHLIELGPGRGTLMKNILKQTKYSNPHLLHFLQIHLVEV 159 Query: 119 SERLTLIQKKQLASYGD---KINWYTSLADVPLG--FTFLVANEFFDSLPIKQFVMTEHG 173 QK LA + KI W+ L +P T +ANE+FD+LP+ QF TE G Sbjct: 160 GAARMEEQKSALAEFQTAQGKIKWWMDLESIPFSLEPTIFIANEYFDALPVAQFRYTERG 219 Query: 174 IRERMIDIDQHDSLVFNIGDHEIKSNFLTC--------SDYFLGAIFENSPCRDREMQSI 225 E +++D+ + + S + S LG E + + M+ I Sbjct: 220 WVETCLEVDEDPANESHFRMVHAPSGSFSAYLIPNDIRSRGKLGDCIEVNAVGMQTMELI 279 Query: 226 SDRL-ACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSI 284 ++ C ++IDYG + TL+ ++GH +V PL++PG+ DLSS V F++L Sbjct: 280 MKKMVDCQKAACLIIDYG-KDEHMHSTLRGIRGHRFVDPLLSPGEVDLSSWVSFKQLRWA 338 Query: 285 AILY-----KLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKD--ILLDSVKRLVSTSAD 337 L L TQ +FL+ GI R ++K K +L + +RL+ D Sbjct: 339 LERLETARRHLKWFPLMTQREFLQWGGIDVRLAHVIKDEETKTAMKILQNYRRLM----D 394 Query: 338 KKSMGELFKIL-VVSHEKVELMPF 360 MG +K+ + + PF Sbjct: 395 VDEMGNSYKVFVAQTRSFPNVSPF 418 >gi|134297156|ref|YP_001120891.1| hypothetical protein Bcep1808_3065 [Burkholderia vietnamiensis G4] gi|134140313|gb|ABO56056.1| protein of unknown function DUF185 [Burkholderia vietnamiensis G4] Length = 351 Score = 203 bits (515), Expect = 5e-50, Method: Composition-based stats. Identities = 84/347 (24%), Positives = 143/347 (41%), Gaps = 33/347 (9%) Query: 25 FALCVADPEFGYYSTC-NPFGAV----GDFVTAPEISQIFGEMLAIFLICAWEQHGFPSC 79 + P GYYS FG DFVTAPE+S +F + LA + A G Sbjct: 1 MERALYAPGLGYYSGGARKFGRRADDGSDFVTAPELSPLFAQTLANPVADALAASG---T 57 Query: 80 VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK--- 136 R++E G G G + +L + L + L +V+ S L Q++ +A+ Sbjct: 58 RRVMEFGAGTGKLAAGLLAALDALGVELDEYL---IVDLSGELRERQRETIAAAAPALAG 114 Query: 137 -INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHE 195 + W +L + G +V NE D++P++ F ER + +D + VF Sbjct: 115 KVRWLDALPERFDG--VVVGNEVLDAMPVRLFAKAGDTWLERGVALDARHAFVFEDRPAG 172 Query: 196 IKSNFLTCS--DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDY------GYLQSR 247 + D G + E +++ LA G +++DY Y R Sbjct: 173 AAGVPAVLATLDVGDGYVTETHEAALAFTRTVCTMLAR--GAVLLVDYGFPAHEYYHPQR 230 Query: 248 VGDT-LQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 T + + H + P + PG D+++HV+F + I + G T+Q +FL G Sbjct: 231 ERGTLMCHYRHHAHDDPFLYPGLQDITAHVEFTGIYEAGIATGADLLGYTSQARFLLNAG 290 Query: 307 IWQRAFSLMKQTARK-DILLDSVKRLVSTSADKKSMGELFKILVVSH 352 + ++ R ++V++L+S + MGELFK++ S Sbjct: 291 VTDALAAIDPSDIRAFLPAANAVQKLIS----EAEMGELFKVIGFSR 333 >gi|71020305|ref|XP_760383.1| hypothetical protein UM04236.1 [Ustilago maydis 521] gi|46100052|gb|EAK85285.1| hypothetical protein UM04236.1 [Ustilago maydis 521] Length = 1159 Score = 202 bits (514), Expect = 6e-50, Method: Composition-based stats. Identities = 116/455 (25%), Positives = 180/455 (39%), Gaps = 110/455 (24%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYST-------CNPFGAVGDFVTAPE 54 L +++ I+ +G M V Y C+ DP GYYS+ G+ GDF+T+PE Sbjct: 681 TKSLNEILLDSIRASGPMPVSTYMRTCLLDPMQGYYSSANSPSTSREVLGSRGDFITSPE 740 Query: 55 ISQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIY 114 ISQ+FGE++AIF + W+ G PS R+VELGPG+G ++ D+LR KP ++ I+ Sbjct: 741 ISQVFGELVAIFYLARWQSVGAPSATRIVELGPGKGTLLADMLRTFATFKPFMATLKRIH 800 Query: 115 MVETSERLTLIQKKQLASY------------------GDKINWYTSLADVPLGF---TFL 153 +VETSE L +Q + G + W+ + VP+ T L Sbjct: 801 LVETSEGLMELQLNAIKEALGVVGKRVVSAEEDAGADGVVVEWFPGIDMVPVIPEELTIL 860 Query: 154 VANEFFDSLPIKQFVM-TEHGIRERMIDID------------------QHDSLVFNIGDH 194 A+EFFD+LP F + RE ++ I Q++ L F + Sbjct: 861 TAHEFFDALPTHIFEKGVDGKFREVLVGIKPTSSITVLKPGQDLQKQAQNEELGFVLSPT 920 Query: 195 EIKSNFL------TCSDYFLGAIFENSPCRDREMQSISDRLACDGGTA------------ 236 + G E SP + + + +A +A Sbjct: 921 PTPWAQMLVQNNPRFQHLEPGQRVEVSPEAWAVARRVGEIVAGRSASAPSSPKQEAPRSA 980 Query: 237 --------------------------------------IVIDYGYLQSRVGDTLQAVKGH 258 ++IDYG G +L+A K H Sbjct: 981 PEGSAEAKAEAALEAERLQAERRLETQRLSHATEGGIGLIIDYG-DDKAYGSSLRAFKNH 1039 Query: 259 TYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMK-- 316 V +PG DL+ +VDF L S G Q FL G+G+ R +L+K Sbjct: 1040 ALVRVFDSPGTVDLTVNVDFLHLKSAIHTTDARYLGPIDQADFLVGMGLQMRTEALVKGR 1099 Query: 317 QTARKDILLDSVKRLVSTSADKKSMGELFKILVVS 351 ++ + D+ RL+ D+ MG +K L ++ Sbjct: 1100 DAHDENRIKDAANRLI----DESGMGIQYKALAIT 1130 >gi|240276848|gb|EER40359.1| DUF185 domain-containing protein [Ajellomyces capsulatus H143] Length = 507 Score = 202 bits (513), Expect = 7e-50, Method: Composition-based stats. Identities = 111/462 (24%), Positives = 175/462 (37%), Gaps = 111/462 (24%) Query: 2 ENKLIRKIVNLI--------------------------KKNGQMTVDQYFALCVADPEFG 35 L + I I + G +++ Y C+ P+ G Sbjct: 48 STPLAKSIAEAINYNFIFYVFLMREEVVHFTLTGSENPQVTGPVSIATYIRQCLTSPDGG 107 Query: 36 YYSTCNP-------FGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPS-CVRLVELGP 87 YY++ FGA GDFVT+PEISQIFGE+L ++ + W G S V+++E GP Sbjct: 108 YYTSRGQEDEDTALFGAKGDFVTSPEISQIFGELLGVWTVTEWMGQGRKSGGVQIIEFGP 167 Query: 88 GRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGD------------ 135 G+G +M D+LR ++ ++Y+VETS L +Q+K L Sbjct: 168 GKGTLMGDMLRSFAS------AIEAVYLVETSPVLREVQRKLLCGDTPMEEVEVGYKSTS 221 Query: 136 --------KINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSL 187 L + F+ A+EFFD+LPI F + + Sbjct: 222 VHLGVPVIWTEHIKLLPNESDKTPFIFAHEFFDALPILAFQSIQTPAPSQTTINTPTGPT 281 Query: 188 VFNIGDHEIKS--NFLTCSDYFLGAIFEN----------------SPCRDREMQSISDRL 229 + + E +P + S R+ Sbjct: 282 TLHQPPISSPHTTEWRELVVSPNPETPEVKSGQEPEFRLSLAKASTPSSLVLPEMSSRRI 341 Query: 230 AC--------------------DGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQ 269 G A+++DYG + ++L+ ++ H VSPLV PG+ Sbjct: 342 GGGGGLVSATSPGVTDTLKNKVPSGAALILDYGTTSTIPINSLRGIRNHRLVSPLVAPGE 401 Query: 270 ADLSSHVDFQRLSSIAILY--KLYINGLTTQGKFLEGLGIWQRAFSLMK------QTARK 321 D+S+ VDF L+ AI + + G QG FLE LGI +RA L++ ++ Sbjct: 402 VDISADVDFTALAEAAIDASPGVEVYGPMEQGPFLEALGISERAAQLLRRMEGEGDEEKR 461 Query: 322 DILLDSVKRLVSTSADKKSMGELFKILVVSHE---KVELMPF 360 ++ KRLV MG+L+K L + E K + F Sbjct: 462 KLIESGWKRLVERGG--GGMGKLYKALAIVPESGGKRRPVGF 501 >gi|33861238|ref|NP_892799.1| hypothetical protein PMM0681 [Prochlorococcus marinus subsp. pastoris str. CCMP1986] gi|33639970|emb|CAE19140.1| conserved hypothetical protein [Prochlorococcus marinus subsp. pastoris str. CCMP1986] Length = 396 Score = 202 bits (513), Expect = 8e-50, Method: Composition-based stats. Identities = 78/378 (20%), Positives = 146/378 (38%), Gaps = 32/378 (8%) Query: 8 KIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNP-FGAVGDFVTAPEISQIFGEMLAI- 65 I +IKK G ++ Y + + D GYY + G+ GDFVT+P +S F +L+ Sbjct: 12 LIKKIIKKGGTISFYDYMDIVLNDLNNGYYGSGKANLGSKGDFVTSPSMSDDFAFLLSKQ 71 Query: 66 ---FLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 +LI + + ++E G G G +M +L + ++E ++ + Sbjct: 72 IYEWLIQVKSKSNCDDKLSVIEFGAGDGSLMSGLLEYFFINDKKILKNVCFIIIEPNKGM 131 Query: 123 TLIQKKQLASY----GDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERM 178 Q+K+L Y D + + ++ANE D+LP+++ + + ++ + Sbjct: 132 IKKQQKKLEKYLKLGFDILWRCLEDLEDRSLNGVVLANEVLDALPVERIINLKGKMQRQG 191 Query: 179 IDIDQHDSLVF-------------NIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSI 225 + ID+ +F E + G E + + +I Sbjct: 192 VSIDKKSGRLFFEAISITKELEKSIASAQEKLDINIPPKYAPEGWTTEWHIDNKKWLMAI 251 Query: 226 SDRLACDGGTAIVIDY------GYLQSRVGDTLQAVKGHTY-VSPLVNPGQADLSSHVDF 278 ++ + G ++IDY Y TL + K + +PG DL+SHV Sbjct: 252 YAKI--NNGILLIIDYAKEAKRYYSLGNNNGTLISYKNQKIVENIFESPGDCDLTSHVCI 309 Query: 279 QRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADK 338 + L + G+ QG+ L LG+ +R F + + S + + D Sbjct: 310 ESLIYDSETLGFETIGIVKQGEALLSLGLAERLFEIQNELKDDISKALSRREALLRLVDP 369 Query: 339 KSMGELFKILVVSHEKVE 356 +G+ FK V S + Sbjct: 370 ICLGD-FKWFVFSKFNNK 386 >gi|124025448|ref|YP_001014564.1| hypothetical protein NATL1_07411 [Prochlorococcus marinus str. NATL1A] gi|123960516|gb|ABM75299.1| Uncharacterized conserved protein [Prochlorococcus marinus str. NATL1A] Length = 400 Score = 202 bits (513), Expect = 9e-50, Method: Composition-based stats. Identities = 76/378 (20%), Positives = 154/378 (40%), Gaps = 33/378 (8%) Query: 9 IVNLI-KKNGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIFGEMLAIF 66 +++ I G ++ +Y L + DP+ G+YST G GDF T+P +S F +LAI Sbjct: 12 LIDRIGDSGGSISFYRYMDLVLNDPDNGFYSTGKLNIGKNGDFCTSPSLSNDFARLLAIQ 71 Query: 67 LICAWEQHGFPSCVR----LVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 ++ L+E+GPG G + D++ I ++ P + + +VE + + Sbjct: 72 VVDWLLDLEKSGIDSKLLSLIEIGPGEGTLSRDLILAIAEIAPALICKIELVLVELNVGM 131 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFT--FLVANEFFDSLPIKQFVMTEHGIRERMID 180 Q+K + + ++S+ D+ L ++ANE D+ P+++ V +++ + + + Sbjct: 132 RRRQEKVVNNLEGINCRWSSIEDLILRPVNGVVIANEVLDAFPVERLVFSDNKVFRQGVG 191 Query: 181 IDQHDS---------------LVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSI 225 + + + + F + + D + E ++ Sbjct: 192 LKKINDENYLEFVDLKPTSKIIKFLKESNSLLKIEFPPKDICNRWVTEWHCDVPSWFGNL 251 Query: 226 SDRLACDGGTAIVIDY------GYLQSRVGDTLQAVKGHT-YVSPLVNPGQADLSSHVDF 278 S L G +V+DY Y R TL + + H + L + G DL++H+ Sbjct: 252 SKVLI--DGALLVVDYAMESKRYYNAMRQEGTLISYRNHVANPNVLKDAGLCDLTAHLCI 309 Query: 279 QRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADK 338 + + A+ G T QG+ L LG+ +SL + + + + D Sbjct: 310 ESTINYALFNGWKFMGETRQGQALLALGLSNFLYSLQNNSNNDLSAALNRRESLLRLVDP 369 Query: 339 KSMGELFKILVVSHEKVE 356 +G+ F+ L + + Sbjct: 370 IGLGD-FRWLAFQKDNSD 386 >gi|312110234|ref|YP_003988550.1| hypothetical protein GY4MC1_1127 [Geobacillus sp. Y4.1MC1] gi|311215335|gb|ADP73939.1| protein of unknown function DUF185 [Geobacillus sp. Y4.1MC1] Length = 377 Score = 201 bits (512), Expect = 1e-49, Method: Composition-based stats. Identities = 77/380 (20%), Positives = 136/380 (35%), Gaps = 35/380 (9%) Query: 9 IVNLIKKN--GQMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAI 65 I I+ + +++ Y L + D +GYY G GDF T+ +S +FG++ A Sbjct: 4 IYQAIQASAQRRLSYADYMELALYDERYGYYMGEKAKIGKGGDFFTSSHVSHVFGKLFAS 63 Query: 66 FLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLI 125 + E++ P + ELG G G +L K P + L+ ++ETS + Sbjct: 64 LFLRLVERNHVPP--HICELGGGDGKFARAVLNEWKKKSPATYKQLTYTVIETSPKQRER 121 Query: 126 QKKQLASYGDKINWYTSLADVPLG----FTFLVANEFFDSLPIKQFVMTEHGIRERMIDI 181 Q + L +K+ Y + + + +NEFFD+ P+ + E + + Sbjct: 122 QLQTLGDASEKVKQYKDIQEFRQHAASFSGIVFSNEFFDAFPVHVITKENGMLYELFVAV 181 Query: 182 DQHDSLVFNIGDHEIKSNFLTCSD----YFLGAIFENSPCRDREMQSISDRLACDGGTAI 237 D + LV E + + G E + + + Sbjct: 182 DGN-KLVEEKHPLENERIVEYLRERQLSLTDGQRLEVPLALKTFLLETARFFR--HCVML 238 Query: 238 VIDYGYLQS------RVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSSIAILYKL 290 IDYGY R +L+ H + +PL PG+ D+++H+ + L Sbjct: 239 TIDYGYTDEELQLPARRQGSLRGYYRHRLIANPLSYPGEMDITAHIQWDALRMFGEQAGW 298 Query: 291 YINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVV 350 L Q +FL GI Q + R V + + M F +++ Sbjct: 299 QCVSLLRQDRFLLAAGILQYLEEHDGANPFSEKAQQ--NRAVRSLIIDEGMSAAFHVMIQ 356 Query: 351 SH----------EKVELMPF 360 + E +PF Sbjct: 357 QKGVDVDWEHIWAQREFLPF 376 >gi|116070487|ref|ZP_01467756.1| hypothetical protein BL107_12615 [Synechococcus sp. BL107] gi|116065892|gb|EAU71649.1| hypothetical protein BL107_12615 [Synechococcus sp. BL107] Length = 379 Score = 201 bits (512), Expect = 1e-49, Method: Composition-based stats. Identities = 75/365 (20%), Positives = 140/365 (38%), Gaps = 22/365 (6%) Query: 14 KKNGQMTVDQYFALCVADPEFGYY-STCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWE 72 + G + +Y L + DP G+Y S GDFVT+P + F +LA ++ Sbjct: 5 QLGGVTSFRRYMDLALNDPNDGFYGSGRARVSRDGDFVTSPALGSDFAGLLASQVVRWLA 64 Query: 73 QHGFP-SCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLA 131 + + L+E+GPG G ++ D++ I P L + +VE + + Q+++L Sbjct: 65 ELPADLPTLSLIEIGPGEGDLLADLVDAIADQSPQMLHRLELVLVEANPGMKQRQQERLQ 124 Query: 132 SYGDKINWYTSLADVPLGF--TFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVF 189 + L ++ ++A+E D+LP+++ E + ++++++D +LVF Sbjct: 125 HQTKFPMRWCGLDELVAAPLRGVVLAHELLDALPVERLTYDEGVMWQQLVELDDDGALVF 184 Query: 190 NIGDHEIKSNFL------------TCSDYFLGAIFENSPCRDREMQSISDRLACDGG--- 234 + G + D G E + L Sbjct: 185 SKGPLPTQLADEIERVCKRCGLDLPPPDADPGWTTEWHSDSLGWFTQLGQVLDQGVLLVV 244 Query: 235 -TAIVIDYGYLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSSIAILYKLYI 292 A+ + Y R TL AV+ SPL PG DL++H+ + + A+ Sbjct: 245 DYALEMHRYYSARRSDGTLMAVQAQRAGLSPLHKPGSQDLTAHICIETVEDAAVQAGWSC 304 Query: 293 NGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSH 352 G QG+ L LG+ +R + L + + D +G+ F+ L+ Sbjct: 305 LGQLRQGEALLALGLAERLYGLQSLPPGDLPQALQRREAMLRLVDPSGLGD-FRWLLFGK 363 Query: 353 EKVEL 357 Sbjct: 364 RVNPA 368 >gi|239827713|ref|YP_002950337.1| hypothetical protein GWCH70_2374 [Geobacillus sp. WCH70] gi|239808006|gb|ACS25071.1| protein of unknown function DUF185 [Geobacillus sp. WCH70] Length = 380 Score = 201 bits (512), Expect = 1e-49, Method: Composition-based stats. Identities = 74/361 (20%), Positives = 124/361 (34%), Gaps = 23/361 (6%) Query: 9 IVNLIKKN--GQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIFGEMLAI 65 I IK + +++ Y L + D GYY G GDF T +S +FG++ A Sbjct: 7 IYEAIKTSELRRLSYADYMQLALYDERCGYYMRKKTKIGKEGDFFTNSHVSDVFGKLFAS 66 Query: 66 FLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLI 125 F + E P + ELG G G +L P+ + L+ M+ETS + Sbjct: 67 FFLRLVECQNVPP--HICELGGGDGRFARAVLNEWKAKSPNTYRQLTYTMIETSPAHRVK 124 Query: 126 QKKQLASYGDKINWYTSLADVPLG----FTFLVANEFFDSLPIKQFVMTEHGIRERMIDI 181 Q + L +K+ Y + + + +NEFFD+ P+ + E + + Sbjct: 125 QLETLGDVAEKVKQYKDIKEFQQHVSSFSGIVFSNEFFDAFPVHVITKENGTVYELFVTV 184 Query: 182 DQHDSLVFNI---GDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIV 238 D H + +H I+ G E + + Sbjct: 185 DDHKLVEEKYPLENEHIIQYLHERQLSLADGQRLEVPLALKTFLLETAPFFR--HCVMFT 242 Query: 239 IDYGYLQ-------SRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLY 291 IDYGY R G + H +PL+ PG+ DL++H+ + L Sbjct: 243 IDYGYTDEELQLPARRQGSLRGYYRHHLITNPLMYPGEMDLTAHIQWDALRMYGKQAGWQ 302 Query: 292 INGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVS 351 + Q +FL GI + R + + M F +++ Sbjct: 303 YVSMLRQDRFLLAAGILLYLEEH--NDSNPFSEKSRQNRAIRSLIMDGGMSTDFHVMIQQ 360 Query: 352 H 352 Sbjct: 361 K 361 >gi|254430282|ref|ZP_05043985.1| conserved hypothetical protein [Cyanobium sp. PCC 7001] gi|197624735|gb|EDY37294.1| conserved hypothetical protein [Cyanobium sp. PCC 7001] Length = 402 Score = 201 bits (511), Expect = 2e-49, Method: Composition-based stats. Identities = 83/384 (21%), Positives = 147/384 (38%), Gaps = 35/384 (9%) Query: 5 LIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIFGEML 63 L++++ G + +Y + DP G Y G GDFVT+P + F +L Sbjct: 12 LVQRLRER---GGAVPFQRYMDWALHDPHHGAYGAGRLRIGRDGDFVTSPSLGPDFAALL 68 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 A L G + LVE GPG G + L + + + P+ ++++VE + + Sbjct: 69 APQLADWLAALGT-GALALVEAGPGEGSLALALAEALQRHWPELAGRTTLWLVEPNTGMA 127 Query: 124 LIQKKQLASYGDKINWYTSLADVPLGFT--FLVANEFFDSLPIKQFVMTEHGIRERMIDI 181 Q+++L +TS ++ ++A+E D+L +++ V R + + + Sbjct: 128 ERQRRRLQGS-PLPCRWTSWQELAAAPVRGVVLAHEVLDALAVERIVWDGALWRRQQVRL 186 Query: 182 DQHD--SLVFNIGDHE-----------IKSNFLTCSDYFLGAIFENSPCRDREMQSISDR 228 + + E G E P +Q+ + Sbjct: 187 VEAGPTGAWLRLEPGEPLEPEALAQLAALGLEPPGDQRPPGWCSELHPGSGPWLQACAAA 246 Query: 229 LACDGGTAIVIDY------GYLQSRVGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRL 281 LA G +VIDY Y R TL A +G PL+ PG DL++H+ L Sbjct: 247 LA--DGPLLVIDYALEAWRYYAPQRSAGTLLAYRGQRASPDPLLEPGAWDLTAHLCIDTL 304 Query: 282 SSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTA--RKDILLDSVKRL--VSTSAD 337 A + G QG+ L LG+ +R L + L + ++R + D Sbjct: 305 QHQARAAGWQVLGQCRQGEALLALGLAERLHGLQQAPEAGEASGLAELLERREGLLRLVD 364 Query: 338 KKSMGELFKILVVSHEKVELMPFV 361 ++G+ F+ L S + F+ Sbjct: 365 PHTLGD-FRWLAFSRGAMATPRFL 387 >gi|295696499|ref|YP_003589737.1| protein of unknown function DUF185 [Bacillus tusciae DSM 2912] gi|295412101|gb|ADG06593.1| protein of unknown function DUF185 [Bacillus tusciae DSM 2912] Length = 375 Score = 201 bits (511), Expect = 2e-49, Method: Composition-based stats. Identities = 80/361 (22%), Positives = 142/361 (39%), Gaps = 23/361 (6%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEM 62 L ++ I++ G +T + + DP GYY + FG GDF TAP++ ++G+ Sbjct: 20 PLKERLAERIRQKGPVTAFTFMEWALYDPAGGYYMREHDVFGRAGDFYTAPDVHPVYGKT 79 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 +A + ++G S VR+VE G G G + I L +VE S Sbjct: 80 IAAWAAQRARRYG-WSDVRIVEFGAGTGRLAEQIFAAW---PEVGIGSLRYSIVEISPAW 135 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 Q ++L + G ++W + + G ++A+E D++P G+ E +D+ Sbjct: 136 REHQARRLKNCGSAVDWPEKMPRLDRG--IVIAHELLDAMPAHLLRRGPEGLEEAWVDLG 193 Query: 183 QHDSL--VFNIGDHEIKSNFLTCSDYFLGAI-FENSPCRDREMQSISDRLACDGGTAIVI 239 + E + F FE +P ++ + +L G +++ Sbjct: 194 PDGRWLRKYGPASPEGRRAFEEWRPRVAPECGFEVAPGATAWIRRVLKQLRE--GLVLIV 251 Query: 240 DYG------YLQSRVGDTLQAVKGHTYVSPLV-NPGQADLSSHVDFQRLSSIAILYKLYI 292 DYG Y R TL+A H + PG+ D+++ V+F L +A + Sbjct: 252 DYGDDEDRLYGPHRPHGTLRAFFQHRCLDRWWEEPGERDITADVNFTILRRVAQAAGGEV 311 Query: 293 NGLTTQGKFLEGLGIWQRAFSLMK-QTARKDILLDSVKRLVSTSADKKSMGELFKILVVS 351 + G+FL GI A L + R + +GE F++L + Sbjct: 312 VFEGSLGEFLWDAGI---ARELSDCPADQPFDPRARRNRAIKQLLIPGGLGEAFRVLEIR 368 Query: 352 H 352 Sbjct: 369 K 369 >gi|229086830|ref|ZP_04218992.1| hypothetical protein bcere0022_34070 [Bacillus cereus Rock3-44] gi|228696474|gb|EEL49297.1| hypothetical protein bcere0022_34070 [Bacillus cereus Rock3-44] Length = 370 Score = 201 bits (510), Expect = 2e-49, Method: Composition-based stats. Identities = 79/346 (22%), Positives = 131/346 (37%), Gaps = 17/346 (4%) Query: 18 QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 ++ Y L + D + GYY G GDF T+ IS +F + A F I E Sbjct: 18 SISYSTYMNLTLYDEDCGYYMKEREKIGRNGDFFTSSNISSVFAKTFARFFIRIVENGEI 77 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 P + E+G G G +L+ + P+ F+ L +VE S +QK+QL + + Sbjct: 78 PP--NICEIGGGTGRFAYAVLQEWQQSSPETFAKLQYSIVEVSPFHRRLQKRQLDFFQNV 135 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 + + +NE FD+ P+K ++ + E I L + E Sbjct: 136 SQYKCYEELGESFTGLVFSNELFDAFPVKVVEKRDNYLYEVRITYTDEGKLGEVVRPLEE 195 Query: 197 KSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS------R 247 + + G FE + +Q I+ L G I +DYGY + Sbjct: 196 RIKSYLQRHRIELSEGQRFEVPLAMEIYVQEIASWLKE--GLFITVDYGYTKEEWMHPAH 253 Query: 248 VGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 +L+ H +PL PG+ D+++H+ + L I LY T Q +FL G Sbjct: 254 REGSLRGYYNHRLIRNPLKYPGEMDITAHIHWDELKEIGEENALYTVWHTKQREFLLAAG 313 Query: 307 IWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSH 352 I ++ S R + + M + F ++V Sbjct: 314 ILEQLASHQDSDPFSAKQKQ--NRAIRSMILHGGMSDAFDVVVQKK 357 >gi|189218955|ref|YP_001939596.1| hypothetical protein Minf_0943 [Methylacidiphilum infernorum V4] gi|189185813|gb|ACD82998.1| Uncharacterized conserved protein [Methylacidiphilum infernorum V4] Length = 392 Score = 200 bits (509), Expect = 2e-49, Method: Composition-based stats. Identities = 86/376 (22%), Positives = 144/376 (38%), Gaps = 22/376 (5%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYS--TCNPFGAVGDFVTAPEISQI 58 +E+ L++++ LI + G + D Y ++ P+FGYYS T G GDF T+ + + Sbjct: 7 VESPLLKELFYLIDQKGPIPFDAYMSMHQGHPQFGYYSRGTQKKIGKNGDFFTSVSVGSL 66 Query: 59 FGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVET 118 FGE LA W+Q + ++E G G G + DI+ + K +P LS +E Sbjct: 67 FGECLARQCCEVWKQLEMNGSLWILEAGAGGGELACDIVDWLDKNEPGLSKNLSYLFLEP 126 Query: 119 SERLTLIQKKQLASYGDKINWYTSLADVPL-----GFTFLVANEFFDSLPIKQFVMTEHG 173 Q+K++ K + + L+ANEF DSLP K+ Sbjct: 127 FAENQEEQRKEIQKRMGKTHRFHWFLGWEDLPTLSSPVILIANEFLDSLPFKRITFRHGQ 186 Query: 174 IRERMIDIDQHDSLVFNIGDHEI-----KSNFLTCSDYFLGAIFENSPCRDREMQSISDR 228 E+ + ++ + L F G E + + + Sbjct: 187 WMEQYLYSNKENKLCFVDLPITKGSLLEDLVHKIALPEIDGYTTEIHTEAWKWIFQAGTK 246 Query: 229 LACDGGTA----IVIDYGYLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSS 283 L + + R TL+ K H PL+ PG +D+++H++F + Sbjct: 247 LDSSLFFIIDYGLSEGEYFAPWRSKGTLRCYKDHKVFSDPLLFPGISDITAHLNFSLVVK 306 Query: 284 IAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGE 343 A + G Q F +G + S+ KD +S R + MG+ Sbjct: 307 AAEESGMEAIGWLDQHHFF--MGWLDQMHSMDPLFLLKDPRRESWIRKFFMLSHPLFMGK 364 Query: 344 LFKILVVSHEKVELMP 359 FK L++S L P Sbjct: 365 NFKFLLLSKN---LPP 377 >gi|225681513|gb|EEH19797.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03] Length = 439 Score = 200 bits (509), Expect = 2e-49, Method: Composition-based stats. Identities = 113/425 (26%), Positives = 173/425 (40%), Gaps = 98/425 (23%) Query: 25 FALCVADPEFGYYSTC-------NPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFP 77 C+ P+ GYY++ FGA GDFVT+PEISQIFGE+L I+ + W G Sbjct: 1 MRQCLTSPDGGYYTSRGQEAEGTEIFGAKGDFVTSPEISQIFGELLGIWTVAEWMGQGRR 60 Query: 78 S-CVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGD- 135 V+++ELGPG+G +M D+LR I K ++ ++Y+VE S L +Q K L Sbjct: 61 KGGVQIIELGPGKGTLMADMLRSIRNFKTFASAIEAVYLVEASTVLREVQHKLLCGDAPT 120 Query: 136 -------------------KINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTE----- 171 L D P F+ A+EFFD+LPI F E Sbjct: 121 EEMEVGYKSTSVHLGVPVIWTEHIKLLPDEPDKTPFIFAHEFFDALPIHAFQSIETPPRP 180 Query: 172 -----------------------HGIRERM-------IDIDQHDSLVFNIGDHEIKSN-- 199 RE + ++ + F++ + ++ Sbjct: 181 QTINTPTGPATLHNPPATSSSPATQWRELVVSPNPEIPELKSGNEPEFHLSLAKSPTSSS 240 Query: 200 --------FLTCSDYFLGAIFENSPCRDREMQSISDRL---------------ACDGGTA 236 G+ E SP Q I+ R+ G A Sbjct: 241 LVLPEMSPRYKAMKSTPGSTIEISPEGQTCAQDIARRIGGSFSSSSSEQSNKKRVPSGAA 300 Query: 237 IVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILY--KLYING 294 +++DYG + ++L+ ++ H VSP PGQ D+S++VDF L+ AI + + G Sbjct: 301 LILDYGTTSTIPINSLRGIRKHQLVSPFAVPGQVDISANVDFTALAEAAIDASPGVEVYG 360 Query: 295 LTTQGKFLEGLGIWQRAFSLMKQ------TARKDILLDSVKRLVSTSADKKSMGELFKIL 348 Q +FLE LGI +RA L+ + ++ + KRLV MG+L+K L Sbjct: 361 PVEQCQFLEALGISKRASQLLTKVEGEGGEEKRKRIESGWKRLVERGG--GGMGKLYKAL 418 Query: 349 VVSHE 353 + E Sbjct: 419 AIVPE 423 >gi|33862938|ref|NP_894498.1| hypothetical protein PMT0666 [Prochlorococcus marinus str. MIT 9313] gi|33634855|emb|CAE20841.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT 9313] Length = 405 Score = 200 bits (509), Expect = 2e-49, Method: Composition-based stats. Identities = 84/370 (22%), Positives = 154/370 (41%), Gaps = 29/370 (7%) Query: 9 IVNLI-KKNGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIFGEMLAIF 66 +VN I + G ++ QY + D +G Y++ G GDF T+P + F ++LAI Sbjct: 12 LVNRIVQAGGSISFHQYMDWALHDQVYGAYASGQLRIGRQGDFATSPSLGADFAQLLAIQ 71 Query: 67 LICAWEQH----GFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 L+ ++Q + L+E+GPG G + D++ + L P L + +VE+++ + Sbjct: 72 LVDWFQQLQQRVDKGKSLSLIEVGPGEGDLSADLISALEDLCPALIPRLELVLVESNKAM 131 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTF--LVANEFFDSLPIKQFVMTEHGIRERMID 180 L Q+++L S + SL ++ ++A+E D+LP+++ V + + + + Sbjct: 132 ALRQRERLKSVTSVPIHWRSLNELAQAPAIGVMLAHEMLDALPVERLVWRDQRLWRQGVC 191 Query: 181 IDQHDS----LVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTA 236 ++ DS + + + LT + LG D L A Sbjct: 192 LENVDSVAHLRFTELFLTDALHSALTEARMCLGIQIPPPDAADGWCSEWHGELKSWLSQA 251 Query: 237 ---------IVIDY------GYLQSRVGDTLQAVKGHTYVS-PLVNPGQADLSSHVDFQR 280 +VIDY Y R TL A + L +PG+ DL++H+ + Sbjct: 252 ASALLCGPLLVIDYALEARRYYSAMRPCGTLMAYRQQRASGALLQDPGRWDLTAHLCLET 311 Query: 281 LSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKS 340 L A G + QG+ L LG+ +R +L + + + + D Sbjct: 312 LQLQAEQQGWTFLGESRQGQALLALGLAERLHALQSLPTSQLSAALNRREALLRLVDPVG 371 Query: 341 MGELFKILVV 350 +GE F+ L Sbjct: 372 LGE-FRWLAF 380 >gi|295399164|ref|ZP_06809146.1| protein of unknown function DUF185 [Geobacillus thermoglucosidasius C56-YS93] gi|294978630|gb|EFG54226.1| protein of unknown function DUF185 [Geobacillus thermoglucosidasius C56-YS93] Length = 377 Score = 200 bits (509), Expect = 3e-49, Method: Composition-based stats. Identities = 77/380 (20%), Positives = 136/380 (35%), Gaps = 35/380 (9%) Query: 9 IVNLIKKN--GQMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAI 65 I I+ + +++ Y L + D +GYY G GDF T+ +S +FG++ A Sbjct: 4 IYQAIQASAQRRLSYADYMELALYDERYGYYMGEKAKIGKGGDFFTSSHVSHVFGKLFAS 63 Query: 66 FLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLI 125 + E++ P + ELG G G +L K P + L+ ++ETS + Sbjct: 64 LFLRLVERNHVPP--HICELGGGDGKFARAVLNEWKKKSPATYKQLTYTVIETSPKQRER 121 Query: 126 QKKQLASYGDKINWYTSLADVPLG----FTFLVANEFFDSLPIKQFVMTEHGIRERMIDI 181 Q + L +K+ Y + + + +NEFFD+ P+ + E + + Sbjct: 122 QLQTLGDASEKVKQYKDIQEFRQHAASFSGIVFSNEFFDAFPVHVITKENGMLYELFVAV 181 Query: 182 DQHDSLVFNIGDHEIKSNFLTCSD----YFLGAIFENSPCRDREMQSISDRLACDGGTAI 237 D + LV E + + G E + + + Sbjct: 182 DGN-KLVEEKHRLENERIVEYLRERQLSLTDGQRLEVPLALKTFLLETARFFR--HCVML 238 Query: 238 VIDYGYLQS------RVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSSIAILYKL 290 IDYGY R +L+ H + +PL PG+ D+++H+ + L Sbjct: 239 TIDYGYTDEELQLPARRQGSLRGYYRHRLIANPLSYPGEMDITAHIQWDALRMFGEQAGW 298 Query: 291 YINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVV 350 L Q +FL GI Q + R V + + M F +++ Sbjct: 299 QCVSLLRQDRFLLAAGILQYLEEHDGANPFSEKAQQ--NRAVRSLIIDEGMSAAFHVMIQ 356 Query: 351 SH----------EKVELMPF 360 + E +PF Sbjct: 357 QKGVDVDWEHIWAQREFLPF 376 >gi|88810999|ref|ZP_01126255.1| hypothetical protein NB231_09363 [Nitrococcus mobilis Nb-231] gi|88791538|gb|EAR22649.1| hypothetical protein NB231_09363 [Nitrococcus mobilis Nb-231] Length = 362 Score = 200 bits (508), Expect = 3e-49, Method: Composition-based stats. Identities = 83/354 (23%), Positives = 145/354 (40%), Gaps = 28/354 (7%) Query: 25 FALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLV 83 A+ + +P GYYS FG GDF TAP IS++F LA + ++ Sbjct: 1 MAIALYEPGLGYYSAGQRRFGPAGDFTTAPLISELFARTLAQQVAQILTAL---DGGVVL 57 Query: 84 ELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWYTSL 143 ELG G G M D+L + +L+ ++E S L Q + +A K+ Sbjct: 58 ELGAGTGHMAADLLSELERLEHLP---ERYLILEVSAALRQEQAQTIARTVPKLRSRVEW 114 Query: 144 ADVPLGFT---FLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNF 200 D ++ANE D+LP+K+F + +G++E+++ + + +L + + + K + Sbjct: 115 LDRLPETPLRGVILANEVIDALPVKRFQINSNGVQEQVVTLGEDATLTWALAPADPKLDT 174 Query: 201 LTCS-------DYFLGAIFENSPCRDREMQSISDRLACDGGTAI-----VIDYGYLQSRV 248 + + E P + S++ + + +Y + Q + Sbjct: 175 AVANIEAELGRRLPPDYVSEWCPRLAPWIASLAGVMEAGAALFVDYGYPRAEYYHPQRHM 234 Query: 249 GDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIW 308 G L + + PLV PG D+++ VDF + I L + G TTQ FL G G+ Sbjct: 235 GTLLCHYRNRVHDDPLVLPGLQDITAFVDFTLAARAGIEAGLEVLGFTTQAHFLIGAGLP 294 Query: 309 QRAFSLMKQTARKDI-LLDSVKRLVSTSADKKSMGELFKILVVSHE-KVELMPF 360 + + + + L K L+ MGE +K+L + L F Sbjct: 295 HLLEAETARAPEQAVHLTQQAKALL----FPGQMGERYKVLALGRGIPTPLSGF 344 >gi|78184622|ref|YP_377057.1| hypothetical protein Syncc9902_1049 [Synechococcus sp. CC9902] gi|78168916|gb|ABB26013.1| conserved hypothetical protein [Synechococcus sp. CC9902] Length = 392 Score = 200 bits (508), Expect = 3e-49, Method: Composition-based stats. Identities = 78/374 (20%), Positives = 143/374 (38%), Gaps = 25/374 (6%) Query: 5 LIRKIVNLIKKNGQMTVDQYFALCVADPEFGYY-STCNPFGAVGDFVTAPEISQIFGEML 63 L + L G + QY L + DP G+Y S GDFVT+ + F +L Sbjct: 12 LAMHLKQL---GGVTSFRQYMDLALNDPNHGFYGSGRAQISRDGDFVTSTALGTDFAGLL 68 Query: 64 AIFLICAWEQHGFP-SCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 A + + + L+E+GPG G ++ D++ + L P L + +VE + + Sbjct: 69 ATQVERWLAELPADLPTLSLIEIGPGEGDLLADLVDALTDLSPQILHRLELVLVEANPGM 128 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGF--TFLVANEFFDSLPIKQFVMTEHGIRERMID 180 Q+ +L + + SL ++ ++A+E D+LP+ + + + +++++ Sbjct: 129 KQRQQARLQHLTNIPMRWCSLDELLAAPLRGLVLAHELLDALPVDRLTFDDGVMWQQLVE 188 Query: 181 IDQHDSLVFNIGDHEIKSNFL------------TCSDYFLGAIFENSPCRDREMQSISDR 228 +D +LVF+ G + D G E + +S Sbjct: 189 LDDDGALVFSKGHVPPQLAAEIERVCKRCELVLPPPDAEPGWTTEWHSGSSNWFKQLSQA 248 Query: 229 LACDGG----TAIVIDYGYLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSS 283 L A+ + Y R TL AV+ SPL PG DL++H+ + + Sbjct: 249 LDQGVLLVVDYALEMHRYYSARRSDGTLMAVQAQRAGLSPLDKPGSQDLTAHICIETVED 308 Query: 284 IAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGE 343 A+ G QG+ L LG+ +R + L + + D +G+ Sbjct: 309 AAVQAGWTCMGQLRQGEALLALGLAERLYGLQSLPPGDLPQALQRREAMLRLVDPSGLGD 368 Query: 344 LFKILVVSHEKVEL 357 F+ L+ Sbjct: 369 -FRWLLFGKGVNPA 381 >gi|164658345|ref|XP_001730298.1| hypothetical protein MGL_2680 [Malassezia globosa CBS 7966] gi|159104193|gb|EDP43084.1| hypothetical protein MGL_2680 [Malassezia globosa CBS 7966] Length = 421 Score = 200 bits (508), Expect = 3e-49, Method: Composition-based stats. Identities = 103/407 (25%), Positives = 167/407 (41%), Gaps = 81/407 (19%) Query: 25 FALCVADPEFGYYSTC------NPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPS 78 C+ +P++GYY++ G GDF+T+PEISQ+FGE+LA+F I W+ G P Sbjct: 1 MQACLTNPDYGYYASKSQQENSRILGTRGDFITSPEISQVFGELLAVFFISRWQSAGAPK 60 Query: 79 CVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLAS------ 132 VR+VELGPGRG ++ D+LR ++ SI ++E+S Q+ L++ Sbjct: 61 NVRIVELGPGRGTLLCDMLRTFSAFPDMISAIRSIELIESSPLFIEQQEANLSATLSRFG 120 Query: 133 ------------------YGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGI 174 + Y + P +T +VA+EFFD+LPI F G Sbjct: 121 RSIANADTPVDQLAPNDLRVEWFASYEQVPTEPNAWTIVVAHEFFDALPIHIFEKHIDGW 180 Query: 175 RERMIDIDQHDSLV------------------FNIGDHEIKSNFLTCSD------YFLGA 210 RE M+D++ V + + L + G Sbjct: 181 REVMVDVNDESKPVNVIKASDLGKPRPEPGLRYVLSPGPTAWTQLLAAKSERFKALQPGQ 240 Query: 211 IFENSPCRDREMQSISDRLACDG------------------------GTAIVIDYGYLQS 246 E SP + I + ++ G +VIDYG Sbjct: 241 RVEISPASWTVARKIGEWVSGYPALRPDQAQAPPPDVIDKRSKPSLGGCGLVIDYG-GMR 299 Query: 247 RVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 D+ +A + H V PL PGQ+DL+++VDF L Y G +Q FL LG Sbjct: 300 FFSDSFRAFRAHKLVDPLEMPGQSDLTANVDFTFLMHAIHTTNAYTYGPLSQQDFLTALG 359 Query: 307 IWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHE 353 + R +L++ + + ++R + + MG ++++ +S Sbjct: 360 LSLRVKNLVE--SNRADRQAEIQRAANRLIETNGMGTQYQVMGISSP 404 >gi|332285798|ref|YP_004417709.1| hypothetical protein PT7_2545 [Pusillimonas sp. T7-7] gi|330429751|gb|AEC21085.1| hypothetical protein PT7_2545 [Pusillimonas sp. T7-7] Length = 392 Score = 200 bits (508), Expect = 3e-49, Method: Composition-based stats. Identities = 88/380 (23%), Positives = 162/380 (42%), Gaps = 47/380 (12%) Query: 7 RKIVNLIKK--NGQMTVDQYFALCVADPEFGYYSTCN-PFG---AVGDFVTAPEISQIFG 60 R + I+ +G + +Q+ + P GYY+ N FG GDF TAPE++ +F Sbjct: 27 RHLGQQIQACESGFLPFEQWMDQALYAPGLGYYAAGNTKFGSTLPTGDFTTAPELTPLFA 86 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 + LA + + + ++E G G G M ++ + +L ++E S Sbjct: 87 QTLARQVAQILQSS---ASCTVLEFGAGSGAMAAAMVPALRELGLQPI----YQILEVSG 139 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 L Q+++L +G+ + W T+L G ++ANE D++PI F E G + + Sbjct: 140 DLQQRQRQRLEQFGEAVQWLTALPQEFSGC--ILANEVLDAMPITLFSWDEQGKLQELGV 197 Query: 181 ID-------QHDSLVFNIGDHEIKSNFLTCS----DYFLGAIFENSPCRDREMQSISDRL 229 +++ F + + G E + + ++ + L Sbjct: 198 RMAQARTTGSNENTPFELAQRAASNQLNNILSQRMPPLPGYQSEINLRAEAWIRQMGAWL 257 Query: 230 ACDGGTAIVIDY------GYLQSRVGDT-LQAVKGHTYVSPLVNPGQADLSSHVDFQRLS 282 G A++IDY Y R T + + H + PL+ G D+++HVDF ++ Sbjct: 258 KR--GAALLIDYGFPQREYYHPQRAEGTLMCHYRHHAHAQPLLYAGLQDITAHVDFTAMA 315 Query: 283 SIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMG 342 A+ L + G T+Q +FL G+ + + L +V++L+S + MG Sbjct: 316 DAALEGGLDVLGYTSQARFLMNAGLPELL-----NFDPQT--LSAVQKLLS----EAEMG 364 Query: 343 ELFKILVVSHE-KVELMPFV 361 ELFK+L + + L+ F+ Sbjct: 365 ELFKVLAIGRDIDQPLIGFI 384 >gi|196248907|ref|ZP_03147607.1| protein of unknown function DUF185 [Geobacillus sp. G11MC16] gi|196211783|gb|EDY06542.1| protein of unknown function DUF185 [Geobacillus sp. G11MC16] Length = 376 Score = 200 bits (508), Expect = 4e-49, Method: Composition-based stats. Identities = 82/365 (22%), Positives = 138/365 (37%), Gaps = 25/365 (6%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIF 59 M +L ++I +G+++ Y + + D FGYY+ G GDF T ++ F Sbjct: 1 MMERLYKEIAA--APDGRVSYADYMQMALYDERFGYYTREREKIGKEGDFFTNSSLAPAF 58 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 G+ LA FLI EQ G P + E G G G + L +L K P + LS +++ S Sbjct: 59 GKALASFLIRLVEQGGLPPA--VCEWGGGDGRLALAVLEEWKKKSPHTYKNLSYTIIDQS 116 Query: 120 ERLTLIQKKQLASYGDKINWY----TSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIR 175 Q++ L +++ Y L + + +NEFFD+ P+ + + Sbjct: 117 PFHRRRQQETLRPVAERVRQYDAISRWLDECGPFSGIVFSNEFFDAFPVHVITKEQGVLY 176 Query: 176 ERMIDIDQHDSLVFNIGDHEIKSNFLTCSD----YFLGAIFENSPCRDREMQSISDRLAC 231 E ++ LV + G E + IS Sbjct: 177 E-CFVAARNGRLVEEKAPLRNPDIIHYLDERGLALSEGQRLEVPLAMKQFWLEISPLFRQ 235 Query: 232 DGGTAIVIDYGYL------QSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSSI 284 + IDYGY +R +L+ H + PL +PG+ DL+SHV + L Sbjct: 236 --AVMVTIDYGYTDEQLGAPTRRHGSLRGYFRHQLISDPLCHPGEMDLTSHVQWDALRLY 293 Query: 285 AILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGEL 344 A Q +FL G+ ++ ++A R++ T + Sbjct: 294 ARRAGWEEVAFVRQDRFLLAAGLLN--EWVVSESAELFSSASRQNRMIRTLITDDGISRF 351 Query: 345 FKILV 349 F +L+ Sbjct: 352 FDVLI 356 >gi|87124398|ref|ZP_01080247.1| hypothetical protein RS9917_12330 [Synechococcus sp. RS9917] gi|86167970|gb|EAQ69228.1| hypothetical protein RS9917_12330 [Synechococcus sp. RS9917] Length = 396 Score = 199 bits (507), Expect = 4e-49, Method: Composition-based stats. Identities = 79/373 (21%), Positives = 145/373 (38%), Gaps = 33/373 (8%) Query: 18 QMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 + Q+ L + PE G Y + G GDF T+P + F +LA + +Q Sbjct: 22 SIPFRQFMELALHHPEHGAYGSGRLQVGPRGDFATSPSLGPDFAALLAPQIAQWLQQQPI 81 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 + LVE GPG G D+ + + P+ + ++ ++E + + Q+ +L + Sbjct: 82 DQPLALVEAGPGEGDCAWDLAQELTAGWPELAARTTLLLLEPNAGMAERQRARLR-HCPL 140 Query: 137 INWYTSLADVPLGFT--FLVANEFFDSLPIKQFVMTEHGIRERMIDIDQ----------- 183 + S+A++ ++A+E D+L +++ V R + + + Q Sbjct: 141 PCRWPSVAELAAKPVRGVVLAHEVLDALAVERIVWDGALWRRQQVALQQVPGAEPSLRLE 200 Query: 184 --HDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDY 241 + E + + G E P + +++ + L D G +VIDY Sbjct: 201 PGEPLGPEQLAQLEPLGLLTPGAQHAPGWCTELHPEQGPWLRAAAAAL--DSGVLLVIDY 258 Query: 242 ------GYLQSRVGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYING 294 Y R G TL A + PL PG DL++H+ + L A+ G Sbjct: 259 ALEAWRYYAPQRSGGTLMAYRQQRASPDPLQEPGCWDLTAHLCLETLEQAALAAGWQPLG 318 Query: 295 LTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVV---- 350 QG+ L LG+ QR L Q+ L + + + D +S+G+ F+ + Sbjct: 319 QRRQGEALLALGLAQRLHGLQHQSGAGLEALLARREALLRLVDPRSLGD-FRWIAFRRSS 377 Query: 351 --SHEKVELMPFV 361 + F+ Sbjct: 378 AGTSNPASPPLFL 390 >gi|72381955|ref|YP_291310.1| hypothetical protein PMN2A_0115 [Prochlorococcus marinus str. NATL2A] gi|72001805|gb|AAZ57607.1| conserved hypothetical protein [Prochlorococcus marinus str. NATL2A] Length = 400 Score = 199 bits (507), Expect = 4e-49, Method: Composition-based stats. Identities = 77/378 (20%), Positives = 149/378 (39%), Gaps = 33/378 (8%) Query: 9 IVNLI-KKNGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIFGEMLAIF 66 +++ I G ++ +Y L + DP+ G YST G GDF T+P +S F +LAI Sbjct: 12 LIDRIGDSGGSISFYRYMDLVLNDPDNGVYSTGKLNIGKNGDFCTSPSLSNDFARLLAIQ 71 Query: 67 LICAWEQHGFPSCVR----LVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 ++ LVE+GPG G + D++ I ++ P + + +VE + + Sbjct: 72 VVDWLLDLEKSGIDSKLLSLVEIGPGEGTLSRDLIVAIAEIAPALICKVELVLVELNVGM 131 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFT--FLVANEFFDSLPIKQFVMTEHGIRERMID 180 Q+K + + ++S+ D+ L ++ANE D+ P+++ V ++ + + + Sbjct: 132 RRRQEKVVNNLEGINCRWSSIEDLILRPVTGVVIANEVLDAFPVERLVFNDNKVFRQGVS 191 Query: 181 IDQHDSLV---------------FNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSI 225 + + + F + D + E ++ Sbjct: 192 LKKINDEYSLEFVDLKPTSKIIKFLKESKSLLKIEFPPKDICNRWVTEWHCDVPSWFGNL 251 Query: 226 SDRLACDGGTAIVIDY------GYLQSRVGDTLQAVKGHT-YVSPLVNPGQADLSSHVDF 278 S L G +V+DY Y R TL + + H + L + G DL++H+ Sbjct: 252 SKVLI--DGALLVVDYAMESKRYYNAMRQDGTLISYRNHVANPNVLKDAGLCDLTTHLCI 309 Query: 279 QRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADK 338 + + A+ G T QG+ L LG+ +SL + + + D Sbjct: 310 ESTINYALFNGWKFMGETRQGQALLALGLSTFLYSLQNNINNDLSAALNRRESLLRLVDP 369 Query: 339 KSMGELFKILVVSHEKVE 356 +G+ F+ L + + Sbjct: 370 IGLGD-FRWLAFQKDNSD 386 >gi|301093177|ref|XP_002997437.1| conserved hypothetical protein [Phytophthora infestans T30-4] gi|262110693|gb|EEY68745.1| conserved hypothetical protein [Phytophthora infestans T30-4] Length = 419 Score = 199 bits (506), Expect = 5e-49, Method: Composition-based stats. Identities = 91/328 (27%), Positives = 152/328 (46%), Gaps = 71/328 (21%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGE 61 EN L+ + ++I+ G +TV ++ + ++ P+ GYY + FG+ GDF TAPEISQ+FGE Sbjct: 88 ENSLVHVLRSMIEVKGPLTVAEFMSRSLSHPDHGYYMKKDVFGSQGDFTTAPEISQMFGE 147 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 ++A++ + W+Q G PS +++VE+GPGRG +M D LR P + + I+MV+ S Sbjct: 148 LIAVWCVATWQQMGMPSHIKIVEMGPGRGSLMSDFLRAAKSFPPF-YDAIEIHMVDISPA 206 Query: 122 LTLIQKK-----------------QLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPI 164 + IQ++ +L G + W+ A+VP G + ++A E FD+LP+ Sbjct: 207 MQKIQQETLKCEPVEDKTAPENTMRLPDNGPTVRWHADFANVPHGPSLMIAQELFDALPV 266 Query: 165 KQFVMTEHGIRERMIDIDQH---DSLVFNIGDHEIKSNFLTC------------------ 203 QF T+ G RER++DID D F + + + Sbjct: 267 HQFEYTDRGWRERLVDIDFEDGGDHFRFVLSPGPTPATRVYIGREKLFDPSTALSHVAET 326 Query: 204 -------------------------------SDYFLGAIFENSPCRDREMQSISDRLACD 232 +G E SP +Q ++ R++ Sbjct: 327 HISGVEDLKKMQETVVQRLDVADVTGTPVHTPQAQVGDKIEISPVSIALVQDMAKRISQS 386 Query: 233 GGTAIVIDYGYLQSRVGDTLQAVKGHTY 260 GG A+++DYGY +L+ +K H + Sbjct: 387 GGGALIVDYGYDHP-SELSLRGIKNHEF 413 >gi|291279650|ref|YP_003496485.1| hypothetical protein DEFDS_1262 [Deferribacter desulfuricans SSM1] gi|290754352|dbj|BAI80729.1| conserved hypothetical protein [Deferribacter desulfuricans SSM1] Length = 375 Score = 199 bits (505), Expect = 6e-49, Method: Composition-based stats. Identities = 81/359 (22%), Positives = 145/359 (40%), Gaps = 24/359 (6%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 +L I+ +K N +MT ++ L + P++GYY NPFG G F T+ + S+ FG L Sbjct: 3 ELRELIIEKVK-NKKMTFAEFMNLALYHPQYGYYQKENPFGMQGSFYTSVDASESFGRSL 61 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 A + A + L E+G G G++ DIL +P F+ + ++E SE L Sbjct: 62 ARGFLKAINELDLEP--VLCEMGAGSGMLANDILNFYKDEEPQFYEKIEYIIIEKSEYLI 119 Query: 124 LIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQ 183 QK+ L ++ KI+W+ S ++ +NE D+ P+ + + ++E + Sbjct: 120 DKQKELLKAHEGKISWH-SFKELRDFDGVFFSNELVDAFPVHRIINISGELKELYVIYHD 178 Query: 184 HDSLVF--NIGDHEIKSNFLTCS-DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVID 240 + + E+K + I + + + ++ + ++ G + ID Sbjct: 179 DKLQFYPDDFSTEELKEYINRLNIKLVDKQIADINLDAVKWIRDLGKKINK--GLVVTID 236 Query: 241 YG------YLQSRVGDTLQAVKGH-TYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYIN 293 YG Y R+ T+ H G D+++ VDF L L + Sbjct: 237 YGFEAKQLYAPFRMDGTVTCYFKHTQNNDFFERVGYQDITAFVDFSALMEYGKESGLDVV 296 Query: 294 GLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSH 352 Q FL GI + A D+ ++ L+ + G F +L+ S Sbjct: 297 NFEPQWLFLLQSGILDEI-----KYAESDLHKARIRSLIIP---EGGFGTNFNVLIQSK 347 >gi|171909932|ref|ZP_02925402.1| hypothetical protein VspiD_02130 [Verrucomicrobium spinosum DSM 4136] Length = 390 Score = 199 bits (505), Expect = 7e-49, Method: Composition-based stats. Identities = 78/350 (22%), Positives = 138/350 (39%), Gaps = 22/350 (6%) Query: 20 TVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPS 78 + L + +P GYY G GDF T+ + ++GE+LA W G P Sbjct: 25 PWAEVMQLALYEPLVGYYRQGVRRIGRGGDFYTSVSVGPLYGELLAEHATGVWTAAGCPE 84 Query: 79 CVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQL-ASYGDKI 137 ++E G G + D++ + + P+ L ++E E L Q+ +L + + Sbjct: 85 RFAVLEQGAHDGTLARDLVEAVHRHHPELAVSLRYVIIEPDETLREAQQARLGPEFAAHL 144 Query: 138 NWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIK 197 + + A+VP L+ NE D+ + + T G +E + + F G Sbjct: 145 SHAATWAEVPEVQGLLICNELLDAFAVHRIEFTSEGWKELHVTTRGDGAFEFVQGPPSTP 204 Query: 198 SNFLTC----SDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYL------QSR 247 + +D+ +G + E + + +S + G ++ DYG+ R Sbjct: 205 GLQVELERLGNDFPIGFVTELNVAMLGWLAEVSQ--SAFTGEILLADYGHAAREYYIPER 262 Query: 248 VGDTLQAVKGHTYVS-PLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 G TL+ H L N G+ADL++HV+F RL+ A+ + + QG+FL Sbjct: 263 NGGTLRRYCQHRTDDRVLENLGEADLTAHVNFTRLAEQAVALGMTVMEFIEQGRFLT--- 319 Query: 307 IWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVE 356 A L+++ + R T +G F IL + E Sbjct: 320 --HVAAHLLRRPGFSPD--PAWLRQFQTLTHPGHLGHAFHILALQKGGWE 365 >gi|229135084|ref|ZP_04263887.1| hypothetical protein bcere0014_39880 [Bacillus cereus BDRD-ST196] gi|228648372|gb|EEL04404.1| hypothetical protein bcere0014_39880 [Bacillus cereus BDRD-ST196] Length = 370 Score = 198 bits (504), Expect = 9e-49, Method: Composition-based stats. Identities = 72/356 (20%), Positives = 134/356 (37%), Gaps = 17/356 (4%) Query: 14 KKNGQMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWE 72 +K ++ Y L + E GYY G GDF T+ +S +F + A F I E Sbjct: 14 EKGHSISYSTYMNLVLYAEEHGYYMREREKIGRQGDFFTSSNVSSVFAKTFAKFFIRLVE 73 Query: 73 QHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLAS 132 + + E+G G G D+L+ +L P+ F L+ ++E S +Q+++L Sbjct: 74 K--GEVFPNICEIGGGTGKFAYDVLQEWKQLSPETFINLNYSIIEVSPFHRRLQQERLCL 131 Query: 133 YGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIG 192 + + + + + + +NE FD+ P++ + E I +L Sbjct: 132 FDNVSYYTSYIEMGESFEGIIFSNELFDAFPVEIVEKRNGILYEVRITYTDEGNLTEVFR 191 Query: 193 DHEIKSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS--- 246 E + G FE + ++ I+ I +DYGY + Sbjct: 192 PIEKRVGRYLLKYNIHIAEGQRFEVPIAMEEYIEEIAKWFQKGIC--ITVDYGYTKEEWM 249 Query: 247 ---RVGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFL 302 +L+ H +PL PG+ DL++HV + L L + Q +FL Sbjct: 250 HPAHREGSLRGYYKHKLIRNPLAYPGEMDLTTHVHWDELKMNFELQGINTIWHRKQSEFL 309 Query: 303 EGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELM 358 GI + S + + R + + +G F +++ + + L+ Sbjct: 310 LAAGILDQLASHQDTNPFSET--QKLNRAIRSMILNGGLGNAFDVVIHTKDMKNLL 363 >gi|229169006|ref|ZP_04296722.1| hypothetical protein bcere0007_39580 [Bacillus cereus AH621] gi|228614415|gb|EEK71524.1| hypothetical protein bcere0007_39580 [Bacillus cereus AH621] Length = 370 Score = 198 bits (504), Expect = 9e-49, Method: Composition-based stats. Identities = 73/356 (20%), Positives = 135/356 (37%), Gaps = 17/356 (4%) Query: 14 KKNGQMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWE 72 +K+ ++ Y L + E GYY G GDF T+ +S +F + A F I E Sbjct: 14 EKDHSISYSTYMNLVLYAEEHGYYMREREKIGRQGDFFTSSNVSSVFAKTFAKFFIRLVE 73 Query: 73 QHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLAS 132 + + E+G G G D+LR +L P+ F L+ ++E S +Q+++L Sbjct: 74 K--GEVSPNICEIGGGTGKFAYDVLREWKQLSPETFINLNYSIIEVSPFHRRLQQERLCL 131 Query: 133 YGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIG 192 + + + + + + +NE FD+ P++ + E I +L Sbjct: 132 FDNVSYYTSYIEMGESFEGIIFSNELFDAFPVEIVEKRNGILYEVRITYTDEGNLTEVFR 191 Query: 193 DHEIKSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS--- 246 E + G FE + ++ I+ I +DYGY + Sbjct: 192 PIEKRVGRYLLKYNIHIAEGQRFEVPIAMEEYIEEIAKWFQKGIC--ITVDYGYTKEEWM 249 Query: 247 ---RVGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFL 302 +L+ H +PL PG+ DL++HV + L L + Q +FL Sbjct: 250 HPAHREGSLRGYYKHKLIRNPLAYPGEMDLTTHVHWDELKMNFELQGINTIWHRKQSEFL 309 Query: 303 EGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELM 358 GI + S + + R + + +G F +++ + + L+ Sbjct: 310 LAAGILDQLTSHQDTNPFSET--QKLNRAIRSMILNGGLGNAFDVVIHTKDMKNLL 363 >gi|163942019|ref|YP_001646903.1| hypothetical protein BcerKBAB4_4113 [Bacillus weihenstephanensis KBAB4] gi|163864216|gb|ABY45275.1| protein of unknown function DUF185 [Bacillus weihenstephanensis KBAB4] Length = 370 Score = 198 bits (503), Expect = 1e-48, Method: Composition-based stats. Identities = 72/356 (20%), Positives = 135/356 (37%), Gaps = 17/356 (4%) Query: 14 KKNGQMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWE 72 +K+ ++ Y L + E GYY G GDF T+ +S +F + A F I E Sbjct: 14 EKDHSISYSTYMNLVLYAEEHGYYMREREKIGRQGDFFTSSNVSSVFAKTFAKFFIRLVE 73 Query: 73 QHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLAS 132 + + E+G G G D+L+ +L P+ F L+ ++E S +Q+++L Sbjct: 74 K--GEVFPNICEIGGGTGKFAYDVLQEWKQLSPETFINLNYSIIEVSPFHRRLQQERLCL 131 Query: 133 YGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIG 192 + + + + + + +NE FD+ P++ + E I +L Sbjct: 132 FDNVSYYTSYIEMGESFEGIIFSNELFDAFPVEIVEKRNGILYEVRITYTDEGNLTEVFR 191 Query: 193 DHEIKSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS--- 246 E + G FE + ++ I+ I +DYGY + Sbjct: 192 PIEKRVGRYLLKYNIHIAEGQRFEVPIAMEEYIEEIAKWFQKGIC--ITVDYGYTKEEWM 249 Query: 247 ---RVGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFL 302 +L+ H +PL PG+ DL++HV + L L + Q +FL Sbjct: 250 HPAHREGSLRGYYKHKLIRNPLAYPGEMDLTTHVHWDELKMNFELQGINTIWHRKQSEFL 309 Query: 303 EGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELM 358 GI + S + + R + + +G F +++ + + L+ Sbjct: 310 LAAGILDQLASHQDTNPFSET--QKLNRAIRSMILNGGLGNAFDVVIHTKDMKNLL 363 >gi|124023260|ref|YP_001017567.1| hypothetical protein P9303_15581 [Prochlorococcus marinus str. MIT 9303] gi|123963546|gb|ABM78302.1| Uncharacterized conserved protein [Prochlorococcus marinus str. MIT 9303] Length = 405 Score = 198 bits (503), Expect = 1e-48, Method: Composition-based stats. Identities = 78/372 (20%), Positives = 149/372 (40%), Gaps = 33/372 (8%) Query: 9 IVNLI-KKNGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIFGEMLAIF 66 + N I + G ++ QY + D +G Y++ G GDF T+P + F ++LAI Sbjct: 12 LANRIVQAGGSISFHQYMDWALHDQVYGAYASGQLHIGRQGDFATSPSLGADFAQLLAIQ 71 Query: 67 LICAWEQHGFPSCV----RLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 L ++Q L+E+GPG G + D++ + L P L + +VE+++ + Sbjct: 72 LADWFQQLQQHVDKGRSLSLIEVGPGEGDLSADLISALEDLCPALIPRLELVLVESNKAM 131 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTF--LVANEFFDSLPIKQFVMTEHGIRERMID 180 Q+++L S + SL ++ ++A+E D+LP+++ V + + + + Sbjct: 132 VQRQRERLKSVTTVPIHWRSLDELAQAPAIGVMLAHEMLDALPVERLVWRDQRLWRQGVC 191 Query: 181 IDQHDSL-VFNIGDHEIKSNFL--------------TCSDYFLGAIFENSPCRDREMQSI 225 ++ DS+ + + D G E + Sbjct: 192 LENVDSVAHLRFTELSLTDALHSALTEARMFWGIQIPPPDADDGWCSEWHGELKSWLSQA 251 Query: 226 SDRLACDGGTAIVIDY------GYLQSRVGDTLQAVKGHTYVS-PLVNPGQADLSSHVDF 278 + L G ++IDY Y R TL A + L +PG+ DL++H+ Sbjct: 252 ASAL--LYGPLLIIDYALEARRYYSAMRPCGTLMAYRQQRASGALLQDPGRWDLTAHLCL 309 Query: 279 QRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADK 338 + L A G + QG+ L LG+ ++ +L + + + + D Sbjct: 310 ETLQHQAEQQGWSFLGESRQGQALLALGLAEKLHALQSLPTSQLSAALNRREALLRLVDP 369 Query: 339 KSMGELFKILVV 350 +GE F+ L Sbjct: 370 AGLGE-FRWLAF 380 >gi|229061954|ref|ZP_04199281.1| hypothetical protein bcere0026_40280 [Bacillus cereus AH603] gi|228717338|gb|EEL69010.1| hypothetical protein bcere0026_40280 [Bacillus cereus AH603] Length = 370 Score = 198 bits (502), Expect = 1e-48, Method: Composition-based stats. Identities = 72/356 (20%), Positives = 135/356 (37%), Gaps = 17/356 (4%) Query: 14 KKNGQMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWE 72 +K+ ++ Y L + E GYY G GDF T+ +S +F + A F I E Sbjct: 14 EKDHSISYSTYMNLVLYAEEHGYYMREREKIGRQGDFFTSSNVSSVFAKTFAKFFIRLVE 73 Query: 73 QHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLAS 132 + + E+G G G D+L+ +L P+ F L+ ++E S +Q+++L Sbjct: 74 K--GEVSPNICEIGGGTGKFAYDVLQEWKQLSPETFINLNYSIIEVSPFHRRLQQERLCL 131 Query: 133 YGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIG 192 + + + + + + +NE FD+ P++ + E I +L Sbjct: 132 FDNVSYYTSYIEMGESFEGIIFSNELFDAFPVEIVEKRNGILYEARITYTDEGNLTEVFR 191 Query: 193 DHEIKSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS--- 246 E + G FE + ++ I+ I +DYGY + Sbjct: 192 PIEKRVGRYLLKYNIHIAEGQRFEVPIAMEEYIEEIAKWFQKGIC--ITVDYGYTKEEWM 249 Query: 247 ---RVGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFL 302 +L+ H +PL PG+ DL++HV + L L + Q +FL Sbjct: 250 HPAHREGSLRGYYKHKLIRNPLAYPGEMDLTTHVHWDELKMNFELQGINTIWHRKQSEFL 309 Query: 303 EGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELM 358 GI + S + + R + + +G F +++ + + L+ Sbjct: 310 LAAGILDQLASHQDTNPFSET--QKLNRAIRSMILNGGLGNAFDVVIHTKDMKNLL 363 >gi|33865845|ref|NP_897404.1| hypothetical protein SYNW1311 [Synechococcus sp. WH 8102] gi|33633015|emb|CAE07826.1| conserved hypothetical protein [Synechococcus sp. WH 8102] Length = 409 Score = 197 bits (501), Expect = 2e-48, Method: Composition-based stats. Identities = 79/367 (21%), Positives = 144/367 (39%), Gaps = 25/367 (6%) Query: 5 LIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNP-FGAVGDFVTAPEISQIFGEML 63 L + G + Q+ + PE GYY + G GDF T+P + F +L Sbjct: 28 LATLLHQ---AGGTVPFRQFMDWALHHPEHGYYGSGRVRIGPQGDFATSPSLGPDFATLL 84 Query: 64 AIFLICAWEQH-GFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 LI S + LVE+GPG G + D+L V+ + PD + +VE S L Sbjct: 85 GRQLIDLLRNLSDQASTLSLVEVGPGEGDLAADLLTVLARQAPDLIERCELVLVERSPSL 144 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFT--FLVANEFFDSLPIKQFVMTEHGIRERMID 180 Q+++L + + ++ L+A+E D+ P+ + V+ + + + + Sbjct: 145 RQRQQQRLEGISGCPVRWCGIEELQSSPIQGVLLAHELLDAFPVDRLVLKQGELALQGVR 204 Query: 181 IDQHDSLVFNIGDHE--------IKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACD 232 + Q+D L L G E +++ ++ Sbjct: 205 LQQNDQLTSVPLALPDTLQEQLQTSGLELPPPGSEDGWTTEWHSNLRPWFGTLASAVS-- 262 Query: 233 GGTAIVIDY------GYLQSRVGDTLQAVKGHTY-VSPLVNPGQADLSSHVDFQRLSSIA 285 G +VIDY Y R TL A + ++PL + G+ DL++H+ + L+ A Sbjct: 263 DGALLVIDYAHEASRYYTARRSEGTLMAYRDGMAGMNPLAHAGEQDLTAHLCIETLTQAA 322 Query: 286 ILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELF 345 + + QG+ L LG+ +L + A + + + D ++G+ F Sbjct: 323 AHHGWQLRDQRRQGEALLALGLANDLHALQQLPASELAEALRRREALLRLVDPAALGD-F 381 Query: 346 KILVVSH 352 + L+ S Sbjct: 382 RWLLFSR 388 >gi|241667927|ref|ZP_04755505.1| hypothetical protein FphipA2_04091 [Francisella philomiragia subsp. philomiragia ATCC 25015] gi|254876467|ref|ZP_05249177.1| conserved hypothetical protein [Francisella philomiragia subsp. philomiragia ATCC 25015] gi|254842488|gb|EET20902.1| conserved hypothetical protein [Francisella philomiragia subsp. philomiragia ATCC 25015] Length = 378 Score = 197 bits (500), Expect = 2e-48, Method: Composition-based stats. Identities = 80/365 (21%), Positives = 143/365 (39%), Gaps = 25/365 (6%) Query: 4 KLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYS-TCNPFGAVGDFVTAPEISQIFGE 61 + I+ IK N M + + + P+FGYYS + + GDF+TA + +F Sbjct: 2 SIENVILEKIKTTNKPMLFRDFMQMALYYPKFGYYSGSKEKISSNGDFITATSQTSLFAR 61 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 A + G S ++E G G G +D + + L +VE S Sbjct: 62 TFARQFALVISELGRDS--NVIEFGAGSGKFAIDCMHELNSLGSLPDK---YIIVELSND 116 Query: 122 LTLIQK----KQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRER 177 L Q+ K + + D+ W T L L + ANE D++P+ F + + ++ Sbjct: 117 LKSRQQENVRKNIPEHYDRFEWVTELPRFKLK-AVVFANEVLDAMPVDIFRAQNNQLIQQ 175 Query: 178 MIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDR--EMQSISDRLAC--DG 233 + + + ++ +++ + FE+ + ++ L + Sbjct: 176 GVGFVNNKWKLVDMIENDKLFEYEANRILLDNINFEDEYASEINTWIRPWIKSLYDILES 235 Query: 234 GTAIVIDYGYLQS------RVGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAI 286 G + DYGY +S R TL H P +N G+ D+++HVDF ++ A Sbjct: 236 GMIFLCDYGYHRSLYYSKERSMGTLACYHQHKTNYDPFINIGEQDITAHVDFTTVAEAAT 295 Query: 287 LYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFK 346 ++G TQ FL+ GI + K KD L + + + E+FK Sbjct: 296 EAGFQLDGYMTQANFLKKAGIADVFDDVSKNLKSKDQLRYTND--IKELLLNDKLAEVFK 353 Query: 347 ILVVS 351 ++ S Sbjct: 354 VIGFS 358 >gi|138896014|ref|YP_001126467.1| hypothetical protein GTNG_2377 [Geobacillus thermodenitrificans NG80-2] gi|134267527|gb|ABO67722.1| Conserved hypothetical protein [Geobacillus thermodenitrificans NG80-2] Length = 375 Score = 196 bits (499), Expect = 4e-48, Method: Composition-based stats. Identities = 80/362 (22%), Positives = 136/362 (37%), Gaps = 25/362 (6%) Query: 6 IRKIVNLIKK--NGQMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEM 62 + ++ I +G+++ Y + + D FGYY+ G GDF T ++ FG+ Sbjct: 1 MERLYKEIAAAPDGRVSYADYMQMALYDERFGYYTREREKIGKEGDFFTNSSLAPAFGKA 60 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 LA FLI EQ G P + E G G G + L +L K P + LS +++ S Sbjct: 61 LASFLIRLVEQGGLPPA--VCEWGGGDGRLALVVLEEWKKKSPHTYKNLSYTIIDQSPFH 118 Query: 123 TLIQKKQLASYGDKINWY----TSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERM 178 Q++ L +++ Y L + + +NEFFD+ P+ + + E Sbjct: 119 RRCQQETLRPVAERVRQYDAISRWLDECGPFSGIVFSNEFFDAFPVHVITKEQGVLYE-C 177 Query: 179 IDIDQHDSLVFNIGDHEIKSNFLTCSD----YFLGAIFENSPCRDREMQSISDRLACDGG 234 ++ LV + G E + IS Sbjct: 178 FVAARNGRLVEEKAPLRNPDIIHYLDERGLALSEGQRLEVPLAMKQFWLEISPLFRQ--A 235 Query: 235 TAIVIDYGYL------QSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSSIAIL 287 + IDYGY +R +L+ H + PL +PG+ DL+SHV + L A Sbjct: 236 VMVTIDYGYTDEQLGAPTRRHGSLRGYFRHQLISDPLCHPGEMDLTSHVQWDALRLYARR 295 Query: 288 YKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKI 347 Q +FL G+ ++ ++A R++ T + F + Sbjct: 296 AGWEEVAFVRQDRFLLAAGLLN--EWVVSESAELFSSASRQNRMIRTLITDDGISRFFDV 353 Query: 348 LV 349 L+ Sbjct: 354 LI 355 >gi|167627370|ref|YP_001677870.1| hypothetical protein Fphi_1145 [Francisella philomiragia subsp. philomiragia ATCC 25017] gi|167597371|gb|ABZ87369.1| conserved hypothetical protein [Francisella philomiragia subsp. philomiragia ATCC 25017] Length = 378 Score = 196 bits (498), Expect = 4e-48, Method: Composition-based stats. Identities = 80/365 (21%), Positives = 142/365 (38%), Gaps = 25/365 (6%) Query: 4 KLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYS-TCNPFGAVGDFVTAPEISQIFGE 61 + I+ IK N M + + + P+FGYYS + + GDF+TA + +F Sbjct: 2 SIENVILEKIKTTNKPMLFRDFMQMALYYPKFGYYSGSKEKISSTGDFITATSQTSLFAR 61 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 A + G S ++E G G G +D + + L +VE S Sbjct: 62 TFARQFALVISELGRDS--NVIEFGAGSGKFAVDCMHELNGLGSLPDK---YIIVELSND 116 Query: 122 LTLIQK----KQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRER 177 L Q+ K + D+ W T L L + ANE D++P+ F + + ++ Sbjct: 117 LKSRQQESVRKNIPELYDRFEWVTELPKYKLK-AVVFANEVLDAMPVDIFRAQNNQLIQQ 175 Query: 178 MIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDR--EMQSISDRLAC--DG 233 + + + ++ +++ + FE+ + ++ L + Sbjct: 176 GVSFVNNKWNLVDMIENDKLFKYEANRILLDNINFEDEYASEINTWIRPWIKSLYDILES 235 Query: 234 GTAIVIDYGYLQS------RVGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAI 286 G + DYGY +S R TL H P +N G+ D+++HVDF ++ A Sbjct: 236 GMIFLCDYGYHRSLYYSKERSMGTLACYHQHKTNYDPFINIGEQDITAHVDFTTVAEAAT 295 Query: 287 LYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFK 346 ++G TQ FL+ GI + K KD L + + + E+FK Sbjct: 296 EAGFQLDGYMTQANFLKKAGIADVFDDVSKNLKSKDQLRYTND--IKELLLNDKLAEVFK 353 Query: 347 ILVVS 351 ++ S Sbjct: 354 VIGFS 358 >gi|260436393|ref|ZP_05790363.1| conserved hypothetical protein [Synechococcus sp. WH 8109] gi|260414267|gb|EEX07563.1| conserved hypothetical protein [Synechococcus sp. WH 8109] Length = 410 Score = 196 bits (497), Expect = 6e-48, Method: Composition-based stats. Identities = 70/381 (18%), Positives = 143/381 (37%), Gaps = 32/381 (8%) Query: 5 LIRKIVNLIKKNGQMTVDQYFALCVADPEFGYY-STCNPFGAVGDFVTAPEISQIFGEML 63 L + G + ++ L + + + GYY + GA GDFVT+P + F +L Sbjct: 23 LATHLHQ---AGGAVPFSRFMDLALNEQDHGYYGAGRARIGAQGDFVTSPSLGSDFAALL 79 Query: 64 AIFLICAWEQHGFPSCVR---LVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 A ++ + +VE+GPG G + D++ + + + + + +VE + Sbjct: 80 APQILAWLTSIPRIEPDQRLSIVEIGPGEGHLARDLVAALRGADSELLARIELVLVEANP 139 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFT--FLVANEFFDSLPIKQFVMTEHGIRERM 178 + Q+ L D + SL + ++A+E D+LP+++ + E ++++ Sbjct: 140 GMRRRQQALLQETDDLPLRWCSLEALRRAPVHGVVIAHELLDALPVERLIWREGSLQQQW 199 Query: 179 IDIDQHDSLVFNIGDHEIKSNFL------------TCSDYFLGAIFENSPCRDREMQSIS 226 + ++ L + D G E + + + Sbjct: 200 VKLNPSGGLQTTHRPLPDGLHQEIRRVCSQGSIQLPPPDVEEGWTTEWNSALPDWFAAAA 259 Query: 227 DRLACDGGTAIVIDY------GYLQSRVGDTLQ-AVKGHTYVSPLVNPGQADLSSHVDFQ 279 + D G +VIDY + R TL +SP +PG+ DL++H+ + Sbjct: 260 AAV--DAGVLLVIDYALEAQRYFTARRSDGTLMAVCAQQAGLSPFDHPGEQDLTAHLCIE 317 Query: 280 RLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKK 339 + A + QG+ L LG+ +R + L + ++ + + D Sbjct: 318 VVDEAAQRNGWMVGDQAKQGEALLALGLAERLYGLQQLPGQQLAEALQRREALLRLVDPA 377 Query: 340 SMGELFKILVVSHEKVELMPF 360 +G F+ L + F Sbjct: 378 GLGG-FRWLT-YRRGLPEDGF 396 >gi|198282441|ref|YP_002218762.1| hypothetical protein Lferr_0301 [Acidithiobacillus ferrooxidans ATCC 53993] gi|198246962|gb|ACH82555.1| protein of unknown function DUF185 [Acidithiobacillus ferrooxidans ATCC 53993] Length = 375 Score = 195 bits (495), Expect = 9e-48, Method: Composition-based stats. Identities = 76/368 (20%), Positives = 141/368 (38%), Gaps = 38/368 (10%) Query: 2 ENKLIRKIVNLI-KKNGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIF 59 + L+ I I G + +Y L + P GYY FGA GDFVTAPE+ ++ Sbjct: 31 SSALLTHIRREIDAAGGMIPFRRYMELALYTPGLGYYMAGQTRFGAAGDFVTAPEMGRVL 90 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 +LA L G ++E G G G + I ++ ++E S Sbjct: 91 AAVLARTLQPDLGPDG------ILEFGGGSGALAGQIRDILPDTP--------YTLLEPS 136 Query: 120 ERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMI 179 L Q+ + I +L + G ++A+E D++P++ + G Sbjct: 137 PDLQARQRSVVTG----IQHLQTLPEHWRG--VMLAHEVLDAMPVQVLELDASGQLHECG 190 Query: 180 DIDQHDSLVFNIGDHEIKSNFLTCS-----DYFLGAIFENSPCRDREMQSISDRLACDGG 234 ++L + + + + + E + + ++ ++ RL Sbjct: 191 VRWNGEALDWVMLPPPVAAPLAARLAPYVTHWPRPYRTEINLTAESWLREVAARLDSGAI 250 Query: 235 TAI-----VIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYK 289 I ++ + Q ++G + H P PG DL++HVD+ L + A+ Sbjct: 251 LLIDYGYEAAEFYHPQRQMGSLRAYYRHHWLDDPFYLPGLCDLTAHVDYTALMTAAVEAG 310 Query: 290 LYINGLTTQGKFLEGLGIWQRAFSLMKQTAR--KDILLDSVKRLVSTSADKKSMGELFKI 347 L + +FL G+ + +L Q + L + +KRL + MGE FK+ Sbjct: 311 LEVAFYGHLARFLVEHGLAEVYGTLCAQAGEDGRFALNNEIKRLTL----PQEMGESFKV 366 Query: 348 LVVSHEKV 355 L++ + Sbjct: 367 LILKKREN 374 >gi|152976693|ref|YP_001376210.1| hypothetical protein Bcer98_2988 [Bacillus cereus subsp. cytotoxis NVH 391-98] gi|152025445|gb|ABS23215.1| protein of unknown function DUF185 [Bacillus cytotoxicus NVH 391-98] Length = 370 Score = 195 bits (495), Expect = 9e-48, Method: Composition-based stats. Identities = 80/357 (22%), Positives = 144/357 (40%), Gaps = 23/357 (6%) Query: 18 QMTVDQYFALCVADPEFGYYS-TCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 ++ Y + + + E GYY + N G GDF T +S + A F I + G Sbjct: 18 SISYRTYMEVALYNQEHGYYMNSRNKIGRKGDFFTTSNVSSAVAKTFARFFIRLVQ--GG 75 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 L E+G G G D+L+ +L P+ F+ L ++E S +QK+QL S+ + Sbjct: 76 EIEPNLCEIGAGTGRFAYDVLQEWQRLSPETFADLHYSIIELSPFHRKLQKQQLNSFSN- 134 Query: 137 INWYTSLADVPLGFT-FLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHE 195 I+ + S ++ F + +NE FD+ P++ + + E + L E Sbjct: 135 ISQHQSYQELGDSFAGIVFSNELFDAFPVEVIEKRKGVLYEVRVSYTNEGKLTEVFRPIE 194 Query: 196 IKSNFLTCSD---YFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS------ 246 ++ G FE + ++ I+ + G I +DYGY + Sbjct: 195 NETVHYLRRHHIRLSEGQRFEVPIVAESYIEEIAAWIKE--GLLITVDYGYTKEEWMHPA 252 Query: 247 RVGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGL 305 +L+ H SPL PG+ D+++HV + + LY T Q FL Sbjct: 253 HREGSLRGYYHHQLIRSPLAYPGEMDITAHVHWDEIKKAGETTGLYTVWHTKQSTFLLAA 312 Query: 306 GIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMPFVN 362 GI ++ + + + + R + + +G+ F IL+ +PF++ Sbjct: 313 GILEQLMNHQDRNPFSEN--QKINRSIRSMILDGGIGDAFDILIQKKG----LPFLD 363 >gi|261417698|ref|YP_003251380.1| hypothetical protein GYMC61_0199 [Geobacillus sp. Y412MC61] gi|297529393|ref|YP_003670668.1| hypothetical protein GC56T3_1049 [Geobacillus sp. C56-T3] gi|319767494|ref|YP_004132995.1| hypothetical protein GYMC52_2471 [Geobacillus sp. Y412MC52] gi|261374155|gb|ACX76898.1| protein of unknown function DUF185 [Geobacillus sp. Y412MC61] gi|297252645|gb|ADI26091.1| protein of unknown function DUF185 [Geobacillus sp. C56-T3] gi|317112360|gb|ADU94852.1| protein of unknown function DUF185 [Geobacillus sp. Y412MC52] Length = 377 Score = 195 bits (495), Expect = 1e-47, Method: Composition-based stats. Identities = 76/378 (20%), Positives = 128/378 (33%), Gaps = 30/378 (7%) Query: 8 KIVNLIKK--NGQMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLA 64 ++ I G+++ Y + + D FGYY G GDF T + +FG+ LA Sbjct: 3 RLYEQIAAAPGGRVSYADYMQMALYDERFGYYMRERAKIGKEGDFFTNSSFAPVFGKALA 62 Query: 65 IFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTL 124 + EQ G P + E G G G + L +L K P + LS +++ S Sbjct: 63 SLWVRMVEQSGLPPA--VCEWGGGDGRLALSVLEEWKKKSPHTYDRLSYTIIDQSPFHRR 120 Query: 125 IQKKQLASYGDKINWYTS----LADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 Q++ L +K+ Y LA+ + +NEFFD+ P+ V + E + Sbjct: 121 RQRETLQPAAEKVEQYDDVSRWLAERGPFSGIVFSNEFFDAFPVHVIVKEGGVLHECFVA 180 Query: 181 IDQH---DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAI 237 + ++ D G E + + Sbjct: 181 ARDGRLVEEKAPLCRPAIVRYLHERGLDLAEGQRLEVPLAMKAFWLEVGTLF--HQAVMV 238 Query: 238 VIDYGYLQS------RVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSSIAILYKL 290 IDYGY R +L+ H V PL +PG+ DL+SHV + A Sbjct: 239 TIDYGYTDEQLRAPARRHGSLRGYFRHQLVADPLRHPGEMDLTSHVQWDAFRLYARQAGW 298 Query: 291 YINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVV 350 Q +FL G+ + + R++ + F +L++ Sbjct: 299 EEVAFVRQDRFLLAAGLLH--EWNVSEAGDFFSPASRQNRMIRALIADDGVSRFFDVLIL 356 Query: 351 SH-------EKVELMPFV 361 + F+ Sbjct: 357 QKGMSLAARDFWPAPEFL 374 >gi|218665478|ref|YP_002424640.1| hypothetical protein AFE_0130 [Acidithiobacillus ferrooxidans ATCC 23270] gi|218517691|gb|ACK78277.1| conserved hypothetical protein [Acidithiobacillus ferrooxidans ATCC 23270] Length = 451 Score = 195 bits (495), Expect = 1e-47, Method: Composition-based stats. Identities = 76/368 (20%), Positives = 141/368 (38%), Gaps = 38/368 (10%) Query: 2 ENKLIRKIVNLI-KKNGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIF 59 + L+ I I G + +Y L + P GYY FGA GDFVTAPE+ ++ Sbjct: 107 SSALLTHIRREIDAAGGMIPFRRYMELALYTPGLGYYMAGQTRFGAAGDFVTAPEMGRVL 166 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 +LA L G ++E G G G + I ++ ++E S Sbjct: 167 AAVLARTLQPDLGPDG------ILEFGGGSGALAGQIRDILPDTP--------YTLLEPS 212 Query: 120 ERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMI 179 L Q+ + I +L + G ++A+E D++P++ + G Sbjct: 213 PDLQARQRSVVTG----IQHLQTLPEHWRG--VMLAHEVLDAMPVQVLELDASGQLHECG 266 Query: 180 DIDQHDSLVFNIGDHEIKSNFLTCS-----DYFLGAIFENSPCRDREMQSISDRLACDGG 234 ++L + + + + + E + + ++ ++ RL Sbjct: 267 VRWNGEALDWVMLPPPVAAPLAARLAPYVTHWPRPYRTEINLTAESWLREVAARLDSGAI 326 Query: 235 TAI-----VIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYK 289 I ++ + Q ++G + H P PG DL++HVD+ L + A+ Sbjct: 327 LLIDYGYEAAEFYHPQRQMGSLRAYYRHHWLDDPFYLPGLCDLTAHVDYTALMTAAVEAG 386 Query: 290 LYINGLTTQGKFLEGLGIWQRAFSLMKQTAR--KDILLDSVKRLVSTSADKKSMGELFKI 347 L + +FL G+ + +L Q + L + +KRL + MGE FK+ Sbjct: 387 LEVAFYGHLARFLVEHGLAEVYGTLCAQAGEDGRFALNNEIKRLTL----PQEMGESFKV 442 Query: 348 LVVSHEKV 355 L++ + Sbjct: 443 LILKKREN 450 >gi|297667837|ref|XP_002812171.1| PREDICTED: protein midA homolog, mitochondrial-like isoform 2 [Pongo abelii] Length = 414 Score = 195 bits (495), Expect = 1e-47, Method: Composition-based stats. Identities = 109/378 (28%), Positives = 169/378 (44%), Gaps = 60/378 (15%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 ++R ++ IK G +TV +Y + +P ++ Sbjct: 41 TPMLRHLIYKIKSTGPITVAEYMKEVLTNP---------------------------AKL 73 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSV-LSIYMVETSER 121 L I+ I W G + +LVELGPGRG ++ DILRV +L + +S+++VE S++ Sbjct: 74 LGIWFISEWMATGKSTAFQLVELGPGRGTLVGDILRVFTQLGSVLKNCDISVHLVEVSQK 133 Query: 122 LTLIQ--------------------KKQLASYGDKINWYTSLADVPLGFTFLVANEFFDS 161 L+ IQ K + G ++WY L DVP G++F +A+EFFD Sbjct: 134 LSEIQALTLTEEKVPLERNAGSPVYMKGVTKSGIPVSWYRDLQDVPKGYSFYLAHEFFDV 193 Query: 162 LPIKQFVMTEHGIRERMIDIDQH--DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRD 219 LP+ +F T G RE +DID D L F + + D E P Sbjct: 194 LPVHKFQKTPQGWREVFVDIDPQVSDKLRFVLAPSATPAEAFIQHD-ETRDHVEVCPDAG 252 Query: 220 REMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQ 279 ++ +S+R+A GG A+V DYG+ +R DT + GH L+ PG ADL++ VDF Sbjct: 253 VIIEELSERIALTGGAALVADYGHDGTRT-DTFRGFCGHKLHDVLIAPGTADLTADVDFS 311 Query: 280 RLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTAR---KDILLDSVKRLVSTSA 336 L +A K+ G Q FL+ +GI R L+ ++ + LL L+ Sbjct: 312 YLRRMA-QGKVASVGPIKQHTFLKNMGIDVRLKVLLDKSNEPSVRQQLLQGYDMLM---- 366 Query: 337 DKKSMGELFKILVVSHEK 354 + K MGE F + + Sbjct: 367 NPKKMGERFNFFALLPHQ 384 >gi|254525590|ref|ZP_05137642.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT 9202] gi|221537014|gb|EEE39467.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT 9202] Length = 396 Score = 194 bits (493), Expect = 2e-47, Method: Composition-based stats. Identities = 80/377 (21%), Positives = 155/377 (41%), Gaps = 35/377 (9%) Query: 5 LIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNP-FGAVGDFVTAPEISQIFGEML 63 L++KI IK G ++ + + DP GYY + G GDFVT+P +S F ++ Sbjct: 12 LVKKI---IKMGGTISFYDFMNFALNDPINGYYGSGKAELGVRGDFVTSPSLSDDFAFLV 68 Query: 64 AI----FLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 +LI + + E G G G M +++ K +F +S ++E + Sbjct: 69 GKQIEDWLIQFKSSFLSNQTLSITEFGAGDGSFMSGLIKYFLKNSKNFLEGVSFVIIEPN 128 Query: 120 ERLTLIQKKQLASYGDKINWYT----SLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIR 175 E + QK +L + + + ++A+E D+LP+++ + + Sbjct: 129 EGMVEKQKNKLEEFLNLGIDILWKSLDEVEENNINGIVLAHEVLDALPVERVTFLKGKLI 188 Query: 176 ERMIDIDQH-DSLVFNIGD--HEIKSNF----------LTCSDYFLGAIFENSPCRDREM 222 + + ID+ + L F+ E+K +F + D G E + + Sbjct: 189 RQAVSIDKKSNKLFFDKMPITRELKKSFELAKSELGITIPPEDALEGWTTEWHVDNSKWL 248 Query: 223 QSISDRLACDGGTAIVIDY------GYLQSRVGDTLQAVKGHTYVSPLVN-PGQADLSSH 275 ++I ++ + G ++IDY Y T+ + + + +++ PG DL+SH Sbjct: 249 EAIYGKI--NNGILLIIDYAKEAKKYYNSKNSDGTIVSYENQKMKNNVLDSPGNCDLTSH 306 Query: 276 VDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTS 335 V + L + A +G+T QG+ L LG+ +R + + K+ + + Sbjct: 307 VCIETLINDAESLGFNTDGITKQGEALLALGLAERLYGIQKEFKENLSNALLRREALLRL 366 Query: 336 ADKKSMGELFKILVVSH 352 D +G+ FK V Sbjct: 367 VDPVCLGD-FKWFVFKK 382 >gi|302661209|ref|XP_003022274.1| hypothetical protein TRV_03596 [Trichophyton verrucosum HKI 0517] gi|291186213|gb|EFE41656.1| hypothetical protein TRV_03596 [Trichophyton verrucosum HKI 0517] Length = 457 Score = 194 bits (493), Expect = 2e-47, Method: Composition-based stats. Identities = 110/453 (24%), Positives = 167/453 (36%), Gaps = 119/453 (26%) Query: 25 FALCVADPEFGYYSTC-----NPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSC 79 C+ E GYY++ + FG GDFVT+PEISQ+FGE+L I+++ W G S Sbjct: 1 MRQCLTSDEGGYYTSRGTPGSDVFGKEGDFVTSPEISQMFGELLGIWIVTEWLSQGRRSS 60 Query: 80 -VRLVELGPGRGIMML--------------------DILRVICKLKPDFFSVLSIYMVET 118 V+L+E GPG+G +M D + + K SV +YM+E Sbjct: 61 GVQLMEFGPGKGTLMADILRVSLSLLNEHGILQLMGDTYQSVRNFKGFASSVEGVYMIEA 120 Query: 119 SERLTLIQKKQLASYGD--------------------KINWYTSLADVPLGFTFLVANEF 158 S L IQKK L L F++A+EF Sbjct: 121 SPTLREIQKKALCGDAPMEECDIGYKSTSIHLGVPVYWTEHIRILPQTEDKAPFIIAHEF 180 Query: 159 FDSLPIKQFVMTEHGIRE-----------RMIDIDQHDSLVFNI---------------- 191 FD+LPI F E R + + + + Sbjct: 181 FDALPIHAFQAVHSPPPETINTPTGPAELRQPSLPLNGTQWRELVVATNPEAEREPDGDD 240 Query: 192 ----GDHEIKSNFLTCS-------------------DYFLGAIFENSPCRDREMQSISDR 228 D +++ G+ E SP Q I+ Sbjct: 241 SSVKNDKKLEFRLALAKSPTPASLVMPEMSSRYKALKSTRGSTIEISPESHTYAQEIARL 300 Query: 229 L-------------ACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSH 275 + G A+++DYG + ++L+ +K H VSP PG+ DLS+ Sbjct: 301 IGGPNPTDKNPSPTRTPAGAALILDYGPSSTIPVNSLRGIKNHQVVSPFATPGEVDLSAD 360 Query: 276 VDFQRLSSIAILY--KLYINGLTTQGKFLEGLGIWQRAFSLM---KQTARKDILLDSVKR 330 VDF L+ A+ + + G QG FL LGI +RA L+ K ++ + S +R Sbjct: 361 VDFTGLAESALDASPGVEVYGPNEQGSFLRSLGIAERAAQLLRNVKDEEKRKQIESSWQR 420 Query: 331 LVSTSADKKSMGELFKILVVSHE---KVELMPF 360 LV MG ++K + + E K + F Sbjct: 421 LVERGG--GGMGRIYKAMAIVPESGGKRRPVGF 451 >gi|114776546|ref|ZP_01451591.1| hypothetical protein SPV1_02462 [Mariprofundus ferrooxydans PV-1] gi|114553376|gb|EAU55774.1| hypothetical protein SPV1_02462 [Mariprofundus ferrooxydans PV-1] Length = 381 Score = 194 bits (492), Expect = 2e-47, Method: Composition-based stats. Identities = 89/364 (24%), Positives = 145/364 (39%), Gaps = 23/364 (6%) Query: 4 KLIRKIVNLIK-KNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 +L + I I G + D + + +PE GYY + FG GDFVTAPE+ Sbjct: 6 QLEQIICAHISDAGGFLPFDAFMQAALYEPELGYYESKTVFGEAGDFVTAPELGPWLSLG 65 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 A + W+Q G P+ L+E G G G ++ ++ ++ ++ P + VE S L Sbjct: 66 FADLIFNCWQQLGEPAQWTLLEQGSGSGKLLASVINILSQMMPVMPERILS--VERSHGL 123 Query: 123 TLIQKKQLASYGDKINWYTSLA-DVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDI 181 Q+ G + + +L + +NE D+ P++ F + ER + Sbjct: 124 RQRQQVLFDESGVDVALFGTLDEIEIPENIIIYSNELPDAFPLRCFRYKQGQYYERGVIH 183 Query: 182 DQHDSLVFNIGDHEIKSNFLTC----SDYFLGAIFENSPCRDREMQSISDRLACDGGTAI 237 + ++ S + G I E +P QS++ + G + Sbjct: 184 RAEGGFEWADAPLPMQHAPDIPQSLSSRWQDGYISEFNPGLPVWQQSLARVVQR--GFIL 241 Query: 238 VIDYGYLQS------RVGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKL 290 +DYGY Q R TL A H L PG D+++HV+F L + Sbjct: 242 TLDYGYSQQEYYRPGRAEGTLLAHLEHKAIDDVLSEPGSRDITAHVNFSALVQAGRAVGM 301 Query: 291 YINGLTTQGKFLEGLG-IWQRAFSLM-KQTARKDILLDSVKRLVSTSADKKSMGELFKIL 348 +QG +L + SL + A+ LL KRL+ +GE+FK+L Sbjct: 302 EPLLWMSQGGWLAQSPSVQAFIQSLATQSDAQSLHLLTHAKRLLM----PFGLGEVFKLL 357 Query: 349 VVSH 352 V S Sbjct: 358 VQSK 361 >gi|332813010|ref|XP_003309028.1| PREDICTED: protein midA homolog, mitochondrial-like [Pan troglodytes] Length = 414 Score = 194 bits (492), Expect = 2e-47, Method: Composition-based stats. Identities = 108/378 (28%), Positives = 167/378 (44%), Gaps = 60/378 (15%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 ++R ++ IK G +TV +Y + +P ++ Sbjct: 41 TPMLRHLMYKIKSTGPITVAEYMKEVLTNP---------------------------AKL 73 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSV-LSIYMVETSER 121 L I+ I W G + +LVELGPGRG ++ DILRV +L + +S+++VE S++ Sbjct: 74 LGIWFISEWMATGKSTAFQLVELGPGRGTLVGDILRVFTQLGSVLKNCDISVHLVEVSQK 133 Query: 122 LTLIQ--------------------KKQLASYGDKINWYTSLADVPLGFTFLVANEFFDS 161 L+ IQ K + G I+WY L DVP G++F +A+EFFD Sbjct: 134 LSEIQALTLTEEKVPLERNAGSPVYMKGVTKSGIPISWYRDLHDVPKGYSFYLAHEFFDV 193 Query: 162 LPIKQFVMTEHGIRERMIDIDQH--DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRD 219 LP+ +F T G RE +DID D L F + + D E P Sbjct: 194 LPVHKFQKTPQGWREVFVDIDPQVSDKLRFVLAPSATPAEAFIQHD-ETRDHVEVCPDAG 252 Query: 220 REMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQ 279 ++ +S R+A GG A+V DYG+ ++ DT + H L+ PG ADL++ VDF Sbjct: 253 VIIEELSQRIALTGGAALVADYGHDGTKT-DTFRGFCDHKLHDVLIAPGTADLTADVDFS 311 Query: 280 RLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTAR---KDILLDSVKRLVSTSA 336 L +A K+ G Q FL+ +GI R L+ ++ + LL L+ Sbjct: 312 YLRRMA-QGKVASLGPIKQHTFLKNMGIDVRLKVLLDKSNEPSVRQQLLQGYDMLM---- 366 Query: 337 DKKSMGELFKILVVSHEK 354 + K MGE F + + Sbjct: 367 NPKKMGERFNFFALLPHQ 384 >gi|123968271|ref|YP_001009129.1| hypothetical protein A9601_07361 [Prochlorococcus marinus str. AS9601] gi|123198381|gb|ABM70022.1| Uncharacterized conserved protein [Prochlorococcus marinus str. AS9601] Length = 396 Score = 193 bits (491), Expect = 3e-47, Method: Composition-based stats. Identities = 76/377 (20%), Positives = 149/377 (39%), Gaps = 35/377 (9%) Query: 5 LIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNP-FGAVGDFVTAPEISQIFGEML 63 L++KI IK G ++ + + DP GYY + G GDFVT+P +S F ++ Sbjct: 12 LVKKI---IKMGGTISFYDFMNFALNDPINGYYGSGKAELGVRGDFVTSPSLSDDFAFLV 68 Query: 64 AI----FLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 +LI + + E G G G M +++ + +F +S ++E + Sbjct: 69 GKQIEDWLIQFKSSFLSNETLSVTEFGAGDGSFMSGLIKYFLENSKNFLEGISFVIIEPN 128 Query: 120 ERLTLIQKKQLASYGDKINWYT----SLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIR 175 E + QK +L + + + ++ANE D+LP+++ ++ + Sbjct: 129 EGMVEKQKNKLEEFLNLGIDILWKGLDEVEENNINGIVLANEVLDALPVERITFSKGKLI 188 Query: 176 ERMIDIDQHDSLVFNIGDHEIKSNF-------------LTCSDYFLGAIFENSPCRDREM 222 + + ID+ +F + + D G E + + Sbjct: 189 RQAVSIDKKSHKLFFDKMPITRELEKSFELAKSELGITIPPEDALEGWTTEWHVDNSKWL 248 Query: 223 QSISDRLACDGGTAIVIDY------GYLQSRVGDTLQAVKGHTYVSPLVN-PGQADLSSH 275 ++I ++ + G ++IDY Y T+ + + + +++ PG DL+SH Sbjct: 249 EAIYGKI--NNGILLIIDYAKEAKKYYNSKNSDGTIVSYENQKMRNNVLDSPGNCDLTSH 306 Query: 276 VDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTS 335 V + L + A G+T QG+ L LG+ +R + + K+ + + Sbjct: 307 VCIETLINDAETLGFDTVGITKQGEALLALGLAERLYGIQKEFKENLSNALLRREALLRL 366 Query: 336 ADKKSMGELFKILVVSH 352 D +G+ FK V Sbjct: 367 VDPVCLGD-FKWFVFKK 382 >gi|254373415|ref|ZP_04988903.1| conserved hypothetical protein [Francisella tularensis subsp. novicida GA99-3549] gi|151571141|gb|EDN36795.1| conserved hypothetical protein [Francisella novicida GA99-3549] Length = 378 Score = 193 bits (490), Expect = 4e-47, Method: Composition-based stats. Identities = 71/364 (19%), Positives = 139/364 (38%), Gaps = 23/364 (6%) Query: 4 KLIRKIVNLIKKNG-QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGE 61 L I+ IK + + + + + P+ GYYS + GDF+TA + +F Sbjct: 2 SLKNIILERIKSSKQPLLFRDFMQMALYYPQLGYYSGAKEKISSQGDFITATSQTSLFAR 61 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 A Q G + ++E G G G D + + L +VE S Sbjct: 62 TFARQFATIISQLG--NDCSVIEFGAGNGKFAADCVDELESLA---ILPKRYIIVELSND 116 Query: 122 LTLIQKKQLASYGDKINW---YTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERM 178 L L Q++ + + + + ANE D++P+ F + + ++ Sbjct: 117 LRLRQQQYIKENLPHLYDRFIWLDKLPAEKINAIVFANELLDAMPVDIFRSENNKLIQQG 176 Query: 179 IDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFE--NSPCRDREMQSISDRLACD--GG 234 + ++ ++++ + + G F+ + + ++ L G Sbjct: 177 VIRKGDTFEFSDMPKNDVRFEYESTKILNDGITFDDGYTSEINTWIRPWVKSLREFLSQG 236 Query: 235 TAIVIDYGYLQS------RVGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAIL 287 + DYGY +S R TL H P +N G+ D+++HVDF ++ AI Sbjct: 237 IVFLCDYGYHRSLYYSKDRYMGTLACYHQHQVNFEPFINIGEQDITAHVDFTTVAEAAIE 296 Query: 288 YKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKI 347 ++G TQ FL+ I + ++ ++ + +L S + + E+FK+ Sbjct: 297 EGFQLDGFMTQANFLKRANIAEVFSNISQRLSTNQLLKYSND--IKDLLLNDKLAEVFKV 354 Query: 348 LVVS 351 + S Sbjct: 355 MAFS 358 >gi|157413101|ref|YP_001483967.1| hypothetical protein P9215_07661 [Prochlorococcus marinus str. MIT 9215] gi|157387676|gb|ABV50381.1| Conserved hypothetical protein [Prochlorococcus marinus str. MIT 9215] Length = 396 Score = 193 bits (489), Expect = 5e-47, Method: Composition-based stats. Identities = 78/377 (20%), Positives = 152/377 (40%), Gaps = 35/377 (9%) Query: 5 LIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNP-FGAVGDFVTAPEISQIFGEML 63 L++KI IK G ++ + + DP GYY + G GDFVT+P +S F ++ Sbjct: 12 LVKKI---IKMGGTISFYDFMNFALNDPINGYYGSGKAELGVRGDFVTSPSLSDDFAFLV 68 Query: 64 AI----FLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 +LI + + E G G G M +++ K +F +S ++E + Sbjct: 69 GKQIEDWLIQFKSSFLSNQTLSITEFGAGDGSFMSGLIKYFLKKSKNFLEGVSFVIIEPN 128 Query: 120 ERLTLIQKKQLASYGDKINWYT----SLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIR 175 E + QK +L + + + ++ANE D+LP+++ + + Sbjct: 129 EGMVEKQKNKLEEFLNLGIDILWKGLDEVEENNINGIVLANEVLDALPVERITFLKGKLI 188 Query: 176 ERMIDIDQH-DSLVFNIGDHEIKSNF------------LTCSDYFLGAIFENSPCRDREM 222 + + ID+ + L F+ ++ + D G E + + Sbjct: 189 RQAVSIDKKSNKLFFDKMPITLELEKSFELAKSELGITIPPEDALEGWTTEWHVDNSKWL 248 Query: 223 QSISDRLACDGGTAIVIDY------GYLQSRVGDTLQAVKGHTYVSPLVN-PGQADLSSH 275 ++I ++ + G ++IDY Y T+ + + + +++ PG DL+SH Sbjct: 249 EAIYGKI--NNGILLIIDYAKEAKKYYNSKNSDGTIVSYENQKMKNNVLDSPGNCDLTSH 306 Query: 276 VDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTS 335 V + L + A +G+T QG+ L LG+ +R + + K+ + + Sbjct: 307 VCIETLINDAESLGFNTDGITKQGEALLALGLAERLYGIQKEFKENLSNALLRREALLRL 366 Query: 336 ADKKSMGELFKILVVSH 352 D +G+ FK V Sbjct: 367 VDPVCLGD-FKWFVFKK 382 >gi|254702246|ref|ZP_05164074.1| hypothetical protein Bsuib55_15501 [Brucella suis bv. 5 str. 513] Length = 191 Score = 193 bits (489), Expect = 5e-47, Method: Composition-based stats. Identities = 78/187 (41%), Positives = 109/187 (58%), Gaps = 4/187 (2%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L ++ LI G ++V Y A C+ D E GYY+T PFG GDF+TAPE+SQ+FGE++ Sbjct: 5 SLKERLKRLIATTGPISVADYMAACLGDREAGYYTTREPFGREGDFITAPEVSQMFGELI 64 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 I+ + W+ P+ L E+GPGRG +M D+LR I +L P I MVETS RL Sbjct: 65 GIWCLSEWDALARPANFVLCEIGPGRGTLMSDMLRTIGRLAPQMLGGARIAMVETSPRLA 124 Query: 124 LIQKKQLASYGDKINWYTSLADVP----LGFTFLVANEFFDSLPIKQFVMTEHGIRERMI 179 QK++LA + W+ AD+P G LV NE FD++P +QFV + ERMI Sbjct: 125 EKQKQKLAGTKAHVEWFERFADIPADTVHGPLILVTNELFDAIPFRQFVKADGRFVERMI 184 Query: 180 DIDQHDS 186 +++ D Sbjct: 185 ALNEQDE 191 >gi|229019481|ref|ZP_04176302.1| hypothetical protein bcere0030_39870 [Bacillus cereus AH1273] gi|229025724|ref|ZP_04182128.1| hypothetical protein bcere0029_40170 [Bacillus cereus AH1272] gi|228735599|gb|EEL86190.1| hypothetical protein bcere0029_40170 [Bacillus cereus AH1272] gi|228741836|gb|EEL92015.1| hypothetical protein bcere0030_39870 [Bacillus cereus AH1273] Length = 370 Score = 193 bits (489), Expect = 6e-47, Method: Composition-based stats. Identities = 74/339 (21%), Positives = 126/339 (37%), Gaps = 17/339 (5%) Query: 18 QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 ++ Y L + GYY G GDF T+ +S +F + A F I E Sbjct: 18 SISYSTYMNLVLYAEGHGYYMREREKIGRQGDFFTSSNVSSVFAKTFAKFFIRLVE--NG 75 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 + E+G G G D+L+ +L P+ FS L+ +VE S +Q++ L S+ + Sbjct: 76 EVSSNICEVGGGTGRFAYDVLQEWKQLSPETFSSLNYSIVEVSPFHRRLQQESLCSFDNV 135 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 + + + + +NE FD+ P++ + E I + L E Sbjct: 136 SYYTSYIEMGESFEGIIFSNELFDAFPVEVVEKRNGILYEVRITYTEEGDLAEVFRPVEK 195 Query: 197 KSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS------R 247 + G FE + ++ I++ I +DYGY + Sbjct: 196 RIGRYLLKYNIHIAEGQRFEVPIAMEEYIEEIAEWFQRGIC--ITVDYGYTKEEWMHPAH 253 Query: 248 VGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 +L+ H +PL PG+ DL++HV + L I L ++ Q +FL G Sbjct: 254 HEGSLRGYYKHKLIRNPLAYPGEMDLTTHVHWDELKMIFELQGIHTIWHRKQSEFLLAAG 313 Query: 307 IWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELF 345 I + S + R V + +G F Sbjct: 314 ILEHLTSHQDTNPFSET--QKRNRAVRSMILNGGLGSAF 350 >gi|229013480|ref|ZP_04170617.1| hypothetical protein bmyco0001_38920 [Bacillus mycoides DSM 2048] gi|228747892|gb|EEL97758.1| hypothetical protein bmyco0001_38920 [Bacillus mycoides DSM 2048] Length = 370 Score = 192 bits (488), Expect = 6e-47, Method: Composition-based stats. Identities = 73/351 (20%), Positives = 134/351 (38%), Gaps = 17/351 (4%) Query: 14 KKNGQMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWE 72 +K+ ++ Y L + GYY G GDF T+ +S +F + A F I E Sbjct: 14 EKDHSISYSTYMNLALYAEGHGYYMREREKIGRRGDFFTSSNVSSVFAKTFAKFFIRLVE 73 Query: 73 QHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLAS 132 + + E+G G G D+L+ +L P+ F L+ ++E S +Q+++L S Sbjct: 74 K--GEVSPNICEIGGGTGKFAYDVLQEWKQLSPETFINLNYSIIEVSPFHRRLQQERLCS 131 Query: 133 YGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIG 192 + + + + + +NE FDS P++ + E I +L Sbjct: 132 VDNVSYYTSYIEMGESFEGIIFSNELFDSFPVEIVEKRNGILYEVRITYTDEGNLTEIFR 191 Query: 193 DHEIKSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS--- 246 E + G FE + ++ I+ I +DYGY + Sbjct: 192 PIEKRIGRYLLKYNIHIAEGQRFEVPIAMEEYIEEITKWFQKGIC--ITVDYGYTKEEWM 249 Query: 247 ---RVGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFL 302 +L+ H +PL +PG+ DL++H+ + L I L + Q +FL Sbjct: 250 HPAHQEGSLRGYYQHELIRNPLEHPGEMDLTTHIHWDELKEIFSLQGMGAVWHKKQSEFL 309 Query: 303 EGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHE 353 GI ++ S + R V + +G F +++ + + Sbjct: 310 LAAGILEQLTSHQDTNPFSET--QKRNRAVRSMILNGGLGSAFDVVIHTKD 358 >gi|229163205|ref|ZP_04291160.1| hypothetical protein bcere0009_39730 [Bacillus cereus R309803] gi|228620268|gb|EEK77139.1| hypothetical protein bcere0009_39730 [Bacillus cereus R309803] Length = 371 Score = 192 bits (488), Expect = 6e-47, Method: Composition-based stats. Identities = 70/347 (20%), Positives = 130/347 (37%), Gaps = 17/347 (4%) Query: 18 QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 ++ Y L + GYY G GDF T+ +S +F ++ A F I E Sbjct: 19 SISYSTYMNLVLYTEGHGYYMKERKKIGRQGDFFTSSNVSSVFAKIFAKFFIRLVE--NG 76 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 + E+G G G D+L+ +L P+ F ++ ++E S +Q++ L S+ + Sbjct: 77 EVAPNICEIGGGTGKFAYDVLQEWKQLSPETFIDVNYSIIEVSPFHRKLQQENLCSFSNV 136 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 + + L +NE FD+ P++ + E I L E Sbjct: 137 SYYTSHSEMGDSFEGILFSNELFDAFPVEVIEKRNGILYEVRITYTGEGKLTEVCRPLEK 196 Query: 197 KSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS------R 247 + G FE + ++ I+ I +DYGY + Sbjct: 197 RIGQYLLKYNIHIAEGQRFEVPIAMEGYIKEIAKWFQKGIC--ITVDYGYTKEEWMHPAH 254 Query: 248 VGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 +L+ H +PL PG+ DL++H+ + L I + ++ Q +FL G Sbjct: 255 HEGSLRGYYEHKLIRNPLAYPGEMDLTTHIHWDELKEIFSMQGMHAVWHKKQSEFLLAAG 314 Query: 307 IWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHE 353 I ++ S + + R V + +G F +++ + + Sbjct: 315 ILEQLTSHQDRDPFSEAQKQ--NRAVRSMILNGGLGSAFDVVIHTKD 359 >gi|56420973|ref|YP_148291.1| hypothetical protein GK2438 [Geobacillus kaustophilus HTA426] gi|56380815|dbj|BAD76723.1| hypothetical conserved protein [Geobacillus kaustophilus HTA426] Length = 374 Score = 192 bits (488), Expect = 6e-47, Method: Composition-based stats. Identities = 76/375 (20%), Positives = 127/375 (33%), Gaps = 30/375 (8%) Query: 11 NLIKK--NGQMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFL 67 I G+++ Y + + D FGYY G GDF T + +FG+ LA Sbjct: 3 EQIAAAPGGRVSYADYMQMVLYDERFGYYMRERAKIGKEGDFFTNSSFAPVFGKALASLW 62 Query: 68 ICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQK 127 + EQ G P + E G G G + L +L K P + LS +++ S Q+ Sbjct: 63 VRMVEQSGLPPA--VCEWGGGDGRLALSVLEEWKKKSPHTYDRLSYTIIDQSPFHRRRQR 120 Query: 128 KQLASYGDKINWYTS----LADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQ 183 + L +K+ Y LA+ + +NEFFD+ P+ V + E + Sbjct: 121 ETLQPAAEKVEQYDDVSRWLAERGPFSGIVFSNEFFDAFPVHVIVKEGGVLHECFVAARD 180 Query: 184 H---DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVID 240 + ++ D G E + + +D Sbjct: 181 GRLVEEKAPLCRPAIVRYLHERGLDLAEGQRLEVPLAMKAFWLEVGPLF--HQAVMVTVD 238 Query: 241 YGYLQS------RVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSSIAILYKLYIN 293 YGY R +L+ H V PL +PG+ DL+SHV + L A Sbjct: 239 YGYTDEQLRAPARRHGSLRGYFRHQLVADPLRHPGEMDLTSHVQWDALRLYARQAGWEEV 298 Query: 294 GLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSH- 352 Q +FL G+ + + R++ + F +L++ Sbjct: 299 AFVRQDRFLLAAGLLH--EWNVSEGGDFFSPASRQNRMIRALIADDGVSRFFDVLILQKG 356 Query: 353 ------EKVELMPFV 361 + F+ Sbjct: 357 MSLPARDFWPAPEFL 371 >gi|194378712|dbj|BAG63521.1| unnamed protein product [Homo sapiens] Length = 414 Score = 192 bits (488), Expect = 7e-47, Method: Composition-based stats. Identities = 107/378 (28%), Positives = 166/378 (43%), Gaps = 60/378 (15%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 ++R ++ IK G +TV +Y + +P ++ Sbjct: 41 TPMLRHLMYKIKSTGPITVAEYMKEVLTNP---------------------------AKL 73 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSV-LSIYMVETSER 121 L I+ I W G + +LVELGPGRG ++ DILRV +L + +S+++VE S++ Sbjct: 74 LGIWFISEWMATGKSTAFQLVELGPGRGTLVGDILRVFTQLGSVLKNCDISVHLVEVSQK 133 Query: 122 LTLIQ--------------------KKQLASYGDKINWYTSLADVPLGFTFLVANEFFDS 161 L+ IQ K + G I+WY L DVP G++F +A+EFFD Sbjct: 134 LSEIQALTLTKEKVPLERNAGSPVYMKGVTKSGIPISWYRDLHDVPKGYSFYLAHEFFDV 193 Query: 162 LPIKQFVMTEHGIRERMIDIDQH--DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRD 219 LP+ + T G RE +DID D L F + + D E P Sbjct: 194 LPVHKIQKTPQGWREVFVDIDPQVSDKLRFVLAPSATPAEAFIQHD-ETRDHVEVCPDAG 252 Query: 220 REMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQ 279 ++ +S R+A GG A+V DYG+ ++ DT + H L+ PG ADL++ VDF Sbjct: 253 VIIEELSQRIALTGGAALVADYGHDGTKT-DTFRGFCDHKLHDVLIAPGTADLTADVDFS 311 Query: 280 RLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTAR---KDILLDSVKRLVSTSA 336 L +A K+ G Q FL+ +GI R L+ ++ + LL L+ Sbjct: 312 YLRRMA-QGKVASLGPIKQHTFLKNMGIDVRLKVLLDKSNEPSVRQQLLQGYDMLM---- 366 Query: 337 DKKSMGELFKILVVSHEK 354 + K MGE F + + Sbjct: 367 NPKKMGERFNFFALLPHQ 384 >gi|56708525|ref|YP_170421.1| hypothetical protein FTT_1486c [Francisella tularensis subsp. tularensis SCHU S4] gi|110670996|ref|YP_667553.1| hypothetical protein FTF1486c [Francisella tularensis subsp. tularensis FSC198] gi|224457692|ref|ZP_03666165.1| hypothetical protein FtultM_08684 [Francisella tularensis subsp. tularensis MA00-2987] gi|254371151|ref|ZP_04987153.1| conserved hypothetical protein [Francisella tularensis subsp. tularensis FSC033] gi|254875375|ref|ZP_05248085.1| conserved hypothetical protein [Francisella tularensis subsp. tularensis MA00-2987] gi|54113363|gb|AAV29315.1| NT02FT1949 [synthetic construct] gi|56605017|emb|CAG46119.1| conserved hypothetical protein [Francisella tularensis subsp. tularensis SCHU S4] gi|110321329|emb|CAL09502.1| conserved hypothetical protein [Francisella tularensis subsp. tularensis FSC198] gi|151569391|gb|EDN35045.1| conserved hypothetical protein [Francisella tularensis subsp. tularensis FSC033] gi|254841374|gb|EET19810.1| conserved hypothetical protein [Francisella tularensis subsp. tularensis MA00-2987] gi|282159741|gb|ADA79132.1| hypothetical protein NE061598_08305 [Francisella tularensis subsp. tularensis NE061598] Length = 378 Score = 192 bits (488), Expect = 7e-47, Method: Composition-based stats. Identities = 71/364 (19%), Positives = 140/364 (38%), Gaps = 23/364 (6%) Query: 4 KLIRKIVNLIKKNG-QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGE 61 L I+ IK + + + + + P+ GYYS + GDF+TA + +F Sbjct: 2 SLKNIILERIKSSKQPLLFRDFMQMALYYPQLGYYSRAKEKISSQGDFITATSQTSLFAR 61 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 A + Q G + ++E G G G D + + L +VE S Sbjct: 62 TFARQFVTIISQLG--NDCSVIEFGAGNGKFAADCVDELESLA---ILPKRYIIVELSND 116 Query: 122 LTLIQKKQLASYGDKINW---YTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERM 178 L L Q++ + + + + ANE D++P+ F + + ++ Sbjct: 117 LRLRQQQYIKENLSHLYDRFIWLDKLPAEKIKAIVFANELLDAMPVDIFRSENNKLIQQG 176 Query: 179 IDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFE--NSPCRDREMQSISDRLAC--DGG 234 + ++ ++++ + + G F + + ++ L G Sbjct: 177 VIRKGDTFEFSDMPKNDVRFEYESTKILNDGITFNDGYTSEINTWIRPWVKSLREVLSQG 236 Query: 235 TAIVIDYGYLQS------RVGDTL-QAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAIL 287 + DYGY +S R TL + H P +N G+ D+++HVDF ++ AI Sbjct: 237 IVFLCDYGYHRSLYYSKDRYMGTLACYHQHHVNFEPFINIGEQDITAHVDFTTVAEAAIE 296 Query: 288 YKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKI 347 ++G TQ FL+ I + ++ ++ + +L S + + E+FK+ Sbjct: 297 EGFQLDGFMTQANFLKRANIAEVFSNISQRLSTNQLLKYSND--IKDLLLNDKLAEVFKV 354 Query: 348 LVVS 351 + S Sbjct: 355 MAFS 358 >gi|187931237|ref|YP_001891221.1| hypothetical protein FTM_0412 [Francisella tularensis subsp. mediasiatica FSC147] gi|187712146|gb|ACD30443.1| conserved hypothetical protein [Francisella tularensis subsp. mediasiatica FSC147] Length = 378 Score = 192 bits (487), Expect = 8e-47, Method: Composition-based stats. Identities = 71/364 (19%), Positives = 139/364 (38%), Gaps = 23/364 (6%) Query: 4 KLIRKIVNLIKKNG-QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGE 61 L I+ IK + + + + + P+ GYYS + GDF+TA + +F Sbjct: 2 SLKNIILERIKSSKQPLLFRDFMQMALYYPQLGYYSRAKEKISSQGDFITATSQTSLFAR 61 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 A Q G + ++E G G G D + + L +VE S Sbjct: 62 TFARQFATIISQLG--NDCSVIEFGAGNGKFAADCVDELESLA---ILPKRYIIVELSND 116 Query: 122 LTLIQKKQLASYGDKINW---YTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERM 178 L L Q++ + + + + ANE D++P+ F + + ++ Sbjct: 117 LRLRQQQYIKENLSHLYDRFIWLDKLPAEKIKAIVFANELLDAMPVDIFRSENNKLIQQG 176 Query: 179 IDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFE--NSPCRDREMQSISDRLAC--DGG 234 + ++ ++++ + + G F + + ++ L G Sbjct: 177 VIRKGDTFEFSDMPKNDVRFEYESTKILNDGITFNDGYTSEINTWIRPWVKSLREVLSQG 236 Query: 235 TAIVIDYGYLQS------RVGDTL-QAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAIL 287 + DYGY +S R TL + H P +N G+ D+++HVDF ++ AI Sbjct: 237 IVFLCDYGYHRSLYYSKDRYMGTLACYHQHHVNFEPFINIGEQDITAHVDFTTVAEAAIE 296 Query: 288 YKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKI 347 ++G TQ FL+ I + ++ ++ + +L S + + E+FK+ Sbjct: 297 EGFQLDGFMTQANFLKRANIAEVFSNISQRLSTNQLLKYSND--IKDLLLNDKLAEVFKV 354 Query: 348 LVVS 351 + S Sbjct: 355 MAFS 358 >gi|228993005|ref|ZP_04152928.1| hypothetical protein bpmyx0001_37420 [Bacillus pseudomycoides DSM 12442] gi|228766653|gb|EEM15293.1| hypothetical protein bpmyx0001_37420 [Bacillus pseudomycoides DSM 12442] Length = 371 Score = 192 bits (487), Expect = 9e-47, Method: Composition-based stats. Identities = 78/351 (22%), Positives = 135/351 (38%), Gaps = 19/351 (5%) Query: 18 QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 ++ Y L + D E+GYY G GDF T+ IS +F A F I + Sbjct: 18 SISYSTYMNLALYDEEYGYYMKEREKIGRNGDFFTSSNISSVFARTFARFFIRLVQNGEI 77 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 P + E+G G G D+L+ +L P ++ L ++E S +QK+QL S+ + Sbjct: 78 PP--NVCEIGGGTGRFAYDVLQEWKQLSPVTYAELRYSIIEVSPFHRRLQKRQLGSFQNI 135 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 + + + +NE FD+ P++ + E I + L + E Sbjct: 136 SQYKSYKELGESFTGIVFSNELFDAFPVEVIEKRAGILYEVRITYTEQRELTEVLRPLEQ 195 Query: 197 KSNFLTCS----DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYL------QS 246 + + + G FE + +Q I L G I +DYGY + Sbjct: 196 EVIRCYLRRHKIELYEGQRFEVPIAMETYLQEIIGWLKE--GLFITVDYGYTKAEWMHPA 253 Query: 247 RVGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGL 305 +L+ H +PL PG+ D+++H+ + L I L+ T Q FL Sbjct: 254 HHEGSLRGYYDHRLIQNPLHYPGEMDITAHIHWDELKKIGEESSLHTVWHTKQRGFLLAA 313 Query: 306 GIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVE 356 GI ++ S + R V + + + F + V+ + + Sbjct: 314 GILEQLVSHQDSDPFSEKQKQ--NRAVRSMILHGGISDAFDV-VMQKKGMP 361 >gi|296224078|ref|XP_002757897.1| PREDICTED: protein midA homolog, mitochondrial-like isoform 2 [Callithrix jacchus] Length = 422 Score = 192 bits (487), Expect = 1e-46, Method: Composition-based stats. Identities = 112/385 (29%), Positives = 168/385 (43%), Gaps = 67/385 (17%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 ++R ++ IK G +TV +Y + +P ++ Sbjct: 42 TPMLRHLMYKIKSTGPITVAEYMKEVLTNP---------------------------AKL 74 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSV-LSIYMVETSER 121 L I+ I W G + +LVELGPGRG ++ DILRV +L + +S+++VE S++ Sbjct: 75 LGIWFISEWMATGKSTAFQLVELGPGRGTLVGDILRVFSQLGSVLKNCDISVHLVEVSQK 134 Query: 122 LTLIQ--------------------KKQLASYGDKINWYTSLADVPLGFTFLVANEFFDS 161 L+ +Q K + G ++WY L DVP G +F +A+EFFD Sbjct: 135 LSEVQALTLTEEKVPLERNAESPVYMKGVTKSGIPVSWYRDLHDVPKGHSFYLAHEFFDV 194 Query: 162 LPIKQFVMTEHGIRERMIDIDQH--DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRD 219 LP+ +F T G RE IDID D L F + + D I E P Sbjct: 195 LPVHKFQKTPQGWREVFIDIDPQVSDKLRFVLAPCATPAEVFIQHDETRDHI-EVCPDAG 253 Query: 220 REMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQ 279 ++ +S R+A GG A+V DYG+ ++ DT + GH L+ PG ADL++ VDF Sbjct: 254 VIIEELSRRIALTGGAALVADYGHDGTKT-DTFRGFCGHKLHDVLIAPGTADLTADVDFS 312 Query: 280 RLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFS--------LMKQTAR--KDILLDSVK 329 L +A K+ G TQ FL+ +GI R L K + K LL Sbjct: 313 FLRRMA-QGKVASLGPITQHTFLKNMGIDVRLKVRIFFFPVLLDKSNEQSVKQQLLQGYD 371 Query: 330 RLVSTSADKKSMGELFKILVVSHEK 354 L+ + K MGE F + + Sbjct: 372 MLM----NPKKMGERFNFFALLPHQ 392 >gi|228999055|ref|ZP_04158637.1| hypothetical protein bmyco0003_36120 [Bacillus mycoides Rock3-17] gi|229006603|ref|ZP_04164238.1| hypothetical protein bmyco0002_35050 [Bacillus mycoides Rock1-4] gi|228754652|gb|EEM04062.1| hypothetical protein bmyco0002_35050 [Bacillus mycoides Rock1-4] gi|228760672|gb|EEM09636.1| hypothetical protein bmyco0003_36120 [Bacillus mycoides Rock3-17] Length = 371 Score = 191 bits (486), Expect = 1e-46, Method: Composition-based stats. Identities = 78/351 (22%), Positives = 135/351 (38%), Gaps = 19/351 (5%) Query: 18 QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 ++ Y L + D E+GYY G GDF T+ IS +F A F I + Sbjct: 18 SISYSTYMNLALYDEEYGYYMKEREKIGRNGDFFTSSNISSVFARTFARFFIRLVQNGEI 77 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 P + E+G G G D+L+ +L P ++ L ++E S +QK+QL S+ + Sbjct: 78 PP--NVCEIGGGTGRFAYDVLQEWKQLSPVTYTELRYSIIEVSPFHRRLQKRQLGSFQNI 135 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 + + + +NE FD+ P++ + E I + L + E Sbjct: 136 SQYKSYKELGESFTGIVFSNELFDAFPVEVIEKRAGILYEVRITYTEQRELTEVLRPLEQ 195 Query: 197 KSNFLTCS----DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYL------QS 246 + + + G FE + +Q I L G I +DYGY + Sbjct: 196 EVIRCYLRRHKIELYEGQRFEVPIAMETYLQEIIGWLKE--GLFITVDYGYTKAEWMHPA 253 Query: 247 RVGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGL 305 +L+ H +PL PG+ D+++H+ + L I L+ T Q FL Sbjct: 254 HHEGSLRGYYDHRLIQNPLNYPGEMDITAHIHWDELKKIGEESSLHTVWHTKQRGFLLAA 313 Query: 306 GIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVE 356 GI ++ S + R V + + + F + V+ + + Sbjct: 314 GILEQLVSHQDSDPFSEKQKQ--NRAVRSMILHGGISDAFDV-VMQKKGMP 361 >gi|313672603|ref|YP_004050714.1| hypothetical protein Calni_0639 [Calditerrivibrio nitroreducens DSM 19672] gi|312939359|gb|ADR18551.1| protein of unknown function DUF185 [Calditerrivibrio nitroreducens DSM 19672] Length = 382 Score = 191 bits (485), Expect = 2e-46, Method: Composition-based stats. Identities = 74/355 (20%), Positives = 137/355 (38%), Gaps = 23/355 (6%) Query: 17 GQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 G++T ++ + + P GYY NPFG G F T+ S+ FG +A + Q Sbjct: 17 GKITFAEFMDMALYYPGLGYYQKENPFGVTGSFYTSVNASETFGFSIARSNLNIIRQFDL 76 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 L E+G G G++ DIL +P+F+ ++ ++E SE L QK+ L ++ K Sbjct: 77 --QPNLCEMGAGSGLLANDILNYYRDNEPEFYDIVKYTIIEKSEYLINNQKEVLKNHVGK 134 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFN--IGDH 194 + W S + +NE D+ P+ + + ++E + + Sbjct: 135 VEWV-SFDEFSNFEGVFFSNELVDAFPVHRIINISGDLKEIYVIYKDDKFQFYPDLFSTD 193 Query: 195 EIKSNFLTCS-DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYG------YLQSR 247 +I I + + + ++++ +++ G + IDYG Y R Sbjct: 194 QISDYLNRLKIKLVDKQIADINLDATKWIRTLGEKIKK--GIVVTIDYGWPAEKLYAPFR 251 Query: 248 VGDTLQAVKGH-TYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 + T+ H G D+++ VDF L L + Q +L G Sbjct: 252 MDGTVTCYFKHKQNNDFFERIGDQDITAFVDFSGLMEYGKDVGLEVVNFLPQTLYLVQSG 311 Query: 307 IWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMPFV 361 I + A+ D+ ++K L+ + G F +L+ S F+ Sbjct: 312 ILDYIAN-----AKTDLQRAAIKSLIIP---EGGFGTNFNVLIQSKNLNVPDSFI 358 >gi|126696072|ref|YP_001090958.1| hypothetical protein P9301_07341 [Prochlorococcus marinus str. MIT 9301] gi|126543115|gb|ABO17357.1| Uncharacterized conserved protein [Prochlorococcus marinus str. MIT 9301] Length = 396 Score = 191 bits (485), Expect = 2e-46, Method: Composition-based stats. Identities = 76/377 (20%), Positives = 151/377 (40%), Gaps = 35/377 (9%) Query: 5 LIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNP-FGAVGDFVTAPEISQIFGEML 63 L++KI IK G ++ + + DP GYY + G GDFVT+P +S F ++ Sbjct: 12 LVKKI---IKMGGTISFYDFMNFALNDPINGYYGSGKAELGVRGDFVTSPALSDDFAFLV 68 Query: 64 AI----FLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 +LI + ++E G G G M +++ + +F +S ++E + Sbjct: 69 GKQIEDWLIQFKNSFLSNQKLAVIEFGAGDGSFMSGLIKYFLENNKNFLEGVSFVIIEPN 128 Query: 120 ERLTLIQKKQLASYGDKINWYT----SLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIR 175 E + QK +L + + + ++ANE D+LP+++ + + Sbjct: 129 EGMVEKQKNKLEEFLNLGIDILWKGLDEVEENNINGIVLANEVLDALPVERITFAKGKLI 188 Query: 176 ERMIDIDQHDSLVFNIGDHEIKSNF-------------LTCSDYFLGAIFENSPCRDREM 222 + + ID+ +F + + +D G E + + Sbjct: 189 RQAVSIDKKSHKLFFDKMPITRELEKSFELAKSELGITIPPADALEGWTTEWHVDNSKWL 248 Query: 223 QSISDRLACDGGTAIVIDY------GYLQSRVGDTLQAVKGHTYVSPLVN-PGQADLSSH 275 ++I ++ + G ++IDY Y T+ + + + +++ PG DL+SH Sbjct: 249 EAIYGKI--NNGILLIIDYAKEAKKYYNSKNSDGTIVSYENQKMKNNVLDSPGNCDLTSH 306 Query: 276 VDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTS 335 V + L + A +G+T QG+ L LG+ +R + + K+ + + Sbjct: 307 VCIETLINDAENLGFNTDGITKQGEALLALGLAERLYGIQKEFKEDLSNALLRREALLRL 366 Query: 336 ADKKSMGELFKILVVSH 352 D +G+ FK V Sbjct: 367 VDPVCLGD-FKWFVFKK 382 >gi|148239539|ref|YP_001224926.1| hypothetical protein SynWH7803_1203 [Synechococcus sp. WH 7803] gi|147848078|emb|CAK23629.1| Conserved hypothetical protein [Synechococcus sp. WH 7803] Length = 410 Score = 191 bits (485), Expect = 2e-46, Method: Composition-based stats. Identities = 71/375 (18%), Positives = 142/375 (37%), Gaps = 30/375 (8%) Query: 5 LIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIFGEML 63 L+ ++ G++ + + DP G Y + G GDF T+P + + F ++L Sbjct: 11 LLDRLRQ---SGGEIPFSMFMDWALHDPVHGAYGAGHLTVGPDGDFATSPSLGEDFADLL 67 Query: 64 AIFLICAWEQHGFPSCV---RLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 L+ G +V++GPG G + ++ ++ + P L +VE + Sbjct: 68 VDQLVDWLGDLGERHPDDRLSVVDVGPGEGTLTAQLIPLLRRKAPALAERLDCVLVECNP 127 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTF--LVANEFFDSLPIKQFVMTEHGIRERM 178 + QK++L + +TSL D+ +VA+E D+LP+++ V+ + +M Sbjct: 128 GMESRQKQRLGASPAIPCRWTSLEDLRRNPLIGVVVAHELLDALPVERLVLRAGTLHRQM 187 Query: 179 IDIDQHDSLV-----FNIGDHEIKSNFL----------TCSDYFLGAIFENSPCRDREMQ 223 + + + + E+++ F + G E M+ Sbjct: 188 VRLRVEGASAQIHLAEGSFEGELRAQFQEDCARSGMVIPPAGAEDGWTTEWHASVSPWMR 247 Query: 224 SISDRLACDGG----TAIVIDYGYLQSRVGDTLQAVKGHT-YVSPLVNPGQADLSSHVDF 278 + + A D Y + R TL A + L N G D+++H+ Sbjct: 248 DAAAAVRQGVLLVVDYAYEADRYYTRHRSDGTLLAYREQVATHDVLRNAGTQDITAHLCV 307 Query: 279 QRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADK 338 + + A G QG+ L LG+ +R +L A + + + D Sbjct: 308 EAVVEAAERNGWMHEGQRRQGEALLALGLAERFTALQSLPAEQLSEALQRRETLLRLVDP 367 Query: 339 KSMGELFKILVVSHE 353 +G+L + +V + Sbjct: 368 ACLGDL-RWMVFHRQ 381 >gi|89099203|ref|ZP_01172081.1| hypothetical protein B14911_07970 [Bacillus sp. NRRL B-14911] gi|89086049|gb|EAR65172.1| hypothetical protein B14911_07970 [Bacillus sp. NRRL B-14911] Length = 354 Score = 191 bits (484), Expect = 2e-46, Method: Composition-based stats. Identities = 74/361 (20%), Positives = 135/361 (37%), Gaps = 25/361 (6%) Query: 5 LIRKIVNLIKKN--GQMTVDQYFALCVADPEFGYYS-TCNPFGAVGDFVTAPEISQIFGE 61 ++ + +I ++ G++T +Q+ + + D GYY G GDF+T IS I+G Sbjct: 1 MLELLTEIISRSPSGRITAEQFMDIALYDENKGYYMVPKPKIGRNGDFITTSNISDIYGR 60 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 +A + ++ G PS R+ E+G G G + KL D ++ ++ E S Sbjct: 61 AVAKWYFRQTQERGLPS--RVCEIGAGDGRFASAFIDEWNKLSGDP---VTYFVKEESPY 115 Query: 122 LTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDI 181 +Q + + ++ SL ++ + +NE FD+LP++ + + E M+ + Sbjct: 116 HKGLQSSLIGA---RVQQVDSLEELKPFCGLIFSNELFDALPVRVVEKQQGRMMEVMVAM 172 Query: 182 DQHDSLVFNIG---DHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIV 238 + + + + E + SIS+ L G + Sbjct: 173 ESGELKEEAVPLGDPSVLSYIRENGLKLKENQRLEIPLRMIGLISSISEMLQK--GIVLT 230 Query: 239 IDYGYLQS------RVGDTLQAVKGHTYVSPL-VNPGQADLSSHVDFQRLSSIAILYKLY 291 +DYGY R +L+ H + +PG+ D++SHV F L Sbjct: 231 VDYGYTDEELMDPARKRGSLRGYSNHRLIDDFLRSPGKMDITSHVHFDSFIREGEKRDLK 290 Query: 292 INGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVS 351 G Q +FL GI + L R + M F +V + Sbjct: 291 FEGKYRQDEFLLSCGILEDL--LNHDDPDPFSAASRRNRAIRQLILPSGMSAAFHSIVQT 348 Query: 352 H 352 Sbjct: 349 K 349 >gi|261752817|ref|ZP_05996526.1| LOW QUALITY PROTEIN: conserved hypothetical protein [Brucella suis bv. 5 str. 513] gi|261742570|gb|EEY30496.1| LOW QUALITY PROTEIN: conserved hypothetical protein [Brucella suis bv. 5 str. 513] Length = 189 Score = 191 bits (484), Expect = 2e-46, Method: Composition-based stats. Identities = 77/185 (41%), Positives = 108/185 (58%), Gaps = 4/185 (2%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L ++ LI G ++V Y A C+ D E GYY+T PFG GDF+TAPE+SQ+FGE++ Sbjct: 5 SLKERLKRLIATTGPISVADYMAACLGDREAGYYTTREPFGREGDFITAPEVSQMFGELI 64 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 I+ + W+ P+ L E+GPGRG +M D+LR I +L P I MVETS RL Sbjct: 65 GIWCLSEWDALARPANFVLCEIGPGRGTLMSDMLRTIGRLAPQMLGGARIAMVETSPRLA 124 Query: 124 LIQKKQLASYGDKINWYTSLADVP----LGFTFLVANEFFDSLPIKQFVMTEHGIRERMI 179 QK++LA + W+ AD+P G LV NE FD++P +QFV + ERMI Sbjct: 125 EKQKQKLAGTKAHVEWFERFADIPADTVHGPLILVTNELFDAIPFRQFVKADGRFVERMI 184 Query: 180 DIDQH 184 +++ Sbjct: 185 ALNEQ 189 >gi|30264332|ref|NP_846709.1| hypothetical protein BA_4484 [Bacillus anthracis str. Ames] gi|47529778|ref|YP_021127.1| hypothetical protein GBAA_4484 [Bacillus anthracis str. 'Ames Ancestor'] gi|65321638|ref|ZP_00394597.1| COG1565: Uncharacterized conserved protein [Bacillus anthracis str. A2012] gi|254684019|ref|ZP_05147879.1| hypothetical protein BantC_09220 [Bacillus anthracis str. CNEVA-9066] gi|254721854|ref|ZP_05183643.1| hypothetical protein BantA1_05215 [Bacillus anthracis str. A1055] gi|254736368|ref|ZP_05194074.1| hypothetical protein BantWNA_14496 [Bacillus anthracis str. Western North America USA6153] gi|254741405|ref|ZP_05199092.1| hypothetical protein BantKB_10407 [Bacillus anthracis str. Kruger B] gi|254753959|ref|ZP_05205994.1| hypothetical protein BantV_15890 [Bacillus anthracis str. Vollum] gi|254757830|ref|ZP_05209857.1| hypothetical protein BantA9_05916 [Bacillus anthracis str. Australia 94] gi|30258977|gb|AAP28195.1| conserved hypothetical protein [Bacillus anthracis str. Ames] gi|47504926|gb|AAT33602.1| conserved hypothetical protein [Bacillus anthracis str. 'Ames Ancestor'] Length = 368 Score = 191 bits (484), Expect = 2e-46, Method: Composition-based stats. Identities = 71/351 (20%), Positives = 129/351 (36%), Gaps = 17/351 (4%) Query: 18 QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 ++ Y L + E GYY G GDF T+ +S F + A F + E Sbjct: 16 SISYSTYMKLALYAEEHGYYMKEREKIGRQGDFFTSSNVSSAFAKTFAKFFVRLVE--NG 73 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 + E+G G G D+L+ +L P+ F L+ M+E S +Q+K L S+ + Sbjct: 74 EVAPNICEIGGGTGRFAYDVLQEWKQLSPETFINLNYSMIEMSPFHRKLQQKSLCSFSNV 133 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 + + L +NE FD+ P++ + E + + L Sbjct: 134 SYYTSHSEMGESFEGILFSNELFDAFPVEVIEKRNGILYEVRVTYTEEGKLAEVCRPLHK 193 Query: 197 KSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS------R 247 + G FE + ++ I+ I +DYGY + Sbjct: 194 RIGRYLLKYNIHLAEGQRFEVPIAMEEFIKEIAKWFQRGVC--ITVDYGYTKEEWMHPAH 251 Query: 248 VGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 +L+ H +PL +PG+ DL++H+ + L + L + Q +FL G Sbjct: 252 QEGSLRGYYEHKLIRNPLAHPGEMDLTTHIHWDELKEMFSLQGMNTVWHKKQSEFLLAAG 311 Query: 307 IWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVEL 357 I + + + + R V + +G F +++ + +L Sbjct: 312 ILDQLTNHQDRNPFSETQKQ--NRAVRSMILNGGLGNAFDVVIHTKHMQQL 360 >gi|228916894|ref|ZP_04080456.1| hypothetical protein bthur0012_41070 [Bacillus thuringiensis serovar pulsiensis BGSC 4CC1] gi|228842718|gb|EEM87804.1| hypothetical protein bthur0012_41070 [Bacillus thuringiensis serovar pulsiensis BGSC 4CC1] Length = 370 Score = 191 bits (484), Expect = 2e-46, Method: Composition-based stats. Identities = 71/351 (20%), Positives = 129/351 (36%), Gaps = 17/351 (4%) Query: 18 QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 ++ Y L + E GYY G GDF T+ +S F + A F + E Sbjct: 18 SISYSTYMKLALYAEEHGYYMKEREKIGRQGDFFTSSNVSSAFAKTFAKFFVRLVE--NG 75 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 + E+G G G D+L+ +L P+ F L+ M+E S +Q+K L S+ + Sbjct: 76 EVAPNICEIGGGTGRFAYDVLQEWKQLSPETFINLNYSMIEMSPFHRKLQQKSLCSFSNV 135 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 + + L +NE FD+ P++ + E + + L Sbjct: 136 SYYTSHSEMGESFEGILFSNELFDAFPVEVIEKRNGILYEVRVTYTEEGKLAEVCRPLHK 195 Query: 197 KSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS------R 247 + G FE + ++ I+ I +DYGY + Sbjct: 196 RIVRYLLKYNIHLAEGQRFEVPIAMEEFIKEIAKWFQRGVC--ITVDYGYTKEEWMHPAH 253 Query: 248 VGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 +L+ H +PL +PG+ DL++H+ + L + L + Q +FL G Sbjct: 254 QEGSLRGYYEHKLIRNPLAHPGEMDLTTHIHWDELKEMFSLQGMNTVWHKKQSEFLLAAG 313 Query: 307 IWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVEL 357 I + + + + R V + +G F +++ + +L Sbjct: 314 ILDQLTNHQDRNPFSETQKQ--NRAVRSMILNGGLGNAFDVVIHTKHMQQL 362 >gi|301055753|ref|YP_003793964.1| hypothetical protein BACI_c42290 [Bacillus anthracis CI] gi|300377922|gb|ADK06826.1| conserved hypothetical protein [Bacillus cereus biovar anthracis str. CI] Length = 370 Score = 191 bits (484), Expect = 2e-46, Method: Composition-based stats. Identities = 71/351 (20%), Positives = 129/351 (36%), Gaps = 17/351 (4%) Query: 18 QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 ++ Y L + E GYY G GDF T+ +S F + A F + E Sbjct: 18 SISYSTYMKLALYAEEHGYYMKEREKIGRQGDFFTSSNVSSAFAKTFAKFFVRLVE--NG 75 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 + E+G G G D+L+ +L P+ F L+ M+E S +Q+K L S+ + Sbjct: 76 EVAPNICEIGGGTGRFAYDVLQEWKQLSPETFINLNYSMIEMSPFHRKLQQKSLCSFSNV 135 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 + + L +NE FD+ P++ + E + + L Sbjct: 136 SYYMSHSEMGESFEGILFSNELFDAFPVEVIEKRNGILYEVRVTYTEEGKLAEVCRPLHK 195 Query: 197 KSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS------R 247 + G FE + ++ I+ I +DYGY + Sbjct: 196 RIGRYLLKYNIHLAEGQRFEVPIAMEEFIKEIAKWFQRGVC--ITVDYGYTKEEWMHPAH 253 Query: 248 VGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 +L+ H +PL +PG+ DL++H+ + L + L + Q +FL G Sbjct: 254 QEGSLRGYYEHKLIRNPLAHPGEMDLTTHIHWDELKEMFSLQGMNTVWHKKQSEFLLAAG 313 Query: 307 IWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVEL 357 I + + + + R V + +G F +++ + +L Sbjct: 314 ILDQLTNHQDRNPFSETQKQ--NRAVRSMILNGGLGSAFDVVIHTKHMQQL 362 >gi|49187159|ref|YP_030411.1| hypothetical protein BAS4162 [Bacillus anthracis str. Sterne] gi|165872050|ref|ZP_02216690.1| conserved hypothetical protein [Bacillus anthracis str. A0488] gi|167634528|ref|ZP_02392848.1| conserved hypothetical protein [Bacillus anthracis str. A0442] gi|167638577|ref|ZP_02396853.1| conserved hypothetical protein [Bacillus anthracis str. A0193] gi|170687499|ref|ZP_02878716.1| conserved hypothetical protein [Bacillus anthracis str. A0465] gi|170707414|ref|ZP_02897868.1| conserved hypothetical protein [Bacillus anthracis str. A0389] gi|177655068|ref|ZP_02936734.1| conserved hypothetical protein [Bacillus anthracis str. A0174] gi|190566222|ref|ZP_03019141.1| conserved hypothetical protein [Bacillus anthracis Tsiankovskii-I] gi|227817035|ref|YP_002817044.1| hypothetical protein BAMEG_4520 [Bacillus anthracis str. CDC 684] gi|228929305|ref|ZP_04092330.1| hypothetical protein bthur0010_39920 [Bacillus thuringiensis serovar pondicheriensis BGSC 4BA1] gi|228935581|ref|ZP_04098397.1| hypothetical protein bthur0009_40270 [Bacillus thuringiensis serovar andalousiensis BGSC 4AW1] gi|229603284|ref|YP_002868550.1| hypothetical protein BAA_4503 [Bacillus anthracis str. A0248] gi|49181086|gb|AAT56462.1| conserved hypothetical protein [Bacillus anthracis str. Sterne] gi|164712181|gb|EDR17718.1| conserved hypothetical protein [Bacillus anthracis str. A0488] gi|167513425|gb|EDR88795.1| conserved hypothetical protein [Bacillus anthracis str. A0193] gi|167529980|gb|EDR92715.1| conserved hypothetical protein [Bacillus anthracis str. A0442] gi|170127658|gb|EDS96531.1| conserved hypothetical protein [Bacillus anthracis str. A0389] gi|170668694|gb|EDT19440.1| conserved hypothetical protein [Bacillus anthracis str. A0465] gi|172080329|gb|EDT65418.1| conserved hypothetical protein [Bacillus anthracis str. A0174] gi|190563141|gb|EDV17107.1| conserved hypothetical protein [Bacillus anthracis Tsiankovskii-I] gi|227006278|gb|ACP16021.1| conserved hypothetical protein [Bacillus anthracis str. CDC 684] gi|228824119|gb|EEM69935.1| hypothetical protein bthur0009_40270 [Bacillus thuringiensis serovar andalousiensis BGSC 4AW1] gi|228830319|gb|EEM75931.1| hypothetical protein bthur0010_39920 [Bacillus thuringiensis serovar pondicheriensis BGSC 4BA1] gi|229267692|gb|ACQ49329.1| conserved hypothetical protein [Bacillus anthracis str. A0248] Length = 370 Score = 191 bits (484), Expect = 2e-46, Method: Composition-based stats. Identities = 71/351 (20%), Positives = 129/351 (36%), Gaps = 17/351 (4%) Query: 18 QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 ++ Y L + E GYY G GDF T+ +S F + A F + E Sbjct: 18 SISYSTYMKLALYAEEHGYYMKEREKIGRQGDFFTSSNVSSAFAKTFAKFFVRLVE--NG 75 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 + E+G G G D+L+ +L P+ F L+ M+E S +Q+K L S+ + Sbjct: 76 EVAPNICEIGGGTGRFAYDVLQEWKQLSPETFINLNYSMIEMSPFHRKLQQKSLCSFSNV 135 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 + + L +NE FD+ P++ + E + + L Sbjct: 136 SYYTSHSEMGESFEGILFSNELFDAFPVEVIEKRNGILYEVRVTYTEEGKLAEVCRPLHK 195 Query: 197 KSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS------R 247 + G FE + ++ I+ I +DYGY + Sbjct: 196 RIGRYLLKYNIHLAEGQRFEVPIAMEEFIKEIAKWFQRGVC--ITVDYGYTKEEWMHPAH 253 Query: 248 VGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 +L+ H +PL +PG+ DL++H+ + L + L + Q +FL G Sbjct: 254 QEGSLRGYYEHKLIRNPLAHPGEMDLTTHIHWDELKEMFSLQGMNTVWHKKQSEFLLAAG 313 Query: 307 IWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVEL 357 I + + + + R V + +G F +++ + +L Sbjct: 314 ILDQLTNHQDRNPFSETQKQ--NRAVRSMILNGGLGNAFDVVIHTKHMQQL 362 >gi|196034994|ref|ZP_03102401.1| conserved hypothetical protein [Bacillus cereus W] gi|228947974|ref|ZP_04110260.1| hypothetical protein bthur0007_41010 [Bacillus thuringiensis serovar monterrey BGSC 4AJ1] gi|229123799|ref|ZP_04252993.1| hypothetical protein bcere0016_40860 [Bacillus cereus 95/8201] gi|195992533|gb|EDX56494.1| conserved hypothetical protein [Bacillus cereus W] gi|228659620|gb|EEL15266.1| hypothetical protein bcere0016_40860 [Bacillus cereus 95/8201] gi|228811664|gb|EEM57999.1| hypothetical protein bthur0007_41010 [Bacillus thuringiensis serovar monterrey BGSC 4AJ1] Length = 370 Score = 190 bits (483), Expect = 2e-46, Method: Composition-based stats. Identities = 71/351 (20%), Positives = 129/351 (36%), Gaps = 17/351 (4%) Query: 18 QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 ++ Y L + E GYY G GDF T+ +S F + A F + E Sbjct: 18 SISYSTYMKLALYAEEHGYYMKEREKIGRQGDFFTSSNVSSAFAKTFAKFFVRLVE--NG 75 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 + E+G G G D+L+ +L P+ F L+ M+E S +Q+K L S+ + Sbjct: 76 EVAPNICEIGGGTGRFAYDVLQEWKQLSPETFINLNYSMIEMSPFHRKLQQKSLCSFSNV 135 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 + + L +NE FD+ P++ + E + + L Sbjct: 136 SYYTSHSEMGESFEGILFSNELFDAFPVEVIEKRNGILYEVRVTYTEEGKLAEVCRPLHK 195 Query: 197 KSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS------R 247 + G FE + ++ I+ I +DYGY + Sbjct: 196 RIGRYLLKYNIHLAEGQRFEVPIAMEEFIKEIAKWFQRGVC--ITVDYGYTKEEWMHPAH 253 Query: 248 VGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 +L+ H +PL +PG+ DL++H+ + L + L + Q +FL G Sbjct: 254 QEGSLRGYYEHKLIRNPLAHPGEMDLTTHIHWDELKEMFSLQGMNTVWHKKQSEFLLAAG 313 Query: 307 IWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVEL 357 I + + + + R V + +G F +++ + +L Sbjct: 314 ILDQLTNHQDRNPFSETQKQ--NRAVRSMILNGGLGNAFDVVIHTKHMQQL 362 >gi|49478596|ref|YP_038320.1| hypothetical protein BT9727_4001 [Bacillus thuringiensis serovar konkukian str. 97-27] gi|49330152|gb|AAT60798.1| conserved hypothetical protein [Bacillus thuringiensis serovar konkukian str. 97-27] Length = 370 Score = 190 bits (483), Expect = 3e-46, Method: Composition-based stats. Identities = 71/351 (20%), Positives = 128/351 (36%), Gaps = 17/351 (4%) Query: 18 QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 ++ Y L + E GYY G GDF T+ +S F + A F + E Sbjct: 18 SISYSTYMKLALYAEEHGYYMKEREKIGRQGDFFTSSNVSSAFAKTFAKFFVRLVE--NG 75 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 + E+G G G D+L+ +L P+ F L+ M+E S +Q+K L S+ + Sbjct: 76 EVAPNICEIGGGTGRFAYDVLQEWKQLSPETFINLNYSMIEMSPFHRKLQQKSLCSFSNV 135 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 + + L +NE FD+ P++ + E + + L Sbjct: 136 SYYTSHSEMGESFEGILFSNELFDAFPVEVIEKRNGMLYEVRVTYTEEGKLAEVCRPLHK 195 Query: 197 KSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS------R 247 + G FE + ++ I+ I +DYGY + Sbjct: 196 RIGRYLLKYNIHLAEGQRFEVPIAMEEFIKEIAKWFQRGVC--ITVDYGYTKEEWMHPAH 253 Query: 248 VGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 +L+ H +PL +PG+ DL++H+ + L + L + Q +FL G Sbjct: 254 QEGSLRGYYEHKLIRNPLAHPGEMDLTTHIHWDELKEMFSLQGMNTVWHKKQSEFLLAAG 313 Query: 307 IWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVEL 357 I + + + R V + +G F +++ + +L Sbjct: 314 ILDQLTDHQDRNPFSETQKQ--NRAVRSMILNGGLGSAFDVVIHTKHMQQL 362 >gi|134301831|ref|YP_001121799.1| hypothetical protein FTW_0807 [Francisella tularensis subsp. tularensis WY96-3418] gi|134049608|gb|ABO46679.1| conserved hypothetical protein [Francisella tularensis subsp. tularensis WY96-3418] Length = 378 Score = 190 bits (482), Expect = 3e-46, Method: Composition-based stats. Identities = 71/364 (19%), Positives = 139/364 (38%), Gaps = 23/364 (6%) Query: 4 KLIRKIVNLIKKNG-QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGE 61 L I+ IK + + + + + P+ GYYS + GDF+TA + +F Sbjct: 2 SLKNIILERIKSSKQPLLFRDFMQMALYYPQLGYYSRAKEKISSQGDFITATSQTSLFAR 61 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 A Q G + ++E G G G D + + L +VE S Sbjct: 62 TFARQFATIISQLG--NDCSVIEFGAGNGKFAADCVDELESLA---ILPKRYIIVELSND 116 Query: 122 LTLIQKKQLASYGDKINW---YTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERM 178 L L Q++ + + + + ANE D++P+ F + + ++ Sbjct: 117 LRLRQQQYIKENLSHLYDRFIWLDKLPAEKIKAIVFANELLDAMPVDIFRSENNKLIQQG 176 Query: 179 IDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFE--NSPCRDREMQSISDRLAC--DGG 234 + ++ ++++ + + G F + + ++ L G Sbjct: 177 VIRKGDTFEFSDMPKNDVRFEYESTKILNDGITFNDGYTSEINTWIRPWVKSLREVLSQG 236 Query: 235 TAIVIDYGYLQS------RVGDTL-QAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAIL 287 + DYGY +S R TL + H P +N G+ D+++HVDF ++ AI Sbjct: 237 IVFLCDYGYHRSLYYSKDRYMGTLACYHQHHVNFEPFINIGEQDITAHVDFTTVAEAAIE 296 Query: 288 YKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKI 347 ++G TQ FL+ I + ++ ++ + +L S + + E+FK+ Sbjct: 297 EGFQLDGFMTQANFLKRANIAEVFSNISQRLSTNQLLKYSND--IKDLLLNYKLAEVFKV 354 Query: 348 LVVS 351 + S Sbjct: 355 MAFS 358 >gi|208779524|ref|ZP_03246869.1| conserved hypothetical protein [Francisella novicida FTG] gi|208744485|gb|EDZ90784.1| conserved hypothetical protein [Francisella novicida FTG] Length = 395 Score = 190 bits (482), Expect = 3e-46, Method: Composition-based stats. Identities = 71/364 (19%), Positives = 139/364 (38%), Gaps = 23/364 (6%) Query: 4 KLIRKIVNLIKKNG-QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGE 61 L I+ IK + + + + + P+ GYYS + GDF+TA + +F Sbjct: 19 SLKNIILERIKSSKQPLLFRDFMQMALYYPQLGYYSGAKEKISSQGDFITATSQTSLFAR 78 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 A Q G + ++E G G G D + + L +VE S Sbjct: 79 TFARQFATIISQLG--NDCSVIEFGAGNGKFAADCVDELESLA---ILPKRYIIVELSND 133 Query: 122 LTLIQKKQLASYGDKINW---YTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERM 178 L L Q++ + + + + ANE D++P+ F + + ++ Sbjct: 134 LRLRQQQYIKENLPHLYDRFIWLDKLPAEKIKAIVFANELLDAMPVDIFRSENNKLIQQG 193 Query: 179 IDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFE--NSPCRDREMQSISDRLAC--DGG 234 + ++ ++++ + + G F+ + + ++ L G Sbjct: 194 VIRKGDTFEFSDMPKNDVRFEYESTKILNDGITFDDGYTSEINTWIRPWVKSLREVLSQG 253 Query: 235 TAIVIDYGYLQS------RVGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAIL 287 + DYGY +S R TL H P +N G+ D+++HVDF ++ AI Sbjct: 254 IVFLCDYGYHRSLYYSKDRYMGTLACYHQHQVNFEPFINIGEQDITAHVDFTTVAEAAIE 313 Query: 288 YKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKI 347 ++G TQ FL+ I + ++ ++ + +L S + + E+FK+ Sbjct: 314 EGFQLDGFMTQANFLKRANIAEVFSNISQRLSTNQLLKYSND--MKDLLLNDKLAEVFKV 371 Query: 348 LVVS 351 + S Sbjct: 372 MAFS 375 >gi|218905396|ref|YP_002453230.1| hypothetical protein BCAH820_4280 [Bacillus cereus AH820] gi|218536263|gb|ACK88661.1| conserved hypothetical protein [Bacillus cereus AH820] Length = 370 Score = 190 bits (482), Expect = 3e-46, Method: Composition-based stats. Identities = 71/351 (20%), Positives = 130/351 (37%), Gaps = 17/351 (4%) Query: 18 QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 ++ Y L + E GYY G GDF T+ +S F + A F + E Sbjct: 18 SISYSTYMKLALYAEEHGYYMKEREKIGRQGDFFTSSNVSSAFAKTFAKFFVRLVE--NG 75 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 + + E+G G G D+L+ +L P+ F L+ M+E S +Q+K L S+ + Sbjct: 76 EVALNICEIGGGTGRFAYDVLQEWKQLSPETFINLNYSMIEMSPFHRKLQQKSLCSFSNV 135 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 + + L +NE FD+ P++ + E + + L Sbjct: 136 SYYTSHSEMGESFEGILFSNELFDAFPVEVIEKRNGILYEVRVTYTEEGKLAEVCRPLHK 195 Query: 197 KSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS------R 247 + G FE + ++ I+ I +DYGY + Sbjct: 196 RIGRYLLKYNIHLAEGQRFEVPIAMEEFIKEIAKWFQRGVC--ITVDYGYTKEEWMHPAH 253 Query: 248 VGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 +L+ H +PL +PG+ DL++H+ + L + L + Q +FL G Sbjct: 254 QEGSLRGYYEHKLIRNPLAHPGEMDLTTHIHWDELKEMFSLQGMNTVWHKKQSEFLLAAG 313 Query: 307 IWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVEL 357 I + + + + R V + +G F +++ + +L Sbjct: 314 ILDQLTNHQDRNPFSETQKQ--NRAVRSMILNGGLGNAFDVVIHTKHMQQL 362 >gi|115314234|ref|YP_762957.1| hypothetical protein FTH_0309 [Francisella tularensis subsp. holarctica OSU18] gi|156501695|ref|YP_001427760.1| hypothetical protein FTA_0327 [Francisella tularensis subsp. holarctica FTNF002-00] gi|167009718|ref|ZP_02274649.1| hypothetical protein Ftulh_03099 [Francisella tularensis subsp. holarctica FSC200] gi|254367112|ref|ZP_04983146.1| hypothetical protein FTHG_00298 [Francisella tularensis subsp. holarctica 257] gi|254369016|ref|ZP_04985029.1| conserved hypothetical protein [Francisella tularensis subsp. holarctica FSC022] gi|290954031|ref|ZP_06558652.1| hypothetical protein FtulhU_07078 [Francisella tularensis subsp. holarctica URFT1] gi|295312589|ref|ZP_06803344.1| hypothetical protein FtulhU_07070 [Francisella tularensis subsp. holarctica URFT1] gi|115129133|gb|ABI82320.1| conserved hypothetical protein [Francisella tularensis subsp. holarctica OSU18] gi|134252936|gb|EBA52030.1| hypothetical protein FTHG_00298 [Francisella tularensis subsp. holarctica 257] gi|156252298|gb|ABU60804.1| conserved hypothetical protein [Francisella tularensis subsp. holarctica FTNF002-00] gi|157121937|gb|EDO66107.1| conserved hypothetical protein [Francisella tularensis subsp. holarctica FSC022] Length = 378 Score = 190 bits (482), Expect = 3e-46, Method: Composition-based stats. Identities = 71/364 (19%), Positives = 138/364 (37%), Gaps = 23/364 (6%) Query: 4 KLIRKIVNLIKKNG-QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGE 61 L I+ IK + + + + + P+ GYYS + GDF+TA + +F Sbjct: 2 SLKNIILERIKSSKQPLLFRDFMQMALYYPQLGYYSRAKEKISSQGDFITATSQTSLFAR 61 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 A Q G + ++E G G G D + + L +VE S Sbjct: 62 TFARQFATIISQLG--NDCSVIEFGAGNGKFAADCVDELESLA---ILPKRYIIVELSND 116 Query: 122 LTLIQKKQLASYGDKINW---YTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERM 178 L L Q++ + + + + ANE D++P+ F + + ++ Sbjct: 117 LRLRQQQYIKENLSHLYDRFIWLDKLPAEKIKAIVFANELLDAMPVDIFRSENNKLIQQG 176 Query: 179 IDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFE--NSPCRDREMQSISDRLAC--DGG 234 + ++ ++++ + + G F + + ++ L G Sbjct: 177 VIRKGDTFEFSDMPKNDVRFEYESTKILNDGITFNDGYTSEINTWIRPWVKSLREVLSQG 236 Query: 235 TAIVIDYGYLQS------RVGDTL-QAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAIL 287 + DYGY +S R TL + H P +N G+ D+++HVDF + AI Sbjct: 237 IVFLCDYGYHRSLYYSKDRYMGTLACYHQHHVNFEPFINIGEQDITAHVDFTTVVEAAIE 296 Query: 288 YKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKI 347 ++G TQ FL+ I + ++ ++ + +L S + + E+FK+ Sbjct: 297 EGFQLDGFMTQANFLKRANIAEVFSNISQRLSTNQLLKYSND--IKDLLLNDKLAEVFKV 354 Query: 348 LVVS 351 + S Sbjct: 355 MAFS 358 >gi|47567821|ref|ZP_00238529.1| hypothetical protein cytosolic protein [Bacillus cereus G9241] gi|47555498|gb|EAL13841.1| hypothetical protein cytosolic protein [Bacillus cereus G9241] Length = 368 Score = 189 bits (481), Expect = 4e-46, Method: Composition-based stats. Identities = 69/351 (19%), Positives = 129/351 (36%), Gaps = 17/351 (4%) Query: 18 QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 ++ Y L + GYY G GDF T+ +S +F + A F I E Sbjct: 16 SISYSTYMNLVLYAEGHGYYMKEREKIGRQGDFFTSSNVSSVFAKTFAKFFIRLVE--NG 73 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 + E+G G G D+L+ +L P+ F L+ M+E S +Q++ L S+ + Sbjct: 74 EVAPNICEIGGGTGRFAYDVLQEWKQLSPETFIDLNYSMIEVSPFHRKLQQENLCSFSNV 133 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 + + L +NE FD+ P++ + E + + L Sbjct: 134 SYYRSHNEMGESFEGILFSNELFDAFPVEVIEKRNGILYEVRVTYTEEGKLAEVCRPLHK 193 Query: 197 KSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS------R 247 G FE + ++ I+ I +DYGY + Sbjct: 194 TIGRYLLKYNIHLAEGQRFEVPIAMEEYIKEIAKWFQRGVC--ITVDYGYTKEEWMHPAH 251 Query: 248 VGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 +L+ H +PL +PG+ DL++H+ + + + L + Q +FL G Sbjct: 252 QEGSLRGYYEHKLIRNPLAHPGEMDLTAHIHWDEMKEMFSLQGMNTVWHKKQYEFLLAAG 311 Query: 307 IWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVEL 357 I ++ + + + R V + +G F +++ + +L Sbjct: 312 ILEQLTNHQDRNPFSETQKQ--NRAVRSMILSGGLGSAFDVVIHTKHMQQL 360 >gi|196041555|ref|ZP_03108847.1| conserved hypothetical protein [Bacillus cereus NVH0597-99] gi|196027543|gb|EDX66158.1| conserved hypothetical protein [Bacillus cereus NVH0597-99] Length = 370 Score = 189 bits (481), Expect = 4e-46, Method: Composition-based stats. Identities = 71/351 (20%), Positives = 129/351 (36%), Gaps = 17/351 (4%) Query: 18 QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 ++ Y L + E GYY G GDF T+ +S F + A F + E Sbjct: 18 SISYSTYMKLVLYAEEHGYYMKEREKIGRQGDFFTSSNVSSAFAKTFAKFFVRLVE--NG 75 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 + E+G G G D+L+ +L P+ F L+ M+E S +Q+K L S+ + Sbjct: 76 EVAPNICEIGGGTGRFAYDVLQEWKQLSPETFINLNYSMIEMSPFHRKLQQKSLCSFSNV 135 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 + + L +NE FD+ P++ + E + + L Sbjct: 136 SYYTSHSEMGESFEGILFSNELFDAFPVEVIEKRNGILYEVRVTYTEEGKLAEVCRPLHK 195 Query: 197 KSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS------R 247 + G FE + ++ I+ I +DYGY + Sbjct: 196 RIGRYLLKYNIHLAEGQRFEVPIAMEEFIKEIAKWFQRGVC--ITVDYGYTKEEWMHPAH 253 Query: 248 VGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 +L+ H +PL +PG+ DL++H+ + L + L + Q +FL G Sbjct: 254 QEGSLRGYYEHKLIRNPLAHPGEMDLTTHIHWDELKEMFSLQGMNTVWHKKQSEFLLAAG 313 Query: 307 IWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVEL 357 I + + + + R V + +G F +++ + +L Sbjct: 314 ILDQLTNHQDRNPFSETQKQ--NRAVRSMILNGGLGNAFDVVIHTKHMQQL 362 >gi|332678783|gb|AEE87912.1| conserved hypothetical protein [Francisella cf. novicida Fx1] Length = 395 Score = 189 bits (481), Expect = 4e-46, Method: Composition-based stats. Identities = 71/364 (19%), Positives = 140/364 (38%), Gaps = 23/364 (6%) Query: 4 KLIRKIVNLIKKNG-QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGE 61 L I+ IK + + + + + P+ GYYS + GDF+TA + +F Sbjct: 19 SLKNIILERIKSSKQPLLFRDFMQMVLYYPQLGYYSGAKEKISSQGDFITATSQTSLFAR 78 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 A Q G + ++E G G G D + + L +VE S Sbjct: 79 TFARQFATIISQLG--NDCSVIEFGAGNGKFAADCVDELESLA---ILPKRYIIVELSND 133 Query: 122 LTLIQKKQLASYGDKINW---YTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERM 178 L L Q++ + + + + ANE D++P+ F + + ++ Sbjct: 134 LRLRQQQYIKENVPHLYDRFIWLDKLPAEKIKAIVFANELLDAMPVDIFRSENNKLIQQG 193 Query: 179 IDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFE--NSPCRDREMQSISDRLACD--GG 234 + ++ ++++ + + G F+ + + ++ L G Sbjct: 194 VIRKGDTFEFSDMPKNDVRFEYESTKILNDGITFDDGYTSEINTWIRPWVKSLREFLSQG 253 Query: 235 TAIVIDYGYLQS------RVGDTL-QAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAIL 287 + DYGY +S R TL + H P +N G+ D+++HVDF ++ AI Sbjct: 254 IVFLCDYGYHRSLYYSKDRYMGTLACYHQHHVNFEPFINIGEQDITAHVDFTTVAEAAIE 313 Query: 288 YKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKI 347 ++G TQ FL+ I + ++ ++ + +L S + + E+FK+ Sbjct: 314 EGFQLDGFMTQANFLKRANIAEVFSNISQRLSTNQLLKYSND--IKDLLLNDKLAEVFKV 371 Query: 348 LVVS 351 + S Sbjct: 372 MSFS 375 >gi|229117757|ref|ZP_04247126.1| hypothetical protein bcere0017_40300 [Bacillus cereus Rock1-3] gi|228665734|gb|EEL21207.1| hypothetical protein bcere0017_40300 [Bacillus cereus Rock1-3] Length = 370 Score = 189 bits (481), Expect = 4e-46, Method: Composition-based stats. Identities = 72/351 (20%), Positives = 129/351 (36%), Gaps = 17/351 (4%) Query: 18 QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 ++ Y L + GYY G GDF T+ +S F + A F I E Sbjct: 18 SISYSTYMNLVLYAEGHGYYMKDREKIGRQGDFFTSSNVSSAFAKTFAKFFIRLVE--ND 75 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 + E+G G G D+L+ +L P+ F L+ M+E S +Q++ L S+ + Sbjct: 76 EVAPNICEIGGGTGKFAYDVLQEWKQLSPETFIDLNYSMIEVSPFHRKLQQENLCSFSNV 135 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 + + L +NE FD+ P++ + E I + +L + Sbjct: 136 SYYTSYSKMGESFEGILFSNELFDAFPVEVIEKRNGMLYEVRITYTEEGNLAEVCRPLDK 195 Query: 197 KSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYL------QSR 247 + G FE + ++ I+ I +DYGY + Sbjct: 196 RIGRYLLKYNIHIAEGQRFEVPIAMEDYVKGIAKWFQKGIC--ITVDYGYTKAEWTYPAH 253 Query: 248 VGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 +L+ H +PL PG+ DL++H+ + L I L + Q +FL G Sbjct: 254 REGSLRGYYKHKLIRNPLAYPGEMDLTTHIHWDELKEIFNLQGMSAVWHKKQSEFLLAAG 313 Query: 307 IWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVEL 357 I ++ S + R + + +G F +++ + + L Sbjct: 314 ILEQLTSHQDTNPFSETQKQ--NRAIRSMILNGGLGNAFDVVIHTKDIQNL 362 >gi|52141239|ref|YP_085591.1| hypothetical protein BCZK4011 [Bacillus cereus E33L] gi|51974708|gb|AAU16258.1| conserved hypothetical protein [Bacillus cereus E33L] Length = 370 Score = 189 bits (481), Expect = 4e-46, Method: Composition-based stats. Identities = 71/351 (20%), Positives = 128/351 (36%), Gaps = 17/351 (4%) Query: 18 QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 ++ Y L + E GYY G GDF T+ +S F + A F + E Sbjct: 18 SISYSTYMKLALYAEEHGYYMKEREKIGRQGDFFTSSNVSSAFAKTFAKFFVRLVE--ND 75 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 + E+G G G D+L+ +L P+ F L+ M+E S +Q+K L S+ + Sbjct: 76 EVAPNICEIGGGTGRFAYDVLQEWKQLSPETFINLNYSMIEMSPFHRKLQQKSLCSFSNV 135 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 + + L +NE FD+ P++ + E + + L Sbjct: 136 SYYTSHSEMGESFEGILFSNELFDAFPVEVIEKRNGILYEVRVTYTEEGKLGEVYRPLHK 195 Query: 197 KSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS------R 247 + G FE + ++ I+ I +DYGY + Sbjct: 196 RIGRYLLKYNIHLAEGQRFEVPIAMEEFIKEIAKWFQRGVC--ITVDYGYTKEEWMHPAH 253 Query: 248 VGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 +L+ H +PL +PG+ DL++H+ + L + L + Q +FL G Sbjct: 254 QEGSLRGYYEHKLIRNPLAHPGEMDLTTHIHWDELKEMFSLQGMNTVWHKKQSEFLLAAG 313 Query: 307 IWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVEL 357 I + + + R V + +G F +++ + +L Sbjct: 314 ILDQLTDHQDRNPFSETQKQ--NRAVRSMILNGGLGNAFDVVIHTKHMQQL 362 >gi|229075965|ref|ZP_04208941.1| hypothetical protein bcere0024_39750 [Bacillus cereus Rock4-18] gi|228707280|gb|EEL59477.1| hypothetical protein bcere0024_39750 [Bacillus cereus Rock4-18] Length = 370 Score = 189 bits (481), Expect = 5e-46, Method: Composition-based stats. Identities = 72/351 (20%), Positives = 130/351 (37%), Gaps = 17/351 (4%) Query: 18 QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 ++ Y L + GYY G GDF T+ +S F ++ A F I E Sbjct: 18 SISYSTYMNLVLYAEGHGYYMKDREKIGRQGDFFTSSNVSSAFAKIFAKFFIRLVE--ND 75 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 + E+G G G D+L+ +L P+ F L+ M+E S +Q++ L S+ + Sbjct: 76 EVAPNICEIGGGTGKFAYDVLQEWKQLSPETFIDLNYSMIEVSPFHRKLQQENLCSFSNV 135 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 + + L +NE FD+ P++ + E I + +L + Sbjct: 136 SYYTSYSKMGESFEGILFSNELFDAFPVEVIEKRNGMLYEVRITYTEEGNLAEVCRPLDK 195 Query: 197 KSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYL------QSR 247 + G FE + ++ I+ I +DYGY + Sbjct: 196 RIGRYLLKYNIHIAEGQRFEVPIAMEDYIKGIAKWFQKGIC--ITVDYGYTKAEWTYPAH 253 Query: 248 VGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 +L+ H +PL PG+ DL++H+ + L I L + Q +FL G Sbjct: 254 REGSLRGYYKHKLIRNPLAYPGEMDLTTHIHWDELKEIFNLQGMSAVWHKKQSEFLLAAG 313 Query: 307 IWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVEL 357 I ++ S + R + + +G F +++ + + L Sbjct: 314 ILEQLTSHQDTNPFSETQKQ--NRAIRSMILNGGLGNAFDVVIHTKDIQNL 362 >gi|229093329|ref|ZP_04224438.1| hypothetical protein bcere0021_40560 [Bacillus cereus Rock3-42] gi|228690053|gb|EEL43852.1| hypothetical protein bcere0021_40560 [Bacillus cereus Rock3-42] Length = 370 Score = 189 bits (480), Expect = 5e-46, Method: Composition-based stats. Identities = 72/351 (20%), Positives = 127/351 (36%), Gaps = 17/351 (4%) Query: 18 QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 ++ Y L + E GYY G GDF T+ +S F + A F + E Sbjct: 18 SISYSTYMKLALYAEEHGYYMKEREKIGRQGDFFTSSNVSSAFAKTFAKFFVRLVE--NG 75 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 + E+G G G D+L+ +L P+ F L+ M+E S +QKK L S+ + Sbjct: 76 EVAPNICEIGGGTGRFAYDVLQEWKQLSPETFINLNYSMIEMSPFHRKLQKKSLCSFSNV 135 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 + L +NE FD+ P++ + E + + L Sbjct: 136 SYYTLHSEMGESFEGILFSNELFDAFPVEVIEKRNGILYEVRVTYTEEGKLAEVCRPLHK 195 Query: 197 KSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS------R 247 + G FE + ++ I+ I +DYGY + Sbjct: 196 RIGRYLLKYNIHLAEGQRFEVPIAMEEFIKEIAKWFQRGVC--ITVDYGYTKEEWMHPAH 253 Query: 248 VGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 +L+ H +PL +PG+ DL++H+ + L + L + Q +FL G Sbjct: 254 QEGSLRGYYEHKLIRNPLAHPGEMDLTTHIHWDELKEMFSLQGMNTVWHKKQSEFLLAAG 313 Query: 307 IWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVEL 357 I + + + R V + +G F +++ + +L Sbjct: 314 ILDQLTDHQDRNPFSETQKQ--NRAVRSMILNGGLGSSFDVVIHTKHMQQL 362 >gi|254374879|ref|ZP_04990360.1| conserved hypothetical protein [Francisella novicida GA99-3548] gi|151572598|gb|EDN38252.1| conserved hypothetical protein [Francisella novicida GA99-3548] Length = 378 Score = 189 bits (480), Expect = 6e-46, Method: Composition-based stats. Identities = 71/364 (19%), Positives = 138/364 (37%), Gaps = 23/364 (6%) Query: 4 KLIRKIVNLIKKNG-QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGE 61 L I+ IK + + + + + P+ GYYS + GDF+TA + +F Sbjct: 2 SLKNIILERIKSSKQPLLFRDFMQMALYYPQLGYYSRAKEKISSQGDFITATSQTSLFAR 61 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 A Q G + ++E G G G D + + L +VE S Sbjct: 62 TFARQFATIISQLG--NDCSVIEFGAGNGKFAADCVDELESLA---ILPKRYIIVELSND 116 Query: 122 LTLIQKKQLASYGDKINW---YTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERM 178 L L Q++ + + + + ANE D++P+ F + + ++ Sbjct: 117 LRLRQQQYIKENLPHLYDRFIWLDKLPAEKIKAIVFANELLDAMPVDIFRSENNKLIQQG 176 Query: 179 IDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFE--NSPCRDREMQSISDRLAC--DGG 234 + ++ ++++ + + G F+ + + ++ L G Sbjct: 177 VIRKGDTFEFSDMPKNDVRFEYESTKILNDGITFDDGYTSEINTWIRPWVKSLREVLSQG 236 Query: 235 TAIVIDYGYLQS------RVGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAIL 287 + DYGY +S R TL H P +N G+ D+++HVDF ++ AI Sbjct: 237 IVFLCDYGYHRSLYYSKDRYMGTLACYHQHQVNFEPFINIGEQDITAHVDFTTVAGAAIE 296 Query: 288 YKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKI 347 ++G TQ FL+ I + ++ + + +L S + + E+FK+ Sbjct: 297 EGFQLDGFMTQANFLKRANIAEVFSNISQSLSTNQLLKYSND--MKDLLLNDKLAEVFKV 354 Query: 348 LVVS 351 + S Sbjct: 355 MAFS 358 >gi|229104890|ref|ZP_04235549.1| hypothetical protein bcere0019_40300 [Bacillus cereus Rock3-28] gi|228678520|gb|EEL32738.1| hypothetical protein bcere0019_40300 [Bacillus cereus Rock3-28] Length = 370 Score = 189 bits (480), Expect = 6e-46, Method: Composition-based stats. Identities = 72/351 (20%), Positives = 129/351 (36%), Gaps = 17/351 (4%) Query: 18 QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 ++ Y L + GYY G GDF T+ +S F + A F I E Sbjct: 18 SISYSTYMNLVLYAEGHGYYMKDREKIGRQGDFFTSSNVSSAFAKTFAKFFIRLVE--ND 75 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 + E+G G G D+L+ +L P+ F L+ M+E S +Q++ L S+ + Sbjct: 76 EVAPNICEIGGGTGKFAYDVLQEWKQLSPETFIDLNYSMIEVSPFHRKLQQENLCSFSNI 135 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 + + L +NE FD+ P++ + E I + +L + Sbjct: 136 SYYTSYSKMGESFEGILFSNELFDAFPVEVIEKRNGMLYEVRITYTEEGNLAEVCRPLDK 195 Query: 197 KSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYL------QSR 247 + G FE + ++ I+ I +DYGY + Sbjct: 196 RIGRYLLKYNIHIAEGQRFEVPIAMEDYIKGIAKWFQKGIC--ITVDYGYTKAEWTYPAH 253 Query: 248 VGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 +L+ H +PL PG+ DL++H+ + L I L + Q +FL G Sbjct: 254 REGSLRGYYKHKLIRNPLAYPGEMDLTTHIHWDELKEIFNLQGMSAVWHKKQSEFLLAAG 313 Query: 307 IWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVEL 357 I ++ S + R + + +G F +++ + + L Sbjct: 314 ILEQLTSHQDTNPFSETQKQ--NRAIRSMILNGGLGNAFDVVIHTKDIQNL 362 >gi|222097705|ref|YP_002531762.1| hypothetical protein BCQ_4045 [Bacillus cereus Q1] gi|221241763|gb|ACM14473.1| conserved hypothetical protein [Bacillus cereus Q1] Length = 368 Score = 189 bits (479), Expect = 7e-46, Method: Composition-based stats. Identities = 71/351 (20%), Positives = 131/351 (37%), Gaps = 17/351 (4%) Query: 18 QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 ++ Y L + E+GYY G GDF T+ +S +F + A F + E Sbjct: 16 SISYSTYMNLALYAEEYGYYMKEREKIGRQGDFFTSSNVSSVFAKTFAKFFVRLVE--NG 73 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 + E+G G G D+L+ +L P+ F L+ ++E S +Q+K L S+ + Sbjct: 74 EVAPHICEIGGGTGRFAYDVLQEWKQLSPETFIDLNYSIIEMSPFHRKLQQKTLCSFSNV 133 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 + + L +NE FD+ P++ + E + + L Sbjct: 134 SYYTSHSEMGESFEGILFSNELFDAFPVEVIEKRNGILYEVRVTYTEEGKLAEVCRPLHK 193 Query: 197 KSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS------R 247 + G FE + ++ I+ I +DYGY + Sbjct: 194 RIGRYLLKYNIHLAEGQRFEVPIAMEEYIKEIAKWFQRGVC--ITVDYGYTKEEWMHPAH 251 Query: 248 VGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 +L+ H +PL PG+ DL++H+ + L + L + + Q +FL G Sbjct: 252 QEGSLRGYYEHKLIRNPLAYPGEMDLTTHIHWDELKEMFSLQGMNMVWHKKQSEFLLAAG 311 Query: 307 IWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVEL 357 I + + Q +I R V + +G +++ + +L Sbjct: 312 ILELLTNHQDQDPFSEIQKQ--NRAVHSMILSGGLGSACDVVIHTKHMQQL 360 >gi|118479435|ref|YP_896586.1| hypothetical protein BALH_3855 [Bacillus thuringiensis str. Al Hakam] gi|196046395|ref|ZP_03113621.1| conserved hypothetical protein [Bacillus cereus 03BB108] gi|225866242|ref|YP_002751620.1| hypothetical protein BCA_4371 [Bacillus cereus 03BB102] gi|229186501|ref|ZP_04313663.1| hypothetical protein bcere0004_40450 [Bacillus cereus BGSC 6E1] gi|118418660|gb|ABK87079.1| conserved hypothetical protein [Bacillus thuringiensis str. Al Hakam] gi|196022865|gb|EDX61546.1| conserved hypothetical protein [Bacillus cereus 03BB108] gi|225788473|gb|ACO28690.1| conserved hypothetical protein [Bacillus cereus 03BB102] gi|228596932|gb|EEK54590.1| hypothetical protein bcere0004_40450 [Bacillus cereus BGSC 6E1] Length = 370 Score = 189 bits (479), Expect = 7e-46, Method: Composition-based stats. Identities = 71/351 (20%), Positives = 128/351 (36%), Gaps = 17/351 (4%) Query: 18 QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 ++ Y L + E GYY G GDF T+ +S F + A F + E Sbjct: 18 SISYSTYMKLVLYAEEHGYYMKEREKIGRQGDFFTSSNVSSAFAKTFAKFFVRLVE--NG 75 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 + E+G G G D+L+ +L P+ F L+ M+E S +Q+K L S+ + Sbjct: 76 EVAPNICEIGGGTGRFAYDVLQEWKQLSPETFINLNYSMIEMSPFHRKLQQKSLCSFSNV 135 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 + + L +NE FD+ P++ + E + + L Sbjct: 136 SYYTSHSEMGESFEGILFSNELFDAFPVEVIEKRNGILYEVRVTYTEEGKLAEVCRPLHK 195 Query: 197 KSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS------R 247 + G FE + ++ I+ I +DYGY + Sbjct: 196 RIGRYLLKYNIHLAEGQRFEVPIAMEEFIKEIAKWFQRGVC--ITVDYGYTKEEWMHPAH 253 Query: 248 VGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 +L+ H +PL +PG+ DL++H+ + L + L + Q +FL G Sbjct: 254 QEGSLRGYYEHKLIRNPLAHPGEMDLTTHIHWDELKEMFSLQGMNTVWHKKQSEFLLAAG 313 Query: 307 IWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVEL 357 I + + + R V + +G F +++ + +L Sbjct: 314 ILDQLTDHQDRNPFSETQKQ--NRAVRSMILNGGLGSSFDVVIHTKHMQQL 362 >gi|206978016|ref|ZP_03238902.1| conserved hypothetical protein [Bacillus cereus H3081.97] gi|217961751|ref|YP_002340321.1| hypothetical protein BCAH187_A4393 [Bacillus cereus AH187] gi|229140996|ref|ZP_04269539.1| hypothetical protein bcere0013_40900 [Bacillus cereus BDRD-ST26] gi|206743816|gb|EDZ55237.1| conserved hypothetical protein [Bacillus cereus H3081.97] gi|217066789|gb|ACJ81039.1| conserved hypothetical protein [Bacillus cereus AH187] gi|228642429|gb|EEK98717.1| hypothetical protein bcere0013_40900 [Bacillus cereus BDRD-ST26] Length = 370 Score = 189 bits (479), Expect = 8e-46, Method: Composition-based stats. Identities = 71/351 (20%), Positives = 131/351 (37%), Gaps = 17/351 (4%) Query: 18 QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 ++ Y L + E+GYY G GDF T+ +S +F + A F + E Sbjct: 18 SISYSTYMNLALYAEEYGYYMKEREKIGRQGDFFTSSNVSSVFAKTFAKFFVRLVE--NG 75 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 + E+G G G D+L+ +L P+ F L+ ++E S +Q+K L S+ + Sbjct: 76 EVAPHICEIGGGTGRFAYDVLQEWKQLSPETFIDLNYSIIEMSPFHRKLQQKTLCSFSNV 135 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 + + L +NE FD+ P++ + E + + L Sbjct: 136 SYYTSHSEMGESFEGILFSNELFDAFPVEVIEKRNGILYEVRVTYTEEGKLAEVCRPLHK 195 Query: 197 KSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS------R 247 + G FE + ++ I+ I +DYGY + Sbjct: 196 RIGRYLLKYNIHLAEGQRFEVPIAMEEYIKEIAKWFQRGVC--ITVDYGYTKEEWMHPAH 253 Query: 248 VGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 +L+ H +PL PG+ DL++H+ + L + L + + Q +FL G Sbjct: 254 QEGSLRGYYEHKLIRNPLAYPGEMDLTTHIHWDELKEMFSLQGMNMVWHKKQSEFLLAAG 313 Query: 307 IWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVEL 357 I + + Q +I R V + +G +++ + +L Sbjct: 314 ILELLTNHQDQDPFSEIQKQ--NRAVHSMILSGGLGSACDVVIHTKHMQQL 362 >gi|229198388|ref|ZP_04325094.1| hypothetical protein bcere0001_39180 [Bacillus cereus m1293] gi|228585088|gb|EEK43200.1| hypothetical protein bcere0001_39180 [Bacillus cereus m1293] Length = 370 Score = 188 bits (478), Expect = 8e-46, Method: Composition-based stats. Identities = 70/351 (19%), Positives = 130/351 (37%), Gaps = 17/351 (4%) Query: 18 QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 ++ Y L + E GYY G GDF T+ +S +F + A F I E Sbjct: 18 SISYSTYMNLALYAEEHGYYMKEREKIGRQGDFFTSSNVSSVFAKTFAKFFIRLVE--NG 75 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 + E+G G G D+L+ +L P+ F L+ ++E S +Q+K L S+ + Sbjct: 76 EVAPHICEIGGGTGRFAYDVLQEWKQLSPETFIDLNYSIIEMSPFHRKLQQKTLCSFSNV 135 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 + + L +NE FD+ P++ + E + + L Sbjct: 136 SYYTSHSEMGESFEGILFSNELFDAFPVEVIEKRNGILYEVRVTYTEEGKLAEVCRPLHK 195 Query: 197 KSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS------R 247 + G FE + ++ I+ I +DYGY + Sbjct: 196 RIGRYLLKYNIHLAEGQRFEVPIAMEEYIKEIAKWFQRGVC--ITVDYGYTKEEWMHPAH 253 Query: 248 VGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 +L+ H +PL PG+ DL++H+ + L + L + + Q +FL G Sbjct: 254 QEGSLRGYYEHKLIRNPLAYPGEMDLTTHIHWDELKEMFSLQGMNMVWHKKQSEFLLAAG 313 Query: 307 IWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVEL 357 I + + + + + R V + +G +++ + +L Sbjct: 314 ILELLTNHQDRDPFSE--IQQQNRAVHSMILSGGLGSACDVVIHTKHMQQL 362 >gi|324328166|gb|ADY23426.1| hypothetical protein YBT020_20990 [Bacillus thuringiensis serovar finitimus YBT-020] Length = 368 Score = 188 bits (478), Expect = 9e-46, Method: Composition-based stats. Identities = 70/351 (19%), Positives = 130/351 (37%), Gaps = 17/351 (4%) Query: 18 QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 ++ Y L + E GYY G GDF T+ +S +F + A F I E Sbjct: 16 SISYSTYMNLALYAEEHGYYMKEREKIGRQGDFFTSSNVSSVFAKTFAKFFIRLVE--NG 73 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 + E+G G G D+L+ +L P+ F L+ ++E S +Q+K L S+ + Sbjct: 74 EVAPHICEIGGGTGRFAYDVLQEWKQLSPETFIDLNYSIIEMSPFHRKLQQKTLCSFSNV 133 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 + + L +NE FD+ P++ + E + + L Sbjct: 134 SYYTSHSEMGESFEGILFSNELFDAFPVEVIEKRNGILYEVRVTYTEEGKLAEVCRPLHK 193 Query: 197 KSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS------R 247 + G FE + ++ ++ I +DYGY + Sbjct: 194 RIGRYLLKYNIHLAEGQRFEVPIAMEEYIKEMAKWFQRGVC--ITVDYGYTKEEWMHPAH 251 Query: 248 VGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 +L+ H +PL PG+ DL++H+ + L + L + + Q +FL G Sbjct: 252 QEGSLRGYYEHKLIRNPLAYPGEMDLTTHIHWDELKEMFSLQGMNMVWHKKQSEFLLAAG 311 Query: 307 IWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVEL 357 I + + + +I R V + +G +++ + +L Sbjct: 312 ILELLTNHQDRDPFSEIQKQ--NRAVHSMILSGGLGSACDVVIHTKHMQQL 360 >gi|229098732|ref|ZP_04229672.1| hypothetical protein bcere0020_39600 [Bacillus cereus Rock3-29] gi|228684811|gb|EEL38749.1| hypothetical protein bcere0020_39600 [Bacillus cereus Rock3-29] Length = 370 Score = 188 bits (478), Expect = 9e-46, Method: Composition-based stats. Identities = 71/351 (20%), Positives = 128/351 (36%), Gaps = 17/351 (4%) Query: 18 QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 ++ Y L + GYY G GDF T+ +S F + A F I E Sbjct: 18 SISYSTYMNLVLYAEGHGYYMKDREKIGRQGDFFTSSNVSSAFAKTFAKFFIRLVE--ND 75 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 + E+G G G D+L+ +L P+ F L+ M+E S +Q++ L S+ + Sbjct: 76 EVAPNICEIGGGTGKFAYDVLQEWKQLSPETFIDLNYSMIEVSPFHRKLQQENLCSFSNV 135 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 + + L +NE FD+ P++ + E I + +L + Sbjct: 136 SYYTSYSKMGESFEGILFSNELFDAFPVEVIEKRNGMLYEVRITYTEEGNLAEVCRPLDK 195 Query: 197 KSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYL------QSR 247 + G FE + ++ I+ I +DYGY + Sbjct: 196 RIGRYLLKYNIHIAEGQRFEVPIAMEDYIKGIAKWFQKGIC--ITVDYGYTKAEWTYPAH 253 Query: 248 VGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 +L+ H +PL PG+ DL++H+ + L I + Q +FL G Sbjct: 254 REGSLRGYYKHKLIRNPLAYPGEMDLTTHIHWDELKEIFNFQGMSAVWHKKQSEFLLAAG 313 Query: 307 IWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVEL 357 I ++ S + R + + +G F +++ + + L Sbjct: 314 ILEQLTSHQDTNPFSETQKQ--NRAIRSMILNGGLGNAFDVVIHTKDIQNL 362 >gi|229157872|ref|ZP_04285947.1| hypothetical protein bcere0010_40530 [Bacillus cereus ATCC 4342] gi|228625829|gb|EEK82581.1| hypothetical protein bcere0010_40530 [Bacillus cereus ATCC 4342] Length = 370 Score = 188 bits (478), Expect = 1e-45, Method: Composition-based stats. Identities = 69/351 (19%), Positives = 129/351 (36%), Gaps = 17/351 (4%) Query: 18 QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 ++ Y L + GYY G GDF T+ +S +F + A F I E Sbjct: 18 SISYSTYMNLVLYAEGHGYYMKEREKIGRQGDFFTSSNVSSVFAKTFAKFFIRLVE--NG 75 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 + E+G G G D+L+ +L P+ F L+ M+E S +Q++ L S+ + Sbjct: 76 EVAPNICEIGGGTGRFAYDVLQEWKQLSPETFIDLNYSMIEMSPFHRKLQQENLCSFSNV 135 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 + + L +NE FD+ P++ + E + + L Sbjct: 136 SYYRSHNEMGESFEGILFSNELFDAFPVEVIEKRNGILYEVRVTYTEEGKLAEVCRPLHK 195 Query: 197 KSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS------R 247 G FE + ++ I+ I +DYGY + Sbjct: 196 TIGRYLLKYNIHLAEGQRFEVPIAMEEYIKEIAKWFQRGVC--ITVDYGYTKEEWMHPAH 253 Query: 248 VGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 +L+ H +PL +PG+ DL++H+ + + + L + Q +FL G Sbjct: 254 QEGSLRGYYEHKLIRNPLAHPGEMDLTAHIHWDEMKEMFSLQGMNTVWHKKQYEFLLAAG 313 Query: 307 IWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVEL 357 I ++ + + + R V + +G F +++ + +L Sbjct: 314 ILEQLTNHQDRNPFSETQKQ--NRAVRSMILSGGLGSAFDVVIHTKHMQQL 362 >gi|212638743|ref|YP_002315263.1| hypothetical protein Aflv_0900 [Anoxybacillus flavithermus WK1] gi|212560223|gb|ACJ33278.1| Uncharacterized conserved protein [Anoxybacillus flavithermus WK1] Length = 367 Score = 188 bits (478), Expect = 1e-45, Method: Composition-based stats. Identities = 76/371 (20%), Positives = 134/371 (36%), Gaps = 25/371 (6%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIFG 60 K++ K+ N +++ QY L + D ++GYY G GDF+T + I G Sbjct: 9 SEKMLEKMRNK-----RLSYAQYMELALYDEQYGYYMRRKEKIGREGDFITTSNVGHIIG 63 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 + A + E H P + E+G G G +L + P F M+E S Sbjct: 64 RVFADVFVRLIETHHVPPL--ICEIGGGTGRFAYTVLEQWKRQSPKTFPQGQYIMIEASP 121 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFT-FLVANEFFDSLPIKQFVMTEHGIRERMI 179 Q + LA + D + +S A++ F+ + +NE FD+ P+ E I E I Sbjct: 122 YHRQKQAETLAPFIDYVRIISSFAELSDSFSGIVFSNELFDAFPVHVIEKREGHIYECFI 181 Query: 180 DIDQH---DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTA 236 +++ + V + + G FE + + + + Sbjct: 182 EVENDQLVEKCVPLENEQIAQYIVERNIQLVDGQRFEIPLAMKKFIFQLDQVIDK--AII 239 Query: 237 IVIDYGYLQS------RVGDTLQAVKGHTYVSP-LVNPGQADLSSHVDFQRLSSIAILYK 289 +DYGY R +L+ H + L PG+ DL++H+ + L Sbjct: 240 FTVDYGYTDEEWKHPARKRGSLRGYYRHQLIDNALRYPGEMDLTTHIQWDALRFYGEQAG 299 Query: 290 LYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILV 349 L Q +FL GI + R + + +G F+++V Sbjct: 300 WTFTHLWRQDEFLLQAGILNHL--INHHDPNPFSEQQRHNRAIRSLVMSD-IGRAFRVMV 356 Query: 350 VSHEKVELMPF 360 + + F Sbjct: 357 QQK-NMNIPIF 366 >gi|297265813|ref|XP_002799256.1| PREDICTED: protein midA homolog, mitochondrial-like [Macaca mulatta] Length = 415 Score = 188 bits (478), Expect = 1e-45, Method: Composition-based stats. Identities = 107/376 (28%), Positives = 165/376 (43%), Gaps = 60/376 (15%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 ++R ++ IK G +TV +Y + +P ++ Sbjct: 42 TPMLRHLMYKIKSTGPITVAEYMKEVLTNP---------------------------AKL 74 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSV-LSIYMVETSER 121 L I+ I W G + +LVELGPGRG ++ DILRV +L + +S+++VE S++ Sbjct: 75 LGIWFISEWMATGKSTAFQLVELGPGRGTLVGDILRVFTQLGSVLKNCDISVHLVEVSQK 134 Query: 122 LTLIQ--------------------KKQLASYGDKINWYTSLADVPLGFTFLVANEFFDS 161 L+ IQ K + G I+WY + DVP G++F +A+EFFD Sbjct: 135 LSEIQALTLTEEKVPLERNAGSPVYMKGVTKSGIPISWYRHVHDVPKGYSFYLAHEFFDV 194 Query: 162 LPIKQFVMTEHGIRERMIDIDQH--DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRD 219 LP+ +F T G RE IDID D L F + + D E P Sbjct: 195 LPVHKFQKTPQGWREVFIDIDPQVSDKLRFVLAPSATPAEAFIQHD-ETRDHVEVCPDAG 253 Query: 220 REMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQ 279 ++ +S R+A GG A+V DYG+ ++ + GH L+ PG ADL++ VDF Sbjct: 254 VIIEELSQRIALTGGAALVADYGHDGTKTXM-FKGFCGHKLHDVLIAPGTADLTADVDFS 312 Query: 280 RLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTAR---KDILLDSVKRLVSTSA 336 L +A K+ G Q FL+ +GI R L+ ++ + LL L+ Sbjct: 313 YLRRMA-QGKVASLGPIKQHTFLKNMGIDVRLKVLLDKSNEPSVRQQLLQGYDMLM---- 367 Query: 337 DKKSMGELFKILVVSH 352 + K MGE F + Sbjct: 368 NPKKMGERFNFFALLP 383 >gi|42783386|ref|NP_980633.1| hypothetical protein BCE_4340 [Bacillus cereus ATCC 10987] gi|42739314|gb|AAS43241.1| conserved hypothetical protein [Bacillus cereus ATCC 10987] Length = 368 Score = 188 bits (477), Expect = 1e-45, Method: Composition-based stats. Identities = 71/351 (20%), Positives = 130/351 (37%), Gaps = 17/351 (4%) Query: 18 QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 ++ Y L + E GYY G GDF T+ +S +F + A F I E Sbjct: 16 SISYSTYMNLALYAEEHGYYMKEREKIGRQGDFFTSSNVSSVFAKTFAKFFIRLVE--NG 73 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 + E+G G G D+L+ +L P+ F L+ ++E S +Q+K L S+ + Sbjct: 74 EVVPHICEIGGGTGRFAYDVLQEWKQLSPETFIDLNYSIIEMSPFHRKLQQKTLCSFSNV 133 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 + + L +NE FD+ P++ + E + + L Sbjct: 134 SYYTSHSEMGESFEGILFSNELFDAFPVEVIEKRNGILYEVRVTYTEEGKLAEVCRPLHK 193 Query: 197 KSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS------R 247 + G FE + ++ I+ I +DYGY + Sbjct: 194 RIGRYLLKYNIHLAEGQRFEVPIAMEEYIKEIAKWFQRGVC--ITVDYGYTKEEWMHPAH 251 Query: 248 VGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 +L+ H +PL PG+ DL++H+ + L + L + + Q +FL G Sbjct: 252 QEGSLRGYYEHKLIRNPLAYPGEMDLTTHIHWDELKEMFSLQGMNMVWHKKQSEFLLAAG 311 Query: 307 IWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVEL 357 I + + + +I R V + +G +++ + +L Sbjct: 312 ILELLTNHQDRDPFSEIQKQ--NRAVHSMILSGGLGSACDVVIHTKHMQQL 360 >gi|229031909|ref|ZP_04187896.1| hypothetical protein bcere0028_39560 [Bacillus cereus AH1271] gi|228729373|gb|EEL80363.1| hypothetical protein bcere0028_39560 [Bacillus cereus AH1271] Length = 370 Score = 188 bits (477), Expect = 1e-45, Method: Composition-based stats. Identities = 74/352 (21%), Positives = 135/352 (38%), Gaps = 19/352 (5%) Query: 18 QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 ++ Y L + GYY G GDF T+ +S +F + A F I E Sbjct: 18 SISYSTYMNLVLYTEGHGYYMKEREKIGRQGDFFTSSNVSSVFAKTFAKFFIRLVES--G 75 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 + E+G G G D+L+ ++ P+ F L+ M+E S +Q+K L S+ + Sbjct: 76 EVAPNICEIGGGTGRFAYDVLQEWKQVSPETFIDLNYSMIEMSPFHRSLQQKNLCSFSN- 134 Query: 137 INWYTSLADVPLGFT-FLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHE 195 +++YTS +++ F L +NE FD+ P++ + E + + L Sbjct: 135 VSYYTSYSEMGESFDGILFSNELFDAFPVEVIEKRNGMLYEVRVTYTEEGKLAEVWRPLH 194 Query: 196 IKSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS------ 246 G FE + ++ I+ I +DYGY + Sbjct: 195 KSIGRYLLKYNIHLAEGQRFEVPIVMEEYIKDIAKWFQKGIC--ITVDYGYTKEEWMHPA 252 Query: 247 RVGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGL 305 +L+ H +PL +PG+ DL++H+ + L + L + Q +FL Sbjct: 253 HQEGSLRGYYEHKLIRNPLAHPGEMDLTAHIHWDELKEMFSLQGMDTVWHKKQSEFLLAA 312 Query: 306 GIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVEL 357 GI + + + R V + +G F +++ + +L Sbjct: 313 GILDQLTGHQDRNPFSETQKQ--NRAVRSMILSGGLGSAFDVVIHTKHMQQL 362 >gi|319651535|ref|ZP_08005663.1| hypothetical protein HMPREF1013_02275 [Bacillus sp. 2_A_57_CT2] gi|317396850|gb|EFV77560.1| hypothetical protein HMPREF1013_02275 [Bacillus sp. 2_A_57_CT2] Length = 353 Score = 188 bits (476), Expect = 1e-45, Method: Composition-based stats. Identities = 71/364 (19%), Positives = 141/364 (38%), Gaps = 23/364 (6%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNP-FGAVGDFVTAPEISQIF 59 M N L+ I N ++ ++ +Y + E+GYY P G GD++T+ IS I+ Sbjct: 1 MRNFLMNFIENTPQQ--MISYAEYIQQALYHSEYGYYMKNTPKIGPAGDYITSSNISDIY 58 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 G ++ + +++ P ++ E+G G G + + + + ETS Sbjct: 59 GRTISKWFFQMAKEYKLP--FQVCEIGGGNGRFARAFIDEWKLIADEEIHYC---IFETS 113 Query: 120 ERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMI 179 +Q++Q+ + + I S ++ + +NE FD+LP+ + + E M+ Sbjct: 114 PYHRKLQEEQIV-FSESIRQIDSWNEITPFCGMIFSNELFDALPVHVVEKRKDQLHEIMV 172 Query: 180 DIDQHDSLVFNIGDHEIKSNFLTCSD---YFLGAIFENSPCRDREMQSISDRLACDGGTA 236 + + + + G E ++ ++S+S L G Sbjct: 173 TVKEGELAEIAVPLTNGDIYLFLEESGLSLSNGQRIEIPLQMEQMIKSLSAALDK--GIV 230 Query: 237 IVIDYGYLQS------RVGDTLQAVKGHTYVS-PLVNPGQADLSSHVDFQRLSSIAILYK 289 + DYGY R +L+ H+ ++ L +PG+ D++SH+ F L I Sbjct: 231 LTADYGYTDEEWQEPMRRDGSLRGYYKHSLMNNVLEHPGKMDITSHIHFDSLIRIGEKEG 290 Query: 290 LYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILV 349 L + Q +FL GI Q + R + + M F +++ Sbjct: 291 LNFHFKMRQDEFLLSAGILQELEDHYDSNPFSP--ISKRNRAIRSLIMPSGMSSAFHLVL 348 Query: 350 VSHE 353 + + Sbjct: 349 QAKK 352 >gi|229174935|ref|ZP_04302455.1| hypothetical protein bcere0006_40190 [Bacillus cereus MM3] gi|228608603|gb|EEK65905.1| hypothetical protein bcere0006_40190 [Bacillus cereus MM3] Length = 370 Score = 187 bits (475), Expect = 2e-45, Method: Composition-based stats. Identities = 75/370 (20%), Positives = 134/370 (36%), Gaps = 21/370 (5%) Query: 1 MENKLIRKIVNLIKKNG--QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQ 57 ME +L + ++K ++ Y L + GYY G GDF T+ +S Sbjct: 1 MEMEL--ILKEWMEKEKDYSISYSTYMNLVLYTEGHGYYMKEREKIGRQGDFFTSSNVSS 58 Query: 58 IFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVE 117 +F + A F I E + E+G G G D+L+ ++ P F L+ M+E Sbjct: 59 VFAKTFAKFFIRLVE--NGEVAPNICEIGGGTGRFAYDVLQEWKQVSPKTFIDLNYSMIE 116 Query: 118 TSERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRER 177 S Q+K L S+ + + + L +NE FD+ P++ + E Sbjct: 117 MSPFHRNFQQKNLCSFSNVSYYTSHSEMGESFEGVLFSNELFDAFPVEVIEKRNGILYEV 176 Query: 178 MIDIDQHDSLVFNIGDHEIKSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGG 234 + + L + G FE + + I+ Sbjct: 177 RVTYTEEGKLAEVCRPLQKNIGRYLLKYNIHLAEGQRFEVPLAMEEYIVEIAKWFQRGVC 236 Query: 235 TAIVIDYGYLQS------RVGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAIL 287 I +DYGY + +L+ H +PL +PG+ DL++H+ + L + L Sbjct: 237 --ITVDYGYTKEEWMHPAHQEGSLRGYYEHKLIRNPLEHPGEMDLTTHIHWDELKEMFSL 294 Query: 288 YKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKI 347 + I Q +FL GI + + + R V + +G F + Sbjct: 295 QGMNIVWHKKQSEFLLAAGILDQLTGHQDRNPFSETQKQ--NRAVRSMILSGGLGSAFDV 352 Query: 348 LVVSHEKVEL 357 ++ + +L Sbjct: 353 VIHTKHMQQL 362 >gi|229192470|ref|ZP_04319433.1| hypothetical protein bcere0002_41230 [Bacillus cereus ATCC 10876] gi|228591047|gb|EEK48903.1| hypothetical protein bcere0002_41230 [Bacillus cereus ATCC 10876] Length = 370 Score = 186 bits (472), Expect = 5e-45, Method: Composition-based stats. Identities = 70/347 (20%), Positives = 128/347 (36%), Gaps = 17/347 (4%) Query: 18 QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 ++ Y L + GYY G GDF T+ +S +F + A I E Sbjct: 18 SISYSTYMNLVLYTEGHGYYMKEREKIGRQGDFFTSSNVSSVFAKTFAKLFIRLVE--NG 75 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 + E+G G G D+L+ +L P F L+ M+E S +Q++ L S+ + Sbjct: 76 EVAPNICEIGGGTGKFAYDVLQEWKQLSPKTFIDLNYSMIEVSPFHRKLQQEHLGSFSNV 135 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 + + L +NE FD+ P++ + E I + SL + Sbjct: 136 SYYTSYSEMGDSFEGILFSNELFDAFPVEIIEKRNGMLYEVRITYTEEGSLSEVCRPLDK 195 Query: 197 KSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS------R 247 + G FE + ++ I+ I +DYGY + Sbjct: 196 RIGRYLLKYNIHIAEGQRFEVPIVMEEYIKEIAKWFQKGIC--ITVDYGYTKEEWMHPAH 253 Query: 248 VGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 +L+ H +PL +PG+ DL++H+ + L + + + Q +FL G Sbjct: 254 REGSLRGYYQHKLIRNPLAHPGEMDLTTHIHWDELKEMFSMQGMNAVWHKKQSEFLLAAG 313 Query: 307 IWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHE 353 I ++ S + R V + +G F +++ + + Sbjct: 314 ILEQLTSHQDTNPFSETQKQ--NRAVRSMILNGGVGSAFDVVIHTKD 358 >gi|261855219|ref|YP_003262502.1| hypothetical protein Hneap_0602 [Halothiobacillus neapolitanus c2] gi|261835688|gb|ACX95455.1| protein of unknown function DUF185 [Halothiobacillus neapolitanus c2] Length = 398 Score = 186 bits (472), Expect = 5e-45, Method: Composition-based stats. Identities = 91/384 (23%), Positives = 144/384 (37%), Gaps = 52/384 (13%) Query: 2 ENKLIRKIV-NLIKKNGQMTVDQYFALCVADPEFGYYSTCNP-FGAVGDFVTAPEISQIF 59 LI + +I + ++ Y A + P++GYY + FGA GDFVTAPE S F Sbjct: 20 SMTLIEHLQGRMIDE--PLSFSDYMAEVLYHPDYGYYGSAQVQFGAGGDFVTAPERSPFF 77 Query: 60 GEMLAIFLICAWEQHGFPSCVRL-VELGPGRGIMMLDILRVI--CKLKPDFFSVLSIYMV 116 A L+ W+Q S VR ELG G G + LD LR PD + + Sbjct: 78 ----AAGLVYEWQQIQRDSPVRQVCELGAGSGQLALDFLRTCDTRGCMPDQY-----LIW 128 Query: 117 ETSERLTLIQKKQL-----ASYGDKINWYTS----LADVPLGFTFLVANEFFDSLPIKQF 167 E S L Q+ +L ++ W ++ANE D++P +F Sbjct: 129 EISPGLRKRQQTRLKDELKPELWSRLTWVEDREAKDLADIWAGGMVIANEVVDAMPACRF 188 Query: 168 VMTEHGI----RERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQ 223 + ++ + + V + E+++ C+ + P Sbjct: 189 RWRPGQLDTLEELKVGWVGKRFGWVADTASPELRAALADCAGLWPLDDLAPEPVAAEINL 248 Query: 224 SISDRL---------ACDGGTAIVIDYG------YLQSRVGDTLQAVKGHTYV-SPLVNP 267 +S L + DYG Y RV TL+ H P V P Sbjct: 249 DLSRWLASIRTLFGHPEAASILYLFDYGGHTAEVYRPDRVDGTLRCHYRHRAHDDPFVYP 308 Query: 268 GQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQ---RAFSLMKQTARKDIL 324 G D+++ VDF+RL+ +A ++G +Q +L G + R + + + Sbjct: 309 GLQDITTWVDFERLARLAREGGFVVDGERSQAAWLLGTDVPDSFSRQMQAVTDRSASARI 368 Query: 325 LDSVKRLVSTSADKKSMGELFKIL 348 K LV MGE F++L Sbjct: 369 AQGFKELVM----PTEMGERFRVL 388 >gi|229146846|ref|ZP_04275211.1| hypothetical protein bcere0012_39860 [Bacillus cereus BDRD-ST24] gi|296504757|ref|YP_003666457.1| hypothetical protein BMB171_C3928 [Bacillus thuringiensis BMB171] gi|228636674|gb|EEK93139.1| hypothetical protein bcere0012_39860 [Bacillus cereus BDRD-ST24] gi|296325809|gb|ADH08737.1| putative cytoplasmic protein [Bacillus thuringiensis BMB171] Length = 370 Score = 186 bits (472), Expect = 5e-45, Method: Composition-based stats. Identities = 76/348 (21%), Positives = 138/348 (39%), Gaps = 19/348 (5%) Query: 18 QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 ++ Y L + GYY G GDF T+ +S +F + A I E Sbjct: 18 SISYSTYMNLVLYTEGHGYYMKEREKIGRQGDFFTSSNVSSVFAKTFAKLFIRLVE--NG 75 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 + E+G G G D+L+ +L P+ F L+ M+E S +Q++QL S+ + Sbjct: 76 EVASNICEVGGGTGKFAYDVLQEWKQLSPNTFINLNYSMIEVSPFHRKLQQEQLGSFSN- 134 Query: 137 INWYTSLADVPLGFT-FLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHE 195 +++YTS +++ F L +NE FD+ P++ + E I + +L Sbjct: 135 VSYYTSYSEMGDSFEGILFSNELFDAFPVEVIEKRNGILYEVRITYTEEGNLSEVCRPLN 194 Query: 196 IKSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS------ 246 + G FE + ++ IS I +DYGY + Sbjct: 195 KRIGRYLLKYNIHIAEGQRFEVPIVMEEYIKEISKWFQKGIC--ITVDYGYTKEEWMHPA 252 Query: 247 RVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGL 305 +L+ H + +PL +PG+ DL++HV + L + + + Q +FL Sbjct: 253 HREGSLRGYYQHKLMRNPLAHPGEMDLTTHVHWDELKEMFSMQGMSAVWHKKQSEFLLAA 312 Query: 306 GIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHE 353 GI ++ S + R V + +G F +++ + + Sbjct: 313 GILEQLTSHQDTNPFSETQKQ--NRAVRSMILNGGVGSSFDVVIHTKD 358 >gi|78779066|ref|YP_397178.1| hypothetical protein PMT9312_0681 [Prochlorococcus marinus str. MIT 9312] gi|78712565|gb|ABB49742.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT 9312] Length = 396 Score = 186 bits (472), Expect = 5e-45, Method: Composition-based stats. Identities = 76/377 (20%), Positives = 148/377 (39%), Gaps = 35/377 (9%) Query: 5 LIRKIVNLIKKNGQMTVDQYFALCVADPEFGYY-STCNPFGAVGDFVTAPEISQIF---- 59 L++KI IK G ++ + + DP GYY S G GDFVT+ +S F Sbjct: 12 LVKKI---IKMGGTISFYDFMNFVLNDPINGYYGSGKAVLGVRGDFVTSTSLSDDFAFLA 68 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 G+ + +LI + ++E G G G M +++ + +F +S ++E + Sbjct: 69 GKQIEDWLIQFKSSFLSNQKLAVIEFGAGDGSFMSGLIKYFLENNKNFLEGVSFLIIEPN 128 Query: 120 ERLTLIQKKQLASYGDKINWYT----SLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIR 175 + + QK +L + + + ++ANE D+LP+++ ++ + Sbjct: 129 KGMVEKQKNKLEEFLNLGIDILWKGLEEVEENNINGIVLANEVLDALPVERITFSKGKLL 188 Query: 176 ERMIDIDQHDSLVFNIGDHEIKSNF-------------LTCSDYFLGAIFENSPCRDREM 222 + + ID+ +F + D G E + + Sbjct: 189 RQAVSIDKKSHNLFFDEMPITNELDKSIELAKSELGITIPPEDALEGWTTEWHIDNSKWL 248 Query: 223 QSISDRLACDGGTAIVIDY------GYLQSRVGDTLQAVKGHTYVSPLVN-PGQADLSSH 275 ++I ++ + G ++IDY Y T+ + + + +++ PG DL+SH Sbjct: 249 KAIYGKI--NNGILLIIDYAKEAKKYYTSKNSDGTIVSYENQKMTNNVLDSPGNCDLTSH 306 Query: 276 VDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTS 335 V + L A G+T QG+ L LG+ +R + + K+ + + Sbjct: 307 VCIETLIHDAETLGFNTVGITKQGEALLALGLAERLYGIQKEFKEDLSNALLRREALLRL 366 Query: 336 ADKKSMGELFKILVVSH 352 D +G+ FK V Sbjct: 367 VDPVCLGD-FKWFVFKK 382 >gi|30022340|ref|NP_833971.1| putative cytoplasmic protein [Bacillus cereus ATCC 14579] gi|218234031|ref|YP_002369065.1| hypothetical protein BCB4264_A4374 [Bacillus cereus B4264] gi|229129539|ref|ZP_04258510.1| hypothetical protein bcere0015_39820 [Bacillus cereus BDRD-Cer4] gi|29897897|gb|AAP11172.1| hypothetical Cytosolic Protein [Bacillus cereus ATCC 14579] gi|218161988|gb|ACK61980.1| conserved hypothetical protein [Bacillus cereus B4264] gi|228654144|gb|EEL10011.1| hypothetical protein bcere0015_39820 [Bacillus cereus BDRD-Cer4] Length = 370 Score = 186 bits (472), Expect = 5e-45, Method: Composition-based stats. Identities = 72/347 (20%), Positives = 130/347 (37%), Gaps = 17/347 (4%) Query: 18 QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 ++ Y L + GYY G GDF T+ +S +F + A I E Sbjct: 18 SISYSTYMNLVLYTEGHGYYMKEREKIGRQGDFFTSSNVSSVFAKTFAKLFIRLVE--NG 75 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 + E+G G G D+L+ +L P+ F L+ M+E S +Q++QL S+ + Sbjct: 76 EVASNICEVGGGTGKFAYDVLQEWKQLSPNTFINLNYSMIEVSPFHRKLQQEQLGSFSNV 135 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 + + L +NE FD+ P++ + E I + +L Sbjct: 136 SYYTSYSEMGDSFEGILFSNELFDAFPVEVIEKRNGILYEVRITYTEEGNLSEVCRPLNK 195 Query: 197 KSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS------R 247 + G FE + ++ IS I +DYGY + Sbjct: 196 RIGRYLLKYNIHIAEGQRFEVPIVMEEYIKEISKWFQKGIC--ITVDYGYTKEEWMHPAH 253 Query: 248 VGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 +L+ H + +PL +PG+ DL++HV + L + + + Q +FL G Sbjct: 254 REGSLRGYYQHKLMRNPLAHPGEMDLTTHVHWDELKEMFSMQGMSAVWHKKQSEFLLAAG 313 Query: 307 IWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHE 353 I ++ S + R V + +G F +++ + + Sbjct: 314 ILEQLTSHQDTNPFSETQKQ--NRAVRSMILNGGVGSSFDVVIHTKD 358 >gi|228960526|ref|ZP_04122175.1| hypothetical protein bthur0005_39920 [Bacillus thuringiensis serovar pakistani str. T13001] gi|228799126|gb|EEM46094.1| hypothetical protein bthur0005_39920 [Bacillus thuringiensis serovar pakistani str. T13001] Length = 370 Score = 186 bits (472), Expect = 5e-45, Method: Composition-based stats. Identities = 75/348 (21%), Positives = 138/348 (39%), Gaps = 19/348 (5%) Query: 18 QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 ++ Y L + GYY G GDF T+ +S +F + A I E Sbjct: 18 SISYSTYMNLVLYTEGHGYYMKEREKIGRQGDFFTSSNVSSVFAKTFAKLFIRLVE--NG 75 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 + E+G G G D+L+ +L P+ F L+ M+E S +Q+++L S+ + Sbjct: 76 EVASNICEVGGGTGKFAYDVLQEWKQLSPNTFINLNYSMIEVSPFHRKLQQEKLGSFSN- 134 Query: 137 INWYTSLADVPLGFT-FLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHE 195 +++YTS +++ F L +NE FD+ P++ + E I + +L Sbjct: 135 VSYYTSYSEMGDSFEGILFSNELFDAFPVEVIEKRNGILYEVRITYTEEGNLSEVCRPLN 194 Query: 196 IKSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS------ 246 + G FE + ++ IS I +DYGY + Sbjct: 195 KRIGRYLLKYNIHIAEGQRFEVPIVMEEYIKEISKWFQKGIC--ITVDYGYTKEEWMHPA 252 Query: 247 RVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGL 305 +L+ H + +PL +PG+ DL++HV + L + + + Q +FL Sbjct: 253 HREGSLRGYYQHKLMRNPLAHPGEMDLTTHVHWDELKEMFSMQGMSAVWHKKQSEFLLAA 312 Query: 306 GIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHE 353 GI ++ S + R V + +G F +++ + + Sbjct: 313 GILEQLTSHQDTNPFSETQKQ--NRAVRSMILNGGVGSSFDVVIHTKD 358 >gi|228987508|ref|ZP_04147627.1| hypothetical protein bthur0001_41800 [Bacillus thuringiensis serovar tochigiensis BGSC 4Y1] gi|228772240|gb|EEM20687.1| hypothetical protein bthur0001_41800 [Bacillus thuringiensis serovar tochigiensis BGSC 4Y1] Length = 370 Score = 186 bits (472), Expect = 5e-45, Method: Composition-based stats. Identities = 69/351 (19%), Positives = 128/351 (36%), Gaps = 17/351 (4%) Query: 18 QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 ++ Y L + GYY G GDF T+ +S +F + A F I E Sbjct: 18 SISYSTYMNLVLYAEGHGYYMKEREKIGRQGDFFTSSNVSSVFAKTFAKFFIRLVE--NG 75 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 + E+G G G D+L+ +L P+ F L+ M+E S +Q++ L S+ + Sbjct: 76 EVAPNICEIGGGTGRFAYDVLQEWEQLSPETFIDLNYSMIEMSPFHRKLQQENLCSFSNV 135 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 + + L +NE FD+ P++ + E + + L Sbjct: 136 SYYRSHNEMGESFEGILFSNELFDAFPVEVIEKRNGILYEVRVTYTEEGKLAEVCRPLHK 195 Query: 197 KSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS------R 247 G FE + ++ I I +DYGY + Sbjct: 196 TIGRYLLKYNIHLAEGQRFEVPIAMEEYIKEIVKWFQRGVC--ITVDYGYTKEEWMHPAH 253 Query: 248 VGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 +L+ H +PL +PG+ DL++H+ + + + L + Q +FL G Sbjct: 254 QEGSLRGYYEHKLIRNPLAHPGEMDLTAHIHWDEMKEMFSLQGMNTVWHKKQYEFLLAAG 313 Query: 307 IWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVEL 357 I ++ + + + R V + +G F +++ + +L Sbjct: 314 ILEQLTNHQDRNPFSETQKQ--NRAVRSMILSGGLGSAFDVVIHTKHMQQL 362 >gi|229071763|ref|ZP_04204978.1| hypothetical protein bcere0025_39320 [Bacillus cereus F65185] gi|228711358|gb|EEL63318.1| hypothetical protein bcere0025_39320 [Bacillus cereus F65185] Length = 370 Score = 186 bits (472), Expect = 5e-45, Method: Composition-based stats. Identities = 70/347 (20%), Positives = 129/347 (37%), Gaps = 17/347 (4%) Query: 18 QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 ++ Y L + GYY G GDF T+ +S +F + A I E Sbjct: 18 SISYSTYMNLVLYTEGHGYYMKEREKIGRQGDFFTSSNVSSVFAKTFAKLFIRLVE--NG 75 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 + E+G G G D+L+ +L P F L+ M+E S +Q++ L S+ + Sbjct: 76 EVAPNICEIGGGTGKFAYDVLQEWKQLSPKTFIDLNYSMIEVSPFHRKLQQEHLGSFSNV 135 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 + + L +NE FD+LP++ + E I + +L + Sbjct: 136 SYYTSYSEMGDSFEGILFSNELFDALPVEIIEKRNGMLYEVRITYTEEGNLSEVCRPLDK 195 Query: 197 KSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS------R 247 + G FE + ++ I+ I +DYGY + Sbjct: 196 RIGRYLLKYNIHIAEGQRFEVPIVMEEYIKEIAKWFQKGIC--ITVDYGYTKEEWMHPAH 253 Query: 248 VGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 +L+ H +PL +PG+ DL++H+ + L + + + Q +FL G Sbjct: 254 REGSLRGYYQHKLIRNPLAHPGEMDLTTHIHWDELKEMFSMQGMSAVWHKKQSEFLLAAG 313 Query: 307 IWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHE 353 I ++ S + R V + +G F +++ + + Sbjct: 314 ILEQLTSHQDTNPFSETQKQ--NRAVRSMILNGGVGSAFDVVIHTKD 358 >gi|229081519|ref|ZP_04214018.1| hypothetical protein bcere0023_41530 [Bacillus cereus Rock4-2] gi|228701826|gb|EEL54313.1| hypothetical protein bcere0023_41530 [Bacillus cereus Rock4-2] Length = 370 Score = 186 bits (471), Expect = 6e-45, Method: Composition-based stats. Identities = 69/347 (19%), Positives = 128/347 (36%), Gaps = 17/347 (4%) Query: 18 QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 ++ Y L + GYY G GDF T+ +S +F + A I E Sbjct: 18 SISYSTYMNLVLYTEGHGYYMKEREKIGRQGDFFTSSNVSSVFAKTFAKLFIRLVE--NG 75 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 + E+G G G D+L+ +L P F L+ M+E S +Q++ L S+ + Sbjct: 76 EVATNICEIGGGTGKFAYDVLQEWKQLSPKTFIDLNYSMIEVSPFHRKLQQEHLGSFSNV 135 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 + + L +NE FD+ P++ + E I + +L + Sbjct: 136 SYYTSYSEMGDSFEGILFSNELFDAFPVEIIEKRNGMLYEVRITYTEEGNLSEVCRPLDK 195 Query: 197 KSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS------R 247 + G FE + ++ I+ I +DYGY + Sbjct: 196 RIGRYLLKYNIHIAEGQRFEVPIVMEEYIKEIAKWFQKGIC--ITVDYGYTKEEWMHPAH 253 Query: 248 VGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 +L+ H +PL +PG+ DL++H+ + L + + + Q +FL G Sbjct: 254 REGSLRGYYQHKLIRNPLAHPGEMDLTTHIHWDELKEMFSMQGMSAVWHKKQSEFLLAAG 313 Query: 307 IWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHE 353 I ++ S + R V + +G F +++ + + Sbjct: 314 ILEQLTSHQDTNPFSETQKQ--NRAVRSMILNGGVGSAFDVVIHTKD 358 >gi|218678243|ref|ZP_03526140.1| hypothetical protein RetlC8_04957 [Rhizobium etli CIAT 894] Length = 257 Score = 186 bits (471), Expect = 6e-45, Method: Composition-based stats. Identities = 116/255 (45%), Positives = 161/255 (63%), Gaps = 6/255 (2%) Query: 110 VLSIYMVETSERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVM 169 +S+++VETSERL +Q + L YG+KI W+ +VP GFT + ANE FD++PI+QFV Sbjct: 1 TMSVHLVETSERLRDVQSQTLEVYGEKIAWHDGFDEVPSGFTLIAANELFDAIPIRQFVR 60 Query: 170 TEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCS--DYFLGAIFENSPCRDREMQSISD 227 T G RERM+ +D + L F G + L + LGA+FE SP R M +I + Sbjct: 61 TPTGFRERMVGLDANGELTFAAGVAGLDPALLPEPVQNLPLGALFEISPARQAVMMAICE 120 Query: 228 RLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAIL 287 RL GGTA+ IDYG+L + GDTLQAV+ H + PL +PG+ADL+SHVDFQ+L+ A+ Sbjct: 121 RLRAFGGTALAIDYGHLVTGFGDTLQAVRMHEFDPPLAHPGEADLTSHVDFQQLAETALA 180 Query: 288 YKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDI--LLDSVKRLVSTSADKKSMGELF 345 +++NG QG FL GLGI +RA +L + + + +V RL A + MGELF Sbjct: 181 AGVHLNGALHQGDFLTGLGILERAAALGRDREPQTQQVIQTAVDRL--AGAGEGRMGELF 238 Query: 346 KILVVSHEKVELMPF 360 K++ VSH V+LMPF Sbjct: 239 KVMAVSHPAVDLMPF 253 >gi|118498066|ref|YP_899116.1| hypothetical protein FTN_1495 [Francisella tularensis subsp. novicida U112] gi|194323291|ref|ZP_03057075.1| conserved hypothetical protein [Francisella tularensis subsp. novicida FTE] gi|118423972|gb|ABK90362.1| conserved protein of unknown function [Francisella novicida U112] gi|194322655|gb|EDX20135.1| conserved hypothetical protein [Francisella tularensis subsp. novicida FTE] Length = 395 Score = 186 bits (471), Expect = 7e-45, Method: Composition-based stats. Identities = 72/364 (19%), Positives = 140/364 (38%), Gaps = 23/364 (6%) Query: 4 KLIRKIVNLIKKNG-QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGE 61 L I+ IK + + + + + P+ GYYS + GDF+TA + +F Sbjct: 19 SLKNIILERIKSSKQPLLFRDFMQMVLYYPQLGYYSGAKEKISSQGDFITATSQTSLFAR 78 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 A Q G + ++E G G G D + + L +VE S Sbjct: 79 TFAGQFATIISQLG--NDCSVIEFGAGNGKFAADCVDELESLA---ILPKRYIIVELSND 133 Query: 122 LTLIQKKQLASYGDKINW---YTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERM 178 L L Q++ + + + + ANE D++P+ F + + ++ Sbjct: 134 LRLRQQQYIKENVPHLYDRFIWLDKLPAEKIKAIVFANELLDAMPVDIFRSENNKLIQQG 193 Query: 179 IDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFE--NSPCRDREMQSISDRLACD--GG 234 + I ++ ++++ + + G F+ + + ++ L G Sbjct: 194 VIIKGDTFEFSDMPKNDVRFEYESTKILNDGITFDDGYTSEINTWIRPWVKSLREFLSQG 253 Query: 235 TAIVIDYGYLQS------RVGDTL-QAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAIL 287 + DYGY +S R TL + H P +N G+ D+++HVDF ++ AI Sbjct: 254 IVFLCDYGYHRSLYYSKDRYMGTLACYHQHHVNFEPFINIGEQDITAHVDFTTVAEAAIE 313 Query: 288 YKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKI 347 ++G TQ FL+ I + ++ + + +L S + + E+FK+ Sbjct: 314 EGFQLDGFMTQANFLKRANIAEVFSNISQCLSTNQLLKYSND--IKDLLLNDKLAEVFKV 371 Query: 348 LVVS 351 + S Sbjct: 372 MSFS 375 >gi|229152460|ref|ZP_04280652.1| hypothetical protein bcere0011_39980 [Bacillus cereus m1550] gi|228631068|gb|EEK87705.1| hypothetical protein bcere0011_39980 [Bacillus cereus m1550] Length = 370 Score = 186 bits (471), Expect = 7e-45, Method: Composition-based stats. Identities = 76/348 (21%), Positives = 138/348 (39%), Gaps = 19/348 (5%) Query: 18 QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 ++ Y L + GYY G GDF T+ +S +F + A I E Sbjct: 18 SISYSTYMNLVLYTEGHGYYMKEREKIGRQGDFFTSSNVSSVFAKTFAKLFIRLVE--NG 75 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 + E+G G G D+L+ +L P+ F L+ M+E S +Q++QL S+ + Sbjct: 76 EVASNICEVGGGTGKFAYDVLQEWKQLSPNTFINLNYSMIEVSPFHRKLQQEQLGSFSN- 134 Query: 137 INWYTSLADVPLGFT-FLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHE 195 +++YTS +++ F L +NE FD+ P++ + E I + +L Sbjct: 135 VSYYTSYSEMGDSFEGILFSNELFDAFPVEVIEKRNGILYEVRITYTEEGNLSEVCRPLN 194 Query: 196 IKSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS------ 246 + G FE + ++ IS I +DYGY + Sbjct: 195 KRIGRYLLKYNIHIAEGQRFEVPIVMEEYIKEISKWFQKGIC--ITVDYGYTKEEWMHPA 252 Query: 247 RVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGL 305 +L+ H + +PL +PG+ DL++HV + L + + + Q +FL Sbjct: 253 HREGSLRGYYQHKLMRNPLAHPGEMDLTTHVHWDELKEMFSMQGMSAVWHKKQSEFLLAA 312 Query: 306 GIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHE 353 GI ++ S + R V + +G F +++ + + Sbjct: 313 GILEQLTSHQDTNPFSETQKQ--NRAVRSMILNGGVGGSFDVVIHTKD 358 >gi|88808614|ref|ZP_01124124.1| hypothetical protein WH7805_02952 [Synechococcus sp. WH 7805] gi|88787602|gb|EAR18759.1| hypothetical protein WH7805_02952 [Synechococcus sp. WH 7805] Length = 410 Score = 185 bits (470), Expect = 8e-45, Method: Composition-based stats. Identities = 74/379 (19%), Positives = 146/379 (38%), Gaps = 30/379 (7%) Query: 5 LIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIFGEML 63 L+ ++ G++ + + DP+ G Y + G GDF T+P + + F E+L Sbjct: 11 LLDRLRQ---SGGEVPFSLFMHWALHDPDHGAYGSGRLAVGPEGDFTTSPSLGEDFAELL 67 Query: 64 AIFLICAWEQHGFPSCV---RLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 L+ + +V++GPG G + ++ ++ P L +VE + Sbjct: 68 VDQLVDWLQALAEFHPDDRLSVVDVGPGEGTLTAQLIPLLLSKAPGLVDRLDCVLVECNP 127 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGF--TFLVANEFFDSLPIKQFVMTEHGIRERM 178 + L QK++L + ++SL D+ L +VA+E D+L +++ V+ ++ +M Sbjct: 128 GMELRQKQRLGASPAIPCRWSSLEDLRLNPLVGVVVAHELLDALSVERLVLRSGTLQRQM 187 Query: 179 IDIDQHDSLV-----FNIGDHEIKSNFLTCSD----------YFLGAIFENSPCRDREMQ 223 + + S D E+++ F + D G E M+ Sbjct: 188 VRLRDEGSSAQIHLAEGPFDGELRARFQSECDRSGMVIPPVGAEDGWTTEWHASVAPWMR 247 Query: 224 SISDRLACDGG----TAIVIDYGYLQSRVGDTLQAVKGHT-YVSPLVNPGQADLSSHVDF 278 + + A D Y R TL A + L N G D+++H+ Sbjct: 248 DAAAAVKQGVLLVVDYAFEADRYYTCHRSDGTLLAYQQQVATNDVLRNAGTQDITAHLCV 307 Query: 279 QRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADK 338 + + A ++ G QG+ L LG+ +R +L A + + + D Sbjct: 308 DGVVAAAEMHGWMFEGHRRQGEALLALGLAERFSALQSLPAAQLGEALRRRETLLRLVDP 367 Query: 339 KSMGELFKILVVSHEKVEL 357 +G+L + +V + Sbjct: 368 SCLGDL-RWMVFHRQNERP 385 >gi|228910094|ref|ZP_04073914.1| hypothetical protein bthur0013_42430 [Bacillus thuringiensis IBL 200] gi|228849611|gb|EEM94445.1| hypothetical protein bthur0013_42430 [Bacillus thuringiensis IBL 200] Length = 370 Score = 185 bits (469), Expect = 1e-44, Method: Composition-based stats. Identities = 69/347 (19%), Positives = 129/347 (37%), Gaps = 17/347 (4%) Query: 18 QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 ++ Y L + GYY G GDF T+ +S +F + A I E Sbjct: 18 SISYSTYMNLVLYTEGHGYYMKEREKIGRQGDFFTSSNVSSVFAKTFAKLFIRLVE--NG 75 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 + E+G G G D+L+ +L P F L+ M+E S +Q+++L S+ + Sbjct: 76 EVASNICEVGGGTGKFAYDVLQEWKQLSPKTFIDLNYSMIEVSPFHRELQQEKLGSFSNV 135 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 + + L +NE FD+ P++ + E I + +L + Sbjct: 136 SYYTSYSEMGDSFEGILFSNELFDAFPVEIIEKRNGMLYEVRITYTEEGNLSEVCRPLDK 195 Query: 197 KSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS------R 247 + G FE + ++ I+ I +DYGY + Sbjct: 196 RIGRYLLKYNIHIAEGQRFEVPIVMEEYIKEIAKWFQKGIC--ITVDYGYTKEEWMHPAH 253 Query: 248 VGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 +L+ H +PL +PG+ DL++H+ + L + + + Q +FL G Sbjct: 254 REGSLRGYYQHKLIRNPLAHPGEMDLTTHIHWDELKEMFSMQGMSAVWHKKQSEFLLAAG 313 Query: 307 IWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHE 353 I ++ S + R V + +G F +++ + + Sbjct: 314 ILEQLTSHQDTNPFSETQKQ--NRAVRSMILNGGVGSAFDVVIHTKD 358 >gi|228954544|ref|ZP_04116569.1| hypothetical protein bthur0006_39140 [Bacillus thuringiensis serovar kurstaki str. T03a001] gi|228805201|gb|EEM51795.1| hypothetical protein bthur0006_39140 [Bacillus thuringiensis serovar kurstaki str. T03a001] Length = 370 Score = 185 bits (469), Expect = 1e-44, Method: Composition-based stats. Identities = 69/347 (19%), Positives = 128/347 (36%), Gaps = 17/347 (4%) Query: 18 QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 ++ Y L + GYY G GDF T+ +S +F + A I E Sbjct: 18 SISYSTYMNLVLYTEGHGYYMKEREKIGRQGDFFTSSNVSSVFAKTFAKLFIRLVE--NG 75 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 + E+G G G D+L+ +L P F L+ M+E S +Q++ L S+ + Sbjct: 76 EVAPNICEIGGGTGKFAYDVLQEWKQLSPKTFIDLNYSMIEVSPFHRKLQQEHLGSFSNV 135 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 + + L +NE FD+ P++ + E I + +L + Sbjct: 136 SYYTSYSEMGDSFEGILFSNELFDAFPVEIIEKRNGMLYEVRITYTEEGNLSEVCRPLDK 195 Query: 197 KSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS------R 247 + G FE + ++ I+ I +DYGY + Sbjct: 196 RIGRYLLKYNIHIAEGQRFEVPIVMEEYIKEIAKWFQKGIC--ITVDYGYTKEEWMYPAH 253 Query: 248 VGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 +L+ H +PL +PG+ DL++H+ + L + + + Q +FL G Sbjct: 254 REGSLRGYYQHKLIRNPLAHPGEMDLTTHIHWDELKEMFSMQGMSAVWHKKQSEFLLAAG 313 Query: 307 IWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHE 353 I ++ S + R V + +G F +++ + + Sbjct: 314 ILEQLTSHQDMNPFSETQKQ--NRAVRSMILNGGVGSAFDVVIHTKD 358 >gi|229180538|ref|ZP_04307880.1| hypothetical protein bcere0005_38830 [Bacillus cereus 172560W] gi|228602962|gb|EEK60441.1| hypothetical protein bcere0005_38830 [Bacillus cereus 172560W] Length = 370 Score = 184 bits (468), Expect = 1e-44, Method: Composition-based stats. Identities = 68/347 (19%), Positives = 128/347 (36%), Gaps = 17/347 (4%) Query: 18 QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 ++ Y L + GYY G GDF T+ +S +F + A I E Sbjct: 18 SISYSTYMNLVLYTEGHGYYMKEREKIGRQGDFFTSSNVSSVFAKTFAKLFIRLVE--NG 75 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 + E+G G G D+L+ ++ P F L+ M+E S +Q++ L S+ + Sbjct: 76 EVAPNICEIGGGTGKFAYDVLQEWKQVSPKTFIDLNYSMIEVSPFHRKLQQEHLGSFSNV 135 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 + + L +NE FD+ P++ + E I + +L + Sbjct: 136 SYYTSYSEMGDSFEGILFSNELFDAFPVEIIEKRNGMLYEVRITYTEEGNLSEVCRPLDK 195 Query: 197 KSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS------R 247 + G FE + ++ I+ I +DYGY + Sbjct: 196 RIGRYLLKYNIHIAEGQRFEVPIVMEEYIKEIAKWFQKGIC--ITVDYGYTKEEWMHPAH 253 Query: 248 VGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 +L+ H +PL +PG+ DL++H+ + L + + + Q +FL G Sbjct: 254 REGSLRGYYQHKLIRNPLAHPGEMDLTTHIHWDELKEMFSMQGMSAVWHKKQSEFLLAAG 313 Query: 307 IWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHE 353 I ++ S + R V + +G F +++ + + Sbjct: 314 ILEQLTSHQDMNPFSETQKQ--NRAVRSMILNGGVGSAFDVVIHTKD 358 >gi|228923012|ref|ZP_04086305.1| hypothetical protein bthur0011_39930 [Bacillus thuringiensis serovar huazhongensis BGSC 4BD1] gi|228836645|gb|EEM81993.1| hypothetical protein bthur0011_39930 [Bacillus thuringiensis serovar huazhongensis BGSC 4BD1] Length = 370 Score = 184 bits (468), Expect = 1e-44, Method: Composition-based stats. Identities = 69/347 (19%), Positives = 128/347 (36%), Gaps = 17/347 (4%) Query: 18 QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 ++ Y L + GYY G GDF T+ +S +F + A I E Sbjct: 18 SISYSTYMNLVLYTEGHGYYMKEREKIGRQGDFFTSSNVSSVFAKTFAKLFIRLVE--NG 75 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 + E+G G G D+L+ +L P F L+ M+E S +Q++ L S+ + Sbjct: 76 EVAPNICEIGGGTGKFAYDVLQEWKQLSPKTFIDLNYSMIEVSPFHRKLQQEHLGSFSNV 135 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 + + L +NE FD+ P++ + E I + +L + Sbjct: 136 SYYTSYSEMGDSFEGILFSNELFDAFPVEIIEKRNGMLYEVRITYTEEGNLSEVCRPLDK 195 Query: 197 KSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS------R 247 + G FE + ++ I+ I +DYGY + Sbjct: 196 RIGRYLLKYNIHIAEGQRFEVPIVMEDYIKEIAKWFQKGIC--ITVDYGYTKEEWMHPAH 253 Query: 248 VGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 +L+ H +PL +PG+ DL++H+ + L + + + Q +FL G Sbjct: 254 REGSLRGYYQHKLIRNPLAHPGEMDLTTHIHWDELKEMFSMQGMSAVWHKKQSEFLLAAG 313 Query: 307 IWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHE 353 I ++ S + R V + +G F +++ + + Sbjct: 314 ILEQLTSHQDMNPFSETQKQ--NRAVRSMILNGGVGSSFDVVIHTKD 358 >gi|229047978|ref|ZP_04193554.1| hypothetical protein bcere0027_39530 [Bacillus cereus AH676] gi|229111733|ref|ZP_04241281.1| hypothetical protein bcere0018_39780 [Bacillus cereus Rock1-15] gi|228671727|gb|EEL27023.1| hypothetical protein bcere0018_39780 [Bacillus cereus Rock1-15] gi|228723435|gb|EEL74804.1| hypothetical protein bcere0027_39530 [Bacillus cereus AH676] Length = 370 Score = 184 bits (468), Expect = 1e-44, Method: Composition-based stats. Identities = 77/348 (22%), Positives = 138/348 (39%), Gaps = 19/348 (5%) Query: 18 QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 ++ Y L + GYY G GDF T+ +S +F + A I E Sbjct: 18 SISYSTYMNLVLYTEGHGYYMKEREKIGRQGDFFTSSNVSSVFAKTFAKLFIRLVE--NG 75 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 + E+G G G D+L+ +L P+ F L+ M+E S +Q++QL S+ + Sbjct: 76 EVASNICEVGGGTGKFAYDVLQEWKQLSPNTFINLNYSMIEVSPFHRKLQQEQLGSFSN- 134 Query: 137 INWYTSLADVPLGFT-FLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHE 195 +++YTS +++ F L +NE FD+ P++ + E I + +L Sbjct: 135 VSYYTSYSEMGDSFEGILFSNELFDAFPVEVIEKRNGILYEVRITYTEEGNLSEVCRPLN 194 Query: 196 IKSNFLTCSDYF---LGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS------ 246 + G FE + ++ IS I +DYGY + Sbjct: 195 KRIGRYLLKYNIYIAEGQRFEVPIVMEEYIKEISKWFQKGIC--ITVDYGYTKEEWMHPA 252 Query: 247 RVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGL 305 +L+ H + +PL +PG+ DL++HV + L I + + Q +FL Sbjct: 253 HREGSLRGYYQHKLMRNPLAHPGEMDLTTHVHWDELKEIFSMQGMSAVWHKKQSEFLLAA 312 Query: 306 GIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHE 353 GI ++ S + R V + +G F +++ + + Sbjct: 313 GILEQLTSHQDTNPFSETQKQ--NRAVRSMILNGGVGSSFDVVIHTKD 358 >gi|206971268|ref|ZP_03232219.1| conserved hypothetical protein [Bacillus cereus AH1134] gi|206734040|gb|EDZ51211.1| conserved hypothetical protein [Bacillus cereus AH1134] Length = 370 Score = 184 bits (468), Expect = 1e-44, Method: Composition-based stats. Identities = 68/347 (19%), Positives = 127/347 (36%), Gaps = 17/347 (4%) Query: 18 QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 ++ Y L + GYY G GDF T+ +S +F + A I E Sbjct: 18 SISYSTYMNLVLYTEGHGYYMKEREKIGRQGDFFTSSNVSSVFAKTFAKLFIRLVE--NG 75 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 + E+G G G D+L+ ++ P F L+ M+E S +Q++ L S+ + Sbjct: 76 EVAPNICEIGGGTGKFAYDVLQEWKQVSPKTFIDLNYSMIEVSPFHRKLQQEHLGSFSNV 135 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 + + L +NE FD+ P++ + E I + +L + Sbjct: 136 SYYTSYSEMGDSFEGILFSNELFDAFPVEIIEKRNGMLYEVRITYTEEGNLSEVCRPLDK 195 Query: 197 KSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS------R 247 + G FE + ++ I+ I +DYGY + Sbjct: 196 RIGRYLLKYNIHIAEGQRFEVPIVMEEYIKEIAKWFQKGIC--ITVDYGYTKEEWMHPAH 253 Query: 248 VGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 +L+ H +PL +PG+ DL++H+ + L + + + Q FL G Sbjct: 254 REGSLRGYYQHKLIRNPLAHPGEMDLTTHIHWDELKEMFSMQGMSAVWHKKQSDFLLAAG 313 Query: 307 IWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHE 353 I ++ S + R V + +G F +++ + + Sbjct: 314 ILEQLTSHQDMNPFSETQKQ--NRAVRSMILNGGVGSAFDVVIHTKD 358 >gi|228941424|ref|ZP_04103975.1| hypothetical protein bthur0008_40620 [Bacillus thuringiensis serovar berliner ATCC 10792] gi|228974355|ref|ZP_04134924.1| hypothetical protein bthur0003_41090 [Bacillus thuringiensis serovar thuringiensis str. T01001] gi|228980948|ref|ZP_04141251.1| hypothetical protein bthur0002_41110 [Bacillus thuringiensis Bt407] gi|228778739|gb|EEM27003.1| hypothetical protein bthur0002_41110 [Bacillus thuringiensis Bt407] gi|228785405|gb|EEM33415.1| hypothetical protein bthur0003_41090 [Bacillus thuringiensis serovar thuringiensis str. T01001] gi|228818205|gb|EEM64279.1| hypothetical protein bthur0008_40620 [Bacillus thuringiensis serovar berliner ATCC 10792] gi|326942042|gb|AEA17938.1| putative cytoplasmic protein [Bacillus thuringiensis serovar chinensis CT-43] Length = 370 Score = 184 bits (467), Expect = 2e-44, Method: Composition-based stats. Identities = 71/347 (20%), Positives = 128/347 (36%), Gaps = 17/347 (4%) Query: 18 QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 ++ Y L + GYY G GDF T+ +S +F + A I E Sbjct: 18 SISYSTYMNLVLYTEGHGYYMKEREKIGRQGDFFTSSNVSSVFAKTFAKLFIRLVE--NG 75 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 + E+G G G D+L+ +L P F L+ M+E S +Q++ L S+ + Sbjct: 76 EVASNICEVGGGTGKFAYDVLQEWKQLSPKTFIDLNYSMIEVSPFHRKLQQEHLGSFSNV 135 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 + + L +NE FD+ P++ + E I + SL + Sbjct: 136 SYYTSYSEMGDSFEGILFSNELFDAFPVEIIEKRNGMLYEVRITYTEEGSLSEICRPLDK 195 Query: 197 KSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS------R 247 + G FE + ++ I+ I +DYGY + Sbjct: 196 RIGRYLLKYNIHIAEGQRFEVPLVMEEYVKEIAKWFQKGIC--ITVDYGYTKEEWMHPAH 253 Query: 248 VGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 +L+ H +PL +PG+ DL++H+ + L + L + Q +FL G Sbjct: 254 REGSLRGYYQHKLIRNPLAHPGEMDLTTHIHWDELKEMFSLQGMSAVWHKKQSEFLLAAG 313 Query: 307 IWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHE 353 I ++ S + R V + +G F +++ + + Sbjct: 314 ILEQLTSHQDMNPFSETQKQ--NRAVRSMILNGGVGSAFDVVIHTKD 358 >gi|228967324|ref|ZP_04128359.1| hypothetical protein bthur0004_41270 [Bacillus thuringiensis serovar sotto str. T04001] gi|228792359|gb|EEM39926.1| hypothetical protein bthur0004_41270 [Bacillus thuringiensis serovar sotto str. T04001] Length = 370 Score = 183 bits (464), Expect = 4e-44, Method: Composition-based stats. Identities = 71/347 (20%), Positives = 128/347 (36%), Gaps = 17/347 (4%) Query: 18 QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 ++ Y L + GYY G GDF T+ +S +F + A I E Sbjct: 18 SISYSTYMNLVLYTEGHGYYMKEREKIGRKGDFFTSSNVSSVFAKTFAKLFIRLVE--NG 75 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 + E+G G G D+L+ +L P F L+ M+E S +Q++QL S+ + Sbjct: 76 EVASNICEVGGGTGKFAYDVLQEWKQLSPKTFIDLNYSMIEVSPFHRKLQQEQLGSFSNV 135 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 + + L +NE FD+ P++ + E I + +L + Sbjct: 136 SYYTSYSEMGDSFEGILFSNELFDAFPVEIIEKRNGMLYEVRITYTEEGNLSEVCRPLDK 195 Query: 197 KSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS------R 247 + G FE + ++ I+ I +DYGY + Sbjct: 196 RIGRYLLKYNIHIAEGQRFEVPIVMEEYIKEIAKWFQKGIC--ITVDYGYTKEEWMHPAH 253 Query: 248 VGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 +L+ H +PL PG+ DL++H+ + L + L + Q +FL G Sbjct: 254 REGSLRGYYQHKLIRNPLAYPGEMDLTTHIHWDELIEMFSLQGMSAVWHKKQSEFLLAAG 313 Query: 307 IWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHE 353 I ++ S + R V + +G F +++ + + Sbjct: 314 ILEQLTSHQDTNPFSET--QKKNRAVRSMILNGGVGSAFDVVIHTKD 358 >gi|304403993|ref|ZP_07385655.1| protein of unknown function DUF185 [Paenibacillus curdlanolyticus YK9] gi|304346971|gb|EFM12803.1| protein of unknown function DUF185 [Paenibacillus curdlanolyticus YK9] Length = 403 Score = 183 bits (463), Expect = 5e-44, Method: Composition-based stats. Identities = 81/389 (20%), Positives = 158/389 (40%), Gaps = 45/389 (11%) Query: 3 NKLIRKIVNLIKKNGQ-------MTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPE 54 + KI +I++ G ++ +Y +C+ D ++GYY + G GDF T+ Sbjct: 9 TAIAAKIAEVIQREGAAAAPSKAISFQRYMEICLYDEQWGYYRSGEIRTGVDGDFYTSAA 68 Query: 55 ISQIFGEMLAIFLI-CAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSI 113 I + GE+ A L A Q + + + E G G G M +L+ P++ S L Sbjct: 69 IGGLMGELWARGLRDQAIGQQREAASLHIGEWGAGAGAMAARMLQCWASESPEWLSKLHF 128 Query: 114 YMVETSERL------TLIQKKQLASYGDKINWYTSLADVPLG--------FTFLVANEFF 159 V+ + ++Q + ++ S + +VANE Sbjct: 129 LTVDDHPKHVAATRSRIVQAVTDTGHAPLSMFHFSSTEAFDYLKNVANDASVMIVANELL 188 Query: 160 DSLPIKQFVMTEHGIRERMIDIDQHDSLV-FNIGDHEIKSNFLTCS------DYFLGAIF 212 D++P+ + V E + E + + +++ + F + + L S G Sbjct: 189 DAMPVHRVVRHEGALMELGVCVQENEPGIGFGYAYMPLSTPALAASLEADGIPLAEGQET 248 Query: 213 ENSPCRDREMQSISDRLACDGGTAIVIDYGY------LQSRVGDTLQAVKGHT-YVSPLV 265 E + + + + + G +++DYG+ + R+ TL K H + +P + Sbjct: 249 EINLAAEAWLAQMGRVI--QSGMLLIMDYGHEADEYRAEHRMQGTLLCYKNHVAHNNPFL 306 Query: 266 NPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILL 325 P + DL++HV+F + A + + TQ +FL GI + L+ + A +D Sbjct: 307 APSEQDLTAHVNFTAMQRAARRAGWEVAYMETQKQFLIDHGILE----LLAEDAGRDPFS 362 Query: 326 DSVK--RLVSTSADKKSMGELFKILVVSH 352 + K R + +M E FK+++V+ Sbjct: 363 ATAKRNRAIRQLLLSDNMSEAFKVMIVTK 391 >gi|75763950|ref|ZP_00743579.1| Hypothetical cytosolic protein [Bacillus thuringiensis serovar israelensis ATCC 35646] gi|218899426|ref|YP_002447837.1| hypothetical protein BCG9842_B0860 [Bacillus cereus G9842] gi|228902774|ref|ZP_04066920.1| hypothetical protein bthur0014_39460 [Bacillus thuringiensis IBL 4222] gi|74488562|gb|EAO52149.1| Hypothetical cytosolic protein [Bacillus thuringiensis serovar israelensis ATCC 35646] gi|218544755|gb|ACK97149.1| conserved hypothetical protein [Bacillus cereus G9842] gi|228856848|gb|EEN01362.1| hypothetical protein bthur0014_39460 [Bacillus thuringiensis IBL 4222] Length = 370 Score = 182 bits (462), Expect = 7e-44, Method: Composition-based stats. Identities = 71/347 (20%), Positives = 129/347 (37%), Gaps = 17/347 (4%) Query: 18 QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 ++ Y L + GYY G GDF T+ +S +F + A I E Sbjct: 18 SISYSTYMNLVLYTEGHGYYMKEREKIGRKGDFFTSSNVSSVFAKTFAKLFIRLVE--NG 75 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 + E+G G G D+L+ +L P F L+ M+E S +Q++QL S+ + Sbjct: 76 EVASNICEVGGGTGKFAYDVLQEWKQLSPKTFIDLNYSMIEVSPFHRKLQQEQLGSFSNV 135 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 + + L +NE FD+ P++ + E I + +L + Sbjct: 136 SYYTSYSEMGDSFEGILFSNELFDAFPVEIIEKRNGMLYEVRITYTEEGNLSEVCRPLDK 195 Query: 197 KSNFLTCS---DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS------R 247 + G FE + ++ I+ I +DYGY + Sbjct: 196 RIGRYLLKYNIHIAEGQRFEVPIVMEEYIKEIAKWFQKGIC--ITVDYGYTKEEWMHPAH 253 Query: 248 VGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 +L+ H +PL PG+ DL++H+ + L+ + L + Q +FL G Sbjct: 254 REGSLRGYYQHKLIRNPLAYPGEMDLTTHIHWDELTEMFSLQGMSAVWHKKQSEFLLAAG 313 Query: 307 IWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHE 353 I ++ S + R V + +G F +++ + + Sbjct: 314 ILEQLTSHQDTNPFSET--QKKNRAVRSMILNGGVGSAFDVVIHTKD 358 >gi|159903337|ref|YP_001550681.1| hypothetical protein P9211_07961 [Prochlorococcus marinus str. MIT 9211] gi|159888513|gb|ABX08727.1| Conserved hypothetical protein [Prochlorococcus marinus str. MIT 9211] Length = 401 Score = 182 bits (461), Expect = 9e-44, Method: Composition-based stats. Identities = 83/377 (22%), Positives = 161/377 (42%), Gaps = 35/377 (9%) Query: 1 MENKLIR----KIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNP-FGAVGDFVTAPEI 55 M++KL + + + + G ++ +Y L + DP G YST G GDFVT+P + Sbjct: 1 MDSKLAQCPDWLVHHFTEAGGGLSFCKYMNLALNDPANGAYSTGKINIGIKGDFVTSPSL 60 Query: 56 SQIFGEMLAIFLICAWEQH----GFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVL 111 + FGE+L+ LI +Q F + ++++GPG G + D++ + K P + Sbjct: 61 TPDFGELLSFQLIEWLDQLMASTKFSEKLVVIDIGPGEGDLTFDLIAALQKFSPSMLRRI 120 Query: 112 SIYMVETSERLTLIQKKQLASYGDKINWYTSLAD--VPLGFTFLVANEFFDSLPIKQFVM 169 +VE +E + L QKK+L + + + SL + ++A+E D+LP+++ V Sbjct: 121 QFILVEINEGMKLRQKKKLEQFPSSLIRWASLEELSRTSQVGVIIAHEILDALPVERVVY 180 Query: 170 TEHGIRERMIDIDQH-DSLVFNIGDHEIKSNFL--------------TCSDYFLGAIFEN 214 + + ++ + + + + + D + + G E Sbjct: 181 KNNKLYQQGVKLIEDSGNYFLDYFDLPLPNKLNHFIRDLSDYCKVNIPPDKAAEGWSTEL 240 Query: 215 SPCRDREMQSISDRLACDGGTAIVIDY------GYLQSRVGDTLQAVKGHTY-VSPLVNP 267 + + IS L D G +VIDY Y +R ++ + K + L + Sbjct: 241 HTNLNSWFEKISKSL--DYGLVLVIDYALEAKRYYHVNRDLGSIVSYKNQCCTFNVLKDA 298 Query: 268 GQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDS 327 G D++SH+ + + A + L+ G+ QG+ L LG+ SL + + Sbjct: 299 GLCDITSHLCIESMQIYASKHNLFSKGIVRQGQALLALGLADILSSLAQADNVDLPTVLR 358 Query: 328 VKRLVSTSADKKSMGEL 344 + + D ++G+L Sbjct: 359 RREALLRLVDPIALGDL 375 >gi|317970048|ref|ZP_07971438.1| hypothetical protein SCB02_10951 [Synechococcus sp. CB0205] Length = 361 Score = 182 bits (461), Expect = 1e-43, Method: Composition-based stats. Identities = 72/348 (20%), Positives = 135/348 (38%), Gaps = 24/348 (6%) Query: 25 FALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLV 83 + DP G Y G GDF T+P + F E+L L+ ++Q + + L+ Sbjct: 1 MDWALHDPVHGAYGAGRLRVGPAGDFATSPSLGPDFAELLLPQLLQWFQQQPAEAPLALI 60 Query: 84 ELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWYTSL 143 E GPG G + L + + I + P+ L + +VE + + Q+ LA + + S Sbjct: 61 ETGPGEGHLALQLAQGIAREAPELVGRLELVLVEPNPGMAERQRGLLADAPLRCR-WQSF 119 Query: 144 ADVPLGFT--FLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFL 201 ++ ++A+E D+L +++ + + + ++ + + Sbjct: 120 DELQASPRSGVMLAHEVLDALAVERVIWHNERWCLQGVSLETGADGAAALRLAAGPALED 179 Query: 202 TC----------SDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDY------GYLQ 245 G E ++ + L+ G +VIDY Y Sbjct: 180 PLRQELEALTPGPQRPDGWCTELHVGLRPWLEQAAQSLS--SGQLLVIDYAHEAWRYYAA 237 Query: 246 SRVGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEG 304 R TL A + PL+ PGQ DL++H+ + L A+ G QG+ L Sbjct: 238 QRSRGTLMAYRNQQASDDPLLEPGQWDLTAHLCLETLERSALAAGWTPLGQRRQGEALLA 297 Query: 305 LGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSH 352 LG+ QR +L + + L S + + D ++G+ F+ + + Sbjct: 298 LGLAQRLHALQQGAPSELAELLSRREALLRFVDPAALGD-FRWMAFAR 344 >gi|308804846|ref|XP_003079735.1| ATP synthase beta subunit/transcription termination factor rho-like (ISS) [Ostreococcus tauri] gi|116058192|emb|CAL53381.1| ATP synthase beta subunit/transcription termination factor rho-like (ISS) [Ostreococcus tauri] Length = 457 Score = 181 bits (459), Expect = 1e-43, Method: Composition-based stats. Identities = 109/447 (24%), Positives = 178/447 (39%), Gaps = 91/447 (20%) Query: 3 NKLIRKIVNLIK-KNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGE 61 +I + + G + V +Y C+ +PE GYY + FG GDFVT+PEISQ+FGE Sbjct: 12 TGMIGHLKRAMAFAGGSIPVSEYVRECLTNPEHGYYMRGDVFGRDGDFVTSPEISQVFGE 71 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 +L ++ E G P +R+VE GPGRG +M D+LR K + S +S++++E S Sbjct: 72 VLGVWAALQHEALGSPGTLRVVEFGPGRGTLMADLLRGTSKFEKFR-SAVSVHLIEVSPA 130 Query: 122 LTLIQKKQLASYG------------DKINWYTSLADVPLGF------------------- 150 L +Q + L ++ + + G Sbjct: 131 LREVQARTLRCVDVETTSAAADDGGARVRVPKNALEAEEGEVDKRSAADGPSGEAHTRGT 190 Query: 151 --------TFLVANEFFDSLP-------------IKQFVMTEHGIRERMIDIDQH----- 184 + E + P ++QF T+ G E+++ ID Sbjct: 191 SEISGAKVFWHDGLESVPNGPTLVICHEFFDALPVRQFQRTDRGWCEKLVTIDAELASTA 250 Query: 185 ---------DSLVFNIGDHEIKSNFLTCSDYFLG---------AIFENSPCRDREMQSIS 226 L + ++ + G + E SP ++ Sbjct: 251 ETVEETTPRRELAMVLSPGPTPASHMLVPRRLKGLPKEQVDSLRLLELSPPSMTLWDKLA 310 Query: 227 DRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAI 286 DR+ + G + IDYG + +G+TL+A+K H +V L +PG+ADLS++VDF L I Sbjct: 311 DRIEKNSGAVLAIDYG-EEGPLGNTLEAIKDHKFVHVLDSPGEADLSAYVDFGALRQIVE 369 Query: 287 LY---KLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDI---LLDSVKRLVSTSAD--- 337 + G TQ + L LG+ R L++ A +D L+ +RLV A Sbjct: 370 EKPQRGVTCYGPVTQQQLLLSLGLVARLEQLVENAASEDQANALVKGCERLVGDGAGNAE 429 Query: 338 ---KKSMGELFKILVVSHEKVELM-PF 360 MG +K + + + F Sbjct: 430 TGEPPGMGVRYKAMCMVSRGLPKPVGF 456 >gi|254247006|ref|ZP_04940327.1| hypothetical protein BCPG_01782 [Burkholderia cenocepacia PC184] gi|124871782|gb|EAY63498.1| hypothetical protein BCPG_01782 [Burkholderia cenocepacia PC184] Length = 344 Score = 181 bits (459), Expect = 2e-43, Method: Composition-based stats. Identities = 82/319 (25%), Positives = 137/319 (42%), Gaps = 28/319 (8%) Query: 48 DFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDF 107 DFVTAPE+S +F + LA + A G R +E G G G + +L + L + Sbjct: 22 DFVTAPELSPLFAQTLAQPVAEALAASG---TRRAMEFGAGTGKLAAGLLAALDALGAEL 78 Query: 108 FSVLSIYMVETSERLTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLP 163 L +V+ S L Q+ + + K+ W +L + G +V NE D++P Sbjct: 79 DEYL---IVDLSGELRERQRDTIEAAVPALAAKVRWLDALPERFDG--VVVGNEVLDAMP 133 Query: 164 IKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCS--DYFLGAIFENSPCRDRE 221 ++ F + RER + +D + VF+ + D G + E Sbjct: 134 VRLFAKADGAWRERGVALDARHAFVFDDRPVGAAGLPAVLAALDVGDGYVTETHEAALAF 193 Query: 222 MQSISDRLACDGGTAIVIDY------GYLQSRVGDT-LQAVKGHTYVSPLVNPGQADLSS 274 +++ L G ++IDY Y R T + + H + P V PG DL++ Sbjct: 194 TRTVCTMLGR--GAVLLIDYGFPAHEYYHPQRDRGTLMCHYRHHAHDDPFVYPGLQDLTA 251 Query: 275 HVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARK-DILLDSVKRLVS 333 HV+F + AI + + G T+Q +FL GI ++ R ++V++L+S Sbjct: 252 HVEFTGIYEAAIATGVDLLGYTSQARFLLNAGITDALAAIDPSDVRAFLPAANAVQKLIS 311 Query: 334 TSADKKSMGELFKILVVSH 352 + MGELFK++ S Sbjct: 312 ----EAEMGELFKVIAFSR 326 >gi|302675134|ref|XP_003027251.1| hypothetical protein SCHCODRAFT_113587 [Schizophyllum commune H4-8] gi|300100937|gb|EFI92348.1| hypothetical protein SCHCODRAFT_113587 [Schizophyllum commune H4-8] Length = 431 Score = 180 bits (457), Expect = 3e-43, Method: Composition-based stats. Identities = 119/434 (27%), Positives = 190/434 (43%), Gaps = 100/434 (23%) Query: 25 FALCVADPEFGYYST--CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRL 82 +LC+ P GYY++ FG GDF+T+PEISQ+FGE++ I+ + + H P +RL Sbjct: 1 MSLCLGHPVHGYYTSSANPVFGKAGDFITSPEISQVFGELIGIWYLTRFSAHPKP-ALRL 59 Query: 83 VELGPGRGIMMLDILRVICKLKPD--FFSVLSIYMVETSERLTLIQKKQLASYGDKINWY 140 VELGPGRG +M DILRV +L +S+++VETS+ + +QK +L+++G ++WY Sbjct: 60 VELGPGRGTLMEDILRVFRQLLSKLPVPPEISVHLVETSQPMRKLQKSKLSAFGHDVHWY 119 Query: 141 TSLADVPLG-----FTFLVANEFFDSLPIKQFVMTEHGIR-ERMIDIDQHDSLVFNI--- 191 S+ DVP FT ++A+EFFD+LPI + + E+++ + S + Sbjct: 120 DSIDDVPQDVDGKTFTMVLAHEFFDALPIDIYQKMDEENFLEKLVTSTEDASGTERLRAV 179 Query: 192 -------------------------GDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSIS 226 + + + GA E SIS Sbjct: 180 PSTMTPKAALLNTAVQTYSAPKKTGFADPLDALARRLAALPAGASAEICWPAWDIAASIS 239 Query: 227 DRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAI 286 L GG +VIDYG + GD+ +A K H VSP PGQ DL+++VDF+ L Sbjct: 240 KLL-RGGGAGLVIDYG-GERMFGDSFRAFKQHKIVSPYETPGQCDLTANVDFKFLRHAFE 297 Query: 287 LYK-------------LYINGLTTQGKFLEGLGIWQRAFSLMKQT---------ARKDIL 324 + + L TQ FL+G+G+ R L+ ++ L Sbjct: 298 SVNHKSDSDNDRPSPPIRTHMLLTQAAFLQGMGVDVRLQKLLDAARREGGEAGKEKEKRL 357 Query: 325 LDSVKRLVSTS----------------------------ADKK--------SMGELFKIL 348 V+RL+ T + + MG+ +K+L Sbjct: 358 RQGVERLIGTGVERLIGTGVERPIGTSMEQPAGKNNVEATNPENGIEGRGSGMGKEYKVL 417 Query: 349 VV-SHEKVELMPFV 361 + + E ++ PF+ Sbjct: 418 GITTGEGEDVWPFI 431 >gi|332184605|gb|AEE26859.1| conserved hypothetical protein [Francisella cf. novicida 3523] Length = 355 Score = 180 bits (456), Expect = 3e-43, Method: Composition-based stats. Identities = 68/342 (19%), Positives = 130/342 (38%), Gaps = 22/342 (6%) Query: 25 FALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLV 83 + + P GYYS GDF+TA + +F A Q G + ++ Sbjct: 1 MQMALYYPLLGYYSGAREKISPQGDFITATSQTSLFARTFARQFATIISQLG--NDCSVI 58 Query: 84 ELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINW---Y 140 E G G G D + + L ++E S L L Q++ + + + Sbjct: 59 EFGAGNGKFAADCINELKSLSNLPK---HYIIIELSNDLRLRQQQYIKENIPHLYDKFIW 115 Query: 141 TSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNF 200 + ANE D++P+ F + + ++ + + + ++ ++++ + Sbjct: 116 LDKLPEQKLKAIVFANELLDAMPVDVFRFMNNKLVQQGVTSNGDNFKFSDMLQNDVRFEY 175 Query: 201 LTCSDYFLGAIFE--NSPCRDREMQSISDRLAC--DGGTAIVIDYGYLQS------RVGD 250 + G F+ + + ++ L G + DYGY +S R Sbjct: 176 ESGKILDDGVTFDDDYTSEINTWIRPWIKSLHEVLSQGIVFLCDYGYHRSLYYSKERYMG 235 Query: 251 TLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQ 309 TL H P +N G+ D+++HVDF ++ AI ++G TQ FL+ GI + Sbjct: 236 TLACYYQHQVNFEPFINIGEQDITAHVDFTTVAEAAIEVGFQLDGYMTQANFLKRAGISE 295 Query: 310 RAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVS 351 + + + K L S + + E+FK++ S Sbjct: 296 VFSDISQHLSAKQYLKYSND--IKDLLLNDKLAEVFKVIAFS 335 >gi|296274298|ref|YP_003656929.1| hypothetical protein Arnit_2774 [Arcobacter nitrofigilis DSM 7299] gi|296098472|gb|ADG94422.1| protein of unknown function DUF185 [Arcobacter nitrofigilis DSM 7299] Length = 336 Score = 179 bits (454), Expect = 6e-43, Method: Composition-based stats. Identities = 75/342 (21%), Positives = 130/342 (38%), Gaps = 21/342 (6%) Query: 21 VDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCV 80 +YF + + GYY+ N G GDF TA S FG +A ++ + P Sbjct: 6 FSEYFNNWLYGED-GYYTKYNAIGKDGDFFTAVSTSIFFGGSIAKKIVDTILDNKLPKNT 64 Query: 81 RLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYG---DKI 137 +VE+G G ++ DI++ I L P L+ +VE E L QK+ L S K+ Sbjct: 65 TIVEIGAHHGYLLADIIQFIYTLNPKLLENLNFAIVERFENLKNEQKEYLKSSFGDNIKV 124 Query: 138 NWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIK 197 +Y +++V L F+VANE +D+ + ++ ++ + F E Sbjct: 125 KFYDDISEVKLDHAFIVANEIYDAFACELLYTKNDNLQ---TAFIKNGKIEFEDCLDENI 181 Query: 198 SNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKG 257 G E + + ++I + + + DYG R + + Sbjct: 182 KTKCKKYKITKG---ELALGYEEFAKNICENI--KNFYFLSFDYGEKYPRNDFSCRIYSQ 236 Query: 258 HTYVSPL-------VNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQR 310 H +D++ V+F + + TQ K L GI Sbjct: 237 HQVFPIFEEKLELTKLYKNSDITYDVNFSHVIDSFNSFCNCEVEYQTQLKALVEFGILDL 296 Query: 311 AFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSH 352 L K + ++ L + K V T + MG+ FK+L++ Sbjct: 297 LEILQKNVSEENYLKEVQK--VKTLLEPTGMGDRFKMLLIKK 336 >gi|221091699|ref|XP_002170346.1| PREDICTED: similar to CG17726 CG17726-PA, partial [Hydra magnipapillata] Length = 312 Score = 179 bits (453), Expect = 7e-43, Method: Composition-based stats. Identities = 99/310 (31%), Positives = 145/310 (46%), Gaps = 42/310 (13%) Query: 17 GQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 G MTV Y + +P++GYY + FGA GDF T+PEISQ+FGE++ I+ + W Q G Sbjct: 3 GPMTVANYMKEALTNPKWGYYMKNDVFGAKGDFTTSPEISQMFGELIGIWFVAQWIQIGK 62 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQL------ 130 P V+LVELGPGRG +M DILRV+ + P+ S + VE SE++ +QK+ L Sbjct: 63 PCGVQLVELGPGRGTLMADILRVMKQF-PETLSNFEVNFVEVSEKMISLQKQNLDISHEK 121 Query: 131 -----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQF-------------VMTEH 172 G K++W+T + DVP G TF +A+EFFD+LP+ F + + Sbjct: 122 KDFYITPSGTKVSWFTHVQDVPKGLTFYLAHEFFDALPVHLFKLLDLVLSPGLTNIRNVN 181 Query: 173 GIRERMIDIDQH---------------DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPC 217 + D + L + E P Sbjct: 182 PFAQIPPDTMMDFGKFRLFIIGYCQSENELQLVTAPGPSPVAKTFLNSETNADECEVCPE 241 Query: 218 RDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVS-PLVNPGQADLSSHV 276 M IS+ ++ GG A++IDYG S+ +L+ K H V NPG D++++V Sbjct: 242 AAVVMSYISENISMYGGCAMIIDYGESDSQ-RFSLRGYKNHVLVDNIFKNPGSCDITANV 300 Query: 277 DFQRLSSIAI 286 DF L Sbjct: 301 DFGFLKRCIR 310 >gi|123965989|ref|YP_001011070.1| hypothetical protein P9515_07541 [Prochlorococcus marinus str. MIT 9515] gi|123200355|gb|ABM71963.1| Uncharacterized conserved protein [Prochlorococcus marinus str. MIT 9515] Length = 396 Score = 179 bits (453), Expect = 8e-43, Method: Composition-based stats. Identities = 83/371 (22%), Positives = 150/371 (40%), Gaps = 32/371 (8%) Query: 8 KIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNP-FGAVGDFVTAPEISQIFGEMLAIF 66 I +IKK G ++ Y L + DP GYY + G+ GDFVTAP +S F L+ Sbjct: 12 LIKKIIKKGGTISFYDYMNLVLNDPNNGYYGSGKANLGSKGDFVTAPSMSDDFAFFLSKQ 71 Query: 67 LICAWEQHGFP----SCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 + Q + ++E G G G +M +L + FFS +S ++E ++ + Sbjct: 72 IYQWLIQVKSKSVSFDNLSVLEFGAGDGSLMSGLLHYLFIYNKQFFSNVSFIIIEPNKGM 131 Query: 123 TLIQKKQLASYGDKINWYT----SLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERM 178 QK++L Y + + ++ANE D+LP+++ + + I + Sbjct: 132 INKQKEKLEKYLNLGFNIMWRSLEELEDKSLNGVILANEVLDALPVERLINLKGKIYRQG 191 Query: 179 IDIDQHDSLVFNI-------------GDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSI 225 + +D+ +F E + ++ D G E ++++ Sbjct: 192 VSLDKETGRLFFKEIKISKELEKSIVFAKENLNIYIPPKDAPEGWTTEWHTDNKSWLKAV 251 Query: 226 SDRLACDGGTAIVIDY------GYLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDF 278 +++ + G +VIDY Y S TL + K + +PG DL+SH+ Sbjct: 252 YEKI--NNGILLVIDYAKEAKRYYSLSNNNGTLISYKNQKIIEDVFESPGNCDLTSHICI 309 Query: 279 QRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADK 338 + L + G+ QG+ L LG+ +R F + + S + + D Sbjct: 310 ESLIYDSETLGFETMGIVKQGEALLLLGLAERLFEIQNELKDDISKALSRREALLRLVDP 369 Query: 339 KSMGELFKILV 349 +G+ FK V Sbjct: 370 ICLGD-FKWFV 379 >gi|251771853|gb|EES52427.1| conserved protein of unknown function [Leptospirillum ferrodiazotrophum] Length = 376 Score = 179 bits (453), Expect = 9e-43, Method: Composition-based stats. Identities = 86/368 (23%), Positives = 144/368 (39%), Gaps = 33/368 (8%) Query: 19 MTVDQYFALCVADPEFGYYSTCNPFGAVG-DFVTAPEISQIFGEMLAIFLICAWEQHGFP 77 +T ++ A PE GYY+ G G DF+T+PE S F +LA+ + G P Sbjct: 2 VTFCEFMRQAAAGPEGGYYTRHAGIGREGGDFLTSPETSPAFATLLALQIGELDRALGSP 61 Query: 78 SCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYG-DK 136 L+E GPG G +M D+L + + P FF +S + E L Q+++L + Sbjct: 62 DPFYLIEAGPGNGTLMSDLLTIFRRADPAFFERVSPILCELPGVLEARQRERLREFELLH 121 Query: 137 INWYTSLADV-------------PLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQ 183 + +L + G ++ NEF D+LP+ + + E + Sbjct: 122 PPRWIALPENLASTSGRAPSDWPEPGQGLVLGNEFLDALPVHRLRIQGGRWEECYVRAGS 181 Query: 184 HDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRL-----ACDGGTAIV 238 GD + + F + Q + L G + Sbjct: 182 EGVWEEIWGDLSDRELSGLLLERFGSDLSGWEGQETEVCQDLPRVLSLLDQCLSSGFMLW 241 Query: 239 IDYGY------LQSRVGDTLQAVKGHTYVS-PLVNPGQADLSSHVDFQRLSSIAILYKLY 291 IDYG + R G TL+A +GH + +PG DL++ VDF +++ + Sbjct: 242 IDYGDIGSEIRSKRRRGGTLRAYRGHQVSDRLIESPGHTDLTAFVDFSQVAGDLAVRGYR 301 Query: 292 INGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVS 351 + G T Q +L GLG + + + + I + T MG +FK+L++S Sbjct: 302 LEGYTDQMSWLMGLGFEEWIRANSDRLSDAAIQEAA------TLVHPLRMGRIFKVLLMS 355 Query: 352 HEKVELMP 359 + P Sbjct: 356 KAMGPMAP 363 >gi|113953923|ref|YP_730636.1| hypothetical protein sync_1431 [Synechococcus sp. CC9311] gi|113881274|gb|ABI46232.1| Uncharacterized conserved protein [Synechococcus sp. CC9311] Length = 392 Score = 178 bits (452), Expect = 1e-42, Method: Composition-based stats. Identities = 76/362 (20%), Positives = 132/362 (36%), Gaps = 35/362 (9%) Query: 25 FALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCV--- 80 A + DPE G Y + G GDFVT+ + F +L L+ Sbjct: 1 MAWALHDPEHGAYGSGQLKIGKGGDFVTSATLGPDFSALLGCQLVQWVRTLALNYPTETL 60 Query: 81 RLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGD----- 135 +VE+GPG G + D++ + + PD L + +VET+ + Q+ +L + Sbjct: 61 SIVEVGPGEGELSCDLIDHLAEHLPDLMHRLELVLVETNPGMEQRQRNRLKQHQVSQQAQ 120 Query: 136 --KINWYTSLADVPLGFTF--LVANEFFDSLPIKQFVMTEHGIRERMIDIDQH--DSLVF 189 +TSL+D+ L+A+E D+ P+++ + + +R + + Q Sbjct: 121 PLFPQRWTSLSDLKAKPVIGVLIAHELLDAFPVERLELIDGQLRRQTVQFQQEHVGGGDL 180 Query: 190 NIGDHEIKSNFL--------------TCSDYFLGAIFENSPCRDREMQSISDRLACDGG- 234 + G I + D G E S+ L Sbjct: 181 HWGTEPIPQSLQERMNATLSATQIALPPPDAEDGWTTEWHDACASWFAEASEALIAGHLL 240 Query: 235 ---TAIVIDYGYLQSRVGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKL 290 + Y R TL A + S LV+ GQ DL++H+ + + A Sbjct: 241 VVDYVLEAHRYYSARRREGTLMAYRNQRASSSVLVDAGQQDLTAHLCLETMVHQATTNGW 300 Query: 291 YINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVV 350 + G QG+ L LG+ +R SL + + + + + D +GE F+ L Sbjct: 301 SLEGQCRQGEALLALGLAERFSSLQQLPGSQLAEVLQRREALLRLVDPACLGE-FRWLSF 359 Query: 351 SH 352 Sbjct: 360 LR 361 >gi|255020107|ref|ZP_05292178.1| Uncharacterized conserved protein [Acidithiobacillus caldus ATCC 51756] gi|254970469|gb|EET27960.1| Uncharacterized conserved protein [Acidithiobacillus caldus ATCC 51756] Length = 411 Score = 178 bits (452), Expect = 1e-42, Method: Composition-based stats. Identities = 77/362 (21%), Positives = 125/362 (34%), Gaps = 34/362 (9%) Query: 2 ENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIF 59 L +I+ I G ++ Y + P GYY FG GDFVTAPE+ + Sbjct: 69 SAALRGRILARIAAAGGSISFADYLEAVLYTPGLGYYMAGQRRFGPDGDFVTAPEMGPVL 128 Query: 60 GEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 +LA +L ++E G G G + + ++ ++ S Sbjct: 129 ATVLARWLEDY-----RNLGDGILEFGGGSGALARQLRAIL--------PSTPYAFLDRS 175 Query: 120 ERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMI 179 L Q K L L + G +A+E D+LP + ER + Sbjct: 176 ADLVAQQAKALPEG----RVLQDLPEAWRG--VFLAHEVLDALPFLAVEWDGERLWERRV 229 Query: 180 DIDQHDSLV-FNIGDHEIKSNFLTCSDYFL-GAIFENSPCRDREMQSISDRLACDGGTAI 237 + D E + ++++ E P + + + + LA I Sbjct: 230 AMQGDDFCWTLAPLAAEHAATLRPYAEHWPAPYQTEIRPLAEAWLAAAAASLAEGALVLI 289 Query: 238 -----VIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYI 292 +Y + Q R G + P PG DL++HVDF L A L Sbjct: 290 DYGQESSEYYHPQRRQGSLRAYYRHRVLDDPFFLPGLCDLTAHVDFGALRRAAARLGLRE 349 Query: 293 NGLTTQGKFLEGLGIWQRAFSLMKQTARK--DILLDSVKRLVSTSADKKSMGELFKILVV 350 +FL G+ + L + L + +KRL + MGE FK+L++ Sbjct: 350 RWYGPLARFLVEGGLAELYPGLAAGRDGRGLLALNNEIKRLTL----PQEMGESFKVLLL 405 Query: 351 SH 352 Sbjct: 406 EK 407 >gi|297584587|ref|YP_003700367.1| hypothetical protein Bsel_2298 [Bacillus selenitireducens MLS10] gi|297143044|gb|ADH99801.1| protein of unknown function DUF185 [Bacillus selenitireducens MLS10] Length = 379 Score = 177 bits (449), Expect = 2e-42, Method: Composition-based stats. Identities = 69/357 (19%), Positives = 133/357 (37%), Gaps = 21/357 (5%) Query: 6 IRKIVNLIKKNGQMTVDQYFALCVADPEFGYYS-TCNPFGAVGDFVTAPEISQIFGEMLA 64 I+ + +G + + D E GYY+ G GDF T+ + +F ++ Sbjct: 5 IQVLKKKTASDGPWRFYDFMDTALYDDEVGYYTVEKTKLGKEGDFYTSNHVHPVFPQVTG 64 Query: 65 IFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTL 124 F + + + E G G G L++LR + P+ F +L+ ++E+S Sbjct: 65 RFFADVFTSTDVDTK--ITEAGAGDGRFALEVLRYFEEHHPNQFRLLTYCIIESSPSHCQ 122 Query: 125 IQKKQLASYGDKINWYTSLADVPLGFT----FLVANEFFDSLPIKQFVMTEHGIRERMID 180 +++ +Y DK+ Y+SL D L +NE FD++P+ E G E +++ Sbjct: 123 QIEEKTEAYADKVRLYSSLQDYFKQEGDIKGILYSNELFDAMPVHLVERREDGWDEILVE 182 Query: 181 IDQHDSLVFNIGDHEIKSNFLTC---SDYFLGAIFENSPCRDREMQSISDRLACDGGTAI 237 D + + + G E +P +Q+ + + Sbjct: 183 YDGDKLFEKKVPCSDTLLLNWLEAFGPELETGYRTEINPDMRHWLQNSLS--GNEPVFIM 240 Query: 238 VIDYGY------LQSRVGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKL 290 +DYGY R +++ + H PL PG+ DL+SH+ + ++ Sbjct: 241 TVDYGYRNEEYRHPQRKDGSIRGYRKHELIPDPLETPGKMDLTSHIQWDAYDAVLKKLGF 300 Query: 291 YINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKI 347 Q +FL GI++ + D R + + + F++ Sbjct: 301 QHLAHHRQDQFLLKAGIFKFLRKPGEINPFSDEFKQ--NRAIQSLVTPDGISGSFQV 355 >gi|194694748|gb|ACF81458.1| unknown [Zea mays] Length = 249 Score = 176 bits (446), Expect = 5e-42, Method: Composition-based stats. Identities = 74/200 (37%), Positives = 116/200 (58%), Gaps = 18/200 (9%) Query: 2 ENKLIRKIVNLIK-KNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 E++L++ I ++IK ++G +++ +Y + +P+ G+Y + FG GDF+T+PE+SQ+FG Sbjct: 49 ESELVKHIKSIIKFRSGPISIAEYMEEVLTNPQSGFYINRDVFGESGDFITSPEVSQMFG 108 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 EM+ ++ +C WEQ G P+ V L+ELGPGRG ++ D+LR K LSI +VE S Sbjct: 109 EMIGVWAMCLWEQMGKPAKVNLIELGPGRGTLLADLLRGSAKFANFT-KALSINLVECSP 167 Query: 121 RLTLIQKKQLASYGDK---------------INWYTSLADVPLG-FTFLVANEFFDSLPI 164 L IQ L + + W+ SL VP G T ++A+EF+D+LPI Sbjct: 168 TLQKIQYNTLKCEDEHVGDGKRTVSKICGAPVCWHASLEQVPSGSPTIIIAHEFYDALPI 227 Query: 165 KQFVMTEHGIRERMIDIDQH 184 QF G E+M+DI + Sbjct: 228 HQFQKASRGWCEKMVDIAED 247 >gi|218507142|ref|ZP_03505020.1| hypothetical protein RetlB5_05735 [Rhizobium etli Brasil 5] Length = 271 Score = 176 bits (446), Expect = 5e-42, Method: Composition-based stats. Identities = 122/267 (45%), Positives = 172/267 (64%), Gaps = 4/267 (1%) Query: 72 EQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLA 131 ++HG P+ VRLVE+GPGRG M+ D+LRVI ++ P F +++++VETSERL +Q + L Sbjct: 4 QRHGTPADVRLVEIGPGRGTMVSDMLRVISRIAPPLFDTMTVHLVETSERLRDVQNQTLE 63 Query: 132 SYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNI 191 +YG+KI W+ +VP GFT + ANE FD++PI+QFV T+ G RERM+ +D L F Sbjct: 64 AYGEKIAWHDGFDEVPPGFTLIAANELFDAIPIRQFVRTQTGFRERMVGLDADGELTFAA 123 Query: 192 GDHEIKSNFLTCS--DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVG 249 G + L + LG +FE SP R M +I +RL GGTA+VIDYG+L + G Sbjct: 124 GVAGLDPALLPEPVQNLPLGTLFEISPARQAVMMAICERLRAFGGTALVIDYGHLVTGFG 183 Query: 250 DTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQ 309 DTLQAV+ H + PL +PG+ADL+SHVDFQ+L+ A+ L++NG QG FL GLGI + Sbjct: 184 DTLQAVRMHEFDPPLAHPGEADLTSHVDFQQLAETALTSGLHLNGALHQGDFLTGLGILE 243 Query: 310 RAFSLMKQTARKDI--LLDSVKRLVST 334 RA +L + + + +V RL Sbjct: 244 RAAALGRDREPQTQQVIQTAVDRLARR 270 >gi|87303546|ref|ZP_01086329.1| hypothetical protein WH5701_09805 [Synechococcus sp. WH 5701] gi|87281959|gb|EAQ73922.1| hypothetical protein WH5701_09805 [Synechococcus sp. WH 5701] Length = 397 Score = 176 bits (446), Expect = 5e-42, Method: Composition-based stats. Identities = 82/371 (22%), Positives = 138/371 (37%), Gaps = 34/371 (9%) Query: 11 NLIKKNGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIFGEMLAIFLIC 69 L++ G + Y + DPE G Y G GDF TAP + F +LA LI Sbjct: 10 RLLQAGGSVPFLTYMEWVLNDPEHGAYGAGRLSIGPRGDFATAPSLGPEFAALLAP-LIA 68 Query: 70 AWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQ 129 W Q + LVE GPG G + + + P L + +VE + + Q+++ Sbjct: 69 QWLQDLPQERLSLVETGPGEGSLAAQLAEALAAGWPQLTERLELVLVEPNAGMAARQRQR 128 Query: 130 LASYGDKINWYTSLADVPLGF--TFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQ---- 183 LA + ++S ++ ++A+E D+L +++ + R + + + Q Sbjct: 129 LAGSPLPLR-WSSFEEMAAAPLTGVVLAHEVLDALAVERIERSGDHWRRQQVTLQQGTLR 187 Query: 184 ----HDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVI 239 + E S G E P + + + +A G +VI Sbjct: 188 LEPGDPLEPEDEARLEPLGLLPLDSRRPEGWCSELHPGLTPWLAACAAAVAR--GRLLVI 245 Query: 240 DY------GYLQSRVGDTLQAVK-GHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYI 292 DY Y R TL A + PL+ PG DL++H+ + + A Sbjct: 246 DYALEAWRYYAPQRSNGTLMAYRAQRASSDPLLEPGHWDLTAHLCVESVLEAAEAAGWSC 305 Query: 293 NGLTTQGKFLEGLGIWQRAFSLMKQTARK-----------DILLDSVKRLVSTSADKKSM 341 G QG+ L LG+ QR L + L + + + D ++ Sbjct: 306 LGQRRQGEALLALGLAQRLHGLQQPVPSAVRASGSPGSDGLAALLARREALLRLVDPAAL 365 Query: 342 GELFKILVVSH 352 G+ F+ L S Sbjct: 366 GD-FRWLAFSR 375 >gi|319957361|ref|YP_004168624.1| hypothetical protein Nitsa_1627 [Nitratifractor salsuginis DSM 16511] gi|319419765|gb|ADV46875.1| protein of unknown function DUF185 [Nitratifractor salsuginis DSM 16511] Length = 376 Score = 175 bits (443), Expect = 1e-41, Method: Composition-based stats. Identities = 72/362 (19%), Positives = 129/362 (35%), Gaps = 26/362 (7%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGE 61 +N KI N +G + Y + E GYY T G GDF TA S FG Sbjct: 31 KNNPKSKIQNPKSPSGPVPFSTYMNEWLYG-EGGYYKTFRDIGKGGDFYTAVSTSAFFGA 89 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 +A ++ P L+E+G RG ++ D+++ + P + +VE E Sbjct: 90 AIANHFWKGIQEGRIPRDAWLIEIGAHRGYLLADMIQWLYSCDPSLLETMRFGIVERQED 149 Query: 122 LTLIQKKQLASYG---DKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERM 178 + IQ++ A + + SL +V + + V+NE FD+ P + + E + E Sbjct: 150 VRRIQREYFAERFGSGVALEQFASLDEVRVPYALFVSNEIFDAFPCELYKEGEQAVVE-- 207 Query: 179 IDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIV 238 + + D E K + + A + + Sbjct: 208 -------NHHISWVDAEDKLKDFAERHRLVKGEIAVGYESFARRVAE----AAEALEFVS 256 Query: 239 IDYGYLQSRVGDTLQAVKGHTYVSPL-------VNPGQADLSSHVDFQRLSSIAILYKLY 291 DYG R +++ + H ++D++ V+F + + Sbjct: 257 FDYGEKYVRNDFSIRVYRKHETFPLFDEELNLAQAYQESDITYDVNFGHVIEAFEGAGMR 316 Query: 292 INGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVS 351 + TQ + L G+ K A+ D L + + + T MG+ FK++ S Sbjct: 317 CDAYETQARALVRYGLVDILEEYAKIAAQADYLRQADR--IKTLIAPTVMGDRFKVVEFS 374 Query: 352 HE 353 Sbjct: 375 KR 376 >gi|332527839|ref|ZP_08403877.1| hypothetical protein RBXJA2T_17851 [Rubrivivax benzoatilyticus JA2] gi|332112234|gb|EGJ12210.1| hypothetical protein RBXJA2T_17851 [Rubrivivax benzoatilyticus JA2] Length = 314 Score = 174 bits (442), Expect = 1e-41, Method: Composition-based stats. Identities = 77/322 (23%), Positives = 129/322 (40%), Gaps = 36/322 (11%) Query: 48 DFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDF 107 DFVTAPE+S +FG A F + + E G G G + + L PD Sbjct: 11 DFVTAPELSPLFG---AAFARQVAQALAAAGAHEVWEFGAGSGAFASQL---LAALPPDT 64 Query: 108 FSVLSIYMVETSERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQF 167 ++V+ S L Q ++LA + ++ W+ +L + G +V NE D++P++ Sbjct: 65 ----CYHVVDLSGTLRARQAERLAPFAPRVQWHDTLPEAIEG--VVVGNEVLDAMPVQLL 118 Query: 168 VMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISD 227 ER + D F D + + + G E + +++ Sbjct: 119 HWDGGRWFERGVV---DDGGRFAWADRPTELRPPVETGFLPGTTIELPAQASAFVATLAT 175 Query: 228 RLACDGGTAIVIDY------GYLQSRVGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQR 280 RL G A DY Y R G TL + H PLV G+ D+++HVDF Sbjct: 176 RLVR--GAAFFADYGFPEREYYHPQRRGGTLMCHRAHRADTDPLVEVGEKDITAHVDFTA 233 Query: 281 LSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKS 340 ++ L + G T+Q +FL G+++ + ++LV+ + Sbjct: 234 VALAGQDAGLDVIGYTSQARFLANCGLFELLE------GADARTIAHAQKLVT----EHE 283 Query: 341 MGELFKILVVSH--EKVELMPF 360 MGELFK++ + + F Sbjct: 284 MGELFKVIGFARGLPDFAPVGF 305 >gi|330813698|ref|YP_004357937.1| hypothetical protein SAR11G3_00723 [Candidatus Pelagibacter sp. IMCC9063] gi|327486793|gb|AEA81198.1| conserved hypothetical protein [Candidatus Pelagibacter sp. IMCC9063] Length = 300 Score = 174 bits (442), Expect = 1e-41, Method: Composition-based stats. Identities = 90/297 (30%), Positives = 148/297 (49%), Gaps = 11/297 (3%) Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 M+AI+++ W + P + ++ELGPG G M DI+ + K+ F S ++ Y +E S+ Sbjct: 1 MIAIWIVLFWNKIKKPKTLNILELGPGDGTMGKDIISSLGKIN-FFKSKVNYYFLEKSKS 59 Query: 122 LTLIQKKQLASYGDKINWYTSLADVPL-GFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 L IQKK L + I W +L D ++ NEFFD+LP+KQF + E+ + Sbjct: 60 LKKIQKKNLKN-EKNIYWIDNLKDFKKKDNLIILGNEFFDALPVKQFSKSGDSWFEKYVF 118 Query: 181 IDQHDSLVFNIGDHEIKSNFLTCS--DYFLGAIFENSPCRDREMQSISDRLACDGGTAIV 238 + L F ++KS + + E ++ ++ ISD L + Sbjct: 119 FNDKKQLSFVFKKAKLKSIKKIEKIYNLKINKFIEYPLILEKLIKRISDLLKNKNSIFLT 178 Query: 239 IDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQ 298 IDYG DT+QA+ + + L N G++D++ V+F L + KL++ TTQ Sbjct: 179 IDYGEDSRICNDTVQAIYKNNKSNILQNVGESDITYQVNFFHLIKLFKKNKLHLVEFTTQ 238 Query: 299 GKFLEGLGIWQRAFSLMKQTARKDILL--DSVKRLVSTSADKKSMGELFKILVVSHE 353 FL+ LGI +RA + K + LL ++KRL+ MG LFK+L++S++ Sbjct: 239 SNFLQKLGIKERAINAKKNLKKNQQLLLDTALKRLLHPL----EMGSLFKVLIISNQ 291 >gi|148242522|ref|YP_001227679.1| hypothetical protein SynRCC307_1423 [Synechococcus sp. RCC307] gi|147850832|emb|CAK28326.1| Conserved hypothetical protein [Synechococcus sp. RCC307] Length = 394 Score = 174 bits (441), Expect = 2e-41, Method: Composition-based stats. Identities = 83/358 (23%), Positives = 139/358 (38%), Gaps = 26/358 (7%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNP-FGAVGDFVTAPEISQIFG 60 + L++++ G + + DP +GYY + FG+ GDFVTAP +F Sbjct: 13 SSWLVQRLQ---ASAGPQSFVAVMDWLLNDPAYGYYGSGQVRFGSGGDFVTAPSQGPVFA 69 Query: 61 EMLAIFLICAWEQHGFPS-CVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 E+LA + S + L+E GPG G +M D++ I P + L + +VE+S Sbjct: 70 ELLARQFRPCLDALAAESGPLTLIEWGPGDGQLMRDLIAGIGAESPAWLDRLELVLVESS 129 Query: 120 ERLTLIQKKQLASYGDKINWYTSLADVPLGFT--FLVANEFFDSLPIKQFVMTEHGIRER 177 L Q++ LA ++ + S + +VA+E D+LP+++F + E Sbjct: 130 PALQARQRQTLAGSAVPVH-WCSPQQLAAEPRRGLIVAHELLDALPVQRFGLQNGNWHEW 188 Query: 178 MIDIDQHDSLVF---------NIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDR 228 ++ +D + + E G E P ++ R Sbjct: 189 LVGLDGQQQPCWEVGAGLAPAVLEQLEQLGLPPGGGGRPDGWSSEWCPALGPWLEQA--R 246 Query: 229 LACDGGTAIVIDY------GYLQSRVGDTLQAVKGHTY-VSPLVNPGQADLSSHVDFQRL 281 G + IDY Y SR G TL A + L +PG+ DL++HV + Sbjct: 247 ACLRQGWLLAIDYAMPASRYYAPSRDGGTLLACREQRTSTELLRDPGRMDLTAHVCTTLV 306 Query: 282 SSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKK 339 +A G QG+ L LG+ Q +L + S + + D Sbjct: 307 EQLAAAAGWRWQGAALQGEVLLQLGLAQEITALSEPGPLALADRLSRREQLLRLVDPH 364 >gi|152990808|ref|YP_001356530.1| hypothetical protein NIS_1063 [Nitratiruptor sp. SB155-2] gi|151422669|dbj|BAF70173.1| conserved hypothetical protein [Nitratiruptor sp. SB155-2] Length = 330 Score = 173 bits (439), Expect = 3e-41, Method: Composition-based stats. Identities = 66/344 (19%), Positives = 121/344 (35%), Gaps = 25/344 (7%) Query: 19 MTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPS 78 M + + DP+ GYY+ G GDF TA ++ +FG +A + + Sbjct: 1 MRFSTFMNEWLYDPD-GYYANQLQIGKSGDFFTAVSVTPLFGGAIAKHIYKQIQTGKLSP 59 Query: 79 CVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYG---D 135 ++E+G +G ++ DI++ + P L +VE +L +QK+ + + Sbjct: 60 QATIMEIGAHQGYLLADIIQFLYTFDPALLKTLQFAIVEPIAKLRSLQKEYMRASFGDAI 119 Query: 136 KINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHE 195 Y SL +V F+VANE FD+ + F I + Sbjct: 120 HFLHYNSLDEVRSIEAFVVANEIFDAFGCEL----------IYQGKQAFVKEDFTIEWKD 169 Query: 196 IKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAV 255 + S F E + + + + + DYG L+ R +++ Sbjct: 170 ADPKIVELSRVFGQNKGEVAVGYEEFAKKMDKAFEK--VEFVTFDYGDLEVRNDFSIRVY 227 Query: 256 KGHTYV-------SPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIW 308 H ++D++ V+F L ++ TQ L GI Sbjct: 228 TKHQVFPFFDEKLDIKRAYKRSDITYDVNFSHLKKAFEETGFVMDSYQTQLAALVEFGIM 287 Query: 309 QRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSH 352 + ++++ + L V MGE FK++ S Sbjct: 288 ELLEEILQKKGYE--LYKQELEKVKILIHPSQMGERFKMIQFSK 329 >gi|89255739|ref|YP_513100.1| hypothetical protein FTL_0308 [Francisella tularensis subsp. holarctica LVS] gi|89143570|emb|CAJ78749.1| conserved hypothetical protein [Francisella tularensis subsp. holarctica LVS] Length = 355 Score = 171 bits (434), Expect = 1e-40, Method: Composition-based stats. Identities = 67/342 (19%), Positives = 130/342 (38%), Gaps = 22/342 (6%) Query: 25 FALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLV 83 + + P+ GYYS + GDF+TA + +F A Q G + ++ Sbjct: 1 MQMALYYPQLGYYSRAKEKISSQGDFITATSQTSLFARTFARQFATIISQLG--NDCSVI 58 Query: 84 ELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINW---Y 140 E G G G D + + L +VE S L L Q++ + + + Sbjct: 59 EFGAGNGKFAADCVDELESLA---ILPKRYIIVELSNDLRLRQQQYIKENLSHLYDRFIW 115 Query: 141 TSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNF 200 + ANE D++P+ F + + ++ + ++ ++++ + Sbjct: 116 LDKLPAEKIKAIVFANELLDAMPVDIFRSENNKLIQQGVIRKGDTFEFSDMPKNDVRFEY 175 Query: 201 LTCSDYFLGAIFE--NSPCRDREMQSISDRLAC--DGGTAIVIDYGYLQS------RVGD 250 + G F + + ++ L G + DYGY +S R Sbjct: 176 ESTKILNDGITFNDGYTSEINTWIRPWVKSLREVLSQGIVFLCDYGYHRSLYYSKDRYMG 235 Query: 251 TL-QAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQ 309 TL + H P +N G+ D+++HVDF + AI ++G TQ FL+ I + Sbjct: 236 TLACYHQHHVNFEPFINIGEQDITAHVDFTTVVEAAIEEGFQLDGFMTQANFLKRANIAE 295 Query: 310 RAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVS 351 ++ ++ + +L S + + E+FK++ S Sbjct: 296 VFSNISQRLSTNQLLKYSND--IKDLLLNDKLAEVFKVMAFS 335 >gi|311030842|ref|ZP_07708932.1| hypothetical protein Bm3-1_09906 [Bacillus sp. m3-13] Length = 380 Score = 171 bits (433), Expect = 2e-40, Method: Composition-based stats. Identities = 87/359 (24%), Positives = 146/359 (40%), Gaps = 27/359 (7%) Query: 9 IVNLIKK--NGQMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAI 65 + I+K + ++ +Y + P GYY N G GDF+T+ + ++G++ A Sbjct: 11 LQEKIEKAPDRKLNFAEYMMTVLYHPSEGYYMKPKNKVGTKGDFITSSNVHTVYGKLFAK 70 Query: 66 FLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLI 125 L+ + + P ++E+G G G +L L P + LS MVETS + Sbjct: 71 LLVKYFNETDIPP--VIIEIGGGNGRFAKHLLEEFKLLDPLLYRRLSYVMVETSSYHIHL 128 Query: 126 QKKQLASYGDKINWYTSLADVPLGFT--FLVANEFFDSLPIKQFVMTEHGIRERMIDIDQ 183 QK + +N++ SL DVP F L +NE FD+LP+ T ++E + I + Sbjct: 129 QKTTIPDSVP-VNYFISLEDVPDRFRKGVLFSNELFDALPVHVIEYTNGTLKEVFVTIGE 187 Query: 184 HDSLVFNIGDHEIKSNFLTCSDYF----LGAIFENSPCRDREMQSISDRLACDGGTAIVI 239 +L + + S G E + + L GT I + Sbjct: 188 DRNLSERSLPLDNEEIQAYISKNKLIFSEGQRMEIPLAMMHYANILGEWL--LAGTIITV 245 Query: 240 DYGY------LQSRVGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYI 292 DYGY Q + +L+ H PL P + DL+SH+ L + +K Sbjct: 246 DYGYRFSELASQDLLEGSLRGYHQHQLVKDPLKYPAEMDLTSHIHLDALEASFDDWKFSH 305 Query: 293 NGLTTQGKFLEGLGIWQRAFSLMKQT--ARKDILLDSVKRLVSTSADKKSMGELFKILV 349 G QG+FL GI + + K +++ L+ S+ S F++L+ Sbjct: 306 VGTMRQGEFLVEAGILEYLQENQDPDPFSEKSKQNRAIRTLIMDSSWSHS----FQVLI 360 >gi|251797894|ref|YP_003012625.1| hypothetical protein Pjdr2_3909 [Paenibacillus sp. JDR-2] gi|247545520|gb|ACT02539.1| protein of unknown function DUF185 [Paenibacillus sp. JDR-2] Length = 385 Score = 171 bits (433), Expect = 2e-40, Method: Composition-based stats. Identities = 76/386 (19%), Positives = 137/386 (35%), Gaps = 40/386 (10%) Query: 2 ENKLIRKIVNLIK---KNGQ------------MTVDQYFALCVADPEFGYYSTCNP-FGA 45 + LI +I I G MT QY +LC+ D + GYY + G Sbjct: 5 KKSLIIRISEQIAGMPAVGWRAGSTGSADLHCMTFKQYMSLCLYDKQDGYYRSGPVRIGR 64 Query: 46 VGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKP 105 GDF T+ + + L ++ + H ++LVE G G G + + I Sbjct: 65 EGDFYTSSAVGAVMAHCLTNYVYD-YAVHSAGGHIQLVEWGAGTGRLTMQIQEAWRAKAE 123 Query: 106 DFFSV-LSIYMVETSERLTLIQKKQLASYGDKINWYTSLAD-------VPLGFTFLVANE 157 ++ +V+ K+ ++S + S + ++ANE Sbjct: 124 QSSALQCRSILVDDHPGHLEEAKRAISSNTSYTALFLSSDEAMQSGRHWQELPAVVLANE 183 Query: 158 FFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSD----YFLGAIFE 213 D+ P+ + + + + E + + D + G E Sbjct: 184 LLDAFPVNRVTVEQGKLVELGVAGNAEDGFYEVHMPLSDERIAAALQKSGIHLTEGQQTE 243 Query: 214 NSPCRDREMQSISDRLACDGGTAIVIDYGY------LQSRVGDTLQAVKGHTYVS-PLVN 266 + ++ + S+++ L G I++DYG+ + R+ TL GH P + Sbjct: 244 VNLDAEQWLASLAEVLEA--GRVIIVDYGHEAEEYTAKHRMKGTLLCYSGHIASDEPYLR 301 Query: 267 PGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLD 326 G+ D+++HV F L A I TQ +FL G ++ + + Sbjct: 302 IGEQDITAHVPFTSLRHAAEAGGWRIAYYDTQKQFLIDHGAFELLQNHAETDPFGQT--A 359 Query: 327 SVKRLVSTSADKKSMGELFKILVVSH 352 + R V M E FK+LV+ Sbjct: 360 RMNRAVRQLLLSDGMSETFKVLVLDK 385 >gi|317128382|ref|YP_004094664.1| hypothetical protein Bcell_1670 [Bacillus cellulosilyticus DSM 2522] gi|315473330|gb|ADU29933.1| protein of unknown function DUF185 [Bacillus cellulosilyticus DSM 2522] Length = 382 Score = 171 bits (432), Expect = 2e-40, Method: Composition-based stats. Identities = 60/350 (17%), Positives = 132/350 (37%), Gaps = 21/350 (6%) Query: 18 QMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 +T+++Y + + D E GYY G GDF T+ + +F + A F + ++ Sbjct: 14 PVTMEEYMRVSLYDEEIGYYMKDRVKLGKEGDFYTSNHVHPVFQKTFARFFLDVIKKEK- 72 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 + E G G G+ ++L + + ++E+S+ + L + ++ Sbjct: 73 -ISPYICEFGAGEGMFAKNVLDYFLHTDEKVYEKMQYIIIESSQYHRAMLLNILDMHKER 131 Query: 137 INWYTSLADVPLGFT----FLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHD-SLVFNI 191 + ++S+ + + + +NE D+ P++ + + E ++D+++ + V Sbjct: 132 VRIFSSMIEAKHCYPHLEGIIFSNELIDAFPVRVVEKHSNQLYEVLVDVNKTEVKEVIVP 191 Query: 192 GDHEIKSNFLTC--SDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQ---- 245 +++L D G FE + + +++ L G + +DYGY Sbjct: 192 CKDNKLTSWLNVYGPDLADGYRFEINLAMREWLIHVNEWLRK--GLVVTVDYGYTNEELN 249 Query: 246 --SRVGDTLQAVKGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFL 302 R +L+ H PL P + D++SH+ + I+ L Q +FL Sbjct: 250 REERRLGSLRGYYKHQLIDDPLKYPSEMDMTSHIQWDAFQQISRELNLEEITHEKQDRFL 309 Query: 303 EGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSH 352 G++ R + + + F++ V Sbjct: 310 LKAGLFTFLEK--ANHLDPFSESFKQNRAIQSLVYPGGISSSFQVNVQGK 357 >gi|194476603|ref|YP_002048782.1| hypothetical protein PCC_0120 [Paulinella chromatophora] gi|171191610|gb|ACB42572.1| hypothetical protein PCC_0120 [Paulinella chromatophora] Length = 391 Score = 171 bits (432), Expect = 2e-40, Method: Composition-based stats. Identities = 76/371 (20%), Positives = 149/371 (40%), Gaps = 27/371 (7%) Query: 11 NLIKKNGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIFGEMLAIFLIC 69 L+K G + QY + DP G Y + G GDFVT+P + F ++ + + Sbjct: 10 RLLKHGGNVPFHQYMNWVLHDPIHGAYGNGSLDIGIHGDFVTSPSLDINFAHLIIVQ-VE 68 Query: 70 AWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQ 129 W + ++E GPG G + L + +++ P + + I +VE + + Q Sbjct: 69 QWLASIGDGPLSIIETGPGEGHLALQLAKILYTNCPPLRNRIEIILVEPNRGMNRHQSSY 128 Query: 130 LASYGDKINWYTSLADVPLGF--TFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSL 187 L ++W S ++ ++++E D+L I++ V ++ + + + +++ Sbjct: 129 LTEVPLPLSWI-SFEELIQNPLQGIVISHEVLDALAIERIVYSQSYWYRQRVALSLNNTS 187 Query: 188 VFNI------GDHEIKSNFLTCS------DYFLGAIFENSPCRDREMQSISDRLACDGGT 235 I ++ K N + +Y G E + + ++ G Sbjct: 188 AMKIILMRGEFWNQNKPNLKSLGVNTLITNYQNGWCTELHTELNGWLDICKQGISK--GY 245 Query: 236 AIVIDY------GYLQSRVGDTLQAV-KGHTYVSPLVNPGQADLSSHVDFQRLSSIAILY 288 +VIDY Y R T+ + K PL NPG+ D+++H+ F+ L A Sbjct: 246 LLVIDYLIEAYRYYNIQRSQGTIMSYGKQIAQSDPLKNPGKLDITAHICFETLLGAAANC 305 Query: 289 KLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKIL 348 G QG+ L LG+ Q+ ++L + + + + + +G+ F+ + Sbjct: 306 GWSFTGQIRQGEALLALGLAQKYYALHSDKKKNITTTLASRETLLRLVNPLMLGD-FRWV 364 Query: 349 VVSHEKVELMP 359 SHE P Sbjct: 365 AFSHETATAQP 375 >gi|116626883|ref|YP_829039.1| hypothetical protein Acid_7859 [Candidatus Solibacter usitatus Ellin6076] gi|116230045|gb|ABJ88754.1| protein of unknown function DUF185 [Candidatus Solibacter usitatus Ellin6076] Length = 331 Score = 170 bits (431), Expect = 3e-40, Method: Composition-based stats. Identities = 74/355 (20%), Positives = 128/355 (36%), Gaps = 49/355 (13%) Query: 7 RKIVNLIKKNGQMTVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAI 65 + I+ G ++ ++ + + PE GYY +PFG GDF TA +I +FG ++A Sbjct: 6 ELLAAEIRAVGPVSFRRFMEVALYHPEHGYYRRAADPFGKEGDFFTAEQIQPVFGILMAA 65 Query: 66 FLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLI 125 + + + G P +VELG GR M Sbjct: 66 RIRALFREMGSPPGFSVVELGAGRREMA-------------------------------- 93 Query: 126 QKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHD 185 + A + + L +NEFFD+LP+ + + + ++ Sbjct: 94 --EAFAEWNYVPVDLATGEMPERFTGVLFSNEFFDALPVDAVIYMGEAFHLQHVTVEDGV 151 Query: 186 SLV---FNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYG 242 + + + + G +E + R ++ IS L G + IDYG Sbjct: 152 FRWQTGEAVSEEADEYLRRYYLEPEEGRWYEVNLEALRWLERISASLTK--GYVLTIDYG 209 Query: 243 YLQSRV----GDTLQAVKGHTY-VSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTT 297 Y ++ TL + HT L +PG D++SHV+F L L T Sbjct: 210 YTRAESVRFEAGTLMGYRRHTALDDVLTDPGVRDITSHVNFTALEERGAECGLVKERFET 269 Query: 298 QGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSH 352 + L G + + + + L +R + MGE F++L+ S Sbjct: 270 LAQSLLAAGEPDQFTAALGTPGTHEEL----RRRMQLKTLLFGMGETFRVLLQSK 320 >gi|313682727|ref|YP_004060465.1| hypothetical protein Sulku_1604 [Sulfuricurvum kujiense DSM 16994] gi|313155587|gb|ADR34265.1| protein of unknown function DUF185 [Sulfuricurvum kujiense DSM 16994] Length = 329 Score = 169 bits (429), Expect = 5e-40, Method: Composition-based stats. Identities = 70/345 (20%), Positives = 126/345 (36%), Gaps = 26/345 (7%) Query: 19 MTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPS 78 M Y + + GYY++ P G GDF TA S+ FG +A +I ++ S Sbjct: 1 MRFSDYMHDWLYGAD-GYYASYRPIGKKGDFYTAVSTSKFFGGTIAKHIISRIDEGFLRS 59 Query: 79 CVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK-- 136 + E+G G ++ DI+ I L+P L +VE E L Q+ Sbjct: 60 DSLICEIGAHHGYLLADIIEFIHTLRPQLLQTLRFGIVERFENLRTAQQSYFDESFGNAV 119 Query: 137 -INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHE 195 + Y+ L+++ F +ANE FD+ + F + E + + F+I D + Sbjct: 120 SLEHYSDLSELNEQSVFFIANEIFDAFGCELFYKGKTARVE-------NHQIHFDIDDPK 172 Query: 196 IKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAV 255 I ++ + E + + ++ + DYG L++R +++ Sbjct: 173 IS----LLAEKYKQDRGEIAVGYESFASAMQK--CSKRFEFMSFDYGELEARSDFSIRVY 226 Query: 256 KGHTYVSPLVN-------PGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIW 308 + H G+ D++ V+F + + TQ L +GI Sbjct: 227 QAHQTFPLFDEALERAEAYGKTDITYDVNFTHVKEAFEEAGIVCAQYATQLVALVEMGIL 286 Query: 309 QRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHE 353 L + + + V MGE FK++ E Sbjct: 287 DLLAILKEHAGEE--IYAQELEKVKILITPSLMGERFKMIRFVKE 329 >gi|254459212|ref|ZP_05072634.1| conserved hypothetical protein [Campylobacterales bacterium GD 1] gi|207084105|gb|EDZ61395.1| conserved hypothetical protein [Campylobacterales bacterium GD 1] Length = 331 Score = 167 bits (422), Expect = 3e-39, Method: Composition-based stats. Identities = 65/346 (18%), Positives = 126/346 (36%), Gaps = 30/346 (8%) Query: 21 VDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCV 80 +Y + + GYY+T G GDF TA S+ FG +A +I ++ Sbjct: 3 FSEYMGEWLYGND-GYYATYKNIGKEGDFYTAVSTSKFFGGSVARHIISLLDEGFLEKDG 61 Query: 81 RLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK---I 137 + E+G G + D++ I L+P + L ++E + L Q+ + Sbjct: 62 VICEIGAHHGYFLADVIEFIYTLRPKLLTTLKFVIIERFDDLQEQQRNYFEESFGDAVSL 121 Query: 138 NWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIK 197 Y SL+++ F +ANE FD+ P + + + G + D+ + Sbjct: 122 THYKSLSELKCKNAFFIANEIFDAFPCELYYKGKTGRVDGHNVEFDVDNDWVD------- 174 Query: 198 SNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKG 257 + + E + + + ++ + DYG + +R +++ Sbjct: 175 ----AKAKKYNKDRGEIAIGYEEFAKEMAKSCEK--FEFMSFDYGEMVARPDFSIRVYAK 228 Query: 258 H-------TYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQR 310 H ++ G++D++ V F+ + + GL Q L +GI Sbjct: 229 HKVIPFFEEDINRKELFGKSDITYDVTFEHVKDAFEEAGVEFIGLKAQMVALVDMGILNL 288 Query: 311 AFSLMKQTARK--DILLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 L + K + L K L+ +GE FK++ K Sbjct: 289 LEMLKENVDDKIYEQELQKAKMLIM----PNFLGERFKMIRFRKNK 330 >gi|225631051|ref|ZP_03787792.1| hypothetical protein WUni_002380 [Wolbachia endosymbiont of Muscidifurax uniraptor] gi|225591250|gb|EEH12391.1| hypothetical protein WUni_002380 [Wolbachia endosymbiont of Muscidifurax uniraptor] Length = 163 Score = 167 bits (422), Expect = 3e-39, Method: Composition-based stats. Identities = 68/166 (40%), Positives = 105/166 (63%), Gaps = 4/166 (2%) Query: 5 LIRKIVNLI-KKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 ++ I LI K G +++ + + ++GYY++ P G GDF TAPEISQ+FGE++ Sbjct: 1 MLTNIHELIDKSQGSISISDFMNAVLYHEKYGYYTSKLPLGKDGDFTTAPEISQLFGEVI 60 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 A++++ WE+ G PS LVELGPG+G ++ DI+RV + FF+ + I++VE S L Sbjct: 61 AVWIMHTWEKLGKPSKFSLVELGPGKGTLIHDIIRVTK-IYSSFFNSMLIHLVEISPTLR 119 Query: 124 LIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVM 169 IQK++L S +NW+ ++ +P T +A EFFD+LPI QFV Sbjct: 120 KIQKEKLKSL--DVNWHKNIDYLPEQPTIFLAYEFFDALPIDQFVY 163 >gi|145608734|ref|XP_369899.2| hypothetical protein MGG_06414 [Magnaporthe oryzae 70-15] gi|145016173|gb|EDK00663.1| hypothetical protein MGG_06414 [Magnaporthe oryzae 70-15] Length = 460 Score = 166 bits (420), Expect = 5e-39, Method: Composition-based stats. Identities = 94/422 (22%), Positives = 157/422 (37%), Gaps = 94/422 (22%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGE 61 L ++ I G + + + +SQ+FGE Sbjct: 64 STPLAEQLAAAILTTGPIPLASFM-----------------------------LSQVFGE 94 Query: 62 MLAIFLICAWEQHGFPSC-VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 ++AI+ + W G PS V L+ELGPGRG +M D+LR I + S+ +IYMVE S Sbjct: 95 LVAIWFVAEWMSQGRPSRGVELMELGPGRGTLMSDVLRTIKRFGDMSNSLDAIYMVEASP 154 Query: 121 RLTLIQKKQLASYGDK-----INWYTSLADVPLGFTFLVANEFFDSLPIKQ--------- 166 L QK L I +++ L + + +P ++ Sbjct: 155 ELRKAQKNLLCGEDAPLTESEIGYHSVCKQTQLPIVWTETVQSIPKIPSEKSAGTTNPSS 214 Query: 167 ----FVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFL-------------- 208 + + RE ++ S ++ + D+ L Sbjct: 215 APETTLKSTLEWRELVVSPTPPGSTHESLNTPATQRWDTPPPDFQLTLSSLATRHSRYLP 274 Query: 209 -------------GAIFENSPCRDREMQSISDRL--------ACDGGTAIVIDY-GYLQS 246 A E P I+ R+ G A+++DY + Sbjct: 275 DSSPRYRAMKKVADAQIEVCPDASLYAGDIASRIGGSVQNPKPRPSGAALILDYGPGDGT 334 Query: 247 RVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAI--LYKLYINGLTTQGKFLEG 304 ++L+ ++ H VSP V PG DLS+ VDF ++ A+ + ++G QG FLE Sbjct: 335 IPVNSLRGIRRHQRVSPFVEPGLTDLSADVDFAAVAEAAMRASEGVEVHGPVPQGYFLEA 394 Query: 305 LGIWQRAFSLMKQTARKDILLD---SVKRLVSTSADKKSMGELFKILVVSHEKVE---LM 358 +GI QRA +L+K D + KRLV ++ MG++++ L + E + Sbjct: 395 MGIKQRADNLIKNCKEPSKAADVDRAWKRLVDRGSN--GMGKIYQALAILPENDGKRRPV 452 Query: 359 PF 360 F Sbjct: 453 GF 454 >gi|15606056|ref|NP_213433.1| hypothetical protein aq_622 [Aquifex aeolicus VF5] gi|2983237|gb|AAC06834.1| hypothetical protein aq_622 [Aquifex aeolicus VF5] Length = 320 Score = 166 bits (420), Expect = 5e-39, Method: Composition-based stats. Identities = 93/340 (27%), Positives = 145/340 (42%), Gaps = 30/340 (8%) Query: 18 QMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFP 77 ++ + V D YYS+ DF TAPE+ + FGE LA F + P Sbjct: 3 FISFRDFMEKAVKD----YYSSQRAL---KDFFTAPELDRAFGEALAEFFSQHLSEFERP 55 Query: 78 SCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKI 137 + LVELG GRG++ DIL PD F+ L Y+ E S L Q++ L ++ Sbjct: 56 A---LVELGAGRGLLAYDILNYYRANYPDLFNRLKYYIYEFSPYLISKQREVLKNF---- 108 Query: 138 NWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIK 197 + ++P + +NEFFD+LP+ G +E +D ++ + ++ + ++K Sbjct: 109 KNVEWVEELPKVEGIVFSNEFFDALPVHIVK----GGKELYVD-EKGSEVWLSLENEKVK 163 Query: 198 SNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYL----QSRVGDTLQ 253 + L I E ++ IS+ L G VIDYGY T+ Sbjct: 164 EFLRRMNYENLNQIVEVCLDCIDMLKKISESLVE--GYHFVIDYGYTSEEITKYPEGTVV 221 Query: 254 AVKGHTY-VSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAF 312 A K H L G AD+++HV+F L + L +Q FL + I Sbjct: 222 AYKEHKVVKDFLKEAGNADITAHVNFSALMEYGRDFGLETILFQSQRDFL--MHIPTFLN 279 Query: 313 SLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSH 352 L K + + +SV+RL SMG+ FK+L+ Sbjct: 280 ELEKLSWEES--AESVERLSRLKTMLISMGDRFKVLLQRK 317 >gi|299470317|emb|CBN78367.1| conserved unknown protein [Ectocarpus siliculosus] Length = 494 Score = 166 bits (420), Expect = 6e-39, Method: Composition-based stats. Identities = 79/217 (36%), Positives = 118/217 (54%), Gaps = 34/217 (15%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGE 61 L +++ +I NG MTV +Y + P++GYY + G GDF+TAPEISQ FGE Sbjct: 16 TGLAKELEQMIVLNGPMTVPEYMIYALQHPKYGYYMRQEDKIGRGGDFITAPEISQTFGE 75 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 M+ I+ + +W++ G P RLVELGPG+G +M+DILR + PDF LS++MVETS+ Sbjct: 76 MIGIWCVASWKEMGSPEEFRLVELGPGKGTLMVDILRTVSSF-PDFRKALSLHMVETSDD 134 Query: 122 LTLIQKKQLASYGDK-------------------------------INWYTSLADVPLG- 149 L +Q K L + + W+T++ VP G Sbjct: 135 LRALQVKALGATFAPTASYSASRGGGGGGGAKEVGGSPMLLPGGGEVVWHTNIEQVPKGQ 194 Query: 150 FTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDS 186 + +A EF D+LP+ QF TE+G RER++D++ + Sbjct: 195 PSLFIAQEFLDALPVHQFQYTENGWRERLVDVNVPGA 231 Score = 102 bits (255), Expect = 6e-20, Method: Composition-based stats. Identities = 49/162 (30%), Positives = 86/162 (53%), Gaps = 12/162 (7%) Query: 208 LGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNP 267 +G E S ++ +++R+A D G A+ +DYG + +GD+L+ KGH V L +P Sbjct: 330 VGDRLEVSGESILLVKGVAERIAQDRGGALFVDYGEAHA-LGDSLRCFKGHEEVPVLSDP 388 Query: 268 GQADLSSHVDFQRLSSI-AILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQT----ARKD 322 G+AD+++ V+F L + A + +G QG+FL +GI R +L +Q + D Sbjct: 389 GEADMTADVNFGLLRRVVAGVEGARPHGPVGQGQFLREMGIGARLTALAEQPHVTAEQAD 448 Query: 323 ILLDSVKRLVSTSADKKSMGELFKILVVS--HEKVELMPFVN 362 +L+ RLV D MG +K+L +S +++ F++ Sbjct: 449 AMLEGYVRLV----DPAQMGVRYKVLGISEERQEMPPPGFMS 486 >gi|307721287|ref|YP_003892427.1| hypothetical protein Saut_1368 [Sulfurimonas autotrophica DSM 16294] gi|306979380|gb|ADN09415.1| protein of unknown function DUF185 [Sulfurimonas autotrophica DSM 16294] Length = 330 Score = 166 bits (419), Expect = 7e-39, Method: Composition-based stats. Identities = 65/344 (18%), Positives = 118/344 (34%), Gaps = 30/344 (8%) Query: 21 VDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCV 80 +Y + + GYY+T G GDF TA S+ FG +A +I ++ Sbjct: 3 FSEYMTEWLYGED-GYYATYKNIGKSGDFYTAVSTSKFFGGTIAKHIISLVDEGFLQKDA 61 Query: 81 RLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK---I 137 + E+G G + D+ I L+P+ S ++E + L Q++ + Sbjct: 62 VICEIGAHHGYFLADVCEFIYTLRPELLSTFQFVIIERFDDLQKYQREYFGESFGDAVAL 121 Query: 138 NWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIK 197 Y SL ++ F +ANE FD+ P + + E E + Sbjct: 122 THYKSLNELQCENAFFIANEIFDAFPCELYFKGESARVENHEVLFDVKDDW--------- 172 Query: 198 SNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKG 257 + + E + + + +++ I DYG +Q+R +L+ Sbjct: 173 --VQIKAKKYHKDRGEIAVGYEEFAKEMANAAKK--FEFISFDYGEMQARPDFSLRIYTK 228 Query: 258 HTYVSPLVN-------PGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQR 310 H G++D++ V F + + Q L +GI + Sbjct: 229 HEVHPFFEEGLNRAELFGKSDITYDVTFLHVKDAYEEAGVEFVEFKAQMVALVDMGILEL 288 Query: 311 AFSLMKQTARK--DILLDSVKRLVSTSADKKSMGELFKILVVSH 352 L K L+ K L+ +GE FK++ Sbjct: 289 LEMLKANADEKIYKQELEKAKMLIM----PNFLGERFKMIKFRK 328 >gi|268679511|ref|YP_003303942.1| hypothetical protein Sdel_0876 [Sulfurospirillum deleyianum DSM 6946] gi|268617542|gb|ACZ11907.1| conserved hypothetical protein [Sulfurospirillum deleyianum DSM 6946] Length = 341 Score = 165 bits (417), Expect = 1e-38, Method: Composition-based stats. Identities = 72/356 (20%), Positives = 129/356 (36%), Gaps = 40/356 (11%) Query: 19 MTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPS 78 M Y + E GYY+ G GDF TA S FG +A LI E+ Sbjct: 1 MRFSDYMHEWLYGKE-GYYTKERTIGKEGDFYTAVSTSMFFGGSIAKRLISTIEEGFLSP 59 Query: 79 CVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKIN 138 +VE+G +G ++ D+++ I L+P L+ +VE ++QK+ + Sbjct: 60 NTYVVEIGAHKGYLLADMIQFIYTLQPTLLKSLTFVIVEPFAANAMMQKRYFQESFGEAI 119 Query: 139 ---WYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHE 195 SL ++ + F VANE FD+ P + + + E ++ + F D Sbjct: 120 SLLHVKSLEELCVDEAFFVANEIFDAFPCEVIYNDKMLMVE-------NEKVSFETMDTF 172 Query: 196 IKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAV 255 ++ + + E + S++ + I DYG ++R +L+ Sbjct: 173 T----CKKAEAYGVSKGELCLEYEAFATSMAR--SAKRFEFISFDYGDKEARGDFSLRVY 226 Query: 256 KGHTYVSPL-----------------VNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQ 298 H ++D++ V F L + + ++G TQ Sbjct: 227 ANHQVYPFFGLSDLVEDPLREQKSFVEYFAKSDITYDVTFNHLFKAFEMADIRLHGYMTQ 286 Query: 299 GKFLEGLGIWQRAFSLMKQTARK--DILLDSVKRLVSTSADKKSMGELFKILVVSH 352 K L G+ + K + ++ +K D MGE FK++ Sbjct: 287 MKALVDFGLIELLELFSKHVTSAVYEKEMNRIK----PLIDPSFMGERFKMVSFRK 338 >gi|255644540|gb|ACU22773.1| unknown [Glycine max] Length = 246 Score = 165 bits (417), Expect = 1e-38, Method: Composition-based stats. Identities = 66/185 (35%), Positives = 110/185 (59%), Gaps = 20/185 (10%) Query: 2 ENKLIRKIVNLIK-KNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 +++L++ + +IK + G +++ +Y + + +P+ GYY + FGA GDF+T+PE+SQ+FG Sbjct: 63 DSELVKHLKGIIKFRGGPISLGEYMSEVLTNPKAGYYINRDVFGAEGDFITSPEVSQMFG 122 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 EM+ ++++C WEQ G P V LVELGPGRG +M D+LR K K +F L +++VE S Sbjct: 123 EMVGVWVMCLWEQMGQPQGVNLVELGPGRGTLMADLLRGASKFK-NFIESLHVHLVECSP 181 Query: 121 RLTLIQKKQLAS------------------YGDKINWYTSLADVPLGFTFLVANEFFDSL 162 L +Q + L +G ++W+ +L ++A+EFFD+L Sbjct: 182 ALQKLQHQNLKCTDEENASQDTDTRTARSLFGTPVSWHATLEQFLQIANIIIAHEFFDAL 241 Query: 163 PIKQF 167 P+ QF Sbjct: 242 PVHQF 246 >gi|167835157|ref|ZP_02462040.1| Uncharacterized ACR, COG1565 superfamily protein [Burkholderia thailandensis MSMB43] Length = 315 Score = 164 bits (414), Expect = 2e-38, Method: Composition-based stats. Identities = 70/309 (22%), Positives = 122/309 (39%), Gaps = 24/309 (7%) Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 S +F LA + G R++E G G G ++ L + + Sbjct: 1 SPLFARTLARPVAQ---ALGASGTRRVMEFGAGTG---KLAAGLLNALAALGVELDEYAI 54 Query: 116 VETSERLTLIQKKQLASYGD----KINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTE 171 V+ S L Q++ LA++ ++ W +L + G +V NE D++P++ Sbjct: 55 VDLSGELRARQRETLAAHAPGLAARVRWLDALPERFEG--VVVGNEVLDAMPVQLVAKQA 112 Query: 172 HGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCS--DYFLGAIFENSPCRDREMQSISDRL 229 HG RER + +D + VF + + D G + E +++ L Sbjct: 113 HGWRERGVSVDDAGAFVFADRPLARAEDAARLAELDADEGYVTETHEAAAAFVRTACAML 172 Query: 230 ACDGGTAIVIDYG----YLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSSI 284 A I + Y + R TL H P V PG D+++HV+F + Sbjct: 173 ARGAAFFIDYGFPGHEYYHRQRAQGTLMCHYRHRAHGDPFVYPGLQDITAHVEFSAIYEA 232 Query: 285 AILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARK-DILLDSVKRLVSTSADKKSMGE 343 + + G T+Q +FL GI + A+ ++V++L+S + MGE Sbjct: 233 GVGAGADLLGYTSQARFLLNAGITDVLAEIDPSDAQHFLPAANAVQKLIS----EAEMGE 288 Query: 344 LFKILVVSH 352 LFK++ S Sbjct: 289 LFKVIAFSR 297 >gi|109947288|ref|YP_664516.1| hypothetical protein Hac_0715 [Helicobacter acinonychis str. Sheeba] gi|109714509|emb|CAJ99517.1| conserved hypothetical protein [Helicobacter acinonychis str. Sheeba] Length = 336 Score = 164 bits (414), Expect = 3e-38, Method: Composition-based stats. Identities = 68/345 (19%), Positives = 121/345 (35%), Gaps = 25/345 (7%) Query: 20 TVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSC 79 + Y + + GYY G GDF T+ +S+ FG +A ++I E+ Sbjct: 3 SFGNYMQEWLYSEK-GYY-RKAVIGQKGDFYTSVSLSKFFGGAIAFYIIKLLEKEKLFLP 60 Query: 80 VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQ---KKQLASYGDK 136 +++VE+G G + DI + L E L IQ KQ Sbjct: 61 LKIVEIGSHHGHFLSDIANFLKALSVGVIEKCEFVSCEPLIELQNIQQITFKQATQLDLI 120 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 L F+V+NE FD+ + +M+ I V++ D Sbjct: 121 GCDLKELDFKENESAFVVSNELFDAFACEIIKDN------QMLFITHDHQSVWSHVDEPT 174 Query: 197 KSNFLTCSDYFLGAIFENSPCRDREMQSISDRLAC-DGGTAIVIDYGYLQSRVGDTLQAV 255 K T + G + ++ + ++L + DYG R L+A Sbjct: 175 KELLKTL-NLKEGC---APLFLNAFIKDLLEKLDKASSWVFLSFDYGDTIERKDMHLRAF 230 Query: 256 KGHTYVSPLV-------NPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIW 308 K H + Q+DL+ V+F + S+ + + +Q L +G+ Sbjct: 231 KNHQVLDFKDILNHLASLYQQSDLTYDVNFSLVRSLFEKHHAEFSFFKSQANALLDMGLM 290 Query: 309 QRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHE 353 + + K + + L ++ K + +GE FK L + Sbjct: 291 ELLETFSKSVSYERYLKEAAK--IKPLISPGGLGERFKALEFVKK 333 >gi|118399106|ref|XP_001031879.1| hypothetical protein TTHERM_00721470 [Tetrahymena thermophila] gi|89286214|gb|EAR84216.1| hypothetical protein TTHERM_00721470 [Tetrahymena thermophila SB210] Length = 1651 Score = 164 bits (414), Expect = 3e-38, Method: Composition-based stats. Identities = 92/424 (21%), Positives = 163/424 (38%), Gaps = 82/424 (19%) Query: 10 VNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLIC 69 K+ G +++++Y+ + + D + G + A GDF+T+ EISQ+FGE+ AI L Sbjct: 1230 KKQTKEKGPISMNEYWNISLLDEKHG--RKNDVITAKGDFITSVEISQMFGEIKAIDLQS 1287 Query: 70 AWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQ 129 ++ L+E GPGRG +M DI+RV+ + D + I VE S + +Q+++ Sbjct: 1288 Q-KRDRSKKRFSLLEFGPGRGTLMSDIIRVLAQF--DLLDGIEINFVEFSPFMRKLQQEK 1344 Query: 130 L---------------------------ASYGDKINWYTSLADVPLG------------- 149 + D+ Sbjct: 1345 VVKELQNRGIYMTYNVDKAKRSQVEEFRCEDHDRFVCLRWFKMYENMLFEDFGDYALQQL 1404 Query: 150 ---------FTFLVANEFFDSLPIKQFVMTEHG-IRERMIDIDQH--------------- 184 + A+EFFD+LP FV + E++++I Sbjct: 1405 DKEHAKTLTPIVVFAHEFFDALPANVFVYQNNYGWCEKLVNICHDPSKMRNFEYITTDGP 1464 Query: 185 DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYL 244 + V I + + G E P S+++ ++ G + IDYG Sbjct: 1465 NENVKKILRPNVSFTEEQKKNIKHGDQIEVQPKSLVITNSLAELISKRNGAMLAIDYGEN 1524 Query: 245 QSRVGDTLQAVKGHTYV---SPLVNPGQADLSSHVDFQRLSSIAILY-KLYINGLTTQGK 300 D+++ ++ H ++ L PG+ DLS++V+F LS A + QG Sbjct: 1525 -QAFSDSIRGIRRHKFIKNEDILEYPGEIDLSAYVNFAHLSQAAKKVPGISTPNPIPQGL 1583 Query: 301 FLEGLGIWQRAFSLMKQTAR--KDILLDSVKRLVSTSADKKSMGELFKILVVS-HEKVEL 357 FLE +G+ R L +Q + + RLV+ + MG +K + + E+ Sbjct: 1584 FLESMGLNTRLEMLCRQVNQIKQKQFEQEYFRLVA----PEEMGGTYKAFYMGLEKNGEI 1639 Query: 358 MPFV 361 PF+ Sbjct: 1640 FPFI 1643 >gi|308062132|gb|ADO04020.1| hypothetical protein HPCU_04300 [Helicobacter pylori Cuz20] Length = 336 Score = 163 bits (413), Expect = 3e-38, Method: Composition-based stats. Identities = 68/346 (19%), Positives = 119/346 (34%), Gaps = 25/346 (7%) Query: 20 TVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSC 79 + Y + + GYY G GDF T+ +S+ FG +A +++ E+ Sbjct: 3 SFGNYMQEWLYGEK-GYY-KKALIGQKGDFYTSVSLSKFFGGAVAFYIVKLLEEEKLFLP 60 Query: 80 VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQ---KKQLASYGDK 136 +++VE+G G + DI + L E + L +Q KQ Sbjct: 61 LKIVEIGAHHGHFLSDIANFLNALSVGVMEQCEFVSCEPLKELQKLQRTIFKQATQLDLM 120 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 I L F+V+NE FD+ + +M+ I V+ D Sbjct: 121 ICDLKDLDFKGNESAFVVSNELFDAFACEIIKDN------KMLFITHDHKGVWGAIDEPT 174 Query: 197 KSNFLTCSDYFLGAIFENSPCRDREMQSISDRLAC-DGGTAIVIDYGYLQSRVGDTLQAV 255 K D G + ++ + ++L + DYG R L+A Sbjct: 175 KELLKNL-DLKQGC---APLFLEAFIKDLLEKLDEASSWVFLSFDYGDEIERKDMHLRAF 230 Query: 256 KGHTYVSPLV-------NPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIW 308 K H + ++DL+ V+F + + + + TQ L +G+ Sbjct: 231 KNHQALDFKDILNNLASLYQKSDLTYDVNFSLVRFLFEKHHAKFSFFKTQANALLDMGLM 290 Query: 309 QRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 + + K K L +S K + +GE FK L + Sbjct: 291 ELLETFSKSVGYKRYLKESAK--IKPLISPGGLGERFKALEFVKKN 334 >gi|188527343|ref|YP_001910030.1| hypothetical protein HPSH_02750 [Helicobacter pylori Shi470] gi|188143583|gb|ACD48000.1| hypothetical protein HPSH_02750 [Helicobacter pylori Shi470] Length = 336 Score = 163 bits (412), Expect = 5e-38, Method: Composition-based stats. Identities = 65/346 (18%), Positives = 120/346 (34%), Gaps = 25/346 (7%) Query: 20 TVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSC 79 + Y + + GYY G GDF T+ +S+ FG +A +++ E+ Sbjct: 3 SFGNYMQEWLYGEK-GYY-KKALIGQKGDFYTSVSLSKFFGGAVAFYIVKLLEEEKLFLP 60 Query: 80 VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQ---KKQLASYGDK 136 +++VE+G G + DI + L E + L +Q KQ Sbjct: 61 LKIVEIGAHHGHFLSDIANFLNALSVGVMEQCEFVSCEPLKELQKLQRTIFKQATQLDLM 120 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 I L F+V+NE FD+ + +M+ I V+ D Sbjct: 121 ICDLKDLDFKGNESAFVVSNELFDAFACEIIKDN------KMLFITHDHKGVWGGIDEPT 174 Query: 197 KSNFLTCSDYFLGAIFENSPCRDREMQSISDRLAC-DGGTAIVIDYGYLQSRVGDTLQAV 255 K D G + ++ + ++L + DYG R L+A Sbjct: 175 KELLKNL-DLKQGC---APLFLEAFIKDLLEKLDEASSWVFLSFDYGDEIERKDMHLRAF 230 Query: 256 KGHTYVSPLV-------NPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIW 308 K H + ++DL+ V+F + + + + +Q L +G+ Sbjct: 231 KNHQALDFKDILNNLASLYQKSDLTYDVNFSLVRFLFEKHHAKFSFFKSQANALLDMGLM 290 Query: 309 QRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 + + K + + L ++ K + +GE FK L + Sbjct: 291 ELLETFSKSASYERYLKEAAK--IKPLISPGGLGERFKALEFVKKN 334 >gi|152992188|ref|YP_001357909.1| hypothetical protein SUN_0593 [Sulfurovum sp. NBC37-1] gi|151424049|dbj|BAF71552.1| conserved hypothetical protein [Sulfurovum sp. NBC37-1] Length = 322 Score = 162 bits (411), Expect = 6e-38, Method: Composition-based stats. Identities = 56/338 (16%), Positives = 116/338 (34%), Gaps = 26/338 (7%) Query: 25 FALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVE 84 + + GYY G GDF TA S FG +A ++ L+E Sbjct: 1 MNEWLYGEK-GYYKNFKAIGKSGDFYTAVSTSSFFGASIANHFFKMLKEGKADRNGWLIE 59 Query: 85 LGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYG---DKINWYT 141 +G +G ++ D+++ + P L +VE + +Q + + + Sbjct: 60 VGAHQGYLLCDMIQWLYTCDPSLVQTLRFGIVERQPEVREVQSAYIKERFGDDVTVTHFE 119 Query: 142 SLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFL 201 L++V + F+VANE FD+ P + + + E + + Sbjct: 120 DLSEVNTAYAFVVANEIFDAFPCELLKDEQIAVVE---------DHTISWEPAPTEMLEW 170 Query: 202 TCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYV 261 Y E + + ++++ + + DYG R +++ H Sbjct: 171 AKGHYLKQG--EVAVGYEDFAKAMASGIKKCD--FVSFDYGEKYVRNDFSIRVYYAHETF 226 Query: 262 -------SPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSL 314 + + D++ V+F+ + TQ + L G+ + Sbjct: 227 PLFDEALDLSESFQKDDITYDVNFKHVLEAFEAAGFREESYETQARALIRFGLIEILEQF 286 Query: 315 MKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSH 352 QT + + ++ K + T MG+ FK++ + Sbjct: 287 ASQTTQDRYVREADK--IKTLIAPTMMGDRFKMIHLHK 322 >gi|317178828|dbj|BAJ56616.1| hypothetical protein HPF30_0519 [Helicobacter pylori F30] Length = 336 Score = 162 bits (410), Expect = 7e-38, Method: Composition-based stats. Identities = 66/346 (19%), Positives = 121/346 (34%), Gaps = 25/346 (7%) Query: 20 TVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSC 79 + Y + + GYY G GDF T+ +S+ FG +A ++I E+ Sbjct: 3 SFGNYMQEWLYGEK-GYY-KKALIGQKGDFYTSVSLSKFFGGAVAFYIIKLLEEEKLFLP 60 Query: 80 VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQ---KKQLASYGDK 136 +++VE+G G + DI + L E + L +Q KQ Sbjct: 61 LKIVEIGAHHGHFLSDIANFLNALSVGVMEKCEFVSCEPLKELQKLQRTIFKQATQLDLM 120 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 I L F+++NE FD+ + +M+ I V+ D Sbjct: 121 ICDLKDLDFKGHESAFIISNELFDAFACEIIKDN------KMLFITHDHKGVWGAIDGPT 174 Query: 197 KSNFLTCSDYFLGAIFENSPCRDREMQSISDRLAC-DGGTAIVIDYGYLQSRVGDTLQAV 255 K D G + ++ + +RL + DYG R L+A Sbjct: 175 KELLKNL-DLKQGC---APLFLEAFIKDLLERLDEASSWVFLSFDYGDEIERKDMHLRAF 230 Query: 256 KGHTYVSPLV-------NPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIW 308 K H + ++DL+ V+F + + + + +Q L +G+ Sbjct: 231 KNHQALDFKDILNNLDSLYQKSDLTYDVNFSLVRFLFEKHHAQFSFFKSQANALLDMGLM 290 Query: 309 QRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 + + K + + L ++ K + +GE FK L + + Sbjct: 291 ELLETFSKSVSYERYLKEAAK--IKPLISPGGLGERFKALELVKKN 334 >gi|260950353|ref|XP_002619473.1| hypothetical protein CLUG_00632 [Clavispora lusitaniae ATCC 42720] gi|238847045|gb|EEQ36509.1| hypothetical protein CLUG_00632 [Clavispora lusitaniae ATCC 42720] Length = 524 Score = 162 bits (410), Expect = 8e-38, Method: Composition-based stats. Identities = 69/205 (33%), Positives = 106/205 (51%), Gaps = 26/205 (12%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 L IK G +++ Y C+ P+FGYY+T +P A GDF+T+PEIS +FGEM Sbjct: 80 QNLSDFFSETIKTTGPVSLSAYMRQCLTHPDFGYYTTRDPLAAGGDFITSPEISSVFGEM 139 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 + ++L W+ G P +++VE GPGRG ++ D + V + S++ ++E S L Sbjct: 140 IGMWLFSVWQAQGSPQKIQVVEFGPGRGTLIHDAMAVFNRFAKVSVSIV---LIEASPVL 196 Query: 123 TLIQKK-----------------------QLASYGDKINWYTSLADVPLGFTFLVANEFF 159 Q K L+ +G ++ W + DVP +++VA+EFF Sbjct: 197 RKEQAKLLCPGVEQFEKVPTPENPAGFDSCLSKWGHRVMWVDTEKDVPSEVSYVVAHEFF 256 Query: 160 DSLPIKQFVMTEHGIRERMIDIDQH 184 D+LPIK FV E G RE ++D H Sbjct: 257 DALPIKSFVRKEEGWRELLVDSADH 281 Score = 102 bits (254), Expect = 8e-20, Method: Composition-based stats. Identities = 52/172 (30%), Positives = 82/172 (47%), Gaps = 9/172 (5%) Query: 195 EIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLAC--DGGTAIVIDYGYLQSRVGDTL 252 I S D +G+ E P + + I + + + G A++IDYG ++L Sbjct: 353 AIPSVSPRFRDLPVGSRVEICPDAELYLSRIVELVKKGQNAGAALIIDYGLANDIPSNSL 412 Query: 253 QAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAF 312 + + H +VSP +PG DLS VDF+ L +A + + + G T QG FL LGI R Sbjct: 413 RGIYKHKFVSPFFSPGNVDLSVDVDFENLRLLAAPH-VDVFGPTEQGDFLHELGIGVRFD 471 Query: 313 SLMK---QTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVE-LMPF 360 L++ A K+ L +S RL D+ SMG+++K L + + F Sbjct: 472 QLIQRANSMANKESLYESYVRLT--GKDENSMGKIYKFLALLPKNSTRPAGF 521 >gi|261837958|gb|ACX97724.1| hypothetical protein KHP_0516 [Helicobacter pylori 51] Length = 336 Score = 162 bits (409), Expect = 8e-38, Method: Composition-based stats. Identities = 65/346 (18%), Positives = 122/346 (35%), Gaps = 25/346 (7%) Query: 20 TVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSC 79 + Y + + GYY G GDF T+ +S+ FG +A ++I E+ Sbjct: 3 SFGNYMQEWLYGEK-GYY-KKALIGQKGDFYTSVSLSKFFGGAVAFYIIKLLEEEKLFLP 60 Query: 80 VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQ---KKQLASYGDK 136 +++VE+G G + DI + L + E + L +Q KQ Sbjct: 61 LKIVEIGAHHGHFLSDIANFLNALSVGVMEKCAFVSCEPLKELQKLQQTIFKQATQLDLM 120 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 I L F+V+NE FD+ + +M+ I V+ + Sbjct: 121 ICDLKDLDFKGHESAFVVSNELFDAFACEIIKDN------KMLFITHDHKGVWGDINKPT 174 Query: 197 KSNFLTCSDYFLGAIFENSPCRDREMQSISDRLAC-DGGTAIVIDYGYLQSRVGDTLQAV 255 K D G + ++ + ++L + DYG R L+A Sbjct: 175 KELLKNL-DLKQGC---APLFLEAFIKDLLEKLDEASSWVFLSFDYGDETERKDMHLRAF 230 Query: 256 KGHTYVSPLV-------NPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIW 308 K H + ++DL+ V+F + + + + +Q L +G+ Sbjct: 231 KNHQALDFKDILNNLDSLYQKSDLTYDVNFSLVRFLFEKHHAQFSFFKSQASALLDMGLM 290 Query: 309 QRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 + + K + + L ++ K + +GE FK L + + Sbjct: 291 ELLETFSKSVSYERYLKEAAK--IKPLISPGGLGERFKALELVKKN 334 >gi|254779215|ref|YP_003057320.1| hypothetical protein HELPY_0545 [Helicobacter pylori B38] gi|254001126|emb|CAX29081.1| Conserved hypothetical protein [Helicobacter pylori B38] Length = 336 Score = 162 bits (409), Expect = 1e-37, Method: Composition-based stats. Identities = 66/345 (19%), Positives = 116/345 (33%), Gaps = 23/345 (6%) Query: 20 TVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSC 79 + Y + + GYY G GDF T+ +S+ FG +A ++I E+ Sbjct: 3 SFGNYMQEWLYGEK-GYYRKAL-IGQKGDFYTSVSLSKFFGGAVAFYIIKLLEEEKLFLP 60 Query: 80 VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQ---KKQLASYGDK 136 +++VE+G G + DI + L E + L +Q KQ Sbjct: 61 LKIVEIGSHHGHFLSDIASFLNALSVGVMEKCEFVSCEPLKELQKLQRTIFKQATQLDLM 120 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 I L F+++NE FD+ + +M+ I V+ D Sbjct: 121 ICDLKDLDFKGHESAFIISNELFDAFACEIIKDN------KMLFITHDHQGVWGGIDEPT 174 Query: 197 KSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVK 256 K D G + +++ + DYG R L+A K Sbjct: 175 KELLKNL-DLKQGCVPLFLEAFIKDLLE--KLNEASSWVFLSFDYGDETERKDLHLRAFK 231 Query: 257 GHTYVSPLV-------NPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQ 309 H + Q+DL+ V+F + + + + TQ L +G+ + Sbjct: 232 NHQALDFKDILNNLASLYQQSDLTYDVNFSLVRFLFEKHHAQFSFFKTQANALLDMGLME 291 Query: 310 RAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 + K + L ++ K + +GE FK L + Sbjct: 292 LLETFSKSVGYERYLKEAAK--IKPLISPGGLGERFKALEFVKKN 334 >gi|163783176|ref|ZP_02178170.1| hypothetical protein HG1285_14169 [Hydrogenivirga sp. 128-5-R1-1] gi|159881510|gb|EDP75020.1| hypothetical protein HG1285_14169 [Hydrogenivirga sp. 128-5-R1-1] Length = 320 Score = 161 bits (407), Expect = 2e-37, Method: Composition-based stats. Identities = 80/338 (23%), Positives = 142/338 (42%), Gaps = 27/338 (7%) Query: 20 TVDQYFALCVADPEFGYYST-CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPS 78 + + YY++ F + GDF TAPE+ + FGE +A FL + P+ Sbjct: 3 SFRDFMEE----KVKEYYTSPKEKFSSQGDFFTAPELDRTFGEAIADFLYHRLREFESPT 58 Query: 79 CVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKIN 138 ++ELG GRG++ DIL + PD + L+ + E S L QKK L+ + D ++ Sbjct: 59 ---ILELGAGRGLLAKDILSYYKEKDPDLYQRLNYLIYEMSPPLIETQKKILSEF-DNVS 114 Query: 139 WYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKS 198 W L + +++NEF+D+LP+ +E I+ D+ + + ++ + +K Sbjct: 115 WVEDLPQME---GVVLSNEFYDALPVHVVK----EGKELYIN-DEGEEVWLSLENGRVKE 166 Query: 199 NFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSR----VGDTLQA 254 L E ++ +++ L G + IDYGY+ T+ Sbjct: 167 FLKRMGYEDLNQRVEVCLDCIDMLERVANALLK--GYHLAIDYGYVSQEIQKFPEGTVVG 224 Query: 255 VKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSL 314 K HT + D+S+ V+F L + L +Q FL + + L Sbjct: 225 YKKHTLEGDIYQKEDMDISAQVNFSALMEYGKDFGLETILYDSQRNFLASIPHFVSQLEL 284 Query: 315 MKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSH 352 + + + + + RL + MG+ FK+L S Sbjct: 285 LSFEETPENI-ERLSRLKTMLIS---MGDRFKVLFQSK 318 >gi|261839373|gb|ACX99138.1| hypothetical protein HPKB_0538 [Helicobacter pylori 52] Length = 336 Score = 161 bits (407), Expect = 2e-37, Method: Composition-based stats. Identities = 66/346 (19%), Positives = 124/346 (35%), Gaps = 25/346 (7%) Query: 20 TVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSC 79 + Y + + GYY G GDF T+ +S+ FG +A ++I E+ Sbjct: 3 SFGNYMQEWLYGEK-GYY-KKALIGQKGDFYTSVSLSKFFGGAVAFYIIKLLEEEKLFLP 60 Query: 80 VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQ---KKQLASYGDK 136 +++VE+G G + DI + L E + L +Q KQ + Sbjct: 61 LKIVEIGAHHGHFLSDIANFLNALSVGVMEKCEFVSCEPLKELQKLQRTIFKQATQWDLM 120 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 I L F+V+NE FD+ + +M+ I V++ D Sbjct: 121 ICDLKDLDFKGHESAFVVSNELFDAFACEIIKDN------KMLFITHDHKGVWSGIDEPT 174 Query: 197 KSNFLTCSDYFLGAIFENSPCRDREMQSISDRLAC-DGGTAIVIDYGYLQSRVGDTLQAV 255 K D G + + ++ + ++L + DYG R L+A Sbjct: 175 KELLKNL-DLKQGCV---PLFLEAFIKDLLEKLDEAPSWVFLSFDYGDEIERKDMHLRAF 230 Query: 256 KGHTYVSPLV-------NPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIW 308 K H + ++DL+ V+F + + + + +Q L +G+ Sbjct: 231 KNHQALDFKDILNNLASLYQKSDLTYDVNFSLVCFLFEKHHAQFSFFKSQANALLDMGLM 290 Query: 309 QRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 + + K + + L ++ K + +GE FK L + + Sbjct: 291 ELLETFSKSVSYERYLKEAAK--IKPLISPGGLGERFKALELVKKN 334 >gi|315586523|gb|ADU40904.1| conserved hypothetical protein [Helicobacter pylori 35A] Length = 336 Score = 161 bits (406), Expect = 2e-37, Method: Composition-based stats. Identities = 63/345 (18%), Positives = 118/345 (34%), Gaps = 23/345 (6%) Query: 20 TVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSC 79 + Y + + GYY G GDF T+ +S+ FG +A ++I E+ Sbjct: 3 SFGNYMQEWLYGEK-GYY-KKALIGQKGDFYTSVSLSKFFGGAVAFYIIKLLEEEKLFLP 60 Query: 80 VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQ---KKQLASYGDK 136 +++VE+G G + DI + L E + L +Q KQ + Sbjct: 61 LKIVEIGAHHGHFLSDIANFLNALSVGVMEKCEFVSCEPLKELQKLQQIIFKQATQWDLM 120 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 I L F+++NE FD+ + +M+ I V+ + Sbjct: 121 ICDLKDLDFKGHESAFVISNELFDAFACEIIKDN------KMLFITHDHKGVWGDINKPT 174 Query: 197 KSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVK 256 K D G +++ + DYG R L+A K Sbjct: 175 KELLKNL-DLKQGCAPLFLEAFIKDLLE--KLNEAPSWVFLSFDYGDEIERKDMHLRAFK 231 Query: 257 GHTYVSPLV-------NPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQ 309 H + ++DL+ V+F + + + + +Q L +G+ + Sbjct: 232 NHQVLDFKDILNNLASLYQKSDLTYDVNFSLVRFLFEKHHTQFSFFKSQASALLDMGLME 291 Query: 310 RAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 + K + + L ++ K + +GE FK L + + Sbjct: 292 LLETFSKSVSYERYLKEAAK--IKPLISPGGLGERFKALELVKKN 334 >gi|78777450|ref|YP_393765.1| hypothetical protein Suden_1252 [Sulfurimonas denitrificans DSM 1251] gi|78497990|gb|ABB44530.1| Protein of unknown function DUF185 [Sulfurimonas denitrificans DSM 1251] Length = 323 Score = 159 bits (403), Expect = 4e-37, Method: Composition-based stats. Identities = 67/339 (19%), Positives = 128/339 (37%), Gaps = 26/339 (7%) Query: 25 FALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVE 84 + + + GYY+T G GDF T+ S+ FG +A +I + + E Sbjct: 1 MSEWLYA-DNGYYATYKSIGKDGDFYTSVSSSKFFGGTIAKHIISLVDDGFLKKDSVVCE 59 Query: 85 LGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYG---DKINWYT 141 +G G + DI+ I L+P+ + L ++E + L QK L K+ Y Sbjct: 60 IGAHHGYFLADIIEFIYTLRPELLNSLEFVIIEKFDALQEFQKSYLQESFGDAIKLTHYK 119 Query: 142 SLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFL 201 SL ++ +ANE FD+ + + + + ++F+I D + Sbjct: 120 SLKELKCENAIFIANEIFDAFACELYYKGK-------VARVHESEIIFDIDDDWVSKK-- 170 Query: 202 TCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYV 261 + + E + + + +++ CD I DYG +Q+R +++ H + Sbjct: 171 --AKKYHKDRGEIAIGYEEFAKEMAE--CCDKFEFISFDYGEMQARPDFSIRVYYKHDVI 226 Query: 262 SPLVN-------PGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSL 314 ++DL+ V F+ + + L Q L +GI + L Sbjct: 227 PFFDENIKRDELFAKSDLTYDVTFEHVKDAFTEAGVEFIELRAQMVALVDMGILELLEIL 286 Query: 315 MKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHE 353 ++T K + V +GE FK++ + Sbjct: 287 KEKTDEK--IYKQELEKVKILIMPNFLGERFKMIKFRKK 323 >gi|58584709|ref|YP_198282.1| SAM-dependent methyltransferase [Wolbachia endosymbiont strain TRS of Brugia malayi] gi|58419025|gb|AAW71040.1| Predicted SAM-dependent methyltransferase [Wolbachia endosymbiont strain TRS of Brugia malayi] Length = 388 Score = 159 bits (403), Expect = 4e-37, Method: Composition-based stats. Identities = 66/162 (40%), Positives = 97/162 (59%), Gaps = 3/162 (1%) Query: 25 FALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVE 84 + E+GYY P G GDF+TAPEISQ+FGE++A++++ W++ G PS LVE Sbjct: 1 MNAALYHKEYGYYMNKLPLGNGGDFITAPEISQLFGEIIAVWVMHTWKKLGKPSKFSLVE 60 Query: 85 LGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWYTSLA 144 LGPGRG ++ DI+RV K FS ++I++VE S L IQK++L +I+W+ ++ Sbjct: 61 LGPGRGTLIHDIIRVTKKYGSF-FSSMAIHLVEISPTLQKIQKEKLKGL--EISWHENID 117 Query: 145 DVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDS 186 +P T +ANEFFD+LPI QFV E + + Sbjct: 118 SLPEQPTIFLANEFFDALPIDQFVYRNGKWHENRVTKQNDGA 159 Score = 100 bits (250), Expect = 2e-19, Method: Composition-based stats. Identities = 47/165 (28%), Positives = 84/165 (50%), Gaps = 11/165 (6%) Query: 190 NIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVG 249 + + I + ++ F GA+ E ++ + D++ +GG A++IDYGY+ Sbjct: 232 KLQKNWIPVSSNGMTEGFDGAVVEVCSAGIEVLKKLEDKIMNNGGAALIIDYGYVYPSHK 291 Query: 250 DTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQ 309 TLQ+V+ H Y + L N G +D+++ V+FQ L I+ TQ +FL GI + Sbjct: 292 STLQSVRQHKYANFLENVGNSDITALVNFQALRDSLRYVDCEIS---TQREFLYLFGIKE 348 Query: 310 RAFSLMKQTA--RKDILLDSVKRLVSTSADKKSMGELFKILVVSH 352 R +LM+ + +K+ + RL ++MG LFK +++ H Sbjct: 349 RTQALMENASDEQKNRIFSEFLRLT------ENMGTLFKAILIHH 387 >gi|217034141|ref|ZP_03439561.1| hypothetical protein HP9810_868g34 [Helicobacter pylori 98-10] gi|216943425|gb|EEC22881.1| hypothetical protein HP9810_868g34 [Helicobacter pylori 98-10] Length = 333 Score = 159 bits (403), Expect = 5e-37, Method: Composition-based stats. Identities = 63/344 (18%), Positives = 118/344 (34%), Gaps = 22/344 (6%) Query: 20 TVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSC 79 + Y + + GYY G GDF T+ +S+ FG +A ++I E+ Sbjct: 3 SFGNYMQEWLYGEK-GYY-KKALIGQKGDFYTSVSLSKFFGGAVAFYIIKLLEEEKLFLP 60 Query: 80 VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYG--DKI 137 +++VE+G G + DI + L E + L +Q+ D Sbjct: 61 LKIVEIGAHHGHFLSDIANFLNALSVGVMEQCEFVSCEPLKELQKLQRTIFKQATQLDLS 120 Query: 138 NWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIK 197 + D F+++NE FD+ + +M+ I V+ D K Sbjct: 121 SCSLEELDFKEKSAFVISNELFDAFACEIIKDN------KMLFITHDHKGVWGDIDEPTK 174 Query: 198 SNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKG 257 D G +++ + DYG R L+A K Sbjct: 175 ELLKNL-DLKQGCAPLFLEAFIKDLLE--KLNEAPSWVFLSFDYGDEIERKDMHLRAFKN 231 Query: 258 HTYVSPLV-------NPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQR 310 H + ++DL+ V+F + + + + +Q L +G+ + Sbjct: 232 HQALDFKDILNHLASLYQKSDLTYDVNFSLVRFLFEKHHAQFSFFKSQANALLDMGLMEL 291 Query: 311 AFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 + K + + L ++ K + +GE FK L + +K Sbjct: 292 LETFSKSVSYERYLKEAAK--IKPLISPGGLGERFKALELVKKK 333 >gi|317180575|dbj|BAJ58361.1| hypothetical protein HPF32_0779 [Helicobacter pylori F32] Length = 335 Score = 159 bits (402), Expect = 6e-37, Method: Composition-based stats. Identities = 64/345 (18%), Positives = 121/345 (35%), Gaps = 24/345 (6%) Query: 20 TVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSC 79 + Y + + GYY G GDF T+ +S+ FG +A ++I E+ Sbjct: 3 SFGNYMQEWLYGEK-GYY-KKALIGQKGDFYTSVSLSKFFGGTVAFYIIKLLEEEKLFLP 60 Query: 80 VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLAS--YGDKI 137 +++VE+G G + DI + L E + L +Q+ D Sbjct: 61 LKIVEIGAHHGHFLSDIANFLNALSVGVMEKCEFVSCEPLKELQKLQRTIFKQATQWDLS 120 Query: 138 NWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIK 197 + D F+++NE FD+ + +M+ I V++ D K Sbjct: 121 SCSLEELDFKEKSAFVISNELFDAFACEIIKDN------KMLFITHDHKGVWSGIDEPTK 174 Query: 198 SNFLTCSDYFLGAIFENSPCRDREMQSISDRLAC-DGGTAIVIDYGYLQSRVGDTLQAVK 256 D G + + ++ + ++L + DYG R L+A K Sbjct: 175 ELLKNL-DLKQGCV---PLFLEAFIKDLLEKLDEAPSWVFLSFDYGDEIERKDMHLRAFK 230 Query: 257 GHTYVSPLV-------NPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQ 309 H + ++DL+ V+F + + Y + +Q L +G+ + Sbjct: 231 NHQALDFKDILNNLASLYQKSDLTYDVNFSLVRFLFEKYHAQFSFFKSQANALLDMGLME 290 Query: 310 RAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 + K + L ++ K + +GE FK L + Sbjct: 291 LLETFSKSVGYERYLKEAAK--IKPLISPGGLGERFKALEFVKKN 333 >gi|332673383|gb|AEE70200.1| conserved hypothetical protein [Helicobacter pylori 83] Length = 335 Score = 158 bits (400), Expect = 9e-37, Method: Composition-based stats. Identities = 63/345 (18%), Positives = 122/345 (35%), Gaps = 24/345 (6%) Query: 20 TVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSC 79 + Y + + GYY G GDF T+ +S+ FG +A ++I E+ Sbjct: 3 SFGNYMQEWLYGEK-GYY-RRALIGQKGDFYTSVSLSKFFGGAVAFYIIKLLEEEKLFLP 60 Query: 80 VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYG--DKI 137 +++VE+G G + DI + L E + L +Q+ D Sbjct: 61 LKIVEIGAHHGHFLSDIANFLNALSVGVMEKCEFVSCEPLKELQKLQRTIFKQATQLDLN 120 Query: 138 NWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIK 197 + D F+++NE FD+ + +M+ I V+ D K Sbjct: 121 SCSLEELDFKEKSAFVISNELFDAFACEIIKDN------KMLFITHDHKGVWGGIDGPTK 174 Query: 198 SNFLTCSDYFLGAIFENSPCRDREMQSISDRLAC-DGGTAIVIDYGYLQSRVGDTLQAVK 256 D G + + ++ + ++L + DYG R L+A K Sbjct: 175 ELLKNL-DLKQGCV---PLFLEAFIKDLLEKLDEASSWVFLSFDYGDEIERKDMHLRAFK 230 Query: 257 GHTYVSPLV-------NPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQ 309 H + ++DL+ V+F + + + + +Q L +G+ + Sbjct: 231 NHQALDFKDILNNLDSLYQKSDLTYDVNFSLVRFLFEKHHAQFSFFKSQANALLDMGLME 290 Query: 310 RAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 + K + + L ++ K + +GE FK L + + Sbjct: 291 LLETFSKSVSYERYLKEAAK--IKPLISPGGLGERFKALELVKKN 333 >gi|15611815|ref|NP_223466.1| hypothetical protein jhp0748 [Helicobacter pylori J99] gi|4155324|gb|AAD06339.1| putative [Helicobacter pylori J99] Length = 336 Score = 158 bits (399), Expect = 1e-36, Method: Composition-based stats. Identities = 66/345 (19%), Positives = 113/345 (32%), Gaps = 23/345 (6%) Query: 20 TVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSC 79 + Y + + GYY G GDF T+ +S+ FG +A ++I E+ Sbjct: 3 SFGNYMQEWLYGEK-GYYRKAL-IGPKGDFYTSVSLSKFFGGAIAFYIIRLLEEEKLFLP 60 Query: 80 VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQ---KKQLASYGDK 136 +++VE+G G + DI + L E + L IQ KQ Sbjct: 61 LKIVEIGSHHGHFLSDIASFLNALSVGVMEKCEFVSCEPLKELQNIQRTIFKQATQLDLI 120 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 L F+V+NE FD+ + +M+ I V+ D Sbjct: 121 SCALEELDFKEKKSAFVVSNELFDAFACEIIKDN------QMLFITHDHQGVWGGIDEPT 174 Query: 197 KSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVK 256 K + G + + + DYG R L+A K Sbjct: 175 KELLKNL-NLKEGCAPLFLEAFIKNLLE--KLNEASSWVFLSFDYGDELERKDMHLRAFK 231 Query: 257 GHTYVSPLV-------NPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQ 309 H + Q+DL+ V+F + + + + TQ L +G+ + Sbjct: 232 NHQALDFKDILNHLASLYQQSDLTYDVNFSLVRFLFEKHHAQFSFFKTQANALLDMGLME 291 Query: 310 RAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 + K + L ++ K + +GE FK L + Sbjct: 292 LLETFSKSVGYERYLKEAAK--IKPLISPGGLGERFKALEFVKKN 334 >gi|317182124|dbj|BAJ59908.1| hypothetical protein HPF57_0834 [Helicobacter pylori F57] Length = 335 Score = 157 bits (397), Expect = 2e-36, Method: Composition-based stats. Identities = 63/345 (18%), Positives = 121/345 (35%), Gaps = 24/345 (6%) Query: 20 TVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSC 79 + Y + + GYY G GDF T+ +S+ FG +A ++I E+ Sbjct: 3 SFGNYMQEWLYGEK-GYY-KKALIGQKGDFYTSVSLSKFFGGAVAFYIIKLLEEEKLFLP 60 Query: 80 VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYG--DKI 137 +++VE+G G + DI + L E + L +Q+ D Sbjct: 61 LKIVEIGAHHGHFLSDIANFLNALSMGVMEQCEFVSCEPLKELQKLQRTIFKQATQLDLN 120 Query: 138 NWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIK 197 + D F+++NE FD+ + +M+ I V+ D K Sbjct: 121 SCSLEELDFKEKSAFVISNELFDAFACEIIKDN------KMLFITHDHKGVWGGIDGPTK 174 Query: 198 SNFLTCSDYFLGAIFENSPCRDREMQSISDRLAC-DGGTAIVIDYGYLQSRVGDTLQAVK 256 D G + ++ + ++L + DYG R L+A K Sbjct: 175 ELLKNL-DLKQGC---APLFLEAFIKDLLEKLDEASSWVFLSFDYGDEIERKDMHLRAFK 230 Query: 257 GHTYVSPLV-------NPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQ 309 H + ++DL+ V+F + + + + +Q L +G+ + Sbjct: 231 NHQALDFKDILNHLASLYQKSDLTYDVNFSLVRFLFEKHHAQFSFFKSQASALLDMGLME 290 Query: 310 RAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 + K + + L ++ K + +GE FK L + + Sbjct: 291 LLETFSKSVSYERYLKEAAK--IKPLISPGGLGERFKALELVKKN 333 >gi|85079715|ref|XP_956406.1| hypothetical protein NCU00183 [Neurospora crassa OR74A] gi|28917469|gb|EAA27170.1| hypothetical protein NCU00183 [Neurospora crassa OR74A] Length = 568 Score = 157 bits (397), Expect = 2e-36, Method: Composition-based stats. Identities = 63/205 (30%), Positives = 95/205 (46%), Gaps = 30/205 (14%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYS--------TCNPFGAVGDFVTAP 53 L +++ I G + + + +C+ GYY+ + FGA GDFVT+P Sbjct: 64 STPLAKQLAEAITATGPVPLASFMRMCLTGDIGGYYTGAIEKSEQNRDQFGAKGDFVTSP 123 Query: 54 EISQIFGEMLAIFLICAWEQHGFPSC-VRLVELGPGRGIMMLDILRVICKLKPDFFSVLS 112 EISQ+FGE+ ++ + W G PS V L+E+GPGRG +M D+LR I S+ + Sbjct: 124 EISQVFGELCGLWYVTEWLAQGRPSKGVELIEVGPGRGTLMDDMLRTIQNFPEMAKSIDA 183 Query: 113 IYMVETSERLTLIQKKQLASYGD---------------------KINWYTSLADVPLGFT 151 +YMVE S +L + QK L S+ P Sbjct: 184 VYMVEASPQLRMAQKNLLCGKDAAMSESKVGYHSHCKYGDIPIVWTETIKSIPYDPEKTP 243 Query: 152 FLVANEFFDSLPIKQFVMTEHGIRE 176 F++A+EFFD+LPI F + + E Sbjct: 244 FIMAHEFFDALPIHAFQLVQVPPTE 268 Score = 92.5 bits (228), Expect = 1e-16, Method: Composition-based stats. Identities = 47/190 (24%), Positives = 73/190 (38%), Gaps = 26/190 (13%) Query: 195 EIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRL--------ACDGGTAIVIDY-GYLQ 245 + L GA+ E P + R+ G A+++DY Sbjct: 375 QKPLAVLPDGTTQAGALIEICPDAFLFASDFATRIGGSPAHPKPSPRGAALILDYGPGDG 434 Query: 246 SRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILY--KLYINGLTTQGKFLE 303 S ++L+ ++ H VSP PG DLS+ VDF ++ A + ++G Q +LE Sbjct: 435 SVPVNSLRGIRKHHLVSPFAEPGLTDLSADVDFTAIAEAATNASEGVEVHGPVEQAWWLE 494 Query: 304 GLGIWQRAFSLMK------QTARKD----ILLDSVKRLVSTSADKKSMGELFKILVVSHE 353 G+G +R L K + KD L S RLV MG ++K L + E Sbjct: 495 GMGGRERVEQLAKRSKRGNEEEEKDKFVKDLRRSWDRLVDRG--PNGMGRIYKALAIVPE 552 Query: 354 KVE---LMPF 360 + F Sbjct: 553 NDGRRRPVGF 562 >gi|187736628|ref|YP_001878740.1| protein of unknown function DUF185 [Akkermansia muciniphila ATCC BAA-835] gi|187426680|gb|ACD05959.1| protein of unknown function DUF185 [Akkermansia muciniphila ATCC BAA-835] Length = 340 Score = 157 bits (396), Expect = 3e-36, Method: Composition-based stats. Identities = 78/369 (21%), Positives = 133/369 (36%), Gaps = 46/369 (12%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGEM 62 +L I G ++++ + L + P+ GYYS+ G GDF T P +S + Sbjct: 3 RLSDHIA---AAGGWLSLEAFMQLALHHPQEGYYSSSIENIGQRGDFSTTPTLSP----I 55 Query: 63 LAIFLICAWEQHGFPSCVR--LVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSE 120 LA ++ W++ R L+E+G G G + + I + +L +VE+S Sbjct: 56 LAKAIVAHWKEACSRCGRRLPLLEIGAGSGALAVKI---LEQLGFWNRLNTDYVIVESSP 112 Query: 121 RLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 RL Q L + G F+ +NE D+ P + F TE +E + Sbjct: 113 RLREFQHLLLGGRAKIYSTMEKALKHCGGKAFIFSNELVDAFPARVFEYTEQDWKEVGLV 172 Query: 181 IDQHDSLVFNIGDHEIKSNFLTCSDY--FLGAIFENSPCRDREMQSISDRLACDGGTAIV 238 + ++ ++ + + F +Y G E R S + G V Sbjct: 173 V-KNGAVREELRPVRQQPLFSHMLEYGSQPGQRVEIHDSYARWFTSWLPLW--NMGVMTV 229 Query: 239 IDYGY-----LQSRVGDTLQAVKGHTY---VSPLVNPGQADLSSHVDFQRLSSIAILYKL 290 IDYG R +L+ K H NPG DL+ V+F L ++ Sbjct: 230 IDYGDEMERLYYRRPRGSLRGYKSHQVLTGEELYRNPGLTDLTCDVNFTDLLELSRNCLG 289 Query: 291 YINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVV 350 TQ +L + T + L D+ GE F +L+ Sbjct: 290 DRVTFMTQRDYLLPH---------AENTPQDAFL-----------TDEYGAGEHFHVLIQ 329 Query: 351 SHEKVELMP 359 ++++ Sbjct: 330 ERQRLQPEG 338 >gi|308063400|gb|ADO05287.1| hypothetical protein HPSAT_02710 [Helicobacter pylori Sat464] Length = 336 Score = 156 bits (395), Expect = 4e-36, Method: Composition-based stats. Identities = 65/346 (18%), Positives = 118/346 (34%), Gaps = 25/346 (7%) Query: 20 TVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSC 79 + Y + + GYY G GDF T+ +S+ FG +A ++I E+ Sbjct: 3 SFGNYMQEWLYGEK-GYY-KKALIGQKGDFYTSVSLSKFFGGSVAFYIIKLLEEEKLFLP 60 Query: 80 VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQ---KKQLASYGDK 136 +++VE+G G + DI + L E + L +Q KQ Sbjct: 61 LKIVEIGAHHGHFLSDIANFLNALSVGVMEKCEFVSCEPLKELQKLQRTIFKQATQLDLS 120 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 L F+V+NE FD+ + +M+ I V+ D Sbjct: 121 SCSLEELDFKEKKSAFVVSNELFDAFACEIVKDN------KMLFITHDYKGVWGGIDEPT 174 Query: 197 KSNFLTCSDYFLGAIFENSPCRDREMQSISDRLAC-DGGTAIVIDYGYLQSRVGDTLQAV 255 K D G + ++ + ++L + DYG R L+A Sbjct: 175 KELLKNL-DLKQGC---APLFLEAFIKDLLEKLDEASSWVFLSFDYGDEIERKDMHLRAF 230 Query: 256 KGHTYVSPLV-------NPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIW 308 K H + ++DL+ V+F + + + + +Q L +G+ Sbjct: 231 KNHQALDFKDILNNLASLYQKSDLTYDVNFSLVRFLFEKHHAKFSFFKSQANALLDMGLM 290 Query: 309 QRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 + + K + + L + K + +GE FK L + Sbjct: 291 ELLETFSKSASYERYLKEVAK--IKPLISPGGLGERFKALEFVKKN 334 >gi|297380010|gb|ADI34897.1| Hypothetical protein HPV225_0826 [Helicobacter pylori v225d] Length = 336 Score = 156 bits (394), Expect = 5e-36, Method: Composition-based stats. Identities = 65/346 (18%), Positives = 118/346 (34%), Gaps = 25/346 (7%) Query: 20 TVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSC 79 + Y + + GYY G GDF T+ +S+ FG +A ++I E+ Sbjct: 3 SFGNYMQEWLYGEK-GYY-KKALIGQKGDFYTSVSLSKFFGGAVAFYIIKLLEEEKLFLP 60 Query: 80 VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQ---KKQLASYGDK 136 +++VE+G G + DI + L E + L +Q KQ Sbjct: 61 LKIVEIGAHHGHFLSDIANFLNALSVGVMEKCEFVSCEPLKELQKLQRTIFKQATQLDLS 120 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 L F+V+NE FD+ + +M+ I V+ D Sbjct: 121 SCSLEELDFKEKKSAFVVSNELFDAFACEIIKDN------KMLFITHDHKGVWGAIDEST 174 Query: 197 KSNFLTCSDYFLGAIFENSPCRDREMQSISDRLAC-DGGTAIVIDYGYLQSRVGDTLQAV 255 K D G + ++ + ++L + DYG R L+A Sbjct: 175 KELLKNL-DLKQGC---APLFLEVFIKDLLEKLDEASSWVFLSFDYGDEIERKDMHLRAF 230 Query: 256 KGHTYVSPLV-------NPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIW 308 K H + ++DL+ V+F + + + + +Q L +G+ Sbjct: 231 KNHQALDFKDILNNLASLYQKSDLTYDVNFSLVRFLFEKHHAQFSFFKSQANALLDMGLM 290 Query: 309 QRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 + + K + L ++ K + +GE FK L + Sbjct: 291 ELLETFSKSVDYERYLKEAAK--IKPLISPGGLGERFKALEFVKKN 334 >gi|317009178|gb|ADU79758.1| hypothetical protein HPIN_02540 [Helicobacter pylori India7] Length = 336 Score = 156 bits (393), Expect = 7e-36, Method: Composition-based stats. Identities = 64/345 (18%), Positives = 113/345 (32%), Gaps = 23/345 (6%) Query: 20 TVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSC 79 + Y + + GYY G GDF T+ +S+ FG +A ++I E+ Sbjct: 3 SFGNYMQEWLYGEK-GYYRKAL-IGQKGDFYTSVSLSKFFGGAMAFYIIKLLEEEKLFLP 60 Query: 80 VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQ---KKQLASYGDK 136 +++VE+G G + DI + L E + L +Q K+ Sbjct: 61 LKIVEIGSHHGHFLSDIASFLNALSVGVMEKCEFVSCEPLKELQKLQRTIFKRATQLDLT 120 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 L F+V+NE FD+ + +M+ I V+ D Sbjct: 121 SCSLEELDFKEKKSAFIVSNELFDAFACEIIKDN------KMLFITHDHKGVWGGIDKPT 174 Query: 197 KSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVK 256 K D G +++ + DYG R L+A K Sbjct: 175 KELLKNL-DLKQGCAPLFLEAFIKDLLE--KLNEASSWVFLSFDYGDEIERKDMHLRAFK 231 Query: 257 GHTYVSPLV-------NPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQ 309 H + ++DL+ V+F + + + + TQ L +G+ + Sbjct: 232 NHQVLDFKDILNNLASLYQKSDLTYDVNFSLVRFLFEKHHAKFSFFKTQASALLDMGLME 291 Query: 310 RAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 + K + L + K + +GE FK L + Sbjct: 292 LLETFSKSVGYERYLKEVAK--IKPLISPGGLGERFKALEFVKKN 334 >gi|320586267|gb|EFW98946.1| duf185 domain containing protein [Grosmannia clavigera kw1407] Length = 562 Score = 155 bits (392), Expect = 8e-36, Method: Composition-based stats. Identities = 66/195 (33%), Positives = 94/195 (48%), Gaps = 29/195 (14%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYST------CNPFGAVGDFVTAPEI 55 L +++ I G + V Y +C+ GYY+ +PFG GDFVT+PE+ Sbjct: 61 STPLAKQLAEAISATGPVPVASYMRMCLTSDLGGYYTGALDKTGRDPFGRAGDFVTSPEV 120 Query: 56 SQIFGEMLAIFLICAWEQHGFPSC-VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIY 114 SQ+FGE++ I+ + W G PS V L+E+GPGRG +M DILR + S+ S+Y Sbjct: 121 SQVFGELVGIWFVAEWMAQGRPSRGVELIEVGPGRGTLMDDILRTVRHFGLAQ-SIESVY 179 Query: 115 MVETSERLTLIQKKQLASYG---------------------DKINWYTSLADVPLGFTFL 153 MVE S +L L QK L + S+ +F+ Sbjct: 180 MVEASPQLRLAQKTLLCGHDVALTESSLGYHGVSKHGSLPIVWTETIQSIPQSLENMSFI 239 Query: 154 VANEFFDSLPIKQFV 168 VA+EFFD+LPI F Sbjct: 240 VAHEFFDALPIHVFQ 254 Score = 81.3 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 41/176 (23%), Positives = 71/176 (40%), Gaps = 25/176 (14%) Query: 208 LGAIFENSPCRDREMQSISDRL--------ACDGGTAIVIDY-GYLQSRVGDTLQAVKGH 258 G + E P + R+ G A+++DY + ++L+ ++ H Sbjct: 383 PGVVVEVCPDAALYATDFAVRIGGSAARPRPHPAGAALILDYGPGDGTVPINSLRGIRRH 442 Query: 259 TYVSPLVNPGQADLSSHVDFQRLSSIAILY--KLYINGLTTQGKFLEGLGIWQR------ 310 VSP PG DLS+ VDF + A+ ++ ++G Q FL +GI +R Sbjct: 443 RRVSPFAEPGLTDLSADVDFGAIVEAAVRASDRVELHGPVDQADFLLQMGIRERAQALAA 502 Query: 311 ---AFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVE---LMPF 360 A ++ + S KRLV MG+++K+L + E + F Sbjct: 503 KAAASPAASSSSTPSDIDRSWKRLVDRG--PGGMGKVYKVLAILPENDGRRRPVGF 556 >gi|319789159|ref|YP_004150792.1| protein of unknown function DUF185 [Thermovibrio ammonificans HB-1] gi|317113661|gb|ADU96151.1| protein of unknown function DUF185 [Thermovibrio ammonificans HB-1] Length = 314 Score = 154 bits (389), Expect = 2e-35, Method: Composition-based stats. Identities = 83/347 (23%), Positives = 138/347 (39%), Gaps = 45/347 (12%) Query: 5 LIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVG-DFVTAPEISQIFGEML 63 L IV LI++ G + D++ LC+ PEFGYY+ G DF TAPE++ +FG++L Sbjct: 2 LKELIVELIEREGPLRFDRFVELCLYHPEFGYYTRVRTLPVPGQDFFTAPELTPVFGKVL 61 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLT 123 A + + P R++ELG G+G + D+L + P+ + ++E Sbjct: 62 ARHIGEVARREKLPL--RILELGGGKGFLAKDLLEELR---PEEY-----LLLEKGPM-- 109 Query: 124 LIQKKQLASYGDKINWYTSLADVPLGFT-FLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 + TSL ++ GF F+V+NEFFD+ P ++ R + Sbjct: 110 ---------NLKGVKGLTSLEELEGGFEGFVVSNEFFDAFPFRRV-----LPRRGLEVFI 155 Query: 183 QHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYG 242 + + + F G + ++ ++ +L+ Sbjct: 156 DGKESRLFETLLPFEGEAGSPCEGFEGE-YPLFSSWRPFLEELAGKLSRCYFVTFDYGGR 214 Query: 243 YLQSRVGDTLQAVKGHTYVSPL-VNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKF 301 + T +A GH PG+ DL++ VDF LS I L Q +F Sbjct: 215 CTELSGRQTFKAFSGHALADDWLERPGEVDLTALVDFDYLSKILGELGFENLELCPQSEF 274 Query: 302 LEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKIL 348 L GI +R S + +L++ D MG FK+L Sbjct: 275 LLSWGI-ERFAS-----PEHTV------QLLTLLVD---MGRRFKVL 306 >gi|33240460|ref|NP_875402.1| hypothetical protein Pro1010 [Prochlorococcus marinus subsp. marinus str. CCMP1375] gi|33237988|gb|AAQ00055.1| Uncharacterized conserved protein [Prochlorococcus marinus subsp. marinus str. CCMP1375] Length = 401 Score = 154 bits (388), Expect = 2e-35, Method: Composition-based stats. Identities = 67/375 (17%), Positives = 148/375 (39%), Gaps = 31/375 (8%) Query: 5 LIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIFGEML 63 + I + G ++ + L + D + G Y+T G GDFVT+P + F ++L Sbjct: 12 MAAHISDR---GGVISFFDFMDLALNDMKNGSYATGKLRIGPKGDFVTSPSLGPEFCDLL 68 Query: 64 AIFLICAWEQHGF----PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 A ++ E + ++++GPG G ++ ++ + K P F + + ++E + Sbjct: 69 ASQVVDWVEALLHTDVTSEVISIIDIGPGEGDLLFHLIEALQKKSPSLFKKIKLILIEIN 128 Query: 120 ERLTLIQKKQLASYGDKINWYTSLADVPLGFT--FLVANEFFDSLPIKQFVMTEHGIRER 177 E + QK++LA + D + S+ ++ ++A+E D+LP+ + V + + + Sbjct: 129 EGMKDRQKRRLAPFKDIPICWMSMKELSDVPVKGIMIAHEILDALPVDRVVSNNNKLFMQ 188 Query: 178 MIDIDQ-HDSLVFNIGDHEIKSNF--------------LTCSDYFLGAIFENSPCRDREM 222 + + ++ + + + + + G E C + Sbjct: 189 GVKLSTLNNKHYIEFTNLPLSDSIKNSIIDISNKIDISIPPQNSEEGWSTEWHSCLNNWF 248 Query: 223 QSISDRLACDGG----TAIVIDYGYLQSRVGDTLQAVKGHTYV-SPLVNPGQADLSSHVD 277 + S L A+ + Y +SR T+ + + L G D++SH+ Sbjct: 249 KETSLCLTEGPLLIIDYALEANRYYSKSRSEGTIISYSSQVSNSNILEKIGTTDITSHLC 308 Query: 278 FQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSAD 337 + + +AI G QG L LG+ + L + K + + + D Sbjct: 309 LEIIYLLAIKNNWNFVGERKQGLSLLALGLADKLNDLKQIPNSKLGEALTKRENLLRLID 368 Query: 338 KKSMGELFKILVVSH 352 +G+ F+ ++ + Sbjct: 369 PSCLGD-FRWILFNK 382 >gi|237750762|ref|ZP_04581242.1| conserved hypothetical protein [Helicobacter bilis ATCC 43879] gi|229373852|gb|EEO24243.1| conserved hypothetical protein [Helicobacter bilis ATCC 43879] Length = 340 Score = 154 bits (388), Expect = 3e-35, Method: Composition-based stats. Identities = 65/353 (18%), Positives = 122/353 (34%), Gaps = 35/353 (9%) Query: 15 KNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQH 74 + ++ + + +P GYY+ + G GDF T+ +S+ FG ++ +++ ++ Sbjct: 2 TSNKLAFSEIMQEWLYNPNTGYYTQNHV-GKAGDFYTSVSVSKFFGGAISRYILRMLDEK 60 Query: 75 GFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYG 134 + +VE+G G ++ DI + F S ++E L QK + Sbjct: 61 RLTLPLHIVEIGSNNGDLIADIAEFLKAFSNIVFIQTSFCVIEPLVCLHTQQKATFQARI 120 Query: 135 --------DKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDS 186 + L D F ++NE FDS+P F +++ Sbjct: 121 TARFAKQLHIYSNLQMLKDTQPNNVFFISNELFDSMPCDIFHNNNMLYF-------HNNN 173 Query: 187 LVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS 246 ++ EI + + E + +Q++ + + DYG + Sbjct: 174 FTWDKPSSEISEFI----EKYDIKSAEIPLIWESFIQNLCNL--PCKWIFLTFDYGDFLA 227 Query: 247 RVGDTLQAVKGHTYVSPLVNPGQ---------ADLSSHVDFQRLSSIAILYKLYINGLTT 297 R + ++ H + +D++ +DF L I + T Sbjct: 228 REMN-IRMYMQHKVYNLYEELQNDRLAQFIGKSDITYDIDFSLLKKILENNNAKVLCNVT 286 Query: 298 QGKFLE-GLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILV 349 Q KFL I S + I L K + +MGE FK LV Sbjct: 287 QSKFLIESCEILDIFESFSAHFS--TIQLSKQKASLQGLIAPNAMGERFKALV 337 >gi|34556740|ref|NP_906555.1| hypothetical protein WS0304 [Wolinella succinogenes DSM 1740] gi|34482454|emb|CAE09455.1| conserved hypothetical protein [Wolinella succinogenes] Length = 337 Score = 154 bits (388), Expect = 3e-35, Method: Composition-based stats. Identities = 61/343 (17%), Positives = 114/343 (33%), Gaps = 32/343 (9%) Query: 20 TVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSC 79 + + E GYY G GDF T+ I + FG + L E+ Sbjct: 3 PFGEVMQEWLYG-ESGYY-RHAKVGRAGDFYTSVSIGRYFGSTIGNGLARFLEE----GE 56 Query: 80 VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGD---K 136 VE+G RG ++ D + + P+ + ++E E L +Q++ Sbjct: 57 ASFVEIGASRGELISDAALFLRRFFPEKIPLCRWVIIEPLEELRALQRQHWEERIGGALP 116 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 + + SL F VANE D+ P + F + ++ +VF + Sbjct: 117 LEIFPSLEAFRCSRAFFVANELLDAFPCELF------WEGKQGFVNPSGEVVF------L 164 Query: 197 KSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVK 256 ++ + + E S + + + DYG R +++ + Sbjct: 165 EAEEWLKERALKAGVSKGELALGVEELVSSLFKSASSWSFLTFDYGQDFPRNDFSIRLYQ 224 Query: 257 GHTYV-------SPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQ 309 H Q+D++S V+F+ +S Q + L +G+ + Sbjct: 225 NHQPFNFADFKGDIRPFFAQSDITSDVNFEEVSRYFKEQGAQRIYFNRQNRALLEMGLVE 284 Query: 310 RAFSLMKQTARKDILLDSVKRL-VSTSADKKSMGELFKILVVS 351 K + + + L V D +GE FK + Sbjct: 285 VVE---KWNSELSGEAYAKEMLSVRPLLDPSLLGERFKCACFA 324 >gi|289620095|emb|CBI53539.1| unnamed protein product [Sordaria macrospora] Length = 571 Score = 153 bits (387), Expect = 3e-35, Method: Composition-based stats. Identities = 62/197 (31%), Positives = 92/197 (46%), Gaps = 30/197 (15%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYS--------TCNPFGAVGDFVTAP 53 L +++ I G + + + +C+ GYY+ + FGA GDFVT+P Sbjct: 66 STPLAKQLAEAITATGPVPLASFMRMCLTGDIGGYYTGAIEKSEQNRDQFGAAGDFVTSP 125 Query: 54 EISQIFGEMLAIFLICAWEQHGFPSC-VRLVELGPGRGIMMLDILRVICKLKPDFFSVLS 112 EISQ+FGE+ ++ + W G PS V L+E+GPGRG +M D+LR I S+ + Sbjct: 126 EISQVFGELCGLWYVTEWLAQGRPSKGVELIEVGPGRGTLMDDMLRTIQNFPEMAKSIDA 185 Query: 113 IYMVETSERLTLIQKKQLASYGD---------------------KINWYTSLADVPLGFT 151 +YMVE S +L + QK L S+ P Sbjct: 186 VYMVEASPQLRIAQKNLLCREDAAMSESKVGYHSHCKYGNIPIVWTETIKSIPYDPEKTP 245 Query: 152 FLVANEFFDSLPIKQFV 168 F++A+EFFD+LPI F Sbjct: 246 FIMAHEFFDALPIHAFQ 262 Score = 84.4 bits (207), Expect = 2e-14, Method: Composition-based stats. Identities = 48/178 (26%), Positives = 75/178 (42%), Gaps = 27/178 (15%) Query: 208 LGAIFENSPC--------RDREMQSISDRLACDGGTAIVIDY-GYLQSRVGDTLQAVKGH 258 G++ E SP R S S A G A+++DY S ++L+ ++ H Sbjct: 390 PGSLIEISPDTYLFATDFATRIGGSPSHPKAQPRGAALILDYGPGDGSVPVNSLRGIRKH 449 Query: 259 TYVSPLVNPGQADLSSHVDFQRLSSIAILY--KLYINGLTTQGKFLEGLGIWQRAFSL-- 314 VSP PG DLS+ VDF ++ A+ + ++G QG LEG+G +R SL Sbjct: 450 HLVSPFAEPGLTDLSADVDFSAIAEAAVNASEGVEVHGPVEQGWLLEGMGGRERVESLVR 509 Query: 315 ---------MKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVE---LMPF 360 ++ + L S +RLV MG ++K L + E + F Sbjct: 510 AGLQTKKGDEEKEKFVEDLRRSWERLVDRG--PNGMGRVYKALAIVPENDGRRRPVGF 565 >gi|317177358|dbj|BAJ55147.1| hypothetical protein HPF16_0550 [Helicobacter pylori F16] Length = 335 Score = 153 bits (387), Expect = 3e-35, Method: Composition-based stats. Identities = 62/345 (17%), Positives = 119/345 (34%), Gaps = 24/345 (6%) Query: 20 TVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSC 79 + Y + + GYY G GDF T+ +S+ FG +A ++I E+ Sbjct: 3 SFGNYMQEWLYGEK-GYY-KKALIGQKGDFYTSVSLSKFFGGAVAFYIIKLLEEEKLFLP 60 Query: 80 VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYG--DKI 137 +++VE+G G + DI + L E + L +Q+ D Sbjct: 61 LKIVEIGTHHGHFLSDIANFLNALSVGVMEKCEFVSCEPLKELQKLQRTIFKQATQLDLN 120 Query: 138 NWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIK 197 + D F+++NE FD+ + +M+ I V+ D K Sbjct: 121 SCSLEELDFKEKSAFVISNELFDAFACEIIKDN------KMLFITHDHKGVWGGIDGPTK 174 Query: 198 SNFLTCSDYFLGAIFENSPCRDREMQSISDRLAC-DGGTAIVIDYGYLQSRVGDTLQAVK 256 A + ++ + ++L + DYG R L+A K Sbjct: 175 KLLKNLDLKQGCA----PLFLEAFIKDLLEKLDEASSWVFLSFDYGDEIERKDMHLRAFK 230 Query: 257 GHTYVSPLV-------NPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQ 309 H + ++DL+ V+F + + + + +Q L +G+ + Sbjct: 231 NHQALDFKDILNHLASLYQKSDLTYDVNFSLVRFLFEKHHAQFSFFKSQASALLDMGLME 290 Query: 310 RAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 + K + L ++ K + +GE FK L + + Sbjct: 291 LLETFSKSVGYERYLKEAAK--IKPLISPGGLGERFKALELVKKN 333 >gi|313220377|emb|CBY31232.1| unnamed protein product [Oikopleura dioica] Length = 363 Score = 151 bits (382), Expect = 1e-34, Method: Composition-based stats. Identities = 87/346 (25%), Positives = 140/346 (40%), Gaps = 50/346 (14%) Query: 58 IFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM-- 115 +FGE + +++ W G P +ELGPGRG + D L I L + Sbjct: 1 MFGEHIGAWILQEWTIAGKPKSFEYLELGPGRGTLAKDALNAIQTLLDKVDDDKKTMINV 60 Query: 116 --VETSERLTLIQKKQLASYGDKINWYTSL-------------ADVPLGFT--------- 151 VE S L+ Q + L +++ + A+ F Sbjct: 61 RFVEVSPVLSKKQAETLDLKITEVSEVEADGCYMKASNGSNISAEWYRSFETLPESNDIS 120 Query: 152 FLVANEFFDSLPIKQFVMTEHGIRERMIDIDQH----DSLVFNIGDHEIKSNFLTCS--- 204 F + NEFFD+LPI QF E+ + R + +D + L F + + Sbjct: 121 FSICNEFFDALPIHQFDFNENTRQWREVIVDTDPVDKEKLRFVTAPGDTPAAKALLPMFD 180 Query: 205 --DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVS 262 D E SP Q + +R+ GG++++IDYG+ + DTL+ K H V Sbjct: 181 NDDLEGKRRVEVSPSSLIHCQWLCERIMKQGGSSLIIDYGHEGQK-EDTLRGFKNHKLVE 239 Query: 263 PLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMK---QTA 319 L NPG D+++ V+F L S+A + + TQ FL+ +GI R LM+ Sbjct: 240 VLDNPGDIDITADVNFSHLRSVAEAIGVDASESITQDAFLKNMGIETRLLMLMRATISAP 299 Query: 320 RKDILLDSVKRLVSTSADKKSMGELFKILVVSHE-----KVELMPF 360 + L + L+ K MG F+++ +SH ++ F Sbjct: 300 ARKNLSECFDYLM------KEMGPKFRVMALSHPARQKSGEQIPGF 339 >gi|73980162|ref|XP_863481.1| PREDICTED: similar to CG17726-PA isoform 2 [Canis familiaris] Length = 342 Score = 149 bits (377), Expect = 5e-34, Method: Composition-based stats. Identities = 84/356 (23%), Positives = 127/356 (35%), Gaps = 88/356 (24%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 ++R +V IK G +TV +Y + +P ++ Sbjct: 41 TPMLRHLVYKIKATGPITVAEYMKEVLTNP---------------------------AKL 73 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 L I+ I W G + +LVELGPG+G + DILRV +L + Sbjct: 74 LGIWFISEWMATGKNAAFQLVELGPGKGTLAGDILRVFSQLGSVLKNC------------ 121 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 + I T G RE IDID Sbjct: 122 --------------------------------------DISIHMVEKTPQGWREVFIDID 143 Query: 183 QH--DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVID 240 D L F + + D + E P +Q +S R+A GG A++ D Sbjct: 144 PQVSDKLRFVLAPCVTPAEVFIQRD-EIRDHVEVCPEAGVIIQELSQRIALTGGAALIAD 202 Query: 241 YGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGK 300 YG+ ++ DT + GH L PG ADL++ VDF L +A + G Q Sbjct: 203 YGHDGTKT-DTFRGFCGHKLHDVLTAPGTADLTADVDFSYLRRMAEGQ-VASLGPIKQQT 260 Query: 301 FLEGLGIWQRAFSLMKQTAR--KDILLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 FL+ +GI R L+ ++ + LL L+ + K MGE F + + Sbjct: 261 FLKNMGIDVRLKVLLDKSDEPARQQLLQGYDMLM----NPKKMGERFNFFALLPHQ 312 >gi|332227212|ref|XP_003262785.1| PREDICTED: protein midA homolog, mitochondrial-like isoform 2 [Nomascus leucogenys] Length = 343 Score = 148 bits (373), Expect = 1e-33, Method: Composition-based stats. Identities = 84/357 (23%), Positives = 130/357 (36%), Gaps = 89/357 (24%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 ++R ++ IK G +TV +Y + +P ++ Sbjct: 41 TPMLRHLIYKIKSTGPITVAEYMKEVLTNP---------------------------AKL 73 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 L I+ I W G + +LVELGPGRG ++ DILRV +L + Sbjct: 74 LGIWFISEWMATGKSTAFQLVELGPGRGTLVGDILRVFTQLGSVLKNC------------ 121 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 + + T G RE +DID Sbjct: 122 --------------------------------------DISVHLVEKTPQGWREVFVDID 143 Query: 183 QH--DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVID 240 D L F + + D E P ++ +S R+A GG A+V D Sbjct: 144 PQVSDKLRFVLAPSATPAEAFIQHD-ETRDHVEVCPDAGVIIEELSQRIALTGGAALVAD 202 Query: 241 YGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGK 300 YG+ ++ DTL+ GH L+ PG ADL++ VDF L +A K+ G Q Sbjct: 203 YGHEGTKT-DTLRGFCGHKLHDVLIAPGTADLTADVDFSYLRRMA-QGKVASLGPIKQHT 260 Query: 301 FLEGLGIWQRAFSLMKQTAR---KDILLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 FL+ +GI R L+ ++ + LL L+ + K MGE F + + Sbjct: 261 FLKNMGIDVRLKVLLDKSNEPSVRQQLLQGYDMLM----NPKKMGERFNFFALLPHQ 313 >gi|114576974|ref|XP_001167138.1| PREDICTED: protein midA homolog, mitochondrial-like isoform 3 [Pan troglodytes] Length = 343 Score = 148 bits (373), Expect = 1e-33, Method: Composition-based stats. Identities = 82/357 (22%), Positives = 128/357 (35%), Gaps = 89/357 (24%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 ++R ++ IK G +TV +Y + +P ++ Sbjct: 41 TPMLRHLMYKIKSTGPITVAEYMKEVLTNP---------------------------AKL 73 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 L I+ I W G + +LVELGPGRG ++ DILRV +L + Sbjct: 74 LGIWFISEWMATGKSTAFQLVELGPGRGTLVGDILRVFTQLGSVLKNC------------ 121 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 + + T G RE +DID Sbjct: 122 --------------------------------------DISVHLVEKTPQGWREVFVDID 143 Query: 183 QH--DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVID 240 D L F + + D E P ++ +S R+A GG A+V D Sbjct: 144 PQVSDKLRFVLAPSATPAEAFIQHD-ETRDHVEVCPDAGVIIEELSQRIALTGGAALVAD 202 Query: 241 YGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGK 300 YG+ ++ DT + H L+ PG ADL++ VDF L +A K+ G Q Sbjct: 203 YGHDGTKT-DTFRGFCDHKLHDVLIAPGTADLTADVDFSYLRRMA-QGKVASLGPIKQHT 260 Query: 301 FLEGLGIWQRAFSLMKQTAR---KDILLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 FL+ +GI R L+ ++ + LL L+ + K MGE F + + Sbjct: 261 FLKNMGIDVRLKVLLDKSNEPSVRQQLLQGYDMLM----NPKKMGERFNFFALLPHQ 313 >gi|145701028|ref|NP_001077415.1| protein midA homolog, mitochondrial isoform 3 [Homo sapiens] gi|31874135|emb|CAD97976.1| hypothetical protein [Homo sapiens] Length = 343 Score = 148 bits (373), Expect = 1e-33, Method: Composition-based stats. Identities = 82/357 (22%), Positives = 128/357 (35%), Gaps = 89/357 (24%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 ++R ++ IK G +TV +Y + +P ++ Sbjct: 41 TPMLRHLMYKIKSTGPITVAEYMKEVLTNP---------------------------AKL 73 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 L I+ I W G + +LVELGPGRG ++ DILRV +L + Sbjct: 74 LGIWFISEWMATGKSTAFQLVELGPGRGTLVGDILRVFTQLGSVLKNC------------ 121 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 + + T G RE +DID Sbjct: 122 --------------------------------------DISVHLVEKTPQGWREVFVDID 143 Query: 183 QH--DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVID 240 D L F + + D E P ++ +S R+A GG A+V D Sbjct: 144 PQVSDKLRFVLAPSATPAEAFIQHD-ETRDHVEVCPDAGVIIEELSQRIALTGGAALVAD 202 Query: 241 YGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGK 300 YG+ ++ DT + H L+ PG ADL++ VDF L +A K+ G Q Sbjct: 203 YGHDGTKT-DTFRGFCDHKLHDVLIAPGTADLTADVDFSYLRRMA-QGKVASLGPIKQHT 260 Query: 301 FLEGLGIWQRAFSLMKQTAR---KDILLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 FL+ +GI R L+ ++ + LL L+ + K MGE F + + Sbjct: 261 FLKNMGIDVRLKVLLDKSNEPSVRQQLLQGYDMLM----NPKKMGERFNFFALLPHQ 313 >gi|116191777|ref|XP_001221701.1| hypothetical protein CHGG_05606 [Chaetomium globosum CBS 148.51] gi|88181519|gb|EAQ88987.1| hypothetical protein CHGG_05606 [Chaetomium globosum CBS 148.51] Length = 290 Score = 147 bits (370), Expect = 3e-33, Method: Composition-based stats. Identities = 63/210 (30%), Positives = 94/210 (44%), Gaps = 30/210 (14%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYST-----CNPFGAVGDFVTAPEIS 56 L +++ I+ G + + Y +C+ GYY+ + FG GDFVT+PEIS Sbjct: 78 STPLAKQLGEAIEATGPVPLASYMRMCLTADIGGYYTGALEEGRDQFGLKGDFVTSPEIS 137 Query: 57 QIFGEMLAIFLICAWEQHGFPSC-VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 Q+FGE+ AI+ + W G S V L+E+GPGRG +M D+ + + S+ +IYM Sbjct: 138 QVFGELCAIWYVTEWMAQGRRSKGVELIEVGPGRGTLMDDM---LRRFPAMANSIDAIYM 194 Query: 116 VETSERLTLIQKKQLASYGD---------------------KINWYTSLADVPLGFTFLV 154 VE S L + QK L S+ P F++ Sbjct: 195 VEASPELRVAQKNLLCGEDAPMTESKVGYHSVCKYNALPIVWTETIKSIPIAPEKMPFIM 254 Query: 155 ANEFFDSLPIKQFVMTEHGIRERMIDIDQH 184 A+EFFD+LPI F + G I+ Sbjct: 255 AHEFFDALPIHAFELISVGFLRAHINFLHD 284 >gi|296224082|ref|XP_002757899.1| PREDICTED: protein midA homolog, mitochondrial-like isoform 4 [Callithrix jacchus] Length = 351 Score = 146 bits (369), Expect = 4e-33, Method: Composition-based stats. Identities = 88/364 (24%), Positives = 130/364 (35%), Gaps = 96/364 (26%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 ++R ++ IK G +TV +Y + +P ++ Sbjct: 42 TPMLRHLMYKIKSTGPITVAEYMKEVLTNP---------------------------AKL 74 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 L I+ I W G + +LVELGPGRG ++ DILRV +L + Sbjct: 75 LGIWFISEWMATGKSTAFQLVELGPGRGTLVGDILRVFSQLGSVLKNC------------ 122 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 + + T G RE IDID Sbjct: 123 --------------------------------------DISVHLVEKTPQGWREVFIDID 144 Query: 183 QH--DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVID 240 D L F + + D I E P ++ +S R+A GG A+V D Sbjct: 145 PQVSDKLRFVLAPCATPAEVFIQHDETRDHI-EVCPDAGVIIEELSRRIALTGGAALVAD 203 Query: 241 YGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGK 300 YG+ ++ DT + GH L+ PG ADL++ VDF L +A K+ G TQ Sbjct: 204 YGHDGTKT-DTFRGFCGHKLHDVLIAPGTADLTADVDFSFLRRMA-QGKVASLGPITQHT 261 Query: 301 FLEGLGIWQRAFS--------LMKQTAR--KDILLDSVKRLVSTSADKKSMGELFKILVV 350 FL+ +GI R L K + K LL L+ + K MGE F + Sbjct: 262 FLKNMGIDVRLKVRIFFFPVLLDKSNEQSVKQQLLQGYDMLM----NPKKMGERFNFFAL 317 Query: 351 SHEK 354 + Sbjct: 318 LPHQ 321 >gi|159484015|ref|XP_001700056.1| hypothetical protein CHLREDRAFT_141994 [Chlamydomonas reinhardtii] gi|158281998|gb|EDP07752.1| predicted protein [Chlamydomonas reinhardtii] Length = 375 Score = 146 bits (368), Expect = 5e-33, Method: Composition-based stats. Identities = 66/181 (36%), Positives = 99/181 (54%), Gaps = 18/181 (9%) Query: 25 FALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVE 84 C+ P+ G+Y + + FGA GDFVT+PEISQ+FGEM+ I+ + W G P + LVE Sbjct: 1 MQDCLTSPQGGFYMSRDVFGAAGDFVTSPEISQLFGEMVGIWCVHTWMALGRPPRLALVE 60 Query: 85 LGPGRGIMMLDILRVI--------CKLKPDFFSVLSIYMVETSERLTLIQKKQL-----A 131 LGPGRG ++ D+LR C F S L +++VE S L +Q + L Sbjct: 61 LGPGRGTLLADLLRGTAGEGGGGVCVSFKPFASTLELHLVEMSPALRAVQWRALGCAPDP 120 Query: 132 SYGDKINWYTSLADVPL--GFTFLVANEFFDSLPIKQFVMTEHGIR---ERMIDIDQHDS 186 + ++W+ +L VP G +A+EFFD+LP+ QFV G R E+++D+ + Sbjct: 121 AAQKCVHWHATLDAVPDGPGPALYIAHEFFDALPVHQFVRDPEGRRGWLEKLVDVQLDEQ 180 Query: 187 L 187 Sbjct: 181 P 181 >gi|289548367|ref|YP_003473355.1| hypothetical protein Thal_0594 [Thermocrinis albus DSM 14484] gi|289181984|gb|ADC89228.1| protein of unknown function DUF185 [Thermocrinis albus DSM 14484] Length = 317 Score = 146 bits (368), Expect = 5e-33, Method: Composition-based stats. Identities = 87/338 (25%), Positives = 128/338 (37%), Gaps = 30/338 (8%) Query: 20 TVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSC 79 + Q+ YY G DF TAPE+ +IFG LA +I E P Sbjct: 3 SFYQFMKE----KVELYYRERAGIGR--DFFTAPELDRIFGFALAEKIIPLLESISTP-- 54 Query: 80 VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINW 139 +VELG GRG+M DIL+ + + KP + LS + ETS L Q K L Y DK+ W Sbjct: 55 -NVVELGAGRGLMAKDILQFVAERKPTLYERLSYRIYETSPLLREFQGKVLQEYRDKVMW 113 Query: 140 YTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSN 199 L +++NEFFD LP+ E + V+ D +K Sbjct: 114 LDRLEIPEE--AVVISNEFFDCLPVHVVK-------EGKELYLKDKEKVWLPCDERVKQF 164 Query: 200 FLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYL----QSRVGDTLQAV 255 + + E ++ I+D + G + IDYGY T+ Sbjct: 165 LRRMGYENIKTVVEVCLECIDLLKRIADSMKR--GYILTIDYGYTSQDLHRYPEGTVVGY 222 Query: 256 KGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG-IWQRAFSL 314 K + + DLS+ V+F L Y L L Q FL L Sbjct: 223 KEGRVYYDIFSEDLMDLSAMVNFSALVEWGEEYGLRTVFLKKQRHFLLESQSFVDELTGL 282 Query: 315 MKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSH 352 + K ++ + RL + MG+ F +L+ + Sbjct: 283 S--MSEKPEDIERLSRLKTMLIS---MGDRFWVLLQAK 315 >gi|217032502|ref|ZP_03437993.1| hypothetical protein HPB128_180g1 [Helicobacter pylori B128] gi|298736515|ref|YP_003729041.1| hypothetical protein HPB8_1020 [Helicobacter pylori B8] gi|216945780|gb|EEC24403.1| hypothetical protein HPB128_180g1 [Helicobacter pylori B128] gi|298355705|emb|CBI66577.1| conserved hypothetical protein [Helicobacter pylori B8] Length = 344 Score = 146 bits (367), Expect = 7e-33, Method: Composition-based stats. Identities = 65/345 (18%), Positives = 113/345 (32%), Gaps = 23/345 (6%) Query: 20 TVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSC 79 + Y + + GYY G GDF T+ +S+ FG +A ++I E+ Sbjct: 11 SFGNYMQEWLYGKK-GYYRKAL-IGPKGDFYTSVSLSKFFGGAVAFYIIKLLEEEKLFLP 68 Query: 80 VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQ---KKQLASYGDK 136 +++VE+G G + DI + L E + L +Q KQ Sbjct: 69 LKIVEIGSHHGHFLSDIASFLNALSVGVMEKCEFVSCEPLKELQKLQRTIFKQATQLDLM 128 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 I L F+V+NE FD+ + +M+ I V+ D Sbjct: 129 ICDLKDLDFKGHENAFVVSNELFDAFACEIIKDN------KMLFITHDHKGVWGGIDEPT 182 Query: 197 KSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVK 256 K + G +++ + DYG R L+A K Sbjct: 183 KELLKNL-NLKEGCTPLFLEAFIKDLLE--KLNEASSWVFLSFDYGDEVERKDMHLRAFK 239 Query: 257 GHTYVSPLV-------NPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQ 309 H + ++DL+ V+F + + + + TQ L +G+ Sbjct: 240 NHQVLDFKDILNNLASLYQKSDLTYDVNFSLVRFLFEKHHAKFSFFKTQANALLDMGLMG 299 Query: 310 RAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 K + L ++ K + +GE FK L + Sbjct: 300 LLEMFSKSVGYERYLKEAAK--IKPLISPGGLGERFKALEFVKKN 342 >gi|58700280|ref|ZP_00374748.1| Uncharacterized ACR, COG1565 superfamily [Wolbachia endosymbiont of Drosophila ananassae] gi|58533203|gb|EAL57734.1| Uncharacterized ACR, COG1565 superfamily [Wolbachia endosymbiont of Drosophila ananassae] Length = 243 Score = 145 bits (366), Expect = 1e-32, Method: Composition-based stats. Identities = 72/253 (28%), Positives = 123/253 (48%), Gaps = 24/253 (9%) Query: 111 LSIYMVETSERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMT 170 + I++VE S L IQK++L S +NW+ ++ ++P T +ANEFFD+LPI QFV Sbjct: 1 MLIHLVEISPTLRKIQKEKLKSLD--VNWHKNIDNLPEQPTIFLANEFFDALPIDQFVYH 58 Query: 171 EHGIRERMIDIDQHD-----------SLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRD 219 + G E M+ + + +T +F GA+ E Sbjct: 59 DEGWYENMVTKQDDGSLLVSCQCVTLESRKKESWIPVSATQMTNGKFFNGAVVEICSVGV 118 Query: 220 REMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQ 279 ++ + ++ + G A+++DYGY+ TLQ++K H Y + L N G +D+++ V+FQ Sbjct: 119 EILKKLEKKIYNNKGAALIVDYGYVYPAYKSTLQSIKQHKYANFLENVGNSDITALVNFQ 178 Query: 280 RLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTA--RKDILLDSVKRLVSTSAD 337 L + TQ +FL GI +R +LMK + +K+ + RL Sbjct: 179 ALRDSLKHVDCE---ILTQREFLYLFGIKERTQALMKSASDEQKNRIFSEFLRLT----- 230 Query: 338 KKSMGELFKILVV 350 ++MG LFK +++ Sbjct: 231 -ENMGTLFKAMLL 242 >gi|171909965|ref|ZP_02925435.1| hypothetical protein VspiD_02295 [Verrucomicrobium spinosum DSM 4136] Length = 345 Score = 145 bits (365), Expect = 1e-32, Method: Composition-based stats. Identities = 69/355 (19%), Positives = 113/355 (31%), Gaps = 52/355 (14%) Query: 20 TVDQYFALCVADPEFGYYSTC-NPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPS 78 + + P+ GYYS G GDF T+ +S + GE +A ++ E+ Sbjct: 9 PFSTWMEQALFAPDTGYYSARIRTVGRRGDFATSATVSSLLGEGIARWISRELERQKGVR 68 Query: 79 CVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKIN 138 ++E+G G G + + + L+ +MVETS L Q+++L + Sbjct: 69 A--IIEVGGGDGSLSASVRSA---VGWWQRRKLAWHMVETSPILRDRQQERLKGAQVHWH 123 Query: 139 W-YTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDH--- 194 S G + NE D+ P+ + + + + Sbjct: 124 ETMESALQACGGRAIIFHNELVDAFPVTLLEWEATRGIWQEVWVVPDRAGWREELRALTL 183 Query: 195 ---------EIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGY-- 243 + E MQ S R G + +DYG Sbjct: 184 PPGQRGAFSVLTQWHALNPPPQRRQRVELHRSFRDWMQGWSSRW--QQGAMLTVDYGDLF 241 Query: 244 ---LQSRVGDTLQAVKGHTY---VSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTT 297 R TL+ H N G+ D+++ V+F L + T Sbjct: 242 PGLYHRRAQGTLRGYLLHQRLSGPDLYQNMGRQDITADVNFSDLLHWGETLGWGDGQVET 301 Query: 298 QGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADK-----KSMGELFKI 347 Q FL K + D KRL S+ AD+ + GE FK+ Sbjct: 302 QAAFL------------------KTWVPDLEKRLKSSGADQFVAGLEGAGEAFKV 338 >gi|297265817|ref|XP_001108210.2| PREDICTED: protein midA homolog, mitochondrial-like isoform 1 [Macaca mulatta] Length = 344 Score = 144 bits (362), Expect = 3e-32, Method: Composition-based stats. Identities = 82/355 (23%), Positives = 126/355 (35%), Gaps = 89/355 (25%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 ++R ++ IK G +TV +Y + +P ++ Sbjct: 42 TPMLRHLMYKIKSTGPITVAEYMKEVLTNP---------------------------AKL 74 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 L I+ I W G + +LVELGPGRG ++ DILRV +L + Sbjct: 75 LGIWFISEWMATGKSTAFQLVELGPGRGTLVGDILRVFTQLGSVLKNC------------ 122 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 + + T G RE IDID Sbjct: 123 --------------------------------------DISVHLVEKTPQGWREVFIDID 144 Query: 183 QH--DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVID 240 D L F + + D E P ++ +S R+A GG A+V D Sbjct: 145 PQVSDKLRFVLAPSATPAEAFIQHD-ETRDHVEVCPDAGVIIEELSQRIALTGGAALVAD 203 Query: 241 YGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGK 300 YG+ ++ + GH L+ PG ADL++ VDF L +A K+ G Q Sbjct: 204 YGHDGTKTXM-FKGFCGHKLHDVLIAPGTADLTADVDFSYLRRMA-QGKVASLGPIKQHT 261 Query: 301 FLEGLGIWQRAFSLMKQTAR---KDILLDSVKRLVSTSADKKSMGELFKILVVSH 352 FL+ +GI R L+ ++ + LL L+ + K MGE F + Sbjct: 262 FLKNMGIDVRLKVLLDKSNEPSVRQQLLQGYDMLM----NPKKMGERFNFFALLP 312 >gi|15645431|ref|NP_207605.1| hypothetical protein HP0812 [Helicobacter pylori 26695] gi|2313954|gb|AAD07871.1| predicted coding region HP0812 [Helicobacter pylori 26695] Length = 336 Score = 144 bits (362), Expect = 3e-32, Method: Composition-based stats. Identities = 67/345 (19%), Positives = 115/345 (33%), Gaps = 23/345 (6%) Query: 20 TVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSC 79 + Y + + GYY G GDF T+ +S+ FG +A ++I E+ Sbjct: 3 SFGNYMQEWLYGEK-GYYRKAL-IGQKGDFYTSVSVSKFFGGAVAFYIIKLLEEEKLFLP 60 Query: 80 VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQ---KKQLASYGDK 136 +++VE+G G + DI + L + E + L +Q KQ Sbjct: 61 LKIVEIGSHHGHFLSDIASFLNALSVGVMEQCAFVSCEPLKELQKLQRTIFKQATQLDLM 120 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 I L F+V+NE FD+ + +M+ I V+ D Sbjct: 121 ICDLKDLDFKGHENAFVVSNELFDAFACEIVKDN------KMLFIAHDHKGVWGAIDEPT 174 Query: 197 KSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVK 256 K D G +++ + DYG R L+A K Sbjct: 175 KELLKNL-DLKQGCAPLFLEAFIKDLLE--KLNEASSWVFLSFDYGDETERKDMHLRAFK 231 Query: 257 GHTYVSPLV-------NPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQ 309 H + Q+DL+ V+F + + + + TQ L +G+ Sbjct: 232 NHQALDFKDILNNLASLYQQSDLTYDVNFSLVRFLFEKHHAQFSFFKTQANALLDMGLMG 291 Query: 310 RAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 + K + L ++ K + +GE FK L + Sbjct: 292 LLETFSKSVGYERYLKEAAK--IKPLISPGGLGERFKALEFVKKN 334 >gi|195329890|ref|XP_002031643.1| GM23933 [Drosophila sechellia] gi|194120586|gb|EDW42629.1| GM23933 [Drosophila sechellia] Length = 169 Score = 143 bits (361), Expect = 4e-32, Method: Composition-based stats. Identities = 44/104 (42%), Positives = 66/104 (63%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEML 63 L +++ I G + V +Y + +P+ GYY + FG GDF+T+PEISQIFGE++ Sbjct: 41 SLAKQLRAKILSTGPIPVAEYMREVLTNPQAGYYMNRDVFGREGDFITSPEISQIFGELV 100 Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDF 107 I+L+ W + G PS LVELGPGRG + D+L+V+ K K ++ Sbjct: 101 EIWLVSEWRKMGCPSPFLLVELGPGRGTLARDVLKVLTKFKQEY 144 >gi|313234207|emb|CBY10275.1| unnamed protein product [Oikopleura dioica] Length = 271 Score = 143 bits (360), Expect = 5e-32, Method: Composition-based stats. Identities = 81/270 (30%), Positives = 122/270 (45%), Gaps = 18/270 (6%) Query: 58 IFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM-- 115 +FGE++ +++ W G P +ELGPGRG + D L I L + Sbjct: 1 MFGELIGAWILQEWTIAGKPKSFEYLELGPGRGTLAKDALNAIQTLLNKVDDDKKTMINV 60 Query: 116 --VETSERLTLIQKKQLASYGDKIN--WYTSLAD--VPLGFTFLVANEFFDSLPIKQFVM 169 VE S L+ Q + LAS G I+ WY S +F + NEFFD+LPI QF Sbjct: 61 KFVEVSPVLSKKQAETLASNGSNISAEWYRSFETLPESEDISFSICNEFFDALPIHQFDF 120 Query: 170 TEHGIRERMIDIDQH----DSLVFNIGDHEIKSNFLTCS-----DYFLGAIFENSPCRDR 220 E+ + R + +D + L F + + D E SP Sbjct: 121 NENTRQWREVIVDTDPVDKEKLRFVTAPGDTPAAKALLPMFDNDDLEGKRRVEVSPSSLI 180 Query: 221 EMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQR 280 Q + +R+ GG++++IDYG+ + DTL+ K H V L NPG D+++ V+F Sbjct: 181 HCQWLCERIMKQGGSSLIIDYGHEGQK-EDTLRGFKNHKLVEILDNPGDIDITADVNFSH 239 Query: 281 LSSIAILYKLYINGLTTQGKFLEGLGIWQR 310 L S+A + + TQ FL+ +GI R Sbjct: 240 LRSVAEAIGVDASESITQDAFLKNMGIETR 269 >gi|194376780|dbj|BAG57536.1| unnamed protein product [Homo sapiens] Length = 214 Score = 142 bits (359), Expect = 5e-32, Method: Composition-based stats. Identities = 53/131 (40%), Positives = 80/131 (61%), Gaps = 1/131 (0%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 ++R ++ IK G +TV +Y + +P GYY + G GDF+T+PEISQIFGE+ Sbjct: 41 TPMLRHLMYKIKSTGPITVAEYMKEVLTNPAKGYYVYRDMLGEKGDFITSPEISQIFGEL 100 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSV-LSIYMVETSER 121 L I+ I W G + ++LVELGPGRG ++ DILRV +L + +S+++VE S++ Sbjct: 101 LGIWFISEWMATGKSTALQLVELGPGRGTLVGDILRVFTQLGSVLKNCDISVHLVEVSQK 160 Query: 122 LTLIQKKQLAS 132 L+ IQ L Sbjct: 161 LSEIQALTLTK 171 >gi|288818826|ref|YP_003433174.1| hypothetical protein HTH_1524 [Hydrogenobacter thermophilus TK-6] gi|288788226|dbj|BAI69973.1| hypothetical protein HTH_1524 [Hydrogenobacter thermophilus TK-6] gi|308752413|gb|ADO45896.1| protein of unknown function DUF185 [Hydrogenobacter thermophilus TK-6] Length = 315 Score = 142 bits (359), Expect = 6e-32, Method: Composition-based stats. Identities = 78/339 (23%), Positives = 126/339 (37%), Gaps = 33/339 (9%) Query: 20 TVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSC 79 + + V + YY DF TAPE+ + FG L+ L P Sbjct: 3 SFRDFMEERVRE----YYLQRKV---GDDFFTAPELDRSFGRALSDHLYQFVRHADNPL- 54 Query: 80 VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINW 139 ++ELG G G + DIL + + L+ Y+ E S L +QK +L + K+ W Sbjct: 55 --ILELGGGNGSLAYDILSFFREKDNKLYGKLTYYIYEESPTLVSLQKNRLREFEGKVFW 112 Query: 140 YTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSN 199 L +++NEFFD LP+ + + +D ++ +I D + Sbjct: 113 TQELITEAD---IVLSNEFFDCLPVHVIKGRKE------LYVDDGRAIWEDITDERTLTF 163 Query: 200 FLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSR----VGDTLQAV 255 LG + E ++ +S+ + G IVIDYGY T+ Sbjct: 164 LDRMGYSALGQVIEVCLDCIDLLRRLSNIV---RGYHIVIDYGYTSEEIAKYPNGTVVGY 220 Query: 256 KGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSL 314 K H + L D+S+ V+F L + L +FL L Sbjct: 221 KAHRVVMDVLKEAPPFDMSAMVNFSALVEYGKDFGLLSVSFQNMREFLLSS--STFLEEL 278 Query: 315 MK-QTARKDILLDSVKRLVSTSADKKSMGELFKILVVSH 352 K + K ++ + RL + MGE FK+L+ Sbjct: 279 EKLSLSEKAEDIERLSRLKTMLIS---MGERFKVLIQKK 314 >gi|317012623|gb|ADU83231.1| hypothetical protein HPLT_04105 [Helicobacter pylori Lithuania75] Length = 336 Score = 142 bits (357), Expect = 9e-32, Method: Composition-based stats. Identities = 67/345 (19%), Positives = 114/345 (33%), Gaps = 23/345 (6%) Query: 20 TVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSC 79 + Y + + GYY G GDF T+ +S+ FG +A ++I E+ Sbjct: 3 SFGNYMQEWLYGEK-GYYRKAL-IGPKGDFYTSVSLSKFFGGAVAFYIIKLLEEEKLFLP 60 Query: 80 VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQ---KKQLASYGDK 136 +++VE+G G + DI + L E + L +Q KQ Sbjct: 61 LKIVEIGSHHGHFLSDIASFLNALSVGVMEKCEFVSCEPLKELQKLQRTIFKQATQLDLM 120 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 I L F+V+NE FD+ + +M+ I V+ D Sbjct: 121 ICDLKDLDFKGHESAFVVSNELFDAFACEIIKGN------KMLFITHDHKGVWGGIDENT 174 Query: 197 KSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVK 256 K D G +++ + DYG R L+A K Sbjct: 175 KELLKNL-DLKQGCAPLFLEAFIKDLLE--KLNEASSWVFLSFDYGDETERKDMHLRAFK 231 Query: 257 GHTYVSPLV-------NPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQ 309 H + Q+DL+ V+F + + + + TQ L +G+ Sbjct: 232 NHQALDFKDILNNLASLYQQSDLTYDVNFSLVRFLFEKHHAKFSFFKTQANALLDMGLMG 291 Query: 310 RAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 + K + L ++ K + +GE FK L + Sbjct: 292 LLETFSKSVGYERYLKEAAK--IKPLISPGGLGERFKALEFVKKN 334 >gi|308182968|ref|YP_003927095.1| hypothetical protein HPPC_04095 [Helicobacter pylori PeCan4] gi|308065153|gb|ADO07045.1| hypothetical protein HPPC_04095 [Helicobacter pylori PeCan4] Length = 336 Score = 141 bits (356), Expect = 1e-31, Method: Composition-based stats. Identities = 64/345 (18%), Positives = 115/345 (33%), Gaps = 23/345 (6%) Query: 20 TVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSC 79 + Y + + GYY G GDF T+ +S+ FG +A ++I E+ Sbjct: 3 SFGNYMQEWLYGEK-GYY-KKALIGPKGDFYTSVSLSKFFGGAIAFYIIKLLEEEKLFLP 60 Query: 80 VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQ---KKQLASYGDK 136 +++VE+G G + DI + L E + L +Q KQ Sbjct: 61 LKIVEIGSHHGHFLSDIASFLNALSVGVMEKCEFISCEPLKELQKLQRTIFKQATQLDLM 120 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 I L F+V+NE FD+ + +M+ I V+ + Sbjct: 121 ICDLKDLDFKRHESAFVVSNELFDAFACEIVKDD------QMLFITHDHQGVWGGINEPT 174 Query: 197 KSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVK 256 K + G +++ + DYG R L+A K Sbjct: 175 KELLKNL-NLKQGCAPLFLEAFIKDLLE--KLNEASSWVFLSFDYGDEVERKDMHLRAFK 231 Query: 257 GHTYVSPLV-------NPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQ 309 H + Q+DL+ V+F + + + + +Q L +G+ Sbjct: 232 NHQALDFKDILNHLASLYQQSDLTYDVNFSLVRFLFEKHHAQFSFFKSQANALLDMGLMG 291 Query: 310 RAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 + K + + L ++ K + +GE FK L + Sbjct: 292 LLETFSKSVSYERYLKEAAK--IKPLISPGGLGERFKALEFVKKN 334 >gi|317011029|gb|ADU84776.1| hypothetical protein HPSA_03920 [Helicobacter pylori SouthAfrica7] Length = 333 Score = 140 bits (352), Expect = 3e-31, Method: Composition-based stats. Identities = 65/344 (18%), Positives = 114/344 (33%), Gaps = 23/344 (6%) Query: 20 TVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSC 79 + Y + + GYY G GDF T+ +S+ FG +A ++I E+ Sbjct: 3 SFGNYMQEWLYGKK-GYY-RKAKIGQKGDFYTSVSLSKFFGGAMAFYIIKLLEEEKLFLP 60 Query: 80 VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQ---KKQLASYGDK 136 +++VE+G G + DI + L E + L IQ KQ Sbjct: 61 LKIVEIGSHHGHFLSDIANFLNALSVGVMEKCEFVSCEPLKELQNIQRTIFKQATQLDLI 120 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 L F+V+NE FD+ + +M+ I V+ D Sbjct: 121 SCALEELDFKEKKSAFVVSNELFDAFACEIVKDD------QMLFITHDHQGVWGGIDEPT 174 Query: 197 KSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVK 256 K + G + + + + DYG R L+A K Sbjct: 175 KELLKNL-NLKEGCVPLFLEAFIKNLLE--KLNEASSWVFLSFDYGDEIERKDLHLRAFK 231 Query: 257 GHTYVSPLV-------NPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQ 309 H + Q+DL+ V+F + + + + +Q L +G+ Sbjct: 232 NHQALDFKDILNHLASLYQQSDLTYDVNFSLVRFLFEKHHAQFSFFKSQANALLDMGLMG 291 Query: 310 RAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHE 353 + K + + L ++ K + +GE FK L + Sbjct: 292 LLEAFSKSVSYERYLKEAAK--IKPLISPGGLGERFKALEFVKK 333 >gi|210135011|ref|YP_002301450.1| hypothetical protein HPP12_0818 [Helicobacter pylori P12] gi|210132979|gb|ACJ07970.1| hypothetical protein HPP12_0818 [Helicobacter pylori P12] Length = 334 Score = 140 bits (352), Expect = 3e-31, Method: Composition-based stats. Identities = 66/345 (19%), Positives = 115/345 (33%), Gaps = 23/345 (6%) Query: 20 TVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSC 79 + Y + + GYY G GDF T+ +S+ FG +A ++I E+ Sbjct: 3 SFGNYMQEWLYGEK-GYY-KKALIGQKGDFYTSVSLSKFFGGAVAFYIIKLLEEEKLFLP 60 Query: 80 VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQ---KKQLASYGDK 136 +++VE+G G + DI + L + E + L +Q KQ Sbjct: 61 LKIVEIGSHHGHFLSDIANFLNALSVGVMEKCAFISCEPLKELQKLQRTIFKQATQLDLV 120 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 I L F+++NE FD+ + +M+ I V+ D Sbjct: 121 ICDLKDLDFKGHENAFVISNELFDAFACEIIKGN------KMLFITHDYKGVWGGIDEPT 174 Query: 197 KSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVK 256 K D G + + + DYG R L+A K Sbjct: 175 KELLKNL-DLKQGCAPLFLEAFIKGLLE--KLNEASSWVFLSFDYGDELERKDMHLRAFK 231 Query: 257 GHTYVSPLV-------NPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQ 309 H + Q+DL+ V+F + + + + +Q L +G+ Sbjct: 232 NHQALDFKDILNNLASLYQQSDLTYDVNFSLVRFLFEKHHAQFSFFKSQANALLDMGLMG 291 Query: 310 RAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 + K + L ++ K + +GE FK L +K Sbjct: 292 LLETFSKSVGYERYLKEAAK--IKPLISPGGLGERFKALEFVKKK 334 >gi|317014221|gb|ADU81657.1| hypothetical protein HPGAM_04190 [Helicobacter pylori Gambia94/24] Length = 333 Score = 139 bits (350), Expect = 6e-31, Method: Composition-based stats. Identities = 64/344 (18%), Positives = 112/344 (32%), Gaps = 23/344 (6%) Query: 20 TVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSC 79 + Y + + GYY G GDF T+ +S+ FG +A ++I E+ Sbjct: 3 SFGNYMQEWLYGEK-GYYRKAL-IGPKGDFYTSVSLSKFFGGAIAFYIIRLLEEEKLFLP 60 Query: 80 VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQ---KKQLASYGDK 136 +++VE+G G + DI + L E + L IQ KQ Sbjct: 61 LKIVEIGSHHGHFLSDIASFLNALSVGVMEKCEFVSCEPLKELQNIQRTIFKQATQLDLI 120 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 L F+V+NE FD+ + +M+ I V+ + Sbjct: 121 SCALEELDFKEKKSAFVVSNELFDAFACEIIKDN------QMLFITHDHQGVWGDINEPT 174 Query: 197 KSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVK 256 K + L + + DYG R L+A K Sbjct: 175 K---ELLKNLNLKEGCAPLFLSAFIKNLLEKLNEASSWVFLSFDYGDETERKDMHLRAFK 231 Query: 257 GHTYVSPLV-------NPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQ 309 H + Q+DL+ V+F + + + + +Q L +G+ Sbjct: 232 NHQALDFKDILNHLASLYQQSDLTYDVNFSLVRFLFEKHDARFSFFKSQANALLDMGLMG 291 Query: 310 RAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHE 353 + K + + L ++ K + +GE FK L + Sbjct: 292 LLETFSKSVSYERYLKEAAK--IKPLISPGGLGERFKALEFVKK 333 >gi|108563222|ref|YP_627538.1| hypothetical protein HPAG1_0797 [Helicobacter pylori HPAG1] gi|107836995|gb|ABF84864.1| hypothetical protein HPAG1_0797 [Helicobacter pylori HPAG1] Length = 336 Score = 139 bits (350), Expect = 7e-31, Method: Composition-based stats. Identities = 67/345 (19%), Positives = 117/345 (33%), Gaps = 23/345 (6%) Query: 20 TVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSC 79 + Y + + GYY G GDF T+ +S+ FG +A ++I E+ Sbjct: 3 SFGNYMQEWLYGEK-GYY-KKALIGQKGDFYTSVSLSKFFGGAMAFYIIKLLEEEKLFLP 60 Query: 80 VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQ---KKQLASYGDK 136 +++VE+G G + DI + L + E + L +Q KQ Sbjct: 61 LKIVEIGSHHGHFLSDIANFLNALSVGVMEKCAFVSCEPLKELQKLQRTIFKQATQLDLT 120 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 I L TF+V+NE FD+ + +M+ I V+ D Sbjct: 121 ICDLRDLDFKGHESTFVVSNELFDAFACEIIKDN------QMLFITHDHQGVWGAIDEPT 174 Query: 197 KSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVK 256 K + G +++ + DYG R L+A K Sbjct: 175 KELLKNL-NLKQGCAPLFLEAFIKDLLE--KLNEASSWVFLSFDYGDELERKDMHLRAFK 231 Query: 257 GHTYVSPLV-------NPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQ 309 H + Q+DL+ V+F + + + + TQ L +G+ Sbjct: 232 NHQALDFKDILNHLASLYQQSDLTYDVNFSLVRFLFEKHHAQFSFFKTQANALLDMGLMG 291 Query: 310 RAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 + K + L ++ K + + +GE FK L + Sbjct: 292 LLETFSKSVGYERYLKEAAK--IKPLISPEGLGERFKALEFVKKN 334 >gi|315452525|ref|YP_004072795.1| hypothetical protein HFELIS_01210 [Helicobacter felis ATCC 49179] gi|315131577|emb|CBY82205.1| putative uncharacterized protein [Helicobacter felis ATCC 49179] Length = 326 Score = 139 bits (349), Expect = 8e-31, Method: Composition-based stats. Identities = 74/341 (21%), Positives = 117/341 (34%), Gaps = 36/341 (10%) Query: 20 TVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSC 79 + + GYY G GDF T+ S+ FG LA +L+ E+ Sbjct: 4 PFSDLMHAWLYG-DGGYY-KKARIGTQGDFYTSVSASKFFGGTLAFYLLGLLEKGLLTLP 61 Query: 80 VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINW 139 + +VE+G G ++ D+L + L + VE E L +Q+K+LA G + Sbjct: 62 LSVVEMGAHGGELLGDVLSFLRALSQGVLEQVEFVSVEPLEELRTLQQKRLAQMGSNLRC 121 Query: 140 YTSLADVPLGFT---FLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 S D+ + T F+ NE +DS P + F HE Sbjct: 122 VASPLDLNIPPTQSVFIYNNELWDSFPCELVR----------PGAQLFVDAHFKPFWHEA 171 Query: 197 KSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIV-IDYGYLQSRVGDTLQAV 255 C P + + ++ + L +V DYG +R L+ Sbjct: 172 SYTQEGC-----------MPHWEACISALLNALEKTKAWVLVSFDYGQYGARDAIDLRGY 220 Query: 256 KGHTYVS-------PLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIW 308 H + DL+ VDFQRL S+ + TQ L +G+ Sbjct: 221 FQHRVFDFEEILSNLNELYQKIDLTYDVDFQRLESLINQQGAHTLFYGTQSLALVRMGLP 280 Query: 309 QRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILV 349 Q ++ K D ++GE FK L+ Sbjct: 281 QLLELFGAHMPFSTYQKEAFKA--RALLDPSALGERFKALI 319 >gi|308184597|ref|YP_003928730.1| hypothetical protein HPSJM_04115 [Helicobacter pylori SJM180] gi|308060517|gb|ADO02413.1| hypothetical protein HPSJM_04115 [Helicobacter pylori SJM180] Length = 335 Score = 139 bits (349), Expect = 8e-31, Method: Composition-based stats. Identities = 64/345 (18%), Positives = 119/345 (34%), Gaps = 24/345 (6%) Query: 20 TVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSC 79 + Y + + GYY G GDF T+ +S+ FG +A ++I E+ Sbjct: 3 SFGNYMQEWLYGEK-GYYRKAL-IGPKGDFYTSVSLSKFFGGAVAFYIIRLLEEEKLFLP 60 Query: 80 VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYG--DKI 137 +++VE+G G + DI + L E + L +Q+ D Sbjct: 61 LKIVEIGSHHGHFLSDIASFLNALSVGVMEQCEFVSCEPLKELQKLQRTIFKQATQLDLS 120 Query: 138 NWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIK 197 + D F+V+NE FD+ + +M+ I V+ D K Sbjct: 121 SCSLEELDFKEKSAFVVSNELFDAFACEIIKDN------KMLFITHDHKGVWGGIDEPTK 174 Query: 198 SNFLTCSDYFLGAIFENSPCRDREMQSISDRLAC-DGGTAIVIDYGYLQSRVGDTLQAVK 256 D G + ++ + ++L + DYG R L+A K Sbjct: 175 ELLKNL-DLKQGC---APLFLEAFIKDLLEKLDEASSWVFLSFDYGDEIERKDMHLRAFK 230 Query: 257 GHTYVSPLV-------NPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQ 309 H + ++DL+ V+F + + + + +Q L +G+ Sbjct: 231 NHQALDFKDILNNLASLYQKSDLTYDVNFSLVRFLFEKHHAKFSFFKSQANALLDMGLMG 290 Query: 310 RAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 + K + + L ++ K + +GE FK L + Sbjct: 291 LLETFSKSVSYERYLKEAAK--IKPLISPGGLGERFKALEFVKKN 333 >gi|207092305|ref|ZP_03240092.1| hypothetical protein HpylHP_04941 [Helicobacter pylori HPKX_438_AG0C1] Length = 334 Score = 138 bits (348), Expect = 1e-30, Method: Composition-based stats. Identities = 66/339 (19%), Positives = 110/339 (32%), Gaps = 23/339 (6%) Query: 25 FALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVE 84 + + GYY G GDF T+ +S+ FG +A ++I E+ +++VE Sbjct: 1 MQEWLYGEK-GYY-KKALIGPKGDFYTSVSLSKFFGGAIAFYIIKLLEEEKLFLPLKIVE 58 Query: 85 LGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQ---KKQLASYGDKINWYT 141 +G G + DI + L E + L +Q KQ I Sbjct: 59 IGAHHGHFLSDIANFLNALSVGVMEQCEFISCEPLKELQKLQRIIFKQATQLDLMICDLK 118 Query: 142 SLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFL 201 L F+V+NE FD+ + +M+ I V+ D K Sbjct: 119 DLDFKGHESAFVVSNELFDAFACEIIKDN------QMLFITHDHQGVWGAIDEPTKELLK 172 Query: 202 TCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYV 261 D G +++ + DYG R L+A K H + Sbjct: 173 NL-DLKQGCAPLFLEAFIKDLLE--KLNEAYSWVFLSFDYGDETERKDMHLRAFKNHQAL 229 Query: 262 SPLV-------NPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSL 314 Q+DL+ V+F + + + TQ L +G+ Sbjct: 230 DFKDILNHLASLYQQSDLTYDVNFSLVRFLFEKNHAKFSFFKTQANALLDMGLMGLLEIF 289 Query: 315 MKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHE 353 K + L ++ K + +GE FK L + Sbjct: 290 SKSVGYERYLKEAAK--IKPLISPGGLGERFKALEFVKK 326 >gi|242308833|ref|ZP_04807988.1| conserved hypothetical protein [Helicobacter pullorum MIT 98-5489] gi|239524624|gb|EEQ64490.1| conserved hypothetical protein [Helicobacter pullorum MIT 98-5489] Length = 334 Score = 137 bits (345), Expect = 3e-30, Method: Composition-based stats. Identities = 70/343 (20%), Positives = 124/343 (36%), Gaps = 36/343 (10%) Query: 20 TVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSC 79 + E GYY + G GDF T+ +S FG +A F+ + + Sbjct: 3 PFSHLMQQWLYGKE-GYYQN-HKIGKDGDFYTSVSVSPFFGYCIANFIADFFTKLPPLQK 60 Query: 80 VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQL----ASYGD 135 + +VE+G +G ++ DI + F+ LS + +E + L QK + Sbjct: 61 IAIVEIGADKGYLISDIASFLAHNP--LFAKLSFHTLEPLKNLQTTQKSTFYSKTSQTLH 118 Query: 136 KINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHE 195 ++ L FTF ++NE DS + + E + Q +SL+F E Sbjct: 119 TLDSPKDLQAQNYDFTFFISNELLDSFACELYYKGE-------MAYLQDNSLLFAPASKE 171 Query: 196 IKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAV 255 I + I E + + S++ + DYG L++R +L+ Sbjct: 172 II----EIAQAMELEIGEIPLHLESFLASLTQ--YTPSFAFLTFDYGDLKARNAFSLRFY 225 Query: 256 KGHTYVSPLVNPGQADL-------------SSHVDFQRLSSIAILYKLYINGLTTQGKFL 302 + HT + +NP D + V F L ++ Q K L Sbjct: 226 QNHTTNNLFLNPTSKDYYPNFLESFGKSDITYEVHFDYLKNLFKAMNAKELFFGRQNKIL 285 Query: 303 EGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELF 345 +G+ + ++ + + S K + T D +GE F Sbjct: 286 VDMGLDKVGEWYIQNFGLESFMHHSPK--IRTLIDPAFLGERF 326 >gi|32266469|ref|NP_860501.1| hypothetical protein HH0970 [Helicobacter hepaticus ATCC 51449] gi|32262520|gb|AAP77567.1| conserved hypothetical protein [Helicobacter hepaticus ATCC 51449] Length = 365 Score = 137 bits (344), Expect = 4e-30, Method: Composition-based stats. Identities = 61/351 (17%), Positives = 111/351 (31%), Gaps = 30/351 (8%) Query: 18 QMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFP 77 + + + + GYY+ + G GDF T+ S+ FG +A +++ E+ Sbjct: 10 PIPFSTFMKKSLYG-QNGYYTNPHRVGKSGDFYTSVSASKFFGGAIASYILNLVEKGHLN 68 Query: 78 SCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKI 137 +R+VE+G G ++ D+ + L F +E L QK ++ Sbjct: 69 LPLRIVEIGANNGYLLGDVALFLDALSTHFLRDCEFATIEPLPSLREAQKSHFSTLRFST 128 Query: 138 NWYTSLADVPLGFT-----FLVANEFFDSLPIKQFV--MTEHGIRERMIDIDQHDSLVFN 190 N + + F+++NE FDS+ + +E + N Sbjct: 129 NISFNAFETLPLPEKSSDIFILSNELFDSIACDVMQDKKILYITQEEKNWQGLWQEIDTN 188 Query: 191 IGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACD-GGTAIVIDYG------- 242 + + + E P ++ +S + DYG Sbjct: 189 PLPSQSAVLLNLLKNNPMPYKQEILPHWIPLIEQLSYIAKAHKKSYFLTFDYGTKNLTNP 248 Query: 243 --YLQSRVGDTLQAVKG---HTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTT 297 Y ++ AD++ VD L+ + Y + Sbjct: 249 LSYNPRFYHSHAVMNLKDILGQNINFYSLYQNADITYDVDISLLNILFSHYGFELVFEDY 308 Query: 298 QGKFLE-GLGIWQRAFSLMKQTARKDIL--LDSVKRLVSTSADKKSMGELF 345 Q K L +GI S + L + VK L+ T MG F Sbjct: 309 QAKVLIEQMGILSLLESFERAQGFATYLKEIHKVKTLLHT------MGGRF 353 >gi|308229541|gb|ADO24188.1| hypothetical protein [Aquaspirillum serpens] Length = 287 Score = 136 bits (341), Expect = 7e-30, Method: Composition-based stats. Identities = 55/264 (20%), Positives = 103/264 (39%), Gaps = 21/264 (7%) Query: 112 SIYMVETSERLTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQF 167 Y+VE S Q+ L ++ W +SL + ++ NE D++P + Sbjct: 23 HYYIVEVSADXAARQRDYLTQQCPELLSRVVWLSSLPEQIE--AVVIGNEVLDAIPCELV 80 Query: 168 VMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLG--AIFENSPCRDREMQSI 225 + D V+ + + G + E M ++ Sbjct: 81 YWDPTQQPWQRGVAWSEDGFVWADRPIQDPRLQAAVALLEPGPDYLSEVQLAAAGFMHTL 140 Query: 226 SDRLACDGGTAIVI-----DYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQR 280 ++RL C +Y + Q R G + + H + P + PG D+++H+DF Sbjct: 141 AERLQCGAILLFDYGFPRDEYYHPQRRQGTLMCHYRHHAHADPFLWPGLQDITTHIDFTT 200 Query: 281 LSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLM-KQTARKDILLDSVKRLVSTSADKK 339 ++ I + + L + G TTQ ++L GI + + ++V++L+ST Sbjct: 201 VAEIGLQHGLDLQGYTTQAQYLINAGILDQLAQFNPEDITHYLPHANAVQKLLST----A 256 Query: 340 SMGELFKILVVSHEKVEL--MPFV 361 MGELFK++ S + L F+ Sbjct: 257 EMGELFKVIGFSR-GLPLEWQGFL 279 >gi|219127089|ref|XP_002183776.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] gi|217405013|gb|EEC44958.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] Length = 349 Score = 136 bits (341), Expect = 7e-30, Method: Composition-based stats. Identities = 87/351 (24%), Positives = 133/351 (37%), Gaps = 65/351 (18%) Query: 41 NPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQH------GFPSCVRLVELGPGRGIM-- 92 N G GDFVTAPE+SQ+FGE L I+ ++ G + +E GPG+G + Sbjct: 1 NIIGPQGDFVTAPEMSQVFGECLGIWFYDQHKKLQKAKADGKRLDWQWLECGPGKGTLVS 60 Query: 93 -MLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQL--------------------- 130 +L + +++VE+S L +QK+ L Sbjct: 61 DLLRFACYGKIRHEFGATCKHVHLVESSPILRQVQKETLQRDLRDVAELEFVEESGIPEN 120 Query: 131 -ASYGDKINWYTSLADVP--------LGFTFLVANEFFDSLPIKQFVMTEH-GIRERMID 180 +++W+ S A T+ V EF D+LP QF T RER+ID Sbjct: 121 RNPNAVQVHWHDSFASFRAWQKQSTSRLTTYAVGQEFLDALPTYQFEKTADGTWRERLID 180 Query: 181 -----IDQHDSLVFNIGDHEIK--------------SNFLTCSDYFLGAIFENSPCRDRE 221 + I + + G++ E SP Sbjct: 181 VALKHLPNAKKPRLRIVLAPLVTVPLKTLLQVDGDGRMLNEPNFAQTGSVVEVSPEAILL 240 Query: 222 MQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRL 281 ++ ++ + GG A+ IDYG S DT++A H V L PGQ DL++ VDF L Sbjct: 241 VKDVATLVDEQGGAALFIDYGQEGS--ADTIRAFAKHEQVHFLSRPGQVDLTADVDFSAL 298 Query: 282 SSIAILYKLYI----NGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSV 328 + G QG FL +G R L+++ + D D + Sbjct: 299 KHAVNALQTRHQTRAFGPVGQGHFLMSMGASDRVLQLIERDSTTDKEADDL 349 >gi|86827512|gb|AAI12869.1| LOC504290 protein [Bos taurus] Length = 144 Score = 135 bits (340), Expect = 9e-30, Method: Composition-based stats. Identities = 37/91 (40%), Positives = 56/91 (61%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 ++R ++ IK G +TV +Y + +P GYY + G GDF+T+PEISQ+FGE+ Sbjct: 41 TPMLRHLIYKIKSTGPITVAEYMKEVLTNPAKGYYMNRDMLGEEGDFITSPEISQMFGEL 100 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMM 93 L I+ I W G + +LVELGPG+G ++ Sbjct: 101 LGIWFISEWIAAGKNAAFQLVELGPGKGTLL 131 >gi|307637502|gb|ADN79952.1| hypothetical conserved protein [Helicobacter pylori 908] gi|325996091|gb|ADZ51496.1| hypothetical protein hp2018_0794 [Helicobacter pylori 2018] gi|325997687|gb|ADZ49895.1| hypothetical protein hp2017_0793 [Helicobacter pylori 2017] Length = 333 Score = 135 bits (339), Expect = 1e-29, Method: Composition-based stats. Identities = 66/344 (19%), Positives = 113/344 (32%), Gaps = 23/344 (6%) Query: 20 TVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSC 79 + Y + + GYY G GDF T+ +S+ FG +A ++I E+ Sbjct: 3 SFGNYMQEWLYGEK-GYYRKAL-IGPKGDFYTSVSLSKFFGGAIAFYIIKLLEEEKLFLP 60 Query: 80 VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQ---KKQLASYGDK 136 +++VE+G G + DI + L E + L IQ KQ Sbjct: 61 LKIVEIGSHHGHFLSDIASFLNALSVGVMEKCEFISCEPLKELQNIQRTIFKQATQLDLI 120 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 L F+V+NE FDS + +M+ I V+ D Sbjct: 121 SCALEELDFKEKKSAFVVSNELFDSFACEIIKDN------QMLFITHDHQGVWGGIDEPT 174 Query: 197 KSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVK 256 K + G +++ + DYG R L+A K Sbjct: 175 KELLKNL-NLKEGCAPLFLEAFIKDLLE--KLNEASSWVFLSFDYGDELERKDLHLRAFK 231 Query: 257 GHTYVSPLV-------NPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQ 309 H + Q+DL+ V+F + + + + +Q L +G+ Sbjct: 232 NHQALDFKDILNHLASLYQQSDLTYDVNFSLVRFLFEKHHAQFSFFKSQANALLDMGLMG 291 Query: 310 RAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHE 353 K + + L ++ K + +GE FK L + Sbjct: 292 LLEVFSKSVSYERYLKEAAK--IKPLISPGGLGERFKALEFVKK 333 >gi|154175261|ref|YP_001408105.1| hypothetical protein CCV52592_1389 [Campylobacter curvus 525.92] gi|112803127|gb|EAU00471.1| conserved hypothetical protein [Campylobacter curvus 525.92] Length = 328 Score = 135 bits (339), Expect = 1e-29, Method: Composition-based stats. Identities = 66/342 (19%), Positives = 123/342 (35%), Gaps = 27/342 (7%) Query: 21 VDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCV 80 ++F + + + YY G GDF T+ + +FG A + + + + Sbjct: 3 FSEFFEIWLHER---YYKNGANIGKKGDFYTSVSVGWLFGAAHANYFLKCLDAKELSAKC 59 Query: 81 RLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYG---DKI 137 +VE+G G M+ D ++ + L+P+ L+ +VE E L IQ + + K+ Sbjct: 60 SIVEIGANSGDMLADFVQGVFTLRPEILGELNFAIVEPHEILREIQLQTFKARFGDEVKL 119 Query: 138 NWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIK 197 + + + L F+++NE FDS + E M+ I+++ ++ D EI Sbjct: 120 THFKNFDECALDEAFIISNELFDSFACEVIEG------ENMLFINENHKPFWSGADDEIL 173 Query: 198 SNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKG 257 T G E + IS + DYG + +L+ K Sbjct: 174 -KICTGLGLEKG---EICLKFTNFARQISKAFKR--VKFLSFDYGEWGVKDDFSLRLYKN 227 Query: 258 HTYV------SPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEG-LGIWQR 310 H G +D++ VDF LS Q L I + Sbjct: 228 HQVFNFFEISDLSEYFGVSDMTYDVDFSHLSIAFESAGFKTAKFKRQNLALVQDFRIDEI 287 Query: 311 AFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSH 352 ++ K + + + + + +GE FK + + Sbjct: 288 LALTLEMGGEKAYQNAA--KQMKFLSSAEFLGERFKFIEFTK 327 >gi|208434723|ref|YP_002266389.1| hypothetical protein HPG27_768 [Helicobacter pylori G27] gi|208432652|gb|ACI27523.1| hypothetical protein HPG27_768 [Helicobacter pylori G27] Length = 347 Score = 134 bits (338), Expect = 2e-29, Method: Composition-based stats. Identities = 67/346 (19%), Positives = 117/346 (33%), Gaps = 25/346 (7%) Query: 20 TVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSC 79 + Y + + GYY G GDF T+ +S+ FG +A ++I E+ Sbjct: 14 SFGNYMQEWLYGEK-GYYRKAL-IGQKGDFYTSVSLSKFFGGAMAFYIIKLLEEEKLFLP 71 Query: 80 VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQ---KKQLASYGDK 136 +++VE+G G + DI + L E + L +Q KQ Sbjct: 72 LKIVEIGSHHGHFLSDIANFLNALSVGVMEKCEFVSCEPLKELQKLQRTIFKQATQLDLI 131 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 L F+V+NE FD+ + +M+ I V+ D Sbjct: 132 SCSLKELDFKERESAFVVSNELFDAFACEIIKDN------KMLFITHDHQGVWGGIDEPT 185 Query: 197 KSNFLTCSDYFLGAIFENSPCRDREMQSISDRLAC-DGGTAIVIDYGYLQSRVGDTLQAV 255 K + A + ++ + ++L + DYG R L+A Sbjct: 186 KELLKNLNLKQGCA----PLFLEAFIKDLLEKLDEASSWVFLSFDYGDKTERKDMHLRAF 241 Query: 256 KGHTYVSPLV-------NPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIW 308 K H + Q+DL+ V+F + + + + TQ L +G+ Sbjct: 242 KNHQVLDFKDILNHLASLYQQSDLTYDVNFSLVRFLFEKHHANFSFFKTQANALLDMGLM 301 Query: 309 QRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 + K + L +S K + +GE FK L + Sbjct: 302 GLLETFSKSVGYERYLKESAK--IKPLISPGGLGERFKALEFVKKN 345 >gi|182415593|ref|YP_001820659.1| hypothetical protein Oter_3784 [Opitutus terrae PB90-1] gi|177842807|gb|ACB77059.1| protein of unknown function DUF185 [Opitutus terrae PB90-1] Length = 328 Score = 134 bits (337), Expect = 2e-29, Method: Composition-based stats. Identities = 62/354 (17%), Positives = 110/354 (31%), Gaps = 56/354 (15%) Query: 6 IRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNP---FGAVGDFVTAPEISQIFGEM 62 + + +M ++ AL + P GYY P FG DF TA +FGE+ Sbjct: 16 LALFRAQADADERMDFARFMALALYAPGVGYYRRGQPRIGFGQGTDFFTASSSGPVFGEL 75 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGR-GIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 +A + VE+G G ++ + ++ +E S Sbjct: 76 VACACAELLGAAAAQAT--FVEIGNETPGGILAGVPHPFAGVRTLPLG----EPIELS-- 127 Query: 122 LTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDI 181 G + +NE FD+ P ++F + RE + Sbjct: 128 ---------------------------GACVVFSNELFDAQPFRRFAFRDGAWRELGVAF 160 Query: 182 DQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDY 241 + E + G + + I+ G + DY Sbjct: 161 R---GGRLVETELETAPPDELPTLAPEGYVIDAPIAAAALAGQIAA--QPWTGLFVAFDY 215 Query: 242 GYLQS-----RVGDTLQAVKGHTY-VSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGL 295 G T +A H L PG+ DL+ HV + +++ + L Sbjct: 216 GKSWRELTEACPAGTARAYHAHQQSNDLLARPGEQDLTCHVCWDWIAAALTKHGFAAPAL 275 Query: 296 TTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRL-VSTSADKKSMGELFKIL 348 +Q FL A S + + + S K+L + +G+ F++L Sbjct: 276 DSQEAFLIRH-----AQSAIAAISTSEAAHYSRKKLSLLQLLHPSHLGQKFQVL 324 >gi|261752818|ref|ZP_05996527.1| conserved hypothetical protein [Brucella suis bv. 5 str. 513] gi|261742571|gb|EEY30497.1| conserved hypothetical protein [Brucella suis bv. 5 str. 513] Length = 155 Score = 133 bits (335), Expect = 3e-29, Method: Composition-based stats. Identities = 66/158 (41%), Positives = 85/158 (53%), Gaps = 7/158 (4%) Query: 205 DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPL 264 GAIFE +P R MQ I+ R+A G A+ IDYG+L+S GDTLQA+ Y Sbjct: 2 KAEEGAIFEAAPARTALMQEIASRIAATRGAALNIDYGHLESGFGDTLQAMLKQAYDDVF 61 Query: 265 VNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSL--MKQTARKD 322 +PG ADL+SHVDF L A G TQG+FL +G+ RA L K A ++ Sbjct: 62 AHPGVADLTSHVDFDILQKTAKACGCKT-GTMTQGEFLLAMGLVDRAGRLGAGKDAAFQE 120 Query: 323 ILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMPF 360 + V+RL A MG LFK+L S E+ L+PF Sbjct: 121 KIRQDVERL----AAPDQMGTLFKVLAFSDEQTRLLPF 154 >gi|254702247|ref|ZP_05164075.1| hypothetical protein Bsuib55_15506 [Brucella suis bv. 5 str. 513] Length = 173 Score = 133 bits (335), Expect = 3e-29, Method: Composition-based stats. Identities = 66/158 (41%), Positives = 85/158 (53%), Gaps = 7/158 (4%) Query: 205 DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPL 264 GAIFE +P R MQ I+ R+A G A+ IDYG+L+S GDTLQA+ Y Sbjct: 20 KAEEGAIFEAAPARTALMQEIASRIAATRGAALNIDYGHLESGFGDTLQAMLKQAYDDVF 79 Query: 265 VNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSL--MKQTARKD 322 +PG ADL+SHVDF L A G TQG+FL +G+ RA L K A ++ Sbjct: 80 AHPGVADLTSHVDFDILQKTAKACGCKT-GTMTQGEFLLAMGLVDRAGRLGAGKDAAFQE 138 Query: 323 ILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMPF 360 + V+RL A MG LFK+L S E+ L+PF Sbjct: 139 KIRQDVERL----AAPDQMGTLFKVLAFSDEQTRLLPF 172 >gi|322380648|ref|ZP_08054800.1| hypothetical protein HSUHS5_0936 [Helicobacter suis HS5] gi|321146970|gb|EFX41718.1| hypothetical protein HSUHS5_0936 [Helicobacter suis HS5] Length = 330 Score = 133 bits (335), Expect = 4e-29, Method: Composition-based stats. Identities = 71/341 (20%), Positives = 122/341 (35%), Gaps = 21/341 (6%) Query: 20 TVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSC 79 + Q + E GYY G GDF TA S FG LA +L+ E Sbjct: 3 SFSQCMQEWLYG-ENGYY-RHALIGMQGDFYTAVNSSAFFGGTLAFYLLSLLENGRLSLP 60 Query: 80 VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINW 139 + +VE+G G G+++ D++ + L + VE +RL +QK++LA G + Sbjct: 61 LSVVEIGAGEGLLLSDVVGFLKDLSQGVLEHIRFISVEPLDRLVNLQKEKLAKRGVDLEC 120 Query: 140 YTSLAD-VPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKS 198 L + F+ NE +DS P + ++ H ++ E Sbjct: 121 VACLENLELSQSVFIYCNELWDSFPCEVIDNSKKL-------YISHSKPIWQNLSSEEMY 173 Query: 199 NFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGH 258 S + S+ ++L + DYG S G +L+ + H Sbjct: 174 QIKAFYPAIKSDCLPLS--WGDYITSLCNQLRGKKWIMVSFDYGQYGSYGGISLRGYRKH 231 Query: 259 TY-------VSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRA 311 + + DL+ VDF+ L ++ + + TQ L +G+ Sbjct: 232 QVLSFQEILDNLQGYYQKIDLTYDVDFKLLETLFLAQGAHTLFYGTQSTTLLKMGLASLL 291 Query: 312 FSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSH 352 +S+K + + GE FK L+V + Sbjct: 292 ELFHASAPFTTYQRESIKA--RALINPEGFGERFKGLIVGN 330 >gi|157164918|ref|YP_001466936.1| dihydrodipicolinate reductase (dhpr) [Campylobacter concisus 13826] gi|112800931|gb|EAT98275.1| conserved hypothetical protein [Campylobacter concisus 13826] Length = 328 Score = 133 bits (334), Expect = 5e-29, Method: Composition-based stats. Identities = 58/342 (16%), Positives = 121/342 (35%), Gaps = 27/342 (7%) Query: 21 VDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCV 80 ++F + V + YY G GDF T + +FG LA + + + S Sbjct: 3 FSEFFDIWVNEN---YYKFGVDIGKKGDFYTNVSVGYLFGACLANYFLKLLKNGEISSSC 59 Query: 81 RLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK---I 137 ++VE+G G M+ D + I L+P+ S L + ++E E L Q + + I Sbjct: 60 KVVEIGANSGDMLADFAQGIFTLEPEILSNLELIIIEPHEILRKKQLETFKNRFGNDIKI 119 Query: 138 NWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIK 197 Y +L + F+++NE D+ + + Sbjct: 120 KHYENLGECSFDEIFVISNELLDAFSCEVIDADNMLFV----------DSDLKFHWQKAD 169 Query: 198 SNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKG 257 N + + F E S + +++ + DYG + + +L+ K Sbjct: 170 QNLINLAKKFGIKKGEISTSYAKFALQLANAAKK--IRFLSFDYGEFEPKNEFSLRVFKD 227 Query: 258 HTYV------SPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLE-GLGIWQR 310 H + +DL+ + F+++ L + Q L G+ + Sbjct: 228 HQVFSLFEISNLAPYLKNSDLTYSLCFKQVKEAFSLAGFKMVKFKKQNDALVCDFGVDEI 287 Query: 311 AFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSH 352 ++++ +++ ++V + + +GE FK + Sbjct: 288 LSLVLEKGSKQA--YENVTKQAKFLLSPEFLGEKFKFIEFLK 327 >gi|119620806|gb|EAX00401.1| hypothetical protein PRO1853, isoform CRA_b [Homo sapiens] Length = 150 Score = 132 bits (332), Expect = 7e-29, Method: Composition-based stats. Identities = 43/98 (43%), Positives = 61/98 (62%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 ++R ++ IK G +TV +Y + +P GYY + G GDF+T+PEISQIFGE+ Sbjct: 41 TPMLRHLMYKIKSTGPITVAEYMKEVLTNPAKGYYVYRDMLGEKGDFITSPEISQIFGEL 100 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVI 100 L I+ I W G + +LVELGPGRG ++ DILR + Sbjct: 101 LGIWFISEWMATGKSTAFQLVELGPGRGTLVGDILRYL 138 >gi|254446013|ref|ZP_05059489.1| conserved hypothetical protein [Verrucomicrobiae bacterium DG1235] gi|198260321|gb|EDY84629.1| conserved hypothetical protein [Verrucomicrobiae bacterium DG1235] Length = 338 Score = 132 bits (332), Expect = 9e-29, Method: Composition-based stats. Identities = 65/367 (17%), Positives = 118/367 (32%), Gaps = 53/367 (14%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYST-CNPFGA--VGDFVTAPEISQIFG 60 +++ + G + + + + P+ GYYS G DF T+ + + F Sbjct: 10 EILTALAAKADSEGFIELPDFIETALYLPKHGYYSKEKQRVGRNAQSDFYTSVSLKEAFA 69 Query: 61 EMLAIFLICAWEQHGF-PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETS 119 E++ EQ GF P+ +E+G G + Sbjct: 70 EIVLEASCSLLEQAGFVPTQTHWLEIGAEPGGAL-------------------------- 103 Query: 120 ERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMI 179 L IQ ++ + D T L +NE FD+ +Q E + Sbjct: 104 --LDGIQNPFKSACALGFGQTIEIPDQ----TILFSNELFDAQTFRQIRFDGREWVEYGV 157 Query: 180 DIDQHDSLVFN---IGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTA 236 +D + + D K D G + + +S+ ++ G Sbjct: 158 RLDAKLPVWSERSTLSDEASKFLPDLPKDLPAGYTIDLPTGSNCLAKSLLEK--PWPGAF 215 Query: 237 IVIDYGYLQS-----RVGDTLQAVKGHTY-VSPLVNPGQADLSSHVDFQRLSSIAILYKL 290 I DYG + +A H + L +PG D++ H+ + L ++ + Sbjct: 216 IAFDYGKTWHGITQDTPQGSARAYFQHRQVPNILESPGSIDITHHICWDHLENLLRSARF 275 Query: 291 YINGLTTQGKFLEGLGIWQRA-FSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILV 349 L +Q F I RA L K LD +++ + MG+ F+ L Sbjct: 276 ESLSLQSQEAF-----IVHRAPKFLQKAFDPSRRALDPIRQKLKELMHPALMGQKFQALC 330 Query: 350 VSHEKVE 356 + Sbjct: 331 ATRGGSP 337 >gi|322378799|ref|ZP_08053228.1| hypothetical protein HSUHS1_0452 [Helicobacter suis HS1] gi|321148829|gb|EFX43300.1| hypothetical protein HSUHS1_0452 [Helicobacter suis HS1] Length = 330 Score = 131 bits (330), Expect = 1e-28, Method: Composition-based stats. Identities = 71/341 (20%), Positives = 122/341 (35%), Gaps = 21/341 (6%) Query: 20 TVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSC 79 + Q + E GYY G GDF TA S FG LA +L+ E Sbjct: 3 SFSQCMQEWLYG-ENGYY-RHALIGMQGDFYTAVNSSAFFGGTLAFYLLSLLENGWLSLP 60 Query: 80 VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINW 139 + +VE+G G G+++ D++ + L + VE +RL +QK++LA G + Sbjct: 61 LSVVEIGAGEGLLLSDVVGFLKDLSQGVLEHIRFISVEPLDRLVNLQKEKLAKRGVDLEC 120 Query: 140 YTSLAD-VPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKS 198 L + F+ NE +DS P + ++ H ++ E Sbjct: 121 VACLENLELSQSVFIYCNELWDSFPCEVIDNSKKL-------YISHSKPIWQNLSSEEMY 173 Query: 199 NFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGH 258 S + S+ ++L + DYG S G +L+ + H Sbjct: 174 QIKAFYPAIKSDCLPLS--WGDYITSLCNQLRGKKWIMVSFDYGQYGSYGGISLRGYRKH 231 Query: 259 TY-------VSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRA 311 + + DL+ VDF+ L ++ + + TQ L +G+ Sbjct: 232 QVLSFQEILDNLQGYYQKIDLTYDVDFKLLETLFLAQGAHTLFYGTQSTTLLKMGLASLL 291 Query: 312 FSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSH 352 +S+K + + GE FK L+V + Sbjct: 292 ELFHASAPFTTYQRESIKA--RALINPEGFGERFKGLIVGN 330 >gi|220672996|emb|CAX14542.1| novel protein (zgc:153989) [Danio rerio] Length = 270 Score = 129 bits (325), Expect = 5e-28, Method: Composition-based stats. Identities = 39/96 (40%), Positives = 59/96 (61%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 + +++ + + I G ++V +Y + +P GYY + GA GDF+T+PEISQIFG Sbjct: 26 INKSILKHLASKIIATGPISVAEYMREALTNPVLGYYVKNDMLGAGGDFITSPEISQIFG 85 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDI 96 E+L ++ I W G S ++LVELGPGRG + DI Sbjct: 86 ELLGVWCISEWMAAGKSSALQLVELGPGRGSLTSDI 121 Score = 99.8 bits (247), Expect = 6e-19, Method: Composition-based stats. Identities = 44/143 (30%), Positives = 69/143 (48%), Gaps = 8/143 (5%) Query: 212 FENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQAD 271 E +Q ++ R+A DGG A+++DYG+ ++ DT + KGH L PG AD Sbjct: 131 VEVCAEAGVIVQKLASRIAEDGGAALIVDYGHDGTKT-DTFRGFKGHQIHDVLEAPGLAD 189 Query: 272 LSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSL--MKQTARKDILLDSVK 329 L++ VDF L +A + G TQ FL+ +GI R L + + L+ S Sbjct: 190 LTADVDFSYLRKMAGDQ-VICLGPITQRSFLKNMGIDSRMQVLLSSNDPSIRAQLIHSYD 248 Query: 330 RLVSTSADKKSMGELFKILVVSH 352 L+ + + MGE F+ V + Sbjct: 249 MLI----NPEKMGERFQFFSVLN 267 >gi|119620805|gb|EAX00400.1| hypothetical protein PRO1853, isoform CRA_a [Homo sapiens] Length = 196 Score = 129 bits (325), Expect = 6e-28, Method: Composition-based stats. Identities = 48/147 (32%), Positives = 69/147 (46%), Gaps = 6/147 (4%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 ++R ++ IK G +TV +Y + +P GYY + G GDF+T+PEISQIFGE+ Sbjct: 41 TPMLRHLMYKIKSTGPITVAEYMKEVLTNPAKGYYVYRDMLGEKGDFITSPEISQIFGEL 100 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 L I+ I W G + +LVELGPGRG ++ DILRV +L + + S L Sbjct: 101 LGIWFISEWMATGKSTAFQLVELGPGRGTLVGDILRVFTQLGSVLKNC------DISVHL 154 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLG 149 L + Sbjct: 155 VEGTAFILHMNFLMFFLCINFRKHHRD 181 >gi|222823612|ref|YP_002575186.1| hypothetical protein Cla_0603 [Campylobacter lari RM2100] gi|222538834|gb|ACM63935.1| conserved hypothetical protein [Campylobacter lari RM2100] Length = 317 Score = 129 bits (324), Expect = 6e-28, Method: Composition-based stats. Identities = 63/340 (18%), Positives = 118/340 (34%), Gaps = 32/340 (9%) Query: 19 MTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPS 78 + ++F + D YYS G GDF TA + +FG +LA + + Sbjct: 2 IAFSEFFQNWI-DK---YYSQAVSVGKNGDFYTAVSVGNLFGVLLANHFLKLIDDKKLTL 57 Query: 79 CVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKIN 138 ++VE+G G +MLD ++ + L+ D + +++E E+L +QKK Y + Sbjct: 58 PAQVVEIGANEGHLMLDFIQALYTLRADVLEQIECFIIEPHEKLKCVQKKLFDKYDLDVK 117 Query: 139 WYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKS 198 Y SL + F ANE FD + M ID +++F + Sbjct: 118 IYNSLEECHFKNAFFYANELFDCFACELIKDKT------MAYIDDDLNIIFKPMSENLLK 171 Query: 199 NFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGH 258 + P + Q+ + + Y + +++ K H Sbjct: 172 ECEKYAITNSELCISYKPFLTKLKQACEKLIFAC--------FDYAKKEEKISIRMYKNH 223 Query: 259 TYVSPLVN-----PGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFS 313 + ++D++ +V+F + + Q + L G+ + Sbjct: 224 QVYNLFEENLKDFFAKSDITYNVNFNHFLQVLKEEGFGVLEYKNQNQALIDFGLEEIIEQ 283 Query: 314 LMKQTARK--DILLDSVKRLVSTSADKKSMGELFKILVVS 351 K T + + K L+ G+ FK L Sbjct: 284 -AKNTNPQIYKNFISQSKNLMFNF------GDKFKFLEFK 316 >gi|151555952|gb|AAI49743.1| LOC504290 protein [Bos taurus] Length = 256 Score = 129 bits (323), Expect = 8e-28, Method: Composition-based stats. Identities = 70/233 (30%), Positives = 108/233 (46%), Gaps = 12/233 (5%) Query: 127 KKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQH-- 184 K + G ++WY L DVP ++F +A+EFFD LP+ +F T HG RE ++DID Sbjct: 1 MKGVTKSGIPVSWYRDLQDVPKEYSFYLAHEFFDVLPVHKFQKTPHGWREVLVDIDPQVS 60 Query: 185 DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYL 244 D L F + + +D E P +Q +S R++ GG A++ DYG+ Sbjct: 61 DKLRFVLAPCATPAGAFIQND-ETRDHVEVCPEAGVVIQELSQRISLTGGAALIADYGHD 119 Query: 245 QSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEG 304 ++ DT + G+ L PG ADL++ VDF L + K+ G Q FL Sbjct: 120 GTKT-DTFRGFCGYRLHDVLTAPGTADLTADVDFSYLRRM-SQGKVASLGPVEQQTFLRN 177 Query: 305 LGIWQRAFSLMKQ---TARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 +GI R L+ + + + LL L+ + MGE F L + + Sbjct: 178 MGIDVRLKILLDKTDDPSLRQQLLQGYNMLM----NPMKMGERFNFLALVPHQ 226 >gi|154149401|ref|YP_001406298.1| hypothetical protein CHAB381_0718 [Campylobacter hominis ATCC BAA-381] gi|153805410|gb|ABS52417.1| conserved hypothetical protein [Campylobacter hominis ATCC BAA-381] Length = 319 Score = 128 bits (321), Expect = 2e-27, Method: Composition-based stats. Identities = 61/322 (18%), Positives = 113/322 (35%), Gaps = 28/322 (8%) Query: 21 VDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCV 80 YF + YY+ P G GDF TA + +FG +A ++ + F + Sbjct: 3 FSDYFEAWLNGN---YYAKATPIGKKGDFYTAVSVGSLFGICIAKRILKL--ANNFEGKI 57 Query: 81 RLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYG---DKI 137 +VE+G G ++ DI++ I P+ + ++E E L +Q K KI Sbjct: 58 FIVEIGANEGYLLADIIQGIFTFSPERLANFEFAIIEPHENLRDLQNKNFKKRFGDEVKI 117 Query: 138 NWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIK 197 + ++S +ANE FD+ + + + E + F D EI Sbjct: 118 SHFSSFDKAKFKNAIFIANELFDAFKCEILDVDKILFVE-------NHKYEFKKADDEIL 170 Query: 198 SNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKG 257 G I +M+ ++ I DYG ++++ +L+ K Sbjct: 171 DFAQKFK-IKKGEIPLGYFDFANDMRKSAE-----NFAFIAFDYGQMRAKNDFSLRVYKK 224 Query: 258 HTYVSPL------VNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRA 311 H G +D++ V+F+ L + + Q L G + Sbjct: 225 HEVFDFFEIENLSEFYGVSDITYDVNFEILKAAFEENSSKMFDFKKQSTALVDFGAVEIL 284 Query: 312 FSLMKQTARK-DILLDSVKRLV 332 +K + + + L+ Sbjct: 285 EMFLKHSKTAYENAKLQLNHLL 306 >gi|294054717|ref|YP_003548375.1| protein of unknown function DUF185 [Coraliomargarita akajimensis DSM 45221] gi|293614050|gb|ADE54205.1| protein of unknown function DUF185 [Coraliomargarita akajimensis DSM 45221] Length = 324 Score = 127 bits (319), Expect = 3e-27, Method: Composition-based stats. Identities = 58/347 (16%), Positives = 96/347 (27%), Gaps = 51/347 (14%) Query: 15 KNGQMTVDQYFALCVADPEFGYYS-TCNPFGA--VGDFVTAPEISQIFGEMLAIFLICAW 71 NG ++ + + + + GYY+ G DF TA + ++F +++ Sbjct: 12 SNGPISYRDFIEMALYSKDGGYYTQQRERVGRSPERDFYTAESLGKVFAQLVTTAAADLL 71 Query: 72 EQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLA 131 VE+ G +LD Sbjct: 72 -GPKTARKSTFVEIAAEPGHSLLDN-------------------------------APGH 99 Query: 132 SYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDS---LV 188 + D G L ANE+ D+LP + V + RER + I + Sbjct: 100 PFTDSKVIRQGDPIQVDGPVVLFANEWLDALPFHRLVFRDGQWRERGVRIGARGQLEDCL 159 Query: 189 FNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS-- 246 N +++ +E E G + DYG Sbjct: 160 LNELSAAVQTEVHRLPSTIEDG-YELDWPLAAEAALAQLLQQDWQGLLLFFDYGKTWQSL 218 Query: 247 ---RVGDTLQAVKGHTYVS-PLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFL 302 T + H S L PGQ D++ V + L + L L +Q F Sbjct: 219 LVDCPSGTARTYTKHQLGSQLLDAPGQRDITCDVCWDPLQAQLEAAGLESITLESQEAFF 278 Query: 303 EGLGIWQRAFSLMKQTARKDILLDSVKRLVST-SADKKSMGELFKIL 348 RA + S + MG+ F++L Sbjct: 279 VN-----RAQRAAAAIVQASAWSFSPDKQTLMELIHPAHMGQRFQVL 320 >gi|195953241|ref|YP_002121531.1| protein of unknown function DUF185 [Hydrogenobaculum sp. Y04AAS1] gi|195932853|gb|ACG57553.1| protein of unknown function DUF185 [Hydrogenobaculum sp. Y04AAS1] Length = 302 Score = 127 bits (318), Expect = 3e-27, Method: Composition-based stats. Identities = 83/318 (26%), Positives = 130/318 (40%), Gaps = 32/318 (10%) Query: 36 YYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLD 95 YYST G GDF T+PE+ + FG+ +A F+ P ++ELG G GIM D Sbjct: 15 YYSTNPKIGK-GDFFTSPELDETFGKSIAYFIKDYISAFDNPK---ILELGAGNGIMAKD 70 Query: 96 ILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVA 155 IL V+ + + ETS+RLT IQK+ L + W L+ + +++ Sbjct: 71 ILDVL---------NIPYIIYETSQRLTNIQKQNLK--CKNVIWIDDLSMLEPFEGIVLS 119 Query: 156 NEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENS 215 NEFFD+L I + E ++ + K + A +E Sbjct: 120 NEFFDALGIAPIKDKKELYIE-------PPKEIWQEPHEDTKILIDILN--LDNAYYELP 170 Query: 216 PCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSH 275 Q IS L G + IDYGY S +T++ K V + + D++ Sbjct: 171 IDSFYIYQKISKLLRK--GYILSIDYGYKTSPHKNTIRGYKNSKIVQNIYSDEIFDITYM 228 Query: 276 VDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDIL-LDSVKRLVST 334 VDF L I + L Q +FL + I ++ + +D ++ RL + Sbjct: 229 VDFSMLQKIGEYFGFKNIFLKRQREFL--MDIPYFIKTIEEVCQEEDAYSIERCSRLKNL 286 Query: 335 SADKKSMGELFKILVVSH 352 MGE F +L+ + Sbjct: 287 ILS---MGESFYVLLQEN 301 >gi|167736762|ref|ZP_02409536.1| hypothetical protein Bpse14_01789 [Burkholderia pseudomallei 14] Length = 230 Score = 127 bits (318), Expect = 3e-27, Method: Composition-based stats. Identities = 45/179 (25%), Positives = 73/179 (40%), Gaps = 18/179 (10%) Query: 2 ENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYSTC-NPFGAVG----DFVTAPEI 55 + L + I G + +Y + P GYYS FG G DFVTAPE+ Sbjct: 59 SDALAASLRAEIAAAGGWIPFSRYMERVLYAPGLGYYSGGAQKFGRRGDDGSDFVTAPEL 118 Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 S +F + LA + A G R++E G G G + +L + L + + Sbjct: 119 SPLFAQTLARPVAQALAASG---TRRVMEFGAGTGQLAAGLLNALAALGVELDE---YAI 172 Query: 116 VETSERLTLIQKKQLASYGD----KINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMT 170 V+ S L Q++ L ++ W +L + G +V NE D++P++ Sbjct: 173 VDLSGELRARQRETLDEQASGAAARVRWLDALPERFEG--VIVGNEVLDAMPVQLVAKH 229 >gi|315638430|ref|ZP_07893607.1| conserved hypothetical protein [Campylobacter upsaliensis JV21] gi|315481421|gb|EFU72048.1| conserved hypothetical protein [Campylobacter upsaliensis JV21] Length = 313 Score = 125 bits (313), Expect = 1e-26, Method: Composition-based stats. Identities = 65/316 (20%), Positives = 118/316 (37%), Gaps = 28/316 (8%) Query: 21 VDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCV 80 ++F + + YY G GDF TA + ++FG +LA + ++ + Sbjct: 3 FSEFFQKWLYES---YYKNGVFVGKRGDFYTAVSVGELFGSLLAKHFLSLIDKQILTPPL 59 Query: 81 RLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWY 140 ++VE+G G + D L + + +P F L +++E E+L ++QK+ L + + Sbjct: 60 QVVEIGANEGYLSRDFLSALVQFRPSIFEKLEFHIIEPHEKLQILQKQTLKG--VEFTHH 117 Query: 141 TSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNF 200 S + F+ NE FDS + E +F +K Sbjct: 118 HSFKETHFQNAFIFCNELFDSFACDLIDNDKMAFIE-------DFKFIFKPLSPALK-KQ 169 Query: 201 LTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTY 260 + G ++ + D DYG + +L+ + H Sbjct: 170 CELLNLQKGEFSFYLSNFFEDLNAAC-----DSFIFAGFDYGEFLPQ-RFSLRIYQNHQL 223 Query: 261 VSPLV-----NPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLM 315 P G++DL+ +V+F L + Y + L Q K L G S++ Sbjct: 224 FDPFEVDLRAFFGKSDLTYNVNFAHLQYLIEKYHFKLLSLEKQSKALLEFG----LESII 279 Query: 316 KQTARKDILLDSVKRL 331 +Q+ K+ LL K L Sbjct: 280 EQSENKEKLLSQAKHL 295 >gi|211826135|gb|AAH12374.2| C2orf56 protein [Homo sapiens] Length = 237 Score = 125 bits (313), Expect = 1e-26, Method: Composition-based stats. Identities = 64/211 (30%), Positives = 97/211 (45%), Gaps = 12/211 (5%) Query: 149 GFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQH--DSLVFNIGDHEIKSNFLTCSDY 206 G++F +A+EFFD LP+ +F T G RE +DID D L F + + D Sbjct: 4 GYSFYLAHEFFDVLPVHKFQKTPQGWREVFVDIDPQVSDKLRFVLAPSATPAEAFIQHD- 62 Query: 207 FLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVN 266 E P ++ +S R+A GG A+V DYG+ ++ DT + H L+ Sbjct: 63 ETRDHVEVCPDAGVIIEELSQRIALTGGAALVADYGHDGTKT-DTFRGFCDHKLHDVLIA 121 Query: 267 PGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTAR---KDI 323 PG ADL++ VDF L +A K+ G Q FL+ +GI R L+ ++ + Sbjct: 122 PGTADLTADVDFSYLRRMA-QGKVASLGPIKQHTFLKNMGIDVRLKVLLDKSNEPSVRQQ 180 Query: 324 LLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 LL L+ + K MGE F + + Sbjct: 181 LLQGYDMLM----NPKKMGERFNFFALLPHQ 207 >gi|194382818|dbj|BAG64579.1| unnamed protein product [Homo sapiens] Length = 137 Score = 125 bits (313), Expect = 1e-26, Method: Composition-based stats. Identities = 37/86 (43%), Positives = 52/86 (60%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 ++R ++ IK G +TV +Y + +P GYY + G GDF+T+PEISQIFGE+ Sbjct: 41 TPMLRHLMYKIKSTGPITVAEYMKEVLTNPAKGYYVYRDMLGEKGDFITSPEISQIFGEL 100 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPG 88 L I+ I W G + +LVELGPG Sbjct: 101 LGIWFISEWMATGKSTAFQLVELGPG 126 >gi|57242319|ref|ZP_00370258.1| conserved hypothetical protein [Campylobacter upsaliensis RM3195] gi|57016999|gb|EAL53781.1| conserved hypothetical protein [Campylobacter upsaliensis RM3195] Length = 313 Score = 124 bits (312), Expect = 2e-26, Method: Composition-based stats. Identities = 64/316 (20%), Positives = 120/316 (37%), Gaps = 28/316 (8%) Query: 21 VDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCV 80 ++F + + YY G GDF TA + ++FG +LA + ++ + Sbjct: 3 FSEFFQKWLYES---YYKNGVFVGKRGDFYTAVSVGELFGSLLAKHFLSLIDKQILTPPL 59 Query: 81 RLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWY 140 ++VE+G G + D L + + +P F L +++E E+L ++QK+ L + + Sbjct: 60 QVVEIGANEGYLSRDFLSALVQFRPSIFEKLEFHIIEPHEKLQILQKQTLKG--VEFTHH 117 Query: 141 TSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNF 200 S + F+ NE FDS + + ++ +F +K Sbjct: 118 HSFKETHFQNAFIFCNELFDSFACDLIDNGK-------MAFIKNFKFIFKPLSPTLK-KQ 169 Query: 201 LTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTY 260 + G ++ + D DYG + +L+ + H Sbjct: 170 CELLNLQKGEFSFYLSNFFEDLNAAC-----DSFIFAGFDYGDFLPQ-RFSLRIYQNHQL 223 Query: 261 VSPLV-----NPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLM 315 P G++DL+ +V+F L + Y + L Q K L G S++ Sbjct: 224 YDPFEVDLRAFFGKSDLTYNVNFTHLQYLIKKYHFKLLSLEKQSKALLEFG----LESII 279 Query: 316 KQTARKDILLDSVKRL 331 +Q+ K+ LL K L Sbjct: 280 EQSENKEKLLSQAKHL 295 >gi|332528304|ref|ZP_08404306.1| hypothetical protein HGR_00350 [Hylemonella gracilis ATCC 19624] gi|332042249|gb|EGI78573.1| hypothetical protein HGR_00350 [Hylemonella gracilis ATCC 19624] Length = 190 Score = 124 bits (310), Expect = 3e-26, Method: Composition-based stats. Identities = 53/198 (26%), Positives = 89/198 (44%), Gaps = 24/198 (12%) Query: 14 KKNGQMTVDQYFALCVADPEFGYYSTC-NPFG----AVG---DFVTAPEISQIFGEMLAI 65 + G + D + + + P GYY+ G A G DFVTAPE+S +FG+ LA Sbjct: 3 ESGGWLGFDDFMSRALYTPGLGYYANDLRKLGLMPGASGGGSDFVTAPELSPLFGQALAA 62 Query: 66 FLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLI 125 + A E+ G ++ E G G G + L +LRV+ V +V+ S L Sbjct: 63 QIGEALERTG---TDQVWEFGAGSGALALQLLRVLRG------KVRRYTIVDVSGALRAR 113 Query: 126 QKKQLASYGDK----INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDI 181 Q++ L + D+ I+W TSL + G +V NE D++P++ + ER + Sbjct: 114 QQETLQEFADQADVRIDWATSLPEAMHG--VVVGNEVLDAMPVQLLLRQRGVWHERGVS- 170 Query: 182 DQHDSLVFNIGDHEIKSN 199 + V+ +++ Sbjct: 171 EVQGRFVWADRPTDLRPP 188 >gi|313143373|ref|ZP_07805566.1| conserved hypothetical protein [Helicobacter cinaedi CCUG 18818] gi|313128404|gb|EFR46021.1| conserved hypothetical protein [Helicobacter cinaedi CCUG 18818] Length = 366 Score = 123 bits (308), Expect = 4e-26, Method: Composition-based stats. Identities = 63/364 (17%), Positives = 116/364 (31%), Gaps = 45/364 (12%) Query: 19 MTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPS 78 +T ++ + + E GYY+ + GDF T+ +S+ FG +A +++ E + Sbjct: 2 LTFSEFMSQSLYG-ESGYYADSSRVSKTGDFYTSVSVSKFFGGSIASYILSLLESNALSL 60 Query: 79 CVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASY----- 133 +R+VE+G +G ++ DI + L + ++E L QK L Sbjct: 61 PLRIVEIGADKGYLLGDIALFLDAL-SEVLPQCEFIIIEPLSTLAQTQKAYLRGLKFSCV 119 Query: 134 --GDKINWYTSLADVPLGFTFLVANEFFDSLPIK-----QFVMTEHGIRERMIDIDQHDS 186 + + +L+ F+++NE FDS P + + + I + + Sbjct: 120 LDFKIVESFEALSQNKDSNLFIISNELFDSFPCDVLDSGKMLCVSQDSKWCGIWQNLNAK 179 Query: 187 LVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYG---- 242 + + + L +D A + DYG Sbjct: 180 NLSTLLPRQSLEILDKLKCNPLKFCGVLPQWQDFICNLSRFAKAHKNSYFVSFDYGRENL 239 Query: 243 ---------YLQSRVGDTLQAVKGH---------TYVSPLVNPGQADLSSHVDFQRLSSI 284 Y + AD++ VDF L S+ Sbjct: 240 SQTQNPNAKYYNPLHHNPRFYKSHQVLSLKDFLEQGGDFHTLYQNADITYDVDFTLLDSL 299 Query: 285 AILYKLYINGLTTQGKFLE-GLGIWQRAFSLMKQTARKDIL--LDSVKRLVSTSADKKSM 341 TQ K L + I + + +Q L + +K L+ T Sbjct: 300 LCENGFQKIFCDTQAKVLIEKMQILELLQTFSQQCGYNTYLKEIHKLKTLLHTL------ 353 Query: 342 GELF 345 GE F Sbjct: 354 GERF 357 >gi|313903868|ref|ZP_07837257.1| protein of unknown function DUF185 [Thermaerobacter subterraneus DSM 13965] gi|313466056|gb|EFR61581.1| protein of unknown function DUF185 [Thermaerobacter subterraneus DSM 13965] Length = 475 Score = 122 bits (307), Expect = 6e-26, Method: Composition-based stats. Identities = 42/174 (24%), Positives = 69/174 (39%), Gaps = 10/174 (5%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNP-FGAVGDFVTAPEISQIFGEM 62 LIR ++N I++ G + +Y L + PE GYY+ P G GDF+TAP FG Sbjct: 6 ALIRLLLNEIRRQGAIPFARYMDLALHHPEHGYYAQGRPLIGREGDFLTAPSFHPAFGRT 65 Query: 63 LAIFLICAWEQHGFPS-------CVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 + + GF R++E+G G G + D+L L + Sbjct: 66 IWRQVREMLHILGFSPRRGLPARPARILEIGAGGGHLARDLLLAARSDGYGP-GHLEYVI 124 Query: 116 VETSERLTLIQKKQLASYGDKINWYTSLADVPLGFT-FLVANEFFDSLPIKQFV 168 V+ S L Q+ + + + G ++ NE + P+ + V Sbjct: 125 VDESSPLQERQRDLITAAWPEAPVRWVPRVEQAGPVHVVLMNELMSAFPVHRLV 178 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 45/210 (21%), Positives = 76/210 (36%), Gaps = 16/210 (7%) Query: 157 EFFDSLPIKQF--VMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSD----YFLGA 210 + D ++ RE + + V G + G Sbjct: 235 QAPDRWGPERSGDHRQPFEWRELYVTVQ-GGRFVQVEGPVSEPRAVAILREEGIEPQPGQ 293 Query: 211 IFENSPCRDREMQSISDRLACDGGTAIVIDYG------YLQSRVGDTLQ-AVKGHTYVSP 263 I + + +++I+ LA I IDYG Y R T++ + P Sbjct: 294 IVDVNVGAGDMLRAIAGVLAR-RAFVITIDYGGPAEVVYSPQRPRGTVRGYYRQRLLDDP 352 Query: 264 LVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGI-WQRAFSLMKQTARKD 322 PG+ D+++ +DF L + L GL QG FL LGI + A L ++ + D Sbjct: 353 FARPGEQDITADLDFTYLQRLGRRLGLRDLGLLPQGAFLLNLGIEEEEALPLARRAWQGD 412 Query: 323 ILLDSVKRLVSTSADKKSMGELFKILVVSH 352 + D + V + +GE F +LV + Sbjct: 413 LEADQALQRVYALYAPEGLGESFWVLVQAR 442 >gi|237753447|ref|ZP_04583927.1| conserved hypothetical protein [Helicobacter winghamensis ATCC BAA-430] gi|229375714|gb|EEO25805.1| conserved hypothetical protein [Helicobacter winghamensis ATCC BAA-430] Length = 345 Score = 120 bits (300), Expect = 4e-25, Method: Composition-based stats. Identities = 57/353 (16%), Positives = 113/353 (32%), Gaps = 42/353 (11%) Query: 16 NGQMTVDQYFALCVADP-----EFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICA 70 + ++T + + + GYY G DF T+ + FG L +L Sbjct: 4 SNKLTFGNFMQEWLYGNGAIFGKKGYY-QQVRVGKDLDFYTSVSTGKFFGYTLGFYLHSI 62 Query: 71 WEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQL 130 + + LVE+G +G ++ DI P L +E L +Q+ Sbjct: 63 LKSLK--GKIALVEIGSEKGDLIADIAEFFNAFNPKQ--TLDFATLEPLVSLQNLQQDTF 118 Query: 131 ASYGDKINWY-----TSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHD 185 + ++ +L D V+NE D+ + + D Sbjct: 119 KRRNPNLAFHTFCDFKTLKDAQYDCILFVSNELLDAFACELVWNDKMAFV---------D 169 Query: 186 SLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQ 245 + L + F E ++S+++ + DYG L+ Sbjct: 170 KDSLKLTFEPASKEALEIARAFHITKGEIPLHSFSFVESLANSAPK--WLFLSFDYGSLE 227 Query: 246 SRVGDTLQAVKGHTYVS-------------PLVNPGQADLSSHVDFQRLSSIAILYKLYI 292 R +L+ + HT + L N D++ V+F + + Sbjct: 228 PRNTFSLRFYQNHTTENLFLDSTTQNYNQEILKNFALMDITYDVNFTLWEQAFLRFG-KT 286 Query: 293 NGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELF 345 + Q + L +G+ + +++ + + S K + + S GE F Sbjct: 287 LFIHRQNRALVEMGLDKMCAWYIEKFGLETYMYQSGK--IRSLISPGSFGERF 337 >gi|156065097|ref|XP_001598470.1| hypothetical protein SS1G_00559 [Sclerotinia sclerotiorum 1980] gi|154691418|gb|EDN91156.1| hypothetical protein SS1G_00559 [Sclerotinia sclerotiorum 1980 UF-70] Length = 414 Score = 119 bits (298), Expect = 7e-25, Method: Composition-based stats. Identities = 77/382 (20%), Positives = 135/382 (35%), Gaps = 99/382 (25%) Query: 76 FPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGD 135 S + + + + I KP S+ ++YMVE S L QK+ L Sbjct: 29 RWSSTETRQWSTPLAKQLSEAITTIKNFKPMAESIEAVYMVEASPALRDAQKQLLCGDAP 88 Query: 136 KIN--------------------WYTSLADVPLGFTFLVANEFFDSLPIKQFVM------ 169 I S+ F+VA+EFFD+LPI F Sbjct: 89 MIETETGFKSTSKYAGIPIMWTENMRSVPYGADKTPFIVAHEFFDALPIHVFQSVAPNLD 148 Query: 170 --------------------------TEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTC 203 RE ++ +S ++ + N + Sbjct: 149 TLEPITIETPTGTHPLAPSTSKSSTAKTPQWREMVVSPTPPNSTQTDVSIPDSPQNQSSP 208 Query: 204 SDYFL---------------------------GAIFENSPCRDREMQSISDRL------- 229 ++ L ++ E SP + + R+ Sbjct: 209 PEFQLTLSKSSTPHSLYLPEILDRYRNLKSIPDSLIEISPESHAIVADFASRIGGSKTNP 268 Query: 230 -ACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILY 288 G A+++DYG + + ++L+ +K H VSP PG DLS+ VDF L+ A+ Sbjct: 269 KTKPSGAALILDYGPISNIPTNSLRGIKAHQRVSPFSEPGIVDLSADVDFVALAEAAMNA 328 Query: 289 --KLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDI-----LLDSVKRLVSTSADKKSM 341 + I+G QG++L +GI +RA L++ KD + + +RLV + M Sbjct: 329 SEGVEIHGPMEQGEWLLSIGIKERAEMLVQSLKGKDDEARKRIEGAWRRLVDMG--ENGM 386 Query: 342 GELFKILVVSHEKV---ELMPF 360 G+++K++++ E + F Sbjct: 387 GKVYKVMIMMPENEGRRPPLGF 408 >gi|153952412|ref|YP_001397673.1| hypothetical protein JJD26997_0488 [Campylobacter jejuni subsp. doylei 269.97] gi|152939858|gb|ABS44599.1| conserved hypothetical protein [Campylobacter jejuni subsp. doylei 269.97] Length = 316 Score = 118 bits (296), Expect = 1e-24, Method: Composition-based stats. Identities = 65/318 (20%), Positives = 112/318 (35%), Gaps = 27/318 (8%) Query: 21 VDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCV 80 +F + + YY G GDF TA + +FG +LA + ++ + Sbjct: 3 FSDFFYAWLHES---YYKNAVSIGKNGDFFTAVSVGNLFGTLLAKHFLNLIDKKILQPPL 59 Query: 81 RLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWY 140 LVE+G G + D L + +L+P+ FS +S +++E E+L +QKK L + + Sbjct: 60 ELVEIGANEGYLSRDFLAALLELRPEIFSQISFFIIEPHEKLRTLQKKTLE--RVEFSHK 117 Query: 141 TSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNF 200 SL + F NE FDS + + E F + + N Sbjct: 118 NSLKECHFKNAFFFCNELFDSFTCELIDNDKMAFVE-----------NFKLIFKNMDENL 166 Query: 201 LTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGY-LQSRVGDTLQAVKGHT 259 +T E S + + + I + Y + +L+ + H Sbjct: 167 ITKCKALNLIKGELSLELENFFKDLDQACER----FIFAGFDYGTLNPQNFSLRIYQKHE 222 Query: 260 YVSPLV-----NPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAF-S 313 +P G++DL+ +V+F L + Y Q L G + Sbjct: 223 VFNPFEVSLKDFFGKSDLTYNVNFTHLQKLIKEYNFKPLTFKKQSLALMDFGFEDLLEYT 282 Query: 314 LMKQTARKDILLDSVKRL 331 K + L K L Sbjct: 283 KNKNIKTYESFLSQAKIL 300 >gi|305431718|ref|ZP_07400886.1| conserved hypothetical protein [Campylobacter coli JV20] gi|304445200|gb|EFM37845.1| conserved hypothetical protein [Campylobacter coli JV20] Length = 316 Score = 118 bits (295), Expect = 1e-24, Method: Composition-based stats. Identities = 64/317 (20%), Positives = 121/317 (38%), Gaps = 25/317 (7%) Query: 21 VDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCV 80 ++F + + YY G GDF TA + +FG +LA + ++ + Sbjct: 3 FSEFFHAWLHES---YYKNAVSIGKNGDFFTAVSVGNLFGTLLAKHFLDLIDKKVLKLPL 59 Query: 81 RLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWY 140 LVE+G G + D L + + +P+ FS LS +++E E+L +QKK L + Sbjct: 60 ELVEIGANEGYLSRDFLAALLEFRPEIFSKLSFFIIEPHEKLRNLQKKTLEG--VEFTHK 117 Query: 141 TSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNF 200 SL + F NE FDS + + + + LVF + ++K Sbjct: 118 RSLKECHFENAFFFCNELFDSFTCELIDDDK-------MAFIKDFKLVFEPMEAKLKEKC 170 Query: 201 LTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTY 260 + G + +++ S + DYG + +++ + H Sbjct: 171 EIL-NLKKGELSLELEEFFKDLDKASKK-----FIFASFDYGVFNPQ-QFSIRIYQKHEV 223 Query: 261 VSPLV-----NPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSL- 314 +P G++DL+ +V+F ++ + + + L Q + L G + + Sbjct: 224 FNPFEITLKDFFGKSDLTYNVNFNQIQQLIKEHDFKLLALKKQNQALIDFGFEELLKYIK 283 Query: 315 MKQTARKDILLDSVKRL 331 K + L K L Sbjct: 284 DKNLKTYENFLSQSKIL 300 >gi|57168132|ref|ZP_00367271.1| conserved hypothetical protein [Campylobacter coli RM2228] gi|57020506|gb|EAL57175.1| conserved hypothetical protein [Campylobacter coli RM2228] Length = 316 Score = 118 bits (295), Expect = 1e-24, Method: Composition-based stats. Identities = 64/317 (20%), Positives = 121/317 (38%), Gaps = 25/317 (7%) Query: 21 VDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCV 80 ++F + + YY G GDF TA + +FG +LA + ++ + Sbjct: 3 FSEFFHAWLHES---YYKNAVSIGKNGDFFTAVSVGNLFGTLLAKHFLDLIDKKVLKLPL 59 Query: 81 RLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWY 140 LVE+G G + D L + + +P+ FS LS +++E E+L +QKK L + Sbjct: 60 ELVEIGANEGYLSRDFLAALLEFRPEIFSKLSFFIIEPHEKLRNLQKKTLEG--VEFTHK 117 Query: 141 TSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNF 200 SL + F NE FDS + + + + LVF + ++K Sbjct: 118 RSLKECHFENAFFFCNELFDSFTCELIDDDK-------MAFIKDFKLVFEPMEAKLKEKC 170 Query: 201 LTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTY 260 + G + +++ S + DYG + +++ + H Sbjct: 171 EIL-NLKKGELSLELEEFFKDLDKASKK-----FIFASFDYGVFNPQ-QFSIRIYQKHEV 223 Query: 261 VSPLV-----NPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSL- 314 +P G++DL+ +V+F ++ + + + L Q + L G + + Sbjct: 224 FNPFEIALKDFFGKSDLTYNVNFNQIQQLIKEHDFKLLALKKQNQALIDFGFEELLKYIK 283 Query: 315 MKQTARKDILLDSVKRL 331 K + L K L Sbjct: 284 DKNLKTYENFLSQSKIL 300 >gi|154311515|ref|XP_001555087.1| hypothetical protein BC1G_06610 [Botryotinia fuckeliana B05.10] gi|150851007|gb|EDN26200.1| hypothetical protein BC1G_06610 [Botryotinia fuckeliana B05.10] Length = 432 Score = 117 bits (293), Expect = 2e-24, Method: Composition-based stats. Identities = 84/386 (21%), Positives = 136/386 (35%), Gaps = 104/386 (26%) Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 LA L A G + M I KP S+ ++YMVE S L Sbjct: 42 LAKQLSEAITATGPIPLASFMR--------MCLTSDTIRNFKPMAESIEAVYMVEASPAL 93 Query: 123 TLIQKKQLASYGDKI---------NWYTSLADVP-----------LGFTFLVANEFFDSL 162 QK+ L I + Y + + F+VA+EFFD+L Sbjct: 94 RDTQKQLLCGDAPMIETETGFKSTSKYAGIPIMWTENMRFVPSGADKTPFIVAHEFFDAL 153 Query: 163 PIKQFVM--------------------------------TEHGIRERMIDIDQHDSLVFN 190 PI F RE ++ +S + Sbjct: 154 PIHAFQSVPPNPNAPEPTTIQTPTGTHPLSPSTSKSSTAKTPQWREMVVSPTPPNSTHND 213 Query: 191 IGDHEIKSNFLTCSDYFL---------------------------GAIFENSPCRDREMQ 223 + + + + ++ L ++ E SP + Sbjct: 214 VHTPKSLQSQSSPPEFQLTLSKASTPHSLYLPEISTRYRALKSIPDSLIEISPESHAIVA 273 Query: 224 SISDRL--------ACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSH 275 + R+ G A+++DYG + ++L+ +K H VSPL PG DLS+ Sbjct: 274 DFASRIGGSETSPKPNPSGAALILDYGPSDTIPTNSLRGIKAHQRVSPLSEPGVVDLSAD 333 Query: 276 VDFQRLSSIAILY--KLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLD-----SV 328 VDF L+ A+ + ++G QG +LE +GI +RA L+K +KD + + Sbjct: 334 VDFIALAEAAMNASEGVEVHGPMEQGGWLESMGIKERAEMLVKSLGQKDDEVKKRFEGAW 393 Query: 329 KRLVSTSADKKSMGELFKILVVSHEK 354 KRLV MG+++K++ V E Sbjct: 394 KRLVDRGG--SGMGKVYKVMAVVPEN 417 Score = 47.0 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 4/29 (13%), Positives = 12/29 (41%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVA 30 L +++ I G + + + +C+ Sbjct: 39 STPLAKQLSEAITATGPIPLASFMRMCLT 67 >gi|148926234|ref|ZP_01809919.1| hypothetical protein Cj8486_1278 [Campylobacter jejuni subsp. jejuni CG8486] gi|145845405|gb|EDK22498.1| hypothetical protein Cj8486_1278 [Campylobacter jejuni subsp. jejuni CG8486] Length = 316 Score = 117 bits (293), Expect = 3e-24, Method: Composition-based stats. Identities = 68/317 (21%), Positives = 114/317 (35%), Gaps = 25/317 (7%) Query: 21 VDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCV 80 +F + + YY G GDF TA + +FG +LA + ++ + Sbjct: 3 FSDFFHAWLHES---YYKNAVSIGKNGDFFTAVSVGNLFGTLLAKHFLNLIDKKILQLPL 59 Query: 81 RLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWY 140 LVE+G G + D L + +L+P+ FS +S +++E E+L +QKK L + Sbjct: 60 ELVEIGANEGYLSRDFLAALLELRPEIFSQISFFVIEPHEKLRTLQKKTLEG--VEFTHK 117 Query: 141 TSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNF 200 SL + F NE FDS + + E F + + N Sbjct: 118 NSLKECHFKNAFFFCNELFDSFACELIDHDKMAFVE-----------NFKLIFKNMDENL 166 Query: 201 LTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTY 260 +T E S + + ++ C+ DYG + +L+ + H Sbjct: 167 ITKCKALNLKKGELSLELENFFKDLNQ--TCERFIFAGFDYG-TLNPQSFSLRIYQKHEV 223 Query: 261 VSPLV-----NPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAF-SL 314 SP G++DL+ +V+F L + Y Q L G + Sbjct: 224 FSPFKVSLKDFFGKSDLTYNVNFTHLQKLIKEYDFKPLAFKKQSLALMDFGFEDLLEYTK 283 Query: 315 MKQTARKDILLDSVKRL 331 K + L K L Sbjct: 284 NKNIKTYESFLSQAKIL 300 >gi|57238108|ref|YP_179358.1| hypothetical protein CJE1371 [Campylobacter jejuni RM1221] gi|57166912|gb|AAW35691.1| conserved hypothetical protein [Campylobacter jejuni RM1221] gi|315058668|gb|ADT72997.1| Uncharacterized conserved protein [Campylobacter jejuni subsp. jejuni S3] Length = 316 Score = 117 bits (292), Expect = 3e-24, Method: Composition-based stats. Identities = 67/317 (21%), Positives = 113/317 (35%), Gaps = 25/317 (7%) Query: 21 VDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCV 80 +F + + YY G GDF TA + +FG +LA + ++ + Sbjct: 3 FSDFFHAWLHES---YYKNAVSIGKNGDFFTAVSVGNLFGTLLAKHFLNLIDKKILQLPL 59 Query: 81 RLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWY 140 LVE+G G + D L + +L+P+ FS +S +++E E+L +QKK L + Sbjct: 60 ELVEIGANEGYLSRDFLAALLELRPEIFSQISFFVIEPHEKLRTLQKKTLEG--VEFTHK 117 Query: 141 TSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNF 200 SL + F NE FDS + + E F + + N Sbjct: 118 NSLKECHFKNAFFFCNELFDSFACELIDHDKMAFVE-----------NFKLIFKNMDENL 166 Query: 201 LTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTY 260 +T E S + + ++ + DYG + +L+ + H Sbjct: 167 ITKCKALNLKKGELSLELENFFKDLNQTCERFIFAGL--DYG-TLNPQSFSLRIYQKHEV 223 Query: 261 VSPLV-----NPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAF-SL 314 SP G++DL+ +V+F L + Y Q L G + Sbjct: 224 FSPFEVSLKDFFGKSDLTYNVNFTHLQKLIKEYDFKPLAFKKQSLALMDFGFEDLLEYTK 283 Query: 315 MKQTARKDILLDSVKRL 331 K + L K L Sbjct: 284 NKNIKTYESFLSQAKIL 300 >gi|283956630|ref|ZP_06374109.1| hypothetical protein C1336_000260080 [Campylobacter jejuni subsp. jejuni 1336] gi|283791879|gb|EFC30669.1| hypothetical protein C1336_000260080 [Campylobacter jejuni subsp. jejuni 1336] Length = 316 Score = 117 bits (292), Expect = 4e-24, Method: Composition-based stats. Identities = 65/318 (20%), Positives = 110/318 (34%), Gaps = 27/318 (8%) Query: 21 VDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCV 80 +F + YY G GDF TA + +FG +LA + ++ + Sbjct: 3 FSDFFHAWLH---KSYYKNAVSIGKNGDFFTAVSVGNLFGTLLAKHFLNLIDKKILQPPL 59 Query: 81 RLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWY 140 LVE+G G + D L + +L+P+ FS +S +++E E+L +QKK L + Sbjct: 60 ELVEIGANEGYLSRDFLAALLELRPEIFSQISFFIIEPHEKLKNLQKKTLEG--VEFTHK 117 Query: 141 TSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNF 200 SL + F NE FDS + + E F + + N Sbjct: 118 NSLKECHFKNAFFFCNELFDSFTCELIDHDKMAFVE-----------NFKLIFKNMDENL 166 Query: 201 LTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGY-LQSRVGDTLQAVKGHT 259 +T E S + + + I + Y + +L+ + H Sbjct: 167 ITKCKTLNLTKGELSLELENFFKDLDQACER----FIFAGFDYGTFNVQNFSLRIYQKHE 222 Query: 260 YVSPLV-----NPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAF-S 313 +P G++DL+ +V+F L + Y Q L G + Sbjct: 223 VFNPFEAPLKDFFGKSDLTYNVNFTHLQKLIKAYDFKPLAFKKQSLALMDFGFEDLLEYT 282 Query: 314 LMKQTARKDILLDSVKRL 331 K + L K L Sbjct: 283 KNKNIKTYESFLSQAKIL 300 >gi|205356255|ref|ZP_03223021.1| hypothetical protein Cj8421_1275 [Campylobacter jejuni subsp. jejuni CG8421] gi|205345860|gb|EDZ32497.1| hypothetical protein Cj8421_1275 [Campylobacter jejuni subsp. jejuni CG8421] Length = 316 Score = 116 bits (291), Expect = 5e-24, Method: Composition-based stats. Identities = 68/317 (21%), Positives = 114/317 (35%), Gaps = 25/317 (7%) Query: 21 VDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCV 80 +F + + YY G GDF TA + +FG +LA + ++ + Sbjct: 3 FSDFFHAWLHES---YYKNAVSIGKNGDFFTAVSVGNLFGTLLAKHFLNLIDKKILQLPL 59 Query: 81 RLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWY 140 LVE+G G + D L + +L+P+ FS +S +++E E+L +QKK L + Sbjct: 60 ELVEIGANEGYLSRDFLAALLELRPEIFSQISFFVIEPHEKLRTLQKKTLEG--VEFTHK 117 Query: 141 TSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNF 200 SL + F NE FDS + + E F + + N Sbjct: 118 NSLKECHFKNAFFFCNELFDSFACELIDHDKMAFVE-----------NFKLIFKNMDENL 166 Query: 201 LTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTY 260 +T E S + + ++ C+ DYG + +L+ + H Sbjct: 167 ITKCKALNLKKGELSLELENFFKDLNQ--TCERFIFAGFDYG-TLNPQSFSLRIYQKHEV 223 Query: 261 VSPLV-----NPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAF-SL 314 SP G++DL+ +V+F L + Y Q L G + Sbjct: 224 FSPFEVSLKDFFGKSDLTYNVNFTHLQKLIKEYDFKPLAFKKQSLALMDFGFEDLLEYTK 283 Query: 315 MKQTARKDILLDSVKRL 331 K + L K L Sbjct: 284 NKNIKTYESFLSQAKIL 300 >gi|225159266|ref|ZP_03725567.1| protein of unknown function DUF185 [Opitutaceae bacterium TAV2] gi|224802163|gb|EEG20434.1| protein of unknown function DUF185 [Opitutaceae bacterium TAV2] Length = 332 Score = 115 bits (289), Expect = 7e-24, Method: Composition-based stats. Identities = 58/348 (16%), Positives = 99/348 (28%), Gaps = 47/348 (13%) Query: 25 FALCVADPEFGYYST-CNPFGAVG--DFVTAPEISQIFGEMLAIFLICAWEQHGFPS--- 78 + + DP GYY G DF TA +FGE++ ++HG Sbjct: 1 MEVALYDPGVGYYRAARQRVGRASGTDFYTATSSGALFGELVCAAAESLVQRHGAQHGVK 60 Query: 79 --CVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 VE+G G +L + ++ + D Sbjct: 61 LSDFTFVEIGAEPGGGILRDVAT--------------------HPFRSVRTLGVGDLLDL 100 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFN------ 190 + G + +NE FD+ P+++FV E + + +L Sbjct: 101 --NDADGSTETGGPCVVFSNELFDAQPVRRFVRHAGAWHELGVTLMPDGTLHETRLASVP 158 Query: 191 IGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS---- 246 SD R + G I DYG Sbjct: 159 TSAAPWLPAADPASDIAPADGDLFDAPRAAAELAARIAAQPWSGLFIAFDYGKTWRELAT 218 Query: 247 -RVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSSIAILY----KLYINGLTTQGK 300 T +A + H L +PG DL++HV + L+ + +Q Sbjct: 219 EHPAGTARAYRAHRQHNDLLADPGGQDLTAHVCWDWLADALRAAVPPFDAGTVAVESQEA 278 Query: 301 FLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKIL 348 F + + + A K + +MG+ F++L Sbjct: 279 FFIHHAVAFLEKTFVANAAESSG-FSPRKSALMQLLHPANMGQKFQVL 325 >gi|218562848|ref|YP_002344627.1| hypothetical protein Cj1236 [Campylobacter jejuni subsp. jejuni NCTC 11168] gi|112360554|emb|CAL35351.1| conserved hypothetical protein Cj1236 [Campylobacter jejuni subsp. jejuni NCTC 11168] gi|315927141|gb|EFV06492.1| conserved hypothetical protein [Campylobacter jejuni subsp. jejuni DFVF1099] Length = 316 Score = 115 bits (288), Expect = 1e-23, Method: Composition-based stats. Identities = 67/317 (21%), Positives = 113/317 (35%), Gaps = 25/317 (7%) Query: 21 VDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCV 80 +F + + YY G GDF TA + +FG +LA + ++ + Sbjct: 3 FSDFFHAWLHES---YYKNAVSIGKNGDFFTAVSVGNLFGTLLAKHFLNLIDEKILKPPL 59 Query: 81 RLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWY 140 LVE+G G + D L + +L+P+ FS +S +++E E+L +QKK L + Sbjct: 60 ELVEIGANEGYLSRDFLAALLELRPEIFSQISFFIIEPHEKLRTLQKKTLEG--VEFTHK 117 Query: 141 TSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNF 200 SL + F NE FDS + + E F + + N Sbjct: 118 NSLKECHFKNAFFFCNELFDSFTCELIDHDKMAFVE-----------NFKLIFKNMDENL 166 Query: 201 LTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTY 260 +T E S + + ++ C+ DYG + +L+ + H Sbjct: 167 ITKCKALNLTKGELSLELENFFKDLNQ--TCERFIFAGFDYG-TLNPQSFSLRIYQKHEV 223 Query: 261 VSPLV-----NPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAF-SL 314 SP G++DL+ +V+F L + Y Q G + Sbjct: 224 FSPFEVSLKDFFGKSDLTYNVNFTHLQKLIKEYDFKPLAFKKQSLAFMDFGFEDLLEYAK 283 Query: 315 MKQTARKDILLDSVKRL 331 K + L K L Sbjct: 284 NKNIKTYESFLSQAKIL 300 >gi|121612945|ref|YP_001000910.1| hypothetical protein CJJ81176_1250 [Campylobacter jejuni subsp. jejuni 81-176] gi|167005823|ref|ZP_02271581.1| hypothetical protein Cjejjejuni_06550 [Campylobacter jejuni subsp. jejuni 81-176] gi|87249554|gb|EAQ72513.1| conserved hypothetical protein [Campylobacter jejuni subsp. jejuni 81-176] Length = 316 Score = 115 bits (288), Expect = 1e-23, Method: Composition-based stats. Identities = 66/318 (20%), Positives = 112/318 (35%), Gaps = 27/318 (8%) Query: 21 VDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCV 80 +F + + YY G GDF TA + +FG +LA + ++ + Sbjct: 3 FSDFFHAWLHES---YYKNAVSIGKNGDFFTAVSVGNLFGTLLAKHFLNLIDKKILQPPL 59 Query: 81 RLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWY 140 LVE+G G + D L + +L+P+ FS +S +++E E+L +QKK L + Sbjct: 60 ELVEIGANEGYLSRDFLAALLELRPEIFSQISFFVIEPHEKLKNLQKKTLEG--VEFTHK 117 Query: 141 TSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNF 200 SL + F NE FDS + + E F + + N Sbjct: 118 NSLKECHFKNAFFFCNELFDSFTCELINHDKIAFVE-----------NFKLIFKNMDENL 166 Query: 201 LTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGY-LQSRVGDTLQAVKGHT 259 +T E S ++ + + I + Y + +L+ + H Sbjct: 167 ITKCKALNLTKGELSLELEKFFKDLDQACER----FIFAGFDYGTFNAQNFSLRIYQKHE 222 Query: 260 YVSPLV-----NPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAF-S 313 SP G++DL+ +V+F L + Y Q L G + Sbjct: 223 VFSPFEVSLKDFFGKSDLTYNVNFTHLQKLIKEYDFKPLAFKKQSLALMDFGFEDLLEYT 282 Query: 314 LMKQTARKDILLDSVKRL 331 K + L K L Sbjct: 283 KNKNIKTYESFLSQAKIL 300 >gi|86152711|ref|ZP_01070916.1| conserved hypothetical protein [Campylobacter jejuni subsp. jejuni HB93-13] gi|85843596|gb|EAQ60806.1| conserved hypothetical protein [Campylobacter jejuni subsp. jejuni HB93-13] Length = 316 Score = 115 bits (287), Expect = 1e-23, Method: Composition-based stats. Identities = 66/318 (20%), Positives = 112/318 (35%), Gaps = 27/318 (8%) Query: 21 VDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCV 80 +F + + YY G GDF TA + +FG +LA + ++ + Sbjct: 3 FSDFFHSWLHES---YYKNAVSIGKNGDFFTAVSVGNLFGTLLAKHFLNLIDKKILQPPL 59 Query: 81 RLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWY 140 LVE+G G + D L + +L+P+ FS +S +++E E+L +QKK L + Sbjct: 60 ELVEIGANEGYLSRDFLAALLELRPEIFSQISFFVIEPHEKLKNLQKKTLEG--VEFTHK 117 Query: 141 TSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNF 200 SL + F NE FDS + + E F + + N Sbjct: 118 NSLKECHFKNAFFFCNELFDSFTCELINHDKIAFVE-----------NFKLIFKNMDENL 166 Query: 201 LTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGY-LQSRVGDTLQAVKGHT 259 +T E S ++ + + I + Y + +L+ + H Sbjct: 167 ITKCKALNLTKGELSLELEKFFKDLDQACER----FIFAGFDYGTFNAQNFSLRIYQKHE 222 Query: 260 YVSPLV-----NPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAF-S 313 SP G++DL+ +V+F L + Y Q L G + Sbjct: 223 VFSPFEVSLKDFFGKSDLTYNVNFTHLQKLIKEYDFKPLAFKKQSLALMDFGFEDLLEYT 282 Query: 314 LMKQTARKDILLDSVKRL 331 K + L K L Sbjct: 283 KNKNIKTYESFLSQAKIL 300 >gi|86150573|ref|ZP_01068796.1| conserved hypothetical protein [Campylobacter jejuni subsp. jejuni CF93-6] gi|88596609|ref|ZP_01099846.1| conserved hypothetical protein [Campylobacter jejuni subsp. jejuni 84-25] gi|85838924|gb|EAQ56190.1| conserved hypothetical protein [Campylobacter jejuni subsp. jejuni CF93-6] gi|88191450|gb|EAQ95422.1| conserved hypothetical protein [Campylobacter jejuni subsp. jejuni 84-25] gi|284926460|gb|ADC28812.1| conserved hypothetical protein [Campylobacter jejuni subsp. jejuni IA3902] gi|315929351|gb|EFV08558.1| conserved hypothetical protein [Campylobacter jejuni subsp. jejuni 305] Length = 316 Score = 115 bits (287), Expect = 2e-23, Method: Composition-based stats. Identities = 67/317 (21%), Positives = 113/317 (35%), Gaps = 25/317 (7%) Query: 21 VDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCV 80 +F + + YY G GDF TA + +FG +LA + ++ + Sbjct: 3 FSDFFHAWLHES---YYKNAVSIGKNGDFFTAVSVGNLFGTLLAKHFLNLIDEKILKPPL 59 Query: 81 RLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWY 140 LVE+G G + D L + +L+P+ FS +S +++E E+L +QKK L + Sbjct: 60 ELVEIGANEGYLSRDFLAALLELRPEIFSQISFFIIEPHEKLRTLQKKTLEG--VEFTHK 117 Query: 141 TSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNF 200 SL + F NE FDS + + E F + + N Sbjct: 118 NSLKECHFKNAFFFCNELFDSFTCELIDHDKMAFVE-----------NFKLIFKNMDENL 166 Query: 201 LTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTY 260 +T E S + + ++ C+ DYG + +L+ + H Sbjct: 167 ITKCKALNLTKGELSLELENFFKDLNQ--TCERFIFAGFDYG-TLNPQSFSLRIYQKHEV 223 Query: 261 VSPLV-----NPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAF-SL 314 SP G++DL+ +V+F L + Y Q G + Sbjct: 224 FSPFEVSLKDFFGKSDLTYNVNFTHLQKLIKEYDFKPLAFKKQSLAFMDFGFEDLLEYTK 283 Query: 315 MKQTARKDILLDSVKRL 331 K + L K L Sbjct: 284 NKNIKTYESFLSQAKIL 300 >gi|157415499|ref|YP_001482755.1| hypothetical protein C8J_1179 [Campylobacter jejuni subsp. jejuni 81116] gi|157386463|gb|ABV52778.1| hypothetical protein C8J_1179 [Campylobacter jejuni subsp. jejuni 81116] gi|307748141|gb|ADN91411.1| Putative uncharacterized protein [Campylobacter jejuni subsp. jejuni M1] gi|315932381|gb|EFV11324.1| conserved hypothetical protein [Campylobacter jejuni subsp. jejuni 327] Length = 316 Score = 114 bits (285), Expect = 2e-23, Method: Composition-based stats. Identities = 65/318 (20%), Positives = 110/318 (34%), Gaps = 27/318 (8%) Query: 21 VDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCV 80 +F + + YY G GDF TA + +F +LA + ++ + Sbjct: 3 FSDFFHAWLHES---YYKNAVSIGKNGDFFTAVSVGNLFSTLLAKHFLNLIDKKILQPPL 59 Query: 81 RLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWY 140 LVE+G G + D L + +L+P+ FS +S +++E E+L +QKK L + Sbjct: 60 ELVEIGANEGYLSRDFLAALLELRPEIFSQISFFVIEPHEKLKNLQKKTLEG--VEFTHK 117 Query: 141 TSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNF 200 SL + F NE FDS + + E F + + N Sbjct: 118 NSLKECHFKNAFFFCNELFDSFTCELIDHDKMAFVE-----------NFKLIFKNMDENL 166 Query: 201 LTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGY-LQSRVGDTLQAVKGHT 259 +T E S + + + I + Y + +L+ + H Sbjct: 167 ITKCKALNLTKGELSLELENFFKDLDQACER----FIFAGFDYGTLNPQNFSLRIYQKHE 222 Query: 260 YVSPLV-----NPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAF-S 313 SP G++DL+ +V+F L + Y Q L G + Sbjct: 223 VFSPFEVSLKDFFGKSDLTYNVNFTHLQKLIKEYDFKPLAFKKQSLALMDFGFEDLLEYT 282 Query: 314 LMKQTARKDILLDSVKRL 331 K + L K L Sbjct: 283 KNKNIKTYESFLSQAKIL 300 >gi|222874776|gb|EEF11907.1| predicted protein [Populus trichocarpa] Length = 280 Score = 114 bits (285), Expect = 2e-23, Method: Composition-based stats. Identities = 42/155 (27%), Positives = 70/155 (45%), Gaps = 12/155 (7%) Query: 1 MENKLIRKIVNLIKK-NGQMTVDQYFALCVADPEFGYYST-CNPFG----AVGDFVTAPE 54 + L ++ I G + D++ AL + +P GYY+ FG + DFVTAPE Sbjct: 94 LTAALQSRVAQEIASSGGWLAFDRFMALALYEPGLGYYANDTAKFGLMPSSGSDFVTAPE 153 Query: 55 ISQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIY 114 +S +FG++LA + A ++ + E G G G + L +L + L Sbjct: 154 MSPVFGQLLAAQVAEALQRT---HTREVWEFGAGTGALALQVLDELAAL---GVRPDRYT 207 Query: 115 MVETSERLTLIQKKQLASYGDKINWYTSLADVPLG 149 +V+ S L Q+ +L Y ++W +L D G Sbjct: 208 IVDLSGTLRARQQLRLVKYEGLVHWADALPDRLEG 242 >gi|86150830|ref|ZP_01069046.1| conserved hypothetical protein [Campylobacter jejuni subsp. jejuni 260.94] gi|85842000|gb|EAQ59246.1| conserved hypothetical protein [Campylobacter jejuni subsp. jejuni 260.94] Length = 316 Score = 114 bits (285), Expect = 3e-23, Method: Composition-based stats. Identities = 67/317 (21%), Positives = 113/317 (35%), Gaps = 25/317 (7%) Query: 21 VDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCV 80 +F + + YY G GDF TA + +FG +LA + ++ + Sbjct: 3 FSDFFHAWLHES---YYKNAVSIGKNGDFFTAVSVGNLFGTLLAKHFLNLIDKKILQLPL 59 Query: 81 RLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWY 140 LVE+G G + D L + +L+P+ FS +S +++E E+L +QKK L + Sbjct: 60 ELVEIGANEGYLSRDFLSALLELRPEIFSQISFFVIEPHEKLRTLQKKTLEE--VEFTHK 117 Query: 141 TSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNF 200 SL + F NE FDS + + E F + + N Sbjct: 118 NSLKECHFKNAFFFCNELFDSFACELIDHDKMAFVE-----------NFKLIFKNMDENL 166 Query: 201 LTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTY 260 +T E S + + ++ C+ DYG + +L+ + H Sbjct: 167 ITKCKALNLKKGELSLELENFFKDLNQ--TCERFIFAGFDYG-TLNPQSFSLRIYQKHEV 223 Query: 261 VSPLV-----NPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAF-SL 314 SP G++DL+ +V+F L + Y Q G + Sbjct: 224 FSPFEVSLKDFFGKSDLTYNVNFTHLQKLIKEYDFKPLAFKKQSLAFMDFGFEDLLEYAK 283 Query: 315 MKQTARKDILLDSVKRL 331 K + L K L Sbjct: 284 NKNIKTYESFLSQAKIL 300 >gi|224436895|ref|ZP_03657884.1| hypothetical protein HcinC1_02971 [Helicobacter cinaedi CCUG 18818] Length = 359 Score = 114 bits (284), Expect = 3e-23, Method: Composition-based stats. Identities = 62/358 (17%), Positives = 112/358 (31%), Gaps = 45/358 (12%) Query: 25 FALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVE 84 + + E GYY+ + GDF T+ +S+ FG +A +++ E + +R+VE Sbjct: 1 MSQSLYG-ESGYYADSSRVSKTGDFYTSVSVSKFFGGSIASYILSLLESNALSLPLRIVE 59 Query: 85 LGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASY-------GDKI 137 +G +G ++ DI + L + ++E L QK L + Sbjct: 60 IGADKGYLLGDIALFLDAL-SEVLPQCEFIIIEPLSTLAQTQKAYLRGLKFSCVLDFKIV 118 Query: 138 NWYTSLADVPLGFTFLVANEFFDSLPIK-----QFVMTEHGIRERMIDIDQHDSLVFNIG 192 + +L+ F+++NE FDS P + + + I + + + + Sbjct: 119 ESFEALSQNKDSNLFIISNELFDSFPCDVLDSGKMLCVSQDSKWCGIWQNLNAKNLSTLL 178 Query: 193 DHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYG---------- 242 + L +D A + DYG Sbjct: 179 PRQSLEILDKLKCNPLKFCGVLPQWQDFICNLSRFAKAHKNSYFVSFDYGRENLSQTQNP 238 Query: 243 ---YLQSRVGDTLQAVKGH---------TYVSPLVNPGQADLSSHVDFQRLSSIAILYKL 290 Y + AD++ VDF L S+ Sbjct: 239 NAKYYNPLHHNPRFYKSHQVLSLKDFLEQGGDFHTLYQNADITYDVDFTLLDSLLCENGF 298 Query: 291 YINGLTTQGKFLE-GLGIWQRAFSLMKQTARKDIL--LDSVKRLVSTSADKKSMGELF 345 TQ K L + I + + +Q L + +K L+ T GE F Sbjct: 299 QKIFCDTQAKVLIEKMQILELLQTFSQQCGYNTYLKEIHKLKTLLHTL------GERF 350 >gi|283954796|ref|ZP_06372312.1| hypothetical protein C414_000260065 [Campylobacter jejuni subsp. jejuni 414] gi|283793636|gb|EFC32389.1| hypothetical protein C414_000260065 [Campylobacter jejuni subsp. jejuni 414] Length = 316 Score = 114 bits (284), Expect = 3e-23, Method: Composition-based stats. Identities = 65/317 (20%), Positives = 116/317 (36%), Gaps = 25/317 (7%) Query: 21 VDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCV 80 +F + + YY G GDF TA + +FG +LA + ++ + Sbjct: 3 FSIFFHTWLYEN---YYKNAVNIGKNGDFFTAVSVGNLFGTLLAQHFLNLVDKKILKLPL 59 Query: 81 RLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWY 140 LVE+G G + D L + +L+P FS +S +++E E+L +QKK L + Sbjct: 60 ELVEIGANEGYLSRDFLAALLELRPQIFSQISFFIIEPHEKLRTLQKKTLEG--VEFTHK 117 Query: 141 TSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNF 200 SL + F NE FDS + + E + LVF D + + Sbjct: 118 NSLKECHFKNAFFFCNELFDSFTCELIDDNKMAFVE-------NFKLVFKNMDENLIAKC 170 Query: 201 LTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTY 260 T + E Q+ + DYG + +L+ + H Sbjct: 171 KTLNLKKGELSLELEDFFKDLNQACDQFI------FAGFDYGIFNPQK-FSLRIYQKHEV 223 Query: 261 VSPLV-----NPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAF-SL 314 +P G++DL+ +++F ++ ++ + + Q L G + + Sbjct: 224 FNPFEISLKDFFGKSDLTYNINFTQVQNLIKEFDFKLLAFKKQSFALMDFGFEKLLEHAK 283 Query: 315 MKQTARKDILLDSVKRL 331 K + L K L Sbjct: 284 KKNIKTYESFLSQAKIL 300 >gi|322790659|gb|EFZ15443.1| hypothetical protein SINV_14205 [Solenopsis invicta] Length = 205 Score = 113 bits (283), Expect = 3e-23, Method: Composition-based stats. Identities = 58/205 (28%), Positives = 88/205 (42%), Gaps = 14/205 (6%) Query: 169 MTEHGIRERMIDIDQH---DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSI 225 T+ G RE +IDI Q + + + + + S + E SP + Sbjct: 1 KTDKGWREILIDIVQESKEERFRYVLSQVPTAACKVYLSPHEKRDHVEVSPQCSVITDYM 60 Query: 226 SDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIA 285 S L GG A+VIDYG+ + + DT +A H PL+NPG ADL++ +DF + IA Sbjct: 61 SQFLWEHGGFALVIDYGHEREKT-DTFRAFCQHKLHDPLLNPGTADLTADIDFLSIKEIA 119 Query: 286 ILYK-LYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGEL 344 L G TQ KFL+ LGI R +++ + V+ D+ MG Sbjct: 120 QKDNRLITFGPVTQRKFLKALGIDVRLKMILRNATSTQK--EQVESGYHMITDEDKMGNC 177 Query: 345 FKILVVSH-------EKVELMPFVN 362 FK++ + K + F N Sbjct: 178 FKVMSLFPFVLKDHLTKWPVAGFEN 202 >gi|222869986|gb|EEF07117.1| predicted protein [Populus trichocarpa] Length = 245 Score = 112 bits (280), Expect = 8e-23, Method: Composition-based stats. Identities = 55/252 (21%), Positives = 87/252 (34%), Gaps = 33/252 (13%) Query: 25 FALCVADPEFGYYSTCN-PFGAV----GDFVTAPEISQIFGEMLAIFLICAWEQHGFPSC 79 +L + P GYYS FG DF+TAPE++ F LA + P Sbjct: 1 MSLALYAPRLGYYSGGAAKFGRDVNDGSDFITAPELTPFFARTLARQFAP-LVRMNLP-- 57 Query: 80 VRLVELGPGRG--IMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQL----ASY 133 R++E G G G L + PD + +VE S L Q+ L Sbjct: 58 -RVMEFGAGTGRLAADLLLALETEDALPDTYQ-----IVELSGELRARQQATLDQRAPHL 111 Query: 134 GDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDS---LVFN 190 ++ W +L D G +V NE D++P++ + T ER + + F Sbjct: 112 AGRVTWLDALPDRFEG--VVVGNEVLDAMPVRLYARTGGRWHERGVVCAKDGKAGAEAFA 169 Query: 191 IGDHEIKSNFLTC----SDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYG---- 242 D ++ + + E + +++ LA I + Sbjct: 170 FEDRPLEDGAIPAVLLGIPGDHDIVTETHTEAEGFTRAVGALLARGAAFFIDYGFPASEY 229 Query: 243 YLQSRVGDTLQA 254 Y R G TL Sbjct: 230 YHPHRTGGTLMT 241 >gi|71412429|ref|XP_808399.1| hypothetical protein [Trypanosoma cruzi strain CL Brener] gi|70872598|gb|EAN86548.1| hypothetical protein, conserved [Trypanosoma cruzi] Length = 173 Score = 112 bits (280), Expect = 8e-23, Method: Composition-based stats. Identities = 40/134 (29%), Positives = 69/134 (51%), Gaps = 3/134 (2%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYS-TCNPFG-AVGDFVTAPEISQI 58 ++ L ++++ + G + Q+ C+ P+ GYY+ + G DF+TA EI Sbjct: 41 LKTPLCIELISKMSSQGYFPMSQFVKECLTHPQHGYYTAKKHVIGSEKADFITAAEI-PF 99 Query: 59 FGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVET 118 F ++++ +++ AW++ G P L+ELGPGRG +M +IL+ P L I++VE Sbjct: 100 FADVISAWIMDAWQKMGTPRAFHLIELGPGRGTLMKNILKQTKYSNPHLLHFLQIHLVEV 159 Query: 119 SERLTLIQKKQLAS 132 QK LA Sbjct: 160 GAARMEEQKSTLAE 173 >gi|239791225|dbj|BAH72108.1| ACYPI009538 [Acyrthosiphon pisum] Length = 128 Score = 112 bits (280), Expect = 9e-23, Method: Composition-based stats. Identities = 37/101 (36%), Positives = 62/101 (61%), Gaps = 4/101 (3%) Query: 1 MENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFG 60 ++ L + + I+ NG +T+ +Y + YY++ N FG+ GDF+T+PEISQ++G Sbjct: 25 VQQNLTKYFQDKIRINGPITLAEYMRESLKT----YYNSGNVFGSDGDFITSPEISQLYG 80 Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVIC 101 EM+ ++L+ WE+ G PS V L+ELGPG +++I Sbjct: 81 EMVMLWLLSLWEKAGCPSPVNLIELGPGNRSYDDRYVKIIK 121 >gi|315124696|ref|YP_004066700.1| hypothetical protein ICDCCJ07001_1184 [Campylobacter jejuni subsp. jejuni ICDCCJ07001] gi|315018418|gb|ADT66511.1| conserved hypothetical protein [Campylobacter jejuni subsp. jejuni ICDCCJ07001] Length = 316 Score = 112 bits (280), Expect = 9e-23, Method: Composition-based stats. Identities = 63/297 (21%), Positives = 107/297 (36%), Gaps = 24/297 (8%) Query: 21 VDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCV 80 +F + + YY G GDF TA + +FG +LA + ++ + Sbjct: 3 FSDFFHAWLHES---YYKNAVSIGKNGDFFTAVSVGNLFGTLLAKHFLNLIDKKILQLPL 59 Query: 81 RLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWY 140 LVE+G G + D L + +L+P+ FS +S +++E E+L +QKK L + Sbjct: 60 ELVEIGANEGYLSRDFLSALLELRPEIFSQISFFVIEPHEKLRTLQKKTLEE--VEFTHK 117 Query: 141 TSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNF 200 SL + F NE FDS + + E F + + N Sbjct: 118 NSLKECHFKNAFFFCNELFDSFACELIDHDKMAFVE-----------NFKLIFKNMDENL 166 Query: 201 LTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTY 260 +T E S + + ++ C+ DYG + +L+ + H Sbjct: 167 ITKCKALNLKKGELSLELENFFKDLNQ--TCERFIFAGFDYG-TLNPQSFSLRIYQKHEV 223 Query: 261 VSPLV-----NPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAF 312 SP G++DL+ +V+F L + Y Q G Sbjct: 224 FSPFEVSLKDFFGKSDLTYNVNFTHLQKLIKEYDFKPLAFKKQSLAFMDFGFEDLLE 280 >gi|114576976|ref|XP_001167113.1| PREDICTED: hypothetical protein isoform 2 [Pan troglodytes] Length = 306 Score = 110 bits (276), Expect = 3e-22, Method: Composition-based stats. Identities = 54/198 (27%), Positives = 83/198 (41%), Gaps = 12/198 (6%) Query: 162 LPIKQFVMTEHGIRERMIDIDQH--DSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRD 219 + + T G RE +DID D L F + + D E P Sbjct: 86 ISVHLVEKTPQGWREVFVDIDPQVSDKLRFVLAPSATPAEAFIQHD-ETRDHVEVCPDAG 144 Query: 220 REMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQ 279 ++ +S R+A GG A+V DYG+ ++ DT + H L+ PG ADL++ VDF Sbjct: 145 VIIEELSQRIALTGGAALVADYGHDGTKT-DTFRGFCDHKLHDVLIAPGTADLTADVDFS 203 Query: 280 RLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTAR---KDILLDSVKRLVSTSA 336 L +A K+ G Q FL+ +GI R L+ ++ + LL L+ Sbjct: 204 YLRRMA-QGKVASLGPIKQHTFLKNMGIDVRLKVLLDKSNEPSVRQQLLQGYDMLM---- 258 Query: 337 DKKSMGELFKILVVSHEK 354 + K MGE F + + Sbjct: 259 NPKKMGERFNFFALLPHQ 276 Score = 46.6 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 8/30 (26%), Positives = 16/30 (53%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADP 32 ++R ++ IK G +TV +Y + +P Sbjct: 41 TPMLRHLMYKIKSTGPITVAEYMKEVLTNP 70 >gi|195365466|ref|XP_002045652.1| GM16874 [Drosophila sechellia] gi|194133194|gb|EDW54710.1| GM16874 [Drosophila sechellia] Length = 100 Score = 109 bits (273), Expect = 5e-22, Method: Composition-based stats. Identities = 24/58 (41%), Positives = 35/58 (60%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGE 61 L +++ I G + V +Y + +P+ GYY + FG GDF+T+PEISQIFGE Sbjct: 41 SLAKQLRAKILSTGPIPVAEYMREVLTNPQAGYYMNRDVFGREGDFITSPEISQIFGE 98 >gi|320582809|gb|EFW97026.1| hypothetical protein HPODL_1736 [Pichia angusta DL-1] Length = 329 Score = 109 bits (273), Expect = 6e-22, Method: Composition-based stats. Identities = 79/314 (25%), Positives = 125/314 (39%), Gaps = 38/314 (12%) Query: 76 FPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYG- 134 R++E GPG+G MM + V + D + + I MVE S+ L Q K L Sbjct: 19 KDKIFRVIEFGPGKGSMMRGLAMVFKQYIID--NPVEIVMVEKSDILIREQHKLLCKSQK 76 Query: 135 --------------------DKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGI 174 + N L F+VA+EFFD+LPI +++ T+HG Sbjct: 77 LEQVDDYNFESITEWGQPITWQKNDLVELDLDSKYMNFVVAHEFFDALPIDRYIKTKHGW 136 Query: 175 RE-----RMIDIDQHDSLVFNIGDHEIKSNFLTC-----SDYFLGAIFENSPCRDREMQS 224 RE R + + + H +F+ + +GA E S Sbjct: 137 REYLVDVREPEAGRPGKFGLVVAPHATPGSFIPATNERYNKLAVGANVEISADAHMYASQ 196 Query: 225 ISDRLAC-DGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSS 283 + + D G A++IDYG + ++L+ +K H +V P PG+ DLS VDF LS Sbjct: 197 FAKIINSGDVGGALIIDYGPKDTIPINSLRGIKDHKFVDPFSEPGKIDLSVDVDFLGLSR 256 Query: 284 IAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDS-VKRLVSTSADKKSMG 342 + L + TQ +FL G+ L + ++ +RL + + MG Sbjct: 257 LFESQGLKTM-IETQSQFLNSSGLPSILGGLAFSMKEQQQRFETLYQRLT--GSAPQDMG 313 Query: 343 ELFKILVVSHEKVE 356 +K L V + Sbjct: 314 RAYKALQVYRNANK 327 >gi|325518328|gb|EGC98059.1| hypothetical protein B1M_43535 [Burkholderia sp. TJI49] Length = 173 Score = 109 bits (273), Expect = 6e-22, Method: Composition-based stats. Identities = 36/156 (23%), Positives = 67/156 (42%), Gaps = 14/156 (8%) Query: 205 DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDY------GYLQSRVGDT-LQAVKG 257 D G + E ++++ L G ++IDY Y R T + + Sbjct: 6 DVDDGYVTETHEAALAFVRTVCTMLGR--GAVLLIDYGFPAHEYYHPQRDRGTLMCHYRH 63 Query: 258 HTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLM-K 316 H + P + PG D+++HV+F + + + G T+Q +FL GI ++ Sbjct: 64 HAHDDPFLYPGLQDITAHVEFTGIYEAGVATGADLLGYTSQARFLLDAGITDALAAIDPS 123 Query: 317 QTARKDILLDSVKRLVSTSADKKSMGELFKILVVSH 352 + ++V++L+S + MGELFK++ S Sbjct: 124 DIKQFLPAANAVQKLIS----EAEMGELFKVIAFSR 155 >gi|145596794|ref|YP_001161091.1| hypothetical protein Strop_4285 [Salinispora tropica CNB-440] gi|145306131|gb|ABP56713.1| protein of unknown function DUF185 [Salinispora tropica CNB-440] Length = 338 Score = 109 bits (271), Expect = 1e-21, Method: Composition-based stats. Identities = 70/354 (19%), Positives = 119/354 (33%), Gaps = 29/354 (8%) Query: 18 QMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFP 77 + + P+ G++ + G F T+ S F L + P Sbjct: 2 PIRWRDAMEQALYGPD-GFFVSGA--GPADHFRTSVHASPAFAAALHRLIATVDAALDHP 58 Query: 78 SCVRLVELGPGRGIMMLDILRVICKLKPDFF-SVLSIYMVETSERLTLIQKKQLASYGDK 136 + +V++G GRG ++ + I L + VE + R A Sbjct: 59 EQLAVVDIGAGRGELLRTLEVSIAGATDTSLPDRLRLTAVERAPR--------PADLPAG 110 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHE- 195 I W + G L+A E+ D++P+ V T G R ++D + V E Sbjct: 111 ITWTNQIPAGVTG--LLLATEWLDNVPLDLAVATPEGWRYLLVDPATGEETVGTPVSPED 168 Query: 196 ---IKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS--RVGD 250 + + +D G E RD RL G A+ +DYG+L+ Sbjct: 169 ADWLTRWWSPPADDDPGTRAEIGRTRDDAWADALGRLDR--GLAVTVDYGHLRQARPTTG 226 Query: 251 TLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQR 310 TL + VSP+ + D+++H+ +++ L +Q L LG R Sbjct: 227 TLTGYRNGRQVSPVPDGS-CDVTAHIAVDSVAASGAQVGRAPYTLVSQRTALRALGADGR 285 Query: 311 ---AFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVELMPFV 361 A L + V+ D +G ++ V L P V Sbjct: 286 RPPLERASTDPAGYLRALSAAS-AVAELIDPAGLGG--HWWLLQPAGVTLDPLV 336 >gi|167736763|ref|ZP_02409537.1| hypothetical protein Bpse14_01794 [Burkholderia pseudomallei 14] Length = 193 Score = 109 bits (271), Expect = 1e-21, Method: Composition-based stats. Identities = 40/153 (26%), Positives = 66/153 (43%), Gaps = 14/153 (9%) Query: 208 LGAIFENSPCRDREMQSISDRLACDGGTAIVIDY------GYLQSRVGDTLQAVKGHTYV 261 G + E + ++ LA G A+ IDY Y + R TL H Sbjct: 29 EGYVTETHDAAAAFVGTVCAMLAR--GAALFIDYGFPRHEYYHRQRAQGTLMCHYRHRAH 86 Query: 262 -SPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTAR 320 P V PG D+++HV+F + + + G T+Q +FL GI + A+ Sbjct: 87 GDPFVYPGLQDITAHVEFSAVYEAGVGAGAELLGYTSQARFLLNAGITDVLAEIDPSDAQ 146 Query: 321 K-DILLDSVKRLVSTSADKKSMGELFKILVVSH 352 + ++V++L+S + MGELFK++ S Sbjct: 147 RFLPAANAVQKLIS----EAEMGELFKVIAFSR 175 >gi|167717730|ref|ZP_02400966.1| hypothetical protein BpseD_01851 [Burkholderia pseudomallei DM98] Length = 176 Score = 109 bits (271), Expect = 1e-21, Method: Composition-based stats. Identities = 40/153 (26%), Positives = 66/153 (43%), Gaps = 14/153 (9%) Query: 208 LGAIFENSPCRDREMQSISDRLACDGGTAIVIDY------GYLQSRVGDTLQAVKGHTYV 261 G + E + ++ LA G A+ IDY Y + R TL H Sbjct: 12 EGYVTETHDAAAAFVGTVCAMLAR--GAALFIDYGFPRHEYYHRQRAQGTLMCHYRHRAH 69 Query: 262 -SPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTAR 320 P V PG D+++HV+F + + + G T+Q +FL GI + A+ Sbjct: 70 GDPFVYPGLQDITAHVEFSAVYEAGVGAGAELLGYTSQARFLLNAGITDVLAEIDPSDAQ 129 Query: 321 K-DILLDSVKRLVSTSADKKSMGELFKILVVSH 352 + ++V++L+S + MGELFK++ S Sbjct: 130 RFLPAANAVQKLIS----EAEMGELFKVIAFSR 158 >gi|167518474|ref|XP_001743577.1| hypothetical protein [Monosiga brevicollis MX1] gi|163777539|gb|EDQ91155.1| predicted protein [Monosiga brevicollis MX1] Length = 223 Score = 107 bits (266), Expect = 4e-21, Method: Composition-based stats. Identities = 42/183 (22%), Positives = 66/183 (36%), Gaps = 4/183 (2%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGE 61 ++L+ IV I+ +G +++ Y + P GYYS FG GDF+TAP +S +FGE Sbjct: 27 TDELLDHIVTRIELSGPLSIADYMQEVLTSPIAGYYSRDGQFGGQGDFITAPGVSHMFGE 86 Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 +L + L R P L + Sbjct: 87 VL----DKHLPATKIDINLIESSLLLSAEQEATICNRERTSPAPLQDPSLPGPYRKADAD 142 Query: 122 LTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDI 181 ++ + N L +VANEF D+ PI Q ER++D+ Sbjct: 143 RRQLRWFRRLDQLLTENADQKTHGSSLDPVIVVANEFLDAAPIYQLQYQNDQWHERLVDV 202 Query: 182 DQH 184 + Sbjct: 203 NPD 205 >gi|297201413|ref|ZP_06918810.1| conserved hypothetical protein [Streptomyces sviceus ATCC 29083] gi|197713821|gb|EDY57855.1| conserved hypothetical protein [Streptomyces sviceus ATCC 29083] Length = 347 Score = 105 bits (262), Expect = 1e-20, Method: Composition-based stats. Identities = 61/329 (18%), Positives = 111/329 (33%), Gaps = 29/329 (8%) Query: 26 ALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVEL 85 + P+ G+Y P G G F T+ S +F E +A L E P+ + V++ Sbjct: 17 EAALYGPD-GFY--RRPEGPAGHFRTSVHASPLFAEAVARLLCRLDEALDRPARLDFVDM 73 Query: 86 GPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWYTSLAD 145 GRG + V+ L + + +Y VE + A+ +I W + Sbjct: 74 AAGRGELA---GGVLAALPAEVNARTRVYAVEV--------ADRPAALDHRIEWLSEPPK 122 Query: 146 VPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSD 205 G L ANE+ D++P++ G+ R++ + Sbjct: 123 DITG--LLFANEWLDNVPVEIAETDSSGLPRRVLVRRDGRERLGEPVAGAEAEWLDRWWP 180 Query: 206 YFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS--RVGDTLQAVKGHTYVSP 263 R+ S G A+ +DY + S + TL + +P Sbjct: 181 LPAEEGLRAEIGLPRDEAWASAVATVGRGLAVAVDYAHTASSRPLFGTLTGFREGRESAP 240 Query: 264 LVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDI 323 + + DL++HV + + TQ L LGI L + + Sbjct: 241 VPDGS-CDLTAHVALDACALPGGR-------VLTQRDALRALGIEGARPPLSLASTQPAE 292 Query: 324 LLDSVKRL--VSTSADKKSMGELFKILVV 350 + ++ R + +G+ F L+ Sbjct: 293 YVRALARAGQAAELTAAGGLGD-FGWLIQ 320 >gi|58699449|ref|ZP_00374192.1| proline dehydrogenase/delta-1-pyrroline-5-carboxylate dehydrogenase [Wolbachia endosymbiont of Drosophila ananassae] gi|58534036|gb|EAL58292.1| proline dehydrogenase/delta-1-pyrroline-5-carboxylate dehydrogenase [Wolbachia endosymbiont of Drosophila ananassae] Length = 287 Score = 105 bits (262), Expect = 1e-20, Method: Composition-based stats. Identities = 43/160 (26%), Positives = 79/160 (49%), Gaps = 11/160 (6%) Query: 193 DHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTL 252 + + +T +F GA+ E ++ + ++ + G A+++DYGY+ TL Sbjct: 136 WIPVSATQMTNGKFFNGAVVEICSVGVEILKKLEKKIYNNKGAALIVDYGYVYPAYKSTL 195 Query: 253 QAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAF 312 Q++K H Y + L N G +D+++ V+FQ L + TQ +FL GI +R Sbjct: 196 QSIKQHKYANFLENVGNSDITALVNFQALRDSLKHVDCE---ILTQREFLYLFGIKERTQ 252 Query: 313 SLMKQTA--RKDILLDSVKRLVSTSADKKSMGELFKILVV 350 +LMK + +K+ + RL ++MG LFK +++ Sbjct: 253 ALMKSASDEQKNRIFSEFLRLT------ENMGTLFKAMLL 286 >gi|159040211|ref|YP_001539464.1| hypothetical protein Sare_4720 [Salinispora arenicola CNS-205] gi|157919046|gb|ABW00474.1| protein of unknown function DUF185 [Salinispora arenicola CNS-205] Length = 346 Score = 105 bits (261), Expect = 1e-20, Method: Composition-based stats. Identities = 65/347 (18%), Positives = 116/347 (33%), Gaps = 21/347 (6%) Query: 25 FALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVE 84 + P+ G++ + G F T+ +S +F L + G P+ + +V+ Sbjct: 9 MEQALYGPD-GFFVSGT--GPASHFRTSVHVSPVFAAALHRLVTTVDAALGHPASIDVVD 65 Query: 85 LGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWYTSLA 144 +G GRG ++ I I + ++ V + + L Sbjct: 66 VGAGRGELLRAIHASIAGATDRPTTTSLVHRVRLTAVERAPRPADLPKEIAWRREIPVGV 125 Query: 145 DVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCS 204 L+A E+ D++ + V T G R ++D + D E Sbjct: 126 A-----GVLLATEWLDNVALDLAVATPAGWRYLLVDPATGNETEGAPVDREDADWLARWW 180 Query: 205 DYFLGAIFENSPCRDREMQSISDRLAC------DGGTAIVIDYGYLQSRVG--DTLQAVK 256 G E+ P E+ D D G A+ +DYG+L+ TL + Sbjct: 181 SSAAGDDLESGPGTRAEIGRARDDAWADAVGRLDRGLAVTVDYGHLRQNRPVAGTLTGYR 240 Query: 257 GHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMK 316 V P+ + D+++HV ++ + L TQ L LG R L + Sbjct: 241 NGRQVPPVPDGS-CDVTAHVAVDSVADAGRRVGRFPYTLVTQRAALRALGADGRRPPLDR 299 Query: 317 QTARKDILLDSVKRL--VSTSADKKSMGELFKILVVSHEKVELMPFV 361 L ++ V+ D +G ++ V L P V Sbjct: 300 AATDPVGYLRALSAASTVAELIDPAGLGG--HWWLLQPAGVTLDPLV 344 >gi|269128688|ref|YP_003302058.1| hypothetical protein Tcur_4493 [Thermomonospora curvata DSM 43183] gi|268313646|gb|ACZ00021.1| protein of unknown function DUF185 [Thermomonospora curvata DSM 43183] Length = 329 Score = 105 bits (261), Expect = 2e-20, Method: Composition-based stats. Identities = 74/344 (21%), Positives = 124/344 (36%), Gaps = 27/344 (7%) Query: 25 FALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVE 84 + E G+YS F T+ S + E L L+ E G PS + LV+ Sbjct: 7 MQQALYG-EGGFYSRGER--PAAHFRTSVHASPRYAEALLRLLVQVDEALGRPSRLDLVD 63 Query: 85 LGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWYTSLA 144 +G G G ++ I P + L+ VE + R + D+I W T Sbjct: 64 IGAGCGGLIAQITEAA---DPALAARLNPVAVELAPR--------PPALADRIAWRTEPP 112 Query: 145 DVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCS 204 + G ++ANE+ D++P+ MT G R ++D + E Sbjct: 113 ETITG--LVIANEWLDNIPLDVVEMTGDGPRTVLVDPATGAERLGPAPCAEDLEWLRRWW 170 Query: 205 DYF-LGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQ--SRVGDTLQAVKGHTYV 261 G E R S+ RLA GG A+ +DY + + +L + V Sbjct: 171 PLRDPGDRAEVGRPRCAAWASVVRRLA--GGLAVAVDYCHTRDTRPPCGSLTGYRDGHAV 228 Query: 262 SPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIW-QRAFSLMKQTAR 320 P+ + D+++HV ++ LTTQ + L LG+ R + Sbjct: 229 PPVPDGS-CDITAHVALDACAAGGSAAGATATALTTQRRALRALGLTGARPPITLAHRDP 287 Query: 321 KDILLDSVKRL--VSTSADKKSMGELFKILVVSHEKVELMPFVN 362 + + +++R + D +G F L + P N Sbjct: 288 RAYVA-ALRRAGEEAELIDPTGLGG-FGWLAQTKHIPLPAPLAN 329 >gi|296271379|ref|YP_003654011.1| hypothetical protein Tbis_3428 [Thermobispora bispora DSM 43833] gi|296094166|gb|ADG90118.1| protein of unknown function DUF185 [Thermobispora bispora DSM 43833] Length = 334 Score = 104 bits (259), Expect = 2e-20, Method: Composition-based stats. Identities = 75/348 (21%), Positives = 122/348 (35%), Gaps = 30/348 (8%) Query: 18 QMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFP 77 +T + P G+Y P F T+ S F + + E+ G P Sbjct: 2 WLTWRAAMERALYGPG-GFYLRERP---ARHFRTSVGASPAFADAVIRLAERVDEELGHP 57 Query: 78 SCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKI 137 LV++G G G + V+ P + L I V+ + R A +I Sbjct: 58 DAFDLVDVGSGDGKLPAL---VLRGAPPRLAARLRITAVDLAPR--------PAGLDPRI 106 Query: 138 NWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIK 197 W L G ++ANE+ D++P+ T G R ++D + + + E Sbjct: 107 TWAAELPGRITG--LVIANEWLDNVPVDVVEQTASGPRLVLVDPETGEERLGPPPSPEDL 164 Query: 198 SNFLTCSDYFL-GAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYL--QSRVGDTLQA 254 + G E RD S+ RL+ GTA+ IDY + TL Sbjct: 165 AWLARWWPLEEPGHRAEVGRPRDEAWASVIVRLSR--GTAVAIDYAHRAGDRPPFGTLAG 222 Query: 255 VKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGI-WQR-AF 312 +G V P+ + D+++HV ++ LTTQ + L LG+ R Sbjct: 223 HRGGAPVPPVPDGS-CDITAHVALDACAAAGERAGATATRLTTQREALRALGLRGARPPL 281 Query: 313 SLMKQTARKDILLDSVKRLVS--TSADKKSMGELFKILVVSHEKVELM 358 L ++ R + ++ R D +G F L S Sbjct: 282 ELARRDPR--GYVRALGRATEEAELLDPGGLGG-FTWLTQSRGLRPAP 326 >gi|302509860|ref|XP_003016890.1| hypothetical protein ARB_05183 [Arthroderma benhamiae CBS 112371] gi|291180460|gb|EFE36245.1| hypothetical protein ARB_05183 [Arthroderma benhamiae CBS 112371] Length = 293 Score = 102 bits (254), Expect = 9e-20, Method: Composition-based stats. Identities = 49/173 (28%), Positives = 77/173 (44%), Gaps = 23/173 (13%) Query: 209 GAIFENSPCRDREMQSISDRL-------------ACDGGTAIVIDYGYLQSRVGDTLQAV 255 G+ E SP Q I+ + G A+++DYG + ++L+ + Sbjct: 117 GSTIEISPESHTYAQEIARLIGGPNPTDKNPSPTRTPAGAALILDYGPSSTIPVNSLRGI 176 Query: 256 KGHTYVSPLVNPGQADLSSHVDFQRLSSIAILY--KLYINGLTTQGKFLEGLGIWQRAFS 313 K H VSP PG+ DLS+ VDF L+ A+ + + G QG FL LGI +RA Sbjct: 177 KNHQVVSPFATPGEVDLSADVDFTGLAESALNASPGVEVYGPNEQGSFLRSLGIAERAAQ 236 Query: 314 LM---KQTARKDILLDSVKRLVSTSADKKSMGELFKILVVSHE---KVELMPF 360 L+ K ++ + S +RLV MG ++K + + E K + F Sbjct: 237 LLRNVKDEEKRKQIESSWQRLVERGG--GGMGRIYKAMAIVPESGGKRRPVGF 287 >gi|317122768|ref|YP_004102771.1| hypothetical protein Tmar_1961 [Thermaerobacter marianensis DSM 12885] gi|315592748|gb|ADU52044.1| protein of unknown function DUF185 [Thermaerobacter marianensis DSM 12885] Length = 497 Score = 102 bits (253), Expect = 1e-19, Method: Composition-based stats. Identities = 25/79 (31%), Positives = 41/79 (51%), Gaps = 1/79 (1%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNP-FGAVGDFVTAPEISQIFGEM 62 +L+R I + I+++G + +Y L + PE+GYY+ P G GDF+T+P FG Sbjct: 6 ELVRLIHDEIRRHGAIPFARYMDLALHHPEYGYYAQERPLIGREGDFLTSPSFHPAFGRT 65 Query: 63 LAIFLICAWEQHGFPSCVR 81 + + E GF +R Sbjct: 66 VWRQVREMLELLGFRPRLR 84 Score = 84.8 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 60/305 (19%), Positives = 105/305 (34%), Gaps = 37/305 (12%) Query: 82 LVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWYT 141 ++E+G G G + D+L L +V+ S RL Q++ + + + Sbjct: 162 ILEIGAGGGHLARDLLLAARADGYGP-GSLEYVIVDESRRLQERQRELITAAWPQAPVRW 220 Query: 142 SLADVPLGFT-FLVANEFFDSLPIKQFVMTEH---------------------GIRERMI 179 G ++ NE + P+ + V RE + Sbjct: 221 VPRVEQAGPVHVVLMNELMSAFPVHRLVWKPAAATGEAGPGRLGRSGNRRPLGEWRELYV 280 Query: 180 DIDQHDSLVFNIGDHEIKSNFLTCSD----YFLGAIFENSPCRDREMQSISDRLACDGGT 235 + V G D G I + + +++I+ LA Sbjct: 281 TVQ-EGRFVQVEGPVSEPRALEILRDEGIEPRPGQIVDVNVGAGDMLRAIAATLAR-RAF 338 Query: 236 AIVIDYG------YLQSRVGDTLQ-AVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILY 288 I +DYG Y R T++ + P PG+ D+++ +DF L + Sbjct: 339 VITVDYGGPAEMVYSPQRPRGTVRGYYRQQVLDDPFARPGEQDITADLDFTYLQRLGRRL 398 Query: 289 KLYINGLTTQGKFLEGLGIWQ-RAFSLMKQTARKDILLDSVKRLVSTSADKKSMGELFKI 347 L GL Q FL LGI + A L ++ + D+ D + V + +GE F + Sbjct: 399 GLRDLGLLPQEAFLLNLGIEEAEALPLARRAWQGDLEADQELQRVYALYAPEGLGESFWV 458 Query: 348 LVVSH 352 LV + Sbjct: 459 LVQAK 463 >gi|307069512|ref|YP_003877989.1| hypothetical protein ZICARI_014 [Candidatus Zinderia insecticola CARI] gi|306482772|gb|ADM89643.1| conserved hypothetical protein [Candidatus Zinderia insecticola CARI] Length = 344 Score = 102 bits (253), Expect = 1e-19, Method: Composition-based stats. Identities = 61/334 (18%), Positives = 128/334 (38%), Gaps = 27/334 (8%) Query: 21 VDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCV 80 ++ + + + GYY+ + D++T+ +IS IF I L + Sbjct: 33 FSEFINIILYKKKIGYYNNFLFYNKK-DYLTSSQISNIF----TISLANFFFFFLKKEFC 87 Query: 81 RLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWY 140 +++E G G G +IL+ + K Y++E S L I+KK+ + Sbjct: 88 KILEFGGGNGKFAFNILKELKKFNLFPK----YYIIEKSRYLKNIKKKKYKNIKWIKKIP 143 Query: 141 TSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKS-N 199 + + + NE DS+P + + I I+ + VF K + Sbjct: 144 KNFS------GIIFLNEVLDSIPFDLIIKNNNNYYNSNIIINNELNFVFKNKKINKKYLS 197 Query: 200 FLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHT 259 F + Y+ F N+ + + + G + + K + Sbjct: 198 FFIKNRYYNLYKFINNIFLNIKNNKNIIIIIDYGNLN-------NFYIKNNFMCFYKHFS 250 Query: 260 YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTA 319 + +P PG D+++ V+F+ + + + YK I + Q F+ +GI Sbjct: 251 HGNPFSFPGLQDITNFVNFKFILKLIVKYKFKILNFSNQENFILNIGILDFLKKKKYNKI 310 Query: 320 RKDILLDSVKRLVSTSADKKSMGELFKILVVSHE 353 + ++ +K+++ MGE+FK+L++ ++ Sbjct: 311 KYFNKINFLKKIIL----PYEMGEIFKVLILGNK 340 >gi|257460542|ref|ZP_05625643.1| dihydrodipicolinate reductase [Campylobacter gracilis RM3268] gi|257441873|gb|EEV17015.1| dihydrodipicolinate reductase [Campylobacter gracilis RM3268] Length = 365 Score = 101 bits (252), Expect = 1e-19, Method: Composition-based stats. Identities = 63/380 (16%), Positives = 113/380 (29%), Gaps = 67/380 (17%) Query: 21 VDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHG----- 75 +F + + YY+ G GDF TA + FG +A ++ G Sbjct: 3 FSDFFEGWLNER---YYANAAKIGKSGDFYTAVSVGSFFGICIAHEILRLSADFGATQAL 59 Query: 76 ---------------------------------FPSCVRLVELGPGRGIMMLDILRVICK 102 + + +VE+G G ++ DI + I Sbjct: 60 SLDAATSPIASRQNFAANPGEPNLDIVLAEKSQNNAKIAIVEIGSHDGRLLCDIAQAIFT 119 Query: 103 LKP-DFFSVLSIYMVETSERLTLIQKKQLASYG---DKINWYTSLADVPLGFTFLVANEF 158 L + S ++E ERL +Q+ + + S + VANE Sbjct: 120 LGGVAALNRFSFAIIEPHERLRELQRASFVECFGGEIALKHFASAREAKFKDVIFVANEL 179 Query: 159 FDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCR 218 FD+ + + + F G I Sbjct: 180 FDAFKCEAVDGENML-------FIKSGAAKFAPIKEGEILTIARRFGISRGEIPVGYFRF 232 Query: 219 DREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYV------SPLVNPGQADL 272 RE+ + + R I DYG + + +L+ + H + G++DL Sbjct: 233 AREICASAQR-----FYFIAFDYGQMGASGDFSLRIYRNHEVFSFFEVQNLSDFYGKSDL 287 Query: 273 SSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLV 332 + V+F+ L + Q L G Q M+++ K L+ Sbjct: 288 TYDVNFEILRAAFEDAGAAATDFKRQIAALMDFGAVQLLELFMQKSGEKGY----HNALL 343 Query: 333 STSADKKSMGELFKILVVSH 352 + + GE FK++ Sbjct: 344 QFNHLRAEFGEKFKMIKFKK 363 >gi|294630640|ref|ZP_06709200.1| conserved hypothetical protein [Streptomyces sp. e14] gi|292833973|gb|EFF92322.1| conserved hypothetical protein [Streptomyces sp. e14] Length = 414 Score = 101 bits (251), Expect = 2e-19, Method: Composition-based stats. Identities = 64/331 (19%), Positives = 112/331 (33%), Gaps = 28/331 (8%) Query: 26 ALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVEL 85 + P G+Y P G G F T+ S +F +A L E G P+ + V++ Sbjct: 55 RDALYGP-AGFY--RRPEGPAGHFRTSVHASPLFATAVARLLCRVDEALGRPARLDFVDM 111 Query: 86 GPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWYTSLAD 145 GRG + V+ L + + ++ VE + D+I W D Sbjct: 112 AAGRGKLAAG---VLGALPAEVAARARVHAVEI--------AGRPDGLDDRIAWLPEPPD 160 Query: 146 VPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSD 205 G L ANE+ D++P++ + G+ R++ + Sbjct: 161 GLTG--LLFANEWLDNVPVEVAEVDPEGVPRRVLVRRDGAERLGEPVGGAEAEWLARWWP 218 Query: 206 YF--LGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYL--QSRVGDTLQAVKGHTYV 261 G E RD RL G A+ +DY + TL + Sbjct: 219 LPGEPGLRAEIGLPRDTAWAGAVSRLRR--GLAVAVDYAHTAADRPPFGTLTGFRDGRET 276 Query: 262 SPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARK 321 +P+ + DL++HV ++ + TQ L LGI L ++ Sbjct: 277 APVPDGS-CDLTAHVALDACAASGPAP--TRARVLTQRAALHALGITGVRPPLTLASSDP 333 Query: 322 DILLDSVKRL--VSTSADKKSMGELFKILVV 350 + ++ R + +G+ F L+ Sbjct: 334 ARYVRALARAGEAAELTAPGGLGD-FGWLLQ 363 >gi|238061442|ref|ZP_04606151.1| hypothetical protein MCAG_02408 [Micromonospora sp. ATCC 39149] gi|237883253|gb|EEP72081.1| hypothetical protein MCAG_02408 [Micromonospora sp. ATCC 39149] Length = 348 Score = 100 bits (249), Expect = 4e-19, Method: Composition-based stats. Identities = 59/293 (20%), Positives = 106/293 (36%), Gaps = 19/293 (6%) Query: 25 FALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVE 84 + + P+ G++ + G G F T+ S F + + GFP+ +V+ Sbjct: 7 MSRALYGPD-GFFVSGA--GPAGHFRTSVHASPAFAAAVFRLVSRLDAALGFPAPFDMVD 63 Query: 85 LGPGRGIMMLDILRV---------ICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGD 135 +G GRG ++ + + + P S + + + + + Sbjct: 64 VGAGRGELLRALADLAREGAAGTILSSAFPAGTSPAPVPLAHRLRLTAVELAPRPPALPS 123 Query: 136 KINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHE 195 +I W + + G L+A E+ D++P+ V TE G R ++D E Sbjct: 124 EIRWTDEIPEAITG--LLIATEWLDNVPLDVAVPTEDGWRYLLVDPATGRETPGPPLTPE 181 Query: 196 IKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGY--LQSRVGDTLQ 253 K+ T E RD L G A+ +DYG+ V TL Sbjct: 182 DKTWLTTWHPCPGAGRVEIGRDRDLAWAGAVRCLRR--GMAVAVDYGHLRDSRPVHGTLT 239 Query: 254 AVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 +G V P+ + D+++HV ++S L +Q + L LG Sbjct: 240 GYRGGRQVPPVPDGS-CDVTAHVAMDSVASAGERVARCAYTLVSQREALRALG 291 >gi|169598546|ref|XP_001792696.1| hypothetical protein SNOG_02078 [Phaeosphaeria nodorum SN15] gi|160704417|gb|EAT90290.2| hypothetical protein SNOG_02078 [Phaeosphaeria nodorum SN15] Length = 323 Score = 98.6 bits (244), Expect = 1e-18, Method: Composition-based stats. Identities = 38/122 (31%), Positives = 61/122 (50%), Gaps = 7/122 (5%) Query: 236 AIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILY--KLYIN 293 A+++DYG + +T + ++GH VSP +PG DLS+ VDF L+ A+ + ++ Sbjct: 185 ALILDYGPANTIPANTFRGIRGHQTVSPFTSPGLVDLSADVDFLALAESALDASPGVEVH 244 Query: 294 GLTTQGKFLEGLGIWQRAFSL---MKQTARKDILLDSVKRLVSTSADKKSMGELFKILVV 350 G Q FL +GI +RA L K A + L KRL+ MG+ +K + + Sbjct: 245 GPVEQSFFLSTMGIKERAERLLKGAKDEATRQRLETGWKRLIDRG--PNGMGKTYKAMAL 302 Query: 351 SH 352 Sbjct: 303 LP 304 >gi|149602001|ref|XP_001518481.1| PREDICTED: hypothetical protein, partial [Ornithorhynchus anatinus] Length = 84 Score = 98.3 bits (243), Expect = 1e-18, Method: Composition-based stats. Identities = 25/58 (43%), Positives = 36/58 (62%) Query: 4 KLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGE 61 ++R ++ +K G +TV +Y + +P GYY + G GDFVT+PEISQIFGE Sbjct: 27 SMLRHLLAKVKATGPITVAEYMREALTNPAKGYYVHHDVLGEKGDFVTSPEISQIFGE 84 >gi|332529562|ref|ZP_08405518.1| hypothetical protein HGR_06586 [Hylemonella gracilis ATCC 19624] gi|332040912|gb|EGI77282.1| hypothetical protein HGR_06586 [Hylemonella gracilis ATCC 19624] Length = 191 Score = 97.9 bits (242), Expect = 2e-18, Method: Composition-based stats. Identities = 38/159 (23%), Positives = 64/159 (40%), Gaps = 18/159 (11%) Query: 207 FLGAIFENSPCRDREMQSISDRLAC-----DGGTAIVIDY------GYLQSRVGDTLQAV 255 + E + M++++ +L G A IDY Y R T+ Sbjct: 10 EHDYLTETHAQAEAFMRTVAGKLLRGAREGRLGAAFFIDYGFPEVEYYHPQRHMGTVMCH 69 Query: 256 KGHT-YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSL 314 + H PL + G D+++HV+F ++ L + G T+QG+FL G L Sbjct: 70 RAHQSDTDPLSDVGLKDITAHVNFTGIALAGQEAGLEVLGYTSQGRFLFNCG----LARL 125 Query: 315 MKQTARKDILLDSVKRLV--STSADKKSMGELFKILVVS 351 M +++ R V S + MGELFK++ + Sbjct: 126 MAPDETAADTPEALARRVQASKLVMEHEMGELFKVIGFA 164 >gi|29831222|ref|NP_825856.1| hypothetical protein SAV_4679 [Streptomyces avermitilis MA-4680] gi|29608337|dbj|BAC72391.1| hypothetical protein [Streptomyces avermitilis MA-4680] Length = 352 Score = 97.5 bits (241), Expect = 3e-18, Method: Composition-based stats. Identities = 57/283 (20%), Positives = 92/283 (32%), Gaps = 26/283 (9%) Query: 26 ALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVEL 85 + P G+Y P G G F T+ S +F +A L E G PS + V++ Sbjct: 24 QEALYGP-AGFY--RRPEGPAGHFRTSVHASPLFAGAVARLLRDLDEALGRPSELAFVDM 80 Query: 86 GPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWYTSLAD 145 GRG + + V+ L PD S Y VE + +I W + Sbjct: 81 AAGRGEL---VTGVLAALPPDIASRARGYAVEL--------AGRPEDLDHRIEWLAEPPE 129 Query: 146 VPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSD 205 G L ANE+ D++P++ + G+ ++ D Sbjct: 130 GITG--LLFANEWLDNVPVEVAEVDSAGVARLVLVRDDGTERHGEPVAGAESQWLTRWWP 187 Query: 206 YFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGY--LQSRVGDTLQAVKGHTYVSP 263 R++ S G A+ +DY + TL + P Sbjct: 188 PAPEEGLRAEIGLPRDIAWASAVGTVHRGLAVAVDYAHFVDTRPPFGTLTGFREGRETLP 247 Query: 264 LVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 + + DL++HV + L +Q L LG Sbjct: 248 VPDGS-CDLTAHVALDACALPGAR-------LLSQRDALRALG 282 >gi|307327038|ref|ZP_07606228.1| protein of unknown function DUF185 [Streptomyces violaceusniger Tu 4113] gi|306887336|gb|EFN18332.1| protein of unknown function DUF185 [Streptomyces violaceusniger Tu 4113] Length = 340 Score = 97.5 bits (241), Expect = 3e-18, Method: Composition-based stats. Identities = 71/327 (21%), Positives = 118/327 (36%), Gaps = 30/327 (9%) Query: 26 ALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVEL 85 + G++ P G G F T+ S +F +A L E G P+ + LV+L Sbjct: 16 EQALYGAG-GFF--RRPEGPAGHFRTSVHASPLFARAVAELLRRVDEALGHPAELALVDL 72 Query: 86 GPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWYTSLAD 145 G GRG ++ +L + L D L Y VE ++ A ++I W + + Sbjct: 73 GAGRGELLTRVLALAPGLPDDLDRRLRPYAVER--------AERPAGLAERIAWLDAPPE 124 Query: 146 VPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDH------EIKSN 199 G L ANE+ D++P+ E G+ R+ +GD + Sbjct: 125 GCTG--LLFANEWLDNVPVDVAETDEDGVPRRVEVDLAARDGTERLGDPVDGEDAAWLAR 182 Query: 200 FLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGY--LQSRVGDTLQAVKG 257 + +D G E R+ L G A+ +DYG+ TL + Sbjct: 183 WWPLADAEPGLRAEIGHPREAAWAGAVRSLRA--GLAVAVDYGHERAARPPFGTLTGFRE 240 Query: 258 HTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQ 317 V P+ + + DL++HV GL TQ + L LG+ R L Sbjct: 241 GREVRPVPDGSR-DLTAHVA----MDACAAAAGPGAGLMTQREALRALGLDARRPPLALA 295 Query: 318 TARKDILLDSVKRL--VSTSADKKSMG 342 + + ++ + D +G Sbjct: 296 STDPAGYVRALGTAGETAELTDPAGLG 322 >gi|302553184|ref|ZP_07305526.1| conserved hypothetical protein [Streptomyces viridochromogenes DSM 40736] gi|302470802|gb|EFL33895.1| conserved hypothetical protein [Streptomyces viridochromogenes DSM 40736] Length = 336 Score = 95.9 bits (237), Expect = 8e-18, Method: Composition-based stats. Identities = 56/285 (19%), Positives = 95/285 (33%), Gaps = 30/285 (10%) Query: 26 ALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVEL 85 + P G+Y G G F T+ S +F +A L E G P + V++ Sbjct: 23 REALYGPG-GFYRRAE--GPAGHFRTSVHASSLFAGAVARLLCRVDEALGRPGALDFVDM 79 Query: 86 GPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWYTSLAD 145 GRG + +L + D + Y VE + +I W + Sbjct: 80 AAGRGELAAGVLDAL---PADVAARTRAYAVEV--------AARPEGLDHRIEWRDTPPK 128 Query: 146 VPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSD 205 G L ANE+ D++P+ + G+ ++ + + Sbjct: 129 GSNG--LLFANEWLDNVPVDVVEVDAAGVARLVLVREDGTERLGEPVGAADARWLERWWP 186 Query: 206 YF--LGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYL--QSRVGDTLQAVKGHTYV 261 G E RD S + L G A+ +DY + TL + Sbjct: 187 LPGEAGLRAEIGLPRDEAWASAAGTLER--GLAVAVDYAHTADARPPFGTLTGFRDGRET 244 Query: 262 SPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 + + + D+++HV L + A+ L L Q + L LG Sbjct: 245 TAVPDGS-CDITAHV---ALDACALPGGL----LLPQREALRALG 281 >gi|302870303|ref|YP_003838940.1| hypothetical protein Micau_5860 [Micromonospora aurantiaca ATCC 27029] gi|302573162|gb|ADL49364.1| protein of unknown function DUF185 [Micromonospora aurantiaca ATCC 27029] Length = 366 Score = 94.8 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 53/323 (16%), Positives = 112/323 (34%), Gaps = 51/323 (15%) Query: 18 QMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFP 77 + + P+ G++ + G F T+ S +F L L G P Sbjct: 4 PLRWRDAMERALYGPD-GFFVSGA--GPAAHFRTSVHASPVFTACLLRQLETVDAALGHP 60 Query: 78 SCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKI 137 + + +V++G GRG ++ ++ + + P+ + L VE + +I Sbjct: 61 ARLDVVDVGAGRGELLREL---LAQAAPELAARLHPVAVER--------ATRPPDLPSQI 109 Query: 138 NWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIG---DH 194 +W + + D G L+A E+ D++P+ V T G R +++ + + D Sbjct: 110 DWRSDIPDDITG--LLLATEWLDNVPVDVAVHTPEGWRLLLVNPKTGEETIGPPPSPTDT 167 Query: 195 EIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACD---------------------- 232 +++ + + + + Q + + Sbjct: 168 TWLTHWWPSAAPPNAPVIKEFVASEGPDQDTNSLINGAGEGEGGGGWRGEIGWSRDLAWA 227 Query: 233 -------GGTAIVIDYGY--LQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSS 283 G A+ +DYG+ TL +G V P+ + D+++HV ++S Sbjct: 228 EAVGKVARGLALAVDYGHLRDSRPADGTLTGYRGGRQVPPVPDGS-CDVTAHVAMDSVAS 286 Query: 284 IAILYKLYINGLTTQGKFLEGLG 306 + +Q + L LG Sbjct: 287 AGERVARCAYSMVSQREALRALG 309 >gi|300175564|emb|CBK20875.2| unnamed protein product [Blastocystis hominis] Length = 264 Score = 94.8 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 48/158 (30%), Positives = 78/158 (49%), Gaps = 11/158 (6%) Query: 205 DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPL 264 D G E P +Q I L G+A+ IDYG + ++++ +K H +VS L Sbjct: 84 DAKEGDRIEVCPEACMLVQEIVMWLEETKGSALFIDYGNDYA-SQNSIRGIKNHQFVSYL 142 Query: 265 VNPGQADLSSHVDFQRLSSIAILYK---LYINGLTTQGKFLEGLGIWQRAFSLM---KQT 318 PG+ D+++ VDFQ L + K + +G QG FL GLGI +R + + Sbjct: 143 QEPGEVDITADVDFQALRKVVENSKSASVKYHGPIPQGYFLCGLGIEERVRQMAETVEDD 202 Query: 319 ARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEKVE 356 A+ D ++ S +RLV +K MGE++K+ ++ Sbjct: 203 AKVDDIIKSAERLV----NKDQMGEVYKVCCLTTGGQP 236 >gi|302544368|ref|ZP_07296710.1| conserved hypothetical protein [Streptomyces hygroscopicus ATCC 53653] gi|302461986|gb|EFL25079.1| conserved hypothetical protein [Streptomyces himastatinicus ATCC 53653] Length = 336 Score = 94.4 bits (233), Expect = 3e-17, Method: Composition-based stats. Identities = 65/333 (19%), Positives = 108/333 (32%), Gaps = 33/333 (9%) Query: 26 ALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVEL 85 + G++ G G F T+ S ++ + L E G P + LV+L Sbjct: 16 ERALYG-TDGFFRRAE--GPAGHFRTSVHASPLYARAVTELLRRVDEALGRPPELALVDL 72 Query: 86 GPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWYTSLAD 145 G GRG ++ +L + L Y VE + ++ W ++ + Sbjct: 73 GAGRGELLTGVLAAAAGQPHGLAARLRPYAVER--------AARPDGADARVEWRETVPE 124 Query: 146 VPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNI----GDHEIKSNFL 201 G L ANE+ D++P+ G+ R+ + D E + + Sbjct: 125 GFTG--LLFANEWLDNVPVDIAETDADGVPRRVEVSAADGAERLGAEVTGEDAEWLARWW 182 Query: 202 TCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGY--LQSRVGDTLQAVKGHT 259 + G E RD G A+ +DYG+ TL + Sbjct: 183 PLAGAEPGLRAEIGRPRDAAWTEAVR--CVGAGLAVAVDYGHERAARPPFGTLTGFRDGR 240 Query: 260 YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIW-QR--AFSLMK 316 V P+ + G DL++HV ++ L Q L LG+ R K Sbjct: 241 EVRPVPD-GTCDLTAHVALDACAAGTDA------RLVPQRDALRHLGLDGGRPPLALASK 293 Query: 317 QTARKDILLDSVKRLVSTSADKKSMGELFKILV 349 A L + + D +G F L Sbjct: 294 DPAAYVRALSAAGEA-AELLDPAGLGG-FGWLA 324 >gi|289770895|ref|ZP_06530273.1| conserved hypothetical protein [Streptomyces lividans TK24] gi|289701094|gb|EFD68523.1| conserved hypothetical protein [Streptomyces lividans TK24] Length = 362 Score = 93.3 bits (230), Expect = 5e-17, Method: Composition-based stats. Identities = 59/288 (20%), Positives = 96/288 (33%), Gaps = 23/288 (7%) Query: 26 ALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVEL 85 + PE G+Y P G G F T+ S +F +A L E G P+ + V++ Sbjct: 26 QEALYGPE-GFYRAG-PEGPAGHFRTSVHASPLFAGAVARLLCRVDEALGRPAVLDFVDM 83 Query: 86 GPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWYTSLAD 145 GRG ++ V+ L D + Y VE + ++ W D Sbjct: 84 AAGRGELVAG---VLAALPADVAARTRAYAVEV--------AARPEGLDRRVQWLARPPD 132 Query: 146 VPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSD 205 G L ANE+ D++P+ + G+ R++ + Sbjct: 133 GVTG--LLFANEWLDNVPVDVAEVDAEGVARRVLVRGDGAERLGEPVAGAEAEWLARWWP 190 Query: 206 YFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS--RVGDTLQAVKGHTYVSP 263 R+ S A D G A+ DY + S TL + P Sbjct: 191 MPAEEGRRAEIGLPRDEAWASAVAALDAGLAVAADYAHSVSARPPFGTLTGFREGRETEP 250 Query: 264 LVNPGQADLSSHVDFQRLS----SIAILYKLYINGLT-TQGKFLEGLG 306 + + D+++HV + + N LT Q + L LG Sbjct: 251 VPDGS-CDITAHVALDACAAAHTARCTPEYAPPNALTRPQREILRALG 297 >gi|21221819|ref|NP_627598.1| hypothetical protein SCO3391 [Streptomyces coelicolor A3(2)] gi|4585837|emb|CAB40931.1| hypothetical protein [Streptomyces coelicolor A3(2)] Length = 362 Score = 93.3 bits (230), Expect = 5e-17, Method: Composition-based stats. Identities = 59/288 (20%), Positives = 96/288 (33%), Gaps = 23/288 (7%) Query: 26 ALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVEL 85 + PE G+Y P G G F T+ S +F +A L E G P+ + V++ Sbjct: 26 QEALYGPE-GFYRAG-PEGPAGHFRTSVHASPLFAGAVARLLCRVDEALGRPAVLDFVDM 83 Query: 86 GPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWYTSLAD 145 GRG ++ V+ L D + Y VE + ++ W D Sbjct: 84 AAGRGELVAG---VLAALPADVVARTRAYAVEV--------AARPEGLDRRVQWLARPPD 132 Query: 146 VPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSD 205 G L ANE+ D++P+ + G+ R++ + Sbjct: 133 GVTG--LLFANEWLDNVPVDVAEVDAEGVARRVLVRGDGAERLGEPVAGAEAEWLARWWP 190 Query: 206 YFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS--RVGDTLQAVKGHTYVSP 263 R+ S A D G A+ DY + S TL + P Sbjct: 191 MPAEEGRRAEIGLPRDEAWASAVAALDAGLAVAADYAHSVSARPPFGTLTGFREGRETEP 250 Query: 264 LVNPGQADLSSHVDFQRLS----SIAILYKLYINGLT-TQGKFLEGLG 306 + + D+++HV + + N LT Q + L LG Sbjct: 251 VPDGS-CDITAHVALDACAAAHTARCTPEYAPPNALTRPQREILRALG 297 >gi|167950157|ref|ZP_02537231.1| hypothetical protein Epers_28339 [Endoriftia persephone 'Hot96_1+Hot96_2'] Length = 364 Score = 92.1 bits (227), Expect = 1e-16, Method: Composition-based stats. Identities = 53/281 (18%), Positives = 90/281 (32%), Gaps = 29/281 (10%) Query: 35 GYY-STCNPFGAVGDFVT-----APEISQIFGEMLAIFLICAWEQHGFPSCVRLVELGPG 88 GYY + FG GDFVT AP + Q + SC R + Sbjct: 2 GYYVAGSRKFGEAGDFVTRTGGLAPCLPS---------ASHTRRQSCWVSCHRAIFWSLV 52 Query: 89 RGIMMLDILRVICKLKPDFFSVLSIYMV-ETSERLTLIQKKQLASYGDKINWYTSLADVP 147 + ++ + + + + Y++ E S L Q++ L ++ S Sbjct: 53 QAPAFWACGFLLAETWNGWRQLPNRYLILELSPELQQRQRQILRERVPQLLERVSWLSQM 112 Query: 148 LGFT--FLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDH------EIKSN 199 F++ANE D++P +F +E GI E + + ++K Sbjct: 113 PSRFEGFVLANELLDAMPASRFRHSEAGIEEGFVVWQEEGFRDHFARPETVGFSEQVKRR 172 Query: 200 FLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYG----YLQSRVGDTLQAV 255 G + E + ++ I Y Y R TL Sbjct: 173 LAGLPVGSGGYVSELNLRLGPWFAALGGAFERGAVLLIDYGYPRSEYYHPQRSEGTLMCH 232 Query: 256 KGHTYV-SPLVNPGQADLSSHVDFQRLSSIAILYKLYINGL 295 H P PG D+++ VDF ++ A + G Sbjct: 233 YRHRAHADPYRWPGLQDITAQVDFTAVAEAAEQAGFALAGY 273 >gi|315503420|ref|YP_004082307.1| hypothetical protein ML5_2634 [Micromonospora sp. L5] gi|315410039|gb|ADU08156.1| protein of unknown function DUF185 [Micromonospora sp. L5] Length = 366 Score = 92.1 bits (227), Expect = 1e-16, Method: Composition-based stats. Identities = 56/323 (17%), Positives = 109/323 (33%), Gaps = 51/323 (15%) Query: 18 QMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFP 77 + + P+ G++ + G F T+ S +F L L G P Sbjct: 4 PLRWRDAMERALYGPD-GFFVSGA--GPAAHFRTSVHASPVFTACLLRQLETVDAALGHP 60 Query: 78 SCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKI 137 + + +V++G GRG ++ +L + P+ + L VE + +I Sbjct: 61 ARLDVVDVGAGRGELLRALLA---QAAPELAARLHPVAVER--------ATRPPDLPPEI 109 Query: 138 NWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIG---DH 194 W + D G L+A E+ D++P+ V T G R +++ + V D Sbjct: 110 AWRPDIPDGITG--LLLATEWLDNVPVDVAVHTPEGWRLLLVNPKTGEETVGPPPSPTDT 167 Query: 195 EIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACD---------------------- 232 +++ + + + + Q + + Sbjct: 168 TWLTHWWPSAAPPNAPVIKEFVASEGPDQDTNSLINGAGEGEGGGGWRAEIGWSRDLAWA 227 Query: 233 -------GGTAIVIDYGY--LQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSS 283 G A+ +DYG+ TL +G V P+ + D+++HV ++S Sbjct: 228 GAVGKVARGLALAVDYGHLRDSRPADGTLTGYRGGRQVPPVPDGS-CDVTAHVAMDSVAS 286 Query: 284 IAILYKLYINGLTTQGKFLEGLG 306 L +Q + L LG Sbjct: 287 AGERVARCAYSLVSQREALRALG 309 >gi|239980310|ref|ZP_04702834.1| hypothetical protein SalbJ_12773 [Streptomyces albus J1074] gi|291452175|ref|ZP_06591565.1| conserved hypothetical protein [Streptomyces albus J1074] gi|291355124|gb|EFE82026.1| conserved hypothetical protein [Streptomyces albus J1074] Length = 349 Score = 89.4 bits (220), Expect = 7e-16, Method: Composition-based stats. Identities = 59/291 (20%), Positives = 105/291 (36%), Gaps = 29/291 (9%) Query: 26 ALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVEL 85 + + G+Y+ G G F T+ S +F +A L G P + V++ Sbjct: 15 EEALYGAD-GFYTGEA--GPAGHFRTSVHASALFAGAVARLLGAVDRALGGPPELAFVDM 71 Query: 86 GPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWYTSLAD 145 G GRG + V+ L P + ++Y VE + + +I W + + Sbjct: 72 GAGRGELSAG---VLGALPPALAARTTVYAVERAVG-------RPPGLDPRIAWRATPPE 121 Query: 146 VPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFN----IGDHEIKSNFL 201 G L ANE+ D++P + + G R++++ + D + Sbjct: 122 GVTG--LLFANEWLDNVPAEVAEVAPDGTV-RLVEVAPDGTERLGAGVSAEDSAWLERWW 178 Query: 202 TCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGY--LQSRVGDTLQAVKGHT 259 + GA E RD + + G A+ +DYG+ TL + Sbjct: 179 PLAGTAPGARAELGGTRDAAWAKAAGTVRR--GLAVAVDYGHERAARPPFGTLTGFREGR 236 Query: 260 YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGL----TTQGKFLEGLG 306 P+ + DL++HV ++ +G TQ + L GLG Sbjct: 237 ETRPVPDGS-CDLTAHVALDACAAAFAATPAGASGPGPALLTQREALHGLG 286 >gi|297192922|ref|ZP_06910320.1| conserved hypothetical protein [Streptomyces pristinaespiralis ATCC 25486] gi|297151560|gb|EDY66580.2| conserved hypothetical protein [Streptomyces pristinaespiralis ATCC 25486] Length = 330 Score = 89.0 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 57/284 (20%), Positives = 101/284 (35%), Gaps = 31/284 (10%) Query: 26 ALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVEL 85 + P G+Y P G G F T+ S++F +A L G L+++ Sbjct: 19 ETALYGPR-GFY--RRPEGPAGHFRTSVHASRLFAAAVARLLTSTAGSLGLEEP-ALIDV 74 Query: 86 GPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWYTSLAD 145 G GRG ++ +L + P + Y VE + A +I W + Sbjct: 75 GAGRGELLTGVLAALPPGLP-----VRAYAVER--------AARPAGLDPRIEWTAEIPR 121 Query: 146 VPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSD 205 G L ANE+ D++P+ + HG+ R++ + + + + Sbjct: 122 GLHG--LLFANEWLDNVPVDVAEVDAHGVVRRVLVREDGEERLGDEVAGADADWLARWWP 179 Query: 206 Y-FLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYL--QSRVGDTLQAVKGHTYVS 262 G E RD + + + G A+ +DY + TL + V Sbjct: 180 LGEPGTRAEIGRPRDEAWAAAAG--SLASGLAVAVDYAHTAGDRPPFGTLTGFRDGREVR 237 Query: 263 PLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 P+ + D+++HV ++ LT Q + L LG Sbjct: 238 PVPDGS-CDITAHVAMDACAAPGSAE------LTGQREALRRLG 274 >gi|256397514|ref|YP_003119078.1| hypothetical protein Caci_8414 [Catenulispora acidiphila DSM 44928] gi|256363740|gb|ACU77237.1| protein of unknown function DUF185 [Catenulispora acidiphila DSM 44928] Length = 345 Score = 88.6 bits (218), Expect = 1e-15, Method: Composition-based stats. Identities = 59/307 (19%), Positives = 107/307 (34%), Gaps = 33/307 (10%) Query: 18 QMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFP 77 +T + PE G+Y G F T+ + +F E + L+ E+ G P Sbjct: 7 WLTWRTAMEQALYGPE-GFYRRPGA-GPAAHFRTSAH-NPVFAEAVGRLLLRVDERLGAP 63 Query: 78 SCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKI 137 + + V++ G G + ++ + P+ + L V+ R + + Sbjct: 64 ARLDFVDMAAGGGELTAGVVEWLTAAAPEVAARLRAVAVDLRPR--------PEGLPEAV 115 Query: 138 NWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQ-HDSLVFNIGDHEI 196 W S +G L+ANE+ D++ + G+ + D + D Sbjct: 116 EWTGSAPSGVVG--LLIANEWLDNVVCDVAEVGADGVARIVEVDPDSGDERAGELPDAAQ 173 Query: 197 KSNFLTCSDYFLG-----AIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQ--SRVG 249 ++ G A E RD +S+ DRL G A+ +DY + Sbjct: 174 QAWLERWWPLSEGGDSAGARAEIGLDRDAAWRSVVDRLER--GVAVAVDYSHTAAGRPRF 231 Query: 250 DTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGL---------TTQGK 300 TL + V + + D+++HV ++ L TTQ + Sbjct: 232 GTLTGYREGHQVPAVPDGS-CDITAHVALDACAAALDSAALDSGAPRDPMVETVLTTQRE 290 Query: 301 FLEGLGI 307 +L LGI Sbjct: 291 YLRELGI 297 >gi|271970329|ref|YP_003344525.1| hypothetical protein Sros_9162 [Streptosporangium roseum DSM 43021] gi|270513504|gb|ACZ91782.1| conserved hypothetical protein [Streptosporangium roseum DSM 43021] Length = 321 Score = 88.6 bits (218), Expect = 1e-15, Method: Composition-based stats. Identities = 62/290 (21%), Positives = 106/290 (36%), Gaps = 19/290 (6%) Query: 18 QMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFP 77 +T + + G+Y P G F T+ S +F E + L+ + G P Sbjct: 2 WLTWRDAMEHALYG-DNGFYLRERP---SGHFRTSVGASPVFAEAVLRELVAVDDALGAP 57 Query: 78 SCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKI 137 + LV++G G G++ V+ P S L + V+ + A ++I Sbjct: 58 PVIDLVDIGAGEGLLASG---VLAAAPPGLRSRLRVTGVDL--------AHRPARLPEEI 106 Query: 138 NWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIK 197 W S+ D G ++ANE+ D++P+ T G R ++D + + Sbjct: 107 GWTASVPDGIHG--LVIANEWLDNVPVDIAEQTADGPRLVLVDPSNGAERLGDRPQAADL 164 Query: 198 SNFLTCSDYFL-GAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVK 256 + G E RD S+ RLA AI + TL + Sbjct: 165 AWLARWWPMRAAGERAEIGRPRDEAWASVIVRLARGRAVAIDYAHPVDDRPPCGTLAGYR 224 Query: 257 GHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 V+P+ + D+++HV ++ LTTQ + L LG Sbjct: 225 DGAAVAPVPDGS-CDITAHVALDACAAAGERAGAATTALTTQRQALRALG 273 >gi|67522110|ref|XP_659116.1| hypothetical protein AN1512.2 [Aspergillus nidulans FGSC A4] gi|40744669|gb|EAA63825.1| hypothetical protein AN1512.2 [Aspergillus nidulans FGSC A4] Length = 883 Score = 87.9 bits (216), Expect = 2e-15, Method: Composition-based stats. Identities = 47/298 (15%), Positives = 82/298 (27%), Gaps = 60/298 (20%) Query: 22 DQYFALCVADPEFGYYSTCNPF---GAVGDF----------------------------- 49 ++ + +P +GY+S G DF Sbjct: 4 REFIDDSLYNPHYGYFSKHATIFSPGEPFDFNNIEDGPAFHRLLGERYTEFEDMLDEKQP 63 Query: 50 -------VTAPEI-SQIFGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDILRVI 100 T E+ +GE +A +L+ ++ +P + E+G G G MM++IL I Sbjct: 64 DEARQLWHTPTELFRPYYGETIARYLVSNYKLTLYPYHDLIIYEMGAGNGTMMINILDFI 123 Query: 101 CKLKPDFFSVLSIYMVETSERLTLIQKKQL------ASYGDKI----NWYTSLADVPLGF 150 + + ++E S L +Q K L A + D + Sbjct: 124 RDTDYEVYQRTKFRIIEISPALAGLQMKNLTDSLYAAGHLDHVEIINKSIFEWDTYVHSP 183 Query: 151 TFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGA 210 F +A E FD+ + + F+ + FL Sbjct: 184 CFFLALEVFDNFAHDAIRYDTKTEMPQQGGVLIDGDGEFHEFWTP---KLDPLASRFL-- 238 Query: 211 IFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPG 268 + R + RLA + Y T + L N Sbjct: 239 RVRQAAARREFPSPLGPRLARQIRGTLPFQKPYTMPEYIPTRLM----QFFDILDNYF 292 >gi|294813231|ref|ZP_06771874.1| DUF185 domain-containing protein [Streptomyces clavuligerus ATCC 27064] gi|326441659|ref|ZP_08216393.1| hypothetical protein SclaA2_11372 [Streptomyces clavuligerus ATCC 27064] gi|294325830|gb|EFG07473.1| DUF185 domain-containing protein [Streptomyces clavuligerus ATCC 27064] Length = 370 Score = 87.5 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 58/290 (20%), Positives = 96/290 (33%), Gaps = 37/290 (12%) Query: 26 ALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFP------SC 79 + P G+Y P G G F T+ S +F +A L + Sbjct: 62 ESALYGPG-GFY--RRPEGPAGHFRTSVHASPLFAGAVARLLRETARRLADEGRLEPSGA 118 Query: 80 VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINW 139 V V++ GRG + +L + P + Y VE ++ A +I W Sbjct: 119 VSFVDMAAGRGELATGVLAALPDGFP-----VRAYAVER--------AERPAGLDPRITW 165 Query: 140 YTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSN 199 + G L ANE+ D++P+ + E G R++ + + Sbjct: 166 TAAPPPGISG--LLFANEWLDNVPVDVAEVDEDGTVRRVLVDEDGTERLGPPVTGADARW 223 Query: 200 FLTCSDY-FLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYL--QSRVGDTLQAVK 256 GA E RD + + G A+ +DY + TL + Sbjct: 224 LDRWWPLARPGARAEIGRSRDAAWARAAGSVRA--GLAVAVDYAHRSGDRPPFGTLTGFR 281 Query: 257 GHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 V P+ + D+++HV + LTTQ + L LG Sbjct: 282 SGREVRPVPDGS-CDITAHVALDACALPGGT-------LTTQREALRALG 323 >gi|242799675|ref|XP_002483429.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500] gi|218716774|gb|EED16195.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500] Length = 545 Score = 87.1 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 36/219 (16%), Positives = 66/219 (30%), Gaps = 51/219 (23%) Query: 22 DQYFALCVADPEFGYYSTCNPF---GAVGDFV-------------------------TAP 53 ++ + +P +GY+S G DF T P Sbjct: 147 REFIEDSLYNPHYGYFSKHATIFHPGEPFDFSRIEDGPHFHRLLGERYTEFEDKLDETNP 206 Query: 54 EIS------------QIFGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDILRVI 100 +I+ +GE +A +L+ + +P + E+G G G +ML+IL I Sbjct: 207 DIARQLWHTPTELFRPYYGEAIARYLVTNYRLSLYPYHDLIIYEMGAGNGTLMLNILDFI 266 Query: 101 CKLKPDFFSVLSIYMVETSERLTLIQKKQLA----------SYGDKINWYTSLADVPLGF 150 P+ ++ ++E S L +Q + L + Sbjct: 267 RDSDPEVYARTKFKIIEISSSLAKLQWENLHASLSAGGHTGHVQIINKSIFDWNEYIHSP 326 Query: 151 TFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVF 189 F +A E FD+ + + F Sbjct: 327 CFFLALEVFDNFSHDAIRYDYETEMPQQGGVVIDSDGEF 365 >gi|295665446|ref|XP_002793274.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb01] gi|226278188|gb|EEH33754.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb01] Length = 502 Score = 87.1 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 37/225 (16%), Positives = 66/225 (29%), Gaps = 51/225 (22%) Query: 22 DQYFALCVADPEFGYYSTCNPF---GAVGDF----------------------------- 49 + + +P +GY+S G DF Sbjct: 105 RDFIEDSLYNPNYGYFSKHATIFNPGEPFDFNSMADGPEFNRLLGQRYKEFEDKLDAVKY 164 Query: 50 -------VTAPEI-SQIFGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDILRVI 100 T E+ +GE +A +L+ ++ FP + E+G G G +ML++L I Sbjct: 165 DESRQLWHTPTELFRPYYGEAIARYLVTNYKLTLFPYHDLIIYEMGAGNGTLMLNVLDYI 224 Query: 101 CKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGD----------KINWYTSLADVPLGF 150 + P+ + ++E S L +Q+K L + Sbjct: 225 RDVDPEVYQRTKFKIIEISPSLAKLQQKNLKNSIHSSGHRGHAEIINQSIFDWNTYVHSP 284 Query: 151 TFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHE 195 F +A E FD+ + R + F Sbjct: 285 CFFLALEVFDNFSHDAIRYDLETGQPRQGCVLIDADGEFYEYYIP 329 >gi|169602737|ref|XP_001794790.1| hypothetical protein SNOG_04372 [Phaeosphaeria nodorum SN15] gi|160706242|gb|EAT88132.2| hypothetical protein SNOG_04372 [Phaeosphaeria nodorum SN15] Length = 401 Score = 86.7 bits (213), Expect = 5e-15, Method: Composition-based stats. Identities = 37/203 (18%), Positives = 70/203 (34%), Gaps = 51/203 (25%) Query: 21 VDQYFALCVADPEFGYYSTCNPFGAVGD------------FV------------------ 50 + + + +P +GY+S + GD F Sbjct: 3 LRDFIEDSLYNPNYGYFSKQVVIFSPGDPFDFNSMSTEDEFFQQLRQRYISFEDSLDYQE 62 Query: 51 ---------TAPEI-SQIFGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDILRV 99 T E+ S +GE +A +L+ ++ + +P + E+G G G MML+IL Sbjct: 63 PNELRQLWHTPTELFSPYYGEAIARYLVEDYKYNAYPYHDLNIYEMGAGNGTMMLNILDY 122 Query: 100 ICKLKPDFFSVLSIYMVETSERLTLIQKKQL------ASYGDKI----NWYTSLADVPLG 149 I + P+ + ++E S +L +Q++ L + DK+ Sbjct: 123 IRDVHPEVYERTKFKIIEISSQLAHLQQQGLGQSAYARGHSDKVEIVNRSIFDWDIYVSS 182 Query: 150 FTFLVANEFFDSLPIKQFVMTEH 172 + +A E FD+ Sbjct: 183 PCYFLALEVFDNFAHDALKYDFE 205 >gi|299745089|ref|XP_001831465.2| hypothetical protein CC1G_08994 [Coprinopsis cinerea okayama7#130] gi|298406428|gb|EAU90312.2| hypothetical protein CC1G_08994 [Coprinopsis cinerea okayama7#130] Length = 470 Score = 86.7 bits (213), Expect = 5e-15, Method: Composition-based stats. Identities = 49/374 (13%), Positives = 106/374 (28%), Gaps = 76/374 (20%) Query: 21 VDQYFALCVADPEFGYYSTCNPFGAV----GDFVTAPEI--------------------- 55 V + + +P +GY+ A DF P + Sbjct: 24 VRDFIEDSLYNPNYGYFPKQATIFAHDSHSFDF---PSLRDSIEFQEEVAKKYAGYGSDS 80 Query: 56 ----------------SQIFGEMLAIFLICAWEQHGFPS-CVRLVELGPGRGIMMLDILR 98 +G+ +A ++ + FP + E+G G G + +DIL Sbjct: 81 YEGPGRQLWHTPTELFKPWYGQAIAQCVVSEYLLKYFPYEDFVIYEIGAGNGTLAMDILN 140 Query: 99 VICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKI----NWYTSLADVPLGFTFLV 154 + + P + ++E S L +QKK+L S + + + Sbjct: 141 YLQEAHPMVYERTRYNIIEISGSLVQLQKKKLRSAHPCVKITHQSVFHWNKREPSPCYFI 200 Query: 155 ANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFEN 214 A E D+ ++ + + F+I + ++ + Sbjct: 201 AMEVVDNFAHDILRYDLDTMQPLQGMVTISEQNDFDIVYTPVTDPLISSMLGIRNKLRHQ 260 Query: 215 SP--CRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADL 272 P R + A + Y+ +R+ +Q ++ H P +D Sbjct: 261 PPVNKLLRYSPFLRKMYRGLPFAANLSKEEYIPTRLLSLMQTLRNH---FPRHRLLLSDF 317 Query: 273 SSHVD-FQRLSSIAILYKLYINGL------TTQGKF----------LEGLGIWQRAFSLM 315 SS D +++ + + + QG F L + ++ Sbjct: 318 SSLPDSIPGINAPVVQTRFRNTTIPCSTLLVKQGYFDIFFPTDFERLRDM-----YEHIL 372 Query: 316 KQTARKDILLDSVK 329 + + + Sbjct: 373 SNPNHPTNISNEAR 386 >gi|322712023|gb|EFZ03596.1| hypothetical protein MAA_00670 [Metarhizium anisopliae ARSEF 23] Length = 509 Score = 86.7 bits (213), Expect = 6e-15, Method: Composition-based stats. Identities = 40/221 (18%), Positives = 69/221 (31%), Gaps = 52/221 (23%) Query: 21 VDQYFALCVADPEFGYYSTCNPF---GAVGDFVT-----A--PEIS-------------- 56 + + + +P +GY+S G DF T A E+ Sbjct: 109 LRDFIDDSLYNPSYGYFSKQAVIFSPGEPFDFTTLRDDLAFQSELGRRYTSFEDHLDDVE 168 Query: 57 -----------------QIFGEMLAIFLICAWEQHGFPSCVR-LVELGPGRGIMMLDILR 98 +GE +A +L+ + +P + E+G GRG +ML+IL Sbjct: 169 GENPTRQLWHTPTELFRPYYGEAIARYLVTNYRLTTYPYDDLLIYEMGAGRGTLMLNILD 228 Query: 99 VICKLKPDFFSVLSIYMVETSERLTLIQKKQL------ASYGDKI----NWYTSLADVPL 148 I ++ P ++ ++E S L +Q K L + DK+ Sbjct: 229 YIREVDPQVYARTRYNIIEISTNLASLQNKHLLSTAESRGHRDKVDIVNRSIFEWDQYVP 288 Query: 149 GFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVF 189 F +A E FD+ + F Sbjct: 289 SPCFFLAMEVFDNFSHDCIRYDVATEEPLQGHVLIDGDGDF 329 >gi|85119707|ref|XP_965696.1| hypothetical protein NCU02565 [Neurospora crassa OR74A] gi|28927508|gb|EAA36460.1| conserved hypothetical protein [Neurospora crassa OR74A] gi|38567141|emb|CAE76436.1| conserved hypothetical protein [Neurospora crassa] Length = 531 Score = 86.3 bits (212), Expect = 6e-15, Method: Composition-based stats. Identities = 40/263 (15%), Positives = 79/263 (30%), Gaps = 55/263 (20%) Query: 21 VDQYFALCVADPEFGYYSTCNPF---GAVGDF---------------------------- 49 + + + +P +GY+S G DF Sbjct: 109 LRDFIEDSLYNPNYGYFSKQVTIFTPGEPFDFPNLRDETEFQNVLSQRYVDFEDKLDEVA 168 Query: 50 --------VTAPEI-SQIFGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDILRV 99 T E+ +GE +A +L+ ++ +P + E+G GRG +ML+IL Sbjct: 169 PSDTRQLWYTPTELFRPYYGEAIARYLVANYKLTTYPYHDLIIYEMGAGRGTLMLNILDY 228 Query: 100 ICKLKPDFFSVLSIYMVETSERLTLIQKK----------QLASYGDKINWYTSLADVPLG 149 I + PD ++ ++E S+ L +Q L Sbjct: 229 IRDMDPDVYARTQYKVIEISDSLAKVQNSTLMRSAAGRGHLNKVEIINQSIFDWTQPVPS 288 Query: 150 FTFLVANEFFDSLPIKQFVMT-EHGIRERMIDIDQHDSLVFNIGDH---EIKSNFLTCSD 205 F +A E FD+ + + + + F + + + D Sbjct: 289 PCFFLAFEVFDNFAHDAIRYDLATETPMQAVVLISESNDFFEFYSPVLDPVAARYFRVRD 348 Query: 206 YFLGAIFENSPCRDREMQSISDR 228 G ++ + + +S Sbjct: 349 AATGGRYKVPYPTTKLGKWVSSI 371 >gi|295837268|ref|ZP_06824201.1| conserved hypothetical protein [Streptomyces sp. SPB74] gi|295826438|gb|EFG64854.1| conserved hypothetical protein [Streptomyces sp. SPB74] Length = 467 Score = 86.3 bits (212), Expect = 7e-15, Method: Composition-based stats. Identities = 57/304 (18%), Positives = 99/304 (32%), Gaps = 37/304 (12%) Query: 25 FALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVE 84 A + P G+Y + P G F T+ S + +A L G P + V+ Sbjct: 1 MATALYGPG-GFYRSAGP-GPAAHFRTSVHASPRYAHAVARLLTEVDAVLGGPPVLDFVD 58 Query: 85 LGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWYTSLA 144 +G GRG ++ V+ L + L + VE + R D + Sbjct: 59 IGAGRGELLTG---VLAALPEKTAARLRPHGVERAPR---------PEGLDPRIGWGEEL 106 Query: 145 DVPLGFTFLVANEFFDSLPIK----------QFVMTEHGIRERM--IDIDQHDSLVFNIG 192 L ANE+ D++P+ + V+ ER+ + Sbjct: 107 PGHGLTGLLFANEWLDNVPVDVVETDTEGVARLVLVAPDGTERLGEPVTGTDADWLARWW 166 Query: 193 DHEIKSNFLTCSDYFLGAI------FENSPCRDREMQSISDRLACDGGTAIVIDYGY--L 244 + + G P R+ + G A+ +DY + Sbjct: 167 PPDSPTTDRADGAGPHGVTGAREAGTRAEPGTGRDAAWAAAVSCLGRGLAVAVDYAHVAH 226 Query: 245 QSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEG 304 TL + V+P+ + G DL++HV ++ + + LT Q + L Sbjct: 227 ARPAFGTLTGFRAGREVAPVPD-GTTDLTAHVALDACAAATAHHGPAV--LTRQREALRA 283 Query: 305 LGIW 308 LGI Sbjct: 284 LGIT 287 >gi|255945633|ref|XP_002563584.1| Pc20g10950 [Penicillium chrysogenum Wisconsin 54-1255] gi|211588319|emb|CAP86424.1| Pc20g10950 [Penicillium chrysogenum Wisconsin 54-1255] Length = 494 Score = 85.9 bits (211), Expect = 8e-15, Method: Composition-based stats. Identities = 38/220 (17%), Positives = 69/220 (31%), Gaps = 51/220 (23%) Query: 22 DQYFALCVADPEFGYYSTCNPF---GAVGDF-------------------------VTAP 53 ++ + +P +GY+S G DF T P Sbjct: 97 REFIDDSLYNPHYGYFSKHATIFSPGEPFDFNNIEDGPAFHKLLGERYTEFEDKLDATNP 156 Query: 54 EIS------------QIFGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDILRVI 100 +I+ +GE +A +L+ ++ +P + E+G G G MML+IL I Sbjct: 157 DIARQLWHTPTELFRPYYGETIARYLVSNYKLTLYPYHDLIIYEMGAGNGTMMLNILDFI 216 Query: 101 CKLKPDFFSVLSIYMVETSERLTLIQKKQL------ASYGDKI----NWYTSLADVPLGF 150 + + ++E S L +Q K L + D + Sbjct: 217 RDTDYEVYQRTKFKIIEISPALADLQYKNLTDKLSAKGHRDHVEIVNRSIFDWDTYVHSP 276 Query: 151 TFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFN 190 F +A E FD+ + + + F+ Sbjct: 277 CFFLALEVFDNFAHDTIRYQQGTEMPQQGGVLIDADGEFH 316 >gi|289615776|emb|CBI57517.1| unnamed protein product [Sordaria macrospora] Length = 508 Score = 85.9 bits (211), Expect = 1e-14, Method: Composition-based stats. Identities = 40/263 (15%), Positives = 80/263 (30%), Gaps = 55/263 (20%) Query: 21 VDQYFALCVADPEFGYYSTCNPF---GAVGDF---------------------------- 49 + + + +P +GY+S G DF Sbjct: 109 LRDFIEDSLYNPNYGYFSKQVTIFTPGEPFDFPAFRDETEFQNVLSQRYVEFEDKLDEVA 168 Query: 50 --------VTAPEI-SQIFGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDILRV 99 T E+ +GE +A +L+ ++ +P + E+G GRG +ML+IL Sbjct: 169 PSDTRQLWYTPTELFRPYYGEAIARYLVANYKLTTYPYHDLIIYEMGAGRGTLMLNILDY 228 Query: 100 ICKLKPDFFSVLSIYMVETSERLTLIQKK----------QLASYGDKINWYTSLADVPLG 149 I + PD ++ ++E S+ L +Q L Sbjct: 229 IRDMDPDVYARTQYKVIEISDSLAKVQSSTLMRSAAGRGHLNKVEIINQSIFDWTQPVPS 288 Query: 150 FTFLVANEFFDSLPIKQFVMTEHGIR-ERMIDIDQHDSLVFNIGDH---EIKSNFLTCSD 205 F +A E FD+ + + + + + F + + + D Sbjct: 289 PCFFLAFEVFDNFAHDAIRYDLTTEQPMQAVVLISESNDFFEFYSPVLDPVAARYFRVRD 348 Query: 206 YFLGAIFENSPCRDREMQSISDR 228 G ++ + + +S Sbjct: 349 AATGGRYKVPYPTTKLGKWVSSI 371 >gi|310789964|gb|EFQ25497.1| hypothetical protein GLRG_00641 [Glomerella graminicola M1.001] Length = 531 Score = 85.2 bits (209), Expect = 1e-14, Method: Composition-based stats. Identities = 46/265 (17%), Positives = 84/265 (31%), Gaps = 56/265 (21%) Query: 22 DQYFALCVADPEFGYYSTCNPFGAVG------------DFV------------------- 50 + + +P +GY+ST + G F Sbjct: 132 RDFIEDSLYNPAYGYFSTQAVIFSPGQPFDFPAFRDEPHFYRELGRRYTEFEDELDDKEG 191 Query: 51 ---------TAPEI-SQIFGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDILRV 99 T E+ +GE +A +L+ + +P + E+G GRG +ML+IL Sbjct: 192 VNSARPLWHTPTELFRPYYGEAIARYLVSNYRLTTYPYHDLIIYEMGAGRGTLMLNILDY 251 Query: 100 ICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGD----------KINWYTSLADVPLG 149 I L P ++ ++E S L +Q+ L S D S ++ Sbjct: 252 IRDLDPQVYARTQYRIIEISPALASLQRHHLLSTADSRGHASKVEIINKSIFSWSERVPS 311 Query: 150 FTFLVANEFFDSLPIKQFVMTEHGIR-ERMIDIDQHDSLVFNIGDHEIKS---NFLTCSD 205 F +A E FD+ + + D+ + ++ FL Sbjct: 312 PCFFLAMEVFDNFSHDCVRYDLATEEPLQGSVLIDADNDFYEFYSPDLDPVLARFLRVRH 371 Query: 206 YFLGAIFENSPCRDREMQSISDRLA 230 G + +R ++ + RL Sbjct: 372 AATGGRYPTPYPANRMLRQLKSRLP 396 >gi|212541100|ref|XP_002150705.1| conserved hypothetical protein [Penicillium marneffei ATCC 18224] gi|210068004|gb|EEA22096.1| conserved hypothetical protein [Penicillium marneffei ATCC 18224] Length = 504 Score = 85.2 bits (209), Expect = 1e-14, Method: Composition-based stats. Identities = 37/219 (16%), Positives = 67/219 (30%), Gaps = 51/219 (23%) Query: 22 DQYFALCVADPEFGYYSTCNPF---GAVGDF----------------VT---------AP 53 ++ + +P +GY+S G DF T P Sbjct: 106 REFIEDSLYNPHYGYFSKHATIFHPGEPFDFSRIEDGPHFHRLLGERYTEFEDKLDEINP 165 Query: 54 EIS------------QIFGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDILRVI 100 +I+ +GE +A +L+ + +P + E+G G G +ML+IL I Sbjct: 166 DIARQLWHTPTELFRPYYGEAIARYLVTNYRLSLYPYHDLIIYEMGAGNGTLMLNILDFI 225 Query: 101 CKLKPDFFSVLSIYMVETSERLTLIQKKQLA----------SYGDKINWYTSLADVPLGF 150 P+ ++ ++E S L +QK+ L + Sbjct: 226 RDSDPEVYARTKFKIIEISSSLAKVQKENLQASLSAGGHTGHVQIINKSIFDWNEYIHSP 285 Query: 151 TFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVF 189 F +A E FD+ + + F Sbjct: 286 CFFLALEVFDNFSHDAIRYDYETEMPQQGGVVIDSDGEF 324 >gi|225561951|gb|EEH10231.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR] Length = 502 Score = 84.8 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 36/225 (16%), Positives = 64/225 (28%), Gaps = 51/225 (22%) Query: 22 DQYFALCVADPEFGYYSTCNPF---GAVGDF----------------------------- 49 + + +P +GY+S G DF Sbjct: 105 RDFIEDSLYNPHYGYFSKHATIFGPGEPFDFNNMADGPEFDRVLGQRYQEFEDKLDAVEY 164 Query: 50 -------VTAPEI-SQIFGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDILRVI 100 T E+ +GE +A +L+ ++ FP + E+G G G +ML++L I Sbjct: 165 DESRQLWHTPTELFRPYYGEAIARYLVANYKLTLFPYHDLTIYEMGAGNGTLMLNVLDYI 224 Query: 101 CKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGD----------KINWYTSLADVPLGF 150 + P+ + ++E S L +Q++ L Sbjct: 225 RDVDPEVYQRTKFKIIEISPSLADLQQQNLNKSIHSSGHRGHAEIINRSIFDWNTYVHSP 284 Query: 151 TFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHE 195 F +A E FD+ R + F Sbjct: 285 CFFLALEVFDNFSHDAIRYDLETGDPRQGCVLIDTDGEFYEYYVP 329 >gi|154283663|ref|XP_001542627.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1] gi|150410807|gb|EDN06195.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1] Length = 428 Score = 84.8 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 36/225 (16%), Positives = 64/225 (28%), Gaps = 51/225 (22%) Query: 22 DQYFALCVADPEFGYYSTCNPF---GAVGDF----------------------------- 49 + + +P +GY+S G DF Sbjct: 31 RDFIEDSLYNPHYGYFSKHATIFGPGEPFDFNNMADGPEFNRLLGQRYQEFEDKLDAVEY 90 Query: 50 -------VTAPEI-SQIFGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDILRVI 100 T E+ +GE +A +L+ ++ FP + E+G G G +ML++L I Sbjct: 91 DESRQLWHTPTELFRPYYGEAIARYLVANYKLTLFPYHDLTIYEMGAGNGTLMLNVLDYI 150 Query: 101 CKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGD----------KINWYTSLADVPLGF 150 + P+ + ++E S L +Q++ L Sbjct: 151 RDVDPEVYQRTKFKIIEISPSLADLQQQNLNKSIHSSGHRGHVEIINRSIFDWNTYVHSP 210 Query: 151 TFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHE 195 F +A E FD+ R + F Sbjct: 211 CFFLALEVFDNFSHDAIRYDLETGDPRQGCVLIDTDGEFYEYYVP 255 >gi|315046326|ref|XP_003172538.1| hypothetical protein MGYG_05129 [Arthroderma gypseum CBS 118893] gi|311342924|gb|EFR02127.1| hypothetical protein MGYG_05129 [Arthroderma gypseum CBS 118893] Length = 503 Score = 84.8 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 38/226 (16%), Positives = 65/226 (28%), Gaps = 53/226 (23%) Query: 22 DQYFALCVADPEFGYYSTCNPF---GAVGDFVTAPEISQIF------------------- 59 + + +P +GY+S G DF + E F Sbjct: 106 RDFIEDSLYNPHYGYFSKHATIFTPGEPFDF-NSIEDGPAFNKLLDQRYVEFEDKLDESN 164 Query: 60 -------------------GEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDILRV 99 GE +A +L+ ++ FP + E+G G G MML+IL Sbjct: 165 YDETRQLWHTPTELFRPYYGEAIARYLVTNYKLTLFPYHDLIIYEMGAGNGTMMLNILDY 224 Query: 100 ICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGD----------KINWYTSLADVPLG 149 I ++P+ + ++E S L +Q++ L Sbjct: 225 IRDVEPEVYQRTKYKIIEISSSLANLQQQNLNHSIHAGGHGGHAEIINRSIFDWNTYVHS 284 Query: 150 FTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHE 195 F +A E FD+ + R + F Sbjct: 285 PCFFLALEVFDNFGHDVIRYDMETGQPRQGCVLIDADGEFYEYYVP 330 >gi|312215501|emb|CBX95453.1| hypothetical protein [Leptosphaeria maculans] Length = 519 Score = 84.8 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 48/308 (15%), Positives = 95/308 (30%), Gaps = 51/308 (16%) Query: 21 VDQYFALCVADPEFGYYSTCNPFGAVGD------------FV------------------ 50 + + + +P +GY+S + GD F Sbjct: 98 LRDFIEDSLYNPNYGYFSKQVVIFSPGDPFKFNEMESEHDFFQQLRHRYTAFEDALDYQE 157 Query: 51 ---------TAPEI-SQIFGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDILRV 99 T E+ S +GE +A +L+ ++ + +P + E+G G G MML+IL Sbjct: 158 PNDLRQLWHTPTELFSPYYGEAIARYLVEDYKFNFYPYHDLNIYEMGAGNGTMMLNILDY 217 Query: 100 ICKLKPDFFSVLSIYMVETSERLTLIQKKQL------ASYGDKI----NWYTSLADVPLG 149 I + P+ + ++E S +L +Q+K L + DK+ Sbjct: 218 IRDVYPEVYERTKFRIIEISSQLADLQQKGLGHSAYARGHSDKVEIVNRSIFDWDQYVSS 277 Query: 150 FTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLG 209 + +A E FD+ + T Sbjct: 278 PCYFLALEVFDNFAHDALKYDFETGAPYQSHVVIDPRGELFEYYSRTLDPVATQFLERRH 337 Query: 210 AIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQ 269 A +N P + + + + + + Y+ +R+ + L++ Sbjct: 338 AACKNYPHPLQGSSMMHNLRSLVPWHSNLSQPEYIPTRLMQFFYMLYEKFPNHKLISSDF 397 Query: 270 ADLSSHVD 277 LS V+ Sbjct: 398 HKLSDSVE 405 >gi|327355579|gb|EGE84436.1| hypothetical protein BDDG_07381 [Ajellomyces dermatitidis ATCC 18188] Length = 502 Score = 84.8 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 36/226 (15%), Positives = 64/226 (28%), Gaps = 51/226 (22%) Query: 22 DQYFALCVADPEFGYYSTCNPF---GAVGDF----------------------------- 49 + + +P +GY+S G DF Sbjct: 105 RDFIEDSLYNPHYGYFSKHATIFSPGEPFDFNSMADGPEFNRLLGQRYQEFEDKLDAVQY 164 Query: 50 -------VTAPEI-SQIFGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDILRVI 100 T E+ +GE +A +L+ ++ FP + E+G G G +ML++L I Sbjct: 165 DESRQLWHTPTELFRPYYGEAIARYLVTNYKLTLFPYHDLTIYEMGAGNGTLMLNVLDYI 224 Query: 101 CKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGD----------KINWYTSLADVPLGF 150 + P+ + ++E S L +Q++ L Sbjct: 225 RDVDPEVYQRTKFKIIEISPSLANLQQQNLNKSIHSSGHRGHAEIINRSIFDWNTYVHSP 284 Query: 151 TFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 F +A E FD+ R + F Sbjct: 285 CFFLALEVFDNFSHDAIRYDLETGEPRQGCVLIDTDGEFYEYYIPK 330 >gi|261197878|ref|XP_002625341.1| conserved hypothetical protein [Ajellomyces dermatitidis SLH14081] gi|239595304|gb|EEQ77885.1| conserved hypothetical protein [Ajellomyces dermatitidis SLH14081] gi|239607731|gb|EEQ84718.1| conserved hypothetical protein [Ajellomyces dermatitidis ER-3] Length = 502 Score = 84.8 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 36/226 (15%), Positives = 64/226 (28%), Gaps = 51/226 (22%) Query: 22 DQYFALCVADPEFGYYSTCNPF---GAVGDF----------------------------- 49 + + +P +GY+S G DF Sbjct: 105 RDFIEDSLYNPHYGYFSKHATIFSPGEPFDFNSMADGPEFNRLLGQRYQEFEDKLDAVQY 164 Query: 50 -------VTAPEI-SQIFGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDILRVI 100 T E+ +GE +A +L+ ++ FP + E+G G G +ML++L I Sbjct: 165 DESRQLWHTPTELFRPYYGEAIARYLVTNYKLTLFPYHDLTIYEMGAGNGTLMLNVLDYI 224 Query: 101 CKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGD----------KINWYTSLADVPLGF 150 + P+ + ++E S L +Q++ L Sbjct: 225 RDVDPEVYQRTKFKIIEISPSLANLQQQNLNKSIHSSGHRGHAEIINRSIFDWNTYVHSP 284 Query: 151 TFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 F +A E FD+ R + F Sbjct: 285 CFFLALEVFDNFSHDAIRYDLETGEPRQGCVLIDTDGEFYEYYIPK 330 >gi|330920710|ref|XP_003299115.1| hypothetical protein PTT_10050 [Pyrenophora teres f. teres 0-1] gi|311327332|gb|EFQ92791.1| hypothetical protein PTT_10050 [Pyrenophora teres f. teres 0-1] Length = 496 Score = 84.4 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 38/203 (18%), Positives = 70/203 (34%), Gaps = 51/203 (25%) Query: 21 VDQYFALCVADPEFGYYSTCNPFGAVGD------------FV------------------ 50 + + + +P +GY+S + GD F Sbjct: 98 LRDFIDDSLYNPNYGYFSKQVVIFSPGDPFEFNAMNSEHEFFQQLRHRYTAFEDELDYQE 157 Query: 51 ---------TAPEI-SQIFGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDILRV 99 T E+ S +GE +A +L+ ++ + +P + E+G G G MML+IL Sbjct: 158 PNDLRQLWHTPTELFSPYYGEAIARYLVEDYKYNFYPYHDLNIYEMGAGNGTMMLNILDF 217 Query: 100 ICKLKPDFFSVLSIYMVETSERLTLIQKKQL------ASYGDKI----NWYTSLADVPLG 149 I + P+ + ++E S +L +Q+K L + DK+ Sbjct: 218 IRDVHPEVYERTKFKIIEISSQLADLQQKGLGHSAYARGHSDKVEIVNRSIFDWNVYVSS 277 Query: 150 FTFLVANEFFDSLPIKQFVMTEH 172 + +A E FD+ Sbjct: 278 PCYFLALEVFDNFAHDALKYDFE 300 >gi|326472109|gb|EGD96118.1| hypothetical protein TESG_03577 [Trichophyton tonsurans CBS 112818] gi|326477026|gb|EGE01036.1| hypothetical protein TEQG_00090 [Trichophyton equinum CBS 127.97] Length = 503 Score = 84.0 bits (206), Expect = 3e-14, Method: Composition-based stats. Identities = 38/225 (16%), Positives = 66/225 (29%), Gaps = 51/225 (22%) Query: 22 DQYFALCVADPEFGYYSTCNPF---GAVGDF----------------------------- 49 + + +P +GY+S G DF Sbjct: 106 RDFIEDSLYNPHYGYFSKHATIFSPGEPFDFNNIEDGPTFNKLLDQRYVEFEDKLDETNY 165 Query: 50 -------VTAPEI-SQIFGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDILRVI 100 T E+ +GE +A +L+ ++ FP + E+G G G MML+IL I Sbjct: 166 DETRQLWHTPTELFRPYYGEAIARYLVTNYKLTLFPYHDLIIYEMGAGNGTMMLNILDYI 225 Query: 101 CKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGD----------KINWYTSLADVPLGF 150 ++P+ + ++E S L +Q++ L Sbjct: 226 RDVEPEVYQRTKYKIIEISSSLANLQQQNLNHSIHAGGHGGHAEIINRSIFDWNTYVHSP 285 Query: 151 TFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHE 195 F +A E FD+ + R + F Sbjct: 286 CFFLALEVFDNFGHDVIRYDMETGQPRQGCVLIDSDGEFYEYYVP 330 >gi|327305267|ref|XP_003237325.1| hypothetical protein TERG_02047 [Trichophyton rubrum CBS 118892] gi|326460323|gb|EGD85776.1| hypothetical protein TERG_02047 [Trichophyton rubrum CBS 118892] Length = 503 Score = 84.0 bits (206), Expect = 3e-14, Method: Composition-based stats. Identities = 38/225 (16%), Positives = 66/225 (29%), Gaps = 51/225 (22%) Query: 22 DQYFALCVADPEFGYYSTCNPF---GAVGDF----------------------------- 49 + + +P +GY+S G DF Sbjct: 106 RDFIEDSLYNPHYGYFSKHATIFSPGEPFDFNNIEDGPTFNKLLDQRYVEFEDKLDETNY 165 Query: 50 -------VTAPEI-SQIFGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDILRVI 100 T E+ +GE +A +L+ ++ FP + E+G G G MML+IL I Sbjct: 166 DETRQLWHTPTELFRPYYGEAIARYLVTNYKLTLFPYHDLIIYEMGAGNGTMMLNILDYI 225 Query: 101 CKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGD----------KINWYTSLADVPLGF 150 ++P+ + ++E S L +Q++ L Sbjct: 226 RDVEPEVYQRTKYKIIEISSSLANLQQQNLNHSIHAGGHGGHAEIINRSIFDWNTYVHSP 285 Query: 151 TFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHE 195 F +A E FD+ + R + F Sbjct: 286 CFFLALEVFDNFGHDVIRYDMETGQPRQGCVLIDSDGEFYEYYVP 330 >gi|325091394|gb|EGC44704.1| conserved hypothetical protein [Ajellomyces capsulatus H88] Length = 502 Score = 84.0 bits (206), Expect = 3e-14, Method: Composition-based stats. Identities = 36/225 (16%), Positives = 64/225 (28%), Gaps = 51/225 (22%) Query: 22 DQYFALCVADPEFGYYSTCNPF---GAVGDF----------------------------- 49 + + +P +GY+S G DF Sbjct: 105 RDFIEDSLYNPHYGYFSKHATIFGPGEPFDFNNMADGPEFNRLLGQRYQEFEDKLDAVEY 164 Query: 50 -------VTAPEI-SQIFGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDILRVI 100 T E+ +GE +A +L+ ++ FP + E+G G G +ML++L I Sbjct: 165 DESRQLWHTPTELFRPYYGEAIARYLVANYKLTLFPYHDLTIYEMGAGNGTLMLNVLDYI 224 Query: 101 CKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGD----------KINWYTSLADVPLGF 150 + P+ + ++E S L +Q++ L Sbjct: 225 RDVDPEVYQRTKFKIIEISPSLADLQQQNLNRSIHSSGHRGHAEIINRSIFDWNTYVHSP 284 Query: 151 TFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHE 195 F +A E FD+ R + F Sbjct: 285 CFFLALEVFDNFSHDTIRYDLETGDPRQGCVLIDTDGEFYEYYVP 329 >gi|240275566|gb|EER39080.1| conserved hypothetical protein [Ajellomyces capsulatus H143] Length = 502 Score = 83.6 bits (205), Expect = 4e-14, Method: Composition-based stats. Identities = 36/225 (16%), Positives = 64/225 (28%), Gaps = 51/225 (22%) Query: 22 DQYFALCVADPEFGYYSTCNPF---GAVGDF----------------------------- 49 + + +P +GY+S G DF Sbjct: 105 RDFIEDSLYNPHYGYFSKHATIFGPGEPFDFNNMADGPEFNRLLGQRYQEFEDKLDAVEY 164 Query: 50 -------VTAPEI-SQIFGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDILRVI 100 T E+ +GE +A +L+ ++ FP + E+G G G +ML++L I Sbjct: 165 DESRQLWHTPTELFRPYYGEAIARYLVANYKLTLFPYHDLTIYEMGAGNGTLMLNVLDYI 224 Query: 101 CKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGD----------KINWYTSLADVPLGF 150 + P+ + ++E S L +Q++ L Sbjct: 225 RDVDPEVYQRTKFKIIEISPSLADLQQQNLNRSIHSSGHRGHAEIINRSIFDWNTYVHSP 284 Query: 151 TFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHE 195 F +A E FD+ R + F Sbjct: 285 CFFLALEVFDNFSHDTIRYDLETGDPRQGCVLIDTDGEFYEYYVP 329 >gi|296805961|ref|XP_002843800.1| conserved hypothetical protein [Arthroderma otae CBS 113480] gi|238845102|gb|EEQ34764.1| conserved hypothetical protein [Arthroderma otae CBS 113480] Length = 428 Score = 83.6 bits (205), Expect = 4e-14, Method: Composition-based stats. Identities = 38/225 (16%), Positives = 66/225 (29%), Gaps = 51/225 (22%) Query: 22 DQYFALCVADPEFGYYSTCNPF---GAVGDF----------------------------- 49 + + +P +GY+S G DF Sbjct: 31 RDFIEDSLYNPHYGYFSKHATIFTPGEPFDFNNIEDGPAFNKLLDQRYVEFEDKLDEVNY 90 Query: 50 -------VTAPEI-SQIFGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDILRVI 100 T E+ +GE +A +L+ ++ FP + E+G G G MML+IL I Sbjct: 91 DETRQLWHTPTELFRPYYGEAIARYLVTNYKLTLFPYHDLIIYEMGAGNGTMMLNILDYI 150 Query: 101 CKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGD----------KINWYTSLADVPLGF 150 ++P+ + ++E S L +Q++ L Sbjct: 151 RDVEPEVYQRTKYKIIEISSSLASLQQRNLNHSIHAGGHGGHAEIINRSIFDWDTYVHSP 210 Query: 151 TFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHE 195 F +A E FD+ + R + F Sbjct: 211 CFFLALEVFDNFGHDVIRYDMETGQPRQGCVLIDSDGEFYEYYVP 255 >gi|145616048|ref|XP_361063.2| hypothetical protein MGG_03606 [Magnaporthe oryzae 70-15] gi|145009818|gb|EDJ94474.1| hypothetical protein MGG_03606 [Magnaporthe oryzae 70-15] Length = 402 Score = 83.6 bits (205), Expect = 4e-14, Method: Composition-based stats. Identities = 45/258 (17%), Positives = 80/258 (31%), Gaps = 55/258 (21%) Query: 22 DQYFALCVADPEFGYYSTCNPF---GAVGDF----------------------------- 49 + + +P +GY+S G DF Sbjct: 4 RDFIQDSLYNPNYGYFSKQVVIFSPGEPFDFPALRDENDFHSLLSQRYTDFEDDLDSTQP 63 Query: 50 -------VTAPEI-SQIFGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDILRVI 100 T E+ +GE +A +L+ + +P + E+G GRG +ML+IL I Sbjct: 64 SDTRQLWFTPTELFRPYYGEAIARYLMANYVLTSYPYHDLIIYEMGAGRGTLMLNILDFI 123 Query: 101 CKLKPDFFSVLSIYMVETSERLTLIQKKQL------ASYGDKI----NWYTSLADVPLGF 150 +P + ++E S L +Q +QL + DK+ A Sbjct: 124 RDTEPSVYDRTKYRIIEISSSLAEMQNRQLRSSAAARGHADKVEIINRSILDWAQPEPSP 183 Query: 151 TFLVANEFFDSLPIKQFVMTEHGIR-ERMIDIDQHDSLVFNIGDH---EIKSNFLTCSDY 206 F +A E FD+ + + + H F + + F Sbjct: 184 CFFLAFEVFDNFSHDCIRYDLATEQPMQGTVLIDHRGDFFEFYEPKLDPLAERFFRVRHA 243 Query: 207 FLGAIFENSPCRDREMQS 224 G +E +R ++ Sbjct: 244 ATGGRYETPYPANRLLRK 261 >gi|259486836|tpe|CBF85017.1| TPA: conserved hypothetical protein [Aspergillus nidulans FGSC A4] Length = 498 Score = 83.6 bits (205), Expect = 4e-14, Method: Composition-based stats. Identities = 45/284 (15%), Positives = 79/284 (27%), Gaps = 56/284 (19%) Query: 22 DQYFALCVADPEFGYYSTCNPF---GAVGDF----------------------------- 49 ++ + +P +GY+S G DF Sbjct: 101 REFIDDSLYNPHYGYFSKHATIFSPGEPFDFNNIEDGPAFHRLLGERYTEFEDMLDEKQP 160 Query: 50 -------VTAPEI-SQIFGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDILRVI 100 T E+ +GE +A +L+ ++ +P + E+G G G MM++IL I Sbjct: 161 DEARQLWHTPTELFRPYYGETIARYLVSNYKLTLYPYHDLIIYEMGAGNGTMMINILDFI 220 Query: 101 CKLKPDFFSVLSIYMVETSERLTLIQKKQL------ASYGDKI----NWYTSLADVPLGF 150 + + ++E S L +Q K L A + D + Sbjct: 221 RDTDYEVYQRTKFRIIEISPALAGLQMKNLTDSLYAAGHLDHVEIINKSIFEWDTYVHSP 280 Query: 151 TFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGA 210 F +A E FD+ + + F+ + FL Sbjct: 281 CFFLALEVFDNFAHDAIRYDTKTEMPQQGGVLIDGDGEFHEFWTP---KLDPLASRFL-- 335 Query: 211 IFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQA 254 + R + RLA + Y T Sbjct: 336 RVRQAAARREFPSPLGPRLARQIRGTLPFQKPYTMPEYIPTRLM 379 >gi|302506967|ref|XP_003015440.1| hypothetical protein ARB_06566 [Arthroderma benhamiae CBS 112371] gi|291179012|gb|EFE34800.1| hypothetical protein ARB_06566 [Arthroderma benhamiae CBS 112371] Length = 420 Score = 83.6 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 38/225 (16%), Positives = 66/225 (29%), Gaps = 51/225 (22%) Query: 22 DQYFALCVADPEFGYYSTCNPF---GAVGDF----------------------------- 49 + + +P +GY+S G DF Sbjct: 4 RDFIEDSLYNPHYGYFSKHATIFSPGEPFDFNNIEDGPAFNKLLDQRYVEFEDKLDETNY 63 Query: 50 -------VTAPEI-SQIFGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDILRVI 100 T E+ +GE +A +L+ ++ FP + E+G G G MML+IL I Sbjct: 64 DETRQLWHTPTELFRPYYGEAIARYLVTNYKLTLFPYHDLIIYEMGAGNGTMMLNILDYI 123 Query: 101 CKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGD----------KINWYTSLADVPLGF 150 ++P+ + ++E S L +Q++ L Sbjct: 124 RDVEPEVYQRTKYKIIEISSSLANLQQQNLNHSIHAGGHGGHAEVINRSIFDWNTYVHSP 183 Query: 151 TFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHE 195 F +A E FD+ + R + F Sbjct: 184 CFFLALEVFDNFGHDVIRYDMETGQPRQGCVLIDSDGEFYEYYVP 228 >gi|302415837|ref|XP_003005750.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102] gi|261355166|gb|EEY17594.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102] Length = 504 Score = 83.6 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 41/250 (16%), Positives = 75/250 (30%), Gaps = 56/250 (22%) Query: 21 VDQYFALCVADPEFGYYSTCNPF---GAVGDFVTAP----------EIS----------- 56 + + + +P +GY+S G DF P E+ Sbjct: 105 MRDFVDDSLYNPNYGYFSKNAVIFSPGQPFDF---PAFSHEPAFYKELGHRYTEFEDALD 161 Query: 57 --------------------QIFGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLD 95 +GE +A +L+ + +P + E+G GRG +ML+ Sbjct: 162 DKEGVKDDRPLWHTPTELFRPYYGEAIARYLVSNYRLTTYPYHDLIIYEMGAGRGTLMLN 221 Query: 96 ILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQL---ASYGDKI----NWYTSLADVPL 148 IL I + P ++ ++E S L +Q + L + + DK+ + Sbjct: 222 ILDYIRDMDPQVYARTKYKIIEISSSLAALQGQNLLRASGHADKVDIINKSIFDWNERVP 281 Query: 149 GFTFLVANEFFDSLPIKQFVMTEHGIR-ERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYF 207 F +A E FD+ + + D + D + Sbjct: 282 SPCFFLALEVFDNFAHDCLRYDLRTEEPLQARVLIDDDGDFYEFYDTRLDPVAARFLRVR 341 Query: 208 LGAIFENSPC 217 A P Sbjct: 342 HAATAGRYPA 351 >gi|226291020|gb|EEH46448.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb18] Length = 502 Score = 83.2 bits (204), Expect = 5e-14, Method: Composition-based stats. Identities = 36/225 (16%), Positives = 65/225 (28%), Gaps = 51/225 (22%) Query: 22 DQYFALCVADPEFGYYSTCNPF---GAVGDF----------------------------- 49 + + +P +GY+S G DF Sbjct: 105 RDFIEDSLYNPNYGYFSKHATIFNPGEPFDFNSMADGPEFNRLLGQRYKEFEDKLDAVKY 164 Query: 50 -------VTAPEI-SQIFGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDILRVI 100 T E+ +GE +A +L+ ++ FP + E+G G G +ML++L I Sbjct: 165 DESRQLWHTPTELFRPYYGEAIARYLVTNYKLTLFPYHDLIIYEMGAGNGTLMLNVLDYI 224 Query: 101 CKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGD----------KINWYTSLADVPLGF 150 + + + ++E S L +Q+K L + Sbjct: 225 RDVDLEVYQRTKFKIIEISPSLAKLQQKNLKNSIHSSGHRGHAEIINQSIFDWNTYVHSP 284 Query: 151 TFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHE 195 F +A E FD+ + R + F Sbjct: 285 CFFLALEVFDNFSHDAIRYDLETGQPRQGCVLIDPDGEFYEYYIP 329 >gi|225679300|gb|EEH17584.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03] Length = 536 Score = 83.2 bits (204), Expect = 5e-14, Method: Composition-based stats. Identities = 36/225 (16%), Positives = 65/225 (28%), Gaps = 51/225 (22%) Query: 22 DQYFALCVADPEFGYYSTCNPF---GAVGDF----------------------------- 49 + + +P +GY+S G DF Sbjct: 139 RDFIEDSLYNPNYGYFSKHATIFNPGEPFDFNSMADGPEFNRLLGQRYKEFEDKLDAVKY 198 Query: 50 -------VTAPEI-SQIFGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDILRVI 100 T E+ +GE +A +L+ ++ FP + E+G G G +ML++L I Sbjct: 199 DESRQLWHTPTELFRPYYGEAIARYLVTNYKLTLFPYHDLIIYEMGAGNGTLMLNVLDYI 258 Query: 101 CKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGD----------KINWYTSLADVPLGF 150 + + + ++E S L +Q+K L + Sbjct: 259 RDVDLEVYQRTKFKIIEISPSLAKLQQKNLKNSIHSSGHRGHAEIINQSIFDWNTYVHSP 318 Query: 151 TFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHE 195 F +A E FD+ + R + F Sbjct: 319 CFFLALEVFDNFSHDAIRYDLETGQPRQGCVLIDTDGEFYEYYIP 363 >gi|302659570|ref|XP_003021473.1| hypothetical protein TRV_04414 [Trichophyton verrucosum HKI 0517] gi|291185375|gb|EFE40855.1| hypothetical protein TRV_04414 [Trichophyton verrucosum HKI 0517] Length = 420 Score = 83.2 bits (204), Expect = 5e-14, Method: Composition-based stats. Identities = 39/225 (17%), Positives = 66/225 (29%), Gaps = 51/225 (22%) Query: 22 DQYFALCVADPEFGYYSTCNPF---GAVGDF----------------------------- 49 + + +P +GY+S G DF Sbjct: 4 RDFIEDSLYNPHYGYFSKHATIFSPGEPFDFNNIEDGPTFNKLLDQRYVEFEDKLDETNY 63 Query: 50 -------VTAPEI-SQIFGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDILRVI 100 T E+ +GE +A +L+ ++ FP + E+G G G MML+IL I Sbjct: 64 DETRQLWHTPTELFRPYYGEAIARYLVTNYKLTLFPYHDLIIYEMGAGNGTMMLNILDYI 123 Query: 101 CKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGD----------KINWYTSLADVPLGF 150 ++PD + ++E S L +Q++ L Sbjct: 124 RDVEPDVYQRTKYKIIEISSSLANLQQQNLNHSIHAGGHGGHAEIINRSIFDWNTYVHSP 183 Query: 151 TFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHE 195 F +A E FD+ + R + F Sbjct: 184 CFFLALEVFDNFGHDVIRYDMETGQPRQGCVLIDSDGEFYEYYVP 228 >gi|189203937|ref|XP_001938304.1| hypothetical protein PTRG_07972 [Pyrenophora tritici-repentis Pt-1C-BFP] gi|187985403|gb|EDU50891.1| hypothetical protein PTRG_07972 [Pyrenophora tritici-repentis Pt-1C-BFP] Length = 496 Score = 83.2 bits (204), Expect = 6e-14, Method: Composition-based stats. Identities = 37/203 (18%), Positives = 70/203 (34%), Gaps = 51/203 (25%) Query: 21 VDQYFALCVADPEFGYYSTCNPFGAVGD------------FV------------------ 50 + + + +P +GY++ + GD F Sbjct: 98 LRDFIDDSLYNPNYGYFAKQVVIFSPGDPFEFNAMSSEHEFFQQLRHRYTAFEDELDYQE 157 Query: 51 ---------TAPEI-SQIFGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDILRV 99 T E+ S +GE +A +L+ ++ + +P + E+G G G MML+IL Sbjct: 158 PNDLRQLWHTPTELFSPYYGEAIARYLVEDYKYNFYPYHDLNIYEMGAGNGTMMLNILDF 217 Query: 100 ICKLKPDFFSVLSIYMVETSERLTLIQKKQL------ASYGDKI----NWYTSLADVPLG 149 I + P+ + ++E S +L +Q+K L + DK+ Sbjct: 218 IRDVHPEVYERTKFKIIEISSQLADLQQKGLGHSAYARGHSDKVEIVNRSIFDWNVYVSS 277 Query: 150 FTFLVANEFFDSLPIKQFVMTEH 172 + +A E FD+ Sbjct: 278 PCYFLALEVFDNFAHDALKYDFE 300 >gi|291438085|ref|ZP_06577475.1| conserved hypothetical protein [Streptomyces ghanaensis ATCC 14672] gi|291340980|gb|EFE67936.1| conserved hypothetical protein [Streptomyces ghanaensis ATCC 14672] Length = 368 Score = 82.8 bits (203), Expect = 6e-14, Method: Composition-based stats. Identities = 66/337 (19%), Positives = 112/337 (33%), Gaps = 47/337 (13%) Query: 29 VADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVELGPG 88 + P G+Y P G G F T+ S +F +A L E G P+ + V++ G Sbjct: 54 LYGPG-GFY--RRPEGPAGHFRTSVHASPLFAGAVARLLCRVDEALGRPAVLDFVDMAAG 110 Query: 89 RGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWYTSLADVPL 148 RG ++ +L + D + + Y VE + A +I W + Sbjct: 111 RGELVTGVLTAL---PADVAARVRAYAVEL--------ADRPAGLDHRIEWRADPPERVT 159 Query: 149 GFTFLVANEFF----------DSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKS 198 G L ANE+ D + + V+ ER+ + + + Sbjct: 160 G--LLFANEWLDNVPVEVVEVDPAGVPRLVLVRADGTERLGEPVAGAEARWLARWWPL-- 215 Query: 199 NFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYL--QSRVGDTLQAVK 256 G E RDR + + LA G A+ +DY + TL + Sbjct: 216 ------GGEEGLRAEIGLPRDRAWAAAAGTLAR--GLAVAVDYAHTAGARPPFGTLTGFR 267 Query: 257 GHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIW-QR--AFS 313 SP+ + D+++HV ++ L + +Q L LG+ R Sbjct: 268 EGRETSPVPDGS-CDITAHVALDACAAACALPGARLL---SQRDALRSLGLTGDRPPLAR 323 Query: 314 LMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVV 350 A L + + + G+ F LV Sbjct: 324 ASTDPAGYVRALATASQA-AELTAPGGFGD-FGWLVQ 358 >gi|239929758|ref|ZP_04686711.1| hypothetical protein SghaA1_16145 [Streptomyces ghanaensis ATCC 14672] Length = 380 Score = 82.8 bits (203), Expect = 7e-14, Method: Composition-based stats. Identities = 66/337 (19%), Positives = 112/337 (33%), Gaps = 47/337 (13%) Query: 29 VADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVELGPG 88 + P G+Y P G G F T+ S +F +A L E G P+ + V++ G Sbjct: 66 LYGPG-GFY--RRPEGPAGHFRTSVHASPLFAGAVARLLCRVDEALGRPAVLDFVDMAAG 122 Query: 89 RGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWYTSLADVPL 148 RG ++ +L + D + + Y VE + A +I W + Sbjct: 123 RGELVTGVLTAL---PADVAARVRAYAVEL--------ADRPAGLDHRIEWRADPPERVT 171 Query: 149 GFTFLVANEFF----------DSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKS 198 G L ANE+ D + + V+ ER+ + + + Sbjct: 172 G--LLFANEWLDNVPVEVVEVDPAGVPRLVLVRADGTERLGEPVAGAEARWLARWWPL-- 227 Query: 199 NFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYL--QSRVGDTLQAVK 256 G E RDR + + LA G A+ +DY + TL + Sbjct: 228 ------GGEEGLRAEIGLPRDRAWAAAAGTLAR--GLAVAVDYAHTAGARPPFGTLTGFR 279 Query: 257 GHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIW-QR--AFS 313 SP+ + D+++HV ++ L + +Q L LG+ R Sbjct: 280 EGRETSPVPDGS-CDITAHVALDACAAACALPGARLL---SQRDALRSLGLTGDRPPLAR 335 Query: 314 LMKQTARKDILLDSVKRLVSTSADKKSMGELFKILVV 350 A L + + + G+ F LV Sbjct: 336 ASTDPAGYVRALATASQA-AELTAPGGFGD-FGWLVQ 370 >gi|170087904|ref|XP_001875175.1| predicted protein [Laccaria bicolor S238N-H82] gi|164650375|gb|EDR14616.1| predicted protein [Laccaria bicolor S238N-H82] Length = 477 Score = 82.8 bits (203), Expect = 7e-14, Method: Composition-based stats. Identities = 50/330 (15%), Positives = 100/330 (30%), Gaps = 49/330 (14%) Query: 21 VDQYFALCVADPEFGYYSTCNPFG----AVGDFV-------------------------- 50 V + + +P +GY+ DF Sbjct: 87 VRDFIEDSLYNPHYGYFPKQATIFDTQKTSFDFSSFRDSVEFQEEVARKYAAYGADKHDG 146 Query: 51 -------TAPEI-SQIFGEMLAIFLICAWEQHGFPS-CVRLVELGPGRGIMMLDILRVIC 101 T E+ +G A L+ + FP + E+G G G + +DIL + Sbjct: 147 PGRQLWHTPTELFKPWYGRAAAQCLVSEYLLKYFPYEDFIIYEIGAGNGTLAMDILNFLQ 206 Query: 102 KLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKI----NWYTSLADVPLGFTFLVANE 157 + PD + ++E S L +Q+K+L S + F +A E Sbjct: 207 ERYPDVYERTRYNIIEISGSLVKLQRKKLQSLHPCVKVNHKSIFQWKTREPAPCFFIAME 266 Query: 158 FFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPC 217 D+ ++ + F++ + ++ + N P Sbjct: 267 VIDNFAHDVVRYDLRTLKPYQGVVTISKEGEFDMLYEPVSDPLISTFLKTRSHLKHNPPI 326 Query: 218 RDREMQSISDRLACDGGTAI--VIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADL--S 273 S + R + Y+ +R+ + LQ ++ H L+ +DL + Sbjct: 327 NRLLRASPALRSLYTNLPFAPNLSAVEYVPTRLLNLLQTLRNHFPRHRLLLSDFSDLPDT 386 Query: 274 SH-VDFQRLSSIAILYKLYING-LTTQGKF 301 ++ + + + + L QG F Sbjct: 387 IPGINAPVVQTRFRNVTVPCSTLLVKQGYF 416 >gi|116180546|ref|XP_001220122.1| hypothetical protein CHGG_00901 [Chaetomium globosum CBS 148.51] gi|88185198|gb|EAQ92666.1| hypothetical protein CHGG_00901 [Chaetomium globosum CBS 148.51] Length = 342 Score = 82.8 bits (203), Expect = 7e-14, Method: Composition-based stats. Identities = 39/226 (17%), Positives = 68/226 (30%), Gaps = 51/226 (22%) Query: 21 VDQYFALCVADPEFGYYSTCNPF---GAVGDF---------------------------- 49 + + + +P +GY+S G DF Sbjct: 30 MRDFIEDSLYNPTYGYFSKQVVIFTPGEPFDFPSLYDELDFHSLLSRRYVEFENAMDAVS 89 Query: 50 --------VTAPEI-SQIFGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDILRV 99 T E+ +GE +A +LI ++ +P + E+G GRG +ML+IL Sbjct: 90 PSDTRQLWYTPTELFRPYYGEAIARYLIANYKLTTYPYHDLIIYEMGAGRGTLMLNILDH 149 Query: 100 ICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYG----------DKINWYTSLADVPLG 149 I + P + ++E S +L IQ QL + Sbjct: 150 IRDVDPAVYDRTKYKIIEISTQLATIQNSQLKQSFAARGHADKVEIINQSIFDWTNPVPS 209 Query: 150 FTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHE 195 F +A E FD+ + + ++ F Sbjct: 210 PCFFLAFEVFDNFAHDILRYDMETQKPLQGMVLIDENGDFYEFYTP 255 >gi|171687677|ref|XP_001908779.1| hypothetical protein [Podospora anserina S mat+] gi|170943800|emb|CAP69452.1| unnamed protein product [Podospora anserina S mat+] Length = 527 Score = 82.8 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 37/201 (18%), Positives = 63/201 (31%), Gaps = 51/201 (25%) Query: 21 VDQYFALCVADPEFGYYSTCNPFGAVG------------DF------------------- 49 + + + +P +GY+S + G DF Sbjct: 128 MRDFIEDSLYNPNYGYFSKQVVIFSPGEPFNFPSLHDEIDFQTILSKRYVEFEDALDAVS 187 Query: 50 --------VTAPEI-SQIFGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDILRV 99 T E+ +GE +A +LI ++ +P + E+G GRG +ML+IL Sbjct: 188 PTDTRQLWYTPTELFRPYYGEAIARYLIANYKLTTYPYHDLIIYEMGAGRGTLMLNILDY 247 Query: 100 ICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGD----------KINWYTSLADVPLG 149 I + P ++ ++E S +L IQ L Sbjct: 248 IRDVDPAVYARTQYKIIEISTQLATIQNNHLLKNSHARGHAQKVEIINQSIFDWKTKVPS 307 Query: 150 FTFLVANEFFDSLPIKQFVMT 170 F +A E FD+ Sbjct: 308 PCFFLAFEVFDNFAHDVVRYD 328 >gi|320010049|gb|ADW04899.1| protein of unknown function DUF185 [Streptomyces flavogriseus ATCC 33331] Length = 333 Score = 82.1 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 64/328 (19%), Positives = 102/328 (31%), Gaps = 26/328 (7%) Query: 26 ALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVEL 85 + E G+Y P G G F T+ S +F +A L + G V LV++ Sbjct: 13 ETALYGEE-GFY--RRPEGPAGHFRTSVHASPLFATAVARLLAGTARELGT-DSVDLVDV 68 Query: 86 GPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWYTSLAD 145 G GRG ++ +L + + VE + R + W+ L Sbjct: 69 GAGRGELLTGVLAALRDEALAPGLTVRARAVELAPR--------PPGLDPAVEWHAELPR 120 Query: 146 VPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLT--C 203 G L ANE+ D++P G+ ++ + + +L+ Sbjct: 121 GVRG--LLFANEWLDNVPTDVAEADADGVPRYVLVRTADGAERLGDPVTGADARWLSRWW 178 Query: 204 SDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSP 263 G E RD LA A+ + TL + V P Sbjct: 179 PLARPGDRAEIGRPRDEAWARAVASLAAGTAVAVDYAHVRASRPPFGTLTGFREGREVPP 238 Query: 264 LVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGI-WQR--AFSLMKQTAR 320 + + DL+SHV L Q L LGI +R A Sbjct: 239 VPDGS-CDLTSHVA----LDACAAAADGTAELLDQRTALRELGISGERPPLAQASADPAG 293 Query: 321 KDILLDSVKRLVSTSADKKSMGELFKIL 348 L S + + + +G+ F L Sbjct: 294 YVRALASAGQA-AELTARGGLGD-FGWL 319 >gi|154296394|ref|XP_001548628.1| hypothetical protein BC1G_13023 [Botryotinia fuckeliana B05.10] gi|150843384|gb|EDN18577.1| hypothetical protein BC1G_13023 [Botryotinia fuckeliana B05.10] Length = 428 Score = 82.1 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 38/230 (16%), Positives = 70/230 (30%), Gaps = 51/230 (22%) Query: 22 DQYFALCVADPEFGYYSTCNPF---GAVGDF----------------VT---------AP 53 + + +P +GY+S G DF T +P Sbjct: 30 RDFIEDSLYNPSYGYFSKQVVIFTPGEPFDFNSLEDEPAFHRLLGQRYTEFEDELDLKSP 89 Query: 54 EIS------------QIFGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDILRVI 100 + +GE +A +LI ++ +P + E+G G G +ML+IL I Sbjct: 90 NETRQLWHTPTELFRPYYGEAIARYLINNYKISQYPYHDLIIYEMGAGNGTLMLNILDYI 149 Query: 101 CKLKPDFFSVLSIYMVETSERLTLIQKKQL------ASYGDKINWY----TSLADVPLGF 150 +P+ ++ ++E S L +QK L + K+ + Sbjct: 150 RATEPEVYARTKFKIIEISSNLASLQKSHLLRNANSRGHSSKVEIINKSVFEWNQIVPSP 209 Query: 151 TFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNF 200 F +A E FD+ + ++ F Sbjct: 210 CFFLAMEVFDNFAHDSIRYDPVTEEPLQGTVLISNTGDFYDFYTPTIDPI 259 >gi|119195547|ref|XP_001248377.1| hypothetical protein CIMG_02148 [Coccidioides immitis RS] gi|320040179|gb|EFW22112.1| DUF185 domain-containing protein [Coccidioides posadasii str. Silveira] Length = 499 Score = 81.7 bits (200), Expect = 1e-13, Method: Composition-based stats. Identities = 39/219 (17%), Positives = 66/219 (30%), Gaps = 51/219 (23%) Query: 22 DQYFALCVADPEFGYYSTCNPF---GAVGDF--------------------------VTA 52 + + +P +GY+S G DF VT Sbjct: 102 RDFIEDSLYNPYYGYFSKHATIFTPGEPFDFNNIDDGPAFNRLVDQRYAEFEDKLDAVTP 161 Query: 53 PEISQI-----------FGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDILRVI 100 Q+ +GE +A +L+ ++ FP + E+G G G +ML+IL I Sbjct: 162 NHTRQLWHTPTELFRPYYGEAIARYLVTNYKLTLFPYHDLIIYEMGAGNGTLMLNILDYI 221 Query: 101 CKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK----------INWYTSLADVPLGF 150 ++ P+ + ++E S L Q+K L Sbjct: 222 REVDPEVYQRTKFKIIEISSHLADTQQKTLNGSIYDDGHRGHVEIINRSIFEWDTYVHSP 281 Query: 151 TFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVF 189 F +A E FD+ + R + F Sbjct: 282 CFFLALEVFDNFAHDAIRYDLQTGQPRQGCVLVDSDGEF 320 >gi|242211533|ref|XP_002471604.1| predicted protein [Postia placenta Mad-698-R] gi|220729280|gb|EED83157.1| predicted protein [Postia placenta Mad-698-R] Length = 428 Score = 81.7 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 37/225 (16%), Positives = 69/225 (30%), Gaps = 41/225 (18%) Query: 21 VDQYFALCVADPEFGYY-------STCNPF----------------------GAVGD--- 48 V + + +P +GY+ +T +P G GD Sbjct: 76 VRDFIEDSLYNPHYGYFPKQADIFTTTDPIHFTSLRNTVEFQEEVGRRYAEYGPDGDGPG 135 Query: 49 ---FVTAPEI-SQIFGEMLAIFLICAWEQHGFPS-CVRLVELGPGRGIMMLDILRVICKL 103 + T E+ +G+ +A L+ + FP + E+G G G + DIL I + Sbjct: 136 RQIWHTPTELFQPWYGQAIAQCLVSEYLLKYFPYEDFVIYEIGAGNGTLARDILDYIQER 195 Query: 104 KPDFFSVLSIYMVETSERLTLIQKKQLASYGDKI----NWYTSLADVPLGFTFLVANEFF 159 P+ + ++E S L +Q+++LA + F +A E Sbjct: 196 YPEVYDRTRYRIIEISGNLARLQREKLADKHPCVDIVHKSIFRWDTRESAPCFFLAMEVI 255 Query: 160 DSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCS 204 D+ + + F + T Sbjct: 256 DNFAHDMIRYDLRTLEPYQGLVTIDAHGDFGTHYTRTRYRNETVP 300 >gi|328883288|emb|CCA56527.1| Conserved hypothetical protein [Streptomyces venezuelae ATCC 10712] Length = 376 Score = 81.7 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 63/313 (20%), Positives = 99/313 (31%), Gaps = 52/313 (16%) Query: 26 ALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVEL 85 + P G+Y P G VG F T+ S +F +A L E+ G LV++ Sbjct: 16 ERALYGPG-GFYL--RPEGPVGHFRTSVHASPLFAAAVARLLAEVAEELGTTEID-LVDV 71 Query: 86 GPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWYTSLAD 145 G GRG ++ V+ + Y VE + R A +I W L Sbjct: 72 GAGRGELLTG---VLAVAGDAHDLTVRPYAVERAPR--------PAGLDPRIEWSDRLPS 120 Query: 146 VPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSD 205 G + ANE+ D++P+ G + + Sbjct: 121 GVRG--LVFANEWLDNVPVDVAEADAEGTVRYVEVRTDGTERLGGPVSGPDAEWLARWWP 178 Query: 206 YFL-GAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQ--SRVGDTLQAVKGHTYVS 262 GA E RD + LA G A+ +DY +++ +L + V Sbjct: 179 LREPGARAEIGRPRDEAWAAAVGSLAA--GRAVAVDYAHVRESRPPFGSLTGFRAGREVP 236 Query: 263 PLVNPGQADLSSHVDFQRL-----------------------------SSIAILYKLYIN 293 P+ + DL+SHV +S + Sbjct: 237 PVPDGS-CDLTSHVALDACAAAGGDGGGASEAAAGHGGGDPEAAGGHGASGSRGGGWPEA 295 Query: 294 GLTTQGKFLEGLG 306 G+ TQ + L LG Sbjct: 296 GIVTQREALGRLG 308 >gi|317123769|ref|YP_004097881.1| hypothetical protein Intca_0609 [Intrasporangium calvum DSM 43043] gi|315587857|gb|ADU47154.1| protein of unknown function DUF185 [Intrasporangium calvum DSM 43043] Length = 325 Score = 81.3 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 60/329 (18%), Positives = 113/329 (34%), Gaps = 33/329 (10%) Query: 18 QMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAW-EQHGF 76 + Q + + P+ G+Y T G F TA + G +LA L+ W + G Sbjct: 3 PVPWHQAWQAALYAPDLGFYVTRG--GPSAHFTTA--THGVPGAVLARALLRLWHRERGE 58 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 +V++G GRG + ++ + P V + I + Sbjct: 59 SPPAVVVDVGAGRGELASHLVGA---VDPRTSVVAVDVVPRPEGLDARISWVESPGGAAL 115 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFN--IGDH 194 L D ++ +E+ D +P M + G ++ + + Sbjct: 116 PEDLDRLDD-----ALVIGHEWLDVVPCVIAEMDQEGHLREVLVAPDTGAERLGGPLEPL 170 Query: 195 EIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQ--SRVGDTL 252 + D G E RD + R+ G + IDYG+L TL Sbjct: 171 DAAWVEAHWPDASPGDRVEVGRARDTAWADLVRRVER--GLVVAIDYGHLVGGRPSQGTL 228 Query: 253 QAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAF 312 A + T P+ + D+++HV L++ + TQ L LG+ R Sbjct: 229 TAYRSGTQTEPVPDGS-CDITAHVAMDSLAADLLA---------TQRDVLRSLGV--RGA 276 Query: 313 SLMKQTARKDILLDSVKRLVSTSADKKSM 341 + A++D + + R + ++ + + Sbjct: 277 TPEHALAQRDPM--AYLRALERASAEAEL 303 >gi|302686936|ref|XP_003033148.1| hypothetical protein SCHCODRAFT_67112 [Schizophyllum commune H4-8] gi|300106842|gb|EFI98245.1| hypothetical protein SCHCODRAFT_67112 [Schizophyllum commune H4-8] Length = 496 Score = 81.3 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 45/333 (13%), Positives = 96/333 (28%), Gaps = 53/333 (15%) Query: 21 VDQYFALCVADPEFGYY--------STCNPF---------------------------GA 45 V + + +P +GY+ + PF G Sbjct: 105 VRDFIEDSLYNPNYGYFPNQAAILDTRNQPFEFNKMRNLAEFQERIAEKYLEYGEEKPGT 164 Query: 46 VG----DFVTAPEI-SQIFGEMLAIFL-ICAWEQHGFPSCVRLVELGPGRGIMMLDILRV 99 +G T E+ +G+ LA L ++ + E+G G G + DIL Sbjct: 165 LGRQLWH--TPTELFRPWYGQALARCLSAEYLLKYFPYDDFNIYEIGAGNGTLAKDILDY 222 Query: 100 ICKLKPDFFSVLSIYMVETSERLTLIQKKQLASY---GDKINWYTSLADVPLGFTFLVAN 156 + + P + ++E S L +QK++L + F +A Sbjct: 223 LRDVYPAVYERTRYNIIEISGNLAELQKRRLRDHDCVHVHNKSVFHWNTHDPAPCFFIAT 282 Query: 157 EFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSP 216 E D+ ++ + F+ + + + P Sbjct: 283 EVVDNFAHDAIRWDLQTLKPHQEMVVIDHEGEFDSLYTPVTDPLIKDYLRLREYLNHPPP 342 Query: 217 C--RDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSS 274 R +++ + Y+ +R+ ++ ++ P D S Sbjct: 343 VNRLLRASETLRRAFVNLPLAPNMSQVEYIPTRLLSLIRTLRN---YFPRHRLLLTDFSE 399 Query: 275 HVD-FQRLSSIAILYKLYINGLTTQGKFLEGLG 306 D +++ + + + +TT L G Sbjct: 400 LPDAIPGINAPVVQMRWENSSITT-STLLVKHG 431 >gi|224059974|ref|XP_002300022.1| predicted protein [Populus trichocarpa] gi|222847280|gb|EEE84827.1| predicted protein [Populus trichocarpa] Length = 52 Score = 80.9 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 19/47 (40%), Positives = 32/47 (68%) Query: 15 KNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGE 61 + G ++V +Y + +P+FG+Y + FG DF+T+PE+SQ+FGE Sbjct: 6 RGGSISVAEYMEEVLMNPKFGFYINRDVFGVERDFITSPEVSQMFGE 52 >gi|317036299|ref|XP_001398051.2| hypothetical protein ANI_1_1002144 [Aspergillus niger CBS 513.88] Length = 514 Score = 80.9 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 35/219 (15%), Positives = 66/219 (30%), Gaps = 51/219 (23%) Query: 22 DQYFALCVADPEFGYYSTCNPFGAVG---------D---FV----------------TAP 53 ++ + + +GY+S G D F T P Sbjct: 115 REFIDDSLYNHNYGYFSKHATIFRPGEPFYFNEIEDGPAFYRMLGERYHEFEDQLDETNP 174 Query: 54 EIS------------QIFGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDILRVI 100 +++ +GE +A +L+ ++ +P + E+G G G MM++IL I Sbjct: 175 DVARQLWHTPTELFRPYYGETIARYLVSNYKLTLYPYHDLIIYEMGAGNGTMMMNILDYI 234 Query: 101 CKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKI----------NWYTSLADVPLGF 150 + + ++E S L +Q + L D Sbjct: 235 RDTDYEVYQRTKYKIIEISPSLASLQMQNLTDSLDAAGHSDRVEIINRSIFDWDTYVHSP 294 Query: 151 TFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVF 189 F +A E FD+ Q + + + F Sbjct: 295 CFFLALEVFDNFSHDQIRYDLKTELPQQGGVLIDANGEF 333 >gi|156054680|ref|XP_001593266.1| hypothetical protein SS1G_06188 [Sclerotinia sclerotiorum 1980] gi|154703968|gb|EDO03707.1| hypothetical protein SS1G_06188 [Sclerotinia sclerotiorum 1980 UF-70] Length = 402 Score = 80.2 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 36/230 (15%), Positives = 68/230 (29%), Gaps = 51/230 (22%) Query: 22 DQYFALCVADPEFGYYSTCNPF---GAVGDF----------------VT---------AP 53 + + +P +GY+S G DF T +P Sbjct: 4 RDFIEDSLYNPSYGYFSKQVVIFTPGEPFDFNSLEDEPAFHRLLGQRYTEFEDELDLKSP 63 Query: 54 EIS------------QIFGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDILRVI 100 + +GE +A +L+ ++ +P + E+G G G +ML+IL I Sbjct: 64 NETRQLWHTPTELFRPYYGEAIARYLVNNYKIWQYPYHDLIIYEMGAGNGTLMLNILDYI 123 Query: 101 CKLKPDFFSVLSIYMVETSERLTLIQKK----------QLASYGDKINWYTSLADVPLGF 150 +P+ ++ ++E S L +QK L+ + Sbjct: 124 RLTEPEVYARTKFKIIEISSNLASLQKSHLLRNANSRGHLSKVEIINKSVFEWNQIVPSP 183 Query: 151 TFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNF 200 F +A E FD+ + ++ F Sbjct: 184 CFFLAMEVFDNFAHDSIRYDPVTEEPLQGTVLISNTGEFYEFYTPTIDPI 233 >gi|302535178|ref|ZP_07287520.1| conserved hypothetical protein [Streptomyces sp. C] gi|302444073|gb|EFL15889.1| conserved hypothetical protein [Streptomyces sp. C] Length = 337 Score = 80.2 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 57/292 (19%), Positives = 101/292 (34%), Gaps = 28/292 (9%) Query: 17 GQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGF 76 G + + P G+Y G G F T+ S ++ +A L + G Sbjct: 18 GPVRWRAAMEAALYGPG-GFYVRPGGPGPAGHFRTSVHASALYAAAVARLLEWVDAELGR 76 Query: 77 PSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK 136 P+ + LV++G GRG ++ V+ L + + + Y VE + + Sbjct: 77 PARLDLVDMGAGRGELLAG---VLAALPAETAARVRPYGVER--------AAEPEGLDPR 125 Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 + W + G L ANE+ D++P+ + + R D + Sbjct: 126 VRWVAEPPEGATG--LLFANEWLDNVPLDVAEDGRYVLVARDGTETPGG--PLEDADRDW 181 Query: 197 KSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQ--SRVGDTLQA 254 + + G E RD + L G A+ +DY + + TL Sbjct: 182 LERWWPGAAGPDGGRAEIGRARDEAWAAAVATLDR--GLAVAVDYAHTRAARPPYGTLTG 239 Query: 255 VKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 + +P+ + G D+++HV + GL TQ L LG Sbjct: 240 FRDGRETAPVPDGG-CDVTAHVALDACAGPGA-------GLLTQRAALTALG 283 >gi|115491225|ref|XP_001210240.1| conserved hypothetical protein [Aspergillus terreus NIH2624] gi|114197100|gb|EAU38800.1| conserved hypothetical protein [Aspergillus terreus NIH2624] Length = 427 Score = 79.8 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 38/222 (17%), Positives = 67/222 (30%), Gaps = 51/222 (22%) Query: 22 DQYFALCVADPEFGYYSTCNPF---GAVGDF----------------------------- 49 ++ + +P +GY+S G DF Sbjct: 30 REFIDDSLYNPHYGYFSKHATIFTPGEPFDFNSMADGPTFHRLLEERYTEFEDRLDEIQP 89 Query: 50 -------VTAPEI-SQIFGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDILRVI 100 T E+ +GE +A +L+ ++ +P + E+G G G MML+IL I Sbjct: 90 DTARQLWHTPTELFKPYYGETIARYLVSNYKLTLYPYHDLIIYEMGAGNGTMMLNILDFI 149 Query: 101 CKLKPDFFSVLSIYMVETSERLTLIQKKQL------ASYGDKI----NWYTSLADVPLGF 150 + + ++E S L +Q K L A + D I Sbjct: 150 RDTDYEVYQRTKFKIIEISPALASLQMKNLMDSAYAAGHLDHIEIINKSIFDWDTYVHSP 209 Query: 151 TFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIG 192 F +A E D+ + + + F+ Sbjct: 210 CFFLALEVIDNFSHDAIRYDRKTEQPQQGGVLIDADGEFHEY 251 >gi|326778532|ref|ZP_08237797.1| protein of unknown function DUF185 [Streptomyces cf. griseus XylebKG-1] gi|326658865|gb|EGE43711.1| protein of unknown function DUF185 [Streptomyces cf. griseus XylebKG-1] Length = 337 Score = 79.0 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 59/288 (20%), Positives = 101/288 (35%), Gaps = 24/288 (8%) Query: 26 ALCVADPEFGYYST--CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLV 83 + E G+Y + +P G G F T+ S +F +A L + G V LV Sbjct: 13 QAALYGDE-GFYRSPLRSPRGPAGHFRTSVHASPLFAAAVARLLTGTARELGT-GTVALV 70 Query: 84 ELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWYTSL 143 ++G GRG + +L + L ++ Y VE + +I W Sbjct: 71 DVGSGRGELPTGVLAALDALPGRPD--VTAYAVEV--------AARPPGLDPRIEWCARP 120 Query: 144 ADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLV--FNIGDHEIKSNFL 201 + G L ANE+ D++P + G+ + + + ++ Sbjct: 121 PEGVTG--LLFANEWLDNVPAEVAEADPDGVPRYVRVRASDGAERLGEPVSGADLAWLER 178 Query: 202 TCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQ--SRVGDTLQAVKGHT 259 G E RD L G A+ +DY +++ TL +G Sbjct: 179 WWPLSAPGERAEIGRPRDTAWARAVGSLTA--GLAVAVDYPHVRGGRPPFGTLTGFRGGR 236 Query: 260 YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYIN-GLTTQGKFLEGLG 306 V P+ + DL++HV ++ + + LT Q L LG Sbjct: 237 EVRPVPDGS-CDLTAHVALDACAAAVAEAGIDASTELTDQRAALRRLG 283 >gi|320584180|gb|EFW98391.1| High affinity polyamine permease [Pichia angusta DL-1] Length = 1029 Score = 78.6 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 38/234 (16%), Positives = 78/234 (33%), Gaps = 55/234 (23%) Query: 22 DQYFALCVADPEFGYYSTCNPF-------------------------------------- 43 + + +P +GY++ Sbjct: 4 SDFIEDSLYNPSYGYFAKQATIFQHDEPIVYKELSGQDEFMQKWISAYDQYDQKMVPTLK 63 Query: 44 -GAVGDF-VTAPEI------SQIF----GEMLAIFLICAWEQHGFPSCV-RLVELGPGRG 90 F +T P + +++F GE LA +L+ ++ + +P + E+G G G Sbjct: 64 HSKENHFQLTKPSLQLWHTPTELFQPYYGEALARYLLVNYKLNQYPYNDLTIYEIGGGNG 123 Query: 91 IMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWYT----SLADV 146 +M +IL I + +P+ + +VE S RL Q K + + K++ Sbjct: 124 TLMTNILDFIQRTQPEVYERTRYNIVEISSRLFDKQIKNKSKHSHKVHLINQSVLDWNKP 183 Query: 147 PLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNF 200 + F+VA E FD+L + + + ++ F + + Sbjct: 184 VIEPCFVVALEVFDNLAHDVVRYDINTNQPYQGYVVIDENNDFKEVFSPELNPW 237 >gi|169766396|ref|XP_001817669.1| hypothetical protein AOR_1_1092174 [Aspergillus oryzae RIB40] gi|238483107|ref|XP_002372792.1| conserved hypothetical protein [Aspergillus flavus NRRL3357] gi|83765524|dbj|BAE55667.1| unnamed protein product [Aspergillus oryzae] gi|220700842|gb|EED57180.1| conserved hypothetical protein [Aspergillus flavus NRRL3357] Length = 503 Score = 78.2 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 37/220 (16%), Positives = 68/220 (30%), Gaps = 51/220 (23%) Query: 22 DQYFALCVADPEFGYYSTCNPF---GAVGDF----------------VT---------AP 53 ++ + +P +GY+S G DF T P Sbjct: 106 REFIDDSLYNPHYGYFSKHATIFSPGEPFDFNNIEDGPAFHRMLGDRYTEFEDHLDEVQP 165 Query: 54 EIS------------QIFGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDILRVI 100 +I+ +GE +A +L+ ++ +P + E+G G G MM++IL I Sbjct: 166 DIARQLWHTPTELFRPYYGETIARYLVSNYKLTLYPYHDLIIYEMGAGNGTMMINILDFI 225 Query: 101 CKLKPDFFSVLSIYMVETSERLTLIQKKQL------ASYGDKI----NWYTSLADVPLGF 150 + + ++E S L +Q K L A + D + Sbjct: 226 RDTDYEVYQRTKFKIIEISSNLAGLQMKNLMDSINAAGHLDHVEIINKSIFDWDTYVHSP 285 Query: 151 TFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFN 190 F +A E D+ + + F+ Sbjct: 286 CFFLALEVIDNFSHDAIRYDTATELPQQGGVLIDADGEFH 325 >gi|119484572|ref|XP_001262065.1| COG1565 domain protein [Neosartorya fischeri NRRL 181] gi|119410221|gb|EAW20168.1| COG1565 domain protein [Neosartorya fischeri NRRL 181] Length = 440 Score = 78.2 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 36/200 (18%), Positives = 65/200 (32%), Gaps = 51/200 (25%) Query: 22 DQYFALCVADPEFGYYSTCNPF---GAVGDFV-------------------------TAP 53 ++ + +P +GY+S G DF T P Sbjct: 43 REFIDDSLYNPHYGYFSKHATIFSPGEPFDFNNIEDGPAFHRMLGERYTEFEDKLDETQP 102 Query: 54 EIS------------QIFGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDILRVI 100 +++ +GE +A +L+ ++ +P + E+G G G MM++IL I Sbjct: 103 DVARQLWHTPTELFRPYYGETVARYLVSNYKLTLYPYHDLIIYEMGAGNGTMMINILDFI 162 Query: 101 CKLKPDFFSVLSIYMVETSERLTLIQKKQL------ASYGDKI----NWYTSLADVPLGF 150 + + ++E S L +Q K L A + D + Sbjct: 163 RDTDYEVYQRTKFKIIEISPALASLQMKNLTDSVNAAGHMDHVEIINRSIFDWDTYVHSP 222 Query: 151 TFLVANEFFDSLPIKQFVMT 170 F +A E FD+ Sbjct: 223 CFFLALEVFDNFSHDAIRYD 242 >gi|70983594|ref|XP_747324.1| conserved hypothetical protein [Aspergillus fumigatus Af293] gi|66844950|gb|EAL85286.1| conserved hypothetical protein [Aspergillus fumigatus Af293] gi|159123670|gb|EDP48789.1| conserved hypothetical protein [Aspergillus fumigatus A1163] Length = 440 Score = 77.8 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 36/200 (18%), Positives = 65/200 (32%), Gaps = 51/200 (25%) Query: 22 DQYFALCVADPEFGYYSTCNPF---GAVGDFV-------------------------TAP 53 ++ + +P +GY+S G DF T P Sbjct: 43 REFIDDSLYNPHYGYFSKHATIFSPGEPFDFNNIEDGPAFHRMLGERYTEFENKLDETQP 102 Query: 54 EIS------------QIFGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDILRVI 100 +++ +GE +A +L+ ++ +P + E+G G G MM++IL I Sbjct: 103 DVARQLWHTPTELFRPYYGETVARYLVSNYKLTLYPYHDLIIYEMGAGNGTMMINILDFI 162 Query: 101 CKLKPDFFSVLSIYMVETSERLTLIQKKQL------ASYGDKI----NWYTSLADVPLGF 150 + + ++E S L +Q K L A + D + Sbjct: 163 RDTDYEVYQRTKFKIIEISPALASLQMKNLTDSVNAAGHMDHVEIINRSIFDWDTYVHSP 222 Query: 151 TFLVANEFFDSLPIKQFVMT 170 F +A E FD+ Sbjct: 223 CFFLALEVFDNFSHDAIRYD 242 >gi|311896795|dbj|BAJ29203.1| hypothetical protein KSE_33950 [Kitasatospora setae KM-6054] Length = 334 Score = 77.5 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 62/330 (18%), Positives = 101/330 (30%), Gaps = 28/330 (8%) Query: 25 FALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVE 84 + P+ G+Y P G G F T+ S +F + L G P + V+ Sbjct: 10 IEHALYRPDGGFYRG--PQGPAGHFRTSVHASALFAGAVVRLLAGVDAALGRPERLAFVD 67 Query: 85 LGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWYTSLA 144 +G GRG + L + ++ + +Y VE + A ++W Sbjct: 68 VGAGRGELTLAVRGLL---PAGLRDRVDLYAVEL--------ADRPAGLPAGVDWRAEPP 116 Query: 145 DVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCS 204 G L ANE+ D++P+ G + D + Sbjct: 117 AGVRG--LLFANEWLDNVPLDVAERGPDGALRYVEVAPDGTERPGGPLDPADAAWARRW- 173 Query: 205 DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPL 264 + G E RD + LA A + TL + +P+ Sbjct: 174 -WPDGERVELGGPRDAAWAAAVGALAGGLAVAADYAHTAAARPPFGTLTGFRAGRECAPV 232 Query: 265 VNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGI-WQR--AFSLMKQTARK 321 + DL++HV ++ L TTQ L LG+ R A Sbjct: 233 PDGS-TDLTAHVALDAAAAPGHPALL-----TTQRAALRALGVSGARPPLALASADPAAY 286 Query: 322 DILLDSVKRLVSTSADKKSMGELFKILVVS 351 L + D +G F LV + Sbjct: 287 LRALSGAGEA-AELTDPAGLG-AFHWLVQA 314 >gi|322694714|gb|EFY86536.1| hypothetical protein MAC_07398 [Metarhizium acridum CQMa 102] Length = 531 Score = 77.5 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 40/214 (18%), Positives = 67/214 (31%), Gaps = 52/214 (24%) Query: 28 CVADPEFGYYSTCNPF---GAVGDFVT-----A--PEIS--------------------- 56 + +P +GY+S G DF T A E+ Sbjct: 138 SLYNPSYGYFSKQAVIFSPGEPFDFTTLRDDLAFQSELGRRYTSFEDHLDDVEGENPTRQ 197 Query: 57 ----------QIFGEMLAIFLICAWEQHGFPSCVR-LVELGPGRGIMMLDILRVICKLKP 105 +GE +A +L+ + +P + E+G GRG +ML+IL I ++ P Sbjct: 198 LWHTPTELFRPYYGEAIARYLVTNYRLTTYPYDDLLIYEMGAGRGTLMLNILDYIREVDP 257 Query: 106 DFFSVLSIYMVETSERLTLIQKKQL------ASYGDKI----NWYTSLADVPLGFTFLVA 155 ++ ++E S L +Q K L + DK+ F +A Sbjct: 258 QVYARTRYNIIEISTNLASLQNKHLLSTAESRGHTDKVDIVNRSIFGWDQYVPSPCFFLA 317 Query: 156 NEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVF 189 E FD+ + F Sbjct: 318 MEVFDNFSHDCIRYDVATEEPLQGHVLIDGDGDF 351 >gi|282866347|ref|ZP_06275392.1| protein of unknown function DUF185 [Streptomyces sp. ACTE] gi|282558743|gb|EFB64300.1| protein of unknown function DUF185 [Streptomyces sp. ACTE] Length = 330 Score = 77.5 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 60/285 (21%), Positives = 99/285 (34%), Gaps = 24/285 (8%) Query: 26 ALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVEL 85 + + G+Y P G G F T+ S +F +A L+ + G V LV++ Sbjct: 13 RTALYG-DGGFY--RRPEGPAGHFRTSVHASGLFASAVARLLVRTARELGS-DTVDLVDV 68 Query: 86 GPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWYTSLAD 145 G GRG ++ +L + P + Y VE + R I W + Sbjct: 69 GAGRGELLTGVLAALPSEAPSL--TVRAYAVEIAPR--------PPGLRPDITWGAGIPQ 118 Query: 146 VPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDH--EIKSNFLTC 203 G L ANE+ D++P G+ R++ + + + Sbjct: 119 GARG--LLFANEWLDNVPADVAEADADGVARRVLVRRSDGAERLGDPVTGADARWLERWW 176 Query: 204 SDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQ--SRVGDTLQAVKGHTYV 261 G E RD LA GG A+ +DY +++ TL + V Sbjct: 177 PLARPGDRAEIGRPRDEAWAEAVGSLA--GGLAVAVDYAHVRAARPPFGTLTGFRSGREV 234 Query: 262 SPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 P+ + +DL++HV ++ L Q L LG Sbjct: 235 PPVPDGS-SDLTAHVALDACAAATAHLGGGAE-LLDQRTALRELG 277 >gi|297683498|ref|XP_002819414.1| PREDICTED: protein midA homolog, mitochondrial-like [Pongo abelii] Length = 187 Score = 77.5 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 44/159 (27%), Positives = 70/159 (44%), Gaps = 10/159 (6%) Query: 195 EIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQA 254 E K F+ + FE P + +S +A GG A+V DYG+ + +T + Sbjct: 2 EKKVKFMLSIPHMEQD-FEVFPDAGVITEELSQCIALTGGAAMVADYGHDGT-NTETFRG 59 Query: 255 VKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSL 314 GH + L+ PG ADL + VDF L +A K+ G Q F + +GI + L Sbjct: 60 FCGHKFHDVLIAPGAADLKADVDFSYLQRMAQE-KVASLGPIKQYTFFKNMGIEVQLKIL 118 Query: 315 MKQTAR---KDILLDSVKRLVSTSADKKSMGELFKILVV 350 ++++ + LL L+ + K MGE I + Sbjct: 119 LEKSNETSVRQQLLQGYDMLM----NPKKMGERLNIFAL 153 >gi|121719862|ref|XP_001276629.1| COG1565 domain protein [Aspergillus clavatus NRRL 1] gi|119404841|gb|EAW15203.1| COG1565 domain protein [Aspergillus clavatus NRRL 1] Length = 504 Score = 77.5 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 44/284 (15%), Positives = 82/284 (28%), Gaps = 56/284 (19%) Query: 22 DQYFALCVADPEFGYYSTCNPF---GAVGDFV-------------------------TAP 53 ++ + +P +GY+S G +F T P Sbjct: 107 REFIDDSLYNPHYGYFSKHATIFSPGEPFEFNNIEDGPAFHRMLGERYTEFEDKLDETKP 166 Query: 54 EIS------------QIFGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDILRVI 100 +I+ +GE +A +L+ ++ +P + E+G G G MM++IL I Sbjct: 167 DIARQLWHTPTELFRPYYGETIARYLVSNYKLTLYPYHDLIIYEMGAGNGTMMINILDFI 226 Query: 101 CKLKPDFFSVLSIYMVETSERLTLIQKKQL------ASYGDKI----NWYTSLADVPLGF 150 + + ++E S L +Q K L A + D + Sbjct: 227 RDTDYEVYQRTKFKIIEISPALASLQMKNLTDSINAAGHMDHVEIINRSIFDWDTYVHSP 286 Query: 151 TFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGA 210 F +A E FD+ + I F+ + + FL Sbjct: 287 CFFLALEVFDNFSHDAIRYDTKTELPQQGSILIDADGEFHEYYNP---QIDPVASRFL-- 341 Query: 211 IFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQA 254 + R + +L ++ Y T Sbjct: 342 RVRQAAARRPFPSPLGPKLMRQVRGSLPFQSQYTMPEYIPTRLM 385 >gi|50556550|ref|XP_505683.1| YALI0F20878p [Yarrowia lipolytica] gi|49651553|emb|CAG78492.1| YALI0F20878p [Yarrowia lipolytica] Length = 530 Score = 77.1 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 50/343 (14%), Positives = 100/343 (29%), Gaps = 85/343 (24%) Query: 18 QMTVDQYFALCVADPEFGYY-------STCNPF--------------------------- 43 +MT + + +P +GY+ +T PF Sbjct: 97 RMTAKDFMDDSLYNPYYGYFSRNVEIFTTDKPFDYTNINDVDDFLNTWTGEYSKYAANSP 156 Query: 44 ---GA-------------VGDFV----------TAPEI-SQIFGEMLAIFLICAWEQHGF 76 G DF T E+ +GE LA +L+ ++ + Sbjct: 157 TLTGKTKDTSHSGTFPAQDKDFTSRGASKQLWHTPTELFKPYYGEALARYLLVNYKLSLY 216 Query: 77 PSCV-RLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGD 135 P + E+G G G +M +IL I +P+ + ++E S++L Q+ L + Sbjct: 217 PYKDLIIYEMGGGNGTLMTNILDYIRDTQPEVYERTRYKVIEISDQLAHKQQSALHTKAS 276 Query: 136 KINW----------YTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHD 185 + + F VA E FD+ ++ + Sbjct: 277 ETGHRSKVEIINKSIFDWKEPVNDPCFFVALEVFDNFAHDIVRYDNRTLQPYQGHVLVDA 336 Query: 186 SLVFNIGDHEIKSNFLT---------CSDYFLGAIFENSPCRDREMQSISDRLACDGGTA 236 + F+ + + DY + P + + L + Sbjct: 337 NGDFHEVYTQKLDEWTKLFLQIRTEALEDYSPDSRLGFHPLSTPWLYKEARNLM-WPFHS 395 Query: 237 IVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQ 279 + D ++ +R + L +K P +D + + Sbjct: 396 ELSDPEFIPTRYLEFLYILKK---YFPQARLLSSDFTHLPETT 435 >gi|320588314|gb|EFX00783.1| cog1565 domain containing protein [Grosmannia clavigera kw1407] Length = 521 Score = 76.7 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 38/221 (17%), Positives = 69/221 (31%), Gaps = 66/221 (29%) Query: 21 VDQYFALCVADPEFGYYSTCNPFGAVG-----------------------DF-------- 49 + + + +P +GY+S G DF Sbjct: 102 MRDFIEDSLYNPHYGYFSKQVVIFTPGEPFRFTEMRDEPEFSAILSQRYSDFEDGLDAAA 161 Query: 50 --------------VTAPEI-SQIFGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMM 93 T E+ +GE +A +L+ + FP + E+G GRG MM Sbjct: 162 VAQGGEPSETRQLWYTPTELFRPHYGEAVARYLVANYMLTSFPYDDLIIYEMGAGRGTMM 221 Query: 94 LDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQL---------------ASYGDKIN 138 L+IL + +P ++ ++E S L +Q+++L +GD++ Sbjct: 222 LNILDYLRVHEPAVYARTRYRIIEISSSLASLQRRELHGSPATSPAATAAADRGHGDRVE 281 Query: 139 WY----TSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIR 175 F +A E FD+ + Sbjct: 282 IINQSVFDWTTPEPSPCFFLAFEVFDNFAHDCVRYDLASQQ 322 >gi|291297630|ref|YP_003508908.1| hypothetical protein Snas_0094 [Stackebrandtia nassauensis DSM 44728] gi|290566850|gb|ADD39815.1| protein of unknown function DUF185 [Stackebrandtia nassauensis DSM 44728] Length = 311 Score = 76.7 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 63/331 (19%), Positives = 113/331 (34%), Gaps = 40/331 (12%) Query: 19 MTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPS 78 + Q + P G+Y+ +P F T+ + +F +A + + P Sbjct: 2 LPWSQAMRRALYGPG-GFYTLNSP---NRHFRTSAQF-PLFATAIAELVRRTDSRLDHPE 56 Query: 79 CVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKIN 138 LV++G G G ++ +L D L VE ++I Sbjct: 57 VFNLVDVGAGSGELLRRVLDE-----EDLNGRLRPVAVELRP--------APPGLDERIE 103 Query: 139 WYTSLADVPLGFTFLVANEFFDSLPIKQFVMTE--HGIRERMIDIDQHDSLVFNIGDHEI 196 W TSL +G L+A E+ D++P+ V E + + + Sbjct: 104 WRTSLPGGIIG--LLMACEYLDNMPLDIAVADAAGAVRYELVDPGSGETRPGQPVSSVDA 161 Query: 197 KSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYL--QSRVGDTLQA 254 G E RD+E + ++ GT + +DYG+ T+ Sbjct: 162 DWLRAWWPLGQAGQRAEIGRSRDQEWARLCGNVSR--GTVVAVDYGHTKDCRPPLGTVTG 219 Query: 255 VKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGI-WQRAFS 313 V+P+ + + D+++HV L GL TQ L LGI R Sbjct: 220 FLDGHQVAPVPDGTR-DITAHVAMDSLGDG---------GLWTQRDALRELGISGSRPPL 269 Query: 314 LMKQTARKDILLDSVKRL--VSTSADKKSMG 342 + +T + + ++ R + D +G Sbjct: 270 ELSRTNPAEYIR-ALSRAGDAAELTDPAGLG 299 >gi|255554130|ref|XP_002518105.1| conserved hypothetical protein [Ricinus communis] gi|223542701|gb|EEF44238.1| conserved hypothetical protein [Ricinus communis] Length = 444 Score = 75.9 bits (185), Expect = 8e-12, Method: Composition-based stats. Identities = 33/236 (13%), Positives = 68/236 (28%), Gaps = 47/236 (19%) Query: 2 ENKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTC--------------------- 40 + I + + V + + P GY+S Sbjct: 27 TSPFQALYSTHIVGDKPILVRDFIHSALYHPLHGYFSQRSRSVGVLEKSIKFNQLQGRKA 86 Query: 41 -----NPFGAVGD---FVTAPEI-SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGI 91 + D F T E+ + + +A ++ +R+ E+G G G Sbjct: 87 YLNHLDKIYKQSDISWF-TPVELFNPWYAHGIAEAIM---RTANLSVPLRIYEIGGGSGT 142 Query: 92 MMLDILRVICKLKPDF-FSVLSIYMVETSERLTLIQKKQLASYGDKINWYT--------- 141 I+ I P ++ ++ VE S L IQK+ + ++ + Sbjct: 143 CAKGIMDYIMLNAPSRVYNTMTYTSVEISPSLAEIQKETVGEVRSHLSKFRVECRDAADR 202 Query: 142 -SLADVPLGFTFLVANEFFDSLPIKQF--VMTEHGIRERMIDIDQHDSLVFNIGDH 194 DV +++ E D+LP +E ++ + + Sbjct: 203 SGWGDVEQQPCWVIMLEVLDNLPHDLIYSENQILPWKEVWVEKQHDKKTLSELYKP 258 >gi|302754006|ref|XP_002960427.1| hypothetical protein SELMODRAFT_75281 [Selaginella moellendorffii] gi|300171366|gb|EFJ37966.1| hypothetical protein SELMODRAFT_75281 [Selaginella moellendorffii] Length = 406 Score = 75.9 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 32/223 (14%), Positives = 71/223 (31%), Gaps = 44/223 (19%) Query: 21 VDQYFALCVADPEFGYYSTC-NPFGAVG---DF-------------------------VT 51 V Y + P+FGY+S+ + G++ DF T Sbjct: 1 VRDYIRQALYHPKFGYFSSRPDVVGSMREPLDFSGISGGRFAYRKHIAELYKQNDMSWFT 60 Query: 52 APEI-SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSV 110 E+ +G +A +++ +++ E+G G G ++L I + + D ++ Sbjct: 61 PVELFQPWYGFAVAEYILQT---MNPQFPLKIYEIGGGTGTCASNVLGYIRERQGDVYNT 117 Query: 111 LSIYMVETSERLTLIQKKQLAS----------YGDKINWYTSLADVPLGFTFLVANEFFD 160 + V+ SE L Q+ +L + + ++V E D Sbjct: 118 MHYTSVDVSEGLANKQRHRLCEEEEHGDKFSVECRNARDKSGWGEQDESPCYVVMLEVLD 177 Query: 161 SLPIKQFVMTEHGIRERMIDIDQH-DSLVFNIGDHEIKSNFLT 202 ++P + F ++ + ++ Sbjct: 178 NMPHDLLFRESSKSPWMETYVQYDSSESRFVEHHRPVEDSLVS 220 >gi|302767740|ref|XP_002967290.1| hypothetical protein SELMODRAFT_86626 [Selaginella moellendorffii] gi|300165281|gb|EFJ31889.1| hypothetical protein SELMODRAFT_86626 [Selaginella moellendorffii] Length = 406 Score = 75.1 bits (183), Expect = 1e-11, Method: Composition-based stats. Identities = 32/223 (14%), Positives = 70/223 (31%), Gaps = 44/223 (19%) Query: 21 VDQYFALCVADPEFGYYSTC-NPFGAVG---DF-------------------------VT 51 V Y + P+FGY+S+ + G++ DF T Sbjct: 1 VRDYIRQALYHPKFGYFSSRPDVVGSMREPLDFSGISGGRFAYRKHIAELYKQNDMSWFT 60 Query: 52 APEI-SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSV 110 E+ +G +A +++ +++ E+G G G ++L I + + D + Sbjct: 61 PVELFQPWYGFAVAEYILQT---MNPQFPLKIYEIGGGTGTCASNVLGYIRERQGDVYKT 117 Query: 111 LSIYMVETSERLTLIQKKQLAS----------YGDKINWYTSLADVPLGFTFLVANEFFD 160 + V+ SE L Q+ +L + + ++V E D Sbjct: 118 MHYTSVDVSEGLANKQRHRLCEEEEHGDKFSVECRNARDKSGWGEQDESPCYVVMLEVLD 177 Query: 161 SLPIKQFVMTEHGIRERMIDIDQH-DSLVFNIGDHEIKSNFLT 202 ++P + F ++ + ++ Sbjct: 178 NMPHDLLFRESSKSPWMETYVQYDSSESRFVEHHRPVEDSLVS 220 >gi|328875278|gb|EGG23643.1| hypothetical protein DFA_05777 [Dictyostelium fasciculatum] Length = 1298 Score = 74.8 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 38/258 (14%), Positives = 77/258 (29%), Gaps = 40/258 (15%) Query: 21 VDQYFALCVADPEFGYYSTCNPFG--AVGDF---------------------------VT 51 + + + + ++GY++T A F T Sbjct: 849 MRDFIQDSLYNTKYGYFATKPVITSIAPTHFKQLNTLESKDQYIDYLQHIYKQHQHSWYT 908 Query: 52 APEI-SQIFGEMLAIFLICAW--EQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFF 108 EI + + ++I + + +R+ E+G G G L +L + + D + Sbjct: 909 PVEIFQPYYSNAITRYIIEKYLEKNQDLSIPLRIYEIGAGSGTNALCMLNYLREHHKDLY 968 Query: 109 SVLSIYMVETSERLTLIQKKQLASYGDKIN-------WYTSLADVPLGFTFLVANEFFDS 161 + ++E S L Q +++ I + F+V E D+ Sbjct: 969 EITEFTIIEISRLLATQQLERIKREHPHIKVQVYNSSIFNWTHKREDQECFIVMTEVIDN 1028 Query: 162 LPIKQFVMTEHGIRE-RMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDR 220 LP Q V+ +G+ E + + K T D Sbjct: 1029 LPHDQIVLNGNGVFETIVQTHLSTMNHFDQQEQEGEKEGEETTIDNLKNKKNRKELVHFE 1088 Query: 221 EMQSISDRLACDGGTAIV 238 + QS+ D + + Sbjct: 1089 QQQSVRDPIIKEYLNLFS 1106 >gi|224055903|ref|XP_002298698.1| predicted protein [Populus trichocarpa] gi|222845956|gb|EEE83503.1| predicted protein [Populus trichocarpa] Length = 432 Score = 74.8 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 34/231 (14%), Positives = 70/231 (30%), Gaps = 45/231 (19%) Query: 6 IRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTC------------------------- 40 I + V + + DP+ GY+S Sbjct: 19 KSHFSTNIVGEKPVLVRDFIHSALYDPKHGYFSQRSRSVGVLERSIRFNQLEGRKAYMNH 78 Query: 41 -NPFGAVGD--FVTAPEI-SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDI 96 + D + T E+ + +A ++ + +++ E+G G GI I Sbjct: 79 LDKIYKRSDISWFTPVELFKPWYAHGIAEAIMRTSQ---LSVPLQIYEIGGGSGICAKGI 135 Query: 97 LRVICKLKPDF-FSVLSIYMVETSERLTLIQKKQLASYGDKINWYT----------SLAD 145 L I P ++ ++ VE S L IQK+ + ++ + D Sbjct: 136 LDYIMLNAPARIYNNMTYTSVEISPSLAEIQKETVGEVRSHLSKFRVECRDAADRSGWGD 195 Query: 146 VPLGFTFLVANEFFDSLPIKQF--VMTEHGIRERMIDIDQHDSLVFNIGDH 194 + +++ E D+LP +E ++ +F + Sbjct: 196 IKQQPCWVIMLEVLDNLPHDLVYSENQVFPWKEVWVEKQHDKESLFELYKP 246 >gi|182437896|ref|YP_001825615.1| hypothetical protein SGR_4103 [Streptomyces griseus subsp. griseus NBRC 13350] gi|178466412|dbj|BAG20932.1| conserved hypothetical protein [Streptomyces griseus subsp. griseus NBRC 13350] Length = 337 Score = 73.6 bits (179), Expect = 4e-11, Method: Composition-based stats. Identities = 58/288 (20%), Positives = 98/288 (34%), Gaps = 24/288 (8%) Query: 26 ALCVADPEFGYYST--CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLV 83 + E G+Y + +P G G F T+ S +F +A L + G V LV Sbjct: 13 QAALYGDE-GFYRSPSRSPRGPAGHFRTSVHASPLFAAAVARLLTGTARELGT-GTVALV 70 Query: 84 ELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWYTSL 143 ++G GRG + +L + L ++ Y VE + +I W Sbjct: 71 DVGSGRGELPTGVLAALDALPWRPD--VTAYAVEV--------AARPPGLDPRIEWCAQP 120 Query: 144 ADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLV--FNIGDHEIKSNFL 201 + G L ANE+ D++P + G+ + + + ++ Sbjct: 121 PEGVTG--LLFANEWLDNVPAEVAEADPDGVPRYVRVRASDGAERLGEPVSGADLAWLER 178 Query: 202 TCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQ--SRVGDTLQAVKGHT 259 G E RD L G A+ +DY +++ TL + Sbjct: 179 WWPLTAPGERAEIGRTRDAAWARAVGSLTA--GLAVAVDYAHVRGARPPFGTLAGFRDGR 236 Query: 260 YVSPLVNPGQADLSSHVDFQRLSSIAILYKLYING-LTTQGKFLEGLG 306 V P+ + DL++HV ++ LT Q L LG Sbjct: 237 EVRPVPDGS-CDLTAHVALDACAAAVAAAVPGAGAELTDQRAALRRLG 283 >gi|134083609|emb|CAL00524.1| unnamed protein product [Aspergillus niger] Length = 564 Score = 73.6 bits (179), Expect = 4e-11, Method: Composition-based stats. Identities = 35/212 (16%), Positives = 64/212 (30%), Gaps = 51/212 (24%) Query: 29 VADPEFGYYSTCNPFGAVG---------D---FV----------------TAPEIS---- 56 + + +GY+S G D F T P+++ Sbjct: 164 LYNHNYGYFSKHATIFRPGEPFYFNEIEDGPAFYRMLGERYHEFEDQLDETNPDVARQLW 223 Query: 57 --------QIFGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDILRVICKLKPDF 107 +GE +A +L+ ++ +P + E+G G G MM++IL I + Sbjct: 224 HTPTELFRPYYGETIARYLVSNYKLTLYPYHDLIIYEMGAGNGTMMMNILDYIRDTDYEV 283 Query: 108 FSVLSIYMVETSERLTLIQKKQLASYGDKI----------NWYTSLADVPLGFTFLVANE 157 + ++E S L +Q + L D F +A E Sbjct: 284 YQRTKYKIIEISPSLASLQMQNLTDSLDAAGHSDRVEIINRSIFDWDTYVHSPCFFLALE 343 Query: 158 FFDSLPIKQFVMTEHGIRERMIDIDQHDSLVF 189 FD+ Q + + + F Sbjct: 344 VFDNFSHDQIRYDLKTELPQQGGVLIDANGEF 375 >gi|296421701|ref|XP_002840403.1| hypothetical protein [Tuber melanosporum Mel28] gi|295636618|emb|CAZ84594.1| unnamed protein product [Tuber melanosporum] Length = 487 Score = 73.6 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 55/381 (14%), Positives = 105/381 (27%), Gaps = 90/381 (23%) Query: 21 VDQYFALCVADPEFGYYST----------------------CNPFGAVGDFVTA------ 52 V + + +P +G++S G T+ Sbjct: 91 VRDFIDDSLYNPNYGFFSKFAVIFSTEKPFPFNLIRDESEFQRVLGEQ---YTSFEDKLD 147 Query: 53 -PEISQI--------------FGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDI 96 E++ + +GE +A +L+ ++ FP + E+G G G MM +I Sbjct: 148 EVEMNPVRQLWHTPTELFKPYYGEAIARYLVENYKLTLFPYSDLIIYEMGAGNGTMMSNI 207 Query: 97 LRVICKLKPDFFSVLSIYMVETSERLTLIQKK------QLASYGDKINWYTSLADVPLGF 150 L + +PD + ++E S+ L +Q+ Sbjct: 208 LDHVRDTEPDVYQRTKYKIIEISKNLANLQRARAAAGGHSEKIEIANESIFDWNTYVSDP 267 Query: 151 TFLVANEFFDSLPIKQFVMTEHGIR--------------------------ERMIDIDQH 184 F +A E FD+ + ER + I Q Sbjct: 268 CFFLALEVFDNFAHDVIRYNPQTEQPLQGIVLIDAAGDFTELYTPRLDPIAERYLRIRQK 327 Query: 185 DSLVFNIGDHEIKSNFLTCSDYFLG-----AIFENSP----CRDREMQSI--SDRLACDG 233 + + D F+ I E P ++ + RL Sbjct: 328 VARPGYNHPLKGPKLIRRLKDQFIPFSENLTIPEYIPTKVLGFFDILREHFPAHRLVVSD 387 Query: 234 GTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYIN 293 A+ + + V T +P V G D+ DF+ + + + Sbjct: 388 FHALPDAVKGINAPVVQTRYNRTSIPVTTPFVQQGLFDILFPTDFEVMDDMYKAMTGKLT 447 Query: 294 GLTTQGKFLEGLGIWQRAFSL 314 + Q FL + ++ Sbjct: 448 RVMKQEDFLRSWAYLEETETI 468 >gi|164659722|ref|XP_001730985.1| hypothetical protein MGL_1984 [Malassezia globosa CBS 7966] gi|159104883|gb|EDP43771.1| hypothetical protein MGL_1984 [Malassezia globosa CBS 7966] Length = 627 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 39/201 (19%), Positives = 67/201 (33%), Gaps = 8/201 (3%) Query: 56 SQIFGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDILRVICKLKPDFFSVLSIY 114 S +G LA +L+ ++ H FP + ELG G G + DIL + + +PD +S + Sbjct: 242 SPHYGHALARYLVAEYKLHLFPYHDLVIYELGGGAGTLARDILDYMEEFEPDVYSRTRYH 301 Query: 115 MVETSERLTLIQKKQLASYGDK------INWYTSLADVPLGFTFLVANEFFDSLPIKQFV 168 +VE S+RL QK +L + + + F+VA E D+L Sbjct: 302 IVEISDRLAAQQKARLLHHLRQGTVEVVPRDFLQWDKDVEDPCFIVALEVLDNLAHDVVR 361 Query: 169 MTEHGIRERMIDIDQHDSLVFN-IGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISD 227 + ++ + + F + + + F P S Sbjct: 362 YSTDDLQPYQGLVSIDRTGDFVELWEPVQDPLIQRYLNLFHSVRRPLLPPGAPFYLSWIP 421 Query: 228 RLACDGGTAIVIDYGYLQSRV 248 Y L Sbjct: 422 SSLRHTLHESAPFYPNLTQPH 442 >gi|222619736|gb|EEE55868.1| hypothetical protein OsJ_04506 [Oryza sativa Japonica Group] Length = 446 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 34/209 (16%), Positives = 60/209 (28%), Gaps = 47/209 (22%) Query: 15 KNGQMTVDQYFALCVADPEFGYYSTCN-PFG-------------------------AVGD 48 +N + V + + DP GY+S + P G D Sbjct: 35 ENKPILVRDFVRSALYDPNHGYFSKRSGPVGVLDSSIRFNQLDGRSAYMQYLDKLYKKHD 94 Query: 49 ---FVTAPEI-SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRV-ICKL 103 F T E+ + +A +++ E+G G G IL + Sbjct: 95 IAWF-TPVELFKPWYAYAIA---ASILRTANLSVPLKIYEIGGGSGTCAKCILDYMMLNA 150 Query: 104 KPDFFSVLSIYMVETSERLTLIQKK-------QLASYGDKINWYTS---LADVPLGFTFL 153 P ++ + VE S L Q + L+ + + T ++ Sbjct: 151 PPKVYNTMKYISVEISSSLAEKQLETVGEVRSHLSKFMVECRDATDRAGWGRKDPQPCWV 210 Query: 154 VANEFFDSLPIKQFVM--TEHGIRERMID 180 + E D+LP E I+ Sbjct: 211 LMLEVLDNLPHDLVYSPDQVSPWMEVWIE 239 >gi|218189586|gb|EEC72013.1| hypothetical protein OsI_04882 [Oryza sativa Indica Group] Length = 446 Score = 71.3 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 34/209 (16%), Positives = 60/209 (28%), Gaps = 47/209 (22%) Query: 15 KNGQMTVDQYFALCVADPEFGYYSTCN-PFG-------------------------AVGD 48 +N + V + + DP GY+S + P G D Sbjct: 35 ENKPILVRDFVRSALYDPNHGYFSKRSGPVGVLDSSIRFNQLDGRSAYMQYLDKLYKKHD 94 Query: 49 ---FVTAPEI-SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRV-ICKL 103 F T E+ + +A +++ E+G G G IL + Sbjct: 95 IAWF-TPVELFKPWYAYAIA---ASILRTANLSVPLKIYEIGGGSGTCAKCILDYMMLNA 150 Query: 104 KPDFFSVLSIYMVETSERLTLIQKK-------QLASYGDKINWYTS---LADVPLGFTFL 153 P ++ + VE S L Q + L+ + + T ++ Sbjct: 151 PPKVYNTMKYISVEISSSLAEKQLETVGEVRSHLSKFMVECRDATDRAGWGRKDPRPCWV 210 Query: 154 VANEFFDSLPIKQFVM--TEHGIRERMID 180 + E D+LP E I+ Sbjct: 211 LMLEVLDNLPHDLVYSPDQVSPWMEVWIE 239 >gi|239052765|ref|NP_001132810.2| hypothetical protein LOC100194300 [Zea mays] gi|238908737|gb|ACF81812.2| unknown [Zea mays] Length = 451 Score = 70.9 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 35/237 (14%), Positives = 64/237 (27%), Gaps = 47/237 (19%) Query: 12 LIKKNGQMTVDQYFALCVADPEFGYYSTCN-PFG-------------------------A 45 ++ N + V + + DP GY+S P G Sbjct: 31 ILAGNKPILVRDFVRSALYDPNHGYFSKRAGPVGVLDASIRFNQLEGRSAYIQHLDKLYK 90 Query: 46 VGD---FVTAPEI-SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRV-I 100 D F T E+ + +A +++ E+G G G IL + Sbjct: 91 KHDIAWF-TPVELFKPWYAYTIA---ASILRTANLSVPLKIYEIGGGSGTCAKCILDYMM 146 Query: 101 CKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWY----------TSLADVPLGF 150 P ++ + VE S L Q + + ++ + Sbjct: 147 LNAPPKVYNDMKYISVEISSSLAEKQLETVGEVQSHLSKFTVEHRDAINRPGWGRTDPHP 206 Query: 151 TFLVANEFFDSLPIKQFVM--TEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSD 205 +++ E D+LP E I+ + S + C D Sbjct: 207 CWVLMLEVLDNLPHDLVYSPDQVSPWMEVWIEKVKGSSQASEVYKPLQDPLISRCVD 263 >gi|258575697|ref|XP_002542030.1| conserved hypothetical protein [Uncinocarpus reesii 1704] gi|237902296|gb|EEP76697.1| conserved hypothetical protein [Uncinocarpus reesii 1704] Length = 473 Score = 70.9 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 37/215 (17%), Positives = 64/215 (29%), Gaps = 51/215 (23%) Query: 26 ALCVADPEFGYYSTCNPF---GAVGDFVT-------------------------APEIS- 56 L + +P +GY+S G DF +P + Sbjct: 80 ELRLYNPHYGYFSKHATIFSPGEPFDFNNIEDGPAFNRLVDQRYAEFEDKLDAVSPNETR 139 Query: 57 -----------QIFGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDILRVICKLK 104 +GE +A +L+ + FP + E+G G G +ML+IL I ++ Sbjct: 140 QLWHTPTELFRPYYGEAIARYLVTNYRLTLFPYHDLIIYEMGAGNGTLMLNILDYIREVD 199 Query: 105 PDFFSVLSIYMVETSERLTLIQKKQLASYGD----------KINWYTSLADVPLGFTFLV 154 P+ + ++E S +L Q+ L + F + Sbjct: 200 PEVYQRTKFKIIEISSQLANTQQMTLNNSIYGDGHRGHVEVINRSIFEWNTYVHSPCFFL 259 Query: 155 ANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVF 189 A E FD+ R + F Sbjct: 260 ALEVFDNFAHDAIRYDMQTGLPRQGCVLIDTDGEF 294 >gi|195627520|gb|ACG35590.1| uncharacterized ACR, COG1565 family protein [Zea mays] Length = 451 Score = 70.5 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 32/212 (15%), Positives = 59/212 (27%), Gaps = 47/212 (22%) Query: 12 LIKKNGQMTVDQYFALCVADPEFGYYSTCN-PFG-------------------------A 45 ++ N + V + + DP GY+S P G Sbjct: 31 ILAGNKPILVRDFVRSALYDPNHGYFSKRAGPVGVLDASIRFNQLEGRSAYIQHLDKLYK 90 Query: 46 VGD---FVTAPEI-SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRV-I 100 D F T E+ + +A +++ E+G G G IL + Sbjct: 91 KHDIAWF-TPVELFKPWYAYAIA---ASILRTANLSVPLKIYEIGGGSGTCAKCILDYMM 146 Query: 101 CKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWY----------TSLADVPLGF 150 P ++ + VE S L Q + + ++ + Sbjct: 147 LNAPPKVYNDMKYISVEISSSLAEKQLETVGEVQSHLSKFTVEHRDAINRPGWGRTDPHP 206 Query: 151 TFLVANEFFDSLPIKQFVM--TEHGIRERMID 180 +++ E D+LP E I+ Sbjct: 207 CWVLMLEVLDNLPHDLVYSPDQVSPWMEVWIE 238 >gi|256374477|ref|YP_003098137.1| hypothetical protein Amir_0322 [Actinosynnema mirum DSM 43827] gi|255918780|gb|ACU34291.1| protein of unknown function DUF185 [Actinosynnema mirum DSM 43827] Length = 305 Score = 69.8 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 65/331 (19%), Positives = 99/331 (29%), Gaps = 58/331 (17%) Query: 25 FALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVE 84 + + P G+++ F TAP + E L L P + LV+ Sbjct: 8 WRTALYGPS-GFFTRGEA--PSDHFRTAPLVGPELAEALLELLRRVDTALDRPDALDLVD 64 Query: 85 LGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWYTSLA 144 LG G G + + + P L + V+ L + W L Sbjct: 65 LGAGGGELSAAVR-ALADADPALRDRLRVTAVDVGPTRDL----------PGVRWTRDLP 113 Query: 145 DVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCS 204 D G LVA+E+ D+LP Sbjct: 114 DRVTG--LLVAHEWLDALPCPVVAWPHRDPW------------------------LDRWW 147 Query: 205 DYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGY------LQSRVGDTLQAVKGH 258 G E RD S R+ G A+ IDYG+ T A + Sbjct: 148 PLRPGGRAEIGAPRDAAWASAVARV---RGAALAIDYGHLADDRAAGRYPRGTFTAYRSG 204 Query: 259 TYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQT 318 V+P+ + DL++HV + A L +Q L LG+ A L +T Sbjct: 205 NQVAPVFDGS-CDLTAHVALDACAEAAG----RPWTLVSQRDALAALGLP--AEPLTART 257 Query: 319 ARKDILLDSVKRLVSTSADKKSMGELFKILV 349 R ++ D +G F L+ Sbjct: 258 PDWLTAAARASR-IAELRDPHGLG-AFGWLL 286 >gi|281200648|gb|EFA74866.1| hypothetical protein PPL_11900 [Polysphondylium pallidum PN500] Length = 486 Score = 69.8 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 30/196 (15%), Positives = 65/196 (33%), Gaps = 40/196 (20%) Query: 21 VDQYFALCVADPEFGYYSTCNPFGA--------------VGDF---------------VT 51 + + + + E+GY+++ + D+ T Sbjct: 86 MRDFIQDSLYNKEYGYFTSKEVITSIDKSRFKELNTLKNRKDYLIYLSELYKDSQHSWFT 145 Query: 52 APEI-SQIFGEMLAIFLICAWEQHGFPS---CVRLVELGPGRGIMMLDILRVICKLKPDF 107 EI + + ++I + +++ E+G G G L IL + P+ Sbjct: 146 PVEIFQPYYSNAIGRYIIEKYNNSKMKQEKKPLKIFEVGAGSGTNALCILNFLRDQHPNL 205 Query: 108 FSVLSIYMVETSERLTLIQKKQLASYGDKIN-------WYTSLADVPLGFTFLVANEFFD 160 ++ ++E S L + Q +++ I + F++ E D Sbjct: 206 YATAQYTIIEISRLLAVKQLERIKVEHPNIKVQVYNTSIFNWTHKREDDECFILMTEVID 265 Query: 161 SLPIKQFVMTEHGIRE 176 +LP + V+ GI E Sbjct: 266 NLPHDKIVVNSDGIFE 281 >gi|307105740|gb|EFN53988.1| hypothetical protein CHLNCDRAFT_25175 [Chlorella variabilis] Length = 500 Score = 69.4 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 38/235 (16%), Positives = 71/235 (30%), Gaps = 70/235 (29%) Query: 21 VDQYFALCVADPEFGYYSTCNPFGAV--G------DF----------------------- 49 V ++ + P GY+S GA G DF Sbjct: 2 VREFIQDSLYHPTEGYFSKQTATGAAVVGSLAAPLDFRRLAGQMQYLQAVQEQYRQLQAS 61 Query: 50 -VTAPEI-SQIFGEMLAIFLICAWEQHGFPSCVR--LVELGPGRGIMMLDILRVICKLKP 105 +T EI +G +A ++ W Q + + E+G G G + +IL + + P Sbjct: 62 WLTPVEIFQPHYGRAVAACILHRWRQLAAAAPPPLLIYEIGGGTGTLARNILDWLREAHP 121 Query: 106 DFFSVLSIYMVETSERLTLIQKKQLASYGDKINWY------------------------- 140 + + +E S L +Q++++A+ G + Sbjct: 122 EAYHTAQYRCIEISPVLAELQRQKVAAEGGHATRFAVRRHDAADAAAWAAADADERRQQQ 181 Query: 141 ----------TSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHD 185 F++A E D+LP + + + + Q D Sbjct: 182 EGGQHQQEEDKHHHRHHHHHCFIIACEVLDNLPHDRVRRDMSSAQWQQTLVAQAD 236 >gi|297299987|ref|XP_001089932.2| PREDICTED: protein midA homolog, mitochondrial-like, partial [Macaca mulatta] Length = 199 Score = 69.0 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 35/139 (25%), Positives = 63/139 (45%), Gaps = 3/139 (2%) Query: 212 FENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQAD 271 E P + +S +A GG A+V DYG+ ++ D + GH + L+ PG+AD Sbjct: 2 VEMFPDAGVITKELSQCIALTGGAALVADYGHDGTKTDD-FRGFCGHKFHDVLIAPGRAD 60 Query: 272 LSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRL 331 L+++VDF L +A K+ G Q F + +GI + + + + + + + + Sbjct: 61 LTANVDFSYLQRMA-QGKVASLGPIKQHTFFKNMGIDVQPKVPLDK-SNETSIRQQLLQG 118 Query: 332 VSTSADKKSMGELFKILVV 350 + K GE I + Sbjct: 119 YDMLMNPKKTGERLNIFAL 137 >gi|46108046|ref|XP_381081.1| hypothetical protein FG00905.1 [Gibberella zeae PH-1] Length = 482 Score = 69.0 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 34/172 (19%), Positives = 61/172 (35%), Gaps = 12/172 (6%) Query: 57 QIFGEMLAIFLICAWEQHGFPSCVR-LVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 +GE +A +L+ + +P + E+G GRG +ML+IL I ++ P ++ + Sbjct: 159 PYYGEAIARYLVSNYRLTTYPYHDLLIYEMGAGRGTLMLNILDYIREVDPQVYARTKFNI 218 Query: 116 VETSERLTLIQKKQL------ASYGDKI----NWYTSLADVPLGFTFLVANEFFDSLPIK 165 +E S L +Q + L + DK+ F +A E FD+ Sbjct: 219 IEISSTLAALQNRHLLATAASRGHADKVEIINRSIFDWDQYVPSPCFFLAMEVFDNFSHD 278 Query: 166 QFVMTEHGIR-ERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSP 216 + + + D + HE+ A N P Sbjct: 279 GIRYDITTEQPLQGHVLIDGDGDFYEFYSHELDPLAARYFRVRHAATAGNYP 330 >gi|297843292|ref|XP_002889527.1| hypothetical protein ARALYDRAFT_470475 [Arabidopsis lyrata subsp. lyrata] gi|297335369|gb|EFH65786.1| hypothetical protein ARALYDRAFT_470475 [Arabidopsis lyrata subsp. lyrata] Length = 447 Score = 68.6 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 35/249 (14%), Positives = 75/249 (30%), Gaps = 47/249 (18%) Query: 18 QMTVDQYFALCVADPEFGYYSTCNP-FG-------------------------AVGD--- 48 + V + + DP+ GY+S + G D Sbjct: 46 PVLVRDFIHTALYDPQKGYFSQRSKSVGVLERSIKFNQLEGRKAYMKLLEKVYKQSDISW 105 Query: 49 FVTAPEI-SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDF 107 F T E+ + +A + +++ E+G G G +L I P+ Sbjct: 106 F-TPVELFKPWYAHGIAEAI---LRTTNLSVPLKIYEIGGGSGTCAKGVLDYIMLNAPER 161 Query: 108 -FSVLSIYMVETSERLTLIQKK-------QLASYGDKINWYTSLADVPL---GFTFLVAN 156 ++ +S +E S L IQK+ L+ + + ++L+ +++ Sbjct: 162 IYNNMSYTSIEISPSLAKIQKETVAQVGSHLSKFRVECRDASNLSGWKNVEQQPCWVIML 221 Query: 157 EFFDSLPIKQFVMTEH--GIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFEN 214 E D+LP E +++ + + C + Sbjct: 222 EVLDNLPHDLVYSKSQVSPWMEVLVENKPESEALSELYKPLEDPLIKRCIEIVEHEDDPV 281 Query: 215 SPCRDREMQ 223 S ++ + Sbjct: 282 SKPKEIWSK 290 >gi|84495010|ref|ZP_00994129.1| hypothetical protein JNB_09429 [Janibacter sp. HTCC2649] gi|84384503|gb|EAQ00383.1| hypothetical protein JNB_09429 [Janibacter sp. HTCC2649] Length = 316 Score = 68.6 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 60/324 (18%), Positives = 119/324 (36%), Gaps = 43/324 (13%) Query: 28 CVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVELGP 87 + P+ G+Y G G F T+ + G +LA + ++ G R+V+L Sbjct: 13 ALYGPD-GFYRRDE--GPAGHFTTSAH-GGL-GAVLADVVGRLADEAGV---CRVVDLAC 64 Query: 88 GRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL-TLIQKKQLASYGDKINWYTSLADV 146 GRG ++ + ++PD ++ + +VE L T + + + L Sbjct: 65 GRGELLTH----LHSIRPD-LELVGVDVVERPPSLPTSVDWLRSPGGSAVPDELHDLDA- 118 Query: 147 PLGFTFLVANEFFDSLPIKQFVMTEH-GIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSD 205 ++A+E+ D +P + +RE ++D ++L + + + Sbjct: 119 -----LVLAHEWLDVVPCTIAEVDADGVLREVLVDEAGTETLGGELAEPDRAWADSWWPA 173 Query: 206 YFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQ--SRVGDTLQAVKGHTYVSP 263 G E RD + + R GT I +DYG+ + +L A + +P Sbjct: 174 TEAGERVEIGLGRDLAWRDLLSR--NRTGTTIAVDYGHTKATRPTEGSLTAYREGVLTNP 231 Query: 264 LVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDI 323 + + + DL++HV L + + TQ LG+ + + Sbjct: 232 VPDGQR-DLTAHVATDSLGADEV---------VTQRDLFHRLGLT---ATTPDHASASTD 278 Query: 324 LLDSVKRL-----VSTSADKKSMG 342 L V+ L ++ D +G Sbjct: 279 PLGYVRALGRTSTIAALTDPHGLG 302 >gi|300681443|emb|CBH32535.1| conserved hypothetical protein, expressed [Triticum aestivum] Length = 441 Score = 68.2 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 35/234 (14%), Positives = 64/234 (27%), Gaps = 49/234 (20%) Query: 16 NGQMTVDQYFALCVADPEFGYYSTCNPFGAVG--D----F-------------------- 49 + + V + + DP GY+S + G VG D F Sbjct: 32 DKPVLVRDFVRSALYDPNHGYFSKRS--GPVGVLDSAIRFHKLDGRTAYMKHLDKLYKMH 89 Query: 50 ----VTAPEI-SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRV-ICKL 103 T E+ + +A +++ E+G G G +L + Sbjct: 90 DISWFTPVELFKPWYAYAIA---ASILRTANLSVPLKIFEIGGGSGTCAKCVLDYMMLNA 146 Query: 104 KPDFFSVLSIYMVETSERLTLIQKKQLASYGDKIN----------WYTSLADVPLGFTFL 153 P ++ + VE S L Q + + ++ ++ Sbjct: 147 PPKVYNNMKYISVEISSSLAEKQLETVGEVQSHLSKFTVEHRDATDIAGWGSKDPQPCWV 206 Query: 154 VANEFFDSLPIKQFVM--TEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSD 205 + E D+LP E I+ S V + CS+ Sbjct: 207 LMLEVLDNLPHDLVYSPDQVSAWMEVWIEKVNGSSQVCEVYKPLQDPLVSRCSE 260 >gi|326518340|dbj|BAJ88199.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 442 Score = 67.8 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 38/232 (16%), Positives = 67/232 (28%), Gaps = 45/232 (19%) Query: 16 NGQMTVDQYFALCVADPEFGYYSTCN-PFG--------AVGDFVTA-------------- 52 + + V + + DP GY+S + P G D TA Sbjct: 32 DKPVLVRDFVRSALYDPNHGYFSKRSGPVGVLDSAIRFHKFDGRTAYMKHLDKLYKMHDI 91 Query: 53 -----PEI-SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRV-ICKLKP 105 E+ + +A +++ E+G G G IL + P Sbjct: 92 AWFTPVELFKPWYAYAIA---ASILRTANLSVPLKIYEIGGGSGTCAKCILDYMMLNAPP 148 Query: 106 DFFSVLSIYMVETSERLTLIQ-------KKQLASYGDKINWYTSLADVPLG---FTFLVA 155 ++ + VE S L Q + L+ + + T +A +++ Sbjct: 149 KVYNNMKYISVEISSSLAEKQLETVGEVQSHLSKFTVEHRDATDIAGWGCKDPQPCWVLM 208 Query: 156 NEFFDSLPIKQFVM--TEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSD 205 E D+LP E I+ S V + CS+ Sbjct: 209 LEVLDNLPHDLVYSPDQVSPWMEVWIEKVNGSSQVCEVYRPLQDPLVSRCSE 260 >gi|302927321|ref|XP_003054472.1| predicted protein [Nectria haematococca mpVI 77-13-4] gi|256735413|gb|EEU48759.1| predicted protein [Nectria haematococca mpVI 77-13-4] Length = 481 Score = 67.4 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 29/144 (20%), Positives = 52/144 (36%), Gaps = 11/144 (7%) Query: 57 QIFGEMLAIFLICAWEQHGFPSCVR-LVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 +GE +A +L+ + +P + E+G GRG +ML+IL I ++ P ++ + Sbjct: 158 PYYGEAIARYLVSNYRLTTYPYHDLLIYEMGAGRGTLMLNILDYIREVDPQVYARTKFNI 217 Query: 116 VETSERLTLIQKKQL------ASYGDKI----NWYTSLADVPLGFTFLVANEFFDSLPIK 165 +E S L +Q + L + DK+ F +A E FD+ Sbjct: 218 IEISSSLAALQNRHLLSTAASRGHADKVEIVNRSIFDWDQYVPSPCFFLAMEVFDNFSHD 277 Query: 166 QFVMTEHGIRERMIDIDQHDSLVF 189 + F Sbjct: 278 GIRYDVATEEPLQGHVLIDGDGDF 301 >gi|318081379|ref|ZP_07988711.1| hypothetical protein SSA3_32917 [Streptomyces sp. SA3_actF] Length = 188 Score = 67.4 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 33/144 (22%), Positives = 54/144 (37%), Gaps = 14/144 (9%) Query: 25 FALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVE 84 A + P G+Y + P G F T+ S ++ +A FL G P + V+ Sbjct: 1 MATALYGPG-GFYRSAGP-GPAAHFRTSVHASPLYARAVARFLTEVDTVLGTPPALDFVD 58 Query: 85 LGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWYTSLA 144 +G GRG ++ +L L P + + Y VE + R D + Sbjct: 59 IGAGRGELLTGVLA---ALPPATAARVRPYGVERAPR---------PEGLDARIGWGEEI 106 Query: 145 DVPLGFTFLVANEFFDSLPIKQFV 168 L ANE+ D++P+ Sbjct: 107 PGHGLTGLLFANEWLDNVPVDVVE 130 >gi|290958413|ref|YP_003489595.1| hypothetical protein SCAB_39651 [Streptomyces scabiei 87.22] gi|260647939|emb|CBG71044.1| conserved hypothetical protein [Streptomyces scabiei 87.22] Length = 293 Score = 67.4 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 50/253 (19%), Positives = 82/253 (32%), Gaps = 23/253 (9%) Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 S +F +A L E G P + V+LG GRG ++ V+ L S Y Sbjct: 4 SPLFAAAVARLLCRVDEALGHPGELAFVDLGAGRGELVSG---VLAALPAPVSSRTRAYA 60 Query: 116 VETSERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIR 175 VE + +I W+ G L ANE+ D++P+ + G Sbjct: 61 VEL--------AGRPDGLDHRIEWHAEPPTGVTG--LLFANEWLDNVPVDVVEVDAAGTA 110 Query: 176 ERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGT 235 ++ + + E G R+ + G Sbjct: 111 RLVLVREDGSERLGEPVAGEAARWLDRWWPPGPGEGLRAEVGLPRDRAWAAAVAGVARGL 170 Query: 236 AIVIDYGY--LQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYIN 293 A+ +DYG+ TL + +P+ + DL++HV + Sbjct: 171 AVAVDYGHLADARPPFGTLTGFREGRETAPVPDGS-CDLTAHVALDACALPGAR------ 223 Query: 294 GLTTQGKFLEGLG 306 L TQ + L LG Sbjct: 224 -LLTQREALRALG 235 >gi|332213718|ref|XP_003255974.1| PREDICTED: protein midA homolog, mitochondrial-like [Nomascus leucogenys] Length = 186 Score = 67.4 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 39/140 (27%), Positives = 62/140 (44%), Gaps = 5/140 (3%) Query: 195 EIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQA 254 E K NF+ + FE P + +S +A GG A+V DYG+ + +T + Sbjct: 2 EKKVNFMLSIPHMEQD-FEVFPDASVITEELSQCIALTGGAALVADYGHDGT-NTETFRG 59 Query: 255 VKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSL 314 GH + L+ PG ADL + VDF + +A K+ G Q F + +GI + L Sbjct: 60 FCGHKFHDVLIAPGTADLKADVDFSYVQRMA-QGKVASLGPIKQHTFFKNMGIDVQLKIL 118 Query: 315 MKQTARKD--ILLDSVKRLV 332 + ++ LL L+ Sbjct: 119 LDKSNETSVRQLLQGYGMLM 138 >gi|333025297|ref|ZP_08453361.1| hypothetical protein STTU_2801 [Streptomyces sp. Tu6071] gi|332745149|gb|EGJ75590.1| hypothetical protein STTU_2801 [Streptomyces sp. Tu6071] Length = 408 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 32/187 (17%), Positives = 58/187 (31%), Gaps = 14/187 (7%) Query: 25 FALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVE 84 A + P G+Y + P G F T+ S ++ +A L G P + V+ Sbjct: 1 MATALYGPG-GFYRSAGP-GPAAHFRTSVHASPLYARAVARLLTEVDTVLGTPPALDFVD 58 Query: 85 LGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWYTSLA 144 +G GRG ++ +L + + Y VE + R D + Sbjct: 59 IGAGRGELLTGVLAALPPATAA---RVRPYGVERAPR---------PEGLDARIGWGEEI 106 Query: 145 DVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCS 204 L ANE+ D++P+ G+ ++ + Sbjct: 107 PGHGLTGLLFANEWLDNVPVDVVETDAEGVARIVLVAPDGSERLGEPVSGADAEWLARWW 166 Query: 205 DYFLGAI 211 G + Sbjct: 167 PMGEGEV 173 >gi|18390452|ref|NP_563721.1| unknown protein [Arabidopsis thaliana] gi|14334776|gb|AAK59566.1| unknown protein [Arabidopsis thaliana] gi|21280933|gb|AAM44929.1| unknown protein [Arabidopsis thaliana] gi|21618146|gb|AAM67196.1| unknown [Arabidopsis thaliana] gi|332189636|gb|AEE27757.1| uncharacterized protein [Arabidopsis thaliana] Length = 442 Score = 66.7 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 36/249 (14%), Positives = 72/249 (28%), Gaps = 47/249 (18%) Query: 18 QMTVDQYFALCVADPEFGYYSTCNP-FG-------------------------AVGD--- 48 + V + + DP GY+S + G D Sbjct: 41 PVLVRDFIHTALYDPIQGYFSQRSKSVGVLERSIKFNQLEGRKAYMKLLEKVYKQSDISW 100 Query: 49 FVTAPEI-SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDF 107 F T E+ + +A + +++ E+G G G +L I P+ Sbjct: 101 F-TPVELFKPWYAHGIAEAI---LRTTNLSVPLKIYEIGGGSGTCAKGVLDYIMLNAPER 156 Query: 108 -FSVLSIYMVETSERLTLIQKK-------QLASYGDKINWYTSLADVPL---GFTFLVAN 156 + +S +E S L IQK+ L+ + + + LA +++ Sbjct: 157 IYKNMSYTSIEISPSLAKIQKETVAQVGSHLSKFRVECRDASDLAGWKNVEQQPCWVIML 216 Query: 157 EFFDSLPIKQFVMTEH--GIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFEN 214 E D+LP E +++ + + C + Sbjct: 217 EVLDNLPHDLVYSKSQLSPWMEVLVENKPESEALSELYKPLEDPLIKRCIEIVEHEDDPV 276 Query: 215 SPCRDREMQ 223 S ++ + Sbjct: 277 SKPKEIWSK 285 >gi|168047011|ref|XP_001775965.1| predicted protein [Physcomitrella patens subsp. patens] gi|162672623|gb|EDQ59157.1| predicted protein [Physcomitrella patens subsp. patens] Length = 401 Score = 65.9 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 33/214 (15%), Positives = 63/214 (29%), Gaps = 53/214 (24%) Query: 21 VDQYFALCVADPEFGYYSTC-NPFGA-----VGDFV------------------------ 50 V Y + +P GY++ N GA D + Sbjct: 1 VRDYIHSALYEPHKGYFAAKSNAVGAIAEPIRFDRIPGKILTGHVAGRAAYNEYLSKLYK 60 Query: 51 -------TAPEI---SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVI 100 T E+ S +G +A ++ + +++ E+G G G +IL Sbjct: 61 QHDIAWFTPVEVFQASPWYGYAVAEYI---LRTMDPSAPLKIYEVGGGTGTCANNILTYF 117 Query: 101 CKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWYT----------SLADVPLGF 150 P+ + ++ VE S L QK ++ + + + V Sbjct: 118 KLRAPNVYKSMTYTSVEISAALAKKQKARVETVISHRGHFMVNCGSASDARTWGTVNASP 177 Query: 151 TFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQH 184 +++ E D+LP + Q Sbjct: 178 CYIIMFEVLDNLPHDLVYRETPKSPWLETWVVQD 211 >gi|260939920|ref|XP_002614260.1| hypothetical protein CLUG_05746 [Clavispora lusitaniae ATCC 42720] gi|238852154|gb|EEQ41618.1| hypothetical protein CLUG_05746 [Clavispora lusitaniae ATCC 42720] Length = 606 Score = 65.9 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 28/188 (14%), Positives = 60/188 (31%), Gaps = 15/188 (7%) Query: 56 SQIFGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDILRVICKLKPDFFSVLSIY 114 S + E LA +++ ++ + +P + E+G G G +M +IL+ I + +PD + Sbjct: 255 SPHYAEALARYIVVNYKLNQYPYHDLVIFEMGGGNGTLMCNILQYIQRHEPDIYRRTQYK 314 Query: 115 MVETSERLTLIQKKQL------------ASYGDKINWYTSLADVPLGFTFLVANEFFDSL 162 ++E S +L Q Q V F +A E FD+ Sbjct: 315 IIEISSQLASKQYSQAMKSRLVSSGLDERKLQIINKSIFRWDTVVEEPCFFIALEVFDNF 374 Query: 163 PIKQFVMTEHGIR-ERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDRE 221 + + + ++ + + P ++ Sbjct: 375 AHDVVRYDSVSGVPYQGYVLVDDAGDFYEFFSPQLSPDTNALLSLRENGRYPVLPQQNTI 434 Query: 222 MQSISDRL 229 + R+ Sbjct: 435 SG-HAKRI 441 >gi|239988678|ref|ZP_04709342.1| hypothetical protein SrosN1_15322 [Streptomyces roseosporus NRRL 11379] Length = 310 Score = 65.5 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 55/272 (20%), Positives = 90/272 (33%), Gaps = 21/272 (7%) Query: 40 CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRV 99 +P G G F T+ S +F +A L + V LV++G GRG ++ +L V Sbjct: 2 RSPEGPAGHFRTSVHASPLFAGAVARLLTGIARELDT-GTVALVDVGAGRGELLTGVLDV 60 Query: 100 ICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFF 159 + ++ Y VE + +I W G L ANE+ Sbjct: 61 LAARPDGPD--VTPYAVEV--------AARPPGLDPRIEWCAEPPAGVAG--LLFANEWL 108 Query: 160 DSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIG----DHEIKSNFLTCSDYFLGAIFENS 215 D++P++ G+ + + D + S G E Sbjct: 109 DNVPVEVAEADAEGVPRYVEVRTSDGAERPGAPVTGADAAWLERWWPLST--PGERAEIG 166 Query: 216 PCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSH 275 RD L A+ +G TL +G V P+ + + DL++H Sbjct: 167 RPRDAAWAGAVGSLTAGLAVAVDYAHGRDARPPFGTLTGFRGGREVRPVPDGSR-DLTAH 225 Query: 276 VDFQRLSSIAILYKLYI-NGLTTQGKFLEGLG 306 V ++ + LT Q L LG Sbjct: 226 VALDACAAAVAEAGADVRTELTDQRAALRRLG 257 >gi|284034315|ref|YP_003384246.1| hypothetical protein Kfla_6450 [Kribbella flavida DSM 17836] gi|283813608|gb|ADB35447.1| protein of unknown function DUF185 [Kribbella flavida DSM 17836] Length = 294 Score = 65.5 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 54/309 (17%), Positives = 101/309 (32%), Gaps = 62/309 (20%) Query: 20 TVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSC 79 + + + + + G+Y P F T+ S +F A ++ + G Sbjct: 3 SFREAWDEALYGSD-GFYRRERP---SAHFRTSVHASPLF----AAAVVRLAREIG---A 51 Query: 80 VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINW 139 ++ ++G G G + + + LA Sbjct: 52 RQITDIGAGSGELGKAV-------------------------------EVLAPELTVTGI 80 Query: 140 YTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSN 199 P ++ANE+ D++P + + + G+ ++ +V ++ Sbjct: 81 ELDDVLPPALTGLVIANEWLDNVPCELAELADDGVPRYLLADLGPGEVVEG---KDLAWL 137 Query: 200 FLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQ--SRVGDTLQAVKG 257 G E RD + RL D G A+ IDYG+++ TL +G Sbjct: 138 ERWWPLAEPGDRAEIGVTRDAAWADVVSRL--DDGLAVAIDYGHVRAGRPAYGTLTGFRG 195 Query: 258 HTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQ 317 P+ + D+++HV L +TTQ L LG+ A ++ Sbjct: 196 GRECEPVPDGS-CDITAHVALDSLGGT----------ITTQRDALRALGVS--AGRPSQE 242 Query: 318 TARKDILLD 326 AR D L Sbjct: 243 LARTDPLRY 251 >gi|329939463|ref|ZP_08288799.1| hypothetical protein SGM_4291 [Streptomyces griseoaurantiacus M045] gi|329301692|gb|EGG45586.1| hypothetical protein SGM_4291 [Streptomyces griseoaurantiacus M045] Length = 354 Score = 65.1 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 53/303 (17%), Positives = 93/303 (30%), Gaps = 46/303 (15%) Query: 26 ALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVEL 85 + PE G+Y P G G F T+ S ++ +A L P+ + V++ Sbjct: 20 QEALYGPE-GFY--RRPEGPAGHFRTSVHASPLYAAAVARLLARVDAALDRPAELTFVDM 76 Query: 86 GPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWYTSLAD 145 GRG + V+ L + ++ VE + A +I W Sbjct: 77 AAGRGELAAG---VLAALPGALAARTRVHAVEL--------AGRPAGLDPRIAWSAEPPA 125 Query: 146 VPLGFTFLVANEFFDSLPIKQFVMTE----------HGIRERMIDIDQHDSLVFNIGDHE 195 G L ANE+ D++P+ ER+ + + Sbjct: 126 GVTG--LLFANEWLDNVPVDVAETDTAGVRRRVLVRRDGTERLGEPVTGAEADWLERWWP 183 Query: 196 IKSNFLT----------CSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQ 245 +G R++ S + G A+ +DY + + Sbjct: 184 SPPEPAPHTGTGDTGPQAGTGPVGTGLRAEIGLPRDLAWASAAGSLARGLAVAVDYAHTR 243 Query: 246 --SRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLE 303 TL + P+ + DL++HV + L L +Q + L Sbjct: 244 EARPPFGTLTGFREGRQTRPVPDGRH-DLTAHVALDACAGPGAL-------LLSQREALG 295 Query: 304 GLG 306 LG Sbjct: 296 ALG 298 >gi|321254036|ref|XP_003192941.1| hypothetical protein CGB_C6510W [Cryptococcus gattii WM276] gi|317459410|gb|ADV21154.1| conserved hypothetical protein [Cryptococcus gattii WM276] Length = 545 Score = 65.1 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 38/235 (16%), Positives = 74/235 (31%), Gaps = 54/235 (22%) Query: 21 VDQYFALCVADPEFGYYSTCNPFG---AVGDF-VTA----------------------PE 54 V + + +P +GY+S G F +T+ P Sbjct: 100 VRDFIDDSLYNPNYGYFSHHATIFTPPKDG-FDITSFRDVAHFQETVAERYEREYGLEPT 158 Query: 55 IS------------------QIFGEMLAIFLICAWEQHGFP-SCVRLVELGPGRGIMMLD 95 + L ++ +++ + FP + + E+G G G M+D Sbjct: 159 EGAQGGLGRQVWHTPTELFKPYYARSLISAIVQSYKLYHFPSEPLIIYEIGAGNGSFMVD 218 Query: 96 ILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQ--LASYGDKINWYTSLADVPLG---- 149 L + PD F+ ++E S L Q+++ +G K+ G Sbjct: 219 SLTYLRDHYPDVFARTKYRIIEISGTLAKGQRERAEKEGFGKKVQVLNKDFFQWDGVGGG 278 Query: 150 --FTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLT 202 F++A E FD+ + R + S F++ I ++ Sbjct: 279 DQPCFVIALEVFDNFAHDMVRYELDTLTPRQAVVGIDASGDFSLLYETINDPLVS 333 >gi|134109343|ref|XP_776786.1| hypothetical protein CNBC2770 [Cryptococcus neoformans var. neoformans B-3501A] gi|50259466|gb|EAL22139.1| hypothetical protein CNBC2770 [Cryptococcus neoformans var. neoformans B-3501A] Length = 545 Score = 65.1 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 41/240 (17%), Positives = 76/240 (31%), Gaps = 64/240 (26%) Query: 21 VDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPE-----IS----QIFGEMLA------- 64 V + + +P +GY+S T P+ IS F E +A Sbjct: 100 VRDFIDDSLYNPNYGYFSHHATI------FTPPKDGFDIISFRDVAHFQETVAERYEREY 153 Query: 65 ---------------IF------------------LICAWEQHGFP-SCVRLVELGPGRG 90 ++ ++ +++ + FP + + E+G G G Sbjct: 154 GLEPTEGAQGGLGRQVWHTPTELFKPYYARSLISAIVQSYKLYHFPSEPLIIYEIGAGNG 213 Query: 91 IMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQ--LASYGDKINWYTSLADVPL 148 M+D L + PD F+ ++E S L Q+++ +G K+ Sbjct: 214 SFMIDSLTYLRDHHPDVFARTKYRIIEISSALAKGQRERAEKEGFGKKVQVLNKDFFQWD 273 Query: 149 G------FTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLT 202 G F+VA E FD+ + R + S F + I ++ Sbjct: 274 GVGGGDQPCFVVALEVFDNFAHDMVRYELDTLTPRQAVVGIDASGDFTLLYETINDPLIS 333 >gi|328769829|gb|EGF79872.1| hypothetical protein BATDEDRAFT_11879 [Batrachochytrium dendrobatidis JAM81] Length = 435 Score = 64.7 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 30/202 (14%), Positives = 59/202 (29%), Gaps = 53/202 (26%) Query: 22 DQYFALCVADPEFGYYSTCNPFGAV-------------------GDFV------------ 50 + + + P +GY+S + GD Sbjct: 37 RDFIDMSLYHPVYGYFSKKAYIFSPPETIQFNSIKDNYEFMNHMGDMYKDVEEEYIHDSD 96 Query: 51 -------TAPEI-SQIFGEMLAIFLICAWEQHGF-PSCVRLVELGPGRGIMMLDILRVIC 101 T E+ +G +A ++ +++ + + E+G G G +M +I+ I Sbjct: 97 IARQVWHTPAELFKPYYGYAIAKHMVSEYKRDPRGADRMIIYEVGAGNGTLMANIMDYIL 156 Query: 102 KLKPDFFSVLSIYMVETSERLTLIQKKQ-------------LASYGDKINWYTSLADVPL 148 +P+ + + ++E S +LT Q K V Sbjct: 157 INEPEIYKTIEYNIIEISSQLTGKQHKSKLLKSHKKGSIGSHRGVQIINKSILEWDKVVS 216 Query: 149 GFTFLVANEFFDSLPIKQFVMT 170 G F +A E D+ Sbjct: 217 GACFFIAMEVIDNFSHDLVRYD 238 >gi|58265706|ref|XP_570009.1| hypothetical protein [Cryptococcus neoformans var. neoformans JEC21] gi|57226241|gb|AAW42702.1| conserved hypothetical protein [Cryptococcus neoformans var. neoformans JEC21] Length = 545 Score = 64.7 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 41/240 (17%), Positives = 76/240 (31%), Gaps = 64/240 (26%) Query: 21 VDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPE-----IS----QIFGEMLA------- 64 V + + +P +GY+S T P+ IS F E +A Sbjct: 100 VRDFIDDSLYNPNYGYFSHHATI------FTPPKDGFDIISFRDVAHFQETVAERYEREY 153 Query: 65 ---------------IF------------------LICAWEQHGFP-SCVRLVELGPGRG 90 ++ ++ +++ + FP + + E+G G G Sbjct: 154 GLEPTEGAQGGLGRQVWHTPTELFKPYYARSLISAIVQSYKLYHFPSEPLIIYEIGAGNG 213 Query: 91 IMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQ--LASYGDKINWYTSLADVPL 148 M+D L + PD F+ ++E S L Q+++ +G K+ Sbjct: 214 SFMIDSLTYLRDHHPDVFARTKYRIIEISSALAKGQRERAEKEGFGMKVQVLNKDFFQWD 273 Query: 149 G------FTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLT 202 G F+VA E FD+ + R + S F + I ++ Sbjct: 274 GVGGGDQPCFVVALEVFDNFAHDMVRYELDTLTPRQAVVGIDASGDFTLLYETINDPLIS 333 >gi|298709956|emb|CBJ31678.1| conserved unknown protein [Ectocarpus siliculosus] Length = 549 Score = 64.7 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 38/280 (13%), Positives = 75/280 (26%), Gaps = 80/280 (28%) Query: 22 DQYFALCVADPEFGYYSTCNPFGAVGD------FV------------------------T 51 + A + + + GY++T + D F T Sbjct: 55 RDFIAKSLYNQDNGYFATKDVIN---DLPGPLEFRNMMGELHYRMDVKKAYESKLSAWMT 111 Query: 52 APEI-SQIFGEMLAIFLICAWEQH------------------------------------ 74 E S F LA ++ ++ Sbjct: 112 PVETFSPHFSHALARWMTKEAKKTSTASSERANGAGDALPRRRKLAQGRNKGTGAGACVG 171 Query: 75 GFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYG 134 G + + E G G G L++L + + +P + +VE S RL +Q +++ S Sbjct: 172 GEGESLVVYEAGGGTGTNALNVLDWLQQQEPKLYERTEYTIVEISPRLAELQTERVCSVH 231 Query: 135 DKIN-------WYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSL 187 + + +V F + E D+LP + E D Sbjct: 232 ENCRVVNANVLEWGMSGEVDPRPCFFLGMELLDNLPHDKIAWVAPSSTEEQGDAGTPQLC 291 Query: 188 VFNIGDHE---IKSNFLTCSDYFLGAIFENSPCRDREMQS 224 + + + F D + + P ++ Sbjct: 292 ETVVVETPGGGFREMFRPVRDTLIRQLLTVCPELAAMIRD 331 >gi|301108061|ref|XP_002903112.1| conserved hypothetical protein [Phytophthora infestans T30-4] gi|262097484|gb|EEY55536.1| conserved hypothetical protein [Phytophthora infestans T30-4] Length = 419 Score = 64.4 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 33/193 (17%), Positives = 64/193 (33%), Gaps = 41/193 (21%) Query: 22 DQYFALCVADPEFGYYSTCNP-----------FGA---VGDF---------------VTA 52 ++ + E GY++T FG G++ +T Sbjct: 39 REFIYASLYAKEAGYFTTQERNVLHVPTESIDFGNLWGAGEYHNMVAELYKESPEAWLTP 98 Query: 53 PEI-SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVL 111 E+ + + + LA + F + + E+G G G L IL + + PD ++ Sbjct: 99 VEVFAPYYSQALARY---MLNSPFFRQELEIFEIGGGSGSNALHILNYLKEQAPDVYAKT 155 Query: 112 SIYMVETSERLTLIQKKQLASYGD--------KINWYTSLADVPLGFTFLVANEFFDSLP 163 ++E S + Q+ ++A I + F +A E D+LP Sbjct: 156 KYTLIEISPVMAERQRNRVAVVHPEQCSVVNQDILTFADTHAPVNSQCFFLALEVLDNLP 215 Query: 164 IKQFVMTEHGIRE 176 + + E Sbjct: 216 HDKVTVQNGKWYE 228 >gi|241949645|ref|XP_002417545.1| conserved hypothetical protein [Candida dubliniensis CD36] gi|223640883|emb|CAX45200.1| conserved hypothetical protein [Candida dubliniensis CD36] Length = 540 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 28/168 (16%), Positives = 54/168 (32%), Gaps = 14/168 (8%) Query: 56 SQIFGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDILRVICKLKPDFFSVLSIY 114 +GE LA +++ ++ + +P + E+G G G +M DIL I + +P+ + Sbjct: 193 HPYYGEALARYILVNYKLNVYPYNDLIIYEMGGGNGTLMCDILNYIREHQPEIYERTQYK 252 Query: 115 MVETSERLTLIQKKQL------------ASYGDKINWYTSLADVPLGFTFLVANEFFDSL 162 ++E S +L Q K + V F +A E FD+ Sbjct: 253 IIEISSQLASKQMKNALDNKLTSQGLDSSKLEIFNKSIFQWKKVVDDPCFFIALEVFDNF 312 Query: 163 PIKQFVMTEHGIRER-MIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLG 209 + + + F E+ + Sbjct: 313 SHDLVRYDNDTGKPYEGKVVIDENGDFFEFFTPELSYYTNAYLNLREN 360 >gi|66815785|ref|XP_641909.1| hypothetical protein DDB_G0278747 [Dictyostelium discoideum AX4] gi|60469994|gb|EAL67975.1| hypothetical protein DDB_G0278747 [Dictyostelium discoideum AX4] Length = 562 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 42/277 (15%), Positives = 91/277 (32%), Gaps = 42/277 (15%) Query: 21 VDQYFALCVADPEFGYYSTCNPF----------GAVGDF------------------VTA 52 + + + + ++GY+ T ++G+F +T Sbjct: 136 MRDFIQESLYNKDYGYFQTKEVIIPSDIEIPKLNSLGNFKEYTNYLHYIYKSLEHAWLTP 195 Query: 53 PEI-SQIFGEMLAIFLICAWEQH---GFPSCVRLVELGPGRGIMMLDILRVICKLKPDFF 108 EI + ++ ++I ++Q S +++ E+G G G L+IL I + + Sbjct: 196 VEIFKPYYSWSISNYIIEKFKQIQIKNSDSKLKIYEIGAGSGTNALNILNHIRDNHKEIY 255 Query: 109 SVLSIYMVETSERLTLIQKKQLASYGD-------KINWYTSLADVPLGFTFLVANEFFDS 161 + ++E S+ L Q ++ N F++ E D+ Sbjct: 256 ENMEYKIIEISKPLAQKQLTRIKQSHPKLNINITNANILEWNQAEEKDECFIILTEVIDN 315 Query: 162 LPIKQFVMTEHGIRERMIDIDQHDSLVFN---IGDHEIKSNFLTCSDYFLGAIFENSPCR 218 LP + +T +GI E ++D +++ I +I S I + Sbjct: 316 LPHDKITITSNGIFESIVDSTVFETVKGKDNIIQKKQISGFHCEDSRPLTDPIIKEYLEY 375 Query: 219 DREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAV 255 ++ M S S + + +T Sbjct: 376 EKSMDSNSLSSFINNIINFYKVNIKSFFKGDETKYLP 412 >gi|159471397|ref|XP_001693843.1| predicted protein [Chlamydomonas reinhardtii] gi|158283346|gb|EDP09097.1| predicted protein [Chlamydomonas reinhardtii] Length = 785 Score = 64.0 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 28/140 (20%), Positives = 50/140 (35%), Gaps = 30/140 (21%) Query: 21 VDQYFALCVADPEFGYYSTCNP-FGAVG-------------D-------------FVTAP 53 V + + P GY++ P G+ G D F+T Sbjct: 136 VRDFIQNSLYHPTLGYFNAPTPPVGSGGGINYWKIYCKDEFDVIIHNKYKELETSFLTPA 195 Query: 54 EI-SQIFGEMLAIFLICAWEQH--GFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSV 110 E+ S +G +A ++ H + +VE+G G G + D+L + +P + Sbjct: 196 EMFSPWYGACIARHMVEHRRHHLGMEGQPLHIVEVGGGNGTLARDVLDWLRDHRPSDYRQ 255 Query: 111 LSIYMVETSERLTLIQKKQL 130 S +E S L Q ++ Sbjct: 256 TSYTCLEISTSLAARQYDKV 275 >gi|254571935|ref|XP_002493077.1| hypothetical protein [Pichia pastoris GS115] gi|238032875|emb|CAY70898.1| Hypothetical protein PAS_chr3_0847 [Pichia pastoris GS115] gi|328352908|emb|CCA39306.1| Protein midA homolog [Pichia pastoris CBS 7435] Length = 491 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 28/159 (17%), Positives = 52/159 (32%), Gaps = 12/159 (7%) Query: 57 QIFGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 +GE LA +L+ + + +P + E+G G G +ML+IL I + +PD + + Sbjct: 160 PFYGEALARYLLVNYMLNQYPYNDLVIYEMGGGNGTLMLNILNYIREKQPDVYERTQYKI 219 Query: 116 VETSERLTLIQ-----------KKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPI 164 +E S L+ Q +V F +A E FD+ Sbjct: 220 IEISHNLSTKQEETALQLKAKDHDHHGKVQIINKSIFDWKEVEPNPCFFIALEVFDNFAH 279 Query: 165 KQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTC 203 + + F + ++ Sbjct: 280 DVIRYDNTTREPYQGYVVIDEHGDFKEFFTKDLDDWANA 318 >gi|68464857|ref|XP_723506.1| hypothetical protein CaO19.4820 [Candida albicans SC5314] gi|68465234|ref|XP_723316.1| hypothetical protein CaO19.12283 [Candida albicans SC5314] gi|46445343|gb|EAL04612.1| hypothetical protein CaO19.12283 [Candida albicans SC5314] gi|46445540|gb|EAL04808.1| hypothetical protein CaO19.4820 [Candida albicans SC5314] gi|238878631|gb|EEQ42269.1| conserved hypothetical protein [Candida albicans WO-1] Length = 540 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 29/168 (17%), Positives = 54/168 (32%), Gaps = 14/168 (8%) Query: 56 SQIFGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDILRVICKLKPDFFSVLSIY 114 +GE LA +++ ++ + +P + E+G G G +M DIL I K +P+ + Sbjct: 193 HPYYGEALARYILVNYKLNVYPYNDLIIYEMGGGNGTLMCDILNYIRKHQPEIYEKTQYK 252 Query: 115 MVETSERLTLIQKKQL------------ASYGDKINWYTSLADVPLGFTFLVANEFFDSL 162 ++E S +L Q K + V F +A E FD+ Sbjct: 253 IIEISSQLASKQMKNALDNKLTSQGLDSSKLEIFNKSIFEWKKVVDDPCFFIALEVFDNF 312 Query: 163 PIKQFVMTEHGIRER-MIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLG 209 + + + F E+ + Sbjct: 313 SHDLIRYDNDSGKPYEGKVVIDENGDFFEFFTPELSYYSNAYLNLREN 360 >gi|71003794|ref|XP_756563.1| hypothetical protein UM00416.1 [Ustilago maydis 521] gi|46096094|gb|EAK81327.1| hypothetical protein UM00416.1 [Ustilago maydis 521] Length = 1878 Score = 62.8 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 26/150 (17%), Positives = 57/150 (38%), Gaps = 6/150 (4%) Query: 57 QIFGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 + +A +L+ ++ H +P + ELG G G + IL + + +P+ ++ + Sbjct: 1527 PYYAHAIARYLVAEYKLHNYPYDDLVVYELGAGSGALARGILDYLEQNEPEIYTRTRYKI 1586 Query: 116 VETSERLTLIQKKQLASYGDKINWYTSLADVPL-----GFTFLVANEFFDSLPIKQFVMT 170 VE S RL Q+++L + ++ F++A E FD+L + Sbjct: 1587 VEISARLAAEQRRKLGGHVERSEVVNQDILTWNRGVVQEPCFVIALEVFDNLAHDVVRFS 1646 Query: 171 EHGIRERMIDIDQHDSLVFNIGDHEIKSNF 200 + + D+ + + Sbjct: 1647 TKDLTPYQAVVSIDDTGDMHELWEPLSDAL 1676 Score = 37.0 bits (84), Expect = 4.2, Method: Composition-based stats. Identities = 4/22 (18%), Positives = 10/22 (45%) Query: 22 DQYFALCVADPEFGYYSTCNPF 43 + + +PE+GY++ Sbjct: 1347 RDFIHDSLYNPEYGYFTKQAVL 1368 >gi|323507937|emb|CBQ67808.1| conserved hypothetical protein [Sporisorium reilianum] Length = 636 Score = 62.8 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 31/216 (14%), Positives = 68/216 (31%), Gaps = 15/216 (6%) Query: 57 QIFGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 + +A +L+ ++ H +P + ELG G G + IL + +P+ + + Sbjct: 292 PHYAHAIARYLVAEYKLHHYPYDDLAIYELGAGSGALAHGILDYLRAHEPEIYVRTRYRI 351 Query: 116 VETSERLTLIQKKQLASYGDKINWYTSLADVP-----LGFTFLVANEFFDSLPIKQFVMT 170 VE S RL Q+++L ++G+++ F+VA E D+L Sbjct: 352 VEISARLAAEQRRKLRAFGERVEVVNEDVLGWSRGVVQEPCFVVALEVLDNLAHDVVRFG 411 Query: 171 EHGIRERMIDIDQHDSLVFNIGDHEIKSNFL-------TCSDYFLGAIFENSPCRDREMQ 223 G+ + ++ + + + S + P + Sbjct: 412 TVGLEAYQAVVSVDETGDMHELWQPLSDPLIREYLDTVPLSPLASSPLRFLPPRLREAVA 471 Query: 224 SISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHT 259 + V + + + H Sbjct: 472 KHAPFFPNLTAPTYVPT--HALRLMHTLHRCFPQHR 505 >gi|255729930|ref|XP_002549890.1| conserved hypothetical protein [Candida tropicalis MYA-3404] gi|240132959|gb|EER32516.1| conserved hypothetical protein [Candida tropicalis MYA-3404] Length = 545 Score = 62.4 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 29/185 (15%), Positives = 54/185 (29%), Gaps = 14/185 (7%) Query: 56 SQIFGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDILRVICKLKPDFFSVLSIY 114 +GE LA +++ ++ + +P + E+G G G +M DIL I K P+ + Sbjct: 198 HPYYGEALARYILVNYKLNIYPYDDLIIYEMGGGNGTLMCDILNYIRKNHPEIYERTQYK 257 Query: 115 MVETSERLTLIQKKQL------------ASYGDKINWYTSLADVPLGFTFLVANEFFDSL 162 ++E S +L Q KQ + V F +A E FD+ Sbjct: 258 IIEISSQLANKQMKQALDNKLTSQGLDSSKLEIFNKSIFEWKQVVNDPCFFIALEVFDNF 317 Query: 163 PIKQFVMTEHGIRER-MIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDRE 221 + + + ++ Sbjct: 318 SHDLIRYDNTTGQPYEGKVLIDQHGDFYEFFSPDLSHYSDAYLRLRENGKHSVLKQASTF 377 Query: 222 MQSIS 226 +S Sbjct: 378 SGKLS 382 >gi|256787002|ref|ZP_05525433.1| hypothetical protein SlivT_21148 [Streptomyces lividans TK24] Length = 312 Score = 62.4 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 50/258 (19%), Positives = 84/258 (32%), Gaps = 21/258 (8%) Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYM 115 S +F +A L E G P+ + V++ GRG ++ V+ L D + Y Sbjct: 4 SPLFAGAVARLLCRVDEALGRPAVLDFVDMAAGRGELVAG---VLAALPADVAARTRAYA 60 Query: 116 VETSERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIR 175 VE + ++ W D G L ANE+ D++P+ + G+ Sbjct: 61 VEV--------AARPEGLDRRVQWLARPPDGVTG--LLFANEWLDNVPVDVAEVDAEGVA 110 Query: 176 ERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGT 235 R++ + R+ S A D G Sbjct: 111 RRVLVRGDGAERLGEPVAGAEAEWLARWWPMPAEEGRRAEIGLPRDEAWASAVAALDAGL 170 Query: 236 AIVIDYGYLQS--RVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLS----SIAILYK 289 A+ DY + S TL + P+ + D+++HV + + Sbjct: 171 AVAADYAHSVSARPPFGTLTGFREGRETEPVPDGS-CDITAHVALDACAAAHTARCTPEY 229 Query: 290 LYINGLT-TQGKFLEGLG 306 N LT Q + L LG Sbjct: 230 APPNALTRPQREILRALG 247 >gi|294657831|ref|XP_460125.2| DEHA2E18942p [Debaryomyces hansenii CBS767] gi|199432982|emb|CAG88395.2| DEHA2E18942p [Debaryomyces hansenii] Length = 622 Score = 62.0 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 29/153 (18%), Positives = 54/153 (35%), Gaps = 16/153 (10%) Query: 57 QIFGEMLAIFLICAWEQHGFPSCV--RLVELGPGRGIMMLDILRVICKLKPDFFSVLSIY 114 +GE LA +L+ ++ +G+ + E+G G G +M +IL I + +PD ++ Sbjct: 270 PYYGEALARYLLVNYKLNGYYPYNDLIIYEMGGGNGTLMCNILNYIKENQPDVYARTQYK 329 Query: 115 MVETSERLTLIQKKQL------------ASYGDKINWYTSLADVPLGFTFLVANEFFDSL 162 ++E S +L Q + + + F +A E FD+ Sbjct: 330 IIEISSQLAEKQYSNALKSKLISQGLDSSKLEIINKSIFNWDRIVEDPCFFIALEVFDNF 389 Query: 163 PIKQFVMTEHG--IRERMIDIDQHDSLVFNIGD 193 E + ID+H Sbjct: 390 AHDLIRYDNVTGEPYEGHVLIDEHGDFYEFFTP 422 Score = 36.6 bits (83), Expect = 5.7, Method: Composition-based stats. Identities = 10/40 (25%), Positives = 18/40 (45%), Gaps = 6/40 (15%) Query: 2 ENKLIRKIVN--LIKKNGQMTVDQYFALCVADPEFGYYST 39 +KL R++ IK M + + +P +GY+S Sbjct: 132 SSKLARRLHRPKKIK----MLASDFIDDSLYNPNYGYFSQ 167 >gi|255728365|ref|XP_002549108.1| conserved hypothetical protein [Candida tropicalis MYA-3404] gi|240133424|gb|EER32980.1| conserved hypothetical protein [Candida tropicalis MYA-3404] Length = 545 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 29/185 (15%), Positives = 55/185 (29%), Gaps = 14/185 (7%) Query: 56 SQIFGEMLAIFLICAWEQHGFPSCV-RLVELGPGRGIMMLDILRVICKLKPDFFSVLSIY 114 +GE LA +++ ++ + +P + E+G G G +M DIL I K +P+ + Sbjct: 198 HPYYGEALARYILMNYKLNVYPYDDLIIYEMGGGNGTLMCDILNYIRKNQPEIYERTQYK 257 Query: 115 MVETSERLTLIQKKQL------------ASYGDKINWYTSLADVPLGFTFLVANEFFDSL 162 ++E S +L Q KQ + V F +A E FD+ Sbjct: 258 IIEISSQLANKQMKQALDNKLTSQGLDSSKLEIFNKSIFEWKQVVNDPCFFIALEVFDNF 317 Query: 163 PIKQFVMTEHGIRER-MIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDRE 221 + + + ++ Sbjct: 318 SHDLIRYDNATGQPYEGKVLIDQHGDFYEFFSPDLSHYSDAYLRLRENGKHSVLKQASTF 377 Query: 222 MQSIS 226 +S Sbjct: 378 SGKLS 382 >gi|149050628|gb|EDM02801.1| similar to PRO1853 homolog, isoform CRA_b [Rattus norvegicus] Length = 99 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 14/70 (20%), Positives = 25/70 (35%), Gaps = 1/70 (1%) Query: 285 AILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSADKKSMGEL 344 ++ G Q FL+ +GI R L+ + L + R + + MGE Sbjct: 1 MAQGRVASLGPVEQRTFLKNMGIDVRLKVLLDKAGDPS-LQQQLLRGYDMLMNPQKMGER 59 Query: 345 FKILVVSHEK 354 F + + Sbjct: 60 FHFFALLPHQ 69 >gi|225462783|ref|XP_002263790.1| PREDICTED: hypothetical protein [Vitis vinifera] Length = 450 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 35/274 (12%), Positives = 75/274 (27%), Gaps = 45/274 (16%) Query: 3 NKLI-RKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCN----------PF----GAVG 47 LI I + + V + + DP GY+S + F G Sbjct: 28 TPLIPAFFSTHIVGDRPVLVRDFIWSALYDPNHGYFSHRSGSVGMLEESIKFNRLQGRKA 87 Query: 48 ------------D--FVTAPEI-SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIM 92 D + T E+ + +A + +++ E+G G G Sbjct: 88 YMKYLEGIYSQNDIAWFTPVELFKPWYAYGIAEAI---LRTADLSVPLKIYEIGGGSGTC 144 Query: 93 MLDILRVICKLKPDF-FSVLSIYMVETSERLTLIQKKQLASYGDKINWY----------T 141 I+ + P ++ +S VE S L Q + + + ++ + Sbjct: 145 AKGIMDYLMLNAPARVYNSMSYISVEISSSLAKKQMETVGAVRSHLSKFRVECHDAADKN 204 Query: 142 SLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFL 201 + + +++ E D+LP + +E+ M + + + Sbjct: 205 AWGSIDQQPCWVIMLEVLDNLPHD-LIYSENQASPWMEVWVEKQHNRVALSELYKPLQDP 263 Query: 202 TCSDYFLGAIFENSPCRDREMQSISDRLACDGGT 235 E S + + Sbjct: 264 LIKRCVEIIDLEKDHNTQGRAISAARHIWSKVFP 297 >gi|150866871|ref|XP_001386609.2| hypothetical protein PICST_33621 [Scheffersomyces stipitis CBS 6054] gi|149388127|gb|ABN68580.2| predicted protein [Scheffersomyces stipitis CBS 6054] Length = 621 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 27/231 (11%), Positives = 65/231 (28%), Gaps = 18/231 (7%) Query: 57 QIFGEMLAIFLICAWEQHGFPSCV--RLVELGPGRGIMMLDILRVICKLKPDFFSVLSIY 114 +GE +A +++ + +G + E+G G G +M +IL I +PD ++ Sbjct: 269 PFYGEAIARYILVNYHLNGTYPYEDLIIYEMGGGNGTLMCNILNYIRNTQPDVYARTQYK 328 Query: 115 MVETSERLTLIQKKQ------------LASYGDKINWYTSLADVPLGFTFLVANEFFDSL 162 ++E S +L Q + + V + +A E FD+ Sbjct: 329 IIEISSQLASKQMENALKNKLVNQGLDQSKLEIINKSIFKWDKVVHDPCYFIALEVFDNF 388 Query: 163 PIKQFVMTEHGIRER-MIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDRE 221 + + + E+ + + Sbjct: 389 AHDVIRYDNVTGQPYEGKVLVDEHGDFYEFYTPELSYYTNAYLQLRENGEYSVLKQSNTL 448 Query: 222 MQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADL 272 + + D + + + + ++ + N A+ Sbjct: 449 SAKMDTF---KSLVPFITDKDKIHPLLQSSTKLKWQNSLLPFKDNMTPAEF 496 Score = 38.2 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 5/25 (20%), Positives = 12/25 (48%) Query: 19 MTVDQYFALCVADPEFGYYSTCNPF 43 M+ + + +P++GY+S Sbjct: 148 MSTADFIEDSLYNPQYGYFSKEVSI 172 >gi|20307022|gb|AAH28524.1| RIKEN cDNA 2410091C18 gene [Mus musculus] Length = 99 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 17/73 (23%), Positives = 27/73 (36%), Gaps = 7/73 (9%) Query: 285 AILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTAR---KDILLDSVKRLVSTSADKKSM 341 K+ G Q FL+ +GI R L+ + K LL L+ + + M Sbjct: 1 MAQGKVASLGPVEQRTFLKNMGIDVRLKVLLDKAGEPSAKQQLLGGYDMLM----NPQKM 56 Query: 342 GELFKILVVSHEK 354 GE F + + Sbjct: 57 GERFHFFALLPHQ 69 >gi|124504320|gb|AAI28475.1| 2410091C18Rik protein [Mus musculus] gi|124504511|gb|AAI28474.1| 2410091C18Rik protein [Mus musculus] gi|144719300|gb|AAI16214.1| RIKEN cDNA 2410091C18 gene [Mus musculus] gi|144719310|gb|AAI16215.1| RIKEN cDNA 2410091C18 gene [Mus musculus] Length = 99 Score = 60.5 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 17/73 (23%), Positives = 28/73 (38%), Gaps = 7/73 (9%) Query: 285 AILYKLYINGLTTQGKFLEGLGIWQRAFSLMK---QTARKDILLDSVKRLVSTSADKKSM 341 K+ G Q FL+ +GI R L+ + + K LL L+ + + M Sbjct: 1 MAQGKVASLGPVEQRTFLKNMGIDVRLKVLLDRAGEPSAKQQLLGGYDMLM----NPQKM 56 Query: 342 GELFKILVVSHEK 354 GE F + + Sbjct: 57 GERFHFFALLPHQ 69 >gi|329905396|ref|ZP_08274118.1| Uncharacterized conserved protein [Oxalobacteraceae bacterium IMCC9480] gi|327547604|gb|EGF32400.1| Uncharacterized conserved protein [Oxalobacteraceae bacterium IMCC9480] Length = 75 Score = 58.6 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 18/62 (29%), Positives = 27/62 (43%), Gaps = 5/62 (8%) Query: 293 NGLTTQGKFLEGLGIWQRAFSLMKQT-ARKDILLDSVKRLVSTSADKKSMGELFKILVVS 351 G +Q FL G+ AR ++V++L S MGELFK+L+V Sbjct: 2 LGYLSQASFLLEAGLGDLLLRTSPDDGARYLPQANAVQKLTS----PAEMGELFKVLIVG 57 Query: 352 HE 353 + Sbjct: 58 KQ 59 >gi|7959841|gb|AAF71091.1|AF116721_71 PRO1853 [Homo sapiens] Length = 99 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 17/73 (23%), Positives = 28/73 (38%), Gaps = 7/73 (9%) Query: 285 AILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTAR---KDILLDSVKRLVSTSADKKSM 341 K+ G Q FL+ +GI R L+ ++ + LL L+ + K M Sbjct: 1 MAQGKVASLGPIKQHTFLKNMGIDVRLKVLLDKSNEPSVRQQLLQGYDMLM----NPKKM 56 Query: 342 GELFKILVVSHEK 354 GE F + + Sbjct: 57 GERFNFFALLPHQ 69 >gi|297159344|gb|ADI09056.1| hypothetical protein SBI_05936 [Streptomyces bingchenggensis BCW-1] Length = 432 Score = 57.8 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 45/236 (19%), Positives = 82/236 (34%), Gaps = 23/236 (9%) Query: 56 SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGI----MMLDILRVICKLKPDFFSVL 111 S + +A L G P+ + +V++G GRG ++ + + L Sbjct: 4 SPRYAAAVAELLARVDRALGHPAELAMVDMGAGRGELLTGVLACAAGMAARRGSGLAERL 63 Query: 112 SIYMVETSERLTLIQKKQLASYGDKINWYTSLADVPLGF--TFLVANEFFDSLPIKQFVM 169 Y VE + R +I W L ++P L ANE+ D++P+ Sbjct: 64 RPYAVERAPR--------PPGLDPRITWLGDLGELPRDGLDGLLFANEWLDNVPVDVAEA 115 Query: 170 TEHGIRERMIDIDQHDSLVFNIG----DHEIKSNFLTCSDYFLGAIFENSPCRDREMQSI 225 G R++ + + D + + + + G E RD Sbjct: 116 DADGAWRRVLVATEDGAERLGEQVGGADADWLARWWPSARDEPGLRAEIGRPRDEAWAEA 175 Query: 226 SDRLACDGGTAIVIDYGYLQ--SRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQ 279 L G A+ +DY + + + TL + V P+ + DL++HV Sbjct: 176 VRALR--SGLAVAVDYAHERGARPLFGTLTGFREGREVRPVPDGS-CDLTAHVALD 228 >gi|90082737|dbj|BAE90550.1| unnamed protein product [Macaca fascicularis] Length = 99 Score = 57.8 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 17/71 (23%), Positives = 27/71 (38%), Gaps = 7/71 (9%) Query: 285 AILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTAR---KDILLDSVKRLVSTSADKKSM 341 K+ G Q FL+ +GI R L+ ++ + LL L+ + K M Sbjct: 1 MAQGKVASLGPIKQHTFLKNMGIDVRLKVLLDKSNEPSVRQQLLQGYDMLM----NPKKM 56 Query: 342 GELFKILVVSH 352 GE F + Sbjct: 57 GERFNFFALLP 67 >gi|291445665|ref|ZP_06585055.1| conserved hypothetical protein [Streptomyces roseosporus NRRL 15998] gi|291348612|gb|EFE75516.1| conserved hypothetical protein [Streptomyces roseosporus NRRL 15998] Length = 181 Score = 57.4 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 33/142 (23%), Positives = 56/142 (39%), Gaps = 16/142 (11%) Query: 26 ALCVADPEFGYYST--CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLV 83 + E G+Y + +P G G F T+ S +F +A L + V LV Sbjct: 36 QAALYGDE-GFYRSPVRSPEGPAGHFRTSVHASPLFAGAVARLLTGIARELDT-GTVALV 93 Query: 84 ELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWYTSL 143 ++G GRG ++ +L V+ ++ Y VE + +I W Sbjct: 94 DVGAGRGELLTGVLDVLAARPDGPD--VTPYAVEV--------AARPPGLDPRIEWCAEP 143 Query: 144 ADVPLGFTFLVANEFFDSLPIK 165 G L ANE+ D++P++ Sbjct: 144 PAGVAG--LLFANEWLDNVPVE 163 >gi|330791097|ref|XP_003283631.1| hypothetical protein DICPUDRAFT_26158 [Dictyostelium purpureum] gi|325086491|gb|EGC39880.1| hypothetical protein DICPUDRAFT_26158 [Dictyostelium purpureum] Length = 546 Score = 57.0 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 31/262 (11%), Positives = 84/262 (32%), Gaps = 40/262 (15%) Query: 21 VDQYFALCVADPEFGYYSTCNPFGAVG-------------------DFV---------TA 52 + + + + ++GY+ T + ++ T Sbjct: 118 MRDFIQDSLYNQDYGYFQTKEVIISPNIKIPPLNSLKNFREYNNYLHYIYKSLEHAWLTP 177 Query: 53 PEI-SQIFGEMLAIFLICAWEQ-HGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSV 110 E+ + +A ++I +++ E+G G G L+IL + + + Sbjct: 178 VELYKPYYSWSIANYIIDKHNNREDKSKNLKIYEIGAGSGTNALNILNFMRSNHLEIYKT 237 Query: 111 LSIYMVETSERLTLIQKKQLASYGD-------KINWYTSLADVPLGFTFLVANEFFDSLP 163 + ++E S+ L L Q ++ + + + F++ E D+LP Sbjct: 238 MEYKIIEISKPLALKQFNRIKADHPALNIQINNTSIFNWTHKTEEDECFIILTEVIDNLP 297 Query: 164 IKQFVMTEHGIRE---RMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDR 220 + V++ +GI E + + + +H+ + + I + + Sbjct: 298 HDKIVLSSNGIFESTVCSTVFETFEKSKSILSNHKSSGYYREETRQLTDPIIKEYLEYEE 357 Query: 221 EMQSISDRLACDGGTAIVIDYG 242 ++ S + ++ Sbjct: 358 HIKKSSSFFPRIPMLVSIKEFY 379 >gi|328855125|gb|EGG04253.1| hypothetical protein MELLADRAFT_49211 [Melampsora larici-populina 98AG31] Length = 462 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 36/231 (15%), Positives = 72/231 (31%), Gaps = 49/231 (21%) Query: 22 DQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM---------LAI------- 65 ++ + +P +GY++T + +PE FG++ +A Sbjct: 80 REFIEDSLCNPHYGYFATQVEI------LDSPEGGIPFGKLSNLRALEEEIANRYGPRQS 133 Query: 66 ----------FLI-----CAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSV 110 + C +H +R+ E+G G G + +L I P +S Sbjct: 134 WHTPTELFKPWYANAVARCLLSRHQPNQPLRIYEIGAGNGTLCEGVLNYIRDHAPTLYSQ 193 Query: 111 LSIYMVETSERLTLIQKKQLASYGDKINWYTSLAD------------VPLGFTFLVANEF 158 + +VE S RL IQ + + G K+ + +++A E Sbjct: 194 TTYTIVEVSARLAGIQAARASRAGHKLEIIPHNVFSLPQEQGTSTSPIESEPCWVIAMEL 253 Query: 159 FDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLG 209 D+L + + ++ F+ I +L Sbjct: 254 VDNLARDVVRYDRTTGEVLVASVGTDENGEFHEFFEPICDQVNPSLSRYLQ 304 >gi|7211976|gb|AAF40447.1|AC004809_5 F13M7.11 [Arabidopsis thaliana] Length = 479 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 36/286 (12%), Positives = 74/286 (25%), Gaps = 76/286 (26%) Query: 18 QMTVDQYFALCVADPEFGYYSTCNP-FG-------------------------AVGD--- 48 + V + + DP GY+S + G D Sbjct: 41 PVLVRDFIHTALYDPIQGYFSQRSKSVGVLERSIKFNQLEGRKAYMKLLEKVYKQSDISW 100 Query: 49 FVTAPEI-SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDF 107 F T E+ + +A + +++ E+G G G +L I P+ Sbjct: 101 F-TPVELFKPWYAHGIAEAI---LRTTNLSVPLKIYEIGGGSGTCAKGVLDYIMLNAPER 156 Query: 108 -FSVLSIYMVETSERLTLIQKK-------QLASYGDKINWYTSLADVPL----------- 148 + +S +E S L IQK+ L+ + + + LA Sbjct: 157 IYKNMSYTSIEISPSLAKIQKETVAQVGSHLSKFRVECRDASDLAGWSKVLFVQHKADSG 216 Query: 149 --------------------GFTFLVANEFFDSLPIKQFVMT---EHGIRERMIDIDQHD 185 +++ E D+LP + + + + Sbjct: 217 FQFPVAMLEFVNFQTENVEQQPCWVIMLEVLDNLPHDLVYSKSQLSPWMEVLVENKPERY 276 Query: 186 SLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLAC 231 + + + D + E D + + + Sbjct: 277 REYLAQMNEALSELYKPLEDPLIKRCIEIVEHEDDPVSKPKEIWSK 322 >gi|296090510|emb|CBI40841.3| unnamed protein product [Vitis vinifera] Length = 458 Score = 54.3 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 34/297 (11%), Positives = 73/297 (24%), Gaps = 68/297 (22%) Query: 3 NKLI-RKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCN-------------------- 41 LI I + + V + + DP GY+S + Sbjct: 28 TPLIPAFFSTHIVGDRPVLVRDFIWSALYDPNHGYFSHRSGSVGMLEESIKFNRLQGIFK 87 Query: 42 ---------PFGAVGDF----------------------VTAPEI-SQIFGEMLAIFLIC 69 G F T E+ + +A + Sbjct: 88 ILNVYDSSFKLYKYGAFPMCRKAYMKYLEGIYSQNDIAWFTPVELFKPWYAYGIAEAI-- 145 Query: 70 AWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDF-FSVLSIYMVETSERLTLIQKK 128 +++ E+G G G I+ + P ++ +S VE S L Q + Sbjct: 146 -LRTADLSVPLKIYEIGGGSGTCAKGIMDYLMLNAPARVYNSMSYISVEISSSLAKKQME 204 Query: 129 QLASYGDKINWY----------TSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERM 178 + + ++ + + + +++ E D+LP + +E+ M Sbjct: 205 TVGAVRSHLSKFRVECHDAADKNAWGSIDQQPCWVIMLEVLDNLPHD-LIYSENQASPWM 263 Query: 179 IDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGT 235 + + + E S + + Sbjct: 264 EVWVEKQHNRVALSELYKPLQDPLIKRCVEIIDLEKDHNTQGRAISAARHIWSKVFP 320 >gi|145484627|ref|XP_001428323.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124395408|emb|CAK60925.1| unnamed protein product [Paramecium tetraurelia] Length = 390 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 36/203 (17%), Positives = 67/203 (33%), Gaps = 39/203 (19%) Query: 17 GQMTVDQYFALCVADPEFGYYSTCNPFGAV----------G--DFV-------------T 51 +M V + + P GY+ GA+ G D+ T Sbjct: 2 KRMLVRDFIYDRLYHPVEGYFVKNIQLGALKKPIEFKQLLGYEDYTKKLAENYPENQWLT 61 Query: 52 APEI-SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSV 110 E+ +G L ++ + +R+VE+G G G +L + +P FS Sbjct: 62 PSEVFRPYYGITLGNYINQQFR-FTRKEKLRIVEIGAGYGAACEGVLYYMRNHQPQTFSN 120 Query: 111 LSIYMVETSERLTLIQKKQLASYGDKINWYTSLADVPLGF------------TFLVANEF 158 + ++V+ S + +L+ + +L F F V E Sbjct: 121 MEYHLVDISPEACAQAQIRLSQDFKQQIKKGNLRIFNQDFLNYKQHTQNNEMWFFVFLEV 180 Query: 159 FDSLPIKQFVMTEHGIRERMIDI 181 FD+L + + + E M + Sbjct: 181 FDNLAHDKVIDGKQVYVENMQEF 203 >gi|301113137|ref|XP_002998339.1| conserved hypothetical protein [Phytophthora infestans T30-4] gi|262112633|gb|EEY70685.1| conserved hypothetical protein [Phytophthora infestans T30-4] Length = 399 Score = 52.0 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 25/151 (16%), Positives = 53/151 (35%), Gaps = 33/151 (21%) Query: 22 DQYFALCVADPEFGYYSTCNP-----------FGA---VGDF---------------VTA 52 ++ + E GY++T FG G++ +T Sbjct: 40 REFIYASLYAKEAGYFTTQERNVLHVPTESIDFGNLWGAGEYHNMVAELYKESSEAWLTP 99 Query: 53 PEI-SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVL 111 E+ + + + LA + F + + E+G G G L IL + + PD ++ Sbjct: 100 VEVFAPYYSQALARY---MLNSPFFRQELEIFEIGGGSGSNALHILNYLKEQAPDVYAKT 156 Query: 112 SIYMVETSERLTLIQKKQLASYGDKINWYTS 142 ++E S + Q+ ++A + + Sbjct: 157 KYTLIEISPVMAERQRNRVAVVHPEQCSVVN 187 >gi|207109133|ref|ZP_03243295.1| hypothetical protein HpylH_07431 [Helicobacter pylori HPKX_438_CA4C1] Length = 137 Score = 51.6 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 26/118 (22%), Positives = 44/118 (37%), Gaps = 3/118 (2%) Query: 54 EISQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSI 113 +S+ FG +A ++I E+ +++VE+G G + DI + L Sbjct: 2 SLSKFFGGAIAFYIIKLLEEEKLFLPLKIVEIGAHHGHFLSDIANFLNALSVGVMEQCEF 61 Query: 114 YMVETSERLTLIQ---KKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFV 168 E + L +Q KQ I L F+V+NE FD+ + Sbjct: 62 ISCEPLKELQKLQRIIFKQATQLDLMICDLKDLDFKGHESAFVVSNELFDAFACEIIK 119 >gi|190345267|gb|EDK37127.2| hypothetical protein PGUG_01225 [Meyerozyma guilliermondii ATCC 6260] Length = 322 Score = 51.6 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 23/140 (16%), Positives = 40/140 (28%), Gaps = 15/140 (10%) Query: 85 LGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKK------------QLAS 132 +G G G +M ++L I + +PD ++ ++E S +L Q + + Sbjct: 1 MGGGNGTLMCNVLNYIKENQPDVYARTHYKIIEISSQLATKQMESALRAKLTREKIDNSK 60 Query: 133 YGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHG--IRERMIDIDQHDSLVFN 190 V F +A E FD+ P E + F Sbjct: 61 VEVINQSIFEWDKVVQEPCFFIALEVFDNFPHDLVRYDNTTGEPHE-GYVLVDEQGDFFE 119 Query: 191 IGDHEIKSNFLTCSDYFLGA 210 E+ T Sbjct: 120 FFTPELSYYTDTFLRLRESD 139 >gi|146419181|ref|XP_001485554.1| hypothetical protein PGUG_01225 [Meyerozyma guilliermondii ATCC 6260] Length = 322 Score = 51.6 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 23/140 (16%), Positives = 40/140 (28%), Gaps = 15/140 (10%) Query: 85 LGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKK------------QLAS 132 +G G G +M ++L I + +PD ++ ++E S +L Q + + Sbjct: 1 MGGGNGTLMCNVLNYIKENQPDVYARTHYKIIEISSQLATKQMESALRAKLTREKIDNSK 60 Query: 133 YGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHG--IRERMIDIDQHDSLVFN 190 V F +A E FD+ P E + F Sbjct: 61 VEVINQSIFEWDKVVQEPCFFIALEVFDNFPHDLVRYDNTTGEPHE-GYVLVDEQGDFFE 119 Query: 191 IGDHEIKSNFLTCSDYFLGA 210 E+ T Sbjct: 120 FFTPELSYYTDTFLRLRESD 139 >gi|229548234|ref|ZP_04436959.1| conserved hypothetical protein [Enterococcus faecalis ATCC 29200] gi|229306637|gb|EEN72633.1| conserved hypothetical protein [Enterococcus faecalis ATCC 29200] Length = 79 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 16/75 (21%), Positives = 30/75 (40%), Gaps = 7/75 (9%) Query: 232 DGGTAIVIDYGYL------QSRVGDTLQ-AVKGHTYVSPLVNPGQADLSSHVDFQRLSSI 284 G + +DYGY R T++ + H + PG D+++ VDF ++ Sbjct: 4 QRGAMLFVDYGYNRGEFYQHDRDDGTVRAFYRHHVHNDVHRWPGLQDITASVDFTAMAEA 63 Query: 285 AILYKLYINGLTTQG 299 + + G +Q Sbjct: 64 GMHGGFELAGYCSQA 78 >gi|242059645|ref|XP_002458968.1| hypothetical protein SORBIDRAFT_03g043510 [Sorghum bicolor] gi|241930943|gb|EES04088.1| hypothetical protein SORBIDRAFT_03g043510 [Sorghum bicolor] Length = 488 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 22/183 (12%), Positives = 48/183 (26%), Gaps = 21/183 (11%) Query: 50 VTAPEI-SQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRV-ICKLKPDF 107 T E+ + +A +++ E+G G G IL + P Sbjct: 123 FTPVELFKPWYAYAIA---ASILRTANLSVPLKIYEIGGGSGTCAKCILDYMMLNAPPKV 179 Query: 108 FSVLSIYMVETSERLTLIQKKQLASYGDKINWY----------TSLADVPLGFTFLVANE 157 ++ + VE S L Q + + ++ + +++ E Sbjct: 180 YNDMKYISVEISSSLAEKQLETVGEVQSHLSKFTVEHRDAINRPGWGRKDPLPCWVLMLE 239 Query: 158 FFDSLPI------KQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAI 211 D+LP Q E++ + + D + Sbjct: 240 VLDNLPHDLVYSPDQVSPWMEVWIEKVNGRKNQGKFFEMEESSQASEVYKPLQDPLISRC 299 Query: 212 FEN 214 + Sbjct: 300 VDI 302 Score = 40.9 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 9/34 (26%), Positives = 15/34 (44%), Gaps = 1/34 (2%) Query: 12 LIKKNGQMTVDQYFALCVADPEFGYYSTCN-PFG 44 ++ N + V + + DP GY+S P G Sbjct: 31 ILAGNKPVLVRDFVRSALYDPNHGYFSKRAGPVG 64 >gi|303321440|ref|XP_003070714.1| hypothetical protein CPC735_038330 [Coccidioides posadasii C735 delta SOWgp] gi|240110411|gb|EER28569.1| hypothetical protein CPC735_038330 [Coccidioides posadasii C735 delta SOWgp] Length = 294 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 22/115 (19%), Positives = 36/115 (31%), Gaps = 10/115 (8%) Query: 85 LGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDK-------- 136 +G G G +ML+IL I ++ P+ + ++E S L Q+K L Sbjct: 1 MGAGNGTLMLNILDYIREVDPEVYQRTKFKIIEISSHLADTQQKTLNGSIYDDGHRGHVE 60 Query: 137 --INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVF 189 F +A E FD+ + R + F Sbjct: 61 IINRSIFEWDTYVHSPCFFLALEVFDNFAHDAIRYDLQTGQPRQGCVLVDSDGEF 115 >gi|327401504|ref|YP_004342343.1| hypothetical protein Arcve_1628 [Archaeoglobus veneficus SNP6] gi|327317012|gb|AEA47628.1| protein of unknown function DUF185 [Archaeoglobus veneficus SNP6] Length = 298 Score = 50.1 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 10/31 (32%), Positives = 15/31 (48%) Query: 7 RKIVNLIKKNGQMTVDQYFALCVADPEFGYY 37 I I +NG ++ + L + PE GYY Sbjct: 5 EIISRFIAENGPVSFRDFVWLLLYHPEVGYY 35 >gi|302834575|ref|XP_002948850.1| hypothetical protein VOLCADRAFT_89143 [Volvox carteri f. nagariensis] gi|300266041|gb|EFJ50230.1| hypothetical protein VOLCADRAFT_89143 [Volvox carteri f. nagariensis] Length = 632 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 36/242 (14%), Positives = 67/242 (27%), Gaps = 39/242 (16%) Query: 21 VDQYFALCVADPEFGYYSTCN-PFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSC 79 V + + P GY+S P G+VG +P I W + Sbjct: 73 VRDFIQNSLYHPTLGYFSAPTPPVGSVG----SP--------------INYWRIYCRDEF 114 Query: 80 VRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINW 139 LV ++ + +P + + S +E S L Q ++ G Sbjct: 115 NILVN------KKYQELEDWLRDNRPQDYRLTSYTCLEISTSLAARQYDKVVRQGGHGGR 168 Query: 140 YT-----------SLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLV 188 + ++ TF++ E D+LP G E + L Sbjct: 169 FHLRRGSGLDPATWGSERRWEHTFVLMMEVLDNLPHDSLER---GPWEMAGEPLADPLLC 225 Query: 189 FNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRV 248 + + D + + RD + S + L A ++ + Sbjct: 226 RTLAAVYRTPSREQQLDERFNRVLDFILARDSQPSSRDEVLWLPTACAAFLELLHTLRPN 285 Query: 249 GD 250 Sbjct: 286 HS 287 >gi|239942211|ref|ZP_04694148.1| hypothetical protein SrosN15_14530 [Streptomyces roseosporus NRRL 15998] Length = 132 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 30/126 (23%), Positives = 50/126 (39%), Gaps = 13/126 (10%) Query: 40 CNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRV 99 +P G G F T+ S +F +A L + V LV++G GRG ++ +L V Sbjct: 2 RSPEGPAGHFRTSVHASPLFAGAVARLLTGIARELDT-GTVALVDVGAGRGELLTGVLDV 60 Query: 100 ICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFF 159 + ++ Y VE + +I W G L ANE+ Sbjct: 61 LAARPDGPD--VTPYAVEV--------AARPPGLDPRIEWCAEPPAGVAG--LLFANEWL 108 Query: 160 DSLPIK 165 D++P++ Sbjct: 109 DNVPVE 114 >gi|330470488|ref|YP_004408231.1| hypothetical protein VAB18032_02735 [Verrucosispora maris AB-18-032] gi|328813459|gb|AEB47631.1| hypothetical protein VAB18032_02735 [Verrucosispora maris AB-18-032] Length = 415 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 15/73 (20%), Positives = 29/73 (39%), Gaps = 3/73 (4%) Query: 25 FALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVE 84 + G++ + G F T+ S +F + L + G PS +V+ Sbjct: 7 MERALYG-TDGFFVSGA--GPAAHFRTSVHASPVFADALLRLIHHLDGVLGHPSVFDVVD 63 Query: 85 LGPGRGIMMLDIL 97 +G GRG ++ + Sbjct: 64 VGAGRGELLRALF 76 Score = 43.6 bits (101), Expect = 0.053, Method: Composition-based stats. Identities = 36/173 (20%), Positives = 67/173 (38%), Gaps = 19/173 (10%) Query: 152 FLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLV--------FNIGDHEIKSNFLTC 203 L+A E+ D++P+ V T+HG ++D + + + S+ + Sbjct: 176 LLLATEWLDNVPLDIAVHTQHGWHYLLVDPTSGEETIGDPLSRADLDWLSTWWPSSLVPD 235 Query: 204 SDYFLGAIFENSP--------CRDREMQSISDRLACDGGTAIVIDYGY--LQSRVGDTLQ 253 SD + F +P R+R+ + G A+ +DYG+ + V TL Sbjct: 236 SDSSTESGFRATPTEHARVEIGRNRDEAWADAVGNVERGLALAVDYGHLRAERPVDGTLT 295 Query: 254 AVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 + V P+ + D+++HV ++S L +Q L LG Sbjct: 296 GYRAGRQVPPVPDGS-CDVTAHVAMDSVASAGERVARCAYLLGSQRDGLRALG 347 >gi|290996947|ref|XP_002681043.1| hypothetical protein NAEGRDRAFT_78474 [Naegleria gruberi] gi|284094666|gb|EFC48299.1| hypothetical protein NAEGRDRAFT_78474 [Naegleria gruberi] Length = 350 Score = 47.0 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 17/112 (15%), Positives = 37/112 (33%), Gaps = 6/112 (5%) Query: 85 LGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGD------KIN 138 +G G G +IL + P+ + +++E S +L QK + A + + Sbjct: 1 MGGGMGTCCRNILDYLETDYPEIYKNTEYHIIEISSQLHEQQKIRCAHHVTNGKLKLHHD 60 Query: 139 WYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFN 190 + V ++A E D+ + + G + + Sbjct: 61 SFMDWKQVEQDECHVIAMEVLDNCAHDKISIDADGNVKECHVFFDKGKNYYE 112 >gi|118380412|ref|XP_001023370.1| hypothetical protein TTHERM_00446060 [Tetrahymena thermophila] gi|89305137|gb|EAS03125.1| hypothetical protein TTHERM_00446060 [Tetrahymena thermophila SB210] Length = 460 Score = 46.6 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 40/277 (14%), Positives = 85/277 (30%), Gaps = 35/277 (12%) Query: 69 CAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKK 128 + F +R+VE+G G+ IL + +++ L +VE S ++ + Sbjct: 133 SQVQAKPFDDKIRIVEIGAGQASAAQSILMYFKNYEQSYYANLEYTIVEISPQMCKKALE 192 Query: 129 QLASYGDKINWYTSLADVPLGF-----------TFLVANEFFDSLPIKQFVM---TEHGI 174 +L+ K+ + + F FL+ E D++P + T+ I Sbjct: 193 KLSRDHSKLIERGQIKFINDDFVNFKPQNRDQHYFLLFLEVLDNMPHDRIYKKKNTQEDI 252 Query: 175 RERMIDIDQHDSL-VFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDG 233 E+ ++ D + + + ++ + E +++ + + Sbjct: 253 WEKQAQVEFTDEFGNEGLKEVQTDIKDPLIQEFIQILKTVPTLDHIEEEKNLQGGIIQNV 312 Query: 234 GTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYIN 293 + L+ +K P N AD F L + Sbjct: 313 IRWLFKQPKDNMFIPTFCLKVLKHINSNIPNHNVIFAD------FDMLKTAESAKMGINA 366 Query: 294 GLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKR 330 + +Q L K +KD V+R Sbjct: 367 PIVSQK--------------LSKSHEKKDFSTYLVQR 389 >gi|255070585|ref|XP_002507374.1| predicted protein [Micromonas sp. RCC299] gi|226522649|gb|ACO68632.1| predicted protein [Micromonas sp. RCC299] Length = 429 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 48/274 (17%), Positives = 87/274 (31%), Gaps = 58/274 (21%) Query: 15 KNGQMTVDQYFALCVADPEFGYYSTCN--PFGAVG------------DF----------- 49 +G + + + + + + GY++ + P GA+ D+ Sbjct: 5 TDGWL-LRDFLHQALYNRDDGYFANASSPPVGAMSRPIPFQALLGQEDYARTLARRYDAL 63 Query: 50 ----VTAPEI-SQIFGEMLAIFLICAWEQH---------------GFPSCVRLVELGPGR 89 +T EI + +A ++ A + G +R+ ELG G Sbjct: 64 ASQWLTPVEIFKPHYARAVARHILRAHKAELDAPLDDDERRRRRVGKTPPLRIYELGGGT 123 Query: 90 GIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQL-ASYGDKINWYTSLAD--- 145 G IL I P F+ VE S L +Q++ + GD ++ D Sbjct: 124 GTCAAGILAHIRDDDPTVFASTEYVGVEISPALAKMQRETVRRELGDDVHIRRDALDAGR 183 Query: 146 ---VPLGFTFLVANEFFDSLPIKQFVM----TEHGIRERMIDIDQHDSLVFNIGDHEIKS 198 V F+VA E D+LP + ++ T + D+ V E Sbjct: 184 WGPVDDDACFVVALEVLDNLPHDRVMLLRDDTRTTMTRVFARRRGDDNAVDGFEQREEPL 243 Query: 199 NFLTCSDYFLGAIFENSPCRDREM-QSISDRLAC 231 S E+ ++++DR Sbjct: 244 RDPLISRALEAIGDEDESFSFGFTVKAMADRWIR 277 >gi|195329888|ref|XP_002031642.1| GM26109 [Drosophila sechellia] gi|194120585|gb|EDW42628.1| GM26109 [Drosophila sechellia] Length = 123 Score = 42.8 bits (99), Expect = 0.090, Method: Composition-based stats. Identities = 25/73 (34%), Positives = 40/73 (54%), Gaps = 1/73 (1%) Query: 61 EMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLS-IYMVETS 119 +++ I+L+ W + G PS +LVELGPGRG + D+L+V+ K K + + + S Sbjct: 2 KLVGIWLVSEWRKMGCPSPFQLVELGPGRGTLARDVLKVLTKFKQKYRTSDPSQSDYQRS 61 Query: 120 ERLTLIQKKQLAS 132 R L+ QL Sbjct: 62 SRRHLLIGAQLPE 74 >gi|56784395|dbj|BAD82434.1| hypothetical protein [Oryza sativa Japonica Group] Length = 245 Score = 42.4 bits (98), Expect = 0.096, Method: Composition-based stats. Identities = 9/31 (29%), Positives = 15/31 (48%), Gaps = 1/31 (3%) Query: 15 KNGQMTVDQYFALCVADPEFGYYSTCN-PFG 44 +N + V + + DP GY+S + P G Sbjct: 35 ENKPILVRDFVRSALYDPNHGYFSKRSGPVG 65 >gi|331236750|ref|XP_003331033.1| hypothetical protein PGTG_12996 [Puccinia graminis f. sp. tritici CRL 75-36-700-3] gi|309310023|gb|EFP86614.1| hypothetical protein PGTG_12996 [Puccinia graminis f. sp. tritici CRL 75-36-700-3] Length = 521 Score = 42.0 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 28/241 (11%), Positives = 70/241 (29%), Gaps = 26/241 (10%) Query: 57 QIFGEMLAIFLIC-------AWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFS 109 + +A +++ ++ G + E+G G G + + IL I + P+ + Sbjct: 160 PWYAWSMAHYIVEKHLEKRDQLQEEGKELK--IYEIGAGNGTLCVGILDYIKEHHPNLYE 217 Query: 110 VLSIYMVETSERLTLIQKKQL--ASYGDKINWYTS---------LADVPLGFTFLVANEF 158 +E S RL Q++++ + + + + +++ E Sbjct: 218 KTRYTTIELSRRLADRQREKIDRSGHQGRARVINRSILGLTSSPELEPSHEPCWVLGMEV 277 Query: 159 FDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCR 218 D+L + F+ I T ++ L + + Sbjct: 278 LDNLARDVIRRDRVTGTPLQSIVITDQLGDFHERFIPI----TTQNNPNLLEYLDIFDQQ 333 Query: 219 DREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPG--QADLSSHV 276 + +S + Q + T + + + N +D S Sbjct: 334 QTKTTQLSLFFEKLWAKYLPFRSNLTQHQFIPTNYLILLQSIFNFFPNHRLILSDFSHLP 393 Query: 277 D 277 + Sbjct: 394 N 394 >gi|325518324|gb|EGC98056.1| hypothetical protein B1M_43540 [Burkholderia sp. TJI49] Length = 86 Score = 42.0 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 20/82 (24%), Positives = 37/82 (45%), Gaps = 6/82 (7%) Query: 116 VETSERLTLIQKKQL----ASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTE 171 V+ S L Q+ + + K+ W +L + G +V NE D++P++ F + Sbjct: 1 VDLSGELRERQRDTIASAAPAQVAKVRWLDALPERFDG--VVVGNEVLDAMPVRLFAKGD 58 Query: 172 HGIRERMIDIDQHDSLVFNIGD 193 RER + +D + VF+ Sbjct: 59 GAWRERGVAVDARQAFVFDDRP 80 >gi|301610113|ref|XP_002934596.1| PREDICTED: myosin-IXa [Xenopus (Silurana) tropicalis] Length = 2551 Score = 38.9 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 27/251 (10%), Positives = 63/251 (25%), Gaps = 11/251 (4%) Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 L ++ R P + + +F + + Sbjct: 1947 LGALFEQILDKTMKQQHPRSWSESP-----LRVWINTFKVFLDEFVTEYKPLNYTPGKIQ 2001 Query: 123 TLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDID 182 +KK+ D + + + E+ SL + + ++ Sbjct: 2002 KTERKKRRKKDSDIVEEHNGHIFKITQYNIPTYCEYCSSL----IWIMDRAAVCKLCRYA 2057 Query: 183 QHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACD-GGTAIVIDY 241 H I K N S G +R + + ++L + + Sbjct: 2058 CHKKCCSLINVACNKKNDSELSTRQFGVDLSRLTNEERLVPVLLEKLISYIEMHGLYTEG 2117 Query: 242 GYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKF 301 Y + + ++ ++ N D + HV + +F Sbjct: 2118 IYRKPGSTNKIRELRQSLDTDI-ENVNLDDYNIHVIASVFKQWLRELPNPLMTFELYEEF 2176 Query: 302 LEGLGIWQRAF 312 L +G+ +R Sbjct: 2177 LRSMGLGERKE 2187 >gi|110288644|gb|ABG65920.1| NB-ARC domain containing protein, expressed [Oryza sativa Japonica Group] gi|125583083|gb|EAZ24014.1| hypothetical protein OsJ_07739 [Oryza sativa Japonica Group] Length = 923 Score = 38.9 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 29/279 (10%), Positives = 80/279 (28%), Gaps = 30/279 (10%) Query: 101 CKLKPDFFSVLSIYMVE-TSERLTLIQKKQLASYGDKINWYTSLADVPLGF---TFLVAN 156 + D +VE L + + +N + + D G + + Sbjct: 255 KEFPKDVDVTDYRSLVETIRLYLEKKRYVLVLDDVWSVNVWFDIKDAFSGGKHGRIIFTS 314 Query: 157 EFFDS--LPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFEN 214 ++ L + + ++ + + + F+ Sbjct: 315 RIYEVALLAPESQKINLQPLQNHYAWDLFCKEAFWKSENRSCPVELHPWAQRFVDKCKGL 374 Query: 215 SPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSS 274 + +S + + + Y + T + ++ ++ DL Sbjct: 375 PIAIVCIGRLLSFK----SANLLEWENVYRNLEMQFTNNYI---LDMNIILKVSLEDLPH 427 Query: 275 HVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVST 334 + + + + ++ Q K+L L I + + +++ D + L++ Sbjct: 428 N-----MKNCFLYCSMFPENYVMQRKWLVRLWIAEGFIEESEHKTLEEVAEDYLTELINR 482 Query: 335 ------------SADKKSMGELFKILVVSHEKVELMPFV 361 D M ++F++L +S + E FV Sbjct: 483 CLLVEVKRNESGYIDDFQMHDIFRVLALSKAREENFCFV 521 >gi|324325110|gb|ADY20370.1| putative phage tail tape measure protein [Bacillus thuringiensis serovar finitimus YBT-020] Length = 1083 Score = 38.2 bits (87), Expect = 2.0, Method: Composition-based stats. Identities = 23/182 (12%), Positives = 52/182 (28%), Gaps = 9/182 (4%) Query: 55 ISQIFGEMLAI---FLICAWEQHGFPSCVRLVELGPGRG---IMMLDILRVICKLKPDFF 108 I + F +M+A ++ W L E G + + + ++ Sbjct: 542 ICKWFSDMIAAIGEWIAPWWNPIKEWLVKTLFEWTIGLRDWWNAISEWFTTTKEDFVNWL 601 Query: 109 SVLSIYMVE-TSERLTLIQKKQLASYGDKINWYTSLADVPLGF--TFLVANEFFDSLPIK 165 S +V+ S + + + W+ L + L + A E ++ I Sbjct: 602 SGWWSAIVDWFSVSTEEWKTRLGDWHQAIAEWWDKLPEDTLKWLENISKALEEWNKKQID 661 Query: 166 QFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSI 225 Q V E + + ++ N + I+ ++ + Sbjct: 662 QLVEDFKKWWEVIDNWFTETKEKWSEKLETWSENIQKWWEEMPDKIYSWFTGWWDKISTW 721 Query: 226 SD 227 D Sbjct: 722 YD 723 >gi|207110467|ref|ZP_03244629.1| hypothetical protein HpylH_15475 [Helicobacter pylori HPKX_438_CA4C1] Length = 85 Score = 38.2 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 13/76 (17%), Positives = 24/76 (31%), Gaps = 2/76 (2%) Query: 278 FQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVSTSAD 337 F + + + TQ L +G+ K + L ++ K + Sbjct: 4 FLLVRFLFEKNHAKFSFFKTQANALLDMGLMGLLEIFSKSVGYERYLKEAAK--IKPLIS 61 Query: 338 KKSMGELFKILVVSHE 353 +GE FK L + Sbjct: 62 PGGLGERFKALEFVKK 77 >gi|116178944|ref|XP_001219321.1| hypothetical protein CHGG_00100 [Chaetomium globosum CBS 148.51] gi|88184397|gb|EAQ91865.1| hypothetical protein CHGG_00100 [Chaetomium globosum CBS 148.51] Length = 557 Score = 37.8 bits (86), Expect = 2.5, Method: Composition-based stats. Identities = 47/313 (15%), Positives = 91/313 (29%), Gaps = 41/313 (13%) Query: 58 IFGEMLAIFLICAWE--QHGFPSCVRLVELGP------GRGIMMLDILRVICKLKPDFFS 109 +FG L + ++C + + P+ R +G G + ++ ++ + D + Sbjct: 96 LFGIALVMTIVCLNKHGKLHLPAEKRFYPIGRRWQWYWGSFVCATAMISLLVSVDVDRYF 155 Query: 110 VLSIYMVETSERLTLIQKKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVM 169 + + ++ TS L+Q +A + + + S + +F D P Sbjct: 156 LPELPVILTSFFWYLMQIGAVALVWEAVRHWGSWMER----------QFIDPDPFSLRDD 205 Query: 170 TEHGIRERMIDIDQHDSLVFNIGDH-EIKSNFLTCSDYFLGAIFENSPCRDREMQSISDR 228 E + + + L N + Y I E P + Sbjct: 206 DRRSRVEFYLPLLAYLFLWLNFFMIVPRNWTAIQHQRYPQQTIDEAEPTATDARFKAAAF 265 Query: 229 LACDGGTAIVIDYGYL----QSRVGDTLQAVKGHTYVSPLVNPGQADLS----------- 273 L + R + G +PL L+ Sbjct: 266 LLAVCWLITAFSLRHSIKYYCPRNRGIFNRIIGFVRYTPLRFALIMPLAAAVIAYQGLVA 325 Query: 274 SHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSLMKQTARKDILLDSVKRLVS 333 H D+ L + +Y G T L + +L + +R V Sbjct: 326 FHFDYSPLKVGGLNAAIYAGGYTP---AL----LIVWIQALFGFFNPNEDRELQRQRRVR 378 Query: 334 TSADKKSMGELFK 346 T A + MG +FK Sbjct: 379 TQALNREMGLVFK 391 >gi|156349174|ref|XP_001621948.1| hypothetical protein NEMVEDRAFT_v1g143109 [Nematostella vectensis] gi|156208312|gb|EDO29848.1| predicted protein [Nematostella vectensis] Length = 388 Score = 37.8 bits (86), Expect = 2.6, Method: Composition-based stats. Identities = 15/230 (6%), Positives = 40/230 (17%), Gaps = 18/230 (7%) Query: 64 AIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKP-----------DFFSVLS 112 A + W + + +C R G ++++ Sbjct: 32 ACWRRNCWTRGTWWACWRWNSWTRGTWWACWRWNSWTRGTWWACWRRNSWTRGNWWACWR 91 Query: 113 IYMVETSERLTLIQKKQLASYGDKINWYTS------LADVPLGFTFLVANEFFDSLPIKQ 166 S + W + G ++ + Sbjct: 92 WNSWTRSTWWACWRWNCWTRGTWWACWRRNSWTRSTWWACWRGNSWTRGTWWA-CWRRNS 150 Query: 167 FVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSIS 226 + R + + + + C C + Sbjct: 151 WTRGTWWACWRWNSWTRGTWWACWKWNSWTRGTWWACWRRNSWTRGTWWACWRWNSWTRG 210 Query: 227 DRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHV 276 AC + + R + T + + + Sbjct: 211 TWWACWKWNSWTRGTWWACWRWNSWTRGTWRFTGTGCWYVRQGSRCTWRI 260 >gi|326926348|ref|XP_003209364.1| PREDICTED: myosin-IXa-like [Meleagris gallopavo] Length = 2452 Score = 37.4 bits (85), Expect = 3.5, Method: Composition-based stats. Identities = 18/176 (10%), Positives = 47/176 (26%), Gaps = 4/176 (2%) Query: 135 DKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDH 194 D + + ++ E+ SL ++M + + + Sbjct: 1969 DVVEEHNGHIFKATQYSIPTYCEYCSSL---IWIMDRASVCKLCKYACHKKCCLKTTTKC 2025 Query: 195 EIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQA 254 K + S F + + + + + + Y +S + ++ Sbjct: 2026 SKKYDPELSSRQFGVELARLTSEERAVPVLVEKLINYIEMHGLYTEGIYRKSGSTNKIKE 2085 Query: 255 VKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQR 310 ++ N D + HV + +FL +G+ +R Sbjct: 2086 LRQGLDTDI-DNVNLDDYNIHVIASVFKQWLRDLPNPLMTFELYEEFLRAMGLQER 2140 >gi|118095489|ref|XP_413711.2| PREDICTED: similar to myosin-IXa [Gallus gallus] Length = 2547 Score = 37.4 bits (85), Expect = 4.0, Method: Composition-based stats. Identities = 18/176 (10%), Positives = 47/176 (26%), Gaps = 4/176 (2%) Query: 135 DKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDH 194 D + + ++ E+ SL ++M + + + Sbjct: 1995 DVVEEHNGHIFKATQYSIPTYCEYCSSL---IWIMDRASVCKLCKYACHKKCCLKTTTKC 2051 Query: 195 EIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQA 254 K + S F + + + + + + Y +S + ++ Sbjct: 2052 SKKYDPELSSRQFGVELSRLTSEDRAVPVLVEKLINYIEMHGLYTEGIYRKSGSTNKIKE 2111 Query: 255 VKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQR 310 ++ N D + HV + +FL +G+ +R Sbjct: 2112 LRQGLDTDI-DNVNLDDYNIHVIASVFKQWLRDLPNPLMTFELYEEFLRAMGLQER 2166 >gi|85711390|ref|ZP_01042449.1| putative type I site-specific deoxyribonuclease LldI chain protein [Idiomarina baltica OS145] gi|85694891|gb|EAQ32830.1| putative type I site-specific deoxyribonuclease LldI chain protein [Idiomarina baltica OS145] Length = 511 Score = 37.0 bits (84), Expect = 4.9, Method: Composition-based stats. Identities = 25/171 (14%), Positives = 48/171 (28%), Gaps = 22/171 (12%) Query: 45 AVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGI-MMLDILRVICKL 103 G+F T PEIS + E+L + + G G +M +VI Sbjct: 183 KAGEFYTPPEISDLIAELL-----------DPQPGDSICDPACGSGSLLMKCGRKVIANH 231 Query: 104 KPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWYTSLADVP--------LGFTFLVA 155 +++ + ++ L + KI W ++ + + F + A Sbjct: 232 DSKEYALFGQEAIGSTWSLAKMNMFLHGEDNHKIEWGDTIRNPKLLDKNGDLMLFDIVTA 291 Query: 156 NE--FFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCS 204 N D + R + F + E + Sbjct: 292 NPPFSLDKWGHDDAENDKFSRFRRGVPPKTKGDYAFILHMIETLKPASSSK 342 >gi|168278385|dbj|BAG11072.1| transcriptional repressor NF-X1 [synthetic construct] Length = 832 Score = 36.6 bits (83), Expect = 5.8, Method: Composition-based stats. Identities = 31/228 (13%), Positives = 60/228 (26%), Gaps = 8/228 (3%) Query: 75 GFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYG 134 G +L+ELG +D + K+ S+ + T E+L S Sbjct: 597 GQTPLSQLLELGSSSRKTCMDPVPSCGKVCGKPLPCGSLDFIHTCEKLCHEGDCGPCSRT 656 Query: 135 DKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFN--IG 192 I+ S + +E + K+ R + +I D I Sbjct: 657 SVISCRCSFR-TKELPCTSLKSEDATFMCDKRCNKKRLCGRHKCNEICCVDKEHKCPLIC 715 Query: 193 DHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTL 252 +++ C + C+ S + G + I T Sbjct: 716 GRKLRCGLHRCEEP-----CHRGNCQTCWQASFDELTCHCGASVIYPPVPCGTRPPECTQ 770 Query: 253 QAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGK 300 + H P+ + ++ + +TQ K Sbjct: 771 TCARVHECDHPVYHSCHSEEKCPPCTFLTQKWCMGKHESHYWASTQKK 818 >gi|55733580|emb|CAH93467.1| hypothetical protein [Pongo abelii] Length = 816 Score = 36.6 bits (83), Expect = 5.8, Method: Composition-based stats. Identities = 31/228 (13%), Positives = 60/228 (26%), Gaps = 8/228 (3%) Query: 75 GFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYG 134 G +L+ELG +D + K+ S+ + T E+L S Sbjct: 581 GQTPLSQLLELGSSSRKTCMDPVPSCGKVCGKPLPCGSLDFIHTCEKLCHEGDCGPCSRT 640 Query: 135 DKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFN--IG 192 I+ S + +E + K+ R + +I D I Sbjct: 641 SVISCRCSFR-TKELPCTSLKSEDATFMCDKRCNKKRLCGRHKCNEICCVDKEHKCPLIC 699 Query: 193 DHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTL 252 +++ C + C+ S + G + I T Sbjct: 700 GRKLRCGLHRCEEP-----CHRGNCQTCWQASFDELTCHCGASVIYPPVPCGTRPPECTQ 754 Query: 253 QAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGK 300 + H P+ + ++ + +TQ K Sbjct: 755 TCARVHECDHPVYHSCHSEEKCPPCTFLTQKWCMGKHESHYWASTQKK 802 >gi|297296805|ref|XP_001089813.2| PREDICTED: myosin-IXa [Macaca mulatta] Length = 2267 Score = 36.6 bits (83), Expect = 5.9, Method: Composition-based stats. Identities = 23/208 (11%), Positives = 61/208 (29%), Gaps = 12/208 (5%) Query: 135 DKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDH 194 D + + ++ E+ SL ++M + + + Sbjct: 1711 DLVEEHNGHIFKATQYSIPTYCEYCSSL---IWIMDRASVCKLCKYACHKKCCLKTTAKC 1767 Query: 195 EIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQA 254 K + S F + + + + + + Y +S + ++ Sbjct: 1768 SKKYDPELSSRQFGVELSRLTSEDRTVPLVVEKLINYIEMHGLYTEGIYRKSGSTNKIKE 1827 Query: 255 VKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSL 314 ++ N D + HV + +FL +G+ +R ++ Sbjct: 1828 LRQGLDTDA-ENVNLDDYNIHVIASVFKQWLRDLPNPLMTFELYEEFLRAMGLQERKETI 1886 Query: 315 ------MKQTARKDILLDSVKRLVSTSA 336 + Q +R L++++RL+ Sbjct: 1887 RGVYSVIDQLSRT--HLNTLERLIFHLV 1912 >gi|327285360|ref|XP_003227402.1| PREDICTED: myosin-IXa-like [Anolis carolinensis] Length = 2574 Score = 36.6 bits (83), Expect = 6.3, Method: Composition-based stats. Identities = 26/249 (10%), Positives = 68/249 (27%), Gaps = 12/249 (4%) Query: 94 LDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWYTSLADVPLGFTFL 153 + ++ + + +KK+ D + + ++ Sbjct: 1970 RVWVNTFKVFLDEYMTEYMPLDYTAPKMTKTERKKRRKKETDVVEEHNGHIFKATQYSIP 2029 Query: 154 VANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFE 213 EF SL ++M + + + K + S F + Sbjct: 2030 TYCEFCSSL---IWIMDRASVCKLCKYACHRKCCLKTTTKCSKKYDPELSSRQFGVELSR 2086 Query: 214 NSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLS 273 + A+ + Y +S + ++ ++ + D + Sbjct: 2087 LTSEERTVPVLFEKLTNYIEMHALYTEGIYRKSGSTNKIKELRQGLDTDI-ESINLDDYN 2145 Query: 274 SHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSL------MKQTARKDILLDS 327 HV + +FL +G+ +R ++ + Q +R L + Sbjct: 2146 IHVIASVFKQWLRDLPNPLMTFELYDEFLRAMGLQERKEAIRGVYSVIDQLSRT--HLHT 2203 Query: 328 VKRLVSTSA 336 ++RL+ Sbjct: 2204 LERLIFHLV 2212 >gi|149041872|gb|EDL95713.1| myosin IXA, isoform CRA_a [Rattus norvegicus] Length = 2626 Score = 36.6 bits (83), Expect = 6.5, Method: Composition-based stats. Identities = 27/276 (9%), Positives = 74/276 (26%), Gaps = 18/276 (6%) Query: 67 LICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQ 126 E+ E + + ++ + + L + Sbjct: 1998 FEQILEKTMRFEQRDWNE------SPVRVWVNTFKVFLDEYMNEFKTLDSTAPKVLKTER 2051 Query: 127 KKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDS 186 KK+ D + + ++ E+ SL ++M + + Sbjct: 2052 KKRRKKETDLVEEHNGHMFKATQYSIPTYCEYCSSL---IWIMDRASVCKLCKYACHKKC 2108 Query: 187 LVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS 246 + K + S F + + + + + + Y +S Sbjct: 2109 CLKTTAKCSKKYDPELSSRQFGVELSRLTSEDRAVPLVVEKLINYIEMHGLYTEGIYRKS 2168 Query: 247 RVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 + ++ ++ + D + HV + +FL +G Sbjct: 2169 GSTNKIKELRQGLDTDA-ESVNLDDYNIHVIASVFKQWLRDLPNPLMTFELYEEFLRAMG 2227 Query: 307 IWQRAFSL------MKQTARKDILLDSVKRLVSTSA 336 + +R ++ + Q +R L +++RL+ Sbjct: 2228 LQERKETIRGVYSVIDQLSRT--HLSTLERLIFHLV 2261 >gi|19705443|ref|NP_599162.1| myosin-IXa [Rattus norvegicus] gi|81872884|sp|Q9Z1N3|MYO9A_RAT RecName: Full=Myosin-IXa; AltName: Full=Myr 7; AltName: Full=Unconventional myosin-9a gi|3955026|emb|CAA04946.1| myosin-RhoGAP protein, Myr 7 [Rattus norvegicus] Length = 2626 Score = 36.6 bits (83), Expect = 6.6, Method: Composition-based stats. Identities = 27/276 (9%), Positives = 74/276 (26%), Gaps = 18/276 (6%) Query: 67 LICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQ 126 E+ E + + ++ + + L + Sbjct: 1998 FEQILEKTMRFEQRDWNE------SPVRVWVNTFKVFLDEYMNEFKTLDSTAPKVLKTER 2051 Query: 127 KKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDS 186 KK+ D + + ++ E+ SL ++M + + Sbjct: 2052 KKRRKKETDLVEEHNGHMFKATQYSIPTYCEYCSSL---IWIMDRASVCKLCKYACHKKC 2108 Query: 187 LVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS 246 + K + S F + + + + + + Y +S Sbjct: 2109 CLKTTAKCSKKYDPELSSRQFGVELSRLTSEDRAVPLVVEKLINYIEMHGLYTEGIYRKS 2168 Query: 247 RVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 + ++ ++ + D + HV + +FL +G Sbjct: 2169 GSTNKIKELRQGLDTDA-ESVNLDDYNIHVIASVFKQWLRDLPNPLMTFELYEEFLRAMG 2227 Query: 307 IWQRAFSL------MKQTARKDILLDSVKRLVSTSA 336 + +R ++ + Q +R L +++RL+ Sbjct: 2228 LQERKETIRGVYSVIDQLSRT--HLSTLERLIFHLV 2261 >gi|229080887|ref|ZP_04213402.1| Phage tail tape measure protein, TP901 [Bacillus cereus Rock4-2] gi|228702383|gb|EEL54854.1| Phage tail tape measure protein, TP901 [Bacillus cereus Rock4-2] Length = 1126 Score = 36.6 bits (83), Expect = 6.9, Method: Composition-based stats. Identities = 27/209 (12%), Positives = 57/209 (27%), Gaps = 32/209 (15%) Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGR---GIMMLDILRVICKLKPDFFSVLSIYMV--- 116 + +L W + E G GI + + + ++ S ++ Sbjct: 548 IGNWLSYWWNSISTWITSKASEWGFQLLAWGIAIKNWFISLPGNIAEWISNWWNTILNWL 607 Query: 117 -------------------ETSERLTLIQKKQLASYGDKINWYTSLADVPL----GFTFL 153 E L I ++L + + I+ + Sbjct: 608 VEKQTAWSLQLSLWGTAIQEWFSSLPEIISQKLTEWWNAISTWFESTKESWKTKLDEWNT 667 Query: 154 VANEFFDSLP--IKQFVMTEHGIRERMIDIDQHD-SLVFNIGDHEIKSNFLTCSDYFLGA 210 E+F+ LP I +++ I E+ + FN I+ F + + + Sbjct: 668 TIGEWFEKLPGNIYNWLLNVSQILEQWNNEQIQKIIDDFNTWWASIEEWFNSTKENWTTK 727 Query: 211 IFENSPCRDREMQSISDRLACDGGTAIVI 239 + E +SI R+ Sbjct: 728 LNEWGAIISNWWESIPSRITEWFNGWWNP 756 >gi|149041873|gb|EDL95714.1| myosin IXA, isoform CRA_b [Rattus norvegicus] Length = 2540 Score = 36.2 bits (82), Expect = 7.0, Method: Composition-based stats. Identities = 27/276 (9%), Positives = 74/276 (26%), Gaps = 18/276 (6%) Query: 67 LICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQ 126 E+ E + + ++ + + L + Sbjct: 1930 FEQILEKTMRFEQRDWNE------SPVRVWVNTFKVFLDEYMNEFKTLDSTAPKVLKTER 1983 Query: 127 KKQLASYGDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDS 186 KK+ D + + ++ E+ SL ++M + + Sbjct: 1984 KKRRKKETDLVEEHNGHMFKATQYSIPTYCEYCSSL---IWIMDRASVCKLCKYACHKKC 2040 Query: 187 LVFNIGDHEIKSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQS 246 + K + S F + + + + + + Y +S Sbjct: 2041 CLKTTAKCSKKYDPELSSRQFGVELSRLTSEDRAVPLVVEKLINYIEMHGLYTEGIYRKS 2100 Query: 247 RVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLG 306 + ++ ++ + D + HV + +FL +G Sbjct: 2101 GSTNKIKELRQGLDTDA-ESVNLDDYNIHVIASVFKQWLRDLPNPLMTFELYEEFLRAMG 2159 Query: 307 IWQRAFSL------MKQTARKDILLDSVKRLVSTSA 336 + +R ++ + Q +R L +++RL+ Sbjct: 2160 LQERKETIRGVYSVIDQLSRT--HLSTLERLIFHLV 2193 >gi|119902006|ref|XP_599652.3| PREDICTED: myosin IXA, partial [Bos taurus] Length = 555 Score = 36.2 bits (82), Expect = 7.3, Method: Composition-based stats. Identities = 22/206 (10%), Positives = 60/206 (29%), Gaps = 12/206 (5%) Query: 137 INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEI 196 + + ++ E+ SL ++M + + + Sbjct: 1 VEEHNGHIFKATQYSIPTYCEYCSSL---IWIMDRASVCKLCKYACHKKCCLKTTAKCSK 57 Query: 197 KSNFLTCSDYFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVK 256 K + S F + + + + + + Y +S + ++ ++ Sbjct: 58 KYDPELSSRQFGVELSRLTSEDRTVPLVVEKLINYIEMHGLYTEGIYRKSGSTNKIKELR 117 Query: 257 GHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSL-- 314 N D + HV + +FL +G+ +R ++ Sbjct: 118 QGLDTDA-ENVNLDDYNIHVIASVFKQWLRDLPNPLMTFELYEEFLRAMGLQERKETIRG 176 Query: 315 ----MKQTARKDILLDSVKRLVSTSA 336 + Q +R L++++RL+ Sbjct: 177 VYSVIDQLSRT--HLNTLERLIFHLV 200 >gi|26324820|dbj|BAC26164.1| unnamed protein product [Mus musculus] Length = 692 Score = 36.2 bits (82), Expect = 7.4, Method: Composition-based stats. Identities = 25/249 (10%), Positives = 71/249 (28%), Gaps = 12/249 (4%) Query: 94 LDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWYTSLADVPLGFTFL 153 + ++ + + L +KK+ D + + ++ Sbjct: 103 RVWVNTFKVFLDEYMNEFKTLDSTAPKVLKTERKKRRKKETDLVEEHNGHIFKATQYSIP 162 Query: 154 VANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFE 213 E+ SL ++M + + + K + S F + Sbjct: 163 TYCEYCSSL---IWIMDRASVCKLCKYACHKKCCLKTTAKCSKKYDPELSSRQFGVELSR 219 Query: 214 NSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLS 273 + + + + + Y +S + ++ ++ + D + Sbjct: 220 LTSEDRAVPLVVEKLINYIEMHGLYTEGIYRKSGSTNKIKELRQGLDTDA-ESVNLDDYN 278 Query: 274 SHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSL------MKQTARKDILLDS 327 HV + +FL +G+ +R ++ + Q +R L++ Sbjct: 279 IHVIASVFKQWLRDLPNPLMTFELYEEFLRAMGLQERKETIRGVYSVIDQLSRT--HLNT 336 Query: 328 VKRLVSTSA 336 ++RL+ Sbjct: 337 LERLIFHLV 345 >gi|26325770|dbj|BAC26639.1| unnamed protein product [Mus musculus] Length = 626 Score = 36.2 bits (82), Expect = 7.5, Method: Composition-based stats. Identities = 25/249 (10%), Positives = 71/249 (28%), Gaps = 12/249 (4%) Query: 94 LDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWYTSLADVPLGFTFL 153 + ++ + + L +KK+ D + + ++ Sbjct: 37 RVWVNTFKVFLDEYMNEFKTLDSTAPKVLKTERKKRRKKETDLVEEHNGHIFKATQYSIP 96 Query: 154 VANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFE 213 E+ SL ++M + + + K + S F + Sbjct: 97 TYCEYCSSL---IWIMDRASVCKLCKYACHKKCCLKTTAKCSKKYDPELSSRQFGVELSR 153 Query: 214 NSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLS 273 + + + + + Y +S + ++ ++ + D + Sbjct: 154 LTSEDRAVPLVVEKLINYIEMHGLYTEGIYRKSGSTNKIKELRQGLDTDA-ESVNLDDYN 212 Query: 274 SHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSL------MKQTARKDILLDS 327 HV + +FL +G+ +R ++ + Q +R L++ Sbjct: 213 IHVIASVFKQWLRDLPNPLMTFELYEEFLRAMGLQERKETIRGVYSVIDQLSRT--HLNT 270 Query: 328 VKRLVSTSA 336 ++RL+ Sbjct: 271 LERLIFHLV 279 >gi|241896922|ref|NP_766606.2| myosin-IXa [Mus musculus] Length = 2631 Score = 36.2 bits (82), Expect = 7.7, Method: Composition-based stats. Identities = 25/249 (10%), Positives = 71/249 (28%), Gaps = 12/249 (4%) Query: 94 LDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWYTSLADVPLGFTFL 153 + ++ + + L +KK+ D + + ++ Sbjct: 2024 RVWVNTFKVFLDEYMNEFKTLDSTAPKVLKTERKKRRKKETDLVEEHNGHIFKATQYSIP 2083 Query: 154 VANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFE 213 E+ SL ++M + + + K + S F + Sbjct: 2084 TYCEYCSSL---IWIMDRASVCKLCKYACHKKCCLKTTAKCSKKYDPELSSRQFGVELSR 2140 Query: 214 NSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLS 273 + + + + + Y +S + ++ ++ + D + Sbjct: 2141 LTSEDRAVPLVVEKLINYIEMHGLYTEGIYRKSGSTNKIKELRQGLDTDA-ESVNLDDYN 2199 Query: 274 SHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSL------MKQTARKDILLDS 327 HV + +FL +G+ +R ++ + Q +R L++ Sbjct: 2200 IHVIASVFKQWLRDLPNPLMTFELYEEFLRAMGLQERKETIRGVYSVIDQLSRT--HLNT 2257 Query: 328 VKRLVSTSA 336 ++RL+ Sbjct: 2258 LERLIFHLV 2266 >gi|148694038|gb|EDL25985.1| mCG9271 [Mus musculus] Length = 2546 Score = 36.2 bits (82), Expect = 8.2, Method: Composition-based stats. Identities = 25/249 (10%), Positives = 71/249 (28%), Gaps = 12/249 (4%) Query: 94 LDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWYTSLADVPLGFTFL 153 + ++ + + L +KK+ D + + ++ Sbjct: 1957 RVWVNTFKVFLDEYMNEFKTLDSTAPKVLKTERKKRRKKETDLVEEHNGHIFKATQYSIP 2016 Query: 154 VANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFE 213 E+ SL ++M + + + K + S F + Sbjct: 2017 TYCEYCSSL---IWIMDRASVCKLCKYACHKKCCLKTTAKCSKKYDPELSSRQFGVELSR 2073 Query: 214 NSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLS 273 + + + + + Y +S + ++ ++ + D + Sbjct: 2074 LTSEDRAVPLVVEKLINYIEMHGLYTEGIYRKSGSTNKIKELRQGLDTDA-ESVNLDDYN 2132 Query: 274 SHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSL------MKQTARKDILLDS 327 HV + +FL +G+ +R ++ + Q +R L++ Sbjct: 2133 IHVIASVFKQWLRDLPNPLMTFELYEEFLRAMGLQERKETIRGVYSVIDQLSRT--HLNT 2190 Query: 328 VKRLVSTSA 336 ++RL+ Sbjct: 2191 LERLIFHLV 2199 >gi|205829208|sp|Q8C170|MYO9A_MOUSE RecName: Full=Myosin-IXa; AltName: Full=Unconventional myosin-9a Length = 2542 Score = 36.2 bits (82), Expect = 8.7, Method: Composition-based stats. Identities = 25/249 (10%), Positives = 71/249 (28%), Gaps = 12/249 (4%) Query: 94 LDILRVICKLKPDFFSVLSIYMVETSERLTLIQKKQLASYGDKINWYTSLADVPLGFTFL 153 + ++ + + L +KK+ D + + ++ Sbjct: 1953 RVWVNTFKVFLDEYMNEFKTLDSTAPKVLKTERKKRRKKETDLVEEHNGHIFKATQYSIP 2012 Query: 154 VANEFFDSLPIKQFVMTEHGIRERMIDIDQHDSLVFNIGDHEIKSNFLTCSDYFLGAIFE 213 E+ SL ++M + + + K + S F + Sbjct: 2013 TYCEYCSSL---IWIMDRASVCKLCKYACHKKCCLKTTAKCSKKYDPELSSRQFGVELSR 2069 Query: 214 NSPCRDREMQSISDRLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLS 273 + + + + + Y +S + ++ ++ + D + Sbjct: 2070 LTSEDRAVPLVVEKLINYIEMHGLYTEGIYRKSGSTNKIKELRQGLDTDA-ESVNLDDYN 2128 Query: 274 SHVDFQRLSSIAILYKLYINGLTTQGKFLEGLGIWQRAFSL------MKQTARKDILLDS 327 HV + +FL +G+ +R ++ + Q +R L++ Sbjct: 2129 IHVIASVFKQWLRDLPNPLMTFELYEEFLRAMGLQERKETIRGVYSVIDQLSRT--HLNT 2186 Query: 328 VKRLVSTSA 336 ++RL+ Sbjct: 2187 LERLIFHLV 2195 >gi|218188331|gb|EEC70758.1| hypothetical protein OsI_02174 [Oryza sativa Indica Group] Length = 629 Score = 35.9 bits (81), Expect = 9.3, Method: Composition-based stats. Identities = 17/127 (13%), Positives = 40/127 (31%), Gaps = 16/127 (12%) Query: 251 TLQAVKGHTYVSPLVNPGQADLSSHVDFQRL----SSIAILYKLYINGLTTQGKFLEGLG 306 + ++ V + L + + ++ Q K+L L Sbjct: 101 WENVYRNLEMQFTNNYILDMNIILKVSLEDLPHNMKNCFLYCSMFPENYVMQRKWLVRLW 160 Query: 307 IWQRAFSLMKQTARKDILLDSVKRLVST------------SADKKSMGELFKILVVSHEK 354 I + + +++ D + L++ D M ++F++L +S + Sbjct: 161 IAEGFIEESEHKTLEEVAEDYLTELINRCLLVEVKRNESGYIDDFQMHDIFRVLALSKAR 220 Query: 355 VELMPFV 361 E FV Sbjct: 221 EENFCFV 227 >gi|284048513|ref|YP_003398852.1| type I restriction-modification system, M subunit [Acidaminococcus fermentans DSM 20731] gi|283952734|gb|ADB47537.1| type I restriction-modification system, M subunit [Acidaminococcus fermentans DSM 20731] Length = 857 Score = 35.9 bits (81), Expect = 9.5, Method: Composition-based stats. Identities = 14/69 (20%), Positives = 28/69 (40%), Gaps = 7/69 (10%) Query: 45 AVGDFVTAPEISQIFGEMLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLK 104 G+F T E+SQ+ E++A +L + + + G G ++++I K Sbjct: 184 KAGEFYTPHEVSQLMSEIIAHYLQ-------GREEISIYDPTSGSGSLLINIGHAAAKYM 236 Query: 105 PDFFSVLSI 113 D + Sbjct: 237 KDANKIRYY 245 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.312 0.130 0.387 Lambda K H 0.267 0.0403 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 2,727,551,548 Number of Sequences: 14124377 Number of extensions: 136709470 Number of successful extensions: 381558 Number of sequences better than 10.0: 1201 Number of HSP's better than 10.0 without gapping: 1123 Number of HSP's successfully gapped in prelim test: 78 Number of HSP's that attempted gapping in prelim test: 376124 Number of HSP's gapped (non-prelim): 1497 length of query: 362 length of database: 4,842,793,630 effective HSP length: 140 effective length of query: 222 effective length of database: 2,865,380,850 effective search space: 636114548700 effective search space used: 636114548700 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (21.0 bits) S2: 82 (36.2 bits)