BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 003940
         (784 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|359479833|ref|XP_002267103.2| PREDICTED: spermatogenesis-associated protein 20-like [Vitis
           vinifera]
          Length = 819

 Score = 1252 bits (3239), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 595/713 (83%), Positives = 648/713 (90%), Gaps = 1/713 (0%)

Query: 68  RPLAVISHRPIHPYKVVAMAERTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWG 127
           R L +   R +H  KV+AMAER+  + SHS +K+TNRLAAEHSPYLLQHAHNPVDW+ WG
Sbjct: 43  RTLPLFPRRHVHTLKVLAMAERSMKTASHS-HKYTNRLAAEHSPYLLQHAHNPVDWYPWG 101

Query: 128 EEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDV 187
           EEAF+E+RKRDVPIFLSIGYSTCHWCHVMEVESFE+EGVAKLLNDWFVSIKVDREERPDV
Sbjct: 102 EEAFSESRKRDVPIFLSIGYSTCHWCHVMEVESFENEGVAKLLNDWFVSIKVDREERPDV 161

Query: 188 DKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDK 247
           DKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPP+DKYGRPGFKT+LRKVKDAW+ 
Sbjct: 162 DKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPDDKYGRPGFKTVLRKVKDAWEN 221

Query: 248 KRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAP 307
           KRD+L +SGAFAIEQLSEALSA+ASSNKL D +PQ AL LCAEQL+ +YD  +GGFGSAP
Sbjct: 222 KRDVLVKSGAFAIEQLSEALSATASSNKLADGIPQQALHLCAEQLAGNYDPEYGGFGSAP 281

Query: 308 KFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVD 367
           KFPRPVEIQ+MLYH KKLE++GKSGEA+E  KMV F+LQCMA+GG+HDH+GGGFHRYSVD
Sbjct: 282 KFPRPVEIQLMLYHYKKLEESGKSGEANEVLKMVAFSLQCMARGGVHDHIGGGFHRYSVD 341

Query: 368 ERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAE 427
           E WHVPHFEKMLYDQGQLAN YLD FS+TKDVFYS + RDILDYLRRDMIGP GEIFSAE
Sbjct: 342 ECWHVPHFEKMLYDQGQLANAYLDVFSITKDVFYSCVSRDILDYLRRDMIGPEGEIFSAE 401

Query: 428 DADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNE 487
           DADSAE+E A RKKEGAFY+WTSKEVED++GEHA LFK+HYY+KP+GNCDLSRMSDPHNE
Sbjct: 402 DADSAESEDAARKKEGAFYIWTSKEVEDVIGEHASLFKDHYYIKPSGNCDLSRMSDPHNE 461

Query: 488 FKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNG 547
           FKGKNVLIE N +SA ASKLGMP+EKYL+ILG CRRKLFDVR  RPRPHLDDKVIVSWNG
Sbjct: 462 FKGKNVLIERNCASAMASKLGMPVEKYLDILGTCRRKLFDVRLNRPRPHLDDKVIVSWNG 521

Query: 548 LVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS 607
           L ISSFARASKILKSEAE   F FPVVG D KEYMEVAE AASFIR+ LYDEQT RL+HS
Sbjct: 522 LAISSFARASKILKSEAEGTKFRFPVVGCDPKEYMEVAEKAASFIRKWLYDEQTRRLRHS 581

Query: 608 FRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT 667
           FRNGPSKAPGFLDDYAFLISGLLD+YEFG  T WLVWAIELQ+TQDELFLD+EGGGYFNT
Sbjct: 582 FRNGPSKAPGFLDDYAFLISGLLDIYEFGGNTNWLVWAIELQDTQDELFLDKEGGGYFNT 641

Query: 668 TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 727
            GEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL S+VAGS  + +R+NAEH LAVFETRL
Sbjct: 642 PGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLTSMVAGSWFERHRRNAEHLLAVFETRL 701

Query: 728 KDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           KDMAMAVPLMCC ADM SVPSRK VVLVGHKSSV+FE+MLAAAHA YD N+TV
Sbjct: 702 KDMAMAVPLMCCGADMFSVPSRKQVVLVGHKSSVEFEDMLAAAHAQYDPNRTV 754


>gi|296086616|emb|CBI32251.3| unnamed protein product [Vitis vinifera]
          Length = 754

 Score = 1233 bits (3189), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 584/688 (84%), Positives = 633/688 (92%), Gaps = 1/688 (0%)

Query: 93  STSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHW 152
           + SHS +K+TNRLAAEHSPYLLQHAHNPVDW+ WGEEAF+E+RKRDVPIFLSIGYSTCHW
Sbjct: 3   TASHS-HKYTNRLAAEHSPYLLQHAHNPVDWYPWGEEAFSESRKRDVPIFLSIGYSTCHW 61

Query: 153 CHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP 212
           CHVMEVESFE+EGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP
Sbjct: 62  CHVMEVESFENEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP 121

Query: 213 DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASAS 272
           DLKPLMGGTYFPP+DKYGRPGFKT+LRKVKDAW+ KRD+L +SGAFAIEQLSEALSA+AS
Sbjct: 122 DLKPLMGGTYFPPDDKYGRPGFKTVLRKVKDAWENKRDVLVKSGAFAIEQLSEALSATAS 181

Query: 273 SNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSG 332
           SNKL D +PQ AL LCAEQL+ +YD  +GGFGSAPKFPRPVEIQ+MLYH KKLE++GKSG
Sbjct: 182 SNKLADGIPQQALHLCAEQLAGNYDPEYGGFGSAPKFPRPVEIQLMLYHYKKLEESGKSG 241

Query: 333 EASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDA 392
           EA+E  KMV F+LQCMA+GG+HDH+GGGFHRYSVDE WHVPHFEKMLYDQGQLAN YLD 
Sbjct: 242 EANEVLKMVAFSLQCMARGGVHDHIGGGFHRYSVDECWHVPHFEKMLYDQGQLANAYLDV 301

Query: 393 FSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKE 452
           FS+TKDVFYS + RDILDYLRRDMIGP GEIFSAEDADSAE+E A RKKEGAFY+WTSKE
Sbjct: 302 FSITKDVFYSCVSRDILDYLRRDMIGPEGEIFSAEDADSAESEDAARKKEGAFYIWTSKE 361

Query: 453 VEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE 512
           VED++GEHA LFK+HYY+KP+GNCDLSRMSDPHNEFKGKNVLIE N +SA ASKLGMP+E
Sbjct: 362 VEDVIGEHASLFKDHYYIKPSGNCDLSRMSDPHNEFKGKNVLIERNCASAMASKLGMPVE 421

Query: 513 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 572
           KYL+ILG CRRKLFDVR  RPRPHLDDKVIVSWNGL ISSFARASKILKSEAE   F FP
Sbjct: 422 KYLDILGTCRRKLFDVRLNRPRPHLDDKVIVSWNGLAISSFARASKILKSEAEGTKFRFP 481

Query: 573 VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDL 632
           VVG D KEYMEVAE AASFIR+ LYDEQT RL+HSFRNGPSKAPGFLDDYAFLISGLLD+
Sbjct: 482 VVGCDPKEYMEVAEKAASFIRKWLYDEQTRRLRHSFRNGPSKAPGFLDDYAFLISGLLDI 541

Query: 633 YEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSV 692
           YEFG  T WLVWAIELQ+TQDELFLD+EGGGYFNT GEDPSVLLRVKEDHDGAEPSGNSV
Sbjct: 542 YEFGGNTNWLVWAIELQDTQDELFLDKEGGGYFNTPGEDPSVLLRVKEDHDGAEPSGNSV 601

Query: 693 SVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHV 752
           SVINLVRL S+VAGS  + +R+NAEH LAVFETRLKDMAMAVPLMCC ADM SVPSRK V
Sbjct: 602 SVINLVRLTSMVAGSWFERHRRNAEHLLAVFETRLKDMAMAVPLMCCGADMFSVPSRKQV 661

Query: 753 VLVGHKSSVDFENMLAAAHASYDLNKTV 780
           VLVGHKSSV+FE+MLAAAHA YD N+TV
Sbjct: 662 VLVGHKSSVEFEDMLAAAHAQYDPNRTV 689


>gi|255559290|ref|XP_002520665.1| conserved hypothetical protein [Ricinus communis]
 gi|223540050|gb|EEF41627.1| conserved hypothetical protein [Ricinus communis]
          Length = 874

 Score = 1218 bits (3152), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 587/695 (84%), Positives = 638/695 (91%), Gaps = 1/695 (0%)

Query: 86  MAERTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSI 145
           MAER PA T+ + +KHTNRLAAEHSPYLLQHAHNPVDW+ WGEEAFAEAR+RDVPIFLSI
Sbjct: 1   MAER-PAETTSTSHKHTNRLAAEHSPYLLQHAHNPVDWYPWGEEAFAEARRRDVPIFLSI 59

Query: 146 GYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWP 205
           GYSTCHWCHVMEVESFEDE VAKLLNDWFVSIKVDREERPDVDKVYMT+VQALYGGGGWP
Sbjct: 60  GYSTCHWCHVMEVESFEDESVAKLLNDWFVSIKVDREERPDVDKVYMTFVQALYGGGGWP 119

Query: 206 LSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE 265
           LSVFLSPDLKPLMGGTYFPPED YGRPGFKT+LRKVKDAWDKKRD+L +SGAFAIEQLSE
Sbjct: 120 LSVFLSPDLKPLMGGTYFPPEDNYGRPGFKTLLRKVKDAWDKKRDVLIKSGAFAIEQLSE 179

Query: 266 ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 325
           ALSASAS+NKLPD LPQNALR CAEQLS+SYD+RFGGFGSAPKFPRPVEIQ+MLYH+KKL
Sbjct: 180 ALSASASTNKLPDGLPQNALRSCAEQLSQSYDARFGGFGSAPKFPRPVEIQLMLYHAKKL 239

Query: 326 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 385
           ED+ K  +A EG KMV  +LQCMAKGGIHDH+GGGFHRYSVDERWHVPHFEKMLYDQGQL
Sbjct: 240 EDSEKVDDAKEGFKMVFSSLQCMAKGGIHDHIGGGFHRYSVDERWHVPHFEKMLYDQGQL 299

Query: 386 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 445
           AN+YLDAFS+T DVFYS++ RDILDYLRRDMIG  GEIFSAEDADSAE EGA +K+EGAF
Sbjct: 300 ANIYLDAFSITNDVFYSFVSRDILDYLRRDMIGQKGEIFSAEDADSAEHEGAKKKREGAF 359

Query: 446 YVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 505
           YVWT KE++DILGEHA LFK+HYY+KP GNCDLSRMSDPH EFKGKNVLIELND SA AS
Sbjct: 360 YVWTDKEIDDILGEHATLFKDHYYIKPLGNCDLSRMSDPHKEFKGKNVLIELNDPSALAS 419

Query: 506 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 565
           K G+P+EKY +ILGE +R LFDVR++RPRPHLDDKVIVSWNGL IS+FARASKILK E+E
Sbjct: 420 KHGLPIEKYQDILGESKRMLFDVRARRPRPHLDDKVIVSWNGLAISAFARASKILKRESE 479

Query: 566 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 625
              +NFPVVG D +EY+EVAE+AA+FIR+HLY+EQT RLQHSFRNGPSKAPGFLDDYAFL
Sbjct: 480 GTRYNFPVVGCDPREYIEVAENAATFIRKHLYEEQTRRLQHSFRNGPSKAPGFLDDYAFL 539

Query: 626 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 685
           ISGLLDLYEFG G  WLVWA ELQNTQDELFLD+EGGGYFNT GEDPSVLLRVKEDHDGA
Sbjct: 540 ISGLLDLYEFGGGIYWLVWATELQNTQDELFLDKEGGGYFNTPGEDPSVLLRVKEDHDGA 599

Query: 686 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 745
           EPSGNSVS INL+RLAS+V GSKS+ YR NAEH LAVFETRLKDMAMAVPLMCCAADM+S
Sbjct: 600 EPSGNSVSAINLIRLASMVTGSKSECYRHNAEHLLAVFETRLKDMAMAVPLMCCAADMIS 659

Query: 746 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           VPSRK VVLVGHK S + ++MLAAAH SYD NKTV
Sbjct: 660 VPSRKQVVLVGHKPSSELDDMLAAAHESYDPNKTV 694


>gi|449436537|ref|XP_004136049.1| PREDICTED: spermatogenesis-associated protein 20-like [Cucumis
           sativus]
          Length = 855

 Score = 1212 bits (3135), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 582/759 (76%), Positives = 650/759 (85%), Gaps = 9/759 (1%)

Query: 23  LCFFRTLDNSSSMLERLLCSSSLHHFLSHKTKLSSLPRNYLYPF-RRPLAVISHRPIHPY 81
             FF +   SSSML       SL HF S  +     PR   +PF   P +     PI+P+
Sbjct: 42  FSFFPSQFPSSSMLPFF----SLRHFNSSISPSLPFPR---FPFLSSPFSFRFSTPIYPH 94

Query: 82  KVVAMAERTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPI 141
           KV AMA R+ +  S   + +TNRLA EHSPYLLQHAHNPV+W+ WGEEAFAEA+KR+VPI
Sbjct: 95  KVFAMAARS-SGGSSHSHGYTNRLATEHSPYLLQHAHNPVNWYPWGEEAFAEAQKRNVPI 153

Query: 142 FLSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGG 201
           FLSIGYSTCHWCHVMEVESFE++ VAKLLNDWFVSIKVDREERPDVDKVYMTYVQALY G
Sbjct: 154 FLSIGYSTCHWCHVMEVESFENKEVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYSG 213

Query: 202 GGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIE 261
           GGWPLSVFLSPDLKPLMGGTYFPP+DKYGRPGFKT+LRKVKDAWD KRD+L +SG FAIE
Sbjct: 214 GGWPLSVFLSPDLKPLMGGTYFPPDDKYGRPGFKTVLRKVKDAWDNKRDVLVKSGTFAIE 273

Query: 262 QLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYH 321
           QLSEAL+ +ASSNKLP+ELPQNAL LCAEQLS+SYD  FGGFGSAPKFPRPVE Q+MLY+
Sbjct: 274 QLSEALATTASSNKLPEELPQNALHLCAEQLSQSYDPNFGGFGSAPKFPRPVEAQLMLYY 333

Query: 322 SKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYD 381
           +K+LE++GKS EA E   MV+F LQCMA+GGIHDHVGGGFHRYSVDE WHVPHFEKMLYD
Sbjct: 334 AKRLEESGKSDEAEEILNMVIFGLQCMARGGIHDHVGGGFHRYSVDECWHVPHFEKMLYD 393

Query: 382 QGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 441
           QGQ+ NVYLDAFS+TKDVFYS++ RD+LDYLRRDMIG  GEI+SAEDADSAE+EGATRKK
Sbjct: 394 QGQITNVYLDAFSITKDVFYSWVSRDVLDYLRRDMIGTQGEIYSAEDADSAESEGATRKK 453

Query: 442 EGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 501
           EGAFYVWT KE++DILGEHA  FKEHYY+KP+GNCDLSRMSDPH+EFKGKNVLIE+   S
Sbjct: 454 EGAFYVWTRKEIDDILGEHADFFKEHYYIKPSGNCDLSRMSDPHDEFKGKNVLIEMKSVS 513

Query: 502 ASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 561
             AS   MP+EKYL ILGECR+KLF+VR +RP+PHLDDKVIVSWNGL ISSFARASKIL+
Sbjct: 514 EMASNHSMPVEKYLEILGECRQKLFEVRERRPKPHLDDKVIVSWNGLTISSFARASKILR 573

Query: 562 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDD 621
           +E E   F FPVVG D KEY +VAE AA FI+  LYDEQTHRLQHSFRNGPSKAPGFLDD
Sbjct: 574 NEKEGTRFYFPVVGCDPKEYFDVAEKAALFIKTKLYDEQTHRLQHSFRNGPSKAPGFLDD 633

Query: 622 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 681
           YAFLI GLLDLYE+G G  WLVWAIELQ TQDELFLDREGGGY+NTTGED SV+LRVKED
Sbjct: 634 YAFLIGGLLDLYEYGGGLNWLVWAIELQATQDELFLDREGGGYYNTTGEDKSVILRVKED 693

Query: 682 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 741
           HDGAEPSGNSVS INLVRL+S+V+GS+S+YYRQNAEH LAVFE RLK+MA+AVPL+CCAA
Sbjct: 694 HDGAEPSGNSVSAINLVRLSSLVSGSRSNYYRQNAEHLLAVFEKRLKEMAVAVPLLCCAA 753

Query: 742 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
            M S+PSRK VVLVGHK+S  FE  LAAAHASYD N+TV
Sbjct: 754 GMFSIPSRKQVVLVGHKNSTQFETFLAAAHASYDPNRTV 792


>gi|449498445|ref|XP_004160539.1| PREDICTED: LOW QUALITY PROTEIN: spermatogenesis-associated protein
           20-like [Cucumis sativus]
          Length = 855

 Score = 1201 bits (3108), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 578/759 (76%), Positives = 646/759 (85%), Gaps = 9/759 (1%)

Query: 23  LCFFRTLDNSSSMLERLLCSSSLHHFLSHKTKLSSLPRNYLYPF-RRPLAVISHRPIHPY 81
             FF +   SSSML       SL HF S  +     PR   +PF   P +     PI+P+
Sbjct: 42  FSFFPSQFPSSSMLPFF----SLRHFNSSISPSLPFPR---FPFLSSPFSFRFSTPIYPH 94

Query: 82  KVVAMAERTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPI 141
           KV AMA R+ +  S   + +TNRLA EHSPYLLQHAHNPV+W+ WGEEAFAEA+KR+VPI
Sbjct: 95  KVFAMAARS-SGGSSHSHGYTNRLATEHSPYLLQHAHNPVNWYPWGEEAFAEAQKRNVPI 153

Query: 142 FLSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGG 201
           FLSIGYSTCHWCHVMEVESFE++ VAKLLNDWFVSIKVDREERPDVDKVYMTYVQALY G
Sbjct: 154 FLSIGYSTCHWCHVMEVESFENKEVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYSG 213

Query: 202 GGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIE 261
           GGWPLSVFLSPDLKPLMGGTYFPP+DKYGRPGFKT+LRKVKDAWD KRD+L +SG FAIE
Sbjct: 214 GGWPLSVFLSPDLKPLMGGTYFPPDDKYGRPGFKTVLRKVKDAWDNKRDVLVKSGTFAIE 273

Query: 262 QLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYH 321
           QLSEAL+ +ASSNKLP+ELPQNAL LCAEQLS+SYD  FGGFGSAPKFPRPVE Q+MLY+
Sbjct: 274 QLSEALATTASSNKLPEELPQNALHLCAEQLSQSYDPNFGGFGSAPKFPRPVEAQLMLYY 333

Query: 322 SKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYD 381
           +K+LE++GKS EA E   MV+F LQCMA+GGIHDHVGGGFHRYSVDE WHVPHFEKMLYD
Sbjct: 334 AKRLEESGKSDEAEEILNMVIFGLQCMARGGIHDHVGGGFHRYSVDECWHVPHFEKMLYD 393

Query: 382 QGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 441
           QG + NVYLDAFS+TKD  YS++ RD+LDYLRRDMIG  GEI+SAEDADSAE+EGATR K
Sbjct: 394 QGXITNVYLDAFSITKDXLYSWVSRDVLDYLRRDMIGTQGEIYSAEDADSAESEGATRXK 453

Query: 442 EGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 501
           EGAFYVWT KE++DILGEHA  FKEHYY+KP+GNCDLSRMSDPH+EFKGKNVLIE+   S
Sbjct: 454 EGAFYVWTRKEIDDILGEHADFFKEHYYIKPSGNCDLSRMSDPHDEFKGKNVLIEMKSVS 513

Query: 502 ASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 561
             AS   MP+EKYL ILGECR+KLF+VR +RP+PHLDDKVIVSWNGL ISSFARASKIL+
Sbjct: 514 EMASNHSMPVEKYLEILGECRQKLFEVRERRPKPHLDDKVIVSWNGLTISSFARASKILR 573

Query: 562 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDD 621
           +E E   F FPVVG D KEY +VAE AA FI+  LYDEQTHRLQHSFRNGPSKAPGFLDD
Sbjct: 574 NEKEGTRFYFPVVGCDPKEYFDVAEKAALFIKTKLYDEQTHRLQHSFRNGPSKAPGFLDD 633

Query: 622 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 681
           YAFLI GLLDLYE+G G  WLVWAIELQ TQDELFLDREGGGY+NTTGED SV+LRVKED
Sbjct: 634 YAFLIGGLLDLYEYGGGLNWLVWAIELQATQDELFLDREGGGYYNTTGEDKSVILRVKED 693

Query: 682 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 741
           HDGAEPSGNSVS INLVRL+S+V+GS+S+YYRQNAEH LAVFE RLK+MA+AVPL+CCAA
Sbjct: 694 HDGAEPSGNSVSAINLVRLSSLVSGSRSNYYRQNAEHLLAVFEKRLKEMAVAVPLLCCAA 753

Query: 742 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
            M S+PSRK VVLVGHK+S  FE  LAAAHASYD N+TV
Sbjct: 754 GMFSIPSRKQVVLVGHKNSTQFETFLAAAHASYDPNRTV 792


>gi|115432144|gb|ABI97349.1| cold-induced thioredoxin domain-containing protein [Ammopiptanthus
           mongolicus]
          Length = 839

 Score = 1173 bits (3034), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 574/707 (81%), Positives = 627/707 (88%), Gaps = 1/707 (0%)

Query: 75  HRPIHPYKVVAMAERTPASTSHSR-NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAE 133
           H P  P K+++MA  + +S++HS   K+TNRLA+E SPYLLQHAHNPVDW+ WGEEAF+E
Sbjct: 66  HLPFRPLKLLSMATSSSSSSTHSHSQKYTNRLASEQSPYLLQHAHNPVDWYPWGEEAFSE 125

Query: 134 ARKRDVPIFLSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMT 193
           A +RDVPIFLSIGYSTCHWCHVMEVESFEDE VAKLLNDWFVSIKVDREERPDVDKVYMT
Sbjct: 126 ASRRDVPIFLSIGYSTCHWCHVMEVESFEDEEVAKLLNDWFVSIKVDREERPDVDKVYMT 185

Query: 194 YVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLA 253
           YVQALYGGGGWPLSVFLSPDLKPLMGGTYFPP+DKYGRPGFKTILRKVK+AWD KRDML 
Sbjct: 186 YVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPDDKYGRPGFKTILRKVKEAWDSKRDMLI 245

Query: 254 QSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPV 313
           +SGAF IEQLSEALSAS+ S+KLPD +P  AL LC+EQLS SYDS+FGGFGSAPKFPRPV
Sbjct: 246 KSGAFTIEQLSEALSASSVSDKLPDGVPDEALNLCSEQLSGSYDSKFGGFGSAPKFPRPV 305

Query: 314 EIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVP 373
           E  +MLYHS+KLEDTGK G A+E QKMV F LQCMAKGGIHDH+GGGFHRYSVDE WHVP
Sbjct: 306 EFNLMLYHSRKLEDTGKLGAANESQKMVFFNLQCMAKGGIHDHIGGGFHRYSVDECWHVP 365

Query: 374 HFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAE 433
           HFEKMLYDQGQLANVYLDAFS+TKD FYS I +DILDYLRRDMIGP GEIFSAEDADSAE
Sbjct: 366 HFEKMLYDQGQLANVYLDAFSITKDTFYSCISQDILDYLRRDMIGPEGEIFSAEDADSAE 425

Query: 434 TEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV 493
            EGATRKKEGAFY+WTSKEVEDILG+HA LFKEHYY+K +GNCDLSRMSDPH+EFKGKNV
Sbjct: 426 IEGATRKKEGAFYIWTSKEVEDILGDHAALFKEHYYIKQSGNCDLSRMSDPHDEFKGKNV 485

Query: 494 LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSF 553
           LIE  D+S  ASK GM +E Y  ILGECRRKLF+VRS+R RPHLDDKVIVSWNGL ISSF
Sbjct: 486 LIERKDTSEMASKYGMSVETYQEILGECRRKLFEVRSRRSRPHLDDKVIVSWNGLAISSF 545

Query: 554 ARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPS 613
           ARASKILK EAE   FNFPVVG++ KEY+ +AE AA FIR+ LYD +THRL HSFRN PS
Sbjct: 546 ARASKILKREAEGTKFNFPVVGTEPKEYLVIAEKAAFFIRKQLYDVETHRLHHSFRNSPS 605

Query: 614 KAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPS 673
           KAPGFLDDYAFLISGLLDLYEFG G  WL+WA ELQ TQD LFLDR+GGGYFN  GEDPS
Sbjct: 606 KAPGFLDDYAFLISGLLDLYEFGGGINWLLWAFELQETQDALFLDRDGGGYFNNAGEDPS 665

Query: 674 VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMA 733
           VLLRVKEDHDGAEPSGNSVS INL+RLAS+VAGSK+  Y++NAEH LAVFE RLKDMAMA
Sbjct: 666 VLLRVKEDHDGAEPSGNSVSAINLIRLASMVAGSKAADYKRNAEHLLAVFEKRLKDMAMA 725

Query: 734 VPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           VPLMCCAADML VPSRK VV+VG +S  +FE+MLAAAHASYD N+TV
Sbjct: 726 VPLMCCAADMLRVPSRKQVVVVGERSFEEFESMLAAAHASYDPNRTV 772


>gi|356570951|ref|XP_003553646.1| PREDICTED: spermatogenesis-associated protein 20-like [Glycine max]
          Length = 755

 Score = 1169 bits (3025), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 554/688 (80%), Positives = 610/688 (88%), Gaps = 1/688 (0%)

Query: 93  STSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHW 152
           ++SHS + H NRLA+E SPYLLQHAHNPV W+ WGEEAFAEAR+RD PIFLSIGYSTCHW
Sbjct: 2   ASSHS-HIHINRLASEQSPYLLQHAHNPVHWYPWGEEAFAEARRRDAPIFLSIGYSTCHW 60

Query: 153 CHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP 212
           CHVMEVESFEDE VAKLLNDWFVSIKVDREERPDVDKVYM+YVQALYGGGGWPLSVFLSP
Sbjct: 61  CHVMEVESFEDEAVAKLLNDWFVSIKVDREERPDVDKVYMSYVQALYGGGGWPLSVFLSP 120

Query: 213 DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASAS 272
           DLKPLMGGTYFPP+DKYGRPGFKTILRK+K+AWD KRDML + G++AIEQLSEA+SAS+ 
Sbjct: 121 DLKPLMGGTYFPPDDKYGRPGFKTILRKLKEAWDSKRDMLIKRGSYAIEQLSEAMSASSD 180

Query: 273 SNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSG 332
           S+KLPD +P +ALRLC+EQLS SYDS+FGGFGSAPKFPRPVEI +MLYHSKKLEDTGK  
Sbjct: 181 SDKLPDGVPADALRLCSEQLSGSYDSKFGGFGSAPKFPRPVEINLMLYHSKKLEDTGKLD 240

Query: 333 EASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDA 392
            A+  QKMV F+LQCMAKGG+HDH+GGGFHRYSVDE WHVPHFEKMLYDQGQLANVYLDA
Sbjct: 241 GANRIQKMVFFSLQCMAKGGMHDHIGGGFHRYSVDECWHVPHFEKMLYDQGQLANVYLDA 300

Query: 393 FSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKE 452
           FS+TKD FYSYI RDILDYLRRDMIGP GEIFSAEDADSAETEGA RKKEGAFY+WT KE
Sbjct: 301 FSITKDTFYSYISRDILDYLRRDMIGPEGEIFSAEDADSAETEGAARKKEGAFYIWTGKE 360

Query: 453 VEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE 512
           V DILGEHA LF+EHYY+K +GNC+LS MSDPH+EFKGKNVLIE  + S  ASK GM +E
Sbjct: 361 VADILGEHAALFEEHYYIKQSGNCNLSGMSDPHDEFKGKNVLIERKEPSELASKYGMSIE 420

Query: 513 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 572
            Y  ILGECR KLF+VRS+RP+PHLDDKVIVSWNGL ISSFARASKILK E E   F FP
Sbjct: 421 TYQEILGECRHKLFEVRSRRPKPHLDDKVIVSWNGLAISSFARASKILKGEVEGTKFYFP 480

Query: 573 VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDL 632
           VVG++ K Y+ +AE AA FI + LY+ +THRL HSFR+ PSKAP FLDDYAFLISGLLDL
Sbjct: 481 VVGTEAKGYLRIAEKAAFFIWKQLYNVETHRLHHSFRHSPSKAPAFLDDYAFLISGLLDL 540

Query: 633 YEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSV 692
           YEFG G  WL+WAIELQ TQD LFLDR GGGYFN TGED SVLLRVKEDHDGAEPSGNSV
Sbjct: 541 YEFGGGINWLLWAIELQETQDALFLDRTGGGYFNNTGEDSSVLLRVKEDHDGAEPSGNSV 600

Query: 693 SVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHV 752
           S INL+RLAS+VAGSK+++Y+QNAEH LAVFE RLKDMAMAVPLMCCAADML VPSRK V
Sbjct: 601 SAINLIRLASMVAGSKAEHYKQNAEHLLAVFERRLKDMAMAVPLMCCAADMLHVPSRKQV 660

Query: 753 VLVGHKSSVDFENMLAAAHASYDLNKTV 780
           V+VG ++S DFENMLAAAHA YD N+TV
Sbjct: 661 VVVGERTSGDFENMLAAAHALYDPNRTV 688


>gi|356505532|ref|XP_003521544.1| PREDICTED: spermatogenesis-associated protein 20-like [Glycine max]
          Length = 809

 Score = 1162 bits (3005), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 574/732 (78%), Positives = 635/732 (86%), Gaps = 8/732 (1%)

Query: 49  LSHKTKLSSLPRNYLYPFRRPLAVISHRPIHPYKVVAMAERTPASTSHSRNKHTNRLAAE 108
           L H+     LPR   + FR+P    S+      KV++MA     S+  S + HTNRLA+E
Sbjct: 19  LLHRFSPLLLPR---FLFRQPPFPSSNFKPLTLKVLSMA-----SSHSSHHIHTNRLASE 70

Query: 109 HSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFEDEGVAK 168
            SPYLLQHAHNPVDW+ WGEEAFAEAR+RD PIFLSIGYSTCHWCHVMEVESFEDE VAK
Sbjct: 71  QSPYLLQHAHNPVDWYPWGEEAFAEARRRDAPIFLSIGYSTCHWCHVMEVESFEDEAVAK 130

Query: 169 LLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDK 228
           LLNDWFVSIKVDREERPDVDKVYM+YVQALYGGGGWPLSVFLSPDLKPLMGGTYFPP+DK
Sbjct: 131 LLNDWFVSIKVDREERPDVDKVYMSYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPDDK 190

Query: 229 YGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLC 288
           YGRPGFKTILRKVK+AWD KRDML +SG++AIEQLSEA+SAS+ S+KLPD +P +ALRLC
Sbjct: 191 YGRPGFKTILRKVKEAWDSKRDMLIKSGSYAIEQLSEAMSASSDSDKLPDGVPADALRLC 250

Query: 289 AEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCM 348
           +EQLS SYDS+FGGFGSAPKFPRPVEI +MLYHSKKLEDTGK G A+  Q+MV F+LQCM
Sbjct: 251 SEQLSGSYDSKFGGFGSAPKFPRPVEINLMLYHSKKLEDTGKLGVANGSQQMVFFSLQCM 310

Query: 349 AKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDI 408
           AKGGIHDH+GGGFHRYSVDE WHVPHFEKMLYDQGQLANVYLDAFS+TKD FYSYI RDI
Sbjct: 311 AKGGIHDHIGGGFHRYSVDECWHVPHFEKMLYDQGQLANVYLDAFSITKDTFYSYISRDI 370

Query: 409 LDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHY 468
           LDYLRRDMIGP GEIFSAEDADSAETEGA RKKEGAFY+WTSKEVED+LGEHA LF+EHY
Sbjct: 371 LDYLRRDMIGPEGEIFSAEDADSAETEGAARKKEGAFYIWTSKEVEDLLGEHAALFEEHY 430

Query: 469 YLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDV 528
           Y+K  GNCDLS MSDPH+EFKGKNVLIE  + S  ASK GM +E Y  ILGECR KLF+V
Sbjct: 431 YIKQLGNCDLSGMSDPHDEFKGKNVLIERKEPSELASKYGMSVETYQEILGECRHKLFEV 490

Query: 529 RSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESA 588
           RS+RP+PHLDDKVIVSWNGL ISSFARASKILK EAE   F FPV+G++ KEYM +AE A
Sbjct: 491 RSRRPKPHLDDKVIVSWNGLAISSFARASKILKGEAEGTKFYFPVIGTEPKEYMGIAEKA 550

Query: 589 ASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIEL 648
           ASFIR+ LY+ +THRL HSFR+ PSKAP FLDDYAFLISGLLDLYEFG G  WL+WAIEL
Sbjct: 551 ASFIRKQLYNVETHRLHHSFRHSPSKAPAFLDDYAFLISGLLDLYEFGGGISWLLWAIEL 610

Query: 649 QNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSK 708
           Q TQD LFLD+ GGGYFN TGED SVLLRVKEDHDGAEPSGNSVS INL+RLAS+VAGSK
Sbjct: 611 QETQDALFLDKTGGGYFNNTGEDASVLLRVKEDHDGAEPSGNSVSAINLIRLASMVAGSK 670

Query: 709 SDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLA 768
           +++Y++NAEH LAVFE RLKDMAMAVPLMCCAADML V SRK VV+VG ++S DFENMLA
Sbjct: 671 AEHYKRNAEHLLAVFEKRLKDMAMAVPLMCCAADMLRVLSRKQVVVVGERTSEDFENMLA 730

Query: 769 AAHASYDLNKTV 780
           AAHA YD N+TV
Sbjct: 731 AAHAVYDPNRTV 742


>gi|224132400|ref|XP_002321330.1| predicted protein [Populus trichocarpa]
 gi|222862103|gb|EEE99645.1| predicted protein [Populus trichocarpa]
          Length = 756

 Score = 1157 bits (2993), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 568/678 (83%), Positives = 617/678 (91%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL+AEHSPYLLQHAHNPV+W+ WGEEAFAEAR+RDVPIFLSIGYSTCHWCHVM+VESFE
Sbjct: 16  NRLSAEHSPYLLQHAHNPVNWYPWGEEAFAEARRRDVPIFLSIGYSTCHWCHVMKVESFE 75

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA+LLND FVS+KVDREERPDVDKVYMT+VQALYGGGGWPLSVF+SPDLKPLMGGTY
Sbjct: 76  DEEVAELLNDSFVSVKVDREERPDVDKVYMTFVQALYGGGGWPLSVFISPDLKPLMGGTY 135

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FPP+DKYGRPGFKTILRKVKDAW  KRD L +SGAFAIEQLSEALSASASS KLPDEL Q
Sbjct: 136 FPPDDKYGRPGFKTILRKVKDAWFSKRDTLVKSGAFAIEQLSEALSASASSKKLPDELSQ 195

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
           NAL LCAEQLS+SYDSR+GGFGSAPKFPRPVEIQ+MLYHSKKL+D G   E+ +G +MV 
Sbjct: 196 NALHLCAEQLSQSYDSRYGGFGSAPKFPRPVEIQLMLYHSKKLDDAGNYSESKKGLQMVF 255

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
           FTLQCMA+GGIHDH+GGGFHRYSVDERWHVPHFEKMLYDQGQL NVYLDAFS+T DVFYS
Sbjct: 256 FTLQCMARGGIHDHIGGGFHRYSVDERWHVPHFEKMLYDQGQLVNVYLDAFSITNDVFYS 315

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
            + RDILDYLRRDMIGP GEIFSAEDADSAE E A +KKEGAFY+WTS+E++D+LGEHA 
Sbjct: 316 SLSRDILDYLRRDMIGPEGEIFSAEDADSAEREDAKKKKEGAFYIWTSQEIDDLLGEHAT 375

Query: 463 LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECR 522
           LFK+HYY+KP GNCDLSRMSDP +EFKGKNVLIEL D+SA A K G+PLEKYL+ILGECR
Sbjct: 376 LFKDHYYVKPLGNCDLSRMSDPQDEFKGKNVLIELTDTSAPAKKYGLPLEKYLDILGECR 435

Query: 523 RKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYM 582
           +KLFD RS+ PRPHLDDKVIVSWNGL ISS ARASKIL  EAE   +NFPVVG D KEYM
Sbjct: 436 QKLFDARSRGPRPHLDDKVIVSWNGLAISSLARASKILMGEAEGTKYNFPVVGCDPKEYM 495

Query: 583 EVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWL 642
             AE AASFIRRHLY+EQ HRL+HSFRNGPSKAPGFLDDYAFLISGLLDLYE G G  WL
Sbjct: 496 TAAEKAASFIRRHLYNEQAHRLEHSFRNGPSKAPGFLDDYAFLISGLLDLYEVGGGIHWL 555

Query: 643 VWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLAS 702
           VWA ELQN QDELFLDREGGGYFNT GEDPSVLLRVKEDHDGAEPSGNSVS INL+RLAS
Sbjct: 556 VWATELQNKQDELFLDREGGGYFNTPGEDPSVLLRVKEDHDGAEPSGNSVSAINLIRLAS 615

Query: 703 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 762
           ++ GSKS+YYRQNAEH LAVFE+RLKDMAMAVPLMCCAADM+SVPS K VVLVGHKSS++
Sbjct: 616 MMTGSKSEYYRQNAEHLLAVFESRLKDMAMAVPLMCCAADMISVPSHKQVVLVGHKSSLE 675

Query: 763 FENMLAAAHASYDLNKTV 780
           F+ MLAAAHASYD N+TV
Sbjct: 676 FDKMLAAAHASYDPNRTV 693


>gi|357511183|ref|XP_003625880.1| Spermatogenesis-associated protein [Medicago truncatula]
 gi|355500895|gb|AES82098.1| Spermatogenesis-associated protein [Medicago truncatula]
          Length = 809

 Score = 1154 bits (2985), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 571/750 (76%), Positives = 637/750 (84%), Gaps = 24/750 (3%)

Query: 43  SSLHHFLSHKTKLSSLPRNYLYPFRRPLAVISHRPIHPYKVVAMAERTPASTSHS-RNKH 101
           S L+ F  H  K       +  PF+     +        KV++MA     ++SHS ++K 
Sbjct: 8   SVLNRFFYHNQKHFPTSTKFRTPFKFSRVTLP-------KVLSMA-----TSSHSDQHKF 55

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNRLA+E SPYLLQHAHNPVDW+ WGEEAFAEAR+RD PIFLSIGYSTCHWCHVMEVESF
Sbjct: 56  TNRLASEQSPYLLQHAHNPVDWYPWGEEAFAEARRRDAPIFLSIGYSTCHWCHVMEVESF 115

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           EDEG+AKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPL+VFLSPDLKPLMGGT
Sbjct: 116 EDEGIAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLTVFLSPDLKPLMGGT 175

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFPPEDKYGRPGFKTILRKVK+AW+ KRDML +SG FAIEQLSEALS+S++S+KLPD + 
Sbjct: 176 YFPPEDKYGRPGFKTILRKVKEAWENKRDMLVKSGTFAIEQLSEALSSSSNSDKLPDGVS 235

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
           ++ALRLC+EQLS++YDS +GGFGSAPKFPRPVEI +MLY SKKLEDTGK   A++ QKMV
Sbjct: 236 EDALRLCSEQLSENYDSEYGGFGSAPKFPRPVEINLMLYKSKKLEDTGKLDGANKSQKMV 295

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWH-----------VPHFEKMLYDQGQLANVYL 390
            FTLQCMAKGG+HDHVGGGFHRYSVDE WH           VPHFEKMLYDQGQLANVYL
Sbjct: 296 FFTLQCMAKGGVHDHVGGGFHRYSVDECWHDIYSLSSYTHAVPHFEKMLYDQGQLANVYL 355

Query: 391 DAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTS 450
           DAFS+TKD FYS + RDILDYLRRDMIGP GEIFSAEDADSAE EG TRKKEGAFYVWTS
Sbjct: 356 DAFSITKDTFYSSLSRDILDYLRRDMIGPEGEIFSAEDADSAENEGDTRKKEGAFYVWTS 415

Query: 451 KEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMP 510
           KEVED+LGEHA LF+EHYY+K  GNCDLS MSDPHNEFKGKNVLIE  DSS  ASK GM 
Sbjct: 416 KEVEDLLGEHAALFEEHYYIKQMGNCDLSEMSDPHNEFKGKNVLIERKDSSEMASKYGMS 475

Query: 511 LEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFN 570
           +E Y  ILGECRRKLF+VR KRP+PHLDDKVIVSWNGLVISSFARASKILK EAE   FN
Sbjct: 476 IETYQEILGECRRKLFEVRLKRPKPHLDDKVIVSWNGLVISSFARASKILKGEAEGIKFN 535

Query: 571 FPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLL 630
           FPVVG++ KEY+ +A+ AASFI+  LY+ +THRLQHSFRN PSKAPGFLDDYAFLISGLL
Sbjct: 536 FPVVGTEPKEYLRIADKAASFIKNQLYNTETHRLQHSFRNSPSKAPGFLDDYAFLISGLL 595

Query: 631 DLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGN 690
           DLYEFG    WL+WAIELQ TQD LFLD++GGGYFN TGED SVLLRVKEDHDGAEPSGN
Sbjct: 596 DLYEFGGEINWLLWAIELQETQDTLFLDKDGGGYFNNTGEDSSVLLRVKEDHDGAEPSGN 655

Query: 691 SVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRK 750
           SVS +NL+RLAS+V+GSK+++Y++NAEH LAVFE RLKD AMAVPLMCCAADML VPSRK
Sbjct: 656 SVSALNLIRLASLVSGSKAEHYKRNAEHLLAVFEKRLKDTAMAVPLMCCAADMLRVPSRK 715

Query: 751 HVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
            VVLVG ++S +FE+ML AAHA YD N+TV
Sbjct: 716 QVVLVGERTSEEFESMLGAAHALYDPNRTV 745


>gi|297813987|ref|XP_002874877.1| hypothetical protein ARALYDRAFT_911883 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297320714|gb|EFH51136.1| hypothetical protein ARALYDRAFT_911883 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 812

 Score = 1135 bits (2935), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 548/742 (73%), Positives = 620/742 (83%), Gaps = 10/742 (1%)

Query: 45  LHHFLSHKTKLSSLPRNYLY------PFRRPLAVISHRPIHPYKVVAMAERTPASTSHSR 98
           LH F S    LSSLPR  +        F  P   I  RPI   KV+AMAE + +ST  + 
Sbjct: 15  LHRFAS----LSSLPRRRIIVRIPNPSFSSPFPPILSRPISSGKVLAMAEESSSSTPSTS 70

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
            KHTNRLAAEHSPYLLQHAHNPVDW+ WGE+AF EARKRDVPIFLSIGYSTCHWCHVMEV
Sbjct: 71  QKHTNRLAAEHSPYLLQHAHNPVDWYPWGEDAFEEARKRDVPIFLSIGYSTCHWCHVMEV 130

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFEDE VAKLLND FVSIKVDREERPDVDKVYM++VQALYGGGGWPLSVFLSPDLKPLM
Sbjct: 131 ESFEDEEVAKLLNDSFVSIKVDREERPDVDKVYMSFVQALYGGGGWPLSVFLSPDLKPLM 190

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
           GGTYFPP D YGRPGFKT+L+KVKDAWD KRD L +SG +AIE+L++ALSASA ++KL D
Sbjct: 191 GGTYFPPNDNYGRPGFKTLLKKVKDAWDSKRDTLVKSGTYAIEELTKALSASAGADKLSD 250

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
            + + A+ +CA+QLS+SYDS FGGFGSAPKFPRPVEIQ+MLY+ KKL+++GK+ EA E Q
Sbjct: 251 GISREAVSICAKQLSRSYDSEFGGFGSAPKFPRPVEIQLMLYYFKKLKESGKTSEADEEQ 310

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
            MVLF+LQ MA GG+HDH+GGGFHRYSVDE WHVPHFEKMLYDQGQLANVYLD F +TKD
Sbjct: 311 SMVLFSLQGMANGGMHDHIGGGFHRYSVDECWHVPHFEKMLYDQGQLANVYLDGFIITKD 370

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
           V YSY+ +DILDYLRRDMI P G IFSAEDADS E EGA RKKEGAFY+W+S E++++LG
Sbjct: 371 VIYSYVAKDILDYLRRDMIAPEGGIFSAEDADSFEFEGAKRKKEGAFYIWSSDEIDEVLG 430

Query: 459 EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 518
           E+A LFKEHYY+K +GNCDLS  SDPHNEF GKNVLIE N+ SA ASK  + +EKY  IL
Sbjct: 431 ENADLFKEHYYVKKSGNCDLSSRSDPHNEFAGKNVLIERNEMSAMASKFSLSVEKYQEIL 490

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
           GECR+KLFDVR  RP+PHLDDK+IVSWNGLVISSFARASK+LK+E ES  + FPVV S  
Sbjct: 491 GECRKKLFDVRLNRPKPHLDDKIIVSWNGLVISSFARASKMLKAEPESTKYCFPVVNSQP 550

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG 638
           +EY+EVAE AA FIR +LYDEQ+ RLQHS+R GPSKAP FLDDYAFLI+GLLDLYE G G
Sbjct: 551 EEYIEVAEKAALFIRGNLYDEQSRRLQHSYRQGPSKAPAFLDDYAFLIAGLLDLYENGGG 610

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
            +WL WAI+LQ TQDEL+LDREGG YFNT G+D SVLLRVKEDHDGAEPSGNSVS INLV
Sbjct: 611 IEWLKWAIKLQETQDELYLDREGGAYFNTEGQDSSVLLRVKEDHDGAEPSGNSVSAINLV 670

Query: 699 RLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHK 758
           RLASIV G K+D Y   A   LAVFE RL++MA+AVPLMCCAADM+SVPSRK VVLVG K
Sbjct: 671 RLASIVTGEKADSYLNTAHRLLAVFELRLREMAVAVPLMCCAADMISVPSRKQVVLVGSK 730

Query: 759 SSVDFENMLAAAHASYDLNKTV 780
           SS +  NML+AAH+ YD NKTV
Sbjct: 731 SSPELNNMLSAAHSVYDPNKTV 752


>gi|30679394|ref|NP_192229.3| uncharacterized protein [Arabidopsis thaliana]
 gi|332656888|gb|AEE82288.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 818

 Score = 1134 bits (2934), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 547/733 (74%), Positives = 620/733 (84%), Gaps = 7/733 (0%)

Query: 55  LSSLPRN------YLYPFRRPLAVISHRPIHPYKVVAMAERTPASTSHSR-NKHTNRLAA 107
           LS+LPR       +   F  P   I  RPI   KV+AMAE + +S++ S   KHTNRLAA
Sbjct: 26  LSTLPRRRNIVRIHNPSFSSPFPPILSRPISSGKVLAMAEESSSSSTSSTSQKHTNRLAA 85

Query: 108 EHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFEDEGVA 167
           EHSPYLLQHAHNPVDW+ WGEEAF EARKRDVPIFLSIGYSTCHWCHVMEVESFEDE VA
Sbjct: 86  EHSPYLLQHAHNPVDWYPWGEEAFEEARKRDVPIFLSIGYSTCHWCHVMEVESFEDEEVA 145

Query: 168 KLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPED 227
           KLLN+ FVSIKVDREERPDVDKVYM++VQALYGGGGWPLSVFLSPDLKPLMGGTYFPP D
Sbjct: 146 KLLNNSFVSIKVDREERPDVDKVYMSFVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPND 205

Query: 228 KYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL 287
            YGRPGFKT+L+KVKDAW+ KRD L +SG +AIE+LS+ALSAS  ++KL D + + A+  
Sbjct: 206 NYGRPGFKTLLKKVKDAWNSKRDTLVKSGTYAIEELSKALSASTGADKLSDGISREAVST 265

Query: 288 CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQC 347
           CA+QLS+SYDS FGGFGSAPKFPRPVEIQ+MLYH KKL+++GK+ EA E + MVLF+LQ 
Sbjct: 266 CAKQLSRSYDSEFGGFGSAPKFPRPVEIQLMLYHYKKLKESGKTSEADEEKSMVLFSLQG 325

Query: 348 MAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRD 407
           MA GG+HDH+GGGFHRYSVDE WHVPHFEKMLYDQGQLANVYLD FS+TKDV YSY+ RD
Sbjct: 326 MANGGMHDHIGGGFHRYSVDECWHVPHFEKMLYDQGQLANVYLDGFSITKDVMYSYVARD 385

Query: 408 ILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEH 467
           ILDYLRRDMI P G IFSAEDADS E EGA RKKEGAFY+WTS E++++LGE+A LFKEH
Sbjct: 386 ILDYLRRDMIAPEGGIFSAEDADSFEFEGAKRKKEGAFYIWTSDEIDEVLGENADLFKEH 445

Query: 468 YYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFD 527
           YY+K +GNCDLS  SDPHNEF GKNVLIE N++SA ASK  + +EKY  ILGECRRKLFD
Sbjct: 446 YYVKKSGNCDLSSRSDPHNEFAGKNVLIERNETSAMASKFSLSVEKYQEILGECRRKLFD 505

Query: 528 VRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAES 587
           VR KRP+PHLDDK+IVSWNGLVISSFARASKILK+E ES  + FPVV S  ++Y+EVAE 
Sbjct: 506 VRLKRPKPHLDDKIIVSWNGLVISSFARASKILKAEPESTKYYFPVVNSQPEDYIEVAEK 565

Query: 588 AASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIE 647
           AA FIR +LYDEQ+ RLQHS+R GPSKAP FLDDYAFLISGLLDLYE G G +WL WAI+
Sbjct: 566 AALFIRGNLYDEQSRRLQHSYRQGPSKAPAFLDDYAFLISGLLDLYENGGGIEWLKWAIK 625

Query: 648 LQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGS 707
           LQ TQDEL+LDREGG YFNT G+DPSVLLRVKEDHDGAEPSGNSVS INLVRLASIVAG 
Sbjct: 626 LQETQDELYLDREGGAYFNTEGQDPSVLLRVKEDHDGAEPSGNSVSAINLVRLASIVAGE 685

Query: 708 KSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENML 767
           K++ Y   A   LAVFE RL+++A+AVPLMCC+ADM+SVPSRK VVLVG KSS +  NML
Sbjct: 686 KAESYLNTAHRLLAVFELRLRELAVAVPLMCCSADMISVPSRKQVVLVGSKSSPELTNML 745

Query: 768 AAAHASYDLNKTV 780
           +AAH+ YD NKTV
Sbjct: 746 SAAHSVYDPNKTV 758


>gi|17064908|gb|AAL32608.1| predicted protein of unknown function [Arabidopsis thaliana]
 gi|34098807|gb|AAQ56786.1| At4g03200 [Arabidopsis thaliana]
          Length = 756

 Score = 1119 bits (2894), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 528/682 (77%), Positives = 594/682 (87%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
            KHTNRLAAEHSPYLLQHAHNPVDW+ WGEEAF EARKRDVPIFLSIGYSTCHWCHVMEV
Sbjct: 15  QKHTNRLAAEHSPYLLQHAHNPVDWYPWGEEAFEEARKRDVPIFLSIGYSTCHWCHVMEV 74

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFEDE VAKLLN+ FVSIKVDREERPDVDKVYM++VQALYGGGGWPLSVFLSPDLKPLM
Sbjct: 75  ESFEDEEVAKLLNNSFVSIKVDREERPDVDKVYMSFVQALYGGGGWPLSVFLSPDLKPLM 134

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
           GGTYFPP D YGRPGFKT+L+KVKDAW+ KRD L +SG +AIE+LS+ALSAS  ++KL D
Sbjct: 135 GGTYFPPNDNYGRPGFKTLLKKVKDAWNSKRDTLVKSGTYAIEELSKALSASTGADKLSD 194

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
            + + A+  CA+QLS+SYDS FGGFGSAPKFPRPVEIQ+MLYH KKL+++GK+ EA E +
Sbjct: 195 GISREAVSTCAKQLSRSYDSEFGGFGSAPKFPRPVEIQLMLYHYKKLKESGKTSEADEEK 254

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
            MVLF+LQ MA GG+HDH+GGGFHRYSVDE WHVPHFEKMLYDQGQLANVYLD FS+TKD
Sbjct: 255 SMVLFSLQGMANGGMHDHIGGGFHRYSVDECWHVPHFEKMLYDQGQLANVYLDGFSITKD 314

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
           V YSY+ RDILDYLRRDMI P G IFSAEDADS E EGA RKKEGAFY+WTS E++++LG
Sbjct: 315 VMYSYVARDILDYLRRDMIAPEGGIFSAEDADSFEFEGAKRKKEGAFYIWTSDEIDEVLG 374

Query: 459 EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 518
           E+A LFKEHYY+K +GNCDLS  SDPHNEF GKNVLIE N++SA ASK  + +EKY  IL
Sbjct: 375 ENADLFKEHYYVKKSGNCDLSSRSDPHNEFAGKNVLIERNETSAMASKFSLSVEKYQEIL 434

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
           GECRRKLFDVR KRP+PHLDDK+IVSWNGLVISSFARASKILK+E ES  + FPVV S  
Sbjct: 435 GECRRKLFDVRLKRPKPHLDDKIIVSWNGLVISSFARASKILKAEPESTKYYFPVVNSQP 494

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG 638
           ++Y+EVAE AA FIR +LYDEQ+ RLQHS+R GPSKAP FLDDYAFLISGLLDLYE G G
Sbjct: 495 EDYIEVAEKAALFIRGNLYDEQSRRLQHSYRQGPSKAPAFLDDYAFLISGLLDLYENGGG 554

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
            +WL WAI+LQ TQDEL+LDREGG YFNT G+DPSVLLRVKEDHDGAEPSGNSVS INLV
Sbjct: 555 IEWLKWAIKLQETQDELYLDREGGAYFNTEGQDPSVLLRVKEDHDGAEPSGNSVSAINLV 614

Query: 699 RLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHK 758
           RLASIVAG K++ Y   A   LAVFE RL+++A+AVPLMCC+ADM+SVPSRK VVLVG K
Sbjct: 615 RLASIVAGEKAESYLNTAHRLLAVFELRLRELAVAVPLMCCSADMISVPSRKQVVLVGSK 674

Query: 759 SSVDFENMLAAAHASYDLNKTV 780
           SS +  NML+AAH+ YD NKTV
Sbjct: 675 SSPELTNMLSAAHSVYDPNKTV 696


>gi|319428654|gb|ADV56678.1| hypothetical protein [Phaseolus vulgaris]
          Length = 804

 Score = 1079 bits (2791), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 546/744 (73%), Positives = 599/744 (80%), Gaps = 63/744 (8%)

Query: 93  STSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCH- 151
           ++SHS + HTNRLA++ SPYLLQHAHNPVDW+ WGEEAFAEAR+RDVPIFLSI    C  
Sbjct: 2   ASSHSLHNHTNRLASQQSPYLLQHAHNPVDWYPWGEEAFAEARRRDVPIFLSICVIDCEV 61

Query: 152 -------------WC-HVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQA 197
                        W  H+  VESFED  VAKLLNDWFVSIKVDREERPDVDK       A
Sbjct: 62  GCCGVVDGDSVRSWLQHLSLVESFEDAAVAKLLNDWFVSIKVDREERPDVDK-------A 114

Query: 198 LYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILR-------------KVKDA 244
           LYGGGGWPLSVFLSPDLKPLMGGTYFPP+DKYGRPGFKTILR             KVK A
Sbjct: 115 LYGGGGWPLSVFLSPDLKPLMGGTYFPPDDKYGRPGFKTILRFLFVYSSVPAFSRKVKQA 174

Query: 245 WDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFG 304
           WD KRDML +SGAFAIEQLSEA+S S++S+KLPD +P +ALRLC+EQLS  YDS+FGGFG
Sbjct: 175 WDSKRDMLIKSGAFAIEQLSEAMSISSTSDKLPDGVPADALRLCSEQLSGGYDSKFGGFG 234

Query: 305 SAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRY 364
           SAPKFPRPVEI +MLYHSKKLE+TGK   A+  QKMVLF+LQCMAKGGIHDH+GGGFHRY
Sbjct: 235 SAPKFPRPVEINLMLYHSKKLEETGKLDGANGSQKMVLFSLQCMAKGGIHDHIGGGFHRY 294

Query: 365 SVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIF 424
           SVDE WHVPHFEKMLYDQGQLANVYLDAFS+TKD FYSYI RDILDYLRRDMIGP GEIF
Sbjct: 295 SVDECWHVPHFEKMLYDQGQLANVYLDAFSITKDTFYSYISRDILDYLRRDMIGPEGEIF 354

Query: 425 SAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDP 484
           SAEDADSAETEGA RKKEGAFY+W SKEV+DILGEHA LF+EHYY+K +GNCDLS MSDP
Sbjct: 355 SAEDADSAETEGAARKKEGAFYIWASKEVQDILGEHAALFEEHYYIKQSGNCDLSGMSDP 414

Query: 485 HNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVS 544
           HNEFK KNVLIE  + S  ASK GM +E Y  ILGECRRKLF+ RS+RP+PHLDDKVIVS
Sbjct: 415 HNEFKEKNVLIERKELSELASKYGMSVETYQEILGECRRKLFEARSRRPKPHLDDKVIVS 474

Query: 545 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL 604
           WNGL +SSFARASKILKSEAE   F FPVVG++ KEYM +AE AA FIR+ LYD +T RL
Sbjct: 475 WNGLAVSSFARASKILKSEAEGTKFYFPVVGTEPKEYMRIAEKAAFFIRKELYDVETRRL 534

Query: 605 QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 664
            HSFR  PSKAPGFLDDYAFLISGLLDLYEFG G  WL+WAIELQ TQD LFLD+ GGGY
Sbjct: 535 YHSFRRSPSKAPGFLDDYAFLISGLLDLYEFGGGVSWLLWAIELQETQDSLFLDKAGGGY 594

Query: 665 FNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL---- 720
           FN TGEDPSVLLRVKEDHDGAEPSGNSVS INL+RLAS+V+GSK++ YR+NAEH L    
Sbjct: 595 FNNTGEDPSVLLRVKEDHDGAEPSGNSVSAINLIRLASMVSGSKAENYRRNAEHLLVCKL 654

Query: 721 ------------------------AVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVG 756
                                   AVFE RLKDMAMAVPLMCCAADML VPSRK VV+VG
Sbjct: 655 LSLFPLKAFSSHICANNGGMGLFEAVFEKRLKDMAMAVPLMCCAADMLRVPSRKQVVVVG 714

Query: 757 HKSSVDFENMLAAAHASYDLNKTV 780
            ++S +FENML AAHA YD N+TV
Sbjct: 715 GRTSEEFENMLTAAHALYDPNRTV 738


>gi|319428671|gb|ADV56694.1| hypothetical protein [Phaseolus vulgaris]
          Length = 804

 Score = 1078 bits (2787), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 545/744 (73%), Positives = 599/744 (80%), Gaps = 63/744 (8%)

Query: 93  STSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCH- 151
           ++SHS + HTNRLA++ SPYLLQHAHNPVDW+ WGEEAFAEAR+RDVPIFLSI    C  
Sbjct: 2   ASSHSLHNHTNRLASQQSPYLLQHAHNPVDWYPWGEEAFAEARRRDVPIFLSICVIDCEV 61

Query: 152 -------------WC-HVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQA 197
                        W  H+  VESFED  VAKLLNDWFVSIKVDREERPDVDK       A
Sbjct: 62  GCCGVVDGDSVRSWLQHLSLVESFEDAAVAKLLNDWFVSIKVDREERPDVDK-------A 114

Query: 198 LYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILR-------------KVKDA 244
           LYGGGGWPLSVFLSPDLKPLMGGTYFPP+DKYGRPGFKTILR             KVK A
Sbjct: 115 LYGGGGWPLSVFLSPDLKPLMGGTYFPPDDKYGRPGFKTILRFLFVYSSVPAFSRKVKQA 174

Query: 245 WDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFG 304
           WD KRDML +SGAFAIEQLSEA+S S++S+KLPD +P +ALRLC+EQLS  YDS+FGGFG
Sbjct: 175 WDSKRDMLIKSGAFAIEQLSEAMSISSTSDKLPDGVPADALRLCSEQLSGGYDSKFGGFG 234

Query: 305 SAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRY 364
           SAPKFPRPVEI +MLYHSKKLE+TGK   A+  QKMVLF+LQCMAKGGIHDH+GGGFHRY
Sbjct: 235 SAPKFPRPVEINLMLYHSKKLEETGKLDGANGSQKMVLFSLQCMAKGGIHDHIGGGFHRY 294

Query: 365 SVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIF 424
           SVDE WHVPHFEKMLYDQGQLANVYLDAFS+TKD FYSYI RDILDYLRRDMIGP GEIF
Sbjct: 295 SVDECWHVPHFEKMLYDQGQLANVYLDAFSITKDTFYSYISRDILDYLRRDMIGPEGEIF 354

Query: 425 SAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDP 484
           SAEDADSAETEGA RKKEGAFY+W SKEV+DILGEHA LF+EHYY+K +GNCDLS MSDP
Sbjct: 355 SAEDADSAETEGAARKKEGAFYIWASKEVQDILGEHAALFEEHYYIKQSGNCDLSGMSDP 414

Query: 485 HNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVS 544
           HNEFK KNVLIE  + S  ASK GM +E Y  ILGECRRKLF+ RS+RP+PHLDDKVIVS
Sbjct: 415 HNEFKEKNVLIERKELSELASKYGMSVETYQEILGECRRKLFEARSRRPKPHLDDKVIVS 474

Query: 545 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL 604
           WNGL +SSFARASKILKSEAE   F FPVVG++ KEYM +AE AA FIR+ LYD +T RL
Sbjct: 475 WNGLAVSSFARASKILKSEAEGTKFYFPVVGTEPKEYMRIAEKAAFFIRKELYDVETRRL 534

Query: 605 QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 664
            HSFR  PSKAPGFLDDYAFLISGLLDLYEFG G  WL+WAIELQ TQD LFLD+ GGGY
Sbjct: 535 YHSFRRSPSKAPGFLDDYAFLISGLLDLYEFGGGISWLLWAIELQETQDSLFLDKAGGGY 594

Query: 665 FNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL---- 720
           FN TGEDPSVLLRVKEDHDGAEPSGNSVS INL+RLAS+V+GSK++ Y++NAEH L    
Sbjct: 595 FNNTGEDPSVLLRVKEDHDGAEPSGNSVSAINLIRLASMVSGSKAENYKRNAEHLLVCKL 654

Query: 721 ------------------------AVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVG 756
                                   AVFE RLKDMAMAVPLMCCAADML VPSRK VV+VG
Sbjct: 655 LVLFLLKAFSSHICANNGGMGLFEAVFEKRLKDMAMAVPLMCCAADMLRVPSRKQVVVVG 714

Query: 757 HKSSVDFENMLAAAHASYDLNKTV 780
            ++S +FENML AAHA YD N+TV
Sbjct: 715 GRTSEEFENMLTAAHALYDPNRTV 738


>gi|147817761|emb|CAN68939.1| hypothetical protein VITISV_028994 [Vitis vinifera]
          Length = 1575

 Score = 1077 bits (2784), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 526/674 (78%), Positives = 570/674 (84%), Gaps = 26/674 (3%)

Query: 130 AFAEARKRDVPIF-----LSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREER 184
           A AE    D   F     +S G+     CHVMEVESFE+EGVAKLLNDWFVSIKVDREER
Sbjct: 60  AMAETEHEDSIAFSQHFMVSDGWKPLVRCHVMEVESFENEGVAKLLNDWFVSIKVDREER 119

Query: 185 PDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILR----- 239
           PDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPP+DKYGRPGFKT+LR     
Sbjct: 120 PDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPDDKYGRPGFKTVLRMSIFV 179

Query: 240 -------------KVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALR 286
                        KVKDAW+ KRD+L +SGAFAIEQLSEALSA+ASSNKL D +PQ AL 
Sbjct: 180 FVLAILLYLYSFRKVKDAWENKRDVLVKSGAFAIEQLSEALSATASSNKLADGIPQQALH 239

Query: 287 LCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQ 346
           LCAEQL+ +YD  +GGFGSAPKFPRPVEIQ+MLYH KKLE++GKSGEA+E  KMV F+LQ
Sbjct: 240 LCAEQLAGNYDPEYGGFGSAPKFPRPVEIQLMLYHYKKLEESGKSGEANEVLKMVAFSLQ 299

Query: 347 CMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICR 406
           CMA+GG+HDH+GGGFHRYSVDE WHVPHFEKMLYDQGQLAN YLD FS+TKDVFYS + R
Sbjct: 300 CMARGGVHDHIGGGFHRYSVDECWHVPHFEKMLYDQGQLANAYLDVFSITKDVFYSCVSR 359

Query: 407 DILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKE 466
           DILDYLRRDMIGP GEIFSAEDADSAE+E A RKKEGAFY+WTSKEVED++GEHA LFK+
Sbjct: 360 DILDYLRRDMIGPEGEIFSAEDADSAESEDAARKKEGAFYIWTSKEVEDVIGEHASLFKD 419

Query: 467 HYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLF 526
           HYY+KP+GNCDLSRMSDPHNEFKGKNVLIE N +SA ASKLGMP+EKYL+ILG CRRKLF
Sbjct: 420 HYYIKPSGNCDLSRMSDPHNEFKGKNVLIERNCASAMASKLGMPVEKYLDILGTCRRKLF 479

Query: 527 DVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAE 586
           DVR  RPRPHLDDKVIVSWNGL ISSFARASKILKSEAE   F FPVVG D KEYMEVAE
Sbjct: 480 DVRLNRPRPHLDDKVIVSWNGLAISSFARASKILKSEAEGTKFRFPVVGCDPKEYMEVAE 539

Query: 587 SAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAI 646
            AASFIR+ LYDEQT RL+HSFRNGPSKAPGFLDDYAFLISGLLD+YEFG  T WLVWAI
Sbjct: 540 KAASFIRKWLYDEQTRRLRHSFRNGPSKAPGFLDDYAFLISGLLDIYEFGGNTNWLVWAI 599

Query: 647 ELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAG 706
           ELQ+TQ                GEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL S+VAG
Sbjct: 600 ELQDTQAWTLYPVPSP---ILGGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLTSMVAG 656

Query: 707 SKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENM 766
           S  + +R+NAEH LAVFETRLKDMAMAVPLMCC ADM SVPSRK VVLVGHKSSV+FE+M
Sbjct: 657 SWFERHRRNAEHLLAVFETRLKDMAMAVPLMCCGADMFSVPSRKQVVLVGHKSSVEFEDM 716

Query: 767 LAAAHASYDLNKTV 780
           LAAAHA YD N+TV
Sbjct: 717 LAAAHAQYDPNRTV 730


>gi|242059825|ref|XP_002459058.1| hypothetical protein SORBIDRAFT_03g045190 [Sorghum bicolor]
 gi|241931033|gb|EES04178.1| hypothetical protein SORBIDRAFT_03g045190 [Sorghum bicolor]
          Length = 821

 Score = 1028 bits (2659), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 496/681 (72%), Positives = 572/681 (83%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           +  NRLAAEHSPYLLQHAHNPVDW+ WG+EAF +AR +DVPIFLSIGYSTCHWCHVMEVE
Sbjct: 73  RKPNRLAAEHSPYLLQHAHNPVDWYPWGDEAFQKARAKDVPIFLSIGYSTCHWCHVMEVE 132

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SFE+E VAKLLNDWFVSIKVDREERPDVDKVYMTYV AL+GGGGWPLSVFLSPDLKPLMG
Sbjct: 133 SFENEEVAKLLNDWFVSIKVDREERPDVDKVYMTYVSALHGGGGWPLSVFLSPDLKPLMG 192

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE 279
           GTYFPP+DKYGRPGFKT+LRKVK+AW+ KR+ L +SG   IEQL +ALS  ASS  +P++
Sbjct: 193 GTYFPPDDKYGRPGFKTVLRKVKEAWETKREALERSGNLVIEQLRDALSTKASSQDVPND 252

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
           L   ++  C EQL+  YD +FGGFGSAPKFPRPVE  +MLY  +K  + GK  EA   +K
Sbjct: 253 LAAVSVDQCVEQLASRYDPKFGGFGSAPKFPRPVEDYIMLYKFRKHMEAGKESEALNIKK 312

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           MV  TL CMA+GG+HDHVGGGFHRYSVDE WH+PHFEKMLYDQGQ+ NVYLD F +T D 
Sbjct: 313 MVTHTLDCMARGGVHDHVGGGFHRYSVDECWHIPHFEKMLYDQGQIVNVYLDTFLITGDE 372

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE 459
           +YS + RDILDYLRRDMIG  GEIFSAEDADSAE EGA RKKEGAFYVWTSKE+ED LGE
Sbjct: 373 YYSIVARDILDYLRRDMIGKEGEIFSAEDADSAEYEGAPRKKEGAFYVWTSKEIEDTLGE 432

Query: 460 HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILG 519
           +A LFK HYY+K +GNCDLS MSDPHNEF  KNVLIE   +S+ ASK G  L++Y  ILG
Sbjct: 433 NAELFKNHYYVKSSGNCDLSPMSDPHNEFSCKNVLIERKPASSMASKCGKSLDEYSQILG 492

Query: 520 ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRK 579
           +CR+KLF VRSKRPRPHLDDKVIVSWNGL IS+FARAS+ILKS     +FNFPV G +  
Sbjct: 493 DCRQKLFHVRSKRPRPHLDDKVIVSWNGLAISAFARASQILKSGPSGTLFNFPVTGCNPV 552

Query: 580 EYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGT 639
           EY+EVAE+AA+FI+  LYD  + RL HS+RNGPSKAPGFLDDYAFLISGLLDLYEFG  T
Sbjct: 553 EYLEVAENAANFIKEKLYDASSKRLHHSYRNGPSKAPGFLDDYAFLISGLLDLYEFGGKT 612

Query: 640 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVR 699
           +WL+WA++LQ TQD+LFLD++GGGYFNT GEDPSVLLRVKED+DGAEPSGNSV+ INL+R
Sbjct: 613 EWLLWAVQLQVTQDDLFLDKQGGGYFNTPGEDPSVLLRVKEDYDGAEPSGNSVAAINLIR 672

Query: 700 LASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKS 759
           L+SI   SKS  Y+ + EH LAVFETRL+ +++A+PLMCCAADMLSVPSRK VVLVG K 
Sbjct: 673 LSSIFDVSKSTGYKSSVEHLLAVFETRLRQLSIALPLMCCAADMLSVPSRKQVVLVGQKG 732

Query: 760 SVDFENMLAAAHASYDLNKTV 780
           S +F++M+AA  + YD N+TV
Sbjct: 733 SEEFQDMVAATFSLYDPNRTV 753


>gi|357131648|ref|XP_003567448.1| PREDICTED: spermatogenesis-associated protein 20-like [Brachypodium
           distachyon]
          Length = 814

 Score = 1015 bits (2624), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 488/685 (71%), Positives = 570/685 (83%)

Query: 96  HSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHV 155
           H      NRLAAEHSPYLLQHAHNPVDW+ WG+EAF +ARK DVPIFLSIGYSTCHWCHV
Sbjct: 61  HGGPGKPNRLAAEHSPYLLQHAHNPVDWYPWGDEAFEKARKMDVPIFLSIGYSTCHWCHV 120

Query: 156 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 215
           MEVESFE+E VAK+LNDWFVSIKVDREERPDVDKVYMTYV ALYGGGGWPLSVFLSP+LK
Sbjct: 121 MEVESFENEEVAKILNDWFVSIKVDREERPDVDKVYMTYVSALYGGGGWPLSVFLSPNLK 180

Query: 216 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 275
           PLMGGTYFPP+DKYGRPGFKT+LR+VK+AW+ KRD L Q+G   IEQL +ALSA A+S  
Sbjct: 181 PLMGGTYFPPDDKYGRPGFKTVLRRVKEAWETKRDALEQAGNVVIEQLRDALSAKATSQD 240

Query: 276 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 335
           +P+++    +  C E+L+ +YD +FGGFGSAPKFPRPVE  +MLY  +K  +  +  E  
Sbjct: 241 VPNDVAVVYVDTCVEKLASNYDPKFGGFGSAPKFPRPVEDCIMLYKFRKHMEARRESEGQ 300

Query: 336 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 395
              KMV  TLQCMA+GG+HDHVGGGFHRYSVDE WHVPHFEKMLYDQGQ+ANVYLD F +
Sbjct: 301 NILKMVTHTLQCMARGGVHDHVGGGFHRYSVDECWHVPHFEKMLYDQGQIANVYLDTFLI 360

Query: 396 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 455
           T D  YS + RDILDYLRRDMIG  GEIFSAEDADS+E EGA RKKEG+FYVWTSKE+ED
Sbjct: 361 TGDECYSSVARDILDYLRRDMIGEEGEIFSAEDADSSEYEGAPRKKEGSFYVWTSKEIED 420

Query: 456 ILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 515
            LGE A LFK HYY+K +GNCDLS MSDPHNEF GKNVLIE    S  ASK G  +++Y 
Sbjct: 421 TLGEDAELFKNHYYVKSSGNCDLSGMSDPHNEFSGKNVLIERKPGSLVASKSGKSVDEYS 480

Query: 516 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 575
            ILG+CR+KLFDVRSKRPRPHLDDKVIVSWNGL IS+FARAS+ILKS +    F FPV G
Sbjct: 481 QILGDCRQKLFDVRSKRPRPHLDDKVIVSWNGLAISAFARASQILKSGSIGTRFYFPVTG 540

Query: 576 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 635
               EY++VAE AA+FI++ LYD  + RL HS+RNGP+KAPGFLDDYAFLI+GLLD+YE+
Sbjct: 541 CHPIEYLQVAEKAATFIKQKLYDASSKRLHHSYRNGPAKAPGFLDDYAFLINGLLDIYEY 600

Query: 636 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 695
           G  T+WL+WA++LQ  QD+LFLDR+GGGYFNT GEDPSVLLRVKED+DGAEPSGNS++ I
Sbjct: 601 GGKTEWLLWAVQLQVIQDQLFLDRQGGGYFNTPGEDPSVLLRVKEDYDGAEPSGNSMAAI 660

Query: 696 NLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLV 755
           NL+RL+SI   +KS+ Y++N EH LAVFETRL+++ +A+PLMCCAADMLSVPSRK VVLV
Sbjct: 661 NLIRLSSIFDAAKSEGYKRNVEHLLAVFETRLRELGIALPLMCCAADMLSVPSRKQVVLV 720

Query: 756 GHKSSVDFENMLAAAHASYDLNKTV 780
           G K S +F++M+AA  +SYD N+TV
Sbjct: 721 GDKGSTEFQDMVAATFSSYDPNRTV 745


>gi|186511491|ref|NP_001118924.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332656889|gb|AEE82289.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 685

 Score = 1000 bits (2585), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 475/625 (76%), Positives = 540/625 (86%)

Query: 156 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 215
           MEVESFEDE VAKLLN+ FVSIKVDREERPDVDKVYM++VQALYGGGGWPLSVFLSPDLK
Sbjct: 1   MEVESFEDEEVAKLLNNSFVSIKVDREERPDVDKVYMSFVQALYGGGGWPLSVFLSPDLK 60

Query: 216 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 275
           PLMGGTYFPP D YGRPGFKT+L+KVKDAW+ KRD L +SG +AIE+LS+ALSAS  ++K
Sbjct: 61  PLMGGTYFPPNDNYGRPGFKTLLKKVKDAWNSKRDTLVKSGTYAIEELSKALSASTGADK 120

Query: 276 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 335
           L D + + A+  CA+QLS+SYDS FGGFGSAPKFPRPVEIQ+MLYH KKL+++GK+ EA 
Sbjct: 121 LSDGISREAVSTCAKQLSRSYDSEFGGFGSAPKFPRPVEIQLMLYHYKKLKESGKTSEAD 180

Query: 336 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 395
           E + MVLF+LQ MA GG+HDH+GGGFHRYSVDE WHVPHFEKMLYDQGQLANVYLD FS+
Sbjct: 181 EEKSMVLFSLQGMANGGMHDHIGGGFHRYSVDECWHVPHFEKMLYDQGQLANVYLDGFSI 240

Query: 396 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 455
           TKDV YSY+ RDILDYLRRDMI P G IFSAEDADS E EGA RKKEGAFY+WTS E+++
Sbjct: 241 TKDVMYSYVARDILDYLRRDMIAPEGGIFSAEDADSFEFEGAKRKKEGAFYIWTSDEIDE 300

Query: 456 ILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 515
           +LGE+A LFKEHYY+K +GNCDLS  SDPHNEF GKNVLIE N++SA ASK  + +EKY 
Sbjct: 301 VLGENADLFKEHYYVKKSGNCDLSSRSDPHNEFAGKNVLIERNETSAMASKFSLSVEKYQ 360

Query: 516 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 575
            ILGECRRKLFDVR KRP+PHLDDK+IVSWNGLVISSFARASKILK+E ES  + FPVV 
Sbjct: 361 EILGECRRKLFDVRLKRPKPHLDDKIIVSWNGLVISSFARASKILKAEPESTKYYFPVVN 420

Query: 576 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 635
           S  ++Y+EVAE AA FIR +LYDEQ+ RLQHS+R GPSKAP FLDDYAFLISGLLDLYE 
Sbjct: 421 SQPEDYIEVAEKAALFIRGNLYDEQSRRLQHSYRQGPSKAPAFLDDYAFLISGLLDLYEN 480

Query: 636 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 695
           G G +WL WAI+LQ TQDEL+LDREGG YFNT G+DPSVLLRVKEDHDGAEPSGNSVS I
Sbjct: 481 GGGIEWLKWAIKLQETQDELYLDREGGAYFNTEGQDPSVLLRVKEDHDGAEPSGNSVSAI 540

Query: 696 NLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLV 755
           NLVRLASIVAG K++ Y   A   LAVFE RL+++A+AVPLMCC+ADM+SVPSRK VVLV
Sbjct: 541 NLVRLASIVAGEKAESYLNTAHRLLAVFELRLRELAVAVPLMCCSADMISVPSRKQVVLV 600

Query: 756 GHKSSVDFENMLAAAHASYDLNKTV 780
           G KSS +  NML+AAH+ YD NKTV
Sbjct: 601 GSKSSPELTNMLSAAHSVYDPNKTV 625


>gi|222619828|gb|EEE55960.1| hypothetical protein OsJ_04681 [Oryza sativa Japonica Group]
          Length = 791

 Score =  973 bits (2514), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 477/709 (67%), Positives = 567/709 (79%), Gaps = 29/709 (4%)

Query: 96  HSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHV 155
           H   +  NRLAAEHSPYLLQHA+NPVDW+ WGEEAF +AR++DVPIFLS        CHV
Sbjct: 17  HGVGRSPNRLAAEHSPYLLQHAYNPVDWYPWGEEAFEKARRKDVPIFLS-----SMKCHV 71

Query: 156 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 215
           MEVESFE++ +AK+LND FVSIKVDREERPDVDKVYMTYV ALYGGGGWPLSVFLSP+LK
Sbjct: 72  MEVESFENDEIAKILNDGFVSIKVDREERPDVDKVYMTYVSALYGGGGWPLSVFLSPNLK 131

Query: 216 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 275
           PLMGGTYFPP+DKYGR GFKTILRKVK+AW+ KRD L ++G   I+QL +ALSA ASS  
Sbjct: 132 PLMGGTYFPPDDKYGRTGFKTILRKVKEAWETKRDALEKTGNVVIKQLRDALSAKASSQD 191

Query: 276 LPDELPQNALRLCAE------------------------QLSKSYDSRFGGFGSAPKFPR 311
           +P++L   ++  C E                        QL+ SYD +FGG+GSAPKFPR
Sbjct: 192 MPNDLAVVSVDNCVEKTRFKNRDKNNIRSSIADSQLISMQLAGSYDPKFGGYGSAPKFPR 251

Query: 312 PVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWH 371
           PVE  +MLY  +K  ++G+  E+    KM+  TLQCMA+GG+HDHVGGGFHRYSVDE WH
Sbjct: 252 PVENCVMLYKFRKHLESGQVSESQNIMKMITHTLQCMARGGVHDHVGGGFHRYSVDECWH 311

Query: 372 VPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADS 431
           VPHFEKMLYDQGQ+ANVYLD F +T D +YS + RDILDYLRRDMIG  GEI+SAEDADS
Sbjct: 312 VPHFEKMLYDQGQIANVYLDTFLITGDEYYSSVARDILDYLRRDMIGEEGEIYSAEDADS 371

Query: 432 AETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGK 491
           AE +GA RK+EGAFYVWT+KE+ED LGE++ LFK HYY+K +GNCDLSRMSDPH+EFKGK
Sbjct: 372 AEYDGAPRKREGAFYVWTNKEIEDTLGENSELFKNHYYVKSSGNCDLSRMSDPHDEFKGK 431

Query: 492 NVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVIS 551
           NVLIE   +S  ASK G  +++Y  ILG+CR KLFDVRSKRPRPHLDDKVIVSWNGL IS
Sbjct: 432 NVLIERKQASLMASKCGKSVDEYAQILGDCRHKLFDVRSKRPRPHLDDKVIVSWNGLAIS 491

Query: 552 SFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG 611
           +FARAS+ILKSE     F FP+ G + +EY+ VAE AA FI+  LYD  ++RL HS+RNG
Sbjct: 492 AFARASQILKSEPTGTRFCFPITGCNPEEYLGVAEKAARFIKEKLYDSSSNRLNHSYRNG 551

Query: 612 PSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGED 671
           P+KAPGFLDDYAFLI+GLLDLYE+G   +WL+WA  LQ  QDELFLD++GGGYFNT GED
Sbjct: 552 PAKAPGFLDDYAFLINGLLDLYEYGGKIEWLMWAAHLQVIQDELFLDKQGGGYFNTPGED 611

Query: 672 PSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMA 731
           PSVLLRVKED+DGAEPSGNSV+ INL+RL+SI   +KSD Y+ N EH LAVF+TRL+++ 
Sbjct: 612 PSVLLRVKEDYDGAEPSGNSVAAINLIRLSSIFDAAKSDGYKCNVEHLLAVFQTRLRELG 671

Query: 732 MAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           +A+PLMCCAADMLSVPSRK VVLVG+K S +F +M+AAA ++YD N+TV
Sbjct: 672 IALPLMCCAADMLSVPSRKQVVLVGNKESTEFRDMVAAAFSTYDPNRTV 720


>gi|218189686|gb|EEC72113.1| hypothetical protein OsI_05096 [Oryza sativa Indica Group]
          Length = 806

 Score =  963 bits (2490), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 477/724 (65%), Positives = 567/724 (78%), Gaps = 44/724 (6%)

Query: 96  HSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHV 155
           H   +  NRLAAEHSPYLLQHA+NPVDW+ WGEEAF +AR++DVPIFLS        CHV
Sbjct: 17  HGVGRSPNRLAAEHSPYLLQHAYNPVDWYPWGEEAFEKARRKDVPIFLS-----SMKCHV 71

Query: 156 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 215
           MEVESFE++ +AK+LND FVSIKVDREERPDVDKVYMTYV ALYGGGGWPLSVFLSP+LK
Sbjct: 72  MEVESFENDEIAKILNDGFVSIKVDREERPDVDKVYMTYVSALYGGGGWPLSVFLSPNLK 131

Query: 216 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 275
           PLMGGTYFPP+DKYGRPGFKTILRKVK+AW+ K D L ++G   I+QL +ALSA ASS  
Sbjct: 132 PLMGGTYFPPDDKYGRPGFKTILRKVKEAWETKCDALEKTGNVVIKQLRDALSAKASSQD 191

Query: 276 LPDELPQNALRLCAE------------------------QLSKSYDSRFGGFGSAPKFPR 311
           +P++L   ++  C E                        QL+ SYD +FGG+GSAPKFPR
Sbjct: 192 IPNDLAVVSVDNCVEKTRFKNRDKNNIRSSIADSQLISMQLAGSYDPKFGGYGSAPKFPR 251

Query: 312 PVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWH 371
           PVE  +MLY  +K  ++G+  E+    KM+  TLQCMA+GG+HDHVGGGFHRYSVDE WH
Sbjct: 252 PVENCVMLYKFRKHLESGQVSESQNIMKMITHTLQCMARGGVHDHVGGGFHRYSVDECWH 311

Query: 372 VPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADS 431
           VPHFEKMLYDQGQ+ANVYLD F +T D +YS + RDILDYLRRDMIG  GEI+SAEDADS
Sbjct: 312 VPHFEKMLYDQGQIANVYLDTFLITGDEYYSSVARDILDYLRRDMIGEEGEIYSAEDADS 371

Query: 432 AETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGK 491
           AE +GA RK+EGAFYVWT+KE+ED LGE++ LFK HYY+K +GNCDLSRMSDPH+EFKGK
Sbjct: 372 AEYDGAPRKREGAFYVWTNKEIEDTLGENSELFKNHYYVKSSGNCDLSRMSDPHDEFKGK 431

Query: 492 NVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVIS 551
           NVLIE   +S  ASK G  +++Y  ILG+CR KLFDVRSKRPRPHLDDKVIVSWNGL IS
Sbjct: 432 NVLIERKQASLMASKCGKSVDEYAQILGDCRHKLFDVRSKRPRPHLDDKVIVSWNGLAIS 491

Query: 552 SFARASKILKSEAESAMFNFPVVGSD---------------RKEYMEVAESAASFIRRHL 596
           +FARAS+ILKSE     F FP+ G +                +EY+ VAE AA FI+  L
Sbjct: 492 AFARASQILKSEPTGTRFCFPITGCNFSLVKQSLGCACPYMPEEYLGVAEKAARFIKEKL 551

Query: 597 YDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELF 656
           YD  ++RL HS+RNGP+KAPGFLDDYAFLI+GLLDLYE+G   +WL+WA  LQ  QDELF
Sbjct: 552 YDSSSNRLNHSYRNGPAKAPGFLDDYAFLINGLLDLYEYGGKIEWLMWAAHLQVIQDELF 611

Query: 657 LDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNA 716
           LD++GGGYFNT GEDPSVLLRVKED+DGAEPSGNSV+ INL+RL+SI   +KSD Y+ N 
Sbjct: 612 LDKQGGGYFNTPGEDPSVLLRVKEDYDGAEPSGNSVAAINLIRLSSIFDAAKSDGYKCNV 671

Query: 717 EHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDL 776
           EH LAVF+TRL+++ +A+PLMCCAADMLSVPSRK VVLVG+K S +F +M+AAA ++YD 
Sbjct: 672 EHLLAVFQTRLRELGIALPLMCCAADMLSVPSRKQVVLVGNKESTEFRDMVAAAFSTYDP 731

Query: 777 NKTV 780
           N+TV
Sbjct: 732 NRTV 735


>gi|168008753|ref|XP_001757071.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162691942|gb|EDQ78302.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 772

 Score =  935 bits (2416), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 435/691 (62%), Positives = 546/691 (79%), Gaps = 6/691 (0%)

Query: 92  ASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCH 151
            STSH   KHTNRLA EHSPYLLQHAHNPVDW+ WGEEAFA+AR+ D PIFLS+GYSTCH
Sbjct: 10  GSTSH---KHTNRLAKEHSPYLLQHAHNPVDWYPWGEEAFAKAREEDKPIFLSVGYSTCH 66

Query: 152 WCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLS 211
           WCHVMEVESFE+E +AKL N+WFV+IKVDREERPDVDKVYMTYVQA  GGGGWP+SVFL+
Sbjct: 67  WCHVMEVESFENEEIAKLQNEWFVNIKVDREERPDVDKVYMTYVQASQGGGGWPMSVFLT 126

Query: 212 PDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASA 271
           P+LKP++GGTYFPP+DKYGRPGFKT+L++V++ W+ K+D+L +SG   ++QL+EA +A A
Sbjct: 127 PELKPIVGGTYFPPDDKYGRPGFKTVLKRVREVWESKKDVLRESGKQVVQQLAEATAAVA 186

Query: 272 SSNKLPDE-LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGK 330
            S +L +  +P  A+ LCA QLSK +DS+ GGFG APKFPRPVE+ +M+ + K+LE  GK
Sbjct: 187 PSTELTESSVPAQAVTLCANQLSKGFDSKLGGFGGAPKFPRPVEVALMMRNYKRLEQQGK 246

Query: 331 SGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYL 390
              A++  +M LF+LQCMA GG+HDHVGGGFHRYSVDE WHVPHFEKMLYD  QL NVYL
Sbjct: 247 EQYATKALEMALFSLQCMANGGMHDHVGGGFHRYSVDEYWHVPHFEKMLYDNAQLVNVYL 306

Query: 391 DAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTS 450
           DAF+++KD+ YSY+ RD+LDYL RDM  P G I+SAEDADSAET  +T+KKEG FY+WT 
Sbjct: 307 DAFAVSKDLTYSYVARDVLDYLIRDMTHPEGGIYSAEDADSAETTSSTKKKEGLFYIWTL 366

Query: 451 KEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGM 509
           +E+E++LG E A +F  +YY+K  GNCDLSRMSDPH EF GKNVLI+ ++    A+K G 
Sbjct: 367 QEIEEVLGKEQAQMFIAYYYVKAEGNCDLSRMSDPHGEFGGKNVLIKRSNVDI-ATKFGK 425

Query: 510 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 569
             E     LG+CR KL   RS+RP PHLDDKVIV+WNGL IS+FARAS+IL +E     +
Sbjct: 426 MPEDVSQYLGQCRAKLHAYRSQRPHPHLDDKVIVAWNGLAISAFARASRILLNEPSGVRY 485

Query: 570 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 629
            FPV G   KEY+ VAE AA FI+  LY+E+T RL  S+RNGPSKAPGFLDDYAFLI+GL
Sbjct: 486 EFPVTGCHPKEYLVVAERAAHFIKSKLYNEKTKRLTRSYRNGPSKAPGFLDDYAFLIAGL 545

Query: 630 LDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSG 689
           LDL+E G   KWL WA+ELQ++QDE FLD+EGG Y+ T   DPS+L R+KED+DGAEPSG
Sbjct: 546 LDLFECGGDYKWLQWALELQSSQDEQFLDKEGGAYYITPEGDPSILFRMKEDYDGAEPSG 605

Query: 690 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSR 749
           NSV+ INL+RL+S+V G  ++     AEH LAV+E R+K++AMAVPL+CCA D  SV ++
Sbjct: 606 NSVAAINLLRLSSLVTGDLAESVHTTAEHLLAVYEQRVKEVAMAVPLLCCAFDSFSVAAK 665

Query: 750 KHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           + +++ G ++S D + ++ A HA +D ++ V
Sbjct: 666 RQIIIAGVRNSPDTDALMTACHAPFDPDRNV 696


>gi|302824870|ref|XP_002994074.1| hypothetical protein SELMODRAFT_163314 [Selaginella moellendorffii]
 gi|300138080|gb|EFJ04861.1| hypothetical protein SELMODRAFT_163314 [Selaginella moellendorffii]
          Length = 769

 Score =  900 bits (2327), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 417/684 (60%), Positives = 538/684 (78%), Gaps = 1/684 (0%)

Query: 98  RNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVME 157
           ++KH+NRL  E+SPYLLQHAHNPVDW+ WGEEAFA+A+  D PIFLS+GYSTCHWCHVME
Sbjct: 18  KHKHSNRLLHENSPYLLQHAHNPVDWYPWGEEAFAKAKAEDKPIFLSVGYSTCHWCHVME 77

Query: 158 VESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPL 217
           VESFE E VAKLLNDWFVSIKVDREERPDVDK+YMT+VQA  GGGGWP+SVFL+P+LKP+
Sbjct: 78  VESFESEEVAKLLNDWFVSIKVDREERPDVDKIYMTFVQASQGGGGWPMSVFLTPELKPI 137

Query: 218 MGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLP 277
           +GGTYFPPED YGRPGFKT+LR+VK+ WD ++ +L  +G   I+QL+EA++A A+S ++ 
Sbjct: 138 VGGTYFPPEDNYGRPGFKTVLRRVKENWDSRKAVLRNAGDNVIQQLAEAMAACATSLQVS 197

Query: 278 DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEG 337
             + + A++LCA QL K +D++ GGFGSAPKFPRPVE+ +ML + K+L+  GK+  + + 
Sbjct: 198 GGVAEQAVQLCASQLMKGFDAKLGGFGSAPKFPRPVELNLMLRYYKRLDQAGKASLSKKA 257

Query: 338 QKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTK 397
            +M  F LQCMA+GG+HDHVGGGFHRYSVD+ WHVPHFEKMLYDQ QLAN YLD + +T+
Sbjct: 258 LEMASFNLQCMARGGMHDHVGGGFHRYSVDDYWHVPHFEKMLYDQAQLANAYLDVYLVTR 317

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
           D  ++ + RDILDYL RDM  P G IFSAEDADS E  G+++KKEGAFYVWT+KE+ED+L
Sbjct: 318 DTMHACVARDILDYLNRDMTHPEGGIFSAEDADSLEPSGSSKKKEGAFYVWTAKEIEDVL 377

Query: 458 G-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLN 516
           G + A +F  HYY++  GNC+LSRMSDPHNEF GKNVLIE    + + +K G  +E+  +
Sbjct: 378 GKDRAQIFAAHYYVREQGNCNLSRMSDPHNEFLGKNVLIERQSLADTVAKFGKTVEETAD 437

Query: 517 ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGS 576
           +LG+CR  L   RSKRPRPHLDDKVIV+WNGL IS+++RAS+ L++E E     FP +G 
Sbjct: 438 LLGQCRELLHAHRSKRPRPHLDDKVIVAWNGLAISAYSRASRFLRAEPEGLKHYFPDMGC 497

Query: 577 DRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG 636
           D K+Y+ VAE  A F++  +Y+    RLQ S+R  PS+APGFLDDYAFLI+GLLDLYE  
Sbjct: 498 DPKDYLIVAERIAKFVKDKIYNASAKRLQRSYRKSPSQAPGFLDDYAFLIAGLLDLYEAS 557

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
             TKWL W  ELQ  QD LFLD+EGGGYF+T   D S+L R+KED+DGAEPSGNSV+ IN
Sbjct: 558 GDTKWLAWVFELQEVQDHLFLDKEGGGYFSTAEGDSSILFRMKEDYDGAEPSGNSVAAIN 617

Query: 697 LVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVG 756
           L+RLASI  G +   + + A+H LAVFE ++K++AMAVPLMCCA D+L+VPS++ +++ G
Sbjct: 618 LLRLASICHGEEGKLFLERAQHLLAVFEGKVKELAMAVPLMCCAYDVLAVPSKRQILVAG 677

Query: 757 HKSSVDFENMLAAAHASYDLNKTV 780
            K+S +F+ ++  +H  +D + T+
Sbjct: 678 AKTSGEFDALVTTSHLFFDPDSTI 701


>gi|4262148|gb|AAD14448.1| predicted protein of unknown function [Arabidopsis thaliana]
 gi|7270190|emb|CAB77805.1| predicted protein of unknown function [Arabidopsis thaliana]
          Length = 794

 Score =  859 bits (2220), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 449/742 (60%), Positives = 514/742 (69%), Gaps = 129/742 (17%)

Query: 55  LSSLPRN------YLYPFRRPLAVISHRPIHPYKVVAMAERTPASTSHSR-NKHTNRLAA 107
           LS+LPR       +   F  P   I  RPI   KV+AMAE + +S++ S   KHTNRLAA
Sbjct: 106 LSTLPRRRNIVRIHNPSFSSPFPPILSRPISSGKVLAMAEESSSSSTSSTSQKHTNRLAA 165

Query: 108 EHSPYLLQHAHNP---------VDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           EHSPYLLQHAHNP         VDW+ WGEEAF EARKRDV                   
Sbjct: 166 EHSPYLLQHAHNPIDFMVYVKKVDWYPWGEEAFEEARKRDV------------------- 206

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
                                DREERPDVDK       ALYGGGGWPLSVFLSPDLKPLM
Sbjct: 207 ---------------------DREERPDVDK-------ALYGGGGWPLSVFLSPDLKPLM 238

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
           GGTYFPP D YGRPGFKT+L+KVKDAW+ KRD L +SG +AIE+LS+ALSAS  ++KL D
Sbjct: 239 GGTYFPPNDNYGRPGFKTLLKKVKDAWNSKRDTLVKSGTYAIEELSKALSASTGADKLSD 298

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
            + + AL+                                        ++GK+ EA E +
Sbjct: 299 GISREALK----------------------------------------ESGKTSEADEEK 318

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
            MVLF+LQ MA GG+HDH+GGGFHRYSVDE WHVPHFEKMLYDQGQLANVYLD FS+TKD
Sbjct: 319 SMVLFSLQGMANGGMHDHIGGGFHRYSVDECWHVPHFEKMLYDQGQLANVYLDGFSITKD 378

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
           V YSY+ RDILDYLRRDMI P G IFSAEDADS E EGA RKKEGAFY+WTS E++++LG
Sbjct: 379 VMYSYVARDILDYLRRDMIAPEGGIFSAEDADSFEFEGAKRKKEGAFYIWTSDEIDEVLG 438

Query: 459 EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 518
           E+A LFKEHYY+K +GNCDLS  SDPHNEF GKNVLIE N++SA ASK  + +EKY  IL
Sbjct: 439 ENADLFKEHYYVKKSGNCDLSSRSDPHNEFAGKNVLIERNETSAMASKFSLSVEKYQEIL 498

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
           GECRRKLFDVR KRP+PHLDDK+IVSWNGLVISSFARASKILK+E ES  + FPVV S  
Sbjct: 499 GECRRKLFDVRLKRPKPHLDDKIIVSWNGLVISSFARASKILKAEPESTKYYFPVVNSQP 558

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG 638
           ++Y+EVAE AA FIR +LYDEQ+ RLQHS+R GPSKAP FLDDYAFLISGLLDLYE G G
Sbjct: 559 EDYIEVAEKAALFIRGNLYDEQSRRLQHSYRQGPSKAPAFLDDYAFLISGLLDLYENGGG 618

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
            +WL WAI+LQ TQ                           +DHDGAEPSGNSVS INLV
Sbjct: 619 IEWLKWAIKLQETQ--------------------------AKDHDGAEPSGNSVSAINLV 652

Query: 699 RLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHK 758
           RLASIVAG K++ Y   A   LAVFE RL+++A+AVPLMCC+ADM+SVPSRK VVLVG K
Sbjct: 653 RLASIVAGEKAESYLNTAHRLLAVFELRLRELAVAVPLMCCSADMISVPSRKQVVLVGSK 712

Query: 759 SSVDFENMLAAAHASYDLNKTV 780
           SS +  NML+AAH+ YD NKTV
Sbjct: 713 SSPELTNMLSAAHSVYDPNKTV 734


>gi|384252567|gb|EIE26043.1| hypothetical protein COCSUDRAFT_52662 [Coccomyxa subellipsoidea
           C-169]
          Length = 796

 Score =  722 bits (1863), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 361/702 (51%), Positives = 474/702 (67%), Gaps = 13/702 (1%)

Query: 90  TPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYST 149
           T    + +  K TNRLA+E SPYLLQHAHNPVDW+ WGEEAF +AR  + PIFLS+GY+T
Sbjct: 13  TSQQPTKTNPKFTNRLASEESPYLLQHAHNPVDWYPWGEEAFEKARTENKPIFLSVGYAT 72

Query: 150 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 209
           CHWCHVME ESFE E +AKL+ND FV+IKVD+EER DVD+VYMTYVQA  GGGGWP+SVF
Sbjct: 73  CHWCHVMERESFESEAIAKLMNDSFVNIKVDKEERSDVDRVYMTYVQATSGGGGWPMSVF 132

Query: 210 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 269
           L+PDL+P +GGTY+PP+D YGRPGF T+L+++ D W  +++ + +  A  + QL+EA+  
Sbjct: 133 LTPDLQPFLGGTYYPPQDAYGRPGFSTVLKRIADVWRSRKNEVIEQSADTMRQLNEAIQP 192

Query: 270 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLY-HSKKLED- 327
                +LP+      +  C   L+  +D   GGFG+APKFPRP EI ++L  H +  +D 
Sbjct: 193 QGGKAELPEGAAGRFIESCYSMLASRFDPTLGGFGAAPKFPRPAEINLLLVEHLRASQDR 252

Query: 328 ------TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYD 381
                    SG   +   M   TLQ MA GG++DHVGGGFHRYSVDE WHVPHFEKMLYD
Sbjct: 253 EASSATASSSGRRRDALGMAETTLQRMAAGGMYDHVGGGFHRYSVDEHWHVPHFEKMLYD 312

Query: 382 QGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 441
            GQLA  YLDA+  T DV Y+ + R ILDYL RDM  P G  +SAEDADS +  G  +K 
Sbjct: 313 NGQLAQTYLDAYRATGDVRYARVARGILDYLHRDMTHPEGGFYSAEDADSLDASG--KKS 370

Query: 442 EGAFYVWTSKEVEDILG---EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN 498
           EGAFYVW++ E++++LG   E   +FK+HYY+K +GN DLS  SD H EF G N LIE  
Sbjct: 371 EGAFYVWSADEIDEVLGTDSERGRVFKQHYYVKASGNTDLSPRSDQHGEFTGLNCLIERE 430

Query: 499 DSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASK 558
              A+A+K G+ +E+    L + R+ L + RS+RPRPHLDDKV+ +WNGL I +FA AS+
Sbjct: 431 SVKATATKFGLSVEETEGTLAKARQLLHERRSQRPRPHLDDKVVTAWNGLAIGAFANASR 490

Query: 559 ILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGF 618
           +L +E +     FPV G   K+Y+  A  AA F+R  ++D    RL+ SF  GPS   GF
Sbjct: 491 VLANEPQPPTPLFPVEGRPAKDYLTDAIRAAEFVRDKVWDADARRLRRSFCRGPSDVGGF 550

Query: 619 LDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRV 678
            DDYAFL+SGLLDL+      +WL +A++LQ  QDELF D   GGYF+TTGEDPS+LLR+
Sbjct: 551 ADDYAFLVSGLLDLHAASGDAQWLQFALQLQAAQDELFWDDAAGGYFSTTGEDPSILLRM 610

Query: 679 KEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMC 738
           KED+DGAEP+ +S++  NL+RLA++     S+  R  A  + A F  RL +M++A+P MC
Sbjct: 611 KEDYDGAEPAPSSIAAANLLRLAALTDPDASEPLRARASAAAAAFRERLAEMSLAMPQMC 670

Query: 739 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           CA  +L     + V++ G   + D E +L AA A +  +K V
Sbjct: 671 CALHLLDSGHLRQVIIAGRLGAADTEALLDAAQAIFAPDKAV 712


>gi|260801315|ref|XP_002595541.1| hypothetical protein BRAFLDRAFT_56926 [Branchiostoma floridae]
 gi|229280788|gb|EEN51553.1| hypothetical protein BRAFLDRAFT_56926 [Branchiostoma floridae]
          Length = 741

 Score =  667 bits (1720), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 346/710 (48%), Positives = 448/710 (63%), Gaps = 49/710 (6%)

Query: 92  ASTSHSR--NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYST 149
           AS+S SR   KH NRLA E SPYLLQH HNPVDW+ WGE+AF +A+K + PIFLS+GYST
Sbjct: 6   ASSSGSRKGGKHKNRLAEEKSPYLLQHCHNPVDWYPWGEDAFKKAKKENKPIFLSVGYST 65

Query: 150 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 209
           CHWCHVME ESFE E V K++N+ FV++KVDREERPDVDKVYM+++QA  GGGGWP+SV+
Sbjct: 66  CHWCHVMERESFESEEVGKIMNEHFVNVKVDREERPDVDKVYMSFIQATSGGGGWPMSVW 125

Query: 210 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE-ALS 268
           L+PDLKP+ GGTYFPP+D  GRPGF TIL ++ + W   +D L Q G   I+ L E ++S
Sbjct: 126 LTPDLKPIAGGTYFPPKDHMGRPGFSTILTRISEQWKNNKDKLIQQGNMVIDALKELSVS 185

Query: 269 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 328
           A  S+  LP    Q +++ C +QL  SYD  FGGFG APKFP+PV    +      ++ T
Sbjct: 186 AVDSTATLPG---QESVKKCLDQLDNSYDEEFGGFGHAPKFPQPVNFNFLFRVWSSMKGT 242

Query: 329 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 388
               EA     M L TL+ MAKGG++DH+G GFHRYS D  WHVPHFEKMLYDQGQLA  
Sbjct: 243 ---PEAQRALDMALETLRFMAKGGMYDHIGQGFHRYSTDRTWHVPHFEKMLYDQGQLAVA 299

Query: 389 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 448
           Y DA+ +TKD  ++ I RDIL Y+ RD+    G  +SAEDADS    G   KKEGAF VW
Sbjct: 300 YCDAYQITKDPIFADIARDILLYVSRDLSDRQGGFYSAEDADSLPNPGHKTKKEGAFCVW 359

Query: 449 TSKEVEDILGEH---------AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELND 499
            + E+ ++LGE          A LF +HY +  +GN    +  DPH E  GKNVLI    
Sbjct: 360 EADEIRNLLGEKLPHYDDMTFADLFAKHYNINRSGNVAFDQ--DPHGELAGKNVLIVRGS 417

Query: 500 SSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKI 559
              +A   G+   +   +LG+CR  LF VR KRP PH DDK+I +WNGL+IS FARA+++
Sbjct: 418 VENTAKAFGLEAAQVEEVLGKCRDILFKVRRKRPPPHRDDKMITAWNGLMISGFARAAQV 477

Query: 560 LKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP------- 612
           L  EA               +Y++ A  AA F+R+ +YD+ T +L  S  + P       
Sbjct: 478 L-GEA---------------QYLDRAVKAAKFVRKKMYDDSTGKLLRSCYHDPEMDRVTQ 521

Query: 613 --SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGE 670
             +   GF DDYAFLI GLLDLYE     +W+ WA +LQ  QDELF D EG  YF  +G 
Sbjct: 522 IANPIDGFADDYAFLIRGLLDLYEASYNEEWVEWAAQLQRKQDELFWDSEGLAYFTVSGA 581

Query: 671 DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDM 730
           DPSVL+R+KED DGAEPS NSVS  NL+RLAS       + +R  +   +  F  RL  +
Sbjct: 582 DPSVLIRMKEDQDGAEPSANSVSAGNLLRLASF---HDDEGWRNKSVQLMTAFGARLAAI 638

Query: 731 AMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
            +A+P M  A  +    + K +++ G+    D + +L   H+S++ NK +
Sbjct: 639 PLALPEMVSAL-IFYQQTPKQIIIAGNPRDRDTKALLQCVHSSFNPNKIL 687


>gi|326515716|dbj|BAK07104.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 532

 Score =  659 bits (1701), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 324/463 (69%), Positives = 381/463 (82%)

Query: 318 MLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEK 377
           MLY  +K  + G+  EA    KMV  TLQCMA+GG+HDHVGGGFHRYSVDE WHVPHFEK
Sbjct: 1   MLYKFRKHMEAGQKSEAENIMKMVTHTLQCMARGGVHDHVGGGFHRYSVDECWHVPHFEK 60

Query: 378 MLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGA 437
           MLYDQGQ+AN YLD + +T D +YS + RDILDYLRRDMIG  GEIFSAEDADSAE EG 
Sbjct: 61  MLYDQGQIANAYLDTYVITGDEYYSSVARDILDYLRRDMIGEDGEIFSAEDADSAEYEGD 120

Query: 438 TRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 497
            RKKEG+FYVWTS+E+ED LGE+A LFK HYY+K +GNCDLS MSDPHNEF GKNVLIE 
Sbjct: 121 ARKKEGSFYVWTSQEIEDTLGENAELFKNHYYVKSSGNCDLSGMSDPHNEFSGKNVLIER 180

Query: 498 NDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 557
              S  ASK G  +++Y  ILGECR+KLFDVRSKRPRPHLDDKVIVSWNGL IS+FARAS
Sbjct: 181 KPGSLMASKYGKSVDEYYGILGECRQKLFDVRSKRPRPHLDDKVIVSWNGLAISAFARAS 240

Query: 558 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPG 617
           +ILKS      F FPV G D  EY++VAE AA+FI+  LYD  + RL HS+RNGP+KAPG
Sbjct: 241 QILKSGPPGTKFYFPVTGCDPVEYLQVAEKAANFIKEKLYDAGSKRLHHSYRNGPAKAPG 300

Query: 618 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLR 677
           FLDDYAFLI+GLLDL+E+G   +WL+WAIELQ  QDELFLD++GGGYFNT GEDPSVLLR
Sbjct: 301 FLDDYAFLINGLLDLFEYGGKMEWLLWAIELQVIQDELFLDKQGGGYFNTPGEDPSVLLR 360

Query: 678 VKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLM 737
           VKED+DGAEPSGNS++ IN+VRL+SI+  +KS+ Y++N EH LAVFETRLK++ +A+PLM
Sbjct: 361 VKEDYDGAEPSGNSMAAINMVRLSSILDAAKSEGYKRNVEHLLAVFETRLKELGIALPLM 420

Query: 738 CCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           CCAADML+VPSRK VVLVG K+S +F++M+ AA  SYD N+TV
Sbjct: 421 CCAADMLTVPSRKQVVLVGDKASPEFQDMVVAAFLSYDPNRTV 463


>gi|302838582|ref|XP_002950849.1| hypothetical protein VOLCADRAFT_81232 [Volvox carteri f.
           nagariensis]
 gi|300263966|gb|EFJ48164.1| hypothetical protein VOLCADRAFT_81232 [Volvox carteri f.
           nagariensis]
          Length = 890

 Score =  648 bits (1671), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 338/731 (46%), Positives = 446/731 (61%), Gaps = 50/731 (6%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           +++TNRLA+E SPYLLQHAHNPVDW+ WGEEAFA AR  D PIFLS+GY+TCHWCHVME 
Sbjct: 26  HQYTNRLASEQSPYLLQHAHNPVDWYPWGEEAFARARAEDKPIFLSVGYATCHWCHVMER 85

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFE E VA+LLN  F+SIKVDREERPDVD+VYMTYVQA+ G GGWP+SV+L+P L+P  
Sbjct: 86  ESFESEEVAELLNRDFISIKVDREERPDVDRVYMTYVQAVSGSGGWPMSVWLTPSLEPFY 145

Query: 219 GGTYFPPEDKY-----GRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASS 273
           GGTY+PP+D++       PGF T+L ++   W   R  L      A        +A+ + 
Sbjct: 146 GGTYYPPKDRFVGGQLALPGFSTVLLRIGSLWRTNRQDLKSKVEAAAAPAGPTEAAANAG 205

Query: 274 NKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGE 333
             LP  L   A+  C   L++ YD+ +GGFG APKFPRP EI ++L  + +  + G    
Sbjct: 206 AALPPSLAAAAVDACGHDLARRYDAEYGGFGGAPKFPRPSEINLLLRAAVRQMEQGDQLA 265

Query: 334 ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 393
           A   + M L +L  MA GG++D +GGGFHRYSVDE WHVPHFEKMLYD  QLA  YL AF
Sbjct: 266 AQRRRSMALHSLTAMASGGMYDQLGGGFHRYSVDELWHVPHFEKMLYDNPQLALSYLAAF 325

Query: 394 SLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAE------------------TE 435
            LT D  Y+ + R +LDYL RDM  PGG ++SAEDADS +                   E
Sbjct: 326 QLTADKQYALVARGVLDYLLRDMTSPGGGLYSAEDADSEDPHSYMTSTTTAAAAAPAAME 385

Query: 436 GATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL 494
             + +KEGAFY+W   EV  +LG E    F   Y +   GNC+ S  SDPH EF+GKNV 
Sbjct: 386 AGSERKEGAFYIWDHSEVVSVLGPELGPFFCLVYGIDEEGNCNRSSRSDPHGEFEGKNVP 445

Query: 495 IELNDSSASASKLGMPL----EKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVI 550
                 + +A++LG+P      +    L   R  L   R+ RPRP LDDK++ +WNG+ I
Sbjct: 446 YIATQPAVAAARLGLPYGDDAAEAARRLSAAREALHAARASRPRPSLDDKIVTAWNGMGI 505

Query: 551 SSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQ----THRLQH 606
            +FA AS++L SE +     FP  G     Y++ A   A+F+R HL+D        RL+ 
Sbjct: 506 GAFAVASRVLASEQQVERL-FPSEGRAPAAYLDAAVRVAAFVREHLWDPAAGGGVGRLRR 564

Query: 607 SFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFN 666
           S+  GPS   GF DDY+ L+SGLLDLYE G G +WL WA++LQ  QD+LF D + GGYF+
Sbjct: 565 SYCKGPSAVAGFADDYSALVSGLLDLYECGGGREWLEWALQLQAVQDQLFWDPQSGGYFS 624

Query: 667 T-----TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIV---------AGSKSDY- 711
           T        DPS+ +R+K+D+DGAEP+ +SV+  NL+RLA ++         A + + + 
Sbjct: 625 TPDPASADADPSIRIRIKDDYDGAEPTASSVAASNLLRLADMIQERPLYDTTASTTTGHA 684

Query: 712 --YRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAA 769
             Y + A  +LA F  R+    +AVP MCCAA   S    + V++ G   + D   +L A
Sbjct: 685 MPYDEAARRTLAAFSARITQAPLAVPQMCCAAHTFSKRPLRQVIVAGTAGATDTGALLDA 744

Query: 770 AHASYDLNKTV 780
            H+ Y  +K V
Sbjct: 745 VHSPYCPDKVV 755


>gi|348502030|ref|XP_003438572.1| PREDICTED: spermatogenesis-associated protein 20 [Oreochromis
           niloticus]
          Length = 748

 Score =  631 bits (1627), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 327/710 (46%), Positives = 441/710 (62%), Gaps = 49/710 (6%)

Query: 84  VAMAERTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFL 143
           +A     P+STSH   +HTNRLA E SPYLLQHAHNPVDW+ WG++AF +A+  D PIFL
Sbjct: 1   MASGSEGPSSTSH---RHTNRLAKERSPYLLQHAHNPVDWYPWGKDAFDKAKTEDKPIFL 57

Query: 144 SIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGG 203
           S+GYSTCHWCHVME ESFEDE + K+L++ FV IK+DREERPDVDKVYMT+VQA  GGGG
Sbjct: 58  SVGYSTCHWCHVMERESFEDEEIGKILSENFVCIKLDREERPDVDKVYMTFVQATSGGGG 117

Query: 204 WPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL 263
           WP+SV+L+P+L+P +GGTYFPP D+ GRPGFKT+L ++ D W   R  L  SG   IE L
Sbjct: 118 WPMSVWLTPELRPFIGGTYFPPRDRGGRPGFKTVLTRIIDQWQNNRPALESSGERIIEAL 177

Query: 264 SEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK 323
            +  + +A++ + P   P  A R C +QL+ S++  +GGF  APKFP PV +  ++ +  
Sbjct: 178 KKGTTITANAGQSPPLAPDVANR-CFQQLAHSFEEEYGGFRDAPKFPSPVNLMFLISYWT 236

Query: 324 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 383
               T    E  E  +M L TL+ MA GGIHDH+  GFHRYS D  WHVPHFEKMLYDQ 
Sbjct: 237 VNRST---SEGVEALQMALHTLRMMALGGIHDHIAQGFHRYSTDSSWHVPHFEKMLYDQA 293

Query: 384 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 443
           QLA  Y+ A  ++ + F++ + +D+L Y+ RD+    G  +SAEDADS    G   K+EG
Sbjct: 294 QLAVAYITASQVSGEQFFAEVAKDVLLYVSRDLSDKSGGFYSAEDADSVPALGGPEKREG 353

Query: 444 AFYVWTSKEVEDIL----------GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV 493
           AF VWT+ EV ++L             A +F  HY +K  GN  ++   DPH E +G+NV
Sbjct: 354 AFCVWTASEVRELLPDVVEGAAGNATLADIFMHHYGVKEQGN--VAPEQDPHGELQGQNV 411

Query: 494 LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSF 553
           LI       +A++ G+ +EK   +L   R K+ +VR  RPRPHLD K++ SWNGL++S++
Sbjct: 412 LIVRYSVELTAARFGITVEKVNELLASARAKMAEVRKSRPRPHLDTKMLASWNGLMLSAY 471

Query: 554 ARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG-- 611
           AR   +L                  K+ +E A  A  F++ HL+D +   +  S   G  
Sbjct: 472 ARVGAVLGD----------------KDLVERAVKAGGFLKEHLWDAKRQTILRSCYRGDQ 515

Query: 612 -------PSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 664
                  PS + GFLDDYAF+I GLLDLYE    T+WL WA ELQ  QD LF D +GGGY
Sbjct: 516 MEVQQISPSIS-GFLDDYAFIICGLLDLYEATLQTEWLQWAEELQLRQDVLFWDDQGGGY 574

Query: 665 FNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE 724
           F +   D +VLL++KED DGAEPS NSVS  NL+RL+      +   + Q ++  L  F 
Sbjct: 575 FCSDPTDSTVLLQLKEDQDGAEPSANSVSAFNLLRLSHYTGRQE---WLQKSQQLLTAFS 631

Query: 725 TRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASY 774
            RL  + +A+P M  A  M    + K +V+ G + + D  ++LAA ++ +
Sbjct: 632 DRLTTVPIALPEMVRAL-MAQHYTLKQIVICGQRDAPDTTSLLAAVNSLF 680


>gi|270011341|gb|EFA07789.1| hypothetical protein TcasGA2_TC005347 [Tribolium castaneum]
          Length = 804

 Score =  627 bits (1618), Expect = e-177,   Method: Compositional matrix adjust.
 Identities = 322/659 (48%), Positives = 418/659 (63%), Gaps = 39/659 (5%)

Query: 97  SRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVM 156
           S +  TNRLA E SPYLLQHA NPVDW+ WG+EAF  A+K +  IFLS+GYSTCHWCHVM
Sbjct: 70  STSTKTNRLALEKSPYLLQHATNPVDWYPWGQEAFDRAKKENKLIFLSVGYSTCHWCHVM 129

Query: 157 EVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKP 216
           E ESFEDE VAK++N  F+++KVDREERPDVDK+YM ++QA  GGGGWP+SVFL+P L+P
Sbjct: 130 EKESFEDEEVAKIMNQHFINVKVDREERPDVDKLYMAFIQASVGGGGWPMSVFLTPTLEP 189

Query: 217 LMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKL 276
           L GGTYFPPEDKYGRPGFKT+L+ + + W  K+  +A SG +++E L +      S+ + 
Sbjct: 190 LAGGTYFPPEDKYGRPGFKTVLKSIAEQWRTKQSAIANSGKYSLEVLRKVSEREISAKQD 249

Query: 277 PDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASE 336
            +   ++  + C  QLS SY+  FGGF + PKFP+P  +  + +   +      S +   
Sbjct: 250 INVPGEDVWKKCLLQLSHSYEDDFGGFSAQPKFPQPCNLNFLFHMYSR---DKHSEQGFR 306

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
              M L TL+ MA GGIHDHV  GF RYSVD+RWHVPHFEKMLYDQ QLA  Y DAF +T
Sbjct: 307 CLHMCLNTLRKMAYGGIHDHVNCGFARYSVDDRWHVPHFEKMLYDQAQLAVSYADAFVVT 366

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
           KD F++ + RDIL Y+ RD+  P G  + AEDADS   EGA+ K+EGAF VW  +E+  +
Sbjct: 367 KDDFFAEVLRDILLYVSRDLSHPLGGFYGAEDADSYPYEGASHKREGAFCVWEFEEISKL 426

Query: 457 LGE-------HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGM 509
           LGE       H  LF  HY +K  GN + ++  DPH+E + KN+L+       ++ K   
Sbjct: 427 LGETKTDDISHRDLFIYHYNVKEDGNVNPAQ--DPHHELEKKNILVCFGSFEDTSRKFKT 484

Query: 510 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 569
            +E    IL  C   L+  R KRP+PH+D K++ SWNGL+IS FA+A  +LK +      
Sbjct: 485 SVETVKEILKSCHEILYKERQKRPKPHVDTKIVTSWNGLMISGFAKAGFVLKDQ------ 538

Query: 570 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG--------PSKAPGFLDD 621
                     EY+  A  AA+FI++ LY+EQ   L      G        P+   GFLDD
Sbjct: 539 ----------EYINRAILAATFIKKFLYNEQDKTLLRCCYKGDNAKIVQTPTPVNGFLDD 588

Query: 622 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 681
           YAFLI GLLDLYE      WL WA  LQ  QD LF D +G GYF +   D S+L+R KED
Sbjct: 589 YAFLIRGLLDLYEASLDADWLSWAEVLQEQQDRLFWDTKGSGYFTSPANDSSILIRGKED 648

Query: 682 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 740
            DGAEP GNS++V NL+RLA+ +   ++D  R  A  +L VF  RLK + +A+P M  A
Sbjct: 649 QDGAEPCGNSIAVHNLIRLAAYL--DRAD-LRAKAGRTLTVFADRLKSIPVALPEMTSA 704


>gi|317419139|emb|CBN81176.1| Spermatogenesis-associated protein 20 [Dicentrarchus labrax]
          Length = 748

 Score =  623 bits (1606), Expect = e-175,   Method: Compositional matrix adjust.
 Identities = 324/698 (46%), Positives = 432/698 (61%), Gaps = 44/698 (6%)

Query: 95  SHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCH 154
           S S  +HTNRLA E SPYLLQHAHNPVDW+ WG+EAF +A+  D PIFLS+GYSTCHWCH
Sbjct: 9   SSSPQRHTNRLAKERSPYLLQHAHNPVDWYPWGQEAFDKAKNEDKPIFLSVGYSTCHWCH 68

Query: 155 VMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDL 214
           VME ESFEDE + K+L+D FV IK+DREERPDVDKVYMT+VQA  GGGGWP+SV+L+P+L
Sbjct: 69  VMERESFEDEEIGKILSDNFVCIKLDREERPDVDKVYMTFVQATSGGGGWPMSVWLTPEL 128

Query: 215 KPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSN 274
           +P +GGTYFPP D   RPG KT+L ++ + W   R  L  SG   +E L +  + +A+  
Sbjct: 129 RPFIGGTYFPPRDHARRPGLKTVLTRIMEQWQNNRPALESSGERILEALKKGTAVAANPG 188

Query: 275 KLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEA 334
           + P   P  A R C +QL+ SY+  +GGF  APKFP PV +  ++ +      T    E 
Sbjct: 189 ESPPLAPDVANR-CFQQLAHSYEEEYGGFRDAPKFPTPVNLMFLMSYWSVNRST---SEG 244

Query: 335 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 394
            E  +M L TL+ MA GGIHDHV  GFHRYS D  WHVPHFEKMLYDQ QLA  Y+ A  
Sbjct: 245 VEALQMALHTLRMMALGGIHDHVAQGFHRYSTDSSWHVPHFEKMLYDQAQLAVAYITASQ 304

Query: 395 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 454
           ++ +  ++ + +DIL Y+ RD+    G  +SAEDADS    G   K+EGAF VWT+ EV 
Sbjct: 305 VSGEQLFADVAKDILLYVTRDLSDKSGGFYSAEDADSVPASGGPEKREGAFCVWTATEVR 364

Query: 455 DIL----------GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 504
           ++L             A +F  HY +K  GN  ++   DPH E +G+NVLI       +A
Sbjct: 365 ELLPDVVEGATGSATQADIFMHHYGVKVQGN--VAPEQDPHGELQGQNVLIVRYSVELTA 422

Query: 505 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 564
           +  G+ +EK   +L   R K+ +VR  RP PHLD K++ SWNGL++S++AR   +L  +A
Sbjct: 423 AHFGISVEKVNELLASARGKMAEVRKSRPCPHLDTKMLGSWNGLMLSAYARVGAVLGDKA 482

Query: 565 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYD-EQTHRLQHSFRNGPSKA-------P 616
                            +E A  A +F++ HL+D EQ   L+  +R    +         
Sbjct: 483 ----------------LLERAAQAGNFLKEHLWDAEQQTILRSCYRGDEMEVQQISPPIS 526

Query: 617 GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL 676
           GFLDDYAF+I GLLDLYE    T+WL WA ELQ  QDELFLD +GGGYF++   D +VLL
Sbjct: 527 GFLDDYAFIICGLLDLYEATLQTEWLQWAEELQLRQDELFLDDQGGGYFSSDPSDNTVLL 586

Query: 677 RVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL 736
           ++KED DGAEPSGNSVS  NL+RL+      +   + Q ++  LA F  RL  + +A+P 
Sbjct: 587 QLKEDQDGAEPSGNSVSASNLLRLSHYTGRQE---WLQRSQQLLAAFTDRLTRVPIALPE 643

Query: 737 MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASY 774
           M     M    + K +V+ G + + D  ++LA  ++ +
Sbjct: 644 MVRTL-MAQHYTLKQIVICGQRDAPDTASLLATINSLF 680


>gi|326672402|ref|XP_001920588.3| PREDICTED: spermatogenesis-associated protein 20 [Danio rerio]
          Length = 818

 Score =  620 bits (1599), Expect = e-175,   Method: Compositional matrix adjust.
 Identities = 334/758 (44%), Positives = 459/758 (60%), Gaps = 51/758 (6%)

Query: 46  HHFLSHKTKLSSLPRNYLYPFRRPLAVISHRPIHPYKVVAMAERTPASTSHSRN-----K 100
           HH L+ K + + LP +Y +  ++ + V +      ++   +   + AS S S +     K
Sbjct: 35  HHTLT-KNRCARLPHDYWFG-QKSVPVSTRLSWDSFRFSGVFFFSMASGSDSPDRLKTPK 92

Query: 101 HTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVES 160
           +TNRL+ E S YLLQHAHNPVDW+ WG+EAF +A+  D PIFLS+GYSTCHWCHVME ES
Sbjct: 93  YTNRLSQEKSSYLLQHAHNPVDWYPWGQEAFDKAKCEDKPIFLSVGYSTCHWCHVMERES 152

Query: 161 FEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGG 220
           FEDE + K+L+D FV IKVDREERPDVDKVYMT+VQA  GGGGWP+SV+L+PDLKP +GG
Sbjct: 153 FEDEEIGKILSDNFVCIKVDREERPDVDKVYMTFVQATSGGGGWPMSVWLTPDLKPFIGG 212

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           TYFPP D   RPG KT+L ++ + W   R+ L  SG   +E L +  + SAS  +     
Sbjct: 213 TYFPPRDSGRRPGLKTVLLRIIEQWQTNRETLESSGERVLEALRKGTAISASPGETLPPG 272

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
           P  A R C +QL+ S++  +GGF  APKFP PV ++ ++           S E +E  +M
Sbjct: 273 PDVANR-CYQQLAHSFEEEYGGFREAPKFPSPVNLKFLMSFWAV---NRSSSEGAEALQM 328

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
            L TL+ MA GGIHDHV  GFHRYS D  WHVPHFEKMLYDQGQLA  Y+ A+ ++ +  
Sbjct: 329 ALHTLRMMALGGIHDHVAQGFHRYSTDSSWHVPHFEKMLYDQGQLAVAYITAYQVSGEQL 388

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL--- 457
           ++ + RD+L Y+ RD+    G  +SAEDADS  T  +T K+EGAF VWT+ E+ ++L   
Sbjct: 389 FADVARDVLLYVSRDLSDKSGGFYSAEDADSFPTVESTEKREGAFCVWTAGEIRELLPDI 448

Query: 458 -------GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMP 510
                     A +F  HY +K  GN D ++  DPH E +G+NVLI       +A+  G+ 
Sbjct: 449 VEGATGGATQADIFMHHYGVKEQGNVDPAQ--DPHGELQGQNVLIVRYSVELTAAHFGIS 506

Query: 511 LEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFN 570
           + +   +L E R KL +VR  RP PHLD K++ SWNGL++S FAR   +L  +A      
Sbjct: 507 VNRLSELLSEARAKLAEVRRARPPPHLDTKMLASWNGLMLSGFARVGAVLGDKA------ 560

Query: 571 FPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG--------PSKAPGFLDDY 622
                      +E AE AA F++ HL+DE   R+ HS   G         S   GFLDDY
Sbjct: 561 ----------LLERAERAACFLQDHLWDEDGQRILHSCYRGNNMEVEQVASPITGFLDDY 610

Query: 623 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 682
           AF++ GLLDL+E     +WL WA ELQ  QD+LF D +G GYF +   DP++LL +K+D 
Sbjct: 611 AFVVCGLLDLFEATQKFRWLQWAEELQLRQDQLFWDSQGSGYFCSDPSDPTLLLALKQDQ 670

Query: 683 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 742
           DGAEPS NSVS +NL+RL+      + D+  Q +E  L  F  RL  + +A+P M     
Sbjct: 671 DGAEPSANSVSAMNLLRLSHFTG--RQDWI-QRSEQLLTAFSDRLLKVPIALPDMVRGV- 726

Query: 743 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           M    + K +V+ G   + D  ++++  ++ +  +K +
Sbjct: 727 MAHHYTLKQIVICGLPDAEDTASLISCVNSLFLPHKVL 764


>gi|189240570|ref|XP_973977.2| PREDICTED: similar to predicted protein [Tribolium castaneum]
          Length = 754

 Score =  619 bits (1597), Expect = e-174,   Method: Compositional matrix adjust.
 Identities = 322/677 (47%), Positives = 419/677 (61%), Gaps = 57/677 (8%)

Query: 97  SRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVM 156
           S +  TNRLA E SPYLLQHA NPVDW+ WG+EAF  A+K +  IFLS+GYSTCHWCHVM
Sbjct: 2   STSTKTNRLALEKSPYLLQHATNPVDWYPWGQEAFDRAKKENKLIFLSVGYSTCHWCHVM 61

Query: 157 EVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKP 216
           E ESFEDE VAK++N  F+++KVDREERPDVDK+YM ++QA  GGGGWP+SVFL+P L+P
Sbjct: 62  EKESFEDEEVAKIMNQHFINVKVDREERPDVDKLYMAFIQASVGGGGWPMSVFLTPTLEP 121

Query: 217 LMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKL 276
           L GGTYFPPEDKYGRPGFKT+L+ + + W  K+  +A SG +++E L +      S+ + 
Sbjct: 122 LAGGTYFPPEDKYGRPGFKTVLKSIAEQWRTKQSAIANSGKYSLEVLRKVSEREISAKQD 181

Query: 277 PDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASE 336
            +   ++  + C  QLS SY+  FGGF + PKFP+P  +  + +   +      S +   
Sbjct: 182 INVPGEDVWKKCLLQLSHSYEDDFGGFSAQPKFPQPCNLNFLFHMYSR---DKHSEQGFR 238

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
              M L TL+ MA GGIHDHV  GF RYSVD+RWHVPHFEKMLYDQ QLA  Y DAF +T
Sbjct: 239 CLHMCLNTLRKMAYGGIHDHVNCGFARYSVDDRWHVPHFEKMLYDQAQLAVSYADAFVVT 298

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
           KD F++ + RDIL Y+ RD+  P G  + AEDADS   EGA+ K+EGAF VW  +E+  +
Sbjct: 299 KDDFFAEVLRDILLYVSRDLSHPLGGFYGAEDADSYPYEGASHKREGAFCVWEFEEISKL 358

Query: 457 LGE-------HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGM 509
           LGE       H  LF  HY +K  GN + ++  DPH+E + KN+L+       ++ K   
Sbjct: 359 LGETKTDDISHRDLFIYHYNVKEDGNVNPAQ--DPHHELEKKNILVCFGSFEDTSRKFKT 416

Query: 510 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 569
            +E    IL  C   L+  R KRP+PH+D K++ SWNGL+IS FA+A  +LK +      
Sbjct: 417 SVETVKEILKSCHEILYKERQKRPKPHVDTKIVTSWNGLMISGFAKAGFVLKDQ------ 470

Query: 570 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL------------------------- 604
                     EY+  A  AA+FI++ LY+EQ   L                         
Sbjct: 471 ----------EYINRAILAATFIKKFLYNEQDKTLLRCCYKGDNAKIVQTVANLLSKSQP 520

Query: 605 -QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGG 663
             +S    P+   GFLDDYAFLI GLLDLYE      WL WA  LQ  QD LF D +G G
Sbjct: 521 TLNSINRRPTPVNGFLDDYAFLIRGLLDLYEASLDADWLSWAEVLQEQQDRLFWDTKGSG 580

Query: 664 YFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVF 723
           YF +   D S+L+R KED DGAEP GNS++V NL+RLA+ +   ++D  R  A  +L VF
Sbjct: 581 YFTSPANDSSILIRGKEDQDGAEPCGNSIAVHNLIRLAAYL--DRAD-LRAKAGRTLTVF 637

Query: 724 ETRLKDMAMAVPLMCCA 740
             RLK + +A+P M  A
Sbjct: 638 ADRLKSIPVALPEMTSA 654


>gi|410895871|ref|XP_003961423.1| PREDICTED: spermatogenesis-associated protein 20-like [Takifugu
           rubripes]
          Length = 748

 Score =  619 bits (1596), Expect = e-174,   Method: Compositional matrix adjust.
 Identities = 322/699 (46%), Positives = 429/699 (61%), Gaps = 44/699 (6%)

Query: 94  TSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWC 153
           +S   ++ TNRLA E SPYLLQHAHNPVDW+ WG+EAF +AR  D PIFLS+GYSTCHWC
Sbjct: 8   SSTPTHRGTNRLAKERSPYLLQHAHNPVDWYPWGQEAFDKARNEDKPIFLSVGYSTCHWC 67

Query: 154 HVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPD 213
           HVME ESFEDE + K+L+D FV IK+DREERPDVDKVYMT++QA  G GGWP+SV+L+PD
Sbjct: 68  HVMERESFEDEEIGKILSDNFVCIKLDREERPDVDKVYMTFIQATSGSGGWPMSVWLTPD 127

Query: 214 LKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASS 273
           L+P +GGTYFPP D   RPG KT+L ++ D W   R  L  +G   +E L +  + +A +
Sbjct: 128 LRPFIGGTYFPPRDHGRRPGLKTVLMRIIDQWTNNRSALESNGNKILEALKKGTAIAADA 187

Query: 274 NKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGE 333
              P   P +  + C +QL+ SY+  +GGF  +PKFP PV +  ++ +      T    E
Sbjct: 188 GTSPPFAP-DVTKRCFQQLANSYEEEYGGFRDSPKFPSPVNLMFLMSYWCMNRST---SE 243

Query: 334 ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 393
             E  +M L TL+ MA GGIHDHV  GFHRYS D  WHVPHFEKMLYDQ QLA  Y+ A 
Sbjct: 244 GVEALQMALHTLRMMALGGIHDHVSQGFHRYSTDSSWHVPHFEKMLYDQAQLAVAYITAS 303

Query: 394 SLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 453
            ++ + FY+ + +DIL Y+ RD+    G  +SAEDADS    G T K+EGAF +WT+ EV
Sbjct: 304 QVSGEQFYADVAKDILCYVSRDLSDKSGGFYSAEDADSLPHCGGTEKREGAFCIWTASEV 363

Query: 454 EDIL----------GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 503
            ++L             A +F  HY +K  GN  +S   DPH E +G+NVLI       +
Sbjct: 364 RELLPDVVEGTAGSATQADIFMHHYGVKEQGN--VSPEQDPHGELQGQNVLIVRYSLELT 421

Query: 504 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 563
           A+  G+ +E+  N+L   R K+ ++R  RPRPHLD K++ SWNGL++S++AR   +L  +
Sbjct: 422 AAHFGVSIEEVTNLLASARAKMAEIRKSRPRPHLDTKMLASWNGLMLSAYARVGAVLGDK 481

Query: 564 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPS--------KA 615
           A                 +E A  AA+F++ H++D +   L  S   G            
Sbjct: 482 A----------------LLERAVQAANFLQEHMWDPEQQTLLRSCYLGDDMELQQISPPI 525

Query: 616 PGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL 675
            GFLDDYAF+I GLLDL+E    T+WL WA ELQ  QD+LF D EGGGYF +   D +VL
Sbjct: 526 SGFLDDYAFIICGLLDLHEATLQTEWLRWAEELQLRQDKLFWDDEGGGYFCSDPSDFTVL 585

Query: 676 LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP 735
           LR+KED DGAEPS NSVS  NL+RL+      +   + Q +E  LA F  RL  + +A+P
Sbjct: 586 LRLKEDQDGAEPSANSVSAFNLLRLSEYTGKQE---WLQKSERLLAAFTDRLTKVPIALP 642

Query: 736 LMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASY 774
            M  A  M    + K +V+ G + S D   +LA  ++ +
Sbjct: 643 EMVRAL-MAQHYTLKKIVICGKRDSPDTVTLLATVNSLF 680


>gi|327264961|ref|XP_003217277.1| PREDICTED: spermatogenesis-associated protein 20-like [Anolis
           carolinensis]
          Length = 739

 Score =  618 bits (1593), Expect = e-174,   Method: Compositional matrix adjust.
 Identities = 321/709 (45%), Positives = 437/709 (61%), Gaps = 44/709 (6%)

Query: 90  TPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYST 149
           T    SHS   HTNRL  E SPYLLQHAHNPVDW+ WG+EAF +A+K D  IFLS+GYST
Sbjct: 3   TGGKDSHSSALHTNRLVHEKSPYLLQHAHNPVDWYPWGQEAFDKAKKEDKLIFLSVGYST 62

Query: 150 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 209
           CHWCHVME ESF++E +A++LN+ FVSIKVDREERPDVDKVYMT+VQA   GGGWP+SV+
Sbjct: 63  CHWCHVMEHESFQNEEIAQILNENFVSIKVDREERPDVDKVYMTFVQATSSGGGWPMSVW 122

Query: 210 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 269
           L+PDLKP +GGTYFPPED   + GF+T+L ++ + W + R  L ++    +  L   +  
Sbjct: 123 LTPDLKPFVGGTYFPPEDGIYQVGFRTVLIRILEQWKRNRAALLENSQKILSALLARVDV 182

Query: 270 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 329
                ++P  L +  +  C +QLS+SYD  +GGF   PKFP PV +  +  +      T 
Sbjct: 183 GVRGEEIPPSL-KEVMSRCFQQLSESYDEEYGGFSETPKFPTPVNMNFLFSYWALHRST- 240

Query: 330 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 389
              E +   +M L TL+ MA GGIHDH+  GFHRYS D+RWHVPHFEKMLYDQGQLA V+
Sbjct: 241 --SEGARALQMALHTLKMMAYGGIHDHIAQGFHRYSTDQRWHVPHFEKMLYDQGQLAVVF 298

Query: 390 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 449
             AF ++ D F++ I  DIL Y  RD+    G  +SAEDADS  T  + +K+EGAF VWT
Sbjct: 299 AKAFQISGDEFFADIVADILLYASRDLSDKSGGFYSAEDADSYPTAKSEKKQEGAFCVWT 358

Query: 450 SKEVEDILGEH----------AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELND 499
           ++E+  +L +           A +F  HY +K  GN  ++ M DPHNE KGKNVLI    
Sbjct: 359 AEEIRHLLPDLIEGSPERKSVADVFMHHYGVKEDGN--VNPMKDPHNELKGKNVLIVQYS 416

Query: 500 SSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKI 559
              +A++ G+ LE+   +L + R +L+  R++RPRPHLD K++ SWNGL+IS FA++  I
Sbjct: 417 LELTAARFGLGLEQLKTMLVKSRDQLYKARAQRPRPHLDTKMLASWNGLMISGFAQSGAI 476

Query: 560 LKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG------PS 613
           L                 +KEY++ A + A F+R ++++    +L  S   G       S
Sbjct: 477 L----------------GKKEYVDRAVNTADFLRNYMFNASNGKLLRSCYQGKENSVDKS 520

Query: 614 KAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGED 671
             P  GFL+DY F+I  L DLYE      WL WA++LQ+ QDELF D +G  YF T   D
Sbjct: 521 SVPIHGFLEDYVFVIQALFDLYEASLNPSWLEWAVQLQHKQDELFWDPKGFAYFTTEASD 580

Query: 672 PSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMA 731
           PS+LLR+K+D DGAEPS NSV+V NL+R AS     +   + + A   L+ F  RL  + 
Sbjct: 581 PSLLLRMKDDQDGAEPSPNSVAVSNLLRAASYTGHKE---WVKKAGQILSAFSERLLKIP 637

Query: 732 MAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           + +P M  A     + ++K VV+ G     D   +L   ++++  N+ +
Sbjct: 638 VVLPEMARATAAFHL-TQKQVVICGDPKGEDTRELLHCYYSTFTPNRVL 685


>gi|363740931|ref|XP_420103.3| PREDICTED: spermatogenesis-associated protein 20 [Gallus gallus]
          Length = 737

 Score =  610 bits (1574), Expect = e-172,   Method: Compositional matrix adjust.
 Identities = 321/699 (45%), Positives = 432/699 (61%), Gaps = 44/699 (6%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           +  NRL  E SPYL QHAHNPVDW+ WG+EAF +A++ +  IFLS+GYSTCHWCHVME E
Sbjct: 11  RRANRLIYERSPYLQQHAHNPVDWYPWGQEAFDKAKRENKLIFLSVGYSTCHWCHVMEEE 70

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SF+++ + ++++  FV IKVDREERPDVDKVYMT+VQA  GGGGWP+SV+L+PDL+P +G
Sbjct: 71  SFKNQEIGEIMSKNFVCIKVDREERPDVDKVYMTFVQATSGGGGWPMSVWLTPDLRPFVG 130

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE 279
           GTYFPPED     GF+T+L ++ + W + ++ L QS    +E L  +LS   + ++    
Sbjct: 131 GTYFPPEDSAHHVGFRTVLLRIAEQWRQNQEALLQSSQRILEAL-RSLSRVGTQDQQAAP 189

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
             Q  L  C +QLS SYD  +GGF   PKFP PV +  +  +      T    E +   +
Sbjct: 190 PAQEVLTTCFQQLSGSYDEEYGGFSQCPKFPTPVNLNFLFTYWALHRTT---PEGARALQ 246

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           M L TL+ MA GGIHDH+G GFHRYS D  WHVPHFEKMLYDQGQLA VY  AF ++ D 
Sbjct: 247 MSLHTLKMMAHGGIHDHIGQGFHRYSTDRHWHVPHFEKMLYDQGQLAVVYSRAFQISGDE 306

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL-- 457
           F++ +  DIL Y  RD+  P G  +SAEDADS  T  ++ K+EGAF VW ++EV  +L  
Sbjct: 307 FFADVAADILLYASRDLGSPAGGFYSAEDADSYPTATSSEKREGAFCVWAAEEVRALLPD 366

Query: 458 -----GEHAIL---FKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGM 509
                 E   L   F  HY +K  GN  +S   DPH E +GKNVLI  +    +A+  G+
Sbjct: 367 PVEGAAEGTTLGDVFMHHYGVKEDGN--VSPRKDPHKELQGKNVLIAHSSPELTAAHFGL 424

Query: 510 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 569
              +   +L E RR+L   R++RPRPHLD K++ SWNGL+IS FA+A  +L         
Sbjct: 425 EPGQLSAVLQEGRRRLQAARAQRPRPHLDTKMLASWNGLMISGFAQAGAVLA-------- 476

Query: 570 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG------PSKAP--GFLDD 621
                   ++EY+  A  AA F+RRHL++  + RL  S   G       S AP  GFL+D
Sbjct: 477 --------KQEYVSRAAQAAGFVRRHLWEPGSGRLLRSCYRGEADVVEQSAAPIHGFLED 528

Query: 622 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 681
           Y F+I GL DLYE      WL WA++LQ+TQD+LF D +G  YF++   DPS+LLR+K+D
Sbjct: 529 YVFVIQGLFDLYEASLDQSWLEWALQLQHTQDKLFWDPKGFAYFSSEAGDPSLLLRLKDD 588

Query: 682 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 741
            DGAEP+ NSV+V NL+R AS    S    + + A   LA F  RL+ + +A+P M  A 
Sbjct: 589 QDGAEPAANSVTVTNLLRAASY---SGHMEWVEKAGQILAAFSERLQKIPLALPEMARAT 645

Query: 742 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
            +    + K VV+ G     D + ML+  H+++  NK +
Sbjct: 646 AVFH-HTLKQVVICGDPQGEDTKEMLSCVHSTFIPNKVL 683


>gi|156368209|ref|XP_001627588.1| predicted protein [Nematostella vectensis]
 gi|156214502|gb|EDO35488.1| predicted protein [Nematostella vectensis]
          Length = 735

 Score =  608 bits (1568), Expect = e-171,   Method: Compositional matrix adjust.
 Identities = 314/710 (44%), Positives = 427/710 (60%), Gaps = 50/710 (7%)

Query: 92  ASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCH 151
           A ++ +  K TNRL  E SPYLLQH +NPVDW+ WG+EAF +A+K   PIFLS+GYSTCH
Sbjct: 2   AESTDTSPKFTNRLVNEKSPYLLQHKNNPVDWYPWGDEAFQKAKKEQKPIFLSVGYSTCH 61

Query: 152 WCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLS 211
           WCHVME ESFEDE +AK+LN+ F+ +KVDREERPDVD+VYMTY+QA+ GGGGWP+S++L+
Sbjct: 62  WCHVMERESFEDENIAKILNENFIPVKVDREERPDVDRVYMTYIQAMVGGGGWPMSLWLT 121

Query: 212 PDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE-ALSAS 270
           PDLKP + GTYFPP D  GRPGF T+L  +   WD  +    Q     +  + E A    
Sbjct: 122 PDLKPFVAGTYFPPNDMAGRPGFGTVLGHIIKQWDTNKPKFTQQSTIVMNAILEHASEIG 181

Query: 271 ASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML-YHSKKLEDTG 329
             +  +P+   +  +    + +SKS+D   GGFG APKFP+P     +  YH  K     
Sbjct: 182 LDAKDMPN---KEVIEKLYQGMSKSFDEELGGFGGAPKFPQPATFNFLFKYHLLK----N 234

Query: 330 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 389
            + E      + L TL+CM KGGIHDHVG GFHRYS D  WHVPHFEKMLYDQ Q+A  Y
Sbjct: 235 GTEEGERALHICLKTLECMGKGGIHDHVGQGFHRYSTDRFWHVPHFEKMLYDQAQIAAAY 294

Query: 390 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 449
              + +TKD  ++  CRDIL Y+ RD+    G  +SAEDADS  +  AT+K EGAFYVW 
Sbjct: 295 AMGYQMTKDEKFAETCRDILLYVMRDLSHKLGGFYSAEDADSLPSPNATKKTEGAFYVWE 354

Query: 450 SKEVEDILGEH-----------AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN 498
            +E++D+L +            + LF +HY ++  GN  +    DPH E   KNVLI   
Sbjct: 355 EQELKDLLSDSLPTKGGGSILLSELFNKHYGVQAEGN--VKPHQDPHKELVKKNVLIVRG 412

Query: 499 DSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASK 558
               +   L +  ++    L + R  LF+ R KRP PHLDDK+I SWNGL+IS FAR+ +
Sbjct: 413 SLQDTIKDLDVEEDEAKEQLAKAREILFEERKKRPAPHLDDKMITSWNGLMISGFARSGQ 472

Query: 559 ILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG------- 611
           +L  E                 Y+  A  AA F+R HLYD+ +  L  S   G       
Sbjct: 473 VLGEEV----------------YILRAIKAAEFVRTHLYDKSSGELLRSCYRGDKDSIAQ 516

Query: 612 -PSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGE 670
             +   G+  DY +LI+GLLDLYE     +WL WA ELQ+  DELFLD+E GGYF  T  
Sbjct: 517 IATPIKGYGCDYVYLINGLLDLYEASFDEQWLKWAEELQDKADELFLDKEKGGYFEVTEA 576

Query: 671 DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDM 730
           D S+L+R+K++ DGAEPS NS++V+NL+RL + V   +   YR  A+    V+E+RL+ +
Sbjct: 577 DKSILVRLKDEQDGAEPSANSLAVMNLMRLGNFVDCQR---YRDQAQRIFMVYESRLRQI 633

Query: 731 AMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
            +A+P +       ++   K +++ G + + D + ++   H+ Y  NK +
Sbjct: 634 PLALPELVSNFITHNL-GMKQIIIAGDRDADDTKLLMRCVHSHYIPNKVL 682


>gi|47211932|emb|CAF92441.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 833

 Score =  607 bits (1566), Expect = e-171,   Method: Compositional matrix adjust.
 Identities = 323/713 (45%), Positives = 425/713 (59%), Gaps = 69/713 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA E SPYLLQHAHNPVDW+ WG+EAF +AR  D PIFLS+GYSTCHWCHVME ESFE
Sbjct: 1   NRLAKERSPYLLQHAHNPVDWYPWGQEAFDKARNEDKPIFLSVGYSTCHWCHVMERESFE 60

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE + K+LND FV IK+DREERPDVDKVYMT+VQA  GGGGWP+SV+L+PDL+P +GGTY
Sbjct: 61  DEEIGKILNDNFVCIKLDREERPDVDKVYMTFVQATSGGGGWPMSVWLTPDLRPFIGGTY 120

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FPP D  GRPG KT+L ++ D W   R  L  +G   +E L +  + ++ +   P   P 
Sbjct: 121 FPPRDHGGRPGLKTVLMRIIDQWRNNRPTLESNGNKILEALRKGTAIASDAGSSPAFAPD 180

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
            A R C +QL+ SY+  +GGF  APKFP PV +  ++ +      T    E  E  +M L
Sbjct: 181 VAKR-CFQQLANSYEEEYGGFREAPKFPSPVNLMFLMSYWCVNRSTS---EGVEALQMAL 236

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL+ MA GGI+DHV  GFHRYS D  WHVPHFEKMLYDQ QLA  Y+ A   + + FY+
Sbjct: 237 HTLRMMALGGINDHVSQGFHRYSTDSSWHVPHFEKMLYDQAQLAVAYITASQASGEQFYA 296

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL----- 457
            + +D+L Y+ RD+    G  +SAEDADSA   G   K+EGAF +WT+ EV ++L     
Sbjct: 297 DVAKDVLRYVSRDLSDKSGGFYSAEDADSAPPSGGAEKREGAFCIWTASEVRELLPDVVK 356

Query: 458 -----GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE 512
                   A +F  HY +K  GN  +S   DPH E +G+NVLI       +A+  G+ +E
Sbjct: 357 GASASATQADIFMHHYGVKEQGN--VSPEQDPHGELQGQNVLIVRYSLELTAAHFGISVE 414

Query: 513 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 572
           +   +L   R K+  VR  RPRPHLD K++ SWNGL++S++AR   +L            
Sbjct: 415 EVSALLASARAKMAAVRKSRPRPHLDTKMLASWNGLMLSAYARVGAVLGD---------- 464

Query: 573 VVGSDRKEYMEVAESAASFIRRHLYD-EQTHRLQHSF----------------------- 608
                 K  +E A  AA+F++ HL+D EQ   L+  +                       
Sbjct: 465 ------KTLLERAAQAANFLQEHLWDPEQQIVLRSCYLGDNMELQQMTIKLNLPELSNEN 518

Query: 609 -------RNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG 661
                  R+ P    GFLDDYAF+I GLLDL+E    T+WL WA ELQ  QD+LF D +G
Sbjct: 519 NYETVTQRSQPIS--GFLDDYAFIICGLLDLHEATLQTEWLRWAEELQLRQDKLFWDEQG 576

Query: 662 GGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA 721
           GGYF +   D +VLL++KED DGAEPS NSVS  NL+RL+      +   + Q ++  LA
Sbjct: 577 GGYFCSDPSDSTVLLQLKEDQDGAEPSANSVSAFNLLRLSHYTGRQE---WLQKSQRLLA 633

Query: 722 VFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASY 774
            F  RL    +A+P M  A  M    + K +V+ G + S D   +L+  ++ +
Sbjct: 634 AFTDRLTRAPIALPEMVRAL-MAQHYTLKQIVICGQRDSPDTAALLSTVNSLF 685


>gi|193215110|ref|YP_001996309.1| hypothetical protein Ctha_1399 [Chloroherpeton thalassium ATCC
           35110]
 gi|193088587|gb|ACF13862.1| protein of unknown function DUF255 [Chloroherpeton thalassium ATCC
           35110]
          Length = 710

 Score =  606 bits (1563), Expect = e-170,   Method: Compositional matrix adjust.
 Identities = 312/691 (45%), Positives = 421/691 (60%), Gaps = 46/691 (6%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           K  NRL+ E SPYLLQHA+NPVDWFAWG+EAF +AR  + PIFLSIGYSTCHWCHVME E
Sbjct: 6   KEPNRLSREKSPYLLQHAYNPVDWFAWGDEAFEKARSEEKPIFLSIGYSTCHWCHVMERE 65

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SFE+E +A++LN+ FVSIKVDREE PD+DKVYMTYVQA  G GGWP+SV+L+P+LKP  G
Sbjct: 66  SFENEEIARILNEHFVSIKVDREEHPDLDKVYMTYVQASTGSGGWPMSVWLTPELKPFFG 125

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSN-KLPD 278
           GTYFPP D YGRPGF ++L K+ ++W + R+ + Q+     EQL       A +  K+PD
Sbjct: 126 GTYFPPSDSYGRPGFGSMLLKIAESWQQSRERVLQAAGNISEQLQAFSEMQAEAGAKVPD 185

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMM--LYHSKKLEDTGKSGEASE 336
           E    A +    Q    +D  +GGFG+APKFPRP  +  +   +H  K E          
Sbjct: 186 EA---AFQNTFAQFESVFDKDWGGFGNAPKFPRPAILNFLFTFFHQTKNE---------A 233

Query: 337 GQKMVLFTLQCMAKGGIHDHV------GGGFHRYSVDERWHVPHFEKMLYDQGQLANVYL 390
             +M L TL+ MA GG+HDH+      GGGF RYS D  WHVPHFEKMLYD  QLA+ YL
Sbjct: 234 ALRMALHTLRKMADGGMHDHISVPGKGGGGFARYSTDAYWHVPHFEKMLYDNAQLASAYL 293

Query: 391 DAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTS 450
           DA+ +T D F++   RDI +Y+  DM  P G  +SAEDADS     +  K EGAFYVW  
Sbjct: 294 DAYQITSDRFFADTARDIFNYVLCDMTAPEGGFYSAEDADSLAAPESPEKTEGAFYVWER 353

Query: 451 KEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGM 509
            E++ +LG+ A  +F   Y + P GN  +    DPH EFKGKN+LI     S +A + G 
Sbjct: 354 AEIDALLGDEASQIFSFIYGVHPGGNASV----DPHGEFKGKNILIRRATLSQAAQEFGK 409

Query: 510 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 569
                  ++ + R +LFD R +RPRPH DDK++ +WNGL+IS+FA+   +L         
Sbjct: 410 SEADIAEVMAKSRERLFDARLQRPRPHRDDKILTAWNGLMISAFAKGYMVL--------- 460

Query: 570 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 629
                  D   Y+  A+ AA F+   LY+++T  L   +R+G S   G  DDYAF +  L
Sbjct: 461 -------DEATYLHAAQKAADFVIEKLYNKETGGLLRRYRDGESAIDGKADDYAFFVQAL 513

Query: 630 LDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSG 689
           +DLYE     K+L  A++L   Q+ LF D + GG+F++T E+ SV+ R+K+D DGAEPS 
Sbjct: 514 IDLYEASFQFKYLSLALDLAEKQNALFYDAQNGGFFSSTSENKSVIFRLKDDQDGAEPSA 573

Query: 690 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSR 749
           NSV+ +NL+RL+ +   +  + +RQ AE ++  F   L +    +P M  A   L     
Sbjct: 574 NSVAALNLLRLSQM---ADREDFRQKAEATVNFFGKILSEAGNQMPQMFAALSFLK-QKP 629

Query: 750 KHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           K ++L G   S +   +  A  + Y+  K +
Sbjct: 630 KQIILTGAPDSPELRALRKAIDSVYEPVKVL 660


>gi|321473187|gb|EFX84155.1| hypothetical protein DAPPUDRAFT_47524 [Daphnia pulex]
          Length = 661

 Score =  606 bits (1562), Expect = e-170,   Method: Compositional matrix adjust.
 Identities = 320/679 (47%), Positives = 433/679 (63%), Gaps = 56/679 (8%)

Query: 92  ASTSHSRNKHT-NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTC 150
           +S++   +KH  N+L    SPYLLQHA NPV W+ WGEEA  +A++ +  IFLS+GYSTC
Sbjct: 4   SSSAGGCHKHDPNQLIKSKSPYLLQHAFNPVQWYPWGEEAIKKAKEENKLIFLSVGYSTC 63

Query: 151 HWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFL 210
           HWCHVME ESFEDE VA+L+N  F++IKVDREERPDVDK+YM++VQA+ G GGWP+SV++
Sbjct: 64  HWCHVMEKESFEDENVAELMNSEFINIKVDREERPDVDKMYMSFVQAITGRGGWPMSVWM 123

Query: 211 SPDLKPLMGGTYFPPEDK-YGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 269
           +P+LKP+ GGTY+PP+D+ YG+PGFKTIL+ + + W +       SG    E++  AL+ 
Sbjct: 124 TPELKPVYGGTYYPPDDRYYGQPGFKTILKSLAEQWKENPGKFKASG----EKIMTALAR 179

Query: 270 SASSNKLPDELPQ--NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 327
           S++  +  D++P   +   LC +QL  SY+ +FGGF  APKFP+PV + ++L      +D
Sbjct: 180 SSTLGR-GDQVPSAFDCGHLCFQQLRGSYEPKFGGFSKAPKFPQPVNMNLLLRWHVLSDD 238

Query: 328 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 387
              S  A +   M L TL+ MAKGGI DHV  GF RYS DE+WHVPHFEKMLYDQ QLA 
Sbjct: 239 AADSDLALD---MCLHTLRMMAKGGIFDHVRLGFARYSTDEKWHVPHFEKMLYDQAQLAL 295

Query: 388 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 447
           VY DA+ LTKD  ++ +  DIL Y+  D+  P G  +SAEDADS    G+  K+EGAF V
Sbjct: 296 VYTDAYLLTKDQDFARVASDILTYVSNDLSDPSGGFYSAEDADSYPETGSDEKREGAFCV 355

Query: 448 WTSKEVEDILGEHAI------------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 495
           W+ KE++ +L                 +   H+ ++P+GN D     DPH+E KG+NVLI
Sbjct: 356 WSHKEIQSVLASQPAPSQVGPDVTVSDIVCYHFDIRPSGNVD--PYQDPHDELKGQNVLI 413

Query: 496 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 555
                  +A+K G+ ++    +L      + + R +RPRPHLDDK++ SWNGL+IS+ AR
Sbjct: 414 IRGSDEETAAKFGLSMDVLRELLETALSTMREARQRRPRPHLDDKMLASWNGLMISALAR 473

Query: 556 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS-FRNGPSK 614
           A +IL                 R  Y+E A  AA F+R+HLYD Q+ RL  S +R G  +
Sbjct: 474 AGQILG----------------RDTYVERAAKAAEFVRQHLYDGQSGRLLRSCYRGGDGQ 517

Query: 615 AP----------GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 664
                       GFLDDYAF+I GLLDLY      KW+ WA ELQ  QD+LF D   GGY
Sbjct: 518 QDAVSQNAEPIGGFLDDYAFVIRGLLDLYTACQDEKWIQWADELQQKQDQLFWDPSQGGY 577

Query: 665 FNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE 724
           F++   DPS+L+R+KE+ DGAEPSGNS++V NL RLA  VA  +SD YR  A  +L +F+
Sbjct: 578 FSSAAGDPSILIRLKEEQDGAEPSGNSIAVGNLERLA--VAVDRSD-YRDQARRTLCLFQ 634

Query: 725 TRLKDMAMAVPLMCCAADM 743
            RL  + +++P M  A  +
Sbjct: 635 DRLAKIPVSLPEMVAALQL 653


>gi|116626220|ref|YP_828376.1| hypothetical protein Acid_7180 [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116229382|gb|ABJ88091.1| protein of unknown function DUF255 [Candidatus Solibacter usitatus
           Ellin6076]
          Length = 704

 Score =  602 bits (1551), Expect = e-169,   Method: Compositional matrix adjust.
 Identities = 323/684 (47%), Positives = 426/684 (62%), Gaps = 39/684 (5%)

Query: 101 HTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVES 160
           HTNRLA E SPYLLQHAHNPVDW  WG EAF  AR+ + PIFLSIGYSTCHWCHVME ES
Sbjct: 2   HTNRLAQEKSPYLLQHAHNPVDWQPWGPEAFERARQENKPIFLSIGYSTCHWCHVMERES 61

Query: 161 FEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGG 220
           FE+E +A LLN  +++IKVDREERPDVD++YMT+VQA  G GGWP+SV+L+P+L+P  GG
Sbjct: 62  FENEEIAALLNRDYIAIKVDREERPDVDRIYMTFVQATTGSGGWPMSVWLTPELEPFFGG 121

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           TYFPPE+++G PGF +IL ++   W   R  + +S    IEQL + +  + S   +    
Sbjct: 122 TYFPPENRWGHPGFGSILTQIAGVWRDNRPQVVESARDVIEQLKKHVEVAPSHGGV--AF 179

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQ--MMLYHSKKLEDTGKSGEASEGQ 338
            Q  L        +++D+R GGFG+APKFPR V I   ++ Y+++    TG      E  
Sbjct: 180 DQATLDSGFSVFRRTFDTRTGGFGAAPKFPR-VSIHHFLLRYYAR----TGN----KEAL 230

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
            MVL TL+ MA+GG++D +GGGFHRYSVD+RW VPHFEKMLYDQ Q+A  YL+AF +T D
Sbjct: 231 DMVLLTLREMARGGMNDQLGGGFHRYSVDDRWFVPHFEKMLYDQAQIAISYLEAFQVTGD 290

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAET-EGATRKKEGAFYVWTSKEVEDIL 457
             Y+   R I DY+ RDM   GG  +SAEDADS  T E  T K EGAFY+W+ +E+  ++
Sbjct: 291 AQYADTARAIFDYVLRDMTDSGGGFYSAEDADSIITPEQPTLKGEGAFYIWSMEEIHALV 350

Query: 458 GEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLN 516
           G  A   F   Y ++  GN +    +DPH EF GKN+L + +    +A   G P  +   
Sbjct: 351 GAPASDWFCYRYGVREGGNVE----NDPHGEFTGKNILYQQHTLEQTAEHFGQPAGEMDA 406

Query: 517 ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGS 576
            L    R L   R+KR RPHLDDK++ SWNGL+IS+FA+   +L+    +          
Sbjct: 407 TLDNAARILLQARAKRVRPHLDDKILTSWNGLMISAFAKGGAVLEEPRYAEA-------- 458

Query: 577 DRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG 636
                   A  AA+F+   L D  +  L   +R G +  PGFLDDYAF + GLLDLYE  
Sbjct: 459 --------ARRAAAFVAGRLCDAASGTLLRRYREGDAAIPGFLDDYAFFVQGLLDLYEAQ 510

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
                L  AI L   Q ELF DRE G +F+T   DP ++LRVKED+DGAEPSGNSVSV+N
Sbjct: 511 FDLSHLQLAIRLTEKQLELFEDREAGAFFSTIDGDPELVLRVKEDYDGAEPSGNSVSVMN 570

Query: 697 LVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVG 756
           LVRLA I   +  D +RQ+A  +L+ F +RL    MAVP +  A + ++   R+ ++  G
Sbjct: 571 LVRLAQI---TNRDQFRQSAGRALSAFASRLSVAPMAVPQLLAACEFVTGQPRE-IIFAG 626

Query: 757 HKSSVDFENMLAAAHASYDLNKTV 780
            + S + + ML   H  +  N+ V
Sbjct: 627 TRDSAELQAMLHELHRRFIPNRVV 650


>gi|241111177|ref|XP_002399229.1| spermatogenesis-associated protein, putative [Ixodes scapularis]
 gi|215492917|gb|EEC02558.1| spermatogenesis-associated protein, putative [Ixodes scapularis]
          Length = 745

 Score =  600 bits (1546), Expect = e-168,   Method: Compositional matrix adjust.
 Identities = 318/684 (46%), Positives = 424/684 (61%), Gaps = 43/684 (6%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA E SPYLLQHA+NPVDW+ WG+EA A A+  D PIFLS+GYSTCHWCHVME ESFE
Sbjct: 20  NRLAGEKSPYLLQHANNPVDWYPWGDEAIARAKSEDKPIFLSVGYSTCHWCHVMERESFE 79

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           +  +A+L+N+ FV++KVDREERPD+D+VYMTY+QA  GGGGWP+SV+L+PDLKP++GGTY
Sbjct: 80  NADIARLMNEHFVNVKVDREERPDLDRVYMTYIQATSGGGGWPMSVWLTPDLKPIVGGTY 139

Query: 223 FPPEDKY-GRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           FPP+D+Y GRPGFKT+L  + +   +  ++L Q+         EA +A+++S        
Sbjct: 140 FPPDDRYFGRPGFKTLLAAIAEQGSRIVEILRQASDLRSSDEREAGAAASTSGSEAVPRA 199

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
                 C EQLS+SYD   GGFG APKFP+ V +  +L H+   ++    GEA+   +M 
Sbjct: 200 STVAATCFEQLSRSYDEAMGGFGKAPKFPQCVNLNFLLRHAVASQE---PGEAARALEMC 256

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
           + TL  MA+GGIHDHV  GFHRYS D  WHVPHFEKMLYDQ QLA  YL+AF  T+D   
Sbjct: 257 VNTLNKMARGGIHDHVAKGFHRYSTDGGWHVPHFEKMLYDQAQLARAYLEAFQATRDPHL 316

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH- 460
           + + RD+LDY+ RD+    G  +SAEDADS     +  KKEGAF VW   EV  +L E  
Sbjct: 317 AQVARDVLDYVERDLSHQSGGFYSAEDADSLPEASSGEKKEGAFCVWEEAEVRRLLPEPL 376

Query: 461 --------AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE 512
                   A LF  ++ ++  GN D   M DPH+E KGKNVL+      + A + G+ L 
Sbjct: 377 PGCPGRTVADLFCRYFGVEAGGNVD--PMQDPHDELKGKNVLVVRESQESLAERFGLELP 434

Query: 513 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 572
              ++L + RR L + R +RPRPHLDDK + +WNGL++S FA A+K+L            
Sbjct: 435 VLHSLLEDARRVLLEARQRRPRPHLDDKFLAAWNGLMVSGFATAAKVL------------ 482

Query: 573 VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPS--------KAPGFLDDYAF 624
               DR+ Y   A  A +F+ +HLYDE    L  S   G            PG L+DYAF
Sbjct: 483 ---GDRR-YAGRALQAVAFLGQHLYDEDRKSLLRSAYRGEGGHVTQTARPIPGVLEDYAF 538

Query: 625 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 684
            + GLLD YE       L+ A ELQ+ QD  F D + GGYF ++GED  +LLR+K+D DG
Sbjct: 539 TVQGLLDTYEACFEAPCLLRAEELQDAQDARFWDPDQGGYFLSSGEDAHLLLRLKDDQDG 598

Query: 685 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 744
           AEPS NSVS+ NLVRL+ ++  +++D  R+ A+     +  RL  + +A+P M C    L
Sbjct: 599 AEPSPNSVSLSNLVRLSVLL--NRAD-LRERAQRLAEAYARRLSLLPLALPEMVCGLLRL 655

Query: 745 SVPSRKHVVLVGHKSSVDFENMLA 768
                + VV+ G K     + +L+
Sbjct: 656 QA-GPQEVVVAGGKDHPGTQELLS 678


>gi|340370640|ref|XP_003383854.1| PREDICTED: spermatogenesis-associated protein 20 [Amphimedon
           queenslandica]
          Length = 741

 Score =  599 bits (1545), Expect = e-168,   Method: Compositional matrix adjust.
 Identities = 332/714 (46%), Positives = 444/714 (62%), Gaps = 57/714 (7%)

Query: 93  STSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHW 152
           ST+    +  NRLA E SPYLLQHA NPVDW+ WGEEAF ++R  + PIFLS+GYSTCHW
Sbjct: 2   STNSCSKRLLNRLAGEKSPYLLQHATNPVDWYPWGEEAFTKSRNENKPIFLSVGYSTCHW 61

Query: 153 CHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP 212
           CHVME ESFE + VAK+LND FVSIKVDREERPDVDKVYMT+VQA  G GGWP+SVFL+P
Sbjct: 62  CHVMERESFESDTVAKVLNDHFVSIKVDREERPDVDKVYMTFVQATQGSGGWPMSVFLTP 121

Query: 213 DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASAS 272
           +LKP +GGTYFPPED +  P F TIL  V + W K  D + Q     ++ L  A++ S+S
Sbjct: 122 ELKPFLGGTYFPPEDSFRSPSFLTILNAVHEQWTKDHDNIKQKMNPLMKALQAAVAGSSS 181

Query: 273 SNKLPDELPQNA-LRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLEDTG 329
            N    +LP  A ++  AE L+  +DS++GGFG + KFP+PV + ++L  Y      + G
Sbjct: 182 LNP---QLPGTACIQKAAEMLADRFDSKYGGFGQSMKFPQPVILDLLLRIYARYPSSEMG 238

Query: 330 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 389
               AS     VLFTL+ M+ GG+HDH+G GFHRYS D  WHVPHFEKMLYDQ QL   Y
Sbjct: 239 DGALAS-----VLFTLEAMSNGGMHDHIGQGFHRYSTDPYWHVPHFEKMLYDQAQLVVTY 293

Query: 390 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 449
           L A+ +TKD  +     DIL+Y+ RD+    G  +SAEDADS    G   KKEGAF VWT
Sbjct: 294 LSAYQITKDDKFKETAVDILEYVLRDLGDKDGGFYSAEDADSYRCHGDKEKKEGAFCVWT 353

Query: 450 SKEVEDILGEH----------AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELND 499
            +E++ IL +           A LF   + +K  GN   ++  DPH E   +NVLI    
Sbjct: 354 WEEIQSILLDPLPGGDTDKTLADLFSSRFGVKKGGNVRPNQ--DPHGELINQNVLIIKKS 411

Query: 500 SSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKI 559
               +S+  + +E+  ++L E + +L+ +R++RP+PH DDK++ +WNGL++S+ +RAS++
Sbjct: 412 FEELSSEFSLEVEQVKSLLMEAKDRLYKMRAERPKPHRDDKILTAWNGLMVSALSRASQV 471

Query: 560 LKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYD-EQTHRLQHSFRN-----GPS 613
           L                   EY+E A+SAASFIR  LYD E++  L++++R+       S
Sbjct: 472 LGG----------------SEYLERAKSAASFIRDSLYDKEKSVLLRNAYRDENDVLSVS 515

Query: 614 KAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLD------REGGGYFNT 667
              GF DDYAFLI GL+DLYE      WL WA+ELQ  QD LFLD       E GGYF+T
Sbjct: 516 TVEGFADDYAFLIRGLIDLYEASHDPLWLKWALELQEQQDRLFLDIKGEEGEEKGGYFST 575

Query: 668 TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 727
           +G D S+LLR+K+  DGAEPS NSVS  NL+RL+S    S+    R  +E+    F + +
Sbjct: 576 SGMDDSILLRMKDGEDGAEPSANSVSAENLLRLSSFFDKSE---LRSKSENIFKTFNSSM 632

Query: 728 KDMAMAVPLMCCA-ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
            +   A+  +  A    L  P  K V++VG  S  D + +L+  H+ +  NKT+
Sbjct: 633 MEHPPAMAALIGAFISYLQKP--KQVIIVGLISGDDTQALLSCIHSHFIPNKTL 684


>gi|357626408|gb|EHJ76509.1| hypothetical protein KGM_19065 [Danaus plexippus]
          Length = 813

 Score =  598 bits (1543), Expect = e-168,   Method: Compositional matrix adjust.
 Identities = 321/695 (46%), Positives = 420/695 (60%), Gaps = 51/695 (7%)

Query: 83  VVAMAERTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIF 142
           ++ MA    + +S +  KHTN+L  E SPYLLQHAHNPVDW+ W +EA   A++ +  IF
Sbjct: 71  IIKMAS---SESSATPKKHTNKLVNEKSPYLLQHAHNPVDWYPWCQEAIDRAKQENKLIF 127

Query: 143 LSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGG 202
           LS+GYSTCHWCHVME ESFE E VAK++N+ F++IKVDREERPD+D+VYM +V A  GGG
Sbjct: 128 LSVGYSTCHWCHVMERESFESEDVAKIMNEHFINIKVDREERPDLDRVYMLFVMATTGGG 187

Query: 203 GWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQ 262
           GWP+SVFL+PDL+P+ GGTYFPPED++GRPGFKTIL  +   W + +    ++    ++ 
Sbjct: 188 GWPMSVFLTPDLRPVTGGTYFPPEDRWGRPGFKTILLSLAKKWKENQTQFLEASINIMDA 247

Query: 263 LSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHS 322
           L    +    +N +P E   N    C  +   +++  FGGFG+APKFP+   I   L+H 
Sbjct: 248 LQNISNVKVETNSVPGEATWNK---CVRRYITNFEPHFGGFGTAPKFPQ-ASIFNFLFHF 303

Query: 323 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 382
              +   ++ E  +  +M L TL  ++KGGIHDHV  GF RYSVD  WHVPHFEKMLYDQ
Sbjct: 304 YARDK--QNPEGKQCLEMCLHTLTKISKGGIHDHVASGFARYSVDNDWHVPHFEKMLYDQ 361

Query: 383 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 442
            QL   Y DA+  TK+ +Y+ + RDI+ Y+ RD+    G  +SAEDADS    GA +KKE
Sbjct: 362 AQLMVAYTDAYLATKEEYYADVVRDIVKYVNRDLRHDLGGYYSAEDADSYPVFGADKKKE 421

Query: 443 GAFYVWTSKEVEDILGEHAI-------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 495
           GAF VW   E+  ++G+  +       +F +++ ++ +GN  +S  SDPH E   KNVLI
Sbjct: 422 GAFCVWEYDEINSLIGDKKVGNVSYLEIFCDYFNVEESGN--VSPESDPHGELTNKNVLI 479

Query: 496 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 555
                  +ASK  +  ++   +L EC   L++ RSKRPRPHLD K++ SWNGL IS  A 
Sbjct: 480 IYGSEEETASKFEITKDQLKQVLKECIDILYEARSKRPRPHLDTKMLCSWNGLAISGLAH 539

Query: 556 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSF------- 608
           A +                G   K ++E A   A+FI+ HLYD++   L HS        
Sbjct: 540 AGQ----------------GLGEKSFVEDAIKTANFIKEHLYDQENKTLLHSCYKAEDGN 583

Query: 609 ---RNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYF 665
               N P K  GFLDDYAFLI GLLDLYE      WL WA ELQ  Q+ELF D + GGYF
Sbjct: 584 ITQTNPPIK--GFLDDYAFLIRGLLDLYEASLDLHWLNWARELQEKQNELFWDSDNGGYF 641

Query: 666 NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKS----DYYRQNAEHSLA 721
             + ED SV+LR+KED DGAEPSGNSVS  NL RLA+    S +    D  R  A+  L 
Sbjct: 642 TCSAEDTSVVLRLKEDQDGAEPSGNSVSCHNLQRLAAYADKSSAEEGGDRERDMAKKVLM 701

Query: 722 VFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVG 756
            F  RL D   A P M  A  M    S   V++ G
Sbjct: 702 AFAKRLIDSPTASPEMMSAL-MFFTDSPTQVLISG 735


>gi|449283068|gb|EMC89771.1| Spermatogenesis-associated protein 20, partial [Columba livia]
          Length = 682

 Score =  598 bits (1542), Expect = e-168,   Method: Compositional matrix adjust.
 Identities = 315/698 (45%), Positives = 427/698 (61%), Gaps = 50/698 (7%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           +HTNRL  E SPYLLQHAHNPVDW+ WG+EAF +A+K +  IFLS+GYSTCHWCHVME E
Sbjct: 17  RHTNRLINEKSPYLLQHAHNPVDWYPWGQEAFDKAKKENKLIFLSVGYSTCHWCHVMEEE 76

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SF+++ + ++++  FV IKVDREERPDVDKVYMT+  A  GGGGWP+SV+L+PDLKP  G
Sbjct: 77  SFKNKEIGEIMSKNFVCIKVDREERPDVDKVYMTF--ATSGGGGWPMSVWLTPDLKPFAG 134

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE 279
           GTYFPPED   R GF+T+L ++ + W + +D L +S    +E L           + P  
Sbjct: 135 GTYFPPEDGVHRVGFRTVLLRIAEQWKENKDSLLESSRKILEALQHVSEIRVRGQESPPP 194

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
             +  +  C +QLS SYD  +GGF  +PKFP PV +   L+    L  T  + E +   +
Sbjct: 195 -SKEVMATCFQQLSNSYDEDYGGFSKSPKFPSPVNLN-FLFTYWALHRT--TPEGARALQ 250

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           M L TL+ MA GGIHDH+  GFHRYS D+ WHVPHFEKMLYDQGQLA  Y  AF ++ D 
Sbjct: 251 MALHTLKMMAHGGIHDHIDQGFHRYSTDQHWHVPHFEKMLYDQGQLAATYSRAFQISGDQ 310

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE 459
           F++ + +DIL Y+ RD+    G  +SAEDADS  T  +  K+EGAF VW ++E+  +L +
Sbjct: 311 FFADVAQDILLYVSRDLSDQAGGFYSAEDADSYPTTASKEKREGAFCVWAAEEIRALLPD 370

Query: 460 H----------AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGM 509
                        +F  HY +K TGN  +S M DPH E KGKNVLI       +A++ G+
Sbjct: 371 PVEGATEGTTLGDVFMHHYGVKETGN--VSPMQDPHQELKGKNVLIVRCSPEVTAAQFGL 428

Query: 510 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 569
            L +   +L E R++L   R++RPRPHLD K++ +WNGL+IS FA+A  +L         
Sbjct: 429 ELGRLGAVLQEGRQRLSTARAQRPRPHLDTKMLAAWNGLMISGFAQAGTVL--------- 479

Query: 570 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG------PSKAP--GFLDD 621
                  D++EY+  A  AA+F+R+HL+D  + RL  S   G       S  P  GFL+D
Sbjct: 480 -------DKQEYVSRAAQAAAFLRKHLFDPTSGRLLRSCYRGRDNTVEQSAVPIQGFLED 532

Query: 622 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 681
           Y F+I  L DLYE      WL WA++LQ+ QD+LF D +G  YF++   DPS+LLR+K D
Sbjct: 533 YVFVIQALFDLYEASLEQDWLEWALQLQHMQDKLFWDSKGFAYFSSEAGDPSLLLRLKGD 592

Query: 682 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 741
            DGAEP+ NSV+V NL+R A   A  +   + + A   LA F  RL+     +P+M  A 
Sbjct: 593 QDGAEPTANSVTVTNLLRAACYSAHME---WVEKAGQILAAFSERLQK----IPIMARAT 645

Query: 742 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKT 779
            +    + K V++ G     D + ML   H+ +  NK 
Sbjct: 646 AVFH-HTLKQVIICGDPQGEDTKEMLRCVHSVFSPNKV 682


>gi|328702149|ref|XP_001952649.2| PREDICTED: spermatogenesis-associated protein 20-like
           [Acyrthosiphon pisum]
          Length = 784

 Score =  597 bits (1538), Expect = e-167,   Method: Compositional matrix adjust.
 Identities = 331/733 (45%), Positives = 430/733 (58%), Gaps = 61/733 (8%)

Query: 69  PLAVISHRPIHPYKVVAMAERTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGE 128
           P + ++ RP +   +      T    S S  K  NRLA E SPYLLQHA NPV W+ WG+
Sbjct: 23  PKSQLTIRPPNYNYIKRFQSSTVNLNSRSMEKIKNRLAQERSPYLLQHAENPVQWYPWGD 82

Query: 129 EAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVD 188
           EAF +AR     IFLS+GYSTCHWCHVME ESFE++ VA ++N+ +V+IKVDREERPDVD
Sbjct: 83  EAFEKARSEKKLIFLSVGYSTCHWCHVMEHESFENQDVAAVMNEHYVNIKVDREERPDVD 142

Query: 189 KVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW--- 245
           ++YMT+VQA  G GGWP+SVFL+PDLKP+ GGTY+PPED YGRPGFKTIL  +   W   
Sbjct: 143 QLYMTFVQAASGQGGWPMSVFLTPDLKPIGGGTYYPPEDAYGRPGFKTILLHMAKRWKSD 202

Query: 246 --------DKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYD 297
                    K   +L  + AF I QL   LS     N      P+  +  C  QL + YD
Sbjct: 203 SKSMLENSSKMMKILNDTTAFDI-QLGTELSNIMKPN------PKTWIT-CYSQLQRIYD 254

Query: 298 SRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHV 357
             +GGFG  PKFP+P  +  + + S K+    KS E  +  +M L TLQ M  GGIHDH+
Sbjct: 255 DEWGGFGMPPKFPQPTILDFLFHISHKM---SKSYEGKKSLEMALETLQKMTMGGIHDHI 311

Query: 358 GGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMI 417
           G GF RYS DE+WHVPHFEKMLYDQ QLA  Y  AF +TK   YS +  DIL Y+ RD+ 
Sbjct: 312 GQGFARYSTDEKWHVPHFEKMLYDQAQLAVSYTTAFQITKHEQYSDVVHDILQYVSRDLS 371

Query: 418 GPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH---------AILFKEHY 468
              G  +SAEDADS  T  +T+K+EGAF  WT +EV+ +L +          + LF  H+
Sbjct: 372 HKLGGFYSAEDADSLPTVDSTKKREGAFCTWTQEEVKTLLDQPLDSNPDIKLSELFCWHF 431

Query: 469 YLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDV 528
            + P GN      SDPH E  G+NVLIE      +A K  + +E     L   +  LF+ 
Sbjct: 432 SVLPNGNVRPD--SDPHGELLGQNVLIEFRSKENTAKKFQITVENVEKELKIAKSILFEA 489

Query: 529 RSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESA 588
           R KRPRPHLD+K+I SWNGL+I+++ARA+  L  E                EY + A  A
Sbjct: 490 RKKRPRPHLDNKIITSWNGLMITAYARAASALNVE----------------EYKQRAIKA 533

Query: 589 ASFIRRHLYDEQTHRLQHSFRNG-------PSKAPGFLDDYAFLISGLLDLYEFGSGTKW 641
           A F++ H ++     L+  + N             GFL+DYAFLI GLLDLYE    +KW
Sbjct: 534 AEFLKTHAWNNSV-LLRSCYVNDIGDIANIEKPIAGFLNDYAFLIRGLLDLYECTLQSKW 592

Query: 642 LVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLA 701
           L WA ELQ  QDELF D+E  GY++++ +DPS++LR K DHDGAEPSGNS+S +NL+RL+
Sbjct: 593 LKWADELQEQQDELFWDKEKFGYYSSSDKDPSIILRFKSDHDGAEPSGNSISALNLLRLS 652

Query: 702 SIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSV 761
            +   S+   YR   +     F  RL   + A+P +  A   L   S   V + G   + 
Sbjct: 653 ILTEKSE---YRSKIDPLFLAFAGRLSGSSSALPALVSAL-TLHCDSITSVYVTGDLDNP 708

Query: 762 DFENMLAAAHASY 774
           + E +L+A    Y
Sbjct: 709 ELEALLSAIRQRY 721


>gi|345485510|ref|XP_001604421.2| PREDICTED: spermatogenesis-associated protein 20-like [Nasonia
           vitripennis]
          Length = 797

 Score =  593 bits (1529), Expect = e-166,   Method: Compositional matrix adjust.
 Identities = 321/711 (45%), Positives = 432/711 (60%), Gaps = 50/711 (7%)

Query: 90  TPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYST 149
           T +   + +NKH N+LA E SPYLLQHA NPVDW+ WGEEA  +AR+ D  IFLS+GYST
Sbjct: 55  TSSDMGNKQNKHLNKLALEKSPYLLQHATNPVDWYPWGEEALEKARREDKLIFLSVGYST 114

Query: 150 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 209
           CHWCHVME ESFE+  VAK++N +FV+IKVDREERPD+D+VYMT++Q++ G GGWP+SVF
Sbjct: 115 CHWCHVMEKESFENPEVAKIMNRYFVNIKVDREERPDIDRVYMTFIQSISGHGGWPMSVF 174

Query: 210 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 269
           L+PDL P+ GGTYFPP DKYG+PGF  IL  +   W + +  L +SG+  ++ L +++ +
Sbjct: 175 LTPDLTPITGGTYFPPVDKYGQPGFSRILESIATKWIESKQDLLKSGSKILQVLKKSVES 234

Query: 270 SASSNKLPDE--LPQ-NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 326
                K P+E  +P  +    C +QL   ++  FGGF  APKFP+PV   ++     + +
Sbjct: 235 -----KDPEEASVPSVDCANTCVKQLINGFEPSFGGFSRAPKFPQPVNFNLLFLMYAR-D 288

Query: 327 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 386
            TG++G+  +   M + TL  MA GGIHDHVG GF RYSVD +WHVPHFEKMLYDQGQL 
Sbjct: 289 PTGETGK--QCLNMCVHTLTKMANGGIHDHVGQGFSRYSVDGKWHVPHFEKMLYDQGQLL 346

Query: 387 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 446
             Y +A+  +KD  ++ I  DI+ Y+ RD+  P G  +SAEDADS  +   T KKEGAFY
Sbjct: 347 RSYSEAYLASKDPLFAEIVNDIVTYVARDLRHPEGGFYSAEDADSFPSFEDTEKKEGAFY 406

Query: 447 VWTSKEVEDILGE---------HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 497
           VW  ++VE +L +          + LF  H+ +KP GN  + R  DPH E   +NVLI  
Sbjct: 407 VWRYEDVESLLDKVISEKEGLTLSDLFCYHFNVKPEGN--VQRQQDPHGELMNQNVLIAF 464

Query: 498 NDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 557
              + +A    + ++     L +    LF+ R+KRPRPHLDDK++ +WNGLVIS  + A+
Sbjct: 465 GSIAETAEHFKLSIDSVKAHLEKSISILFEERNKRPRPHLDDKIVTAWNGLVISGLSHAA 524

Query: 558 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-- 615
             L                D  +Y + AE AA FI R+LY++    L  S   G S    
Sbjct: 525 SAL----------------DNPKYTKFAEDAARFIERYLYNKDDKVLLRSCYRGDSDQIL 568

Query: 616 ------PGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTG 669
                  GF  DYAF I GLLDLYE      WL +A ELQ+ QD LF D + GGYF+TT 
Sbjct: 569 QTSVPIKGFQVDYAFAIRGLLDLYEVSFNAHWLEFAEELQDIQDSLFWDDKSGGYFSTTT 628

Query: 670 EDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKD 729
           +D SV+LR+K+D DGAEPSGNSV+  NLVRLAS +   ++D     AE  L+  +  L  
Sbjct: 629 DDRSVILRLKDDQDGAEPSGNSVACGNLVRLASYL--DRTD-LSSKAEKLLSSMQEILIQ 685

Query: 730 MAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
             +A P +  A   L + S   V ++G K + D + +L    +     K V
Sbjct: 686 FPVACPELVTALVTL-IDSTTQVYIIGKKDTDDTKQLLKVLQSKLVPGKIV 735


>gi|281208328|gb|EFA82504.1| DUF255 family protein [Polysphondylium pallidum PN500]
          Length = 863

 Score =  590 bits (1521), Expect = e-165,   Method: Compositional matrix adjust.
 Identities = 304/690 (44%), Positives = 423/690 (61%), Gaps = 37/690 (5%)

Query: 93  STSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHW 152
           S+ + ++KHTNRL  E SPYLLQHAHNPVDW+ WG+EAF  A+++D  IFLS+GYSTCHW
Sbjct: 106 SSLNKQHKHTNRLINEKSPYLLQHAHNPVDWYPWGQEAFDAAKQQDKLIFLSVGYSTCHW 165

Query: 153 CHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP 212
           CHVME ESFEDE +AK++ND FV+IKVDREERPD+DK+YMTY+    G GGWP+SV+L+P
Sbjct: 166 CHVMERESFEDETIAKVMNDLFVNIKVDREERPDIDKIYMTYITETSGSGGWPMSVWLTP 225

Query: 213 DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASAS 272
           DL+P+ GGTYFPP  KYGR GF  I +K+   W   R  + +SGA  I  L E       
Sbjct: 226 DLRPITGGTYFPPTTKYGRGGFPDICKKISTMWKDDRKRVLESGASFITYLKE---EKPK 282

Query: 273 SNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSG 332
            NK    +  + L+ C  ++ K +D  FGGF  APKFPR             L    +  
Sbjct: 283 GNK-DAAISFDTLKTCHSEIVKRFDPEFGGFSEAPKFPRTSIFNF-------LHRVHRRF 334

Query: 333 EASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDA 392
           E+    + + FTL+ M++GGI+DH+ GGFHRYSV E W VPHFEKMLYDQGQ+ +VYLDA
Sbjct: 335 ESDNTLEKLHFTLEKMSRGGIYDHLAGGFHRYSVTEDWKVPHFEKMLYDQGQIVSVYLDA 394

Query: 393 FSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKE 452
           + ++K+  +  +   +++Y+ RD+    G  +SAEDADS + +G   K EGAFYVW   E
Sbjct: 395 YQISKNEHFKDVATGVIEYVLRDLTHVDGGFYSAEDADSLDDKG--EKTEGAFYVWDYSE 452

Query: 453 VEDILGEHAIL--FKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMP 510
           ++  + E + L  F   + + P GN  +S   DPH EF  KN++++ +     ++KL +P
Sbjct: 453 IKKAVPEESDLEIFNFIFGISPNGN--VSASEDPHGEFLDKNIIMQFHTFEECSNKLNIP 510

Query: 511 LEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFN 570
           +E+    + + +  L  +R+KR RPHLDDK+I SWN L+IS+ +++              
Sbjct: 511 VEQVKQSIEKSKVSLLKLRAKRARPHLDDKIITSWNALMISALSKS-------------- 556

Query: 571 FPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLL 630
           F ++G  R  Y+E A+ +  FI+ +LY+ +   L  ++R GPSK  GF DDYAFLI  LL
Sbjct: 557 FQLLGEQR--YLEAAKKSVHFIKTNLYNAEKQTLIRNYREGPSKVEGFTDDYAFLIQALL 614

Query: 631 DLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGN 690
           DLYE      +L WA+ELQ  QD+LF D+EG GYF+++G D S+L R+KE+HDGAEPS  
Sbjct: 615 DLYECCFDIAYLEWAVELQAKQDKLFWDKEGHGYFSSSGLDSSILSRLKEEHDGAEPSCQ 674

Query: 691 SVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRK 750
           SV+  NL+R+ +++     D Y  NA   L      L    +  P M  +      P+  
Sbjct: 675 SVACNNLIRIGNML---HDDDYTDNALLLLESVSLYLHRAPIVFPQMVVSLANHLEPTYT 731

Query: 751 HVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
                  KSS +  ++L   H  Y  NK +
Sbjct: 732 -FSFAADKSSAELRSLLDTIHTFYMPNKVL 760


>gi|328874248|gb|EGG22614.1| DUF255 family protein [Dictyostelium fasciculatum]
          Length = 815

 Score =  586 bits (1511), Expect = e-164,   Method: Compositional matrix adjust.
 Identities = 308/662 (46%), Positives = 419/662 (63%), Gaps = 42/662 (6%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           +++TNRL  E SPYLLQHAHNPVDW+ WG EAF EA+K+D  IFLS+GYSTCHWCHVME 
Sbjct: 101 HEYTNRLINEKSPYLLQHAHNPVDWYPWGTEAFEEAKKQDKLIFLSVGYSTCHWCHVMER 160

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFE+  +A+++N+ FV+IKVDREERPD+DK+YMTY+  ++G GGWP+SV+L+PDL PL 
Sbjct: 161 ESFENPDIARIMNELFVNIKVDREERPDIDKLYMTYITEVFGHGGWPMSVWLTPDLAPLT 220

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
           GGTYF  +  +GRPGF    +++ + W K ++M    GA  I+ L E  S     N +  
Sbjct: 221 GGTYFSSKASHGRPGFGVRCQQIANIWKKDKEMAISRGASFIDYLKE--SKPKGDNNVA- 277

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
            L    +  C   ++K +DS +GGF  APKFPR       +Y+  +L   G    +SE  
Sbjct: 278 -LSNATITKCTGMITKQFDSVYGGFSDAPKFPR-----CSVYN--ELNVCG----SSEDL 325

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
           + + FTL  MA GGIHDH+GGGFHRYSV E W VPHFEKMLYDQGQ+ANVY+DA+  TK+
Sbjct: 326 EQLDFTLLKMACGGIHDHLGGGFHRYSVTEDWRVPHFEKMLYDQGQIANVYIDAYLRTKN 385

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
             +  +  DIL Y++RD+    G  +SAEDADS   E    K+EGAFYVWT +E+E +LG
Sbjct: 386 PLFRQVVYDILHYVQRDLTDSQGGFYSAEDADSLNKE-TNEKQEGAFYVWTLQEIEKLLG 444

Query: 459 E--HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLN 516
                 +    + +KP+GN D S  SDPH E  GKN+L +++ +  +ASK     EK   
Sbjct: 445 SALDTEVVAYMFDVKPSGNVDPS--SDPHGELTGKNILHKVHTTEETASKFNHTPEKIEE 502

Query: 517 ILGECRRKLFDVRS-KRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 575
           I+   ++ L++ R+  R RPHLDDK+I +WNGL+IS+FARA ++                
Sbjct: 503 IVERSKKILYEYRTNNRVRPHLDDKIITAWNGLMISAFARAYQVF--------------- 547

Query: 576 SDRKEYMEVAESAASFIRR-HLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYE 634
              KE++  A+ A  FI+  +LY E    L  ++R+GPS   GF DDYAFLI  LLDLYE
Sbjct: 548 -GEKEFLVSAQRAVEFIQSGNLYQESNQILIRNYRHGPSNVEGFSDDYAFLIQALLDLYE 606

Query: 635 FGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSV 694
                  L WA++LQ  Q ELF D + GG+F T G DP++L R KE+HDGAEPS  SVS 
Sbjct: 607 ASFDESHLRWALQLQKKQIELFWDEKEGGFFTTNGRDPTLLSRQKEEHDGAEPSAQSVSS 666

Query: 695 INLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVL 754
            NL+RL++++     D + + A+ ++      L+   + +P M CA   L  P  + + +
Sbjct: 667 CNLLRLSNML---HLDEFEERAQKTMEGSSIYLEKAPLVMPQMVCALKYLIDPFYQ-ITV 722

Query: 755 VG 756
           VG
Sbjct: 723 VG 724


>gi|171910219|ref|ZP_02925689.1| hypothetical protein VspiD_03585 [Verrucomicrobium spinosum DSM
           4136]
          Length = 723

 Score =  585 bits (1509), Expect = e-164,   Method: Compositional matrix adjust.
 Identities = 314/687 (45%), Positives = 415/687 (60%), Gaps = 32/687 (4%)

Query: 90  TPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYST 149
           TP +T+    KHTN LA E SPYLLQHAHNPV+W  WGE AF +ARK D PI LSIGYST
Sbjct: 6   TPPATT---PKHTNALATEKSPYLLQHAHNPVNWLPWGEAAFEQARKADKPILLSIGYST 62

Query: 150 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 209
           CHWCHVME ESFE+E  A++LN+ F+SIKVDREERPDVD  YMTY QA+ GGGGWPL+V+
Sbjct: 63  CHWCHVMERESFENEETAQVLNEHFISIKVDREERPDVDLTYMTYAQAVSGGGGWPLNVW 122

Query: 210 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW-DKKRDMLAQSGAFAIEQLSEALS 268
           L+P+LKP   GTYFPPED+ GR GF+ +  K+ + W D +  ++ +SGA AI++L E + 
Sbjct: 123 LTPELKPFFAGTYFPPEDRGGRMGFRALCLKIAEVWKDDRAGVMERSGA-AIQKLQEYIE 181

Query: 269 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 328
                +  P +     ++   + +S ++D   GGF  APKFPRPV + ++    K L   
Sbjct: 182 DEQKHHDAPFDA---VMKKAYDDVSNAFDYHEGGFSGAPKFPRPVTLNLLGRLKKHLALK 238

Query: 329 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 388
            +  E++    M   TL CMA GGI DHVGGGFHRYSVD  WHVPH+EKMLYDQ QL   
Sbjct: 239 KEESESNWAVAMGKTTLTCMANGGIRDHVGGGFHRYSVDGYWHVPHYEKMLYDQAQLLTA 298

Query: 389 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 448
           Y++    T    ++ I R+I++Y++RD+  P G  +SAEDADS   +  T K EGAFYVW
Sbjct: 299 YVEGHQHTGLKSFAAIAREIVEYVKRDLRHPEGAFYSAEDADSYTDDTRTTKGEGAFYVW 358

Query: 449 TSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 507
            + E++++LG E   +F+  Y  +  GN      SDPH E KG N L        +A   
Sbjct: 359 KAAEIDELLGKEEGSIFRYAYGARRDGNARPE--SDPHEELKGLNTLFRAYSPKKTAEYF 416

Query: 508 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 567
            +  +K   IL   R+ LF+ R KRP PHLDDKV+ +WNGL+IS  ARA+  L       
Sbjct: 417 KLEEDKVAEILERGRKVLFEAREKRPHPHLDDKVLTAWNGLMISGLARAAGAL------- 469

Query: 568 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 627
                    +   ++E+A  +A FI  HL D+ ++ L+ S+R G S   GF  DYA LI 
Sbjct: 470 ---------NEPSFLELATQSAQFIYDHLSDKGSN-LRRSWREGVSTVHGFASDYALLIQ 519

Query: 628 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 687
           GLLDLYE G   KWL WA  LQ   +  + D E GGYF+ +   P+ +L+VKED+D AEP
Sbjct: 520 GLLDLYEAGFDVKWLQWAAALQEEFETKYGDPEKGGYFSVSKAIPNSVLQVKEDYDSAEP 579

Query: 688 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 747
           S NSV+ +NL RLA ++A    +  R+     L +F   L++    VP M  A D  S  
Sbjct: 580 SPNSVAAMNLFRLARMLA---REDLRERGAKVLRLFGKSLEESPFTVPAMVAALD-FSHY 635

Query: 748 SRKHVVLVGHKSSVDFENMLAAAHASY 774
               +VL G K    F+ +  A  + Y
Sbjct: 636 GEVEIVLAGSKDDAGFQTLATAVRSRY 662


>gi|427788829|gb|JAA59866.1| Hypothetical protein [Rhipicephalus pulchellus]
          Length = 766

 Score =  585 bits (1507), Expect = e-164,   Method: Compositional matrix adjust.
 Identities = 319/703 (45%), Positives = 425/703 (60%), Gaps = 60/703 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA E SPYLLQHA NPVDW+ WG+ AF +A+  D  IFLS+GYSTCHWCHVME ESFE
Sbjct: 20  NRLAQEKSPYLLQHASNPVDWYPWGDAAFKKAKDEDKLIFLSVGYSTCHWCHVMERESFE 79

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           ++ +AK++ND FV++KVDREERPDVD+VYMTY+QA  GGGGWP+S++L+PDLKP++GGTY
Sbjct: 80  NDDIAKIMNDNFVNVKVDREERPDVDRVYMTYIQATSGGGGWPMSIWLTPDLKPVVGGTY 139

Query: 223 FPPEDK-YGRPGFKTILRKVKDAWDKKRDMLAQSGA--FAI-EQLSE-----------AL 267
           FPP+D+ YG+PGFKT+L  + + W K R  L   G   F I EQ S+           + 
Sbjct: 140 FPPDDRYYGQPGFKTLLTSLAEQWRKNRTKLIDQGTRIFQILEQTSDVRVFGGDGVPTSP 199

Query: 268 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 327
             S ++ K P     +    C  QL +SYD   GGFG APKFP+ V +  +L +   L  
Sbjct: 200 RGSEANQKCP--FAPDVATTCYRQLERSYDVSMGGFGRAPKFPQCVNLNFLLRYRAVLLQ 257

Query: 328 TGKSGEA----SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 383
                EA     +  +M + TL+ MA+GGIHDH+G GFHRYS D +WHVPHFEKMLYDQ 
Sbjct: 258 GDPPPEAKTAVDKALEMTVHTLRMMAQGGIHDHIGKGFHRYSTDGKWHVPHFEKMLYDQA 317

Query: 384 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 443
           QL   Y +A+ +T D   + + RDIL Y+ RD+  P G  +SAEDADS    G   K+EG
Sbjct: 318 QLTRTYSEAYQVTHDRRLADVARDILCYVERDLSHPSGGFYSAEDADSYPEHGDKEKREG 377

Query: 444 AFYVWTSKEVEDILGEH---------AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL 494
           AF VW   EV  +L E          A +   +Y ++ +GN D   M DPH+E K KNVL
Sbjct: 378 AFCVWEESEVYRLLTEPLPSCPTKTVADIVCRYYDIRKSGNVD--PMQDPHDELKRKNVL 435

Query: 495 IELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFA 554
           I      + A+  G+ +     +L   R  LF+ R +RP+PHLDDK + SWNGL+IS FA
Sbjct: 436 IVRESKESVAACYGLEVGVLDALLERARETLFEARLRRPKPHLDDKFLTSWNGLMISGFA 495

Query: 555 RASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS-FR---- 609
            A++ L         N PV       Y++ A     FI++HLY+ +   L  S +R    
Sbjct: 496 IAARTL---------NQPV-------YLDRALKCVEFIKKHLYNPKKKTLIRSAYRGEDG 539

Query: 610 ---NGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFN 666
               G     G L+DYAFLI  LLD+YE       L+WA ELQ+ QD LF D++  GYF 
Sbjct: 540 SVVQGSQPIDGVLEDYAFLIQALLDVYEASFDVSCLMWAEELQDKQDRLFWDKKDMGYFL 599

Query: 667 TTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 726
           + GEDP+V+LR+K+D DGAEPS NSVS+ NLVRL+ ++   + D  RQ AE   +V+  R
Sbjct: 600 SNGEDPTVVLRLKDDQDGAEPSSNSVSLNNLVRLSVLL---QRDELRQRAEKLASVYGQR 656

Query: 727 LKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAA 769
           +  + +A+P M C    L     + VV+ G +     + +L+ 
Sbjct: 657 MILVPLALPEMVCGLMRLQA-GPQEVVIAGPRDDPGTKELLSC 698


>gi|449479427|ref|XP_002191427.2| PREDICTED: spermatogenesis-associated protein 20 [Taeniopygia
           guttata]
          Length = 753

 Score =  583 bits (1504), Expect = e-164,   Method: Compositional matrix adjust.
 Identities = 304/699 (43%), Positives = 411/699 (58%), Gaps = 58/699 (8%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           +HTNRL  E SPYLLQHAHNPVDW+ WG+EAF +A+  +  IFLS+GYSTCHWCHVME E
Sbjct: 41  RHTNRLINEKSPYLLQHAHNPVDWYPWGQEAFDKAKTENKLIFLSVGYSTCHWCHVMEEE 100

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SF+ + +  ++N+ FV IKVDREERPDVDKVYMT+VQA  GGGGWP+SV+L+PDLKP  G
Sbjct: 101 SFKSKEIGDIMNEHFVCIKVDREERPDVDKVYMTFVQATSGGGGWPMSVWLTPDLKPFAG 160

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE 279
           GTYFPPED     GF+T+L ++ + W + +D L  S    +E L             P  
Sbjct: 161 GTYFPPEDGVNHVGFRTVLLRIAEQWKENKDALLGSSQRILEALRHTSEIRVQGQASPPP 220

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
             +  +  C +QLS+SYD  +GGF   PKFP PV +  +  +    + T    E +   +
Sbjct: 221 -AKEVMDTCFQQLSRSYDEEYGGFSKCPKFPSPVNLNFLFTYWALHQTT---PEGARALQ 276

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           M L TL+ MA GGIHDH+G GFHRYS+D+ WHVPHFEKMLYDQGQLA +Y  AF ++ D 
Sbjct: 277 MALHTLKMMALGGIHDHIGQGFHRYSIDQHWHVPHFEKMLYDQGQLAAIYSKAFQISGDE 336

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE 459
           F++ + RDIL Y+ RD+    G  +SA+DADS  T  +  K+EGAF VW +KE+  +L +
Sbjct: 337 FFADVVRDILLYVSRDLSDQAGGFYSAQDADSYPTTTSREKREGAFCVWAAKELRALLPD 396

Query: 460 H----------AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGM 509
                      A +F  HY +K  GN D +R  DP+ E KGKNVLI       +A+K G+
Sbjct: 397 PVEGATEGTTLADVFMHHYGVKEAGNVDPAR--DPYQELKGKNVLIVRCAPELTAAKFGL 454

Query: 510 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 569
              +   +L EC+++L   R++RP+PHLD K++ +WNGL+IS FA+A   L  +      
Sbjct: 455 EPGRLSTLLQECQQRLSSARAQRPQPHLDTKMLAAWNGLMISGFAQAGAALSEQG----- 509

Query: 570 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL--------QHSFRNGPSKAPGFLDD 621
                      Y+  A  AA+F+R HL+D  + +L         +S   G     GFL+D
Sbjct: 510 -----------YVSRAAQAAAFLRTHLFDPDSGKLLRSCYQGMHNSVEQGAVPIQGFLED 558

Query: 622 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 681
           Y F+I  L DLYE      WL WA+ LQ+ QD+LF D +G  YF+T   DPS+LLR+K+D
Sbjct: 559 YVFVIQALFDLYEVSLEQGWLEWALHLQHMQDKLFWDPKGFAYFSTEASDPSLLLRLKDD 618

Query: 682 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 741
            DGAEP+ NSV+V NL               +Q     L     R+  + + VP M    
Sbjct: 619 QDGAEPAPNSVAVTNLRE------------KKQTRSEQL-----RVPMITVVVPEMLRTT 661

Query: 742 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
            +    + K VV+ G     D + ML    + +  NK +
Sbjct: 662 AVFH-HTLKQVVICGDPQGEDTKEMLHCVRSVFSPNKVL 699


>gi|193787397|dbj|BAG52603.1| unnamed protein product [Homo sapiens]
          Length = 742

 Score =  583 bits (1504), Expect = e-164,   Method: Compositional matrix adjust.
 Identities = 316/714 (44%), Positives = 432/714 (60%), Gaps = 59/714 (8%)

Query: 91  PASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTC 150
           P+ST     +  NRL  E SPYLLQHA+NPVDW+ WGEEAF +ARK + PIFLS+GYSTC
Sbjct: 10  PSSTPQ---RVPNRLIHEKSPYLLQHAYNPVDWYPWGEEAFDKARKENKPIFLSVGYSTC 66

Query: 151 HWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFL 210
           HWCH+ME ESF++E + +LL++ FVS+KVDREERPDVDKVYMT+VQA   GGGWP++V+L
Sbjct: 67  HWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPDVDKVYMTFVQATSSGGGWPMNVWL 126

Query: 211 SPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSAS 270
           +P+L+P +GGTYFPPED   R GF+T+L ++++ W + +D L ++     ++++ AL A 
Sbjct: 127 TPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWKQNKDTLLENS----QRVTTALLAR 182

Query: 271 ASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKL 325
           +  +    +LP +A  +   C +QL + YD  +GGF  APKFP PV +  +   + S +L
Sbjct: 183 SEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGFAEAPKFPTPVILSFLFSYWLSHRL 242

Query: 326 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 385
              G     S  Q+M L TL+ MA GGI DHVG GFHRYS D +WHVPHFEKMLYDQ QL
Sbjct: 243 TQDG-----SRAQQMALHTLKMMANGGIRDHVGQGFHRYSTDRQWHVPHFEKMLYDQAQL 297

Query: 386 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 445
           A  Y  AF L+ D FYS + + IL Y+ R +    G  +SAEDADS    G  R KEGA+
Sbjct: 298 AVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRSGGFYSAEDADSPPERG-QRPKEGAY 356

Query: 446 YVWTSKEVEDILGEHAI----------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 495
           YVWT KEV+ +L E  +          L  +HY L   GN  +S   DP  E +G+NVL 
Sbjct: 357 YVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLTEAGN--ISPSQDPKGELQGQNVLT 414

Query: 496 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 555
                  +A++ G+ +E    +L     KLF  R  RP+PHLD K++ +WNGL++S +A 
Sbjct: 415 VRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKHRPKPHLDSKMLAAWNGLMVSGYAV 474

Query: 556 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP--- 612
              +L              G DR   +  A + A F++RH++D  + RL  +   GP   
Sbjct: 475 TGAVL--------------GQDR--LINYATNGAKFLKRHMFDVASGRLMRTCYTGPGGT 518

Query: 613 ---SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT 667
              S  P  GFL+DYAF++ GLLDLYE    + WL WA+ LQ+TQD LF D +GGGYF +
Sbjct: 519 VEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLEWALRLQDTQDRLFWDSQGGGYFCS 578

Query: 668 TGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 726
             E  + L LR+K+D DGAEPS NSVS  NL+RL     G K   +       L  F  R
Sbjct: 579 EAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHGFT-GHKD--WMDKCVCLLTAFSER 635

Query: 727 LKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           ++ + +A+P M  A       + K +V+ G + + D + ++   H+ Y  NK +
Sbjct: 636 MRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKDTKALVQCVHSVYIPNKVL 688


>gi|385648253|ref|NP_001245301.1| spermatogenesis-associated protein 20 isoform 2 precursor [Homo
           sapiens]
 gi|311033529|sp|Q8TB22.3|SPT20_HUMAN RecName: Full=Spermatogenesis-associated protein 20; AltName:
           Full=Sperm-specific protein 411; Short=Ssp411; Flags:
           Precursor
          Length = 786

 Score =  583 bits (1502), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 314/714 (43%), Positives = 433/714 (60%), Gaps = 59/714 (8%)

Query: 91  PASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTC 150
           P+ST     +  NRL  E SPYLLQHA+NPVDW+ WG+EAF +ARK + PIFLS+GYSTC
Sbjct: 54  PSSTPQ---RVPNRLIHEKSPYLLQHAYNPVDWYPWGQEAFDKARKENKPIFLSVGYSTC 110

Query: 151 HWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFL 210
           HWCH+ME ESF++E + +LL++ FVS+KVDREERPDVDKVYMT+VQA   GGGWP++V+L
Sbjct: 111 HWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPDVDKVYMTFVQATSSGGGWPMNVWL 170

Query: 211 SPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSAS 270
           +P+L+P +GGTYFPPED   R GF+T+L ++++ W + ++ L ++     ++++ AL A 
Sbjct: 171 TPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWKQNKNTLLENS----QRVTTALLAR 226

Query: 271 ASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKL 325
           +  +    +LP +A  +   C +QL + YD  +GGF  APKFP PV +  +   + S +L
Sbjct: 227 SEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGFAEAPKFPTPVILSFLFSYWLSHRL 286

Query: 326 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 385
              G     S  Q+M L TL+ MA GGI DHVG GFHRYS D +WHVPHFEKMLYDQ QL
Sbjct: 287 TQDG-----SRAQQMALHTLKMMANGGIRDHVGQGFHRYSTDRQWHVPHFEKMLYDQAQL 341

Query: 386 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 445
           A  Y  AF L+ D FYS + + IL Y+ R +    G  +SAEDADS    G  R KEGA+
Sbjct: 342 AVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRSGGFYSAEDADSPPERG-QRPKEGAY 400

Query: 446 YVWTSKEVEDILGEHAI----------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 495
           YVWT KEV+ +L E  +          L  +HY L   GN  +S   DP  E +G+NVL 
Sbjct: 401 YVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLTEAGN--ISPSQDPKGELQGQNVLT 458

Query: 496 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 555
                  +A++ G+ +E    +L     KLF  R  RP+PHLD K++ +WNGL++S +A 
Sbjct: 459 VRYSLELTAARFGLDVEAVRTLLNSGLEKLFQARKHRPKPHLDSKMLAAWNGLMVSGYAV 518

Query: 556 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP--- 612
              +L              G DR   +  A + A F++RH++D  + RL  +   GP   
Sbjct: 519 TGAVL--------------GQDR--LINYATNGAKFLKRHMFDVASGRLMRTCYTGPGGT 562

Query: 613 ---SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT 667
              S  P  GFL+DYAF++ GLLDLYE    + WL WA+ LQ+TQD+LF D +GGGYF +
Sbjct: 563 VEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLEWALRLQDTQDKLFWDSQGGGYFCS 622

Query: 668 TGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 726
             E  + L LR+K+D DGAEPS NSVS  NL+RL     G K   +       L  F  R
Sbjct: 623 EAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHGFT-GHKD--WMDKCVCLLTAFSER 679

Query: 727 LKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           ++ + +A+P M  A       + K +V+ G + + D + ++   H+ Y  NK +
Sbjct: 680 MRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKDTKALVQCVHSVYIPNKVL 732


>gi|84040225|gb|AAI11030.1| SPATA20 protein [Homo sapiens]
 gi|119615009|gb|EAW94603.1| spermatogenesis associated 20, isoform CRA_a [Homo sapiens]
          Length = 786

 Score =  582 bits (1501), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 314/714 (43%), Positives = 432/714 (60%), Gaps = 59/714 (8%)

Query: 91  PASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTC 150
           P+ST     +  NRL  E SPYLLQHA+NPVDW+ WG+EAF +ARK + PIFLS+GYSTC
Sbjct: 54  PSSTPQ---RVPNRLIHEKSPYLLQHAYNPVDWYPWGQEAFDKARKENKPIFLSVGYSTC 110

Query: 151 HWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFL 210
           HWCH+ME ESF++E + +LL++ FVS+KVDREERPDVDKVYMT+VQA   GGGWP++V+L
Sbjct: 111 HWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPDVDKVYMTFVQATSSGGGWPMNVWL 170

Query: 211 SPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSAS 270
           +P+L+P +GGTYFPPED   R GF+T+L ++++ W + ++ L ++     ++++ AL A 
Sbjct: 171 TPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWKQNKNTLLENS----QRVTTALLAR 226

Query: 271 ASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKL 325
           +  +    +LP +A  +   C +QL + YD  +GGF  APKFP PV +  +   + S +L
Sbjct: 227 SEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGFAEAPKFPTPVILSFLFSYWLSHRL 286

Query: 326 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 385
              G     S  Q+M L TL+ MA GGI DHVG GFHRYS D +WHVPHFEKMLYDQ QL
Sbjct: 287 TQDG-----SRAQQMALHTLKMMANGGIRDHVGQGFHRYSTDRQWHVPHFEKMLYDQAQL 341

Query: 386 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 445
           A  Y  AF L+ D FYS + + IL Y+ R +    G  +SAEDADS    G  R KEGA+
Sbjct: 342 AVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRSGGFYSAEDADSPPERG-QRPKEGAY 400

Query: 446 YVWTSKEVEDILGEHAI----------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 495
           YVWT KEV+ +L E  +          L  +HY L   GN  +S   DP  E +G+NVL 
Sbjct: 401 YVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLTEAGN--ISPSQDPKGELQGQNVLT 458

Query: 496 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 555
                  +A++ G+ +E    +L     KLF  R  RP+PHLD K++ +WNGL++S +A 
Sbjct: 459 VRYSLELTAARFGLDVEAVRTLLNSGLEKLFQARKHRPKPHLDSKMLAAWNGLMVSGYAV 518

Query: 556 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP--- 612
              +L              G DR   +  A + A F++RH++D  + RL  +   GP   
Sbjct: 519 TGAVL--------------GQDR--LINYATNGAKFLKRHMFDVASGRLMRTCYTGPGGT 562

Query: 613 ---SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT 667
              S  P  GFL+DYAF++ GLLDLYE    + WL WA+ LQ+TQD LF D +GGGYF +
Sbjct: 563 VEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLEWALRLQDTQDRLFWDSQGGGYFCS 622

Query: 668 TGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 726
             E  + L LR+K+D DGAEPS NSVS  NL+RL     G K   +       L  F  R
Sbjct: 623 EAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHGFT-GHKD--WMDKCVCLLTAFSER 679

Query: 727 LKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           ++ + +A+P M  A       + K +V+ G + + D + ++   H+ Y  NK +
Sbjct: 680 MRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKDTKALVQCVHSVYIPNKVL 732


>gi|134085853|ref|NP_001076876.1| spermatogenesis-associated protein 20 [Bos taurus]
 gi|133777605|gb|AAI23690.1| SPATA20 protein [Bos taurus]
 gi|296476477|tpg|DAA18592.1| TPA: spermatogenesis associated 20 [Bos taurus]
          Length = 789

 Score =  582 bits (1501), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 324/759 (42%), Positives = 441/759 (58%), Gaps = 71/759 (9%)

Query: 60  RNYLYPFRRPLAVISHRPIH--------------PYKVVAMAERTPASTSHSRNKHTNRL 105
           R +L P   P+  +S+R                 P        RT  S S +  K  NRL
Sbjct: 10  RGFLLPGAGPVLALSYRGSSARDKDRSVTVSSSVPMPAGGKGSRTNCSQS-TPQKVPNRL 68

Query: 106 AAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFEDEG 165
             E SPYLLQHA+NPVDW+ WG+EAF +A+K + PIFLS+GYSTCHWCH+ME ESF++E 
Sbjct: 69  INEKSPYLLQHAYNPVDWYPWGQEAFDKAKKENKPIFLSVGYSTCHWCHMMEEESFQNEE 128

Query: 166 VAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPP 225
           + +LL++ FVS+KVDREERPDVDKVYMT+VQA   GGGWP+SV+L+PDL+P +GGTYFPP
Sbjct: 129 IGRLLSEDFVSVKVDREERPDVDKVYMTFVQATSSGGGWPMSVWLTPDLQPFVGGTYFPP 188

Query: 226 EDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNAL 285
           ED   R GF+T+L +++D W + +  L ++     ++++ AL A ++ +    +LP +A 
Sbjct: 189 EDGLTRVGFRTVLMRIRDQWKQNKSTLLENS----QRVTTALLARSAISMGDRQLPPSAA 244

Query: 286 RL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKM 340
            +   C +QL + YD  +GGF  APKFP PV +  +   + S +L   G     S  Q+M
Sbjct: 245 TMNSRCFQQLDEGYDEEYGGFAEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQM 299

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
            L TL+ MA GGI DHVG GFHRYS D +WHVPHFEKMLYDQ QL   Y  AF ++ D F
Sbjct: 300 ALHTLKMMANGGIRDHVGQGFHRYSTDRQWHVPHFEKMLYDQAQLTVAYSQAFQISGDEF 359

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH 460
           YS + + IL Y+ R++    G  +SAEDADS    G  R KEGAFYVWT KEV+ +L E 
Sbjct: 360 YSEVAKGILQYVVRNLSHRSGGFYSAEDADSPPERG-MRPKEGAFYVWTVKEVQHLLPEP 418

Query: 461 AI----------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMP 510
            +          L  +HY L   GN  +S   DP  E +G+NVL        +A++ G+ 
Sbjct: 419 VLGATEPLTSGQLLMKHYGLTEAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLD 476

Query: 511 LEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFN 570
           +E    +L     KLF  R  RP+PHLD K++ +WNGL++S FA    +L  E    + N
Sbjct: 477 VEAVRTLLNSGLEKLFQARKHRPKPHLDSKMLAAWNGLMVSGFAVTGAVLGQE---RVIN 533

Query: 571 FPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDY 622
           + + G             A F++RH++D  + RL  +   G       S  P  GFL+DY
Sbjct: 534 YAING-------------AKFLKRHMFDVASGRLMRTCYAGSGGTVEHSNPPCWGFLEDY 580

Query: 623 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKED 681
           AF++ GLLDLYE    + WL WA+ LQ+TQD LF D  GGGYF +  E  + L LR+K+D
Sbjct: 581 AFVVRGLLDLYEASQESAWLEWALRLQDTQDRLFWDSRGGGYFCSEAELGAGLPLRLKDD 640

Query: 682 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 741
            DGAEPS NSVS  NL+RL     G K   +       L  F  R++ + +A+P M  A 
Sbjct: 641 QDGAEPSANSVSAHNLLRLHGFT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRAL 697

Query: 742 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
                 + K +V+ G   + D + +L   H+ Y  NK +
Sbjct: 698 SA-HQQTLKQIVICGDPQAKDTKALLQCVHSIYIPNKVL 735


>gi|41351283|gb|AAH65526.1| SPATA20 protein [Homo sapiens]
          Length = 742

 Score =  582 bits (1500), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 315/714 (44%), Positives = 432/714 (60%), Gaps = 59/714 (8%)

Query: 91  PASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTC 150
           P+ST     +  NRL  E SPYLLQHA+NPVDW+ WGEEAF +ARK + PIFLS+GYSTC
Sbjct: 10  PSSTPQ---RVPNRLIHEKSPYLLQHAYNPVDWYPWGEEAFDKARKENKPIFLSVGYSTC 66

Query: 151 HWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFL 210
           HWCH+ME ESF++E + +LL++ FVS+KVDREERPDVDKVYMT+VQA   GGGWP++V+L
Sbjct: 67  HWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPDVDKVYMTFVQATSSGGGWPMNVWL 126

Query: 211 SPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSAS 270
           +P+L+P +GGTYFPPED   R GF+T+L ++++ W + ++ L ++     ++++ AL A 
Sbjct: 127 TPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWKQNKNTLLENS----QRVTTALLAR 182

Query: 271 ASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKL 325
           +  +    +LP +A  +   C +QL + YD  +GGF  APKFP PV +  +   + S +L
Sbjct: 183 SEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGFAEAPKFPTPVILSFLFSYWLSHRL 242

Query: 326 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 385
              G     S  Q+M L TL+ MA GGI DHVG GFHRYS D +WHVPHFEKMLYDQ QL
Sbjct: 243 TQDG-----SRAQQMALHTLKMMANGGIRDHVGQGFHRYSTDRQWHVPHFEKMLYDQAQL 297

Query: 386 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 445
           A  Y  AF L+ D FYS + + IL Y+ R +    G  +SAEDADS    G  R KEGA+
Sbjct: 298 AVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRSGGFYSAEDADSPPERG-QRPKEGAY 356

Query: 446 YVWTSKEVEDILGEHAI----------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 495
           YVWT KEV+ +L E  +          L  +HY L   GN  +S   DP  E +G+NVL 
Sbjct: 357 YVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLTEAGN--ISPSQDPKGELQGQNVLT 414

Query: 496 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 555
                  +A++ G+ +E    +L     KLF  R  RP+PHLD K++ +WNGL++S +A 
Sbjct: 415 VRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKHRPKPHLDSKMLAAWNGLMVSGYAV 474

Query: 556 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP--- 612
              +L              G DR   +  A + A F++RH++D  + RL  +   GP   
Sbjct: 475 TGAVL--------------GQDR--LINYATNGAKFLKRHMFDVASGRLMRTCYTGPGGT 518

Query: 613 ---SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT 667
              S  P  GFL+DYAF++ GLLDLYE    + WL WA+ LQ+TQD LF D +GGGYF +
Sbjct: 519 VEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLEWALRLQDTQDRLFWDSQGGGYFCS 578

Query: 668 TGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 726
             E  + L LR+K+D DGAEPS NSVS  NL+RL     G K   +       L  F  R
Sbjct: 579 EAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHGFT-GHKD--WMDKCVCLLTAFSER 635

Query: 727 LKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           ++ + +A+P M  A       + K +V+ G + + D + ++   H+ Y  NK +
Sbjct: 636 MRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKDTKALVQCVHSVYIPNKVL 688


>gi|385648255|ref|NP_001245302.1| spermatogenesis-associated protein 20 isoform 3 [Homo sapiens]
          Length = 742

 Score =  582 bits (1500), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 314/714 (43%), Positives = 433/714 (60%), Gaps = 59/714 (8%)

Query: 91  PASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTC 150
           P+ST     +  NRL  E SPYLLQHA+NPVDW+ WG+EAF +ARK + PIFLS+GYSTC
Sbjct: 10  PSSTPQ---RVPNRLIHEKSPYLLQHAYNPVDWYPWGQEAFDKARKENKPIFLSVGYSTC 66

Query: 151 HWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFL 210
           HWCH+ME ESF++E + +LL++ FVS+KVDREERPDVDKVYMT+VQA   GGGWP++V+L
Sbjct: 67  HWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPDVDKVYMTFVQATSSGGGWPMNVWL 126

Query: 211 SPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSAS 270
           +P+L+P +GGTYFPPED   R GF+T+L ++++ W + ++ L ++     ++++ AL A 
Sbjct: 127 TPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWKQNKNTLLENS----QRVTTALLAR 182

Query: 271 ASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKL 325
           +  +    +LP +A  +   C +QL + YD  +GGF  APKFP PV +  +   + S +L
Sbjct: 183 SEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGFAEAPKFPTPVILSFLFSYWLSHRL 242

Query: 326 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 385
              G     S  Q+M L TL+ MA GGI DHVG GFHRYS D +WHVPHFEKMLYDQ QL
Sbjct: 243 TQDG-----SRAQQMALHTLKMMANGGIRDHVGQGFHRYSTDRQWHVPHFEKMLYDQAQL 297

Query: 386 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 445
           A  Y  AF L+ D FYS + + IL Y+ R +    G  +SAEDADS    G  R KEGA+
Sbjct: 298 AVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRSGGFYSAEDADSPPERG-QRPKEGAY 356

Query: 446 YVWTSKEVEDILGEHAI----------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 495
           YVWT KEV+ +L E  +          L  +HY L   GN  +S   DP  E +G+NVL 
Sbjct: 357 YVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLTEAGN--ISPSQDPKGELQGQNVLT 414

Query: 496 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 555
                  +A++ G+ +E    +L     KLF  R  RP+PHLD K++ +WNGL++S +A 
Sbjct: 415 VRYSLELTAARFGLDVEAVRTLLNSGLEKLFQARKHRPKPHLDSKMLAAWNGLMVSGYAV 474

Query: 556 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP--- 612
              +L              G DR   +  A + A F++RH++D  + RL  +   GP   
Sbjct: 475 TGAVL--------------GQDR--LINYATNGAKFLKRHMFDVASGRLMRTCYTGPGGT 518

Query: 613 ---SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT 667
              S  P  GFL+DYAF++ GLLDLYE    + WL WA+ LQ+TQD+LF D +GGGYF +
Sbjct: 519 VEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLEWALRLQDTQDKLFWDSQGGGYFCS 578

Query: 668 TGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 726
             E  + L LR+K+D DGAEPS NSVS  NL+RL     G K   +       L  F  R
Sbjct: 579 EAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHGFT-GHKD--WMDKCVCLLTAFSER 635

Query: 727 LKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           ++ + +A+P M  A       + K +V+ G + + D + ++   H+ Y  NK +
Sbjct: 636 MRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKDTKALVQCVHSVYIPNKVL 688


>gi|158257042|dbj|BAF84494.1| unnamed protein product [Homo sapiens]
          Length = 742

 Score =  582 bits (1499), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 315/714 (44%), Positives = 431/714 (60%), Gaps = 59/714 (8%)

Query: 91  PASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTC 150
           P+ST     +  NRL  E SPYLLQHA+NPVDW+ WGEEAF +ARK   PIFLS+GYSTC
Sbjct: 10  PSSTPQ---RVPNRLIHEKSPYLLQHAYNPVDWYPWGEEAFDKARKESKPIFLSVGYSTC 66

Query: 151 HWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFL 210
           HWCH+ME ESF++E + +LL++ FVS+KVDREERPDVDKVYMT+VQA   GGGWP++V+L
Sbjct: 67  HWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPDVDKVYMTFVQATSSGGGWPMNVWL 126

Query: 211 SPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSAS 270
           +P+L+P +GGTYFPPED   R GF+T+L ++++ W + ++ L ++     ++++ AL A 
Sbjct: 127 TPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWKQNKNTLLENS----QRVTTALLAR 182

Query: 271 ASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKL 325
           +  +    +LP +A  +   C +QL + YD  +GGF  APKFP PV +  +   + S +L
Sbjct: 183 SEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGFAEAPKFPTPVILSFLFSYWLSHRL 242

Query: 326 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 385
              G     S  Q+M L TL+ MA GGI DHVG GFHRYS D +WHVPHFEKMLYDQ QL
Sbjct: 243 TQDG-----SRAQQMALHTLKMMANGGIRDHVGQGFHRYSTDRQWHVPHFEKMLYDQAQL 297

Query: 386 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 445
           A  Y  AF L+ D FYS + + IL Y+ R +    G  +SAEDADS    G  R KEGA+
Sbjct: 298 AVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRSGGFYSAEDADSPPERG-QRPKEGAY 356

Query: 446 YVWTSKEVEDILGEHAI----------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 495
           YVWT KEV+ +L E  +          L  +HY L   GN  +S   DP  E +G+NVL 
Sbjct: 357 YVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLTEAGN--ISPSQDPKGELQGQNVLT 414

Query: 496 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 555
                  +A++ G+ +E    +L     KLF  R  RP+PHLD K++ +WNGL++S +A 
Sbjct: 415 VRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKHRPKPHLDSKMLAAWNGLMVSGYAV 474

Query: 556 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP--- 612
              +L              G DR   +  A + A F++RH++D  + RL  +   GP   
Sbjct: 475 TGAVL--------------GQDR--LINYATNGAKFLKRHMFDVASGRLMRTCYTGPGGT 518

Query: 613 ---SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT 667
              S  P  GFL+DYAF++ GLLDLYE    + WL WA+ LQ+TQD LF D +GGGYF +
Sbjct: 519 VEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLEWALRLQDTQDRLFWDSQGGGYFCS 578

Query: 668 TGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 726
             E  + L LR+K+D DGAEPS NSVS  NL+RL     G K   +       L  F  R
Sbjct: 579 EAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHGFT-GHKD--WMDKCVCLLTAFSER 635

Query: 727 LKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           ++ + +A+P M  A       + K +V+ G + + D + ++   H+ Y  NK +
Sbjct: 636 MRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKDTKALVQCVHSVYIPNKVL 688


>gi|440910483|gb|ELR60277.1| Spermatogenesis-associated protein 20 [Bos grunniens mutus]
          Length = 789

 Score =  582 bits (1499), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 324/759 (42%), Positives = 441/759 (58%), Gaps = 71/759 (9%)

Query: 60  RNYLYPFRRPLAVISHRPIH--------------PYKVVAMAERTPASTSHSRNKHTNRL 105
           R +L P   P+  +S+R                 P        RT  S S +  K  NRL
Sbjct: 10  RGFLLPGAGPVLALSYRGSSARDKDRSVTVSSSVPMPAGGKGSRTNCSQS-TPQKVPNRL 68

Query: 106 AAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFEDEG 165
             E SPYLLQHA+NPVDW+ WG+EAF +A+K + PIFLS+GYSTCHWCH+ME ESF++E 
Sbjct: 69  INEKSPYLLQHAYNPVDWYPWGQEAFDKAKKENKPIFLSVGYSTCHWCHMMEEESFQNEE 128

Query: 166 VAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPP 225
           + +LL++ FVS+KVDREERPDVDKVYMT+VQA   GGGWP+SV+L+PDL+P +GGTYFPP
Sbjct: 129 IGRLLSEDFVSVKVDREERPDVDKVYMTFVQATSSGGGWPMSVWLTPDLQPFVGGTYFPP 188

Query: 226 EDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNAL 285
           ED   R GF+T+L +++D W + +  L ++     ++++ AL A ++ +    +LP +A 
Sbjct: 189 EDGLTRVGFRTVLMRIRDQWKQNKSTLLENS----QRVTTALLARSAISMGDRQLPPSAA 244

Query: 286 RL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKM 340
            +   C +QL + YD  +GGF  APKFP PV +  +   + S +L   G     S  Q+M
Sbjct: 245 TMNSRCFQQLDEGYDEEYGGFAEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQM 299

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
            L TL+ MA GGI DHVG GFHRYS D +WHVPHFEKMLYDQ QL   Y  AF ++ D F
Sbjct: 300 ALHTLKMMANGGIRDHVGQGFHRYSTDRQWHVPHFEKMLYDQAQLTVAYSQAFQISGDEF 359

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH 460
           YS + + IL Y+ R++    G  +SAEDADS    G  R KEGAFYVWT KEV+ +L E 
Sbjct: 360 YSEVAKGILQYVVRNLSHRSGGFYSAEDADSPPERG-MRPKEGAFYVWTVKEVQHLLPEP 418

Query: 461 AI----------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMP 510
            +          L  +HY L   GN  +S   DP  E +G+NVL        +A++ G+ 
Sbjct: 419 VLGATEPLTSGQLLMKHYGLTEAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLD 476

Query: 511 LEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFN 570
           +E    +L     KLF  R  RP+PHLD K++ +WNGL++S FA    +L  E    + N
Sbjct: 477 VEAVRTLLNSGLEKLFQARKHRPKPHLDSKMLAAWNGLMVSGFAVTGAVLGQE---RVIN 533

Query: 571 FPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDY 622
           + + G             A F++RH++D  + RL  +   G       S  P  GFL+DY
Sbjct: 534 YAING-------------AKFLKRHMFDVASGRLMRTCYAGSGGTVEHSNPPCWGFLEDY 580

Query: 623 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKED 681
           AF++ GLLDLYE    + WL WA+ LQ+TQD LF D  GGGYF +  E  + L LR+K+D
Sbjct: 581 AFVVRGLLDLYEASQESAWLEWALRLQDTQDRLFWDSRGGGYFCSEAELGAGLPLRLKDD 640

Query: 682 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 741
            DGAEPS NSVS  NL+RL     G K   +       L  F  R++ + +A+P M  A 
Sbjct: 641 QDGAEPSANSVSAHNLLRLHGFT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRAL 697

Query: 742 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
                 + K +V+ G   + D + +L   H+ Y  NK +
Sbjct: 698 SA-HQQTLKQIVICGDPQAKDTKALLQCVHSIYIPNKVL 735


>gi|119615011|gb|EAW94605.1| spermatogenesis associated 20, isoform CRA_c [Homo sapiens]
          Length = 742

 Score =  582 bits (1499), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 314/714 (43%), Positives = 432/714 (60%), Gaps = 59/714 (8%)

Query: 91  PASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTC 150
           P+ST     +  NRL  E SPYLLQHA+NPVDW+ WG+EAF +ARK + PIFLS+GYSTC
Sbjct: 10  PSSTPQ---RVPNRLIHEKSPYLLQHAYNPVDWYPWGQEAFDKARKENKPIFLSVGYSTC 66

Query: 151 HWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFL 210
           HWCH+ME ESF++E + +LL++ FVS+KVDREERPDVDKVYMT+VQA   GGGWP++V+L
Sbjct: 67  HWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPDVDKVYMTFVQATSSGGGWPMNVWL 126

Query: 211 SPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSAS 270
           +P+L+P +GGTYFPPED   R GF+T+L ++++ W + ++ L ++     ++++ AL A 
Sbjct: 127 TPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWKQNKNTLLENS----QRVTTALLAR 182

Query: 271 ASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKL 325
           +  +    +LP +A  +   C +QL + YD  +GGF  APKFP PV +  +   + S +L
Sbjct: 183 SEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGFAEAPKFPTPVILSFLFSYWLSHRL 242

Query: 326 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 385
              G     S  Q+M L TL+ MA GGI DHVG GFHRYS D +WHVPHFEKMLYDQ QL
Sbjct: 243 TQDG-----SRAQQMALHTLKMMANGGIRDHVGQGFHRYSTDRQWHVPHFEKMLYDQAQL 297

Query: 386 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 445
           A  Y  AF L+ D FYS + + IL Y+ R +    G  +SAEDADS    G  R KEGA+
Sbjct: 298 AVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRSGGFYSAEDADSPPERG-QRPKEGAY 356

Query: 446 YVWTSKEVEDILGEHAI----------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 495
           YVWT KEV+ +L E  +          L  +HY L   GN  +S   DP  E +G+NVL 
Sbjct: 357 YVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLTEAGN--ISPSQDPKGELQGQNVLT 414

Query: 496 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 555
                  +A++ G+ +E    +L     KLF  R  RP+PHLD K++ +WNGL++S +A 
Sbjct: 415 VRYSLELTAARFGLDVEAVRTLLNSGLEKLFQARKHRPKPHLDSKMLAAWNGLMVSGYAV 474

Query: 556 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP--- 612
              +L              G DR   +  A + A F++RH++D  + RL  +   GP   
Sbjct: 475 TGAVL--------------GQDR--LINYATNGAKFLKRHMFDVASGRLMRTCYTGPGGT 518

Query: 613 ---SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT 667
              S  P  GFL+DYAF++ GLLDLYE    + WL WA+ LQ+TQD LF D +GGGYF +
Sbjct: 519 VEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLEWALRLQDTQDRLFWDSQGGGYFCS 578

Query: 668 TGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 726
             E  + L LR+K+D DGAEPS NSVS  NL+RL     G K   +       L  F  R
Sbjct: 579 EAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHGFT-GHKD--WMDKCVCLLTAFSER 635

Query: 727 LKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           ++ + +A+P M  A       + K +V+ G + + D + ++   H+ Y  NK +
Sbjct: 636 MRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKDTKALVQCVHSVYIPNKVL 688


>gi|31542723|ref|NP_073738.2| spermatogenesis-associated protein 20 isoform 1 precursor [Homo
           sapiens]
 gi|19263653|gb|AAH25255.1| Spermatogenesis associated 20 [Homo sapiens]
          Length = 802

 Score =  581 bits (1498), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 314/714 (43%), Positives = 433/714 (60%), Gaps = 59/714 (8%)

Query: 91  PASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTC 150
           P+ST     +  NRL  E SPYLLQHA+NPVDW+ WG+EAF +ARK + PIFLS+GYSTC
Sbjct: 70  PSSTPQ---RVPNRLIHEKSPYLLQHAYNPVDWYPWGQEAFDKARKENKPIFLSVGYSTC 126

Query: 151 HWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFL 210
           HWCH+ME ESF++E + +LL++ FVS+KVDREERPDVDKVYMT+VQA   GGGWP++V+L
Sbjct: 127 HWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPDVDKVYMTFVQATSSGGGWPMNVWL 186

Query: 211 SPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSAS 270
           +P+L+P +GGTYFPPED   R GF+T+L ++++ W + ++ L ++     ++++ AL A 
Sbjct: 187 TPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWKQNKNTLLENS----QRVTTALLAR 242

Query: 271 ASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKL 325
           +  +    +LP +A  +   C +QL + YD  +GGF  APKFP PV +  +   + S +L
Sbjct: 243 SEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGFAEAPKFPTPVILSFLFSYWLSHRL 302

Query: 326 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 385
              G     S  Q+M L TL+ MA GGI DHVG GFHRYS D +WHVPHFEKMLYDQ QL
Sbjct: 303 TQDG-----SRAQQMALHTLKMMANGGIRDHVGQGFHRYSTDRQWHVPHFEKMLYDQAQL 357

Query: 386 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 445
           A  Y  AF L+ D FYS + + IL Y+ R +    G  +SAEDADS    G  R KEGA+
Sbjct: 358 AVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRSGGFYSAEDADSPPERG-QRPKEGAY 416

Query: 446 YVWTSKEVEDILGEHAI----------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 495
           YVWT KEV+ +L E  +          L  +HY L   GN  +S   DP  E +G+NVL 
Sbjct: 417 YVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLTEAGN--ISPSQDPKGELQGQNVLT 474

Query: 496 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 555
                  +A++ G+ +E    +L     KLF  R  RP+PHLD K++ +WNGL++S +A 
Sbjct: 475 VRYSLELTAARFGLDVEAVRTLLNSGLEKLFQARKHRPKPHLDSKMLAAWNGLMVSGYAV 534

Query: 556 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP--- 612
              +L              G DR   +  A + A F++RH++D  + RL  +   GP   
Sbjct: 535 TGAVL--------------GQDR--LINYATNGAKFLKRHMFDVASGRLMRTCYTGPGGT 578

Query: 613 ---SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT 667
              S  P  GFL+DYAF++ GLLDLYE    + WL WA+ LQ+TQD+LF D +GGGYF +
Sbjct: 579 VEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLEWALRLQDTQDKLFWDSQGGGYFCS 638

Query: 668 TGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 726
             E  + L LR+K+D DGAEPS NSVS  NL+RL     G K   +       L  F  R
Sbjct: 639 EAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHGFT-GHKD--WMDKCVCLLTAFSER 695

Query: 727 LKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           ++ + +A+P M  A       + K +V+ G + + D + ++   H+ Y  NK +
Sbjct: 696 MRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKDTKALVQCVHSVYIPNKVL 748


>gi|426347559|ref|XP_004041417.1| PREDICTED: spermatogenesis-associated protein 20 isoform 3 [Gorilla
           gorilla gorilla]
          Length = 786

 Score =  581 bits (1497), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 311/714 (43%), Positives = 432/714 (60%), Gaps = 59/714 (8%)

Query: 91  PASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTC 150
           P+ST     +  NRL  E SPYLLQHA+NPVDW+ WG+EAF +ARK + PIFLS+GYSTC
Sbjct: 54  PSSTPQ---RVPNRLIHEKSPYLLQHAYNPVDWYPWGQEAFDKARKENKPIFLSVGYSTC 110

Query: 151 HWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFL 210
           HWCH+ME ESF++E + +LL++ FVS+KVDREERPDVDKVYMT+VQA   GGGWP++V+L
Sbjct: 111 HWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPDVDKVYMTFVQATSSGGGWPMNVWL 170

Query: 211 SPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSAS 270
           +P+L+P +GGTYFPPED   R GF+T+L ++++ W + ++ L ++     ++++ AL A 
Sbjct: 171 TPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWKQNKNTLLENS----QRVTTALLAR 226

Query: 271 ASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKL 325
           +  +    +LP +A  +   C +QL + YD  +GGF  APKFP PV +  +   + S +L
Sbjct: 227 SEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGFAEAPKFPTPVILSFLFSYWLSHRL 286

Query: 326 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 385
              G     S  Q+M L TL+ MA GGI DHVG GFHRYS D +WH+PHFEKMLYDQ QL
Sbjct: 287 TQDG-----SRAQQMALHTLKMMANGGIRDHVGQGFHRYSTDRQWHIPHFEKMLYDQAQL 341

Query: 386 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 445
           A  Y  AF ++ D FYS + + IL Y+ + +    G  +SAEDADS    G  R KEGA+
Sbjct: 342 AVAYSQAFQISGDEFYSDVAKGILQYVAQSLSHRSGGFYSAEDADSPPERG-LRPKEGAY 400

Query: 446 YVWTSKEVEDILGEHAI----------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 495
           YVWT KEV+ +L E  +          L  +HY L   GN  +S   DP  E +G+NVL 
Sbjct: 401 YVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLTEAGN--ISPSQDPKGELQGQNVLT 458

Query: 496 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 555
                  +A++ G+ +E    +L     KLF  R  RP+PHLD K++ +WNGL++S +A 
Sbjct: 459 VRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKHRPKPHLDSKMLAAWNGLMVSGYAV 518

Query: 556 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP--- 612
              +L              G DR   +  A + A F++RH++D  + RL  +    P   
Sbjct: 519 TGAVL--------------GQDR--LINYATNGAKFLKRHMFDVASGRLMRTCYTSPGGT 562

Query: 613 ---SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT 667
              S  P  GFL+DYAF++ GLLDLYE    + WL WA+ LQ+TQD LF D +GGGYF +
Sbjct: 563 VDHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLEWALRLQDTQDRLFWDSQGGGYFCS 622

Query: 668 TGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 726
             E  + L LR+K+D DGAEPS NSVS  NL+RL     G K   +       L  F  R
Sbjct: 623 EAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHGFT-GHKD--WMDKCVCLLTAFSER 679

Query: 727 LKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           ++ + +A+P M CA       + K +V+ G + + D + ++   H+ Y  NK +
Sbjct: 680 MRRVPVALPEMVCALSA-QQQTLKQIVICGDRQAKDTKALVQCVHSVYIPNKVL 732


>gi|119615010|gb|EAW94604.1| spermatogenesis associated 20, isoform CRA_b [Homo sapiens]
          Length = 802

 Score =  581 bits (1497), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 314/714 (43%), Positives = 432/714 (60%), Gaps = 59/714 (8%)

Query: 91  PASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTC 150
           P+ST     +  NRL  E SPYLLQHA+NPVDW+ WG+EAF +ARK + PIFLS+GYSTC
Sbjct: 70  PSSTPQ---RVPNRLIHEKSPYLLQHAYNPVDWYPWGQEAFDKARKENKPIFLSVGYSTC 126

Query: 151 HWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFL 210
           HWCH+ME ESF++E + +LL++ FVS+KVDREERPDVDKVYMT+VQA   GGGWP++V+L
Sbjct: 127 HWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPDVDKVYMTFVQATSSGGGWPMNVWL 186

Query: 211 SPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSAS 270
           +P+L+P +GGTYFPPED   R GF+T+L ++++ W + ++ L ++     ++++ AL A 
Sbjct: 187 TPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWKQNKNTLLENS----QRVTTALLAR 242

Query: 271 ASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKL 325
           +  +    +LP +A  +   C +QL + YD  +GGF  APKFP PV +  +   + S +L
Sbjct: 243 SEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGFAEAPKFPTPVILSFLFSYWLSHRL 302

Query: 326 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 385
              G     S  Q+M L TL+ MA GGI DHVG GFHRYS D +WHVPHFEKMLYDQ QL
Sbjct: 303 TQDG-----SRAQQMALHTLKMMANGGIRDHVGQGFHRYSTDRQWHVPHFEKMLYDQAQL 357

Query: 386 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 445
           A  Y  AF L+ D FYS + + IL Y+ R +    G  +SAEDADS    G  R KEGA+
Sbjct: 358 AVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRSGGFYSAEDADSPPERG-QRPKEGAY 416

Query: 446 YVWTSKEVEDILGEHAI----------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 495
           YVWT KEV+ +L E  +          L  +HY L   GN  +S   DP  E +G+NVL 
Sbjct: 417 YVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLTEAGN--ISPSQDPKGELQGQNVLT 474

Query: 496 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 555
                  +A++ G+ +E    +L     KLF  R  RP+PHLD K++ +WNGL++S +A 
Sbjct: 475 VRYSLELTAARFGLDVEAVRTLLNSGLEKLFQARKHRPKPHLDSKMLAAWNGLMVSGYAV 534

Query: 556 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP--- 612
              +L              G DR   +  A + A F++RH++D  + RL  +   GP   
Sbjct: 535 TGAVL--------------GQDR--LINYATNGAKFLKRHMFDVASGRLMRTCYTGPGGT 578

Query: 613 ---SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT 667
              S  P  GFL+DYAF++ GLLDLYE    + WL WA+ LQ+TQD LF D +GGGYF +
Sbjct: 579 VEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLEWALRLQDTQDRLFWDSQGGGYFCS 638

Query: 668 TGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 726
             E  + L LR+K+D DGAEPS NSVS  NL+RL     G K   +       L  F  R
Sbjct: 639 EAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHGFT-GHKD--WMDKCVCLLTAFSER 695

Query: 727 LKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           ++ + +A+P M  A       + K +V+ G + + D + ++   H+ Y  NK +
Sbjct: 696 MRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKDTKALVQCVHSVYIPNKVL 748


>gi|403279582|ref|XP_003931326.1| PREDICTED: spermatogenesis-associated protein 20 [Saimiri
           boliviensis boliviensis]
          Length = 742

 Score =  581 bits (1497), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 314/719 (43%), Positives = 433/719 (60%), Gaps = 61/719 (8%)

Query: 91  PASTSHSRNKHT-----NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSI 145
           PA    SR+  T     NRL  E SPYLLQHA+NPVDW+ WG+EAF +ARK + PIFLS+
Sbjct: 2   PAGGKGSRSSSTPQRVPNRLIHEKSPYLLQHAYNPVDWYPWGQEAFDKARKENKPIFLSV 61

Query: 146 GYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWP 205
           GYSTCHWCH+ME ESF++E + +LL++ FVS+KVDREERPDVDKVYMT+VQA   GGGWP
Sbjct: 62  GYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPDVDKVYMTFVQATSSGGGWP 121

Query: 206 LSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE 265
           ++V+L+P+L+P +GGTYFPPED   R GF+T+L ++++ W + ++ L ++     ++++ 
Sbjct: 122 MNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWKQNKNALLENS----QRVTT 177

Query: 266 ALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--Y 320
           AL A +  +    +LP +A  +   C +QL + YD  +GGF  APKFP PV +  +   +
Sbjct: 178 ALLARSEISMGDRQLPPSAATMNSRCFQQLDEGYDEEYGGFAEAPKFPTPVILSFLFSYW 237

Query: 321 HSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLY 380
            S +L   G     S  Q+M L TL+ MA GGI DHVG GFHRYS D +WHVPHFEKMLY
Sbjct: 238 LSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGFHRYSTDRQWHVPHFEKMLY 292

Query: 381 DQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRK 440
           DQ QLA  Y  AF ++ D FYS + +DIL Y+ R +    G  +SAEDADS    G  R 
Sbjct: 293 DQAQLAVAYSQAFQISGDEFYSDVAKDILQYVTRSLSHRSGGFYSAEDADSPPERG-MRP 351

Query: 441 KEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLKPTGNCDLSRMSDPHNEFKG 490
           KEGA+YVWT+ EV+ +L E  +          LF +HY L   GN  +S   DP  E +G
Sbjct: 352 KEGAYYVWTANEVQQLLPEPVLGATEPLTSGQLFMKHYGLTEAGN--ISSSQDPKGELQG 409

Query: 491 KNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVI 550
           +NVL        +A++ G+ +E    +L     KLF  R  RP+PHLD K++ +WNGL++
Sbjct: 410 QNVLTVRYSLELTAARFGLDVEGVRTLLNTGLEKLFQARKHRPKPHLDSKMLAAWNGLMV 469

Query: 551 SSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRN 610
           S +A    +L              G DR   +  A + A F++RH++D  + RL  +   
Sbjct: 470 SGYAVTGAVL--------------GQDR--LINYATNGAKFLKRHMFDVASGRLMRTCYT 513

Query: 611 GP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGG 662
                   S  P  GFL+DYAF++ GLLDLYE    + WL WA+ LQ+TQD LF D +GG
Sbjct: 514 SSGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLEWALRLQDTQDRLFWDSQGG 573

Query: 663 GYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA 721
           GYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL     G K   +       L 
Sbjct: 574 GYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHGFT-GHKD--WMDKCVCLLT 630

Query: 722 VFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
            F  R++ + +A+P M  A       + K +V+ G + + D + ++   H+ Y  NK +
Sbjct: 631 AFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKDTKALVQCVHSIYIPNKVL 688


>gi|426347561|ref|XP_004041418.1| PREDICTED: spermatogenesis-associated protein 20 isoform 4 [Gorilla
           gorilla gorilla]
          Length = 786

 Score =  581 bits (1497), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 311/714 (43%), Positives = 432/714 (60%), Gaps = 59/714 (8%)

Query: 91  PASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTC 150
           P+ST     +  NRL  E SPYLLQHA+NPVDW+ WG+EAF +ARK + PIFLS+GYSTC
Sbjct: 54  PSSTPQ---RVPNRLIHEKSPYLLQHAYNPVDWYPWGQEAFDKARKENKPIFLSVGYSTC 110

Query: 151 HWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFL 210
           HWCH+ME ESF++E + +LL++ FVS+KVDREERPDVDKVYMT+VQA   GGGWP++V+L
Sbjct: 111 HWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPDVDKVYMTFVQATSSGGGWPMNVWL 170

Query: 211 SPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSAS 270
           +P+L+P +GGTYFPPED   R GF+T+L ++++ W + ++ L ++     ++++ AL A 
Sbjct: 171 TPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWKQNKNTLLENS----QRVTTALLAR 226

Query: 271 ASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKL 325
           +  +    +LP +A  +   C +QL + YD  +GGF  APKFP PV +  +   + S +L
Sbjct: 227 SEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGFAEAPKFPTPVILSFLFSYWLSHRL 286

Query: 326 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 385
              G     S  Q+M L TL+ MA GGI DHVG GFHRYS D +WH+PHFEKMLYDQ QL
Sbjct: 287 TQDG-----SRAQQMALHTLKMMANGGIRDHVGQGFHRYSTDRQWHIPHFEKMLYDQAQL 341

Query: 386 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 445
           A  Y  AF ++ D FYS + + IL Y+ + +    G  +SAEDADS    G  R KEGA+
Sbjct: 342 AVAYSQAFQISGDEFYSDVAKGILQYVAQSLSHRSGGFYSAEDADSPPERG-LRPKEGAY 400

Query: 446 YVWTSKEVEDILGEHAI----------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 495
           YVWT KEV+ +L E  +          L  +HY L   GN  +S   DP  E +G+NVL 
Sbjct: 401 YVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLTEAGN--ISPSQDPKGELQGQNVLT 458

Query: 496 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 555
                  +A++ G+ +E    +L     KLF  R  RP+PHLD K++ +WNGL++S +A 
Sbjct: 459 VRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKHRPKPHLDSKMLAAWNGLMVSGYAV 518

Query: 556 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP--- 612
              +L              G DR   +  A + A F++RH++D  + RL  +    P   
Sbjct: 519 TGAVL--------------GQDR--LINYATNGAKFLKRHMFDVASGRLMRTCYTSPGGT 562

Query: 613 ---SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT 667
              S  P  GFL+DYAF++ GLLDLYE    + WL WA+ LQ+TQD LF D +GGGYF +
Sbjct: 563 VDHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLEWALRLQDTQDRLFWDSQGGGYFCS 622

Query: 668 TGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 726
             E  + L LR+K+D DGAEPS NSVS  NL+RL     G K   +       L  F  R
Sbjct: 623 EAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHGFT-GHKD--WMDKCVCLLTAFSER 679

Query: 727 LKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           ++ + +A+P M CA       + K +V+ G + + D + ++   H+ Y  NK +
Sbjct: 680 MRRVPVALPEMVCALSA-QQQTLKQIVICGDRQAKDTKALVQCVHSVYIPNKVL 732


>gi|426347555|ref|XP_004041415.1| PREDICTED: spermatogenesis-associated protein 20 isoform 1 [Gorilla
           gorilla gorilla]
          Length = 742

 Score =  580 bits (1496), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 311/714 (43%), Positives = 432/714 (60%), Gaps = 59/714 (8%)

Query: 91  PASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTC 150
           P+ST     +  NRL  E SPYLLQHA+NPVDW+ WG+EAF +ARK + PIFLS+GYSTC
Sbjct: 10  PSSTPQ---RVPNRLIHEKSPYLLQHAYNPVDWYPWGQEAFDKARKENKPIFLSVGYSTC 66

Query: 151 HWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFL 210
           HWCH+ME ESF++E + +LL++ FVS+KVDREERPDVDKVYMT+VQA   GGGWP++V+L
Sbjct: 67  HWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPDVDKVYMTFVQATSSGGGWPMNVWL 126

Query: 211 SPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSAS 270
           +P+L+P +GGTYFPPED   R GF+T+L ++++ W + ++ L ++     ++++ AL A 
Sbjct: 127 TPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWKQNKNTLLENS----QRVTTALLAR 182

Query: 271 ASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKL 325
           +  +    +LP +A  +   C +QL + YD  +GGF  APKFP PV +  +   + S +L
Sbjct: 183 SEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGFAEAPKFPTPVILSFLFSYWLSHRL 242

Query: 326 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 385
              G     S  Q+M L TL+ MA GGI DHVG GFHRYS D +WH+PHFEKMLYDQ QL
Sbjct: 243 TQDG-----SRAQQMALHTLKMMANGGIRDHVGQGFHRYSTDRQWHIPHFEKMLYDQAQL 297

Query: 386 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 445
           A  Y  AF ++ D FYS + + IL Y+ + +    G  +SAEDADS    G  R KEGA+
Sbjct: 298 AVAYSQAFQISGDEFYSDVAKGILQYVAQSLSHRSGGFYSAEDADSPPERG-LRPKEGAY 356

Query: 446 YVWTSKEVEDILGEHAI----------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 495
           YVWT KEV+ +L E  +          L  +HY L   GN  +S   DP  E +G+NVL 
Sbjct: 357 YVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLTEAGN--ISPSQDPKGELQGQNVLT 414

Query: 496 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 555
                  +A++ G+ +E    +L     KLF  R  RP+PHLD K++ +WNGL++S +A 
Sbjct: 415 VRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKHRPKPHLDSKMLAAWNGLMVSGYAV 474

Query: 556 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP--- 612
              +L              G DR   +  A + A F++RH++D  + RL  +    P   
Sbjct: 475 TGAVL--------------GQDR--LINYATNGAKFLKRHMFDVASGRLMRTCYTSPGGT 518

Query: 613 ---SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT 667
              S  P  GFL+DYAF++ GLLDLYE    + WL WA+ LQ+TQD LF D +GGGYF +
Sbjct: 519 VDHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLEWALRLQDTQDRLFWDSQGGGYFCS 578

Query: 668 TGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 726
             E  + L LR+K+D DGAEPS NSVS  NL+RL     G K   +       L  F  R
Sbjct: 579 EAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHGFT-GHKD--WMDKCVCLLTAFSER 635

Query: 727 LKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           ++ + +A+P M CA       + K +V+ G + + D + ++   H+ Y  NK +
Sbjct: 636 MRRVPVALPEMVCALSA-QQQTLKQIVICGDRQAKDTKALVQCVHSVYIPNKVL 688


>gi|426347557|ref|XP_004041416.1| PREDICTED: spermatogenesis-associated protein 20 isoform 2 [Gorilla
           gorilla gorilla]
          Length = 802

 Score =  580 bits (1495), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 311/714 (43%), Positives = 432/714 (60%), Gaps = 59/714 (8%)

Query: 91  PASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTC 150
           P+ST     +  NRL  E SPYLLQHA+NPVDW+ WG+EAF +ARK + PIFLS+GYSTC
Sbjct: 70  PSSTPQ---RVPNRLIHEKSPYLLQHAYNPVDWYPWGQEAFDKARKENKPIFLSVGYSTC 126

Query: 151 HWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFL 210
           HWCH+ME ESF++E + +LL++ FVS+KVDREERPDVDKVYMT+VQA   GGGWP++V+L
Sbjct: 127 HWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPDVDKVYMTFVQATSSGGGWPMNVWL 186

Query: 211 SPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSAS 270
           +P+L+P +GGTYFPPED   R GF+T+L ++++ W + ++ L ++     ++++ AL A 
Sbjct: 187 TPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWKQNKNTLLENS----QRVTTALLAR 242

Query: 271 ASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKL 325
           +  +    +LP +A  +   C +QL + YD  +GGF  APKFP PV +  +   + S +L
Sbjct: 243 SEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGFAEAPKFPTPVILSFLFSYWLSHRL 302

Query: 326 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 385
              G     S  Q+M L TL+ MA GGI DHVG GFHRYS D +WH+PHFEKMLYDQ QL
Sbjct: 303 TQDG-----SRAQQMALHTLKMMANGGIRDHVGQGFHRYSTDRQWHIPHFEKMLYDQAQL 357

Query: 386 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 445
           A  Y  AF ++ D FYS + + IL Y+ + +    G  +SAEDADS    G  R KEGA+
Sbjct: 358 AVAYSQAFQISGDEFYSDVAKGILQYVAQSLSHRSGGFYSAEDADSPPERG-LRPKEGAY 416

Query: 446 YVWTSKEVEDILGEHAI----------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 495
           YVWT KEV+ +L E  +          L  +HY L   GN  +S   DP  E +G+NVL 
Sbjct: 417 YVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLTEAGN--ISPSQDPKGELQGQNVLT 474

Query: 496 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 555
                  +A++ G+ +E    +L     KLF  R  RP+PHLD K++ +WNGL++S +A 
Sbjct: 475 VRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKHRPKPHLDSKMLAAWNGLMVSGYAV 534

Query: 556 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP--- 612
              +L              G DR   +  A + A F++RH++D  + RL  +    P   
Sbjct: 535 TGAVL--------------GQDR--LINYATNGAKFLKRHMFDVASGRLMRTCYTSPGGT 578

Query: 613 ---SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT 667
              S  P  GFL+DYAF++ GLLDLYE    + WL WA+ LQ+TQD LF D +GGGYF +
Sbjct: 579 VDHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLEWALRLQDTQDRLFWDSQGGGYFCS 638

Query: 668 TGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 726
             E  + L LR+K+D DGAEPS NSVS  NL+RL     G K   +       L  F  R
Sbjct: 639 EAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHGFT-GHKD--WMDKCVCLLTAFSER 695

Query: 727 LKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           ++ + +A+P M CA       + K +V+ G + + D + ++   H+ Y  NK +
Sbjct: 696 MRRVPVALPEMVCALSA-QQQTLKQIVICGDRQAKDTKALVQCVHSVYIPNKVL 748


>gi|343958896|dbj|BAK63303.1| SPATA20 protein [Pan troglodytes]
          Length = 742

 Score =  580 bits (1495), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 314/714 (43%), Positives = 431/714 (60%), Gaps = 59/714 (8%)

Query: 91  PASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTC 150
           P+ST     +  NRL  E SPYLLQHA+NPVDW+ WG+EAF +ARK + PIFLS+GYSTC
Sbjct: 10  PSSTPQ---RVPNRLIHEKSPYLLQHAYNPVDWYPWGQEAFDKARKENKPIFLSVGYSTC 66

Query: 151 HWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFL 210
           HWCH+ME ESF+DE + +LL++ FVS+KVDREERPDVDKVYM +VQA   GGGWP++V+L
Sbjct: 67  HWCHMMEEESFQDEEIGRLLSEDFVSVKVDREERPDVDKVYMMFVQATSSGGGWPMNVWL 126

Query: 211 SPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSAS 270
           +P+L+P +GGTYFPPED   R GF+T+L ++++ W + ++ L ++     ++++ AL A 
Sbjct: 127 TPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWKQNKNTLLENS----QRVTTALLAR 182

Query: 271 ASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKL 325
           +  +    +LP +A  +   C +QL + YD  +GGF  APKFP PV +  +   + S +L
Sbjct: 183 SEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGFAEAPKFPTPVILSFLFSYWLSHRL 242

Query: 326 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 385
              G     S  Q+M L TL+ MA GGI DHVG GFHRYS D +WHVPHFEKMLYDQ QL
Sbjct: 243 TQDG-----SRAQQMALHTLKMMANGGIRDHVGQGFHRYSTDRQWHVPHFEKMLYDQAQL 297

Query: 386 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 445
           A  Y  AF L+ D FYS + + IL Y+ R +    G  +SAEDADS    G  R KEGA+
Sbjct: 298 AVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRSGGFYSAEDADSPPERG-LRPKEGAY 356

Query: 446 YVWTSKEVEDILGEHAI----------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 495
           YVWT KEV+ +L E  +          L  +HY L   GN  +S   DP  E +G+NVL 
Sbjct: 357 YVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLTEAGN--ISPSQDPKGELQGQNVLT 414

Query: 496 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 555
                  +A++ G+ +E    +L     KLF  R  RP+PHLD K++ +WNGL++S +A 
Sbjct: 415 VRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKHRPKPHLDSKMLAAWNGLMVSGYAV 474

Query: 556 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP--- 612
              +L              G DR   +  A + A F++RH++D  + RL  +   GP   
Sbjct: 475 TGAVL--------------GQDR--LINYATNGAKFLKRHMFDVASGRLMRTCYTGPGGT 518

Query: 613 ---SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT 667
              S  P  GFL+DYAF++ GLLDLYE    + WL WA+ LQ+TQD LF D +GGGYF +
Sbjct: 519 VEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLEWALRLQDTQDRLFWDSQGGGYFCS 578

Query: 668 TGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 726
             E  + L LR+K+D DGAEPS NSVS  NL+RL     G K   +       L  F  R
Sbjct: 579 EAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHGFT-GHKD--WMDKCVCLLTAFSER 635

Query: 727 LKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           ++ + +A+P M  A       + K +V+ G + + D + ++   H+ Y  NK +
Sbjct: 636 MRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKDTKALVQCVHSVYIPNKVL 688


>gi|73966409|ref|XP_548202.2| PREDICTED: spermatogenesis-associated protein 20 [Canis lupus
           familiaris]
          Length = 789

 Score =  580 bits (1494), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 313/725 (43%), Positives = 435/725 (60%), Gaps = 57/725 (7%)

Query: 80  PYKVVAMAERTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDV 139
           P  +     RT  S S  + K  NRL  E SPYLLQHA+NPVDW+ WG+EAF +ARK + 
Sbjct: 44  PMPIGGKGSRTNCSPSVPQ-KVPNRLINEKSPYLLQHAYNPVDWYPWGQEAFDKARKENK 102

Query: 140 PIFLSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALY 199
           PIFLS+GYSTCHWCH+ME ESF++E +  LLN+ FVS+KVDREERPDVDKVYMT+VQA  
Sbjct: 103 PIFLSVGYSTCHWCHMMEEESFQNEEIGHLLNEDFVSVKVDREERPDVDKVYMTFVQATS 162

Query: 200 GGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFA 259
            GGGWP++V+L+P+L+P +GGTYFPPED   R GF+T+L ++++ W + ++ L ++    
Sbjct: 163 SGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWKQNKNTLLENS--- 219

Query: 260 IEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQ 316
            ++++ AL A +  +    ++P +A  +   C +QL + YD  +GGF  APKFP PV + 
Sbjct: 220 -QRVTTALLARSEISMGDRQVPPSAATMNSRCFQQLDEGYDEEYGGFAEAPKFPTPVILN 278

Query: 317 MML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPH 374
            +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GFHRYS D +WH+PH
Sbjct: 279 FLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGFHRYSTDRQWHIPH 333

Query: 375 FEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAET 434
           FEKMLYDQ QLA  Y  AF ++ D FYS + + IL Y+ R++    G  +SAEDADS   
Sbjct: 334 FEKMLYDQAQLAVAYSQAFQISGDEFYSDVAKGILQYVARNLSHRSGGFYSAEDADSPPE 393

Query: 435 EGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLKPTGNCDLSRMSDP 484
            G  R +EGAFYVWT KEV+++L E  +          L  +HY L   GN  +S   DP
Sbjct: 394 RG-MRPREGAFYVWTVKEVQNLLPEPVLGATEPLTSGQLLMKHYGLTEAGN--ISPSQDP 450

Query: 485 HNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVS 544
             E +G+NVL        +A++ G+ ++    +L     KLF  R  RP+PHLD K++ +
Sbjct: 451 KGELQGQNVLTVRYSLELTAARFGLDVDAVRTLLNTGLEKLFQARKHRPKPHLDSKMLAA 510

Query: 545 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL 604
           WNGL++S +A    +L  E    + N+ + G             A F++RH++D  + RL
Sbjct: 511 WNGLMVSGYAVTGAVLGQE---RLINYAING-------------AKFLKRHMFDVASGRL 554

Query: 605 QHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELF 656
             +   GP      S  P  GFL+DYAF++ GLLDLYE    + WL WA+ LQ+TQD LF
Sbjct: 555 MRTCYAGPGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESSWLEWALRLQDTQDRLF 614

Query: 657 LDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQN 715
            D +GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+R+     G K   +   
Sbjct: 615 WDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRMHGFT-GHKD--WMDK 671

Query: 716 AEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYD 775
               L  F  R++ + +A+P M  A       + K +V+ G   + D + +L   H+ Y 
Sbjct: 672 CVCLLTAFSERMRRVPVALPEMVRALSAHQQ-TLKQIVICGDPQAKDTKALLQCVHSIYI 730

Query: 776 LNKTV 780
            NK +
Sbjct: 731 PNKVL 735


>gi|114669341|ref|XP_001170552.1| PREDICTED: spermatogenesis-associated protein 20 isoform 4 [Pan
           troglodytes]
 gi|397493180|ref|XP_003817490.1| PREDICTED: spermatogenesis-associated protein 20 isoform 3 [Pan
           paniscus]
          Length = 786

 Score =  579 bits (1493), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 313/714 (43%), Positives = 431/714 (60%), Gaps = 59/714 (8%)

Query: 91  PASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTC 150
           P+ST     +  NRL  E SPYLLQHA+NPVDW+ WG+EAF +ARK + PIFLS+GYSTC
Sbjct: 54  PSSTPQ---RVPNRLIHEKSPYLLQHAYNPVDWYPWGQEAFDKARKENKPIFLSVGYSTC 110

Query: 151 HWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFL 210
           HWCH+ME ESF++E + +LL++ FVS+KVDREERPDVDKVYM +VQA   GGGWP++V+L
Sbjct: 111 HWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPDVDKVYMMFVQATSSGGGWPMNVWL 170

Query: 211 SPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSAS 270
           +P+L+P +GGTYFPPED   R GF+T+L ++++ W + ++ L ++     ++++ AL A 
Sbjct: 171 TPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWKQNKNTLLENS----QRVTTALLAR 226

Query: 271 ASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKL 325
           +  +    +LP +A  +   C +QL + YD  +GGF  APKFP PV +  +   + S +L
Sbjct: 227 SEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGFAEAPKFPTPVILSFLFSYWLSHRL 286

Query: 326 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 385
              G     S  Q+M L TL+ MA GGI DHVG GFHRYS D +WHVPHFEKMLYDQ QL
Sbjct: 287 TQDG-----SRAQQMALHTLKMMANGGIRDHVGQGFHRYSTDRQWHVPHFEKMLYDQAQL 341

Query: 386 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 445
           A  Y  AF L+ D FYS + + IL Y+ R +    G  +SAEDADS    G  R KEGA+
Sbjct: 342 AVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRSGGFYSAEDADSPPERG-LRPKEGAY 400

Query: 446 YVWTSKEVEDILGEHAI----------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 495
           YVWT KEV+ +L E  +          L  +HY L   GN  +S   DP  E +G+NVL 
Sbjct: 401 YVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLTEAGN--ISPSQDPKGELQGQNVLT 458

Query: 496 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 555
                  +A++ G+ +E    +L     KLF  R  RP+PHLD K++ +WNGL++S +A 
Sbjct: 459 VRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKHRPKPHLDSKMLAAWNGLMVSGYAV 518

Query: 556 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP--- 612
              +L              G DR   +  A + A F++RH++D  + RL  +   GP   
Sbjct: 519 TGAVL--------------GQDR--LINYATNGAKFLKRHMFDVASGRLMRTCYTGPGGT 562

Query: 613 ---SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT 667
              S  P  GFL+DYAF++ GLLDLYE    + WL WA+ LQ+TQD LF D +GGGYF +
Sbjct: 563 VEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLEWALRLQDTQDRLFWDSQGGGYFCS 622

Query: 668 TGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 726
             E  + L LR+K+D DGAEPS NSVS  NL+RL     G K   +       L  F  R
Sbjct: 623 EAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHGFT-GHKD--WMDKCVCLLTAFSER 679

Query: 727 LKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           ++ + +A+P M  A       + K +V+ G + + D + ++   H+ Y  NK +
Sbjct: 680 MRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKDTKALVQCVHSVYIPNKVL 732


>gi|114669339|ref|XP_511882.2| PREDICTED: spermatogenesis-associated protein 20 isoform 8 [Pan
           troglodytes]
 gi|397493178|ref|XP_003817489.1| PREDICTED: spermatogenesis-associated protein 20 isoform 2 [Pan
           paniscus]
 gi|410211920|gb|JAA03179.1| spermatogenesis associated 20 [Pan troglodytes]
 gi|410266782|gb|JAA21357.1| spermatogenesis associated 20 [Pan troglodytes]
 gi|410349593|gb|JAA41400.1| spermatogenesis associated 20 [Pan troglodytes]
          Length = 802

 Score =  579 bits (1492), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 313/714 (43%), Positives = 431/714 (60%), Gaps = 59/714 (8%)

Query: 91  PASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTC 150
           P+ST     +  NRL  E SPYLLQHA+NPVDW+ WG+EAF +ARK + PIFLS+GYSTC
Sbjct: 70  PSSTPQ---RVPNRLIHEKSPYLLQHAYNPVDWYPWGQEAFDKARKENKPIFLSVGYSTC 126

Query: 151 HWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFL 210
           HWCH+ME ESF++E + +LL++ FVS+KVDREERPDVDKVYM +VQA   GGGWP++V+L
Sbjct: 127 HWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPDVDKVYMMFVQATSSGGGWPMNVWL 186

Query: 211 SPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSAS 270
           +P+L+P +GGTYFPPED   R GF+T+L ++++ W + ++ L ++     ++++ AL A 
Sbjct: 187 TPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWKQNKNTLLENS----QRVTTALLAR 242

Query: 271 ASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKL 325
           +  +    +LP +A  +   C +QL + YD  +GGF  APKFP PV +  +   + S +L
Sbjct: 243 SEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGFAEAPKFPTPVILSFLFSYWLSHRL 302

Query: 326 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 385
              G     S  Q+M L TL+ MA GGI DHVG GFHRYS D +WHVPHFEKMLYDQ QL
Sbjct: 303 TQDG-----SRAQQMALHTLKMMANGGIRDHVGQGFHRYSTDRQWHVPHFEKMLYDQAQL 357

Query: 386 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 445
           A  Y  AF L+ D FYS + + IL Y+ R +    G  +SAEDADS    G  R KEGA+
Sbjct: 358 AVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRSGGFYSAEDADSPPERG-LRPKEGAY 416

Query: 446 YVWTSKEVEDILGEHAI----------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 495
           YVWT KEV+ +L E  +          L  +HY L   GN  +S   DP  E +G+NVL 
Sbjct: 417 YVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLTEAGN--ISPSQDPKGELQGQNVLT 474

Query: 496 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 555
                  +A++ G+ +E    +L     KLF  R  RP+PHLD K++ +WNGL++S +A 
Sbjct: 475 VRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKHRPKPHLDSKMLAAWNGLMVSGYAV 534

Query: 556 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP--- 612
              +L              G DR   +  A + A F++RH++D  + RL  +   GP   
Sbjct: 535 TGAVL--------------GQDR--LINYATNGAKFLKRHMFDVASGRLMRTCYTGPGGT 578

Query: 613 ---SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT 667
              S  P  GFL+DYAF++ GLLDLYE    + WL WA+ LQ+TQD LF D +GGGYF +
Sbjct: 579 VEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLEWALRLQDTQDRLFWDSQGGGYFCS 638

Query: 668 TGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 726
             E  + L LR+K+D DGAEPS NSVS  NL+RL     G K   +       L  F  R
Sbjct: 639 EAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHGFT-GHKD--WMDKCVCLLTAFSER 695

Query: 727 LKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           ++ + +A+P M  A       + K +V+ G + + D + ++   H+ Y  NK +
Sbjct: 696 MRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKDTKALVQCVHSVYIPNKVL 748


>gi|410349595|gb|JAA41401.1| spermatogenesis associated 20 [Pan troglodytes]
          Length = 802

 Score =  578 bits (1491), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 313/714 (43%), Positives = 431/714 (60%), Gaps = 59/714 (8%)

Query: 91  PASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTC 150
           P+ST     +  NRL  E SPYLLQHA+NPVDW+ WG+EAF +ARK + PIFLS+GYSTC
Sbjct: 70  PSSTPQ---RVPNRLIHEKSPYLLQHAYNPVDWYPWGQEAFDKARKENKPIFLSVGYSTC 126

Query: 151 HWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFL 210
           HWCH+ME ESF++E + +LL++ FVS+KVDREERPDVDKVYM +VQA   GGGWP++V+L
Sbjct: 127 HWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPDVDKVYMMFVQATSSGGGWPMNVWL 186

Query: 211 SPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSAS 270
           +P+L+P +GGTYFPPED   R GF+T+L ++++ W + ++ L ++     ++++ AL A 
Sbjct: 187 TPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWKQNKNTLLENS----QRVTTALLAR 242

Query: 271 ASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKL 325
           +  +    +LP +A  +   C +QL + YD  +GGF  APKFP PV +  +   + S +L
Sbjct: 243 SEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGFAEAPKFPTPVILSFLFSYWLSHRL 302

Query: 326 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 385
              G     S  Q+M L TL+ MA GGI DHVG GFHRYS D +WHVPHFEKMLYDQ QL
Sbjct: 303 TQDG-----SRAQQMALHTLKMMANGGIRDHVGQGFHRYSTDRQWHVPHFEKMLYDQAQL 357

Query: 386 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 445
           A  Y  AF L+ D FYS + + IL Y+ R +    G  +SAEDADS    G  R KEGA+
Sbjct: 358 AVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRSGGFYSAEDADSPPERG-LRPKEGAY 416

Query: 446 YVWTSKEVEDILGEHAI----------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 495
           YVWT KEV+ +L E  +          L  +HY L   GN  +S   DP  E +G+NVL 
Sbjct: 417 YVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLTEAGN--ISPSQDPKGELQGQNVLT 474

Query: 496 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 555
                  +A++ G+ +E    +L     KLF  R  RP+PHLD K++ +WNGL++S +A 
Sbjct: 475 VRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKHRPKPHLDSKMLAAWNGLMVSGYAV 534

Query: 556 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP--- 612
              +L              G DR   +  A + A F++RH++D  + RL  +   GP   
Sbjct: 535 TGAVL--------------GQDR--LINYATNGAKFLKRHMFDVASGRLMRTCYTGPGGT 578

Query: 613 ---SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT 667
              S  P  GFL+DYAF++ GLLDLYE    + WL WA+ LQ+TQD LF D +GGGYF +
Sbjct: 579 VEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLEWALRLQDTQDRLFWDSQGGGYFCS 638

Query: 668 TGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 726
             E  + L LR+K+D DGAEPS NSVS  NL+RL     G K   +       L  F  R
Sbjct: 639 EAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHGFT-GHKD--WMDKCVCLLTAFSER 695

Query: 727 LKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           ++ + +A+P M  A       + K +V+ G + + D + ++   H+ Y  NK +
Sbjct: 696 MRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKDTKALVQCVHSVYIPNKVL 748


>gi|410051894|ref|XP_003953187.1| PREDICTED: spermatogenesis-associated protein 20 [Pan troglodytes]
          Length = 786

 Score =  578 bits (1491), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 313/714 (43%), Positives = 431/714 (60%), Gaps = 59/714 (8%)

Query: 91  PASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTC 150
           P+ST     +  NRL  E SPYLLQHA+NPVDW+ WG+EAF +ARK + PIFLS+GYSTC
Sbjct: 54  PSSTPQ---RVPNRLIHEKSPYLLQHAYNPVDWYPWGQEAFDKARKENKPIFLSVGYSTC 110

Query: 151 HWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFL 210
           HWCH+ME ESF++E + +LL++ FVS+KVDREERPDVDKVYM +VQA   GGGWP++V+L
Sbjct: 111 HWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPDVDKVYMMFVQATSSGGGWPMNVWL 170

Query: 211 SPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSAS 270
           +P+L+P +GGTYFPPED   R GF+T+L ++++ W + ++ L ++     ++++ AL A 
Sbjct: 171 TPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWKQNKNTLLENS----QRVTTALLAR 226

Query: 271 ASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKL 325
           +  +    +LP +A  +   C +QL + YD  +GGF  APKFP PV +  +   + S +L
Sbjct: 227 SEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGFAEAPKFPTPVILSFLFSYWLSHRL 286

Query: 326 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 385
              G     S  Q+M L TL+ MA GGI DHVG GFHRYS D +WHVPHFEKMLYDQ QL
Sbjct: 287 TQDG-----SRAQQMALHTLKMMANGGIRDHVGQGFHRYSTDRQWHVPHFEKMLYDQAQL 341

Query: 386 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 445
           A  Y  AF L+ D FYS + + IL Y+ R +    G  +SAEDADS    G  R KEGA+
Sbjct: 342 AVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRSGGFYSAEDADSPPERG-LRPKEGAY 400

Query: 446 YVWTSKEVEDILGEHAI----------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 495
           YVWT KEV+ +L E  +          L  +HY L   GN  +S   DP  E +G+NVL 
Sbjct: 401 YVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLTEAGN--ISPSQDPKGELQGQNVLT 458

Query: 496 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 555
                  +A++ G+ +E    +L     KLF  R  RP+PHLD K++ +WNGL++S +A 
Sbjct: 459 VRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKHRPKPHLDSKMLAAWNGLMVSGYAV 518

Query: 556 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP--- 612
              +L              G DR   +  A + A F++RH++D  + RL  +   GP   
Sbjct: 519 TGAVL--------------GQDR--LINYATNGAKFLKRHMFDVASGRLMRTCYTGPGGT 562

Query: 613 ---SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT 667
              S  P  GFL+DYAF++ GLLDLYE    + WL WA+ LQ+TQD LF D +GGGYF +
Sbjct: 563 VEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLEWALRLQDTQDRLFWDSQGGGYFCS 622

Query: 668 TGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 726
             E  + L LR+K+D DGAEPS NSVS  NL+RL     G K   +       L  F  R
Sbjct: 623 EAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHGFT-GHKD--WMDKCVCLLTAFSER 679

Query: 727 LKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           ++ + +A+P M  A       + K +V+ G + + D + ++   H+ Y  NK +
Sbjct: 680 MRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKDTKALVQCVHSVYIPNKVL 732


>gi|116487451|gb|AAI25719.1| LOC779596 protein [Xenopus (Silurana) tropicalis]
          Length = 770

 Score =  578 bits (1491), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 311/718 (43%), Positives = 426/718 (59%), Gaps = 60/718 (8%)

Query: 81  YKVVAMAERTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVP 140
           ++V  MA    + ++ +     NRL  E S YL QHA NPVDW+ WG+EAF+ A +   P
Sbjct: 55  FEVCKMA----SGSTQTPTGRVNRLINEKSLYLQQHARNPVDWYPWGQEAFSRAAREMKP 110

Query: 141 IFLSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYG 200
           IFLS+GYSTCHWCHVME ESFEDE + ++LN+ F+ +KVDREERPDVDKVYMT++QA   
Sbjct: 111 IFLSVGYSTCHWCHVMERESFEDEEIGRILNENFICVKVDREERPDVDKVYMTFLQATDS 170

Query: 201 GGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAI 260
           GGGWP+SV+L+PDL+P +GGTYFPPED   R  F+T+L ++ + W + R       AF  
Sbjct: 171 GGGWPMSVWLTPDLRPFVGGTYFPPEDGVRRVSFRTVLLRIVEQWKENR-------AFLC 223

Query: 261 EQLSEALSASASSNKL------PDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVE 314
           E+    LS   SS+ +      P  LP    +LC +QL + +D  +GGFG  PKFP PV 
Sbjct: 224 ERSERILSVLQSSSDIDGAAEPPPSLPVQ--KLCFQQLERIFDEEYGGFGEFPKFPTPVN 281

Query: 315 IQMM--LYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHV 372
              +  L+   K      S E ++   M + TL+ M  GGIHDH+G GFHRYS D+ WHV
Sbjct: 282 FSFLFCLWALSK-----GSPEGTQALHMAVHTLKWMMYGGIHDHIGKGFHRYSTDQTWHV 336

Query: 373 PHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSA 432
           PHFEKMLYDQ QLA  Y +AF ++    +S    DIL Y+ +++    G  +SAEDADS 
Sbjct: 337 PHFEKMLYDQAQLAVAYAEAFQISGKEIFSDAAHDILQYVLQNLSDDAGGFYSAEDADSL 396

Query: 433 ETEGATRKKEGAFYVWTSKEVEDILGE--------HAILFKEHYYLKPTGNCDLSRMSDP 484
               +  KKEGAF  WT+KE++ +L +           +F  HY +K  GN   S+  D 
Sbjct: 397 PNAQSKEKKEGAFATWTAKEIQQLLPDMEEANGNTFGDIFMHHYGMKEEGNVSASQ--DI 454

Query: 485 HNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVS 544
           H E +G+NVLI  +    +A+K G+ + +   IL  CR +L+  R  RP P  D K++ S
Sbjct: 455 HGELQGQNVLIVRSSLELTAAKFGLDVARVQTILSMCRDRLYKARRLRPPPQRDTKILAS 514

Query: 545 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL 604
           WNGL++S  AR   IL+ E                 Y+E A+ AASF+  ++YD ++  L
Sbjct: 515 WNGLMLSGLARCGVILRDEG----------------YIERAKLAASFLHENMYDLKSGIL 558

Query: 605 QHSFRNG----PSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDRE 660
             SF  G        PGFLDDYAF++ GLLDLYE      +L WA++LQ+ QD+LF D +
Sbjct: 559 LRSFYKGHQPIADLVPGFLDDYAFMVRGLLDLYEACLDQFYLEWALQLQDRQDQLFWDAK 618

Query: 661 GGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL 720
           G GYF +   D S+LLR+K+D DGAEPSGNSVSV+NL+RLA     ++   + + +   L
Sbjct: 619 GSGYFCSDASDSSILLRLKDDQDGAEPSGNSVSVVNLLRLACYTGRTE---FTERSGQIL 675

Query: 721 AVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNK 778
           A F  RL  +  ++P M    +M+   + K VV+ G K   +   +L AA + Y  NK
Sbjct: 676 AAFSERLLKVPASLPEM-VRGNMIYHQTVKQVVVCGDKEDPNTRELLEAAQSMYVPNK 732


>gi|114669347|ref|XP_001170636.1| PREDICTED: spermatogenesis-associated protein 20 isoform 7 [Pan
           troglodytes]
 gi|397493176|ref|XP_003817488.1| PREDICTED: spermatogenesis-associated protein 20 isoform 1 [Pan
           paniscus]
          Length = 742

 Score =  578 bits (1491), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 313/714 (43%), Positives = 431/714 (60%), Gaps = 59/714 (8%)

Query: 91  PASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTC 150
           P+ST     +  NRL  E SPYLLQHA+NPVDW+ WG+EAF +ARK + PIFLS+GYSTC
Sbjct: 10  PSSTPQ---RVPNRLIHEKSPYLLQHAYNPVDWYPWGQEAFDKARKENKPIFLSVGYSTC 66

Query: 151 HWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFL 210
           HWCH+ME ESF++E + +LL++ FVS+KVDREERPDVDKVYM +VQA   GGGWP++V+L
Sbjct: 67  HWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPDVDKVYMMFVQATSSGGGWPMNVWL 126

Query: 211 SPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSAS 270
           +P+L+P +GGTYFPPED   R GF+T+L ++++ W + ++ L ++     ++++ AL A 
Sbjct: 127 TPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWKQNKNTLLENS----QRVTTALLAR 182

Query: 271 ASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKL 325
           +  +    +LP +A  +   C +QL + YD  +GGF  APKFP PV +  +   + S +L
Sbjct: 183 SEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGFAEAPKFPTPVILSFLFSYWLSHRL 242

Query: 326 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 385
              G     S  Q+M L TL+ MA GGI DHVG GFHRYS D +WHVPHFEKMLYDQ QL
Sbjct: 243 TQDG-----SRAQQMALHTLKMMANGGIRDHVGQGFHRYSTDRQWHVPHFEKMLYDQAQL 297

Query: 386 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 445
           A  Y  AF L+ D FYS + + IL Y+ R +    G  +SAEDADS    G  R KEGA+
Sbjct: 298 AVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRSGGFYSAEDADSPPERG-LRPKEGAY 356

Query: 446 YVWTSKEVEDILGEHAI----------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 495
           YVWT KEV+ +L E  +          L  +HY L   GN  +S   DP  E +G+NVL 
Sbjct: 357 YVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLTEAGN--ISPSQDPKGELQGQNVLT 414

Query: 496 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 555
                  +A++ G+ +E    +L     KLF  R  RP+PHLD K++ +WNGL++S +A 
Sbjct: 415 VRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKHRPKPHLDSKMLAAWNGLMVSGYAV 474

Query: 556 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP--- 612
              +L              G DR   +  A + A F++RH++D  + RL  +   GP   
Sbjct: 475 TGAVL--------------GQDR--LINYATNGAKFLKRHMFDVASGRLMRTCYTGPGGT 518

Query: 613 ---SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT 667
              S  P  GFL+DYAF++ GLLDLYE    + WL WA+ LQ+TQD LF D +GGGYF +
Sbjct: 519 VEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLEWALRLQDTQDRLFWDSQGGGYFCS 578

Query: 668 TGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 726
             E  + L LR+K+D DGAEPS NSVS  NL+RL     G K   +       L  F  R
Sbjct: 579 EAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHGFT-GHKD--WMDKCVCLLTAFSER 635

Query: 727 LKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           ++ + +A+P M  A       + K +V+ G + + D + ++   H+ Y  NK +
Sbjct: 636 MRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKDTKALVQCVHSVYIPNKVL 688


>gi|10437433|dbj|BAB15051.1| unnamed protein product [Homo sapiens]
          Length = 786

 Score =  578 bits (1491), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 313/714 (43%), Positives = 430/714 (60%), Gaps = 59/714 (8%)

Query: 91  PASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTC 150
           P+ST     +  NRL  E SPYLLQHA+NPVDW+ WG+EAF +ARK + PIFLS+GYSTC
Sbjct: 54  PSSTPQ---RVPNRLIHEKSPYLLQHAYNPVDWYPWGQEAFDKARKENKPIFLSVGYSTC 110

Query: 151 HWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFL 210
           HWCH+ME ESF++E + +LL++ FVS+KVDREERPDVDKVYMT+VQA   GGGWP++V+L
Sbjct: 111 HWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPDVDKVYMTFVQATSSGGGWPMNVWL 170

Query: 211 SPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSAS 270
           +P+L+P +GGTYFPPED   R GF+T+L ++++ W + ++ L ++     ++++ AL A 
Sbjct: 171 TPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWKQNKNTLLENS----QRVTTALLAR 226

Query: 271 ASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKL 325
           +  +    +LP +A  +   C +QL + YD  +GGF  APKFP PV +  +   + S +L
Sbjct: 227 SEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGFAEAPKFPTPVILSFLFSYWLSHRL 286

Query: 326 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 385
              G     S  Q+M L TL+ MA GGI DHVG GFHRYS D +WHVPHFEKMLYDQ QL
Sbjct: 287 TQDG-----SRAQQMALHTLKMMANGGIRDHVGQGFHRYSTDRQWHVPHFEKMLYDQAQL 341

Query: 386 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 445
           A  Y  AF L+ D  YS + + IL Y+ R +    G  +SAEDADS    G  R KEGA+
Sbjct: 342 AVAYSQAFQLSGDELYSDVAKGILQYVARSLSHRSGGFYSAEDADSPPERG-QRPKEGAY 400

Query: 446 YVWTSKEVEDILGEHAI----------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 495
           YVWT KEV+ +L E  +          L  +HY L   GN  +S   DP  E +G+NVL 
Sbjct: 401 YVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLTEAGN--ISPSQDPKGELQGQNVLT 458

Query: 496 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 555
                  +A++ G+ +E    +L     KLF  R  RP+PHLD K++ +WNGL++S +A 
Sbjct: 459 VRYSLELTAARFGLDVEAVRTLLNSGLEKLFQARKHRPKPHLDSKMLAAWNGLMVSGYAV 518

Query: 556 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP--- 612
              +L              G DR   +  A + A F+ RH++D  + RL  +   GP   
Sbjct: 519 TGAVL--------------GQDR--LINYATNGAKFLERHMFDVASGRLMRTCYTGPGGT 562

Query: 613 ---SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT 667
              S  P  GFL+DYAF++ GLLDLYE    + WL WA+ LQ+TQD LF D +GGGYF +
Sbjct: 563 VEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLEWALRLQDTQDRLFWDSQGGGYFCS 622

Query: 668 TGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 726
             E  + L LR+K+D DGAEPS NSVS  NL+RL     G K   +       L  F  R
Sbjct: 623 EAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHGFT-GHKD--WMDKCVCLLTAFSER 679

Query: 727 LKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           ++ + +A+P M  A       + K +V+ G + + D + ++   H+ Y  NK +
Sbjct: 680 MRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKDTKALVQCVHSVYIPNKVL 732


>gi|189500022|ref|YP_001959492.1| hypothetical protein Cphamn1_1072 [Chlorobium phaeobacteroides BS1]
 gi|189495463|gb|ACE04011.1| protein of unknown function DUF255 [Chlorobium phaeobacteroides
           BS1]
          Length = 712

 Score =  578 bits (1490), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 304/676 (44%), Positives = 417/676 (61%), Gaps = 46/676 (6%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           +  N LA E SPYLLQHA+NP  W+ WGEEAF +AR  D P+FLS+GYSTCHWCHVME E
Sbjct: 6   RRPNLLAEETSPYLLQHAYNPAAWYPWGEEAFEKARNEDKPVFLSVGYSTCHWCHVMERE 65

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SFE++ +A+LLN  FV +KVDREERPD+D++YMTYVQA  G GGWP+SV+L+PDLKP  G
Sbjct: 66  SFENDRIAELLNRAFVPVKVDREERPDIDRLYMTYVQATTGSGGWPMSVWLTPDLKPFFG 125

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE 279
           G+YFPPED+YG+PGF ++L  ++ AW + R+    +     EQL EALS        P+ 
Sbjct: 126 GSYFPPEDRYGKPGFHSLLLSIERAWKEDRNRFLSAAEGMTEQL-EALSLQK-----PET 179

Query: 280 LP--QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEG 337
           +P  +      A+  +  +D   GGFG+APKFP+P  ++ +L +S     TG      E 
Sbjct: 180 VPLDEQVFHHAAKTFAGMFDKEDGGFGNAPKFPQPSILEFLLAYSYF---TGN----QEA 232

Query: 338 QKMVLFTLQCMAKGGIHDHV------GGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLD 391
           ++MVL +L+ MA GGIHDH+      GGGF RYS D RWHVPHFEKMLYD  QLA V  +
Sbjct: 233 KEMVLLSLRKMASGGIHDHLGIKNLGGGGFARYSTDVRWHVPHFEKMLYDNAQLAVVATE 292

Query: 392 AFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSK 451
           A+ +T +  Y+ +  DIL+Y+  DM    G  +SAEDADS     +  KKEGAFY W+ +
Sbjct: 293 AYQITGENLYANLADDILNYVLCDMTDNKGGFYSAEDADSFPNSKSKAKKEGAFYTWSIQ 352

Query: 452 EVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMP 510
           E+   L      +F   Y ++  GN     + DPH EF G+N+L   ND  A+A++  MP
Sbjct: 353 EITAKLDPLETDIFCFIYGVESDGNA----LDDPHLEFTGRNILFARNDIEAAAAQFSMP 408

Query: 511 LEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFN 570
            E    I  + R KLF  R+ RPRPHLDDK++ SWNGL+IS+ ++AS +L+S+       
Sbjct: 409 SEIIREITDDAREKLFHSRNDRPRPHLDDKILTSWNGLMISALSKASCVLRSQ------- 461

Query: 571 FPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLL 630
                     Y++ A  AA FI  +LY     RL   +R+G +   G  DDY+F I GLL
Sbjct: 462 ---------NYLDAALKAAEFILNNLYSTTDGRLLRRYRSGQAGIGGKADDYSFFIQGLL 512

Query: 631 DLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGN 690
           DLYE  S  ++L  A++L   Q ELF D + GG+FN   +D SV +R+KED+DGAEPS N
Sbjct: 513 DLYEASSEHRYLSNAVKLMEKQIELFFDDKSGGFFNAASDDSSVPIRMKEDYDGAEPSPN 572

Query: 691 SVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRK 750
           S++  +L RLA ++     D +R+ A+ ++A F   LK+    +P +   A ML     +
Sbjct: 573 SINTFSLYRLADMM---DRDDFREIADKTIAYFSKSLKENGRQLPCLLKTA-MLPFYGTR 628

Query: 751 HVVLVGHKSSVDFENM 766
            V+L G + +   +N+
Sbjct: 629 QVILTGERHNETMKNL 644


>gi|340721576|ref|XP_003399194.1| PREDICTED: spermatogenesis-associated protein 20-like [Bombus
           terrestris]
          Length = 831

 Score =  578 bits (1490), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 303/660 (45%), Positives = 411/660 (62%), Gaps = 47/660 (7%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL+ E SPYLLQHA NPVDW+ W +EA  +A K +  IFLS+GYSTCHWCHVME ESF 
Sbjct: 101 NRLSLEKSPYLLQHATNPVDWYPWCDEALEKASKENKCIFLSVGYSTCHWCHVMEKESFT 160

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           ++ +A+++N  F++IKVD+EERPD+DK+YMT++QA  G GGWP+SVFL+ DLKP++GGTY
Sbjct: 161 NKEIAEIMNKNFINIKVDKEERPDIDKIYMTFIQATSGHGGWPMSVFLTADLKPIIGGTY 220

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FPPED + + GFKTIL  V   W++ R  L + G+  +E L  ++S   +S K+ D    
Sbjct: 221 FPPEDTFRQIGFKTILLSVAQKWNQSRSKLTEIGSTNLETLC-SISKIPNSLKVHDTPSL 279

Query: 283 NALRLCAEQLSKSYDSRFGGFGS-----APKFPRPVEIQMMLYHSKKLEDTGKSGEASEG 337
              ++C +Q    ++ +FGGFGS     +PKFP+PV +   L+H    +   +S      
Sbjct: 280 ECSKICIQQFVNGFEPKFGGFGSTYNMQSPKFPQPVNLN-FLFHMYARQPNVES--VRPC 336

Query: 338 QKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTK 397
             M ++TL+ M+ GGIHDHVG GF RY+ D  WHVPHFEKMLYDQGQL   Y DA+ +TK
Sbjct: 337 LHMSVYTLKKMSFGGIHDHVGQGFSRYATDGEWHVPHFEKMLYDQGQLMKSYADAYLVTK 396

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
           D F++ I  DI  Y+ RD+    G  +SAEDADS  T  A  KKEGAFYVW++ E++ IL
Sbjct: 397 DNFFAEIVDDIATYVIRDLRHKEGGFYSAEDADSYPTHDAHAKKEGAFYVWSAVEIKSIL 456

Query: 458 GEH---------AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 508
            +          + +F  H+ +  +GN  +    DPH E K KNVLI  N+   +A    
Sbjct: 457 NKEVSDETHVKLSDIFCRHFNVNESGN--VKSHQDPHGEIKEKNVLIAYNEIEETARYFN 514

Query: 509 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 568
           +P+E+    L E    L+ VRS RPRPHLDDK+I +WNGL+IS  A              
Sbjct: 515 LPVEETKMYLKEACSMLYKVRSARPRPHLDDKIITAWNGLMISGLA-------------- 560

Query: 569 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS-FRNGP-------SKAPGFLD 620
             F     + K+Y+E A  AA FI+ +L+DE  + L HS +R+         +  PGFLD
Sbjct: 561 --FGGAAVNNKQYIERAADAAKFIKEYLFDETKNILLHSCYRDEKDTIIQISTPIPGFLD 618

Query: 621 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 680
           DYAF+I GLLDLYE     +WL +A +LQ+ QD+ F D + GGYF+TT  DPS++LR+KE
Sbjct: 619 DYAFVIKGLLDLYESDLNEEWLEFAEKLQHLQDQYFWDEKDGGYFSTTSSDPSIILRLKE 678

Query: 681 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 740
            +DGAEPSGNS++  NL+RLA  +     D ++  A H   VF   L    + VP +  A
Sbjct: 679 AYDGAEPSGNSIAAENLLRLADYLG---CDEFKDKAAHLFRVFRHLLMQSPVTVPQLTSA 735


>gi|297700798|ref|XP_002827419.1| PREDICTED: spermatogenesis-associated protein 20 isoform 1 [Pongo
           abelii]
          Length = 786

 Score =  577 bits (1488), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 310/710 (43%), Positives = 429/710 (60%), Gaps = 56/710 (7%)

Query: 95  SHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCH 154
           S +  +  NRL  E SPYLLQHA+NPVDW+ WG+EAF +ARK + PIFLS+GYSTCHWCH
Sbjct: 55  SSAPQRVPNRLIHEKSPYLLQHAYNPVDWYPWGQEAFDKARKENKPIFLSVGYSTCHWCH 114

Query: 155 VMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDL 214
           +ME ESF++E + +LL++ FVS+KVDREERPDVDKVYMT+VQA   GGGWP++V+L+P+L
Sbjct: 115 MMEEESFQNEEIGRLLSEDFVSVKVDREERPDVDKVYMTFVQATSSGGGWPMNVWLTPNL 174

Query: 215 KPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSN 274
           +P +GGTYFPPED   R GF+T+L ++++ W + ++ L ++     ++++ AL A +  +
Sbjct: 175 QPFVGGTYFPPEDGLTRVGFRTVLLRIREQWKQNKNTLLENS----QRVTTALLARSEIS 230

Query: 275 KLPDELPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLEDTG 329
               +LP +A  +   C +QL + YD  +GGF  APKFP PV +  +   + S +L   G
Sbjct: 231 VGDRQLPPSAATMNNRCFQQLDEGYDEEYGGFAEAPKFPTPVILSFLFSYWLSHRLTQDG 290

Query: 330 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 389
                S  Q+M L TL+ MA GGI DHVG GFHRYS D +WHVPHFEKMLYDQ QLA  Y
Sbjct: 291 -----SRAQQMALHTLKMMANGGIRDHVGQGFHRYSTDRQWHVPHFEKMLYDQAQLAVAY 345

Query: 390 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 449
             AF ++ D FYS + + IL Y+ R +    G  +SAEDADS    G  R KEGA+YVWT
Sbjct: 346 SQAFQISGDEFYSDMAKGILQYVARSLSHRSGGFYSAEDADSPPERG-MRPKEGAYYVWT 404

Query: 450 SKEVEDILGEHAI----------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELND 499
            KEV+ +L E  +          L  +HY L   GN  +S   DP  E +G+NVL     
Sbjct: 405 VKEVQQLLPEPVLGATEPLTSGQLLMKHYGLTEAGN--ISPSQDPKGELQGQNVLTVRYS 462

Query: 500 SSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKI 559
              +A++ G+ +E    +L     KLF  R  RP+PHLD K++ +WNGL++S +A    +
Sbjct: 463 LELTAARFGLDVEAVRTLLNTGLEKLFQARKHRPKPHLDSKMLAAWNGLMVSGYAVTGAV 522

Query: 560 LKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP------S 613
           L              G DR   +  A + A F++RH++D  + RL  +   G       S
Sbjct: 523 L--------------GQDR--LINYATNGAKFLKRHMFDVASGRLMRTCYTGSGGTVEHS 566

Query: 614 KAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGED 671
             P  GFL+DYAF++ GLLDLYE    + WL WA+ LQ+TQD LF D +GGGYF +  E 
Sbjct: 567 NPPCWGFLEDYAFVVRGLLDLYEASQESAWLEWALRLQDTQDRLFWDSQGGGYFCSEAEL 626

Query: 672 PSVL-LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDM 730
            + L LR+K+D DGAEPS NSVS  NL+RL     G K   +       L  F  R++ +
Sbjct: 627 GAGLPLRLKDDQDGAEPSANSVSAHNLLRLHGFT-GHKD--WMDKCVCLLTAFSERMRRV 683

Query: 731 AMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
            +A+P M  A       + K +V+ G + + D + ++   H+ Y  NK +
Sbjct: 684 PVALPEMVRALSA-QQQTLKQIVICGDRQAKDTKALVQCVHSVYIPNKVL 732


>gi|402899621|ref|XP_003912789.1| PREDICTED: spermatogenesis-associated protein 20 isoform 2 [Papio
           anubis]
          Length = 802

 Score =  577 bits (1488), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 321/751 (42%), Positives = 441/751 (58%), Gaps = 63/751 (8%)

Query: 59  PRNYLYPFRRPLAVISHRPIHPYKVVAMAERTPASTSHSRNKHT-----NRLAAEHSPYL 113
           PR +  P R P    S R       V+ +   PA    S    T     NRL  E SPYL
Sbjct: 32  PRTW--PHRNPSRGSSSRDKDRSATVSSSVPMPAGGKGSHPSSTPQRVPNRLIHEKSPYL 89

Query: 114 LQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFEDEGVAKLLNDW 173
           LQHA+NPVDW+ WG+EAF +ARK + PIFLS+GYSTCHWCH+ME ESF++E + +LL++ 
Sbjct: 90  LQHAYNPVDWYPWGQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSED 149

Query: 174 FVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPG 233
           FVS+KVDREERPDVDKVYMT+VQA   GGGWP++V+L+P+L+P +GGTYFPPED   R G
Sbjct: 150 FVSVKVDREERPDVDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVG 209

Query: 234 FKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAE 290
           F+T+L ++++ W + ++ L ++     ++++ AL A +  +    +LP +A  +   C +
Sbjct: 210 FRTVLLRIREQWKQNKNTLLENS----QRVTTALLARSEISMGDRQLPPSAATMNNRCFQ 265

Query: 291 QLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCM 348
           QL + YD  +GGF  APKFP PV +  +   + S +L   G     S  Q+M L TL+ M
Sbjct: 266 QLDEGYDEEYGGFAEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMM 320

Query: 349 AKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDI 408
           A GGI DHVG GFHRYS D +WHVPHFEKMLYDQ QLA  Y  AF ++ D FYS + + I
Sbjct: 321 ANGGIRDHVGQGFHRYSTDCQWHVPHFEKMLYDQAQLAVAYSQAFQISGDEFYSDVAKGI 380

Query: 409 LDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI------ 462
           L Y+ R +    G  +SAEDADS    G  R KEGA+YVWT KEV+ +L E  +      
Sbjct: 381 LQYVARSLSHRSGGFYSAEDADSPPERG-MRPKEGAYYVWTVKEVQQLLPEPVLGATEPL 439

Query: 463 ----LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 518
               L  +HY L   GN  +S   DP  E +G+NVL        +A++ G+ +E    +L
Sbjct: 440 TSGQLLMKHYGLTEAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLL 497

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
                KLF  R  RP+PHLD K++ +WNGL++S +A    +L              G DR
Sbjct: 498 NTGLEKLFQARKHRPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR 543

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLL 630
              +  A + A F++RH++D  + RL  +   G       S  P  GFL+DYAF++ GLL
Sbjct: 544 --LISYATNGAKFLKRHMFDVASGRLMRTCYTGSGGTVEHSSPPCWGFLEDYAFVVRGLL 601

Query: 631 DLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSG 689
           DLYE    + WL WA+ LQ+TQD LF D +GGGYF +  E  + L LR+K+D DGAEPS 
Sbjct: 602 DLYEASQESAWLEWALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSA 661

Query: 690 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSR 749
           NSVS  NL+RL     G K   +       L  F  R++ + +A+P M  A       + 
Sbjct: 662 NSVSAHNLLRLHGFT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTL 717

Query: 750 KHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           K +V+ G + + D + ++   H+ Y  NK +
Sbjct: 718 KQIVICGDRQAKDTKALVQCVHSVYIPNKVL 748


>gi|355753994|gb|EHH57959.1| hypothetical protein EGM_07713, partial [Macaca fascicularis]
          Length = 777

 Score =  577 bits (1487), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 321/751 (42%), Positives = 441/751 (58%), Gaps = 63/751 (8%)

Query: 59  PRNYLYPFRRPLAVISHRPIHPYKVVAMAERTPASTSHSRNKHT-----NRLAAEHSPYL 113
           PR +  P R P    S R       V+ +   PA    S    T     NRL  E SPYL
Sbjct: 7   PRTW--PHRNPSRGSSSRDKDRSATVSSSVPMPAGGKGSHPSSTPQRVPNRLIHEKSPYL 64

Query: 114 LQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFEDEGVAKLLNDW 173
           LQHA+NPVDW+ WG+EAF +ARK + PIFLS+GYSTCHWCH+ME ESF++E + +LL++ 
Sbjct: 65  LQHAYNPVDWYPWGQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSED 124

Query: 174 FVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPG 233
           FVS+KVDREERPDVDKVYMT+VQA   GGGWP++V+L+P+L+P +GGTYFPPED   R G
Sbjct: 125 FVSVKVDREERPDVDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVG 184

Query: 234 FKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAE 290
           F+T+L ++++ W + ++ L ++     ++++ AL A +  +    +LP +A  +   C +
Sbjct: 185 FRTVLLRIREQWKQNKNTLLENS----QRVTTALLARSEISMGDRQLPPSAATMNNRCFQ 240

Query: 291 QLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCM 348
           QL + YD  +GGF  APKFP PV +  +   + S +L   G     S  Q+M L TL+ M
Sbjct: 241 QLDEGYDEEYGGFAEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMM 295

Query: 349 AKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDI 408
           A GGI DHVG GFHRYS D +WHVPHFEKMLYDQ QLA  Y  AF ++ D FYS + + I
Sbjct: 296 ANGGIRDHVGQGFHRYSTDCQWHVPHFEKMLYDQAQLAVAYSQAFQISGDEFYSDVAKGI 355

Query: 409 LDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI------ 462
           L Y+ R +    G  +SAEDADS    G  R KEGA+YVWT KEV+ +L E  +      
Sbjct: 356 LQYVARSLSHRSGGFYSAEDADSPPERG-MRPKEGAYYVWTVKEVQQLLPEPVLGATEPL 414

Query: 463 ----LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 518
               L  +HY L   GN  +S   DP  E +G+NVL        +A++ G+ +E    +L
Sbjct: 415 TSGQLLMKHYGLTEAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLL 472

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
                KLF  R  RP+PHLD K++ +WNGL++S +A    +L              G DR
Sbjct: 473 NTGLEKLFQARKHRPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR 518

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLL 630
              +  A + A F++RH++D  + RL  +   G       S  P  GFL+DYAF++ GLL
Sbjct: 519 --LINYATNGAKFLKRHMFDVASGRLMRTCYTGSGGTVEHSNPPCWGFLEDYAFVVRGLL 576

Query: 631 DLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSG 689
           DLYE    + WL WA+ LQ+TQD LF D +GGGYF +  E  + L LR+K+D DGAEPS 
Sbjct: 577 DLYEASQESAWLEWALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSA 636

Query: 690 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSR 749
           NSVS  NL+RL     G K   +       L  F  R++ + +A+P M  A       + 
Sbjct: 637 NSVSAHNLLRLHGFT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTL 692

Query: 750 KHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           K +V+ G + + D + ++   H+ Y  NK +
Sbjct: 693 KQIVICGDRQAKDTKALVQCVHSVYIPNKVL 723


>gi|109114321|ref|XP_001099622.1| PREDICTED: spermatogenesis-associated protein 20 isoform 4 [Macaca
           mulatta]
 gi|355568523|gb|EHH24804.1| hypothetical protein EGK_08527 [Macaca mulatta]
          Length = 802

 Score =  577 bits (1487), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 321/751 (42%), Positives = 441/751 (58%), Gaps = 63/751 (8%)

Query: 59  PRNYLYPFRRPLAVISHRPIHPYKVVAMAERTPASTSHSRNKHT-----NRLAAEHSPYL 113
           PR +  P R P    S R       V+ +   PA    S    T     NRL  E SPYL
Sbjct: 32  PRTW--PHRNPSRGSSSRDKDRSATVSSSVPMPAGGKGSHPSSTPQRVPNRLIHEKSPYL 89

Query: 114 LQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFEDEGVAKLLNDW 173
           LQHA+NPVDW+ WG+EAF +ARK + PIFLS+GYSTCHWCH+ME ESF++E + +LL++ 
Sbjct: 90  LQHAYNPVDWYPWGQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSED 149

Query: 174 FVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPG 233
           FVS+KVDREERPDVDKVYMT+VQA   GGGWP++V+L+P+L+P +GGTYFPPED   R G
Sbjct: 150 FVSVKVDREERPDVDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVG 209

Query: 234 FKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL---CAE 290
           F+T+L ++++ W + ++ L ++     ++++ AL A +  +    +LP +A  +   C +
Sbjct: 210 FRTVLLRIREQWKQNKNTLLENS----QRVTTALLARSEISMGDRQLPPSAATMNNRCFQ 265

Query: 291 QLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCM 348
           QL + YD  +GGF  APKFP PV +  +   + S +L   G     S  Q+M L TL+ M
Sbjct: 266 QLDEGYDEEYGGFAEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMALHTLKMM 320

Query: 349 AKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDI 408
           A GGI DHVG GFHRYS D +WHVPHFEKMLYDQ QLA  Y  AF ++ D FYS + + I
Sbjct: 321 ANGGIRDHVGQGFHRYSTDCQWHVPHFEKMLYDQAQLAVAYSQAFQISGDEFYSDVAKGI 380

Query: 409 LDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI------ 462
           L Y+ R +    G  +SAEDADS    G  R KEGA+YVWT KEV+ +L E  +      
Sbjct: 381 LQYVARSLSHRSGGFYSAEDADSPPERG-MRPKEGAYYVWTVKEVQQLLPEPVLGATEPL 439

Query: 463 ----LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 518
               L  +HY L   GN  +S   DP  E +G+NVL        +A++ G+ +E    +L
Sbjct: 440 TSGQLLMKHYGLTEAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLL 497

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
                KLF  R  RP+PHLD K++ +WNGL++S +A    +L              G DR
Sbjct: 498 NTGLEKLFQARKHRPKPHLDSKMLAAWNGLMVSGYAVTGAVL--------------GQDR 543

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYAFLISGLL 630
              +  A + A F++RH++D  + RL  +   G       S  P  GFL+DYAF++ GLL
Sbjct: 544 --LINYATNGAKFLKRHMFDVASGRLMRTCYTGSGGTVEHSNPPCWGFLEDYAFVVRGLL 601

Query: 631 DLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSG 689
           DLYE    + WL WA+ LQ+TQD LF D +GGGYF +  E  + L LR+K+D DGAEPS 
Sbjct: 602 DLYEASQESAWLEWALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSA 661

Query: 690 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSR 749
           NSVS  NL+RL     G K   +       L  F  R++ + +A+P M  A       + 
Sbjct: 662 NSVSAHNLLRLHGFT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALSA-QQQTL 717

Query: 750 KHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           K +V+ G + + D + ++   H+ Y  NK +
Sbjct: 718 KQIVICGDRQAKDTKALVQCVHSVYIPNKVL 748


>gi|109114323|ref|XP_001099418.1| PREDICTED: spermatogenesis-associated protein 20 isoform 2 [Macaca
           mulatta]
          Length = 786

 Score =  577 bits (1487), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 312/714 (43%), Positives = 431/714 (60%), Gaps = 59/714 (8%)

Query: 91  PASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTC 150
           P+ST     +  NRL  E SPYLLQHA+NPVDW+ WG+EAF +ARK + PIFLS+GYSTC
Sbjct: 54  PSSTPQ---RVPNRLIHEKSPYLLQHAYNPVDWYPWGQEAFDKARKENKPIFLSVGYSTC 110

Query: 151 HWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFL 210
           HWCH+ME ESF++E + +LL++ FVS+KVDREERPDVDKVYMT+VQA   GGGWP++V+L
Sbjct: 111 HWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPDVDKVYMTFVQATSSGGGWPMNVWL 170

Query: 211 SPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSAS 270
           +P+L+P +GGTYFPPED   R GF+T+L ++++ W + ++ L ++     ++++ AL A 
Sbjct: 171 TPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWKQNKNTLLENS----QRVTTALLAR 226

Query: 271 ASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKL 325
           +  +    +LP +A  +   C +QL + YD  +GGF  APKFP PV +  +   + S +L
Sbjct: 227 SEISMGDRQLPPSAATMNNRCFQQLDEGYDEEYGGFAEAPKFPTPVILSFLFSYWLSHRL 286

Query: 326 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 385
              G     S  Q+M L TL+ MA GGI DHVG GFHRYS D +WHVPHFEKMLYDQ QL
Sbjct: 287 TQDG-----SRAQQMALHTLKMMANGGIRDHVGQGFHRYSTDCQWHVPHFEKMLYDQAQL 341

Query: 386 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 445
           A  Y  AF ++ D FYS + + IL Y+ R +    G  +SAEDADS    G  R KEGA+
Sbjct: 342 AVAYSQAFQISGDEFYSDVAKGILQYVARSLSHRSGGFYSAEDADSPPERG-MRPKEGAY 400

Query: 446 YVWTSKEVEDILGEHAI----------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 495
           YVWT KEV+ +L E  +          L  +HY L   GN  +S   DP  E +G+NVL 
Sbjct: 401 YVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLTEAGN--ISPSQDPKGELQGQNVLT 458

Query: 496 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 555
                  +A++ G+ +E    +L     KLF  R  RP+PHLD K++ +WNGL++S +A 
Sbjct: 459 VRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKHRPKPHLDSKMLAAWNGLMVSGYAV 518

Query: 556 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP--- 612
              +L              G DR   +  A + A F++RH++D  + RL  +   G    
Sbjct: 519 TGAVL--------------GQDR--LINYATNGAKFLKRHMFDVASGRLMRTCYTGSGGT 562

Query: 613 ---SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT 667
              S  P  GFL+DYAF++ GLLDLYE    + WL WA+ LQ+TQD LF D +GGGYF +
Sbjct: 563 VEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLEWALRLQDTQDRLFWDSQGGGYFCS 622

Query: 668 TGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 726
             E  + L LR+K+D DGAEPS NSVS  NL+RL     G K   +       L  F  R
Sbjct: 623 EAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHGFT-GHKD--WMDKCVCLLTAFSER 679

Query: 727 LKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           ++ + +A+P M  A       + K +V+ G + + D + ++   H+ Y  NK +
Sbjct: 680 MRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKDTKALVQCVHSVYIPNKVL 732


>gi|332246333|ref|XP_003272309.1| PREDICTED: LOW QUALITY PROTEIN: spermatogenesis-associated protein
           20 [Nomascus leucogenys]
          Length = 802

 Score =  577 bits (1486), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 312/714 (43%), Positives = 431/714 (60%), Gaps = 59/714 (8%)

Query: 91  PASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTC 150
           P+ST     +  NRL  E SPYLLQHA+NPVDW+ WG+EAF +ARK + PIFLS+GYSTC
Sbjct: 70  PSSTPQ---RVPNRLIHEKSPYLLQHAYNPVDWYPWGQEAFDKARKENKPIFLSVGYSTC 126

Query: 151 HWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFL 210
           HWCH+ME ESF++E + +LL++ FVS+KVDREERPDVDKVYMT+VQA   GGGWP++V+L
Sbjct: 127 HWCHMMEKESFQNEEIGRLLSEDFVSVKVDREERPDVDKVYMTFVQATSSGGGWPMNVWL 186

Query: 211 SPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSAS 270
           +P+L+P +GGTYFPPED   R GF+T+L ++++ W + ++ L +S     ++++ AL A 
Sbjct: 187 APNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWKQNKNTLLESS----QRVTTALLAR 242

Query: 271 ASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKL 325
           +  +    +LP +A  +   C +QL + YD  +GGF  APKFP PV +  +   + S +L
Sbjct: 243 SEISVGDRQLPPSAATMSNRCFQQLDEGYDEEYGGFAEAPKFPTPVILSFLFSYWLSHRL 302

Query: 326 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 385
              G     S  Q+M L TL+ MA GGI DHVG GFHRYS D +WHVPHFEKMLYDQ QL
Sbjct: 303 TQDG-----SRAQQMALHTLKMMANGGIRDHVGQGFHRYSTDCQWHVPHFEKMLYDQAQL 357

Query: 386 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 445
           A  Y  AF ++ D FYS + + IL Y+ R +    G  +SAEDADS    G    KEGA+
Sbjct: 358 AVAYSQAFQISGDEFYSDVAKGILQYVARSLSHRSGGFYSAEDADSPPERGMX-PKEGAY 416

Query: 446 YVWTSKEVEDILGE----------HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 495
           YVWT KE + +L E             L  +HY L   GN  +S   DP  E +G+NVL 
Sbjct: 417 YVWTVKEFQQLLPEPVPGATEPLTSGQLLMKHYGLTEAGN--ISPSQDPKGELQGQNVLT 474

Query: 496 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 555
                  +A++ G+ +E    +L     KLF  R  RP+PHLD+K++ +WNGL++S +A 
Sbjct: 475 VRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKHRPKPHLDNKMLAAWNGLMVSGYAV 534

Query: 556 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP--- 612
              +L              G DR   +  A + A F++RH++D  + RL  +   G    
Sbjct: 535 TGAVL--------------GQDR--LINYATNGAKFLKRHMFDVASGRLIRTCYTGSGGT 578

Query: 613 ---SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT 667
              S  P  GFL+DYAF++ GLLDLYE    + WL WA+ LQ+TQD+LF D +GGGYF +
Sbjct: 579 VEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLEWALRLQDTQDKLFWDSQGGGYFCS 638

Query: 668 TGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 726
             E  + L LR+K+D DGAEPS NSVS  NL+RL     G K   +       L  F  R
Sbjct: 639 EAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHGFT-GHKD--WMDKCVCLLTAFSER 695

Query: 727 LKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           ++ + +A+P M CA       + K +V+ G + + D + ++   H+ Y  NK +
Sbjct: 696 MRRVPVALPEMVCALSA-QQQTLKQIVICGDRQAKDTKALVRCVHSVYIPNKVL 748


>gi|402899623|ref|XP_003912790.1| PREDICTED: spermatogenesis-associated protein 20 isoform 3 [Papio
           anubis]
          Length = 786

 Score =  577 bits (1486), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 312/714 (43%), Positives = 431/714 (60%), Gaps = 59/714 (8%)

Query: 91  PASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTC 150
           P+ST     +  NRL  E SPYLLQHA+NPVDW+ WG+EAF +ARK + PIFLS+GYSTC
Sbjct: 54  PSSTPQ---RVPNRLIHEKSPYLLQHAYNPVDWYPWGQEAFDKARKENKPIFLSVGYSTC 110

Query: 151 HWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFL 210
           HWCH+ME ESF++E + +LL++ FVS+KVDREERPDVDKVYMT+VQA   GGGWP++V+L
Sbjct: 111 HWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPDVDKVYMTFVQATSSGGGWPMNVWL 170

Query: 211 SPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSAS 270
           +P+L+P +GGTYFPPED   R GF+T+L ++++ W + ++ L ++     ++++ AL A 
Sbjct: 171 TPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWKQNKNTLLENS----QRVTTALLAR 226

Query: 271 ASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKL 325
           +  +    +LP +A  +   C +QL + YD  +GGF  APKFP PV +  +   + S +L
Sbjct: 227 SEISMGDRQLPPSAATMNNRCFQQLDEGYDEEYGGFAEAPKFPTPVILSFLFSYWLSHRL 286

Query: 326 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 385
              G     S  Q+M L TL+ MA GGI DHVG GFHRYS D +WHVPHFEKMLYDQ QL
Sbjct: 287 TQDG-----SRAQQMALHTLKMMANGGIRDHVGQGFHRYSTDCQWHVPHFEKMLYDQAQL 341

Query: 386 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 445
           A  Y  AF ++ D FYS + + IL Y+ R +    G  +SAEDADS    G  R KEGA+
Sbjct: 342 AVAYSQAFQISGDEFYSDVAKGILQYVARSLSHRSGGFYSAEDADSPPERG-MRPKEGAY 400

Query: 446 YVWTSKEVEDILGEHAI----------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 495
           YVWT KEV+ +L E  +          L  +HY L   GN  +S   DP  E +G+NVL 
Sbjct: 401 YVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLTEAGN--ISPSQDPKGELQGQNVLT 458

Query: 496 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 555
                  +A++ G+ +E    +L     KLF  R  RP+PHLD K++ +WNGL++S +A 
Sbjct: 459 VRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKHRPKPHLDSKMLAAWNGLMVSGYAV 518

Query: 556 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP--- 612
              +L              G DR   +  A + A F++RH++D  + RL  +   G    
Sbjct: 519 TGAVL--------------GQDR--LISYATNGAKFLKRHMFDVASGRLMRTCYTGSGGT 562

Query: 613 ---SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT 667
              S  P  GFL+DYAF++ GLLDLYE    + WL WA+ LQ+TQD LF D +GGGYF +
Sbjct: 563 VEHSSPPCWGFLEDYAFVVRGLLDLYEASQESAWLEWALRLQDTQDRLFWDSQGGGYFCS 622

Query: 668 TGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 726
             E  + L LR+K+D DGAEPS NSVS  NL+RL     G K   +       L  F  R
Sbjct: 623 EAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHGFT-GHKD--WMDKCVCLLTAFSER 679

Query: 727 LKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           ++ + +A+P M  A       + K +V+ G + + D + ++   H+ Y  NK +
Sbjct: 680 MRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKDTKALVQCVHSVYIPNKVL 732


>gi|109114325|ref|XP_001099321.1| PREDICTED: spermatogenesis-associated protein 20 isoform 1 [Macaca
           mulatta]
          Length = 742

 Score =  576 bits (1485), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 312/714 (43%), Positives = 431/714 (60%), Gaps = 59/714 (8%)

Query: 91  PASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTC 150
           P+ST     +  NRL  E SPYLLQHA+NPVDW+ WG+EAF +ARK + PIFLS+GYSTC
Sbjct: 10  PSSTPQ---RVPNRLIHEKSPYLLQHAYNPVDWYPWGQEAFDKARKENKPIFLSVGYSTC 66

Query: 151 HWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFL 210
           HWCH+ME ESF++E + +LL++ FVS+KVDREERPDVDKVYMT+VQA   GGGWP++V+L
Sbjct: 67  HWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPDVDKVYMTFVQATSSGGGWPMNVWL 126

Query: 211 SPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSAS 270
           +P+L+P +GGTYFPPED   R GF+T+L ++++ W + ++ L ++     ++++ AL A 
Sbjct: 127 TPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWKQNKNTLLENS----QRVTTALLAR 182

Query: 271 ASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKL 325
           +  +    +LP +A  +   C +QL + YD  +GGF  APKFP PV +  +   + S +L
Sbjct: 183 SEISMGDRQLPPSAATMNNRCFQQLDEGYDEEYGGFAEAPKFPTPVILSFLFSYWLSHRL 242

Query: 326 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 385
              G     S  Q+M L TL+ MA GGI DHVG GFHRYS D +WHVPHFEKMLYDQ QL
Sbjct: 243 TQDG-----SRAQQMALHTLKMMANGGIRDHVGQGFHRYSTDCQWHVPHFEKMLYDQAQL 297

Query: 386 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 445
           A  Y  AF ++ D FYS + + IL Y+ R +    G  +SAEDADS    G  R KEGA+
Sbjct: 298 AVAYSQAFQISGDEFYSDVAKGILQYVARSLSHRSGGFYSAEDADSPPERG-MRPKEGAY 356

Query: 446 YVWTSKEVEDILGEHAI----------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 495
           YVWT KEV+ +L E  +          L  +HY L   GN  +S   DP  E +G+NVL 
Sbjct: 357 YVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLTEAGN--ISPSQDPKGELQGQNVLT 414

Query: 496 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 555
                  +A++ G+ +E    +L     KLF  R  RP+PHLD K++ +WNGL++S +A 
Sbjct: 415 VRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKHRPKPHLDSKMLAAWNGLMVSGYAV 474

Query: 556 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP--- 612
              +L              G DR   +  A + A F++RH++D  + RL  +   G    
Sbjct: 475 TGAVL--------------GQDR--LINYATNGAKFLKRHMFDVASGRLMRTCYTGSGGT 518

Query: 613 ---SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT 667
              S  P  GFL+DYAF++ GLLDLYE    + WL WA+ LQ+TQD LF D +GGGYF +
Sbjct: 519 VEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLEWALRLQDTQDRLFWDSQGGGYFCS 578

Query: 668 TGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 726
             E  + L LR+K+D DGAEPS NSVS  NL+RL     G K   +       L  F  R
Sbjct: 579 EAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHGFT-GHKD--WMDKCVCLLTAFSER 635

Query: 727 LKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           ++ + +A+P M  A       + K +V+ G + + D + ++   H+ Y  NK +
Sbjct: 636 MRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKDTKALVQCVHSVYIPNKVL 688


>gi|182413448|ref|YP_001818514.1| hypothetical protein Oter_1630 [Opitutus terrae PB90-1]
 gi|177840662|gb|ACB74914.1| protein of unknown function DUF255 [Opitutus terrae PB90-1]
          Length = 751

 Score =  576 bits (1485), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 321/697 (46%), Positives = 411/697 (58%), Gaps = 48/697 (6%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N LA E SPYLLQHA NPV+W  WGE AFA+AR    PIFLSIGY+TCHWCHVM  ESFE
Sbjct: 3   NALAQEKSPYLLQHADNPVNWLPWGEAAFAKARAEQKPIFLSIGYATCHWCHVMAHESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           +E VA+LLN+ FV+IKVDREERPDVD+VYMTYVQA+ G GGWPLS +L+PDLKP  GGTY
Sbjct: 63  NEAVAQLLNESFVAIKVDREERPDVDRVYMTYVQAMTGHGGWPLSAWLTPDLKPFFGGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE--------------ALS 268
           FPPED+ GR GF  ILR +   W  +R+ L   G   I  L E                S
Sbjct: 123 FPPEDRQGRAGFAAILRAIAHGWSTEREKLVAEGERVIAALREHQQSKTADVSKSTGGES 182

Query: 269 ASASSNKLPDELPQN-------ALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYH 321
           A A      D L          A     +   +++D   GGFG APKFPR   +   L+ 
Sbjct: 183 AGAEIGSGIDALIHQLHERGAPAFERGFQYFYEAFDPEHGGFGGAPKFPRASNLS-FLFR 241

Query: 322 SKKLEDTGKSGEA-SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLY 380
           +  L+  G + EA +E  ++   TLQ MA+GGIHDHVGGGFHRYSVDERW VPHFEKMLY
Sbjct: 242 AAALQ--GVASEAGAEAIRLASATLQAMARGGIHDHVGGGFHRYSVDERWFVPHFEKMLY 299

Query: 381 DQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEG---- 436
           DQ Q+A   L+A   T D  ++++ RDIL Y+ RD+  P G  +SAEDADSA        
Sbjct: 300 DQAQIALNALEAKQATGDERFAWLARDILTYVLRDLAHPDGGFYSAEDADSAAANAEPGH 359

Query: 437 ATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIE 496
             +K EGAFYVW   E+E +LG+ A L  EH+ +KP GN  +    DPH EF GKNVL +
Sbjct: 360 GGKKVEGAFYVWAQSEIEQVLGDEARLVCEHFGVKPDGN--VPGQLDPHGEFTGKNVLAQ 417

Query: 497 LNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARA 556
               + +A    +  E     L     +L  VR++RPRP  DDK+I +WNGL+IS+ A+A
Sbjct: 418 AQPLATTAKAHELTPEMASERLQAALERLRAVRAQRPRPLRDDKIITAWNGLMISALAKA 477

Query: 557 SKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP 616
             +L+   ++A             Y+  A   A F+ R L+D     L  S+R G S   
Sbjct: 478 HVVLELAEDAA----------ETLYLGAATRTAEFVERELFDRDRAILFRSWRGGRSAVE 527

Query: 617 GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL 676
           GF +DYAF+I GLLDLYE G   +WL WA  LQ T D  F D E GGYFN+  +DP ++L
Sbjct: 528 GFAEDYAFMIQGLLDLYEAGFDVRWLQWAERLQATMDARFWDAEHGGYFNSASDDPHLVL 587

Query: 677 RVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDY------YRQNAEHSLAVFETRLKDM 730
           R+KED+DGAEP+ +SV+ +NL+RL  ++    +        YR+    ++  F+ +    
Sbjct: 588 RLKEDYDGAEPAPSSVAAMNLLRLGVMIERPGAAAAAGGIDYRERGLRTILAFQEQWSQT 647

Query: 731 AMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENML 767
             A+P M CA +   +P   HVVL G      F  +L
Sbjct: 648 PQALPQMLCALERALMPP-AHVVLAGQPGDEAFRALL 683


>gi|297700800|ref|XP_002827420.1| PREDICTED: spermatogenesis-associated protein 20 isoform 2 [Pongo
           abelii]
          Length = 802

 Score =  576 bits (1485), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 310/710 (43%), Positives = 429/710 (60%), Gaps = 56/710 (7%)

Query: 95  SHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCH 154
           S +  +  NRL  E SPYLLQHA+NPVDW+ WG+EAF +ARK + PIFLS+GYSTCHWCH
Sbjct: 71  SSAPQRVPNRLIHEKSPYLLQHAYNPVDWYPWGQEAFDKARKENKPIFLSVGYSTCHWCH 130

Query: 155 VMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDL 214
           +ME ESF++E + +LL++ FVS+KVDREERPDVDKVYMT+VQA   GGGWP++V+L+P+L
Sbjct: 131 MMEEESFQNEEIGRLLSEDFVSVKVDREERPDVDKVYMTFVQATSSGGGWPMNVWLTPNL 190

Query: 215 KPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSN 274
           +P +GGTYFPPED   R GF+T+L ++++ W + ++ L ++     ++++ AL A +  +
Sbjct: 191 QPFVGGTYFPPEDGLTRVGFRTVLLRIREQWKQNKNTLLENS----QRVTTALLARSEIS 246

Query: 275 KLPDELPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLEDTG 329
               +LP +A  +   C +QL + YD  +GGF  APKFP PV +  +   + S +L   G
Sbjct: 247 VGDRQLPPSAATMNNRCFQQLDEGYDEEYGGFAEAPKFPTPVILSFLFSYWLSHRLTQDG 306

Query: 330 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 389
                S  Q+M L TL+ MA GGI DHVG GFHRYS D +WHVPHFEKMLYDQ QLA  Y
Sbjct: 307 -----SRAQQMALHTLKMMANGGIRDHVGQGFHRYSTDRQWHVPHFEKMLYDQAQLAVAY 361

Query: 390 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 449
             AF ++ D FYS + + IL Y+ R +    G  +SAEDADS    G  R KEGA+YVWT
Sbjct: 362 SQAFQISGDEFYSDMAKGILQYVARSLSHRSGGFYSAEDADSPPERG-MRPKEGAYYVWT 420

Query: 450 SKEVEDILGEHAI----------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELND 499
            KEV+ +L E  +          L  +HY L   GN  +S   DP  E +G+NVL     
Sbjct: 421 VKEVQQLLPEPVLGATEPLTSGQLLMKHYGLTEAGN--ISPSQDPKGELQGQNVLTVRYS 478

Query: 500 SSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKI 559
              +A++ G+ +E    +L     KLF  R  RP+PHLD K++ +WNGL++S +A    +
Sbjct: 479 LELTAARFGLDVEAVRTLLNTGLEKLFQARKHRPKPHLDSKMLAAWNGLMVSGYAVTGAV 538

Query: 560 LKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP------S 613
           L              G DR   +  A + A F++RH++D  + RL  +   G       S
Sbjct: 539 L--------------GQDR--LINYATNGAKFLKRHMFDVASGRLMRTCYTGSGGTVEHS 582

Query: 614 KAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGED 671
             P  GFL+DYAF++ GLLDLYE    + WL WA+ LQ+TQD LF D +GGGYF +  E 
Sbjct: 583 NPPCWGFLEDYAFVVRGLLDLYEASQESAWLEWALRLQDTQDRLFWDSQGGGYFCSEAEL 642

Query: 672 PSVL-LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDM 730
            + L LR+K+D DGAEPS NSVS  NL+RL     G K   +       L  F  R++ +
Sbjct: 643 GAGLPLRLKDDQDGAEPSANSVSAHNLLRLHGFT-GHKD--WMDKCVCLLTAFSERMRRV 699

Query: 731 AMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
            +A+P M  A       + K +V+ G + + D + ++   H+ Y  NK +
Sbjct: 700 PVALPEMVRALSA-QQQTLKQIVICGDRQAKDTKALVQCVHSVYIPNKVL 748


>gi|297700802|ref|XP_002827421.1| PREDICTED: spermatogenesis-associated protein 20 isoform 3 [Pongo
           abelii]
          Length = 742

 Score =  576 bits (1484), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 309/702 (44%), Positives = 426/702 (60%), Gaps = 56/702 (7%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYLLQHA+NPVDW+ WG+EAF +ARK + PIFLS+GYSTCHWCH+ME ESF+
Sbjct: 19  NRLIHEKSPYLLQHAYNPVDWYPWGQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQ 78

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           +E + +LL++ FVS+KVDREERPDVDKVYMT+VQA   GGGWP++V+L+P+L+P +GGTY
Sbjct: 79  NEEIGRLLSEDFVSVKVDREERPDVDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTY 138

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FPPED   R GF+T+L ++++ W + ++ L ++     ++++ AL A +  +    +LP 
Sbjct: 139 FPPEDGLTRVGFRTVLLRIREQWKQNKNTLLENS----QRVTTALLARSEISVGDRQLPP 194

Query: 283 NALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEG 337
           +A  +   C +QL + YD  +GGF  APKFP PV +  +   + S +L   G     S  
Sbjct: 195 SAATMNNRCFQQLDEGYDEEYGGFAEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRA 249

Query: 338 QKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTK 397
           Q+M L TL+ MA GGI DHVG GFHRYS D +WHVPHFEKMLYDQ QLA  Y  AF ++ 
Sbjct: 250 QQMALHTLKMMANGGIRDHVGQGFHRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQISG 309

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
           D FYS + + IL Y+ R +    G  +SAEDADS    G  R KEGA+YVWT KEV+ +L
Sbjct: 310 DEFYSDMAKGILQYVARSLSHRSGGFYSAEDADSPPERG-MRPKEGAYYVWTVKEVQQLL 368

Query: 458 GEHAI----------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 507
            E  +          L  +HY L   GN  +S   DP  E +G+NVL        +A++ 
Sbjct: 369 PEPVLGATEPLTSGQLLMKHYGLTEAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARF 426

Query: 508 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 567
           G+ +E    +L     KLF  R  RP+PHLD K++ +WNGL++S +A    +L       
Sbjct: 427 GLDVEAVRTLLNTGLEKLFQARKHRPKPHLDSKMLAAWNGLMVSGYAVTGAVL------- 479

Query: 568 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP------SKAP--GFL 619
                  G DR   +  A + A F++RH++D  + RL  +   G       S  P  GFL
Sbjct: 480 -------GQDR--LINYATNGAKFLKRHMFDVASGRLMRTCYTGSGGTVEHSNPPCWGFL 530

Query: 620 DDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRV 678
           +DYAF++ GLLDLYE    + WL WA+ LQ+TQD LF D +GGGYF +  E  + L LR+
Sbjct: 531 EDYAFVVRGLLDLYEASQESAWLEWALRLQDTQDRLFWDSQGGGYFCSEAELGAGLPLRL 590

Query: 679 KEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMC 738
           K+D DGAEPS NSVS  NL+RL     G K   +       L  F  R++ + +A+P M 
Sbjct: 591 KDDQDGAEPSANSVSAHNLLRLHGFT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMV 647

Query: 739 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
            A       + K +V+ G + + D + ++   H+ Y  NK +
Sbjct: 648 RALSA-QQQTLKQIVICGDRQAKDTKALVQCVHSVYIPNKVL 688


>gi|402899619|ref|XP_003912788.1| PREDICTED: spermatogenesis-associated protein 20 isoform 1 [Papio
           anubis]
          Length = 742

 Score =  576 bits (1484), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 312/714 (43%), Positives = 431/714 (60%), Gaps = 59/714 (8%)

Query: 91  PASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTC 150
           P+ST     +  NRL  E SPYLLQHA+NPVDW+ WG+EAF +ARK + PIFLS+GYSTC
Sbjct: 10  PSSTPQ---RVPNRLIHEKSPYLLQHAYNPVDWYPWGQEAFDKARKENKPIFLSVGYSTC 66

Query: 151 HWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFL 210
           HWCH+ME ESF++E + +LL++ FVS+KVDREERPDVDKVYMT+VQA   GGGWP++V+L
Sbjct: 67  HWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPDVDKVYMTFVQATSSGGGWPMNVWL 126

Query: 211 SPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSAS 270
           +P+L+P +GGTYFPPED   R GF+T+L ++++ W + ++ L ++     ++++ AL A 
Sbjct: 127 TPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWKQNKNTLLENS----QRVTTALLAR 182

Query: 271 ASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKL 325
           +  +    +LP +A  +   C +QL + YD  +GGF  APKFP PV +  +   + S +L
Sbjct: 183 SEISMGDRQLPPSAATMNNRCFQQLDEGYDEEYGGFAEAPKFPTPVILSFLFSYWLSHRL 242

Query: 326 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 385
              G     S  Q+M L TL+ MA GGI DHVG GFHRYS D +WHVPHFEKMLYDQ QL
Sbjct: 243 TQDG-----SRAQQMALHTLKMMANGGIRDHVGQGFHRYSTDCQWHVPHFEKMLYDQAQL 297

Query: 386 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 445
           A  Y  AF ++ D FYS + + IL Y+ R +    G  +SAEDADS    G  R KEGA+
Sbjct: 298 AVAYSQAFQISGDEFYSDVAKGILQYVARSLSHRSGGFYSAEDADSPPERG-MRPKEGAY 356

Query: 446 YVWTSKEVEDILGEHAI----------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 495
           YVWT KEV+ +L E  +          L  +HY L   GN  +S   DP  E +G+NVL 
Sbjct: 357 YVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLTEAGN--ISPSQDPKGELQGQNVLT 414

Query: 496 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 555
                  +A++ G+ +E    +L     KLF  R  RP+PHLD K++ +WNGL++S +A 
Sbjct: 415 VRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKHRPKPHLDSKMLAAWNGLMVSGYAV 474

Query: 556 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP--- 612
              +L              G DR   +  A + A F++RH++D  + RL  +   G    
Sbjct: 475 TGAVL--------------GQDR--LISYATNGAKFLKRHMFDVASGRLMRTCYTGSGGT 518

Query: 613 ---SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT 667
              S  P  GFL+DYAF++ GLLDLYE    + WL WA+ LQ+TQD LF D +GGGYF +
Sbjct: 519 VEHSSPPCWGFLEDYAFVVRGLLDLYEASQESAWLEWALRLQDTQDRLFWDSQGGGYFCS 578

Query: 668 TGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 726
             E  + L LR+K+D DGAEPS NSVS  NL+RL     G K   +       L  F  R
Sbjct: 579 EAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHGFT-GHKD--WMDKCVCLLTAFSER 635

Query: 727 LKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           ++ + +A+P M  A       + K +V+ G + + D + ++   H+ Y  NK +
Sbjct: 636 MRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKDTKALVQCVHSVYIPNKVL 688


>gi|344285393|ref|XP_003414446.1| PREDICTED: spermatogenesis-associated protein 20 [Loxodonta
           africana]
          Length = 789

 Score =  575 bits (1483), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 313/714 (43%), Positives = 427/714 (59%), Gaps = 56/714 (7%)

Query: 91  PASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTC 150
           P+       +  NRL  E SPYLLQHA+NPVDW+ WG+EAF +ARK + PIFLS+GYSTC
Sbjct: 54  PSCPPSIPQRAPNRLVNEKSPYLLQHAYNPVDWYPWGQEAFDKARKENKPIFLSVGYSTC 113

Query: 151 HWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFL 210
           HWCH+ME ESF++E + +LL++ FVS+KVDREERPDVDKVYMT+VQA   GGGWP+SV+L
Sbjct: 114 HWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPDVDKVYMTFVQATSSGGGWPMSVWL 173

Query: 211 SPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSAS 270
           +P+L+P +GGTYFPPED   R GF+T+L +++D W + R+ L ++     ++++ AL A 
Sbjct: 174 TPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIRDQWKQNRNTLLENS----QRVTAALLAR 229

Query: 271 ASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKL 325
           +  +    +LP +A  +   C +QL + YD  +GGF  APKFP PV +  +   + S ++
Sbjct: 230 SEISMGDRQLPPSAATMNSRCFQQLDEGYDEEYGGFAEAPKFPTPVILSFLFSYWLSHRI 289

Query: 326 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 385
              G     S  Q+M L TL+ MA GGI DHVG GFHRYS D +W VPHFEKMLYDQ QL
Sbjct: 290 TQDG-----SRAQQMALHTLKMMANGGIRDHVGQGFHRYSTDRQWLVPHFEKMLYDQAQL 344

Query: 386 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 445
           A  Y  AF ++ D FYS + + IL Y+ R +    G  +SAEDADS    G  R KEGAF
Sbjct: 345 AVAYSQAFQISGDEFYSDVAKGILQYVSRSLSHRSGGFYSAEDADSPPERG-MRPKEGAF 403

Query: 446 YVWTSKEVEDILGEHAI----------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 495
           Y+WT KE++ +L E  +          L  +HY L   GN  +S   DP  E +G+NVL 
Sbjct: 404 YLWTVKEIQQLLPEPVLGASEPLTSGQLLTKHYGLTEAGN--ISPNQDPKGELQGQNVLN 461

Query: 496 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 555
                  +A++ G+ +E    +L     KLF VR  RPRPHLD K++ +WNGL++S +A 
Sbjct: 462 VRYSLELTAARFGLDVEAVRTLLNLGLEKLFQVRKHRPRPHLDSKMLAAWNGLMVSGYAV 521

Query: 556 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP--- 612
              +L              G DR   +  A + A F++RH++D  T RL  +   G    
Sbjct: 522 TGAVL--------------GMDR--LINCAINGAKFLKRHMFDVATGRLMRTCYAGSGGT 565

Query: 613 ---SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT 667
              S  P  GFL+DYAF++ GLLDLYE    + WL WA+ LQ+TQD LF D  GGGYF +
Sbjct: 566 VEHSDPPCWGFLEDYAFVVRGLLDLYEASQESAWLEWALRLQDTQDRLFWDSRGGGYFCS 625

Query: 668 TGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 726
             E  + L LR+K+D DGAEPS NSVS  NL+RL     G K   +       L  F  R
Sbjct: 626 EAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHGFT-GHKD--WMDKCVCLLTAFSER 682

Query: 727 LKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           ++ + +A+P M  A       + K +V+ G   + D + ++   H+ Y  NK +
Sbjct: 683 MRRVPVALPEMVRALSA-HQQTLKQIVICGDPQAKDTKALVQCVHSVYIPNKVL 735


>gi|307166116|gb|EFN60365.1| Spermatogenesis-associated protein 20 [Camponotus floridanus]
          Length = 754

 Score =  574 bits (1480), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 312/708 (44%), Positives = 429/708 (60%), Gaps = 51/708 (7%)

Query: 83  VVAMAERTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIF 142
           + + ++ +  +TSHS  K  NRL+ E SPYLLQHA NPV+W+ WG+EA  +A+K D  IF
Sbjct: 1   MASTSKSSAKNTSHSSAKKLNRLSLEKSPYLLQHATNPVEWYPWGDEALEKAKKEDKLIF 60

Query: 143 LSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGG 202
           LS+GYSTCHWCHVME ESFE+E +A+++N+ FV+IKVDREERPD+D++YMT+VQA  G G
Sbjct: 61  LSVGYSTCHWCHVMEKESFENEDIARIMNENFVNIKVDREERPDIDRIYMTFVQAKSGHG 120

Query: 203 GWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQ 262
           GWP+SVFLSPDL P+ GGTYFPP+ KYG  GFK++L  V   W +++  + +S A  +E+
Sbjct: 121 GWPMSVFLSPDLMPVTGGTYFPPDGKYGLIGFKSLLLAVAKEWTQQKSNIIKSAANIVER 180

Query: 263 LSEALSASASSNKLPDELPQ-NALRLCAEQLSKSYDSRFGGFGS-----APKFPRPVEIQ 316
           L + +       K  D  P      LC   L+  Y+ +FGGF S     +PKFP PV   
Sbjct: 181 LKDIVECKQGLKK-DDGFPTAECALLCVHLLANGYEPKFGGFSSRSWMNSPKFPEPVNFN 239

Query: 317 MMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFE 376
             L+ +  L  +  S    +  +M L TL  MA GGIHDHVG GF RYSVD  WHVPHFE
Sbjct: 240 -FLFSTYAL--STSSELRKQCLEMCLHTLTKMAYGGIHDHVGQGFSRYSVDGEWHVPHFE 296

Query: 377 KMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEG 436
           KMLYDQ Q+   Y DA+ +TKD FYS I  DI  Y+ RD+    G  +SAEDADS     
Sbjct: 297 KMLYDQAQIIQAYADAYVITKDSFYSDIVDDIATYVVRDLRHKEGGFYSAEDADSLPEPQ 356

Query: 437 ATRKKEGAFYVWTSKEVEDIL-----GEHAILFKE----HYYLKPTGNCDLSRMSDPHNE 487
           A+ K+EGAFYVW  KEV+ +L     G   + F +    H+ +K  GN  + +  DPH E
Sbjct: 357 ASAKREGAFYVWPYKEVKTLLDKKIPGNDNVRFSDLICYHFNVKKEGN--VRKAQDPHGE 414

Query: 488 FKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNG 547
             GKNV I  +    +A   G+ +E   + + E  + LF+ RSKRPRPHLDDK++ +WNG
Sbjct: 415 LTGKNVFIVYDGIEQTAEHFGISVENTKSYIKEACQILFEERSKRPRPHLDDKIVTAWNG 474

Query: 548 LVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS 607
           L+IS FARA   ++++                +Y+E+A  AA F++++L+D+    L  S
Sbjct: 475 LMISGFARAGAAVRND----------------KYVELATDAAKFVKQYLFDKNKGVLLRS 518

Query: 608 FRNG------PSKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR 659
              G       +  P  GF DDYAF++ GLLDLYE     +WL +A ELQ+ QD LF D 
Sbjct: 519 CYRGEDDRIMQTSVPIHGFHDDYAFVVKGLLDLYEANFDAQWLEFAEELQDIQDRLFWDS 578

Query: 660 EGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHS 719
           + GGYF+T  E+  ++LR+K+ HDGAEPS NS++  NL+RLA+ +  S+    +  A   
Sbjct: 579 QDGGYFSTV-ENSQMILRMKDAHDGAEPSSNSIACSNLLRLATYLDRSE---LKDKAGQL 634

Query: 720 LAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENML 767
           L+ F   L +M +  P +  A  +L   +   + + G   + D   ML
Sbjct: 635 LSAFGKGLTEMPIMFPQLTLA--LLEYHNATQIYIAGRPDAEDTIEML 680


>gi|410298424|gb|JAA27812.1| spermatogenesis associated 20 [Pan troglodytes]
          Length = 802

 Score =  573 bits (1478), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 311/714 (43%), Positives = 429/714 (60%), Gaps = 59/714 (8%)

Query: 91  PASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTC 150
           P+ST     +  NRL  E SPYLLQHA+NPVDW+ WG+EAF +ARK + PIFLS+G  TC
Sbjct: 70  PSSTPQ---RVPNRLIHEKSPYLLQHAYNPVDWYPWGQEAFDKARKENKPIFLSVGSPTC 126

Query: 151 HWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFL 210
           HWCH+ME ESF++E + +LL++ FVS+KVDREERPDVDKVYM +VQA   GGGWP++V+L
Sbjct: 127 HWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPDVDKVYMMFVQATSSGGGWPMNVWL 186

Query: 211 SPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSAS 270
           +P+L+P +GGTYFPPED   R GF+T+L ++++ W + ++ L ++     ++++ AL A 
Sbjct: 187 TPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWKQNKNTLLENS----QRVTTALLAR 242

Query: 271 ASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKL 325
           +  +    +LP +A  +   C +QL + YD  +GGF  APKFP PV +  +   + S +L
Sbjct: 243 SEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYGGFAEAPKFPTPVILSFLFSYWLSHRL 302

Query: 326 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 385
              G     S  Q+M L TL+ MA GGI DHVG GFHRYS D +WHVPHFEKMLYDQ QL
Sbjct: 303 TQDG-----SRAQQMALHTLKMMANGGIRDHVGQGFHRYSTDRQWHVPHFEKMLYDQAQL 357

Query: 386 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 445
           A  Y  AF L+ D FYS + + IL Y+ R +    G  +SAEDADS    G  R KEGA+
Sbjct: 358 AVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRSGGFYSAEDADSPPERG-LRPKEGAY 416

Query: 446 YVWTSKEVEDILGEHAI----------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 495
           YVWT KEV+ +L E  +          L  +HY L   GN  +S   DP  E +G+NVL 
Sbjct: 417 YVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYGLTEAGN--ISPSQDPKGELQGQNVLT 474

Query: 496 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 555
                  +A++ G+ +E    +L     KLF  R  RP+PHLD K++ +WNGL++S +A 
Sbjct: 475 VRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKHRPKPHLDSKMLAAWNGLMVSGYAV 534

Query: 556 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP--- 612
              +L              G DR   +  A + A F++RH++D  + RL  +   GP   
Sbjct: 535 TGAVL--------------GQDR--LINYATNGAKFLKRHMFDVASGRLMRTCYTGPGGT 578

Query: 613 ---SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT 667
              S  P  GFL+DYAF++ GLLDLYE    + WL WA+ LQ+TQD LF D +GGGYF +
Sbjct: 579 VEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLEWALRLQDTQDRLFWDSQGGGYFCS 638

Query: 668 TGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 726
             E  + L LR+K+D DGAEPS NSVS  NL+RL     G K   +       L  F  R
Sbjct: 639 EAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHGFT-GHKD--WMDKCVCLLTAFSER 695

Query: 727 LKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           ++ + +A+P M  A       + K +V+ G + + D + ++   H+ Y  NK +
Sbjct: 696 MRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKDTKALVQCVHSVYIPNKVL 748


>gi|344252175|gb|EGW08279.1| Spermatogenesis-associated protein 20 [Cricetulus griseus]
          Length = 1263

 Score =  573 bits (1478), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 308/706 (43%), Positives = 425/706 (60%), Gaps = 56/706 (7%)

Query: 99   NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
             K  NRL  E SPYLLQHA+NPVDW+ WG+EAF +A+K + PIFLS+GYSTCHWCH+ME 
Sbjct: 536  QKTPNRLIHEKSPYLLQHAYNPVDWYPWGQEAFDKAKKENKPIFLSVGYSTCHWCHMMEE 595

Query: 159  ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
            ESF++E + +LLN+ FVS+KVDREERPDVDKVYMT+VQA   GGGWP++V+++P L+P +
Sbjct: 596  ESFQNEEIGRLLNEDFVSVKVDREERPDVDKVYMTFVQATSSGGGWPMNVWMTPSLQPFV 655

Query: 219  GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
            GGTYFPPED   R GF+T+L +++D W + ++ L ++     ++++ AL A +  +    
Sbjct: 656  GGTYFPPEDGLTRVGFRTVLTRIRDQWKQNKNTLLENS----QRVTTALLARSEISVGDR 711

Query: 279  ELPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLEDTGKSGE 333
            ++P +A  +   C +QL + YD  +GGF  APKFP PV +  +   + S +L   G    
Sbjct: 712  QVPPSAATMNTRCFQQLDEGYDEEYGGFAEAPKFPTPVILNFLFSYWLSHRLAQDG---- 767

Query: 334  ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 393
             S  Q+M L TL+ MA GGI DHVG GFHRYS D +WH+PHFEKMLYDQ QLA VY  AF
Sbjct: 768  -SRAQQMALHTLKMMANGGIRDHVGQGFHRYSTDRQWHIPHFEKMLYDQAQLAVVYSQAF 826

Query: 394  SLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 453
             ++ D FYS + + IL Y+ R +    G  +SAEDADSA   G  + KEGAFYVWT +E+
Sbjct: 827  QISGDEFYSDVAKGILQYVTRSLSHRSGGFYSAEDADSAPERG-MKPKEGAFYVWTVQEI 885

Query: 454  EDILGE----------HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 503
            + +L E             L  +HY L   GN + ++  DP  E +G+NVL        +
Sbjct: 886  QQLLPEPVGGASEPLTSGQLLMKHYGLSEAGNINSNQ--DPKGELQGQNVLTVRYSLELT 943

Query: 504  ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 563
            A++ G+ +E    +L     KLF  R  RP+ HLD K++ +WNGL++S FA    +L   
Sbjct: 944  AARFGLDVEAVSTLLNTGLEKLFQARKHRPKAHLDSKMLAAWNGLMVSGFAVTGAVL--- 1000

Query: 564  AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP------SKAP- 616
                       G D+   +  A + A F++RH++D  + RL+ +   G       S  P 
Sbjct: 1001 -----------GMDK--LVTQATNGAKFLKRHMFDVASGRLKRTCYAGTGGSVEHSNPPC 1047

Query: 617  -GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL 675
             GFL+DYAF++ GLLDLYE    + WL WA+ LQ+TQD LF D  GGGYF +  E  S L
Sbjct: 1048 WGFLEDYAFVVRGLLDLYEASQESSWLEWALRLQDTQDRLFWDSRGGGYFCSEAELGSDL 1107

Query: 676  -LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAV 734
             LR+K+D DGAEPS NSVS  NL+RL     G K   +       L  F  R++ + +A+
Sbjct: 1108 PLRLKDDQDGAEPSANSVSAHNLLRLHGFT-GHKD--WMDKCVCLLTAFSERMRRVPVAL 1164

Query: 735  PLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
            P M  A       + K +V+ G     D + +L   H+ Y  NK +
Sbjct: 1165 PEMVRALSA-QQETLKQIVICGDPQGKDTKALLQCVHSIYLPNKVL 1209


>gi|348562581|ref|XP_003467088.1| PREDICTED: spermatogenesis-associated protein 20-like [Cavia
           porcellus]
          Length = 789

 Score =  573 bits (1477), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 310/715 (43%), Positives = 429/715 (60%), Gaps = 60/715 (8%)

Query: 92  ASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCH 151
           +S  ++  K  NRL  E SPYLLQHA+NPVDW++WG+EAF +A+K + PIFLS+GYSTCH
Sbjct: 55  SSAINTTQKTPNRLINEKSPYLLQHAYNPVDWYSWGQEAFDKAKKENKPIFLSVGYSTCH 114

Query: 152 WCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLS 211
           WCH+ME E+F++E +A+LLN+ FVS+KVDREERPDVDKVYMT+VQA   GGGWP++V+L+
Sbjct: 115 WCHMMEEETFQNEEIARLLNEDFVSVKVDREERPDVDKVYMTFVQATSSGGGWPMNVWLT 174

Query: 212 PDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASA 271
           P L+P +GGTYFPPED   R GF+T+L +++D W + ++ L  S     ++++ AL A +
Sbjct: 175 PSLQPFVGGTYFPPEDGLTRVGFRTVLLRIRDQWKQNKNTLLDSS----QRVTTALLARS 230

Query: 272 SSNKLPDELPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLE 326
             +    ++P  A  +   C +QL + YD  +GGF  APKFP PV +  +   +   ++ 
Sbjct: 231 EISMGDRQMPPTAATMSSRCFQQLDEGYDEEYGGFAEAPKFPTPVILSFLFSYWLGHRMA 290

Query: 327 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 386
             G     S  Q+M L TL+ MA GGI DHVG GFHRYS D +W VPHFEKMLYDQGQLA
Sbjct: 291 QDG-----SRAQQMALHTLKMMANGGIRDHVGQGFHRYSTDRQWQVPHFEKMLYDQGQLA 345

Query: 387 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 446
             Y  AF ++ D FYS + + IL Y+ R +    G  +SAEDADS    G  R KEGAFY
Sbjct: 346 VSYSQAFQISGDEFYSDVAKGILQYVSRSLSHRSGGFYSAEDADSPPERG-MRPKEGAFY 404

Query: 447 VWTSKEVEDILGE----------HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIE 496
           VWT KEV+ +L E             L  +HY L  TGN  ++   D   E  G+NVL  
Sbjct: 405 VWTVKEVQRLLPEAVPGATEPLTAGQLLIKHYGLTETGN--INTCQDSKGELHGQNVLTV 462

Query: 497 LNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARA 556
                 +A++ G+ +E   ++L     KL   R +RP+PHLD K++ +WNGL++S +A  
Sbjct: 463 RYSLELTAARFGLEVEAVRSLLTAGVDKLLQARKQRPKPHLDSKMLAAWNGLMVSGYAVT 522

Query: 557 SKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP 616
             +L              G D+   +  A + A F++RH++D  T RL+ +   G     
Sbjct: 523 GAVL--------------GIDK--LVHSATNCAKFLKRHMFDVATGRLRRTCYAGTGTTV 566

Query: 617 --------GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTT 668
                   GFL+DYAF++ GLLDLYE    + WL WA+ LQ+ QD LF D +GGGYF + 
Sbjct: 567 EHRDPPCWGFLEDYAFVVRGLLDLYEASQESAWLEWALRLQDAQDRLFWDSQGGGYFCSE 626

Query: 669 GE-DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 727
            E   S+ LRVK+D DGAEPS NSV+  NL+RL         D+  + A   L  F  R+
Sbjct: 627 AELGGSLPLRVKDDQDGAEPSANSVAAHNLLRLHGFTG--HKDWLDKCA-CLLTAFSERM 683

Query: 728 KDMAMAVPLMCCAADMLSVPSR--KHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           + + +A+P M  A   LS   +  K +V+ G +++ D   +L   HA Y  NK +
Sbjct: 684 RRVPVALPEMVRA---LSAHQQGLKQIVICGERTAKDTRALLQCVHALYIPNKVL 735


>gi|354478455|ref|XP_003501430.1| PREDICTED: spermatogenesis-associated protein 20 [Cricetulus
           griseus]
          Length = 789

 Score =  572 bits (1475), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 308/706 (43%), Positives = 425/706 (60%), Gaps = 56/706 (7%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
            K  NRL  E SPYLLQHA+NPVDW+ WG+EAF +A+K + PIFLS+GYSTCHWCH+ME 
Sbjct: 62  QKTPNRLIHEKSPYLLQHAYNPVDWYPWGQEAFDKAKKENKPIFLSVGYSTCHWCHMMEE 121

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESF++E + +LLN+ FVS+KVDREERPDVDKVYMT+VQA   GGGWP++V+++P L+P +
Sbjct: 122 ESFQNEEIGRLLNEDFVSVKVDREERPDVDKVYMTFVQATSSGGGWPMNVWMTPSLQPFV 181

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
           GGTYFPPED   R GF+T+L +++D W + ++ L ++     ++++ AL A +  +    
Sbjct: 182 GGTYFPPEDGLTRVGFRTVLTRIRDQWKQNKNTLLENS----QRVTTALLARSEISVGDR 237

Query: 279 ELPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLEDTGKSGE 333
           ++P +A  +   C +QL + YD  +GGF  APKFP PV +  +   + S +L   G    
Sbjct: 238 QVPPSAATMNTRCFQQLDEGYDEEYGGFAEAPKFPTPVILNFLFSYWLSHRLAQDG---- 293

Query: 334 ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 393
            S  Q+M L TL+ MA GGI DHVG GFHRYS D +WH+PHFEKMLYDQ QLA VY  AF
Sbjct: 294 -SRAQQMALHTLKMMANGGIRDHVGQGFHRYSTDRQWHIPHFEKMLYDQAQLAVVYSQAF 352

Query: 394 SLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 453
            ++ D FYS + + IL Y+ R +    G  +SAEDADSA   G  + KEGAFYVWT +E+
Sbjct: 353 QISGDEFYSDVAKGILQYVTRSLSHRSGGFYSAEDADSAPERG-MKPKEGAFYVWTVQEI 411

Query: 454 EDILGE----------HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 503
           + +L E             L  +HY L   GN + ++  DP  E +G+NVL        +
Sbjct: 412 QQLLPEPVGGASEPLTSGQLLMKHYGLSEAGNINSNQ--DPKGELQGQNVLTVRYSLELT 469

Query: 504 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 563
           A++ G+ +E    +L     KLF  R  RP+ HLD K++ +WNGL++S FA    +L   
Sbjct: 470 AARFGLDVEAVSTLLNTGLEKLFQARKHRPKAHLDSKMLAAWNGLMVSGFAVTGAVL--- 526

Query: 564 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP------SKAP- 616
                      G D+   +  A + A F++RH++D  + RL+ +   G       S  P 
Sbjct: 527 -----------GMDK--LVTQATNGAKFLKRHMFDVASGRLKRTCYAGTGGSVEHSNPPC 573

Query: 617 -GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL 675
            GFL+DYAF++ GLLDLYE    + WL WA+ LQ+TQD LF D  GGGYF +  E  S L
Sbjct: 574 WGFLEDYAFVVRGLLDLYEASQESSWLEWALRLQDTQDRLFWDSRGGGYFCSEAELGSDL 633

Query: 676 -LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAV 734
            LR+K+D DGAEPS NSVS  NL+RL     G K   +       L  F  R++ + +A+
Sbjct: 634 PLRLKDDQDGAEPSANSVSAHNLLRLHGFT-GHKD--WMDKCVCLLTAFSERMRRVPVAL 690

Query: 735 PLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           P M  A       + K +V+ G     D + +L   H+ Y  NK +
Sbjct: 691 PEMVRALSA-QQETLKQIVICGDPQGKDTKALLQCVHSIYLPNKVL 735


>gi|226533705|ref|NP_001152785.1| spermatogenesis-associated protein 20 [Sus scrofa]
 gi|226354712|gb|ACO50965.1| spermatogenesis associated 20 [Sus scrofa]
          Length = 789

 Score =  572 bits (1474), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 314/725 (43%), Positives = 426/725 (58%), Gaps = 57/725 (7%)

Query: 80  PYKVVAMAERTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDV 139
           P        RT  S S +  K  NRL  E SPYLLQHA+NPVDW+ WG+EAF +ARK + 
Sbjct: 44  PMPAGGKGSRTNCSQS-APQKTPNRLINEKSPYLLQHAYNPVDWYPWGQEAFDKARKENK 102

Query: 140 PIFLSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALY 199
           PIFLS+GYSTCHWCH+ME ESF++E + +LL++ FVS+KVDREERPDVDKVYMT+VQA  
Sbjct: 103 PIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPDVDKVYMTFVQATS 162

Query: 200 GGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFA 259
            GGGWP+SV+L+P+L+P +GGTYFPPED   R GF+T+L ++++ W + +  L ++    
Sbjct: 163 SGGGWPMSVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWKQNKKTLLENS--- 219

Query: 260 IEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQ 316
            ++++ AL A +  +    +LP +A  +   C +QL + YD  +GGF  APKFP PV + 
Sbjct: 220 -QRVTTALLARSEISMGDRQLPPSAATMNSRCFQQLDEGYDEEYGGFAEAPKFPTPVILS 278

Query: 317 MML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPH 374
            +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GFHRYS D +WHVPH
Sbjct: 279 FLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGFHRYSTDRQWHVPH 333

Query: 375 FEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAET 434
           FEKMLYDQ QL   Y  AF ++ D FYS + + IL Y+ R++    G  +SAEDADS   
Sbjct: 334 FEKMLYDQAQLTVAYSQAFQISGDEFYSDVAKGILQYVARNLSHRSGGFYSAEDADSPPG 393

Query: 435 EGATRKKEGAFYVWTSKEVEDILGEH----------AILFKEHYYLKPTGNCDLSRMSDP 484
            G  R KEGAFY+WT KEV+ +L EH            L  +HY L   GN  +S   DP
Sbjct: 394 RG-MRPKEGAFYLWTVKEVQQLLPEHVPGATEPLTSGQLLMKHYGLTEAGN--ISPSQDP 450

Query: 485 HNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVS 544
             E +G+NVL        +A++ G+  E    +L     KLF  R  RP+PHLD K++ +
Sbjct: 451 KGELQGQNVLTVRYSLELTAARFGLDAEAVQTLLNTGLEKLFQARKHRPKPHLDSKMLAA 510

Query: 545 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL 604
           WNGL++S FA    +L  E    + N+ + G             A F++RH++D  + RL
Sbjct: 511 WNGLMVSGFAVTGAVLGQE---RLINYAING-------------AKFLKRHMFDVASGRL 554

Query: 605 QHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELF 656
             +   G       S  P  GFL+DY F++ GLLDLYE    + WL WA+ LQ+ QD LF
Sbjct: 555 MRTCYAGSGGTVEHSNPPCWGFLEDYTFVVRGLLDLYEASQESAWLEWALRLQDMQDRLF 614

Query: 657 LDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQN 715
            D  GGGYF +  E  + L LR+K+D DGAEPS N VS  NL+RL     G K   +   
Sbjct: 615 WDSRGGGYFCSEAELGAGLPLRLKDDQDGAEPSANFVSAHNLLRLHGFT-GHKD--WMDK 671

Query: 716 AEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYD 775
               L  F  R++ + +A+P M  A       + K +V+ G   + D + +L   H+ Y 
Sbjct: 672 CVCLLTAFSERMRRVPVALPEMVRALSAHQQ-TLKQIVICGDPQAKDTKALLQCVHSIYI 730

Query: 776 LNKTV 780
            NK +
Sbjct: 731 PNKVL 735


>gi|242004841|ref|XP_002423285.1| conserved hypothetical protein [Pediculus humanus corporis]
 gi|212506287|gb|EEB10547.1| conserved hypothetical protein [Pediculus humanus corporis]
          Length = 774

 Score =  572 bits (1474), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 309/707 (43%), Positives = 424/707 (59%), Gaps = 73/707 (10%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           NK +NRLA E SPYLLQH+ NPVDW+ WG EAF+ A K +  IFLS+GYSTCHWCHVME 
Sbjct: 62  NKVSNRLALEKSPYLLQHSTNPVDWYPWGNEAFSRAVKENKLIFLSVGYSTCHWCHVMEK 121

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFE+E +AK++N+ FV +KVDREERPDVDK+YM +VQ                   P+ 
Sbjct: 122 ESFENEEIAKIMNENFVCVKVDREERPDVDKLYMLFVQ-------------------PIF 162

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS-----ASASS 273
           GGTYFPP D + RPGFK++L  + + W + R   +++G   ++ + ++ S      + S+
Sbjct: 163 GGTYFPPSDFHERPGFKSVLLILAEQWRENRQKFSENGRKIMDYIEQSSSLDNSILNPSA 222

Query: 274 NKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMM--LYHSKKLEDTGKS 331
              PD    + +  C   L KSY+  +GGF  APKFP  V +  +  LY  +   + GK+
Sbjct: 223 VNPPD---ISCIEKCYNSLFKSYEKNYGGFSEAPKFPHLVNLNFLFHLYAREPKSERGKT 279

Query: 332 GEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLD 391
             A     M + TL+ MA GGIHDH+G GF RYSVD +WHVPHFEKMLYDQGQLA  Y  
Sbjct: 280 ALA-----MCIHTLKMMANGGIHDHIGKGFSRYSVDNKWHVPHFEKMLYDQGQLAVSYAT 334

Query: 392 AFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSK 451
           A+  TK+ F+S +   IL Y+ RD+  P G  +SAEDADS     +T KKEGAFYVWT +
Sbjct: 335 AYLTTKNQFFSEVLEGILSYVDRDLSHPDGGFYSAEDADSLSAPDSTEKKEGAFYVWTYE 394

Query: 452 EVEDILGE---------HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSA 502
           +++  L +         +A +F E++ +K  GN + S+  DPHNE K +NVLI  +  +A
Sbjct: 395 DIKKHLPQKIPESSELTYADVFCEYFNVKANGNVNPSK--DPHNELKNQNVLIITDSEAA 452

Query: 503 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 562
            A+K  +  E+   IL E ++ LF++R+KRPRPHLDDK++ SWNGL+IS +A+A ++L +
Sbjct: 453 VAAKFNLSEERVKQILDESKKILFNLRAKRPRPHLDDKILTSWNGLMISGYAKAGQVLGN 512

Query: 563 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL--------QHSFRNGPSK 614
                             Y++ A  AA FIR+HLY   T  L         ++     + 
Sbjct: 513 S----------------HYVQRAIGAAKFIRQHLYKNDTKTLLRSCYKSSDNTISQIATP 556

Query: 615 APGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSV 674
             GFLDDYAFLI GLLDLYE      W+ WA  LQ TQD LF D  G GYF++   D S+
Sbjct: 557 INGFLDDYAFLIRGLLDLYEASFDPIWIEWAESLQETQDTLFWDEGGAGYFSSPSGDSSI 616

Query: 675 LLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAV 734
           L+R+KEDHDGAEP GNSVSV NL+RL + +  ++   Y+  A   LA F +RLK M + +
Sbjct: 617 LVRMKEDHDGAEPCGNSVSVSNLLRLGAYLDKAE---YKDRAGKLLAAFTSRLKKMPVIL 673

Query: 735 PLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVS 781
           P M  A  +L       +++ G K+  D   +L    + +  N+ ++
Sbjct: 674 PEMVSAL-LLYHDGPTQILITGKKTDPDTAALLNVVQSRFIPNRILA 719


>gi|383859631|ref|XP_003705296.1| PREDICTED: spermatogenesis-associated protein 20 [Megachile
           rotundata]
          Length = 744

 Score =  572 bits (1473), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 312/707 (44%), Positives = 419/707 (59%), Gaps = 66/707 (9%)

Query: 93  STSHSRN---KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYST 149
           + S+S+N   + TNRLA E SPYLLQHA NPVDW+ W  EA  +A+K D  IFLS+GYST
Sbjct: 2   AASNSKNVKPQKTNRLALEKSPYLLQHATNPVDWYPWCTEALEKAKKEDKLIFLSVGYST 61

Query: 150 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 209
           CHWCHVME ESF ++ +A ++N  FV+IKVD  ERPD+DK+YM +VQA  G GGWP+SVF
Sbjct: 62  CHWCHVMEKESFTNKEIADIMNKHFVNIKVDNGERPDIDKIYMAFVQATTGHGGWPMSVF 121

Query: 210 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 269
           L+PDLKP+ GGTYFPPED + + GFKTIL  + D W+  +  + + G+   + L +    
Sbjct: 122 LTPDLKPVFGGTYFPPEDTFRQTGFKTILLNIADKWNSLKTKITEVGSANFKTLKDISKV 181

Query: 270 SASSNKLPDELPQ-NALRLCAEQLSKSYDSRFGGFGSA-----PKFPRPVEIQMM--LYH 321
             +S K   E+P      +CA QL+  ++  FGGF S+     PKFP+PV    +  +Y 
Sbjct: 182 PQTSKK--HEVPSLECSNVCALQLASEFEPEFGGFTSSFDMHTPKFPQPVIFNFLFHMYA 239

Query: 322 SKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYD 381
               E+  KS        M ++TL+ +A GGIHDH+G GF RY+ D +WHVPHFEKMLYD
Sbjct: 240 RHPNEELAKS-----CLHMCVYTLKKIAFGGIHDHIGQGFSRYATDGKWHVPHFEKMLYD 294

Query: 382 QGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 441
           QGQL   Y DA+  TKD +++ I  DI  Y+ RD+    G  +SAEDADS  T  A  K 
Sbjct: 295 QGQLMKSYADAYVTTKDNYFAEIVDDIAAYVIRDLRHQEGGFYSAEDADSYATSDAHEKL 354

Query: 442 EGAFYVWTSKEVEDILGEH--------AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV 493
           EGAFYVWT+ E++ +L +         + +F  H+ +K +GN  +    DP  E  GKNV
Sbjct: 355 EGAFYVWTAAEIKSLLDKKVSSENIKLSDIFCHHFNVKESGN--VKGYQDPRGELTGKNV 412

Query: 494 LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSF 553
           LI   D   +A      +E+  N L +    L++ R  RPRPHLDDK+I SWNGL+IS  
Sbjct: 413 LIVYEDIDDTAKHFNCTVEEIKNYLKDACSILYEARQARPRPHLDDKIITSWNGLMISGL 472

Query: 554 ARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS-FRNGP 612
           A    ++                D K+Y+E A  AA FI+R+L+DE    L HS +RN  
Sbjct: 473 AYGGAVV----------------DNKQYIEYATDAAKFIKRYLFDEAKDILLHSCYRNAE 516

Query: 613 SKAP-------GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYF 665
           +K         GFLDDYAF+I GLLDLYE G   +WL +A  LQ+ QD+L  D   GGYF
Sbjct: 517 NKITQINEPIHGFLDDYAFVIKGLLDLYEAGFDEQWLEFAERLQDIQDKLLWDETSGGYF 576

Query: 666 NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 725
            TT +DPS+++R+KE HDGAEPSGNS+S  NL+RLA  +  S     +         F  
Sbjct: 577 TTTSDDPSIIVRLKEAHDGAEPSGNSISAENLLRLAYYLGRSD---LKDKVVRLFGAFRH 633

Query: 726 RLKDMAMAVPLMCCAADMLSVPSRKH-----VVLVGHKSSVDFENML 767
            L    +AVP       ++S   R H     + +VG + + D +++L
Sbjct: 634 LLTQRPIAVP------QLVSALVRYHDDATQIYVVGKRGAKDTDDLL 674


>gi|380028980|ref|XP_003698161.1| PREDICTED: spermatogenesis-associated protein 20 [Apis florea]
          Length = 746

 Score =  571 bits (1472), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 299/707 (42%), Positives = 425/707 (60%), Gaps = 53/707 (7%)

Query: 92  ASTSHSRNKH---TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYS 148
           A+TS+  N      N L  E SPYLLQHA NPVDW+ W +EA  +A+K D  IFLS+GYS
Sbjct: 2   ATTSNLENIQIAKNNHLNLEKSPYLLQHATNPVDWYPWCDEALEKAKKEDKCIFLSVGYS 61

Query: 149 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 208
           TCHWCHVME ESF+++ +A ++N  F++IKVD+EERPD+D++YMT+VQA  G GGWP+SV
Sbjct: 62  TCHWCHVMEKESFKNKEIAIIMNKNFINIKVDKEERPDIDRIYMTFVQATTGHGGWPMSV 121

Query: 209 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 268
           FL+PDLKP+ GGTYFPPED   + GFKTIL  +   W++ +  + ++G+  +E L + +S
Sbjct: 122 FLTPDLKPIFGGTYFPPEDTSRQTGFKTILLSIAQKWNQSKTKINEAGSTNLEIL-QNIS 180

Query: 269 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-----APKFPRPVEIQMMLYHSK 323
               ++KL D        +C +QL   ++ +FGGFGS     +PKFP+PV    + +   
Sbjct: 181 KIPHTSKLHDIPSLECSEICIQQLENEFEPKFGGFGSIYNMQSPKFPQPVNFNFLFHMYA 240

Query: 324 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 383
           +  +   +  A     M ++TL+ M+ GGIHDHVG GF RY+ D  WHVPHFEKMLYDQ 
Sbjct: 241 RQPN---ADLARLCLHMCVYTLKKMSYGGIHDHVGQGFSRYATDGEWHVPHFEKMLYDQA 297

Query: 384 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 443
           QL   Y DA+  TK+ +++ I  DI  Y+ RD+    G  +SAEDADS  T  A+ KKEG
Sbjct: 298 QLMKSYADAYLATKNNYFAEIVNDIATYVIRDLRHKEGGFYSAEDADSYPTYDASAKKEG 357

Query: 444 AFYVWTSKEVEDILGEHAIL-----------FKEHYYLKPTGNCDLSRMSDPHNEFKGKN 492
           AFY+WT+ E++ +L +  +L           F  H+ +K  GN  +    DPH E +GKN
Sbjct: 358 AFYIWTAIEIKSLLNKELLLSNEKHIKLSDIFCHHFNIKELGN--IKSYQDPHGELEGKN 415

Query: 493 VLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISS 552
           VLI  N+   +A    +P+E+    L E    L+  RS RPRPHLDDK+I +WNGL+IS 
Sbjct: 416 VLIMYNEIEETAKHFNLPVEEVKMHLMEACSILYKARSTRPRPHLDDKIITAWNGLMISG 475

Query: 553 FARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS----- 607
            A                F     + K+Y++ A  A  FI+R+L+D+  + L HS     
Sbjct: 476 LA----------------FGGTAVNNKQYVKYAVDAIKFIKRYLFDKTKNILLHSCYRDE 519

Query: 608 ---FRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 664
                   +  PGFLDDYAF+I GLLDLYE     +WL +A +LQ+ QD+ F D   GGY
Sbjct: 520 KNIITQMSTPIPGFLDDYAFVIKGLLDLYESDLNEEWLEFAEKLQDLQDQFFWDETNGGY 579

Query: 665 FNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE 724
           F+TT  DPS++LR+KE +DGAEPSGNS++  NL+RLA  +  S+   ++  A      F 
Sbjct: 580 FSTTSNDPSIILRLKEAYDGAEPSGNSIAAENLLRLADYLGRSE---FKDKAVRLFGTFR 636

Query: 725 TRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAH 771
             L    +++P +  A  +        + +VG +++ D +++L+  +
Sbjct: 637 HLLIKRPVSIPQLVSAL-IRYHDDATQIYVVGKRNAKDTDDLLSVIY 682


>gi|307213879|gb|EFN89140.1| Spermatogenesis-associated protein 20 [Harpegnathos saltator]
          Length = 755

 Score =  571 bits (1471), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 312/700 (44%), Positives = 422/700 (60%), Gaps = 55/700 (7%)

Query: 93  STSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHW 152
           +TSH   K  NRL+ E SPYLLQHA NPV+W+ WG+EA  +A+K D  IFLS+GYSTCHW
Sbjct: 11  NTSHFGAKKLNRLSLEKSPYLLQHATNPVEWYPWGDEALEQAKKEDKMIFLSVGYSTCHW 70

Query: 153 CHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP 212
           CHVME ESFE+E +A ++ND F++IKVDREERPD+D++YMT+VQA  G GGWP+SVFL+P
Sbjct: 71  CHVMEKESFENEEIAHIMNDNFINIKVDREERPDIDRIYMTFVQAKSGHGGWPMSVFLAP 130

Query: 213 DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASAS 272
           +L P+ GGTYFPP+D+YG  GFK++L +V   W ++++ + +SGA  + +L + +    S
Sbjct: 131 NLTPVTGGTYFPPDDRYGLIGFKSLLLEVAKKWAQQKNDIIKSGANIVSRLKDMVERRQS 190

Query: 273 SNKLPDELPQNALR-LCAEQLSKSYDSRFGGFGS-----APKFPRPVEIQMM--LYHSKK 324
             K  D  P      LC   L+  Y+ +FGGFGS     APKFP PV    +  +Y    
Sbjct: 191 L-KEGDGFPTVECGFLCVHLLANGYEPKFGGFGSQFRMNAPKFPEPVNFNFLFSVYALSN 249

Query: 325 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 384
           L +  K     E  +M L TL  MA GGIHDHVG GF RYSVD  WHVPHFEKMLYDQ Q
Sbjct: 250 LSELRK-----ECLEMCLHTLTKMAYGGIHDHVGQGFSRYSVDGEWHVPHFEKMLYDQAQ 304

Query: 385 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 444
           +   Y DA+ +TKD FYS I  DI  Y+ RD+    G  +SAEDADS     ++ K+EGA
Sbjct: 305 IIQAYADAYVITKDSFYSDIVDDIAKYVERDLRHKEGGFYSAEDADSLPESKSSAKREGA 364

Query: 445 FYVWTSKEVEDIL-----GEHAILFKE----HYYLKPTGNCDLSRMSDPHNEFKGKNVLI 495
           FYVWT  EV+ +L     G + + F +    H+ +K  GN  + +  DPH E  GKNVLI
Sbjct: 365 FYVWTYDEVKSLLNKKVPGRNNVRFFDLICYHFNVKKEGN--VRKAQDPHGELTGKNVLI 422

Query: 496 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 555
                  +A    + LE     + +    LF  RSKRPRPHLDDK++ +WNGL+IS FAR
Sbjct: 423 AYEAVEKTAEHFNISLEDTKTYIKQACLILFKERSKRPRPHLDDKMVTAWNGLMISGFAR 482

Query: 556 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSF------R 609
           A   +++                 +Y+E+A  AA F+ ++L+D+    L  S       R
Sbjct: 483 AGAAVRN----------------SKYVELATDAAKFVEQYLFDKNKGTLLRSCYREEDDR 526

Query: 610 NGPSKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT 667
              +  P  GF DDYAF++ GLLDLY+      WL  A +LQ+TQDELF D + GGYF+T
Sbjct: 527 IIQTSVPIYGFHDDYAFVVKGLLDLYQANFDVHWLELAEQLQDTQDELFWDSQDGGYFST 586

Query: 668 TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 727
             ED  ++LR+K+ HDGAEPS NS++  NL+RLA+ +  ++    ++ A   L  F   L
Sbjct: 587 V-EDSQMILRMKDAHDGAEPSSNSIACSNLLRLAAFLDRNE---LKEKAAQLLRAFGKGL 642

Query: 728 KDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENML 767
            ++ +  P M  A  +L       + ++G   + D   ML
Sbjct: 643 TEIPIMFPQMTLA--LLDYHYTTQIYIIGKSDAEDTNEML 680


>gi|194217119|ref|XP_001499729.2| PREDICTED: spermatogenesis-associated protein 20-like [Equus
           caballus]
          Length = 889

 Score =  569 bits (1467), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 312/725 (43%), Positives = 429/725 (59%), Gaps = 57/725 (7%)

Query: 80  PYKVVAMAERTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDV 139
           P        RT  S + +  K  NRL  E SPYL QHA+NPVDW+ WG+EAF +ARK + 
Sbjct: 144 PMPAGGKGSRTNCSQA-TPQKVPNRLINEKSPYLQQHAYNPVDWYPWGQEAFDKARKENK 202

Query: 140 PIFLSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALY 199
           PIFLS+GYSTCHWCH+ME ESF++E + +LLN+ FVS+KVDREERPDVDKVYMT+VQA  
Sbjct: 203 PIFLSVGYSTCHWCHMMEEESFQNEEIGRLLNEDFVSVKVDREERPDVDKVYMTFVQATS 262

Query: 200 GGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFA 259
            GGGWP++V+L+P+L+P +GGTYFPPED   R GF T+L+++++ W + ++ L ++    
Sbjct: 263 SGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFHTVLQRIREQWKQNKNTLLENS--- 319

Query: 260 IEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQ 316
            ++++ AL A +  +    +LP +A  +   C +QL + YD  +GGF  APKFP PV + 
Sbjct: 320 -QRVTTALLARSEISMGDRQLPPSAATMNSRCFQQLDEGYDEEYGGFAEAPKFPTPVILS 378

Query: 317 MML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPH 374
            +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GFHRYS D +WHVPH
Sbjct: 379 FLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGFHRYSTDRQWHVPH 433

Query: 375 FEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAET 434
           FEKMLYDQ QLA  Y  AF ++ D FYS + + IL Y+ R++    G  +SAEDADS   
Sbjct: 434 FEKMLYDQAQLAVAYSQAFQISGDEFYSDVAKGILQYVTRNLSHRSGGFYSAEDADSPPE 493

Query: 435 EGATRKKEGAFYVWTSKEVEDILGE----------HAILFKEHYYLKPTGNCDLSRMSDP 484
            G  R KEGAFYVWT KEV+ +L E             L  +HY L   GN  +S   DP
Sbjct: 494 RG-MRPKEGAFYVWTVKEVQQLLPEPVPGATEPLTSGQLLMKHYGLTEAGN--ISSNQDP 550

Query: 485 HNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVS 544
             E  G+NVL        +A++ G+ ++    +L     KLF  R  RP+PHLD K++ +
Sbjct: 551 KGELHGQNVLTVRYSLELTAARFGLDVDAVRTLLNTGLEKLFQARKHRPKPHLDSKMLAA 610

Query: 545 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL 604
           WNGL++S +A    +L  E    + N+ +             + A F++RH++D  + RL
Sbjct: 611 WNGLMVSGYAVTGAVLGLE---RLINYAI-------------NCAKFLKRHMFDVASGRL 654

Query: 605 QHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELF 656
             +   G       S  P  GFL+DYAF++ GLLDLYE    + WL WA+ LQ+TQD LF
Sbjct: 655 MRTCYAGSGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEATQESAWLEWALRLQDTQDRLF 714

Query: 657 LDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQN 715
            D +GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL     G K   +   
Sbjct: 715 WDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHGFT-GHKD--WMDK 771

Query: 716 AEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYD 775
               L  F  R++ + +A+P M  A       + K +V+ G   +   + +L   H+ Y 
Sbjct: 772 CVCLLTAFSERMRRVPVALPEMVRALSAHQQ-TLKQIVICGDPQAKGTKALLQCVHSIYI 830

Query: 776 LNKTV 780
            NK +
Sbjct: 831 PNKVL 835


>gi|350406875|ref|XP_003487911.1| PREDICTED: spermatogenesis-associated protein 20-like [Bombus
           impatiens]
          Length = 831

 Score =  568 bits (1465), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 306/705 (43%), Positives = 421/705 (59%), Gaps = 51/705 (7%)

Query: 92  ASTSHSRN---KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYS 148
           AST++S N   +  NRL+ E SPYLLQHA NPVDW+ W +EA  +A K +  IFLS+GYS
Sbjct: 87  ASTNNSGNMPIQKKNRLSLEKSPYLLQHATNPVDWYPWCDEALEKASKENKCIFLSVGYS 146

Query: 149 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 208
           TCHWCHVME ESF ++ +A+++N  F++IKVD+EERPD+D++YMT++QA  G GGWP+SV
Sbjct: 147 TCHWCHVMEKESFTNKEIAEIMNKNFINIKVDKEERPDIDRIYMTFIQATSGHGGWPMSV 206

Query: 209 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 268
           FL+ DLKP++GGTYFPPED + + GFKTIL  V   W++ R  L + G+  +E L  ++S
Sbjct: 207 FLTTDLKPIVGGTYFPPEDTFRQTGFKTILLSVAQKWNQSRSKLTEIGSTNLETL-HSIS 265

Query: 269 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-----APKFPRPVEIQMMLYHSK 323
               S K+ D       ++C +QL   ++ +FGGFGS     +PKFP+PV     L+H  
Sbjct: 266 KIPDSLKVHDIPSLECSKICIQQLVNEFEPKFGGFGSTYNMQSPKFPQPVNFN-FLFHMY 324

Query: 324 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 383
             +   +S        M ++TL+ M+ GGIHDHVG GF RY+ D  WHVPHFEKMLYDQG
Sbjct: 325 ARQPNVES--VRPCLYMSVYTLKRMSFGGIHDHVGQGFSRYATDGEWHVPHFEKMLYDQG 382

Query: 384 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 443
           QL   Y DA+ +TKD +++ I  DI  Y+ RD+    G  +SAEDADS        KKEG
Sbjct: 383 QLMKSYADAYLVTKDNYFAEIVDDIATYVIRDLRHKEGGFYSAEDADSYPMHDTHAKKEG 442

Query: 444 AFYVWTSKEVEDILGEH---------AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL 494
           AFYVW++ E++ +L +          + +F  H+ +  +GN  +    DPH E   KNVL
Sbjct: 443 AFYVWSAMEIKSLLNKEVSDENHVKLSDIFCRHFNVNESGN--VKSHQDPHGEMGQKNVL 500

Query: 495 IELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFA 554
           I  N+   +A    +P+E+    L E    L+ VRS RPRPHLDDK+I SWNGL+IS  A
Sbjct: 501 IAYNEIEETARYFNLPIEETKMYLKEACSMLYKVRSARPRPHLDDKIITSWNGLMISGLA 560

Query: 555 RASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS------- 607
                           F     + K+Y+E A  AA FI+ +L+DE  + L HS       
Sbjct: 561 ----------------FGGAAVNNKQYIEHAADAAKFIKEYLFDETKNILLHSCYRDEKG 604

Query: 608 -FRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFN 666
                 +  PGFLDDYAF+I GLLDLYE     +WL +A +LQ+ QD+ F D   GGYF 
Sbjct: 605 TITQMSTPIPGFLDDYAFVIKGLLDLYESDLNEEWLEFAEKLQHLQDQYFWDETNGGYFL 664

Query: 667 TTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 726
           TT  DPS++LR+KE +DGAEPSGNS++  NL+RLA  +     D ++  A      F   
Sbjct: 665 TTSSDPSIILRLKEVYDGAEPSGNSIAAENLLRLADYLG---CDEFKDKAARLFGAFRYL 721

Query: 727 LKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAH 771
           L    +AVP +  A  +        + +VG + + D + +L   +
Sbjct: 722 LMQRPVAVPQLTSAL-VRYHDDAAQIYVVGKRGAKDTDELLRVIY 765


>gi|110598780|ref|ZP_01387040.1| Protein of unknown function DUF255 [Chlorobium ferrooxidans DSM
           13031]
 gi|110339607|gb|EAT58122.1| Protein of unknown function DUF255 [Chlorobium ferrooxidans DSM
           13031]
          Length = 712

 Score =  568 bits (1465), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 303/688 (44%), Positives = 415/688 (60%), Gaps = 53/688 (7%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           +  NRL  E SPYLLQHAHNPVDW+AWGEEAF +A + + PIFLS+GYSTCHWCHVME E
Sbjct: 6   RKPNRLIREKSPYLLQHAHNPVDWYAWGEEAFEKAERENRPIFLSVGYSTCHWCHVMERE 65

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SFE+  +A++LN +FV +KVDREE PD+D++YM YVQ+  G GGWP+SV+L+PD  P  G
Sbjct: 66  SFENPDIAEVLNRYFVPVKVDREELPDLDRLYMEYVQSTTGRGGWPMSVWLTPDRNPFYG 125

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRDML--AQSGAFAIEQLSEALSASASSNKLP 277
           G+YFPPED+YG  GFKTIL  +   W+   + +  A SG F+  Q      A++ +  LP
Sbjct: 126 GSYFPPEDRYGMTGFKTILLSIASLWESDEEKIRDASSGFFSDLQ----AFAASRAAALP 181

Query: 278 DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEG 337
            E    A   C   L  ++D  +GGF  APKFPRPV +  +  H+        SG  S+ 
Sbjct: 182 PE--DEAQHNCFRWLESTFDPVYGGFSGAPKFPRPVLLNFLFSHAY------YSGN-SKA 232

Query: 338 QKMVLFTLQCMAKGGIHDHV------GGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLD 391
           ++M LFTL+ MA+GGIHDH+      GGGF RYS DERWHVPHFEKMLYD  QLA  YL+
Sbjct: 233 REMALFTLRRMAEGGIHDHISVTGKGGGGFARYSTDERWHVPHFEKMLYDNAQLAVSYLE 292

Query: 392 AFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSK 451
           AF  + +  +  +  DI +Y+  DM  P G  +SAEDADS E+E  T KKEGAFY+W + 
Sbjct: 293 AFQCSGEPLFRSVAEDIFNYVLSDMTAPEGGFYSAEDADSLESESGTEKKEGAFYLWRAD 352

Query: 452 EVEDILG--EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGM 509
           E+ + +G  E A +F   Y ++  GN     ++DPH EF G+N+L++      +A + G 
Sbjct: 353 ELHEAIGNAEQAAIFSFVYGVRAEGNA----LNDPHGEFTGRNILMQQVSVEETAVRFGK 408

Query: 510 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 569
              +  ++L E RRKL+  RS RPRP LDDK++ SWN L+IS+ ++  ++L SE      
Sbjct: 409 TAVEIRDVLDEARRKLYTARSGRPRPFLDDKILTSWNALMISALSKGFRVLHSE------ 462

Query: 570 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 629
                     E +  A  AA F+   LYD ++ RL   +R+G +   G +DDYAF +  L
Sbjct: 463 ----------ECLTAARKAADFLLETLYDRRSCRLLRRYRDGSAAIAGKVDDYAFFVQAL 512

Query: 630 LDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSG 689
           +DLYE      +L  A+EL   Q  LF D   GGYF++  +D +V +R KE +DGAEPS 
Sbjct: 513 IDLYEASFEIVYLKAALELAEVQKTLFCDALHGGYFSSASDDQTVPVRQKESYDGAEPSA 572

Query: 690 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSR 749
           NSV+ +NL+RL  +    K ++  Q AE   + F T L   + A+P M  A +     +R
Sbjct: 573 NSVTALNLLRLGELTG--KEEFALQ-AEELFSAFGTTLASQSHALPQMLVALNF----AR 625

Query: 750 K---HVVLVGHKSSVDFENMLAAAHASY 774
           K    ++  G   + + E + A A   Y
Sbjct: 626 KRGCRILFSGDLHATEMERLRAVAGERY 653


>gi|223935696|ref|ZP_03627612.1| protein of unknown function DUF255 [bacterium Ellin514]
 gi|223895704|gb|EEF62149.1| protein of unknown function DUF255 [bacterium Ellin514]
          Length = 701

 Score =  568 bits (1465), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 314/690 (45%), Positives = 420/690 (60%), Gaps = 53/690 (7%)

Query: 93  STSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHW 152
           +T  + + HTNRLA E SPYLLQH +NPVDW+ WGEEAFA+ARK + PIFLSIGYSTCHW
Sbjct: 18  TTKSAVHTHTNRLAREKSPYLLQHQYNPVDWYGWGEEAFAKARKENKPIFLSIGYSTCHW 77

Query: 153 CHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP 212
           CHVME ESFE E + K LN+ FVSIKVDREERPDVDK+YMT+VQ+  G GGWPL+ FL+P
Sbjct: 78  CHVMERESFEKEEIGKYLNEHFVSIKVDREERPDVDKIYMTFVQSTSGQGGWPLNCFLTP 137

Query: 213 DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASAS 272
           DLKP  GGTYFPPE KYGRP F  +L+ +   W+ +   +  S     EQL++ ++A  +
Sbjct: 138 DLKPFYGGTYFPPESKYGRPSFLDLLKHINQLWETRHGDVTNSAVQLHEQLAQ-MTAKET 196

Query: 273 SNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSG 332
           +N L   L Q  L   A QL + YDSR GGFG APKFP+P +   +L +       G   
Sbjct: 197 TNGL--ALTQAVLNKAAGQLKEMYDSRNGGFGDAPKFPQPSQPAFLLRY-------GVHS 247

Query: 333 EASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDA 392
              E   MVL T   MA+GGIHD +GGGF RY+VD +W VPHFEKMLYD  QL N+YLDA
Sbjct: 248 NDQEAIAMVLNTCDHMARGGIHDQIGGGFARYAVDAKWLVPHFEKMLYDNAQLVNLYLDA 307

Query: 393 FSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKE 452
           + ++ +  Y+   RD++ Y+ RDM    G  +SAEDADS   EG    KEG FY WT  E
Sbjct: 308 YLVSGETRYADTARDVIGYVLRDMTHAEGGFYSAEDADS---EG----KEGKFYCWTRVE 360

Query: 453 VEDILG--EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMP 510
           +  +L   E  +  K   Y   T   +    SDP      +NVL  ++ +   A +   P
Sbjct: 361 LAKLLTPEEFNVAVK---YFGITEGGNFVDHSDPE-PLPNQNVLSIVDSNLPRADE---P 413

Query: 511 LEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFN 570
           L      L   ++K+F  RSKR RPHLDDK++ SWNGL++S+ ARA  +L          
Sbjct: 414 L------LQSAKQKMFAARSKRVRPHLDDKILASWNGLMLSAIARAYAVLGD-------- 459

Query: 571 FPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLL 630
                   KEY+  AE   SF++  L+D +T  L H +R+G        + YAFL++G++
Sbjct: 460 --------KEYLTAAEHNLSFLQSKLWDAKTKTLYHRWRDGERDTAQLHETYAFLLNGVV 511

Query: 631 DLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGN 690
           DLYE     + L +AI L +     F D   GG++ + G  P ++LR+KED+DGAEPSGN
Sbjct: 512 DLYEATLDPRHLEFAISLADAMIAKFYDPAEGGFWQSAGA-PDLILRIKEDYDGAEPSGN 570

Query: 691 SVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRK 750
           SV+ + L++LA+I    ++D YR+ AE ++ +F  RL+    AVP M  A D  S+   K
Sbjct: 571 SVATLTLLKLAAIT--DRAD-YRKAAEGTMRLFADRLQRFPQAVPYMLMAVD-FSLQEPK 626

Query: 751 HVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
            VV+ G+++  + + +L AAH+ Y   K V
Sbjct: 627 RVVIAGNRAEPEAQKLLRAAHSVYQPAKVV 656


>gi|350590464|ref|XP_003483066.1| PREDICTED: spermatogenesis-associated protein 20-like [Sus scrofa]
          Length = 749

 Score =  568 bits (1464), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 306/698 (43%), Positives = 418/698 (59%), Gaps = 56/698 (8%)

Query: 107 AEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFEDEGV 166
           A   PYLLQHA+NPVDW+ WG+EAF +ARK + PIFLS+GYSTCHWCH+ME ESF++E +
Sbjct: 30  AREVPYLLQHAYNPVDWYPWGQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEI 89

Query: 167 AKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPE 226
            +LL++ FVS+KVDREERPDVDKVYMT+VQA   GGGWP+SV+L+P+L+P +GGTYFPPE
Sbjct: 90  GRLLSEDFVSVKVDREERPDVDKVYMTFVQATSSGGGWPMSVWLTPNLQPFVGGTYFPPE 149

Query: 227 DKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALR 286
           D   R GF+T+L ++++ W + +  L ++     ++++ AL A +  +    +LP +A  
Sbjct: 150 DGLTRVGFRTVLLRIREQWKQNKKTLLENS----QRVTTALLARSEISMGDRQLPPSAAT 205

Query: 287 L---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMV 341
           +   C +QL + YD  +GGF  APKFP PV +  +   + S +L   G     S  Q+M 
Sbjct: 206 MNSRCFQQLDEGYDEEYGGFAEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRAQQMA 260

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
           L TL+ MA GGI DHVG GFHRYS D +WHVPHFEKMLYDQ QL   Y  AF ++ D FY
Sbjct: 261 LHTLKMMANGGIRDHVGQGFHRYSTDRQWHVPHFEKMLYDQAQLTVAYSQAFQISGDEFY 320

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH- 460
           S + + IL Y+ R++    G  +SAEDADS    G  R KEGAFY+WT KEV+ +L EH 
Sbjct: 321 SDVAKGILQYVARNLSHRSGGFYSAEDADSPPERG-MRPKEGAFYLWTVKEVQQLLPEHV 379

Query: 461 ---------AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPL 511
                      L  +HY L   GN  +S   DP  E +G+NVL        +A++ G+ +
Sbjct: 380 PGATEPLTSGQLLMKHYGLTEAGN--ISPSQDPKGELQGQNVLTVRYSLELTAARFGLDV 437

Query: 512 EKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNF 571
           E    +L     KLF  R  RP+PHLD K++ +WNGL++S FA    +L  E    + N+
Sbjct: 438 EAVQTLLNTGLEKLFQARKHRPKPHLDSKMLAAWNGLMVSGFAVTGAVLGQE---RLINY 494

Query: 572 PVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP------SKAP--GFLDDYA 623
            + G             A F++RH++D  + RL  +   G       S  P  GFL+DY 
Sbjct: 495 AING-------------AKFLKRHMFDVASGRLMRTCYAGSGGTVEHSNPPCWGFLEDYT 541

Query: 624 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDH 682
           F++ GLLDLYE    + WL WA+ LQ+TQD LF D  GGGYF +  E  + L LR+K+D 
Sbjct: 542 FVVRGLLDLYEASQESAWLEWALRLQDTQDRLFWDSRGGGYFCSEAELGAGLPLRLKDDQ 601

Query: 683 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 742
           DGAEPS NSVS  NL+RL     G K   +       L  F  R++ + +A+P M  A  
Sbjct: 602 DGAEPSANSVSAHNLLRLHGFT-GHKD--WMDKCVCLLTAFSERMRRVPVALPEMVRALS 658

Query: 743 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
                + K +V+ G   + D + +L   H+ Y  NK +
Sbjct: 659 A-HQQTLKQIVICGDPQAKDTKALLQCVHSIYIPNKVL 695


>gi|426237729|ref|XP_004012810.1| PREDICTED: LOW QUALITY PROTEIN: spermatogenesis-associated protein
           20 [Ovis aries]
          Length = 795

 Score =  567 bits (1462), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 313/719 (43%), Positives = 422/719 (58%), Gaps = 59/719 (8%)

Query: 89  RTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYS 148
           RT  S S +  K  NRL  E SPYLLQHA+NPVDW+ WG+EAF +A+K + PIFLS+GYS
Sbjct: 55  RTNCSQS-TPPKVPNRLINEKSPYLLQHAYNPVDWYPWGQEAFDKAKKENKPIFLSVGYS 113

Query: 149 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 208
           TCHWCH+ME ESF++E + +LL++ FVS+KVDREERPDVDKVYMT+VQA   GGGWP+SV
Sbjct: 114 TCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPDVDKVYMTFVQATSSGGGWPMSV 173

Query: 209 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 268
           +L+P+L+P +GGTYFPPED   R GF+T+L +++D W + +  L ++       L  A S
Sbjct: 174 WLTPNLQPFVGGTYFPPEDGLTRVGFRTVLMRIRDQWKQNKSTLLENSQRVTTALL-ARS 232

Query: 269 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLE 326
           A +  ++     P+ +   C +QL + YD  +GGF  APKFP PV +  +   + S +L 
Sbjct: 233 AISMGDRQXSAAPRPS--RCFQQLDEGYDEEYGGFAEAPKFPTPVILSFLFSYWLSHRLT 290

Query: 327 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 386
             G     S  Q+M L TL+ MA GGI DHVG GFHRYS D +WHVPHFEKMLYDQ QL 
Sbjct: 291 QDG-----SRAQQMALHTLKMMANGGIRDHVGQGFHRYSTDRQWHVPHFEKMLYDQAQLT 345

Query: 387 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 446
             Y  AF ++ D FYS + + IL Y+ R++    G  +SAEDADS    G  R KEGAFY
Sbjct: 346 VAYSQAFQISGDEFYSEVAKGILQYVARNLSHRSGGFYSAEDADSPPERG-MRPKEGAFY 404

Query: 447 VWTSKEVEDILGEHAI----------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIE 496
           VWT KEV+ +L E  +          L  +HY L   GN  +S   DP  E +G+NVL  
Sbjct: 405 VWTVKEVQHLLPEPVLGATEPLTSGQLLMKHYGLTEAGN--ISPSQDPKGELQGQNVLTV 462

Query: 497 LNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARA 556
                 +A++ G+ +E    +L     KLF  R  RP+PHLD K++ +WNGL++S FA  
Sbjct: 463 RYSLELTAARFGLDVEAVRTLLNSGLEKLFQARKHRPKPHLDSKMLAAWNGLMVSGFAVT 522

Query: 557 SKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP---- 612
             +L  E                  +  A + A F++RH++D  + RL  +   G     
Sbjct: 523 GAVLGQE----------------RVVSYAINGAKFLKRHMFDVASGRLMRTCYAGAGGTV 566

Query: 613 --SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTT 668
             S  P  GFL+DYAF++ GLLDLYE    + WL WA+ LQ+TQD LF D  GGGYF + 
Sbjct: 567 EHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLEWALRLQDTQDRLFWDSRGGGYFCSE 626

Query: 669 GEDPSVL-------LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA 721
            E  + L       LR+++D DGAEPS NSVS  NL+RL     G K   +       L 
Sbjct: 627 AELGAGLPWGGGLPLRLEDDQDGAEPSANSVSAHNLLRLHGFT-GHKD--WMDKCVCLLT 683

Query: 722 VFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
            F  R++ + +A+P M  A       + K +V+ G   + D + +L   H+ Y  NK +
Sbjct: 684 AFSERMRRVPVALPEMVRALSA-HQQTLKQIVICGDPQAKDTKALLQCVHSIYIPNKVL 741


>gi|328781619|ref|XP_393124.4| PREDICTED: spermatogenesis-associated protein 20 [Apis mellifera]
          Length = 804

 Score =  567 bits (1461), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 296/692 (42%), Positives = 418/692 (60%), Gaps = 48/692 (6%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           +N L  E SPYLLQHA NPVDW+ W +EA  +A+K D  IFLS+GYSTCHWCH+ME ESF
Sbjct: 74  SNHLNLEKSPYLLQHATNPVDWYPWCDEALEKAKKEDKCIFLSVGYSTCHWCHIMEKESF 133

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           +++ +A ++N  F++IKVD+EERPD+D++YMT+VQA  G GGWP+SVFL+PDLKP+ GGT
Sbjct: 134 KNKEIAIIMNKNFINIKVDKEERPDIDRIYMTFVQATTGHGGWPMSVFLTPDLKPIFGGT 193

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFPPED   + GFKTIL  +   W++ +  + ++G+  +E L + +S    ++KL D   
Sbjct: 194 YFPPEDTSRQTGFKTILLSIAQKWNQSKTKINEAGSTNLEIL-QNISKIPHTSKLHDIPS 252

Query: 282 QNALRLCAEQLSKSYDSRFGGFGS-----APKFPRPVEIQMMLYHSKKLEDTGKSGEASE 336
               ++C +QL   ++ +FGGFGS     +PKFP+PV     L+H    +  G    A  
Sbjct: 253 LECSKICIQQLENEFEPKFGGFGSTYNMQSPKFPQPVNFN-FLFHMYARQPNGDL--ARL 309

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
              M ++TL+ M+ GGIHDHVG GF RY+ D  WHVPHFEKMLYDQ QL   Y DA+  T
Sbjct: 310 CLHMCVYTLKKMSYGGIHDHVGQGFSRYATDGEWHVPHFEKMLYDQAQLMKSYADAYLAT 369

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
           K+ +++ I  DI  Y+ RD+    G  +SAEDADS  T  A+ KKEGAFYVWT+ E++ +
Sbjct: 370 KNNYFAEIVNDIATYVIRDLRHKEGGFYSAEDADSYPTYDASAKKEGAFYVWTAMEIKSL 429

Query: 457 LGEH---------AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 507
           L +          + +F  H+ +K  GN  +    DPH E +GKNVLI  N+   +A   
Sbjct: 430 LNKELSDEKHIKLSDVFCHHFNIKELGN--IKSYQDPHGELEGKNVLIMYNEIEETAKHF 487

Query: 508 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 567
            +P+E+    L E    L+  RS RPRPHLDDK+I +WNGL+IS  A             
Sbjct: 488 NLPVEEMKMHLMEACSILYKARSTRPRPHLDDKIITAWNGLMISGLA------------- 534

Query: 568 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS--------FRNGPSKAPGFL 619
              F     + K+Y+E A  A  FI+R+L+D+  + L HS             +  PGFL
Sbjct: 535 ---FGGTAVNNKQYIEYAVDAIKFIKRYLFDKTKNILLHSCYRDEKNIITQMSTPIPGFL 591

Query: 620 DDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVK 679
           DDYAF+I GLLDLYE     +WL +A +LQ+ QD+ F D    GYF+TT  D S++LR+K
Sbjct: 592 DDYAFVIKGLLDLYESDLNEEWLEFAEKLQDLQDQFFWDETNAGYFSTTSNDLSIILRLK 651

Query: 680 EDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCC 739
           E +DGAEPSGNS++  NL+RLA  +  S+    +  A      F   L    +++P +  
Sbjct: 652 EAYDGAEPSGNSIAAENLLRLADYLGRSE---LKDKAVRLFGTFRHLLIKRPVSIPQLVS 708

Query: 740 AADMLSVPSRKHVVLVGHKSSVDFENMLAAAH 771
           A  +        + +VG +++ D +++L+  +
Sbjct: 709 AL-IRYHDDTTQIYVVGKRNAKDTDDLLSVIY 739


>gi|351713578|gb|EHB16497.1| Spermatogenesis-associated protein 20, partial [Heterocephalus
           glaber]
          Length = 806

 Score =  567 bits (1461), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 309/731 (42%), Positives = 430/731 (58%), Gaps = 68/731 (9%)

Query: 84  VAMAERTPASTSHSRN--------KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEAR 135
           V+ +E  PA    SR         K  NRL  E SPYLLQHA+NPVDW+ WG+EAF +AR
Sbjct: 58  VSSSETMPAGGKGSRTSGATNTAQKVPNRLIDEKSPYLLQHAYNPVDWYPWGQEAFGKAR 117

Query: 136 KRDVPIFLSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYV 195
           K + PIFLS+GYSTCHWCH+ME E+F++E + +LL++ FVS+KVDREE+PDVDKVYMT+V
Sbjct: 118 KENKPIFLSVGYSTCHWCHMMEEETFQNEEIGRLLSEDFVSVKVDREEQPDVDKVYMTFV 177

Query: 196 QALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQS 255
           QA   GGGWP++V+L+P L+P +GGTYFPPED   R GF+T+L +++D W + +  L +S
Sbjct: 178 QATSSGGGWPMNVWLTPSLQPFVGGTYFPPEDGLTRVGFRTVLLRIRDQWKQNKSTLLES 237

Query: 256 GAFAIEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRP 312
                ++++ AL A +  +    + P  A  +   C +QL + YD  +GGF  APKFP P
Sbjct: 238 S----QRVTTALLARSEISMGDRQAPPLAATMNSRCFQQLDEGYDEEYGGFAEAPKFPIP 293

Query: 313 VEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERW 370
           V +  +   +   +L   G     S  Q+M L TL+ MA GGI DHVG GFHRYS D +W
Sbjct: 294 VILSFLFSYWLGHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGFHRYSTDRQW 348

Query: 371 HVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDAD 430
             PHFEKMLYDQ QLA  Y  AF ++ D FYS I + IL Y+ R +    G  +SAED+D
Sbjct: 349 QGPHFEKMLYDQAQLAVSYSQAFQISGDEFYSDIAKGILQYVDRSLSHRSGGFYSAEDSD 408

Query: 431 SAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLKPTGNCDLSR 480
           SA   G  + +EGAFY+WT +E++ +L E  +          L  +HY L   GN  L +
Sbjct: 409 SAPERG-MQPREGAFYMWTVRELQCLLPEPVVGASEPLTVGQLLTKHYGLTEAGNVSLCQ 467

Query: 481 MSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDK 540
             DP  E +G+NVL        +A++ G+ +E    +L     KLF VR +RP+PHLD K
Sbjct: 468 --DPKGELQGQNVLTVRYSLELTAARFGLDVEAVRGLLTSGLDKLFQVRKQRPKPHLDSK 525

Query: 541 VIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQ 600
           ++ +WNGL++S +A    +L  E                  +  A ++A F++RH++D  
Sbjct: 526 MLTAWNGLMVSGYAVTGAVLGIE----------------RLVNRATNSAKFLKRHMFDVA 569

Query: 601 THRLQHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQ 652
           T RL+ +   G       S  P  GFL+DYAF++ GLLDLYE    + WL WA+ LQ+TQ
Sbjct: 570 TGRLKRTCYAGTGASVEHSTPPRWGFLEDYAFVVRGLLDLYEASQESAWLEWALRLQDTQ 629

Query: 653 DELFLDREGGGYFNTTGE-DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDY 711
           D LF D  GGGYF +  E  P + LRVK+D DGAEPS NSV+  NL+RL      ++   
Sbjct: 630 DRLFWDSRGGGYFCSEAELGPGLPLRVKDDQDGAEPSANSVAAHNLLRLHGF---TRHKD 686

Query: 712 YRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSR--KHVVLVGHKSSVDFENMLAA 769
           +       L  F  R++ + +A+P M      LS   +  K +V+ G   + D + +L  
Sbjct: 687 WLDKCVCLLTAFSERMRRVPVALPEM---VRTLSTHQQGLKQIVICGDAQAKDTKALLQC 743

Query: 770 AHASYDLNKTV 780
            H+ Y  NK +
Sbjct: 744 VHSLYIPNKVL 754


>gi|148683975|gb|EDL15922.1| spermatogenesis associated 20, isoform CRA_a [Mus musculus]
          Length = 745

 Score =  567 bits (1460), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 306/707 (43%), Positives = 424/707 (59%), Gaps = 60/707 (8%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           K  NRL  E SPYLLQHA+NPVDW+ WG+EAF +A+K + PIFLS+GYSTCHWCH+ME E
Sbjct: 19  KTANRLINEKSPYLLQHAYNPVDWYPWGQEAFDKAKKENKPIFLSVGYSTCHWCHMMEEE 78

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SF++E + +LLN+ F+ + VDREERPDVDKVYMT+VQA   GGGWP++V+L+P L+P +G
Sbjct: 79  SFQNEEIGRLLNENFICVMVDREERPDVDKVYMTFVQATSSGGGWPMNVWLTPGLQPFVG 138

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE 279
           GTYFPPED   R GF+T+L ++ D W   ++ L ++     ++++ AL A +  +    +
Sbjct: 139 GTYFPPEDGLTRVGFRTVLMRICDQWKLNKNTLLENS----QRVTTALLARSEISVGDRQ 194

Query: 280 LPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLEDTGKSGEA 334
           +P +A  +   C +QL + YD  +GGF  APKFP PV +  +   + S +L   G     
Sbjct: 195 IPASAATMNSRCFQQLDEGYDEEYGGFAEAPKFPTPVILNFLFSYWLSHRLTQDG----- 249

Query: 335 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 394
           S  Q+M L TL+ MA GGI DHVG GFHRYS D +WH+PHFEKMLYDQ QL+ VY  AF 
Sbjct: 250 SRAQQMALHTLKMMANGGIQDHVGQGFHRYSTDRQWHIPHFEKMLYDQAQLSVVYTQAFQ 309

Query: 395 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 454
           ++ D FY+ + + IL Y+ R +    G  +SAEDADS    G  + +EGA+YVWT KEV+
Sbjct: 310 ISGDEFYADVAKGILQYVTRTLSHRSGGFYSAEDADSPPERG-MKPQEGAYYVWTVKEVQ 368

Query: 455 DILGEHAI----------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 504
            +L E  +          L  +HY L   GN + S+  DP+ E  G+NVL+       +A
Sbjct: 369 QLLPEPVVGASEPLTSGQLLMKHYGLSEVGNINSSQ--DPNGELHGQNVLMVRYSLELTA 426

Query: 505 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 564
           ++ G+ +E    +L     KLF  R  RP+ HLD+K++ +WNGL++S FA     L  E 
Sbjct: 427 ARYGLEVEAVRALLNTGLEKLFQARKHRPKAHLDNKMLAAWNGLMVSGFAVTGAALGMEK 486

Query: 565 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP------SKAP-- 616
             A                 A S A F++RH++D  + RL+ +   G       S  P  
Sbjct: 487 LVAQ----------------ATSGAKFLKRHMFDVSSGRLKRTCYAGTGGTVEQSNPPCW 530

Query: 617 GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL- 675
           GFL+DYAF++ GLLDLYE    + WL WA+ LQ+TQD+LF D  GGGYF +  E  + L 
Sbjct: 531 GFLEDYAFVVRGLLDLYEASQESSWLEWALRLQDTQDKLFWDPRGGGYFCSEAELGADLP 590

Query: 676 LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP 735
           LR+K+D DGAEPS NSVS  NL+RL S   G K   +       L  F  R++ + +A+P
Sbjct: 591 LRLKDDQDGAEPSANSVSAHNLLRLHSFT-GHKD--WMDKCVCLLTAFSERMRRVPVALP 647

Query: 736 LMCCAADMLSVPSR--KHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
            M      LS   +  K +V+ G   + D + +L   H+ Y  NK +
Sbjct: 648 EM---VRTLSAQQQTLKQIVICGDPQAKDTKALLQCVHSIYVPNKVL 691


>gi|194336238|ref|YP_002018032.1| hypothetical protein Ppha_1140 [Pelodictyon phaeoclathratiforme
           BU-1]
 gi|194308715|gb|ACF43415.1| protein of unknown function DUF255 [Pelodictyon phaeoclathratiforme
           BU-1]
          Length = 737

 Score =  566 bits (1458), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 305/693 (44%), Positives = 415/693 (59%), Gaps = 49/693 (7%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N L AE SPYLLQHA NPV W AWGEEAF +AR  + PIFLS+GYSTCHWCHVME ESFE
Sbjct: 25  NSLIAEKSPYLLQHALNPVAWLAWGEEAFKKARGENKPIFLSVGYSTCHWCHVMEDESFE 84

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           +  +AKLLN  FV +KVDREE PD+D++YM+YVQA  G GGWP+SV+L+P+L P  GG+Y
Sbjct: 85  NPEIAKLLNAHFVPVKVDREELPDLDRLYMSYVQASTGRGGWPMSVWLTPELNPFYGGSY 144

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRD-MLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           FPPE++YG PGFKTIL  +   W+ +R+ ++++SG+F         S  A S   P   P
Sbjct: 145 FPPEERYGMPGFKTILITITRYWENEREKIISESGSFFA-------SLGAVSRTTPSSQP 197

Query: 282 --QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
             + A + C E L  +YD  FGGFG APKFPRPV +  +  H+    D        +  +
Sbjct: 198 DAEMAQKKCFEWLEANYDPMFGGFGRAPKFPRPVLLNFLFNHAYHTGD-------KKALR 250

Query: 340 MVLFTLQCMAKGGIHDHV------GGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 393
           M L TL  MA+GGIHDH+      GGGF RYS D+RWHVPHFEKMLYD  QLA   L+AF
Sbjct: 251 MALHTLHKMAEGGIHDHLGIIGKGGGGFARYSTDQRWHVPHFEKMLYDNAQLAISCLEAF 310

Query: 394 SLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 453
             + D FY     DI +Y+  DM  P G  +SAEDAD+  T G+ +K+EGA Y+W++ E+
Sbjct: 311 QCSGDNFYKRTAEDIFNYVLCDMRSPQGGFYSAEDADTLLTHGSEQKQEGALYLWSADEI 370

Query: 454 EDILG--EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPL 511
            + L   E A +F   Y ++  GN +     DPH EF GKN+L++       A   G  +
Sbjct: 371 RETLADEELATIFSFTYGIRDEGNAEY----DPHGEFNGKNILMQQATDEECADTFGKTV 426

Query: 512 EKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNF 571
           E+    L + R KL+  RS+RPR  LDDK++ +WNGL+IS+ A+  ++L +E        
Sbjct: 427 EEIRAALDDARTKLYHARSRRPRAFLDDKILTAWNGLMISALAKGYQVLHNET------- 479

Query: 572 PVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLD 631
                    ++  A  AA+FI   LYD+   RL   +R+G +   G  +DYAFL+ GL D
Sbjct: 480 ---------FLAAAREAANFILETLYDQANGRLLRRYRDGNAAIAGKAEDYAFLVQGLTD 530

Query: 632 LYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNS 691
           LYE  S  ++L  A++L   Q+ LF D   GGYF+T  +D +V LR+KE++DGAEPS NS
Sbjct: 531 LYEASSEVRYLQIALQLAEIQNTLFYDNAQGGYFSTAIDDHTVPLRIKEEYDGAEPSANS 590

Query: 692 VSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKH 751
           +S +NL+RLA +      D+ R+ AE ++      L + + A+P M  A +  +   + H
Sbjct: 591 ISTLNLLRLAEMTG--NEDFVRR-AEETIKSCRIMLAENSSALPQMLVAKN-FAEQRKVH 646

Query: 752 VVLVGHKSSVDFENMLAAAHASYDLNKTVSKKS 784
           +V  G   S     +    +  Y    T+S  S
Sbjct: 647 LVFSGPLDSSSMNELRQTVYEQYLPGATMSHAS 679


>gi|46485467|ref|NP_659076.2| spermatogenesis-associated protein 20 [Mus musculus]
 gi|81912951|sp|Q80YT5.1|SPT20_MOUSE RecName: Full=Spermatogenesis-associated protein 20; AltName:
           Full=Sperm-specific protein 411; Short=Ssp411; AltName:
           Full=Transcript increased in spermiogenesis 78 protein
 gi|29748049|gb|AAH50788.1| Spermatogenesis associated 20 [Mus musculus]
          Length = 790

 Score =  566 bits (1458), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 306/707 (43%), Positives = 424/707 (59%), Gaps = 60/707 (8%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           K  NRL  E SPYLLQHA+NPVDW+ WG+EAF +A+K + PIFLS+GYSTCHWCH+ME E
Sbjct: 64  KTVNRLINEKSPYLLQHAYNPVDWYPWGQEAFDKAKKENKPIFLSVGYSTCHWCHMMEEE 123

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SF++E + +LLN+ F+ + VDREERPDVDKVYMT+VQA   GGGWP++V+L+P L+P +G
Sbjct: 124 SFQNEEIGRLLNENFICVMVDREERPDVDKVYMTFVQATSSGGGWPMNVWLTPGLQPFVG 183

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE 279
           GTYFPPED   R GF+T+L ++ D W   ++ L ++     ++++ AL A +  +    +
Sbjct: 184 GTYFPPEDGLTRVGFRTVLMRICDQWKLNKNTLLENS----QRVTTALLARSEISVGDRQ 239

Query: 280 LPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLEDTGKSGEA 334
           +P +A  +   C +QL + YD  +GGF  APKFP PV +  +   + S +L   G     
Sbjct: 240 IPASAATMNSRCFQQLDEGYDEEYGGFAEAPKFPTPVILNFLFSYWLSHRLTQDG----- 294

Query: 335 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 394
           S  Q+M L TL+ MA GGI DHVG GFHRYS D +WH+PHFEKMLYDQ QL+ VY  AF 
Sbjct: 295 SRAQQMALHTLKMMANGGIQDHVGQGFHRYSTDRQWHIPHFEKMLYDQAQLSVVYTQAFQ 354

Query: 395 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 454
           ++ D FY+ + + IL Y+ R +    G  +SAEDADS    G  + +EGA+YVWT KEV+
Sbjct: 355 ISGDEFYADVAKGILQYVTRTLSHRSGGFYSAEDADSPPERG-MKPQEGAYYVWTVKEVQ 413

Query: 455 DILGEHAI----------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 504
            +L E  +          L  +HY L   GN + S+  DP+ E  G+NVL+       +A
Sbjct: 414 QLLPEPVVGASEPLTSGQLLMKHYGLSEVGNINSSQ--DPNGELHGQNVLMVRYSLELTA 471

Query: 505 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 564
           ++ G+ +E    +L     KLF  R  RP+ HLD+K++ +WNGL++S FA     L  E 
Sbjct: 472 ARYGLEVEAVRALLNTGLEKLFQARKHRPKAHLDNKMLAAWNGLMVSGFAVTGAALGMEK 531

Query: 565 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP------SKAP-- 616
             A                 A S A F++RH++D  + RL+ +   G       S  P  
Sbjct: 532 LVAQ----------------ATSGAKFLKRHMFDVSSGRLKRTCYAGTGGTVEQSNPPCW 575

Query: 617 GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL- 675
           GFL+DYAF++ GLLDLYE    + WL WA+ LQ+TQD+LF D  GGGYF +  E  + L 
Sbjct: 576 GFLEDYAFVVRGLLDLYEASQESSWLEWALRLQDTQDKLFWDPRGGGYFCSEAELGADLP 635

Query: 676 LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP 735
           LR+K+D DGAEPS NSVS  NL+RL S   G K   +       L  F  R++ + +A+P
Sbjct: 636 LRLKDDQDGAEPSANSVSAHNLLRLHSFT-GHKD--WMDKCVCLLTAFSERMRRVPVALP 692

Query: 736 LMCCAADMLSVPSR--KHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
            M      LS   +  K +V+ G   + D + +L   H+ Y  NK +
Sbjct: 693 EM---VRTLSAQQQTLKQIVICGDPQAKDTKALLQCVHSIYVPNKVL 736


>gi|324505187|gb|ADY42236.1| Unknown [Ascaris suum]
          Length = 775

 Score =  565 bits (1457), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 308/694 (44%), Positives = 412/694 (59%), Gaps = 70/694 (10%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNRL  E SPYLLQHAHNPVDW+ WG+EAF +A+  +  IFLS+GYSTCHWCHVM  ESF
Sbjct: 56  TNRLVNERSPYLLQHAHNPVDWYPWGDEAFTKAKTLNRLIFLSVGYSTCHWCHVMAHESF 115

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           E++ +A +LN+ FVSIKVDREERPDVDK+YMT++QA+ GGGGWP+SVFL+PDL P+ GGT
Sbjct: 116 ENQTIADILNENFVSIKVDREERPDVDKLYMTFIQAISGGGGWPMSVFLTPDLNPVTGGT 175

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFPPED+YGRPGF +ILR + + W  + D +   G FA   L+ A+  +  +N+      
Sbjct: 176 YFPPEDRYGRPGFASILRTIAEKWQLEGDQIRGQG-FA---LANAIKKAFLTNRETVPAD 231

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQK 339
           +N    C  +L+  +D  + GFG APKFP+P E+  ML  Y + K    GK        K
Sbjct: 232 ENVALTCYTELADRFDETYKGFGGAPKFPKPAELDFMLSFYANNKSTTEGKL-----ALK 286

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           MV  TL+ MA+GGIHDH+G GFHRY+VD  WHVPHFEKMLYDQ QL +VY +        
Sbjct: 287 MVGETLEAMARGGIHDHIGKGFHRYAVDAAWHVPHFEKMLYDQAQLLSVYAN-------- 338

Query: 400 FYSYIC-------RDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKE 452
            YS +C        DI DY+ R++  P G  +SA+DADS  +  A  K+EGAFYVWT +E
Sbjct: 339 -YSLVCGQMKEIVEDIADYVYRNLTHPEGGFYSAQDADSLPSHNAKAKREGAFYVWTEQE 397

Query: 453 VEDILG----------EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSA 502
           ++D L           + A  FK+++ +K  GNC     +DPH E K +NVL   +    
Sbjct: 398 IDDALKDVTVNGDSSVDVATYFKQYFGVKANGNCPSD--TDPHGELKLQNVLAMKDSHKD 455

Query: 503 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 562
           SA KLG+  +K   I+ + R+ L + R++RP PHLD K++ SWNGL+IS  +RAS     
Sbjct: 456 SARKLGISEDKLTAIIEKARQVLVEARAQRPEPHLDSKMLTSWNGLMISGLSRAS----- 510

Query: 563 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRN---------GPS 613
                      V + + E    A+    FI++++  E    L+ ++ +          P 
Sbjct: 511 -----------VAAGKPELAGRAQKVVEFIKKYMLSENGELLRTAYTDESGGVVHNSKPV 559

Query: 614 KAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPS 673
           KA  F DDYAFLI GLLDLYE       L +A ELQ   DE F D +    +  +  DPS
Sbjct: 560 KA--FADDYAFLIEGLLDLYEVTFDENLLKFASELQKQFDERFWDTDNNAGYFLSETDPS 617

Query: 674 VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMA 733
           ++ R  EDHDGAEP+ NSV+ +NLVRLASI      + +R    + L     RL+     
Sbjct: 618 IMTRFMEDHDGAEPATNSVAALNLVRLASIF---DEERFRDRVANILESVSLRLRRYPSV 674

Query: 734 VPLMCCAADMLSVPSRKHVVLVGHKSSVDFENML 767
           +P M  A    S P+   VV++G +     + ML
Sbjct: 675 LPKMVTALMRHSRPA-TLVVVIGKRDDPLTQQML 707


>gi|148683976|gb|EDL15923.1| spermatogenesis associated 20, isoform CRA_b [Mus musculus]
          Length = 796

 Score =  565 bits (1456), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 306/707 (43%), Positives = 424/707 (59%), Gaps = 60/707 (8%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           K  NRL  E SPYLLQHA+NPVDW+ WG+EAF +A+K + PIFLS+GYSTCHWCH+ME E
Sbjct: 70  KTANRLINEKSPYLLQHAYNPVDWYPWGQEAFDKAKKENKPIFLSVGYSTCHWCHMMEEE 129

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SF++E + +LLN+ F+ + VDREERPDVDKVYMT+VQA   GGGWP++V+L+P L+P +G
Sbjct: 130 SFQNEEIGRLLNENFICVMVDREERPDVDKVYMTFVQATSSGGGWPMNVWLTPGLQPFVG 189

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE 279
           GTYFPPED   R GF+T+L ++ D W   ++ L ++     ++++ AL A +  +    +
Sbjct: 190 GTYFPPEDGLTRVGFRTVLMRICDQWKLNKNTLLENS----QRVTTALLARSEISVGDRQ 245

Query: 280 LPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLEDTGKSGEA 334
           +P +A  +   C +QL + YD  +GGF  APKFP PV +  +   + S +L   G     
Sbjct: 246 IPASAATMNSRCFQQLDEGYDEEYGGFAEAPKFPTPVILNFLFSYWLSHRLTQDG----- 300

Query: 335 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 394
           S  Q+M L TL+ MA GGI DHVG GFHRYS D +WH+PHFEKMLYDQ QL+ VY  AF 
Sbjct: 301 SRAQQMALHTLKMMANGGIQDHVGQGFHRYSTDRQWHIPHFEKMLYDQAQLSVVYTQAFQ 360

Query: 395 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 454
           ++ D FY+ + + IL Y+ R +    G  +SAEDADS    G  + +EGA+YVWT KEV+
Sbjct: 361 ISGDEFYADVAKGILQYVTRTLSHRSGGFYSAEDADSPPERG-MKPQEGAYYVWTVKEVQ 419

Query: 455 DILGEHAI----------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 504
            +L E  +          L  +HY L   GN + S+  DP+ E  G+NVL+       +A
Sbjct: 420 QLLPEPVVGASEPLTSGQLLMKHYGLSEVGNINSSQ--DPNGELHGQNVLMVRYSLELTA 477

Query: 505 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 564
           ++ G+ +E    +L     KLF  R  RP+ HLD+K++ +WNGL++S FA     L  E 
Sbjct: 478 ARYGLEVEAVRALLNTGLEKLFQARKHRPKAHLDNKMLAAWNGLMVSGFAVTGAALGMEK 537

Query: 565 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP------SKAP-- 616
             A                 A S A F++RH++D  + RL+ +   G       S  P  
Sbjct: 538 LVAQ----------------ATSGAKFLKRHMFDVSSGRLKRTCYAGTGGTVEQSNPPCW 581

Query: 617 GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL- 675
           GFL+DYAF++ GLLDLYE    + WL WA+ LQ+TQD+LF D  GGGYF +  E  + L 
Sbjct: 582 GFLEDYAFVVRGLLDLYEASQESSWLEWALRLQDTQDKLFWDPRGGGYFCSEAELGADLP 641

Query: 676 LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP 735
           LR+K+D DGAEPS NSVS  NL+RL S   G K   +       L  F  R++ + +A+P
Sbjct: 642 LRLKDDQDGAEPSANSVSAHNLLRLHSFT-GHKD--WMDKCVCLLTAFSERMRRVPVALP 698

Query: 736 LMCCAADMLSVPSR--KHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
            M      LS   +  K +V+ G   + D + +L   H+ Y  NK +
Sbjct: 699 EM---VRTLSAQQQTLKQIVICGDPQAKDTKALLQCVHSIYVPNKVL 742


>gi|301781214|ref|XP_002926022.1| PREDICTED: LOW QUALITY PROTEIN: spermatogenesis-associated protein
           20-like [Ailuropoda melanoleuca]
          Length = 785

 Score =  565 bits (1456), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 312/725 (43%), Positives = 424/725 (58%), Gaps = 61/725 (8%)

Query: 80  PYKVVAMAERTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDV 139
           P  V     RT  S S +  K  NRL  E SPYLLQHA+NPVDW+ WG+EAF +ARK + 
Sbjct: 44  PMPVGGKGSRTSCSPS-TLQKVPNRLINEKSPYLLQHAYNPVDWYPWGQEAFDKARKENK 102

Query: 140 PIFLSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALY 199
           PIFLS+GYSTCHWCH+ME ESF++E + +LL++ FVS+KVDREERPDVDKVYMT+VQA  
Sbjct: 103 PIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPDVDKVYMTFVQATS 162

Query: 200 GGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFA 259
            GGGW     L+P+L+P +GGTYFPPED   R GF T+L ++++ W + +  L ++    
Sbjct: 163 SGGGW----XLTPNLQPFVGGTYFPPEDGLTRVGFHTVLLRIREQWKQNKTTLLENS--- 215

Query: 260 IEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQ 316
            ++++ AL A +  +    ++P +A  +   C +QL + YD  +GGF  APKFP PV + 
Sbjct: 216 -QRVTTALLARSEISMGDRQVPPSAATMNSRCFQQLDEGYDEEYGGFAEAPKFPTPVILN 274

Query: 317 MML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPH 374
            +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GFHRYS D +WH+PH
Sbjct: 275 FLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGFHRYSTDRQWHIPH 329

Query: 375 FEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAET 434
           FEKMLYDQ QLA  Y  AF ++ D FYS + + IL Y+ R++    G  +SAEDADS   
Sbjct: 330 FEKMLYDQAQLAVAYTQAFQISGDEFYSDVAKGILQYVARNLSHRSGGFYSAEDADSPPE 389

Query: 435 EGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLKPTGNCDLSRMSDP 484
            G  R KEGAFYVWT  EV+ +L E  +          LF +HY L   GN  +S   DP
Sbjct: 390 RG-MRPKEGAFYVWTVNEVQQLLPEPVLGATEPLTSGQLFMKHYGLTEAGN--ISPSQDP 446

Query: 485 HNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVS 544
             E +G+NVL        +A++ G+ ++    +L     KLF  R  RP+PHLD K++ +
Sbjct: 447 KGELQGQNVLTVRYSLELTAARFGLDVDAVRTLLNTGLEKLFQARKHRPKPHLDSKMLAA 506

Query: 545 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL 604
           WNGL++S +A    +L  E                  +  A + A F++RH++D    RL
Sbjct: 507 WNGLMVSGYAVTGAVLGLE----------------RLITCAINGAKFLKRHMFDVARGRL 550

Query: 605 QHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELF 656
             +   GP      S  P  GFL+DYAF++ GLLDLYE    + WL WA+ LQ+TQD LF
Sbjct: 551 MRTCYAGPGGTVEHSNPPSWGFLEDYAFVVRGLLDLYEASQESSWLEWALRLQDTQDRLF 610

Query: 657 LDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQN 715
            D  GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL     G K   +   
Sbjct: 611 WDSRGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHGFT-GHKD--WMDK 667

Query: 716 AEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYD 775
               L  F  R++ + +A+P M  A       + K +V+ G   + D + +L   H+ Y 
Sbjct: 668 CVCLLTAFSERMRRVPVALPEMVRALSA-HQQTLKQIVICGDPQAKDTKALLQCVHSIYI 726

Query: 776 LNKTV 780
            NK +
Sbjct: 727 PNKVL 731


>gi|390355802|ref|XP_003728630.1| PREDICTED: spermatogenesis-associated protein 20
           [Strongylocentrotus purpuratus]
          Length = 671

 Score =  562 bits (1449), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 300/644 (46%), Positives = 399/644 (61%), Gaps = 47/644 (7%)

Query: 156 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 215
           ME ESFE+  + KL+N+ +VSIKVDREERPDVD+VYMT++QA  GGGGWP+SV+L+PDLK
Sbjct: 1   MERESFENVDIGKLMNEHYVSIKVDREERPDVDRVYMTFIQATAGGGGWPMSVWLTPDLK 60

Query: 216 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 275
           PLMGGTYFPP D++GRPGF TIL+ +   W + R+ L Q     IE L  A+   ++S+ 
Sbjct: 61  PLMGGTYFPPHDRFGRPGFPTILQSIARQWGENREALEQQSTKIIEALQAAVKVKSTSD- 119

Query: 276 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMM--LYHSKKLEDTGKSGE 333
            P  L    +  C +QL+ S+D+++GGFG APKFP+PV    +  LY S      G+S  
Sbjct: 120 -PSPLGTEVMEKCFKQLTDSFDNQYGGFGGAPKFPQPVNFNFLFRLYSSPP----GESEI 174

Query: 334 ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 393
              G KM L TL+ MAKGGIHDHV  GFHRYS D  WHVPHFEKMLYDQGQLA  YLDA+
Sbjct: 175 GERGLKMCLHTLKMMAKGGIHDHVSQGFHRYSTDRFWHVPHFEKMLYDQGQLAVAYLDAY 234

Query: 394 SLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 453
            +TK+  ++ + RDIL+Y+ RD+    G  +SAEDADS      T KKEGAF VWT  EV
Sbjct: 235 QITKEAVFADVARDILEYVGRDLSDKAGGFYSAEDADSLPAADETHKKEGAFCVWTDTEV 294

Query: 454 EDILGEH---------AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 504
              L +          A +F +HY +K  GN D  +  DPH E K +NVLI      ++A
Sbjct: 295 RTHLSDMVEGSDSVTLADVFCKHYDIKTGGNVDFEQ--DPHGELKDQNVLIARGSVDSTA 352

Query: 505 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 564
           S LG+        L   RR L +VR +RPRPHLDDK++ +WNGL+IS F+RA ++L++  
Sbjct: 353 SMLGLTEGTVEAALETARRTLHEVRLERPRPHLDDKMLTAWNGLMISGFSRAGQVLQA-- 410

Query: 565 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTH-RLQHSFRNG-------PSKAP 616
                          E+ + AE A +FIR+HLYD  T   L+ ++RN        P    
Sbjct: 411 --------------PEFTQRAEQAVTFIRQHLYDPSTGCLLRSAYRNKEGDIAQIPIPIQ 456

Query: 617 GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL 676
           GF+DDY FLI GLLDLYE     +W+ WA +LQ   DEL  D E GGYF+TT +D S+LL
Sbjct: 457 GFVDDYCFLIRGLLDLYEANYDEQWIEWASQLQEKLDELLWDTENGGYFSTTDKDSSILL 516

Query: 677 RVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL 736
           R+KED DGAEPS NSV+ +NL+RL+  +  ++ D Y++ A    +VF  RL+ + +A+P 
Sbjct: 517 RLKEDQDGAEPSANSVACMNLLRLSHYL--NRPD-YQEKASKLFSVFGERLQKIPIALPE 573

Query: 737 MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           M  A  +    + K +++ G   + D   +L   H  Y  NK +
Sbjct: 574 MASAL-LFQESTAKQIIICGDPQAEDTRLLLQCVHTHYLPNKVL 616


>gi|126343214|ref|XP_001376429.1| PREDICTED: spermatogenesis-associated protein 20 [Monodelphis
           domestica]
          Length = 744

 Score =  562 bits (1449), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 300/702 (42%), Positives = 423/702 (60%), Gaps = 56/702 (7%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYLLQHA+NPVDWF WG+EAF +A+K + PIFLS+GYSTCHWCHVME ESF+
Sbjct: 21  NRLIHEKSPYLLQHAYNPVDWFPWGQEAFDKAKKENKPIFLSVGYSTCHWCHVMEEESFQ 80

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           ++ + ++L++ FVSIKVDREERPDVDKVYMT+VQA   GGGWP++V+L+PDL+P +GGTY
Sbjct: 81  NKDIGQILSEDFVSIKVDREERPDVDKVYMTFVQATSSGGGWPMNVWLTPDLQPFVGGTY 140

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FPPED   R GF+T+L ++++ W + + ML  +     ++++ +L A +       ELP 
Sbjct: 141 FPPEDGVTRVGFRTVLLRIREQWKQNKAMLMANS----QRVTASLLARSEICMGDRELPP 196

Query: 283 NALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEG 337
           +A  +   C +QL + YD   GGF   PKFP PV +  +   + + ++   G        
Sbjct: 197 SASAVSNRCFQQLEEVYDEEHGGFAEVPKFPTPVILSFLFSYWATHRMATDG-----FRA 251

Query: 338 QKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTK 397
           Q+M + TL+ MA GGI DHVG GFHRYS D +WH+PHFEKMLYDQ QLA  Y+ AF ++ 
Sbjct: 252 QQMAMHTLKMMANGGIRDHVGQGFHRYSTDRQWHIPHFEKMLYDQAQLAVAYIQAFQISG 311

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
           D F++ I +DIL Y+ +++    G   SAEDADS   EG  + KEGA+Y+W  KE++D+L
Sbjct: 312 DEFFADIAKDILQYVSQNLSHQSGGFCSAEDADSM-PEGEKKPKEGAYYLWKVKEIKDLL 370

Query: 458 GEH----------AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 507
            +             LF +HY +   GN  +    DPH E +G+NVL        +A++ 
Sbjct: 371 PDPVEGSNEPLTLGQLFMKHYGITENGN--IGSTQDPHGELQGQNVLTVRYSMDLTAARY 428

Query: 508 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 567
           G+  E    +L   R KL   R +RPRP LD K++ +WNGL++S +A     L +E    
Sbjct: 429 GLEAEAVRTLLDIGREKLIQTRKRRPRPRLDSKMLAAWNGLMVSGYAITGATLGNE---- 484

Query: 568 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP--------SKAPGFL 619
                       E ++ A   A F++RHL+D  + RL      G         S+  GFL
Sbjct: 485 ------------EMIKQAIDGAKFLKRHLFDVSSGRLIRGCYAGAGGTVEQSSSQWWGFL 532

Query: 620 DDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRV 678
           +DYAF+I GLLDLYE    + WL WA++LQ+ QD+LF D +GGGYF    E  + L LR+
Sbjct: 533 EDYAFVIRGLLDLYEASRESAWLEWALKLQDMQDKLFWDTQGGGYFCNEVELRNDLPLRL 592

Query: 679 KEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMC 738
           K+D DG+EPS NSVS  NL+R+       + DY  +  +  L  F  RL  + +A+P M 
Sbjct: 593 KDDQDGSEPSANSVSAHNLLRIHGYTG--RRDYMEKCVK-LLTAFSDRLWKVPVALPEMV 649

Query: 739 CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
            A  ++   + K VV+ G   + D + ++   H+ Y  NK +
Sbjct: 650 RAL-IIQQQTVKQVVICGSPQTTDTQALINCVHSVYVPNKVL 690


>gi|40786501|ref|NP_955434.1| spermatogenesis-associated protein 20 [Rattus norvegicus]
 gi|81871190|sp|Q6T393.1|SPT20_RAT RecName: Full=Spermatogenesis-associated protein 20; AltName:
           Full=Sperm-specific protein 411; Short=Ssp411
 gi|38156445|gb|AAR12892.1| sperm protein SSP411 [Rattus norvegicus]
          Length = 789

 Score =  561 bits (1447), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 301/706 (42%), Positives = 423/706 (59%), Gaps = 56/706 (7%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
            K  NRL  E SPYLLQHAHNPVDW+ WG+EAF +A+K + PIFLS+GYSTCHWCH+ME 
Sbjct: 62  QKTANRLINEKSPYLLQHAHNPVDWYPWGQEAFDKAKKENKPIFLSVGYSTCHWCHMMEE 121

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESF++E +  LLN+ FVS+ VDREERPDVDKVYMT+VQA   GGGWP++V+L+P L+P +
Sbjct: 122 ESFQNEEIGHLLNENFVSVMVDREERPDVDKVYMTFVQATSSGGGWPMNVWLTPSLQPFV 181

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
           GGTYFPPED   R GF+T+L ++ D W + ++ L ++     ++++ AL A +  +    
Sbjct: 182 GGTYFPPEDGLTRVGFRTVLMRICDQWKQNKNTLLENS----QRVTTALLARSEISVGDR 237

Query: 279 ELPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLEDTGKSGE 333
           +LP +A  +   C +QL + YD  +GGF  APKFP PV +  +   + S ++   G    
Sbjct: 238 QLPPSAATMNSRCFQQLDEGYDEEYGGFAEAPKFPTPVILNFLFSYWLSHRVTQDG---- 293

Query: 334 ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 393
            S  Q+M L TL+ MA GGI DHVG GFHRYS D +WH+PHFEKMLYDQ QL+ VY  AF
Sbjct: 294 -SRAQQMALHTLKMMANGGIRDHVGQGFHRYSTDRQWHIPHFEKMLYDQAQLSVVYCQAF 352

Query: 394 SLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 453
            ++ D F+S + + IL Y+ R++    G  +SAEDADS    G  + +EGA Y+WT KEV
Sbjct: 353 QISGDEFFSDVAKGILQYVTRNLSHRSGGFYSAEDADSPPERG-VKPQEGALYLWTVKEV 411

Query: 454 EDILGE----------HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 503
           + +L E             L  +HY L   GN + ++  D + E  G+NVL   +    +
Sbjct: 412 QQLLPEPVGGASEPLTSGQLLMKHYGLSEAGNINPTQ--DVNGEMHGQNVLTVRDSLELT 469

Query: 504 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 563
            ++ G+ +E    +L     KLF  R  RP+ HLD+K++ +WNGL++S FA A  +L  E
Sbjct: 470 GARYGLEVEAVRALLNTGLEKLFQARKHRPKAHLDNKMLAAWNGLMVSGFAVAGSVLGME 529

Query: 564 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP------SKAP- 616
                           + +  A + A F++RH++D  + RL+ +   G       S  P 
Sbjct: 530 ----------------KLVTQATNGAKFLKRHMFDVSSGRLKRTCYAGAGGTVEQSNPPC 573

Query: 617 -GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL 675
            GFL+DYAF++ GLLDLYE    + WL WA+ LQ+ QD+LF D  GGGYF +  E  + L
Sbjct: 574 WGFLEDYAFVVRGLLDLYEASQESSWLEWALRLQDIQDKLFWDSHGGGYFCSEAELGTDL 633

Query: 676 -LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAV 734
            LR+K+D DGAEPS NSVS  NL+RL  +  G K   +       L  F  R++ + +A+
Sbjct: 634 PLRLKDDQDGAEPSANSVSAHNLLRLHGLT-GHKD--WMDKCVCLLTAFSERMRRVPVAL 690

Query: 735 PLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           P M  A       + K +V+ G   + D + +L   H+ Y  NK +
Sbjct: 691 PEMVRALSA-QQQTLKQIVICGDPQAKDTKALLQCVHSIYIPNKVL 735


>gi|149053889|gb|EDM05706.1| spermatogenesis associated 20 [Rattus norvegicus]
          Length = 745

 Score =  561 bits (1447), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 302/706 (42%), Positives = 423/706 (59%), Gaps = 56/706 (7%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
            K  NRL  E SPYLLQHAHNPVDW+ WG+EAF +A+K + PIFLS+GYSTCHWCH+ME 
Sbjct: 18  QKTANRLINEKSPYLLQHAHNPVDWYPWGQEAFDKAKKENKPIFLSVGYSTCHWCHMMEE 77

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESF++E +  LLN+ FVS+ VDREERPDVDKVYMT+VQA   GGGWP++V+L+P L+P +
Sbjct: 78  ESFQNEEIGHLLNENFVSVMVDREERPDVDKVYMTFVQATSSGGGWPMNVWLTPSLQPFV 137

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
           GGTYFPPED   R GF+T+L ++ D W + ++ L ++     ++++ AL A +  +    
Sbjct: 138 GGTYFPPEDGLTRVGFRTVLMRICDQWKQNKNTLLENS----QRVTTALLARSEISVGDR 193

Query: 279 ELPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLEDTGKSGE 333
           +LP +A  +   C +QL + YD  +GGF  APKFP PV +  +   + S ++   G    
Sbjct: 194 QLPPSAATMNSRCFQQLDEGYDEEYGGFAEAPKFPTPVILNFLFSYWLSHRVTQDG---- 249

Query: 334 ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 393
            S  Q+M L TL+ MA GGI DHVG GFHRYS D +WH+PHFEKMLYDQ QL+ VY  AF
Sbjct: 250 -SRAQQMALHTLKMMANGGIRDHVGQGFHRYSTDRQWHIPHFEKMLYDQAQLSVVYCQAF 308

Query: 394 SLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 453
            ++ D F+S + + IL Y+ R++    G  +SAEDADS    G  + +EGA Y+WT KEV
Sbjct: 309 QISGDEFFSDVAKGILQYVTRNLSHRSGGFYSAEDADSPPERG-VKPQEGALYLWTVKEV 367

Query: 454 EDILGE----------HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 503
           + +L E             L  +HY L   GN + ++  D + E  G+NVL        +
Sbjct: 368 QQLLPEPVGGASEPLTSGQLLMKHYGLSEAGNINPTQ--DVNGEMHGQNVLTVRYSLELT 425

Query: 504 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 563
           A++ G+ +E    +L     KLF  R  RP+ HLD+K++ +WNGL++S FA A  +L  E
Sbjct: 426 AARYGLEVEAVRALLNTGLEKLFQARKHRPKAHLDNKMLAAWNGLMVSGFAVAGSVLGME 485

Query: 564 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP------SKAP- 616
                           + +  A + A F++RH++D  + RL+ +   G       S  P 
Sbjct: 486 ----------------KLVTQATNGAKFLKRHMFDVSSGRLKRTCYAGAGGTVEQSNPPC 529

Query: 617 -GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL 675
            GFL+DYAF++ GLLDLYE    + WL WA+ LQ+ QD+LF D  GGGYF +  E  + L
Sbjct: 530 WGFLEDYAFVVRGLLDLYEASQESSWLEWALRLQDIQDKLFWDSHGGGYFCSEAELGTDL 589

Query: 676 -LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAV 734
            LR+K+D DGAEPS NSVS  NL+RL  +  G K   +       L  F  R++ + +A+
Sbjct: 590 PLRLKDDQDGAEPSANSVSAHNLLRLHGLT-GHKD--WMDKCVCLLTAFSERMRRVPVAL 646

Query: 735 PLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           P M  A       + K +V+ G   + D + +L   H+ Y  NK +
Sbjct: 647 PEMVRALSA-QQQTLKQIVICGDPQAKDTKALLQCVHSIYIPNKVL 691


>gi|320168532|gb|EFW45431.1| spermatogenesis-associated protein 20 [Capsaspora owczarzaki ATCC
           30864]
          Length = 832

 Score =  560 bits (1443), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 305/732 (41%), Positives = 418/732 (57%), Gaps = 96/732 (13%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNRLA E SPYLLQHAHNPVDW   G EAF +AR+R +PIFLS+GYSTCHWCHVME +SF
Sbjct: 22  TNRLATEKSPYLLQHAHNPVDW---GPEAFQKARERQLPIFLSVGYSTCHWCHVMEEQSF 78

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
            + G+A ++N  FV+IKVDREERPDVD+VYM ++ A  G GGWP+SV+L+P+L P+ GGT
Sbjct: 79  MNPGIASIMNKNFVNIKVDREERPDVDRVYMAFITATTGHGGWPMSVWLTPELTPIFGGT 138

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE-- 279
           YFPPEDK+G PGF  +L K+   W  +RD +   G   ++ L + + A     +  +E  
Sbjct: 139 YFPPEDKWGTPGFPFLLAKIAALWSSRRDEILLKGRGIMQLLEQGIDARLQPTEESNEGA 198

Query: 280 -------LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML------------Y 320
                    ++ L L   +  + +D + GGFG APKFPRPV +Q +L             
Sbjct: 199 VSDAKQDSARDWLELAFTKFEEEFDPQLGGFGGAPKFPRPVILQFLLNLYAHFSRVTASL 258

Query: 321 HSKKLEDTGKSGEAS------------------------------------EGQKMVLFT 344
            ++  + T     AS                                    +  +M   T
Sbjct: 259 KAQATDATPSPTSASPRLAGAPVAAAAATTLSASPKLKGSRRLSVAERNCLQTMRMCTTT 318

Query: 345 LQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYI 404
           L  M +GG++DH+GGGFHRYSVD+ WHVPHFEKML+DQ QLA  Y   F LT+   Y+ +
Sbjct: 319 LDAMHRGGLYDHLGGGFHRYSVDQFWHVPHFEKMLFDQAQLALTYAMGFQLTRIPAYAQV 378

Query: 405 CRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE----- 459
           CRD L Y+ RD+  P G  FSAEDADS  +  +  K EGA+YVW+ +E+   L +     
Sbjct: 379 CRDTLAYVLRDLAHPLGGFFSAEDADSLPSVTSESKSEGAYYVWSYEEISTTLSQGDCAA 438

Query: 460 -------HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE 512
                     +F   + ++P GN  + R S+PH E   KN L +      +A    +PL 
Sbjct: 439 GVASNATDLAVFCYAFGVRPQGN--IRRESNPHGELARKNHLFQEYTLQETADHFHLPLA 496

Query: 513 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 572
              N L   R +L  +R+ RPRPHLDDK+I +WNGL+IS+ A+A  ++    E  +F   
Sbjct: 497 DVANRLENARARLHGIRAARPRPHLDDKIIAAWNGLMISALAKAGGVV----EEPLF--- 549

Query: 573 VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFLISGLLD 631
                    +  A+ AA F+R  +Y+ ++ +L  S+R+G  SK  GFL DYAF+I GLLD
Sbjct: 550 ---------IHAAQKAARFLRGSMYNTESGQLVRSWRDGSASKVGGFLSDYAFVIQGLLD 600

Query: 632 LYEFGSGTKWLVWAIELQNTQDELFLDRE-GGGYFNTTGEDPSVLLRVKEDHDGAEPSGN 690
           LYE    T WL WA++LQ+ QDELF D   GGGYF T+  DPS+L+R+K + D AEP+GN
Sbjct: 601 LYEVDGDTTWLEWALQLQSKQDELFHDPNGGGGYFVTSTHDPSILVRLKCEEDSAEPAGN 660

Query: 691 SVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRK 750
           S++ INL+RLA++V   +    R  A   +   +    +   A+P+M  A   L  P+ +
Sbjct: 661 SIAAINLLRLANLVNRPE---MRDRAAALITSHQFLFSNAPTALPMMLSALQFLHSPNVQ 717

Query: 751 HVVLVGHKSSVD 762
            VVLV   S  D
Sbjct: 718 -VVLVTKNSPTD 728


>gi|395328680|gb|EJF61071.1| hypothetical protein DICSQDRAFT_161788 [Dichomitus squalens
           LYAD-421 SS1]
          Length = 791

 Score =  560 bits (1443), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 308/694 (44%), Positives = 418/694 (60%), Gaps = 47/694 (6%)

Query: 60  RNYLYPFRRPLAVISHRPIHPYKVVAMAERTPASTSHSRNKHTNRLAAEHSPYLLQHAHN 119
           R + +P  RP  + S   I   K++        ST+H  + H NRLA   SPYLLQHA N
Sbjct: 33  RIHKFPLARPTTIPSRTHIFA-KIM--------STAHGGSGHKNRLAKAKSPYLLQHAEN 83

Query: 120 PVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKV 179
           PVDW+ WG+EAF +A+    PIFLS+GYS CHWCHV+  ESFEDE  AK++N+++V+IKV
Sbjct: 84  PVDWYEWGQEAFDKAKLESKPIFLSVGYSACHWCHVLAHESFEDEVTAKIMNEYYVNIKV 143

Query: 180 DREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILR 239
           DREERPDVD++YMT++QA  GGGGWP+SV+L+PDL P   GTYFPP +      F+ +L 
Sbjct: 144 DREERPDVDRLYMTFLQATTGGGGWPMSVWLTPDLHPFFAGTYFPPGN------FRQVLI 197

Query: 240 KVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSR 299
           K+ + W++  +    SG   IE L ++  A+  S      L +  L     QL K +D++
Sbjct: 198 KLAEIWERDPERCIASGKQIIEVLQQSSKAAPESGVDVKPLAEKILT----QLQKRFDAK 253

Query: 300 FGGFGSAPKFPRPVEIQMML-----YHSKKLEDTGKSGEASE-GQKMVLFTLQCMAKGGI 353
            GGFG APKFP P +    L     Y+      T +  E++E  + M +FT+  +  GGI
Sbjct: 254 EGGFGRAPKFPSPSQTMYPLARIAAYYLNNSSATAQEKESAEKARDMAVFTMTKIYNGGI 313

Query: 354 HDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD-----VFYSYICRDI 408
            D VGGGF RYSVDERWHVPHFEKMLYD+ QL +  L+ + L             + +DI
Sbjct: 314 RDVVGGGFSRYSVDERWHVPHFEKMLYDEAQLLSSALELYQLLPSGSHDKTTLELMAKDI 373

Query: 409 LDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHY 468
           + Y+ RD+  P G  +SAEDADS  +  +T KKEGAFYVWT+K+++++L   A LFK H+
Sbjct: 374 VSYVARDLRSPQGGFYSAEDADSLPSHESTVKKEGAFYVWTAKQLDELLDADAELFKYHF 433

Query: 469 YLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDV 528
            +K  GNCD S   D   E KG+NVL   +    +A K G   E+    L      L + 
Sbjct: 434 GVKAEGNCDPSH--DIQGELKGQNVLFTAHTLEETAQKFGKAYEEVQKTLEVNLATLREY 491

Query: 529 RSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAES 587
           R+K RPRPHLDDK++  WNGL+IS  ++  ++L S +E A           K+ +++AE 
Sbjct: 492 RNKHRPRPHLDDKILACWNGLMISGLSKTYEVLHSHSEIA-----------KKALQLAED 540

Query: 588 AASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIE 647
           +A+F+R HLYDE++  L  S+R GP    G  DDYAFLI GLLDLYE  +  ++L+WA+ 
Sbjct: 541 SATFLRAHLYDEKSGTLWRSYREGPGPT-GQADDYAFLIQGLLDLYEASAKEEYLLWALR 599

Query: 648 LQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGS 707
           LQ  QDELF D EGGGYF  +  D  +L+R+K+  DGAEPS  SV+V NL RLA     +
Sbjct: 600 LQEKQDELFYDPEGGGYF-ASAPDEHILVRMKDAQDGAEPSAVSVAVSNLQRLAHFAEDN 658

Query: 708 KSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 741
            S +  +    +LA     LK    A+  M  AA
Sbjct: 659 HSAFTEKTTS-TLASNGQFLKQAPHALAYMVSAA 691


>gi|409047490|gb|EKM56969.1| hypothetical protein PHACADRAFT_92450 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 717

 Score =  559 bits (1441), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 306/692 (44%), Positives = 416/692 (60%), Gaps = 51/692 (7%)

Query: 93  STSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHW 152
           +T H  + H NRLA   SPYLLQHA NPVDW+ WG EAF +A++ D PIFLS+GYS CHW
Sbjct: 7   ATGHGGSHHPNRLAKAKSPYLLQHAENPVDWYEWGPEAFEKAKREDKPIFLSVGYSACHW 66

Query: 153 CHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP 212
           CHV+  ESFEDE  AKL+N+ +V++KVDREERPDVD++YMT++QA  GGGGWP+SV+L+P
Sbjct: 67  CHVLAHESFEDEVTAKLMNERYVNVKVDREERPDVDRLYMTFLQATSGGGGWPMSVWLTP 126

Query: 213 DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASAS 272
           DL P   GTYFP      +  F+  L K+ + W++ R+ L +SG   IEQL  + +AS  
Sbjct: 127 DLHPFFAGTYFP------KGQFRQALEKLANFWEEDRERLVESGKGIIEQLKSSSNASIC 180

Query: 273 SNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE--DTGK 330
           S                ++L + YDS  GGFG APKFP P +    L     L   D   
Sbjct: 181 SQ-------------VYKRLERLYDSVHGGFGGAPKFPSPSQTTHFLARLAALNIGDEKL 227

Query: 331 SGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYL 390
             EA + + M + T+  +  GGI D VGGGF RYSVD+ WHVPHFEKMLYD+ QL +  L
Sbjct: 228 KSEALKARDMAVQTMVKIYNGGIRDVVGGGFSRYSVDDHWHVPHFEKMLYDEAQLLSSAL 287

Query: 391 DAFSLTKDVFYSYICR-------DILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 443
           +   L      S  C+       DI+ Y+ RD+    G  +SAEDADS  +  +T KKEG
Sbjct: 288 ELAQLLP--IDSVECKTLEAMANDIIIYVSRDLRNSEGAFYSAEDADSLPSSDSTIKKEG 345

Query: 444 AFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 503
           AFYVWTS +++++LG+++ +FK HY +K  GNCD     D   E KG+NVL   +    +
Sbjct: 346 AFYVWTSAQLDELLGDNSDVFKFHYGVKSNGNCDPKH--DVQGELKGQNVLYTAHTVEDT 403

Query: 504 ASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKS 562
           A K G+P E+    L +C   L   R + RPRPHLDDK++  WNGL++S  A+AS++L+ 
Sbjct: 404 ARKFGIPAEQVQVTLDQCLAHLKRYRDENRPRPHLDDKILTCWNGLMLSGLAKASEVLEG 463

Query: 563 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDY 622
           +A +A              +++AE +A+FI++ LYDE+T  L+ S+R GP    G  DDY
Sbjct: 464 QAANA--------------LKLAEDSAAFIKKELYDEKTGELRRSYRQGPGPT-GQADDY 508

Query: 623 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 682
           AFLI GLLDLYE     +++ WAI LQ  QDELF D EGGGYF  +  DP +L+R+K+  
Sbjct: 509 AFLIQGLLDLYEASGKEEYVTWAIRLQEKQDELFHDTEGGGYF-ASAPDPHILVRMKDAQ 567

Query: 683 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 742
           DGAEPS  SV++ NL RLA   A  +   YR+ A+  L      L+    A+  M  AA 
Sbjct: 568 DGAEPSAVSVTLYNLNRLAHF-AEDRHGEYREKAQSILRSNSQLLEHAPFALATMVSAA- 625

Query: 743 MLSVPSRKHVVLVGHKSSVDFENMLAAAHASY 774
           + +    +  ++ G  S+ D    L A   ++
Sbjct: 626 LTAQRGYRQFIVSGEASNSDTTRFLHAIRHTF 657


>gi|392558461|gb|EIW51649.1| hypothetical protein TRAVEDRAFT_137028 [Trametes versicolor
           FP-101664 SS1]
          Length = 739

 Score =  559 bits (1440), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 300/638 (47%), Positives = 401/638 (62%), Gaps = 48/638 (7%)

Query: 93  STSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHW 152
           STS++  +H NRLA   SPYLLQHA NPVDW+ WG+EAF +A+K + PIFLS+GYS CHW
Sbjct: 2   STSNTSTRHVNRLAKAKSPYLLQHAENPVDWYEWGQEAFDKAKKENKPIFLSVGYSACHW 61

Query: 153 CHVMEVESFEDEGVAKLLNDWFVSI-KVDREERPDVDKVYMTYVQALYGGGGWPLSVFLS 211
           CHV+  ESFEDE  AK++N+ +V++ KVDREERPDVD++YMT++QA  GGGGWP+SV+L+
Sbjct: 62  CHVLAHESFEDEITAKMMNEHYVNVKKVDREERPDVDRLYMTFLQASTGGGGWPMSVWLT 121

Query: 212 PDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASA 271
           PDL P   GTYFPP    GR  F+ IL ++ D W   R+   +S    +E L E      
Sbjct: 122 PDLHPFFAGTYFPP----GR--FRQILDRLADVWTYDRERCIESAGKVLETLKE------ 169

Query: 272 SSNKLPDELPQNALRL------CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSK 323
           SSN  P   PQ+++ L        ++L K +D   GGFG APKFP P +    L  Y + 
Sbjct: 170 SSNIAPS--PQDSVELKPLPQEVFQRLQKRFDGVNGGFGGAPKFPSPAQTTHFLARYAAS 227

Query: 324 KLEDTGKSGE----ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKML 379
            L D   S E    A   + M ++++  +  GGI D VGGGF RYSVDERWHVPHFEKML
Sbjct: 228 HLSDLNASNEDKKNAQAARDMAVYSMIKIYNGGIRDVVGGGFSRYSVDERWHVPHFEKML 287

Query: 380 YDQGQLANVYLDAFSL----TKD-VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAET 434
           YD+ QL +  LD + L    ++D      + +DI+ Y+  D+  P G  +SAEDADS  T
Sbjct: 288 YDEAQLLSSSLDLYQLLTTPSRDKKTLELMAKDIVSYVANDLRSPEGGFYSAEDADSLPT 347

Query: 435 EGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL 494
             +  KKEGAFYVWTS++++++LG  A LF+ H+ ++  GNCD     D   E KG+NVL
Sbjct: 348 HDSIVKKEGAFYVWTSEQLDELLGADAELFEYHFGVEADGNCDPGH--DIQGELKGQNVL 405

Query: 495 IELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSF 553
              + S  +A K G  +E    ILG   + L D R K RPRPHLDDK++  WNGL+IS  
Sbjct: 406 FTAHTSEETADKFGKSVEDTEKILGAGLKTLRDYRDKHRPRPHLDDKILTCWNGLMISGL 465

Query: 554 ARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPS 613
           AR S++L  + + A            + +++AE++A+FIR HL+DEQ+ +L  S+R GP 
Sbjct: 466 ARTSEVLGHDKDVA-----------SKALDMAEASAAFIRGHLFDEQSGKLWRSYREGPG 514

Query: 614 KAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPS 673
              G  DDYAFLI G LDLYE  +  + L+WA+ LQ  QDELF D E GGYF  +  D  
Sbjct: 515 PT-GQADDYAFLIQGFLDLYEASANEEHLLWALRLQEKQDELFYDPEDGGYF-ASAPDEH 572

Query: 674 VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDY 711
           +L+R+K+  DGAEPS  SV++ NL RLA +     +DY
Sbjct: 573 ILIRMKDAQDGAEPSAVSVTLANLQRLAHLAEDRHADY 610


>gi|395536753|ref|XP_003770376.1| PREDICTED: spermatogenesis-associated protein 20 [Sarcophilus
           harrisii]
          Length = 744

 Score =  558 bits (1438), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 296/710 (41%), Positives = 422/710 (59%), Gaps = 52/710 (7%)

Query: 93  STSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHW 152
           S + ++ +  NRL  E SPYLLQHA+NPVDWF WG+EAF +A+  + PIFLS+GYSTCHW
Sbjct: 11  SHNQTQLQVPNRLIHEKSPYLLQHAYNPVDWFPWGQEAFDKAKNENKPIFLSVGYSTCHW 70

Query: 153 CHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP 212
           CHVME ESF ++ + ++L++ FVS+KVDREE PDVDKVYMT+VQA   GGGWP++V+L+P
Sbjct: 71  CHVMEEESFRNKEIGEILSEDFVSVKVDREEHPDVDKVYMTFVQATSSGGGWPMNVWLTP 130

Query: 213 DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASAS 272
           DL+P +GGTYFPPED   R GF+T+L +++D W + + ML ++     ++++ +L A + 
Sbjct: 131 DLQPFVGGTYFPPEDGLTRVGFRTVLLRIRDQWKQNKAMLLENS----QRVTASLLARSE 186

Query: 273 SNKLPDELPQNA---LRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 329
                 ELP  A    + C +QL + YD   GGF  APKFP PV +  +  +      T 
Sbjct: 187 ITVGDRELPPTASAVSKRCFQQLEEVYDEEHGGFAEAPKFPTPVILSFLFSYWAAHRMT- 245

Query: 330 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 389
              E    Q+M + +L+ MA GGI DHVG GFHRYS D +WH+PHFEKMLYDQ QLA  Y
Sbjct: 246 --SEGFRAQQMAMHSLKMMANGGIRDHVGQGFHRYSTDRQWHIPHFEKMLYDQAQLAVAY 303

Query: 390 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 449
             AF ++ D  +S + + IL Y+ +++  P G  +SAEDADS   EG  + KEGA+Y+WT
Sbjct: 304 TQAFQVSGDELFSDVAKGILQYVSQNLSHPSGGFYSAEDADSV-PEGEVKPKEGAYYLWT 362

Query: 450 SKEVEDILGEH----------AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELND 499
             E++D+L E             LF +HY +  TGN  +    DP  E +G+NVL     
Sbjct: 363 VNEIKDLLPEPVEGATEPLSLGQLFMKHYGVTETGN--IGSTQDPQGELQGQNVLTVRYS 420

Query: 500 SSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKI 559
              +A++ G+  E    +L   R KL  +R +R RP LD K++ +WNG+++S +A A  +
Sbjct: 421 MDLTAARFGLEAETVRKLLDTGREKLVQIRKRRSRPRLDIKMLAAWNGMMVSGYAIAGAV 480

Query: 560 LKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL--------QHSFRNG 611
           L  E                E +  A   A F++RHL+D  + RL          +    
Sbjct: 481 LGKE----------------ELINQAIDGAKFLKRHLFDVSSGRLFRGCYATIGGTVEQS 524

Query: 612 PSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGED 671
            S+  GFL+DYAF+I GLLDLYE    + WL WA+ LQ+ QD+LF D +GGGYF +  E 
Sbjct: 525 SSQFWGFLEDYAFVIRGLLDLYEASGESAWLEWALRLQDMQDKLFWDTQGGGYFCSEAEL 584

Query: 672 PSVL-LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDM 730
              L LR+K+D DG+EPS NSVS  NL+R+ +     + D+  +  +  L  F  RL+ +
Sbjct: 585 GGNLPLRLKDDQDGSEPSANSVSAHNLLRIHAYTG--RRDWMDKCVK-LLTAFSDRLRRV 641

Query: 731 AMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
            +A+P M  A   +   + K +V+ G     D + ++   H+ Y  NK +
Sbjct: 642 PVALPEMVRAL-CIQQQTIKQIVICGSPQGQDTKALIDCVHSIYVPNKVL 690


>gi|427779347|gb|JAA55125.1| Hypothetical protein [Rhipicephalus pulchellus]
          Length = 816

 Score =  554 bits (1427), Expect = e-155,   Method: Compositional matrix adjust.
 Identities = 319/753 (42%), Positives = 426/753 (56%), Gaps = 110/753 (14%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA E SPYLLQHA NPVDW+ WG+ AF +A+  D  IFLS+GYSTCHWCHVME ESFE
Sbjct: 20  NRLAQEKSPYLLQHASNPVDWYPWGDAAFKKAKDEDKLIFLSVGYSTCHWCHVMERESFE 79

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           ++ +AK++ND FV++KVDREERPDVD+VYMTY+QA  GGGGWP+S++L+PDLKP++GGTY
Sbjct: 80  NDDIAKIMNDNFVNVKVDREERPDVDRVYMTYIQATSGGGGWPMSIWLTPDLKPVVGGTY 139

Query: 223 FPPEDK-YGRPGFKTILRKVKDAWDKKRDMLAQSGA--FAI-EQLSE-----------AL 267
           FPP+D+ YG+PGFKT+L  + + W K R  L   G   F I EQ S+           + 
Sbjct: 140 FPPDDRYYGQPGFKTLLTSLAEQWRKNRTKLIDQGTRIFQILEQTSDVRVFGGDGVPTSP 199

Query: 268 SASASSNKLPDELPQNALRLCAEQ---------LSKSYDSR-FGG--------------- 302
             S ++ K P     +    C  Q         L ++ D R FGG               
Sbjct: 200 RGSEANQKCP--FAPDVATTCYRQLXGTRIFQILEQTSDVRVFGGDGVPTSPRGSEANQK 257

Query: 303 -------------------------FGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEA--- 334
                                    FG APKFP+ V +  +L +   L       EA   
Sbjct: 258 CPFAPDVATTCYRQLERSYDVSMGGFGRAPKFPQCVNLNFLLRYRAVLLQGDPPPEAKTA 317

Query: 335 -SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 393
             +  +M + TL+ MA+GGIHDH+G GFHRYS D +WHVPHFEKMLYDQ QL   Y +A+
Sbjct: 318 VDKALEMTVHTLRMMAQGGIHDHIGKGFHRYSTDGKWHVPHFEKMLYDQAQLTRTYSEAY 377

Query: 394 SLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 453
            +T D   + + RDIL Y+ RD+  P G  +SAEDADS    G   K+EGAF VW   EV
Sbjct: 378 QVTHDRRLADVARDILCYVERDLSHPSGGFYSAEDADSYPEHGDKEKREGAFCVWEESEV 437

Query: 454 EDILGEH---------AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 504
             +L E          A +   +Y ++ +GN D   M DPH+E K KNVLI      + A
Sbjct: 438 YRLLTEPLPSCPTKTVADIVCRYYDIRKSGNVD--PMQDPHDELKRKNVLIVRESKESVA 495

Query: 505 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 564
           +  G+ +     +L   R  LF+ R +RP+PHLDDK + SWNGL+IS FA A++ L    
Sbjct: 496 ACYGLEVGVLDALLERARETLFEARLRRPKPHLDDKFLTSWNGLMISGFAIAARTL---- 551

Query: 565 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS-FR-------NGPSKAP 616
                N PV       Y++ A     FI++HLY+ +   L  S +R        G     
Sbjct: 552 -----NQPV-------YLDRALKCVEFIKKHLYNPKKKTLIRSAYRGEDGSVVQGSQPID 599

Query: 617 GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL 676
           G L+DYAFLI  LLD+YE       L+WA ELQ+ QD LF D++  GYF + GEDP+V+L
Sbjct: 600 GVLEDYAFLIQALLDVYEASFDVSCLMWAEELQDKQDRLFWDKKDMGYFLSNGEDPTVVL 659

Query: 677 RVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL 736
           R+K+D DGAEPS NSVS+ NLVRL+ ++   + D  RQ AE   +V+  R+  + +A+P 
Sbjct: 660 RLKDDQDGAEPSSNSVSLNNLVRLSVLL---QRDELRQRAEKLASVYGQRMILVPLALPE 716

Query: 737 MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAA 769
           M C    L     + VV+ G +     + +L+ 
Sbjct: 717 MVCGLMRLQA-GPQEVVIAGPRDDPGTKELLSC 748


>gi|66826709|ref|XP_646709.1| DUF255 family protein [Dictyostelium discoideum AX4]
 gi|60474801|gb|EAL72738.1| DUF255 family protein [Dictyostelium discoideum AX4]
          Length = 824

 Score =  553 bits (1425), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 299/705 (42%), Positives = 420/705 (59%), Gaps = 59/705 (8%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           K+TN+L  E SPYLL+HAHNPVDW  WGEEAF  AR  D  IFLS+GY  CHWC+VME E
Sbjct: 90  KYTNKLINEKSPYLLKHAHNPVDWLPWGEEAFKIARDNDKLIFLSVGYMACHWCNVMERE 149

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
            FE+  +AK++N++ V+IK+DREERPD+DK+YMTY+  + G GGWP+S++L+P L P+ G
Sbjct: 150 CFENVEIAKVMNEYCVNIKIDREERPDIDKIYMTYLTEISGSGGWPMSIWLTPQLHPITG 209

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE 279
           GTYF PE KYGRPGF  +++K+   W K R+M+ +     I+ L E       +N L  +
Sbjct: 210 GTYFAPEAKYGRPGFPDLIKKLDKLWRKDREMVQERADSFIKFLKEEKPMGNINNALSSQ 269

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
                +  C +Q+ K YD   GG+  APKFPR     ++L   K  ED  K  +     K
Sbjct: 270 ----TIEKCFQQIMKGYDPIDGGYSDAPKFPRCSIFNLLLMTLK--EDYSK--QVGSLDK 321

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           +V FTL+ MA GG++D VGGGFHRYSV   W +PHFEKMLYD  QLA+VYLDA+ +TK  
Sbjct: 322 LV-FTLEKMANGGMYDQVGGGFHRYSVTSDWMIPHFEKMLYDNAQLASVYLDAYQITKSP 380

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE 459
            +  + ++IL Y+   +    G  FSAEDADS   E    K+EGAFYVW+ ++++  + +
Sbjct: 381 LFERVAKEILHYVSTKLTHTLGGFFSAEDADSLNLE-INEKQEGAFYVWSYQDIKKAIQD 439

Query: 460 H--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI---ELNDSSASASKLGMPLEKY 514
                ++  H+ L   GN D     DPHNEFK KNV+     L +++A   K    +EK 
Sbjct: 440 KDDIEIYSFHHGLIENGNVD--PKDDPHNEFKDKNVITIVKSLKETAAYFKKTQEEIEKS 497

Query: 515 LNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
           LN   + + KLF  R + +P+P LDDK+IVSWNGL++SSF +A ++ K E          
Sbjct: 498 LN---QSKEKLFKFREQFKPKPQLDDKIIVSWNGLMVSSFCKAYQLFKDE---------- 544

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDE--------------QTHRLQHSFRNGPSKAPGFL 619
                 +Y+  A  +  FI+ HLYD                  RL  ++++GPSK   F 
Sbjct: 545 ------KYLNSAIKSIEFIKTHLYDSVGDDNDYDDEDDKLNNCRLIRNYKDGPSKIHAFT 598

Query: 620 DDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVK 679
           DDY+FLI  LLDLY+     K L WA++LQ  QD LF D E GGY++T+G D S+L R+K
Sbjct: 599 DDYSFLIQALLDLYQVTFDYKHLEWAMKLQKQQDNLFYDLENGGYYSTSGLDKSILSRMK 658

Query: 680 EDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCC 739
           E+HDGAEPS  S+SV NL++L SI   + ++ Y++ A+ +L      L+   +  P M C
Sbjct: 659 EEHDGAEPSPQSISVSNLLKLYSI---TYNEAYKEKAKKTLENCSLYLEKAPLVFPQMVC 715

Query: 740 AADMLSVPSRKHVVLV----GHKSSVDFENMLAAAHASYDLNKTV 780
           +   L + S   ++L      ++      ++L   H++Y  NK +
Sbjct: 716 SL-YLYLNSINTIILSTNSNDNQQKQQLLSILDEIHSNYIPNKLI 759


>gi|391227735|ref|ZP_10263942.1| thioredoxin domain containing protein [Opitutaceae bacterium TAV1]
 gi|391223228|gb|EIQ01648.1| thioredoxin domain containing protein [Opitutaceae bacterium TAV1]
          Length = 734

 Score =  553 bits (1425), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 306/685 (44%), Positives = 400/685 (58%), Gaps = 37/685 (5%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL+A  SPYLLQHA NPV W  WGEEAFA AR    PIFLSIGYSTCHWCHVM  ESFE
Sbjct: 3   NRLSAARSPYLLQHARNPVHWQEWGEEAFARARAEQKPIFLSIGYSTCHWCHVMARESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           +E VA +LN  FVSIKVDREERPDVDKVYM YVQA+ G GGWPLSV+L+PDLKP  GGTY
Sbjct: 63  NEAVAAVLNKHFVSIKVDREERPDVDKVYMAYVQAMTGHGGWPLSVWLAPDLKPFYGGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAW---DKKRDMLAQS--------GAFAIEQLSEALSASA 271
           FPPED+ GR G  ++L  +   W   D++R  +A+S        G +A +Q+        
Sbjct: 123 FPPEDRSGRSGLLSVLDVIARGWNDDDERRKFVAESSRVIDVLAGYYAGKQVR-----PD 177

Query: 272 SSNKLPD--ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 329
            +  +P   E   +A   C  QL +S+DS  GGFG APKFPR   +  +   +       
Sbjct: 178 PATPMPPLYETGGDAFERCYLQLGESFDSTHGGFGGAPKFPRASNLDFLFRVAAIQGPET 237

Query: 330 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 389
           ++G   E   M   TL+ M  GGIHDHVGGGFHRYSVD+ W VPHFEKMLYDQ Q+A   
Sbjct: 238 ETGR--EAVSMAASTLRHMIAGGIHDHVGGGFHRYSVDDAWFVPHFEKMLYDQAQIAVNL 295

Query: 390 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 449
           LDA   T D  Y++  R  LDY+ RD+  P G  FSAEDAD+A   GAT   EGAFYVWT
Sbjct: 296 LDAALFTGDERYAWAARATLDYVLRDLTHPDGGFFSAEDADAAPAHGATEHVEGAFYVWT 355

Query: 450 SKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 508
           + E+   L  + A L + H  + P    ++    DPH E +GKN+L ++   + +A+ LG
Sbjct: 356 AGELRRALSPDAARLVESHLGINPGPEGNVPPTLDPHGELRGKNILRQVRPLAETAAALG 415

Query: 509 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 568
           +        L      L  +R+ RPRPHLDDKVI +WNGL +S+FARA+    +      
Sbjct: 416 LEPAAAAERLAAALETLQAIRAARPRPHLDDKVITAWNGLALSAFARAATSPAA------ 469

Query: 569 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 628
                +   R  Y++ A  AA F+ R L D     L  ++R     + GF +DYA  I+G
Sbjct: 470 ----CLDDRRDRYLDAARRAARFVERELCDAGRGVLYRAWRGERGASEGFAEDYACFIAG 525

Query: 629 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 688
           LLDL++      WL  A  LQ T D  F D   GGYFN+   DP ++LR+KED+DGAEP+
Sbjct: 526 LLDLHDATFDAHWLRLAERLQQTMDARFRDEVAGGYFNSPAGDPHIVLRLKEDYDGAEPA 585

Query: 689 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD-MLSVP 747
            +S++  NL RL+S++     +     A  ++     +      A+P M CA + +L+ P
Sbjct: 586 PSSIAAANLQRLSSLL---HDETLHARAVDTVEALRGQWSQTPHALPAMLCALERILAEP 642

Query: 748 SRKHVVLVGHKSSVDFENMLAAAHA 772
            +  VV+ G  ++  F  ++A   A
Sbjct: 643 VQ--VVIAGDPAAPGFRALVAVVRA 665


>gi|449543699|gb|EMD34674.1| hypothetical protein CERSUDRAFT_86096 [Ceriporiopsis subvermispora
           B]
          Length = 737

 Score =  553 bits (1424), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 306/686 (44%), Positives = 416/686 (60%), Gaps = 35/686 (5%)

Query: 97  SRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVM 156
           S  +  NRLA   SPYLLQHA NPVDW+ WG+EAF  A++ + PIFLS+GYS CHWCHV+
Sbjct: 9   SAERKQNRLADSKSPYLLQHAENPVDWYEWGQEAFDAAKRHNKPIFLSVGYSACHWCHVL 68

Query: 157 EVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKP 216
             ESFEDE  AK++N+ +V+IKVDREERPDVD++YMT++QA  GGGGWP+SV+L+P+L P
Sbjct: 69  AHESFEDEVTAKIMNEHYVNIKVDREERPDVDRLYMTFLQATTGGGGWPMSVWLTPELHP 128

Query: 217 LMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKL 276
              GTYFP      +  F+ +L K+ + W+      A+ G   IEQL  A S  A S  +
Sbjct: 129 FFAGTYFP------QGQFRQVLLKLAEVWNNDPARCAEVGKSVIEQLRNA-SNIAPSASI 181

Query: 277 PDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLEDTGKSGEA 334
           P  +   ++ +   +L K YDSR GGFG APKFP+P +    L  Y +  + DT    +A
Sbjct: 182 PS-ISAASISIY-RRLEKRYDSRHGGFGGAPKFPQPSQTTHFLARYAALNMRDTTTKKDA 239

Query: 335 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 394
            + + M + T+  +  GGI D VGGGF RYSVDERWHVPHFEKMLYD+GQL +  ++   
Sbjct: 240 EQARDMAVETMVKIYNGGIRDVVGGGFSRYSVDERWHVPHFEKMLYDEGQLLSSAIELSL 299

Query: 395 L-----TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 449
           L      +      +  DI+ Y+ RD+  P G  +SAEDADS  +  +T KKEGAFYVWT
Sbjct: 300 LLPCDAPERTTLQLMAADIVTYVARDLRSPEGGFYSAEDADSLPSSDSTVKKEGAFYVWT 359

Query: 450 SKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGM 509
           +K+++D+LG  A  FK H+ ++  GNCD S   D   E KG+NVL   +    +A K G 
Sbjct: 360 AKQLDDLLGAEAEAFKYHFGVEAKGNCDPSH--DIQGELKGQNVLYTAHTPEETAKKFGR 417

Query: 510 PLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 568
            +E+   +L     KL + R K RPRPHLDDK++  WNGL+IS  ++AS++L    E + 
Sbjct: 418 SIEETGQLLKGSLAKLKEYRDKERPRPHLDDKILTCWNGLMISGLSKASEVLDESFELS- 476

Query: 569 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 628
                     ++ +++AE +A+FIR+ LYDE T  L+ S+R GP    G  DDYAFLI G
Sbjct: 477 ----------EKALQLAEDSATFIRQRLYDESTGELRRSYREGPGPT-GQADDYAFLIQG 525

Query: 629 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 688
           LLDLYE     ++ +WAI LQ  QDELF D EGGGYF ++  DP +L+R+K+  DGAEPS
Sbjct: 526 LLDLYEASGKEEYALWAIRLQEKQDELFWDSEGGGYF-SSAPDPHILVRMKDPQDGAEPS 584

Query: 689 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPS 748
             SV+  NL RL S  A  +   Y++ A   L      L     A+  M   A +L+   
Sbjct: 585 AQSVAFWNLQRL-SHFAEDRHGAYQEKARGVLETDAQILGQAPYALAAMVSGA-LLAEKG 642

Query: 749 RKHVVLVGHKSSVDFENMLAAAHASY 774
            K  + V   S  +  + L A H+ +
Sbjct: 643 LKQFI-VTKPSYSEAASFLKAVHSRF 667


>gi|431890790|gb|ELK01669.1| Spermatogenesis-associated protein 20 [Pteropus alecto]
          Length = 777

 Score =  552 bits (1423), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 308/725 (42%), Positives = 427/725 (58%), Gaps = 69/725 (9%)

Query: 80  PYKVVAMAERTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDV 139
           P        RT  S S  + K +NRL  E SPYLLQHA+NPVDW+ WG+EAF +ARK + 
Sbjct: 44  PMPAGGKGSRTNCSQSMPQ-KVSNRLINEKSPYLLQHAYNPVDWYPWGQEAFDKARKENK 102

Query: 140 PIFLSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALY 199
           PIFLS+GYSTCHWCH+ME ESF++E + +LLN+ FVS+KVDREERPDVDKVYMT+VQA  
Sbjct: 103 PIFLSVGYSTCHWCHMMEEESFQNEEIGRLLNEDFVSVKVDREERPDVDKVYMTFVQATS 162

Query: 200 GGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFA 259
            GGGWP++V+L+P+L+P +GGTYFPPED   R GF+T+L ++++ W + ++ L ++    
Sbjct: 163 SGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRIGFRTVLLRIREQWKQNKNTLLENS--- 219

Query: 260 IEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQ 316
            ++++ AL A +  +    +LP +A  +   C +QL + YD  +            V + 
Sbjct: 220 -QRVTTALLARSEISTGDRQLPPSAATMNSRCFQQLDEGYDEEY------------VILN 266

Query: 317 MML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPH 374
            +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GFHRYS D +WHVPH
Sbjct: 267 FLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGFHRYSTDRQWHVPH 321

Query: 375 FEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAET 434
           FEKMLYDQGQLA  Y  AF ++ D FYS + + IL Y+ R++    G  +SAEDADS   
Sbjct: 322 FEKMLYDQGQLAVAYSQAFQISGDEFYSDVAKGILQYVSRNLSHRSGGFYSAEDADSPPE 381

Query: 435 EGATRKKEGAFYVWTSKEVEDILGE----------HAILFKEHYYLKPTGNCDLSRMSDP 484
            G  R KEGAFYVWT KEV+ +L E             L  +HY L   GN  +S   DP
Sbjct: 382 RG-MRPKEGAFYVWTVKEVQQLLPESVHGATEPLTSGQLLMKHYGLTEAGN--ISPNQDP 438

Query: 485 HNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVS 544
             E +G+NVL        +A++ G+ +E    +L     KLF  R  RP+PHLD K++ +
Sbjct: 439 KGELQGQNVLTVRYSLELTAARFGLDVEAIRTLLNTGLEKLFQARKHRPKPHLDSKMLAA 498

Query: 545 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL 604
           WNGL++S +A    +L  E    + N+             A + A F++RH++D  + RL
Sbjct: 499 WNGLMVSGYAITGAVLGME---RLVNY-------------ATNGAKFLKRHMFDVASGRL 542

Query: 605 QHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELF 656
             +   G       S  P  GFL+DYAF++ GLLDLYE    + WL WA+ LQ+TQD+LF
Sbjct: 543 MRTCYAGSGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASLESAWLEWALRLQDTQDKLF 602

Query: 657 LDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQN 715
            D  GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL     G K   + + 
Sbjct: 603 WDSRGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHGFT-GHKD--WMEK 659

Query: 716 AEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYD 775
               L  F  R++ + +A+P M  A  +    + K +V+ G   + D + ++   H+ Y 
Sbjct: 660 CVCLLTAFSERMRRVPVALPEMVRAL-LAHQQTLKQIVICGDPQAKDTKALVQCVHSIYI 718

Query: 776 LNKTV 780
            NK +
Sbjct: 719 PNKVL 723


>gi|373850029|ref|ZP_09592830.1| hypothetical protein Opit5DRAFT_0884 [Opitutaceae bacterium TAV5]
 gi|372476194|gb|EHP36203.1| hypothetical protein Opit5DRAFT_0884 [Opitutaceae bacterium TAV5]
          Length = 734

 Score =  552 bits (1423), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 305/685 (44%), Positives = 401/685 (58%), Gaps = 37/685 (5%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL+A  SPYLLQHA NPV W  WGEEAFA AR    PIFLSIGYSTCHWCHVM  ESFE
Sbjct: 3   NRLSAARSPYLLQHARNPVHWQEWGEEAFARARAEQKPIFLSIGYSTCHWCHVMARESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           +E VA +LN+ FVSIKVDREERPDVDKVYM YVQA+ G GGWPLSV+L+PDLKP  GGTY
Sbjct: 63  NEAVAAVLNEHFVSIKVDREERPDVDKVYMAYVQAMTGHGGWPLSVWLAPDLKPFYGGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWD---KKRDMLAQS--------GAFAIEQLSEALSASA 271
           FPPED+ GR G  ++L  +   W+   ++R  +A+S        G +A +Q+        
Sbjct: 123 FPPEDRSGRSGLLSVLDVIIQGWNDDGERRKFVAESSRVIDVLAGYYAGKQVR-----PD 177

Query: 272 SSNKLPD--ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 329
            +  +P   E   +A   C  QL +S+DS  GGFG APKFPR   +  +   +       
Sbjct: 178 PATPMPPLYETGGDAFERCYLQLGESFDSTHGGFGGAPKFPRASNLDFLFRVAAIQGPET 237

Query: 330 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 389
           ++G   E   M   TL+ M  GGIHDHVGGGFHRYSVD+ W VPHFEKMLYDQ Q+A   
Sbjct: 238 ETGR--EAVSMAASTLRHMIAGGIHDHVGGGFHRYSVDDAWFVPHFEKMLYDQAQIAVNL 295

Query: 390 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 449
           LDA   T D  Y++  R  LDY+ RD+  P G  FSAEDAD+A   GAT   EGAFYVWT
Sbjct: 296 LDAALFTGDERYAWAARATLDYVLRDLTHPDGGFFSAEDADAAPAHGATEHVEGAFYVWT 355

Query: 450 SKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 508
           + E+   L  + A L + H  + P    ++    DPH E +GKN+L ++   + +A+ LG
Sbjct: 356 ADELRRALSPDAARLVESHLGINPGSEGNVPPALDPHGELRGKNILRQVRPLAETAAALG 415

Query: 509 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 568
           +        L      L  +R+ RPRPHLDDKVI +WNGL +S+FARA+    +      
Sbjct: 416 LEPAAAAERLAAALETLQAIRTARPRPHLDDKVITAWNGLALSAFARAATSPAA------ 469

Query: 569 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 628
                +   R  Y++ A  AA F+ R L D     L  ++R     + GF +DYA  I+G
Sbjct: 470 ----CLDDRRDRYLDAARRAARFVERELCDAGRGVLYRAWRGERGASEGFAEDYACFIAG 525

Query: 629 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 688
           LLDL++      WL  A  LQ T D  F D   GGYFN+   DP ++LR+KED+DGAEP+
Sbjct: 526 LLDLHDATFDAHWLRLAERLQQTMDARFRDEIAGGYFNSPAGDPHIVLRLKEDYDGAEPA 585

Query: 689 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD-MLSVP 747
            +S++  NL RL+S++     +     A  ++     +      A+P M CA + +L+ P
Sbjct: 586 PSSIAASNLQRLSSLL---HDETLHARAVDTVEALRGQWSQTPHALPAMLCALERILAEP 642

Query: 748 SRKHVVLVGHKSSVDFENMLAAAHA 772
            +  VV+ G  ++  F  ++A   A
Sbjct: 643 VQ--VVIAGDPAAPGFRALVAVVRA 665


>gi|189346882|ref|YP_001943411.1| hypothetical protein Clim_1372 [Chlorobium limicola DSM 245]
 gi|189341029|gb|ACD90432.1| protein of unknown function DUF255 [Chlorobium limicola DSM 245]
          Length = 706

 Score =  551 bits (1421), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 305/687 (44%), Positives = 410/687 (59%), Gaps = 56/687 (8%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           ++  N LA E SPYLLQHA NPVDW  WG EAF ++R+R+ PIFLS+GY+TCHWCHVME 
Sbjct: 5   SRQPNLLAKEKSPYLLQHAFNPVDWQPWGPEAFRKSRERNKPIFLSVGYATCHWCHVMER 64

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFE+E  A+LLN  F+ +KVDREE PD+D++YMTYVQA  G GGWP+SV+L+PDLKP  
Sbjct: 65  ESFENEETARLLNGSFIPVKVDREELPDLDRLYMTYVQASTGRGGWPMSVWLTPDLKPFY 124

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
           GG+YFPPED+YG PGF+T+L  +   W+     + ++     EQL    S+    + LP+
Sbjct: 125 GGSYFPPEDRYGMPGFRTVLTSIAQLWNTDPARITEASRIFFEQLQS--SSPMGKSGLPE 182

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
           +    A   C   L+ +YD   GGFG APKFPRP  +  +  H+     TG    AS   
Sbjct: 183 K--GEAQEACFRWLASAYDPLRGGFGGAPKFPRPALLTFLFSHAFH---TGNREAAS--- 234

Query: 339 KMVLFTLQCMAKGGIHDHV------GGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDA 392
            M L TL+ MA+GGIHDHV      GGGF RYS DERWH+PHFEKMLYD  QLA  YL+A
Sbjct: 235 -MALHTLKKMAEGGIHDHVHSMGKGGGGFARYSTDERWHLPHFEKMLYDNAQLAASYLEA 293

Query: 393 FSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKE 452
           F ++ +  ++ I  DI +Y+  DM  P G  +SAEDADS        K+EGAFYVW+ KE
Sbjct: 294 FQISGETLFARIAEDIFNYILHDMQSPEGGFYSAEDADSFPDGETQEKREGAFYVWSWKE 353

Query: 453 VEDILGE--HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMP 510
           V  +  E     LF   Y +KP GN       DPH EF GKNVL+E +            
Sbjct: 354 VMSLPAEPDKLELFARTYGMKPEGNVS----EDPHGEFGGKNVLMEQSAPEKHE------ 403

Query: 511 LEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFN 570
            +  +  L E R+ L++ R +R RP LDDK+I SWNGL+IS+FA+  ++L  E       
Sbjct: 404 -KDTVAALDEVRQLLYEKRLQRSRPLLDDKIITSWNGLMISAFAKGYRVLGHE------- 455

Query: 571 FPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLL 630
                    EY+  A +AA FI  HLY+E   RL   +R+G +   G  +DYAF + GL+
Sbjct: 456 ---------EYLRAARNAADFILVHLYEENEGRLLRRYRDGDAAITGKAEDYAFFVRGLI 506

Query: 631 DLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGN 690
           DLY+     ++L  A  L  T + LF D   GGYF+T  +D +V +R+KE++DGAEP+ +
Sbjct: 507 DLYQACFDNRYLDAADRLCETCNRLFYDHADGGYFSTATDDNTVPVRLKEEYDGAEPAAS 566

Query: 691 SVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRK 750
           SV ++NL+ LA ++ G+++  Y   AE     F T L   + A+PLM  A +     +RK
Sbjct: 567 SVGILNLLDLA-VMTGNEA--YEGMAEACFRGFGTMLSHNSPALPLMLAALNN----ARK 619

Query: 751 H---VVLVGHKSSVDFENMLAAAHASY 774
                VL G+  S   + +L   ++ Y
Sbjct: 620 GGILAVLAGNMQSPRMQELLKTLNSRY 646


>gi|254445309|ref|ZP_05058785.1| conserved hypothetical protein [Verrucomicrobiae bacterium DG1235]
 gi|198259617|gb|EDY83925.1| conserved hypothetical protein [Verrucomicrobiae bacterium DG1235]
          Length = 715

 Score =  551 bits (1419), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 302/682 (44%), Positives = 416/682 (60%), Gaps = 36/682 (5%)

Query: 92  ASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCH 151
           A  S+S  K  N LA   SPYLLQH  NPVDW  WGEEAFAEAR+R VPIFLSIGYSTCH
Sbjct: 3   AEMSNSSGKKRNALAKSRSPYLLQHTSNPVDWREWGEEAFAEARERGVPIFLSIGYSTCH 62

Query: 152 WCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLS 211
           WCHVM  ESFEDEG+A  +ND FV++K+DREERPDVD++YM+YVQ+  G GGWP+SV+L+
Sbjct: 63  WCHVMAHESFEDEGIAGRMNDLFVNVKLDREERPDVDRIYMSYVQSTTGSGGWPMSVWLT 122

Query: 212 PDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASA 271
           PDLKP  GGTYFPPEDKYGR GF T++ ++   W  +R  L + G     + S+AL A +
Sbjct: 123 PDLKPFYGGTYFPPEDKYGRVGFLTLVERIGQLWRDERATLLEYG-----EKSQALLADS 177

Query: 272 SSNKLPDELPQ--NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 329
           +S  L D + +   A+ LC EQL   YD ++GGFG APKFP P   QM+      ++   
Sbjct: 178 ASRNLSDGIGEAAGAIDLCLEQLDTEYDEQWGGFGGAPKFPMPGYFQML------VDGIS 231

Query: 330 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 389
           + G A    +M+  +L+ MA GGI DHVG GFHRYSVD+ WHVPH+EKMLYDQGQLA +Y
Sbjct: 232 RRGNARL-TEMLAGSLEKMADGGIWDHVGSGFHRYSVDKYWHVPHYEKMLYDQGQLAGIY 290

Query: 390 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 449
            +A+ LT    ++ + + I+ Y+ RD+ G  GE+F+AEDADSA  + A++  EGAFYVW+
Sbjct: 291 AEAYRLTGRDSFAAVAKGIVRYVARDLQGAAGELFAAEDADSALPDDASKHGEGAFYVWS 350

Query: 450 SKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGM 509
             E++ +LGE A LF   Y +K  GN      SDPH E KG N L+ +        +  +
Sbjct: 351 KAELDGLLGEDAALFASAYDVKAGGNARPE--SDPHGELKGMNTLMRVASDGELGKRFSL 408

Query: 510 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 569
            +      LG C   LF+ R  RPRPHLDDK +VSWN L+IS    A K+ ++  ++   
Sbjct: 409 EVSAVRERLGACLGVLFEKRDGRPRPHLDDKALVSWNALMISG---ACKVYQACGDA--- 462

Query: 570 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 629
                     + +E+A+ AA F+   ++D    R    +R G  +  GF +DYA      
Sbjct: 463 ----------DALELAKKAAVFLFAEMWDAGEGRFARVYRGGCGEQGGFAEDYAAAAGAC 512

Query: 630 LDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSG 689
           LDLYE      W+  A E+       F D + GG+F T   D +VL+R+++D+DGAEP+ 
Sbjct: 513 LDLYEATFDAVWVERAREVLQQLKLRFWDEQRGGFFATEVGDANVLVRLRDDYDGAEPAA 572

Query: 690 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSR 749
           +S++ + L+RLA+++   K    R     ++  F  + K    A+PLM  AA    + S 
Sbjct: 573 SSLAALALLRLAALLDDEK---LRVLGRETIEAFGEQWKRSPRAMPLMLVAASRF-LESD 628

Query: 750 KHVVLVGHKSSVDFENMLAAAH 771
           + +V+VG   + +   ++A A+
Sbjct: 629 QQIVVVGDLEAAETRELIACAN 650


>gi|452825593|gb|EME32589.1| hypothetical protein Gasu_03590 [Galdieria sulphuraria]
          Length = 822

 Score =  550 bits (1418), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 294/696 (42%), Positives = 413/696 (59%), Gaps = 51/696 (7%)

Query: 101 HTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVES 160
            TNRLA E SPYLLQHA+NPVDW+ W EEAF +A++ + PIFLS+GYSTCHWCHVME ES
Sbjct: 106 RTNRLANEKSPYLLQHANNPVDWYPWSEEAFGKAKEENKPIFLSVGYSTCHWCHVMEKES 165

Query: 161 FEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGG 220
           FE+E +A +LN +FVS+KVDREERPDVD VYMT+VQA  G GGWP+S+FL+PDL P +G 
Sbjct: 166 FENEQIASILNTYFVSVKVDREERPDVDGVYMTFVQATNGNGGWPMSIFLTPDLVPFVGT 225

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           TY PP+       F + L+++ + W   ++ + Q G+  +  L + L A    + L    
Sbjct: 226 TYLPPDR------FASALQQIAEKWRTSKEAIEQEGSRVLNALQQYLDAPRKDDSL---- 275

Query: 281 PQNALRLCAEQ----LSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASE 336
             N    C EQ      + +D  +GGFG+APKFPRPV    +   +    D GK+  A +
Sbjct: 276 --NITTSCLEQGYMEAKEMFDEEYGGFGTAPKFPRPVVYDFLF--TLYWFDGGKTERAKD 331

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
              M L TL  MAKGGIHDH+GGGFHRYSVD+ WHVPHFEKMLYDQ QL   YLDA+ +T
Sbjct: 332 CLNMALQTLSNMAKGGIHDHLGGGFHRYSVDQYWHVPHFEKMLYDQSQLLQSYLDAYLIT 391

Query: 397 KDVFYSYICRDILDYLRRDMIGPG-GEIFSAEDADSAE-------TEGATRKKEGAFYVW 448
           KD  +     DIL Y+ RDM     G  FSAEDADS E       +  +  KKEGAFY W
Sbjct: 392 KDESFRDTAIDILSYVLRDMTDKNTGAFFSAEDADSLEPFSTDSSSINSETKKEGAFYTW 451

Query: 449 TSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 507
           T  E + ILG   + L  EH+ +KP GN      SDP  E  GKNVL      +  +  +
Sbjct: 452 TDFECKLILGPTTSKLISEHFDIKPEGNARPG--SDPFGELGGKNVLYIAKSLTEVSKSM 509

Query: 508 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 567
           G+   +    + E ++KL++ R++R RPHLDDK+I SWN ++I S  +A  +L+ E    
Sbjct: 510 GVSEAEANVAIQEAKQKLWEQRNRRARPHLDDKIITSWNAMMIYSLVKAYIVLEDE---- 565

Query: 568 MFNFPVVGSDRKEYMEVAESAASFIRRHLYD---EQTHRLQHSFRNGPSKAPGFLDDYAF 624
                       +Y++ A  AA+F++ ++ +   ++T  +  S+R G S   GF++DYA 
Sbjct: 566 ------------QYLQKAMDAATFLKSYMIETTSQETTLIYRSYREGRSDVEGFVEDYAH 613

Query: 625 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 684
            I   L ++E     +WL +AI+LQNTQD  F D   GGYF+T+ +  ++LLR K+D+DG
Sbjct: 614 TIRAFLSVFEATGNEEWLKYAIQLQNTQDATFYDEVNGGYFSTSSQAKNILLRRKDDYDG 673

Query: 685 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 744
           +EPS ++VS  NL RL +I   +K   Y +  + ++  F   +      VP M     +L
Sbjct: 674 SEPSPSAVSGWNLFRLGAITGDTK---YYEKFKSTINAFSIPVNKAPFGVPAMLINCCLL 730

Query: 745 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
              + + V++V +       +++ A  + ++ N+ +
Sbjct: 731 LKEATRVVLVVDNMKEPRTRDLVNAVVSRFEPNRVL 766


>gi|451946132|ref|YP_007466727.1| thioredoxin domain-containing protein [Desulfocapsa sulfexigens DSM
           10523]
 gi|451905480|gb|AGF77074.1| thioredoxin domain-containing protein [Desulfocapsa sulfexigens DSM
           10523]
          Length = 710

 Score =  546 bits (1406), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 296/678 (43%), Positives = 396/678 (58%), Gaps = 44/678 (6%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           +K TN L  E SPYLLQH +NPVDW+ W EEA + A   D PIFLSIGYSTCHWCHVM  
Sbjct: 13  SKQTNHLFHEKSPYLLQHVNNPVDWYPWSEEALSRAVSEDKPIFLSIGYSTCHWCHVMAH 72

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           +SFED+ +A  LN +F+ IKVDREERPDVD++YM   QA+ G GGWP+S+FL PD +P  
Sbjct: 73  QSFEDQEIADFLNSYFIPIKVDREERPDVDQIYMAATQAMTGSGGWPMSLFLFPDTRPFY 132

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
            GTYFPP   YGRPGF  IL+ +K AW   R+ L+ S     EQ++  L    S  ++  
Sbjct: 133 AGTYFPPRADYGRPGFMEILQAIKTAWLTDRESLSLSA----EQVTSLLRKDTSDGRVS- 187

Query: 279 ELPQNA-LRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEG 337
             P+ A L     QL +SYD ++GGFG APKFPRPV I  +L + K    TG+       
Sbjct: 188 --PEKAWLDKGFSQLEESYDPKYGGFGQAPKFPRPVVIDFLLRYYKS---TGRKA----A 238

Query: 338 QKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTK 397
           + M L TL+ MA GG++D +GGGFHRYSVD RW VPHFEKMLYDQ QL   YL AF LT 
Sbjct: 239 RDMALVTLEQMAGGGMYDQIGGGFHRYSVDGRWRVPHFEKMLYDQSQLVFAYLSAFQLTG 298

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
           D  Y  I  ++L+Y+ RDM  P G  +SAEDADS          EGAFY+WT +E++ +L
Sbjct: 299 DSAYKEIVVEVLEYVLRDMRHPEGGFYSAEDADSVNPYNLEEHGEGAFYLWTEEEIDTLL 358

Query: 458 GE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLN 516
            E  A L K +Y +K  GN     + DP  EF G+N+     + S  A ++G+  E+  +
Sbjct: 359 TEKQAALIKAYYGVKAKGNA----LHDPQKEFTGRNIFYRDKELSEVAREVGLSEEEARD 414

Query: 517 ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGS 576
           IL + RR L   R  R  PHLDDK++ SWNGL+IS+FARA+ +L                
Sbjct: 415 ILQDARRSLLSHRQDRTAPHLDDKILTSWNGLMISAFARAAMVLGE-------------- 460

Query: 577 DRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG 636
             K Y+  A  A  F+   L  +    L   +R+G ++    LDDY+FL+ GLLDLY   
Sbjct: 461 --KRYLAAANQATDFLLDRLTVD--GELVRRWRDGDARYAAGLDDYSFLVQGLLDLYLAS 516

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
             +  L  A++L      +F D +GG  F  T +   +L R++  +DGAEPSGNSV+V+N
Sbjct: 517 HDSIRLQAAVDLTEKMIRIFADEKGG--FYDTPQSTQLLTRMRAAYDGAEPSGNSVAVMN 574

Query: 697 LVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVG 756
           L+RLA +   ++   +   A  S+  F   L     A+P+M  A D   +   + +V+ G
Sbjct: 575 LLRLAGLTGNNE---WVALATESIESFGKTLSTYPPAMPMMLSAMD-FQMDKPRQIVIAG 630

Query: 757 HKSSVDFENMLAAAHASY 774
              + D   +L+  H+ Y
Sbjct: 631 TLEADDTRELLSEVHSRY 648


>gi|301620517|ref|XP_002939623.1| PREDICTED: spermatogenesis-associated protein 20-like [Xenopus
           (Silurana) tropicalis]
          Length = 775

 Score =  545 bits (1405), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 301/718 (41%), Positives = 412/718 (57%), Gaps = 81/718 (11%)

Query: 81  YKVVAMAERTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVP 140
           ++V  MA    + ++ +     NRL  E S YL QHA NPVDW                 
Sbjct: 82  FEVCKMA----SGSTQTPTGRVNRLINEKSLYLQQHARNPVDW----------------- 120

Query: 141 IFLSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYG 200
               +GYSTCHWCHVME ESFEDE + ++LN+ F+ +KVDREERPDVDKVYMT++QA   
Sbjct: 121 ----VGYSTCHWCHVMERESFEDEEIGRILNENFICVKVDREERPDVDKVYMTFLQATDS 176

Query: 201 GGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAI 260
           GGGWP+SV+L+PDL+P +GGTYFPPED   R  F+T+L ++ + W + R       AF  
Sbjct: 177 GGGWPMSVWLTPDLRPFVGGTYFPPEDGVRRVSFRTVLLRIVEQWKENR-------AFLC 229

Query: 261 EQLSEALSASASSNKL------PDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVE 314
           E+    LS   SS+ +      P  LP    +LC +QL + +D  +GGFG  PKFP PV 
Sbjct: 230 ERSERILSVLQSSSDIDGAAEPPPSLPVQ--KLCFQQLERIFDEEYGGFGEFPKFPTPVN 287

Query: 315 IQMM--LYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHV 372
              +  L+   K      S E ++   M + TL+ M  GGIHDH+G GFHRYS D+ WHV
Sbjct: 288 FSFLFCLWALSK-----GSPEGTQALHMAVHTLKWMMYGGIHDHIGKGFHRYSTDQTWHV 342

Query: 373 PHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSA 432
           PHFEKMLYDQGQLA  Y +AF ++    +S    DIL Y+ +++    G  +SAEDADS 
Sbjct: 343 PHFEKMLYDQGQLAVAYAEAFQISGKEIFSDAAHDILQYVLQNLSDDAGGFYSAEDADSL 402

Query: 433 ETEGATRKKEGAFYVWTSKEVEDILGE--------HAILFKEHYYLKPTGNCDLSRMSDP 484
               +  KKEGAF  WT+KE++ +L +           +F  HY +K  GN   S+  D 
Sbjct: 403 PNAQSKEKKEGAFATWTAKEIQQLLPDMEEANGNTFGDIFMHHYGMKEEGNVSASQ--DI 460

Query: 485 HNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVS 544
           H E +G+NVLI  +    +A+K G+ + +   IL  CR +L+  R  RP P  D  ++ S
Sbjct: 461 HGELQGQNVLIVRSSLELTAAKFGLDVARVQTILSMCRDRLYKARRLRPPPQRDTNILAS 520

Query: 545 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL 604
           WNGL++S  AR   IL+ E                EY+E A+ AASF+  ++YD ++  L
Sbjct: 521 WNGLMLSGLARCGVILRDE----------------EYIERAKLAASFLHENMYDLKSGIL 564

Query: 605 QHSFRNG----PSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDRE 660
             SF  G        PGFLDDYAF++ GLLDLYE      +L WA++LQ+ QD+LF D +
Sbjct: 565 LRSFYKGHQPIADLVPGFLDDYAFMVRGLLDLYEACLDQFYLEWALQLQDRQDQLFWDAK 624

Query: 661 GGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL 720
           G GYF +   D S+LLR+K+D DGAEPSGNSVSV+NL+RLA     ++   + + +   L
Sbjct: 625 GSGYFCSDASDSSILLRLKDDQDGAEPSGNSVSVVNLLRLACYTGRTE---FTERSGQIL 681

Query: 721 AVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNK 778
           A F  RL  +  ++P M    +M+   + K VV+ G K   +   +L AA + Y  NK
Sbjct: 682 AAFSERLLKVPASLPEM-VRGNMIYHQTVKQVVVCGDKEDPNTRELLEAAQSMYVPNK 738


>gi|395826687|ref|XP_003786547.1| PREDICTED: spermatogenesis-associated protein 20 [Otolemur
           garnettii]
          Length = 752

 Score =  541 bits (1395), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 296/707 (41%), Positives = 414/707 (58%), Gaps = 60/707 (8%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           K     A    P LL  A   +  + WG+EAF +ARK + PIFLS+GYSTCHWCH+ME E
Sbjct: 26  KQLGSQAPPQPPGLLSDAPLALHRYPWGQEAFDKARKENKPIFLSVGYSTCHWCHMMEEE 85

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SF++E + +LL++ F+S+KVDREERPDVDKVYMT+VQA   GGGWP++V+L+P+L+P +G
Sbjct: 86  SFQNEEIGRLLSEDFISVKVDREERPDVDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVG 145

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE 279
           GTYFPPED   R GF+T+L +++D W + ++ L ++     ++++ AL A +  +    +
Sbjct: 146 GTYFPPEDGLTRVGFRTVLLRIRDQWKQNKNTLLENS----QRVTTALLARSEISMGDRQ 201

Query: 280 LPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYH--SKKLEDTGKSGEA 334
           LP +A  +   C +QL + YD  +GGF  APKFP PV +  + ++  + +L   G     
Sbjct: 202 LPPSAATMNSRCFQQLDEGYDEEYGGFAEAPKFPTPVILNFLFFYWLNHRLTQDG----- 256

Query: 335 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 394
           S  Q+M L TL+ MA GGI DHVG GFHRYS D +WHVPHFEKMLYDQ QLA  Y  AF 
Sbjct: 257 SRAQQMALHTLKMMANGGIRDHVGQGFHRYSTDRQWHVPHFEKMLYDQAQLAVAYSHAFQ 316

Query: 395 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 454
           ++ D F+S + + IL Y+ R +    G  + AEDADS    G  R KEGAFYVWT KEV+
Sbjct: 317 ISGDEFFSDVAKGILQYVSRSLTHRFGGFYCAEDADSPPERG-MRPKEGAFYVWTVKEVQ 375

Query: 455 DILGE----------HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 504
            +L E             L  +HY L   GN  LS+  DP  E +G+NVL        +A
Sbjct: 376 HLLPEPIPGATEPLTSGQLLMKHYGLTEAGNIGLSQ--DPKGELQGQNVLTVRYSLELTA 433

Query: 505 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 564
           ++ G+ +E    +L     KLF  R  RP+PHLD+K++ +WNGL++S +A    +L  E 
Sbjct: 434 ARFGLDVEAVRTLLNTGLEKLFQARKHRPKPHLDNKMLAAWNGLMVSGYAVTGAVLGIE- 492

Query: 565 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP------SKAP-- 616
                          + +  A S A F++RH++D  T RL  +   G       S  P  
Sbjct: 493 ---------------KLINCATSGAKFLKRHMFDVATGRLMRTCYTGSGGTVEHSNPPCW 537

Query: 617 GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL- 675
           GFL+DYAF++ GLLDLYE    + WL WA+ LQ+TQD LF D +GGGYF +  E  + L 
Sbjct: 538 GFLEDYAFVVRGLLDLYEASQESAWLEWALRLQDTQDRLFWDCQGGGYFCSEAELGAGLP 597

Query: 676 LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP 735
           LR+K+D DGAEPS NSVS  NL+RL           +       L  F  R++ + +A+P
Sbjct: 598 LRLKDDQDGAEPSANSVSAHNLLRLHGFTGHRD---WMDKCVCLLTAFSERMRRVPVALP 654

Query: 736 LMCCAADMLSVPSR--KHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
            M      LS   +  K +V+ G + + D + ++   H+ Y  NK +
Sbjct: 655 EM---VRTLSAHQQTLKQIVICGDRQAKDTKALVQCVHSMYIPNKVL 698


>gi|170067981|ref|XP_001868692.1| spermatogenesis-associated protein 20 [Culex quinquefasciatus]
 gi|167863990|gb|EDS27373.1| spermatogenesis-associated protein 20 [Culex quinquefasciatus]
          Length = 763

 Score =  538 bits (1386), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 305/707 (43%), Positives = 407/707 (57%), Gaps = 64/707 (9%)

Query: 91  PASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTC 150
           P+ TS +  KHTNRL  E SPYLLQHAHNPVDW+ WGEEA A AR  +  IFLS+GYSTC
Sbjct: 19  PSGTS-TPPKHTNRLINEKSPYLLQHAHNPVDWYPWGEEAIARARAENKLIFLSVGYSTC 77

Query: 151 HWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFL 210
           HWCHVME ESFE E VA+++N+ FV++KVDREERPD+DK+YMT++  + G GGWP+SV+L
Sbjct: 78  HWCHVMEKESFESEEVAEIMNENFVNVKVDREERPDIDKLYMTFILLINGSGGWPMSVWL 137

Query: 211 SPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSAS 270
           +PDL P+ GGTYFPP+D++G PGF TIL K+K  W    + L ++G   I+ + + +   
Sbjct: 138 TPDLAPITGGTYFPPKDRWGMPGFTTILLKLKIKWATDGEDLKETGRSIIQAIQKNVE-- 195

Query: 271 ASSNKLPDELP---QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 327
              +K   ELP   +   R       +++D  +GG    PKFP   ++  +++H   L+ 
Sbjct: 196 -EKHKEEPELPLTVEEKFRQAIMIYRRNFDPVWGGSMGEPKFPEVSKLN-LIFHLHLLD- 252

Query: 328 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 387
                 AS+   +VL TL  MA GGIHDHV GGF RYSVD++WHVPHFEKMLYDQGQL  
Sbjct: 253 -----PASKLLGVVLNTLDKMAAGGIHDHVFGGFARYSVDKKWHVPHFEKMLYDQGQLLM 307

Query: 388 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 447
            Y + +  T+   Y  +   I  YL +D+  P G  +S EDADS     +  K EGAFY 
Sbjct: 308 AYANGYKATRKPLYLEVADSIFKYLCKDLRHPAGGFYSGEDADSLPAWDSKDKIEGAFYA 367

Query: 448 WTSKEVEDILGEH------------AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 495
           WT  E++D+   +              +F EHY ++PTGN + S  SDPH    GKN+LI
Sbjct: 368 WTFSEIKDLFNANLEKFGDLGKLNPVEVFTEHYDVQPTGNVEPS--SDPHGHLLGKNILI 425

Query: 496 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 555
                  +A KL    E    IL      L +VR KRPRPHLD K+I +WNGL++S  A 
Sbjct: 426 VYGSLRETALKLDTSEEVVAKILKVGNELLHEVRDKRPRPHLDTKIICAWNGLILSGLAE 485

Query: 556 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPS-- 613
            S++  +              +R EY+EVA    +FIR +L+D +  +L  SF    S  
Sbjct: 486 LSRVKDA-------------PNRAEYLEVAAKLVAFIRENLFDAKAGKLLRSFYGDDSDK 532

Query: 614 ----KAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT 667
               + P  GF+DDYAFLI GL+D Y     T  L WA ELQ  QD LF D   G YF +
Sbjct: 533 AKSLEVPIYGFIDDYAFLIKGLIDYYRASLDTSALRWARELQEIQDRLFWDDTSGAYFYS 592

Query: 668 TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHS----LAVF 723
                +V++R+KEDHDGAEP GNSV+  NL+ L         DY+ + A H     L  +
Sbjct: 593 EANSANVVVRLKEDHDGAEPCGNSVAAHNLLLLG--------DYFAEGAFHERARKLLDY 644

Query: 724 ETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAA 770
            + +      +P M  AA ++    R  ++++G K   D  N L  A
Sbjct: 645 FSNVAPFGYVLPKMMSAA-LMEEHGRDMLIVIGPKG--DQTNALVDA 688


>gi|158296880|ref|XP_317217.4| AGAP008252-PA [Anopheles gambiae str. PEST]
 gi|157014924|gb|EAA12337.5| AGAP008252-PA [Anopheles gambiae str. PEST]
          Length = 813

 Score =  535 bits (1379), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 300/694 (43%), Positives = 396/694 (57%), Gaps = 57/694 (8%)

Query: 92  ASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCH 151
           A+++ +  K TNRL  E SPYLLQHAHNPV+W+ WGEEA   AR  +  IFLS+GYSTCH
Sbjct: 66  ANSNGTEPKFTNRLKQEKSPYLLQHAHNPVEWYPWGEEAIQRARAENKLIFLSVGYSTCH 125

Query: 152 WCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLS 211
           WCHVME ESFE+E VAK++N+ F++IKVDREERPD+DK+YM ++  + G GGWP+SV+L+
Sbjct: 126 WCHVMEKESFENEEVAKIMNEHFINIKVDREERPDIDKLYMMFILLINGSGGWPMSVWLT 185

Query: 212 PDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS--- 268
           PDL P+ GGTYFPP D++G PGF T+L K+   W   +D L  +G   IE +   +    
Sbjct: 186 PDLAPVTGGTYFPPNDRWGMPGFTTVLTKLASKWSTDKDDLVTTGRSVIEAIRRNVDHKR 245

Query: 269 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 328
           A    +    E  +   +       ++YD  +GG   APKFP   ++ +M +H    E  
Sbjct: 246 ADEVEDATNMETLEAKFKQAVNMYQRNYDMVWGGSLGAPKFPEASKLNLM-FHLHVQEPK 304

Query: 329 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 388
            K         +VL TL  MA GGIHDHV GGF RYSVD++WHVPHFEKMLYDQGQL ++
Sbjct: 305 HKV------LGVVLNTLDKMAAGGIHDHVFGGFARYSVDKKWHVPHFEKMLYDQGQLLSL 358

Query: 389 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 448
           Y + + LTK   Y  +   I  YL +D+  P G  +S EDADS  T  +  K EGAFY W
Sbjct: 359 YANGYRLTKKPSYLAVADAIYRYLCKDLRHPAGGFYSGEDADSLPTAESEEKIEGAFYAW 418

Query: 449 TSKEVEDILGEHAILFKE------------HYYLKPTGNCDLSRMSDPHNEFKGKNVLIE 496
           T  EV+++LG +   F E            HY +K  GN   S  SDPH    GKN+LI 
Sbjct: 419 TYDEVKELLGANGEKFGELGGVDPVAVYAAHYDVKEEGNVKPS--SDPHGHLLGKNILIV 476

Query: 497 LNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARA 556
                 +A K    +E    IL      L +VR KRPRPHLD K++ +WNGLV+S  ++ 
Sbjct: 477 YGSVRETAEKFNTTVEIVERILKTGNELLHEVRDKRPRPHLDTKILCAWNGLVLSGLSQL 536

Query: 557 SKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG----- 611
           + +  +               R EY+  AE    FIR +LYD Q  +L  S   G     
Sbjct: 537 ACVKDAPG-------------RSEYLATAEELVKFIRANLYDVQARKLLRSCYGGAEESL 583

Query: 612 PSKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTG 669
            S+ P  GF+DDYAFLI GL+D Y        L WA ELQ+ QDELF D + G YF +  
Sbjct: 584 ASERPIYGFIDDYAFLIKGLIDYYVASLDEHALHWAKELQDIQDELFWDTKHGAYFYSEA 643

Query: 670 EDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQN--AEHSLAVFE--T 725
             P+V +R+KEDHDGAEP GNSV+  NL+ L        SDY+ +    E +  +F+   
Sbjct: 644 NSPNVAVRLKEDHDGAEPCGNSVAAHNLLLL--------SDYFEEERLKEKARTLFDYFA 695

Query: 726 RLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKS 759
                   +P M  AA +L    R  +++VG +S
Sbjct: 696 HTAHFGYVLPEMMSAA-LLEEQGRNTLIVVGPES 728


>gi|290982332|ref|XP_002673884.1| predicted protein [Naegleria gruberi]
 gi|284087471|gb|EFC41140.1| predicted protein [Naegleria gruberi]
          Length = 600

 Score =  532 bits (1370), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 279/609 (45%), Positives = 374/609 (61%), Gaps = 49/609 (8%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           +K+TNRLA E SPYLLQHAHNPVDW+ WGEEAF +AR  + PIFLSIGYSTCHWCHVME 
Sbjct: 10  HKYTNRLAKEASPYLLQHAHNPVDWYPWGEEAFEKARNENKPIFLSIGYSTCHWCHVMEK 69

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFE+E +A ++N  FV+IKVDREERPD+D+VYMT+VQ   G GGWPLS FL+P LKP+ 
Sbjct: 70  ESFENEEIAAIMNQNFVNIKVDREERPDIDRVYMTFVQLTTGSGGWPLSCFLTPQLKPIF 129

Query: 219 GGTYFPPEDKY--GRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKL 276
           GGTYFPP++    G   F ++L K+ + W  KR+ L   G   +  L +A +   +  + 
Sbjct: 130 GGTYFPPKESIYRGNISFPSLLNKIHNMWTNKREALVSQGDKIVSVLKKAFTEKENEEE- 188

Query: 277 PDELPQNALRLCAEQLS-------KSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 329
           P +   + L+   E ++        S+D+ +GGF  APKFPRPV I  +L    + +D  
Sbjct: 189 PAKSADHILKFAHEYVASTVEDFLSSFDTVYGGFSQAPKFPRPVVIDFLLRSYYEEKDDR 248

Query: 330 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 389
           +  +       V FTL  MA+GG++DH+GGGFHRYSVD  WHVPHFEKM+YDQGQLA V+
Sbjct: 249 RKLDIINS---VTFTLDKMARGGLYDHLGGGFHRYSVDTYWHVPHFEKMMYDQGQLAIVF 305

Query: 390 LDAFSLTKDVFYSYICRDILDYLRRDM-IGPGGEI---FSAEDADSAETEGATRKKEGAF 445
            +A+  T++ +Y  I  +IL Y+ RDM +G   ++   FSAEDADS  T  +  K+EGAF
Sbjct: 306 AEAYKATRNEYYKQILEEILLYIERDMSLGESSDMIGFFSAEDADSLPTFDSKEKREGAF 365

Query: 446 YVWTSKEVEDILG---------EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIE 496
           Y W  ++V DI+          + + +F   + LK  GN   S  SDPH E  G NVL  
Sbjct: 366 YAWDYQQVVDIIDNMVPHIGSVKPSDIFSFMFDLKQDGNVRQS--SDPHGELTGLNVLYM 423

Query: 497 LNDSSASASKLG-MPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFA 554
                 +  +   +P E   N++ +C+  LF  R+K +PRPHLDDK+I +WN  VIS+F+
Sbjct: 424 DKSLKETQDRFSTIPPESVANVIMDCKDILFKERNKMKPRPHLDDKIITAWNAYVISAFS 483

Query: 555 RASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSK 614
           R++ +L                    Y+++AE AA+FI   LYD +T  L   F+    K
Sbjct: 484 RSALLLSEPG----------------YLKIAERAANFIYEKLYDRETKVLHRIFKKNSEK 527

Query: 615 ---APGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGED 671
                GFL DYA +IS L+DLYE     KWL WA ELQ+ QD  F D+  GGYF   G D
Sbjct: 528 ERNIAGFLSDYANMISALIDLYEASGSIKWLNWAFELQDIQDSYFYDQTNGGYFEERGND 587

Query: 672 PSVLLRVKE 680
           P+++ R+KE
Sbjct: 588 PTIIYRLKE 596


>gi|390463544|ref|XP_002748471.2| PREDICTED: spermatogenesis-associated protein 20 [Callithrix
           jacchus]
          Length = 783

 Score =  531 bits (1369), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 297/719 (41%), Positives = 413/719 (57%), Gaps = 80/719 (11%)

Query: 91  PASTSHSRNKHT-----NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSI 145
           PA    SR   T     NRL  E SPYLLQHA+NPVDW+ WG+EAF +ARK + PIFLS+
Sbjct: 62  PAGGKGSRPSSTPQRVPNRLIHEKSPYLLQHAYNPVDWYPWGQEAFDKARKENKPIFLSV 121

Query: 146 GYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWP 205
           GYSTCHWCH+ME ESF++E + +LL++                    T+V A   GGGWP
Sbjct: 122 GYSTCHWCHMMEEESFQNEEIGRLLSE-------------------GTFVSATSSGGGWP 162

Query: 206 LSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE 265
           ++V+L+P+L+P +GGTYFPPED   R GF+T+L ++++ W + ++ L ++     ++++ 
Sbjct: 163 MNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIREQWKQNKNALLENS----QRVTT 218

Query: 266 ALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--Y 320
           AL A +  +    +LP +A  +   C +QL + YD  +GGF  APKFP PV +  +   +
Sbjct: 219 ALLARSEISVGDRQLPPSAATVNSRCFQQLDEGYDEEYGGFAEAPKFPTPVILSFLFSYW 278

Query: 321 HSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLY 380
            S +L   G     S  Q+M L TL+ MA GGI DHVG GFHRYS D +WHVPHFEKMLY
Sbjct: 279 LSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGFHRYSTDRQWHVPHFEKMLY 333

Query: 381 DQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRK 440
           DQ QLA  Y  AF ++ D FYS + +DIL Y+ R +    G  +SAEDADS    G  R 
Sbjct: 334 DQAQLAVAYSQAFQISGDEFYSDVAKDILQYVTRSLSHRSGGFYSAEDADSPPERG-MRP 392

Query: 441 KEGAFYVWTSKEVEDILGEHAI----------LFKEHYYLKPTGNCDLSRMSDPHNEFKG 490
           KEGA+YVWT KEV+ +L E  +          LF +HY L   GN  +S   DP  E +G
Sbjct: 393 KEGAYYVWTVKEVQQLLPEPVLGATELLTSGQLFTKHYGLTEAGN--ISPSQDPKGELQG 450

Query: 491 KNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVI 550
           +NVL        +A++ G+ +E    +L     KLF  R  RP+PHLD K++ +WNGL++
Sbjct: 451 QNVLTVRYSLELTAARFGLGVEAVRTLLNTGLEKLFQARKHRPKPHLDSKMLAAWNGLMV 510

Query: 551 SSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRN 610
           S +A    +L              G DR   +  A + A F++RH++D  + RL  +   
Sbjct: 511 SGYAVTGAVL--------------GQDR--LINYATNGAKFLKRHMFDVASGRLMRTCYT 554

Query: 611 GP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGG 662
           G       S  P  GFL+DYAF++ GLLDLYE    + WL WA+ LQ+TQD LF D +GG
Sbjct: 555 GSGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLEWALRLQDTQDRLFWDSQGG 614

Query: 663 GYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA 721
           GYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL     G K   +       L 
Sbjct: 615 GYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHGFT-GHKD--WMDKCVCLLT 671

Query: 722 VFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
            F  R++ + +A+P M  A       + K +V+ G + + D + ++   H+ Y  NK +
Sbjct: 672 AFSERMRRVPVALPEMVRALSA-QQQTLKQIVICGDRQAKDTKALVQCVHSVYIPNKVL 729


>gi|403182450|gb|EAT47160.2| AAEL001725-PA [Aedes aegypti]
          Length = 749

 Score =  531 bits (1369), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 291/679 (42%), Positives = 387/679 (56%), Gaps = 47/679 (6%)

Query: 98  RNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVME 157
           + KHTNRL  E SPYLLQHAHNPVDW+ WGEEA A A+  +  IFLS+GYSTCHWCHVME
Sbjct: 11  KPKHTNRLINEKSPYLLQHAHNPVDWYPWGEEAIARAKAENKLIFLSVGYSTCHWCHVME 70

Query: 158 VESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPL 217
            ESFE+E VA ++N+ F++IKVDREERPD+DK+YMT++  + G GGWP+SV+L+PDL P+
Sbjct: 71  KESFENEQVADIMNENFINIKVDREERPDIDKLYMTFILLINGSGGWPMSVWLTPDLAPV 130

Query: 218 MGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLP 277
            GGTYFPP+D++G PGF TIL K+K+ W    + LA +G   I+ +   +          
Sbjct: 131 TGGTYFPPKDRWGMPGFTTILLKLKNKWITDGEDLASTGKSIIDAIQRNVEEKHQEEAER 190

Query: 278 DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEG 337
              P+   R       +++D  +GG   APKFP   ++ ++ +   +   T   G     
Sbjct: 191 VFTPEEKYRQAVTIYKRNFDPVWGGSLGAPKFPEVSKLNLIFHAHLQDPSTKILG----- 245

Query: 338 QKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTK 397
             +VL TL+ MA GGI+DHV GGF RYSVD++WHVPHFEKMLYDQGQL   Y + +  T+
Sbjct: 246 --VVLNTLEKMAAGGIYDHVFGGFARYSVDKKWHVPHFEKMLYDQGQLLMAYANGYKTTR 303

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
              Y  +   I  Y+ +D+  P G  +S EDADS  T  +T K EGAFY WT  EV D+L
Sbjct: 304 KPLYLEVADSIYRYISKDLQHPAGGFYSGEDADSLPTWESTDKIEGAFYAWTFAEVRDLL 363

Query: 458 GEH------------AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 505
             +              +F EHY ++ TGN + S  SDPH    GKN+ I       +A 
Sbjct: 364 KANLDKFGDIGKVDPVEVFTEHYDIQETGNVEPS--SDPHGHLLGKNIPIVYGSVRETAD 421

Query: 506 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 565
           K     E    IL      L +VR KRPRPHLD K+I +WNGL++S  ++ S I  +   
Sbjct: 422 KFETTAEVVGKILKVGNELLHEVRDKRPRPHLDTKIICAWNGLILSGLSQLSCIKDA--- 478

Query: 566 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP--------G 617
                      +R  Y++      SFIR +LYD Q  +L  S     S           G
Sbjct: 479 ----------PNRDNYLKSCSKLVSFIRENLYDVQARKLLRSCYGDESDQAKSLETPIYG 528

Query: 618 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLR 677
           F+DDYAFLI GL+D Y     T  L WA ELQ  QDELF D + G YF +     +V++R
Sbjct: 529 FIDDYAFLIKGLIDYYRASLDTGALSWAKELQEIQDELFWDHKHGAYFYSEANSANVVVR 588

Query: 678 VKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLM 737
           +KEDHDGAEP GNSVS  NL+ L       ++  +R+ A    + F + +      +P M
Sbjct: 589 LKEDHDGAEPCGNSVSAHNLIMLGDYF---ETAAFREKANKLFSYF-SNVTPFGYVLPEM 644

Query: 738 CCAADMLSVPSRKHVVLVG 756
             A  +L    R  +V+VG
Sbjct: 645 MSAM-LLQENGRDMLVVVG 662


>gi|405953510|gb|EKC21160.1| Spermatogenesis-associated protein 20 [Crassostrea gigas]
          Length = 682

 Score =  531 bits (1368), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 297/698 (42%), Positives = 400/698 (57%), Gaps = 93/698 (13%)

Query: 94  TSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWC 153
           TS S N+  NRL+ E SPYLLQHA NPVDW+ WG+EAF +++  +  IFLS+GYSTCHWC
Sbjct: 7   TSKS-NEKRNRLSKELSPYLLQHASNPVDWYPWGQEAFDKSKVENKLIFLSVGYSTCHWC 65

Query: 154 HVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPD 213
           HVME ESFE+E + ++LN+ FVSIKVDREERPDVD+VYMT++QA  GGGGWP+SV+L+P+
Sbjct: 66  HVMERESFENEEIGRILNENFVSIKVDREERPDVDRVYMTFIQATVGGGGWPMSVWLTPE 125

Query: 214 LKPLMGGTYFPPEDK-YGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSAS-A 271
           LKPL GGTYFPP+D+ YGRPGFKT+L  + + W  K  +L +  +  +  L E  SAS A
Sbjct: 126 LKPLFGGTYFPPDDRYYGRPGFKTVLTSLAEQWKTKGPVLKEQSSVILRTLQEGTSASEA 185

Query: 272 SSNKLPDELPQNALRLCAE----QLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 327
               LPD      L+ C E    QL +S+D   GGF   PKFP+PV    +     K +D
Sbjct: 186 QGQSLPD------LKDCTEKLYYQLERSFDQEDGGFSKEPKFPQPVNFNFLFRLYAKYKD 239

Query: 328 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 387
           +  S  A+   +M  FTL  MAKGGI DH+                              
Sbjct: 240 SF-SDMANSSLEMATFTLNKMAKGGIFDHIS----------------------------- 269

Query: 388 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 447
                  +TK   ++ + RDI +Y  RD++ P G  +SAEDADS  T  +  KKEGAF V
Sbjct: 270 ------KITKQDNFAEVVRDIAEYTMRDLLNPCGGFYSAEDADSLPTAESPEKKEGAFCV 323

Query: 448 WTSKEVEDILGEH-------AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 500
           WT ++++DIL E        A +F  H+ +K  GN D   M DPH+E   +NVLI  +  
Sbjct: 324 WTYQQIQDILKEKVKDNLSLAQIFCYHFNIKEKGNVD--PMQDPHDELLNQNVLIVKDSV 381

Query: 501 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 560
             +A K  +   +  ++L +CR  L+  R  RPRPHLDDK++ +WNGL+IS  ++A + L
Sbjct: 382 EETAQKFSLNPVEVKDVLEKCRTLLYKERQNRPRPHLDDKIVAAWNGLMISGLSKAGQAL 441

Query: 561 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 620
               ES              +++ A   ASF++ H+                S   GF+D
Sbjct: 442 ---GESL-------------FVDQAVKTASFLQSHM---------------SSPIEGFVD 470

Query: 621 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 680
           DYA++I GLLDLYE     +W+ WA ELQ  Q+ LF D EGG YF+ +G D S++LR+K+
Sbjct: 471 DYAYVIRGLLDLYEVCQDEQWVQWAEELQERQNGLFWDSEGGAYFSNSGRDASIVLRLKD 530

Query: 681 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 740
           D DGAEP  NSVSV NLVRL +++       Y + A   L VF  RL  + +A+P M C 
Sbjct: 531 DQDGAEPCPNSVSVSNLVRLGALLNNQD---YTEKAVTILKVFYERLTKIPIAIPEMVCG 587

Query: 741 ADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNK 778
             +L   + K +VLVG  +S D   +       Y  NK
Sbjct: 588 LILLQ-DTPKQIVLVGDPNSDDLTALKNCVAKHYLPNK 624


>gi|330805805|ref|XP_003290868.1| hypothetical protein DICPUDRAFT_155404 [Dictyostelium purpureum]
 gi|325078993|gb|EGC32616.1| hypothetical protein DICPUDRAFT_155404 [Dictyostelium purpureum]
          Length = 740

 Score =  530 bits (1366), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 279/654 (42%), Positives = 399/654 (61%), Gaps = 34/654 (5%)

Query: 92  ASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCH 151
            +T++   K+TN+L  E SPYL++HAH+PV+W+ W +EAF  A+K+D  IFLS+GY  CH
Sbjct: 6   TTTTNKEYKYTNKLINEKSPYLIKHAHDPVNWYPWCDEAFELAKKQDKLIFLSVGYMACH 65

Query: 152 WCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLS 211
           WC VM  E FE+  ++K++ND F++IKVDREERPD+DK+YMT++    GGGGWP+S++L+
Sbjct: 66  WCSVMHKECFENPSISKVMNDLFINIKVDREERPDIDKLYMTFLTETTGGGGWPMSIWLT 125

Query: 212 PDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASA 271
           P L+P+  GTYF PE K+GR  F  + +K+ + W   R+ + + G   IE L E      
Sbjct: 126 PSLQPISAGTYFAPEPKFGRAAFPELCKKLNEIWKNDRETVIERGNSFIEYLKEDKPKGN 185

Query: 272 SSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKS 331
             N L +E     +  C EQ+ K YD   GGF  APKFPR      +L  S   ++  KS
Sbjct: 186 LDNALSEE----TVSKCIEQILKGYDPDDGGFTDAPKFPRCSIFNFLL--SASTQEQLKS 239

Query: 332 GEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLD 391
            + S  +K+  FTL  MA GGI+D +G GFHRYSV   W +PHFEKMLYDQGQL  VYLD
Sbjct: 240 SKESILEKL-FFTLSKMAYGGIYDQIGFGFHRYSVTPDWKIPHFEKMLYDQGQLVPVYLD 298

Query: 392 AFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSK 451
           ++ L+K+  +  I +  L Y++  +    G  FSAEDADS     +  K EGAFY+W  +
Sbjct: 299 SYILSKNELFKNISKSTLKYVQNYLTHKDGGFFSAEDADSFNE--SNEKSEGAFYIWNFE 356

Query: 452 EVEDIL---GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 508
           +++  L    E   ++   Y L   GN  ++   DPHNEF  KN+++ +  +  +A+   
Sbjct: 357 DIKKALENDKEAIEIYSFIYGLVENGN--VNPKDDPHNEFIDKNIIMRIKSNQDAANYFK 414

Query: 509 MPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 567
              ++  + L   R+KL   R   +PRP LDDK+IV+WNGL+IS+FARA +I        
Sbjct: 415 KSTKEIESSLESSRKKLLTYRDTFKPRPPLDDKIIVAWNGLMISAFARAYQI-------- 466

Query: 568 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 627
              FP    D + Y+E A+ A  FI+ +LY++ T  L  +F++ PS    F DDYA LI 
Sbjct: 467 ---FP----DEESYLESAKRATKFIKDNLYNQATKTLIRNFKDSPSLIHAFADDYASLIQ 519

Query: 628 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDRE-GGGYFNTTGEDPSVLLRVKEDHDGAE 686
           GLLDLY+     ++L WAIELQ  QD+LF D +  GGYF+T+G+D S+L R+KE+HDGAE
Sbjct: 520 GLLDLYQCTFEIEYLEWAIELQEKQDQLFYDSQLPGGYFSTSGDDKSILHRLKEEHDGAE 579

Query: 687 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 740
            S  S+SV NL++L S+    +   Y++ A  +L      L+   + +P M C+
Sbjct: 580 NSCQSISVSNLLKLYSVTYNQE---YKEKALATLDSCSLYLEKAPIVMPQMMCS 630


>gi|21674102|ref|NP_662167.1| hypothetical protein CT1279 [Chlorobium tepidum TLS]
 gi|21647257|gb|AAM72509.1| conserved hypothetical protein [Chlorobium tepidum TLS]
          Length = 710

 Score =  530 bits (1365), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 291/678 (42%), Positives = 386/678 (56%), Gaps = 42/678 (6%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           K  N+L  E SPYLLQHA NPVDW  WGEEAF+ AR+   PIFLS GYSTCHWCHVME E
Sbjct: 3   KQPNKLIREKSPYLLQHAWNPVDWHPWGEEAFSRARETGRPIFLSSGYSTCHWCHVMEHE 62

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SFE+   A LLN  FV +K+DREE PDVD +YM +VQA  G GGWP+SV+++PDLKP  G
Sbjct: 63  SFENAETAALLNRHFVPVKLDREEHPDVDHLYMMFVQATTGRGGWPMSVWMTPDLKPFFG 122

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE 279
           G+YFP  +++G P F+++L  + + W+  R  L  S    ++QLS        +    DE
Sbjct: 123 GSYFPATERWGMPSFRSVLEHLANLWEHDRPRLLASAGSIMDQLSGLTRPQEGT----DE 178

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
           +       C   L + +D+ +GGFG  PKFPRP  +  +  H+     TG          
Sbjct: 179 VTDAHASACLAALERGFDAEWGGFGGEPKFPRPAVLSFLFSHAVA---TGN----RHALD 231

Query: 340 MVLFTLQCMAKGGIHDH------VGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 393
           M L TL+ MA GGIHDH       GGGF RYS D  WHVPHFEKMLYD  QLA  YL+A+
Sbjct: 232 MALLTLRKMAAGGIHDHLGVAGLGGGGFARYSTDRFWHVPHFEKMLYDNAQLAASYLEAY 291

Query: 394 SLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 453
             + D  ++   RDI  Y+  DM  P G  +SAEDADS +  G+  K+EGAFY+WT +E+
Sbjct: 292 QASGDELFANTARDIFHYVLCDMTSPEGAFWSAEDADSLDPYGSGEKREGAFYLWTEQEI 351

Query: 454 EDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE 512
             +L  E A LF   Y ++  GN       DPH EF GKN+LI     +  A    +P+E
Sbjct: 352 TGLLDPEEATLFIATYGIRSDGNAPF----DPHGEFTGKNILIRTMSDNELAGTFEIPIE 407

Query: 513 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 572
                L   R+KLF+ R KRPRP LDDK++ SWNGL++S+ A+ S +L            
Sbjct: 408 TVGKRLNSARKKLFEARKKRPRPGLDDKILTSWNGLMLSALAKGSLVLGD---------- 457

Query: 573 VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDL 632
                    +E AE AA FI   L D ++ +L   +R+G +   G   DYA LI GLLDL
Sbjct: 458 ------TTLLEAAERAARFILDTLCDSKSGKLLRRYRDGQAAIEGKAADYACLILGLLDL 511

Query: 633 YEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSV 692
           Y     + WL  AI+L   Q E F D+E G +++T  ED SV LR+ ED+D AEPS NSV
Sbjct: 512 YSASFDSDWLRAAIKLAEAQIERFFDQEAGVFYSTAVEDHSVPLRMIEDNDNAEPSANSV 571

Query: 693 SVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHV 752
           + +N +RLA+I      D +R  A  ++  F   L     A+PL+   A  ++  S   +
Sbjct: 572 NALNYLRLAAITG---RDEFRTIALRTIRHFSGTLDANPSALPLLLV-ARQIATASPVQI 627

Query: 753 VLVGHKSSVDFENMLAAA 770
           +  G + +     ++A A
Sbjct: 628 IFAGKRGNPALAKLVATA 645


>gi|225156854|ref|ZP_03724957.1| protein of unknown function DUF255 [Diplosphaera colitermitum TAV2]
 gi|224802800|gb|EEG21050.1| protein of unknown function DUF255 [Diplosphaera colitermitum TAV2]
          Length = 758

 Score =  530 bits (1364), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 314/713 (44%), Positives = 415/713 (58%), Gaps = 53/713 (7%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA   SPYL QHA NPV W  WGE AFAEA  R VPIFLSIGYSTCHWCHVM  ESFE
Sbjct: 3   NRLAFARSPYLQQHAGNPVHWQEWGEAAFAEAHARQVPIFLSIGYSTCHWCHVMARESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           +E VA +LN+ FVSIKVDREERPDVD++YM YVQA+ G GGWPLS +L+PDLKP  GGTY
Sbjct: 63  NESVAAVLNEHFVSIKVDREERPDVDRIYMAYVQAMTGRGGWPLSAWLTPDLKPFYGGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAW--DKKRDMLAQSGAFAIEQLS------EALSASASSN 274
           FPP D+ GRPGF  +L  + +AW  + +R  L    A  I+ L+      +  S  A + 
Sbjct: 123 FPPHDQQGRPGFLAVLHAITEAWSDEAERHKLVAESARVIQALTDYHAGKQHASVPAHTR 182

Query: 275 KLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEA 334
            L D    +A   C  QL +S+D   GGFG APKFPR   +   L+    ++ T +S   
Sbjct: 183 PLHDRA-ADAFEHCFLQLRESFDPAHGGFGGAPKFPRASNLD-FLFRVAAIQGT-QSEVG 239

Query: 335 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 394
            E  K+   TL+ M  GGIHDHVGGGFHRY+VDE W VPHFEKMLYDQ Q+A   LDA  
Sbjct: 240 REAVKLATTTLRHMIAGGIHDHVGGGFHRYAVDETWLVPHFEKMLYDQAQIAVNLLDAAL 299

Query: 395 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSA----ETEGATRK----KEGAFY 446
           +T D  Y+++ R  LDY+ RD+  P G  FSAEDADSA    + + + R      EGAFY
Sbjct: 300 VTGDERYAWVARSTLDYVLRDLRHPAGGFFSAEDADSAVPHDDGDASPRAHGNHAEGAFY 359

Query: 447 VWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMS------DPHNEFKGKNVLIELND 499
           VWT+ E+  IL  + A  F  H+ +  + + + +         DPH E  GKN+L     
Sbjct: 360 VWTTAELRRILPSDTADRFILHFGVAGSHDANAAEAGNVPPAHDPHGELSGKNILHHTRP 419

Query: 500 SSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKI 559
            + +A+ LG+               L  VR+ RPRPHLDDK+I +WNGL I++FARA+  
Sbjct: 420 IAETAAALGLDPAALAAEFARALETLRAVRAARPRPHLDDKIITAWNGLAITAFARAAAS 479

Query: 560 LKSEAESAMFNFPVVGSDRKE-YMEVAESAASFIRRHLYDEQTHR------LQHSFRNGP 612
             +  +           DR+E Y++ A +AA FI R LYD+          L  ++R+G 
Sbjct: 480 PAACLD-----------DRREFYLDAALTAARFIERELYDDDGGDAPARCILWRNWRDGR 528

Query: 613 SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDP 672
             + GF +DYAFLI+GLLDL+E      WL  A  LQ T D LF D   GGYFNT    P
Sbjct: 529 GASEGFAEDYAFLIAGLLDLHEATLDPHWLRRAARLQETMDHLFWDDAHGGYFNTPAGSP 588

Query: 673 SVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAM 732
            ++LR+KED+DGAEP+  S++  NL RL+++    + D     A  ++     + +    
Sbjct: 589 HLVLRLKEDYDGAEPAPGSIAAANLQRLSALF---QDDTLHARAVRTVESLRGQWETTPH 645

Query: 733 AVPLMCCAAD-MLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVSKKS 784
           A+P +  A + +L  P++  ++L G   S DF  + A   A    +KT+ + +
Sbjct: 646 ALPALLFALERILEEPAQ--IILAGDPRSHDFRALAAVLRAR---DKTLRRHT 693


>gi|193212931|ref|YP_001998884.1| hypothetical protein Cpar_1281 [Chlorobaculum parvum NCIB 8327]
 gi|193086408|gb|ACF11684.1| protein of unknown function DUF255 [Chlorobaculum parvum NCIB 8327]
          Length = 708

 Score =  530 bits (1364), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 284/688 (41%), Positives = 406/688 (59%), Gaps = 42/688 (6%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           +  NRL  E SPYLLQHA NPVDW  WGEEAF +A+++++PIFLS GYSTCHWCHVME E
Sbjct: 3   QQPNRLINEKSPYLLQHAWNPVDWHPWGEEAFRKAQQQELPIFLSSGYSTCHWCHVMERE 62

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SFED  +A  LN  FV +K+DREE PD+D+ YM +VQA     GWP+SV+++PD KP  G
Sbjct: 63  SFEDPEIAGFLNAHFVPVKLDREEHPDIDRFYMLFVQATTSNAGWPMSVWMTPDRKPFFG 122

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE 279
           G+YFPP +++G P F+++L  +   W+  R  L  S    ++QL +     +    + D 
Sbjct: 123 GSYFPPAERWGMPSFRSVLETLARMWEHDRPKLLASAGSIMDQLFDIAKPQSGPGDVSD- 181

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
              +A R C E L++ +D+ +GGFG+APKFP+P  +  +  H+ +   TG    A     
Sbjct: 182 --AHAAR-CFEALAQRFDAEWGGFGNAPKFPQPSILGFLFSHAAR---TGNQTAAD---- 231

Query: 340 MVLFTLQCMAKGGIHDHV------GGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 393
           M L TL+ MA GG+HD +      GGGF RYS D  WHVPHFEKMLYD  QLA  YL+A+
Sbjct: 232 MALVTLRKMAAGGLHDQLGVTGRGGGGFARYSTDRFWHVPHFEKMLYDNAQLAASYLEAY 291

Query: 394 SLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 453
            LT +  ++   RDI +Y+  DM  P G  +SAEDADS +  G+  K+EG FYVWT +E+
Sbjct: 292 QLTGEALFADTARDIFNYVLCDMTSPEGGFWSAEDADSLDPNGSGEKREGTFYVWTEEEI 351

Query: 454 EDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE 512
            ++L  + A+LF E Y ++P GN  +    DPH EF G+N+L          ++ G+ ++
Sbjct: 352 GNLLDPDEAVLFMEAYGVRPEGNAPV----DPHGEFIGRNILKRTASDEELTNRFGLSMD 407

Query: 513 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 572
           +    L E R KLF+ R  RPRP LDDK++V+WNG++IS+ A+ + +L+           
Sbjct: 408 EASRRLKEARSKLFESRLTRPRPGLDDKILVAWNGMMISALAKGALVLRD---------- 457

Query: 573 VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDL 632
                 K+ +E AE AA FI   LYD  T +L   +R+G +   G   DYA +I  L+DL
Sbjct: 458 ------KKLLEAAERAALFILGTLYDSATGKLLRRYRDGEAAIDGKASDYACMIQALIDL 511

Query: 633 YEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSV 692
           Y+     ++L  AI L  TQ E F D++ G +++T  +D S  LR+ ED+D AEPS NSV
Sbjct: 512 YQASLDPEYLSTAIALAETQIERFFDQKQGVFYSTAFDDESAPLRMIEDNDTAEPSPNSV 571

Query: 693 SVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHV 752
           S  N +RLA++      D  R+ A  ++  F + L    +A+PLM  A  M    +   +
Sbjct: 572 SAFNYLRLAAMTG---RDELREIALRTINFFSSTLDANPVALPLMLAARAMADT-APAQL 627

Query: 753 VLVGHKSSVDFENMLAAAHASYDLNKTV 780
           ++ G +S    +  + AA   +    T+
Sbjct: 628 IVSGKRSDPAIQRFVEAASRHFQPELTI 655


>gi|157123455|ref|XP_001653842.1| hypothetical protein AaeL_AAEL001725 [Aedes aegypti]
          Length = 752

 Score =  526 bits (1354), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 291/682 (42%), Positives = 387/682 (56%), Gaps = 50/682 (7%)

Query: 98  RNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVME 157
           + KHTNRL  E SPYLLQHAHNPVDW+ WGEEA A A+  +  IFLS+GYSTCHWCHVME
Sbjct: 11  KPKHTNRLINEKSPYLLQHAHNPVDWYPWGEEAIARAKAENKLIFLSVGYSTCHWCHVME 70

Query: 158 VESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPL 217
            ESFE+E VA ++N+ F++IKVDREERPD+DK+YMT++  + G GGWP+SV+L+PDL P+
Sbjct: 71  KESFENEQVADIMNENFINIKVDREERPDIDKLYMTFILLINGSGGWPMSVWLTPDLAPV 130

Query: 218 MGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLP 277
            GGTYFPP+D++G PGF TIL K+K+ W    + LA +G   I+ +   +          
Sbjct: 131 TGGTYFPPKDRWGMPGFTTILLKLKNKWITDGEDLASTGKSIIDAIQRNVEEKHQEEAER 190

Query: 278 DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEG 337
              P+   R       +++D  +GG   APKFP   ++ ++ +   +   T   G     
Sbjct: 191 VFTPEEKYRQAVTIYKRNFDPVWGGSLGAPKFPEVSKLNLIFHAHLQDPSTKILG----- 245

Query: 338 QKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTK 397
             +VL TL+ MA GGI+DHV GGF RYSVD++WHVPHFEKMLYDQGQL   Y + +  T+
Sbjct: 246 --VVLNTLEKMAAGGIYDHVFGGFARYSVDKKWHVPHFEKMLYDQGQLLMAYANGYKTTR 303

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
              Y  +   I  Y+ +D+  P G  +S EDADS  T  +T K EGAFY WT  EV D+L
Sbjct: 304 KPLYLEVADSIYRYISKDLQHPAGGFYSGEDADSLPTWESTDKIEGAFYAWTFAEVRDLL 363

Query: 458 GEH------------AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 505
             +              +F EHY ++ TGN + S  SDPH    GKN+ I       +A 
Sbjct: 364 KANLDKFGDIGKVDPVEVFTEHYDIQETGNVEPS--SDPHGHLLGKNIPIVYGSVRETAD 421

Query: 506 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 565
           K     E    IL      L +VR KRPRPHLD K+I +WNGL++S  ++ S I  +   
Sbjct: 422 KFETTAEVVGKILKVGNELLHEVRDKRPRPHLDTKIICAWNGLILSGLSQLSCIKDA--- 478

Query: 566 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP--------G 617
                      +R  Y++      SFIR +LYD Q  +L  S     S           G
Sbjct: 479 ----------PNRDNYLKSCSKLVSFIRENLYDVQARKLLRSCYGDESDQAKSLETPIYG 528

Query: 618 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLR 677
           F+DDYAFLI GL+D Y     T  L WA ELQ  QDELF D + G YF +     +V++R
Sbjct: 529 FIDDYAFLIKGLIDYYRASLDTGALSWAKELQEIQDELFWDHKHGAYFYSEANSANVVVR 588

Query: 678 VKE---DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAV 734
           +KE   DHDGAEP GNSVS  NL+ L       ++  +R+ A    + F + +      +
Sbjct: 589 LKEGKLDHDGAEPCGNSVSAHNLIMLGDYF---ETAAFREKANKLFSYF-SNVTPFGYVL 644

Query: 735 PLMCCAADMLSVPSRKHVVLVG 756
           P M  A  +L    R  +V+VG
Sbjct: 645 PEMMSAM-LLQENGRDMLVVVG 665


>gi|268530908|ref|XP_002630580.1| Hypothetical protein CBG13036 [Caenorhabditis briggsae]
          Length = 724

 Score =  521 bits (1342), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 293/677 (43%), Positives = 392/677 (57%), Gaps = 51/677 (7%)

Query: 101 HTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVES 160
           HTNRLA+E SPYLLQHA+NP+DWF WGEEAF +AR+ + PIFLS+GYSTCHWCHVME ES
Sbjct: 10  HTNRLASEKSPYLLQHANNPIDWFPWGEEAFQKARESNKPIFLSVGYSTCHWCHVMEKES 69

Query: 161 FEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGG 220
           FE+E  AKLLND FV+IKVDREERPDVDK+YM +V A  G GGWP+SVFL+PDL P+ GG
Sbjct: 70  FENENTAKLLNDNFVAIKVDREERPDVDKLYMAFVVAASGHGGWPMSVFLTPDLHPITGG 129

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           TYFPP+D  G  GF TIL  + + W K+ + L   GA  I+ L   L+ S   N+  D  
Sbjct: 130 TYFPPDDNRGMLGFPTILNMIHEEWQKEGENLKARGAQIIKLLQPKLN-SGDVNRSED-- 186

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
                R    +   S+DSR GGFG APKFP+P ++  ++  +   +    S  + E  KM
Sbjct: 187 ---VFRAIFTRHQSSFDSRLGGFGGAPKFPKPSDLDFLICMANT-DPILNSESSKESVKM 242

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
           +  TL+ MA GGIHDH+G GFHRYSVD  WHVPHFEKMLYDQ QL   Y D + LT    
Sbjct: 243 IQKTLESMADGGIHDHIGNGFHRYSVDAEWHVPHFEKMLYDQSQLLATYSDFYRLTGRKL 302

Query: 401 --YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
                I  DI  Y+++     GG  +SAEDADS     +T+K EGAF VW  +E++ +LG
Sbjct: 303 DNIKTIVDDIFQYMQKISHKDGG-FYSAEDADSLPRHDSTKKMEGAFCVWEKEEIKILLG 361

Query: 459 EHAI-------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPL 511
           E  I       +F ++  ++  GN  +SR SDPH E K KNVL +L      A    + +
Sbjct: 362 EMKIGSANLVDVFNDYLDVEENGN--VSRSSDPHGELKNKNVLRKLLTDEECAINHDITV 419

Query: 512 EKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNF 571
           ++ +  +   ++ L++ R+KRP PHLD K++ +W GL I+   +A +             
Sbjct: 420 DELIEGMQRAKKILWEARTKRPSPHLDSKMVTAWQGLAITGLVKAYQ------------- 466

Query: 572 PVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPS--------KAPGFLDDYA 623
               ++  +Y+E AE  A F++++L   +   L+ S   GP+        +   F DDYA
Sbjct: 467 ---ATNDTKYIERAEKCAEFVQKYL--AENGELKRSVYLGPTGEVEQGNQEMKAFSDDYA 521

Query: 624 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 683
           F+I  LLDLY       +L  AIELQ   D  F    G GYF +   D  V +R+ ED D
Sbjct: 522 FMIQALLDLYTTLGKDDYLKNAIELQKICDSKFW--SGNGYFISEQTDEKVSVRMIEDQD 579

Query: 684 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 743
           GAEP+  S++  NL+R   I+   + + YR+ A         RL  + +A+P M  A + 
Sbjct: 580 GAEPTATSIASNNLLRFYDIL---EDEEYREKAHQCFRGASERLNKVPIALPKMAVALNR 636

Query: 744 LSVPSRKHVVLVGHKSS 760
               S    VLVG   S
Sbjct: 637 WQKGSIT-FVLVGEPDS 652


>gi|423073704|ref|ZP_17062443.1| hypothetical protein HMPREF0322_01864 [Desulfitobacterium hafniense
           DP7]
 gi|361855545|gb|EHL07513.1| hypothetical protein HMPREF0322_01864 [Desulfitobacterium hafniense
           DP7]
          Length = 706

 Score =  520 bits (1339), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 299/676 (44%), Positives = 390/676 (57%), Gaps = 52/676 (7%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           NK  NRL  E SPYLLQHAHNPVDW+ WGEEAFA+A+  + PIFLSIGYSTCHWCHVME 
Sbjct: 12  NKVPNRLLQEKSPYLLQHAHNPVDWYPWGEEAFAKAKAENKPIFLSIGYSTCHWCHVMER 71

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPD-LKPL 217
           ESFEDE VA+L+N +FV IKVDREERPDVD +YM + QAL G GGWPL++FL+PD  KP 
Sbjct: 72  ESFEDEEVAQLINRYFVPIKVDREERPDVDHIYMEFCQALTGSGGWPLTLFLTPDERKPF 131

Query: 218 MGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDML---AQSGAFAIEQLSEALSASASSN 274
             GTYFP E +YGRPG   +L ++ + W K +  +   A S   A+    E   +S +  
Sbjct: 132 YAGTYFPKESRYGRPGILDLLSQLGELWAKDQPKIRGSADSIYKAVTSREEPSVSSLTPA 191

Query: 275 KLPDELP--QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSG 332
           +  D +P  +  L    + L KS+D ++GGFG APKFP P  +  +L ++    D G   
Sbjct: 192 QQDDFIPWAKEILDTAFQTLQKSFDRQYGGFGRAPKFPTPHHLTFLLRYA---HDHGDGL 248

Query: 333 EASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDA 392
           EA +   MV  TL+ M +GGI DHVG GF RYS D RW VPHFEKMLYD   LA  YL+ 
Sbjct: 249 EAQQASLMVRTTLERMGQGGIFDHVGFGFARYSTDRRWLVPHFEKMLYDNALLAIAYLET 308

Query: 393 FSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKE 452
           +    D +     R+I  Y+ RDM  P G  +SAEDADS   EG     EG FYVWT +E
Sbjct: 309 YQAEHDPYDGQKAREIFAYVLRDMTAPEGGFYSAEDADS---EGV----EGKFYVWTPQE 361

Query: 453 VEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN-DSSASASKLGMP 510
           + +ILG E   L+ + Y + P GN            F+GK++   L+ D  A  S     
Sbjct: 362 IHEILGNEEGRLYCQAYGITPEGN------------FEGKSIPNLLDTDWEALESDWQQS 409

Query: 511 LEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFN 570
           L      L + R KLF VR +R  PH DDK++ SWNGL+I++ A+ +++L   A      
Sbjct: 410 LSALKERLEKSREKLFAVRKERIPPHKDDKILTSWNGLMIAALAKGTQVLGEPA------ 463

Query: 571 FPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLL 630
                     Y E AE A  FIR++LY  Q  RL   +R+G S   G+LDDYAFLI GL+
Sbjct: 464 ----------YAEAAEQAVYFIRKNLYANQ--RLLARYRDGDSAHLGYLDDYAFLIWGLI 511

Query: 631 DLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGN 690
           +LY+     + L +A++LQ  QDELF D    GYF T  +   +L+R KE +DGA PSGN
Sbjct: 512 ELYQASGQKEHLEFALQLQREQDELFWDGAKSGYFLTGRDAEELLIRPKEIYDGATPSGN 571

Query: 691 SVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRK 750
           S+S +NL+RLA +      +   + A   +  F+  L            A       SR+
Sbjct: 572 SISALNLIRLARLTGDGMLE---ERAYEQINAFKATLAAYPSGYSAFLQAIQFALQESRE 628

Query: 751 HVVLVGHKSSVDFENM 766
            ++L G     + ENM
Sbjct: 629 -IILAGSLQHPELENM 643


>gi|403418379|emb|CCM05079.1| predicted protein [Fibroporia radiculosa]
          Length = 791

 Score =  520 bits (1338), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 305/716 (42%), Positives = 399/716 (55%), Gaps = 80/716 (11%)

Query: 101 HTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVES 160
           H NRL+   SPYLLQHA NPVDW+ WG EAF +AR+ D PIFLS+GYS CHWCHV+  ES
Sbjct: 15  HLNRLSHAKSPYLLQHAENPVDWYEWGPEAFEKARQEDKPIFLSVGYSACHWCHVLAHES 74

Query: 161 FEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGG 220
           FED+  A L+N+ +++IKVDREERPDVD++YMT++QA  GGGGWP+S++L+P+L P   G
Sbjct: 75  FEDKVTANLMNEHYINIKVDREERPDVDRLYMTFLQASSGGGGWPMSIWLTPELHPFFAG 134

Query: 221 TYFPPEDKYGRPG-FKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE 279
              P    Y  PG F+ +L K+ D W+   D    SG   IE L +A +  + +    DE
Sbjct: 135 PSLPVPQTYFPPGRFRQVLYKLADIWESDPDRCRASGKQIIESLRDATNVKSGT----DE 190

Query: 280 LPQNALRLCA-EQLSKSYDSRFGGFGSAPKFPRPVEIQMML-------YHSKK------- 324
           LP  +L L    +L+K +D+R+GGF SAPKFP+P +    L        HSK        
Sbjct: 191 LPVVSLALTVYARLAKRFDTRYGGFSSAPKFPQPSQTTQFLARYAALRMHSKDSGAGEQK 250

Query: 325 ----------LEDTGKSG-----------------EASEGQKMVLFTLQCMAKGGIHDHV 357
                      E  G+ G                 EA   + M   TL  + KGGIHD V
Sbjct: 251 NADEVLKHLDAESLGEDGKDSKLSEPSSKPKSKQEEAEHARDMAAETLVQIYKGGIHDVV 310

Query: 358 GGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL------------TKDVFYSYIC 405
            GGF RYSVDERWHVPHFEKMLYDQ QL    L+  SL            T+    + + 
Sbjct: 311 EGGFARYSVDERWHVPHFEKMLYDQAQLLTSALELASLLPHSSDGPPLSSTRTTLLA-LA 369

Query: 406 RDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFK 465
           R IL YL R +  P G  +SAEDADS     +T+ KEGAFY WT+ +   ILGE A +  
Sbjct: 370 RSILIYLPRHLTSPEGGFYSAEDADSLPAADSTKTKEGAFYTWTANQFSRILGEDAEVAV 429

Query: 466 EHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKL 525
             Y +K  GNCD   M D   E KG+NVL   +    +A K G P+E+    L     KL
Sbjct: 430 WAYGVKEDGNCD--PMHDIQGELKGQNVLFMAHTPEEAAEKFGRPVEEVRCALQHSLDKL 487

Query: 526 FDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEV 584
              R + RPRPHLDDK++  WNGL+IS  ARA++  +             G +  + + +
Sbjct: 488 RAFRDENRPRPHLDDKILTCWNGLMISGLARATETFE-------------GEEAVQALTL 534

Query: 585 AESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVW 644
           AE +A+F+R  LY+E +  L  S+R G +   G  DDYAFLI GLLDLYE     ++++W
Sbjct: 535 AERSAAFLRAQLYNEASGELTRSWREG-AGPKGQADDYAFLIQGLLDLYEACGKEEYVIW 593

Query: 645 AIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIV 704
           AI LQ  QDELF D EG GYF  +  D  +L+R+K+  DGAEPS  SV++ NL+RL S  
Sbjct: 594 AIRLQEKQDELFFDAEGCGYF-ASAPDEHILIRMKDAQDGAEPSAVSVTLSNLLRL-SHF 651

Query: 705 AGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSS 760
           A  +   Y + A+  LA     L     A+  M  AA M      K ++L    +S
Sbjct: 652 AEDRHKEYDEKAKSILASNAQLLGAAPYALAAMVSAA-MCREKGYKQIILTESPAS 706


>gi|194334203|ref|YP_002016063.1| hypothetical protein Paes_1395 [Prosthecochloris aestuarii DSM 271]
 gi|194312021|gb|ACF46416.1| protein of unknown function DUF255 [Prosthecochloris aestuarii DSM
           271]
          Length = 720

 Score =  519 bits (1336), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 298/694 (42%), Positives = 401/694 (57%), Gaps = 42/694 (6%)

Query: 94  TSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWC 153
           T   +NK  N L+ E SPYLLQHA+NPV W AWG +AF  + + D PIFLS+GYSTCHWC
Sbjct: 2   TMKEKNKVPNALSKEKSPYLLQHAYNPVQWLAWGPDAFNTSLREDKPIFLSVGYSTCHWC 61

Query: 154 HVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPD 213
           HVME ESFE++ +A++LN  FV +K+DREERPD+D++YM YVQA  G GGWP+SV+L+P+
Sbjct: 62  HVMERESFENDEIAQVLNHSFVPVKIDREERPDIDRLYMAYVQASTGSGGWPMSVWLTPE 121

Query: 214 LKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASS 273
           LKP  GGTY+PPED++GRPGF ++L  + DAW + R  L        + +   L + +++
Sbjct: 122 LKPFYGGTYYPPEDRFGRPGFLSLLHSIADAWKEDRKKLEH----VADGIQSQLKSFSTA 177

Query: 274 NKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGE 333
              P+ L +  L     Q+S  +D   GGF SAPKFPRP  +  +  ++     TG+   
Sbjct: 178 APHPESLGEKVLDDAFMQISSHFDPVAGGFSSAPKFPRPSILTFLFNYAYF---TGR--- 231

Query: 334 ASEGQKMVLFTLQCMAKGGIHDHV------GGGFHRYSVDERWHVPHFEKMLYDQGQLAN 387
             E   M L TL+ MA+GGIHDH+      GGGF RY+ D  WHVPHFEKMLYD   LA 
Sbjct: 232 -EEASAMALLTLERMARGGIHDHLGVKGKGGGGFARYATDALWHVPHFEKMLYDNALLAL 290

Query: 388 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 447
            +L+AF LTK+  Y+    DI +Y+  DM  P G  +SAEDADS     +  K EG FYV
Sbjct: 291 SFLEAFQLTKETLYAQTAEDIFNYVLCDMTSPEGAFYSAEDADSFPDRESKTKIEGGFYV 350

Query: 448 WTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 506
           WT  E+ ++L      +F   Y +K  GN     + DPH  F+ KN+L    D   +A  
Sbjct: 351 WTKTEIAELLDPLEEQIFSFRYGVKQNGNV----LEDPHGTFERKNILSLKADEETTAKH 406

Query: 507 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 566
             +P ++  N+      KLF  R +RPRP  DDK+I SWN L+IS+ A+ S++L++    
Sbjct: 407 FDLPTDQVANLSRSAIEKLFQARMRRPRPDRDDKIITSWNALMISALAKGSRVLQN---- 462

Query: 567 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 626
                        +Y+  AE AA FI  +L++  T  L   +  G S   G  +DYAFLI
Sbjct: 463 ------------TDYLTAAEKAAGFIGDNLFENGTGNLLRRYCKGESGITGQAEDYAFLI 510

Query: 627 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 686
            GLLDLYE       L  A EL   Q E F D E GG+FN + ++ SV +R+KED+DGAE
Sbjct: 511 QGLLDLYEASFDDSLLHKAQELAERQCEHFYDDEHGGFFNASSQEASVPIRLKEDYDGAE 570

Query: 687 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 746
           PS NSVSV+N  RL  ++ G +  +Y   AE +L  F   L    M +P M      L  
Sbjct: 571 PSANSVSVMNFSRLW-LMTGKQ--HYLDIAEKTLYYFSAILAANGMQLPEMLAGYARLLH 627

Query: 747 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           PS   V+L G +S   F+ +  +    Y    TV
Sbjct: 628 PSNT-VILTGSQSDPAFKALKKSVEQLYLPGTTV 660


>gi|119357268|ref|YP_911912.1| hypothetical protein Cpha266_1460 [Chlorobium phaeobacteroides DSM
           266]
 gi|119354617|gb|ABL65488.1| protein of unknown function DUF255 [Chlorobium phaeobacteroides DSM
           266]
          Length = 720

 Score =  517 bits (1331), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 283/654 (43%), Positives = 378/654 (57%), Gaps = 46/654 (7%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           +  NRL  E SPYLLQHA NPVDW+ WG EAFA+A+K   PIFLS+GYSTCHWCHVME E
Sbjct: 6   RKPNRLIDEKSPYLLQHAENPVDWYPWGVEAFAKAKKESKPIFLSVGYSTCHWCHVMERE 65

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SFED   A LLN  FV +KVDREE PD+D++YMT+VQ+  G GGWP+SV+L+PDL P  G
Sbjct: 66  SFEDPRTALLLNTNFVPVKVDREEYPDLDRLYMTFVQSTTGRGGWPMSVWLTPDLDPFYG 125

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE 279
           G+YFPP D+YG PGF T+L  +   W      +    A   +QL+     SA S K    
Sbjct: 126 GSYFPPVDRYGMPGFNTLLTSIARLWQTDPQSILDRSALFFQQLN-----SAESVKTEGS 180

Query: 280 LP-QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML-YHSKKLEDTGKSGEASEG 337
           LP ++A   C   L  S+D  FGGFG+APKFPRPV +  +  YH      TG      + 
Sbjct: 181 LPSKDAANRCFRWLEDSFDRDFGGFGNAPKFPRPVLLDFLFNYHYH----TGN----EQA 232

Query: 338 QKMVLFTLQCMAKGGIHDHV------GGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLD 391
             M LFTL+ MA+GGIHDH+      GGGF RYS D  WH+PHFEKMLYD  QLA  ++ 
Sbjct: 233 LAMALFTLRKMAEGGIHDHLGIPEKGGGGFSRYSTDPFWHLPHFEKMLYDNAQLAISFVQ 292

Query: 392 AFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSK 451
           AF  + D FY+ +  DI +Y+  D+    G  +SAEDADS   + ++  +EGAFY W+ +
Sbjct: 293 AFQCSGDSFYAEVADDIFNYVLTDLASSEGAFYSAEDADSLPEQSSSVLEEGAFYRWSHE 352

Query: 452 EVEDI-LGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGM 509
           EV  +     +I LF   Y ++P GN     ++DPHNEF G N+L + +          M
Sbjct: 353 EVLRLPCSRRSIELFSRLYGIRPEGNV----LNDPHNEFAGLNILKKESSIEEIGRIFSM 408

Query: 510 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 569
             ++    L E R  L + R  RPRP LDDK++ SWNGL+IS+ AR  ++          
Sbjct: 409 REKEVAEALEEVRLALHNARLARPRPFLDDKILASWNGLMISALARGYRVFGD------- 461

Query: 570 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 629
                    K  +  A  A  F+   LY+  T +L   +RNG +   G  DDYAF + GL
Sbjct: 462 ---------KRLLLAANRATEFLLSTLYNRHTGKLLRRYRNGSAGIDGKADDYAFFVQGL 512

Query: 630 LDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSG 689
           LDLYE     + +  AI L  T   LF D   GG+ +T  +D S+  R++E++DGAEP+ 
Sbjct: 513 LDLYEADFDPRHIETAIALTETVILLFEDTIKGGFSSTASDDTSLPARMREEYDGAEPAA 572

Query: 690 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 743
           NSV  +NL+RL+ +    +   Y + AE+    F++ L   + A+P M  A + 
Sbjct: 573 NSVLAMNLLRLSEMTGEER---YNEKAENIFKAFDSILDTNSHALPAMLVALNF 623


>gi|386812871|ref|ZP_10100096.1| conserved hypothetical protein [planctomycete KSU-1]
 gi|386405141|dbj|GAB62977.1| conserved hypothetical protein [planctomycete KSU-1]
          Length = 704

 Score =  516 bits (1328), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 286/668 (42%), Positives = 393/668 (58%), Gaps = 57/668 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA+NPVDW+AWGEEAF +A + + P+FLSIGYSTCHWCHVME ESFE
Sbjct: 26  NRLIHEKSPYLQQHAYNPVDWYAWGEEAFQKAIRENKPVFLSIGYSTCHWCHVMEYESFE 85

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VAK+LN+ FVSIKVDREERPD+D +Y+T  QA+ G GGWPL++FL+P+ KP   GTY
Sbjct: 86  DEEVAKILNENFVSIKVDREERPDLDNIYITVCQAMTGSGGWPLNLFLTPEKKPFFAGTY 145

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE-LP 281
           FP  ++YG PGF  IL+K+ D W   ++ +  S     EQ+++ + ++A S   P E L 
Sbjct: 146 FPKTERYGNPGFIAILKKISDLWKTNKESVIASS----EQITKVIQSAAIST--PGEILT 199

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
           +  L+    QL  ++DS +GGFGSAPKFP P     +L   K+  D           ++V
Sbjct: 200 KETLQHAYAQLRDNFDSIYGGFGSAPKFPTPHNYTFLLRWWKRSND-------PTALEIV 252

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             TL+ M +GGI+D +GGGFHRYS DE W VPHFEKMLYDQ   A  Y + +  T  VFY
Sbjct: 253 EKTLERMGRGGIYDQLGGGFHRYSTDEYWLVPHFEKMLYDQALAAIAYTETYQATGKVFY 312

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE-H 460
           +   R I  Y+ RDM  P G  +SAEDADS   EG     EG FYVWT  E+  ILGE  
Sbjct: 313 ADSVRGIFTYVLRDMTSPEGGFYSAEDADS---EGV----EGKFYVWTPDEIIKILGEKE 365

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL-GMPLEKYLNILG 519
             +F ++Y +   GN            F+ KN+L  ++    + SK+ G+   +   +L 
Sbjct: 366 GNIFCDYYDVSKEGN------------FEEKNIL-HVDKPVDTFSKMRGIKPAELEEVLR 412

Query: 520 ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRK 579
             R KLF VR KR  PH DDK++ +WNGL+I++ A+ ++ L                +  
Sbjct: 413 TAREKLFSVREKRIHPHKDDKILTAWNGLMIAALAKGAQAL----------------NEP 456

Query: 580 EYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGT 639
           +Y + A  AA FI   L  ++   L   +R+G +  PG+LDDYA+ + GL+DLYE     
Sbjct: 457 KYTQAAMRAADFILNTL-RQKDGTLLRRYRSGEASIPGYLDDYAYFVWGLIDLYEATFEV 515

Query: 640 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVR 699
           K+L  A EL N   E F D +GGG+F +  ++  ++ + KE +DGA PSGNSV++ N++R
Sbjct: 516 KYLKIARELNNHMIENFQDEKGGGFFFSGKKNEQLITQTKEIYDGATPSGNSVALFNILR 575

Query: 700 LASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKS 759
           L  I   ++   + + AE  +  F   +K          CA D +  P+ K +V+ G   
Sbjct: 576 LGRITGNTE---FEKIAEQIIRAFGETIKQHPSGYTQFLCALDFVLGPT-KEIVIAGEPG 631

Query: 760 SVDFENML 767
           S D E +L
Sbjct: 632 SDDTERIL 639


>gi|89894906|ref|YP_518393.1| hypothetical protein DSY2160 [Desulfitobacterium hafniense Y51]
 gi|89334354|dbj|BAE83949.1| hypothetical protein [Desulfitobacterium hafniense Y51]
          Length = 699

 Score =  515 bits (1326), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 298/676 (44%), Positives = 389/676 (57%), Gaps = 52/676 (7%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           NK  NRL  E SPYLLQHAHNPVDW+ WGEEAFA+A+  D PIFLSIGYSTCHWCHVME 
Sbjct: 5   NKVPNRLLQEKSPYLLQHAHNPVDWYPWGEEAFAKAKAEDKPIFLSIGYSTCHWCHVMER 64

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPD-LKPL 217
           ESFEDE VA+L+N +FV IKVDREERPDVD +YM + QAL G GGWPL++FL+PD  KP 
Sbjct: 65  ESFEDEEVAQLINRYFVPIKVDREERPDVDHIYMEFCQALTGSGGWPLTLFLTPDERKPF 124

Query: 218 MGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS--EALSASASSNK 275
             GTYFP E +YGRPG   +L ++ + W K +  +  S     + ++  E  S S+ +  
Sbjct: 125 YAGTYFPKESRYGRPGILDLLSQLGELWAKDQPKIRGSADSIYKAVTSREEPSVSSLTPA 184

Query: 276 LPDEL---PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSG 332
           L D+     +  L    + L KS+D ++GGFG APKFP P  +  +L ++    D     
Sbjct: 185 LQDDFIPWAKEILDTAFQTLQKSFDRQYGGFGRAPKFPTPHHLTFLLRYA---HDHSDGL 241

Query: 333 EASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDA 392
           EA +   MV  TL+ M +GGI DHVG GF RYS D  W VPHFEKMLYD   LA  YL+ 
Sbjct: 242 EAQQAALMVRTTLERMGQGGIFDHVGFGFARYSTDRHWLVPHFEKMLYDNALLAIAYLEN 301

Query: 393 FSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKE 452
           +    D       R+I  Y+ RDM  P G  +SAEDADS   EG     EG FYVWT +E
Sbjct: 302 YQAQHDPHDEQKAREIFSYVLRDMTAPEGGFYSAEDADS---EGV----EGKFYVWTPQE 354

Query: 453 VEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN-DSSASASKLGMP 510
           + +ILG E   L+ + Y + P GN            F+GK++   L+ D  A  S+    
Sbjct: 355 IHEILGSEEGRLYCQAYGVSPEGN------------FEGKSIPNLLDTDWEALGSERQHS 402

Query: 511 LEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFN 570
           LE     L + R KLF VR +R  PH DDK++ SWNGL+IS+ A+ +++L   A      
Sbjct: 403 LEVLKRRLEKSREKLFAVRKERIPPHKDDKILTSWNGLMISALAKGAQVLGEPA------ 456

Query: 571 FPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLL 630
                     Y E AE A  FIR++LY  Q  RL   +R+G S   G+LDDYAFLI GL+
Sbjct: 457 ----------YAEAAEQAVYFIRKNLYANQ--RLLARYRDGDSAHLGYLDDYAFLIWGLI 504

Query: 631 DLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGN 690
           +LY+     + L +A++LQ  QDELF D    GYF T  +   +L+R KE +DGA PSGN
Sbjct: 505 ELYQASGQKEHLEFALQLQREQDELFWDGAKSGYFLTGRDAEELLIRPKEIYDGATPSGN 564

Query: 691 SVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRK 750
           S+S +NL+RLA +      +   + A   +  F+  L            A       SR+
Sbjct: 565 SISALNLIRLARLTGDGMLE---ERAYEQINAFKATLATYPSGYSAFLQAIQFALQESRE 621

Query: 751 HVVLVGHKSSVDFENM 766
            ++L G     + +NM
Sbjct: 622 -IILAGSLQHPELKNM 636


>gi|156058630|ref|XP_001595238.1| hypothetical protein SS1G_03327 [Sclerotinia sclerotiorum 1980]
 gi|154701114|gb|EDO00853.1| hypothetical protein SS1G_03327 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 797

 Score =  513 bits (1321), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 273/638 (42%), Positives = 384/638 (60%), Gaps = 27/638 (4%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            NR     SPY+  H+ NPV W  WG+EA   AR+ +  +F+SIGYS+CHWCH+ME ESF
Sbjct: 40  VNRAGESKSPYVRAHSSNPVAWQLWGDEAIDLARRENKLLFVSIGYSSCHWCHIMERESF 99

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           E+E VA +LN  F+ IK+DREERPD+D++YM +VQA  G GGWPL+VFL+P L+P+ GGT
Sbjct: 100 ENEEVAAILNSSFIPIKIDREERPDIDRIYMNFVQATTGSGGWPLNVFLTPSLEPVFGGT 159

Query: 222 YFPPEDKY----GRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLP 277
           Y+P   K      +  F  IL K+   W ++     Q  A  ++QL +  +    SN+L 
Sbjct: 160 YWPGPSKTKAFEDQVDFLGILDKLSTVWSEQERRCRQDSAQILQQLKDFANEGTLSNRLG 219

Query: 278 DELPQNALRLCAE---QLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHS---KKLEDTGKS 331
           D +    + L  E     +KS+D + GGFGSAPKFP P ++  +L  S   + + D    
Sbjct: 220 DAVDNIDIELLEEATQHFAKSFDKKNGGFGSAPKFPTPSKLAFLLRLSQFPQAVLDIVGI 279

Query: 332 GEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLD 391
            +    + + + TL+ MA+GGIHDH+G GF RYSV   W +PHFEKMLYD  QL ++YLD
Sbjct: 280 PDCENAKNIAITTLRKMARGGIHDHIGNGFARYSVTADWSLPHFEKMLYDNAQLLHIYLD 339

Query: 392 AFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSK 451
           AF L++D  +  +  DI DYL   +  P G  +S+EDADS    G T K+EGA+YVWT +
Sbjct: 340 AFLLSRDPEFLGVAYDIADYLTITLFHPQGGFYSSEDADSYYKAGDTEKREGAYYVWTKR 399

Query: 452 EVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMP 510
           E E+ILG EH  +    + +   GN  +++ +DPH+EF  +NVL   +  SA A++ GM 
Sbjct: 400 EFENILGTEHEPILSAFFNVTSHGN--VAQENDPHDEFMDQNVLAISSTPSALANQFGMK 457

Query: 511 LEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 569
             + + ++ E + KL   R + R +P +DDK+IVSWNG+ I + ARAS ++        F
Sbjct: 458 EAEIIKVIKEGKAKLRKRREADRVKPDMDDKIIVSWNGIAIGALARASAVING------F 511

Query: 570 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 629
           + PV   D   Y++ A   A FI+ +LYDE++  L   +R G     GF DDYAFL+ GL
Sbjct: 512 D-PVKAQD---YLDAALKTAKFIKENLYDEKSKILYRIWREGRGDTQGFADDYAFLMEGL 567

Query: 630 LDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSG 689
           +DLYE     KWL WA ELQ +Q   F D   GG+F+T    P+V+LR+KE  D AEPS 
Sbjct: 568 IDLYEATFDEKWLQWADELQQSQINFFYDTNKGGFFSTIASAPNVILRLKEGMDSAEPST 627

Query: 690 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 727
           N  S  NL RL+SI+     + Y + A  ++  FE+ +
Sbjct: 628 NGTSSSNLYRLSSIL---NDESYAKKANETVKSFESEM 662


>gi|302814858|ref|XP_002989112.1| hypothetical protein SELMODRAFT_1701 [Selaginella moellendorffii]
 gi|300143213|gb|EFJ09906.1| hypothetical protein SELMODRAFT_1701 [Selaginella moellendorffii]
          Length = 354

 Score =  513 bits (1320), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 237/354 (66%), Positives = 293/354 (82%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E+SPYLLQHAHNPVDW+ WGEEAFA+A+  D PIFLS+GYSTCHWCHVMEVESFE
Sbjct: 1   NRLLHENSPYLLQHAHNPVDWYPWGEEAFAKAKAEDKPIFLSVGYSTCHWCHVMEVESFE 60

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
            E VAKLLNDWFVSIKVDREERPDVDKVYMT+VQA  GGGGWP+SVFL+P+LKP++GGTY
Sbjct: 61  SEEVAKLLNDWFVSIKVDREERPDVDKVYMTFVQASQGGGGWPMSVFLTPELKPIVGGTY 120

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FPPED YGRPGFKT+LR+VK+ WD ++ +L  +G   I+QL+EA++A A+S ++   + +
Sbjct: 121 FPPEDNYGRPGFKTVLRRVKENWDSRKAVLRNAGDNVIQQLAEAMAACATSLQVSGGVAE 180

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
            A++LCA QL K +D++ GGFGSAPKFPRPVE+ +ML + K+L+  GK+  + +  +M  
Sbjct: 181 QAVQLCASQLMKGFDAKLGGFGSAPKFPRPVELNLMLRYYKRLDQAGKASLSKKALEMAS 240

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
           F LQCMA+GG+HDHVGGGFHRYSVD+ WHVPHFEKMLYDQ QLAN YLD + +T+D  ++
Sbjct: 241 FNLQCMARGGMHDHVGGGFHRYSVDDYWHVPHFEKMLYDQAQLANAYLDVYLVTRDTMHA 300

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
            + RDILDYL RDM  P G IFSAEDADS E  G+++KKEGAFYVWT+KEV ++
Sbjct: 301 CVARDILDYLNRDMTHPEGGIFSAEDADSLEPSGSSKKKEGAFYVWTAKEVRNL 354


>gi|414153807|ref|ZP_11410129.1| conserved hypothetical protein [Desulfotomaculum hydrothermale Lam5
           = DSM 18033]
 gi|411454828|emb|CCO08033.1| conserved hypothetical protein [Desulfotomaculum hydrothermale Lam5
           = DSM 18033]
          Length = 691

 Score =  512 bits (1319), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 284/679 (41%), Positives = 389/679 (57%), Gaps = 57/679 (8%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
            + TN L  E SPYLLQHAHNPV+WF WGEEAFA+A+  D PIFLSIGYSTCHWCHVME 
Sbjct: 6   TRSTNLLINEKSPYLLQHAHNPVNWFPWGEEAFAKAKAEDKPIFLSIGYSTCHWCHVMER 65

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFE   VA++LN +FVSIKVDREERPDVD++YM+  QAL G GGWPL+V ++P  KP  
Sbjct: 66  ESFESADVAEVLNKYFVSIKVDREERPDVDQIYMSVCQALTGSGGWPLTVIMTPQQKPFF 125

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
            GTYFP E  YGRPG   IL ++   W+ +R  L   G    EQL+  L   A+ +  P 
Sbjct: 126 AGTYFPKETNYGRPGLIEILTRIAWLWEHERPSLLAMG----EQLTAHLHQEAAVS--PG 179

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
           +LP + L      L+++YD+ +GGFG+APKFP P  +  +L +  K +         +  
Sbjct: 180 QLPADILDQAYRLLARNYDASYGGFGTAPKFPTPHNLMFLLRYYYKTKQ-------PQAL 232

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
            MV  TL  M +GGI+DH+G GF RYSVD +W VPHFEKMLYD   LA  +L+ + +T +
Sbjct: 233 TMVEETLDAMHRGGIYDHIGFGFARYSVDHKWLVPHFEKMLYDNALLALAFLETYQVTGN 292

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
           + +  I ++I  Y+ RDM  P G  +SAEDADS  T       EG FY+W  +EV DILG
Sbjct: 293 MRFGRIAKEIFAYVLRDMTSPEGGFYSAEDADSEGT-------EGKFYLWQPQEVVDILG 345

Query: 459 E-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASASKLGMPLEKYL 515
           +    +F  +Y +   GN            F+G N+  LI   D    A++LG+ L   +
Sbjct: 346 QPDGEIFCRYYNITAQGN------------FEGSNIPNLIG-QDPRRFAAELGIELADLV 392

Query: 516 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 575
             + +CR  LF  RSKR  P  DDK++ +WNGL+I++ +R +++  SE            
Sbjct: 393 KGMEKCRSLLFKARSKRVHPFKDDKILTAWNGLMIAALSRGARVFHSEV----------- 441

Query: 576 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 635
                Y   A  A +FI + L      RL   FR+G +  P +LDDYAFL  GLL+LYE 
Sbjct: 442 -----YRTAAVKAVNFINQRL-RRPDGRLLARFRDGEAAFPAYLDDYAFLAWGLLELYEA 495

Query: 636 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 695
              T +L  A+ L     ELFLD++ GG+F    +   ++ R KE +DGA PSGNSV+ +
Sbjct: 496 TFDTDYLAEAVRLTEDMIELFLDQQHGGFFFYGKDSEQLISRPKEIYDGALPSGNSVAAV 555

Query: 696 NLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLV 755
           NL+RLA +   + +D + + A   L  F  +++           AA +L  P  + +VL 
Sbjct: 556 NLIRLARL---TGNDRFAELAHRQLTGFAQQVEQYPAGYSFFMIAAYLLQEPPLE-IVLT 611

Query: 756 GHKSSVDFENMLAAAHASY 774
           G  +      M+     ++
Sbjct: 612 GEAADDSLRRMIQTVQRAF 630


>gi|219669354|ref|YP_002459789.1| hypothetical protein Dhaf_3335 [Desulfitobacterium hafniense DCB-2]
 gi|219539614|gb|ACL21353.1| protein of unknown function DUF255 [Desulfitobacterium hafniense
           DCB-2]
          Length = 699

 Score =  512 bits (1318), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 297/679 (43%), Positives = 390/679 (57%), Gaps = 52/679 (7%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           NK  NRL  E SPYLLQHAHNPVDW+ WGEEAFA+A+  D PIFLSIGYSTCHWCHVME 
Sbjct: 5   NKVPNRLLQEKSPYLLQHAHNPVDWYPWGEEAFAKAKAEDKPIFLSIGYSTCHWCHVMER 64

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPD-LKPL 217
           ESFEDE VA+L+N +FV IKVDREERPDVD +YM + QAL G GGWPL++FL+PD  KP 
Sbjct: 65  ESFEDEEVAQLINRYFVPIKVDREERPDVDHIYMEFCQALTGSGGWPLTLFLTPDERKPF 124

Query: 218 MGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS--EALSASASSNK 275
             GTYFP E +YGRPG   +L ++ + W K +  +  S     + ++  E  S S+ +  
Sbjct: 125 YAGTYFPKESRYGRPGILDLLSQLGELWAKDQPKIRGSADSIYKAVTSREEPSVSSLTPA 184

Query: 276 LPDEL---PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSG 332
           L D+     +  L    + L KS+D ++GGFG APKFP P  +  +L ++    D     
Sbjct: 185 LQDDFIPWAKEILDTAFQTLQKSFDRQYGGFGRAPKFPTPHHLTFLLRYA---HDHSDGL 241

Query: 333 EASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDA 392
           EA +   MV  TL+ M +GGI DHVG GF RYS D  W VPHFEKMLYD   LA  YL+ 
Sbjct: 242 EAQQAALMVRTTLERMGQGGIFDHVGFGFARYSTDRHWLVPHFEKMLYDNALLAIAYLEN 301

Query: 393 FSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKE 452
           +    D       R+I  Y+ RDM  P G  +SAEDADS   EG     EG FYVWT +E
Sbjct: 302 YQAQHDPHDEQKAREIFSYVLRDMTAPEGGFYSAEDADS---EGV----EGKFYVWTPQE 354

Query: 453 VEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN-DSSASASKLGMP 510
           + +ILG E   L+ + Y + P GN            F+GK++   L+ D  A  S+    
Sbjct: 355 IHEILGSEEGRLYCQAYGVSPEGN------------FEGKSIPNLLDTDWEALGSERQHS 402

Query: 511 LEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFN 570
           LE     L + R KLF VR +R  PH DDK++ SWNGL+I++ A+ +++L   A      
Sbjct: 403 LEVLKRRLEKSREKLFAVRKERIPPHKDDKLLTSWNGLMIAALAKGAQVLGEPA------ 456

Query: 571 FPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLL 630
                     Y E  E A  FIR++LY  Q  RL   +R+G S   G+LDDYAFLI GL+
Sbjct: 457 ----------YAEAVEQAVYFIRKNLYANQ--RLLARYRDGDSAHLGYLDDYAFLIWGLI 504

Query: 631 DLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGN 690
           +LY+     + L +A++LQ  QDELF D    GYF T  +   +L+R KE +DGA PSGN
Sbjct: 505 ELYQASGKKEHLEFALQLQREQDELFWDGAKSGYFLTGRDAEELLIRPKEIYDGATPSGN 564

Query: 691 SVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRK 750
           S+S +NL+RLA +    + +   + A   +  F+  L            A       SR+
Sbjct: 565 SISALNLIRLARLTGDGELE---KRAYEQINAFKATLSTYPSGYSAFLQAIQFALQESRE 621

Query: 751 HVVLVGHKSSVDFENMLAA 769
            ++L G     + +NM  A
Sbjct: 622 -IILAGPLQHPELKNMKTA 639


>gi|431794219|ref|YP_007221124.1| thioredoxin domain-containing protein [Desulfitobacterium
           dichloroeliminans LMG P-21439]
 gi|430784445|gb|AGA69728.1| thioredoxin domain protein [Desulfitobacterium dichloroeliminans
           LMG P-21439]
          Length = 698

 Score =  510 bits (1314), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 292/678 (43%), Positives = 395/678 (58%), Gaps = 51/678 (7%)

Query: 96  HSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHV 155
           +S+N   NRL  E SPYLLQHA+NPVDW+ WG+EAFA+A+ ++ PIFLSIGYSTCHWCHV
Sbjct: 2   NSKNGAPNRLINEKSPYLLQHAYNPVDWYPWGQEAFAKAKTQNRPIFLSIGYSTCHWCHV 61

Query: 156 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 215
           ME ESFED  VA LLN +F++IKVDREERPDVD +YM + QAL G GGWPL++ ++PD K
Sbjct: 62  MERESFEDHEVADLLNRYFIAIKVDREERPDVDHIYMEFCQALIGSGGWPLTILMTPDQK 121

Query: 216 PLMGGTYFPPEDKYGRPGFKTILRKVKDAW---DKKRDMLAQSGAFAIEQLSEALSASAS 272
           P   GTYFP E +YGRPG   +L ++ + W   +KK    A+S   A+    E  +AS  
Sbjct: 122 PFYAGTYFPKESRYGRPGIIDVLHQLGELWRVDEKKVLSSAESIYTAVTTHKELPNASVV 181

Query: 273 SNKLPDELPQNALRLCA--EQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGK 330
           S++  D  P   + L A  +   +S+DS++GGF  APKFP P  +  +L ++    D G+
Sbjct: 182 SSQEDDFRPWAKVILEAAFQTFQESFDSQYGGFRQAPKFPTPHNLTFLLRYAY---DHGQ 238

Query: 331 SGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYL 390
           + +A +   MV  TL  M +GGI+DH+G GF RYS D+ W VPHFEKMLYD   LA  YL
Sbjct: 239 APKAQQATHMVRTTLDAMGQGGIYDHIGFGFARYSTDQHWLVPHFEKMLYDNALLAIAYL 298

Query: 391 DAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTS 450
           +++ +          R+I  Y+ RDM+ P G  +SAEDADS   EG     EG FYVWT 
Sbjct: 299 ESYQVQHLPRDEQKVREIFAYVLRDMVSPEGGFYSAEDADS---EGV----EGKFYVWTP 351

Query: 451 KEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN-DSSASASKLG 508
           +E+ ++LG E   L+   Y +   GN            F+GKN+   L+ + +A A +  
Sbjct: 352 QEIHELLGSEAGQLYCRAYDITRDGN------------FEGKNIPNLLHTEWTALAEEFN 399

Query: 509 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 568
           +  E+    L E R+ LF  R KR  PH DDK++ SWNGL+I++ A+ ++IL        
Sbjct: 400 LSREELSLQLEEARKVLFQAREKRIHPHKDDKILTSWNGLMIAALAKGAQIL-------- 451

Query: 569 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 628
                   D   Y + AE A SFI  +LY +Q  RL   +R+  S   G+LDDYAFLI G
Sbjct: 452 --------DDTTYTDAAEKAVSFIINYLYPKQ--RLLARYRDRDSAHLGYLDDYAFLIWG 501

Query: 629 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 688
           L++LY        L  A+ LQ  QDELFLD E  GYF T  +   +L+R KE +DGA PS
Sbjct: 502 LIELYSATGKKDHLGLALSLQKAQDELFLDTEQLGYFLTGHDAEELLIRPKEIYDGATPS 561

Query: 689 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPS 748
           GNSVS  NL+RLA +       ++ + A   L  F++ L   +    +   A       S
Sbjct: 562 GNSVSACNLIRLARLTGDI---HWEKRANEQLMAFKSSLSTHSAGYTMFLQALQYALAQS 618

Query: 749 RKHVVLVGHKSSVDFENM 766
           R+ +VL G     +   M
Sbjct: 619 RE-IVLAGPIQHAELSKM 635


>gi|28210673|ref|NP_781617.1| thymidylate kinase [Clostridium tetani E88]
 gi|28203111|gb|AAO35554.1| thymidylate kinase [Clostridium tetani E88]
          Length = 713

 Score =  509 bits (1312), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 281/681 (41%), Positives = 396/681 (58%), Gaps = 67/681 (9%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           N+  NRLA E SPYLLQHA+NPVDW+ WGEEAF +A++ D PIFLSIGYSTCHWCHVME 
Sbjct: 41  NRVPNRLAQEKSPYLLQHAYNPVDWYPWGEEAFQKAKEEDKPIFLSIGYSTCHWCHVMER 100

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFEDE VAK+LND F+SIKVDREERPD+D +YMT+ QA+ G GGWPL++ ++PD KP  
Sbjct: 101 ESFEDEEVAKVLNDNFISIKVDREERPDIDNIYMTFCQAVTGSGGWPLTIIMTPDKKPFF 160

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
            GTYFP ED+YG  G   IL+++ + W   R+++  S    ++ +S+ +S S       +
Sbjct: 161 AGTYFPKEDRYGVRGLMYILKEMSNQWKNNRELILNSSEKLLKDMSQYISVSQR-----E 215

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
           +L +  ++ C E L +SYD   GGF  APKFP   ++  +L + +  +D        E  
Sbjct: 216 DLNKEVIKECFEVLKESYDPIHGGFYDAPKFPTSHKLMFLLRYYRLYKD-------EEAL 268

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
            +V  TL+ M KGGI DH+G GF RYS D++W VPHFEKMLYD   L   Y + + +TK+
Sbjct: 269 NIVEKTLKSMYKGGIFDHIGYGFSRYSTDDKWLVPHFEKMLYDNAMLTIAYAEMYQITKE 328

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
             Y  I    + Y+ RDM    G  +SAEDADS   EG     EG FYVWT +E+EDILG
Sbjct: 329 ELYKEIIEKTISYVIRDMKDKKGAFYSAEDADS---EGV----EGKFYVWTLEEIEDILG 381

Query: 459 -EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIE--LNDSSASASKLGMPLEK 513
            E A LF ++Y +   GN            F+G+N+  LIE  L D              
Sbjct: 382 KEDAKLFSKYYGITDRGN------------FEGENIPNLIETPLEDLEPDVK-------- 421

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
             + L   R+ LF  R KR  PH D K++ SWNGL+I++ A + ++LK            
Sbjct: 422 --DKLENIRKTLFINREKRIHPHKDTKILTSWNGLMIAALAYSGRVLK------------ 467

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
               RK+Y+E AE A  FI ++L DE   R+   +R+G     G L+DY+FLI  L++LY
Sbjct: 468 ----RKDYIESAEEAVKFIMKNLIDENG-RIYVRYRDGERAHKGHLEDYSFLIWALIELY 522

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
           +    T+++  A+++     ELF D E  G+F+T  +   ++L++KE +D A PSGNSV+
Sbjct: 523 QSTFKTEYIEKALKINYDMIELFWDEENHGFFHTGKDGEELILKLKESYDSAIPSGNSVA 582

Query: 694 VINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVV 753
           + N+VRL+ I   SK D   +  + +L  F  R+K    +      +     + S + V+
Sbjct: 583 MYNMVRLSRITGDSKLD---EIIQQNLNYFSGRIKSTLESHTFFLISYMHYVLESEEIVI 639

Query: 754 LVGHKSSVDFENMLAAAHASY 774
           + G    + F+ M+   +  Y
Sbjct: 640 VKGEDEDI-FKAMIKVINEKY 659


>gi|374856309|dbj|BAL59163.1| hypothetical conserved protein [uncultured candidate division OP1
           bacterium]
          Length = 683

 Score =  509 bits (1312), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 276/682 (40%), Positives = 391/682 (57%), Gaps = 56/682 (8%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
            +H NRL  E SPYLLQHA+NPVDW+ WGEEA  +AR+ D PI LSIGYS CHWCHVME 
Sbjct: 2   TQHPNRLVHETSPYLLQHAYNPVDWYPWGEEALHKARREDRPIVLSIGYSACHWCHVMER 61

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           E FE+  +A+ LN+ FVSIKVDREERPD+D++YMT VQ L G GGWPL+VFL+PDLKP  
Sbjct: 62  ECFENPQIAQYLNEHFVSIKVDREERPDLDEIYMTAVQLLTGQGGWPLTVFLTPDLKPFF 121

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
           GGTYFPPED++GRPGF T+L+ +   + K+R+ + +      EQL++ L A        +
Sbjct: 122 GGTYFPPEDRWGRPGFLTVLKAITALYQKEREKIVEQA----EQLTQYLQALQQPRPSSE 177

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
            L ++ ++       +S+D   GGFG APKFP  +E+ ++L +  +  D       ++  
Sbjct: 178 LLTRDLIQRAYLSALQSFDREHGGFGGAPKFPHSLELSLLLRYWHRTRD-------ADAL 230

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
            +V F+L+ MA+GGI+D +GGGFHRYSVD +W VPHFEKMLYD   L   YL+A+ +T+ 
Sbjct: 231 HVVEFSLEQMARGGIYDQLGGGFHRYSVDAQWAVPHFEKMLYDNALLVWTYLEAYQITQK 290

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
             Y  +  + LDY+ R+M    G  F+++DADS +        EGAFY+WT +E+E +LG
Sbjct: 291 ALYRRVVEETLDYVLREMTSSAGGFFASQDADSPD-------GEGAFYLWTPEEIEAVLG 343

Query: 459 EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 518
             A   K   Y    G   + R      EF               A+K+ M + +    L
Sbjct: 344 A-ADGAKACEYFGVAGGASVLRSPYTLEEF---------------AAKMKMTISECEGWL 387

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
              + KLF  R +RP+P  D+K++ +WNGL+IS+  RA ++L  E               
Sbjct: 388 ARVKEKLFAAREQRPKPARDEKMLTAWNGLMISALVRAYQVLGHE--------------- 432

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG 638
            +Y+  A  AA F    LY +    L+HS ++G +K PG+LDDYAFLI  LLDLYE    
Sbjct: 433 -KYLHAAHDAAHFCLNSLYRDGA--LKHSCKDGIAKIPGYLDDYAFLILALLDLYESDFD 489

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
            +W+  A  L  T  E F D  GGG+F T+ +   + +R K  +DGA PSGNS + + L+
Sbjct: 490 LRWVHAAKTLSATLIEKFWDEHGGGFFFTSSDHEKLPVRPKSFYDGATPSGNSAATMALL 549

Query: 699 RLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHK 758
           RL  +   +     R  AE +L +    ++    A+  M  A D    P+ + + +VG +
Sbjct: 550 RLVELTGDAA---LRVKAEQTLRLCRDFMEQAPQALSYMLSALDFYLGPTTQ-IAIVGAR 605

Query: 759 SSVDFENMLAAAHASYDLNKTV 780
                +  + +  A +  NK V
Sbjct: 606 GDARTQQFVESIRARFLPNKIV 627


>gi|341899864|gb|EGT55799.1| hypothetical protein CAEBREN_04954 [Caenorhabditis brenneri]
          Length = 731

 Score =  509 bits (1311), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 282/673 (41%), Positives = 385/673 (57%), Gaps = 53/673 (7%)

Query: 101 HTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVES 160
           + NRL  E SPYLLQHA+NP+DW+ WGEEAF +A++ + PIFLS+GYSTCHWCHVME ES
Sbjct: 19  YKNRLGQEKSPYLLQHANNPIDWYPWGEEAFQKAKETNKPIFLSVGYSTCHWCHVMEKES 78

Query: 161 FEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGG 220
           FE+E  AK+LN+ FV+IKVDREERPDVDK+YM +V A  G GGWP+SVFL+PDL P+ GG
Sbjct: 79  FENENTAKILNENFVAIKVDREERPDVDKLYMAFVVAASGHGGWPMSVFLTPDLHPITGG 138

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           TYFPP+D  G  GF TIL  +   W K+ + L   GA  I+ L   +  S   N+  D  
Sbjct: 139 TYFPPDDNRGMLGFPTILNMIHTEWQKEGENLRTRGAQIIKLLQPEMK-SGDVNRSED-- 195

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
                         ++DSR GGFG APKFP+  +   ++  +        S E  E   M
Sbjct: 196 ---VFESIYSHKKSTFDSRLGGFGRAPKFPKAPDFDFLIAFASS---QSNSKEKQESIMM 249

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT--KD 398
           +  TL+ MA GGIHDH+G GFHRYSVD  WH+PHFEKM+YDQ QL   Y +   LT  K 
Sbjct: 250 LQKTLESMADGGIHDHIGNGFHRYSVDSEWHIPHFEKMIYDQSQLLASYSEFHRLTEKKH 309

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
                +  DI +Y+++     GG  ++AEDADS  T  +T K EGAF  W   E++ +LG
Sbjct: 310 ENIKLVINDIFEYMQKISHKDGG-FYAAEDADSLPTHESTEKVEGAFCAWERDEIKQLLG 368

Query: 459 EHAI-------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPL 511
           E  I       +F +++ ++  GN  +++ SDPH E K KNVL +L      A+  G+ +
Sbjct: 369 EKKIESASLFDVFVDYFDVEENGN--VAKSSDPHGELKNKNVLRKLLTDEECATNHGITV 426

Query: 512 EKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNF 571
           E+  N + E R  L+  R+KRP PHLD K++ +W GL I+   +A +             
Sbjct: 427 EQLKNGIDEAREILWIARTKRPSPHLDSKMVTAWQGLAITGLVKAYQ------------- 473

Query: 572 PVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS--------FRNGPSKAPGFLDDYA 623
               ++  +Y+E AE  A+F+ ++L  E+   L+ S           G  +   F DDYA
Sbjct: 474 ---ATNEPKYVERAEKCAAFVEKYL--EENGELRRSVYLGDNGEVEQGNQRMKAFSDDYA 528

Query: 624 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 683
           FLI GLLDLY      ++L  +I+LQ T DE F    G GYF +   D  V +R+ ED D
Sbjct: 529 FLIQGLLDLYTVAGKNEYLERSIKLQKTCDEKFWS--GNGYFISEKSDEVVSVRMIEDQD 586

Query: 684 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 743
           GAEP+  S++  NL+R   I+   +++ YR+ A         RL  + +A+P M  A   
Sbjct: 587 GAEPTATSIASNNLLRFYDIL---ENEEYRERANQCFRGASERLNKIPIALPKMAVALQR 643

Query: 744 LSVPSRKHVVLVG 756
             + S    VLVG
Sbjct: 644 WQLGSTT-FVLVG 655


>gi|308274671|emb|CBX31270.1| Spermatogenesis-associated protein 20 [uncultured Desulfobacterium
           sp.]
          Length = 633

 Score =  509 bits (1311), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 266/613 (43%), Positives = 374/613 (61%), Gaps = 40/613 (6%)

Query: 108 EHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFEDEGVA 167
           E SPYLLQHA+NPV+W+ WG+EA   A K D PI LSIGYSTCHWCHVME ESF D  +A
Sbjct: 3   EKSPYLLQHAYNPVNWYPWGDEAINRAAKEDKPIILSIGYSTCHWCHVMENESFTDHEIA 62

Query: 168 KLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPED 227
           K++ND F+ IKVDREERPD+D++Y++ V AL G  GWPL+VFL+P LKP  GGTYFP E 
Sbjct: 63  KIMNDNFICIKVDREERPDLDRIYISAVTALTGSAGWPLNVFLTPKLKPFFGGTYFPAES 122

Query: 228 KYGRPGFKTILRKVKDAWDK---KRDMLAQSGAFAIEQLSEALSASASSNKL---PDELP 281
            +G   +  +L ++   W      +D+++ S     E++++ +  + S +K+    ++  
Sbjct: 123 NFGITSWPDLLNRITSVWKDPVVHKDIISSS-----EKITDIIIKNLSYDKVFSTAEKHK 177

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
           Q+ L    +  S SYD ++ GFG APKFP P  I+ +L +    +   +   A     M 
Sbjct: 178 QSHLDDAFKYYSSSYDEKYAGFGKAPKFPSPSIIKFILAYFSYAKKINEPAVAKRTIDMA 237

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
            +TL+ MAKGGI+D + GGFHRYS DE+WH+PHFEKMLYD  QL NVYL+A+ +T D F+
Sbjct: 238 DYTLKAMAKGGIYDQLRGGFHRYSTDEKWHIPHFEKMLYDNAQLVNVYLEAYQITSDKFF 297

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADS-------AETEGATRKKEGAFYVWTSKEVE 454
           + I ++  DY+  DM    G  +SAEDADS         ++ A  K EGAFYVW+ KE++
Sbjct: 298 AQIAKETCDYILSDMTSSPGGFYSAEDADSYPGQISEKGSDDAHNKVEGAFYVWSKKELD 357

Query: 455 DILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEK 513
            IL E+ A +F   + +   GN       DPH  FK KN+L   +  + +A K  M  +K
Sbjct: 358 KILEENTAEIFSYFFGVMEEGNA----AHDPHGYFKKKNILYVKHSINETAKKYNMAPDK 413

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
              I+ + + KL   RS R RPHLDDK++ SWNGL+IS+FA+A K+L             
Sbjct: 414 VELIINDAKNKLLKARSSRERPHLDDKILTSWNGLMISAFAKAYKVL------------- 460

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
            GSD+  Y++ A++AA FI  +LYD+ T +L   +R G     G   DYAF I GL+DLY
Sbjct: 461 -GSDK--YLQAAKNAAEFIISNLYDKNTGKLFRRWREGERAVLGMGSDYAFYICGLIDLY 517

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGE-DPSVLLRVKEDHDGAEPSGNSV 692
           E  S  KWL  A+ L     +LF D +  G++ T+ + D ++++R K+D D   P+  SV
Sbjct: 518 ESDSDKKWLETAVMLSEEYIKLFYDEQFAGFYITSPDHDKNLIIRAKDDSDSVIPAHGSV 577

Query: 693 SVINLVRLASIVA 705
           ++ NL+RL+ I  
Sbjct: 578 AIQNLLRLSKITG 590


>gi|333922724|ref|YP_004496304.1| hypothetical protein Desca_0499 [Desulfotomaculum carboxydivorans
           CO-1-SRB]
 gi|333748285|gb|AEF93392.1| hypothetical protein Desca_0499 [Desulfotomaculum carboxydivorans
           CO-1-SRB]
          Length = 692

 Score =  508 bits (1307), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 281/680 (41%), Positives = 393/680 (57%), Gaps = 56/680 (8%)

Query: 98  RNKHT-NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVM 156
           R +H  NRL  E SPYLLQHA+NPVDW+ WGEEAF +A++ + P+FLSIGYSTCHWCHVM
Sbjct: 3   RTEHKPNRLIHEKSPYLLQHAYNPVDWYPWGEEAFEKAKRENKPVFLSIGYSTCHWCHVM 62

Query: 157 EVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKP 216
           E ESFE E VA++LN ++V+IKVDREERPD+D++YMT  QAL G GGWPL++ ++PD KP
Sbjct: 63  ERESFESEDVAEVLNKYYVAIKVDREERPDIDQIYMTVCQALTGQGGWPLNIIMTPDQKP 122

Query: 217 LMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKL 276
              GTYFP    YG+PG   IL+++ D W K R  L       + +L+  +  + +  +L
Sbjct: 123 FFAGTYFPKNSNYGKPGLIDILQQIADLWAKDRQQLLGISDQLMARLN--MKTATAPGQL 180

Query: 277 PDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASE 336
             E+   A RL A    + +DS +GGFG+ PKFP P  + ++L   KK           +
Sbjct: 181 SPEVLDKAYRLFA----RHFDSTYGGFGNPPKFPTPHNLMLLLRCWKKTSQ-------KK 229

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
              MV  TL  M +GGI+DH+G GF RYS D RW VPHFEKMLYD   LA  +L+ + + 
Sbjct: 230 ALTMVEDTLDAMHRGGIYDHIGFGFSRYSTDRRWLVPHFEKMLYDNALLAIAFLETYQIN 289

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
           ++  +S + ++I  Y+ RDM  P G  +SAEDADS   EG     EG FYVW  +EVE +
Sbjct: 290 RNPRFSRVAKEIFTYVLRDMTAPEGGFYSAEDADS---EGV----EGKFYVWHPQEVEQV 342

Query: 457 LGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN-DSSASASKLGMPLEKY 514
           LG+    LF  +Y + P GN            F+G ++   +N D    A +L + LE  
Sbjct: 343 LGQIDGQLFCRYYDITPRGN------------FEGASIPNLINQDPLKFAQELDITLEDL 390

Query: 515 LNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVV 574
           ++ L +CR+ LF  R KR  PH DDK++ SWNGL+I++ AR +++L  E           
Sbjct: 391 VDGLEKCRQLLFAQREKRVHPHKDDKILTSWNGLMIAALARGARVLGDE----------- 439

Query: 575 GSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYE 634
                +Y + AE A  FI  +L      RL   +R+G +  P +LDDYAFLI GLL+LYE
Sbjct: 440 -----KYSQAAEKAVDFIYHNL-QRADGRLLARYRDGEAAYPAYLDDYAFLIWGLLELYE 493

Query: 635 FGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSV 694
                K L  A++L ++  +LF DR+ GG+F    +   ++ R KE +DGA PSGNSV+ 
Sbjct: 494 ATFDIKHLEQAVQLTDSMIDLFWDRQNGGFFFYGKDSEQLISRPKEIYDGAIPSGNSVAT 553

Query: 695 INLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVL 754
           +NL RLA +   ++   Y + A   L VF   L+   +       AA +   P  + +VL
Sbjct: 554 VNLFRLARLTGRNR---YEELATKQLQVFAGELEHYPIGYSYFMIAAYLNQEPPTE-IVL 609

Query: 755 VGHKSSVDFENMLAAAHASY 774
            G +     + M+      +
Sbjct: 610 SGKREDSALKQMIDVVQKEF 629


>gi|195120756|ref|XP_002004887.1| GI20164 [Drosophila mojavensis]
 gi|193909955|gb|EDW08822.1| GI20164 [Drosophila mojavensis]
          Length = 747

 Score =  508 bits (1307), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 276/636 (43%), Positives = 358/636 (56%), Gaps = 48/636 (7%)

Query: 90  TPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYST 149
           T   T     KHTNRLAA  SPYLLQHAHNPVDW+ W EEAF  AR  +  IFLS+GYST
Sbjct: 3   TGGETKAETPKHTNRLAASKSPYLLQHAHNPVDWYPWCEEAFERARSENKLIFLSVGYST 62

Query: 150 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 209
           CHWCHVME ESFED   A+++N  FV+IKVDREERPD+DKVYM ++    G GGWP+SV+
Sbjct: 63  CHWCHVMEHESFEDAATAEVMNKHFVNIKVDREERPDIDKVYMQFLLMSKGSGGWPMSVW 122

Query: 210 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 269
           L+PDL+PL  GTYFPP+ +YG P F  +L  +   W   RD L ++G+  ++ +    SA
Sbjct: 123 LTPDLEPLAAGTYFPPKPRYGMPSFTMVLESIAKKWVADRDSLKKAGSTLLQAMQTNQSA 182

Query: 270 SASSNKLPDELPQNALRLCAEQLSKS-YDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 328
             S+    +    +A    A  + K  +D +  GFG  PKFP    +  + +     +D 
Sbjct: 183 GTSAEMAFERGSGDAKLAEAVAVHKQRFDQQHAGFGREPKFPEVPRLNFLFHAYLVTKDV 242

Query: 329 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 388
                  +   MVL TL  + +GGI+DH+ GGF RY+    WH  HFEKMLYDQGQL   
Sbjct: 243 -------DVLDMVLQTLDHIGRGGINDHIFGGFARYATTRDWHNVHFEKMLYDQGQLMAA 295

Query: 389 YLDAFSLTKDV-FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 447
           Y +A+ LT+   F  Y  R I +YL +D+  P G  ++ EDADS  T   T K EGAFY 
Sbjct: 296 YANAYKLTRSKEFLGYADR-IYEYLIKDLRHPAGGFYAGEDADSLPTHEDTVKVEGAFYA 354

Query: 448 WTSKEVEDILGEHAILFKE------------HYYLKPTGNCDLSRMSDPHNEFKGKNVLI 495
           WT  EV+    +    FK+            HY LKP+GN  +S  SDPH    GKN+LI
Sbjct: 355 WTWDEVKQAFQKEESCFKDISAARAFEIYSFHYDLKPSGN--VSPSSDPHGHLTGKNILI 412

Query: 496 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 555
                  + S   M LEK   +L      L  +R +RPRPHLD K+I  WNGLV+S  A+
Sbjct: 413 VRGSEEDTCSNFNMELEKLQQLLRTANEILHKIRDQRPRPHLDTKIICGWNGLVLSGLAK 472

Query: 556 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS-------- 607
            +    ++              R  Y+  A+    F+R+HLYDE    L  S        
Sbjct: 473 LANCGTAK--------------RDAYLATAKQLMEFVRKHLYDEDEKLLLRSCYGAGVAD 518

Query: 608 --FRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYF 665
                  ++  GFLDDYAFLI GLLD Y+     + L W+  LQ TQD+LF D + G YF
Sbjct: 519 DTLEQNATRIEGFLDDYAFLIKGLLDYYKASLEMEALNWSKTLQETQDKLFWDEDKGAYF 578

Query: 666 NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLA 701
            +    P+V++R+KEDHDGAEP GNSV+  NL  L+
Sbjct: 579 FSQQNAPNVIVRLKEDHDGAEPCGNSVAARNLTLLS 614


>gi|333374035|ref|ZP_08465926.1| thymidylate kinase [Desmospora sp. 8437]
 gi|332968513|gb|EGK07575.1| thymidylate kinase [Desmospora sp. 8437]
          Length = 702

 Score =  507 bits (1306), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 293/693 (42%), Positives = 397/693 (57%), Gaps = 53/693 (7%)

Query: 84  VAMAERTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFL 143
           V +A+R     S+   +  NRL  E SPYLLQHA+NPVDW+ W + AFA+ARK D PIFL
Sbjct: 3   VPLAKREVEKLSNHEGREPNRLIQEKSPYLLQHAYNPVDWYPWSDAAFAKARKEDKPIFL 62

Query: 144 SIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGG 203
           SIGYSTCHWCHVME ESFED  VA+LLN  +++IKVDREERPDVD +YM+  QAL G GG
Sbjct: 63  SIGYSTCHWCHVMERESFEDVEVAQLLNREYIAIKVDREERPDVDNIYMSVCQALTGHGG 122

Query: 204 WPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL 263
           WPL++ ++P+ +P   GTYFP +   G  G   IL +V  AW ++R+ +  +G      +
Sbjct: 123 WPLTIIMTPEKEPFFAGTYFPKQAVQGMQGLMEILGQVARAWREEREQVLDAGRKITRAV 182

Query: 264 SEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK 323
              L  S S +   +EL +        Q   +YD ++GGFG+APKFPRP ++  +L + K
Sbjct: 183 QTQLKVSESGDLGKEELAE-----AYRQFKSTYDPQYGGFGTAPKFPRPHDLLFLLRYWK 237

Query: 324 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 383
                 +SGE      MV  TL  M +GGI+DHVG GF RY+VD  W VPHFEKMLYD  
Sbjct: 238 ------ESGEPF-ALSMVEETLDGMRRGGIYDHVGFGFARYAVDREWLVPHFEKMLYDNA 290

Query: 384 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 443
            LA  YL+A+ +TK   Y+   R+I  Y+ R M  P G  +SAEDADS   EG    +EG
Sbjct: 291 LLAYAYLEAYQVTKKDAYAGTAREIFTYVLRGMTSPEGGFYSAEDADS---EG----EEG 343

Query: 444 AFYVWTSKEVEDILGEHA-ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSA 502
            FYVW   EV+++LGE A  LF E Y + P GN +  +MS P+   +  + L E+ D   
Sbjct: 344 KFYVWNPSEVKEVLGEEAGELFCECYDITPHGNFE-QKMSIPN---RIHSSLQEIAD--- 396

Query: 503 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 562
              + G  +E+    L   R KLF  R +R  PH DDK++ SWNGL+I++ A+ +++L  
Sbjct: 397 ---RRGRDVEELREQLEVSREKLFRAREERVHPHKDDKILTSWNGLMIAALAKGARVLGD 453

Query: 563 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDY 622
           E+                Y E AE AASFI   L DE+  RL   +R+G +  PG++DDY
Sbjct: 454 ES----------------YAEAAEKAASFILERLRDEKG-RLLARYRDGEAAIPGYVDDY 496

Query: 623 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 682
           AFL+ GL++LYE     ++L  A+EL     ELF D E GG + T  +   +L R KE +
Sbjct: 497 AFLVWGLIELYEATFRPRYLKSALELTREMLELFGDEEEGGLYFTGRDAEKLLTRTKEVY 556

Query: 683 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 742
           DGA PSGNSV+ +NL RLA +   +     R+ A+  +  F   +     A      A  
Sbjct: 557 DGAVPSGNSVAALNLARLARLTGDTG---LREQADRQIRAFAGSVGQAPTAFSFFLTAVQ 613

Query: 743 -MLSVPSRKHVVLVGHKSSVDFENMLAAAHASY 774
             L  P  K +V+ G     D E M+     ++
Sbjct: 614 FFLGTP--KEIVIAGPDGDHDTELMIRRVQQAF 644


>gi|341876361|gb|EGT32296.1| hypothetical protein CAEBREN_30752 [Caenorhabditis brenneri]
          Length = 745

 Score =  506 bits (1303), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 285/691 (41%), Positives = 389/691 (56%), Gaps = 67/691 (9%)

Query: 101 HTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVES 160
           + NRL  E SPYLLQHA+NP+DW+ WGEEAF +A++ + PIFLS+GYSTCHWCHVME ES
Sbjct: 19  YKNRLGQEKSPYLLQHANNPIDWYPWGEEAFQKAKETNKPIFLSVGYSTCHWCHVMEKES 78

Query: 161 FEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGG 220
           FE+E  AK+LN+ FV+IKVDREERPDVDK+YM +V A  G GGWP+SVFL+PDL P+ GG
Sbjct: 79  FENENTAKILNENFVAIKVDREERPDVDKLYMAFVVAASGHGGWPMSVFLTPDLHPITGG 138

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           TYFPP+D  G  GF TIL  +   W K+ + L   GA  I+ L   +  S   N+  D  
Sbjct: 139 TYFPPDDNRGMLGFPTILNMIHTEWQKEGENLRTRGAQIIKLLQPEIK-SGDVNRSED-- 195

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
                +        ++DSR GGFG APKFP+  +   ++  +        S E  E   M
Sbjct: 196 ---VFKSIYSHKKSTFDSRLGGFGRAPKFPKAPDFDFLIAFAS---SQSNSEEKQESIMM 249

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
           +  TL+ MA GGIHDH+G GFHRYSVD  WH+PHFEKM+YDQ QL   Y +  SLT+   
Sbjct: 250 LQKTLESMADGGIHDHIGNGFHRYSVDSEWHIPHFEKMIYDQSQLLASYSEFHSLTEKKH 309

Query: 401 YSY--ICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
            S   +  DI +Y+++     GG  ++AEDADS  T  +T K EGAF  W   E++ +LG
Sbjct: 310 ESIKLVINDIFEYMQKISHKDGG-FYAAEDADSLPTHESTEKVEGAFCAWERDEIKQLLG 368

Query: 459 EHAI-------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPL 511
           E  I       +F +++ ++  GN  +++ SDPH E K KNVL +L      A+  G+ +
Sbjct: 369 EKKIESASLFDVFVDYFDVEENGN--VAKSSDPHGELKNKNVLRKLLTDEECATNHGITV 426

Query: 512 EKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNF 571
           E+  N + E R  L+  R+KRP PHLD K++ +W GL I+   +A +             
Sbjct: 427 EQLKNGIDEAREILWIARTKRPSPHLDSKMVTAWQGLAITGLVKAYQ------------- 473

Query: 572 PVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS--------FRNGPSKAPGFLDDYA 623
               ++  +Y+E AE  A+F+ ++L  E+   L+ S           G  +   F DDYA
Sbjct: 474 ---ATNEPKYLERAEKCAAFVEKYL--EENGELRRSVYLGDNGEVEQGNQRMKAFSDDYA 528

Query: 624 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE--- 680
           FLI GLLDLY      ++L   IELQ T DE F    G GYF +   D  V +R+ E   
Sbjct: 529 FLIQGLLDLYTVAGKNEYLERCIELQKTCDEKFWS--GNGYFISEKSDEEVSVRMIEGKI 586

Query: 681 -----------DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKD 729
                      D DGAEP+  S++  NL+R   I+   +++ YR+ A         RL  
Sbjct: 587 ILSNFYKKNFSDQDGAEPTATSIASNNLLRFYDIL---ENEEYREKANQCFRGASERLNK 643

Query: 730 MAMAVPLMCCAADMLSVPSRKHVVLVGHKSS 760
           + +A+P M  A     + S    VLVG  +S
Sbjct: 644 IPIALPKMAVALQRWQLGSTT-FVLVGDPTS 673


>gi|298710386|emb|CBJ25450.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 808

 Score =  505 bits (1301), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 309/730 (42%), Positives = 411/730 (56%), Gaps = 76/730 (10%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA E SPYLLQHAHNPVDW  WG+EAF+ A++ D PIFLS+GYSTCHWCHVME ESFE
Sbjct: 24  NRLAEETSPYLLQHAHNPVDWMPWGQEAFSRAKEEDKPIFLSVGYSTCHWCHVMERESFE 83

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
            + VAK+LN+ FVSIKVDREERPDVD+ +MT+VQA  GGGGWP+SV+L+PDLKP +G TY
Sbjct: 84  SQTVAKVLNENFVSIKVDREERPDVDQCFMTFVQATSGGGGWPMSVWLTPDLKPFVGATY 143

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL-- 280
           FP         F +IL+ + D W   R+ + + G   +  L E LS +A+++  P     
Sbjct: 144 FPEMR------FVSILKTLADKWSSDREEVVKQGDHIVRLLQERLSETAAASGDPLAFLA 197

Query: 281 ---PQNALRLCAEQLSKSYDSRFGGFGSAP---KFPRPVEIQMMLYHSKKLEDTGKSGEA 334
               + A+R     L K +D   GG+G      KFP+P  + ++L  + +LE  G S   
Sbjct: 198 LDKSREAVREGVRVLDKGHDDVLGGWGGGRGGMKFPQPSRMNLLL-RAHRLEGEG-SALG 255

Query: 335 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 394
           +    MV  TL+ MAKGGI+D++  GF RYS D RWHVPHFEKMLYDQ QL   Y++AF 
Sbjct: 256 ARALAMVETTLKAMAKGGIYDYLFDGFARYSTDPRWHVPHFEKMLYDQSQLVTAYVEAFQ 315

Query: 395 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 454
           +T D  Y+ + R +L Y+ RDM   GG  +SAEDADS   EGAT KKEGAF VWT  ++ 
Sbjct: 316 VTGDTAYADVARGVLRYVLRDMTDEGGGFYSAEDADSLPFEGATEKKEGAFCVWTEPDLR 375

Query: 455 DIL-GEHAI--------------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELND 499
            +L GE  +              LF   Y ++P GN D +   D H E   +NVL +   
Sbjct: 376 RLLDGEEGVALPGEGGQTVPVSSLFCRVYGVRPEGNVDPA--VDAHGELTSQNVLFKSET 433

Query: 500 SSASASKLGMPL--EKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 557
              +A  LG+    E+    +   R  L   R KRP PHLDDKV+ SWNGL+IS+ ARAS
Sbjct: 434 VRVAAEALGLTCSGEEAEAAMTGARATLVAARRKRPAPHLDDKVLTSWNGLMISALARAS 493

Query: 558 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY------DEQTHRLQHSFRNG 611
           +          F+      +   Y+  A  AA F+R +LY       E    L  S+RNG
Sbjct: 494 Q---------AFSSSPPSEESLAYLGAATKAAEFVRENLYRSGSGDGETAGTLLRSWRNG 544

Query: 612 -PSKAPGFLDDYAFLISGLLDLYEF----GSGTKWLVWAIELQNTQDELFL--DREGGGY 664
             S   GF DDYAFLI GL+DLYE      +G +WL WA ELQ   DE F      GGGY
Sbjct: 545 RASPVEGFADDYAFLIRGLIDLYEADPRRDTGWRWLRWARELQAEMDEGFKCPSEAGGGY 604

Query: 665 FN-----TTGEDPS------------VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGS 707
           ++     + GE               +  R++ D+DGAEP   SV+  NL+RL+    G 
Sbjct: 605 YSSRALESEGETKGDGETEGGSGSGVLPYRLRTDYDGAEPGAGSVAADNLLRLSGYFGGE 664

Query: 708 KSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENML 767
           +    R+ A   LA     L +   A P +  A+ + ++   K V++ G  +  + + ++
Sbjct: 665 EGKVLREKAAEQLAA-AFALPETPQAYPEL-TASLVTALLGPKQVIISGDPAGAETQALM 722

Query: 768 AAAHASYDLN 777
           +AA  S+  N
Sbjct: 723 SAAQRSFCPN 732


>gi|323703366|ref|ZP_08115015.1| protein of unknown function DUF255 [Desulfotomaculum nigrificans
           DSM 574]
 gi|323531635|gb|EGB21525.1| protein of unknown function DUF255 [Desulfotomaculum nigrificans
           DSM 574]
          Length = 692

 Score =  504 bits (1299), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 280/680 (41%), Positives = 391/680 (57%), Gaps = 56/680 (8%)

Query: 98  RNKHT-NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVM 156
           R +H  NRL  E SPYLLQHA+NPVDW+ WGEEAF +A++ + P+FLSIGYSTCHWCHVM
Sbjct: 3   RTEHKPNRLIHEKSPYLLQHAYNPVDWYPWGEEAFEKAKRENKPVFLSIGYSTCHWCHVM 62

Query: 157 EVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKP 216
           E ESFE E VA++LN ++V+IKVDREERPD+D++YMT  QAL G GGWPL++ ++PD KP
Sbjct: 63  ERESFESEDVAEVLNKYYVAIKVDREERPDIDQIYMTVCQALTGQGGWPLNIIMTPDQKP 122

Query: 217 LMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKL 276
              GTYFP    YG+PG   IL+++ D W K R  L        +QL   L+   ++   
Sbjct: 123 FFAGTYFPKNSNYGKPGLIDILQQIADLWAKNRQQLLGIS----DQLMARLNMKTATA-- 176

Query: 277 PDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASE 336
           P +L    L       ++ +DS +GGFG+ PKFP P  + ++L   KK           +
Sbjct: 177 PGQLSPEVLDKAYLLFARHFDSTYGGFGNPPKFPTPHNLMLLLRCWKKTSQ-------KK 229

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
              MV  TL  M +GGI+DH+G GF RYS D RW VPHFEKMLYD   LA  +L+ + + 
Sbjct: 230 ALTMVEDTLDAMHRGGIYDHIGFGFSRYSTDRRWLVPHFEKMLYDNALLAIAFLETYQIN 289

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
           ++  +S + ++I  Y+ RDM  P G  +SAEDADS   EG     EG FYVW  +EVE +
Sbjct: 290 RNPRFSRVAKEIFTYVLRDMTAPEGGFYSAEDADS---EGV----EGKFYVWHPQEVEQV 342

Query: 457 LGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN-DSSASASKLGMPLEKY 514
           LG+    LF  +Y + P GN            F+G ++   +N D    A +L + LE  
Sbjct: 343 LGQIDGQLFCRYYDITPRGN------------FEGASIPNLINQDPLKFAQELDITLEDL 390

Query: 515 LNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVV 574
           ++ L +CR+ LF  R KR  PH DDK++ SWNGL+I++ AR +++L  E           
Sbjct: 391 VDGLEKCRQLLFAQREKRVHPHKDDKILTSWNGLMIAALARGARVLGDE----------- 439

Query: 575 GSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYE 634
                +Y + AE A  FI  +L      RL   +R+G +  P +LDDYAFLI GLL+LYE
Sbjct: 440 -----KYSQAAEKAVDFIYHNL-QRADGRLLARYRDGEAAYPAYLDDYAFLIWGLLELYE 493

Query: 635 FGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSV 694
                K L  A++L ++  +LF DR+ GG+F    +   ++ R KE +DGA PSGNSV+ 
Sbjct: 494 ATFDIKHLEQAVQLTDSMIDLFWDRQNGGFFFYGKDSEQLISRPKEIYDGAIPSGNSVAT 553

Query: 695 INLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVL 754
           +NL RLA +   ++ + Y + A   L VF   L+   +       AA +   P  + +VL
Sbjct: 554 VNLFRLARL---TERNRYEELATKQLQVFAGELEHYPIGYSYFMIAAYLNQEPPTE-IVL 609

Query: 755 VGHKSSVDFENMLAAAHASY 774
            G +     + M+      +
Sbjct: 610 SGKREDSALKQMIDVVQKEF 629


>gi|20129985|ref|NP_610953.1| CG8613 [Drosophila melanogaster]
 gi|7303195|gb|AAF58258.1| CG8613 [Drosophila melanogaster]
 gi|60677913|gb|AAX33463.1| RE10908p [Drosophila melanogaster]
          Length = 808

 Score =  504 bits (1298), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 282/689 (40%), Positives = 382/689 (55%), Gaps = 77/689 (11%)

Query: 66  FRRPLAVISH----RPIHPYKVVAMAERTPASTSHSRN---KHTNRLAAEHSPYLLQHAH 118
           FRR L ++ +    RP+   K   MA    AS   S+    K  NRL A  SPYLLQHA+
Sbjct: 33  FRRNLRLLHNSCRSRPVSNQKFRTMATGGEASKEVSKEEPAKQGNRLVASKSPYLLQHAY 92

Query: 119 NPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIK 178
           NPVDW+ WGEEAF +AR  +  IFLS+GYSTCHWCHVME ESFE+   A ++N+ FV+IK
Sbjct: 93  NPVDWYPWGEEAFEKARSENKIIFLSVGYSTCHWCHVMEHESFENPETAAIMNENFVNIK 152

Query: 179 VDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTIL 238
           VDREERPD+DK+YM ++    G GGWP+SV+L+P L PL+ GTYFPP+ +YG P F T+L
Sbjct: 153 VDREERPDIDKIYMQFLLMSKGSGGWPMSVWLTPTLAPLVAGTYFPPKSRYGMPSFNTVL 212

Query: 239 RKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL--CAEQLSKS- 295
           + +   W+  ++ L  +G+  +  L +   ASA        +P+ A       E+LS++ 
Sbjct: 213 KSIARKWETDKESLLATGSSLLSALQKNQDASA--------VPEAAFGAGSAIEKLSEAI 264

Query: 296 ------YDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMA 349
                 +D   GGFGS PKFP    +  + +     +D        +   MV+ TL  + 
Sbjct: 265 NVHRQRFDQTHGGFGSEPKFPEVPRLNFLFHGYLVTKD-------PDVLDMVIETLTQIG 317

Query: 350 KGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDIL 409
           KGGIHDH+ GGF RY+  + WH  HFEKMLYDQGQL   + +A+ +T+D  Y      I 
Sbjct: 318 KGGIHDHIFGGFARYATTQDWHNVHFEKMLYDQGQLMMAFANAYKVTRDEIYLRYADKIH 377

Query: 410 DYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE-----------DILG 458
            YL +D+  P G  ++ EDADS  T     K EGAFY WT  E++           DI  
Sbjct: 378 KYLIKDLRHPLGGFYAGEDADSLPTHEDKVKVEGAFYAWTWDEIQAAFKDQAQRFDDITP 437

Query: 459 EHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI 517
           E A  ++  HY LKP GN  +   SDPH    GKN+LI       + +   +  +++  +
Sbjct: 438 ERAFEIYAYHYGLKPPGN--VPAYSDPHGHLTGKNILIVRGSEEDTCANFKLEEDRFKKL 495

Query: 518 LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSD 577
           L      L  +R KRPRPHLD K+I +WNGLV+S   +                    ++
Sbjct: 496 LATTNDILHVIRDKRPRPHLDTKIICAWNGLVLSGLCKLGN--------------CYSAN 541

Query: 578 RKEYMEVAESAASFIRRHLYDEQTHRL----------QHSFRNGPSKAPGFLDDYAFLIS 627
           R++YM+ A+    F+R+ +YD +   L            +     S+  GFLDDYAFLI 
Sbjct: 542 REQYMQTAKELLDFLRKEMYDPEQKLLIRSCYGVAVGDETLEKNASQIDGFLDDYAFLIK 601

Query: 628 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 687
           GLLD Y+       L WA  LQ+TQD+LF D   G YF +  + P+V++R+KEDHDGAEP
Sbjct: 602 GLLDYYKATLDVDVLHWAKALQDTQDKLFWDERNGAYFFSQQDAPNVIVRLKEDHDGAEP 661

Query: 688 SGNSVSVINLVRLASIVAGSKSDYYRQNA 716
            GNSVS  NLV LA         YY +NA
Sbjct: 662 CGNSVSAHNLVLLAH--------YYDENA 682


>gi|410980751|ref|XP_003996739.1| PREDICTED: spermatogenesis-associated protein 20 [Felis catus]
          Length = 773

 Score =  503 bits (1295), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 298/725 (41%), Positives = 399/725 (55%), Gaps = 73/725 (10%)

Query: 80  PYKVVAMAERTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDV 139
           P        RT  S S +  K  NRL  E SPYLLQHA+NPVDW+ WG EAF +ARK + 
Sbjct: 44  PMPAGGKGSRTNCSPS-TPQKVPNRLINEKSPYLLQHAYNPVDWYPWGPEAFDKARKENK 102

Query: 140 PIFLSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALY 199
           PIFLS+GYSTCHWCH+ME ESF++E + +LL++ FVS+KVDREERPDVDKVYMT++Q   
Sbjct: 103 PIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREERPDVDKVYMTFIQVSS 162

Query: 200 GGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFA 259
               W             +GG   PP   +        L +    W + ++ L ++    
Sbjct: 163 VSTYW------------AVGGXXXPPPTPHADLQVCPCLPQ----WKQNKNTLLENS--- 203

Query: 260 IEQLSEALSASASSNKLPDELPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQ 316
            ++++ AL A +  +    +LP +   +   C +QL +SYD  +GGF  APKFP PV + 
Sbjct: 204 -QRVTAALLARSEISMGDRQLPPSGATMNSRCFQQLDESYDEEYGGFAEAPKFPTPVILS 262

Query: 317 MML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPH 374
            +   + S +L   G     S  Q+M L TL+ MA GGI DHVG GFHRYS D +WH+PH
Sbjct: 263 FLFSYWLSHRLTQDG-----SRAQQMALHTLKMMANGGIRDHVGQGFHRYSTDRQWHIPH 317

Query: 375 FEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAET 434
           FEKMLYDQ QLA  Y  AF ++ D FYS + R IL Y+ R++    G   SAEDADS   
Sbjct: 318 FEKMLYDQAQLAVAYSQAFQISGDEFYSDVARGILQYVARNLSHRSGGFCSAEDADSPPE 377

Query: 435 EGATRKKEGAFYVWTSKEVEDILGE----------HAILFKEHYYLKPTGNCDLSRMSDP 484
            G  + KEGAFYVWT KEV+ +L E             L  +HY L   GN  +S   DP
Sbjct: 378 RG-MQPKEGAFYVWTVKEVQQLLSEPVPGATEPLTSGQLLMKHYGLTEAGN--ISPSQDP 434

Query: 485 HNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVS 544
             E  G+NVL        +A++ G+ +E    +L     KLF  R  RPRPHLD K++ S
Sbjct: 435 KGELHGRNVLTVRYSLELTAARFGLDVEAVRTLLNTGLEKLFQARKHRPRPHLDSKMLAS 494

Query: 545 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL 604
           WNGL++S FA    +L  E    + N+             A + A F++RH++D  + RL
Sbjct: 495 WNGLMVSGFAVTGAVLGLE---RLINY-------------ATNGAKFLKRHMFDVASGRL 538

Query: 605 QHSFRNGP------SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELF 656
             +   G       S  P  GFL+DYAF++ GLLDLYE    + WL WA+ LQ+ QD LF
Sbjct: 539 MRTCYAGSGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESSWLEWALRLQDAQDRLF 598

Query: 657 LDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQN 715
            D +GGGYF +  E  + L LR+K+D DGAEPS NSVS  NL+RL     G K   +   
Sbjct: 599 WDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHGFT-GHKD--WMDK 655

Query: 716 AEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYD 775
               L  F  RL+ + +A+P M  A       + K +V+ G   + D + +L   H+ Y 
Sbjct: 656 CVSLLTAFSERLRRVPVALPEMVRALSAHQQ-TLKQIVICGDPQAKDTKALLQCVHSIYI 714

Query: 776 LNKTV 780
            NK +
Sbjct: 715 PNKVL 719


>gi|283778260|ref|YP_003369015.1| hypothetical protein Psta_0467 [Pirellula staleyi DSM 6068]
 gi|283436713|gb|ADB15155.1| protein of unknown function DUF255 [Pirellula staleyi DSM 6068]
          Length = 709

 Score =  503 bits (1295), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 284/677 (41%), Positives = 392/677 (57%), Gaps = 64/677 (9%)

Query: 96  HSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHV 155
           H      NRLA+E SPYLLQH +NPVDW+ W  EA   +R  D PIFLSIGYS CHWCHV
Sbjct: 6   HCETTMPNRLASESSPYLLQHQNNPVDWYPWSSEALERSRAEDKPIFLSIGYSACHWCHV 65

Query: 156 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 215
           ME ESFE + +A  LN+ FV IKVDREERPD+D++YM  VQ + G GGWP+SVFL+P+ K
Sbjct: 66  MEHESFESQEIADYLNEHFVCIKVDREERPDLDQIYMDAVQLMTGRGGWPMSVFLTPEGK 125

Query: 216 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDM-LAQSGAFAIEQLSEALSASASSN 274
           P  GGTY+PP D+ G PGF  ++R V DAW  +R+  L+Q+      +L++ L + A+SN
Sbjct: 126 PFFGGTYWPPTDRQGMPGFSRVIRAVIDAWKNRREQALSQA-----TELTDHLGSLATSN 180

Query: 275 KLPDELPQNALR--------LCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 326
             P +LP +  R          A +LS+++DSR+GGFGSAPKFP  ++++++L   ++  
Sbjct: 181 T-PAQLPLSVSRSMVDGWMETAAARLSRAFDSRYGGFGSAPKFPHSMDLELLLLEWQR-- 237

Query: 327 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 386
                    +  +M L TL+ M+ GGI+DH+GGGF RYSVDERW VPHFEKMLYD   L 
Sbjct: 238 -----SARVDVAEMTLVTLEKMSAGGIYDHLGGGFARYSVDERWLVPHFEKMLYDNSLLL 292

Query: 387 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 446
              + A+  T D  ++   R+  +YL RDM    G I+S EDADS   EG    +EG FY
Sbjct: 293 RALVRAYQATGDAKFAATMRETCNYLLRDMTDELGGIYSTEDADS---EG----EEGKFY 345

Query: 447 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 505
           VW   E+ ++LG E    F + Y + P GN            F+    ++ L+ S A  S
Sbjct: 346 VWKPAEIYEVLGPERGSRFCQVYDVAPGGN------------FEHGFSILNLSRSIADWS 393

Query: 506 KLG-MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 564
           +L  MPLE   N L E R  LFDVR KR  P  DDK++ SWN L I + A  + +L    
Sbjct: 394 RLWEMPLEVLSNELAEDRAILFDVREKRVHPGKDDKILTSWNALAIDALAEVAGVL---- 449

Query: 565 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 624
                       D   Y+  A+ AA F+ +HL D    RL H++R+G +K   +LDDYA+
Sbjct: 450 ------------DEPRYLLAAQRAADFVLQHLRDSDG-RLLHTWRHGRAKLAAYLDDYAY 496

Query: 625 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 684
           L+  L+ LYE    T+WL  A+EL +     F D E GG+F T  +  +++ R K+ HDG
Sbjct: 497 LVHALVSLYEADFHTRWLSAAVELADQMIAHFSDHERGGFFFTADDHEALITRAKDMHDG 556

Query: 685 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 744
           + PSG+S++ + L RL  I        Y   +E ++      +     A  +M  AAD+L
Sbjct: 557 SVPSGSSMAALALARLGKITGKQA---YLLASERAILAASGSVTANPTASAVMIQAADLL 613

Query: 745 SVPSRKHVVLVGHKSSV 761
             P+ + +VL G ++ V
Sbjct: 614 VGPTSE-IVLAGPEAEV 629


>gi|330916342|ref|XP_003297383.1| hypothetical protein PTT_07767 [Pyrenophora teres f. teres 0-1]
 gi|311329963|gb|EFQ94518.1| hypothetical protein PTT_07767 [Pyrenophora teres f. teres 0-1]
          Length = 747

 Score =  503 bits (1295), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 282/676 (41%), Positives = 384/676 (56%), Gaps = 29/676 (4%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           K  NRL+   SPY+  H +NPV W  WG EA   A+K +  IF+SIGY+ CHWCHVME E
Sbjct: 18  KLKNRLSESRSPYVRGHMNNPVAWQMWGPEAIELAKKSNRLIFISIGYAACHWCHVMERE 77

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SFE++ VA LLN+ F+ IK+DREERPDVD++YM YVQA  G GGWPL+ F++PDL+P+ G
Sbjct: 78  SFENDEVANLLNENFIPIKIDREERPDVDRIYMNYVQATTGSGGWPLNAFITPDLEPIFG 137

Query: 220 GTYFP-PEDKYG---RPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL---SEALSASAS 272
           GTY+P P          GF  IL K++D W  +R    +S      QL   +E  + S  
Sbjct: 138 GTYWPGPGSTMAMGEHIGFVGILEKIRDVWRDQRQRCLESAKEITAQLRDFAEDGNISRK 197

Query: 273 SNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK---KLEDTG 329
               P+ L  + L    E   K YD    GFG APKFP P  ++ +L  S+    + +  
Sbjct: 198 DGAAPEGLDLDTLDEAYEHFKKRYDKAHAGFGGAPKFPTPSNLRFLLKLSQYPSAVREVL 257

Query: 330 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 389
            + + +  + M L TL  M KGGIHD +G GF RYSV + W +PHFEKMLYDQ QL  VY
Sbjct: 258 GAKDCTHAKDMALATLDAMNKGGIHDQIGNGFARYSVTKDWSLPHFEKMLYDQAQLLPVY 317

Query: 390 LDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 448
           LDA+ +T+   +     DI  YL    M    G  FS+EDADS        K+EGAFYVW
Sbjct: 318 LDAYLMTRSPEHLSAVHDIAAYLTSPPMQAESGGFFSSEDADSLYRPNDKEKREGAFYVW 377

Query: 449 TSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 507
           T KE + ILG+  A +   +Y +K  GN  ++   D H+E   +NVL         A + 
Sbjct: 378 TLKEFQQILGDRDAEILARYYNVKDEGN--VAPEHDAHDELINQNVLAITTTKPDLAQQF 435

Query: 508 GMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 566
           G+  ++  NIL E R+KL D R+K RPRP LDDK++VSWNGL I + AR S  L S+  +
Sbjct: 436 GLSEDEVNNILEEGRQKLLDHRNKERPRPGLDDKIVVSWNGLAIGALARTSAALSSQDPT 495

Query: 567 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 626
                       ++Y+  AE AASF+R HLY+  +  L   +R GP  APGF DDYA+LI
Sbjct: 496 R----------SQKYLAAAEKAASFLRAHLYNPTSKTLIRVYREGPGDAPGFADDYAYLI 545

Query: 627 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 686
           SGL+DLYE      +L WA +LQ TQ  +F D++  G+F+T  +   +++R+K+  D AE
Sbjct: 546 SGLIDLYEATFNDTYLQWADDLQQTQLAMFWDKQHLGFFSTPEDQKDLIMRLKDGMDNAE 605

Query: 687 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 746
           P  N VS  NL RL +++   + + Y + A  + + FE  +       P M  A  ++  
Sbjct: 606 PGTNGVSAQNLDRLGALL---EHEDYTKKARDTASAFEAEIMQHPFLFPTMMDAV-VVGK 661

Query: 747 PSRKHVVLVGHKSSVD 762
               H V+ G    V+
Sbjct: 662 LGNSHSVITGEGKKVE 677


>gi|189195556|ref|XP_001934116.1| hypothetical protein PTRG_03783 [Pyrenophora tritici-repentis
           Pt-1C-BFP]
 gi|187979995|gb|EDU46621.1| hypothetical protein PTRG_03783 [Pyrenophora tritici-repentis
           Pt-1C-BFP]
          Length = 748

 Score =  503 bits (1295), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 281/676 (41%), Positives = 384/676 (56%), Gaps = 29/676 (4%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           K  NRL+   SPY+  H +NPV W  WG EA   A+K +  IF+SIGY+ CHWCHVME E
Sbjct: 19  KLKNRLSESRSPYVRGHMNNPVAWQMWGPEAIELAKKSNRLIFISIGYAACHWCHVMERE 78

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SFE++ VAKLLN+ F+ IK+DREERPDVD++YM YVQA  G GGWPL+ F++PDL+P+ G
Sbjct: 79  SFENDEVAKLLNENFIPIKIDREERPDVDRIYMNYVQATTGSGGWPLNAFITPDLEPIFG 138

Query: 220 GTYFP-PEDKYG---RPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL---SEALSASAS 272
           GTY+P P          GF  IL K++D W  +R    +S      QL   +E  + S  
Sbjct: 139 GTYWPGPGSTMAMGEHIGFVGILEKIRDVWRDQRQRCLESAKEITAQLRDFAEDGNISRK 198

Query: 273 SNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK---KLEDTG 329
               P+ L  + L    E   K YD    GFG APKFP P  ++ +L  S+    + +  
Sbjct: 199 DGAAPEGLDLDTLDEAYEHFKKRYDKAHAGFGGAPKFPTPSNLRFLLKLSQYPSAVREVL 258

Query: 330 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 389
            + + +  + M L TL  M KGGIHD +G GF RYSV + W +PHFEKMLYDQ QL  VY
Sbjct: 259 SAKDCTHAKDMALATLDAMNKGGIHDQIGNGFARYSVTKDWSLPHFEKMLYDQAQLLPVY 318

Query: 390 LDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 448
           LDA+ +T+   +     DI  YL    M    G  FS+EDADS        K+EGAFYVW
Sbjct: 319 LDAYLMTRSPEHLSAVHDIATYLTSPPMQAESGGFFSSEDADSLYRPNDKEKREGAFYVW 378

Query: 449 TSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 507
           T KE + ILG+  A +   +Y ++  GN  ++   D H+E   +NVL         A + 
Sbjct: 379 TLKEFQQILGDRDAEILARYYNVQDEGN--VAPEHDAHDELINQNVLAVTTTKPDLAQQF 436

Query: 508 GMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 566
           G+  ++   IL E R+KL D R+K RPRP LDDK++VSWNGL I + AR S  L S+  +
Sbjct: 437 GLSEDEVNKILEEGRQKLLDHRNKERPRPGLDDKIVVSWNGLAIGALARTSAALSSQDPT 496

Query: 567 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 626
                       ++Y+  AE AA+F+R HLY+  +  L   +R GP  APGF DDYA+LI
Sbjct: 497 R----------SQKYLAAAEKAATFLRAHLYNSTSKTLIRVYREGPGDAPGFADDYAYLI 546

Query: 627 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 686
           SGL+DLYE      +L WA +LQ TQ  +F D++  G+F+T  +   +++R+K+  D AE
Sbjct: 547 SGLIDLYEATFNDTYLQWADDLQQTQLAMFWDKQHLGFFSTPEDQKDLIMRLKDGMDNAE 606

Query: 687 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 746
           P  N VS  NL RL +++   + + Y + A  + + FE  +       P M  A  ++  
Sbjct: 607 PGTNGVSAQNLDRLGALL---EHEDYTKKARDTASAFEAEIMQHPFLFPTMMDAV-VVGK 662

Query: 747 PSRKHVVLVGHKSSVD 762
               H V+ G    VD
Sbjct: 663 LGISHSVITGEGKKVD 678


>gi|374302064|ref|YP_005053703.1| hypothetical protein [Desulfovibrio africanus str. Walvis Bay]
 gi|332555000|gb|EGJ52044.1| protein of unknown function DUF255 [Desulfovibrio africanus str.
           Walvis Bay]
          Length = 691

 Score =  503 bits (1295), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 294/681 (43%), Positives = 387/681 (56%), Gaps = 45/681 (6%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           KHTNRL  E SPYLLQHAHNPVDW  WGEEAF  A ++D P+FLSIGYSTCHWCHVME E
Sbjct: 3   KHTNRLVGEKSPYLLQHAHNPVDWHPWGEEAFRTATEQDKPVFLSIGYSTCHWCHVMERE 62

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SFED+ VAKLLN+ FV IKVDREERPD+D VYMT  Q + G GGWPL+V ++PD KP   
Sbjct: 63  SFEDDEVAKLLNEAFVCIKVDREERPDIDNVYMTVCQMMTGHGGWPLTVLMTPDKKPFFS 122

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE 279
           GTYFP     GR G   ++ KV+D W  +R+ L QS     E L   L   A   +L D 
Sbjct: 123 GTYFPKSSLSGRMGLMELVPKVQDLWRTRREDLVQSADKVTEAL-RGLERPAVGGELGDS 181

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
           +   A R    QLS+ +D  FGGFG APKFP P     +L   +    TG +   +    
Sbjct: 182 VLFKAER----QLSERFDEAFGGFGGAPKFPTP---HNLLLLLRMFRRTGNARNLA---- 230

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           MV  TL  M +GGI+DH+G GFHRYS D+RW +PHFEKMLYDQ QL   Y++A+ LT+  
Sbjct: 231 MVEKTLTTMRRGGIYDHLGYGFHRYSTDQRWLLPHFEKMLYDQAQLLMAYVEAYQLTRKP 290

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE 459
            Y    ++I++Y+RRD+  P G  +SAEDADS   EG    +EG FYVW+ KE+  +LG+
Sbjct: 291 IYKRTAQEIVEYVRRDLQHPDGPFYSAEDADS---EG----EEGKFYVWSEKEIRSVLGK 343

Query: 460 HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILG 519
            A  F   Y + P GN     + +  +   G NVL         A +LGM   +    L 
Sbjct: 344 KADPFIRAYDILPEGNF----LDEATHRRTGANVLHLQRPLDILAKELGMSELELETTLA 399

Query: 520 ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRK 579
           + RR LF VR +R RP  DDKV+  WNGL+I++ + A+K L                D +
Sbjct: 400 DQRRLLFHVRERRVRPLRDDKVLTDWNGLMIAALSMAAKAL----------------DEE 443

Query: 580 EYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGT 639
            ++  A +AA FI   +   +  RL H FR+G       L DYAFLI GL++LYE G  +
Sbjct: 444 LFVRAATAAADFILSRM--RKDGRLLHRFRDGEVAIEATLTDYAFLIWGLVELYEAGLDS 501

Query: 640 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVR 699
           + L  A++L    ++ F D + GGY+ T      +L+R K+  DGA PSGNSV++  L++
Sbjct: 502 RHLEAALDLTEIMNKQFWDPKDGGYYFTAESAEQLLVRQKDLFDGAIPSGNSVAMHVLLK 561

Query: 700 LASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKS 759
           L+ +               S A   T   +  +    + C  D    PS   VV+VG ++
Sbjct: 562 LSRLTGRPNLANRAAAVARSAARQAT---EHPVGFTQLLCGVDFSIGPS-AEVVIVGKRN 617

Query: 760 SVDFENMLAAAHASYDLNKTV 780
           + +   ML   HASY  NK +
Sbjct: 618 APETRAMLRKLHASYIPNKVL 638


>gi|156742936|ref|YP_001433065.1| hypothetical protein Rcas_2990 [Roseiflexus castenholzii DSM 13941]
 gi|156234264|gb|ABU59047.1| protein of unknown function DUF255 [Roseiflexus castenholzii DSM
           13941]
          Length = 696

 Score =  503 bits (1295), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 284/685 (41%), Positives = 392/685 (57%), Gaps = 51/685 (7%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
            +  NRL  E SPYLLQHA+NPVDW+ WGEEAFA A+  D PI LS+GY+ CHWCHVME 
Sbjct: 7   TRRPNRLINETSPYLLQHAYNPVDWYPWGEEAFARAQAEDKPILLSVGYAACHWCHVMEH 66

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFEDE  A L+N +FV++KVDREERPDVD +YMT VQA+ G GGWP++VFL+PD  P  
Sbjct: 67  ESFEDEETAALMNRYFVNVKVDREERPDVDSIYMTAVQAMTGSGGWPMTVFLTPDGTPFF 126

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLP- 277
            GTYFPPED++  P F+ +LR V +A+  +R+ L   G   +E++ E     AS  ++P 
Sbjct: 127 AGTYFPPEDRWQMPSFQRVLRSVAEAYATRRNDLLARGRELVERMRE-----ASMMQIPG 181

Query: 278 DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEG 337
             L   AL      L +++D  +GGFG APKFP+P+ ++ +L ++ +   TG+      G
Sbjct: 182 STLTPAALDSAFMGLQQAFDPEYGGFGRAPKFPQPMTLEFLLRYAAR---TGR------G 232

Query: 338 QKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTK 397
            +M+  TL+ MA+GG++D +GGGFHRYSVD +W VPHFEKMLYD   LA VYL+ F  T 
Sbjct: 233 MEMLERTLRAMAEGGMYDQIGGGFHRYSVDAQWLVPHFEKMLYDNALLARVYLETFQATG 292

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
           + FY  I  + L Y+ R+M  P G  FS +DADS  T  AT K EGAF+VWT  E+ + L
Sbjct: 293 NAFYRRIAEETLTYMLREMQHPDGGFFSTQDADSLPTADATHKHEGAFFVWTPAEIREAL 352

Query: 458 GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI 517
           G  A +F   Y +   GN            F+GKN+L      +  A  +GM +E+  +I
Sbjct: 353 GADATVFSALYGVTDRGN------------FEGKNILHVQRSPAEVARVMGMSVERVESI 400

Query: 518 LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSD 577
               RR LF VR  RP+P LDDKV+ +WNG+ + +FA  + +L                D
Sbjct: 401 AERGRRVLFAVRQHRPKPELDDKVLTAWNGMALRAFALGAIVL----------------D 444

Query: 578 RKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFLISGLLDLYEFG 636
           R+EY   A   A F+ R L       L+ S+R G  +  P FL+DYA L  GLL LYE  
Sbjct: 445 REEYRTAAVRCAEFVLRELRRADGELLR-SWRQGVANPTPAFLEDYALLADGLLALYEAT 503

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
              +WL+ A  L +   E F D   GG+++T      +++R ++  D A PSG+S +   
Sbjct: 504 FDPRWLLEARALADALLERFWDDGIGGFYDTGSHHEQLVIRPRDTGDNATPSGSSAAADV 563

Query: 697 LVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM-LSVPSRKHVVLV 755
           L+RLA I    +   YR+ A   L+     ++           AA+  LS P  + + L+
Sbjct: 564 LLRLALIFDEPR---YRERALTVLSAMAPLMERYPTGFGRYLAAAEFALSQP--REIALI 618

Query: 756 GHKSSVDFENMLAAAHASYDLNKTV 780
           G   + D   + A A   +  N+ V
Sbjct: 619 GDPEAADTRALAAIALKPFLPNRVV 643


>gi|308480509|ref|XP_003102461.1| hypothetical protein CRE_04116 [Caenorhabditis remanei]
 gi|308261193|gb|EFP05146.1| hypothetical protein CRE_04116 [Caenorhabditis remanei]
          Length = 746

 Score =  503 bits (1294), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 286/696 (41%), Positives = 390/696 (56%), Gaps = 73/696 (10%)

Query: 75  HRPIHPYKVVAMAERTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEA 134
           +RP+H   +V     T          + NRL  E SPYLLQHA+NP+DW+ WGEEAF +A
Sbjct: 3   NRPVHASNLVFRMFAT----------YKNRLGLEKSPYLLQHANNPIDWYPWGEEAFKKA 52

Query: 135 RKRDVPIFLSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTY 194
           ++ + PIFLS+GYSTCHWCHVME ESFE+E  AK+LN+ F++IKVDREERPDVDK+YM +
Sbjct: 53  KESNKPIFLSVGYSTCHWCHVMEKESFENENTAKILNENFIAIKVDREERPDVDKLYMAF 112

Query: 195 V---------------QALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILR 239
           V               QA  G GGWP+SVFL+P+L P+ GGTYFPP+D  G  GF TIL 
Sbjct: 113 VVVYLNFCFTSSFSFFQAASGHGGWPMSVFLTPELHPITGGTYFPPDDNRGMLGFSTILN 172

Query: 240 KVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSR 299
            ++  W K+ D L + G   I +L +  +AS   NK      +   +        S+DSR
Sbjct: 173 MIQTEWKKEGDNLRKRGEQII-KLLQPETASGDVNK-----SEEVFQSIYSHKQSSFDSR 226

Query: 300 FGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGG 359
            GGFG APKFP+  ++  ++  S       KS E++    M+  TL+ MA GGIHDH+G 
Sbjct: 227 LGGFGGAPKFPKASDLDFLIAFSSADSCGDKSKEST---TMLQKTLESMADGGIHDHIGT 283

Query: 360 GFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT--KDVFYSYICRDILDYLRRDMI 417
           GFHRYSVD  WHVPHFEKMLYDQ QL   Y D   LT  K+    ++  DI +Y+++   
Sbjct: 284 GFHRYSVDGEWHVPHFEKMLYDQSQLLATYSDFHRLTGKKNENIKFVINDIFEYMQKISH 343

Query: 418 GPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHY-----YLKP 472
             GG  +SAEDADS     +  K EGAF VW  +E++ +L E  I   + +     Y   
Sbjct: 344 KEGG-FYSAEDADSLPKNDSKEKMEGAFCVWEKEEIKKLLCERKIGSADLFDVVADYFDV 402

Query: 473 TGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKR 532
             N ++ R SDPH E K KNVL +L      A+   + +E+    + E ++ L++ R+KR
Sbjct: 403 EDNGNVPRSSDPHGELKNKNVLRKLLTDDECAANHSLTVEELKRGIEEAKQILWEARTKR 462

Query: 533 PRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFI 592
           P PHLD K++ +W  L IS   +A +                 ++  +Y+E AE  A+F+
Sbjct: 463 PSPHLDSKMVTAWQALAISGLVKAYQ----------------ATEDVKYIERAEKCAAFV 506

Query: 593 RRHLYDEQTHRLQHS--------FRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVW 644
           R++L  E+   L+ S           G      F DDYAF+I GLLDLY      ++L  
Sbjct: 507 RKYL--EENGELKRSVYLGVEGNIEQGHQNMKAFSDDYAFMIQGLLDLYTVLGKNEYLEK 564

Query: 645 AIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIV 704
           AIELQ T D+ F    G GYF +   D  V +R+ ED DGAEP+  S++  NL+RL  I+
Sbjct: 565 AIELQKTCDQKFWS--GNGYFISEQADEGVSVRMVEDQDGAEPTATSIASNNLLRLHDIL 622

Query: 705 AGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 740
              ++D YR+ A         RL    +A+P M  A
Sbjct: 623 ---ENDEYREKANKCFRGASERLNKFPIALPKMAVA 655


>gi|148656403|ref|YP_001276608.1| hypothetical protein RoseRS_2279 [Roseiflexus sp. RS-1]
 gi|148568513|gb|ABQ90658.1| protein of unknown function DUF255 [Roseiflexus sp. RS-1]
          Length = 700

 Score =  502 bits (1292), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 281/696 (40%), Positives = 394/696 (56%), Gaps = 63/696 (9%)

Query: 94  TSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWC 153
           +S+ R++  NRL    SPYLLQHA+NPVDW+ WGEEA A A+  D PI LS+GY+ CHWC
Sbjct: 2   SSNKRDRRPNRLINATSPYLLQHAYNPVDWYPWGEEALARAKAEDKPILLSVGYAACHWC 61

Query: 154 HVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPD 213
           HVME ESFEDE  A L+N  F+++KVDREERPD+D +YMT VQA+ G GGWP++VFL+PD
Sbjct: 62  HVMEHESFEDEETAALMNQHFINVKVDREERPDIDAIYMTAVQAMTGSGGWPMTVFLTPD 121

Query: 214 LKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASS 273
             P   GTYFPPED++  P F+ +LR V +A+  +R+ L   G   +E++ EA+S     
Sbjct: 122 GVPFFAGTYFPPEDRWQMPSFRRVLRSVAEAYASRRNELLARGRELVERMREAISMHMPG 181

Query: 274 NKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGE 333
             L   +   A       L +++D  FGGFG APKFP+P+ ++ +L ++ +   TG+   
Sbjct: 182 GTLTPAVLDTAF----IGLQQAFDPAFGGFGRAPKFPQPMTLEFLLRYAVR---TGR--- 231

Query: 334 ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 393
              G +M+  TL+ MA+GG++D +GGGFHRYSVD +W VPHFEKMLYD   LA VYL+ F
Sbjct: 232 ---GMEMLEMTLRRMAEGGMYDQLGGGFHRYSVDAQWLVPHFEKMLYDNALLARVYLETF 288

Query: 394 SLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 453
             T +  Y  I  + LDY+ R+M  P G  FS +DADS  T  AT K EGAF+VWT  E+
Sbjct: 289 QATGNACYRRIAEETLDYMLREMHHPEGGFFSTQDADSLPTPDATHKHEGAFFVWTPAEI 348

Query: 454 EDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEK 513
            + LG  AI+F   Y +   GN            F+GKN+L         A  +GMP+E+
Sbjct: 349 REALGTDAIVFSALYGVTDQGN------------FEGKNILHVRRSPDEVARVMGMPVEQ 396

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
              I    RR LF+VR +RP P LDDKV+ +WNG+ I +FA  +                
Sbjct: 397 IETIAARGRRILFEVRQRRPMPDLDDKVLTAWNGMAIRAFALGA---------------- 440

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
           V  DR++Y   A   A F+  +L       L+   R   +  P FL+DYA L  GLL LY
Sbjct: 441 VALDREDYRIAAVRCARFVLTNLRRADGELLRSWRRGVANPTPAFLEDYALLADGLLALY 500

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
           E      WL+ A  L ++  E F D   GG+++T      +++R ++  D A PSG+S +
Sbjct: 501 EATFDPHWLLEARALADSLLERFWDEGLGGFYDTGKNHEQLVIRPRDTGDNATPSGSSAA 560

Query: 694 VINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLM---------CCAADML 744
           V  L+RLA I   ++   YR   E +L+V E+        VP+M           AA   
Sbjct: 561 VDVLLRLALIFDEAR---YR---ERALSVLES-------MVPVMQRYPTGFGRYLAAAEF 607

Query: 745 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           ++   + + L+G+    D + + A     +  N+ +
Sbjct: 608 ALGQPREIALIGNPEDADTQALAAVVLKPFLPNRVI 643


>gi|392411456|ref|YP_006448063.1| thioredoxin domain protein [Desulfomonile tiedjei DSM 6799]
 gi|390624592|gb|AFM25799.1| thioredoxin domain protein [Desulfomonile tiedjei DSM 6799]
          Length = 692

 Score =  500 bits (1288), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 282/683 (41%), Positives = 384/683 (56%), Gaps = 52/683 (7%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNRLA+E SPYLLQHAHNPVDW+ WGEEAF +AR  D PIFLSIGYSTCHWCHVME ESF
Sbjct: 3   TNRLASEKSPYLLQHAHNPVDWYPWGEEAFKKARSEDKPIFLSIGYSTCHWCHVMEHESF 62

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           EDE  A  +N  FVSIKVDREERPD+D +YMT  Q + G GGWPL+V L+PDLKP   GT
Sbjct: 63  EDEETAAAMNQSFVSIKVDREERPDLDNIYMTVCQMMTGSGGWPLNVVLTPDLKPFFAGT 122

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSG---AFAIEQLSEALSASASSNKLPD 278
           YFP   ++G+ G   +  ++++ W  +R+ + +S      A+ Q+ +A S S     L  
Sbjct: 123 YFPKTSRFGKIGMVELSDRIREIWQTRRNDVLESADKVTNALRQMPDASSGSVQGKAL-- 180

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
                 L     +L K +D   GGF  APKFP P  +  +L + K+  D        +  
Sbjct: 181 ------LEQAFTELDKRFDPARGGFSPAPKFPTPHNLLFLLRYWKRTGD-------EKAL 227

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
           KMV  TL  +  GGI+DHVG GFHRYS D  W VPHFEKMLYDQ  L   Y +A+  T +
Sbjct: 228 KMVEKTLHALRLGGIYDHVGFGFHRYSTDTEWLVPHFEKMLYDQALLTMAYTEAYQATGN 287

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
            FY+   ++I+ Y+ RDM  P G  +SAEDADS   EG     EG FYVWT +E+ED+LG
Sbjct: 288 EFYADTAKEIVTYVLRDMTSPQGGFYSAEDADS---EGV----EGKFYVWTLREIEDVLG 340

Query: 459 EH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI 517
           +  A L+   Y  +P GN       +   +  G N+   L      A+   M   +  + 
Sbjct: 341 QKDAALYSAVYNFEPEGNFH----DEASGQATGANIPHLLARFEEIAATRDMTPHELHDR 396

Query: 518 LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSD 577
           L   R KLF  R +R  PH DDK++  WNGL+I++ A+A+++ ++               
Sbjct: 397 LRAIREKLFSTRERRVHPHKDDKILTDWNGLMIAALAKAAQVFEN--------------- 441

Query: 578 RKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGS 637
            +EY E A  AA F+   L DEQ  RL H FR+G +     +DD+AF + GLL+LYE   
Sbjct: 442 -REYGEAARKAADFLLSTLRDEQG-RLLHRFRDGEAGLTAHVDDFAFFVWGLLELYETVF 499

Query: 638 GTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINL 697
             ++L  A+EL +   + F D E GG++ T  +  ++L+R KE +DGA PSGNSVS++NL
Sbjct: 500 EPQYLAAALELNDDLLKRFWDDERGGFYFTAMDAENLLVRTKEVYDGAVPSGNSVSLLNL 559

Query: 698 VRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGH 757
           +RL  + +  + +     AE     F   L+    A   M    +      R + V++ +
Sbjct: 560 LRLGRMTSNPELE---SKAEQIAKAFAGTLRQFPSAYTQMLVGLEF--AEGRTYEVVIAN 614

Query: 758 KSSVDFENMLAAAHASYDLNKTV 780
             + D   ML     ++  NK V
Sbjct: 615 SGTEDVLPMLRIIRRNFLPNKVV 637


>gi|451845821|gb|EMD59132.1| hypothetical protein COCSADRAFT_41015 [Cochliobolus sativus ND90Pr]
          Length = 799

 Score =  499 bits (1286), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 287/680 (42%), Positives = 390/680 (57%), Gaps = 37/680 (5%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           K  NRL    SPY+  H +NPV W  WG EA   A+K +  IF+SIGY+ CHWCHVME E
Sbjct: 70  KLRNRLNESRSPYVRGHMNNPVAWQIWGPEAIELAKKSNRLIFISIGYAACHWCHVMERE 129

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SFE++ VAKLLN+ F+ IK+DREERPDVD++YM YVQA  G GGWPL+VF++PDL+P+ G
Sbjct: 130 SFENDEVAKLLNEHFIPIKIDREERPDVDRIYMNYVQATTGSGGWPLNVFITPDLEPIFG 189

Query: 220 GTYFP-PEDKYG---RPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 275
           GTY+P P          GF  IL+K++D W  +R    +S      QL +       S K
Sbjct: 190 GTYWPGPGSTMAMGEHIGFIGILKKIRDVWRDQRQRCLESAKEITAQLRDFAEEGNISRK 249

Query: 276 LPDELPQNALRL-----CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK---KLED 327
             D  P   L L       E   K YD    GFG APKFP P  +  +L  S+    +++
Sbjct: 250 --DGAPNETLDLELLDEAYEHFKKRYDQVHAGFGGAPKFPTPSNLHFLLKLSQYPNPVKE 307

Query: 328 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 387
              + + +  + M L TL  M KGGIHD +G GF RYSV + W +PHFEKMLYDQ QL  
Sbjct: 308 VLGAKDCTYAKDMALATLSAMNKGGIHDQIGNGFARYSVTKDWSLPHFEKMLYDQSQLLA 367

Query: 388 VYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDADSAETEGATRKKEGAFY 446
           VYLDA+ +T+   +     DI  YL    M    G  +S+EDADS        K+EGAFY
Sbjct: 368 VYLDAYLMTRSPEHLGAVHDIATYLTSPPMHAESGGFYSSEDADSLYRPNDKEKREGAFY 427

Query: 447 VWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 505
           VWT  E +DILGE  + +   +Y +K  GN  ++   D H+E   +NVL   + S+  A 
Sbjct: 428 VWTLNEFQDILGERDSEILARYYNVKDEGN--VAPEHDAHDELINQNVLAITSTSADLAK 485

Query: 506 KLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEA 564
           + G+  +K   IL E R+KL + R+K RPRP LDDK++VSWNGL I + AR S  L S+ 
Sbjct: 486 QFGLSEDKVEKILTEGRQKLLEHRNKERPRPGLDDKIVVSWNGLAIGALARTSAALASQD 545

Query: 565 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 624
            +            KEY+  AE AA+F+++HLY+ ++  L   +R GP  APGF DDYA+
Sbjct: 546 PAR----------SKEYLAAAEKAAAFLQKHLYNSESKTLIRVWREGPGDAPGFADDYAY 595

Query: 625 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 684
           LISGL++LYE      +L WA +LQ TQ ++F D++  G+F+T  +   +++R+K+  D 
Sbjct: 596 LISGLINLYEATFNDSYLQWADDLQKTQLKMFWDKQHLGFFSTPEDQTDLIMRLKDGMDN 655

Query: 685 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLM--CCAAD 742
           AEP  N VS  NL RL +++  S+   Y Q A  + + FE  +       P M     A 
Sbjct: 656 AEPGTNGVSAQNLDRLGALLEDSE---YTQRARDTASAFEAEIMQHPFLFPSMMEAVVAG 712

Query: 743 MLSVPSRKHVVLVGHKSSVD 762
            L +   +H V+ G    VD
Sbjct: 713 KLGI---RHAVITGDGQKVD 729


>gi|218780669|ref|YP_002431987.1| hypothetical protein Dalk_2829 [Desulfatibacillum alkenivorans
           AK-01]
 gi|218762053|gb|ACL04519.1| protein of unknown function DUF255 [Desulfatibacillum alkenivorans
           AK-01]
          Length = 718

 Score =  499 bits (1286), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 290/679 (42%), Positives = 378/679 (55%), Gaps = 45/679 (6%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYLLQHA NPVDW  WG+EAF +A+K D P+FLSIGYSTCHWCHVME ESFE
Sbjct: 30  NRLIFEKSPYLLQHAANPVDWRPWGDEAFEQAKKEDKPVFLSIGYSTCHWCHVMERESFE 89

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D   A LLN  F+ IKVDREERPD+D VYM+  QA+ G GGWP+SVFL+PD +P   GTY
Sbjct: 90  DPEAAALLNRHFICIKVDREERPDIDHVYMSVTQAMTGAGGWPMSVFLTPDKEPFYAGTY 149

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP ED  GRPG   +   + + W  +R         A +Q+ +ALS  A   K  +EL  
Sbjct: 150 FPKEDHMGRPGLMRLATLLGELWKNERSKALN----AAQQVVQALS-QAQPKKGREELGP 204

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
           + L      L  SYD + GGFG   KFP P  +  +L + K+  D       +E   MV 
Sbjct: 205 HTLGKAFAGLKASYDVQQGGFGRGNKFPTPHNLTFLLRYWKRTGD-------AEALAMVE 257

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL  M  GGI+DHVG G HRY+ D  W +PHFEKMLYDQ   AN  L+A+  T    Y+
Sbjct: 258 KTLTAMRMGGIYDHVGFGIHRYATDPNWLLPHFEKMLYDQALTANALLEAYQATGKEEYA 317

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHA 461
              R+I  Y+ RDM  P G  +SAEDADS   EG    +EG FYVWT+KE+ +ILG E  
Sbjct: 318 TNAREIFTYVLRDMTSPEGGFYSAEDADS---EG----EEGKFYVWTTKEITEILGKEDG 370

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
            LF   + L   GN           +  G ++     D    A+ LGM   +  + L + 
Sbjct: 371 ALFISAFNLVKGGNF----FDQATGQKTGDSIPHLQKDPGRLAADLGMEKAELESRLEKI 426

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
           R  LF  R KR  P+ DDK++  WNGL+I++ A+  +IL  E                +Y
Sbjct: 427 RAALFAEREKRIHPYKDDKILTDWNGLMIAALAKGGRILGDE----------------KY 470

Query: 582 MEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKW 641
              A  AA FI   L D + H LQ  FR G +  PG LDDYAF++ GLL+LYE   G KW
Sbjct: 471 TLAAVRAADFILDALQDGEGH-LQKRFREGEAALPGLLDDYAFMVWGLLELYESTFGVKW 529

Query: 642 LVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLA 701
           L  A+ L  T  +LF DR+ GG F +      + +R K+ HDGA+PSGNSV+ +NL+RLA
Sbjct: 530 LKKAVTLNETMLDLFWDRKNGGLFMSPVYGEKLFMRGKDLHDGAQPSGNSVAAVNLLRLA 589

Query: 702 SIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSV 761
            I A  +    R+ AE  L  F  +++        +  A D +  P+ + +V+ G + + 
Sbjct: 590 GITANEEC---REKAEAILQAFSGQIEAQPYVYTHLLGALDFIIGPALE-IVICGDQGAR 645

Query: 762 DFENMLAAAHASYDLNKTV 780
           D   ML   +  +  NK +
Sbjct: 646 DSTVMLDGVNQRFVPNKVL 664


>gi|108805332|ref|YP_645269.1| hypothetical protein Rxyl_2540 [Rubrobacter xylanophilus DSM 9941]
 gi|108766575|gb|ABG05457.1| protein of unknown function DUF255 [Rubrobacter xylanophilus DSM
           9941]
          Length = 685

 Score =  499 bits (1286), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 284/683 (41%), Positives = 402/683 (58%), Gaps = 56/683 (8%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            NRLA E SPYLLQH  NPVDW+ WGEEA   AR+ D PI LS+GYS+CHWCHVME ESF
Sbjct: 5   ANRLANETSPYLLQHKDNPVDWYPWGEEALRRARREDKPILLSVGYSSCHWCHVMERESF 64

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           EDE  A+++N+ FV+IKVDREERPD+D +YM+ +QA+  GGGWP++VFL+P+  P   GT
Sbjct: 65  EDEETARIMNEHFVNIKVDREERPDIDSIYMSALQAMTRGGGWPMTVFLTPEGVPFYAGT 124

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFPPE + G P FK +L  + DA+  +R+ + +S     E L  + +A     +L +EL 
Sbjct: 125 YFPPEPRGGMPSFKQVLLTLADAYRNRREEVLRSAESVREFLRASTTAEMPRGRLREELL 184

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
             A    AE L +  D RFGGFG APKFP+P+ ++++L H ++  D        E    V
Sbjct: 185 DGA----AEALMRQLDRRFGGFGGAPKFPQPMSLEVLLRHHRRTGD-------REALAGV 233

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             TL+ MA+GGI+D +GGGFHRY+VD RW VPHFEKMLYD   L+ +YL+A+  T D FY
Sbjct: 234 ELTLRSMARGGIYDQLGGGFHRYAVDGRWLVPHFEKMLYDNALLSRLYLEAYQATGDGFY 293

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EH 460
             I  + LDY+ RDM GP G  +SAEDADS   EG    +EG FYVWT +E+ + LG E 
Sbjct: 294 RRIAEETLDYVARDMRGPEGGFYSAEDADS---EG----EEGKFYVWTPRELREALGSED 346

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
           A L   ++ +   GN            F+G+NVL    +    A ++G+   +    + E
Sbjct: 347 ASLAAAYWGVTERGN------------FEGRNVLHVPREPEEVAREVGLSPGELGRRVRE 394

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            RR+L + R +R RP  D+KV+ +WNGL++ SFA  +++L+                R++
Sbjct: 395 IRRRLLEARGRRVRPGRDEKVLAAWNGLMLRSFAFTARVLR----------------RED 438

Query: 581 YMEVA-ESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGT 639
           Y+ +A E+AA  + R L  E   RL  S+R+G ++  G+L+DYA +  GL+ LYE    T
Sbjct: 439 YLRIACENAAFLLGRLLSPE--GRLLRSYRDGRARIAGYLEDYAMVADGLVSLYEATFET 496

Query: 640 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVR 699
           +WL  AI L +  DELF D   G +F+       ++ R ++ +D A PSG SV+V   V 
Sbjct: 497 RWLREAISLADAMDELFWDESAGAFFDAPAGGEELVTRPRDVYDNATPSGTSVAVD--VL 554

Query: 700 LASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM-LSVPSRKHVVLVGHK 758
           L   +   + D YR+ AE +L      L+ M  A   +  A D  L  P  + V +VG  
Sbjct: 555 LRLALLLGRED-YRRRAEAALEGLSGLLEQMPAAFGRLLGALDFHLGRP--REVAIVGRP 611

Query: 759 SSVDFENMLAAAHASYDLNKTVS 781
            + D   ++ A ++ Y  N+ ++
Sbjct: 612 DAPDTRALVDALYSVYLPNRVIA 634


>gi|195334316|ref|XP_002033829.1| GM21533 [Drosophila sechellia]
 gi|194125799|gb|EDW47842.1| GM21533 [Drosophila sechellia]
          Length = 808

 Score =  496 bits (1278), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 283/715 (39%), Positives = 386/715 (53%), Gaps = 73/715 (10%)

Query: 67  RRPLAVISH----RPIHPYKVVAMAERTPASTSHSRN---KHTNRLAAEHSPYLLQHAHN 119
           RR L ++ +    RP+   K   MA    +S   S+    K  NRL A  SPYLLQHA+N
Sbjct: 34  RRNLQLLHNSCRSRPVSNQKFRTMATGGESSKEVSKEEPAKQGNRLVASKSPYLLQHAYN 93

Query: 120 PVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKV 179
           PV+W+ WGEEAF +AR  +  IFLS+GYSTCHWCHVME ESFE    A ++N+ FV+IKV
Sbjct: 94  PVEWYPWGEEAFEKARSENKLIFLSVGYSTCHWCHVMEHESFESPETAAIMNENFVNIKV 153

Query: 180 DREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILR 239
           DREERPD+DK+YM ++    G GGWP+SV+L+P+L PL+ GTYFPP+ +YG P F  +L 
Sbjct: 154 DREERPDIDKIYMQFLLMSKGSGGWPMSVWLTPNLAPLVAGTYFPPKSRYGMPSFNAVLN 213

Query: 240 KVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRL--CAEQLSKS-- 295
            +   W+  ++ L  +G+  +  L +   ASA        +P+ A       E+LS++  
Sbjct: 214 SIARKWETDKESLLTTGSSLLSALKKNQDASA--------VPEAAFGAGSAIEKLSEAIN 265

Query: 296 -----YDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAK 350
                +D   GGFGS PKFP    +  + +     +D        +   MV+ TL  + K
Sbjct: 266 VHRQRFDQTHGGFGSEPKFPEVPRLNFLFHGYLVTKD-------PDVLDMVIETLTQIGK 318

Query: 351 GGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILD 410
           GGIHDH+ GGF RY+  + WH  HFEKMLYDQGQL   + +A+ +T+D  Y      I  
Sbjct: 319 GGIHDHIFGGFARYATTQDWHNVHFEKMLYDQGQLMVAFTNAYKVTRDEIYLGYADKIYK 378

Query: 411 YLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKE---- 466
           YL +D+  P G  ++ EDADS  T     K EGAFY WT  E++    + A  F +    
Sbjct: 379 YLIKDLRHPLGGFYAGEDADSLPTHEDKVKVEGAFYAWTWDEIQAAFKDQAQRFDDITPD 438

Query: 467 --------HYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 518
                   HY LKP GN  +   SDPH    GKN+LI       + +   +  +++  +L
Sbjct: 439 RAFEIYAYHYDLKPPGN--VPTYSDPHGHLTGKNILIVRGSEEDTCANFKLEADQFKKLL 496

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
                 L  +R KRPRPHLD K+I +WNGLV+S   +                    ++R
Sbjct: 497 ATTNDILHVIRDKRPRPHLDTKIICAWNGLVLSGLCKLGN--------------CYSANR 542

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRL----------QHSFRNGPSKAPGFLDDYAFLISG 628
           ++YM+ A+    F+R+ +YD +   L            +     S+  GFLDDYAFLI G
Sbjct: 543 EQYMQTAKELLDFLRKEMYDPEQKLLIRSCYGVAVGDETLEKNASQIDGFLDDYAFLIKG 602

Query: 629 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 688
           LLD Y+       L WA  LQ+TQD+LF D   G YF +  + P+V++R+KEDHDGAEPS
Sbjct: 603 LLDYYKATLDVDVLHWAKALQDTQDKLFWDERNGAYFFSQQDAPNVIVRLKEDHDGAEPS 662

Query: 689 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 743
           GNSVS  NLV LA        D + Q A   L  F   +     A+P M  A  M
Sbjct: 663 GNSVSAHNLVLLAHYY---DEDAFLQKAGKLLNFF-ADVSPFGHALPEMLSALLM 713


>gi|451995214|gb|EMD87683.1| hypothetical protein COCHEDRAFT_21080 [Cochliobolus heterostrophus
           C5]
          Length = 734

 Score =  496 bits (1278), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 287/678 (42%), Positives = 393/678 (57%), Gaps = 37/678 (5%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           K  NRL+   SPY+  H +NPV W  WG+EA   A+K +  IF+SIGY+ CHWCHVME E
Sbjct: 9   KLKNRLSESRSPYVRGHMNNPVAWQIWGQEAIGLAKKSNRLIFISIGYAACHWCHVMERE 68

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SFE++ VA LLN+ F+ IK+DREERPDVD++YM YVQA  G GGWPL+VF++PDL+P+ G
Sbjct: 69  SFENDEVANLLNEHFIPIKIDREERPDVDRIYMNYVQATTGSGGWPLNVFITPDLEPIFG 128

Query: 220 GTYFP-PEDKYG---RPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 275
           GTY+P P          GF  IL+K++D W  +R    +S      QL +       S K
Sbjct: 129 GTYWPGPGSTMAMGEHIGFVGILKKIRDVWRDQRQRCLESAKEITAQLRDFAEEGNISRK 188

Query: 276 LPDELPQNALRLCAEQLSKSYDSRF---GGFGSAPKFPRPVEIQMMLYHSKK---LEDTG 329
             D  P   L L  E L ++Y++       FG APKFP P  +  +L  S+    +++  
Sbjct: 189 --DGAPNETLDL--ELLDEAYEASTTFASSFGGAPKFPTPSNLHFLLKLSQYPNLVKEVL 244

Query: 330 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 389
            + + +  + M L TL  M KGGIHD +G GF RYSV + W +PHFEKMLYDQ QL  VY
Sbjct: 245 GAKDCTRAKDMALATLSAMNKGGIHDQIGNGFARYSVTKDWSLPHFEKMLYDQSQLLAVY 304

Query: 390 LDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 448
           LDA+ +T+   +     DI  YL    M    G  +S+EDADS        K+EGAFYVW
Sbjct: 305 LDAYLMTRSPEHLEAVHDIATYLTSPPMHAESGGFYSSEDADSLYRPNDKEKREGAFYVW 364

Query: 449 TSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 507
           T KE +DILGE  + +   +Y +K  GN  ++   D H+E   +NVL   +  +  A + 
Sbjct: 365 TLKEFQDILGERDSEILARYYNVKDEGN--VAPEHDAHDELINQNVLAITSTPADLAKQF 422

Query: 508 GMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 566
           G+  EK   IL E R+KL + R+K RPRP LDDK++VSWNGL I + AR S  L S+  +
Sbjct: 423 GLSEEKVKRILTEGRQKLLEHRNKERPRPGLDDKIVVSWNGLAIGALARTSAALASQDPT 482

Query: 567 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 626
                       KEY+  AE AA+F+++HLY  ++  L   +R GP  APGF DDYA+LI
Sbjct: 483 R----------SKEYLAAAEKAAAFVQKHLYHSESKTLIRVWREGPGDAPGFADDYAYLI 532

Query: 627 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 686
           SGL+DLYE      +L WA +LQ TQ ++F D++  G+F+T  +   +++R+K+  D AE
Sbjct: 533 SGLIDLYEATFNDSYLQWADDLQKTQLKMFWDKQHLGFFSTPEDQTDLIMRLKDGMDNAE 592

Query: 687 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA--ADML 744
           P  N VS  NL RL +++  S+   Y Q A  + + FE  +       P M  A  A  L
Sbjct: 593 PGTNGVSAQNLDRLGALLEDSE---YTQRARDTASAFEAEIMQHPFLFPSMMDAVVAGKL 649

Query: 745 SVPSRKHVVLVGHKSSVD 762
            +    H V+ G+   VD
Sbjct: 650 GI---THAVITGNGQKVD 664


>gi|25147430|ref|NP_495615.2| Protein B0495.5 [Caenorhabditis elegans]
 gi|21264548|sp|Q09214.2|YP65_CAEEL RecName: Full=Uncharacterized protein B0495.5
 gi|351065503|emb|CCD61473.1| Protein B0495.5 [Caenorhabditis elegans]
          Length = 729

 Score =  496 bits (1278), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 275/686 (40%), Positives = 386/686 (56%), Gaps = 48/686 (6%)

Query: 91  PASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTC 150
           P +     + + NRL  E SPYLLQHA+NP+DW+ WG+EAF +A+  + PIFLS+GYSTC
Sbjct: 7   PITVIRMTSTYKNRLGQEKSPYLLQHANNPIDWYPWGQEAFQKAKDNNKPIFLSVGYSTC 66

Query: 151 HWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFL 210
           HWCHVME ESFE+E  AK+LND FV+IKVDREERPDVDK+YM +V A  G GGWP+SVFL
Sbjct: 67  HWCHVMEKESFENEATAKILNDNFVAIKVDREERPDVDKLYMAFVVASSGHGGWPMSVFL 126

Query: 211 SPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSAS 270
           +PDL P+ GGTYFPP+D  G  GF TIL  +   W K+ + L Q GA  I +L +  +AS
Sbjct: 127 TPDLHPITGGTYFPPDDNRGMLGFPTILNMIHTEWKKEGESLKQRGAQII-KLLQPETAS 185

Query: 271 ASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGK 330
              N+      +   +        S+DSR GGFG APKFP+  ++  ++  +       +
Sbjct: 186 GDVNR-----SEEVFKSIYSHKQSSFDSRLGGFGRAPKFPKACDLDFLITFAAS---ENE 237

Query: 331 SGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYL 390
           S +A +   M+  TL+ MA GGIHDH+G GFHRYSV   WH+PHFEKMLYDQ QL   Y 
Sbjct: 238 SEKAKDSIMMLQKTLESMADGGIHDHIGNGFHRYSVGSEWHIPHFEKMLYDQSQLLATYS 297

Query: 391 DAFSLT--KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 448
           D   LT  K     ++  DI  Y+++     GG  ++AEDADS     ++ K EGAF  W
Sbjct: 298 DFHKLTERKHDNVKHVINDIYQYMQKISHKDGG-FYAAEDADSLPNHNSSNKVEGAFCAW 356

Query: 449 TSKEVEDILGEHAI-------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 501
             +E++ +LG+  I       +  +++ ++ +GN  ++R SDPH E K KNVL +L    
Sbjct: 357 EKEEIKQLLGDKKIGSASLFDVVADYFDVEDSGN--VARSSDPHGELKNKNVLRKLLTDE 414

Query: 502 ASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 561
             A+   + + +    + E +  L++ R++RP PHLD K++ SW GL I+   +A +   
Sbjct: 415 ECATNHEISVAELKKGIDEAKEILWNARTQRPSPHLDSKMVTSWQGLAITGLVKAYQ--- 471

Query: 562 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHR------LQHSFRNGPSKA 615
                         ++  +Y++ AE  A FI + L D    R             G  + 
Sbjct: 472 -------------ATEETKYLDRAEKCAEFIGKFLDDNGELRRSVYLGANGEVEQGNQEI 518

Query: 616 PGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL 675
             F DDYAFLI  LLDLY      ++L  A+ELQ   D  F +  G GYF +   D  V 
Sbjct: 519 RAFSDDYAFLIQALLDLYTTVGKDEYLKKAVELQKICDVKFWN--GNGYFISEKTDEDVS 576

Query: 676 LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP 735
           +R+ ED DGAEP+  S++  NL+RL  I+   + + YR+ A         RL  + +A+P
Sbjct: 577 VRMIEDQDGAEPTATSIASNNLLRLYDIL---EKEEYREKANQCFRGASERLNTVPIALP 633

Query: 736 LMCCAADMLSVPSRKHVVLVGHKSSV 761
            M  A     + S   V++   KS +
Sbjct: 634 KMAVALHRWQIGSTTFVLVGDPKSEL 659


>gi|169597471|ref|XP_001792159.1| hypothetical protein SNOG_01521 [Phaeosphaeria nodorum SN15]
 gi|160707528|gb|EAT91170.2| hypothetical protein SNOG_01521 [Phaeosphaeria nodorum SN15]
          Length = 756

 Score =  496 bits (1276), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 284/676 (42%), Positives = 381/676 (56%), Gaps = 38/676 (5%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL    SPY+  H +NPV W  WG EA   A+K +  IF+SIGY+ CHWCHVME ESFE
Sbjct: 21  NRLNESRSPYVRGHMNNPVAWQQWGPEALELAKKSNRLIFISIGYAACHWCHVMERESFE 80

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           ++ VA +LN  F+ IK+DREERPD+D++YM YVQA  GGGGWPL+ F++PDL+P+ GGTY
Sbjct: 81  NQEVADILNKNFIPIKIDREERPDIDRIYMNYVQATTGGGGWPLNAFITPDLEPIFGGTY 140

Query: 223 FP-PEDKY---GRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
           +P PE      G PGF  IL K++D W  +R     S      QL +       S K   
Sbjct: 141 WPGPESTMAMEGHPGFVGILEKIRDVWQNQRQRCLDSAKEITAQLRDFAEDGNISRKDGA 200

Query: 279 E-------LPQNALRLC----AEQLSKSYDSRFGGFGSAPKFPRPVEIQMML---YHSKK 324
           E       L  +A  +C     +   + YD    GFGSAPKFP P  +  +L    + K+
Sbjct: 201 EHDHLDLDLLDDAYEVCEADGPQHFKRRYDQAHAGFGSAPKFPTPSNLHFLLKLNTYPKQ 260

Query: 325 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 384
                 + + S  QKMVL TL  M KGGIHD +G GF RYSV + W +PHFEKMLYDQ Q
Sbjct: 261 TAQILTAEDISNAQKMVLATLDKMNKGGIHDQIGNGFARYSVTKDWSLPHFEKMLYDQAQ 320

Query: 385 LANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDADSAETEGATRKKEG 443
           L  VYLDA+  TK         DI  YL    M    G  FS+EDADS        K+EG
Sbjct: 321 LLPVYLDAYLATKRPEMLEAVHDIATYLTTPPMQAESGGFFSSEDADSLYRPSDKEKREG 380

Query: 444 AFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDSS 501
           AFYVWT KE ++ILG+  A +   +Y ++  GN  ++   D H+E   +NVL I  N  +
Sbjct: 381 AFYVWTLKEFQEILGDRDAEILARYYNVRDEGN--VAPEHDAHDELINQNVLAINNNTPT 438

Query: 502 ASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKIL 560
             A +  +  ++  +IL   R+KL D R+K RPRP LDDK++VSWNGL I + AR +  +
Sbjct: 439 DVAKQFALSEDELQSILRSGRQKLLDHRNKERPRPALDDKIVVSWNGLAIGALARTAAAI 498

Query: 561 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 620
            ++  S             +Y+  AE AA FI++ LY+  +  L   +R GP  APGF D
Sbjct: 499 SAQDPSR----------SSQYLAAAEKAAHFIQKELYNPTSKTLTRVYREGPGDAPGFAD 548

Query: 621 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 680
           DYA+LISGL+DLYE       L WA ELQ TQ  +F D++  G+F+T      +++R+K+
Sbjct: 549 DYAYLISGLIDLYEATFNPSNLQWADELQQTQLSMFWDKQHLGFFSTPENQTDLIMRLKD 608

Query: 681 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 740
             D AEP  N VS  NL RL +++  ++   Y + A  +++ FE  +       P M  A
Sbjct: 609 GMDNAEPGTNGVSARNLDRLGALLEDAE---YVKKARDTVSAFEAEIMQHPFLFPSMLDA 665

Query: 741 ADMLSVPSRKHVVLVG 756
                +  R HVV+ G
Sbjct: 666 VVAGKLGMR-HVVVTG 680


>gi|391342665|ref|XP_003745636.1| PREDICTED: spermatogenesis-associated protein 20 [Metaseiulus
           occidentalis]
          Length = 728

 Score =  495 bits (1275), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 300/709 (42%), Positives = 396/709 (55%), Gaps = 92/709 (12%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            NRL  E SPYLLQHAHNPV WF+W +EAF  AR+ +  IFLSIGYSTCHWCHVME ESF
Sbjct: 8   VNRLVNERSPYLLQHAHNPVAWFSWEDEAFEAARRDNKLIFLSIGYSTCHWCHVMERESF 67

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           E+E VAK+LND +VSIKVDREERPD+DK+YMTYVQ   G  GWPLSV+L+P+LKP+ GGT
Sbjct: 68  ENEEVAKILNDRYVSIKVDREERPDIDKIYMTYVQVTSGHSGWPLSVWLTPELKPIFGGT 127

Query: 222 YFPPED-KYGRPGFKTILRKVKDAW------------DKKRDMLAQSGAFAIEQLSEALS 268
           YFPPED +YG  GFKTIL  + D W            D+   MLA++       L E L 
Sbjct: 128 YFPPEDNQYGLAGFKTILLMLDDKWHSSKNEKIKADSDRITAMLARAS-----NLRENLE 182

Query: 269 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPV--EIQMMLYHSKKLE 326
           A+ S        P   ++ C+  L K       GF   P+FP+ V     M L+H +   
Sbjct: 183 AAESFQ------PSQCIKDCSLILQK----HLIGFVKEPRFPQCVNGNFYMNLFHFQN-- 230

Query: 327 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 386
                     G  +V   L+ MA GGIHDH+GGGFHRY+VD  W VPHFEKMLYDQ Q+ 
Sbjct: 231 -------NRMGVDIVERQLKEMATGGIHDHLGGGFHRYTVDAAWQVPHFEKMLYDQAQIL 283

Query: 387 NVYLDAFSLTK-----DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAET-EGATRK 440
            +Y     +         F+  +   I DY+ RD+  P G  +SAEDADS E+ + +  K
Sbjct: 284 ALYCSYLRMPGIKPEIASFFGGVATGIADYVMRDLSHPQGGFYSAEDADSLESFDSSDHK 343

Query: 441 KEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI---- 495
           KEGAFYVWT  E++ IL  + A +F E + +   GN D     D   E   +N L     
Sbjct: 344 KEGAFYVWTMAEIQKILSKKEAKVFCEFFGVDEQGNVDPHH--DAQGELLNQNTLFYRYP 401

Query: 496 -----ELNDSSASAS-KLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGL 548
                 +ND +     + G PL++   IL   +RKL   R   RPRPHLD+K++ +WNGL
Sbjct: 402 DSYDQNINDMAKVIDLEDGDPLDE---ILESAKRKLLQRRLESRPRPHLDNKIVSAWNGL 458

Query: 549 VISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS- 607
           +I++ A+AS +LK                R  Y E A  A  FIR +L+D +  RL  S 
Sbjct: 459 MIAALAKASVVLK----------------RPAYAERALKAVDFIRANLFDRENQRLYRSA 502

Query: 608 FRNGPSKA----------PGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFL 657
           +  G   A          PG L+DYAF+ISGLL LY+     + L++A  LQ++Q+  F 
Sbjct: 503 YTEGEGDAARVEQLEKPIPGVLEDYAFVISGLLQLYDATLDEQLLLFAKILQDSQNRQFW 562

Query: 658 DREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAE 717
           D   GGYF  +G   +++  +K+DHDGAEPS NSVS+ NL+RL  I      + YR  A 
Sbjct: 563 DETNGGYFLFSGGGSNIIYVLKDDHDGAEPSANSVSIANLIRLYHIF---DHEPYRTKAN 619

Query: 718 HSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENM 766
            ++ +F  RL  + +A+P M  +   L  P  K ++        DF+ +
Sbjct: 620 KTVKLFAERLSKVPIALPEMVSSLMYLVEPPTKIILSAEDDEISDFKRV 668


>gi|194883110|ref|XP_001975647.1| GG20445 [Drosophila erecta]
 gi|190658834|gb|EDV56047.1| GG20445 [Drosophila erecta]
          Length = 805

 Score =  495 bits (1275), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 280/690 (40%), Positives = 382/690 (55%), Gaps = 60/690 (8%)

Query: 81  YKVVAMAERTPASTSHSR-NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDV 139
           ++ +A     P   S +   K  NRL A  SPYLLQHA+NPVDW+ WGEEAF +AR+ + 
Sbjct: 54  FRTMATGGEAPKEESGAEPAKQGNRLVASKSPYLLQHAYNPVDWYPWGEEAFEKARRENK 113

Query: 140 PIFLSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALY 199
            IFLS+GYSTCHWCHVME ESFE+   A  LN+ FVSIK+DREERPD+DK+YM ++    
Sbjct: 114 IIFLSVGYSTCHWCHVMEHESFENPDTAAFLNEHFVSIKLDREERPDIDKIYMKFLLMTK 173

Query: 200 GGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFA 259
           G GGWP++V+L+PDL PL+ GTYFP + +YG   F  +L+ +   W+  ++ L  +G+  
Sbjct: 174 GSGGWPMNVWLTPDLVPLVAGTYFPHKPQYGMHSFIVVLKTIAKKWNADKEFLLTTGSSM 233

Query: 260 IEQLSEALSASASSNKLPDELPQNALRLCAEQLS---KSYDSRFGGFGSAPKFPRPVEIQ 316
           +  + E+ SA+  S K       +A+   +E ++   + +D  +GGFGS PKFP    I 
Sbjct: 234 LSTILESQSAAEVSFK-----EGSAIDKLSEAINIHKQRFDETYGGFGSEPKFPEVPRIN 288

Query: 317 MMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFE 376
            + +     +D        +   MV+ TL  + KGGI+DH+ GGF RY+  E WH  HFE
Sbjct: 289 FLFHAYLVTKDV-------DVLDMVIETLNQIGKGGINDHIFGGFARYATTEDWHNVHFE 341

Query: 377 KMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEG 436
           KMLYDQGQL   + +A+ +++D  +      I  YL +D+  P G  ++ EDADS  T  
Sbjct: 342 KMLYDQGQLMGAFANAYKVSRDETFLGYGDKIYKYLVKDLSHPMGGFYAGEDADSLPTHE 401

Query: 437 ATRKKEGAFYVWTSKEVE-----------DILGEHAI-LFKEHYYLKPTGNCDLSRMSDP 484
              K EGAFY WT  E++           DI  E A  ++  HY LKP GN   S  SDP
Sbjct: 402 DKVKVEGAFYAWTWDEIQAAVQDQAQRFDDITAERAFEIYAYHYDLKPPGNVKAS--SDP 459

Query: 485 HNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVS 544
           H    GKN+LI       + +   +  +K   +L      L  +R +RPRPHLD K+I +
Sbjct: 460 HGHLTGKNILIIRGSEEDTCANFKLEADKLKKLLATTNDILHVLREQRPRPHLDTKIICA 519

Query: 545 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL 604
           WNGLV+S   + +                  ++R++YM+ AE    F+R+ +YD +  RL
Sbjct: 520 WNGLVLSGLCKLAN--------------CYSANREQYMQTAEKLLDFLRKEMYDPERKRL 565

Query: 605 QHSF-----------RNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQD 653
             S            +N P +  GFLDDYAFLI GLLD Y+       L WA ELQ TQD
Sbjct: 566 IRSCYGVAVGDETLEKNEP-QIDGFLDDYAFLIKGLLDYYKATLDVDVLHWAKELQETQD 624

Query: 654 ELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYR 713
            LF D + G YF +  + P++++R KEDHDGAEP GNSVS  NLV LA     S    Y 
Sbjct: 625 TLFWDDQNGAYFFSQQDAPNIIMRYKEDHDGAEPCGNSVSAGNLVLLAHYYDESA---YI 681

Query: 714 QNAEHSLAVFETRLKDMAMAVPLMCCAADM 743
           Q A   L  F   +     A+P M  A  M
Sbjct: 682 QKAGKLLNFF-ADVSPFGHALPEMLSALLM 710


>gi|195029929|ref|XP_001987824.1| GH19740 [Drosophila grimshawi]
 gi|193903824|gb|EDW02691.1| GH19740 [Drosophila grimshawi]
          Length = 747

 Score =  495 bits (1274), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 274/638 (42%), Positives = 357/638 (55%), Gaps = 52/638 (8%)

Query: 90  TPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYST 149
           T + T     K +NRLA   SPYLLQHA+NPVDW+ W EEAF  AR  +  IFLS+GYST
Sbjct: 3   TGSETKAPPPKPSNRLATSKSPYLLQHANNPVDWYPWCEEAFERARSENKLIFLSVGYST 62

Query: 150 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 209
           CHWCHVME ESFED   A ++N  FV+IKVDREERPD+DKVYM ++    G GGWP+SV+
Sbjct: 63  CHWCHVMEHESFEDADTAAVMNKHFVNIKVDREERPDIDKVYMQFLLMSKGSGGWPMSVW 122

Query: 210 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 269
           L+P+L PL  GTYFPP+ +YG P F  +L  +   W   R  L  +G+  ++ L    +A
Sbjct: 123 LTPELAPLAAGTYFPPKARYGMPSFTMVLESIAKKWQTDRAALQNAGSILMDALKANQNA 182

Query: 270 SASSNKLPDELPQNALRLCAEQLS---KSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 326
           SA      +  P +A    AE L+   + +D + GGFG  PKFP    +  + +     +
Sbjct: 183 SAVGEAAFE--PGSADAKLAEALNVHKQRFDQQHGGFGREPKFPEVSRLNFLFHAYLVSK 240

Query: 327 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 386
           D        +   MVL TL  + +GGI+DH+ GGF RY+    WH  HFEKMLYDQGQL 
Sbjct: 241 DV-------DVLDMVLQTLDHIGRGGINDHIFGGFARYATTRDWHNVHFEKMLYDQGQLM 293

Query: 387 NVYLDAFSLTK-DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 445
             + +A+ LT+ + F  Y  R I +YL +D+  P G  F+ EDADS  T   T K EGAF
Sbjct: 294 AAFANAYKLTRSEEFLGYADR-IYEYLLKDLRHPAGGFFAGEDADSLPTHKDTVKVEGAF 352

Query: 446 YVWTSKEVEDILGEHAILFKE------------HYYLKPTGNCDLSRMSDPHNEFKGKNV 493
           Y WT +EV+D        F +            HY +KP GN  +   SDPH    GKNV
Sbjct: 353 YAWTWQEVQDAFRAQKTHFNDVSPDRAFDIYSFHYDMKPGGN--VPPDSDPHGHLTGKNV 410

Query: 494 LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSF 553
           LI       + S   + L++   +L      L  VR KRPRPHLD K+I SWNGLV+S  
Sbjct: 411 LIVRGSEEDTCSNFNVELDQLKPLLRTANDILHAVRDKRPRPHLDTKIICSWNGLVLSGL 470

Query: 554 ARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL--------- 604
           A+ +     +              R  Y++ A+    F+R HLYDE+   L         
Sbjct: 471 AKLANCGTGK--------------RNAYLKTAKELVQFLRTHLYDEEQQVLLRSCYGAGV 516

Query: 605 -QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGG 663
             ++      +  GFLDDYAFLI GLLD Y+       L WA ELQ TQD+LF D + G 
Sbjct: 517 QDNTLEQNAVRIEGFLDDYAFLIKGLLDYYKASLDMGALRWAKELQGTQDKLFWDEKNGA 576

Query: 664 YFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLA 701
           YF +  + P+V++R+KEDHDGAEP GNSV+  NL  L 
Sbjct: 577 YFYSQQDAPNVIVRLKEDHDGAEPCGNSVTARNLTLLT 614


>gi|158521543|ref|YP_001529413.1| hypothetical protein Dole_1532 [Desulfococcus oleovorans Hxd3]
 gi|158510369|gb|ABW67336.1| protein of unknown function DUF255 [Desulfococcus oleovorans Hxd3]
          Length = 641

 Score =  494 bits (1273), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 278/657 (42%), Positives = 373/657 (56%), Gaps = 50/657 (7%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           +N LA E SPYLLQHA NPVDW+ W + A A AR+ D PI LSIGY+TCHWCHVM  ESF
Sbjct: 8   SNHLADEKSPYLLQHADNPVDWYPWSDAAIARARQTDRPILLSIGYATCHWCHVMAHESF 67

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPD-LKPLMGG 220
            D   A L+N  FV +KVDREERPD+D++YMT V A+ G GGWPL+VFL P  L P  GG
Sbjct: 68  SDPDTAALMNAHFVCVKVDREERPDIDRLYMTAVSAITGSGGWPLNVFLEPHALAPFFGG 127

Query: 221 TYFPPEDKYGRPG------FKTILRKVKDAW---DKKRDMLAQSGAFAIEQLSEALSASA 271
           TYFPP     RPG      +  +L+++ DAW   DK+  +LA + +     L  AL+ + 
Sbjct: 128 TYFPP-----RPGRTLMITWPDLLQQIADAWENPDKRSSLLASADSITT-FLESALTGTR 181

Query: 272 SSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLEDTG 329
                 D       +   +  +  YDS+ GGFG APKFP P  I  +L    +    D G
Sbjct: 182 HRPAEGDAELTGIYKKALDAFTGMYDSQSGGFGPAPKFPMPAIINFLLACAATDPAADLG 241

Query: 330 -KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 388
             + +  +   M + TL  MA+GGI+D +GGGFHRYS DERWH+PHFEKMLYD  QL   
Sbjct: 242 LDTRQREKALGMAIHTLSAMARGGIYDQLGGGFHRYSTDERWHLPHFEKMLYDNAQLLAC 301

Query: 389 YLDAFSLTKDVFYSYIC--RDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 446
             DA++LT++   S +C  R   DY+ ++M  P G  +SA+DADS E+ GA +K EGAFY
Sbjct: 302 LADAYALTEN--NSLLCRARQTADYILKEMTHPEGGFYSAQDADSPESAGAGKKVEGAFY 359

Query: 447 VWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPH-NEFKGKNVLIELNDSSASA 504
           VW ++E+E +L    A LF  H+ ++P GN     +S PH  EF  KNVL        +A
Sbjct: 360 VWEAREIESLLDAPAAKLFMSHFGVRPEGN-----VSGPHAAEFSHKNVLYGTGPVDQAA 414

Query: 505 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 564
              G+  ++  ++L   R+ L   R  RP P  DDK+I +WNGL+IS  A+  ++ +   
Sbjct: 415 KTFGLSEQETQDLLQTARQTLLAHRKHRPAPDTDDKIITAWNGLMISGLAKLYRVTR--- 471

Query: 565 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 624
                          +Y + A  AA FI+ HLYD QTH L   +R G ++  G  +DYAF
Sbjct: 472 -------------EAQYRDGAVKAARFIQTHLYDPQTHHLARIWRAGEARIDGMAEDYAF 518

Query: 625 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT-TGEDPSVLLRVKEDHD 683
           L  GL+DLYE  +   WL WAI+L       F D + GG F T  G DP +LLR+KED D
Sbjct: 519 LAQGLIDLYEANADAFWLAWAIDLSEEVLASFYDSKNGGIFMTGKGHDPHLLLRMKEDTD 578

Query: 684 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 740
              PS  SV+  N  RL++     ++D +   A  ++      L++   A PL+  A
Sbjct: 579 NVMPSAGSVAARNFYRLSAYTG--RND-FSDAARATINALIPLLEEHPSAAPLLLTA 632


>gi|374297486|ref|YP_005047677.1| thioredoxin domain-containing protein [Clostridium clariflavum DSM
           19732]
 gi|359826980|gb|AEV69753.1| thioredoxin domain protein [Clostridium clariflavum DSM 19732]
          Length = 680

 Score =  494 bits (1272), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 279/665 (41%), Positives = 375/665 (56%), Gaps = 65/665 (9%)

Query: 97  SRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVM 156
           S NK  NRL  E SPYLLQHA+NPV+WF W  EAF +A+  D PIFLSIGYSTCHWCHVM
Sbjct: 2   STNKQANRLIHEKSPYLLQHAYNPVNWFPWSNEAFEKAKSEDKPIFLSIGYSTCHWCHVM 61

Query: 157 EVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKP 216
           E ESFED  VA++LN +F+SIKVDREERPD+D +YM   QAL G GGWPL++F++PD KP
Sbjct: 62  ERESFEDYEVAEILNKYFISIKVDREERPDIDHIYMNVCQALTGHGGWPLTIFMTPDKKP 121

Query: 217 LMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKL 276
              GTYFP  D+ G  G  +IL  V +AW   R+ L +   + I  ++E        ++ 
Sbjct: 122 FFAGTYFPKNDRMGMSGLMSILESVHNAWTTDREALLKESEYIINAINEHNELLEQDHE- 180

Query: 277 PDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML---YHSKKLEDTGKSGE 333
             EL ++ L     +L  ++D+ FGGFGSAPKFP P  +  +L   Y++K+         
Sbjct: 181 -GELTEDILDKAYSELKFAFDNIFGGFGSAPKFPTPHNLFFLLRYWYNTKE--------- 230

Query: 334 ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 393
                 MV  TL CM KGGI+DH+G GF RYS D +W VPHFEKMLYD   L+  YL+A+
Sbjct: 231 -EYALTMVEKTLACMHKGGIYDHIGFGFSRYSTDRKWLVPHFEKMLYDNALLSIAYLEAY 289

Query: 394 SLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 453
             TK   Y+ I  +I  Y+ RDM  P G  +SAEDADS   EG     EG FYVW+  EV
Sbjct: 290 QATKKRDYADIAEEIFTYVLRDMTSPEGGFYSAEDADS---EGM----EGKFYVWSMDEV 342

Query: 454 EDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE 512
           + +LGE H   + ++Y + P GN            F+G N+         +  K  +P E
Sbjct: 343 KKVLGEQHGEKYCKYYDITPHGN------------FEGFNI--------PNLIKGNIPDE 382

Query: 513 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 572
           +    + ECR+KLF+ R KR  PH DDK++ SWNGL+I++ A   ++L  E         
Sbjct: 383 E-RPFIEECRKKLFEYREKRVHPHKDDKILTSWNGLMIAALAIGGRVLGKE--------- 432

Query: 573 VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDL 632
                  +Y+  AE AA FI   L      RL   +R+G S  PG++DDYAF I GL++L
Sbjct: 433 -------KYITAAERAAKFISSKLVS-NNGRLLARYRDGESAFPGYVDDYAFFIWGLIEL 484

Query: 633 YEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSV 692
           YE      +L  +++L +   + F D   GG F    +   ++ R KE +DGA PSGNSV
Sbjct: 485 YETTYKPVYLKQSLKLNDDLIKYFWDENNGGLFYYGSDSEQLITRPKETYDGAIPSGNSV 544

Query: 693 SVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHV 752
           S +N +RLA +   S  +     A      F   +++ AM       A  + +    K V
Sbjct: 545 STLNFLRLARLTGRSDLE---DKAYIQFKTFSRNIENFAMGHSFFLTAL-LFAKSKSKEV 600

Query: 753 VLVGH 757
           V+VG+
Sbjct: 601 VIVGN 605


>gi|332020712|gb|EGI61117.1| Spermatogenesis-associated protein 20 [Acromyrmex echinatior]
          Length = 746

 Score =  494 bits (1272), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 295/693 (42%), Positives = 395/693 (56%), Gaps = 65/693 (9%)

Query: 92  ASTSHSRNK-----HTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIG 146
           ASTS   +K       NRL  E SPYLLQHA NPVDW++WG+EA  +A+K +  IF+SIG
Sbjct: 2   ASTSRQDSKSEPEVKKNRLRLERSPYLLQHATNPVDWYSWGDEALEKAKKENKIIFVSIG 61

Query: 147 YSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQA--LYGGGGW 204
           YSTCHWCHVME ESF++E VAK++N+ +V+IKVDREERPD+D + M ++QA  L G GGW
Sbjct: 62  YSTCHWCHVMEKESFKNEEVAKIMNENYVNIKVDREERPDIDMMCMMFIQASRLRGHGGW 121

Query: 205 PLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS 264
           PL+VFL+PDL P+ GGTYF          F   L ++   W + RD + +S A   ++L 
Sbjct: 122 PLNVFLTPDLMPITGGTYF------SCAMFTLYLTRIVKEWTEGRDKMVKSAAIVSDRLK 175

Query: 265 EALSASASSNKLPDELPQ-NALRLCAEQLSKSYDSRFGGFGS-------APKFPRPVEIQ 316
           E LS S    K  D +P  +   LCA  L   YD  +GGFGS       +PKFP P  + 
Sbjct: 176 E-LSTSRHDIK-DDGVPAIDCAFLCAHVLLNIYDEEYGGFGSSSATNPNSPKFPEPTNLN 233

Query: 317 MMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFE 376
            +L     L  +    E S      L TL+ M+ GG+HDHVG GFHRY+VD RW VPHFE
Sbjct: 234 FLL-SMHVLSTSTMLVEMSLNAS--LNTLRKMSFGGLHDHVGKGFHRYTVDARWKVPHFE 290

Query: 377 KMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEG 436
           KMLYDQ QL   Y+DA+ +TKD F+S I  DI  Y+ R +    G  FSA DADS  T  
Sbjct: 291 KMLYDQAQLIQCYVDAYIITKDSFFSDIVDDIATYVLRMLTHMEGGFFSAVDADSLPTFD 350

Query: 437 ATRKKEGAFYVWTSKEVEDIL-----GEHAI----LFKEHYYLKPTGNCDLSRMSDPHNE 487
           A  K+EGAFYVW+   ++ +L     G+  +    L   H+ ++  GN  + R  DPH E
Sbjct: 351 APAKREGAFYVWSYDNLKALLKKKVPGKDNVTYFDLICRHFSVRKEGN--VERPQDPHGE 408

Query: 488 FKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNG 547
             GKNVL   +    +A+   + +++    + E    L++ RS RP P LDDK++ SWNG
Sbjct: 409 LTGKNVLSMQSGIEDTANHFKLNVKETQKYIKEACTTLYEDRSHRPWPSLDDKMVTSWNG 468

Query: 548 LVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS 607
           L+IS  ARA   +K+                K+Y+E A  AA+F+ ++L+++    L  S
Sbjct: 469 LMISGLARAGIAVKN----------------KDYVEAATEAATFVEKYLFNKDKRILLRS 512

Query: 608 -FRNGPSK-------APGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR 659
            +R    K        PGF +DYAF + GLLDLYE      W+ +A ELQ+ QD LF D 
Sbjct: 513 CYRRRDDKIVQRSDPIPGFHEDYAFFVKGLLDLYEATFNPHWVEFAEELQDIQDRLFWDS 572

Query: 660 EGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHS 719
           E GGYF    E P +L R K+  DG++PSGNS++  NL+RLA  +     D  R  AE  
Sbjct: 573 EDGGYFAMAEESP-ILTRTKDSDDGSQPSGNSIACSNLLRLAIYL---DRDDLRHKAEKL 628

Query: 720 LAVFETRLKDMAMAVPLMCCAADMLSVPSRKHV 752
           L  F  +L +   A P M  A      P++ +V
Sbjct: 629 LCAFGNKLANCPAACPQMMLALIEFHHPTQIYV 661


>gi|91201579|emb|CAJ74639.1| conserved hypothetical protein [Candidatus Kuenenia
           stuttgartiensis]
          Length = 729

 Score =  494 bits (1272), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 275/685 (40%), Positives = 384/685 (56%), Gaps = 55/685 (8%)

Query: 98  RNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVME 157
           +    N L  E SPYL QHA+NPVDW+ WG+EAF +A+     IFLSIGYSTCHWCHVME
Sbjct: 46  KTNKPNHLIHEKSPYLQQHAYNPVDWYPWGKEAFEKAKAESKVIFLSIGYSTCHWCHVME 105

Query: 158 VESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPL 217
            ESFEDE VAK+LN+++V+IKVDREERPD+D VYMT  QA+ G GGWPL++FL+ + K  
Sbjct: 106 TESFEDEEVAKILNEYYVAIKVDREERPDIDNVYMTVCQAMTGSGGWPLTLFLTSEGKSF 165

Query: 218 MGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLP 277
             GTYFP  ++ G PG   +L ++ + W+  ++ +  S +  + +L +  +AS    K P
Sbjct: 166 YAGTYFPKTERLGNPGLIALLTQIANLWNTNKESIIAS-SLQVTKLIDTETASKGEEK-P 223

Query: 278 DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEG 337
           D      L+   EQLS  +DS +GGFG++PKFP P     +L   K+  +       +  
Sbjct: 224 D---VRTLKTAYEQLSDRFDSLYGGFGTSPKFPTPHNFTFLLRWWKRSNN-------AFA 273

Query: 338 QKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTK 397
            +MV  +L+ MA+GGIHDH+GGGFHRYS DE W  PHFEKMLYDQ  LA  Y++ +  TK
Sbjct: 274 LEMVEKSLELMARGGIHDHLGGGFHRYSTDEYWLTPHFEKMLYDQALLAISYIETYQATK 333

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
              YS I +DI DY+ RDM  P G  +SAEDADS   EG     EG FYVW  +E+++ L
Sbjct: 334 KDLYSAIAKDIFDYVLRDMTSPEGGFYSAEDADS---EGI----EGKFYVWKPEEIKEAL 386

Query: 458 GEHAILFKEHYYLKPTGN--CDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 515
           GE              GN  CD   +SD  N F+ KN+L        +A    M  +   
Sbjct: 387 GEK------------DGNIFCDFYDVSDIGN-FEDKNILHADKPLHIAAKLENMSPDALE 433

Query: 516 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 575
             L   R+KL  +R KR +PH D K+I SWNGL+IS+ +R ++ +               
Sbjct: 434 KRLANSRKKLLSIREKRIKPHKDTKIITSWNGLMISALSRGAQAM--------------- 478

Query: 576 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 635
            D  +Y  VA  AA FI   L  E    L+  +  G S   GFLDDYAF ++GL+DLYE 
Sbjct: 479 -DEPKYTNVAMCAADFILNTLLQENKILLRR-YCQGESAIAGFLDDYAFFVNGLIDLYEA 536

Query: 636 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 695
               K+L  A+++     + FLD   GG+F +   +  +  + K+ +DGA PSGNS++++
Sbjct: 537 TFQEKYLQAALQINEEMIKNFLDENEGGFFLSGKSNEKLFTQTKDIYDGATPSGNSIALL 596

Query: 696 NLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLV 755
           NL+RL  I        Y   A++ +  F   +           CA D    P+ K +++ 
Sbjct: 597 NLLRLGRITGNPS---YEALADNLIKTFSGTILQYPSGYTQFMCALDFALGPT-KEIIVA 652

Query: 756 GHKSSVDFENMLAAAHASYDLNKTV 780
           G +   D +++L    + +  NK +
Sbjct: 653 GEREGNDTKDILREIRSRFLPNKVL 677


>gi|195430492|ref|XP_002063288.1| GK21469 [Drosophila willistoni]
 gi|194159373|gb|EDW74274.1| GK21469 [Drosophila willistoni]
          Length = 752

 Score =  494 bits (1272), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 272/628 (43%), Positives = 357/628 (56%), Gaps = 52/628 (8%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           K  NRL    SPYLLQHA+NPVDW+ W EEAF  ARK +  IFLS+GYSTCHWCHVME E
Sbjct: 18  KSGNRLINSKSPYLLQHAYNPVDWYPWCEEAFELARKENKLIFLSVGYSTCHWCHVMEHE 77

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SFE+   A ++N  FV+IKVDREERPD+DKVYM ++    G GGWP+SV+L+PDL PL  
Sbjct: 78  SFENPETAAVMNKHFVNIKVDREERPDIDKVYMQFLLLSKGSGGWPMSVWLTPDLAPLAA 137

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE 279
           GTYFPP  ++G P F  +L  + + W   R+ L ++G+  ++ L +   A+A +    + 
Sbjct: 138 GTYFPPHSRWGMPSFTKVLESIANKWQTDRESLLKAGSTVLKALQKNQDAAAVAEAAFE- 196

Query: 280 LPQNALRLCAEQLS---KSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASE 336
            P +A     E L+   + YD   GGFG  PKFP    +  + +     +D        +
Sbjct: 197 -PGSAEEKLMEALNVHKQRYDQAHGGFGREPKFPEIPRLNFLFHAYLVTKDV-------D 248

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
              MV+ TL  + +GGI+DHV GGF RY+    WH  HFEKMLYDQGQL   Y +A+ LT
Sbjct: 249 VLDMVMQTLDHIGRGGINDHVFGGFCRYATTRDWHNVHFEKMLYDQGQLMAAYANAYKLT 308

Query: 397 K-DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 455
           + D+F SY  + I  YL +D+  P G  ++ EDADS  T   T K EGAFY WT  E+++
Sbjct: 309 RSDLFLSYADK-IYRYLIKDLRHPAGGFYAGEDADSLPTHQDTVKVEGAFYAWTWSEIQE 367

Query: 456 ILGEHAILFKE------------HYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 503
                A  F E            HY L+P GN  +   SDPH    GKN+LI       +
Sbjct: 368 TFKSQAQCFGEVSPERAFEIYTFHYDLQPKGN--VPPASDPHGHLTGKNILIVKGSEEDT 425

Query: 504 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 563
            S   + LE+   IL      L  VR KRPRPHLD K+I  WNGLV+S  ++ +    ++
Sbjct: 426 CSNFNLELEQLQQILETANDILHSVRDKRPRPHLDTKIICGWNGLVLSGLSKLANCGTTK 485

Query: 564 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS----------FRNGPS 613
                         R EYM+ A+    F+RR +YD++   LQ S                
Sbjct: 486 --------------RDEYMQTAKELVDFLRREMYDKERKLLQRSCYGSGVEDNTLEKNEL 531

Query: 614 KAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPS 673
           +  GFLDDYAFLI GLLD Y+       L WA ELQ +QD+LF D++ G YF +    P+
Sbjct: 532 QIEGFLDDYAFLIKGLLDYYKASLDLSVLSWAKELQESQDKLFWDQQNGAYFFSQQNAPN 591

Query: 674 VLLRVKEDHDGAEPSGNSVSVINLVRLA 701
           V++R+KEDHDGAEP GNSVS  NL  L+
Sbjct: 592 VIVRLKEDHDGAEPCGNSVSARNLTLLS 619


>gi|195382934|ref|XP_002050183.1| GJ22002 [Drosophila virilis]
 gi|194144980|gb|EDW61376.1| GJ22002 [Drosophila virilis]
          Length = 747

 Score =  494 bits (1271), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 268/638 (42%), Positives = 354/638 (55%), Gaps = 52/638 (8%)

Query: 90  TPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYST 149
           T   T     KH NRLAA  SPYLLQHAHNPVDW+ W EEAF  AR  +  IFLS+GYST
Sbjct: 3   TGGETKAQSPKHINRLAASKSPYLLQHAHNPVDWYPWCEEAFERARSENKLIFLSVGYST 62

Query: 150 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 209
           CHWCHVME ESFED   A ++N  FV+IKVDREERPD+DKVYM ++    G GGWP+SV+
Sbjct: 63  CHWCHVMEHESFEDADTAAVMNKHFVNIKVDREERPDIDKVYMQFLLMSKGSGGWPMSVW 122

Query: 210 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 269
           L+PDL PL  GTYFPP+ +YG P F  +L  +   W   R  L ++G+  +E +    +A
Sbjct: 123 LTPDLAPLAAGTYFPPKARYGMPSFTMVLESIAKKWQTDRTSLKKAGSTLMEAMRANQNA 182

Query: 270 SASSNKLPDELPQNALRLCAEQLS---KSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 326
              +    +  P +A    AE L+   + +D    GFG  PKFP    +  + +     +
Sbjct: 183 GTDAEAAFE--PGSADAKLAEALAVHKQRFDQEHAGFGREPKFPEVPRLNFLFHAYLVSK 240

Query: 327 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 386
           D        +   MVL TL  + +GGI+DH+ GGF RY+    WH  HFEKMLYDQGQL 
Sbjct: 241 DV-------DVLDMVLQTLDHIGRGGINDHIFGGFARYATTRDWHNVHFEKMLYDQGQLM 293

Query: 387 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 446
             Y +A+ LT+   +      I +YL +D+  P G  ++ EDADS  T   T K EGAFY
Sbjct: 294 AAYANAYKLTRSKEFLRYADRIYEYLIKDLRHPAGGFYAGEDADSLPTHADTVKVEGAFY 353

Query: 447 VWTSKEVEDILGEHAILFKE------------HYYLKPTGNCDLSRMSDPHNEFKGKNVL 494
            WT  EV+         F +            HY +KP GN  +   SDPH    GKN+L
Sbjct: 354 AWTWDEVKQAFEAQQARFNDVSPARVFEIYCFHYGMKPAGN--VPPASDPHGHLTGKNIL 411

Query: 495 IELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFA 554
           I       + S   + + +   +L      L  +R +RPRPHLD K+I  WNGLV+S  +
Sbjct: 412 IVRGSEEDTCSNFNLEMAQLSQLLETANDILHKIRDQRPRPHLDTKIICGWNGLVLSGLS 471

Query: 555 RASKILKSEAESAMFNFPVVGSDRKE-YMEVAESAASFIRRHLYD-EQTHRLQHSFRNG- 611
           + +                 G+D+++ Y+  A+    F+R HLYD EQ   L+  +  G 
Sbjct: 472 KLAN---------------CGTDKRDAYLATAKQLMDFLRTHLYDGEQKLLLRSCYGAGV 516

Query: 612 --------PSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGG 663
                   P++  GFLDDYAFL+ GLLD Y+       L WA ELQ TQD+LF D + G 
Sbjct: 517 QDNTLEQNPTRIEGFLDDYAFLVKGLLDYYKASLDMSALHWAKELQVTQDKLFWDEKNGA 576

Query: 664 YFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLA 701
           YF +    P+V++R+KEDHDGAEP GNSV+  NL  L+
Sbjct: 577 YFFSQQNAPNVIVRLKEDHDGAEPCGNSVAARNLTLLS 614


>gi|347839355|emb|CCD53927.1| similar to DUF255 domain protein [Botryotinia fuckeliana]
          Length = 823

 Score =  493 bits (1270), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 266/658 (40%), Positives = 389/658 (59%), Gaps = 29/658 (4%)

Query: 85  AMAERTPASTSHSRN---KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPI 141
            M  +   +    RN   +  NR +   SPY+  H+ NPV W  WG+EA   AR+ +  +
Sbjct: 16  GMLGKATTTVPEQRNDIVQLVNRASESKSPYVRAHSANPVAWQLWGDEAIDLARRENKLL 75

Query: 142 FLSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGG 201
           F+SIGYS+CHWCH+ME ESFE+E VA +LN  F+ IK+DREERPD+D++YM +VQA  G 
Sbjct: 76  FVSIGYSSCHWCHIMERESFENEEVAAILNSSFIPIKIDREERPDIDRIYMNFVQATTGS 135

Query: 202 GGWPLSVFLSPDLKPLMGGTYF----PPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGA 257
           GGWPL+VFL+P L+P+ GGTY+       D   +  F  IL K+   W ++     Q  A
Sbjct: 136 GGWPLNVFLTPSLEPVFGGTYWRGPSKTTDFEDQVDFLGILDKLSTVWSEQESRCRQDSA 195

Query: 258 FAIEQLSEALSASASSNKLP---DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVE 314
            +++QL +  +    SN+L    D +    L    E  + SYD   GGFGSAPKFP P +
Sbjct: 196 QSLQQLKDFANEGTLSNRLGEGVDNIDLELLEEVTEHFASSYDKANGGFGSAPKFPTPSK 255

Query: 315 IQMML---YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWH 371
           I  +L      + + D     +    +++ + TL+ MA+GGIHDH+G GF RYS    W 
Sbjct: 256 IAFLLRLGQFPQAVVDIVGLPDCQNAREIAITTLRKMARGGIHDHIGNGFARYSATADWS 315

Query: 372 VPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADS 431
           +PHFEKMLYD  QL ++YLD F L++D  +  +  DI +YL   +    G  +S+EDADS
Sbjct: 316 LPHFEKMLYDNAQLLHLYLDGFLLSRDPEFLGVAYDIANYLTTTLSHSEGGFYSSEDADS 375

Query: 432 AETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGK 491
               G + K+EGA+YVWT +E E+ILG    L    ++   TG+ ++ + +DPH+EF  +
Sbjct: 376 YYKNGDSEKREGAYYVWTKREFENILGSERGLILSAFF-NVTGHGNVGQENDPHDEFMDQ 434

Query: 492 NVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVI 550
           NVL   +  SA AS+ G+   + + ++ E + +L   R + R +P +DDKV+VSWNG+ +
Sbjct: 435 NVLAISSTPSALASQFGIKESEIIKVIKEGKAQLRRRRETDRVKPAMDDKVVVSWNGIAV 494

Query: 551 SSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRN 610
            + AR S ++        F+ PV     +EY++ A  AA+FI+++LYD++   L   +R 
Sbjct: 495 GALARLSSVING------FD-PVKA---QEYLDAALKAATFIKKNLYDDKAKILYRIWRE 544

Query: 611 GPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG-GGYFNTTG 669
           G     GF DDYAFLI GL+DLYE     KWL WA ELQ +Q  LF D+ G G +F+TT 
Sbjct: 545 GRGDTQGFADDYAFLIEGLIDLYETTFDEKWLQWADELQQSQINLFYDKNGTGAFFSTTV 604

Query: 670 EDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 727
             P+V+LR+K+  D +EPS N +S  NL RL+S+      + Y + A+ ++  FE  +
Sbjct: 605 SAPNVILRLKDAMDSSEPSTNGISSSNLYRLSSMF---NDESYAKKAKETVKSFEAEM 659


>gi|365158244|ref|ZP_09354475.1| hypothetical protein HMPREF1015_02341 [Bacillus smithii 7_3_47FAA]
 gi|363621167|gb|EHL72387.1| hypothetical protein HMPREF1015_02341 [Bacillus smithii 7_3_47FAA]
          Length = 678

 Score =  493 bits (1270), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 283/668 (42%), Positives = 390/668 (58%), Gaps = 60/668 (8%)

Query: 97  SRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVM 156
           ++ K  NRL  E SPYLLQHA+NPVDW+ WG EAF +A+  + P+F+SIGYSTCHWCHVM
Sbjct: 2   TKGKKANRLIQEKSPYLLQHAYNPVDWYPWGNEAFEKAKSENKPVFVSIGYSTCHWCHVM 61

Query: 157 EVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKP 216
           E ESFED  VA+LLN +FV+IKVDREERPD+D VYMT  Q + G GGWPL+VFL+PD KP
Sbjct: 62  ERESFEDPEVAELLNQYFVAIKVDREERPDIDSVYMTVCQMMTGQGGWPLTVFLTPDKKP 121

Query: 217 LMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKL 276
              GTYFP   +YGRPG   IL ++  A+ +  D +A  G+  +E L E      +  K 
Sbjct: 122 FYAGTYFPKNSQYGRPGMMDILPQLHRAYHQDPDRIADIGSRLVEALKE-----EAGRKS 176

Query: 277 PDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLEDTGKSGEA 334
             ++ + A+    EQL+  +DS +GGFG APKFP P ++  +   YH         +GE 
Sbjct: 177 EGDVTEEAVHKGFEQLAGKFDSLYGGFGEAPKFPSPHQLLFLFRYYHM--------TGEE 228

Query: 335 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 394
           S   KM   TL  MA GGI+DH+GGGF RYS D  W VPHFEKMLYD   L   Y +A+ 
Sbjct: 229 S-ALKMAEKTLDSMAAGGIYDHIGGGFSRYSTDGMWLVPHFEKMLYDNALLMYAYTEAYQ 287

Query: 395 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 454
           +TK+  Y  I  +I D++ R+M  P G  +SA DADS   EG    +EG FYVW+ +E+ 
Sbjct: 288 ITKNERYRRIVLEIADFVAREMTHPEGGFYSAIDADS---EG----EEGKFYVWSKEEIM 340

Query: 455 DILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN-DSSASASKLGMPLE 512
           D+LGE    +F E Y++   GN            F+GKN+L  L  D    A+   + +E
Sbjct: 341 DVLGEETGTIFSELYHVTDQGN------------FEGKNILHLLQTDLETIAANHELSIE 388

Query: 513 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 572
           +  N++ + ++ LF  R KR +PH+DDKV+ SWNGL+I++ A+A  +         F+ P
Sbjct: 389 ELENLMSKAKQFLFQAREKRVKPHVDDKVLTSWNGLMIAALAKAGSV---------FDDP 439

Query: 573 VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDL 632
            + S        A  A +F+ ++++ E+  RL   FR G +K  G+LDDYAFL+ G L+L
Sbjct: 440 GLLSQ-------ARKAMAFLEKYVWKEK--RLMARFREGEAKYRGYLDDYAFLLWGTLEL 490

Query: 633 YEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSV 692
           +        L +AIEL+N   E F D E GG+F T  +   +L+R K  +DGA PSGNSV
Sbjct: 491 FLAEDDLHMLSFAIELKNALFERFWD-ENGGFFFTDRDGEELLVREKPGYDGAYPSGNSV 549

Query: 693 SVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHV 752
           +   L RLA +    +     +  E  +  F   L    +++  M  AA  L    R+ V
Sbjct: 550 AAYQLWRLAKLTGDIE---LMKRVEMCVRSFSKELNAFPVSMLYMLEAAMALFAQGRE-V 605

Query: 753 VLVGHKSS 760
           +++G   S
Sbjct: 606 IVIGSNGS 613


>gi|410661555|ref|YP_006913926.1| Thymidylate kinase [Dehalobacter sp. CF]
 gi|409023911|gb|AFV05941.1| Thymidylate kinase [Dehalobacter sp. CF]
          Length = 741

 Score =  493 bits (1270), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 288/706 (40%), Positives = 399/706 (56%), Gaps = 68/706 (9%)

Query: 101 HTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVES 160
           + NRLA E SPYLLQHA NPVDWF WGEEAF +A++ + P+FLSIGYSTCHWCHVME ES
Sbjct: 7   NANRLAGEKSPYLLQHALNPVDWFPWGEEAFQKAKEENKPVFLSIGYSTCHWCHVMERES 66

Query: 161 FEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGG 220
           FED+ VA +LN  ++ +KVDREERPD+D++YMTY Q + G GGWPL+V ++PD +P   G
Sbjct: 67  FEDKEVAAILNRSYIPVKVDREERPDIDQLYMTYCQVMTGAGGWPLTVLMTPDKQPFFAG 126

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL----SASASSNKL 276
           TYFP    YGRPG   IL +V + W  ++D + Q+ A   E ++       +A+++  K 
Sbjct: 127 TYFPKHSHYGRPGLMDILSQVGELWQTEKDKVIQTAAELYETVTRHYRGDKNATSAVPKN 186

Query: 277 PDELP---------------QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYH 321
              LP               +  L    E L   +DS++GGFGSAPKFP P  +  +L +
Sbjct: 187 KQTLPFTEKEKDSGDIAIWGKTLLGKGYELLENKFDSKYGGFGSAPKFPAPHNLGFLLRY 246

Query: 322 SKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYD 381
           S  +E+       S+   MV  TL  MA GGI DH+G GF RYS D  W VPHFEKMLYD
Sbjct: 247 S--MEEP-----QSKALAMVEKTLDSMADGGIFDHIGFGFARYSTDHYWLVPHFEKMLYD 299

Query: 382 QGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 441
              LA VYL+A+  TK+  Y  + ++I  Y+ RDM    G  +SAEDADS   EG    +
Sbjct: 300 NAGLALVYLEAYQRTKNQKYRRVAQNIFGYVLRDMTSAEGGFYSAEDADS---EG----E 352

Query: 442 EGAFYVWTSKEVEDILGEHAILFKEHYYL----KPTGN---------CDLSRMSDPHNEF 488
           EG +Y+W+  E+   L +     ++   L    KP            CD   ++D  N +
Sbjct: 353 EGKYYLWSKDEIRKTLQDGIESLQKERELKNGFKPLSKQKEEVADIYCDAYGITDEGN-Y 411

Query: 489 KGKNVL-----IELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIV 543
           +GKN+      + + D ++  S  G  L + L+I   C   LF  R KR RP  DDK++V
Sbjct: 412 EGKNIPSRIFHVGVGDLTSRYSLTGDELGEMLDI---CNTILFSAREKRVRPAKDDKILV 468

Query: 544 SWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHR 603
           SWNGL+I + A+  ++L  +            +D+K  +  AE+AA FIR  ++D +  R
Sbjct: 469 SWNGLMIGALAKGVQVLSGDLSWE--------NDKKSLLLTAENAAGFIRDKMFDSRG-R 519

Query: 604 LQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGG 663
           L   +R G +  PG+LDDYAFL+ GLL+LY     T++L  AI LQ  Q++LF D   GG
Sbjct: 520 LLARYREGEAGIPGYLDDYAFLVHGLLELYTACGKTEYLEQAIFLQEEQEKLFRDETNGG 579

Query: 664 YFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVF 723
           Y+ T  +   +LLR KE +DGA PSGNS+S  NL RL  +   SK   +++ AE  +  F
Sbjct: 580 YYFTGCDAEELLLRPKEIYDGAMPSGNSMSACNLGRLWRLTGLSK---WQERAEKQINSF 636

Query: 724 ETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAA 769
            T ++D          A    ++   + +VL G  ++   E M  A
Sbjct: 637 RTTVEDYPPGYTAFLQAI-QYTLNQGEELVLSGSSANQTLEKMQTA 681


>gi|331269923|ref|YP_004396415.1| thymidylate kinase [Clostridium botulinum BKT015925]
 gi|329126473|gb|AEB76418.1| thymidylate kinase [Clostridium botulinum BKT015925]
          Length = 671

 Score =  493 bits (1269), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 255/607 (42%), Positives = 362/607 (59%), Gaps = 59/607 (9%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           N  +N+L  E SPYLLQHAHNPVDW+ W EEAF +A+K D PIFLSIGYS+CHWCHVME 
Sbjct: 4   NDKSNKLINEKSPYLLQHAHNPVDWYPWCEEAFLKAKKEDKPIFLSIGYSSCHWCHVMEK 63

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFEDE VAK+LND ++SIKVDREERPDVD  YMT+ Q++ G GGWPL++ ++P+ KP  
Sbjct: 64  ESFEDEEVAKILNDKYISIKVDREERPDVDNTYMTFCQSVTGSGGWPLTIIMTPEQKPFF 123

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
            GTYFP +  YGRPGF  IL+++ D W   ++ +  +    +  + E +S   S      
Sbjct: 124 AGTYFPKKSMYGRPGFIQILKQISDEWKSNKNNIINTSNELLNTMEEHISQDKSG----- 178

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
           E+ +  L+    +++  YD+++GGFG++PKFP P ++ ++L + K   +    G      
Sbjct: 179 EINETILQDAVIEMNYYYDNKYGGFGASPKFPTPHKLMLLLINYKVYNNKNALG------ 232

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
            MV  TL+CM KGGI DH+G GF RYS DE+W VPHFEKMLYD   LA VY  A+ +T  
Sbjct: 233 -MVENTLKCMYKGGIFDHIGFGFSRYSTDEKWLVPHFEKMLYDNALLAYVYTQAYQVTGK 291

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
            FY  +   I  Y+ RDM  P G  +SAEDADS   EG     EG FYVWT  E+E ILG
Sbjct: 292 SFYKEVAEKIFKYILRDMTSPEGGFYSAEDADS---EGV----EGKFYVWTLHEIESILG 344

Query: 459 EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 518
           E A  F   Y +   GN            F+G N+           + +G  L+  ++ L
Sbjct: 345 EDAKEFCNIYNITKNGN------------FEGSNI----------PNLIGKDLDD-IDKL 381

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
              R+KLF+VR KR  P  DDK++ +WN L+I + A A ++ ++E               
Sbjct: 382 ESLRKKLFEVREKRIHPFKDDKILTAWNALMIVALAYAGRVFENE--------------- 426

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG 638
            +Y+  A+ A +FI  +L   +  RL   FR+G +    +L+DY+FL+  L++LYE    
Sbjct: 427 -KYINRAKKAYNFIENNLI-RKDGRLLARFRHGEAAYIAYLEDYSFLVWALMELYEATFD 484

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
           +K+L  A+   +   +LF D E  G+F++  +   ++L +K+ +D A PSGNS++ +NL+
Sbjct: 485 SKYLKQALHFTDEMIKLFWDEESYGFFHSGKDGEKLILNLKDSYDMAIPSGNSIAAMNLI 544

Query: 699 RLASIVA 705
           +L+ I  
Sbjct: 545 KLSKITG 551


>gi|410658568|ref|YP_006910939.1| Thymidylate kinase [Dehalobacter sp. DCA]
 gi|409020923|gb|AFV02954.1| Thymidylate kinase [Dehalobacter sp. DCA]
          Length = 741

 Score =  493 bits (1269), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 288/706 (40%), Positives = 399/706 (56%), Gaps = 68/706 (9%)

Query: 101 HTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVES 160
           + NRLA E SPYLLQHA NPVDWF WGEEAF +A++ + P+FLSIGYSTCHWCHVME ES
Sbjct: 7   NANRLAGEKSPYLLQHALNPVDWFPWGEEAFQKAKEENKPVFLSIGYSTCHWCHVMERES 66

Query: 161 FEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGG 220
           FED+ VA +LN  ++ +KVDREERPD+D++YMTY Q + G GGWPL+V ++PD +P   G
Sbjct: 67  FEDKEVAAILNRSYIPVKVDREERPDIDQLYMTYCQVMTGAGGWPLTVLMTPDKQPFFAG 126

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL----SASASSNKL 276
           TYFP    YGRPG   IL +V + W  ++D + Q+ A   E ++       +A+++  K 
Sbjct: 127 TYFPKHSHYGRPGLMDILSQVGELWQTEKDKVIQTAAELYETVTRHYRGDKNATSAVPKN 186

Query: 277 PDELP---------------QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYH 321
              LP               +  L    E L   +DS++GGFGSAPKFP P  +  +L +
Sbjct: 187 KQTLPFTEKEKDSGDIAIWGKTLLGKGYELLENKFDSKYGGFGSAPKFPAPHNLGFLLRY 246

Query: 322 SKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYD 381
           S  +E+       S+   MV  TL  MA GGI DH+G GF RYS D  W VPHFEKMLYD
Sbjct: 247 S--MEEP-----QSKALAMVEKTLDSMADGGIFDHIGFGFARYSTDHYWLVPHFEKMLYD 299

Query: 382 QGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 441
              LA VYL+A+  TK+  Y  + ++I  Y+ RDM    G  +SAEDADS   EG    +
Sbjct: 300 NAGLALVYLEAYQRTKNQKYRRVAQNIFGYVLRDMTSAEGGFYSAEDADS---EG----E 352

Query: 442 EGAFYVWTSKEVEDILGEHAILFKEHYYL----KPTGN---------CDLSRMSDPHNEF 488
           EG +Y+W+  E+   L +     ++   L    KP            CD   ++D  N +
Sbjct: 353 EGKYYLWSKDEIRKTLQDGIESLQKERELKNGFKPLSKQKEEVADIYCDAYGITDEGN-Y 411

Query: 489 KGKNVL-----IELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIV 543
           +GKN+      + + D ++  S  G  L + L+I   C   LF  R KR RP  DDK++V
Sbjct: 412 EGKNIPSRIFHVGVGDLTSRYSLTGDELGEMLDI---CNTILFSAREKRVRPAKDDKILV 468

Query: 544 SWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHR 603
           SWNGL+I + A+  ++L  +            +D+K  +  AE+AA FIR  ++D +  R
Sbjct: 469 SWNGLMIGALAKGVQVLSGDLSWE--------NDKKSLLLTAENAAGFIRDKMFDSRG-R 519

Query: 604 LQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGG 663
           L   +R G +  PG+LDDYAFL+ GLL+LY     T++L  AI LQ  Q++LF D   GG
Sbjct: 520 LLARYREGEAGIPGYLDDYAFLVHGLLELYTACGKTEYLEQAIFLQEEQEKLFRDETNGG 579

Query: 664 YFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVF 723
           Y+ T  +   +LLR KE +DGA PSGNS+S  NL RL  +   SK   +++ AE  +  F
Sbjct: 580 YYFTGCDAEELLLRPKEIYDGAMPSGNSMSACNLGRLWRLTGLSK---WQERAEKQINSF 636

Query: 724 ETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAA 769
            T ++D          A    ++   + +VL G  ++   E M  A
Sbjct: 637 RTTVEDYPPGYTAFLQAI-QYALNQGEELVLSGSSANQTLEKMQTA 681


>gi|168186605|ref|ZP_02621240.1| thymidylate kinase [Clostridium botulinum C str. Eklund]
 gi|169295490|gb|EDS77623.1| thymidylate kinase [Clostridium botulinum C str. Eklund]
          Length = 693

 Score =  493 bits (1269), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 269/679 (39%), Positives = 384/679 (56%), Gaps = 63/679 (9%)

Query: 96  HSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHV 155
             +  + N+L  E SPYLLQHAHNPVDW+ W EEAF +A++ D PIFLSIGYS+CHWCHV
Sbjct: 9   QGKQSNPNKLINEKSPYLLQHAHNPVDWYPWCEEAFIKAKEEDKPIFLSIGYSSCHWCHV 68

Query: 156 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 215
           ME ESFEDE VAKLLND ++SIKVDREERPDVD +YMT+ QA+ G GGWPL++ ++PD K
Sbjct: 69  MEKESFEDEEVAKLLNDKYISIKVDREERPDVDNIYMTFCQAVTGSGGWPLTIIMAPDQK 128

Query: 216 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 275
           P   GTYFP +  YGRPG   IL ++ D W+  RD +  +    +  + E  S   S   
Sbjct: 129 PFFAGTYFPKKRMYGRPGLIQILNQIADEWENNRDGVINASNELLNTMKEHTSQDKSG-- 186

Query: 276 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 335
              E+ +N L+   +++   YD  +GGFG APKFP P ++ ++L + K+  +        
Sbjct: 187 ---EINENVLQDAIKEMKHYYDESYGGFGIAPKFPTPHKLMLLLTYYKEYNN-------K 236

Query: 336 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 395
               MV  TL+CM KGGI DH+G GF RYS DE+W VPHFEKMLYD   LA VY   + +
Sbjct: 237 IALHMVENTLKCMYKGGIFDHIGFGFSRYSTDEKWLVPHFEKMLYDNALLAYVYTQTYQI 296

Query: 396 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 455
           T  +FY  +   I  Y+ RDM  P G  +SAEDADS   EG     EG FY+WT  EVE+
Sbjct: 297 TGKLFYKEVAEKIFTYVLRDMTSPEGGFYSAEDADS---EGV----EGKFYLWTLHEVEN 349

Query: 456 ILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 515
           IL E A  F   Y +   GN            F+G N+           + +G  LE   
Sbjct: 350 ILKEDAKEFCNTYDITKGGN------------FEGSNI----------PNLIGKDLEN-T 386

Query: 516 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 575
           + L   R+KLF VR KR  P  DDK++ +WN L+IS+ A A ++ +++            
Sbjct: 387 DKLENLRKKLFQVREKRVHPFKDDKILTAWNALMISALAYAGRVFENQ------------ 434

Query: 576 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 635
               EY++ A+ A +FI  +L   +  RL   FR+G +    +++DY+FL+  LL+LYE 
Sbjct: 435 ----EYIDRAKEAYNFIENNLI-RKDGRLLARFRHGEAAYIAYIEDYSFLVWALLELYEA 489

Query: 636 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 695
              +K+L  A++  +   +LF D E  G+F++  +   ++L +K+ +D A PSGNSV+ +
Sbjct: 490 TFESKFLKEALQFTDEMIKLFWDEESYGFFHSGKDGEKLILNLKDSYDTAIPSGNSVAAM 549

Query: 696 NLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLV 755
           NL++L+ I   +      + A   L  F   +K+   +  +          PS K +++ 
Sbjct: 550 NLIKLSKITGDNS---LGEKAYKMLEGFGGNIKESLQSHSIFLMVYMNYIRPS-KQIIIA 605

Query: 756 GHKSSVDFENMLAAAHASY 774
             K    F++M+   +  +
Sbjct: 606 SKKEDKVFKDMIREVNKRF 624


>gi|410671814|ref|YP_006924185.1| hypothetical protein Mpsy_2614 [Methanolobus psychrophilus R15]
 gi|409170942|gb|AFV24817.1| hypothetical protein Mpsy_2614 [Methanolobus psychrophilus R15]
          Length = 703

 Score =  493 bits (1268), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 277/696 (39%), Positives = 391/696 (56%), Gaps = 48/696 (6%)

Query: 86  MAERTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSI 145
           M E  P    H+     NRLA E SPYLLQHAHNPVDW+ WGEEAF +A++ D PIFLSI
Sbjct: 1   MQENKPDDNEHN----VNRLAGEKSPYLLQHAHNPVDWYPWGEEAFNKAKQDDKPIFLSI 56

Query: 146 GYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWP 205
           GYSTCHWCHVME ESFED  VA+L+N+ FV IKVDREERPD+D +YM+  QAL G GGWP
Sbjct: 57  GYSTCHWCHVMERESFEDPQVAELMNEAFVPIKVDREERPDIDTIYMSVCQALTGRGGWP 116

Query: 206 LSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE 265
           LS+ ++PD KP M  TY P E +YG  G   I+  V + W ++R+ L  +     E++  
Sbjct: 117 LSIIMTPDKKPFMAATYIPRESRYGMAGMLDIVPAVSNMWTRQREELIANA----EEIVS 172

Query: 266 ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 325
           A+S  A  +     L ++ L    + L  S+D    GFG+APKFP P  ++ +L + K+ 
Sbjct: 173 AISGGARDSTEGPGLDESTLDRTYQLLRSSFDPSSAGFGNAPKFPTPHHLKFLLRYWKR- 231

Query: 326 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 385
               K  +A E   M   TL+ M KGGI+DH+G GFHRYS D RW VPHFEKMLYDQ  +
Sbjct: 232 ---SKEDKALE---MAEETLKAMRKGGIYDHIGFGFHRYSTDSRWLVPHFEKMLYDQALI 285

Query: 386 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 445
           +   ++ +  T++  Y     ++  Y+ RDM  P G  +SAEDADS +       +EG F
Sbjct: 286 SIALVETYQATQNPEYRENAEEVFSYVLRDMHSPEGGFYSAEDADSED-------EEGRF 338

Query: 446 YVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 504
           Y+WT +E+ED+LGE  A LFKE ++  P GN  L   S  H    G+N+L        +A
Sbjct: 339 YLWTEQELEDVLGEMDAGLFKEVFHTSPGGNF-LDEASMTHT---GRNILHLEESLREAA 394

Query: 505 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 564
            + G   +++   L   RRKLF+ R  R  P  DDK++  WN L+I + ++A++      
Sbjct: 395 ERRGEDYDRFRQSLESSRRKLFEHREMRVHPSKDDKIMTDWNSLMIVALSKAARAF---- 450

Query: 565 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 624
                       D   Y + A   A FI   +      RL H +R+G     GFLDDYAF
Sbjct: 451 ------------DEPAYAQEAALTADFILSKMISPNG-RLFHRYRDGEVAVEGFLDDYAF 497

Query: 625 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 684
            I GL++LY+    T++L  A+   +     F D   GG+F+T  +   +++R KE +DG
Sbjct: 498 FIWGLIELYQATFNTEYLRNALRFNDQLILHFRDSIHGGFFHTADDSEKLIMRSKEIYDG 557

Query: 685 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 744
           A PSGNSV  +NL+ L  I   +      + A   + +F  ++  M +    + CA D  
Sbjct: 558 AIPSGNSVCALNLLHLGRITGNTD---LEKKAYEIMQLFSGQVSKMPVGYTQLMCALDFA 614

Query: 745 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           + PSR+ +V+ G   S + + +++  +  +  NK +
Sbjct: 615 AGPSRE-IVVAGDPESEETQGIISDINREFVPNKVI 649


>gi|195485941|ref|XP_002091297.1| GE13577 [Drosophila yakuba]
 gi|194177398|gb|EDW91009.1| GE13577 [Drosophila yakuba]
          Length = 809

 Score =  492 bits (1267), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 267/642 (41%), Positives = 361/642 (56%), Gaps = 58/642 (9%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           K  NRL A  SPYLLQHA+NPVDW+ WGEEAF +AR  +  IFLS+GYSTCHWCHVME E
Sbjct: 75  KQGNRLVASKSPYLLQHAYNPVDWYPWGEEAFEKARSENKIIFLSVGYSTCHWCHVMEHE 134

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SFE    A ++N+ FV+IKVDREERPD+DK+YM ++    G GGWP+SV+L+P L PL+ 
Sbjct: 135 SFESPVTAAIMNEKFVNIKVDREERPDIDKIYMQFLLMSKGSGGWPMSVWLTPTLAPLVA 194

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE 279
           GTYFPP+ +YG P F  +L+ +   W+  ++ L  +G+  +  L +   ASA +      
Sbjct: 195 GTYFPPKSRYGMPSFNAVLKSIAKKWETDKESLLTAGSTLLTALQKNQDASAVAEAAFG- 253

Query: 280 LPQNALRLCAEQLS---KSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASE 336
              +A+   +E ++   + +D   GGFGS PKFP    I  + +     +D       ++
Sbjct: 254 -VGSAIEKLSEAINVHKQRFDQTHGGFGSEPKFPEVPRINFLFHAYLVTKD-------AD 305

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
              MV+ TL  + KGGI+DH+ GGF RY+  E WH  HFEKMLYDQGQL   + +A+ +T
Sbjct: 306 VLDMVIETLTQIGKGGINDHIFGGFARYATTEDWHNVHFEKMLYDQGQLMAAFANAYKVT 365

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE-- 454
           +D  +      I  YL +D+  P G  ++ EDADS  T     K EGAFY WT  E++  
Sbjct: 366 RDETFLGYADKIYKYLLKDLRHPLGGFYAGEDADSLPTHEDNVKVEGAFYAWTWDEIQAA 425

Query: 455 ---------DILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 504
                    DI  E A  ++  HY LKP GN  +   SDPH    GKN+LI       S 
Sbjct: 426 FKDQAQRLDDITPERAFEIYAYHYDLKPPGN--VPAYSDPHGHLTGKNILIVRGSEEDSI 483

Query: 505 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 564
           +   +  +K+  +L      L  VR +RPRPHLD K+I +WNGLV+S   +         
Sbjct: 484 ANFSLEADKFKKLLATTNDILHVVREQRPRPHLDTKIICAWNGLVLSGLCKLGN------ 537

Query: 565 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL----------QHSFRNGPSK 614
                      ++R +YM+ A+    F+R+ +YD +   L            +     S+
Sbjct: 538 --------CYSANRDQYMQTAKELLDFLRKEMYDPEKKLLIRSCYGVAVGDETLEKNESQ 589

Query: 615 APGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSV 674
             GFLDDYAFLI GLLD Y+       L WA  LQ+TQD+LF D   G YF +  + P+V
Sbjct: 590 IDGFLDDYAFLIKGLLDYYKATLDVDVLHWAKALQDTQDKLFWDERNGAYFFSQQDAPNV 649

Query: 675 LLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNA 716
           ++R+KEDHDGAEP GNSVS  NLV L          YY +NA
Sbjct: 650 IVRLKEDHDGAEPCGNSVSARNLVLLGH--------YYDENA 683


>gi|195583350|ref|XP_002081485.1| GD11041 [Drosophila simulans]
 gi|194193494|gb|EDX07070.1| GD11041 [Drosophila simulans]
          Length = 808

 Score =  492 bits (1267), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 284/702 (40%), Positives = 382/702 (54%), Gaps = 69/702 (9%)

Query: 76  RPIHPYKVVAMAERTPASTSHSRN---KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFA 132
           RP+   K   MA    AS   S+    K  NRL A  SPYLLQHA+NPVDW+ WGEEAF 
Sbjct: 47  RPVSNQKFRTMATGGGASKEVSKEEPAKQGNRLVASKSPYLLQHAYNPVDWYPWGEEAFE 106

Query: 133 EARKRDVPIFLSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYM 192
           +AR  +  IFLS+GYSTCHWCHVME ESFE    A ++N+ FV+IKVDREERPD+DK+YM
Sbjct: 107 KARSENKLIFLSVGYSTCHWCHVMEHESFESPETAAIMNENFVNIKVDREERPDIDKIYM 166

Query: 193 TYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDML 252
            ++    G GGWP+SV+L+P+L PL+ GTYFPP+ +YG P F  +L+ +   W+  ++ L
Sbjct: 167 QFLLMSKGSGGWPMSVWLTPNLAPLVAGTYFPPKSRYGMPSFNAVLKSIARKWETDKESL 226

Query: 253 AQSGAFAIEQLSEALSASASSNKLPDELPQNALRL--CAEQLSKS-------YDSRFGGF 303
             +G+  +  L +   ASA        +P+ A       E+LS++       +D   GGF
Sbjct: 227 LSTGSSLLSALQKNQDASA--------VPEAAFGAGSAIEKLSEAINVHRQRFDQTHGGF 278

Query: 304 GSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHR 363
           GS PKFP    +  + +     +D        +   MV+ TL  + KGGIHDH+ GGF R
Sbjct: 279 GSEPKFPEVPRLNFLFHGYLVTKD-------PDVLDMVIETLTQIGKGGIHDHIFGGFAR 331

Query: 364 YSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEI 423
           Y+  + WH  HFEKMLYDQGQL   + +A+ +T+D  Y      I  YL +D+  P G  
Sbjct: 332 YATTQDWHNVHFEKMLYDQGQLIVAFTNAYKVTRDEIYLGYADKIYKYLIKDLRHPLGGF 391

Query: 424 FSAEDADSAETEGATRKKEGAFYVWTSKEV-----------EDILGEHAI-LFKEHYYLK 471
           ++ EDADS  T     K EGAFY WT  E+           EDI  E A  ++  HY LK
Sbjct: 392 YAGEDADSLPTHEDKVKVEGAFYAWTWDEIQAAFKDQAQRFEDITPERAFEIYAYHYDLK 451

Query: 472 PTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK 531
           P GN  +   SDPH    GKN+LI       + +   +  +++  +L      L  +R K
Sbjct: 452 PPGN--VPTYSDPHGHLTGKNILIVRGSEEDTCANFKLEADQFKKLLATTNDILHVIRDK 509

Query: 532 RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASF 591
           RPRPHLD K+I +WNGLV+S   +                    ++R++YM+ A+    F
Sbjct: 510 RPRPHLDTKIICAWNGLVLSGLCKLGN--------------CYSANREQYMQTAKELLDF 555

Query: 592 IRRHLYDEQTHRL----------QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKW 641
           +R+ +YD +   L            +     S+  GFLDDYAFLI GLLD Y+       
Sbjct: 556 LRKEMYDPEQKLLIRSCYGVAVGDETLEKNASQIDGFLDDYAFLIKGLLDYYKATLDVDV 615

Query: 642 LVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLA 701
           L WA  LQ+TQD+LF D   G YF +  + P+V++R+KEDHDGAEP GNSVS  NLV LA
Sbjct: 616 LHWAKALQDTQDKLFWDERNGAYFFSQQDAPNVIVRLKEDHDGAEPCGNSVSAHNLVLLA 675

Query: 702 SIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM 743
                   D + Q A   L  F   +     A+P M  A  M
Sbjct: 676 HYY---DEDAFLQKAGKLLNFF-ADVSPFGHALPEMLSALLM 713


>gi|253681418|ref|ZP_04862215.1| dTMP kinase [Clostridium botulinum D str. 1873]
 gi|253561130|gb|EES90582.1| dTMP kinase [Clostridium botulinum D str. 1873]
          Length = 671

 Score =  492 bits (1266), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 265/678 (39%), Positives = 385/678 (56%), Gaps = 63/678 (9%)

Query: 97  SRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVM 156
           ++N  +NRL  E SPYLLQHA+NPVDW+ W EEAF +A++ + PIFLSIGYS+CHWCHVM
Sbjct: 2   NKNSKSNRLINEKSPYLLQHAYNPVDWYPWCEEAFLKAKQDNKPIFLSIGYSSCHWCHVM 61

Query: 157 EVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKP 216
           E ESFEDE VAK+LND ++SIKVDREERPDVD  YMT+ QA+ G GGWPL++ ++P+ KP
Sbjct: 62  EKESFEDEEVAKILNDKYISIKVDREERPDVDNTYMTFCQAVTGSGGWPLTIIMTPEQKP 121

Query: 217 LMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKL 276
              GTYFP +  YGRPG   IL+++ D W   +D +  +    +  + E +S        
Sbjct: 122 FFAGTYFPKKSMYGRPGIIQILKQISDEWKNNKDNIINTSNKLLNTMKERVSQDKW---- 177

Query: 277 PDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASE 336
            +E+ ++ L     +++  YD+++GGFG APKFP P ++ ++L + K   D    G    
Sbjct: 178 -EEINESILHDAIMEMNYYYDNKYGGFGIAPKFPTPHKLMLLLIYYKVYNDKSALG---- 232

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
              MV  TL+CM KGGI DH+G GF RYS DE+W VPHFEKMLYD   LA VY +A+ +T
Sbjct: 233 ---MVENTLKCMYKGGIFDHIGFGFSRYSTDEKWLVPHFEKMLYDNALLAYVYTEAYQVT 289

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
              FY  +   I  Y+ RDM  P G  +SAEDADS   EG     EG FYVW+ +E++ I
Sbjct: 290 GKSFYKEVAEKIFTYILRDMTSPEGGFYSAEDADS---EGV----EGKFYVWSLEEIQSI 342

Query: 457 LGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLN 516
           LGE A  F   Y +   GN            F+GKN+           + +G  LE  ++
Sbjct: 343 LGEDAKEFCNTYDITEKGN------------FEGKNI----------PNLIGKDLEN-ID 379

Query: 517 ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGS 576
            L + R KLF VR KR  P  DDK++ +WN L+I S + A ++                 
Sbjct: 380 KLKDLRNKLFKVREKRVHPFKDDKILTAWNALMIVSLSYAGRVF---------------- 423

Query: 577 DRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG 636
           + KEY+  ++ A  FI  +L   +  RL   FR+G +    +L+DY+FL+  L++LYE  
Sbjct: 424 ENKEYINRSKKAYDFIENNLI-RKDGRLLARFRHGEAAYIAYLEDYSFLVWALMELYEAT 482

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
             + +L  A+   +   +LF D E  G+F++  +   ++L +K+ +D A PSGNSV+ +N
Sbjct: 483 FESNYLKQALNFTDKMIKLFWDEESYGFFHSGRDGEKLILNLKDSYDTAIPSGNSVAAMN 542

Query: 697 LVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVG 756
           L++L+ I   +      + A      F   +K+   +  +   +      PSR+ +V+  
Sbjct: 543 LIKLSKITGDNS---LGEKAYKMFQCFGGNIKESLQSHSIFLISYMNYIKPSRQ-IVIAS 598

Query: 757 HKSSVDFENMLAAAHASY 774
            K    F+ M+   +  +
Sbjct: 599 EKEDRLFKEMIKEVNKRF 616


>gi|407917811|gb|EKG11113.1| protein of unknown function DUF255 [Macrophomina phaseolina MS6]
          Length = 747

 Score =  490 bits (1262), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 279/695 (40%), Positives = 380/695 (54%), Gaps = 32/695 (4%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            NRL+   SPY+  H HNPV W  WG E    A+K +  +F+SIGY+ CHWCHVME ESF
Sbjct: 19  VNRLSESRSPYVRGHMHNPVAWQMWGPETIELAKKTNRLLFVSIGYAACHWCHVMERESF 78

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           E+  +A +LN  F+ +KVDREERPDVD++YM YVQA  G GGWPL+VF++PDL+P+ GGT
Sbjct: 79  ENPEIANILNKNFIPVKVDREERPDVDRIYMNYVQATTGSGGWPLNVFITPDLEPIFGGT 138

Query: 222 YFPPEDKY----GRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL----SEALSASASS 273
           Y+P           P F  IL ++KD W  +R    +S      QL     E   +    
Sbjct: 139 YWPGPGSTTVLGDHPSFLEILERIKDVWQTQRQKCLESAKEVTAQLREFAQEGTISKGGE 198

Query: 274 NKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML---YHSKKLEDTGK 330
             + D L    L       +  YD ++ GFG APKFP P  I  +L    + + +E    
Sbjct: 199 GAVGDGLDLELLEEAYTHFANKYDKQYAGFGKAPKFPTPTNISFLLRLAQYPEAVEHVVG 258

Query: 331 SGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYL 390
             E +  ++M + TL+ MA+GGIHD +G GF RYSV   W +PHFEKMLYDQ QL   YL
Sbjct: 259 DRECAHAKEMAVETLRRMARGGIHDQIGNGFARYSVTRDWSLPHFEKMLYDQSQLLTAYL 318

Query: 391 DAFSLTKDVFYSYICRDILDYL-RRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 449
           DA  +T D        DI  YL    +  P G  FS+EDADS        K+EGAFYVWT
Sbjct: 319 DAHIITNDSELLDAAHDIATYLTTHPLQSPDGGFFSSEDADSLYRPNDKEKREGAFYVWT 378

Query: 450 SKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 508
            KE + ILGE  A +   +Y ++  GN  +S   D H+E   +NVL   +   A A + G
Sbjct: 379 RKEFKSILGEKDAEVCARYYNVRENGN--VSPEHDAHDELINQNVLAISSTPDALAKEFG 436

Query: 509 MPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 567
           +  ++   IL   RR+L + R+K RPRP LDDK++V WNGL I + AR S  L++     
Sbjct: 437 LSKDEVTKILESGRRRLLEHRNKERPRPGLDDKIVVGWNGLAIGALARFSAYLQASGSKE 496

Query: 568 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 627
                    DR  Y+  AE A   I+  LY      L+  +R GP +AP F DDYAFLIS
Sbjct: 497 --------PDR--YISAAEKAVKLIKTKLYSAADGTLKRVYREGPGEAPAFADDYAFLIS 546

Query: 628 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 687
           GL+DLYE      +L +A +LQ TQ +LF D   G +F+T      ++LR+KE  D AEP
Sbjct: 547 GLIDLYEATFDDSYLEFADQLQRTQIKLFWDSTSGAFFSTAEGQADLILRLKEGMDNAEP 606

Query: 688 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 747
           S N +S  NL RL +++   + DY ++ A+ +   FE  L       P M      L + 
Sbjct: 607 STNGISASNLYRLGALL--EEPDYTKR-AKETCEAFEAELMQHPFLFPSMLNGIVALRL- 662

Query: 748 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVSK 782
             K +V+ G   +V  E  ++ A +  + N T+++
Sbjct: 663 GMKSIVVSGSGENV--EKAISKARSRVNTNTTIAR 695


>gi|194756922|ref|XP_001960719.1| GF13496 [Drosophila ananassae]
 gi|190622017|gb|EDV37541.1| GF13496 [Drosophila ananassae]
          Length = 797

 Score =  490 bits (1261), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 274/671 (40%), Positives = 374/671 (55%), Gaps = 64/671 (9%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           K  NRL +  SPYLLQHA+NPVDW+ W +EAF +AR+ +  IFLS+GYSTCHWCHVME E
Sbjct: 63  KQGNRLVSSKSPYLLQHAYNPVDWYPWSDEAFEKARRENKLIFLSVGYSTCHWCHVMEHE 122

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SFE    A ++N+ FV+IKVDREERPD+DKVYM ++    G GGWP+SV+L+PDL PL+ 
Sbjct: 123 SFESPETAAIMNEHFVNIKVDREERPDIDKVYMQFLLMSKGSGGWPMSVWLTPDLAPLVA 182

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE 279
           GTYFPP+ +YG P F T+L+ +   W   ++ L ++G+     L +AL  +  +  +P+ 
Sbjct: 183 GTYFPPKTRYGMPSFTTVLQNIAKKWQTDKESLIEAGS----TLVDALKRNQDAEAVPEA 238

Query: 280 L--PQNALRLCAEQLS---KSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEA 334
              P +A    +E ++   + +D   GGFGS PKFP    +  + +     +D       
Sbjct: 239 AFEPGSAEAKLSEAITVHKQRFDQTHGGFGSEPKFPEVPRLNFLFHGYLVTKDV------ 292

Query: 335 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 394
            +   MVL +L  + +GGI+DH+ GGF RY+    WH  HFEKMLYDQGQL   Y +A+ 
Sbjct: 293 -DVLDMVLQSLDHIGRGGINDHIFGGFARYATTRDWHNVHFEKMLYDQGQLMAAYANAYK 351

Query: 395 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 454
           LT+   +      I  YL +D+  P G  ++ EDADS  T   T K EGAFY WT +E++
Sbjct: 352 LTRSETFLGYADKIYKYLVKDLRHPLGGFYAGEDADSLPTHKDTVKVEGAFYAWTWEEIQ 411

Query: 455 DILGEHAILFKE------------HYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSA 502
                 A  F+             HY LKP GN  +   SDPH    GKN+LI      A
Sbjct: 412 SAFKNQAERFEGVSPERAFEIYSFHYGLKPQGN--VPTYSDPHGHLTGKNILIVKGSDEA 469

Query: 503 SASKLGM---PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKI 559
           + S   +   PLEK L+   +    L  +R +RPRPHLD K+I +WNGLV+S  ++ +  
Sbjct: 470 TCSNFNLEAEPLEKLLDTANDI---LHVLRDQRPRPHLDTKIICAWNGLVLSGLSKLANC 526

Query: 560 LKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS----------FR 609
             ++              R+EYM+ A+    F+R+ +YD +   L  S            
Sbjct: 527 GTAK--------------RQEYMQTAKELLEFLRKEMYDSERKLLLRSCYGVAVGDPRLE 572

Query: 610 NGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTG 669
              S+  GFLDDY+FLI GLLD Y+       L WA ELQ TQD+LF D   G YF +  
Sbjct: 573 KNESEIEGFLDDYSFLIKGLLDYYKASLDLSALNWAKELQETQDKLFWDERNGAYFFSQR 632

Query: 670 EDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKD 729
           + P+V++R+K+DHDGAEP GNSVS  NL  L+        D Y Q A   L  F   +  
Sbjct: 633 DSPNVIVRLKDDHDGAEPCGNSVSARNLTLLSHYY---DEDAYLQRAGKLLNFF-ADVSP 688

Query: 730 MAMAVPLMCCA 740
              A+P M  A
Sbjct: 689 FGHALPEMLSA 699


>gi|306811901|gb|ADN05998.1| YyaL-like conserved hypothetical protein [uncultured Myxococcales
           bacterium]
          Length = 800

 Score =  490 bits (1261), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 264/647 (40%), Positives = 377/647 (58%), Gaps = 50/647 (7%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           + TNRL  E SPYLLQHAHNPV+W+AW +EAFA A++ + PIFLS+GYSTCHWCHVME E
Sbjct: 88  RFTNRLIRESSPYLLQHAHNPVNWYAWSDEAFARAKRENKPIFLSVGYSTCHWCHVMERE 147

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SFEDE +A  LN  F++IKVDREERPD+D VYM  V  L G GGWP++V ++PD +P  G
Sbjct: 148 SFEDEEIAAYLNRHFIAIKVDREERPDIDSVYMKAVTILTGRGGWPMTVIMTPDKEPFFG 207

Query: 220 GTYFPPEDKY--GRPGFKTILRKVKDAW-DKKRDMLAQSGAFAIEQLSEALSASASSNKL 276
           GTYFPP   +  GR G   IL  +   + ++  +++A++     ++LS+ +  +A+    
Sbjct: 208 GTYFPPRKGFRGGRAGLIDILADMLGLYRNEPTEVVARA-----QELSQRVEQAAAIKPG 262

Query: 277 PDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASE 336
           P       + + A+ L + +D   GGFG APKFP+P  + ++L ++++  D G +     
Sbjct: 263 PGVPSDKVIVVAAQNLGRMFDPVDGGFGGAPKFPQPSRLSLLLRYARRTRDKGATA---- 318

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
              MV  TL  MA GGI+D VGGGFHRYS D +W VPHFEKMLYD  QLA VYL+A+  T
Sbjct: 319 ---MVATTLDKMAAGGIYDQVGGGFHRYSTDAQWLVPHFEKMLYDNAQLAVVYLEAWQHT 375

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
            D  Y  + R+ILDY+ R+M  P G  +SA DADS    G    +EG F+ WT  E+E +
Sbjct: 376 GDSGYERVAREILDYVAREMTSPEGGFYSATDADSPTPSG--HDEEGWFFTWTPDELERL 433

Query: 457 LGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 515
           LG   A +F   + +   GN            F+G+N+L  +      AS+LG+  ++  
Sbjct: 434 LGAGDAAVFSSAFGVTKPGN------------FEGRNILHRVKSDQELASELGLAPKRVG 481

Query: 516 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 575
            ++   +  L+D R+ RP P  D+K+I +WNG++ ++FA+A  +L +EA           
Sbjct: 482 EMIRRAQSTLYDARASRPPPIRDEKIIAAWNGMMGAAFAKAGWML-AEA----------- 529

Query: 576 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 635
                Y+EVA  A  F+   +  +    L  ++R+G   +  FLDDYAF+++  LDLYE 
Sbjct: 530 ----RYVEVAARAVQFVLEQMRTKDGA-LVRTYRDGKKGSASFLDDYAFMVAASLDLYEA 584

Query: 636 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 695
                W+  A+ELQ  QD  +LD + GGY+ T  +   +L+R K  +D A PSGNSV+  
Sbjct: 585 TGDAAWIERAVELQTDQDLRYLDEQTGGYYLTAADGEVLLVREKPAYDRAVPSGNSVAAN 644

Query: 696 NLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 742
           NL+RL       K   +R+ AE   A    ++       PL+  A D
Sbjct: 645 NLLRLHDFNGDPK---WRRRAERLFASLAFQVTRSPTGFPLLLVALD 688


>gi|386002945|ref|YP_005921244.1| hypothetical protein Mhar_2269 [Methanosaeta harundinacea 6Ac]
 gi|357211001|gb|AET65621.1| hypothetical protein Mhar_2269 [Methanosaeta harundinacea 6Ac]
          Length = 698

 Score =  489 bits (1260), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 284/690 (41%), Positives = 378/690 (54%), Gaps = 61/690 (8%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           K  NRLA E SPYLL+HA NPVDW+ WGEEAF  A + D P+FLSIGYSTCHWCHVM  E
Sbjct: 2   KKKNRLAFEKSPYLLEHAENPVDWYPWGEEAFTRAEREDKPVFLSIGYSTCHWCHVMAAE 61

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SFEDE VA+LLN  FV IKVDREERPD+D VYM   Q + G GGWPL+VFL+PD KP   
Sbjct: 62  SFEDEEVARLLNATFVPIKVDREERPDLDAVYMAVAQMMTGSGGWPLTVFLTPDKKPFFA 121

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE 279
            TY P E ++GR G   ++ ++   W  +R ML          LS A   +++  + P E
Sbjct: 122 ATYIPKESRFGRIGILDLIPRIGHLWKNERAML----------LSSAEEVASALRRPPPE 171

Query: 280 LP-----QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEA 334
           +P     +  ++   + L   +D+  GGFG APKFP P     +L H ++  D G     
Sbjct: 172 VPGLRLEEATIKAAYQGLVARFDAANGGFGGAPKFPSPTTFLFLLRHWRRTGDPG----- 226

Query: 335 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 394
             G +M   TL+ M +GGI DH+GGGFHRYS D  W +PHFEKMLYDQ  ++   L+A  
Sbjct: 227 --GVQMTEVTLRAMRRGGIFDHLGGGFHRYSTDLHWRLPHFEKMLYDQAMISLACLEAHQ 284

Query: 395 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 454
            T    Y+ I R++ DYL RD+  P G  +SAEDADS   EG    +EG FY+WT  EV 
Sbjct: 285 ATGKAEYATIAREVFDYLLRDLAAPEGGFYSAEDADS---EG----EEGRFYLWTLPEVR 337

Query: 455 DILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL---IELNDSSASASKLGMP 510
            +L  + A L    ++L+  GN       +      GKNVL   I L D    A ++G+P
Sbjct: 338 AVLDPDEAELAARIFHLQEEGNF----REEATGRLTGKNVLAMKIPLED---HAREMGIP 390

Query: 511 LEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFN 570
           +      L   R KLF  R  R RP  DDK++  WNGL I++ AR +++L          
Sbjct: 391 VGDLREWLEAAREKLFAAREGRARPKKDDKILADWNGLAIAALARGAQVL---------- 440

Query: 571 FPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLL 630
               G  R E  E A+ AA  +   + DE+  RL H +R G +   G LDDYA ++ GLL
Sbjct: 441 ----GDRRLE--EAADRAADLVLHRMRDERG-RLLHRYRGGDAGILGNLDDYANMVWGLL 493

Query: 631 DLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGN 690
           +LYE G   + L  A+ L     E F DR+GGG+F T  +   +++R K+ HDGA P+GN
Sbjct: 494 ELYEAGFRPERLEAALALARDMVERFRDRDGGGFFFTPEDGEELIVRRKDGHDGALPAGN 553

Query: 691 SVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRK 750
           +V+  NL+RLA +    + +         L  F  + +    A   +  A D    PS  
Sbjct: 554 AVAAFNLLRLARMTGDPELEVI---GSEGLQAFAAQARGSPSAFLHLLSALDFALGPS-S 609

Query: 751 HVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
            VV+VG   S +   ML A  + +   K V
Sbjct: 610 EVVVVGEAGSPETAEMLKALRSRFLPRKVV 639


>gi|94985364|ref|YP_604728.1| hypothetical protein Dgeo_1263 [Deinococcus geothermalis DSM 11300]
 gi|94555645|gb|ABF45559.1| protein of unknown function DUF255 [Deinococcus geothermalis DSM
           11300]
          Length = 678

 Score =  489 bits (1260), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 259/594 (43%), Positives = 342/594 (57%), Gaps = 45/594 (7%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA E SPYLLQHA NPVDW+ WGE AFAEAR+RDVP+ LSIGYSTCHWCHVM  ESFE
Sbjct: 2   NRLAQETSPYLLQHAENPVDWWPWGEAAFAEARRRDVPVLLSIGYSTCHWCHVMAHESFE 61

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D   A+ +N  FV+IKVDREERPDVD VYMT  Q + G GGWP++VFL+PD KP   GTY
Sbjct: 62  DPSTAEFMNKHFVNIKVDREERPDVDSVYMTATQLMTGQGGWPMTVFLTPDGKPFYAGTY 121

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FPPED+YG PGF+ +L  V  AW + RD L  +     + L+E +  ++   +   +LP 
Sbjct: 122 FPPEDRYGMPGFRRLLASVAQAWAQDRDKLTGNA----QTLTEHIREASRPRRGAGDLPT 177

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
           + LR   + L + YD+  GGFGSAPKFP P  +  +L                EG+ M L
Sbjct: 178 DFLRRGVDNLRRVYDADLGGFGSAPKFPAPTTLDFLLTQ-------------PEGRDMAL 224

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL+ M +GGI+D +GGGFHRYSVDERW VPHFEKMLYD  QL    L A+  T D  ++
Sbjct: 225 HTLRMMGRGGIYDQLGGGFHRYSVDERWLVPHFEKMLYDNAQLTRTLLRAWQFTGDPTFT 284

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
            + R+ L YL R+M+ P G  FSA+DAD+   EG T       + WT +E+ ++LG    
Sbjct: 285 RLARETLAYLEREMLAPQGGFFSAQDADTQGVEGLT-------FTWTPQEIREVLGAGP- 336

Query: 463 LFKEHYYLKPTGNCDLSRMSDPHN-EFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
                  L+  G  +    +DPH  E+  +NVL  L   +  A  LG   E     L   
Sbjct: 337 --DTDLVLRVYGVTEEGNFADPHRPEYGRRNVLHVLTPPAELARDLGESAEALSARLDAA 394

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
           RRKL   R +RP+P  D KV+ SWNGL +++FA A +IL                    Y
Sbjct: 395 RRKLLTAREQRPQPGTDRKVLTSWNGLALAAFADAGRILGE----------------GHY 438

Query: 582 MEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKW 641
           +E+A   A F+R+HL       L+H++++G ++  G L+D+A    GL+ LY+ G     
Sbjct: 439 LEIARRNADFVRQHLRLPDGT-LRHTYKDGEARVEGLLEDHALYGLGLVALYQAGGDLAH 497

Query: 642 LVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 695
           L WA EL       F D E G + +T G   ++L R  +  D A  S N+ + +
Sbjct: 498 LAWARELWGIVRRDFWDGEAGLFRSTGGRAETLLTRQAQGFDAAVLSDNAAAAL 551


>gi|225181777|ref|ZP_03735215.1| protein of unknown function DUF255 [Dethiobacter alkaliphilus AHT
           1]
 gi|225167551|gb|EEG76364.1| protein of unknown function DUF255 [Dethiobacter alkaliphilus AHT
           1]
          Length = 697

 Score =  488 bits (1257), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 278/688 (40%), Positives = 386/688 (56%), Gaps = 48/688 (6%)

Query: 95  SHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCH 154
           +++ N+  NRL  E SPYLLQHA+NPVDW+ WG+EAF +A+  D PIFLS+GYSTCHWCH
Sbjct: 2   NNTENQKANRLIDEKSPYLLQHAYNPVDWYPWGDEAFEKAKNEDKPIFLSVGYSTCHWCH 61

Query: 155 VMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDL 214
           VME ESFEDE VA+ LN  FV IKVDREERPD+D +YM   QA+ G GGWPL++ +SPD 
Sbjct: 62  VMERESFEDEEVARELNRVFVCIKVDREERPDIDNIYMAVCQAMTGSGGWPLTIVMSPDK 121

Query: 215 KPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSN 274
           +P   GTYFP +  +GR G   + ++++  W   RD +  +        S   S  A S 
Sbjct: 122 RPFFAGTYFPKKTSFGRMGVIDLAQRIEMLWKTSRDKINSTAD------SVMTSLQAMSK 175

Query: 275 KLPDELP-QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGE 333
             P +LP + AL+    +L   +D   GGFG APKFP P  +  +L + K      +SG 
Sbjct: 176 VTPGDLPGEEALQGGFAKLEGRFDPDHGGFGYAPKFPSPHNLTFLLRYWK------RSGN 229

Query: 334 ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 393
           A +  +MV  TL  MA+GG++DH+G GFHRYS D  W +PHFEKMLYDQ  LA  YL+A+
Sbjct: 230 A-KALEMVEKTLLAMARGGVYDHIGFGFHRYSTDREWLLPHFEKMLYDQALLAVTYLEAY 288

Query: 394 SLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 453
             T    Y+   R+I  Y+ RDM  P G  +SAEDADS   EG    +EG FYVW + E+
Sbjct: 289 QATGKEVYAQTAREIFGYVLRDMTSPQGGFYSAEDADS---EG----EEGKFYVWETNEI 341

Query: 454 EDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE 512
             ILGE  A +F   Y ++  GN       +   +  G N+          A +L +   
Sbjct: 342 VHILGEADAAIFNAAYNIREDGNF----TDETTGKKTGANIPHLRKTYQELAQELSLEPN 397

Query: 513 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 572
           +  + L   R+KLF VR KR  PH DDK++  WNGL+I++ A   +IL  E         
Sbjct: 398 ELKDRLEAMRQKLFAVRKKRIHPHKDDKILTDWNGLMIAALAMGGRILNDE--------- 448

Query: 573 VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDL 632
                   Y + A+ AA FI  HL  ++  RL   FR   +  P  LDDYAF + GL++L
Sbjct: 449 -------NYNKSAKKAAGFILSHL--KKDGRLLKRFREDEASLPAHLDDYAFFVWGLIEL 499

Query: 633 YEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSV 692
           YE    T +L  A+ L  T  + F D + G ++ T  +   VL+R +E +DGA PSGNSV
Sbjct: 500 YETTFDTDFLKEALSLNKTMIKHFWDHDNGSFYFTADDAEDVLVRHRELYDGAVPSGNSV 559

Query: 693 SVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHV 752
           + +N +RL  I   ++ +   Q AE     F   ++ +      M  A + ++ PS + +
Sbjct: 560 AAMNNLRLGRITGNTELE---QIAEKIARAFTDEIEKVPQGYTQMLSAINFMAGPSLE-I 615

Query: 753 VLVGHKSSVDFENMLAAAHASYDLNKTV 780
           V+ G   + D ++ML    +++  NK V
Sbjct: 616 VIAGEAQAQDTKDMLQKLCSTFVPNKVV 643


>gi|198457071|ref|XP_001360541.2| GA21208 [Drosophila pseudoobscura pseudoobscura]
 gi|198135846|gb|EAL25116.2| GA21208 [Drosophila pseudoobscura pseudoobscura]
          Length = 803

 Score =  488 bits (1256), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 263/624 (42%), Positives = 353/624 (56%), Gaps = 50/624 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL +  SPYLLQHA+NPVDW+ WGEEAF  AR  +  IFLS+GYSTCHWCHVME ESFE
Sbjct: 72  NRLVSSKSPYLLQHAYNPVDWYPWGEEAFERARTENKLIFLSVGYSTCHWCHVMEHESFE 131

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           +   A ++N+ FV+IKVDREERPD+DK+YMT++Q   GGGGWP+S++L+PDL P+  GTY
Sbjct: 132 NLETAAVMNEHFVNIKVDREERPDIDKIYMTFLQMTKGGGGWPMSIWLTPDLAPITAGTY 191

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FPP  +YG P FKT+L  +   W   R  L +SG+  +  L +   ASA +    +  P 
Sbjct: 192 FPPTGRYGMPSFKTVLLAIAQQWQTNRQTLIESGSSILNALKQNEDASAVAEAAFE--PG 249

Query: 283 NALRLCAEQLS---KSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
           +A    AE +    + +D   GGFG+ PKFP    +  + +     +D            
Sbjct: 250 SASAKLAEAIGVHKRRFDRTNGGFGTEPKFPEVPRLNFLFHAYLVSKDVSV-------LD 302

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           +VL TL  + +GGI+DH+ GGF RY+    WH  HFEKMLYDQGQL   Y +A+ LT+  
Sbjct: 303 LVLQTLDHIGRGGINDHIFGGFARYATTADWHNVHFEKMLYDQGQLMAAYSNAYKLTRSA 362

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE----- 454
            +      I  Y+ +D+  P G  ++ EDADS      T K EGAFY WT  E+E     
Sbjct: 363 TFLTYADKIYKYIMKDLRHPLGGFYAGEDADSLPDHKDTVKVEGAFYAWTWNEIEAAFKD 422

Query: 455 ------DILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 507
                 D+L + A  ++  HY LKP GN  +   SDPH    GKN+LI       + S  
Sbjct: 423 QAKRFDDVLPKRAFEIYAFHYGLKPKGN--VPTHSDPHGHLTGKNILIVRGSDEETCSNF 480

Query: 508 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 567
            +  EK   +L      L  +R +RPRPHLD K+I +WNGL++S  ++ +     +    
Sbjct: 481 DLQPEKLDKLLETANDILHVLRDQRPRPHLDTKIICAWNGLMLSGLSKLANCGTVK---- 536

Query: 568 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS----------FRNGPSKAPG 617
                     R+EY++ A+    F+R+ +YD +   L  S               S+  G
Sbjct: 537 ----------REEYIKAAKELVDFLRKEMYDPEQKLLVRSCYGVAVGDPTLEKNESQIDG 586

Query: 618 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLR 677
           FLDDYAFLI GLLD Y+       L WA ELQ TQD+LF D + G YF +    P+V++R
Sbjct: 587 FLDDYAFLIKGLLDYYKASLDLSALRWAKELQETQDKLFWDEQNGAYFFSQQNAPNVIVR 646

Query: 678 VKEDHDGAEPSGNSVSVINLVRLA 701
           +KE  DGAEP GNSVS  NL  L+
Sbjct: 647 LKEGDDGAEPCGNSVSARNLTLLS 670


>gi|167629725|ref|YP_001680224.1| thioredoxin [Heliobacterium modesticaldum Ice1]
 gi|167592465|gb|ABZ84213.1| conserved hypothetical protein containing a thioredoxin domain
           [Heliobacterium modesticaldum Ice1]
          Length = 687

 Score =  488 bits (1255), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 287/677 (42%), Positives = 379/677 (55%), Gaps = 57/677 (8%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           ++  NRL  E SPYLLQHA+NPV+W+ WGEEAF  A+++D P+FLS+GYSTCHWCHVME 
Sbjct: 6   SRKPNRLIQEKSPYLLQHAYNPVEWYPWGEEAFTRAKEQDKPVFLSVGYSTCHWCHVMER 65

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFEDE VA  LN+ F+S+KVDREERPDVD +YMT  QA+ G GGWPL+V ++PD KP  
Sbjct: 66  ESFEDEEVAAYLNEHFISVKVDREERPDVDHIYMTVCQAITGHGGWPLTVIMTPDKKPFF 125

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
            GTYFP   + G  G   IL  V D W   R  L  +G    + L   + A+ S+  L D
Sbjct: 126 AGTYFPKRSRQGLAGLLDILEAVVDQWKNDRGKLVAAGDRVTQHLQREVQAN-SAGSLDD 184

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
               + LR  A  L K +D  +GGFG APKFP P  +  +L   K +        A E  
Sbjct: 185 ---ASILRGYA-WLQKRFDDVYGGFGHAPKFPTPHNLLFLLRCDKLI-------NAKEAL 233

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
            MV  TL+ M  GGI+DH+G GF RYS DE+W VPHFEKMLYD  QLA  YL+A+ +T  
Sbjct: 234 PMVEKTLRQMHAGGIYDHLGYGFSRYSTDEKWLVPHFEKMLYDNAQLAMAYLEAYQVTAK 293

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
             Y+ + R+I  Y+ RDM  P G  +SAEDADS   EG     EG FY+WT +EV++ILG
Sbjct: 294 DEYAEVAREIFSYVLRDMHAPEGGFYSAEDADS---EGV----EGKFYLWTPQEVKEILG 346

Query: 459 EH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI 517
           E    LF + Y +   GN            F+G+N+   LN   A       P+  +  I
Sbjct: 347 EETGKLFCQWYDITEKGN------------FEGQNI---LNRIDADRRPFTPPM-GWHQI 390

Query: 518 LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSD 577
           L +   KLF  R KR  P  D+K++ +WNGL+I++ A   +IL                 
Sbjct: 391 LTDAEEKLFVAREKRVHPLKDEKILTAWNGLMIAALAMGFRILYD--------------- 435

Query: 578 RKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGS 637
            + Y++ A  AA FI   L D++  RL   +R+G +   G++DDYAF+I  L++LY+  +
Sbjct: 436 -RSYLDAAIGAADFIWEKLRDDKG-RLLARYRDGEAAYKGYIDDYAFMIWALIELYQADT 493

Query: 638 GTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINL 697
              WL  A+ LQ  Q+ LF D + GGYF    +   +L R KE +DGA PSGNSVS +NL
Sbjct: 494 NPLWLKRALTLQEDQNRLFWDPDQGGYFFYGSDSEELLTRPKEIYDGATPSGNSVSALNL 553

Query: 698 VRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGH 757
           +RLA I    ++ Y RQ AE  L  F   +            A      P  K VV+V  
Sbjct: 554 LRLARITG--RNAYARQ-AETLLESFSGNINAQPAGHTFALMALLFARRPG-KEVVVVAD 609

Query: 758 KSSVDFENMLAAAHASY 774
           +    F   L   H+ +
Sbjct: 610 RKRETFRQELERLHSPF 626


>gi|195150279|ref|XP_002016082.1| GL10685 [Drosophila persimilis]
 gi|194109929|gb|EDW31972.1| GL10685 [Drosophila persimilis]
          Length = 803

 Score =  487 bits (1254), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 265/624 (42%), Positives = 354/624 (56%), Gaps = 50/624 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL +  SPYLLQHA+NPVDW+ WGEEAF  AR  +  IFLS+GYSTCHWCHVME ESFE
Sbjct: 72  NRLVSSKSPYLLQHAYNPVDWYPWGEEAFERARTENKLIFLSVGYSTCHWCHVMEHESFE 131

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           +   A ++N+ FV+IKVDREERPD+DK+YMT++Q   GGGGWP+S++L+PDL P+  GTY
Sbjct: 132 NLETAAVMNEHFVNIKVDREERPDIDKIYMTFLQMTKGGGGWPMSIWLTPDLAPITAGTY 191

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FPP  +YG P FKT+L  +   W   R  L +SG+  +  L +   ASA +    +  P 
Sbjct: 192 FPPTGRYGMPSFKTVLLAIAQQWQTNRQTLIESGSSILNALKKNEDASAVAEAAFE--PG 249

Query: 283 NALRLCAEQLS---KSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
           +A    AE +    + +D   GGFG+ PKFP    +  + +     +D            
Sbjct: 250 SASAKLAEAIGVHKRRFDRTNGGFGTEPKFPEVPRLNFLFHAYLVSKDVSV-------LD 302

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           +VL TL  + +GGI+DH+ GGF RY+    WH  HFEKMLYDQGQL   Y +A+ LT+  
Sbjct: 303 LVLQTLDHIGRGGINDHIFGGFARYATTADWHNVHFEKMLYDQGQLMAAYSNAYKLTRSA 362

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE----- 454
            +      I  Y+ +D+  P G  ++ EDADS      T K EGAFY WT  E+E     
Sbjct: 363 TFLTYADKIYKYIMKDLRHPLGGFYAGEDADSLPDHKDTVKVEGAFYAWTWNEIEAAFKD 422

Query: 455 ------DILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 507
                 D+L + A  ++  HY LKP GN  +   SDPH    GKN+LI       + S  
Sbjct: 423 QAKRFDDVLPKRAFEIYAFHYGLKPKGN--VPTHSDPHGHLTGKNILIVRGSDEETCSNF 480

Query: 508 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 567
            +  EK   +L      L  +R +RPRPHLD K+I +WNGL++S  ++            
Sbjct: 481 DLQPEKLDKLLETANDILHVLRDQRPRPHLDTKIICAWNGLMLSGLSK------------ 528

Query: 568 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS----------FRNGPSKAPG 617
           + N   V   R+EY++ A+    F+R+ +YD +   L  S               S+  G
Sbjct: 529 LANCGTV--KREEYIKAAKELVDFLRKEMYDPEQKLLVRSCYGVAVGDPTLEKNESQIDG 586

Query: 618 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLR 677
           FLDDYAFLI GLLD Y+       L WA ELQ TQD+LF D + G YF +    P+V++R
Sbjct: 587 FLDDYAFLIKGLLDYYKASLDLSALRWAKELQETQDKLFWDEQNGAYFFSQQNAPNVIVR 646

Query: 678 VKEDHDGAEPSGNSVSVINLVRLA 701
           +KE  DGAEP GNSVS  NL  L+
Sbjct: 647 LKEGDDGAEPCGNSVSARNLTLLS 670


>gi|220931972|ref|YP_002508880.1| putative glutamate--cysteine ligase/putative amino acid ligase
           [Halothermothrix orenii H 168]
 gi|219993282|gb|ACL69885.1| putative glutamate--cysteine ligase/putative amino acid ligase
           [Halothermothrix orenii H 168]
          Length = 691

 Score =  486 bits (1252), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 273/683 (39%), Positives = 382/683 (55%), Gaps = 59/683 (8%)

Query: 96  HSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHV 155
           ++++K+TNRL  E SPYLLQHAHNPVDW+ WG +AF +A+  D PIFLSIGYSTCHWCHV
Sbjct: 4   YTKSKYTNRLINEKSPYLLQHAHNPVDWYPWGNDAFMKAKSEDKPIFLSIGYSTCHWCHV 63

Query: 156 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 215
           ME ESF+DE VA+LLN+ F+SIKVDREERPD+D VYM   QAL G GGWPL++ L+PD K
Sbjct: 64  MERESFKDEEVARLLNENFISIKVDREERPDIDAVYMNVCQALTGSGGWPLTILLTPDKK 123

Query: 216 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 275
           P  GGTY P   + GR G   +L +V + W K  + + ++       +  +++  +    
Sbjct: 124 PFFGGTYIPKNSRGGRMGLIDLLSRVTELWSKNNEKIIKNADKITSSIQRSMTDDSYKGH 183

Query: 276 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 335
               L +N L    + L   +D  +GGFG+APKFP P ++  +L++  +           
Sbjct: 184 KETSLGKNTLEKAFDDLKVVFDVEYGGFGTAPKFPIPHQLIFLLHYWYR----------- 232

Query: 336 EGQKMVLF----TLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLD 391
            G  M L+    TL  M  GGI DH+G GFHRYS D +W +PHFEKMLYDQ  L   Y +
Sbjct: 233 TGNDMALYMVEKTLTAMRCGGIFDHIGYGFHRYSTDRKWILPHFEKMLYDQALLTYSYSE 292

Query: 392 AFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSK 451
           A+  T++  +    ++I+DY+RR++    G  +SA+D   AE+EG     EG +Y W+ K
Sbjct: 293 AYLATENKKFLTTIKEIIDYVRRELKSDRGGFYSAQD---AESEGV----EGKYYTWSVK 345

Query: 452 EVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPL 511
           E+E+ILG+ A  F E Y LK  GN     + +   +  GKNVL   N             
Sbjct: 346 EIENILGKQADRFIETYSLKSDGNF----IDEATGKKTGKNVLYLRNYKEEVEELK---- 397

Query: 512 EKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNF 571
                   + R KLF VR +R  P  DDK++  WNGL+I+  ARA +             
Sbjct: 398 --------KEREKLFKVRQRRRPPFKDDKILTDWNGLMIAGLARAGQ------------- 436

Query: 572 PVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLD 631
               +   EY+ +A  AA FI  +LY    +RL H FR G     G L+DYAF I GLL+
Sbjct: 437 ---ATGEIEYITMAREAADFIINNLYSSD-NRLYHRFRKGEVSIKGNLNDYAFFIWGLLE 492

Query: 632 LYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNS 691
           LY+     K+L  A++L + Q   F D + GG++ T  ++  +L+R KE +DGA PSGNS
Sbjct: 493 LYQDTFEVKYLKKALKLIDQQLNYFWDNKNGGFYFTPDDEEEILVRQKEIYDGATPSGNS 552

Query: 692 VSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKH 751
           VS+ NL R+  +   S    Y + AE+ L VF  ++K+   +  +     + L  P    
Sbjct: 553 VSIWNLYRIGHLTGNSD---YEEIAENILRVFSDKIKNDPASYSMALIGLNSLLGPGYD- 608

Query: 752 VVLVGHKSSVDFENMLAAAHASY 774
           VV+VG K+      +L +    Y
Sbjct: 609 VVVVGDKNKAKTHKILYSLKNEY 631


>gi|384917096|ref|ZP_10017228.1| conserved hypothetical protein [Methylacidiphilum fumariolicum
           SolV]
 gi|384525484|emb|CCG93101.1| conserved hypothetical protein [Methylacidiphilum fumariolicum
           SolV]
          Length = 727

 Score =  486 bits (1252), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 267/664 (40%), Positives = 376/664 (56%), Gaps = 33/664 (4%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N L+ E SPYLLQHAHNPV W  W E    +A++ + PIFLS+GYSTCHWCHVM  ESFE
Sbjct: 2   NTLSKEKSPYLLQHAHNPVQWQPWTEATIQKAKELNRPIFLSVGYSTCHWCHVMAEESFE 61

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           +  VA+LLN +++ +KVDREERPD+D+ YM +VQA  G GGWP+SV+L+PDL+P  GGTY
Sbjct: 62  NPTVAELLNAFYIPVKVDREERPDIDQFYMEFVQAFCGQGGWPMSVWLTPDLEPFFGGTY 121

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP E K+GRPGF  +L+K+ + W   R  L Q G   + ++ E++  S      P+ L Q
Sbjct: 122 FPLESKWGRPGFIDLLKKIANLWQSHRSALQQQGQEILNKMRESILCSIEIESQPN-LTQ 180

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
            A R   EQL  ++D  +GGF   PKFPRP  +   L+ +   ++     + ++  KM L
Sbjct: 181 IA-RKTVEQLWGNFDRVYGGFSPPPKFPRP-NLFFFLFRAGSFKELPDPLQ-NKAMKMAL 237

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
           FTLQ M+ GGIHD + GGFHRYSVD +W +PHFEKMLYDQ  L + YL+AF +T D  + 
Sbjct: 238 FTLQKMSCGGIHDILEGGFHRYSVDAQWRLPHFEKMLYDQAHLGSAYLEAFQMTSDFLFK 297

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
                + +YL   +  P G  +SAEDADS  + G   K EGA+Y+WT +E+E IL E  +
Sbjct: 298 ETATALFEYLFSHLYNPAGGFYSAEDADSLNSSG--EKAEGAYYLWTMEELEKILEE--V 353

Query: 463 LFKEH-----YYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI 517
           + KE       +   T   +L+         + KN+L      SA A +L MP+E+  ++
Sbjct: 354 VGKERSKVLASFFGATNQGNLAEGLGTEPSMRLKNMLFFSKPLSALAEELKMPIEETKDL 413

Query: 518 LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSD 577
           L + +  L + R KRP+P LDDK+I +WNG  IS+ A+A  +L                 
Sbjct: 414 LLKAKTALKEARLKRPKPFLDDKIITAWNGYAISALAKAYMVLAD--------------- 458

Query: 578 RKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGS 637
              Y+  A+  A FI  HL+D  +  L   +RNG    PGF  DYA L + LLDL+E   
Sbjct: 459 -SRYLNEAKKTADFILEHLWDADSKILYRIYRNGRGSIPGFASDYASLAASLLDLFEADQ 517

Query: 638 GTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINL 697
             KWL+ A   Q   +E F D     Y +   E  + +++ +E++DGAEP+  S+S   L
Sbjct: 518 DEKWLLQAKMFQELLEEKFADPYRHQYLSRAVETAATIIQTREEYDGAEPATLSLSAYAL 577

Query: 698 VRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGH 757
            +L SI    K   +++  E         L+    A+P         SVP  + +++VG 
Sbjct: 578 WKLFSITGEEK---WKKRLEELFNSAWPILERFPTALPYFLGVYLEYSVPPIE-IIIVGE 633

Query: 758 KSSV 761
           K  +
Sbjct: 634 KDDL 637


>gi|134300686|ref|YP_001114182.1| hypothetical protein Dred_2853 [Desulfotomaculum reducens MI-1]
 gi|134053386|gb|ABO51357.1| protein of unknown function DUF255 [Desulfotomaculum reducens MI-1]
          Length = 690

 Score =  486 bits (1251), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 273/684 (39%), Positives = 389/684 (56%), Gaps = 57/684 (8%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           + +NRL  E SPYLLQHAHNPVDW+ WG EAF  A++ D PIFLSIGYSTCHWCHVME E
Sbjct: 6   QKSNRLINEKSPYLLQHAHNPVDWYPWGNEAFDMAKRVDKPIFLSIGYSTCHWCHVMERE 65

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SFE E VAK+LN+ FVSIKVDREERPD+D++YM   Q+L G GGWPL++ ++PD KP   
Sbjct: 66  SFESEEVAKILNEHFVSIKVDREERPDIDQIYMNVCQSLTGSGGWPLTIMMTPDQKPFFA 125

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE 279
           GTYFP + +YGRPG   IL  V   W  +R  L + G    ++L   + + AS+   P +
Sbjct: 126 GTYFPKQAQYGRPGITEILENVASLWKNERQHLLEVG----DKLVSHMQSEAST--APGQ 179

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
           LP + L       +++YD+ +GGFG+APKFP P  +  +L +        K+GEA +   
Sbjct: 180 LPADILDKAYHIFAQNYDATYGGFGTAPKFPTPHNLMFLLRYWH------KTGEA-KALS 232

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           MV  TL  M +GGI+DH+G GF RYS D++W VPHFEKMLYD   LA  + + + +T + 
Sbjct: 233 MVEETLDAMHRGGIYDHIGFGFSRYSTDKKWLVPHFEKMLYDNALLALAFTETYQITGNP 292

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE 459
            +  + ++I  Y+ RDM  P G  +SAEDADS   EG     EG FYVW  +EV  +LG+
Sbjct: 293 RFGRVAKEIFTYILRDMTSPEGGFYSAEDADS---EGV----EGKFYVWRPEEVISLLGQ 345

Query: 460 -HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASASKLGMPLEKYLN 516
               L+ ++Y +  TGN            F+G+++  LI   D    +  L + L   + 
Sbjct: 346 VDGELYCQYYDITSTGN------------FEGESIPNLIG-QDPFKFSQDLEITLGDLVE 392

Query: 517 ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGS 576
            L  CR+ LF+ R+KR  P+ DDK++ +WNGL+I++ AR +++ +S              
Sbjct: 393 GLEACRKTLFEERAKRIHPYKDDKILTAWNGLMIAALARGAQVFQS-------------- 438

Query: 577 DRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG 636
             K Y+E A +A  FI   L      RL   +R   +  P +LDDYAF+I GLL+LY+  
Sbjct: 439 --KRYLEAASNAMGFIFDRL-QRNDGRLLARYREYEAAYPAYLDDYAFVIWGLLELYQAT 495

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
              + L  A+ L +   +LF D + GG++    +   ++ R K+ +DGA PSGNSV+ +N
Sbjct: 496 FEPRHLQNAVYLTDDMIDLFYDDKQGGFYFYGKDSEQLISRPKDIYDGAIPSGNSVATVN 555

Query: 697 LVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVG 756
           L +LA +   S+   Y + A   L VF   L             A +   P  + +V+ G
Sbjct: 556 LFKLARLTGNSR---YEELANQQLQVFADELARYPAGYSFFMMGAYLQQEPPME-IVIAG 611

Query: 757 HKSSVDFENMLAAAHASYDLNKTV 780
            K     + M+     ++  N +V
Sbjct: 612 TKEDPSLQQMINTLRQNFLPNASV 635


>gi|302392081|ref|YP_003827901.1| hypothetical protein [Acetohalobium arabaticum DSM 5501]
 gi|302204158|gb|ADL12836.1| protein of unknown function DUF255 [Acetohalobium arabaticum DSM
           5501]
          Length = 686

 Score =  486 bits (1251), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 273/679 (40%), Positives = 385/679 (56%), Gaps = 68/679 (10%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYLLQHA+NPVDW++W +EAF +A+  D P+FLSIGYSTCHWCHVME ESFE
Sbjct: 10  NRLIEEQSPYLLQHAYNPVDWYSWSDEAFKKAKTEDKPVFLSIGYSTCHWCHVMERESFE 69

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA++LN  FV+IKVDREERPD+D +YMT  Q L G GGWPL+V ++P+ KP   GTY
Sbjct: 70  DEEVAEILNRSFVAIKVDREERPDIDNIYMTVCQTLTGRGGWPLTVIMTPEKKPFFAGTY 129

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP E   G+PG   IL +V+ AW KKR  L ++     E++  AL     ++K      +
Sbjct: 130 FPKEAGRGQPGLMDILIRVEQAWKKKRQPLLETS----EEILSALERVNDTDKNDSASME 185

Query: 283 NALRLCAE---QLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
               L  E       ++D  +GGFG+APKFP P  +  +L + K       +GE  +  +
Sbjct: 186 EMSGLAKEAFISFVANFDEDYGGFGTAPKFPTPHNLMFLLRYWK------STGE-EKALE 238

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           MV  TL  M +GG++DH+G GF RYS DE+W VPHFEKMLYD   LA  YL+A+ +T   
Sbjct: 239 MVETTLDNMYRGGMYDHLGYGFARYSTDEKWLVPHFEKMLYDNALLAVTYLEAYQITDKE 298

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE 459
            Y+ I R+I  Y+ RD+  P G  +SAEDADS        ++EG FYVWT  E++ ILG 
Sbjct: 299 DYADIAREIFTYVLRDLTSPEGGFYSAEDADS-------EREEGKFYVWTPNEIKKILGN 351

Query: 460 HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LI--ELNDSSASASKLGMPLEKYL 515
                 E +       C +  ++D  N F+GK++  LI  EL+ S               
Sbjct: 352 KQ---GEEF-------CQVYNITDEGN-FEGKSIPNLIGTELDKSEVDKK---------- 390

Query: 516 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 575
                 R++LF  R KR  PH DDK++ SWNGL+I++ A  +++L  E            
Sbjct: 391 --FAAERKELFKAREKRVHPHKDDKILTSWNGLMIAALAIGARVLNDE------------ 436

Query: 576 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 635
                Y + A+ AA FI ++L  +   RL   +RNG +   G++DDYAF I GL++LYE 
Sbjct: 437 ----RYQQAAKEAAEFIWQNLRRDGNGRLLARYRNGEADYYGYVDDYAFFIWGLIELYET 492

Query: 636 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 695
              T++L  A EL N   E F D+E GG +    +   +L R KE +DGA PSGNSV+ +
Sbjct: 493 TFETEYLEKAAELNNDLIEYFWDKEQGGLYFYGYDSEELLTRPKEIYDGAIPSGNSVATL 552

Query: 696 NLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLV 755
           NL+RLA ++  ++ +   + A      F +R+ +  +A      +  + +    + +V+ 
Sbjct: 553 NLLRLAKLIGDTELE---EKARQQFEYFGSRITNKPIASSYFLLSW-LFAQNGGREIVIA 608

Query: 756 GHKSSVDFENMLAAAHASY 774
           G++     E M+   H  +
Sbjct: 609 GNREETVTEEMVQVLHQEF 627


>gi|392375956|ref|YP_003207789.1| hypothetical protein DAMO_2917 [Candidatus Methylomirabilis
           oxyfera]
 gi|258593649|emb|CBE69990.1| conserved protein of unknown function [Candidatus Methylomirabilis
           oxyfera]
          Length = 1103

 Score =  486 bits (1250), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 266/669 (39%), Positives = 381/669 (56%), Gaps = 54/669 (8%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           +HTNRL  E SPYLLQHAHNPVDW+ WGEEA   AR+ + PI LSIGYS CHWCHVM  E
Sbjct: 15  RHTNRLIHETSPYLLQHAHNPVDWYPWGEEALRRAREENRPILLSIGYSACHWCHVMAHE 74

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQAL-YGGGGWPLSVFLSPDLKPLM 218
           SFE E +A+L+N +FV IKVDREERPD+D +YM    AL +G GGWP++VFL+PDL+P  
Sbjct: 75  SFESEQIAELMNRYFVCIKVDREERPDLDAIYMAATLALNHGQGGWPMTVFLTPDLQPFF 134

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
            GTYFPP D  GRPGF TIL +V   W ++ D L        ++++E L  S S   LP 
Sbjct: 135 AGTYFPPRDGLGRPGFPTILNRVAQVWREQPDALRTQS----DKITEGLRES-SRPSLPM 189

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
            + +  +       + ++D  FGGFG+APKFP    + ++L H +   D       +   
Sbjct: 190 PVGRAEIAAAVAHFAATFDPTFGGFGAAPKFPAATALSLLLRHHQHTGD-------AHAL 242

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
           +MV  TL  MA+GGI+D +GGGF RYS DERW +PHFEKMLYD   LA  YL+AF +  D
Sbjct: 243 QMVRTTLDAMARGGIYDQIGGGFARYSTDERWLIPHFEKMLYDNALLARTYLEAFQVAGD 302

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
             Y  I  ++LDY+ R+M    G  +SA DADS   EG     EG FYVWT  E+E ILG
Sbjct: 303 PSYRQIATELLDYILREMTALEGGFYSATDADS---EGV----EGKFYVWTPAEIEAILG 355

Query: 459 -EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI 517
            E A  F  +Y + PTGN            ++G+++      ++  A+KLG+ +E+    
Sbjct: 356 QEEARRFCAYYDITPTGN------------WEGRSIPNIRRTAAQVAAKLGVSVEELAAS 403

Query: 518 LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSD 577
           +   + K+++ R KR  P LDDK++ +WNGL++S+ A   ++L                 
Sbjct: 404 IDRTQPKVYEARRKRVPPGLDDKILTAWNGLMVSAMAEGYRVLGE--------------- 448

Query: 578 RKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGS 637
            + +++ A  AA F+   L      RL  ++R+G +    +L+DYA L  GL+DLYE G 
Sbjct: 449 -RRHLDAAVRAADFLLSTLL-RPDGRLLRTYRSGVAHLNAYLEDYACLCEGLIDLYEAGG 506

Query: 638 GTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINL 697
            T++L  A+ L       F D E G +  T+ +  +++LR +E  DGA PSGN+V+   L
Sbjct: 507 ETRYLREAVRLAERMPGDFADEESGAFHTTSRDHETLILRYREGTDGATPSGNAVAASAL 566

Query: 698 VRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGH 757
            RL+  +     + +R+ AE +++ +  ++     A        D+L +     + L+G+
Sbjct: 567 TRLSFHL---NREEWRRAAEQAISAYGQQIARYPHAFAKSLAVVDLL-LEGPVELCLIGN 622

Query: 758 KSSVDFENM 766
            +    E +
Sbjct: 623 PAEAGCEAL 631


>gi|374994065|ref|YP_004969564.1| thioredoxin domain-containing protein [Desulfosporosinus orientis
           DSM 765]
 gi|357212431|gb|AET67049.1| thioredoxin domain-containing protein [Desulfosporosinus orientis
           DSM 765]
          Length = 702

 Score =  485 bits (1248), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 278/686 (40%), Positives = 389/686 (56%), Gaps = 71/686 (10%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           +K TNRL  E SPYLLQHA+NPV+W+ WGEEAF  +++ + PIFLSIGYSTCHWCHVME 
Sbjct: 5   SKPTNRLINEKSPYLLQHAYNPVNWYPWGEEAFTLSKRENKPIFLSIGYSTCHWCHVMER 64

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFEDE VA LLN WF+SIKVDREERPDVD +YM + QAL G GGWPL++ ++P+ KP  
Sbjct: 65  ESFEDEAVAALLNRWFISIKVDREERPDVDHMYMAFCQALTGSGGWPLTIIMTPEKKPFF 124

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDML----------AQSGAFAIEQLSEALS 268
            GTYFP  + +G  G   +L +V   W    + L           QSG    ++ S  + 
Sbjct: 125 AGTYFPKTEHHGYHGLMELLEQVGTLWRTSENKLRESADQIVAAVQSGLALPKKASTPID 184

Query: 269 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 328
            S +++       ++ +      L +++D R+GGFG APKFP P  +  +L ++      
Sbjct: 185 NSQNTSDSNKAWEKDVIDKAYAALEQNFDPRYGGFGRAPKFPSPHTLTFLLRYA------ 238

Query: 329 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 388
            ++   S    MV  TL  MA+GG++DH+G GF RYS DE+W +PHFEKMLYD   LA  
Sbjct: 239 -ENHPQSNALAMVRKTLNGMARGGMYDHIGFGFARYSTDEKWLIPHFEKMLYDNALLALA 297

Query: 389 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 448
           YL++F +T    ++ + +DI  Y+ RDM  P G  +SAEDAD+ +       +EG F+VW
Sbjct: 298 YLESFQVTHSPEHAKVAQDIFTYVLRDMTSPEGGFYSAEDADAED-------QEGKFHVW 350

Query: 449 TSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELN----DSS 501
           T +EVE +L  E A  +   Y +   GN            F+GK++  L++ N    D  
Sbjct: 351 TPQEVEAVLDMETAQKYCSVYDISAKGN------------FEGKSIPNLLQGNIHKLDQE 398

Query: 502 ASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 561
           +S +++ +     +  L   R+ LF  R KR  PH DDK++ SWNGL+I++ A+ +++L 
Sbjct: 399 SSLAEVDV-----IKSLESARQALFSAREKRIHPHKDDKILTSWNGLMIAALAKGAQVLG 453

Query: 562 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDD 621
           +                K Y+E  E AA FI  HL      RL   +R G S   G+LDD
Sbjct: 454 N----------------KTYLEAGEKAADFILTHL-RRVDGRLLARYREGDSAILGYLDD 496

Query: 622 YAFLISGLLDLYEFGSGTK-WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 680
           Y+F I GLL+LY F SG   +L  A+ LQ  QD LF D + GGYF T  +   +L R KE
Sbjct: 497 YSFFIWGLLELY-FASGKPLFLQTALLLQEEQDRLFFDTQRGGYFLTGSDGEKLLFRPKE 555

Query: 681 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 740
            +DGA PSGNS++ +NL+R   +  GSK  Y+++ AE  L  F T L+           A
Sbjct: 556 SYDGAIPSGNSITTLNLLRFGQLT-GSK--YWKEKAEQQLLDFRTVLEAHPSGYTAFLQA 612

Query: 741 ADMLSVPSRKHVVLVGHKSSVDFENM 766
                 P+++ ++L G   S +   M
Sbjct: 613 LQFALHPTQE-LILAGSLDSEELSMM 637


>gi|347753644|ref|YP_004861209.1| hypothetical protein Bcoa_3257 [Bacillus coagulans 36D1]
 gi|347586162|gb|AEP02429.1| hypothetical protein Bcoa_3257 [Bacillus coagulans 36D1]
          Length = 689

 Score =  484 bits (1246), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 283/674 (41%), Positives = 388/674 (57%), Gaps = 58/674 (8%)

Query: 97  SRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVM 156
           + N+  NRL  E SPYLLQHA NPVDW+ W E+AFA+A++ + P+F+SIGYSTCHWCHVM
Sbjct: 2   AENRRFNRLIHEKSPYLLQHARNPVDWYPWSEDAFAKAKQENKPVFVSIGYSTCHWCHVM 61

Query: 157 EVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKP 216
           E ESFE+E VA++LN+ FV+IKVDREERPD+D +YM   Q + G GGWPLSVFL+P+  P
Sbjct: 62  ERESFENEEVARILNEKFVAIKVDREERPDIDAIYMLVCQMMTGQGGWPLSVFLTPEKVP 121

Query: 217 LMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKL 276
              GTYFP E +YG PGFK +L  +   + +  D +   G     Q+ +AL AS    K 
Sbjct: 122 FYAGTYFPRESRYGMPGFKEVLLYLSQQYTENPDRIKDVGV----QVKQALEASREKGK- 176

Query: 277 PDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASE 336
              L +  +    +   + +D R+GGFG APKFP P  +  +L ++K  E+      A++
Sbjct: 177 QTALTKETIGRAFQAYKQGFDPRYGGFGKAPKFPMPHSLVFLLMYAKFYENRDALAMATK 236

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
                  TL  +A+GGI+DH+G GF RYSVDE++ VPHFEKMLYD   L   Y DAF +T
Sbjct: 237 -------TLDGLARGGIYDHIGYGFSRYSVDEKFLVPHFEKMLYDNALLVLAYTDAFRMT 289

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
           K+  Y  I  +I+ Y+ RDM  P G  +SAEDADS   EG    KEG FYVWT  EV+D+
Sbjct: 290 KNAQYKKITEEIITYVLRDMAHPDGGFYSAEDADS---EG----KEGKFYVWTPAEVKDV 342

Query: 457 LGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS-ASKLGMPLEKY 514
           LGE    LF + Y +   GN            F+GKN+  ++     S A K G+     
Sbjct: 343 LGEQLGTLFCQAYGITGQGN------------FEGKNIPNQITTHLESIAKKEGISPAAL 390

Query: 515 LNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVV 574
              L   R+ LF  R KR RP  DDK++ +WNGL+I++ A+A ++         F+ P  
Sbjct: 391 AEKLETARQSLFQHREKRVRPFRDDKILTAWNGLMIAALAKAGRV---------FHQP-- 439

Query: 575 GSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYE 634
                 Y++ AE A SFIR +L   Q  R+   +R+G  K  GF+D+YAFL+ G ++LYE
Sbjct: 440 -----SYVQAAEKAVSFIRDNLI--QNDRVMVRYRDGEVKNKGFIDEYAFLLWGYMELYE 492

Query: 635 FGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSV 694
                 +L  A +L     +LF D  GGG+F +  +D  +L+R KE +DGA PSGNSV+ 
Sbjct: 493 STFAPFYLAEAKKLAGNMIDLFWDGHGGGFFFSGNDDEPLLVRQKESYDGALPSGNSVAA 552

Query: 695 INLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVL 754
             L+RL+ +      +   +  +    VF   + D   A  +M  A  M +  + K VV+
Sbjct: 553 CQLLRLSKLTGDFTLE---EKVQQLFQVFSKDIHDEPTAHAMMLQAG-MHAQQATKEVVI 608

Query: 755 V---GHKSSVDFEN 765
           V     K  VDF N
Sbjct: 609 VMDDETKEVVDFIN 622


>gi|390559056|ref|ZP_10243426.1| conserved hypothetical protein [Nitrolancetus hollandicus Lb]
 gi|390174366|emb|CCF82718.1| conserved hypothetical protein [Nitrolancetus hollandicus Lb]
          Length = 685

 Score =  484 bits (1246), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 272/682 (39%), Positives = 385/682 (56%), Gaps = 53/682 (7%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYLLQHA NPVDW+ WG+EA A AR++D PI LSIGYS+CHWCHVM  ESFE
Sbjct: 3   NRLKNETSPYLLQHADNPVDWYPWGKEALAAAREQDKPILLSIGYSSCHWCHVMAHESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           +  +A ++N+ F++IKVDREERPD+D +YM  VQ L G GGWP++VFL+PD++P   GTY
Sbjct: 63  NPDIAAIMNENFINIKVDREERPDLDAIYMAAVQMLSGQGGWPMTVFLTPDMRPFYAGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FPPED+   PGF  IL  V DA+  +R+ + ++     ++L+    A+  S  +   +  
Sbjct: 123 FPPEDRPPMPGFARILDLVADAYRDRREDIDETAEQISDELNHHFQAAIESLAISPSILD 182

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
           +  R    +L+  +D   GGFG+ PKFP  + ++ ML   +    TG    +    +MV 
Sbjct: 183 DGAR----KLALQFDQSNGGFGNEPKFPPSMSLEFML---RTYVRTG----SKRALEMVT 231

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
           FTL  MA+GGI+D +GGGFHRYSVD  W VPHFEKMLYD   LA +Y   +  T    Y 
Sbjct: 232 FTLDRMARGGIYDQIGGGFHRYSVDAIWLVPHFEKMLYDNALLARIYTLGYQATGKDLYR 291

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH-A 461
            I      Y+ R+M+ P G  +SA+DADS   EG    +EG FY+WT +E E +LG   A
Sbjct: 292 RIAEQTFTYVLREMMSPEGGFYSAQDADS---EG----EEGKFYIWTPQEFETVLGRRDA 344

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
            + K ++ + P GN            F+GKN+L    +    A + G+ LE+  + + E 
Sbjct: 345 SIAKRYFGIMPDGN------------FEGKNILTAPREPERIAEQFGISLEELESTIAEI 392

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
           R KL+  RS R  P  DDKV+ +WN L++ SFA  + +                  R + 
Sbjct: 393 RGKLYQARSTRVWPGRDDKVLTAWNALMLRSFAEGATVFG----------------RADL 436

Query: 582 MEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKW 641
           +EVA   A FIR +LY  Q   L  ++  G +K  G+L+DYA+LI  LL LYE      W
Sbjct: 437 LEVAVRNARFIRDNLY--QDGHLLRTYTAGQAKLNGYLEDYAYLIDALLSLYEATFNASW 494

Query: 642 LVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLA 701
           + WA EL +T  + F D E GG+F+T      ++ R KE  D A PSGNSV+   L+RL+
Sbjct: 495 IAWAQELTDTMVKEFWDHENGGFFSTGTSHEELVARPKELFDSATPSGNSVAADVLLRLS 554

Query: 702 SIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSV 761
            ++   ++D YR+     L       K+       +  A D  ++ S + + LVG  S+ 
Sbjct: 555 HLLG--RND-YRERGMAVLKKHGMLAKEYPHGTARLLLAYD-FALSSPREIALVGDPSAE 610

Query: 762 DFENMLAAAHASYDLNKTVSKK 783
             +++LA     Y  +K V+ +
Sbjct: 611 ATQSLLAVVQQPYLPHKVVALR 632


>gi|449300572|gb|EMC96584.1| hypothetical protein BAUCODRAFT_33944 [Baudoinia compniacensis UAMH
           10762]
          Length = 739

 Score =  484 bits (1245), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 269/643 (41%), Positives = 369/643 (57%), Gaps = 32/643 (4%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNR     SPY+  H  NP  W  W  E    AR+ +  +F+SIGYS CHWCHVM  ESF
Sbjct: 9   TNRCGESKSPYVRSHMDNPTAWQLWTPETLELARQTNRLLFVSIGYSACHWCHVMAHESF 68

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           +D  +A+LLN+ F+ IK+DREERPD+D+ YM ++QA  GGGGWPL+VF++PDL+P+ GGT
Sbjct: 69  DDPRIAQLLNEHFIPIKIDREERPDIDRQYMDFLQATSGGGGWPLNVFVTPDLEPIFGGT 128

Query: 222 YFP-PED---KYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE-------ALSAS 270
           Y+P P+    + G  GF+ IL KV   W ++   L ++G     QL E            
Sbjct: 129 YWPGPKSERAQMGGTGFEQILVKVAQMWKEQESKLRENGKQITAQLKEFAQEGTLGGRTD 188

Query: 271 ASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLY---HSKKLED 327
             ++   D L  + +          +DS++GGFGSAPKFP PV ++ ++    H   +++
Sbjct: 189 GKTSDGDDGLELDLIEEAYNHYKGRFDSKYGGFGSAPKFPTPVHLKALVRFGCHPHTVKE 248

Query: 328 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 387
                E    + M + TL+CMAKGGI D VG GF RYSV   W +PHFEKMLYD  QL  
Sbjct: 249 IVGDKEVKHARYMAVKTLECMAKGGIKDQVGHGFARYSVTRDWSLPHFEKMLYDNAQLLP 308

Query: 388 VYLDAFSLTKDVFYSYICRDILDYLRRD-MIGPGGEIFSAEDADSAETEGATRKKEGAFY 446
           +YLDA+ LTK   +     D+  YL  + M    G I ++EDADS  T     K+EGAFY
Sbjct: 309 LYLDAYLLTKTDLFLETVHDVATYLTTEPMQSSLGGINASEDADSLPTAIDHHKREGAFY 368

Query: 447 VWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 505
           VWT  E +++L  E A +   ++ ++P GN D  R  D   E  G+N L    D+   AS
Sbjct: 369 VWTLDEFKELLTDEEATVCARYWNVQPNGNVD--RRYDHQGELVGRNTLCVQYDTPDLAS 426

Query: 506 KLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEA 564
           +LGM   +   ++G  R+KL + R K RP P LDDK++ +WNGL I   ARAS  L S A
Sbjct: 427 ELGMSDSEVKRLIGSGRKKLLEYRDKNRPLPSLDDKIVTAWNGLAIGGLARASAALSSMA 486

Query: 565 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 624
             +           + Y+  AE AA+ I++HL+D +T  L+  +R GP +  GF DDYAF
Sbjct: 487 PDSA----------QAYLAGAERAAACIKQHLFDAKTGTLRRVYREGPGETQGFADDYAF 536

Query: 625 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 684
           LISGLLDLYE      +L +A  LQ TQ +LF D     +F+T    P +L+R K+  D 
Sbjct: 537 LISGLLDLYEATFDDSYLSFADTLQQTQVKLFWDDNKYAFFSTPANQPDILVRTKDAMDN 596

Query: 685 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 727
           AEPS N VS  NL RL+S++   K   Y + A+ ++A FE  +
Sbjct: 597 AEPSTNGVSAQNLFRLSSLLNDEK---YEKMAKRTVAAFEVEI 636


>gi|406878261|gb|EKD27217.1| hypothetical protein ACD_79C00804G0001 [uncultured bacterium]
          Length = 713

 Score =  483 bits (1244), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 265/663 (39%), Positives = 381/663 (57%), Gaps = 45/663 (6%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           ++TN L  E SPYLLQHAHNPVDW+ W EEAF +ARK D P+FLSIGYSTCHWCHVME E
Sbjct: 6   ENTNHLVNEKSPYLLQHAHNPVDWYPWSEEAFDKARKEDKPVFLSIGYSTCHWCHVMEEE 65

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SF  + +A +LN  F+SIKVDREERPD+D VYM  VQ + G GGWPL+VF++PD K   G
Sbjct: 66  SFSGKTIADILNRDFISIKVDREERPDIDSVYMNAVQKMTGSGGWPLNVFITPDKKIFYG 125

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE 279
           GTYF PE        K IL  ++D W  KR+ + +     +  ++E   A   + ++ D 
Sbjct: 126 GTYFAPEQ------LKIILSSIEDLWKNKREKILKPSEELMNLMNEETLARNHTTEVSDV 179

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
           +   A      Q    YDS +GGFG+ PKFP       +L +  + ++           +
Sbjct: 180 VFNTAFEFLLSQ----YDSMYGGFGTFPKFPSSQTFSFLLRYYYRTKN-------KTALE 228

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           MV  ++  +  GGI+D +G G HRYS D++W +PHFEKMLYDQ  +  V+L+ + +T++ 
Sbjct: 229 MVKNSISHILDGGIYDQLGSGIHRYSTDQKWFLPHFEKMLYDQALITKVFLEIYQITREE 288

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAET-EGATRKKEGAFYVWTSKEVEDILG 458
            Y+   RDIL+++ R+M  P G  +SA DADS    E + +K EGAFY+W  KE+  ILG
Sbjct: 289 KYAEAARDILEFVLREMTSPEGVFYSALDADSFNNDENSVKKTEGAFYIWEKKEIIRILG 348

Query: 459 -EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI 517
            +   +F  +Y ++  GN      +D H EF  KNVL   N+ + +A    M  ++  N 
Sbjct: 349 NKTGEIFCYYYGIQEDGNVS----NDSHGEFIRKNVLAVSNNLTNTAKHFNMQHKEIENE 404

Query: 518 LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSD 577
           L    + LF  R KRP+P LDDK++  WN L+IS+FA+   IL                +
Sbjct: 405 LNRSHQLLFHSREKRPKPFLDDKILTDWNALMISAFAKGGLIL----------------N 448

Query: 578 RKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGS 637
              Y+  + ++A+F+   L  E+   L H +R+  +  PGFLDDYAF I+ LLDLYE   
Sbjct: 449 EPRYVNASINSANFVLSRLKTEKG-TLLHRYRDQIAGIPGFLDDYAFFINSLLDLYEATF 507

Query: 638 GTKWLVWAIELQNTQDELFLDREGGGYFNT-TGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
              +L  A+ L +   ELF D+  GG+F T  G +  +  R+KE +DGA PSGNS+++IN
Sbjct: 508 EGIYLKEALALNDKMLELFEDKVNGGFFLTAVGTETILQNRIKEFYDGAYPSGNSIALIN 567

Query: 697 LVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVG 756
           L++L+ I   ++ +  +Q+++ S+      L     A  LM   A   S+     +V+V 
Sbjct: 568 LIKLSRI---TQKNILKQSSKKSIDFISEALSKFPTAY-LMSLIALNNSLEPENEIVIVS 623

Query: 757 HKS 759
           + S
Sbjct: 624 NDS 626


>gi|85858097|ref|YP_460299.1| thymidylate kinase [Syntrophus aciditrophicus SB]
 gi|85721188|gb|ABC76131.1| thymidylate kinase [Syntrophus aciditrophicus SB]
          Length = 691

 Score =  483 bits (1244), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 280/676 (41%), Positives = 377/676 (55%), Gaps = 58/676 (8%)

Query: 94  TSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWC 153
           ++ S     NRL  E SPYLLQHA NPVDW+ WGEEAF +AR+ D PIFLSIGYSTCHWC
Sbjct: 4   STRSTGSFRNRLQQEKSPYLLQHASNPVDWYPWGEEAFEKARREDKPIFLSIGYSTCHWC 63

Query: 154 HVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPD 213
           HVM  ESFE+E VA+LLN+ F+SIKVDREERPD+DK+YM   Q L GGGGWPL++ ++PD
Sbjct: 64  HVMAHESFENEEVARLLNESFISIKVDREERPDIDKLYMAVCQLLTGGGGWPLTILMTPD 123

Query: 214 LKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASS 273
            +P   GTY P E + G  G   ++  + + W K+R+ + ++      +++ AL      
Sbjct: 124 RRPFYAGTYIPRESRSGMVGMLVLIPGLSEVWRKERNRILETAG----EITTALQGMDQG 179

Query: 274 NKLPDELPQN-ALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSG 332
              P ELP +  L    + L + +D+R+GGF SAPKFP       M  HS  L   G+  
Sbjct: 180 G--PGELPLDRVLHEAYDDLRRRFDARYGGFDSAPKFP-------MAQHSFFLLRYGRRQ 230

Query: 333 EASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDA 392
           E S+   +V  TLQ M +GGI+D VG GFHRYS D +W +PHFEKMLYDQ  LA  Y +A
Sbjct: 231 ENSQALAIVEKTLQSMRRGGIYDAVGFGFHRYSTDAQWRLPHFEKMLYDQALLAMAYTEA 290

Query: 393 FSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKE 452
           F       Y    R+IL Y+ RDM  P G  +SAEDAD+A        +EGAFY+WT++E
Sbjct: 291 FQAAGQSLYKKTAREILTYVLRDMTAPEGGFYSAEDADTA-------GEEGAFYLWTAEE 343

Query: 453 VEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS-KLGMPL 511
           +  +L           Y  P G               GK  ++  + S    S  L +P 
Sbjct: 344 LRQVLPTEEAELMIRVYAIPEG---------------GKPSVLHCSSSYPELSVDLDLPE 388

Query: 512 EKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNF 571
           E+ L  L   R+KLF  R+KR RP  DDK++  WNGL+I++ ARA+ +         F  
Sbjct: 389 ERLLERLESARQKLFLQRAKRIRPLRDDKILTDWNGLMIAAMARAAAV---------FEE 439

Query: 572 PVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLD 631
           PV       Y++ A  A  FI  +L D +  RL H +R G +  P  LDDYAFLI GL++
Sbjct: 440 PV-------YLQAAREAVRFILENLRDPR-GRLLHRWREGEAAMPAVLDDYAFLIWGLIE 491

Query: 632 LYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNS 691
            YE       L  A+ L       F D   GGYF T  +  S+L+R KE +DGA PSGNS
Sbjct: 492 AYEATFDANLLQTALSLDEELTAHFWDNASGGYFYTPDDGESLLVRQKESYDGAIPSGNS 551

Query: 692 VSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKH 751
           V+++NL+RL+ +   +  +   + A  +   F   ++ ++ A      A D L+ PS   
Sbjct: 552 VAMLNLLRLSRLTGQAGLE---ERAVATAQAFADSIRSLSAAHTSFMVALDYLAGPS-AE 607

Query: 752 VVLVGHKSSVDFENML 767
           VV+ G     D  +ML
Sbjct: 608 VVIAGSPEGTDTRDML 623


>gi|87306323|ref|ZP_01088470.1| hypothetical protein DSM3645_08327 [Blastopirellula marina DSM
           3645]
 gi|87290502|gb|EAQ82389.1| hypothetical protein DSM3645_08327 [Blastopirellula marina DSM
           3645]
          Length = 688

 Score =  483 bits (1244), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 288/685 (42%), Positives = 392/685 (57%), Gaps = 60/685 (8%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            NRL  E SPYLLQHA NPVDW  W + A AEA + D PIFLSIGYS CHWCHVME ESF
Sbjct: 2   ANRLTHESSPYLLQHAANPVDWRPWDQAAIAEAVEADKPIFLSIGYSACHWCHVMEHESF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           E++ +A  LN+ FVSIKVDREERPD+D++YM  VQ L G GGWP+SVFL+P LKP  GGT
Sbjct: 62  ENQEIADYLNEHFVSIKVDREERPDLDQIYMNAVQMLTGRGGWPMSVFLTPQLKPFFGGT 121

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDM-LAQSGAFAIEQLSEALSASASSNKLPDEL 280
           Y+PP  + G PGF  +L+ V DAW+ +R + L QS  FA E+L E   A  S  ++   L
Sbjct: 122 YWPPTPRGGMPGFDQVLKAVMDAWENRRAIALEQSEKFA-ERLQEIGQAEDSGEQIDLHL 180

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
             +A +     L   YD R GGFG APKFP  ++I++ L +S++         +S   +M
Sbjct: 181 LDDAYKY----LESIYDFRHGGFGGAPKFPHTMDIEVCLRYSRR-------QPSSRALEM 229

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
            +  L  MA+GGI+DH+GGGF RYSVD RW VPHFEKMLYD   LA VY+D +  T    
Sbjct: 230 AIHNLDQMARGGIYDHLGGGFARYSVDARWLVPHFEKMLYDNALLAGVYIDGYRATGRED 289

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE- 459
           ++ + R+  DY+   +    G   S EDADS   EG    +EG FYVWT +E+ DILGE 
Sbjct: 290 FARVARETCDYVLHYLTDEAGGFQSTEDADS---EG----EEGKFYVWTPQEIVDILGEG 342

Query: 460 HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL---IELNDSSASASKLGMPLEKYLN 516
               F E + +  +GN            F+GKN+L     + D  A+++   + L + L+
Sbjct: 343 EGRRFCEIFDVSESGN------------FEGKNILNLPQSIEDWGAASNLDVVELRRELD 390

Query: 517 ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGS 576
           +    R++L  VR KR RP  DDKV+VSWNGL+I S ARA+  L                
Sbjct: 391 V---ARQQLLQVRDKRIRPAKDDKVLVSWNGLMIDSLARAAGALSE-------------- 433

Query: 577 DRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG 636
              +Y+  AE AA F+   + D+ + RL HS+R+G +K   +LDDYA L +  + LYE  
Sbjct: 434 --PKYLIAAERAADFVFDKMIDD-SGRLLHSYRHGVAKLAAYLDDYANLANACISLYEAS 490

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
              +WL  AIEL N     F D  GGGY+ T  +   ++ R K+ +D + PSGNS++ + 
Sbjct: 491 FAERWLKRAIELTNLMMRHFGDPVGGGYYFTADDHEKLIARNKDLYDNSVPSGNSMAAVV 550

Query: 697 LVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVG 756
           L+RL++++  ++       A  ++ V    +K    A   M  A D    P+R+ VV+ G
Sbjct: 551 LLRLSALLGNTE---LLDEAVTTIRVAAPLMKKHPTATGQMLAAVDRYLGPARE-VVIFG 606

Query: 757 HKSSVDFENMLAAAHASYDLNKTVS 781
           +  S      LA    SY  N  ++
Sbjct: 607 NADSGATHEFLAELRRSYTPNSAIA 631


>gi|387929306|ref|ZP_10131983.1| hypothetical protein PB1_12859 [Bacillus methanolicus PB1]
 gi|387586124|gb|EIJ78448.1| hypothetical protein PB1_12859 [Bacillus methanolicus PB1]
          Length = 685

 Score =  483 bits (1243), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 272/613 (44%), Positives = 366/613 (59%), Gaps = 53/613 (8%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           NK  NRL AE SPYLLQHAHNPVDW+ WGEEAF +AR  + P+F+SIGYSTCHWCHVME 
Sbjct: 4   NKTPNRLIAEKSPYLLQHAHNPVDWYPWGEEAFQKARTENKPVFVSIGYSTCHWCHVMER 63

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFEDE VA+LLN+ FVSIKVDREERPD+D +YM   Q + G GGWPLSVF++PD KP  
Sbjct: 64  ESFEDEEVARLLNERFVSIKVDREERPDIDSIYMNICQMMNGHGGWPLSVFMTPDQKPFF 123

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
            GTYFP E +YG PGFK ++ ++ D + K RD + +  + A E L    SA  SS +LP 
Sbjct: 124 AGTYFPKESRYGVPGFKEVITQLHDQYMKNRDQIEKIASDAAEALKH--SARESSAELPS 181

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
               + L    +QL+ S++S +GGFG APKFP P  +  +L + K    TGK        
Sbjct: 182 ---ADVLHKTYQQLAGSFNSFYGGFGDAPKFPIPHNLMFLLKYYKW---TGKEM----AL 231

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
           KMV  TL  MA GGI+DH+G GF RYSVD  W VPHFEKMLYD   L   Y +A+ +TK+
Sbjct: 232 KMVEKTLVSMANGGIYDHIGFGFARYSVDVMWLVPHFEKMLYDNALLLYTYSEAYQVTKN 291

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
             Y  I   I++++ R+M    G  FSA DADS   EG    +EG +YVW+ +E+ D+LG
Sbjct: 292 SKYKEIAEQIIEFITREMTNEEGAFFSAIDADS---EG----EEGKYYVWSKEEILDVLG 344

Query: 459 EH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASASKLGMPLEKYL 515
           +     F   Y +   GN            F+GKN+  LI  N    + ++ G+ LE+  
Sbjct: 345 DKDGEFFCRVYDITSGGN------------FEGKNIPNLIHTN-IVKTVAEAGLNLEEGK 391

Query: 516 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 575
             L E R+KLF+ R +R  PHLDDK++ SWN L+I+  A+A +  ++             
Sbjct: 392 AKLEESRQKLFEKRQERVYPHLDDKILTSWNALMIAGLAKAGQAFQN------------- 438

Query: 576 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 635
              K ++E AE A  FI   L       L   +R+G SK   +LDD+AFL+  LL+LYE 
Sbjct: 439 ---KNHVEKAEKALRFIEEKLV--VNGELMARYRDGESKFRAYLDDWAFLLWALLELYEA 493

Query: 636 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 695
               ++L  A        + F D + GG++ T  +  ++++R K+ +DGA PSGNSV+ +
Sbjct: 494 TFSMEYLDKARNTAEKMKKHFWDEQDGGFYFTRSDGEALIVREKQVYDGALPSGNSVAAV 553

Query: 696 NLVRLASIVAGSK 708
           +L+RL      +K
Sbjct: 554 SLLRLGHFTGETK 566


>gi|125972813|ref|YP_001036723.1| hypothetical protein Cthe_0291 [Clostridium thermocellum ATCC
           27405]
 gi|281417012|ref|ZP_06248032.1| protein of unknown function DUF255 [Clostridium thermocellum JW20]
 gi|385779271|ref|YP_005688436.1| hypothetical protein Clo1313_1937 [Clostridium thermocellum DSM
           1313]
 gi|419721660|ref|ZP_14248818.1| hypothetical protein AD2_1363 [Clostridium thermocellum AD2]
 gi|419725407|ref|ZP_14252450.1| hypothetical protein YSBL_1257 [Clostridium thermocellum YS]
 gi|125713038|gb|ABN51530.1| hypothetical protein Cthe_0291 [Clostridium thermocellum ATCC
           27405]
 gi|281408414|gb|EFB38672.1| protein of unknown function DUF255 [Clostridium thermocellum JW20]
 gi|316940951|gb|ADU74985.1| hypothetical protein Clo1313_1937 [Clostridium thermocellum DSM
           1313]
 gi|380771156|gb|EIC05033.1| hypothetical protein YSBL_1257 [Clostridium thermocellum YS]
 gi|380782356|gb|EIC11996.1| hypothetical protein AD2_1363 [Clostridium thermocellum AD2]
          Length = 680

 Score =  483 bits (1243), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 274/675 (40%), Positives = 381/675 (56%), Gaps = 64/675 (9%)

Query: 97  SRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVM 156
           S  K  NRL  E SPYLLQHA+NPVDW+ W +EAF +A++ + PIFLSIGYSTCHWCHVM
Sbjct: 2   SAYKQANRLIHEKSPYLLQHAYNPVDWYPWCDEAFEKAKRENKPIFLSIGYSTCHWCHVM 61

Query: 157 EVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKP 216
           E ESFEDE VA++LN  FVSIKVDREERPD+D +YMT  QAL G GGWPL++ ++PD KP
Sbjct: 62  ESESFEDEEVAEILNKNFVSIKVDREERPDIDSIYMTACQALTGHGGWPLTIIMTPDKKP 121

Query: 217 LMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKL 276
              GTYFP +D+ G PG  +IL+ V + W  ++D LA+  +  +  +SE++      +  
Sbjct: 122 FFAGTYFPKKDRMGMPGLISILKSVHNTWVNEKDSLAKYSSKVVSVISESIDDDYYYS-- 179

Query: 277 PDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASE 336
            DE+ ++       Q    +D+ +GGFG+APKFP P  +  +L +  K         A E
Sbjct: 180 VDEITEDIFEDAFSQFKYDFDNIYGGFGNAPKFPMPHNLYFLLRYWHK---------AKE 230

Query: 337 GQKMVLF--TLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 394
              +V+   TL  M  GGI+DH+G GF RYS DE+W VPHFEKMLYD   LA  YL+ + 
Sbjct: 231 EYALVMVEKTLDSMYSGGIYDHIGFGFCRYSTDEKWLVPHFEKMLYDNALLAIAYLETYQ 290

Query: 395 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 454
            TK+  Y+ I ++I  Y+ RDM  P G  +SAEDADS   EG    +EG FY+W+  E++
Sbjct: 291 ATKNKKYADIAKEIFTYVLRDMTSPEGGFYSAEDADS---EG----EEGKFYIWSPTEIK 343

Query: 455 DILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEK 513
           ++LGE     F ++Y +   GN            F+G N+   +N +     K  + L  
Sbjct: 344 EVLGESDGEKFCKYYNITEEGN------------FEGLNIPNLINSTIPDEDKEFVEL-- 389

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
                  CR+KLFD R KR  PH DDK++ +WNGL+I++ A   ++L  E          
Sbjct: 390 -------CRKKLFDHREKRVHPHKDDKILTAWNGLMIAALAIGGRVLGIE---------- 432

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
                 +Y   AE A+ FI   L      RL   +R+G +    +LDDYAFLI  L++LY
Sbjct: 433 ------KYTLAAEKASEFIFSKLV-RPDGRLLARYRDGEAAFLAYLDDYAFLIWALIELY 485

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
           E      +L  A+EL N   + F D + GG F    +   ++ R KE +DGA PSGNSV+
Sbjct: 486 ETTYKPMYLKKAMELTNDMIKYFWDNKKGGLFIYGSDSEQLITRPKEIYDGAIPSGNSVA 545

Query: 694 VINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVV 753
            +N +RL+ +    + +   + A    A+F +++  M         A  + S      VV
Sbjct: 546 ALNFLRLSRLTGQQELE---EKAHQMFALFGSKIDSMPQGYAFFLTAM-LFSKSKSNEVV 601

Query: 754 LVGHKSSVDFENMLA 768
           LVG     D +NML+
Sbjct: 602 LVGSNEK-DTQNMLS 615


>gi|385811559|ref|YP_005847955.1| thioredoxin domain-containing protein [Ignavibacterium album JCM
           16511]
 gi|383803607|gb|AFH50687.1| Thioredoxin domain protein [Ignavibacterium album JCM 16511]
          Length = 692

 Score =  483 bits (1242), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 265/665 (39%), Positives = 383/665 (57%), Gaps = 45/665 (6%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           N+  N+L  E SPYLLQHA+NPVDWF W EEAF +A++ D PIFLSIGYSTCHWCHVME 
Sbjct: 2   NRKPNKLINEKSPYLLQHAYNPVDWFPWCEEAFEKAKREDKPIFLSIGYSTCHWCHVMER 61

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFEDE VAKL+ND F+SIKVDREERPD+D VYM   Q + GGGGWPL++ ++PD KP  
Sbjct: 62  ESFEDEEVAKLMNDTFISIKVDREERPDIDGVYMAVCQMITGGGGWPLTIVMTPDKKPFF 121

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
            GTYFP  +++GR G   ++ K+ D W  +R+ +  S     E+++++++   S  K  +
Sbjct: 122 AGTYFPKYNRFGRIGMLELITKLNDIWKNRREEVLNSA----EEITKSIN-KISHKKSDE 176

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
           E+ +  L    ++ S+ +D  +GGFG+APKFP P  +  +L + ++ ++           
Sbjct: 177 EIDEKILDKAFDEYSRRFDKEYGGFGNAPKFPTPHNLLFLLRYYRRTKNLS-------AL 229

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
           K+V  TL  M KGGI+D +G GF RYS D+ W VPHFEKMLYD   L   + +AF +T +
Sbjct: 230 KIVEKTLTEMRKGGIYDQIGFGFARYSTDKYWLVPHFEKMLYDNALLLMAFSEAFQITGN 289

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
            FY     +I +Y+ RDM  P G  FSAEDADS   EG    +EG FY+WT  E+ ++L 
Sbjct: 290 DFYKTTSEEIAEYVLRDMTHPEGGFFSAEDADS---EG----EEGKFYLWTEVEIRELLT 342

Query: 459 -EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI 517
            + A    + + ++P GN       +      G N+L         A+ L M    ++  
Sbjct: 343 KDEADFIIKVFNIEPNGNW----YDEARGVRTGNNILHLKKSYKELANDLSMSENDFIKN 398

Query: 518 LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSD 577
           L   R+K+FD R KR  PH DDK++  WN L+IS+  ++S IL                D
Sbjct: 399 LSSIRKKMFDWRKKRVHPHKDDKILTDWNSLMISALIKSSVIL----------------D 442

Query: 578 RKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGS 637
           + ++++ A  A  F++++L+  ++ +L H FR   S   G +DDYAF I   LDL+E  S
Sbjct: 443 KNKFLQAAMKADKFVKKYLF--RSEKLLHRFRESESAIDGNIDDYAFFIQAQLDLFEATS 500

Query: 638 GTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINL 697
             ++L+ AI L       F D + GGYF T+ +   +++R KE +DGA PSGNSV ++NL
Sbjct: 501 EAEFLLTAIRLNEILFHKFWDDKSGGYFFTSEDSEKLIVRQKEIYDGAIPSGNSVQLLNL 560

Query: 698 VRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGH 757
           +RL  +   +    Y + A+  +  F + +  M        C  D LS  S + V+    
Sbjct: 561 LRLYELTGNA---VYYEIAQKQVKAFASEVSRMPSVFAQFLCGFDFLSGASVQLVITAKD 617

Query: 758 KSSVD 762
           K+  D
Sbjct: 618 KNVAD 622


>gi|357039905|ref|ZP_09101696.1| hypothetical protein DesgiDRAFT_2812 [Desulfotomaculum gibsoniae
           DSM 7213]
 gi|355357268|gb|EHG05044.1| hypothetical protein DesgiDRAFT_2812 [Desulfotomaculum gibsoniae
           DSM 7213]
          Length = 688

 Score =  482 bits (1241), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 277/674 (41%), Positives = 380/674 (56%), Gaps = 52/674 (7%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            NRLA E SPYLLQHA+NPVDW+ W +EAF  A++ ++PIFLSIGYSTCHWCHVME ESF
Sbjct: 2   VNRLAKEKSPYLLQHANNPVDWYPWSDEAFKRAQRFNLPIFLSIGYSTCHWCHVMERESF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           ED+ VA  LN  FVSIKVDREERPD+D++YMT  QAL G GGWPL+V ++PD KP   GT
Sbjct: 62  EDQEVADALNHHFVSIKVDREERPDIDQIYMTVCQALTGQGGWPLTVIMTPDKKPFFAGT 121

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFP   ++GR G   I+ +V D W   RD L Q+     EQ+            L DE  
Sbjct: 122 YFPKRSRWGRAGLLDIIEQVADKWTNDRDKLIQASDMITEQVQ-----FTPGGYLADEPL 176

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
            +      +Q  +S+D ++GGFG APKFP P  +  ++ + K      ++GE +    M 
Sbjct: 177 ADISARGYKQFRQSFDKQYGGFGLAPKFPTPHNLLFLMRYWK------QNGEEA-ALNMA 229

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             TLQ + +GGI+DH+G GF RYS DE+W VPHFEKMLYD   LA  +L+ +  T++ FY
Sbjct: 230 KKTLQSIYRGGINDHIGFGFSRYSTDEKWLVPHFEKMLYDNALLALAFLEVYQATQNDFY 289

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EH 460
           +   R I  Y+ RDM  P G  +SAEDADS   EG     EG FYVW+  EV  +LG E+
Sbjct: 290 AGAARQIFTYVLRDMTHPEGGFYSAEDADS---EGV----EGKFYVWSPAEVYQVLGREN 342

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
             ++ + Y +  +GN +   +          N++  L +    A KLG+     L +L E
Sbjct: 343 GDIYCKVYNITESGNFESKSIP---------NLISALPEE--HARKLGIETRALLQLLEE 391

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            R+KLF+ R++R  P  DDKV+ +WNGL++++ AR + +L              G  R  
Sbjct: 392 SRQKLFNHRARRVHPFKDDKVLTAWNGLMMAALARGAAVL--------------GDVR-- 435

Query: 581 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 640
           Y + A  A  FI RH    +  RL   +R+G S   G+LDDYAF+I GLL+LY       
Sbjct: 436 YRDAAVKAEQFI-RHKLQRRDGRLLARYRDGESDLNGYLDDYAFVIWGLLELYRATFQAV 494

Query: 641 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 700
           +L  AI+L +   +LF D+E GG+F    +   ++ R KE +DGA PSGNSV   NL++L
Sbjct: 495 YLSRAIDLTHHVRDLFWDQEQGGFFFYGTDSEQLIARPKEIYDGAMPSGNSVMAANLLQL 554

Query: 701 ASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSS 760
           A+I   S+ +   + AE  + +F                A    + P+   +V+ G +  
Sbjct: 555 AAITGNSELE---ELAERQIDIFAGTAAQHPRGYAYFLTALLFATGPT-SEIVITGQRDD 610

Query: 761 VDFENMLAAAHASY 774
                ML  A   Y
Sbjct: 611 PQVAEMLRLAQRQY 624


>gi|322420309|ref|YP_004199532.1| hypothetical protein GM18_2810 [Geobacter sp. M18]
 gi|320126696|gb|ADW14256.1| protein of unknown function DUF255 [Geobacter sp. M18]
          Length = 742

 Score =  482 bits (1241), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 272/657 (41%), Positives = 378/657 (57%), Gaps = 48/657 (7%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           ++TNRL  E SPYLLQHAHNPV+WF WG+EAF  AR+   P+ +SIGY+TCHWCHVME E
Sbjct: 50  RYTNRLFLETSPYLLQHAHNPVNWFPWGDEAFELARRLHRPLLVSIGYATCHWCHVMEEE 109

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SFEDE VA+ LN  F++IKVDREERPDVD VYMT V A+   GGWPL+VF++PD KP  G
Sbjct: 110 SFEDESVAEFLNGNFIAIKVDREERPDVDTVYMTAVHAMGLQGGWPLNVFVAPDRKPFYG 169

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE 279
           GTY PP D  G  GF T+LR++++++D   D ++++G    E +   L+ +       + 
Sbjct: 170 GTYSPPNDYPGGLGFLTLLRRIRESFDSAPDRVSRAGVQLTEAVQTMLAPAQGEESWQEI 229

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
            P  A+RL  ++    +D R GG   APKFP  + ++++L +  +  D            
Sbjct: 230 SPDPAVRLYQDR----FDDRNGGLVGAPKFPSSLPLRLLLRYFLRTGD-------RRSLS 278

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           MV  TL+ MA GGI+D  GGGFHRY+ D  W VPHFEKMLYD   L   YL+ +  T   
Sbjct: 279 MVELTLRSMAAGGIYDQAGGGFHRYATDTSWLVPHFEKMLYDNALLTVSYLEGYQATGAA 338

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG- 458
            ++ + R+IL YL+RDM  P G  +SA DADS    G   ++EG F+ WT +E+   LG 
Sbjct: 339 EFAAVAREILRYLQRDMQAPAGGFYSATDADSLSPGG--HREEGVFFTWTPEELRGTLGP 396

Query: 459 EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 518
           E   L    Y +   GN            F+G+++L      +  A  L +  ++    L
Sbjct: 397 ERGDLMAACYGVTQGGN------------FEGRSILHREKSIAELARALKLSEQELELTL 444

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
            +CR  L+  R+KRP P  D+K++ SWNGL IS+FA    IL +                
Sbjct: 445 ADCRELLYRARAKRPLPLRDEKILASWNGLAISAFASGGLILNN---------------- 488

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG 638
            E ++VA  AA F+ +++      RL+HSF+ G +K   FLDDYAFLI+GL+DL+E    
Sbjct: 489 AELVQVAVRAAGFMLQNMV--VNGRLRHSFQEGEAKGEAFLDDYAFLIAGLIDLFEASRD 546

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
             WL  A+EL     E F DRE GG+F T      ++ R K  +DG  PSGNSV ++NL+
Sbjct: 547 ISWLERALELTAAVQEQFEDRESGGFFMTGPHHEELISREKPAYDGVIPSGNSVMIMNLL 606

Query: 699 RLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLV 755
           RL ++   ++       A ++LA F T+L +   A+  M  A + L   + K VV+V
Sbjct: 607 RLNTLTGATR---LLDQARNALAAFATQLANSPAALSEMLLAIEYLQQ-TPKEVVIV 659


>gi|306811868|gb|ADN05966.1| YyaL-like conserved hypothetical protein [uncultured Myxococcales
           bacterium]
          Length = 800

 Score =  482 bits (1241), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 261/646 (40%), Positives = 366/646 (56%), Gaps = 48/646 (7%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           + TNRL  E SPYLLQHAHNPV+W+AW +EAF  A++ + PIFLS+GYSTCHWCHVME E
Sbjct: 88  RFTNRLIRESSPYLLQHAHNPVNWYAWSDEAFDRAKRENKPIFLSVGYSTCHWCHVMERE 147

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SFEDE +A  LN  F++IKVDREERPD+D VYMT V  L G GGWP++V ++P  +P  G
Sbjct: 148 SFEDEEIAAYLNRHFIAIKVDREERPDIDSVYMTAVTILTGRGGWPMTVIMTPHKEPFFG 207

Query: 220 GTYFPPEDKY--GRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLP 277
           GTYFPP   +   R G   IL  +   +  +   +        ++LS+ +  +A+    P
Sbjct: 208 GTYFPPRKGFRGNRAGLIDILTDMLSLYKNEPTQVVARA----QELSQRVEQAAAIKPGP 263

Query: 278 DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEG 337
                  + + A+ L + +D   GGFG APKFP+P  + +++ ++++  D G +      
Sbjct: 264 GVPSDKMIVVAAQNLGRMFDPVDGGFGGAPKFPQPSRLSLLMRYARRTRDEGATA----- 318

Query: 338 QKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTK 397
             MV  TL  MA GGI+D VGGGFHRYS D +W VPHFEKMLYD  QLA VYL+A+  T 
Sbjct: 319 --MVTTTLDKMAAGGIYDQVGGGFHRYSTDAQWLVPHFEKMLYDNAQLAVVYLEAWQHTG 376

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
           D  Y  + R+ILDY+ R+M  P G  +SA DADS    G    +EG F+ WT  E+E +L
Sbjct: 377 DSAYERVAREILDYVAREMTSPEGGFYSATDADSPTPSG--HDEEGWFFTWTPGELERLL 434

Query: 458 GE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLN 516
           G   A +    + +   GN            F+G+N+L  +       S+LG+  ++   
Sbjct: 435 GAGDAAVVSSAFGVTERGN------------FEGRNILHRVKADQELGSELGLAPKRVGE 482

Query: 517 ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGS 576
           I+   R  L+D R+ RP P  D+K+I +WNG++ ++FA+A  +L +EA            
Sbjct: 483 IIRSARSTLYDARASRPPPIRDEKIIAAWNGMMGAAFAKAGWML-AEA------------ 529

Query: 577 DRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG 636
               Y+EVA  A  F+   +  E    L  ++R G   +  FLDDYAF+++  LDLYE  
Sbjct: 530 ---RYVEVAARAVGFVLAQMRAEGGA-LVRTYREGKKGSASFLDDYAFIVAACLDLYEAT 585

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
               W+  A+ELQ  QD  +LD + GGY+ T  +   +L+R K  +D A PSGNSV+  N
Sbjct: 586 GDAAWIERAVELQTDQDLRYLDEQTGGYYLTAADGEVLLVREKPAYDRAVPSGNSVAANN 645

Query: 697 LVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 742
           L+RL       K   +R+ AE   A    ++       PL+  A D
Sbjct: 646 LLRLHDFTGDPK---WRRRAERLFAWLAFQVTRSPTGFPLLLVALD 688


>gi|15607089|ref|NP_214471.1| hypothetical protein aq_2146 [Aquifex aeolicus VF5]
 gi|2984353|gb|AAC07873.1| hypothetical protein aq_2146 [Aquifex aeolicus VF5]
          Length = 692

 Score =  482 bits (1240), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 270/687 (39%), Positives = 388/687 (56%), Gaps = 51/687 (7%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           K  NRL  E SPYL QHA+NPVDW+ WGEEAF +A++ D PIFLSIGYSTCHWCHVME E
Sbjct: 3   KKPNRLIKEKSPYLRQHAYNPVDWYPWGEEAFKKAKEEDKPIFLSIGYSTCHWCHVMEKE 62

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SFED  +A++LN++FV IKVDREERPDVD  YM+  QA+ G GGWPL++ ++PD +P   
Sbjct: 63  SFEDPEIAEILNNYFVPIKVDREERPDVDAFYMSVCQAMTGTGGWPLTIIMTPDKEPFFA 122

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE 279
           GTY P E  +GRPG + +L  +++ W+K R  +  +    ++ L EA   +  +     +
Sbjct: 123 GTYIPKEGMFGRPGLRDLLLTIRELWEKDRTKILNTAKHLVKALQEASRETQKA-----Q 177

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMM--LYHSKKLEDTGKSGEASEG 337
           + +  +     +L  SYD  FGGFGSAPKFP P  +  +   Y+  K E         + 
Sbjct: 178 IGEETIHRAFSELFSSYDEHFGGFGSAPKFPTPHNLMFLGRYYYRYKRE---------QA 228

Query: 338 QKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTK 397
            KM+  TL  M  GGI+DHVG GFHRYS D  W +PHFEKMLYDQ  L   Y + + L K
Sbjct: 229 LKMIEKTLTNMRMGGIYDHVGFGFHRYSTDREWILPHFEKMLYDQAMLLFAYTEGYQLLK 288

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
              +     +I+D+L+RDM+ P G  +SA DADS   EG    +EG FY W+ +E++++L
Sbjct: 289 KDLFKQTVYEIVDFLKRDMLSPEGAFYSAWDADS---EG----EEGKFYTWSFEELKEVL 341

Query: 458 G-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLN 516
             E   L  + + L   GN     + +      G+NVL         A +LG+  ++   
Sbjct: 342 DPEELELAVKVFNLSQEGNY----LEEATKVKTGRNVLYIGKSYEELAKELGISEKELKE 397

Query: 517 ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGS 576
            L   R+KLF+ R KR +P  D+K++  WNGL I++ + A K+                 
Sbjct: 398 KLERIRKKLFEAREKRVKPLRDEKILTDWNGLTIAALSYAGKVF---------------- 441

Query: 577 DRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG 636
             KE++++A+ AA F+ +++  E    L H +  G +K  GFL+DYA+ I GL++LYE  
Sbjct: 442 GEKEWIDLAKGAADFVLKNMRTENG-LLLHRYMEGEAKYWGFLEDYAYFIWGLMELYEAT 500

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
             +K+L   I+LQ  Q + F D+E GG+F T      + +R KE +DGA PSGNSVS  N
Sbjct: 501 LDSKYLEEVIKLQEIQIKHFWDKENGGFFQTPDFFTEIPVRKKEVYDGAIPSGNSVSAYN 560

Query: 697 LVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVG 756
           L+RL  +++ S+   Y +    +L  F   + +   A      A D++ V   K +V+V 
Sbjct: 561 LIRLGRLISRSE---YEKYGTKTLEAFSWEIANFPSAHTFSIIALDLI-VNGTKELVIVP 616

Query: 757 HKSSVDFENMLAAAHASYDLNKTVSKK 783
              S  + N+ A     Y  +  + KK
Sbjct: 617 TDDS--WRNLKAQLDKEYLPDLLILKK 641


>gi|366164964|ref|ZP_09464719.1| hypothetical protein AcelC_14944 [Acetivibrio cellulolyticus CD2]
          Length = 680

 Score =  481 bits (1239), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 278/670 (41%), Positives = 378/670 (56%), Gaps = 71/670 (10%)

Query: 97  SRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVM 156
           S NK  NRL  E SPYLLQHA+NPV+WF W +EAF +A+  D PIFLSIGYSTCHWCHVM
Sbjct: 2   STNKQANRLIHEKSPYLLQHAYNPVNWFPWSDEAFQKAKSEDKPIFLSIGYSTCHWCHVM 61

Query: 157 EVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKP 216
           E ESFED+ VA  LN  F+SIKVDREERPD+D +YM   QAL G GGWPL++F+SPD KP
Sbjct: 62  EKESFEDKEVADALNKNFISIKVDREERPDIDHIYMNVCQALTGHGGWPLTIFMSPDKKP 121

Query: 217 LMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKL 276
              GTYFP  ++ G PG  T+L  V DAW   RD+L +S     EQ+  ALS     N +
Sbjct: 122 FFAGTYFPKNNRMGMPGLLTVLESVHDAWVSNRDILTRSS----EQILNALS---DRNDI 174

Query: 277 --PD---ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKS 331
             PD   EL ++       +    +D+ +GGFGSAPKFP P  +  +L +    +D    
Sbjct: 175 LEPDSEEELSEDIFYEAFSEFKYDFDNNYGGFGSAPKFPTPHNLFFLLRYWYNTKD---- 230

Query: 332 GEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLD 391
                  KMV  TL+ M KGGI+DH+G GF RYS D +W +PHFEKMLYD   LA  YL+
Sbjct: 231 ---EYALKMVEKTLESMHKGGIYDHIGFGFSRYSTDRKWLIPHFEKMLYDNALLAIAYLE 287

Query: 392 AFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSK 451
            +  TK   Y+ I ++I  Y+ RDM    G  +SAEDADS   EG    +EG FY+W++ 
Sbjct: 288 VYQATKKSEYADIAKEIFTYVLRDMTSNEGGFYSAEDADS---EG----EEGKFYIWSAN 340

Query: 452 EVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASASKLGM 509
           EV+ +LG       E Y       C L  ++  H  F+G N+  LI+ N +         
Sbjct: 341 EVKTVLGNKD---GEKY-------CKLYDIT-AHGNFEGFNIPNLIKGNIAQEDDG---- 385

Query: 510 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 569
                   + ECR+KLF+ R KR  P+ DDK++ SWNGL+I++ A   ++L         
Sbjct: 386 -------FIEECRKKLFEFREKRVHPYKDDKILTSWNGLMIAAMAFGGRVL--------- 429

Query: 570 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 629
                G D+  Y + AE A  FI   L      RL   +R+G S  P ++DDYAFLI GL
Sbjct: 430 -----GVDK--YTKAAEKAVDFIFSKLISSDG-RLLARYRDGDSAFPAYVDDYAFLIWGL 481

Query: 630 LDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSG 689
           ++LYE      +L  +++L +   + F D   GG F+   +   ++ R KE +DGA PSG
Sbjct: 482 IELYETTYKPIYLKRSLKLNDDLIKYFWDETNGGLFHYGSDSEQLITRPKEIYDGATPSG 541

Query: 690 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSR 749
           NSV+ +N +RLA +   ++ +   + A +  A F   ++  A        A  + +    
Sbjct: 542 NSVATMNFLRLARLTGQAELE---EKAYNQFATFGRSIERFARGHSFFLSAL-LFAKSKS 597

Query: 750 KHVVLVGHKS 759
           K VV+VG+++
Sbjct: 598 KEVVIVGNEN 607


>gi|407473332|ref|YP_006787732.1| thioredoxin domain-containing protein [Clostridium acidurici 9a]
 gi|407049840|gb|AFS77885.1| thioredoxin domain-containing protein [Clostridium acidurici 9a]
          Length = 682

 Score =  481 bits (1239), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 272/689 (39%), Positives = 398/689 (57%), Gaps = 69/689 (10%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           N  TNRL  E SPYLLQHA+NPV+W+ W EEAF +A++ D PIFLSIGYSTCHWCHVME 
Sbjct: 4   NVKTNRLINEKSPYLLQHAYNPVNWYPWDEEAFEKAKQEDKPIFLSIGYSTCHWCHVMER 63

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFED+ VA++LN +F+SIKVDREERPD+D +YM + QA+ G GGWP+++ ++PD KP +
Sbjct: 64  ESFEDDEVAEVLNKYFISIKVDREERPDIDSIYMNFCQAMTGSGGWPMTIIMTPDKKPFI 123

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
            GTY+P    +GR G   +L KV + W   +D L  S    +E +   + AS   N L  
Sbjct: 124 AGTYYPKHSMHGRIGIIELLNKVNEKWKSNKDDLINSSEEILEFMKTNIVASEQGN-LDM 182

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
           E  +NA  L    L  S+D  +GGFG APKFP P  +  +L + K        G+ S   
Sbjct: 183 EDIENAFNL----LKNSFDPEYGGFGKAPKFPTPHNLNFLLRYYK------VKGDES-AL 231

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
           ++V  TL+ M KGGI DH+G GF RYSVDE+W VPHFEKMLYD   LA  Y++A+ +TK 
Sbjct: 232 EVVEKTLESMYKGGIFDHIGYGFARYSVDEKWLVPHFEKMLYDNALLAVAYIEAYQITKR 291

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
             Y  I   I +++ R+M    G  +SA DADS   EG     EG FY++   E+ + LG
Sbjct: 292 DLYKEIAEKIFEFIEREMTSEEGGFYSAIDADS---EGV----EGKFYLFDHSEISEQLG 344

Query: 459 -EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI 517
            E + LF  +Y +   GN            F+GKN+         +    G+P     ++
Sbjct: 345 LEDSELFAHYYDITYDGN------------FEGKNI--------PNLIITGLPNMDTNSV 384

Query: 518 LGE----CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
           L E    C +KL+  R+KR  PH DDK++ SWNGL+I + A   ++ K +          
Sbjct: 385 LQERLRACIKKLYTYRNKRVYPHKDDKILTSWNGLMIGALALGGRVFKDD---------- 434

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
                 +Y+E AE +A+FI  +L D +  RL   +R+G +K   +L+DYA+L+ GL++LY
Sbjct: 435 ------KYIERAERSANFILENLIDREG-RLLARYRDGETKYKAYLEDYAYLVHGLIELY 487

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
           +     ++L  AI+L     +LF D   GG F    +   ++L+ KE +DGA+PSGNSV+
Sbjct: 488 QSTFKMEYLEKAIKLNQDMLDLFWDDNEGGLFIYGKDSEQLVLQHKEIYDGAQPSGNSVA 547

Query: 694 VINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAM--AVPLMCCAADMLSVPSRKH 751
            +NL+RL+ I+     +   + ++  L  F   +K+  +  +  LM C   + ++ S + 
Sbjct: 548 SLNLIRLSKILEDPSLE---EKSKAILKAFGGNVKNTVIGHSYLLMSC---LFNIVSTQE 601

Query: 752 VVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           +V++G+K+  D + M+   + ++    TV
Sbjct: 602 IVILGNKNDSDTQEMIDKVNDNFTPFTTV 630


>gi|269926785|ref|YP_003323408.1| hypothetical protein Tter_1680 [Thermobaculum terrenum ATCC
           BAA-798]
 gi|269790445|gb|ACZ42586.1| protein of unknown function DUF255 [Thermobaculum terrenum ATCC
           BAA-798]
          Length = 686

 Score =  481 bits (1239), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 274/679 (40%), Positives = 385/679 (56%), Gaps = 53/679 (7%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA E SPYLLQHA NPVDW+ WG+EAF +ARK D PI LSIGYS+CHWCHVM  ESFE
Sbjct: 3   NRLAQESSPYLLQHAENPVDWYPWGQEAFDKARKEDKPILLSIGYSSCHWCHVMAHESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           +  +AK++ND FV+IKVDREERPD+D +YM  VQA+ G  GWPL+VFL+PD KP  GGTY
Sbjct: 63  NPEIAKIMNDNFVNIKVDREERPDIDAIYMEAVQAMTGQAGWPLNVFLTPDGKPFFGGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FPPED+ G PGFK +L  + + +  +R  + QS +   +QL +   A   S+ +  E+ +
Sbjct: 123 FPPEDRVGMPGFKRLLLWLSEVYHTRRQEIEQSASQIAQQLLQISRAELKSHDISLEILE 182

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
           +A     + L  S+D ++GGFG+APKFP+P+ ++ +L        +    +  E   MV 
Sbjct: 183 SA----CQSLKSSFDHQYGGFGTAPKFPQPMTVEYLL-------QSFIRAQQKEYLDMVT 231

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL  M+ GGIHDH+GGGFHRYSVD  W +PHFEKMLYDQ  +A  YL A+ +T + +Y 
Sbjct: 232 LTLVRMSLGGIHDHLGGGFHRYSVDRTWLIPHFEKMLYDQALIARAYLHAWQVTHNSWYL 291

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
            +    L Y+ +DM    G  +SA+DADS   EG    +EG +Y+W+  E++ +L E  +
Sbjct: 292 KVVNRTLQYVLKDMTSSQGGFYSAQDADS---EG----EEGKYYLWSLDEIKRVLNEREV 344

Query: 463 -LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
            L  EHY +  +GN            F+GKN+L         A    M L +   I+ E 
Sbjct: 345 ELVCEHYGVTASGN------------FEGKNILHIAKSIEDLARDHNMDLSEVEKIIDEA 392

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
             KL   R +R  P  D KV+ SWN L+ ++ A        EA  AM N         EY
Sbjct: 393 SMKLLHYRDQRTPPAKDTKVVTSWNALMSTTLA--------EAGFAMNN--------PEY 436

Query: 582 MEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKW 641
           +  ++  A F+  +L  +    L H++ +   K PGFL+DYA L + L+ LYE  S  KW
Sbjct: 437 IAASQRNAQFLLDNLVVDGL--LHHTYSDSKPKVPGFLEDYAALSNSLITLYEITSDGKW 494

Query: 642 LVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLA 701
           L  A        + F   E G + +T+ +   + L+ +  +D A PSGNS++ + L+RLA
Sbjct: 495 LESARRFVQDMIDSFWKEEIGTFSDTSIKHSDIFLQPRNLYDNATPSGNSLACMALLRLA 554

Query: 702 SIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSV 761
            I    + D YR+ A   +      +     A   M C A+ L  PS + +V++G K SV
Sbjct: 555 VIF--DRQD-YREIASRVVRGLALVMSKHPTAFGHMLCVANTLLSPSVE-IVILGDKHSV 610

Query: 762 DFENMLAAAHASYDLNKTV 780
           + E +L     +Y  NK +
Sbjct: 611 NTEALLEVIRQTYIPNKIL 629


>gi|78043330|ref|YP_360543.1| hypothetical protein CHY_1723 [Carboxydothermus hydrogenoformans
           Z-2901]
 gi|77995445|gb|ABB14344.1| conserved hypothetical protein [Carboxydothermus hydrogenoformans
           Z-2901]
          Length = 686

 Score =  481 bits (1239), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 279/684 (40%), Positives = 382/684 (55%), Gaps = 57/684 (8%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           +  NRL  E SPYLLQHA+NPVDW+ WG +AF +A   D P+FLSIGYSTCHWCHVME E
Sbjct: 2   RQPNRLIHEKSPYLLQHAYNPVDWYPWGIDAFKKALMEDKPVFLSIGYSTCHWCHVMERE 61

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SFEDE VA LLN  FV+IKVDREERPDVD++YMT  QA+ G GGWPL++ ++P+ KP   
Sbjct: 62  SFEDEEVADLLNKHFVAIKVDREERPDVDQIYMTACQAMTGQGGWPLTIIMTPEKKPFFA 121

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE 279
           GTYFP   K+GRPG   IL ++   W+  R+ L        ++L E +     S K   +
Sbjct: 122 GTYFPKRSKWGRPGLMEILTEIVKLWETDREQLLTIS----KRLYEFMQTIPQSKK--GD 175

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
           L +  L     +    +DS +GGFG APKFP P  +  +L + K+   TG+       +K
Sbjct: 176 LTEEVLEKAYREFLGRFDSEYGGFGPAPKFPTPHNLIFLLRYWKR---TGEEKALFMAEK 232

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
               TL+ MA+GGI+DHVG GFHRYS D  W VPHFEKMLYD   LA  YL+A+  TK  
Sbjct: 233 ----TLEAMARGGIYDHVGYGFHRYSTDREWLVPHFEKMLYDNALLAYTYLEAYQATKKE 288

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG- 458
            Y+ I R++  Y++R M  P    +SAEDADS   EG     EG +YVWT  EV+ +LG 
Sbjct: 289 KYARIAREVFTYVKRKMTSPERGFYSAEDADS---EGV----EGKYYVWTPDEVKKVLGP 341

Query: 459 EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASASKLGMPLEKYLN 516
           E   LF   Y + P GN            F+GKN+  LI   D    A ++G    +   
Sbjct: 342 EEGELFCRVYDITPEGN------------FEGKNIPNLIH-TDIELVAQEIGKSAAELTE 388

Query: 517 ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGS 576
            L   R+KL+  R KR  P  DDK++ SWNGL+I++ A+ +++L+ +             
Sbjct: 389 SLDRMRQKLYHEREKRVLPLKDDKILTSWNGLMIAALAKGARVLQDQ------------- 435

Query: 577 DRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG 636
              E + +A +AA FI   L      RL   +R G +    +LDDYAFLI GL++LYE  
Sbjct: 436 ---ELLNMAHNAAEFIFSKL-RRADGRLIARYREGEAAVLAYLDDYAFLIWGLIELYEAS 491

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
               +L  A+EL     +LF D + GG F T  +   ++ R KE +DGA PSGNSV+ +N
Sbjct: 492 FEVWYLKLAVELTREMLKLFWDEKHGGLFFTGADGEELITRPKEIYDGALPSGNSVAALN 551

Query: 697 LVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVG 756
           L+RL+ ++     + + Q A   L+ F  ++ ++  A      A  +  +   K +V+ G
Sbjct: 552 LLRLSRMLG---EEDFLQKAVEILSTFAGKVSEIPSAHSFYLLAY-LFYLGPVKEIVVAG 607

Query: 757 HKSSVDFENMLAAAHASYDLNKTV 780
                D   M+   + +Y  N  V
Sbjct: 608 EPDGEDTRAMIEKINLAYLPNSVV 631


>gi|335040507|ref|ZP_08533634.1| hypothetical protein CathTA2_2248 [Caldalkalibacillus thermarum
           TA2.A1]
 gi|334179587|gb|EGL82225.1| hypothetical protein CathTA2_2248 [Caldalkalibacillus thermarum
           TA2.A1]
          Length = 715

 Score =  481 bits (1239), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 281/684 (41%), Positives = 386/684 (56%), Gaps = 55/684 (8%)

Query: 94  TSHSRN-KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHW 152
            + S+N K+TNRL  E SPYLLQHAHNPVDW+ WGEEAF +AR+ D P+FLSIGYSTCHW
Sbjct: 22  VTDSKNPKYTNRLIHEKSPYLLQHAHNPVDWYPWGEEAFEKARREDKPVFLSIGYSTCHW 81

Query: 153 CHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP 212
           CHVME ESFEDE +A +LN+ FVSIKVDREERPDVD +YM   QAL G GGWPL++ + P
Sbjct: 82  CHVMERESFEDEEIADILNNHFVSIKVDREERPDVDAIYMAVCQALTGHGGWPLTIVMHP 141

Query: 213 DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASAS 272
           D KP    TY P E K+GR G K IL+K+   W   R  L ++G   I+ + E  S    
Sbjct: 142 DQKPFFAATYLPKEGKWGRSGLKEILQKIHHLWLHDRKKLNEAGTNIIKAIQEMKSRPKG 201

Query: 273 SNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSG 332
           +     EL +  L     Q  +++D+ +GGFG APKFP P     +L   +  + TG+  
Sbjct: 202 A-----ELTKEILHHAYAQFERTFDADYGGFGQAPKFPLPHSYLFLL---RYWQMTGE-- 251

Query: 333 EASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDA 392
              +  +M   +L+ M +GGI+DH+G GF RYSVDE+W VPHFEKMLYD   LA  Y +A
Sbjct: 252 --PKALEMTEKSLRAMHRGGIYDHLGYGFARYSVDEKWLVPHFEKMLYDNALLAYSYTEA 309

Query: 393 FSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKE 452
           +  T++ +Y  +  +I +Y++R M  P G  +SAEDADS   EG     EG FYVWT +E
Sbjct: 310 YQATRNPYYKQVTEEIFEYVQRVMTSPEGGFYSAEDADS---EGV----EGKFYVWTPEE 362

Query: 453 VEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN-DSSASASKLGMP 510
           + ++L E  A LF           CD+  +++  N F+GKN+L  ++ D    A + G+ 
Sbjct: 363 IFEVLEETEAELF-----------CDIYDVTEQGN-FEGKNILHLIDVDLEQKAKQYGLS 410

Query: 511 LEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFN 570
             +    L   R KLF  R KR  PH DDK++ +WNGL+I++ A+AS             
Sbjct: 411 FAQLEQKLAAARHKLFLHREKRVHPHKDDKILTAWNGLMIAALAKASAAF---------- 460

Query: 571 FPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLL 630
                  R +Y+E+A  AA+ I RHL D +  RL   +R+G +    ++DDYAF I  L 
Sbjct: 461 ------GRSDYLELARRAANMIERHLTDNEG-RLLARYRDGEAHYLAYIDDYAFFIWALH 513

Query: 631 DLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGN 690
           +LY        L  A  L +   E F D++ GG+F    +   ++   KE +DGA PSGN
Sbjct: 514 ELYFASLDASCLQQAKSLLDQALERFWDKQNGGFFFYAKDAERLITNPKEIYDGATPSGN 573

Query: 691 SVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRK 750
            V   NLVR   +   S  D YR+ AE  L  F  ++ +          A  +LS  +  
Sbjct: 574 GVMAFNLVRHYLL---SGEDVYRETAEALLQAFGQQINEYPSGHAFSLLALQLLS-GNHA 629

Query: 751 HVVLVGHKSSVDFENMLAAAHASY 774
            +V+V  K    ++ M+     +Y
Sbjct: 630 ELVIVEGKDRHTYDKMVETVQRAY 653


>gi|430746011|ref|YP_007205140.1| thioredoxin domain-containing protein [Singulisphaera acidiphila
           DSM 18658]
 gi|430017731|gb|AGA29445.1| thioredoxin domain protein [Singulisphaera acidiphila DSM 18658]
          Length = 701

 Score =  481 bits (1239), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 274/693 (39%), Positives = 393/693 (56%), Gaps = 52/693 (7%)

Query: 90  TPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYST 149
           +P+ T+ + ++ +NRLA E SPYLLQHA NPVDW+ WG EAF  AR  + PIFLS+GYS 
Sbjct: 8   SPSMTASAADRPSNRLAGETSPYLLQHALNPVDWYPWGPEAFDRARAENKPIFLSVGYSA 67

Query: 150 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVF 209
           CHWCHVME ESFE+   A L+N+ F+++KVDREERPDVD++YM  VQA+   GGWP+SVF
Sbjct: 68  CHWCHVMEHESFENADTAALMNEHFINVKVDREERPDVDQIYMAAVQAMTDHGGWPMSVF 127

Query: 210 LSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA 269
           L+PDLKP   GTYFPP D  G PGF  +L  V  AW ++RD +  S     +++      
Sbjct: 128 LTPDLKPFYCGTYFPPVDGRGMPGFPRVLYSVHRAWAERRDDILISAGDLTDRIRLMGKI 187

Query: 270 SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 329
            A+S  L   L   A R     L++S+D+  GGFGSAPKFP P++++++L    +  +  
Sbjct: 188 PAASGALESVLLDQAAR----GLARSFDTIHGGFGSAPKFPHPMDLKVLLRQHARTRE-- 241

Query: 330 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 389
                +   ++V  TL  MA+GGI+D + GGF RYS DERW  PHFEKMLYD   L++VY
Sbjct: 242 -----AHPLQIVRHTLDKMARGGIYDQLLGGFARYSTDERWLAPHFEKMLYDNALLSSVY 296

Query: 390 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 449
           L+A  +T D  Y+ + R+ +DY+   M GP GEI+S EDADS   EG    +EG FYVW+
Sbjct: 297 LEAHQVTGDAEYARVARETMDYILERMTGPEGEIYSTEDADS---EG----EEGKFYVWS 349

Query: 450 SKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 508
             EV  ILG E A  F   Y +  +GN            ++ +N+L        +A++LG
Sbjct: 350 LAEVNQILGPERAKEFAAVYDVTESGN------------WEHQNILNLPMSVDQAATRLG 397

Query: 509 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 568
               +    L   R +L + R +R  P  D KV+ SWNGL++++ A  S+ILK E     
Sbjct: 398 RDERELQADLDRDRARLLEARDRRVPPGKDTKVLTSWNGLMLAALAEGSRILKDE----- 452

Query: 569 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 628
                       Y++ A  AA+F+   +   +  RL H++++G ++  G+LDDY+ LI G
Sbjct: 453 -----------RYLDAATKAAAFLLDRMRTAEG-RLLHAYKDGRARFNGYLDDYSNLIDG 500

Query: 629 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 688
           L  LYE     +W+  A+EL     + F D E GG+F T      ++ R K+  D A PS
Sbjct: 501 LTRLYEVSGEPRWIEAALELTAVMIDEFHDAEAGGFFYTGRSHEVLIARQKDFQDNATPS 560

Query: 689 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPS 748
           GN++    L+RL ++  G +S   R     +L   +  L    MA+     A D      
Sbjct: 561 GNAMVATALLRLGALT-GRES--LRTLGRSTLEAVQAYLDRAPMAMGQSLVALDFELASP 617

Query: 749 RKHVVLVGHKSSVDFENMLAAAHASYDLNKTVS 781
           R+  V+ G   + +F  ++ A +A +  +K V+
Sbjct: 618 REFAVIAGSDPA-EFRRVMEAIYAPFLPHKVVA 649


>gi|399888568|ref|ZP_10774445.1| hypothetical protein CarbS_08603 [Clostridium arbusti SL206]
          Length = 679

 Score =  481 bits (1238), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 266/671 (39%), Positives = 375/671 (55%), Gaps = 62/671 (9%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           N  +NRL  E SPYLLQHA+NPV+W+ W EEAF +A + + PIFLS+GYSTCHWCHVME 
Sbjct: 4   NSISNRLINEKSPYLLQHAYNPVNWYPWSEEAFNKANRENKPIFLSVGYSTCHWCHVMEK 63

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFED  VA+LLN +F++IKVDREERPD+D +YM+  QA+ G GGWP+++ ++ D KP  
Sbjct: 64  ESFEDNEVAELLNKYFIAIKVDREERPDIDNIYMSVCQAMTGSGGWPMTIIMTSDKKPFF 123

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
            GTY P + +YG  G   +L K+   W + ++ L +S    ++ L + +           
Sbjct: 124 AGTYLPKKTQYGHMGLMELLNKINKLWIEDKNKLVESSNNIVDFLQDQIVHKKG------ 177

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
           E+ +  +    E L  SY+  FGGF S+PKFP P  +  +L + +   D           
Sbjct: 178 EISEKIVNDAYESLRDSYNPVFGGFSSSPKFPTPHNLNFLLRYYRAKGD-------KYAL 230

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
           +MV  TL  M  GGI DH+G GF RYSVD +W VPHFEKMLYD   LA +Y + + +T  
Sbjct: 231 QMVENTLNSMYSGGIFDHIGFGFSRYSVDSKWLVPHFEKMLYDNALLAIIYTETYQITHK 290

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
             Y  I   IL+Y+ RDM    G  +SAEDADS   EG     EG FYVW  KE++ +LG
Sbjct: 291 DRYREIAMKILNYILRDMTSKQGGFYSAEDADS---EGV----EGKFYVWDKKEIKSVLG 343

Query: 459 EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASASKLGMPLEKYLN 516
           E A  F EHY +K  GN            F+GKN+  LI  +        +   L+    
Sbjct: 344 EDADFFNEHYNIKSKGN------------FEGKNIPNLIGEDLEELEDESIKSKLDG--- 388

Query: 517 ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGS 576
                + KLF  R KR  PH DDK++ SWNGL+I++ A A +              V G 
Sbjct: 389 ----LKEKLFSYREKRIHPHKDDKILTSWNGLMIAAMAYAGR--------------VFGI 430

Query: 577 DRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG 636
           +R  Y E A  + SFI  +L + +  RL   +R+G +   G+LDDYAFL+ GL+++YE  
Sbjct: 431 ER--YKEAASKSISFISHNLVNHKG-RLLCRYRDGEAANLGYLDDYAFLVFGLIEMYEAT 487

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
             + +L  AIEL +   + F D + GG F    +   ++L+ KE +DGA PSGNSV+ +N
Sbjct: 488 FESFYLRKAIELNDEMVKYFWDEQNGGLFFYGKDSEELILKTKEIYDGAIPSGNSVAAMN 547

Query: 697 LVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVG 756
           ++RL+ I    K +   Q A      F  ++ ++ +A  +   +A + S  S  HVV+ G
Sbjct: 548 IIRLSRITGDKKLE---QKAGEIFNTFAEKINEVPLAY-VNTISAFLTSKISETHVVIAG 603

Query: 757 HKSSVDFENML 767
            K   + + M+
Sbjct: 604 DKDHTNTKAMI 614


>gi|268325595|emb|CBH39183.1| conserved hypothetical protein, DUF255 family [uncultured archaeon]
          Length = 685

 Score =  481 bits (1237), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 277/670 (41%), Positives = 378/670 (56%), Gaps = 73/670 (10%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           K  N L  E SPYLLQHA+NPV+W+ WGEEAF  +++ D PIFLSIGYSTCHWCHVM  E
Sbjct: 2   KTPNALINEKSPYLLQHAYNPVNWYPWGEEAFRRSKEEDKPIFLSIGYSTCHWCHVMARE 61

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SFE++  A+LLN  F+ IKVDREERPD+D +YM  VQ + G GGWPLSVF++PDLKP  G
Sbjct: 62  SFENKQTAELLNTNFICIKVDREERPDLDALYMKAVQMMAGTGGWPLSVFMTPDLKPFYG 121

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE 279
           GTYFPPE  +G P F  +L+ + D W +KR+ +  S     EQ++E L  S   N L +E
Sbjct: 122 GTYFPPEPIHGLPAFNELLQTITDYWHEKRERILHSS----EQITEHLRRSYQHNLLTEE 177

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGS--------APKFPRPVEI-QMMLYHSKKLEDTGK 330
           L  + L    EQL+  +DS +GGFG+         PKFP P  +  ++LYH +  E    
Sbjct: 178 LSVDMLENAFEQLNLQFDSTYGGFGAEVAAWSVKKPKFPLPSYLFFLLLYHHRTDE---- 233

Query: 331 SGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYL 390
               S   KMV  TL  MA+GGI+D + GGFHRYS D RW VPHFEKMLYD   LA VYL
Sbjct: 234 ----SYALKMVTKTLYEMARGGIYDQLAGGFHRYSTDNRWLVPHFEKMLYDNALLAQVYL 289

Query: 391 DAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTS 450
            A+ +T D F++ I  + LD++ R+M    G  +SA DADS +        EGAFYVW+ 
Sbjct: 290 WAYQVTGDKFFAQIATETLDWVLREMTDSNGGFYSAIDADSEDI-------EGAFYVWSP 342

Query: 451 KEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGM 509
            E+  +L  EH  +F  +Y +   GN +            GK+VL   ND     +    
Sbjct: 343 SEIISVLSEEHGEVFCRYYGVTQQGNFE-----------GGKSVLHVANDEVNKDTA--- 388

Query: 510 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 569
                  I+   ++KL + R++R RP  DDK+I  WN L+IS+FA   ++L+        
Sbjct: 389 ------GIINRSKQKLLEARNRRIRPATDDKIITGWNSLMISAFALGYQVLR-------- 434

Query: 570 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 629
                    + +++ A SA  FI   L  E   +L   +R G +   G LDD+AFLI+ L
Sbjct: 435 --------ERRFLDAATSATQFILNKLNKEG--QLFRRYRAGEAAITGTLDDHAFLIAAL 484

Query: 630 LDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSG 689
           LD+YE     KWL  A++  +   ELF D+   G+F     +  +   +KE +DG  PSG
Sbjct: 485 LDIYEASFDLKWLREALQRNDRVVELFWDKANAGFFFNRYGETDLPAAIKEAYDGPIPSG 544

Query: 690 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM-LSVPS 748
           NS++  NL+RLA++   + ++  R  A+     F  +L+   +    M CA D  LS P 
Sbjct: 545 NSIAAQNLIRLAAL---TDNEELRILAKDLFRTFGAQLEQSPLEHTQMLCALDFYLSSPM 601

Query: 749 RKHVVLVGHK 758
           +  VV+   K
Sbjct: 602 Q--VVIASQK 609


>gi|188996723|ref|YP_001930974.1| hypothetical protein SYO3AOP1_0787 [Sulfurihydrogenibium sp.
           YO3AOP1]
 gi|188931790|gb|ACD66420.1| protein of unknown function DUF255 [Sulfurihydrogenibium sp.
           YO3AOP1]
          Length = 686

 Score =  481 bits (1237), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 273/667 (40%), Positives = 367/667 (55%), Gaps = 53/667 (7%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           NK  NRL  E SPYLLQHA+NPVDW+ W +EAF +A+K D PIFLSIGYS+CHWCHVME 
Sbjct: 2   NKKPNRLINEKSPYLLQHAYNPVDWYPWCDEAFEKAKKEDKPIFLSIGYSSCHWCHVMEK 61

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFEDE VAK+LN+ FVSIKVDREERPD+D +YM       G GGWPL++ ++PD KP  
Sbjct: 62  ESFEDEEVAKILNENFVSIKVDREERPDIDSIYMNVCLMFNGSGGWPLTIIMTPDKKPFF 121

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
            GTYFP   + GR G   +L  V + W   ++ L Q     IE L       +      D
Sbjct: 122 AGTYFPKYSRPGRIGLVDLLTSVAEYWKNNKEDLIQRAEKVIEYLKNDFKGKS------D 175

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML---YHSKKLEDTGKSGEAS 335
           E+ ++ +  C   L   +D  +GGF   PKFP P  I  +L   YH+K++          
Sbjct: 176 EISKDIIDACYLDLKSRFDKEYGGFSIKPKFPTPHNILFLLRYYYHTKEM---------- 225

Query: 336 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 395
           E  KM   TL  M  GG++DHVG GFHRYS D  W +PHFEKMLYDQ  L   Y +A+ L
Sbjct: 226 EALKMAEKTLINMRLGGMYDHVGFGFHRYSTDREWLLPHFEKMLYDQAMLTMAYTEAYQL 285

Query: 396 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 455
           TK+ FY    ++ + Y+ RDM    G  +S+EDADS   EG    +EG FY WT  E+++
Sbjct: 286 TKNNFYKKTAQETIAYVLRDMTSKEGVFYSSEDADS---EG----EEGKFYTWTIDELKE 338

Query: 456 ILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKY 514
           +L +  + L  + + +K  GN     + +      G+N+L         A+ L M  ++ 
Sbjct: 339 VLNDEELSLVIKVFNVKEEGN----YLEEATGHLTGRNILYLKKPIRELANDLNMNQDQL 394

Query: 515 LNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVV 574
              L E R+KLFD R KR  P  DDKV+  WNGL+IS+ A+A K                
Sbjct: 395 ETKLEEIRKKLFDAREKRVHPQKDDKVLTDWNGLMISALAKAGK---------------- 438

Query: 575 GSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYE 634
           G + ++ +E A++AA FI   ++   T  L H +++G  K  G LDDYAF   GL++LYE
Sbjct: 439 GFEDRDLIEKAKTAADFILNTMFKNDT--LYHLYKDGEVKVEGLLDDYAFFSWGLIELYE 496

Query: 635 FGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSV 694
                K+L  A++L +   E F D E GG+F +      V++R KE  DGA PSGNSVS 
Sbjct: 497 ATGDIKYLKSALKLTDLMIEKFYDFENGGFFLSPKNSKDVIVRPKEAFDGAIPSGNSVSA 556

Query: 695 INLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVL 754
            NL RL  I    K   Y   A  +L  F   +K +     +      ++  P+ + VVL
Sbjct: 557 YNLYRLYLISGNEK---YYNFAIETLKAFGGEIKRLPSYHSMFNIVLMLVFYPTSE-VVL 612

Query: 755 VGHKSSV 761
            G+   V
Sbjct: 613 AGNCEKV 619


>gi|268316671|ref|YP_003290390.1| hypothetical protein Rmar_1111 [Rhodothermus marinus DSM 4252]
 gi|262334205|gb|ACY48002.1| protein of unknown function DUF255 [Rhodothermus marinus DSM 4252]
          Length = 699

 Score =  480 bits (1236), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 279/679 (41%), Positives = 377/679 (55%), Gaps = 45/679 (6%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QH  +PVDW+ W EEAF +A+  D PIFLSIGY+ CHWCHVM  ESF+
Sbjct: 3   NRLQFEKSPYLQQHKDDPVDWWPWCEEAFEKAKAEDKPIFLSIGYAACHWCHVMAHESFQ 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA+LLND F++IKVDREERPD+D +YMT  Q + G GGWPL++ ++PD KP    TY
Sbjct: 63  DEEVARLLNDAFINIKVDREERPDIDHLYMTVCQMVTGHGGWPLTIIMTPDKKPFFAATY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
            P   +YGRPG   I+ ++K+AW + RD +  S       L + +S  A S  +  E  +
Sbjct: 123 IPKRSRYGRPGLLEIIPRIKEAWQQHRDEIIASAEKLTGTLQKVMSFEAPSQIIDAEWLE 182

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
            A R    +L   +D + GGFG APKFP P  +  +L +        +SGEA   Q MV 
Sbjct: 183 IAYR----RLDDIFDRKHGGFGHAPKFPTPHTLLFLLRYWH------RSGEAHALQ-MVE 231

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL  M  GGI+DHVG GFHRY+ DE W VPHFEKMLYDQ  L   Y +A+  T + FY 
Sbjct: 232 HTLVQMRLGGIYDHVGFGFHRYATDEAWRVPHFEKMLYDQALLTMAYTEAYQATGNPFYE 291

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHA 461
              R+IL Y+ RD+  P G  +S+EDADS   EG    +EG FYVWT +E+ ++LG E  
Sbjct: 292 RTAREILTYVLRDLRAPEGAFYSSEDADS---EG----EEGKFYVWTVEELREVLGPELT 344

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
            L  E + + P GN +     +   E  GKN+L       A A + G   E+    L E 
Sbjct: 345 PLAIELFNVDPEGNYE----EEATGERTGKNILYLSKPPEALARERGWTPEELEAKLEEI 400

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
           R++LF  R++R RP  D+K++  WNGL+I++ ARA+++                 D   Y
Sbjct: 401 RQRLFAYRARRVRPGRDEKILTDWNGLMIAALARAAQVF----------------DEVAY 444

Query: 582 MEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKW 641
           +E A SAA F+ R ++  +  RL H +R G +  PG LDDYAFL  GLLDLYE    T +
Sbjct: 445 VEAARSAADFLLRTMHTPEG-RLWHRYREGEAGIPGMLDDYAFLTWGLLDLYETTFETSY 503

Query: 642 LVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLA 701
           L  A+ L       F D  G  Y      +P +++R +E  D A PSGN+V+++NLVRL 
Sbjct: 504 LETALALTEQMLAHFWDPRGAFYMTPDDGEP-MIVRPRETLDNALPSGNAVALMNLVRLG 562

Query: 702 SIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSV 761
            +   +    Y ++A+  +  F   +K        M  A D+   P  + +VL G     
Sbjct: 563 HMTGRTA---YEEHADAMIRFFSGPVKQQPPIFTGMLIAIDLAFGPIYE-LVLAGEPDDP 618

Query: 762 DFENMLAAAHASYDLNKTV 780
               ML   H  Y   K +
Sbjct: 619 TLREMLRTIHRRYLPRKVL 637


>gi|298243436|ref|ZP_06967243.1| protein of unknown function DUF255 [Ktedonobacter racemifer DSM
           44963]
 gi|297556490|gb|EFH90354.1| protein of unknown function DUF255 [Ktedonobacter racemifer DSM
           44963]
          Length = 719

 Score =  480 bits (1236), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 280/710 (39%), Positives = 400/710 (56%), Gaps = 63/710 (8%)

Query: 89  RTP-ASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGY 147
           R+P     H   +HTNRLA E SPYLLQHAHNPVDW+ WGEEA  +AR+ D PI LS+GY
Sbjct: 6   RSPQGEQQHREPQHTNRLAHETSPYLLQHAHNPVDWYPWGEEALQKARQEDKPILLSVGY 65

Query: 148 STCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 207
           S CHWCHVME ESFE+  +A L+N  FVSIKVDREERPD+D +YM  VQA+   GGWP++
Sbjct: 66  SACHWCHVMERESFENPAIAALMNQHFVSIKVDREERPDIDNIYMQAVQAMTQQGGWPMT 125

Query: 208 VFLSPDLKPLMGGTYFPPEDK----YGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL 263
           VFL+PD +P  GGTYFPP+D+    Y  PGF+ +L  +   + ++R+ + +      + L
Sbjct: 126 VFLTPDGRPFYGGTYFPPDDRHHGQYVMPGFRRVLLSLAQLYAQEREKIEEQADELAQFL 185

Query: 264 --SEALSASASSNKLPDELPQNALRLCAEQ-LSKSYDSRFGGFGSAPKFPRPVEIQMM-- 318
              E +      N     LPQ  L + A Q L+  +D++ GGFG APKFP  + ++ +  
Sbjct: 186 RQREGMPLRRRENAT-QGLPQLDLLVVASQALANDFDAQHGGFGGAPKFPHSMALEFLLR 244

Query: 319 --LYHSKKLEDTGK-SGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHF 375
             L+ SK+    G+  G  +E   MV  +L+ MAKGG++D +GGGFHRYSVD  W VPHF
Sbjct: 245 VYLHRSKQELSLGQLPGNLTE-LGMVESSLEHMAKGGMYDQLGGGFHRYSVDAEWLVPHF 303

Query: 376 EKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETE 435
           EKMLYD   L+  YL A+ +T   FY  I  + LDY+ R+M+ P G  +S +DADS   E
Sbjct: 304 EKMLYDNALLSCAYLAAYLVTGKPFYRRIVEETLDYVAREMVSPEGGFYSTQDADS---E 360

Query: 436 GATRKKEGAFYVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL 494
           G     EG F++W   EVE +L    A +F  +Y +   GN            F+GKN+L
Sbjct: 361 GV----EGKFFLWQPAEVEALLNAPDAAIFMRYYDISARGN------------FEGKNIL 404

Query: 495 IELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFA 554
               +    A +L + + +   I+   R +LF  R  R +P  D+K++ SWNGL++ SFA
Sbjct: 405 HINVEVEQLAKELTLSVPEVEQIVKSGREQLFKARELRVKPGRDEKILTSWNGLMLRSFA 464

Query: 555 RASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSK 614
            A++ L                 R +Y+E+A + A+F+ R L   Q  RL  ++++G ++
Sbjct: 465 EAARHL----------------GRGDYLEIAINNANFLLRSL--RQDGRLLRTYKDGRAR 506

Query: 615 APGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSV 674
             G+L+DYAFL  GLL LY+     +W   A  L +    LF D + GG+F+T  +   +
Sbjct: 507 LKGYLEDYAFLADGLLALYQACFDPRWFAEARTLMDQAIALFADEQNGGFFDTGSDHEEL 566

Query: 675 LLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAV 734
           + R K+  D A PSGNSV+   L+RLA++   S  D YR+ AE  L      L D+ +  
Sbjct: 567 VTRPKDIMDNATPSGNSVAADVLLRLAAL---SGEDAYRERAEAYL----QSLADVMVQH 619

Query: 735 PLM---CCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVS 781
           P        A   S+   + + L+G   + D + +L   +  Y  N  ++
Sbjct: 620 PQFFGQALGALDFSLTMAREIALLGSPEAADTQALLNVVNTRYLPNSVLA 669


>gi|406859397|gb|EKD12463.1| putative DUF255 domain-containing protein [Marssonina brunnea f.
           sp. 'multigermtubi' MB_m1]
          Length = 820

 Score =  478 bits (1231), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 265/641 (41%), Positives = 371/641 (57%), Gaps = 34/641 (5%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NR     SPY+  H  NPV W  WG EA   AR+ +  IF+SIGY+ CHWCHVME ESFE
Sbjct: 58  NRAGESRSPYVRAHRGNPVAWQLWGSEAVEMARRENRLIFVSIGYAACHWCHVMERESFE 117

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           +E +A LLN  F+ +K+DRE RPD+D++YM +VQA  G GGWPL+VFL+PDL+P+ GGTY
Sbjct: 118 NEEIATLLNTHFIPVKIDREVRPDIDRIYMNFVQATTGSGGWPLNVFLTPDLEPVFGGTY 177

Query: 223 FPP-------EDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 275
           +P        ED+     F  IL+K+   W ++ +   +     +EQL    +     ++
Sbjct: 178 WPGHSSGTAFEDQV---DFLGILQKLSSVWREQEERCRRDSKQILEQLKSFAADGTFGSR 234

Query: 276 LPDELPQNA-----LRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML---YHSKKLED 327
           L D    +      L    +  S +YDS  GGFG APKFP P ++  +L    +   + D
Sbjct: 235 LGDGEGGDGLDIELLEEAVQHFSSTYDSTNGGFGLAPKFPTPSKLSFLLRLGQYPSIVVD 294

Query: 328 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 387
              + E    Q M + TL+ MA+GG+HD VG GF RYSV   W +PHFEKMLYD  QL +
Sbjct: 295 VVGAPECRNAQSMAVTTLRKMARGGVHDQVGNGFARYSVTADWSLPHFEKMLYDNAQLLH 354

Query: 388 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 447
           VYLDAF L++D     +  DI  YL  D+    G  +S++DADS    G + K+EGAFYV
Sbjct: 355 VYLDAFLLSRDAELLGVVYDISTYLTTDLAHAEGGFYSSQDADSLYRRGDSEKREGAFYV 414

Query: 448 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 507
           WT +E E++LGE+  +    + +  TG+ ++   +D H+EF  +NVL  ++  SA AS+ 
Sbjct: 415 WTKREFENVLGENEPILSAFFNV--TGHGNVGPENDGHDEFLDQNVLAIVSTPSALASQF 472

Query: 508 GMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 566
           GM  E+ + I+   +  L   R K R RP LDDK++ SWNGL + + AR   + K     
Sbjct: 473 GMKEEEVVRIIKAGKAALRAHREKERVRPGLDDKIVTSWNGLAVGALARTGGVFK----- 527

Query: 567 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 626
               F    S+  E +  A  AA+FI+++LYD  +  L   +R G     GF DDYAFL+
Sbjct: 528 ---GFDPAKSE--ELLGFAIKAATFIKQNLYDSSSKILYRIWREGRGDTEGFADDYAFLV 582

Query: 627 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 686
            GL+DLYE     +WL WA ELQ TQ  LF D   GG+F+T+   P ++LR+K+  D +E
Sbjct: 583 EGLIDLYEATFDEEWLKWADELQQTQISLFFDVNIGGFFSTSSTAPHLILRLKDGMDTSE 642

Query: 687 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 727
           PS N  S  NL RL+S++       Y + A+ +LA FE+ +
Sbjct: 643 PSTNGTSASNLYRLSSLL---NDLTYAEKAKQTLACFESEM 680


>gi|396464920|ref|XP_003837068.1| similar to DUF255 domain-containing protein [Leptosphaeria maculans
           JN3]
 gi|312213626|emb|CBX93628.1| similar to DUF255 domain-containing protein [Leptosphaeria maculans
           JN3]
          Length = 748

 Score =  478 bits (1231), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 275/675 (40%), Positives = 373/675 (55%), Gaps = 28/675 (4%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           K  NRL+   SPY+  H +NPV W  WG EA   AR+ +  IF+SIGY+ CHWCHVME E
Sbjct: 18  KLRNRLSESRSPYVRGHRNNPVAWQEWGPEAIELARQSNRLIFISIGYAACHWCHVMERE 77

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SFE++ VAK+LN+ ++ IKVDREERPDVD++YM YVQAL G GGWPL+ FL+PDL+P+ G
Sbjct: 78  SFENQEVAKILNESYIPIKVDREERPDVDRIYMNYVQALTGRGGWPLNAFLTPDLQPIFG 137

Query: 220 GTYFP---PEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL---SEALSASASS 273
           GTYF         G   F  +L K++D W  +R     S     ++L   ++  + S   
Sbjct: 138 GTYFAGPGSTTALGAQPFVAVLEKIRDLWTDQRQRCLDSAREETKKLIDFAQDGNISRQG 197

Query: 274 NKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK---KLEDTGK 330
               D L    L        + YD    GFG APKFP P  +Q +L  S+    + +   
Sbjct: 198 GAEHDGLELELLDDALSHFKRKYDPVNAGFGDAPKFPTPSNLQFLLKLSRYPTAVTELLG 257

Query: 331 SGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYL 390
           + + +  + MVL TL  M KGGIHD +G GF RYSV + W +PHFEKMLYD  QL  V+L
Sbjct: 258 ADDCTLAKTMVLKTLDAMNKGGIHDQIGNGFARYSVTKDWSLPHFEKMLYDHAQLLPVFL 317

Query: 391 DAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 449
           DA+ LTK   +     DI  YL    M    G  FS+EDADS        K+EGAFYVWT
Sbjct: 318 DAYLLTKSAAHLSAVHDIATYLTSPPMHAEHGGFFSSEDADSLYRPNDKEKREGAFYVWT 377

Query: 450 SKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 508
             E +DILGE  A +   +Y ++  GN       D H+E   +NVL      S  A + G
Sbjct: 378 LTEFQDILGERDAEILARYYNVRDEGNVHPEH--DAHDELINQNVLAISTTPSDLAKQFG 435

Query: 509 MPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 567
           +  E+   IL   R+KL   R K RPRP LDDK++VSWNGL I + AR +  L S   +A
Sbjct: 436 LSEEEVHRILTSGRQKLLFHRDKERPRPALDDKIVVSWNGLAIGALARTAAALSSSEPTA 495

Query: 568 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 627
                        Y+  AE AA+F++ +LYD  +  L   +R GP + PGF DDYA+LIS
Sbjct: 496 SHT----------YLAAAEKAATFLKENLYDPSSQTLTRVYREGPGETPGFADDYAYLIS 545

Query: 628 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 687
           GL+DLY+      +L WA +LQ +Q  LF D +  G+F+T      +++R+K+  D AEP
Sbjct: 546 GLIDLYQTTFNDSYLQWADDLQQSQIRLFWDTKHLGFFSTPAGQSDLIMRLKDGMDNAEP 605

Query: 688 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 747
             N VS  NL RL +++   + + Y + A  + + FE  L       P +  A  ++   
Sbjct: 606 GTNGVSAQNLDRLGALL---EDEAYSKRARETASAFEAELMQHPFLFPSLMDAV-VVGRL 661

Query: 748 SRKHVVLVGHKSSVD 762
             +H V+ G    V+
Sbjct: 662 GIRHSVITGEGRRVE 676


>gi|452985594|gb|EME85350.1| hypothetical protein MYCFIDRAFT_60228 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 784

 Score =  478 bits (1230), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 268/642 (41%), Positives = 365/642 (56%), Gaps = 31/642 (4%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNR     SPY+  H  NP  W  W  E    ARK +  +F+SIGYS CHWCHVM  ESF
Sbjct: 60  TNRCGESKSPYVRSHKDNPTAWQLWNPETLELARKTNRLLFVSIGYSACHWCHVMAHESF 119

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           +D  +++LLN+ F+ +K+DREERPD+D+ YM ++QA  GGGGWP++VF++PDL+P+ GGT
Sbjct: 120 DDPRISRLLNENFIPVKIDREERPDIDRQYMDFLQATNGGGGWPMNVFVTPDLEPVFGGT 179

Query: 222 YFP---PEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE-----ALSASASS 273
           Y+P    E      GF+ IL K+   W ++   + QSG     QL E     ++      
Sbjct: 180 YWPGPKSERLQAAGGFEDILIKIATTWKEQEARVRQSGKEITRQLREFAQEGSIGGKNGR 239

Query: 274 NKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML----YHSKKLEDTG 329
               DEL  + L    +     YD +  GFG APKFP PV I+ +L    Y S   E  G
Sbjct: 240 TDDEDELELDLLDDAFQHYKMRYDPKHHGFGGAPKFPTPVHIRPLLRVAAYPSVVREIVG 299

Query: 330 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 389
           +  E  E + M + TL  MAKGGI D +G GF RYSV   W +PHFEKMLYD  QL  VY
Sbjct: 300 EK-ECVEARAMAVNTLAAMAKGGIKDQIGHGFARYSVTRDWSLPHFEKMLYDNAQLLPVY 358

Query: 390 LDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 448
           LDA+ LTK   +     DI  YL    M  P G I SAEDADS+ T     K+EGA+YVW
Sbjct: 359 LDAYLLTKSPLFLETAIDIATYLTSPPMQSPLGGICSAEDADSSPTVSDKEKREGAYYVW 418

Query: 449 TSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 507
           T  E + +LG+  + +  +++ ++P GN D  + SD   E  G+N L    D    A +L
Sbjct: 419 TFDEFKQVLGDAQVDICAKYWNVRPEGNID--QRSDAQGELAGQNTLCVQYDIPDLAKEL 476

Query: 508 GMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 566
           G+P ++   ++ + R+KL   R K RPRP LDDK++ SWNGL I   AR S +L+S A +
Sbjct: 477 GLPEDEVKQMILDGRQKLLAHREKTRPRPALDDKIVTSWNGLAIGGLARTSAVLQSSAPA 536

Query: 567 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 626
                         Y+  A  A + I+ HL+D  T  L+  +R GP +  GF DDYAF +
Sbjct: 537 QA----------TRYLSSAVRAVTCIQEHLFDPATGTLKRVYREGPGETQGFADDYAFFV 586

Query: 627 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 686
           SGLLDLYE    ++WL +A  LQ TQ++LF D    G+F+T  + P +L+R K+  D AE
Sbjct: 587 SGLLDLYEATFDSRWLEFAETLQKTQNKLFWDDLKYGFFSTPADQPDILIRTKDAMDNAE 646

Query: 687 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLK 728
           PS N VS  NL RL S++  ++   Y +     +A FE  ++
Sbjct: 647 PSVNGVSAANLFRLGSLLNDAE---YEKMGRRVVACFEVEIE 685


>gi|408381411|ref|ZP_11178960.1| hypothetical protein A994_03123 [Methanobacterium formicicum DSM
           3637]
 gi|407815878|gb|EKF86441.1| hypothetical protein A994_03123 [Methanobacterium formicicum DSM
           3637]
          Length = 712

 Score =  478 bits (1230), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 277/685 (40%), Positives = 376/685 (54%), Gaps = 51/685 (7%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           K+ N L  E SPYLLQH  NPVDW+ WG+EAF +A+  D PIFLSIGYSTCHWCHVM  E
Sbjct: 11  KNQNHLKNEKSPYLLQHVDNPVDWYPWGDEAFNKAKNEDKPIFLSIGYSTCHWCHVMARE 70

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SF+D  +  LLN  FV +KVDREERPD+D VYMT  Q + G GGWPL+V ++PDLKP   
Sbjct: 71  SFQDPEIGDLLNQVFVPVKVDREERPDIDSVYMTVCQMITGSGGWPLTVIMTPDLKPFFA 130

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSG---AFAIEQLSEA-----LSASA 271
           GTYFP +      G + ++  V+D WD KR  L +S      +++Q+SE      +  S 
Sbjct: 131 GTYFPKDTGPRGTGLRDLILNVRDLWDNKRGELVKSAEELTHSLQQISEGPLPQTVKGSQ 190

Query: 272 SSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKS 331
              +   EL +  L+   + LS ++D ++ GFG+  KFP P  +  +L + K    TG+ 
Sbjct: 191 GFPESSQELGEEILKQAYQSLSDNFDEKYTGFGNNQKFPTPHHLLFLLRYWKH---TGED 247

Query: 332 GEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLD 391
              +    MV  TL  M KGGI+DHVG GFHRY+VD +W VPHFEKMLYDQ  LA  Y +
Sbjct: 248 MALT----MVERTLDAMKKGGIYDHVGFGFHRYTVDRQWMVPHFEKMLYDQALLAIAYTE 303

Query: 392 AFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSK 451
           AF  T    Y     ++L+Y+ RDM  P G  +SAEDADS   EG    +EG FY+WT  
Sbjct: 304 AFQATGKTQYRETAEEVLEYILRDMRSPEGGFYSAEDADS---EG----EEGKFYLWTQD 356

Query: 452 EVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFK-GKNVLIELNDSSASASKLGM 509
           E+ D+LG +   LF E Y +   GN       D     K GKN+L         + KLG+
Sbjct: 357 EIMDLLGSNDGALFSEIYSVSEEGN-----FKDEATRVKTGKNILHRTQTWDELSKKLGI 411

Query: 510 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 569
             E+        R  LF  R  R  PH DDKV+  WNGLVI + A A    K        
Sbjct: 412 STEELWWKTETARETLFHARKSRIHPHKDDKVLTDWNGLVIVALALAGNSFK-------- 463

Query: 570 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 629
                   R++Y+  A  A  FI   L+ +   RL+H +R+G +   G LDDYA+LI GL
Sbjct: 464 --------REDYLMAAGDAVKFIMTKLHHQG--RLKHRWRDGEAAVDGNLDDYAYLIWGL 513

Query: 630 LDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSG 689
           L+LY+    +++L  A++L  T  E FLD + GG++ T+     +L+R KE +D A PSG
Sbjct: 514 LELYQATFQSEYLEIALKLNQTLLEHFLDHDNGGFYFTSDFTQKILVRQKEAYDTALPSG 573

Query: 690 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSR 749
           NSV ++NL + + I+     D     + H L  +   +   + +   M  +A +L +   
Sbjct: 574 NSVQMMNLEKFSLII----DDMKISESFHGLESYFASMITQSPSAFTMFLSAIILKIGPS 629

Query: 750 KHVVLVGHKSSVDFENMLAAAHASY 774
             VV+ G K S D + +L      Y
Sbjct: 630 FQVVICGEKDSPDTQVLLNTIQKEY 654


>gi|148379048|ref|YP_001253589.1| hypothetical protein CBO1058 [Clostridium botulinum A str. ATCC
           3502]
 gi|153933571|ref|YP_001383431.1| hypothetical protein CLB_1099 [Clostridium botulinum A str. ATCC
           19397]
 gi|153935757|ref|YP_001386978.1| hypothetical protein CLC_1111 [Clostridium botulinum A str. Hall]
 gi|148288532|emb|CAL82612.1| conserved hypothetical protein [Clostridium botulinum A str. ATCC
           3502]
 gi|152929615|gb|ABS35115.1| conserved hypothetical protein [Clostridium botulinum A str. ATCC
           19397]
 gi|152931671|gb|ABS37170.1| conserved hypothetical protein [Clostridium botulinum A str. Hall]
          Length = 680

 Score =  478 bits (1229), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 269/676 (39%), Positives = 372/676 (55%), Gaps = 64/676 (9%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNRL  E SPYLLQHAHNPVDW+ WGEEAF +A+  D P+FLSIGYSTCHWCHVME ESF
Sbjct: 6   TNRLINEKSPYLLQHAHNPVDWYPWGEEAFEKAKIEDKPVFLSIGYSTCHWCHVMERESF 65

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           EDE VA++LN  F+SIKVDREERPD+D +YM + QA  G GGWPL++ ++PD KP   GT
Sbjct: 66  EDEEVAEVLNKNFISIKVDREERPDIDSIYMNFCQAYTGSGGWPLTIIMTPDKKPFFAGT 125

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFP   KY  PG   ILR + + W + ++ + +S    +EQ+          N    EL 
Sbjct: 126 YFPKWGKYNVPGIMDILRSISNLWREDKNKILESSNRILEQIER-----FQDNHRQGELE 180

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQK 339
           +  +   A+ L  ++DS++GGFG+ PKFP    I  +L  Y+ KK E             
Sbjct: 181 EYIIEEAAQTLLDNFDSKYGGFGTKPKFPTAHYILFLLRYYYFKKDEKV---------LD 231

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           ++  TL  M KGGI DH+G GF RYS D +W VPHFEKMLYD   L+  Y +A+  TK+ 
Sbjct: 232 VINKTLTSMYKGGIFDHIGFGFSRYSTDNKWLVPHFEKMLYDNALLSMAYTEAYEATKNP 291

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG- 458
            +  I   +L+Y+++ M    G  +SAEDADS   EG     EG FY+WT +E+ DILG 
Sbjct: 292 LFKDITEKVLNYVKKSMTSEKGGFYSAEDADS---EGV----EGKFYLWTKEEIMDILGE 344

Query: 459 EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 518
           E   L+ + Y +   GN            F+ KN+   +N            LEK     
Sbjct: 345 EEGELYCKIYDITSKGN------------FENKNIANLINTDLKIVDNNKDKLEK----- 387

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
              R KLF+ R KR  P+ DDK++ SWN L+I +F++A + LK++               
Sbjct: 388 --IREKLFEYREKRIHPYKDDKILTSWNALMIVAFSKAGRSLKND--------------- 430

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG 638
             Y+E+A+ +A+FI  +L DE+   L    R G     GF+DDYAF +  L++LYE    
Sbjct: 431 -NYIEIAKKSANFIIENLMDEKG-TLYARIREGERGNEGFIDDYAFFLWALIELYEASFD 488

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
             +L  +IE+ N+  +LF  +E GG++  +     +L+R KE +DGA PSGN+V+ + L 
Sbjct: 489 IYYLEKSIEVANSMIDLFWHKEDGGFYLYSKNSEKLLVRPKEIYDGATPSGNAVASLTLN 548

Query: 699 RLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHK 758
            L  I      D Y+   +     F T +K   M   L    A M ++   K + L  +K
Sbjct: 549 LLYYITG---EDRYKDLVDKQFKFFATNIKSGPM-YHLFSVIAYMYNISPVKEITLAYNK 604

Query: 759 SSVDFENMLAAAHASY 774
              DF   +   +  Y
Sbjct: 605 KDEDFYKFINEVNNRY 620


>gi|118443135|ref|YP_878469.1| thymidylate kinase [Clostridium novyi NT]
 gi|118133591|gb|ABK60635.1| thymidylate kinase [Clostridium novyi NT]
          Length = 678

 Score =  477 bits (1228), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 254/610 (41%), Positives = 365/610 (59%), Gaps = 61/610 (10%)

Query: 98  RNKHTN--RLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHV 155
           ++KH N  +L  E SPYLLQHA+NPV W+ W EEAF +A++ D PIFLSIGYS+CHWCHV
Sbjct: 8   KDKHNNPNKLINEKSPYLLQHAYNPVQWYPWCEEAFIKAKEEDKPIFLSIGYSSCHWCHV 67

Query: 156 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 215
           ME ESFEDE VA++LND ++SIKVDREERPDVD +YMT+ QA+ G GGWPL++ ++PD +
Sbjct: 68  MENESFEDEEVAEILNDNYISIKVDREERPDVDNIYMTFCQAVTGSGGWPLTIIMTPDQR 127

Query: 216 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 275
           P   GTYFP +  YGRPG   IL ++ D W+  ++ +  S    ++ L E   A   S +
Sbjct: 128 PFFAGTYFPKKRMYGRPGLIQILNQIADEWEINKNNIINSSDELLKTLKEH-EAQDKSGE 186

Query: 276 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 335
           + +E+ Q+A+    E++   YD  +GGFG APKFP P ++ ++L + K+  D        
Sbjct: 187 INEEVLQDAI----EEMKYYYDDVYGGFGIAPKFPTPHKLMLLLTYYKEYNDKNV----- 237

Query: 336 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 395
               +V  TL+CM KGGI DH+G GF RYS DE+W VPHFEKMLYD   LA VY +A+ L
Sbjct: 238 --LHIVEHTLKCMYKGGIFDHIGFGFSRYSTDEKWLVPHFEKMLYDNALLAYVYTEAYQL 295

Query: 396 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 455
           T   FY  +   I  Y+ RDM  P G  +SAEDADS   EG     EG FY+W   E+E+
Sbjct: 296 TGKSFYKEVAEKIFTYILRDMTSPEGGFYSAEDADS---EGV----EGKFYLWKLNEIEN 348

Query: 456 ILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 515
           IL E         Y K     D++R+ +    F+G N+           + +G  +E  +
Sbjct: 349 ILKED--------YKKFCNTYDITRVGN----FEGSNI----------PNLIGKDIEN-I 385

Query: 516 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 575
           + L   R KLF +R KR  P  DDK++ +WN L+IS+ A   ++ ++             
Sbjct: 386 DKLEYIREKLFQIREKRIHPFKDDKILTAWNALMISALAYGGRVFEN------------- 432

Query: 576 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 635
              KEY++ A+ A  FI+ +L   +  RL   FR G +    +L+DY+FL+  L++LYE 
Sbjct: 433 ---KEYIKRAKDAYDFIKNNLI-RKDGRLLARFRYGEAAYIAYLEDYSFLVWALIELYEA 488

Query: 636 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 695
              +K+L  A+  Q+   +LF D +  G+F++  +   ++L +K+ +D A PSGNSV+ +
Sbjct: 489 TFESKFLKEALYFQDEMIKLFWDEKSYGFFHSGKDGEKLILNLKDSYDTAIPSGNSVAAM 548

Query: 696 NLVRLASIVA 705
           NL++L+ I  
Sbjct: 549 NLIKLSKITG 558


>gi|387817346|ref|YP_005677690.1| hypothetical protein H04402_01136 [Clostridium botulinum H04402
           065]
 gi|322805387|emb|CBZ02951.1| hypothetical protein H04402_01136 [Clostridium botulinum H04402
           065]
          Length = 680

 Score =  477 bits (1228), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 269/676 (39%), Positives = 372/676 (55%), Gaps = 64/676 (9%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNRL  E SPYLLQHAHNPVDW+ WGEEAF +A+  D P+FLSIGYSTCHWCHVME ESF
Sbjct: 6   TNRLINEKSPYLLQHAHNPVDWYPWGEEAFEKAKIEDKPVFLSIGYSTCHWCHVMERESF 65

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           EDE VAK+LN  F+SIKVDREERPD+D +YM + QA  G GGWPL++ ++PD KP   GT
Sbjct: 66  EDEEVAKVLNKNFISIKVDREERPDIDSIYMNFCQAYTGSGGWPLTIIMTPDKKPFFAGT 125

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFP   KY  PG   ILR + + W + ++ + +S    +EQ+          N    EL 
Sbjct: 126 YFPKWGKYNVPGIMDILRSISNLWREDKNKILESSNRILEQIER-----FQDNHREGELE 180

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQK 339
           +  +   A+ L  ++DS++GGFG+ PKFP    I  +L  Y+ KK           +   
Sbjct: 181 EYIIEEAAQTLLDNFDSKYGGFGTKPKFPTAHYILFLLRYYYFKK---------DKKILD 231

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           +V  TL  M KGGI DH+G GF RYS D +W VPHFEKMLYD   L+  Y +A+  TK+ 
Sbjct: 232 IVNKTLTSMYKGGIFDHIGFGFSRYSTDNKWLVPHFEKMLYDNALLSMAYTEAYEATKNP 291

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG- 458
            +  I   +L+Y+++ M    G  +SAEDADS   EG     EG FY+WT +E+ DILG 
Sbjct: 292 LFKDITEKVLNYVKKSMTSEKGGFYSAEDADS---EGV----EGKFYLWTKEEIMDILGE 344

Query: 459 EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 518
           E   L+ + Y +   GN            F+ KN+   +N            LEK     
Sbjct: 345 EEGELYCKIYDITSKGN------------FENKNIANLINTDLKIVDNNKDKLEK----- 387

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
              R KLF+ R KR  P+ DDK++ SWN L+I +F++A + LK++               
Sbjct: 388 --IREKLFEYREKRIHPYKDDKILTSWNALMIVAFSKAGRSLKND--------------- 430

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG 638
             Y+E+A+ +A+FI  +L DE+   L    R G     GF+DDYAF +  L++LYE    
Sbjct: 431 -NYIEIAKKSANFIIENLMDEKG-TLYARIREGERGNEGFIDDYAFFLWALIELYEASFD 488

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
             +L  +IE+ N+  +LF  +E GG++  +     +L+R KE +DGA PSGN+V+ + L 
Sbjct: 489 IYYLEKSIEVANSMIDLFWHKEDGGFYLYSKNSEKLLVRPKEIYDGATPSGNAVAALTLN 548

Query: 699 RLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHK 758
            L  I      D Y+   +     F T +K   M   L    A M ++   K + L  ++
Sbjct: 549 LLYYITG---EDRYKDLVDKQFKFFATNIKSGPM-YHLFSVIAYMYNISPVKEITLAYNE 604

Query: 759 SSVDFENMLAAAHASY 774
              DF   +   +  Y
Sbjct: 605 KDEDFYKFINEVNNRY 620


>gi|83816674|ref|YP_445669.1| hypothetical protein SRU_1548 [Salinibacter ruber DSM 13855]
 gi|83758068|gb|ABC46181.1| Protein of unknown function, DUF255 family [Salinibacter ruber DSM
           13855]
          Length = 701

 Score =  477 bits (1227), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 265/609 (43%), Positives = 349/609 (57%), Gaps = 38/609 (6%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA E SPYL QH  NPVDW  WG+ AFA+AR+ D PIFLSIGYSTCHWCHVME ESFE
Sbjct: 3   NRLADEQSPYLRQHKDNPVDWRPWGDAAFAKAREEDKPIFLSIGYSTCHWCHVMERESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D+ VA LLND FV IKVDREERPDVD +YM   Q + G GGWPL+V L+PD KP    TY
Sbjct: 63  DDDVAALLNDGFVPIKVDREERPDVDSIYMDVCQMMRGQGGWPLTVLLTPDRKPFFAATY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAW--DKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
            P E ++ + G   +L +VK  W  D +  +L  +     EQ+++ L          D  
Sbjct: 123 LPKEGRFQQTGLMDLLPRVKQLWNSDDRAKLLDDA-----EQVTDRLQRIGDDQTDGDAP 177

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
               L   A QL++ +D   GGFGSAPKFP P  +  +L H  +   TG+    ++    
Sbjct: 178 GPTLLDDAARQLAQQFDRTHGGFGSAPKFPAPHNLLFLLRHWHR---TGEQAALNQ---- 230

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
           V  TL  M  GG+ D VG GFHRYS D++W +PHFEKMLYDQ      Y +A+  T    
Sbjct: 231 VTTTLDRMRWGGLFDQVGYGFHRYSTDQQWKLPHFEKMLYDQAMHVLAYTEAYQATGTDR 290

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH 460
           Y    R++L Y+RRD+  P G  FSAEDADS   EG    +EGAFYVW+ +++ + L   
Sbjct: 291 YERTAREVLTYVRRDLQAPDGGFFSAEDADSLNAEGDM--EEGAFYVWSIEDIREHLEPA 348

Query: 461 -AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILG 519
            A L  + Y + P GN    R      E  GKNVL      +A+A + GM ++   + L 
Sbjct: 349 LADLVIDVYNMSPAGNYQEERT----GERTGKNVLHRDQSLAAAAEQRGMEVDVLRDHLE 404

Query: 520 ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRK 579
             RR L D RS+RPRP LDDKV+  WNGL+ ++ A+A+++                 D  
Sbjct: 405 TARRVLLDARSERPRPGLDDKVLTDWNGLMTAALAKAARVF----------------DDA 448

Query: 580 EYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGT 639
           ++ E A     F+   ++D    RL H +R G +     LDDYAFLI GLL+LYE     
Sbjct: 449 QFEEAAVQTGRFVLDTMHDADG-RLLHRYREGEAGIQATLDDYAFLIWGLLELYETTFDA 507

Query: 640 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVR 699
            WL  A+E      + F D EGGG++ T  +  ++++R KE +DGA PSGNSV ++NL+R
Sbjct: 508 DWLRAAVEHMEAALDRFWDAEGGGFYMTPEDGEALIVRPKEANDGALPSGNSVQLMNLLR 567

Query: 700 LASIVAGSK 708
           LA     ++
Sbjct: 568 LARFTGRTE 576


>gi|159897570|ref|YP_001543817.1| hypothetical protein Haur_1041 [Herpetosiphon aurantiacus DSM 785]
 gi|159890609|gb|ABX03689.1| protein of unknown function DUF255 [Herpetosiphon aurantiacus DSM
           785]
          Length = 681

 Score =  477 bits (1227), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 271/681 (39%), Positives = 385/681 (56%), Gaps = 53/681 (7%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            NRL  E SPYLLQHA NPVDW+AWGEEA   A++ D PI LS+GYS CHWCHVM  ESF
Sbjct: 2   ANRLIHETSPYLLQHAENPVDWYAWGEEALQRAKQDDKPILLSVGYSACHWCHVMAHESF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           ED   A ++N+ FV+IKVDREERPD+D +YM  VQA+   GGWP++VFL+PD  P  GGT
Sbjct: 62  EDPATAAVMNELFVNIKVDREERPDIDSLYMAAVQAMTRHGGWPMTVFLTPDGAPFYGGT 121

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFPPE ++  P F+ +L  V +A+  +R+ + QS     E L + LS      K    L 
Sbjct: 122 YFPPEPRHNMPSFQQVLHGVAEAYRDRREEVFQSAEQMREHLEDILSFDLEQVK----LS 177

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
           ++ L + A++    +DSRFGG+G APKFP+ +   M+L    + ED     + ++     
Sbjct: 178 KSQLNVAAQRQMSQFDSRFGGYGGAPKFPQALIFGMVLRTWLRSEDQDALNQVTQ----- 232

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             TLQ MA GG++D +GGGF RYSVD +W VPHFEKMLYD   L+ +YL+ +  T D FY
Sbjct: 233 --TLQAMANGGMYDQLGGGFARYSVDAQWLVPHFEKMLYDNALLSQLYLETYQATHDPFY 290

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EH 460
             I  + ++Y+ RDM  P G  ++AEDADS   EG    +EG FYVW+  E++ +L  E 
Sbjct: 291 RRIAEESINYILRDMTSPDGGFYAAEDADS---EG----EEGKFYVWSLAEIQQLLSPED 343

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
           A L + ++ ++P GN            F+G  +L    D S  A +L +        +  
Sbjct: 344 AALAQLYWNIQPEGN------------FEGHAILYVPQDPSVVAKELSISEADLAQRIAV 391

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            R  L   R+ R RP  D+K++ SWNG+++ S A A+ +L                D  +
Sbjct: 392 IRATLLAQRNTRIRPGRDEKILASWNGMMLRSLAFAANVL----------------DNAD 435

Query: 581 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 640
           Y   A   A FI   LY  Q  +L  S+++G +K  G+L+DYA +  G+L LYE     +
Sbjct: 436 YRAAAIRNAEFITSKLY--QNGQLYRSYKDGQAKFKGYLEDYACVADGMLALYEATFDLR 493

Query: 641 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 700
           WL  AIEL  +  E F D +   +F+T  +   ++ R ++ +D A P+GNSV+V  L+RL
Sbjct: 494 WLQVAIELAESMTERFWDAQQRSFFDTASDHEQLITRPRDLYDNATPAGNSVAVDVLLRL 553

Query: 701 ASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSS 760
           A+++   +   YRQ AE  LA     L  +  A   +  AAD      R+ V L+G  + 
Sbjct: 554 ATLLDRYE---YRQYAETVLANLSGALLQLPGAFGRLLAAADFALAEPRE-VALIGDPAD 609

Query: 761 VDFENMLAAAHASYDLNKTVS 781
             F+ +L A + +Y  NK V+
Sbjct: 610 PAFKALLQATYRNYQPNKVVA 630


>gi|296132106|ref|YP_003639353.1| hypothetical protein TherJR_0579 [Thermincola potens JR]
 gi|296030684|gb|ADG81452.1| protein of unknown function DUF255 [Thermincola potens JR]
          Length = 673

 Score =  477 bits (1227), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 280/689 (40%), Positives = 388/689 (56%), Gaps = 70/689 (10%)

Query: 98  RNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVME 157
           +  +TNRL  E SPYLLQHAHNPVDW+ WG++AF +A K D PIFLSIGYSTCHWCHVME
Sbjct: 2   QTTYTNRLINEKSPYLLQHAHNPVDWYPWGDDAFRKAEKEDKPIFLSIGYSTCHWCHVME 61

Query: 158 VESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPL 217
            ESFEDE VA +LN+ +VSIKVDREERPD+D +YM+  QA+ G GGWPL+V ++PD KP 
Sbjct: 62  RESFEDEEVAAILNEHYVSIKVDREERPDIDTIYMSVCQAMTGHGGWPLTVIMTPDKKPF 121

Query: 218 MGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLP 277
             GTYFP +   G PG   IL ++ D W +++  L +SG    E+++EA+++   S+   
Sbjct: 122 FAGTYFPKKSSRGMPGLTDILIQIADLWRERKKELTESG----EKITEAVNSHLFSHTGG 177

Query: 278 DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEG 337
           D + +  L        +++D  +GGFG+APKFP P  +  +L + K       +G A E 
Sbjct: 178 D-VSKEMLDKAFAYFEENFDRLYGGFGAAPKFPTPHNLTFLLRYWK----MSGNGAALE- 231

Query: 338 QKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTK 397
             MV  TL  M +GGI+DH+G GF RYS D +W VPHFEKMLYD   LA  YL+A+  T 
Sbjct: 232 --MVEKTLDAMYRGGIYDHIGFGFARYSTDRKWLVPHFEKMLYDNALLAIAYLEAYQATG 289

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
           +  Y+    +I  Y++RDMI P G  +SAEDADS   EG    +EG FYVWT +EV+++L
Sbjct: 290 NRKYAKTAEEIFTYVQRDMISPEGGFYSAEDADS---EG----EEGKFYVWTPEEVKEVL 342

Query: 458 GEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASASKLGMPLEKY 514
           G+     F   Y +   GN            F+ K++  LIE                 Y
Sbjct: 343 GDTLGRYFCRDYDITAQGN------------FESKSIPNLIETG---------------Y 375

Query: 515 LNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVV 574
           +    E R+KLF  R +R  P  DDK++ +WNGL+I++ A  ++ L              
Sbjct: 376 VEGYEEARKKLFARREQRVHPFKDDKILTAWNGLMIAAMAYGARAL-------------- 421

Query: 575 GSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYE 634
               K+Y EVA  A +FI ++L  E   RL   FR+G +   G+LDDYA  + GL++LYE
Sbjct: 422 --GEKKYAEVAAKAVNFINKNLRREDG-RLSARFRDGEAAFLGYLDDYACYVWGLIELYE 478

Query: 635 FGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSV 694
                 +L  A+EL N   +LF D E GG F    +  +++ R KE +DGA P+GNSV+ 
Sbjct: 479 ATFEPAYLEQALELNNDMLKLFWDEENGGLFLYGNDAENLITRPKEIYDGALPAGNSVAA 538

Query: 695 INLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVL 754
           +NL RLA +    +     + A   L  F   + +  M       A   L +     + +
Sbjct: 539 VNLFRLARLTGDRQ---LAERAREQLKAFGGSVAESPMGHSHFLMAV-WLDLTPPVDITV 594

Query: 755 VGHKSSVDFENMLAAAHASYDLNKTVSKK 783
           VG + + D E MLA  ++ +    TV  K
Sbjct: 595 VGDRKAGDTEKMLATVNSRFMPEATVILK 623


>gi|91204070|emb|CAJ71723.1| conserved hypothetical protein (thioredoxin) [Candidatus Kuenenia
           stuttgartiensis]
          Length = 758

 Score =  476 bits (1226), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 275/694 (39%), Positives = 384/694 (55%), Gaps = 54/694 (7%)

Query: 92  ASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCH 151
           +S  +   K  NRL  E SPYLLQHA NPVDW+AWG EAF +ARK + PIFLSIGYSTCH
Sbjct: 59  SSALNDAGKKHNRLIHEKSPYLLQHADNPVDWYAWGPEAFEKARKENKPIFLSIGYSTCH 118

Query: 152 WCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLS 211
           WCHVM  ESFED  VA+L+N+ F+ IKVDREERPD+D +YM   Q + G GGWPL++ ++
Sbjct: 119 WCHVMAHESFEDPEVARLMNEVFICIKVDREERPDIDNIYMRVCQMMTGSGGWPLTIVMT 178

Query: 212 PDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASA 271
           PD KP   GTY  P+  YGR G   ++ ++K+ W+ +   + +S       L +  S   
Sbjct: 179 PDKKPFYAGTYI-PKKSYGRIGMLDLVPRIKELWNIQHADIQKSANLITASLGQ-FSHDP 236

Query: 272 SSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKS 331
           S  +    L  + L+   E L++ +  + GGF ++PKFP P  +  +L + K       +
Sbjct: 237 SEAR----LDASTLKAAYELLARRFSEQHGGFSTSPKFPSPQNLLFLLRYWK------ST 286

Query: 332 GEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLD 391
           GE +   +MV+ TL  M KGGI+DH+G GFHRYS D  W VPHFEKMLYDQ  LA  Y +
Sbjct: 287 GEGN-ALRMVVKTLHSMRKGGIYDHIGYGFHRYSTDPEWLVPHFEKMLYDQAMLAMAYTE 345

Query: 392 AFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSK 451
           A+  T    +    ++I  Y+ RDM  P G   SAEDADS   EG    KEG FYVWT +
Sbjct: 346 AYLATGRKEFGETAKEIFAYVMRDMTDPKGGFCSAEDADS---EG----KEGKFYVWTEE 398

Query: 452 EVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMP 510
           E+   L E  A L    + ++  GN          +E  G+N    +     S +++ + 
Sbjct: 399 EIRHALKEDDANLIINVFNIEKAGNF--------KDEIAGRNTGDNILHLKKSLAEIALE 450

Query: 511 LEKYLNILGE----CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 566
            +  L+ L E     RRKLF VRSKR RPH DDK++  WNGL+I++ A+ ++        
Sbjct: 451 NKTSLDELKERVETARRKLFAVRSKRIRPHKDDKILTDWNGLMIAALAKGAQAF------ 504

Query: 567 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 626
                     D  EY+  A+ AA FI   +   Q  RL H +R G +  P F DDYAF I
Sbjct: 505 ----------DAPEYLAAAKRAADFILSDM-RRQDGRLLHRYRGGQAGIPAFADDYAFFI 553

Query: 627 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 686
            GLL+LYE      +L  A++L +   + F D + GG++ T  +   +++R KE +DGA 
Sbjct: 554 WGLLELYETNFNVNYLRTALDLNSDMIKHFWDNQNGGFYFTADDAEDLIVRQKEVYDGAI 613

Query: 687 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 746
           PSGNSV+ +NL RLA I A  + +   + A  ++  F T +K M      M         
Sbjct: 614 PSGNSVAALNLFRLARITADPELE---EKANKTMLAFSTEVKKMPAGYTQMMIGLSFGIG 670

Query: 747 PSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           P+ + +++ G+  +VD  +ML      +  NK V
Sbjct: 671 PAYE-IIIAGNPRAVDTRDMLNTLRRHFIPNKIV 703


>gi|294507561|ref|YP_003571619.1| hypothetical protein SRM_01746 [Salinibacter ruber M8]
 gi|294343889|emb|CBH24667.1| conserved hypothetical protein [Salinibacter ruber M8]
          Length = 701

 Score =  476 bits (1225), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 264/609 (43%), Positives = 348/609 (57%), Gaps = 38/609 (6%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA E SPYL QH  NPVDW  WG+ AFA+AR+ D PIFLSIGYSTCHWCHVME ESFE
Sbjct: 3   NRLADEQSPYLRQHKDNPVDWRPWGDAAFAKAREEDKPIFLSIGYSTCHWCHVMERESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D+ VA LLND FV IKVDREERPDVD +YM   Q + G GGWPL+V L+PD KP    TY
Sbjct: 63  DDDVAALLNDGFVPIKVDREERPDVDSIYMDVCQMMRGQGGWPLTVLLTPDRKPFFAATY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAW--DKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
            P E ++ + G   +L +V+  W  D +  +L  +     EQ+++ L          D  
Sbjct: 123 LPKEGRFQQTGLMDLLPRVRQLWNSDDRAKLLDDA-----EQVTDRLQRIGDDQTDGDAP 177

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
               L   A QL++ +D   GGFGSAPKFP P  +  +L H  +   TG+    ++    
Sbjct: 178 GPTLLDDAARQLAQQFDRTHGGFGSAPKFPAPHNLLFLLRHWHR---TGEQAALNQ---- 230

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
           V  TL  M  GG+ D VG GFHRYS D++W +PHFEKMLYDQ      Y +A+  T    
Sbjct: 231 VTTTLDRMRWGGLFDQVGYGFHRYSTDQQWKLPHFEKMLYDQAMHVLAYTEAYQATGTDR 290

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH 460
           Y    R++L Y+RRD+  P G  FSAEDADS   EG    +EGAFYVW+ +++ + L   
Sbjct: 291 YERTAREVLTYVRRDLQAPDGGFFSAEDADSLNAEGDM--EEGAFYVWSIEDIREHLEPA 348

Query: 461 -AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILG 519
            A L  + Y + P GN    R      E  GKNVL      +A+A + GM  +   + L 
Sbjct: 349 LADLVIDVYNMSPAGNYQEERT----GERTGKNVLHRDQSLAAAAEQRGMEADVLRDHLD 404

Query: 520 ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRK 579
             RR L D RS+RPRP LDDKV+  WNGL+ ++ A+A+++                 D  
Sbjct: 405 TARRVLLDARSERPRPGLDDKVLTDWNGLMTAALAKAARVF----------------DEA 448

Query: 580 EYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGT 639
           ++ E A     F+   ++D    RL H +R G +     LDDYAFLI GLL+LYE     
Sbjct: 449 QFEEAAVQTGRFVLDTMHDADG-RLLHRYREGEAGIQATLDDYAFLIWGLLELYETTFDA 507

Query: 640 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVR 699
            WL  A+E      + F D EGGG++ T  +  ++++R KE +DGA PSGNSV ++NL+R
Sbjct: 508 DWLRAAVEHMEAALDRFWDAEGGGFYMTPEDGEALIVRPKEANDGALPSGNSVQLMNLLR 567

Query: 700 LASIVAGSK 708
           LA     ++
Sbjct: 568 LARFTGRTE 576


>gi|402218687|gb|EJT98763.1| hypothetical protein DACRYDRAFT_110659 [Dacryopinax sp. DJM-731
           SS1]
          Length = 705

 Score =  476 bits (1224), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 274/646 (42%), Positives = 373/646 (57%), Gaps = 59/646 (9%)

Query: 119 NPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIK 178
           NPVDW+ WGEEAF +A+  D P+FLS+GYSTC WCHVME ESFE+E VAK++ND  V++K
Sbjct: 17  NPVDWYPWGEEAFQKAKAEDKPVFLSVGYSTCRWCHVMERESFENEEVAKMMNDVCVNVK 76

Query: 179 VDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK-PLMGGTYFPPEDKYGRPGFKTI 237
           VDRE  PDVD+VYM YV A+ G GGWP+SV+++PD K P  GGTYFPP+        + I
Sbjct: 77  VDREVLPDVDRVYMNYVTAISGRGGWPMSVWITPDTKIPFFGGTYFPPQ------AMEQI 130

Query: 238 LRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQ----LS 293
           L +VKD W  +RD L   G    + L E  S ++ +      L Q  L L  ++    L 
Sbjct: 131 LTQVKDKWKNERDKLVPKGNSLSDILQEPASPTSPA------LSQLGLPLLRDRGLAMLG 184

Query: 294 KSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGI 353
           + YD   GGFG APKFP       +   +   ED+      + G+KM  FTL+ MA GGI
Sbjct: 185 QMYDRTHGGFGGAPKFPTQSRFSFLHLVAYLAEDSN-----NLGRKMSAFTLKKMAMGGI 239

Query: 354 HDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLR 413
           HD +G GFHRYSVD  WH+PHFE MLYD  QLA  YL  + LT D +Y  +   +L YL 
Sbjct: 240 HDQIGLGFHRYSVDAAWHIPHFEIMLYDNAQLAYHYLTYYVLTGDEYYRTVANGVLAYLD 299

Query: 414 RDMIGP---GGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYY 469
           R ++     G    SAEDA+S E EG T KKEGAFYVWT  ++   LGE     F +H+ 
Sbjct: 300 RVLLKKTDHGIAYMSAEDAESYEEEGDTIKKEGAFYVWTRAQITAALGEKDGDAFCDHFG 359

Query: 470 LKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVR 529
           +K  GN  L    DPH E +GKNVL+E   +  +A+ LG+  E+   I+   R  L + R
Sbjct: 360 VKEEGNVGLEH--DPHKELQGKNVLMEQRSAEETATALGISTEEMEGIINRGREVLREER 417

Query: 530 SKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAA 589
            KRP+PHLDDK+I SWNGL++ + A+A+  L S            G + +++       A
Sbjct: 418 DKRPKPHLDDKIIASWNGLMLKTLAQAALRLPS------------GPEPEKFYNQGIEVA 465

Query: 590 SFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQ 649
            F++  +  +   +L   +R   +   G  +DYA +I+GLL LY+       L  A+ELQ
Sbjct: 466 RFVQNQMIKD--GKLLRCYR---TNVQGVCEDYASVINGLLALYQVKLEPWLLRIAVELQ 520

Query: 650 NTQDELFLDREGGGYFNTTGE-DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASI----- 703
           + QDELF D +  GYF +  + D S ++R+K+DHDG EPS NS+S+ NLV L SI     
Sbjct: 521 DKQDELFWDEKAWGYFASAEDSDASKIMRLKDDHDGPEPSANSLSLHNLVTLDSICHATD 580

Query: 704 --------VAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 741
                   ++ S+++ Y+  A+  +  F  RL     ++P M  AA
Sbjct: 581 PFALGIPNMSESRAERYQMYAQKMVTFFTPRLLTQPASMPEMVSAA 626


>gi|20092523|ref|NP_618598.1| hypothetical protein MA3726 [Methanosarcina acetivorans C2A]
 gi|19917793|gb|AAM07078.1| conserved hypothetical protein [Methanosarcina acetivorans C2A]
          Length = 697

 Score =  475 bits (1223), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 267/683 (39%), Positives = 378/683 (55%), Gaps = 45/683 (6%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
            +  NRL  E SPYLLQHA+NPVDW+ WGEEAF +ARK + PIFLSIGYSTCHWCHVM  
Sbjct: 5   QRKPNRLINEKSPYLLQHAYNPVDWYPWGEEAFEKARKENKPIFLSIGYSTCHWCHVMAH 64

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFEDE +A+L+N+ FVSIKVDREERPD+D +YMT  Q + G GGWPL++ ++P  KP  
Sbjct: 65  ESFEDEEIARLMNEAFVSIKVDREERPDIDNIYMTVCQIILGRGGWPLTIIMTPGKKPFF 124

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
            GTY P + ++ + G   ++ ++K+ WD++ + +  S       +   +  S        
Sbjct: 125 AGTYIPKKSRFNQTGMTELIPRIKEIWDQQHEEVLDSAEKITSTIQNMIVESTGEGLG-- 182

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
              +  +      L  S+D  +GGFG APKFP P +I  +L + K+  D        E  
Sbjct: 183 ---EEIIEEAYNDLLNSFDPEYGGFGRAPKFPTPHKISFLLRYWKRSGD-------PEAL 232

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
            MV  TL  M  GGI+DH+G GFHRYS D  W +PHFEKMLYDQ   A  Y++A+ ++  
Sbjct: 233 DMVEHTLDNMRSGGIYDHLGSGFHRYSTDNMWLLPHFEKMLYDQALTAIAYIEAYQVSGK 292

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
             Y      ILDY+ RD+  P G  +  EDAD    EG    +EG +Y+WT +EV  ILG
Sbjct: 293 DLYKETAEGILDYVLRDLTSPEGGFYCGEDAD---VEG----EEGKYYLWTIEEVMSILG 345

Query: 459 -EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI 517
            E + L  + + LK  GN +     +      G N+   ++   + A++L +P+E+  + 
Sbjct: 346 PEDSELIIKMFNLKRGGNFE----EEIRGRKTGTNLFYMVHSPGSLAAELEIPVEEVESR 401

Query: 518 LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSD 577
           +   R KL   R +R RP LDDKV+  WNGL+I++FA+               F V G +
Sbjct: 402 VKSAREKLLKARYERKRPSLDDKVLTDWNGLMIAAFAKG--------------FQVFGEE 447

Query: 578 RKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGS 637
           +  Y++ AE AA F+   LY  +  RL H +R+G +   G  DDYAFLI GLL+LYE G 
Sbjct: 448 K--YLKAAEKAADFLLETLYGPE-KRLHHRYRDGVAGISGTSDDYAFLIHGLLELYEAGF 504

Query: 638 GTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINL 697
             ++L  A+ L     E F D E GG++ T  +   ++ R KE  D A PSGNS  ++NL
Sbjct: 505 ELRYLKSAVSLNRELLEHFWDPENGGFYFTASDSEVLIFRKKEFTDAAIPSGNSFEMLNL 564

Query: 698 VRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGH 757
           +RL+ ++A    +   + A+     F   +K           A D    PS + V++ G 
Sbjct: 565 LRLSRLIADPGME---ETADRLERAFSKLIKKTPSGYTQFLSAFDFRLGPSYE-VIISGK 620

Query: 758 KSSVDFENMLAAAHASYDLNKTV 780
           + S D  NML    + +  NK +
Sbjct: 621 RESPDTVNMLEELWSYFTPNKVL 643


>gi|116749973|ref|YP_846660.1| hypothetical protein Sfum_2547 [Syntrophobacter fumaroxidans MPOB]
 gi|116699037|gb|ABK18225.1| protein of unknown function DUF255 [Syntrophobacter fumaroxidans
           MPOB]
          Length = 684

 Score =  474 bits (1221), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 275/680 (40%), Positives = 383/680 (56%), Gaps = 54/680 (7%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL+AE SPYLLQHA NPVDW+ WGEEAF +A++ D P+FLSIGY+TCHWCHVME ESFE
Sbjct: 3   NRLSAEKSPYLLQHADNPVDWYPWGEEAFRKAKEEDKPVFLSIGYATCHWCHVMERESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA LLN+  V++KVDREERPD+D++YMT  QAL G GGWPLSVF++P+      G+Y
Sbjct: 63  DEEVAALLNEHVVAVKVDREERPDIDQIYMTVCQALLGSGGWPLSVFMTPEKNAFFAGSY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP   + G  GF  ++R++   W   R+ L ++G    E +      +  S   P+ L +
Sbjct: 123 FPKHARLGMAGFTDVIRRIVHMWKNDRERLLEAGRQITESIQPRPVQTVGSLPGPEVLEE 182

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
              R     LS+++D+ +GGFGS PKFP P  +  +L   ++          S+   +V 
Sbjct: 183 AYSR-----LSRAFDATWGGFGSKPKFPTPHHLTFLLRWHRR-------NPWSDALAIVE 230

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL  M  GGI D VG GFHRYSVDE+W VPHFEKMLYDQ  LA  YL+AF +T    + 
Sbjct: 231 KTLDGMRDGGIFDQVGFGFHRYSVDEKWLVPHFEKMLYDQAMLALAYLEAFQVTGRERHG 290

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHA 461
            + R+I +Y+ RDM  P G  +SAEDADS   EG     EG FYVWT  EV  +LG E  
Sbjct: 291 RVAREIFEYVLRDMTDPDGGFYSAEDADS---EGV----EGRFYVWTPAEVNALLGNEIG 343

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGM-PLEKYLNILGE 520
             F   + + P GN +  R S PH        L EL DS +   + G+  LE   ++L +
Sbjct: 344 ETFCRFFDITPEGNFEDGR-SIPH--------LAELADSLSDRDEPGIGGLE---DLLEK 391

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            RR LF+ R  R  P  DDK++ SWNGL+I++ ++ S+ L                  + 
Sbjct: 392 GRRLLFEARRMRVHPLKDDKILTSWNGLMIAALSKGSRALGD----------------RS 435

Query: 581 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 640
           Y   A  AA FI   +    + RL   +R G +    + DDYAF I GL++LYE     +
Sbjct: 436 YALAASRAADFILDRMR-RDSGRLHRRYRKGEAAIHAYADDYAFFIWGLIELYEAAFDVR 494

Query: 641 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 700
           +L  A++LQ+   +LF D   GG+F T  +  ++++R +E +DGA PS NS + +NL+RL
Sbjct: 495 YLEEAVKLQDLMIDLFWDDAEGGFFFTPNDGENLIVREREIYDGAVPSSNSAAALNLLRL 554

Query: 701 ASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSS 760
             +V   +   + + A+  L  F   ++D   A      A D  + P+R+ VV+ G   +
Sbjct: 555 GRMVGAVR---FEEKADRLLRRFSETVRDYPSAYTQFLHAVDFAAGPTRE-VVIAGSPDN 610

Query: 761 VDFENMLAAAHASYDLNKTV 780
                M+    + +  N  V
Sbjct: 611 ATTAEMMKIVGSGFVPNTVV 630


>gi|322794007|gb|EFZ17245.1| hypothetical protein SINV_09516 [Solenopsis invicta]
          Length = 891

 Score =  474 bits (1221), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 292/742 (39%), Positives = 385/742 (51%), Gaps = 124/742 (16%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNRL+ E SPYLLQHA NPVDW+ W +EA  +A+K +  IF+SIGYSTCHWCHVME ESF
Sbjct: 98  TNRLSLERSPYLLQHATNPVDWYPWCDEALEKAKKENKIIFVSIGYSTCHWCHVMEKESF 157

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQA----------LYGGGGWPLSVFLS 211
           ++E VAK++N+ +V+IKVDREERPD+D + M ++QA          L G GGWPLSVFL+
Sbjct: 158 KNEEVAKIMNEHYVNIKVDREERPDIDMMCMMFIQASLYLVSGTTRLRGHGGWPLSVFLT 217

Query: 212 PDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASA 271
           PDL P+ GGTYF          F   L ++   W   RD + +S     E+L E L+ S 
Sbjct: 218 PDLMPITGGTYF------SSSMFTLYLTRIMKEWTDGRDKMIKSATTIAERLKE-LATSR 270

Query: 272 SSNKLP-----------------------DELPQ-NALRLCAEQLSKSYDSRFGGFGSA- 306
              K+                        D +P  ++  LCA  L   YDS +GGFGS+ 
Sbjct: 271 EDIKVSECYLKFLNYFNNVFYLLIFAIQDDGVPAIDSAFLCAHVLMNIYDSEYGGFGSSS 330

Query: 307 ------PKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGG 360
                 PKFP P  +  +L        T      S+     L TL+ M+ GGIHDH+G G
Sbjct: 331 AINPNSPKFPEPSNLNFLLSMHVLTTSTMLVEMTSDA---CLNTLKKMSYGGIHDHIGKG 387

Query: 361 FHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPG 420
           FHRY+VD RW VPHFEKMLYDQ QL   Y DA+ +TKD FYS I  DI  Y+ R +    
Sbjct: 388 FHRYTVDARWKVPHFEKMLYDQAQLIQCYADAYLITKDSFYSDIVDDIATYVLRILQHME 447

Query: 421 GEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI----------LFKEHYYL 470
           G  FSAEDADS  T  A+ K+EGAFYVWT   ++ +L +  +          L   H+ +
Sbjct: 448 GGFFSAEDADSLPTSDASAKREGAFYVWTYDRLKTLLKKEKVPGKDNVTYFDLICRHFSV 507

Query: 471 KPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRS 530
           +  GN +  +  DPH E  GKNV         +AS   + +E+    L E    LF+ R+
Sbjct: 508 RKEGNVESPQ--DPHGELTGKNVFSMQAGIEDTASHFKLSVEETQKHLKEACTILFEDRT 565

Query: 531 KRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAAS 590
            RP P LDDK++ +WNGL+IS  ARA   +K+                K Y+E A  AA+
Sbjct: 566 HRPWPQLDDKMVTAWNGLMISGLARAGIAVKN----------------KTYVEAATEAAT 609

Query: 591 FIRRHLYDEQTHRLQHS------------------------------FRNGPSKAPGFLD 620
           F+ ++L+D++   L  S                              +R+ P   PGF +
Sbjct: 610 FVEKYLFDKKKRILLRSCYRRRDDKIVQRQVLSLHQSVSRCEIYDAIYRSTP--IPGFHE 667

Query: 621 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 680
           DYAF + GLLDLYE      W+ +A ELQ+ QD LF D + GGYF    E P +L R K+
Sbjct: 668 DYAFYVKGLLDLYEATFNPHWVEFAEELQDIQDRLFWDLQDGGYFAMAEESP-ILTRTKD 726

Query: 681 ---------DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMA 731
                      DGA PS NS++  NL+RLA  +     D  R  AE  L  F  +L    
Sbjct: 727 FKIPMSFVVADDGALPSSNSIACSNLLRLAIYL---DRDDLRNKAEKLLCAFGNKLVSCP 783

Query: 732 MAVPLMCCAADMLSVPSRKHVV 753
            A P M  A      P++ +V 
Sbjct: 784 AACPQMMLALIEYHHPTQIYVT 805


>gi|83590501|ref|YP_430510.1| hypothetical protein Moth_1665 [Moorella thermoacetica ATCC 39073]
 gi|83573415|gb|ABC19967.1| Protein of unknown function DUF255 [Moorella thermoacetica ATCC
           39073]
          Length = 752

 Score =  474 bits (1221), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 285/715 (39%), Positives = 381/715 (53%), Gaps = 82/715 (11%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           +  NRL  E SPYLLQHA+NPVDW+ WGEEAFA A++ D P+FLSIGYSTCHWCHVM  E
Sbjct: 5   RRPNRLIHEKSPYLLQHAYNPVDWYPWGEEAFARAKREDKPVFLSIGYSTCHWCHVMARE 64

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SF DE VA LLND F++IKVDREERPD+D+VYM   QAL G GGWPL+VFL+P+ +P   
Sbjct: 65  SFNDEEVAALLNDSFIAIKVDREERPDIDQVYMAACQALTGSGGWPLTVFLTPEKRPFYA 124

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE 279
           GTYFP  ++YGRPG   +L+ +++ W   R+ L +SGA  I+ ++   + +      P E
Sbjct: 125 GTYFPKHNRYGRPGLVELLKLIREKWATHREELEESGAELIQHVAGQFAPTP-----PGE 179

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
                L    +QL   +D  +GGF  APKFP P ++  +L + K+ ++ G          
Sbjct: 180 PGAQVLEKGWQQLRAGFDPLYGGFSEAPKFPSPHQLLFLLRYWKRYDEAG-------ALA 232

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           MV  TLQ M  GGI+DH+G GF RYS D RW VPHFEKMLYD   LA  YL+    T   
Sbjct: 233 MVEKTLQAMYCGGIYDHIGFGFARYSTDRRWLVPHFEKMLYDNALLALAYLETRQATGKA 292

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE 459
            YS++ R+I  ++ RDM  P G  +SA DADS   EG    +EG FY+WT  +V ++LG 
Sbjct: 293 VYSHVAREIFTWVLRDMTSPEGGFYSALDADS---EG----EEGRFYLWTPDQVREVLGA 345

Query: 460 HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI------ELNDSSASASK------- 506
               F   Y+   T   +    S P+   +G+ +        E ND++    +       
Sbjct: 346 KEGEFFCRYF-DITAGGNFEGRSIPNLIGRGEALFAAGTSGNESNDTAGDQRQPREQGGR 404

Query: 507 -----------LGMPLEKYLNILGEC----------------RRKLFDVRSKRPRPHLDD 539
                       G P E  L   G                  R KLF  R KR  PH DD
Sbjct: 405 AGGISGGGGCAKGSPEEDRLPGRGPTTLAGFGPATAARLAAAREKLFAAREKRVHPHRDD 464

Query: 540 KVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE 599
           K++ +WNGL+I++ AR + +L                D   Y   A  AA FI  HL D 
Sbjct: 465 KILTAWNGLMIAALARGAWVL----------------DEPAYAAAAARAARFILTHLRDA 508

Query: 600 QTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR 659
           +  RLQ  +R G +  P +LDDYAFL  GL++LY+    T +L  A+ L     ELF D 
Sbjct: 509 EG-RLQARYREGQAAFPAYLDDYAFLTWGLIELYQATFETGYLREALALTRQMQELFRD- 566

Query: 660 EGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHS 719
           EGGGYF T      + +R +E +DGA PSGNSV+ +NL+RLA I   S+ +   + A   
Sbjct: 567 EGGGYFFTPHGAGELPVRPREVYDGAIPSGNSVAALNLLRLARITGDSRLE---EEAAAQ 623

Query: 720 LAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASY 774
           +      + +         CA D    P    +VL G + + D   +L    A+Y
Sbjct: 624 VRALAGTVAEYPRGYSFYLCALDFYLGPV-TEIVLAGERETEDTRALLRVLRAAY 677


>gi|237755775|ref|ZP_04584378.1| thymidylate kinase [Sulfurihydrogenibium yellowstonense SS-5]
 gi|237692063|gb|EEP61068.1| thymidylate kinase [Sulfurihydrogenibium yellowstonense SS-5]
          Length = 686

 Score =  474 bits (1219), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 271/667 (40%), Positives = 365/667 (54%), Gaps = 53/667 (7%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           NK  NRL  E SPYLLQHA+NPVDW+ W +EAF +A+K D PIFLSIGYS+CHWCHVME 
Sbjct: 2   NKKPNRLINEKSPYLLQHAYNPVDWYPWCDEAFEKAKKEDKPIFLSIGYSSCHWCHVMEK 61

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFEDE VAK+LN+ +VSIKVDREERPD+D +YM       G GGWPL++ ++PD KP  
Sbjct: 62  ESFEDEEVAKILNENYVSIKVDREERPDIDSIYMNVCLMFNGSGGWPLTIIMTPDKKPFF 121

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
            GTYFP   + GR G   +L  V + W   ++ L Q     IE L +          + D
Sbjct: 122 AGTYFPKYSRPGRIGLVDLLTSVAEYWKNNKEDLIQRAEKVIEYLKDDFKG------IYD 175

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML---YHSKKLEDTGKSGEAS 335
           E+ ++ +  C   L   +D  +GGF   PKFP P  I  +L   YH+K+          +
Sbjct: 176 EISKDIIDACYFDLKSRFDREYGGFSIKPKFPTPHNIMFLLRYYYHTKE----------T 225

Query: 336 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 395
           E  KM   TL  M  GG++DH+G GFHRYS D  W +PHFEKMLYDQ  L   Y +A+ L
Sbjct: 226 EALKMAEKTLINMRLGGMYDHIGFGFHRYSTDREWLLPHFEKMLYDQAMLTMAYTEAYQL 285

Query: 396 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 455
           TK+ FY    ++ + Y+ RDM    G  +S+EDADS   EG    +EG FY WT  E+++
Sbjct: 286 TKNNFYKKTAQETITYVLRDMTSKEGVFYSSEDADS---EG----EEGKFYTWTIDELKE 338

Query: 456 ILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKY 514
           +L +  + L  + + +K  GN     + +      G+N+L         A+ L M  ++ 
Sbjct: 339 VLNDEELSLVIKVFNVKEEGN----YLEEATGHLTGRNILYLKKPIRELANDLNMNQDQL 394

Query: 515 LNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVV 574
              L E RRKLFD R KR  P  DDKV+  WNGL+IS+ A+A K                
Sbjct: 395 EAKLEEIRRKLFDAREKRVHPQKDDKVLTDWNGLMISALAKAGK---------------- 438

Query: 575 GSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYE 634
           G + K+ +E A+ AA FI   ++   T  L H +++G  K  G LDDY F   GL++L E
Sbjct: 439 GFEDKDLIEKAKVAADFILNTMFKNDT--LYHLYKDGEIKVEGLLDDYTFFSWGLIELCE 496

Query: 635 FGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSV 694
                K+L  A++L +   E F D E GG+F +      V++R KE  DGA PSGNSVS 
Sbjct: 497 ATGDIKYLKSALKLTDLMIEKFYDFENGGFFLSPKNSKDVIVRPKEAFDGAIPSGNSVSA 556

Query: 695 INLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVL 754
            NL RL  I    K   Y   A  +L  F   +K +     +      ++  P+ + VVL
Sbjct: 557 YNLYRLYLISGNEK---YYNFAIETLKAFGGEIKRLPSYHSMFNIVLMLVFYPTSE-VVL 612

Query: 755 VGHKSSV 761
            G+   V
Sbjct: 613 AGNCEKV 619


>gi|321265830|ref|XP_003197631.1| DUF255 domain protein [Cryptococcus gattii WM276]
 gi|317464111|gb|ADV25844.1| DUF255 domain protein, putative [Cryptococcus gattii WM276]
          Length = 772

 Score =  473 bits (1217), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 288/728 (39%), Positives = 404/728 (55%), Gaps = 41/728 (5%)

Query: 68  RPLAVISHRPIHPY-KVVAMAERTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAW 126
           +P+A +S R I P  + +     +  S +    + +N LA   SPYLLQH  NPV W  W
Sbjct: 10  KPVA-LSLRQIRPTPRAIYHLRMSSTSATDMTPRLSNVLAKSKSPYLLQHKDNPVAWQEW 68

Query: 127 GEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 186
             E  A A+K D PIFLS GYS CHWCHV+  ESFEDE  AK++N+WFV+IKVDREERPD
Sbjct: 69  SPETIALAQKLDKPIFLSSGYSACHWCHVLAHESFEDEETAKMMNEWFVNIKVDREERPD 128

Query: 187 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 246
           VD++YM+Y+QA+ GGGGWP+SVF++P L+P   GTYFP      RP F  +L+K+ + W+
Sbjct: 129 VDRMYMSYLQAVSGGGGWPMSVFMTPKLEPFFAGTYFP------RPNFHQLLKKIHNVWE 182

Query: 247 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSA 306
           + R+   + G   IE L +      +S  L   L  +       QLS   D R+GGF +A
Sbjct: 183 EDREKCEKMGKGVIEALKDMNDTGRTSESLSQLLSTSPASKLFAQLSTMNDPRYGGFTNA 242

Query: 307 ------PKFPR-PVEIQMMLYHSKKLEDTGKSGEASE-GQKMVLFTLQCMAKGGIHDHVG 358
                 PKFP   + ++ +   +       ++ E  E  ++M +  L+ M  GGI D VG
Sbjct: 243 GSSTRGPKFPSCSITLEPLARLASIPGGGARNAEIREDAREMGMKMLRSMWSGGIRDWVG 302

Query: 359 GGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT----KDVFYSY-ICRDILDYLR 413
           GG  RYSVDE+W VPHFEKMLYDQ QL +  LD   L      D    Y +  DIL Y  
Sbjct: 303 GGMARYSVDEKWMVPHFEKMLYDQTQLVSSCLDFARLYPADHPDRLLCYDLAADILKYTL 362

Query: 414 RDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPT 473
           RD+  P G  +SAEDADSAE +GA +K EGAFY+W   E++++LG+ A LF   + ++P 
Sbjct: 363 RDLKSPEGGFWSAEDADSAEYKGA-KKSEGAFYIWKKSEIDEVLGDDAPLFNSFFGVEPD 421

Query: 474 GNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRP 533
           GN D+  + D H E + KN+L +       A + G   ++  +I+ +   KL   R +R 
Sbjct: 422 GNVDI--IHDSHGEMRDKNILHQHKTYEEVALEFGKKEDEAKDIIVQACEKLRLKREERE 479

Query: 534 RPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIR 593
           RP LDDK++ +WNGL++++ ++AS +L    + +    P            A    +F++
Sbjct: 480 RPGLDDKILTAWNGLMLTALSKASTLLPPSYDISPQCLP-----------AALGIVNFVK 528

Query: 594 RHLYDEQTHRLQHSFRNGPSKAP-GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQ 652
            H++D  T  L  S+R G  K P    DDYAFLI GLL+LYE       +++A ELQ  Q
Sbjct: 529 SHMWDSSTRTLTRSYREG--KGPQAQTDDYAFLIQGLLNLYEATGDESHVLFAEELQKRQ 586

Query: 653 DELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYY 712
           DELF D   GGYF T+ EDP VL+R+K+  DGAEPS  +VS  NL R + +++    D Y
Sbjct: 587 DELFWDDHDGGYF-TSAEDPHVLVRMKDAQDGAEPSAAAVSAHNLSRFSLLLSSEFED-Y 644

Query: 713 RQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHA 772
              AE +       +     AV         L    R+ V++VG       +  L AA  
Sbjct: 645 EARAEATYLSMGPLIAQAPRAVGYAVSGLIDLEKGYRE-VIIVGSTKDDVVKKFLKAARE 703

Query: 773 SYDLNKTV 780
           +Y  N+ +
Sbjct: 704 TYFSNQVI 711


>gi|315425009|dbj|BAJ46683.1| hypothetical conserved protein [Candidatus Caldiarchaeum
           subterraneum]
          Length = 692

 Score =  473 bits (1216), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 277/659 (42%), Positives = 381/659 (57%), Gaps = 58/659 (8%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           +  NRL  E SPYLLQHA+NPVDW+ WGEEA  +AR+ + PIFLSIGYS+CHWCHVME E
Sbjct: 13  RKPNRLINERSPYLLQHAYNPVDWYPWGEEAIKKAREENKPIFLSIGYSSCHWCHVMEKE 72

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SFEDE +A+LLN +FV +KVDREERPD+D+VYM  V  + G GGWPL+VFL+PDLKP  G
Sbjct: 73  SFEDEKIAELLNTFFVPVKVDREERPDIDEVYMKAVIMMTGHGGWPLTVFLTPDLKPFFG 132

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE 279
           GTYFPP  + G  G   ILR V + W K    + +    A EQ    L +  ++ K  D 
Sbjct: 133 GTYFPPRRRGGLRGLDEILRGVAELWRKDPKQVME----AAEQNVSLLKSFYTTEK-SDT 187

Query: 280 LPQNALRLCA-EQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
            P + L + A + L+ S+DS +GGFG APKFP PV +  +  +S  LE      +     
Sbjct: 188 TPSHNLVVTAFDILATSFDSLYGGFGGAPKFPMPVYLDFLQVYS-VLE------KEPAAV 240

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
           +MV  TL+ MA+GG+ DH+GGGF RYS D  W VPHFEKMLYD   LA VY++ + +T D
Sbjct: 241 RMVSTTLENMARGGLRDHLGGGFFRYSTDRVWLVPHFEKMLYDNALLARVYMNHYLITGD 300

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
            FY  I    LD+L  +M+ PGG  +SA DADS E        EG +YVW   E+E ILG
Sbjct: 301 SFYREIGASTLDWLVSEMMNPGGGFYSAVDADSPE-------GEGEYYVWRRGELEQILG 353

Query: 459 -EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI 517
            E A +  + Y +  TGN +            GKN+L     ++  A++LG+       +
Sbjct: 354 PELAKIAAKTYAVTDTGNFE-----------HGKNILTMRKRTAELAAELGVDEPTLKQM 402

Query: 518 LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSD 577
           L E + KL D R KRP P +DDK+I +WNG  +S+     +                 + 
Sbjct: 403 LEEAKNKLLDARRKRPAPGVDDKIIAAWNGFAVSALCTGYR----------------ATG 446

Query: 578 RKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGS 637
            K Y++ A     FI  +++   T  L   ++NG S   GFLDDYA +++ LLD++E   
Sbjct: 447 EKRYLDAALKTIDFIISNMWLNNT--LHRIYKNGAS-INGFLDDYAAVVNALLDVFEVSF 503

Query: 638 GTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINL 697
             ++L  A+++ N   ELF D   GG++ T  ED + + R+K+ +DGA PSGN+++   L
Sbjct: 504 EPRYLAVAVDVANRMVELFWDNVDGGFYYTV-EDVAGVTRIKDAYDGATPSGNTLAAAAL 562

Query: 698 VRLASIVAGSKSDYYRQNAEHSLAVFETRLKDM-AMAVPLMCCAADMLSVPSRKHVVLV 755
           ++L+ +   +K   Y Q  E +L  F +RL+   A    L+   A   +  SR  VVLV
Sbjct: 563 LKLSELTGETK---YLQYVEETLKCFASRLEAAPAEHTGLITVLAGFHT--SRMEVVLV 616


>gi|398309078|ref|ZP_10512552.1| hypothetical protein BmojR_06022 [Bacillus mojavensis RO-H-1]
          Length = 689

 Score =  473 bits (1216), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 269/685 (39%), Positives = 378/685 (55%), Gaps = 57/685 (8%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           N   NRL AE SPYLLQHAHNPVDW+ WGEEAF +A++ + P+ +SIGYSTCHWCHVM  
Sbjct: 4   NNKPNRLIAEKSPYLLQHAHNPVDWYPWGEEAFEKAKRENKPVLVSIGYSTCHWCHVMAH 63

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFEDE +A LLN+ FV+IKVDREERPDVD VYM   Q + G GGWPL+VF++PD KP  
Sbjct: 64  ESFEDEEIASLLNERFVAIKVDREERPDVDSVYMRICQLMTGQGGWPLNVFITPDQKPFY 123

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
            GTYFP   KY RPGF  +L  + + +   R+ +      A   L    +A  S      
Sbjct: 124 AGTYFPKTSKYNRPGFVDVLEHLSETFANDREHVEDIAENAANHLQTKTAAKTSEG---- 179

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
            L ++A+    +QL+  +D+ +GGFG APKFP P    M++Y  +    TG+        
Sbjct: 180 -LSESAIHRTFQQLANGFDTIYGGFGQAPKFPMP---HMLMYLLRYYHTTGQENALYNVT 235

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
           K    TL  MA GGI+DH+G GF RYS D+ W VPHFEKMLYD   L   Y +A+ +T++
Sbjct: 236 K----TLDSMANGGIYDHIGYGFARYSTDDEWLVPHFEKMLYDNALLLTAYTEAYQVTQN 291

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
             Y  IC  I+ +++R+M    G  FSA DAD   TEG    +EG +YVW+ +E+   LG
Sbjct: 292 SRYKDICEQIITFIQREMTHEDGSFFSALDAD---TEG----EEGKYYVWSKEEILKTLG 344

Query: 459 EH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASASKLGMPLEKYL 515
           E    L+   Y +   GN            F+GKN+  LI        A   G+  E+  
Sbjct: 345 EDLGTLYCSVYDITEKGN------------FEGKNIPNLIHTKREQIKADG-GLTEEELS 391

Query: 516 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 575
             L + R KL   R +R  PH+DDKV+ SWN L+I+  A+A+K+ +              
Sbjct: 392 RKLEDARLKLLKTREERTYPHVDDKVLTSWNALMIAGLAKAAKVFQ-------------- 437

Query: 576 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 635
               +Y+ +AE A +FI  ++  +   R+   +R+G  K  GF+DDYAFL+   LDLYE 
Sbjct: 438 --EPQYLSLAEDAITFIENNVIIDG--RVMVRYRDGEVKNKGFIDDYAFLLWAYLDLYEA 493

Query: 636 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 695
                +L  A +L     +LF D E GG++ T  +  ++++R KE +DGA PSGNSV+ +
Sbjct: 494 SFDLSYLEKAKKLSEDMIDLFWDEEHGGFYFTGHDAEALIVREKEVYDGAVPSGNSVAAV 553

Query: 696 NLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLV 755
            L+RL   V G  S    + AE   +VF+  ++           +      P +K +V+ 
Sbjct: 554 QLLRLGQ-VTGDLS--LIEKAETMFSVFKPEIEAYPSGHSFFMQSVLKHMTP-KKEIVIF 609

Query: 756 GHKSSVDFENMLAAAHASYDLNKTV 780
           G     D + + +A   ++  N ++
Sbjct: 610 GRPDDPDRKQITSALQQAFIPNDSI 634


>gi|295695073|ref|YP_003588311.1| hypothetical protein [Kyrpidia tusciae DSM 2912]
 gi|295410675|gb|ADG05167.1| protein of unknown function DUF255 [Kyrpidia tusciae DSM 2912]
          Length = 716

 Score =  472 bits (1215), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 275/653 (42%), Positives = 366/653 (56%), Gaps = 52/653 (7%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA E SPYLLQHA+NPVDWF W EEAF +A++ + P+FLSIGYSTCHWCHVME ESFE
Sbjct: 8   NRLAREKSPYLLQHAYNPVDWFPWSEEAFEKAQQENKPVFLSIGYSTCHWCHVMERESFE 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D  VA+LLN  FV+IKVDREERPDVD +YM   QAL G GGWPL+VFL+P+ +P   GTY
Sbjct: 68  DPEVAELLNRHFVAIKVDREERPDVDHLYMAACQALTGQGGWPLTVFLTPEKEPFYAGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP   +YGRPG   +L +V   W+K  D +  +G     Q+ EAL  +A       E+  
Sbjct: 128 FPKRSRYGRPGLMELLTRVAQLWEKGADRVKDAGRHLTGQIGEALGRAAQG-----EVDA 182

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             L    EQL  SYD  FGGFG APKFPRP ++  +L +  +   +G+     E   MV 
Sbjct: 183 GTLTRAFEQLLASYDHTFGGFGHAPKFPRPHDLLFLLRYGVR---SGR----REAFDMVQ 235

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL+ M +GGI DHVG GF RYS D RW +PHFEKMLYD   L   YL+A+    D  ++
Sbjct: 236 GTLEGMRRGGIWDHVGFGFARYSTDRRWLIPHFEKMLYDNALLVLTYLEAYQALGDQRWA 295

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHA 461
              R+I+ Y+RR+M  PGG  +SAEDADS   EG    +EG FYVWT +E+ + +G E  
Sbjct: 296 QTAREIVTYVRREMTDPGGGFYSAEDADS---EG----EEGKFYVWTPQEITEAVGPEDG 348

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN-DSSASASKLGMPLEKYLNILGE 520
            +   ++ +   GN +            G++VL E++ D    A +LGM  E+    +  
Sbjct: 349 EVLCRYFGVTEEGNFE-----------GGRSVLNEIDTDVDLLARELGMTPEEIDRKVRR 397

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
               L  VR +R  PH DDK++ +WNGL+I++ AR +++L                   +
Sbjct: 398 GLEILHSVRDRRVHPHKDDKILTAWNGLMIAALARGARVLGD----------------AD 441

Query: 581 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 640
           Y+  A  AA ++ R L  +   RL   +R+G +   G+LDDYAF I GLL+LY+      
Sbjct: 442 YLVSARRAAEWLWRTL-RQGDGRLLARYRDGEAGILGYLDDYAFYIWGLLELYQADGDVA 500

Query: 641 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 700
           WL  AI L      LF D + GG F T  +  ++  R K   DGA PSGNSV  ++L+ L
Sbjct: 501 WLRRAIRLAQDVRTLFWDEKEGGCFLTGSDAEALWSRPKTAEDGALPSGNSVLALDLLWL 560

Query: 701 ASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVV 753
             +        + + AE  L  F   +            A D    PS + VV
Sbjct: 561 GRLTGDPA---WERWAEAQLRAFAGAVSRYPAGYTFFLTAWDFALGPSEEIVV 610


>gi|423680595|ref|ZP_17655434.1| hypothetical protein MUY_00405 [Bacillus licheniformis WX-02]
 gi|383441701|gb|EID49410.1| hypothetical protein MUY_00405 [Bacillus licheniformis WX-02]
          Length = 681

 Score =  472 bits (1215), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 268/610 (43%), Positives = 357/610 (58%), Gaps = 59/610 (9%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNRL  E SPYLLQHAHNPVDW+ WGEEAF +A++ + P+ +SIGYSTCHWCHVM  ESF
Sbjct: 3   TNRLINEKSPYLLQHAHNPVDWYPWGEEAFEKAKRENKPVLVSIGYSTCHWCHVMAHESF 62

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           EDE VAKLLN+ FVSIKVDREERPDVD +YMT  Q + G GGWPL+VFL+PD KP   GT
Sbjct: 63  EDEEVAKLLNEKFVSIKVDREERPDVDSIYMTICQMMTGQGGWPLNVFLTPDQKPFYAGT 122

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFP   ++ RPGF  +++++ D + K R+ +        E+ +  L   A S+   D L 
Sbjct: 123 YFPKTSRFNRPGFVEVVKQLSDTFAKNREHVEDIA----EKAANNLRIKAKSDA-GDSLG 177

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML-YHSKKLEDTGKSGEASEGQKM 340
           ++ LR   +QL  S+D+ +GGFGSAPKFP P  +  +L YH         SGE +     
Sbjct: 178 EDILRRTYQQLINSFDAAYGGFGSAPKFPIPHMLTFLLRYHQ-------YSGEEN-ALYS 229

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
           V+ TL  MA GGI+DHVG GF RYS D+ W VPHFEKMLYD   L   Y +A+ +TK+  
Sbjct: 230 VMKTLDSMANGGIYDHVGYGFARYSTDDEWLVPHFEKMLYDNALLLIAYTEAYQITKNER 289

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-E 459
           Y  I   I+ ++RR+M    G  +SA DAD   TEG     EG +YVW+ +EV + LG E
Sbjct: 290 YKQISEQIITFVRREMTDEKGAFYSALDAD---TEGV----EGKYYVWSKEEVLETLGDE 342

Query: 460 HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKN----VLIELNDSSASASKLGMPLEKYL 515
              L+   Y +   GN            F+G N    +   L D      +  +  E+  
Sbjct: 343 LGELYCAVYNITQEGN------------FEGHNIPNLIYTRLEDIK---DEFALTDEELQ 387

Query: 516 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 575
           N L E R KLF+ R +R  PH+DDKV+ SWN L+I+  A+A+K+         +N P   
Sbjct: 388 NKLEEARTKLFEKRQERTYPHVDDKVLTSWNALMIAGLAKAAKV---------YNAP--- 435

Query: 576 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 635
               EY+E+A +AA FI   L   Q  R+   +R+G  K  GF+DDYAFL+   ++LYE 
Sbjct: 436 ----EYLEMARAAAEFIENKLI--QDGRIMVRYRDGEVKNKGFIDDYAFLLWAYIELYEA 489

Query: 636 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 695
                 L  A +L+     LF D E GG++ T  +  ++++R KE +DGA PSGN V  +
Sbjct: 490 SLDLTDLRKAKKLEADMKGLFWDEEHGGFYFTGSDAEALIVRDKEVYDGALPSGNGVLAV 549

Query: 696 NLVRLASIVA 705
            L RL  +  
Sbjct: 550 QLSRLGRLTG 559


>gi|315426698|dbj|BAJ48323.1| conserved hypothetical protein [Candidatus Caldiarchaeum
           subterraneum]
 gi|343485462|dbj|BAJ51116.1| conserved hypothetical protein [Candidatus Caldiarchaeum
           subterraneum]
          Length = 692

 Score =  472 bits (1215), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 275/655 (41%), Positives = 376/655 (57%), Gaps = 56/655 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYLLQHA+NPVDW+ WGEEA  +AR  + PIFLSIGYS+CHWCHVME ESFE
Sbjct: 16  NRLINERSPYLLQHAYNPVDWYPWGEEAIKKARGENKPIFLSIGYSSCHWCHVMEKESFE 75

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE +A+LLN +FV +KVDREERPD+D+VYM  V  + G GGWPL+VFL+PDLKP  GGTY
Sbjct: 76  DEKIAELLNTFFVPVKVDREERPDIDEVYMKAVIMMTGHGGWPLTVFLTPDLKPFFGGTY 135

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FPP  + G  G   ILR V + W K    + +    A EQ    L +  ++ K       
Sbjct: 136 FPPRRRGGLRGLDEILRGVAELWRKDPKQVME----AAEQNVSLLKSFYTTEKSVTTPSH 191

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
           N +    + L+ S+DS +GGFG APKFP PV +  +  +S  LE      + S   +MV 
Sbjct: 192 NLVVTAFDILATSFDSLYGGFGGAPKFPMPVYLDFLQVYS-VLE------KESAAVRMVS 244

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL+ MA+GG+ DH+GGGF RYS D  W VPHFEKMLYD   LA VY++ + +T D FY 
Sbjct: 245 TTLENMARGGLRDHLGGGFFRYSTDRVWLVPHFEKMLYDNALLARVYMNHYLITGDSFYR 304

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHA 461
            I    LD+L  +M+ PGG  +SA DADS E        EGA+YVW   E+  ILG E A
Sbjct: 305 EIGASTLDWLVSEMMNPGGGFYSAVDADSPE-------GEGAYYVWRLGELGQILGPELA 357

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
            +  + Y +  TGN +            GKN+L     ++  A++LG+       +L E 
Sbjct: 358 KIAAKTYAVTDTGNFE-----------HGKNILTMRKRTAELAAELGVDEPTLKQMLEEA 406

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
           + KL D R KRP P +DDK+I +WNG  +S+     +                 +  K Y
Sbjct: 407 KNKLLDARRKRPAPGVDDKIIAAWNGFAVSALCTGYR----------------ATGEKRY 450

Query: 582 MEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKW 641
           ++ A     FI  +++   T  L   ++NG S   GFLDDYA +++ LLD++E     ++
Sbjct: 451 LDAALKTIDFIISNMWLNNT--LHRIYKNGAS-INGFLDDYAAVVNALLDVFEVSFEPRY 507

Query: 642 LVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLA 701
           L  A+++ N   ELF D   GG++ T  ED + + R+K+ +DGA PSGN+++   L++L+
Sbjct: 508 LAVAVDVANRMVELFWDNVDGGFYYTV-EDVAGVTRIKDAYDGATPSGNTLAAAALLKLS 566

Query: 702 SIVAGSKSDYYRQNAEHSLAVFETRLKDM-AMAVPLMCCAADMLSVPSRKHVVLV 755
            +   +K   Y Q  E +L  F +RL+   A    L+   A   +  SR  VVLV
Sbjct: 567 ELTGETK---YLQYVEETLKCFASRLEAAPAEHTGLITVLAGFHT--SRMEVVLV 616


>gi|421839588|ref|ZP_16273125.1| hypothetical protein CFSAN001627_27670 [Clostridium botulinum
           CFSAN001627]
 gi|409733965|gb|EKN35825.1| hypothetical protein CFSAN001627_27670 [Clostridium botulinum
           CFSAN001627]
          Length = 680

 Score =  472 bits (1214), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 267/675 (39%), Positives = 371/675 (54%), Gaps = 62/675 (9%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNRL  E SPYLLQHAHNPVDW+ WGEEAF +A+  D P+FLSIGYSTCHWCHVME ESF
Sbjct: 6   TNRLINEKSPYLLQHAHNPVDWYPWGEEAFEKAKIEDKPVFLSIGYSTCHWCHVMERESF 65

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           EDE VA++LN  F+SIKVDREERPD+D +YM + QA  G GGWPL++ ++PD KP   GT
Sbjct: 66  EDEEVAEVLNKNFISIKVDREERPDIDSIYMNFCQAYTGSGGWPLTIIMTPDKKPFFAGT 125

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFP   KY  PG   ILR + + W + ++ + +S    +EQ+          N    EL 
Sbjct: 126 YFPKWGKYNVPGIMDILRSISNLWREDKNKILESSNRILEQIER-----FQDNHREGELE 180

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQK 339
           +  +    + L  ++D+++GGFG+ PKFP    I  +L  Y+ KK           +   
Sbjct: 181 EYIIEEAIKTLLDNFDNQYGGFGTYPKFPTAHYILFLLRYYYFKK---------DKKILD 231

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           +V  TL  M KGGI DH+G GF RYS D +W VPHFEKMLYD   L+  Y +A+  TK+ 
Sbjct: 232 IVNKTLTSMYKGGIFDHIGFGFSRYSTDNKWLVPHFEKMLYDNALLSMAYTEAYEATKNP 291

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE 459
            +  I   +L+Y+++ M    G  +SAEDADS   EG     EG FY+WT +E+ DILGE
Sbjct: 292 LFKDITEKVLNYVKKSMTSEKGGFYSAEDADS---EGV----EGKFYLWTKEEIMDILGE 344

Query: 460 HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILG 519
                 E Y       C +  ++   N F+ KN+   +N            LEK      
Sbjct: 345 EE---GEFY-------CKIYDITSKGN-FENKNIANLINTDLKIVDNNKDKLEK------ 387

Query: 520 ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRK 579
             R KLF+ R KR  P+ DDK++ SWN L+I +F++A + LK++                
Sbjct: 388 -IREKLFEYREKRIHPYKDDKILTSWNALMIVAFSKAGRSLKND---------------- 430

Query: 580 EYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGT 639
            Y+E+A+ +A+FI  +L DE+   L    R G     GF+DDYAF +  L++LYE     
Sbjct: 431 NYIEIAKKSANFIIENLMDEKG-TLYARIREGERGNEGFIDDYAFFLWALIELYEASFDI 489

Query: 640 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVR 699
            +L  +IE+ N+  +LF  +E GG++  +     +L+R KE +DGA PSGN+V+ + L  
Sbjct: 490 YYLEKSIEVANSMIDLFWHKEDGGFYLYSKNSEKLLVRPKEIYDGATPSGNAVASLTLNL 549

Query: 700 LASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKS 759
           L  I      D Y+   +     F T +K   M   L    A M ++   K + L  +K 
Sbjct: 550 LYYITG---EDRYKDLVDKQFKFFATNIKSGPM-YHLFSVIAYMYNISPVKEITLAYNKK 605

Query: 760 SVDFENMLAAAHASY 774
             DF   +   +  Y
Sbjct: 606 DEDFYKFINEVNNRY 620


>gi|221632535|ref|YP_002521756.1| hypothetical protein trd_0509 [Thermomicrobium roseum DSM 5159]
 gi|221156894|gb|ACM06021.1| Protein of unknown function, DUF255 family [Thermomicrobium roseum
           DSM 5159]
          Length = 687

 Score =  472 bits (1214), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 268/696 (38%), Positives = 384/696 (55%), Gaps = 81/696 (11%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA E S YL QHA NPVDW+ W EEAF  AR++D PI LSIGYS+CHWCHVME E FE
Sbjct: 3   NRLANEKSLYLRQHADNPVDWYPWCEEAFRVAREQDKPILLSIGYSSCHWCHVMERECFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           +  +A+L N+ FV+IKVDREERPD+D++YM  +QA+ G GGWPL+VFL+PD KP  GGTY
Sbjct: 63  NPEIAQLQNELFVNIKVDREERPDLDELYMNALQAMTGSGGWPLNVFLTPDGKPFYGGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSG----AFAIEQLSEALSASASSNKLPD 278
           FPPED+   P +  +L  V  A+ ++R  + ++     ++  +Q    L A+    +  D
Sbjct: 123 FPPEDRGQLPAWPRVLLAVAQAYRERRADVERAAEDLVSYLQQQSRPPLQAAPLREQFLD 182

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
           E  +N        L   YD   GGFG+APKFP P++++ +L        T +   A    
Sbjct: 183 EAARN--------LVPHYDREHGGFGTAPKFPSPLQLEFLLR-------TFRRAGAPRAL 227

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
           +MVL TL  MA+GGIHD +GGGFHRY+VDE W VPHFEKMLYD   LA VY  A   + +
Sbjct: 228 EMVLQTLTAMARGGIHDQIGGGFHRYTVDEAWLVPHFEKMLYDNALLARVYTLAHLASGN 287

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
                I  + L Y++R+M G  G  F+A+DADS E        EGAFY+WT +E+  +LG
Sbjct: 288 RLCRTIAEETLVYIQREMRGDHGAFFAAQDADSEE-------GEGAFYLWTPEEIAAVLG 340

Query: 459 -EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI 517
            + A L   ++ + P GN            F+GK++L    D    AS+ G+ L++    
Sbjct: 341 NDDAGLACRYFGVTPRGN------------FEGKSILHVAEDPVTIASEFGLSLDELEQR 388

Query: 518 LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSD 577
           +G  R +L++ R +RP P  D+KVIV+WN L I +FA A   L                D
Sbjct: 389 IGSIRARLYEARDQRPHPARDEKVIVAWNALAIRAFAEAGTAL----------------D 432

Query: 578 RKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGS 637
           R +++ +AE AA+F+R  L+D +T  L H +  G ++ PGFLDDYA L++ L+ LYE   
Sbjct: 433 RPDFVALAERAATFLRDQLWDGKT--LYHVWEEGEARFPGFLDDYADLVNALVSLYEATF 490

Query: 638 GTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINL 697
              W+ WA +L       F+D   G +++T  +   +++R K   D   PSGN  +   L
Sbjct: 491 DPFWIAWARQLTEAILAKFIDPVAGDFYDTASDGEQLIVRPKTFIDQGTPSGNGATAEAL 550

Query: 698 VRLASIVAGSK---------SDYYRQNAEHSLAVFETRLK-DMAMAVPLMCCAADMLSVP 747
           +RL +++   +           Y +   EH +A  +  L  D A+  P            
Sbjct: 551 LRLGTLLGEHRFIDQARTLLERYAQLAVEHPIACGQLLLAMDFALGQPF----------- 599

Query: 748 SRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVSKK 783
               V ++G  +  +   +L    ASY  N+ ++ +
Sbjct: 600 ---EVAIIGDPTQPETRALLRVVQASYLPNRVLALR 632


>gi|376259602|ref|YP_005146322.1| thioredoxin domain-containing protein [Clostridium sp. BNL1100]
 gi|373943596|gb|AEY64517.1| thioredoxin domain protein [Clostridium sp. BNL1100]
          Length = 673

 Score =  471 bits (1213), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 275/646 (42%), Positives = 366/646 (56%), Gaps = 63/646 (9%)

Query: 97  SRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVM 156
           + NK  N+L  E SPYLLQHAHNPVDW+ WG EAF+ A   D PIFLSIGYSTCHWCHVM
Sbjct: 3   TNNKMPNKLIQEKSPYLLQHAHNPVDWYPWGPEAFSRAAGEDKPIFLSIGYSTCHWCHVM 62

Query: 157 EVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKP 216
           E ESFEDE VA +LN  F+ IKVDREERPD+D +YM+  QAL G GGWPL+VFL+PD +P
Sbjct: 63  ERESFEDEDVAHILNRDFICIKVDREERPDIDSIYMSVCQALTGHGGWPLTVFLTPDRQP 122

Query: 217 LMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKL 276
              GTYFP ED  G  G  ++L  VK+AWD KRD L +S    IE +S+         K+
Sbjct: 123 FYAGTYFPKEDSRGFMGLMSLLGSVKEAWDNKRDKLLESAKSIIEHVSQ--------EKV 174

Query: 277 PDE--LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEA 334
            DE  + ++ +    +    ++DS++GGFG++PKFP P  +  +L    +   T K   A
Sbjct: 175 SDEAKISKDIIHEAFKHFKYNFDSKYGGFGTSPKFPSPHTLLFLL----RYWYTEKEPFA 230

Query: 335 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 394
            E   MV  TL+ M  GGI DH+G GF RYS D++W VPHFEKMLYD   LA  Y +AFS
Sbjct: 231 LE---MVEKTLESMKNGGIFDHIGFGFSRYSTDKKWLVPHFEKMLYDNALLAIAYGEAFS 287

Query: 395 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 454
            T +  Y    R ILDY++RDM    G  +SAEDADS   EG     EG FY+W+ +E  
Sbjct: 288 ATGNKNYEETARQILDYVQRDMTSQFGAFYSAEDADS---EGV----EGKFYIWSREEAI 340

Query: 455 DILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKY 514
           D+LG       E Y       C L  ++   N F+G N+   +N         G   E+ 
Sbjct: 341 DVLGSKD---AEEY-------CRLFDITSSGN-FEGLNIPNLINS--------GTLTEQQ 381

Query: 515 LNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVV 574
            +   +CR+KLF  R KR  P+ DDKV+ SWNGL+ ++ A   +I               
Sbjct: 382 KSFAEDCRKKLFSHREKRIHPYKDDKVLTSWNGLMTAAMAYCGRIF-------------- 427

Query: 575 GSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYE 634
           G DR  Y+E A+    FI + L      RL   +R+G +  P +L+DYAFL+ GLL+LYE
Sbjct: 428 GEDR--YIESAKRCVDFIYKKLI-RTDGRLLARYRDGEAVFPAYLEDYAFLVWGLLELYE 484

Query: 635 FGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSV 694
               T +L  A++L +    LF +    G F    +   ++ R +E +DGA PSGNSV+ 
Sbjct: 485 ATFTTIYLKRALKLTDAMLNLFGENNSAGLFLYGHDSEQLISRPRESYDGAIPSGNSVAA 544

Query: 695 INLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 740
           +NL+RLA I    +   Y   A+  +  F  +++        M C+
Sbjct: 545 MNLLRLARITGHHE---YENRAKAIMDFFSNQVEVAPTGHSYMLCS 587


>gi|226948333|ref|YP_002803424.1| hypothetical protein CLM_1215 [Clostridium botulinum A2 str. Kyoto]
 gi|226841180|gb|ACO83846.1| conserved hypothetical protein [Clostridium botulinum A2 str.
           Kyoto]
          Length = 680

 Score =  471 bits (1213), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 267/675 (39%), Positives = 372/675 (55%), Gaps = 62/675 (9%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNRL  E SPYLLQHAHNPVDW+ WGEEAF +A+  D P+FLSIGYSTCHWCHVME ESF
Sbjct: 6   TNRLINEKSPYLLQHAHNPVDWYPWGEEAFEKAKIEDKPVFLSIGYSTCHWCHVMERESF 65

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           EDE VA++LN  F+SIKVDREERPD+D +YM + QA  G GGWPL++ ++PD KP   GT
Sbjct: 66  EDEEVAEVLNKNFISIKVDREERPDIDSIYMNFCQAYTGSGGWPLTIIMTPDKKPFFAGT 125

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFP   KY  PG   ILR + + W + ++ + +S    +EQ+          N    EL 
Sbjct: 126 YFPKWGKYNVPGIMDILRSISNLWREDKNKILESSNRILEQIER-----FQDNHREGELE 180

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQK 339
           +  +   A+ L  ++D+++GGFG+ PKFP    I  +L  Y+ KK           +   
Sbjct: 181 EYIIEEAAKTLLDNFDNQYGGFGTYPKFPTAHYILFLLRYYYFKK---------DKKILD 231

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           +V  TL  M KGGI DH+G GF RYS D +W VPHFEKMLYD   L+  Y +A+  TK+ 
Sbjct: 232 IVNKTLTSMYKGGIFDHIGFGFSRYSTDNKWLVPHFEKMLYDNALLSMAYTEAYEATKNP 291

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE 459
            +  I   +L+Y+++ M    G  +SAEDADS   EG     EG FY+WT +E+ DILGE
Sbjct: 292 LFKDITEKVLNYVKKSMTSEKGGFYSAEDADS---EGV----EGKFYLWTKEEIMDILGE 344

Query: 460 HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILG 519
                 E Y       C +  ++   N F+ KN+   +N            LEK      
Sbjct: 345 EE---GEFY-------CKIYDITSKGN-FENKNIANLINTDLKIVDNNKDKLEK------ 387

Query: 520 ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRK 579
             R KLF+ R KR  P+ DDK++ SWN L+I +F++A + LK++                
Sbjct: 388 -IREKLFEYREKRIHPYKDDKILTSWNALMIVAFSKAGRSLKND---------------- 430

Query: 580 EYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGT 639
            Y+E+A+ +A+FI  +L DE+   L    R G     GF+DDYAF +  L++LYE     
Sbjct: 431 NYIEIAKKSANFIIENLMDEKG-TLYARIREGERGNEGFIDDYAFFLWALIELYEASFDI 489

Query: 640 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVR 699
            +L  +IE+ N+  +LF  +E GG++  +     +L+R KE +DGA PSGN+V+ + L  
Sbjct: 490 YYLEKSIEVANSMIDLFWHKEDGGFYLYSKNSEKLLVRPKEIYDGATPSGNAVASLTLNL 549

Query: 700 LASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKS 759
           L  I      D Y+   +     F T +K   M   L    A M ++   K + L  ++ 
Sbjct: 550 LYYITG---EDRYKDLVDKQFKFFATNIKSGPM-YHLFSVIAYMYNISPVKEITLAYNEK 605

Query: 760 SVDFENMLAAAHASY 774
             DF   +   +  Y
Sbjct: 606 DEDFYKFINELNNRY 620


>gi|325958772|ref|YP_004290238.1| hypothetical protein Metbo_1019 [Methanobacterium sp. AL-21]
 gi|325330204|gb|ADZ09266.1| hypothetical protein Metbo_1019 [Methanobacterium sp. AL-21]
          Length = 702

 Score =  471 bits (1212), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 281/673 (41%), Positives = 372/673 (55%), Gaps = 50/673 (7%)

Query: 97  SRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVM 156
           S+N + N L  E SPYL+QH+ NPVDW+ WG+EAF +A+K D PIFLSIGYSTCHWCHVM
Sbjct: 9   SKNSY-NHLKGEKSPYLIQHSKNPVDWYPWGDEAFEKAKKLDKPIFLSIGYSTCHWCHVM 67

Query: 157 EVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKP 216
             ESFED  VA+LLN+ FV++KVDREERPDVD VYM   Q + G GGWPL++ ++ D KP
Sbjct: 68  AHESFEDLEVAELLNNNFVAVKVDREERPDVDSVYMAACQIMTGTGGWPLTIIMTHDKKP 127

Query: 217 LMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKL 276
              GTYFP E  +G  G K +L  V D W  +R     SG    +Q+  AL    S N  
Sbjct: 128 FFAGTYFPKESSFGNIGLKDLLLNVMDIWRDERKNALDSG----DQIFRALK-EMSVNTK 182

Query: 277 PDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASE 336
             +L    L    +QLSK +D   GGFG   KFP P  +  +L + K+   TG     + 
Sbjct: 183 GKQLDSTILEKTYDQLSKVFDVENGGFGDFQKFPTPHSLMFLLRYWKR---TGNKHSLN- 238

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
              MVL TL  MA GGI+DHVG GFHRYSVD+ W VPHFEKMLYDQ  +A +Y + +S T
Sbjct: 239 ---MVLKTLDEMAMGGIYDHVGFGFHRYSVDKNWLVPHFEKMLYDQALIAMLYTEVYSAT 295

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
               Y    + I +Y+ RDM    G  +SAEDADS   EG     EG FY WT +E+  I
Sbjct: 296 GKFEYKKTAQQIYEYVLRDMTDVEGGFYSAEDADS---EGV----EGKFYYWTYEELYSI 348

Query: 457 LG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 515
           L  + A L  E + +K  GN      +D ++     N+L +  D    A   G+ +    
Sbjct: 349 LDKDSADLITEVFNVKKDGN-----FNDGYSNESINNILHKKRDYKKIAENKGLNISDLE 403

Query: 516 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 575
            ++ +   +LF VR KR  PH DDK++  WNGL+I+S +RA ++ + E            
Sbjct: 404 ELVDDILSELFLVREKRVHPHKDDKILTDWNGLMIASLSRAFQVFEEE------------ 451

Query: 576 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 635
               +Y++ AE+  +FI    Y  Q +RL H FR+G S   G LDDY F+I GLL++Y  
Sbjct: 452 ----KYVKAAENCVNFIMNKSY--QQNRLMHMFRDGESAVYGNLDDYTFMIWGLLEIYMA 505

Query: 636 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 695
                +L  A++L  T  E F D E GG++ T  ++  VL+R K+  D A PSGNSV  +
Sbjct: 506 TFNVDYLEKAMDLNQTVVEHFWDEENGGFYFTADDEEKVLIREKKTFDSAIPSGNSVEFL 565

Query: 696 NLVRLASIVAGSKSDYYRQNAEHSL-AVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVL 754
           NL+RL S      +D+ + +    L  VF   +K             D    PS   VV+
Sbjct: 566 NLLRLGSFT----NDHNQMDTARKLETVFSETVKRSPTGHTQFISGVDFALGPSYS-VVI 620

Query: 755 VGHKSSVDFENML 767
           VG   S D   ML
Sbjct: 621 VGDGDSEDTIEML 633


>gi|269836164|ref|YP_003318392.1| hypothetical protein Sthe_0131 [Sphaerobacter thermophilus DSM
           20745]
 gi|269785427|gb|ACZ37570.1| protein of unknown function DUF255 [Sphaerobacter thermophilus DSM
           20745]
          Length = 685

 Score =  471 bits (1211), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 273/686 (39%), Positives = 380/686 (55%), Gaps = 61/686 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYLLQHA NPVDW+ WGEEA   AR +D PI LSIGY+ CHWCHVME ESFE
Sbjct: 3   NRLQHETSPYLLQHADNPVDWYPWGEEALEAARTQDKPILLSIGYAACHWCHVMERESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           +  +A L+N  F++IKVDREERPD+D VYM   Q + G GGWPL++FL PD KP   GTY
Sbjct: 63  NPDIAALMNQHFINIKVDREERPDLDTVYMAAAQMMTGQGGWPLTIFLMPDGKPFYAGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FPPED+ G PGF  +L  V +A+  +R  L ++       L+E    S     +   L  
Sbjct: 123 FPPEDRSGMPGFPRVLLAVAEAYRNRRADLERAANDIQGHLTEHFRWSLPETAITPAL-- 180

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML-YHSKKLEDTGKSGEASEGQKMV 341
             L   A  L++ +D   GGFG APKFP P+ ++ +L Y  +   DT          ++V
Sbjct: 181 --LNEAASGLARQFDEANGGFGGAPKFPPPMALEFLLRYRLRTGSDTAL--------RIV 230

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             TL+ MA+GGIHD VGGGFHRY+VD  W VPHFEKMLYD   LA +Y   +  T   FY
Sbjct: 231 ELTLERMARGGIHDQVGGGFHRYAVDATWLVPHFEKMLYDNALLARLYTLTYQATGHPFY 290

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EH 460
           +    D ++Y+ R+M  P G  +S +DADS   EG    +EG FYVWT +E+E +LG E 
Sbjct: 291 AATALDTIEYVLREMTSPDGGFYSTQDADS---EG----EEGKFYVWTPEELEAVLGPEQ 343

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
           A +   +Y + P GN            F+GK++L       + A+   + +++ + I+G 
Sbjct: 344 APIVARYYGVHPGGN------------FEGKSILHVPEAPESVAAAFDLTIDELVEIIGP 391

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            R KL+  R++R  P  D+K++  WNGL++ + A+A+  L                 R +
Sbjct: 392 AREKLYAARAQRVWPGRDEKILTDWNGLMLRALAQAAIALG----------------RSD 435

Query: 581 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 640
             + A   A+F+  HLY  +  RL HS+++G +K  G+L DYA LI+GLL LYE     +
Sbjct: 436 LRDAAVRNATFLHTHLY--RDGRLLHSYKDGEAKITGYLADYASLIAGLLALYEATFDVR 493

Query: 641 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 700
           W+ WA +L +     F D EGG +F+T+ +D  ++ R K+  D A PSGNS+   +L+RL
Sbjct: 494 WIAWARDLTDRAIADFWDNEGGAFFDTSADDAPLVARPKDAFDSATPSGNSLMAESLLRL 553

Query: 701 ASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL---MCCAADMLSVPSRKHVVLVGH 757
             +      D YRQ A   + V E R   +A   P        A  L++     + LVG 
Sbjct: 554 GLL---LGEDDYRQRA---MTVLE-RFAALAAKAPTGFGQLLCAADLALAEAHEIALVGD 606

Query: 758 KSSVDFENMLAAAHASYDLNKTVSKK 783
                   MLA     Y  ++ V+ +
Sbjct: 607 PQVPAMAEMLAVVQQPYLPHQVVALR 632


>gi|373458119|ref|ZP_09549886.1| hypothetical protein Calab_1940 [Caldithrix abyssi DSM 13497]
 gi|371719783|gb|EHO41554.1| hypothetical protein Calab_1940 [Caldithrix abyssi DSM 13497]
          Length = 684

 Score =  471 bits (1211), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 276/688 (40%), Positives = 386/688 (56%), Gaps = 64/688 (9%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           +K+TNRL  E SPYL QHAHNPVDW+ WG EA + AR+++ PI LSIGYS CHWCHVME 
Sbjct: 2   HKYTNRLIDETSPYLQQHAHNPVDWYPWGGEALSLAREQNKPILLSIGYSACHWCHVMEK 61

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFEDE  A+L+N  FV+IKVDREERPD+D+ YM +VQ L G GGWPL+VFL+PD +P  
Sbjct: 62  ESFEDEETAQLMNRLFVNIKVDREERPDIDQHYMEFVQTLTGSGGWPLTVFLTPDGEPFY 121

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK--- 275
           GGTYFPPED+YG+P FK +L  V + + K R  L ++    ++++ E ++      K   
Sbjct: 122 GGTYFPPEDRYGKPAFKKLLVMVSEYYHKNRQQLEEN----LDKIREIMARQRREIKGRH 177

Query: 276 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 335
           +PD     A     ++L++ YD+  GG G APKFP    +Q+     +K    G      
Sbjct: 178 IPDT---EAWNQAVQRLTQFYDALNGGMGQAPKFP---AVQVFSLFLRKFAHHGD----K 227

Query: 336 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 395
           +  +M   TLQ MA GGI+D +GGGF RY+VDE+W VPHFEKMLYD  QLA++Y+DA+ L
Sbjct: 228 QFLRMAEHTLQRMANGGIYDQLGGGFARYAVDEKWRVPHFEKMLYDNAQLASLYIDAYRL 287

Query: 396 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 455
           T++ FY  I R+ L+++RR++  P G  +S+ DADS   EG    +EG FY+W+  E+  
Sbjct: 288 TQNPFYLQIARETLEFVRRELTDPDGGFYSSLDADS---EG----QEGKFYLWSKDEILK 340

Query: 456 ILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKY 514
           ILG E   LF   + +   GN            F+G N+L         A++     E+ 
Sbjct: 341 ILGDETGRLFCARFGVTDGGN------------FEGSNILFVSKSFDELAAEFKKTPEEI 388

Query: 515 LNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVV 574
             ++ + R+K+   R +R RP LD K + SWNGL++S+FA A ++  +            
Sbjct: 389 EALIRQARKKMLAEREQRIRPGLDYKALTSWNGLMLSAFAAAYQVTLNPT---------- 438

Query: 575 GSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYE 634
                 Y  V +    F+RR+LY  Q+ RL H +  G SK   F+DDYA+LI GLLD YE
Sbjct: 439 ------YAAVIDKNIDFVRRNLY--QSGRLLHVYSKGQSKIDAFVDDYAYLIQGLLDAYE 490

Query: 635 FGSGTKWLVWAIELQNTQDELFLDREGGGY-FNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
                 +L  A+EL    ++LF D+  GGY F  TG+D +     K + D ++PS  +V 
Sbjct: 491 ALFDEHYLQMAVELTRRANDLFWDKRHGGYFFEATGKDQAK-RHFKSETDASQPSPTAVM 549

Query: 694 VINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM-LSVPSRKHV 752
           + N +RL           Y Q AE  +  +  +  +   A      A D  LS P     
Sbjct: 550 LHNQLRLFHFTG---EQLYLQTAEQLMRKYGQKALENPYAFASFLNALDFYLSQPLE--- 603

Query: 753 VLVGHKSSVDFENMLAAAHASYDLNKTV 780
           +L+  K    F+       + Y  NK V
Sbjct: 604 ILILKKDQQRFDAFQKLIFSRYLPNKVV 631


>gi|58262588|ref|XP_568704.1| hypothetical protein [Cryptococcus neoformans var. neoformans
           JEC21]
 gi|57230878|gb|AAW47187.1| conserved hypothetical protein [Cryptococcus neoformans var.
           neoformans JEC21]
          Length = 773

 Score =  470 bits (1210), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 287/739 (38%), Positives = 406/739 (54%), Gaps = 45/739 (6%)

Query: 57  SLPRNYLYPFRRPLAVISHRPIHPY-KVVAMAERTPASTSHSRNKHTNRLAAEHSPYLLQ 115
           SLPR       +P+ V     I P  + +     +  S +    + +N LA   SPYLLQ
Sbjct: 4   SLPRTL-----KPIIVPFPPQIRPTPRGIYHLRMSSTSATDPTPRLSNVLAKSKSPYLLQ 58

Query: 116 HAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFV 175
           H  NPV W  W  E  A A+K D PIFLS GYS CHWCHV+  ESFEDE  AK++N+WFV
Sbjct: 59  HKDNPVAWQEWSPETIALAQKLDKPIFLSSGYSACHWCHVLAHESFEDEETAKMMNEWFV 118

Query: 176 SIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFK 235
           +IKVDREERPDVD++YM+Y+QA+ GGGGWP+S+F++P L+P   GTYFP      RP F 
Sbjct: 119 NIKVDREERPDVDRMYMSYLQAVSGGGGWPMSIFMTPKLEPFFAGTYFP------RPNFH 172

Query: 236 TILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKS 295
            +L K+ + W++ R+   + G   IE L +      +S  L   L  +       QLS  
Sbjct: 173 QLLNKIHEVWEEDREKCEKMGKGVIEVLKDMSHTGRTSESLSQLLASSPASKLFSQLSTM 232

Query: 296 YDSRFGGF---GSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS-----EGQKMVLFTLQC 347
            D+R+GGF   GS+ + P+     + L    +L      G  +     + ++M +  L+ 
Sbjct: 233 NDTRYGGFTNSGSSTRGPKFPSCSITLEPLARLASIPGGGARNAEIREDAREMGMKMLRS 292

Query: 348 MAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT----KDVFYSY 403
           M  GGI D VGGG  RYSVDE+W VPHFEKMLYDQ QL +  LD   L     +D    Y
Sbjct: 293 MWSGGIRDWVGGGMARYSVDEKWMVPHFEKMLYDQAQLVSSCLDFARLYPVDHQDRLLCY 352

Query: 404 -ICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
            +  DIL Y  RD+  P G  +SAEDADSAE +GA +K EGAFY+W   E++++LG+ A 
Sbjct: 353 DLAADILKYTLRDLKSPEGGFWSAEDADSAEYKGA-KKSEGAFYIWKKTEIDEVLGDDAP 411

Query: 463 LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECR 522
           LF   + ++P GN D+  + D H E +GKN+L +       A + G   ++   I+ +  
Sbjct: 412 LFNSFFGVQPDGNVDI--IHDSHGEMRGKNILHQHKTYEEVALEFGKREDQAKGIIIQAC 469

Query: 523 RKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYM 582
            KL   R +R RP LDDK++ +WNGL++++ ++AS +L           P     R + +
Sbjct: 470 EKLRLKREERERPGLDDKILTAWNGLMLTALSKASTLL-----------PPSYGIRSQCL 518

Query: 583 EVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP-GFLDDYAFLISGLLDLYEFGSGTKW 641
             A    +F++ H++D  T  L  S+R G  K P    DDYAFL+ GLL+LYE       
Sbjct: 519 PAALGIVNFVKSHMWDSSTRTLTRSYREG--KGPQAQTDDYAFLVQGLLNLYEATGDESH 576

Query: 642 LVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLA 701
           +++A ELQ  QDELF D   GGYF  + ED  VL+R+K+  DGAEPS  +VS  NL R +
Sbjct: 577 VLFAEELQKRQDELFWDDHDGGYF-ASAEDAHVLVRMKDAQDGAEPSAAAVSAHNLSRFS 635

Query: 702 SIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSV 761
            +++ S+ + Y   AE +       +     AV         L    R+ V+++G  S  
Sbjct: 636 LLLS-SEFENYEARAEATFLSMGPLITQAPRAVGYAVSGLIDLEKGYRE-VIVIGSASDE 693

Query: 762 DFENMLAAAHASYDLNKTV 780
             +  L AA  +Y  N+ +
Sbjct: 694 VVKKFLEAARKTYFSNQVI 712


>gi|168182912|ref|ZP_02617576.1| dTMP kinase [Clostridium botulinum Bf]
 gi|182673930|gb|EDT85891.1| dTMP kinase [Clostridium botulinum Bf]
          Length = 682

 Score =  470 bits (1209), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 266/678 (39%), Positives = 371/678 (54%), Gaps = 64/678 (9%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           K TNRL  E SPYLLQHAHNPVDW+ WGEEAF +A+  D P+FLSIGYSTCHWCHVME E
Sbjct: 6   KKTNRLIKEKSPYLLQHAHNPVDWYPWGEEAFEKAKIEDKPVFLSIGYSTCHWCHVMERE 65

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SFEDE VA++LN  F+SIKVDREERPD+D +YM + QA  G GGWPL++ ++PD KP   
Sbjct: 66  SFEDEEVAEVLNKNFISIKVDREERPDIDSIYMNFCQAYTGSGGWPLTIIMTPDKKPFFA 125

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE 279
           GTYFP   KY  PG   ILR + + W + ++ + +S    +EQ+          N    E
Sbjct: 126 GTYFPKWGKYNVPGIMDILRSISNLWREDKNKILESSNRILEQIER-----FQDNHREGE 180

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEG 337
           L +  +    + L  ++D+++GGFG+ PKFP    I  +L  Y+ KK          ++ 
Sbjct: 181 LEEYIIEEAIKTLLDNFDNQYGGFGTKPKFPTAHYILFLLRYYYFKK---------DNKV 231

Query: 338 QKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTK 397
             ++  TL  M KGGI DH+G GF RYS D +W VPHFEKMLYD   L+  Y +A+  TK
Sbjct: 232 LDVINKTLTSMYKGGIFDHIGFGFSRYSTDNKWLVPHFEKMLYDNALLSMTYTEAYEATK 291

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
           +  +  I   +L+Y+++ M    G  +SAEDADS   EG     EG FY+WT +E+ DIL
Sbjct: 292 NPLFKDITEKVLNYVKKSMTSEKGGFYSAEDADS---EGV----EGKFYLWTKEEIMDIL 344

Query: 458 G-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLN 516
           G E   L+ + Y +   GN            F+ KN+   +N            LEK   
Sbjct: 345 GEEEGELYCKIYNITSKGN------------FENKNIANLINTDLKIVDNNKDKLEK--- 389

Query: 517 ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGS 576
                R KLF+ R KR  P+ DDK++ SWN L+I +F++A + LK++             
Sbjct: 390 ----IREKLFEYREKRIHPYKDDKILTSWNALMIVAFSKAGRSLKND------------- 432

Query: 577 DRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG 636
               Y+E+A+ +A+FI  +L DE+   L    R G     GF+DDYAF +  L++LYE  
Sbjct: 433 ---NYIEIAKKSANFIIENLMDEKG-TLYARIREGERGNEGFIDDYAFFLWALIELYEAS 488

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
               +L  +IE+ N+  +LF  +E GG++  +     +L+R KE +DGA PSGN+V+ + 
Sbjct: 489 FDIYYLEKSIEVANSMIDLFWHKEDGGFYLYSKNSEKLLVRPKEIYDGATPSGNAVASLT 548

Query: 697 LVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVG 756
           L  L  I      D Y+   +     F   +K   M   L    A M +V   K + L  
Sbjct: 549 LNLLYYITG---EDRYKDLVDKQFKFFAANIKSGPM-YHLFSVMAYMYNVLPIKEITLTY 604

Query: 757 HKSSVDFENMLAAAHASY 774
            +   DF   +   +  Y
Sbjct: 605 REKDEDFYKFINEVNNRY 622


>gi|153003852|ref|YP_001378177.1| hypothetical protein Anae109_0984 [Anaeromyxobacter sp. Fw109-5]
 gi|152027425|gb|ABS25193.1| protein of unknown function DUF255 [Anaeromyxobacter sp. Fw109-5]
          Length = 725

 Score =  470 bits (1209), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 282/697 (40%), Positives = 381/697 (54%), Gaps = 69/697 (9%)

Query: 87  AERTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIG 146
           A RT       R   TNRL  E SPYLLQHAHNPV W  WGEEAFAEAR+   P+FLS+G
Sbjct: 31  APRTHHLDGSGRPLFTNRLILERSPYLLQHAHNPVSWRPWGEEAFAEARRTGRPVFLSVG 90

Query: 147 YSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPL 206
           YSTCHWCHVME ESFEDE +A++LN+ +V IKVDREERPDVD +YMT VQ L GGGGWP+
Sbjct: 91  YSTCHWCHVMEGESFEDEEIARVLNERYVPIKVDREERPDVDGLYMTAVQLLTGGGGWPM 150

Query: 207 SVFLSPDLKPLMGGTYFPPED-KYGRP-GFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS 264
           SV+L+P+ +P  GGTYFP  D   G P GF +ILR++ D + +    +  + +  +  + 
Sbjct: 151 SVWLTPEKEPFFGGTYFPARDGDRGAPRGFLSILRELADLYARDAGRVQAATSSLVGAVR 210

Query: 265 EALSASAS-SNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML-YHS 322
            AL+     +  +P     + L         ++D+  GG   APKFP  + ++ +L YH 
Sbjct: 211 AALAPRGEPAASVPG---ADVLEAAFRGFRDAFDAAHGGLRGAPKFPSSLPVRFLLRYHR 267

Query: 323 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 382
           +  E        +E  +M   TL+ MA GG+HD +GGGFHRYS D  W VPHFEKMLYD 
Sbjct: 268 RARE--------AEALRMATVTLERMAAGGLHDQIGGGFHRYSTDATWLVPHFEKMLYDN 319

Query: 383 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 442
             LA  Y +A+ +T     + + R  LDYL R+M  P G ++SA DADS   EG    +E
Sbjct: 320 ALLAVAYAEAWQVTGRRELARVVRQTLDYLGREMTSPEGGLYSATDADS---EG----EE 372

Query: 443 GAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSA 502
           G F+VW + E+   LG  A  F   +     GN            F+G+NVL        
Sbjct: 373 GRFFVWDAAELRQRLGADAERFMRFHGATDAGN------------FEGRNVL-------- 412

Query: 503 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 562
               +  P E     L   R  L+  R +RPRP  D+K++  WNGL IS+ A   ++L  
Sbjct: 413 ---HVPRPDEDEWEALAPQRALLYAAREERPRPLRDEKILAGWNGLAISALAFGGRVLGE 469

Query: 563 EAESAMFNFPVVGSDRKEYMEVAESAASFI-RRHLYDEQTHRLQHSFRNGPSKAPGFLDD 621
           E                 Y++ A SAA F+  R + D    RL+ ++ +G +  PGFLDD
Sbjct: 470 E----------------RYVKAAASAAEFVLGRMIVD---GRLRRAWLDGAAGVPGFLDD 510

Query: 622 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 681
           +AF+  GLLDLYE     +WL  A+EL    + LF D  GG +F T  +   +L R K  
Sbjct: 511 HAFVAQGLLDLYEATFDARWLEAAVELSERLEVLFGDPRGGAWFGTAADHERLLAREKPT 570

Query: 682 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 741
           HDGAEPSG SV+++N +RL++    +  D +R  AE +L  +   L +   A   M  A 
Sbjct: 571 HDGAEPSGASVALVNALRLSAF---TTDDRWRVRAEGALRHYGRALAEHPSAFTEMLLAV 627

Query: 742 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNK 778
           D  +  +R+ VVLV  +     E  LA    S+  N+
Sbjct: 628 DFATDVARE-VVLVWPEEGPSPEPFLAVLRRSFLPNR 663


>gi|403068246|ref|ZP_10909578.1| hypothetical protein ONdio_01469 [Oceanobacillus sp. Ndiop]
          Length = 685

 Score =  470 bits (1209), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 279/687 (40%), Positives = 379/687 (55%), Gaps = 61/687 (8%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           N +TNRL  E SPYLLQHA NPV+W+ WG+EAF  A+  + PIFLSIGYSTCHWCHVM  
Sbjct: 3   NDNTNRLIHEKSPYLLQHARNPVNWYPWGKEAFERAKLENKPIFLSIGYSTCHWCHVMAH 62

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFED  VA+LLN  ++SIKVDREERPD+D VYM   Q + G GGWPL++ ++PD  P  
Sbjct: 63  ESFEDPEVAELLNAHYISIKVDREERPDIDSVYMKVCQMMTGHGGWPLTIMMTPDKVPFY 122

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASA---SSNK 275
            GTYFP E K+G PG    L ++   + K  D +A+      E ++ AL  S    S N+
Sbjct: 123 AGTYFPKESKHGMPGILEALSQLHKKYTKDPDHIAE----VTESVTAALQKSVTEKSENR 178

Query: 276 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 335
           L  E  + A R    QL+K++D  +GGFG APKFP+P  +  +L H     +T       
Sbjct: 179 LTSESTEKAYR----QLAKNFDFSYGGFGPAPKFPQPQNLFFLLKHYHFTGNTS------ 228

Query: 336 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 395
              KMV  TLQ MA GGI DH+G GF RYS DE+W VPHFEKMLYD   L  VY + + +
Sbjct: 229 -ALKMVESTLQSMASGGIWDHIGYGFSRYSTDEKWLVPHFEKMLYDNALLLMVYTECYQI 287

Query: 396 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 455
           TK+ FY  I   I+ ++ R+M    G  +SA DADS   EG     EG +YVW ++E+ D
Sbjct: 288 TKNPFYRQISEQIIAFVSREMTSSDGAFYSAIDADS---EGI----EGKYYVWRNEEIYD 340

Query: 456 ILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS-SASASKLGMPLEK 513
           +LGE    L+ + Y + P GN            F+GKN+   +N S   +A   GM L  
Sbjct: 341 VLGEELGELYSDIYGITPFGN------------FEGKNIPNLINTSLEKTAKDNGMSLAN 388

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
             + L   R KL   R KR  PH+DDKV+ +WNGL++++ A+A K L ++          
Sbjct: 389 LHSHLETARSKLLLAREKRTYPHVDDKVLTAWNGLMVAALAKAGKALANDT--------- 439

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
                  Y+E A  A  FI + LY  Q +RL   FR+G +K   ++DDYAFL+ G ++LY
Sbjct: 440 -------YIEKANRAIQFIEKKLY--QGNRLMARFRDGEAKFKAYIDDYAFLLWGYIELY 490

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
           E    T++L  A+ L     ELF D   GG++    +   ++ + KE +DGA PSGNS +
Sbjct: 491 EATYSTEYLQKAMALIEQMTELFWDEANGGFYFNGKDSEELISKEKEIYDGAIPSGNSTA 550

Query: 694 VINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVV 753
            + L R+A +   +    Y    E     F       A A      +  +   P+ K VV
Sbjct: 551 ALMLTRMAYLTGETA---YLDKTEEMYFTFYEDTHQYASASAFFMQSLFVTENPA-KEVV 606

Query: 754 LVGHKSSVDFENMLAAAHASYDLNKTV 780
           ++G       + +LA    +Y  N TV
Sbjct: 607 ILGRSDDPARQKLLAKLQEAYIPNVTV 633


>gi|168178477|ref|ZP_02613141.1| conserved hypothetical protein [Clostridium botulinum NCTC 2916]
 gi|182670724|gb|EDT82698.1| conserved hypothetical protein [Clostridium botulinum NCTC 2916]
          Length = 680

 Score =  469 bits (1208), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 266/675 (39%), Positives = 371/675 (54%), Gaps = 62/675 (9%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNRL  E SPYLLQHAHNPVDW+ WGEEAF +A+  D P+FLSIGYSTCHWCHVME ESF
Sbjct: 6   TNRLINEKSPYLLQHAHNPVDWYPWGEEAFEKAKIEDKPVFLSIGYSTCHWCHVMERESF 65

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           EDE VA++LN  F+SIKVDREERPD+D +YM + QA  G GGWPL++ ++PD KP   GT
Sbjct: 66  EDEEVAEVLNKNFISIKVDREERPDIDSIYMNFCQAYTGSGGWPLTIIMTPDKKPFFAGT 125

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFP   KY  PG   ILR + + W + ++ + +S    +EQ+          N    EL 
Sbjct: 126 YFPKWGKYNVPGIMDILRSISNLWREDKNKILESSNRILEQIER-----FQDNHREGELE 180

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQK 339
           +  +    + L  ++D+++GGFG+ PKFP    I  +L  Y+ KK           +   
Sbjct: 181 EYIIEEAIKTLLDNFDNQYGGFGTYPKFPTAHYILFLLRYYYFKK---------DKKILD 231

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           +V  TL  M KGGI DH+G GF RYS D +W VPHFEKMLYD   L+  Y +A+  TK+ 
Sbjct: 232 IVNKTLTSMYKGGIFDHIGFGFSRYSTDNKWLVPHFEKMLYDNALLSMAYTEAYEATKNP 291

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE 459
            +  I   +L+Y+++ M    G  +SAEDADS   EG     EG FY+WT +E+ DILGE
Sbjct: 292 LFKDITEKVLNYVKKSMTSEKGGFYSAEDADS---EGV----EGKFYLWTKEEIMDILGE 344

Query: 460 HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILG 519
                 E Y       C +  ++   N F+ KN+   +N            LEK      
Sbjct: 345 EE---GEFY-------CKIYDITSKGN-FENKNIANLINTDLKIVDNNKDKLEK------ 387

Query: 520 ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRK 579
             R KLF+ R KR  P+ DDK++ SWN L+I +F++A + LK++                
Sbjct: 388 -IREKLFEYREKRIHPYKDDKILTSWNALMIVAFSKAGRSLKND---------------- 430

Query: 580 EYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGT 639
            Y+E+A+ +A+FI  +L DE+   L    R G     GF+DDYAF +  L++LYE     
Sbjct: 431 NYIEIAKKSANFIIENLMDEKG-TLYARIREGERGNEGFIDDYAFFLWALIELYEASFDI 489

Query: 640 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVR 699
            +L  +IE+ N+  +LF  +E GG++  +     +L+R KE +DGA PSGN+V+ + L  
Sbjct: 490 YYLEKSIEVANSMIDLFWHKEDGGFYLYSKNSEKLLVRPKEIYDGATPSGNAVASLTLNL 549

Query: 700 LASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKS 759
           L  I      D Y+   +     F T +K   M   L    A M ++   K + L  ++ 
Sbjct: 550 LYYITG---EDRYKDLVDKQFKFFATNIKSGPM-YHLFSVIAYMYNISPVKEITLAYNEK 605

Query: 760 SVDFENMLAAAHASY 774
             DF   +   +  Y
Sbjct: 606 DEDFYKFINELNNRY 620


>gi|452845430|gb|EME47363.1| hypothetical protein DOTSEDRAFT_41782 [Dothistroma septosporum
           NZE10]
          Length = 734

 Score =  469 bits (1208), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 268/640 (41%), Positives = 358/640 (55%), Gaps = 36/640 (5%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NR     SPY+  H  NP  W  W  E    AR+ +  +F+SIGYS CHWCHVM  ESF+
Sbjct: 15  NRCGESKSPYVRSHMDNPTAWQLWTPETLDLARQTNRLLFVSIGYSACHWCHVMAHESFD 74

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D  +A+LLN++FV IK+DREERPD+D+ YM ++QA  GGGGWPL+VF++PDL+P+ GGTY
Sbjct: 75  DPRIAQLLNEYFVPIKIDREERPDIDRQYMDFLQATSGGGGWPLNVFVTPDLEPIFGGTY 134

Query: 223 FP----PEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE-----ALSASASS 273
           +P       + G   F+ IL KV   W ++ + L  SG    +QL E      +      
Sbjct: 135 WPGPRSDRAQMGGTTFEDILLKVSSMWKEQEERLRASGKEITKQLREFAQEGHIGGRDGK 194

Query: 274 NKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLY---HSKKLEDTGK 330
               D L  + L    +   K YD +FGGFG+APKFP PV I+ +L+   + K++ +   
Sbjct: 195 GDDNDGLELDLLDDAFQHYKKRYDRKFGGFGAAPKFPTPVHIRPLLHVACYPKEVREIVG 254

Query: 331 SGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYL 390
             E+ E + M + +L+ MAKGGI D +G GF RYSV   W +PHFEKMLYD  QL  VYL
Sbjct: 255 EDESIEVRAMAVKSLENMAKGGIKDQIGHGFARYSVTRDWSLPHFEKMLYDNAQLLPVYL 314

Query: 391 DAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 449
           +A+ LTK   +     DI  YL    M    G I SAEDADS  T     K+EGA+YVWT
Sbjct: 315 EAYMLTKSQLFLETTHDIAKYLTSAPMASDLGGICSAEDADSLPTAIDHHKREGAYYVWT 374

Query: 450 SKEVEDILGEHAILFKEHYY-LKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 508
             E + IL +  +     Y+ +K  GN D  +  D   E  G+N L   ++ +  A +L 
Sbjct: 375 MDEFKKILTDEEVKVCSAYWGVKSEGNID--KQHDIQGELVGQNTLCVQHEPAELARELS 432

Query: 509 MPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 567
           M  E     L   R KL   R K RPRP LDDK++ SWNGL +   ARA          A
Sbjct: 433 MSEEDVKRTLANGREKLLAYRQKDRPRPALDDKIVTSWNGLAVGGLARA---------GA 483

Query: 568 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 627
               P       EY+  AE A + IR  L+DE+   L+  +R GP +  GF DDYAFLIS
Sbjct: 484 ALGVP-------EYIAAAEKAVNCIRAQLFDEKAKTLKRVYREGPGETQGFADDYAFLIS 536

Query: 628 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 687
           GLLDLYE    ++WL +A  LQ TQ +LF D E  G+F+T    P +L R K+  D AEP
Sbjct: 537 GLLDLYESTFDSQWLEFADILQQTQTKLFWDEEKFGFFSTPANQPDILFRTKDAMDNAEP 596

Query: 688 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 727
           S N VS +NL RL S++  +    Y +  + ++A F+  +
Sbjct: 597 SVNGVSAMNLFRLGSLLYDAT---YEKMGKRTVAAFDVEI 633


>gi|86157370|ref|YP_464155.1| hypothetical protein Adeh_0943 [Anaeromyxobacter dehalogenans
           2CP-C]
 gi|85773881|gb|ABC80718.1| protein of unknown function DUF255 [Anaeromyxobacter dehalogenans
           2CP-C]
          Length = 718

 Score =  469 bits (1207), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 277/660 (41%), Positives = 383/660 (58%), Gaps = 67/660 (10%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           + TNRLA E SPYLLQHAHNPV W+AWG+EAF EAR+   P+FLS+GYSTCHWCHVME E
Sbjct: 37  RFTNRLALERSPYLLQHAHNPVSWWAWGDEAFEEARRTGRPVFLSVGYSTCHWCHVMERE 96

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SFEDE +A++LN+ +V+IKVDREERPDVD VYMT VQ L G GGWP+SV+L+PD +P  G
Sbjct: 97  SFEDEEIARVLNERYVAIKVDREERPDVDAVYMTAVQLLTGSGGWPMSVWLTPDREPFFG 156

Query: 220 GTYFPPEDKYGRP--GFKTILRKVKDAWDKKRDML-AQSGAFAIEQLSEALSASASSNKL 276
           GTYFPP D    P  G  +IL ++ D W +  D + + +GA      +    A  ++  +
Sbjct: 157 GTYFPPRDGVRGPARGLLSILHEIADLWARDPDRIRSATGALVEAVRTALAPAGPAAADV 216

Query: 277 PDELP-QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 335
           P   P ++A+ L    L +S+D R GG   APKFP  V ++++L H +      ++GE  
Sbjct: 217 PGPEPIEHAVTL----LERSFDERHGGLRRAPKFPSNVPVRLLLRHHR------RTGE-E 265

Query: 336 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 395
              +M   TL+ MA GG+HD VGGGFHRYS D +W VPHFEKMLYD   LA  Y +A+  
Sbjct: 266 RSLRMATVTLERMAAGGLHDQVGGGFHRYSTDAQWLVPHFEKMLYDNALLAVAYAEAWQA 325

Query: 396 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 455
           T    ++ + R  LDYL R++  P G ++SA DADS   EG    +EG F+ WT  E+ +
Sbjct: 326 TGRRDFARVTRQTLDYLLRELTSPEGGLYSATDADS---EG----EEGRFFTWTEAELRE 378

Query: 456 ILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 515
            LG+ A  F   + ++P GN            F+G+NVL            +  P E   
Sbjct: 379 ALGDRAEAFLRFHGVRPEGN------------FEGRNVL-----------HVPAPDEDAW 415

Query: 516 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 575
                 R  L+ +R +RPRP  D+KV+  WNGL IS+ A   ++L SEA           
Sbjct: 416 ESFAPDRAALYALRERRPRPLRDEKVLAGWNGLAISALALGGRVL-SEA----------- 463

Query: 576 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 635
                +++ A  AA F+   +  +   RLQ S+  G +  P +L+D+AFL+ GLLDL+E 
Sbjct: 464 ----RWVDAAARAADFVLTRMVKDG--RLQRSWLAGRAGVPAYLEDHAFLVQGLLDLHEA 517

Query: 636 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 695
               +WL  A++L   QD LF D  GGG+F +  +   +L R K  HDGAEPSG SV+ +
Sbjct: 518 SFDPRWLRSALQLAEAQDRLFGDPAGGGWFQSATDHERLLAREKPTHDGAEPSGASVAAL 577

Query: 696 NLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLV 755
           N +RL +  +  +   +R+ A+ +L      L +  +A+  +  A D  S   R+ VVLV
Sbjct: 578 NALRLEAFTSDPR---WRRAADGALRHHARTLAEQPLAMSELLLALDFASDAVRE-VVLV 633


>gi|424826571|ref|ZP_18251427.1| hypothetical protein IYC_01504 [Clostridium sporogenes PA 3679]
 gi|365980601|gb|EHN16625.1| hypothetical protein IYC_01504 [Clostridium sporogenes PA 3679]
          Length = 682

 Score =  469 bits (1207), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 268/679 (39%), Positives = 371/679 (54%), Gaps = 66/679 (9%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           K TNRL  E SPYLLQHAHNPVDW+ WGEEAF +A+  D P+FLSIGYSTCHWCHVME E
Sbjct: 7   KKTNRLIKEKSPYLLQHAHNPVDWYPWGEEAFEKAKIEDKPVFLSIGYSTCHWCHVMERE 66

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SFEDE VA++LN+ F+SIKVDREERPDVD +YM++ QA  G GGWPL++ ++PD KP   
Sbjct: 67  SFEDEDVAEILNNNFISIKVDREERPDVDNIYMSFCQAYTGSGGWPLTILMTPDKKPFFA 126

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE 279
           GTYFP   KY  PG   IL+ +   W + +  + +S    +EQ+          N   DE
Sbjct: 127 GTYFPKWGKYNIPGIMDILKSINKLWHEDKSKILESSNRILEQIER-----FQDNHGEDE 181

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEG 337
           L +  +   A+ L  ++DS++GGFG+ PKFP    I  +L  Y+ KK E           
Sbjct: 182 LEEYIIEEAAQTLIDNFDSKYGGFGTKPKFPTAHYILFLLRYYYFKKDEKV--------- 232

Query: 338 QKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTK 397
             ++  TL  M KGGI DH+G GF RYS D +W VPHFEKMLYD   L+  Y +A+  TK
Sbjct: 233 LDVINKTLTSMYKGGIFDHIGFGFSRYSTDNKWLVPHFEKMLYDNALLSMAYTEAYEATK 292

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
           +  Y  +   IL+Y+++ M    G  +SAEDADS   EG     EG FY+WT KE+ DIL
Sbjct: 293 NPLYKVVTEKILNYVKKSMTSEEGGFYSAEDADS---EGV----EGKFYLWTKKEIIDIL 345

Query: 458 GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASASKLGMPLEKYL 515
           GE    F           C L  ++   N F+ KN+  LI+ +      +K         
Sbjct: 346 GEEDGAFY----------CKLYDITSRGN-FENKNIANLIQTDLKDVDNNK--------- 385

Query: 516 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 575
           + L   R KLF+ R KR  PH DDK++ SWN L+I +F RA +  K++            
Sbjct: 386 DKLERIREKLFEYREKRIHPHKDDKILTSWNALMIIAFCRAGRSFKND------------ 433

Query: 576 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 635
                Y+++A+ +A FI ++L DE    L    R+      GF+DDYAF +  L++LYE 
Sbjct: 434 ----NYIDIAKQSADFIIKNLMDENG-TLYARIRDEERGNEGFIDDYAFFLWALIELYEA 488

Query: 636 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 695
                +L  +IE+ ++  +LF  +E GG++  +     +++R KE +DGA PSGN+V+ +
Sbjct: 489 SFDIYYLEKSIEVADSMIDLFWHKEKGGFYLYSKNSEKLIVRPKEIYDGAMPSGNAVASL 548

Query: 696 NLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLV 755
            L  L  I      D Y+   +     F   +K   M   L    A M +V   K + L 
Sbjct: 549 ALSLLYYITG---EDKYKNLVDEQFKFFAANIKSGPM-YHLFSVMAYMYNVSPVKEITLA 604

Query: 756 GHKSSVDFENMLAAAHASY 774
            ++    F   +   +  Y
Sbjct: 605 YNEKDEAFYEFINEFNNRY 623


>gi|296415498|ref|XP_002837423.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295633295|emb|CAZ81614.1| unnamed protein product [Tuber melanosporum]
          Length = 773

 Score =  469 bits (1207), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 260/639 (40%), Positives = 367/639 (57%), Gaps = 53/639 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCH--WCHVMEVES 160
           N+L    SPY+  HA+NPV W  W EE    A+K +  +F+SIGY+ CH  +  VME ES
Sbjct: 60  NQLLKSQSPYVRGHAYNPVRWQLWNEETLELAKKNNRIVFVSIGYAACHCEYTIVMERES 119

Query: 161 FEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGG 220
           FE+E +A++LN+ F+ IK+DREERPD+D++YM +VQA  G GGWPL+VFL+PDL+P+ GG
Sbjct: 120 FENEEIARILNENFIPIKIDREERPDIDRIYMNFVQATTGSGGWPLNVFLTPDLQPVFGG 179

Query: 221 TYFPPEDKYG----RPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSAS--ASSN 274
           TY+P     G    + GF  +LRK+ + W ++ +    S +  + QL E        +  
Sbjct: 180 TYWPGPSAVGGMKDQLGFLEVLRKIANVWKEQHERCVASASDILNQLKEFTDEGLKGTGG 239

Query: 275 KLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML---YHSKKLEDTGKS 331
           +  D L  + L    +     YD  +GGFG+APKFP PV +  +L        ++D    
Sbjct: 240 EPGDGLELDLLEEAYQHFMARYDPLYGGFGNAPKFPTPVNLAFLLRLGTFPATVQDIVGE 299

Query: 332 GEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLD 391
            E    + MV+ TLQ MAKGGIHDH+G GF RYSV   W++PHFEKMLYDQ QL ++Y+D
Sbjct: 300 MECENAKSMVIDTLQGMAKGGIHDHIGHGFSRYSVTANWNLPHFEKMLYDQAQLLSIYID 359

Query: 392 AFSLTKDVFYSYICRDILDYLRRDMI-GPGGEIFSAEDADSAETEGATRKKEGAFYVWTS 450
           A+ +TK         DI +Y+  D +  P G  +S+EDADS   +  T K+EGAFYVWT 
Sbjct: 360 AWLVTKSPAMLEAANDIAEYMCLDALKSPDGAFYSSEDADSLYRKADTEKREGAFYVWTR 419

Query: 451 KEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGM 509
           KE + +LGE  A +   ++ +   GN D +  +DPH+EF  +NVL   +     +   GM
Sbjct: 420 KEFDVMLGEQDASICARYWNVHRDGNVDPA--NDPHDEFIAQNVLSVASTPEKLSKMYGM 477

Query: 510 PLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 568
             E+  NI+   R+KL   R K RPRP+LDDK++ +                        
Sbjct: 478 SAERITNIISSARQKLLQHRLKERPRPNLDDKIVTT------------------------ 513

Query: 569 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 628
                     + Y + AE A SFIR++LYDE+T  L+  +R+GP +A GF DDYAFLISG
Sbjct: 514 ----------QLYKKNAEEAISFIRKNLYDEKTGILKRVYRDGPGEADGFADDYAFLISG 563

Query: 629 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 688
           LL +YE     ++L WA  LQ  Q + F D E GG+F+T+     ++LR+K+  D  EPS
Sbjct: 564 LLCMYEATFDVEYLQWADALQQKQIDAFWDAENGGFFSTSEGASDLILRLKDGLDSQEPS 623

Query: 689 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 727
            N VS  NL RL +++   K + Y   A+ + + F T L
Sbjct: 624 TNGVSANNLFRLGTLLGDPKLEEY---AQQTCSAFSTEL 659


>gi|254442730|ref|ZP_05056206.1| conserved hypothetical protein [Verrucomicrobiae bacterium DG1235]
 gi|198257038|gb|EDY81346.1| conserved hypothetical protein [Verrucomicrobiae bacterium DG1235]
          Length = 727

 Score =  469 bits (1207), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 269/677 (39%), Positives = 373/677 (55%), Gaps = 57/677 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL    SPYLLQHA NPVDW+ WG EAF +A   +  +F+SIGYSTCHWCHVM  ESF 
Sbjct: 26  NRLVDSQSPYLLQHADNPVDWYPWGPEAFEKAEAENKLVFISIGYSTCHWCHVMNRESFS 85

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE +A  LN+ +V IK+DREERPD+D VYMT+VQ L G GGWPL+V+LSPD KP  GGTY
Sbjct: 86  DEEIAAYLNEHYVCIKIDREERPDIDNVYMTFVQNLTGNGGWPLNVWLSPDKKPFFGGTY 145

Query: 223 FPPEDKYGRP-GFKTILRKVKDAW-DKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           FPP D   R  GF  +++++ D W      +LA+S +  ++ L++  + + ++N      
Sbjct: 146 FPPRDDPSRGRGFLPLIQEINDFWIQDPTGVLARSQSI-VDTLNQHSAQTLAANS----- 199

Query: 281 PQNALRLCAEQLSKS-------YDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGE 333
            +NA  L  E+LS+S       +D +  GFG+  KFP P  + ++L  +   E      +
Sbjct: 200 -ENAASL--ERLSESITAFLFIFDEQNKGFGNDQKFPSPNTLSLLLRAAATPE--LHQED 254

Query: 334 ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 393
            S  +++ L TL  M  GGI DH+GGGFHRY+VD  W +PHFEKMLYDQ  +A+  +DA+
Sbjct: 255 RSLAKRLALETLDAMLAGGIRDHLGGGFHRYTVDAGWQLPHFEKMLYDQALIASALVDAY 314

Query: 394 SLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 453
            LT +  Y     + LDY+ RD+    G ++SAEDA+S + + +  K+EGA+Y WT+ + 
Sbjct: 315 QLTGEARYRQAATETLDYVLRDLRHENGGLYSAEDAESLDPDKSFAKREGAYYTWTTADF 374

Query: 454 EDILG--EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPL 511
           E +    E       H+ L+P GN        P   F G N L    D+     +L   L
Sbjct: 375 ERLFPHEEKRAGLAAHFSLRPAGNAPYGNF--PREIFAGYNTLRINPDAKIDPDQLAADL 432

Query: 512 EKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNF 571
                        L   RS R RPHLDDK+I SWNGL IS+ ARA  +            
Sbjct: 433 A-----------TLRQDRSTRARPHLDDKIITSWNGLAISALARAGLVF----------- 470

Query: 572 PVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLD 631
                +R +Y   A+ AA+F+  +LY  ++ +L   +R   S    F +DYA+LI+GLLD
Sbjct: 471 -----NRPDYTNAAQQAANFLLENLYQPESQQLLRLYRQDASPVAAFAEDYAYLIAGLLD 525

Query: 632 LYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNS 691
           LYE  +  +WL  A ELQ  Q++ F D E GGYF     D  V  R K+  D A PS NS
Sbjct: 526 LYEADADHRWLQKAHELQLAQNQRFADTENGGYFLFEASDDIVFNRTKQAADTAIPSPNS 585

Query: 692 VSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRK- 750
           VS  NL RLA     +    ++Q A  ++  F  +L      +P +  A  +L V  +  
Sbjct: 586 VSAKNLARLAQFFDDAS---FQQQASQTINAFAPQLDSSGTTLPTLREA--ILFVGKKPL 640

Query: 751 HVVLVGHKSSVDFENML 767
            +V+ G   +   + ML
Sbjct: 641 QIVIAGDPQTASAQAML 657


>gi|224368664|ref|YP_002602826.1| hypothetical protein HRM2_15540 [Desulfobacterium autotrophicum
           HRM2]
 gi|223691380|gb|ACN14663.1| conserved hypothetical protein [Desulfobacterium autotrophicum
           HRM2]
          Length = 766

 Score =  469 bits (1206), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 261/647 (40%), Positives = 385/647 (59%), Gaps = 46/647 (7%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           K+TNRL  E SPYLLQHAHNPV+W+ WG+EAF  ARK + P+FLS+GY+TCHWCHVME E
Sbjct: 61  KYTNRLFLESSPYLLQHAHNPVNWYPWGDEAFETARKLNRPVFLSVGYATCHWCHVMEEE 120

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SFE+E +A+ LN+ ++ +KVDREERPD+D +YM+ VQAL G GGWP++V+L+ D KP  G
Sbjct: 121 SFENEEIARYLNENYLCVKVDREERPDIDSIYMSAVQALTGRGGWPMNVWLTCDRKPFYG 180

Query: 220 GTYFPPE--DKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLP 277
           GTYFPP   D+    GF T+L K+  ++  +   +  +G      + + +S    +    
Sbjct: 181 GTYFPPRDGDRGADIGFLTLLEKLIQSFHAQDGRVENAGRQITAAIQQMMSPKPGTRLPG 240

Query: 278 DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEG 337
            E  QNA+        +SYDSRFGG   +PKFP  + ++++L H++   +  K  + +  
Sbjct: 241 KETIQNAVSF----YRQSYDSRFGGLSGSPKFPSSLPVRLLLRHNRNTFE--KVKQDTNI 294

Query: 338 QKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTK 397
            +M+  +L  MA GG++DHVGGGFHRYS DE W VPHFEKMLYD   LA VYL+A+  T 
Sbjct: 295 LEMIDHSLAQMAGGGMYDHVGGGFHRYSTDEHWLVPHFEKMLYDNALLAVVYLEAWQATD 354

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
           +  +  +  +IL Y+ +DM    G  +SA DADS    G    +EG ++ WT +E++ IL
Sbjct: 355 NADFKRVVNEILSYVIQDMTSADGAFYSATDADSITPRG--HMEEGWYFTWTPEELDAIL 412

Query: 458 G-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLN 516
           G E++ + K +Y +  T N            F+ +++L      + +AS L +  EK   
Sbjct: 413 GKENSKIIKRYYSVGVTPN------------FEKRHILHTTKSRAETASALNITEEKLAK 460

Query: 517 ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGS 576
           I+   R  L+  R+KRP P  D+KV+ +WN L+IS+FARA   L +              
Sbjct: 461 IIETSRELLYLERNKRPAPLRDEKVLTAWNALMISAFARAGFTLNNTV------------ 508

Query: 577 DRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG 636
               Y++ A  AA FI  +LY +  +RL  S+++G ++   +L+DYAF I+ L+DLYE  
Sbjct: 509 ----YIDQAVRAARFIMENLYID--NRLFRSYKDGKARHNAYLEDYAFFIAALIDLYEAT 562

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
              +WL  A+EL +     + DR+ G +F T+ +  +++ R K  +D A PSGN+++++N
Sbjct: 563 HDIEWLKKALELDDVLKTFYEDRKNGAFFMTSSDHEALISREKPYYDNATPSGNAIAILN 622

Query: 697 LVRLASIVAGSKSDY-YRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 742
           L+RL S      +DY Y+Q AE +L  F  RL     A+  M  A D
Sbjct: 623 LLRLHSFT----TDYRYKQRAEKALKFFSERLNTAPSALSEMLLAID 665


>gi|410721128|ref|ZP_11360472.1| N-acylglucosamine 2-epimerase [Methanobacterium sp. Maddingley
           MBC34]
 gi|410599579|gb|EKQ54125.1| N-acylglucosamine 2-epimerase [Methanobacterium sp. Maddingley
           MBC34]
          Length = 708

 Score =  469 bits (1206), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 273/693 (39%), Positives = 376/693 (54%), Gaps = 49/693 (7%)

Query: 86  MAERTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSI 145
           + +     +S    K  N L  E SPYLLQHA NPVDW+ WG+EAF +A+K D PIFLSI
Sbjct: 3   IGDNMSQKSSPESGKTQNHLKDEKSPYLLQHADNPVDWYPWGDEAFDKAKKEDKPIFLSI 62

Query: 146 GYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWP 205
           GYSTCHWCHVM  ESF+D  +  LLN  FV +KVDREERPD+D VYMT  Q + G GGWP
Sbjct: 63  GYSTCHWCHVMARESFQDPEIGDLLNQVFVPVKVDREERPDIDSVYMTVCQMITGSGGWP 122

Query: 206 LSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSG---AFAIEQ 262
           L++ ++PDLKP   GTYFP +      G + ++  V D W+ KR+ L +S      +++Q
Sbjct: 123 LTIIMTPDLKPFFAGTYFPKDTGPRGTGLRDLILNVHDLWENKREDLLKSAEDLTLSLQQ 182

Query: 263 LSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHS 322
           +S       S +K  ++L    L    +   +++D  + GFG+  KFP P  +  +L + 
Sbjct: 183 ISH-----RSPDKSGEQLNDGILNQTYQSQLENFDQEYAGFGTNQKFPTPHHLLFLLRYW 237

Query: 323 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 382
           K       +GE  E   MV  TL  M KGGI+DHVG GFHRY+VD +W VPHFEKMLYDQ
Sbjct: 238 K------HTGE-DEALTMVEKTLDAMRKGGIYDHVGFGFHRYTVDRKWVVPHFEKMLYDQ 290

Query: 383 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 442
             L   Y +AF  T    Y     ++L+YL RDM  P    +SAEDADS   EG    +E
Sbjct: 291 ALLVIAYTEAFQATGKTKYRETAEEVLEYLLRDMRSPEDGFYSAEDADS---EG----EE 343

Query: 443 GAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 501
           G FY+WT  E+ +ILG E   LF   Y +   GN       +   E  GKN+L       
Sbjct: 344 GKFYLWTLDEIINILGPEEGELFSRVYSVSENGNFK----DEATGEKTGKNILHRSQTWD 399

Query: 502 ASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 561
             + KL M  E+        R  LF  R  R  PH DDK++  WNGLVI + A A K+  
Sbjct: 400 ELSKKLEMSPEELWWKTESARETLFQAREGRVHPHKDDKILTDWNGLVIVALALAGKVFG 459

Query: 562 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDD 621
                           R++Y+  A  A +FI   +   Q  RL H +R+G +   G LDD
Sbjct: 460 ----------------REDYLLAATEAVNFIMTKI--NQQGRLHHRWRDGEAAVDGNLDD 501

Query: 622 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 681
           YA+LI GLL+LY+    +++L  A++L  T  E F D + GG++ T+   P +L+R KE 
Sbjct: 502 YAYLIWGLLELYQATFNSEYLKTALKLNQTILEHFWDHDNGGFYFTSDYAPEILVRQKEA 561

Query: 682 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 741
           +D A PSGNSV ++NL +L  I      D + +   ++L  + + + + + +   M  +A
Sbjct: 562 YDTALPSGNSVMMMNLEKLYLIT----EDIHIREISNALEKYFSPMIEQSPSAFTMFLSA 617

Query: 742 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASY 774
            +L       + + G K S D + ML A +  Y
Sbjct: 618 IILKRGPSFKIAITGEKDSADTKAMLNALYKKY 650


>gi|333987397|ref|YP_004520004.1| hypothetical protein MSWAN_1186 [Methanobacterium sp. SWAN-1]
 gi|333825541|gb|AEG18203.1| hypothetical protein MSWAN_1186 [Methanobacterium sp. SWAN-1]
          Length = 700

 Score =  469 bits (1206), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 277/689 (40%), Positives = 380/689 (55%), Gaps = 46/689 (6%)

Query: 92  ASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCH 151
           +S  +   K  N L  E SPYL+QHA NPVDW+ WG+EAF +A K D PIFLSIGYSTCH
Sbjct: 3   SSQENDPKKGYNHLKNEKSPYLIQHADNPVDWYPWGDEAFKKAEKEDKPIFLSIGYSTCH 62

Query: 152 WCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLS 211
           WCHVM  ESFED  VA+L+N+ FV +KVDREERPDVD++YM   Q + G GGWPL++ ++
Sbjct: 63  WCHVMAHESFEDPEVAELINEVFVPVKVDREERPDVDRIYMDVCQIMTGTGGWPLTIIMT 122

Query: 212 PDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASA 271
           PD KP   GTYFP E +YG  G K ++  V++ W + R  +  SG    EQ+   L    
Sbjct: 123 PDKKPFFAGTYFPKESRYGSTGLKDLILNVEEIWKENRKDVLNSG----EQVFRVLK-DV 177

Query: 272 SSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKS 331
           SS     E+    L    + LSK++D  +GGFG   KFP P  +  +L + K+   TG  
Sbjct: 178 SSTPRGGEIEAKILEKTYDTLSKTFDYEYGGFGDFQKFPTPHNLMFLLRYWKR---TGNK 234

Query: 332 GEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLD 391
                   MV  TL  M  GGI+DH+G GFHRYSVD  W VPHFEKMLYDQ  ++ VY++
Sbjct: 235 NAVH----MVEKTLDSMYMGGIYDHLGFGFHRYSVDPGWVVPHFEKMLYDQALISMVYIE 290

Query: 392 AFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSK 451
           AF  T +  Y  I   I  Y+ R+M  P G  +SAEDAD   TEG     EG FY+WT K
Sbjct: 291 AFQATGNEEYKRIAEQIFKYVFRNMKSPEGGFYSAEDAD---TEGV----EGKFYLWTKK 343

Query: 452 EVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMP 510
           E+ D L  + A L  + + +K  GN +   +     E  G N+L   +     A  LG+ 
Sbjct: 344 EIFDALDPDEAELICKIFNVKEAGNFEDETIG----EETGANILYLKSSIGELAEGLGIS 399

Query: 511 LEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFN 570
             +  + L   R KLF  R  R  P  DDK++  WNGL+I++ A+A++            
Sbjct: 400 RRELEDKLETSRMKLFQNRETRVHPQKDDKILADWNGLMITALAKAAQAF---------- 449

Query: 571 FPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLL 630
                 D  +Y + AE AA+FI   +  E   RL H +R+  +  PG LDD+ F+I GLL
Sbjct: 450 ------DDPKYSKAAEDAANFILDKMCKEG--RLFHRYRDNEAAIPGNLDDHTFMIWGLL 501

Query: 631 DLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGN 690
           +LYE     K+L  A++L     E F D + GG++ T  +   VLL  K+ +DGA PSGN
Sbjct: 502 ELYEAVFNVKYLKKALKLNKILIEHFWDEKDGGFYFTANDSEHVLLWEKQTYDGALPSGN 561

Query: 691 SVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRK 750
           SV + NL++LA I    + +    + E +   F T+++   +       A D    PS +
Sbjct: 562 SVGIFNLIKLARITEDPELERRSIDLERA---FSTQIRRAPIVHTHFLEAIDFKVGPSYE 618

Query: 751 HVVLVGHKSSVDFENMLAAAHASYDLNKT 779
            VV+VG   + D + M+ +  + +  NK 
Sbjct: 619 -VVIVGDPEADDTKKMIQSIRSHFIPNKV 646


>gi|435854108|ref|YP_007315427.1| thioredoxin domain protein [Halobacteroides halobius DSM 5150]
 gi|433670519|gb|AGB41334.1| thioredoxin domain protein [Halobacteroides halobius DSM 5150]
          Length = 681

 Score =  468 bits (1205), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 262/639 (41%), Positives = 369/639 (57%), Gaps = 71/639 (11%)

Query: 86  MAERTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSI 145
           M E TP           NRLA E SPYLLQHAHNPV+W+ W EEAF +A++ + P+FLSI
Sbjct: 1   MVETTP----------VNRLANEKSPYLLQHAHNPVNWYPWSEEAFKKAQEENKPVFLSI 50

Query: 146 GYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWP 205
           GYSTCHWCHVME ESF D+ VA +LN+ FVSIKVDREERPD+D +YM+  QA+ G GGWP
Sbjct: 51  GYSTCHWCHVMERESFADQEVANVLNENFVSIKVDREERPDIDDIYMSVCQAMTGRGGWP 110

Query: 206 LSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE 265
           L+V ++PD +P   GTYFP + K GRPG   IL ++   W  +++ + +S    ++ + +
Sbjct: 111 LTVVMTPDKRPFFAGTYFPKQTKRGRPGLLKILDQITKKWSNQQEKILESSEELVQAIKQ 170

Query: 266 A----LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYH 321
                 +A+ SSN L D+L + A+      L  S+D+++GGFGSAPKFP P  +  +L +
Sbjct: 171 QDMKKQAANFSSNDL-DKLVKEAV----SSLKSSFDAQYGGFGSAPKFPSPHNLMFLLRY 225

Query: 322 SKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYD 381
                  GK     E   +V  TL  M +GGI+DH+G GF RY+ DE+W  PHFEKMLYD
Sbjct: 226 -------GKIHNDQEVLSIVEKTLDSMYQGGIYDHIGYGFSRYATDEKWLAPHFEKMLYD 278

Query: 382 QGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 441
              L  VYL+ + + +   Y+ I  +IL Y+ RDM    G  +SAEDADS   EG    +
Sbjct: 279 NALLTIVYLEGYQVLEKEIYAKIAEEILAYINRDMTSSKGAFYSAEDADS---EG----E 331

Query: 442 EGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 500
           EG +Y+W   EV++ LG+     F + Y + P GN            F GKN+    N  
Sbjct: 332 EGKYYLWQPGEVKEALGDKLGSQFCQTYNIIPEGN------------FAGKNI---PNLI 376

Query: 501 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 560
                KL +  E       + R+KLF  R KR RP  DDK++ +WNGL+I +FA+A KIL
Sbjct: 377 KTERDKLKINHE-----FRKARKKLFLAREKRVRPAKDDKILTAWNGLMIVAFAKAGKIL 431

Query: 561 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 620
                           D++EY+  A+ AA FI  +L  +   RL   +R G +   G+++
Sbjct: 432 ----------------DKEEYLNYAKEAADFIWDNLIRKDDGRLLARYREGEADYLGYVN 475

Query: 621 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 680
           DYAF I GL++LY+      +L  A+ L       F D+E GG++    +   ++ R K 
Sbjct: 476 DYAFYIWGLIELYQANFNANYLERALILNKDLIHFFWDQEDGGFYLYGSDGEKLITRPKR 535

Query: 681 DHDGAEPSGNSVSVINLVRLASIVAGSK-SDYYRQNAEH 718
             DGA PSGNS++ +NL++L+ +V+  + SD  +Q  E+
Sbjct: 536 VRDGALPSGNSIATLNLLKLSKLVSNQELSDMAQQQFEY 574


>gi|347754417|ref|YP_004861981.1| thioredoxin domain-containing protein [Candidatus
           Chloracidobacterium thermophilum B]
 gi|347586935|gb|AEP11465.1| Thioredoxin domain containing protein [Candidatus
           Chloracidobacterium thermophilum B]
          Length = 691

 Score =  468 bits (1205), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 271/683 (39%), Positives = 378/683 (55%), Gaps = 51/683 (7%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           +  NRL +E SPYLLQHAHNPVDW+ WG EA A A+  D PI LSIGYS CHWCHVME E
Sbjct: 8   QFVNRLISETSPYLLQHAHNPVDWYPWGPEALARAKAEDKPILLSIGYSACHWCHVMEHE 67

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
            FE+  +A L+N+ FV+IKVDREERPD+D +YM  VQ + G GGWPL+VFL+PD +P  G
Sbjct: 68  CFENPSIAALMNELFVNIKVDREERPDLDTLYMNAVQLMTGRGGWPLTVFLTPDGEPFYG 127

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE 279
           GTYFPPED+   PGF  ILR V DA+ ++R  + QS A    +L         +  L  E
Sbjct: 128 GTYFPPEDRGRMPGFPRILRSVADAYRQRRQDVRQSIAEITAELRRIHEPLDGARTLSPE 187

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
           +  +A R    +LS  +D   GGFG APKFP  + +  +L + +       +GE     +
Sbjct: 188 ILTDAYR----RLSTRFDHVHGGFGGAPKFPNSMLLSFLLRYWR------LTGEL-HALE 236

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           MV  +L  MA GG++DH+GGGFHRYS D++W VPHFEKMLYD   LA  YL+A+  T   
Sbjct: 237 MVELSLDKMASGGMYDHLGGGFHRYSTDDQWLVPHFEKMLYDNALLARTYLEAWQATGKP 296

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE 459
            Y  I  + LDY+ R+M  P G  ++ +DADS   EG    +EG F+VWT +E+  +L E
Sbjct: 297 RYRQIVEETLDYVVREMTAPTGGFYATQDADS---EG----EEGRFFVWTPEEINTLLDE 349

Query: 460 -HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 518
             A L + ++ +   GN           E  GK VL         A    +  E   ++L
Sbjct: 350 ADADLVRRYFDVTEEGNF----------EGTGKTVLSTPLPLETVARLKEVTPEHLEHVL 399

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
              +R LF+ R +R +P  D+K + +WNGL++ SFARA+ +L                +R
Sbjct: 400 ARAKRILFEAREQRVKPARDEKCLAAWNGLMLYSFARAAAVL----------------ER 443

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG 638
            +Y  VAE  A+F+   +Y +    L  S ++G +K PG+ +DYA    GLL LYE    
Sbjct: 444 DDYRAVAERNAAFVLGTMYVDGI--LYRSHKDGQNKFPGYQEDYACYAEGLLALYEATGN 501

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
            K+   A EL       F D +GGG+F T      ++ RVK+  D A PSGNSV+V  L+
Sbjct: 502 VKYFCAARELTEAMLAQFDDPQGGGFFFTGDRHEQLITRVKDVFDNATPSGNSVAVEVLL 561

Query: 699 RLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHK 758
           RLA +    +   YR+ AEH L    + +  M      +  A D   + S + +V+VG  
Sbjct: 562 RLALLTGEQR---YRERAEHILQTLSSSMAKMPSGFGQLLGALDFY-LASVREIVIVGPP 617

Query: 759 SSVDFENMLAAAHASYDLNKTVS 781
            + +   +      ++  ++ V+
Sbjct: 618 DAAETRELRRVVEEAFRPHRVVA 640


>gi|398407269|ref|XP_003855100.1| hypothetical protein MYCGRDRAFT_99250 [Zymoseptoria tritici IPO323]
 gi|339474984|gb|EGP90076.1| hypothetical protein MYCGRDRAFT_99250 [Zymoseptoria tritici IPO323]
          Length = 750

 Score =  468 bits (1205), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 272/645 (42%), Positives = 349/645 (54%), Gaps = 35/645 (5%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            NR     SPY+  H  NP  W  W  E    ARK +  +F+SIGYS CHWCHVME ESF
Sbjct: 14  NNRCGESKSPYVRSHMDNPTAWQLWSAETLELARKTNRLLFVSIGYSACHWCHVMEHESF 73

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
            D  +A+LLN+ F+ IK+DREERPD+D+ YM ++QA  GGGGWPL+VF++PDL+P+ GGT
Sbjct: 74  SDSRIAQLLNEHFIPIKIDREERPDIDRQYMDFLQATSGGGGWPLNVFVTPDLEPIFGGT 133

Query: 222 YFP-PEDKYGR-----PGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 275
           Y+P P  +  R       F+ +LRKV  AW ++      +      QL E         +
Sbjct: 134 YWPGPNSERARSRAAGTTFEDVLRKVSTAWKEQEQKCRANAKDITRQLREYAQEGMLGGR 193

Query: 276 LPDELPQNALRLCA------EQLSKSYDSRFGGFGSAPKFPRPVEIQMML----YHSKKL 325
              +  +N            E     YD++ GGFG APKFP PV I+ +L    Y     
Sbjct: 194 DGKQTDENDGLELDLLDDAYEHYKGRYDAKCGGFGGAPKFPTPVHIKPLLRVANYPHVVR 253

Query: 326 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 385
           E  G+  +  E ++M + TL+ MAKGGI D +G GF RYSV   W +PHFEKMLYD  QL
Sbjct: 254 EIVGEE-DCQEARRMAVHTLESMAKGGIKDQIGHGFARYSVTRDWSLPHFEKMLYDNAQL 312

Query: 386 ANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDADSAETEGATRKKEGA 444
             VYLDA+ LTK         DI  YL    M+   G IFSAEDADS  T     K+EGA
Sbjct: 313 LPVYLDAWILTKSPLLLESVNDIATYLTSPPMVSELGGIFSAEDADSLPTPQDKHKREGA 372

Query: 445 FYVWTSKEVEDILGEHAILFKEHYY-LKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 503
           FYVW   E + IL E  +     Y+ ++  GN D  R  D   E  G+N L    +    
Sbjct: 373 FYVWMMDEFKSILSEEEVTVCAKYWGVQAQGNVD--RRFDLQGELVGQNTLCVQYEIPEL 430

Query: 504 ASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKS 562
           A +L    E+    +   R KL   R K RPRP LDDK++ SWNGL I   AR S  L+ 
Sbjct: 431 AQELSKSEEQITQTIQSGRSKLLAHREKNRPRPALDDKIVTSWNGLAIGGLARTSSALRY 490

Query: 563 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDY 622
                     +       Y+  A  A + I+ HL+D  T+ L+  +R GP + PGF DDY
Sbjct: 491 ----------ISPEPAAAYLAAALKATNCIKTHLFDPSTNALKRVYREGPGETPGFADDY 540

Query: 623 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 682
           AFLISGLLDLYE    + WL WA  LQ TQ  LF D E  G+F+T    P +L+RVK+  
Sbjct: 541 AFLISGLLDLYEATWDSNWLQWADTLQQTQTRLFWDEEKYGFFSTAASQPDILIRVKDAM 600

Query: 683 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 727
           D AEPS N V+  NL RL S++  S+   Y + A   +A FE  L
Sbjct: 601 DNAEPSVNGVASYNLFRLGSLLNDSE---YEKMARRIVACFEVEL 642


>gi|237794355|ref|YP_002861907.1| thymidylate kinase [Clostridium botulinum Ba4 str. 657]
 gi|229263126|gb|ACQ54159.1| dTMP kinase [Clostridium botulinum Ba4 str. 657]
          Length = 682

 Score =  468 bits (1204), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 265/678 (39%), Positives = 370/678 (54%), Gaps = 64/678 (9%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           K  NRL  E SPYLLQHAHNPVDW+ WGEEAF +A+  D P+FLSIGYSTCHWCHVME E
Sbjct: 6   KKINRLIKEKSPYLLQHAHNPVDWYPWGEEAFEKAKIEDKPVFLSIGYSTCHWCHVMERE 65

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SFEDE VA++LN  F+SIKVDREERPD+D +YM + QA  G GGWPL++ ++PD KP   
Sbjct: 66  SFEDEEVAEVLNKNFISIKVDREERPDIDSIYMNFCQAYTGSGGWPLTIIMTPDKKPFFA 125

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE 279
           GTYFP   KY  PG   ILR + + W + ++ + +S    +EQ+          N    E
Sbjct: 126 GTYFPKWGKYNVPGIMDILRSISNLWREDKNKILESSNRILEQIER-----FQDNHREGE 180

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEG 337
           L +  +    + L  ++D+++GGFG+ PKFP    I  +L  Y+ KK          ++ 
Sbjct: 181 LEEYIIEEAIKTLLDNFDNQYGGFGTKPKFPTAHYILFLLRYYYFKK---------DNKV 231

Query: 338 QKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTK 397
             ++  TL  M KGGI DH+G GF RYS D +W VPHFEKMLYD   L+  Y +A+  TK
Sbjct: 232 LDVINKTLTSMYKGGIFDHIGFGFSRYSTDNKWLVPHFEKMLYDNALLSMAYTEAYEATK 291

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
           +  +  I   +L+Y+++ M    G  +SAEDADS   EG     EG FY+WT +E+ DIL
Sbjct: 292 NPLFKDITEKVLNYVKKSMTSEKGGFYSAEDADS---EGV----EGKFYLWTKEEIMDIL 344

Query: 458 G-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLN 516
           G E   L+ + Y +   GN            F+ KN+   +N            LEK   
Sbjct: 345 GEEEGELYCKIYNITSKGN------------FENKNIANLINTDLKIVDNNKDKLEK--- 389

Query: 517 ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGS 576
                R KLF+ R KR  P+ DDK++ SWN L+I +F++A + LK++             
Sbjct: 390 ----IREKLFEYREKRIHPYKDDKILTSWNALMIVAFSKAGRSLKND------------- 432

Query: 577 DRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG 636
               Y+E+A+ +A+FI  +L DE+   L    R G     GF+DDYAF +  L++LYE  
Sbjct: 433 ---NYIEIAKKSANFIIENLMDEKG-TLYARIREGERGNEGFIDDYAFFLWALIELYEAS 488

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
               +L  +IE+ N+  +LF  +E GG++  +     +L+R KE +DGA PSGN+V+ + 
Sbjct: 489 FDIYYLEKSIEVANSMIDLFWHKEDGGFYLYSKNSEKLLVRPKEIYDGATPSGNAVASLT 548

Query: 697 LVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVG 756
           L  L  I      D Y+   +     F   +K   M   L    A M +V   K + L  
Sbjct: 549 LNLLYYITG---EDRYKDLVDKQFKFFAANIKSGPM-YHLFSVMAYMYNVLPIKEITLTY 604

Query: 757 HKSSVDFENMLAAAHASY 774
            +   DF   +   +  Y
Sbjct: 605 REKDEDFYKFINEVNNRY 622


>gi|440784088|ref|ZP_20961509.1| thioredoxin domain-containing protein [Clostridium pasteurianum DSM
           525]
 gi|440219124|gb|ELP58339.1| thioredoxin domain-containing protein [Clostridium pasteurianum DSM
           525]
          Length = 679

 Score =  468 bits (1203), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 265/678 (39%), Positives = 371/678 (54%), Gaps = 58/678 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYLLQHAHNPVDW+ WGEEAF +A + + P+FLS+GYSTCHWCHVM  ESFE
Sbjct: 8   NRLINEKSPYLLQHAHNPVDWYPWGEEAFNKADRENKPVFLSVGYSTCHWCHVMNRESFE 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA++LN +FV+IKVDREERPD+D +YM+  QA+ G GGWPL++ ++ + KP   GTY
Sbjct: 68  DEEVAEILNKYFVAIKVDREERPDIDNIYMSVCQAITGSGGWPLTIIMTAEKKPFFAGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
            P  +KYG+ G   +L KV   W +K+D L +S    ++ L           K+ +++  
Sbjct: 128 LPKIEKYGQIGIIELLDKVNTMWIQKKDKLLESSNNIVDFLQN--DTVDKKGKINEDIID 185

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
            A       L  +YD  FGGF  +PKFP P  +  +L + K   D        E  +MV 
Sbjct: 186 EAYN----SLKNAYDPVFGGFSDSPKFPIPHNLSFLLRYYKIKGD-------REALQMVE 234

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL  M  GGI DH+G GF RYSVD +W VPHFEKMLYD   LA VY + + +T    Y 
Sbjct: 235 NTLDSMYSGGIFDHIGFGFARYSVDSKWLVPHFEKMLYDNALLAIVYTETYQITHKNRYK 294

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
            I + I DY  RDM    G  +SAEDADS   EG     EG FY+W   E+E+IL E A 
Sbjct: 295 EIVQKIFDYTLRDMTNEDGGFYSAEDADS---EGV----EGKFYLWDKSEIENILEEDAD 347

Query: 463 LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECR 522
           LF  +Y +K  GN            F+G+N+   + +                N +   R
Sbjct: 348 LFNSYYNIKSKGN------------FEGRNIPNLIGEDLEELENEETK-----NKINRLR 390

Query: 523 RKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYM 582
            KLF+ R KR  PH DDK++ +WNGL+I++ A A K+ K EA                  
Sbjct: 391 EKLFNYREKRVHPHKDDKILTAWNGLMIAAMAYAGKVFKIEAYKKA-------------- 436

Query: 583 EVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWL 642
             A+ A+ FI  +L D +  RL   +R+G +   GFLDDYAF + GL++LYE      +L
Sbjct: 437 --AKKASDFILANLIDNRG-RLLCRYRDGETGNVGFLDDYAFFVFGLIELYEATFEVHYL 493

Query: 643 VWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLAS 702
             A++L     + F D E  G+F    +   ++L+ KE +DGA PSGNSV+ +NL+RL+ 
Sbjct: 494 KKAVDLNGEMIKYFWDEENSGFFFYGKDSEELILKTKEIYDGALPSGNSVAAMNLIRLSR 553

Query: 703 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 762
           I    + +   +      ++F  ++  + +       A    +VP   H+V+ G K  V+
Sbjct: 554 ITGDVQLE---EKVAEIFSLFSEKINKVPLGYINTISAFLTNTVPDI-HIVIAGDKDDVN 609

Query: 763 FENMLAAAHASYDLNKTV 780
            + ++   +  + L  +V
Sbjct: 610 TKTLIDEINKRFLLFASV 627


>gi|325107403|ref|YP_004268471.1| hypothetical protein Plabr_0826 [Planctomyces brasiliensis DSM
           5305]
 gi|324967671|gb|ADY58449.1| protein of unknown function DUF255 [Planctomyces brasiliensis DSM
           5305]
          Length = 686

 Score =  468 bits (1203), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 265/652 (40%), Positives = 375/652 (57%), Gaps = 51/652 (7%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA E SPYLLQHAHNPVDW+ WG+EAFA AR+R+VPIFLS+GYS CHWCHVME ESFE
Sbjct: 7   NRLADETSPYLLQHAHNPVDWYPWGDEAFAAARERNVPIFLSVGYSACHWCHVMERESFE 66

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           ++ +A L+N WFV++KVDREERPD+D++YMT VQ + G GGWP+SVFL+P  +P  GGTY
Sbjct: 67  NDQIAALMNQWFVNVKVDREERPDIDQIYMTAVQLVTGQGGWPMSVFLAPSGEPFYGGTY 126

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           +PP  ++G PGF  IL+K+   W++ R+     GA    +L  A+       +    L +
Sbjct: 127 WPPTSRHGMPGFADILQKIHQYWEEHREECLAKGA----ELVTAIDQLHHHEQEKSPLQE 182

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
           + LR    +L +S D + GGFG APKFP P++++++L   ++       GE  E + +V 
Sbjct: 183 DLLRHAQHRLMQSADMQEGGFGHAPKFPHPIDLRVLLRSWRRF------GEV-ESRNVVT 235

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL  MA GGI+DH+ GGF RYS D  W VPHFEKMLYD  QLA  YL+ +  T +  Y+
Sbjct: 236 LTLDKMADGGIYDHLAGGFARYSTDRYWLVPHFEKMLYDNSQLATAYLEGYQATGEERYA 295

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL-GEHA 461
            + R+ LD++ RDM       +S  DADS   EG     EG FYVW+  EV+++L  + A
Sbjct: 296 EVVRETLDFVLRDMTSSEHGFYSTLDADS---EGV----EGKFYVWSEAEVDELLEAKAA 348

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
             FK  Y +   GN            ++G N+L         A +LG   E     L + 
Sbjct: 349 EWFKHVYNVSAQGN------------WEGHNILHRTKPLQELAGELGTDRETLSASLMQS 396

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
           R  L  VR +R  P  D+K+IV+WNGL++S+FA+A +IL              G DR  Y
Sbjct: 397 RETLLKVREQRIWPGRDEKIIVAWNGLMLSAFAQAGRIL--------------GEDR--Y 440

Query: 582 MEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKW 641
            + A +AA F+   L  E    L H  ++G ++  GFLDDYA L+ GL DLY      K+
Sbjct: 441 TQAACNAADFLLDTLRREDG-SLWHCRKDGRNRFNGFLDDYACLVDGLNDLYLTTLEPKY 499

Query: 642 LVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLA 701
           L  A+EL +    LF D E   +  T  +   +++RV++ +D A PSG ++++  L++L 
Sbjct: 500 LQAALELADVMQRLFYDDEQKAFHYTPSDHEELVVRVRDRYDSAIPSGTNLAIHALLKLG 559

Query: 702 SIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVV 753
            I    + DY  + A   L      ++     +     A D+L  P+ + ++
Sbjct: 560 WIAG--REDYVTR-AGDCLDSVSGTMRQQPSGMGQAVVALDLLLGPTEEFIL 608


>gi|407478214|ref|YP_006792091.1| hypothetical protein Eab7_2389 [Exiguobacterium antarcticum B7]
 gi|407062293|gb|AFS71483.1| Hypothetical protein Eab7_2389 [Exiguobacterium antarcticum B7]
          Length = 677

 Score =  468 bits (1203), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 254/623 (40%), Positives = 363/623 (58%), Gaps = 55/623 (8%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNRL  E SPYLLQHA NPVDW+ WGEEAF+ AR  + PIFLSIGYSTCHWCHV+  ESF
Sbjct: 3   TNRLIHEKSPYLLQHATNPVDWYPWGEEAFSLARATNKPIFLSIGYSTCHWCHVLAHESF 62

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           EDE  A++LN+ FVSIKVDREERPD+D++YMT  Q + G GGWPLSVFLSPD  P   GT
Sbjct: 63  EDEETARMLNERFVSIKVDREERPDIDQIYMTAAQLMNGQGGWPLSVFLSPDQTPFYIGT 122

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFP   ++ RP F+ ++ ++ + +    + + + G   I+ L++  SA  ++ +L D L 
Sbjct: 123 YFPKTPQFNRPSFRQVILQLSEHYRTDPEKIKRVGNELIQALTDVTSAD-TTGQLDDTLI 181

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
            +      +Q  + +D + GGFG APKFP P  +  +L + +  ED           +MV
Sbjct: 182 HDTF----DQAMRQFDVQNGGFGEAPKFPSPSLLTFLLDYYRFAED-------ETALQMV 230

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
           + TL  M  GGI D +G G  RY+VDERW VPHFEKMLYD    A + ++ + ++    +
Sbjct: 231 MRTLTAMRDGGITDQIGFGLCRYTVDERWDVPHFEKMLYDNALFATLCIETYQVSGRERF 290

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHA 461
                ++  Y+ RD++ P G  +SAEDADS   EG    +EG FY +T  E+ D+LGE A
Sbjct: 291 KQYAEEVFTYIERDLLSPDGAFYSAEDADS---EG----REGTFYTFTYDELLDVLGEDA 343

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS-ASKLGMPLEKYLNILGE 520
            LF   Y   P GN            F G+NV    N S    A   G  ++K L  L +
Sbjct: 344 -LFPRFYQATPQGN------------FDGRNVFRRTNQSVQQFADDNGRTVQKTLFQLEQ 390

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            R+ L  VRS+R RP  DDK++ +WN L+IS++A+A ++                 D   
Sbjct: 391 ERQTLLHVRSQRIRPFRDDKILTAWNALMISAYAKAGRVF----------------DDHH 434

Query: 581 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 640
           Y +VA  A +F+  HL D+   RL+  +R G  +  GFLDDY+FL    L+L++    T 
Sbjct: 435 YTDVAIRALTFLETHLMDDD--RLRVRYREGHIQGNGFLDDYSFLTEAYLELHQTTQQTV 492

Query: 641 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 700
           ++  A+ L +   + F D E G +F T+ E+ ++L+R K+ +DG +P+GNS +V+NL+RL
Sbjct: 493 YIQQALRLTDRMIQDFGD-EQGSFFFTSVEEETLLVRPKDIYDGVKPAGNSTAVLNLIRL 551

Query: 701 ASIVAGSKSDYYRQNAEHSLAVF 723
           + +   +    YR+ A+H  +  
Sbjct: 552 SQLTGRTD---YRECAQHVFSAL 571


>gi|300855044|ref|YP_003780028.1| hypothetical protein CLJU_c18640 [Clostridium ljungdahlii DSM
           13528]
 gi|300435159|gb|ADK14926.1| conserved protein containing a thioredoxin domain [Clostridium
           ljungdahlii DSM 13528]
          Length = 675

 Score =  467 bits (1201), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 260/656 (39%), Positives = 366/656 (55%), Gaps = 64/656 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYLLQHAHNPV+W+ WG+EAF +A+  D PIFLSIGYSTCHWCHVME  SFE
Sbjct: 8   NRLINEKSPYLLQHAHNPVNWYPWGDEAFKKAKSEDKPIFLSIGYSTCHWCHVMEKGSFE 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D  VA++LND F+SIKVDREERPD+D +YM   Q++ G GGWPL++ ++PD KP   GTY
Sbjct: 68  DTEVAEMLNDSFISIKVDREERPDIDSIYMNVCQSITGSGGWPLTIIMTPDQKPFFAGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE-LP 281
           FP  ++ G  G  +IL  +K AW   R  L  +        ++ L +  +SN+  +E + 
Sbjct: 128 FPKNNRDGLMGLMSILDYIKKAWKNNRSELLNAS-------TQILDSLKNSNETSNETIN 180

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
           ++  +         +D  +GGFG  PKFP    +  +L +  K +D       S   +MV
Sbjct: 181 EDIFQKTFLNFKYDFDPTYGGFGDFPKFPSAHNLLFLLRYFYKTKD-------SSALEMV 233

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             TL CM KGGI+DH+G GF RYSVD +W VPHFEKMLYD   L   Y++ F  T +  Y
Sbjct: 234 EKTLDCMRKGGIYDHIGFGFSRYSVDRKWLVPHFEKMLYDNALLIIAYIETFQATGNKKY 293

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH- 460
                +IL Y+ RDM    G  +SAEDADS   EG    +EG FYVW+ +E++DIL E  
Sbjct: 294 CKTAEEILSYVLRDMTSNEGGFYSAEDADS---EG----EEGKFYVWSEEEIKDILQEED 346

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
           +  F  ++ +   GN            F+GKN+L  +N S        +P E  +  +  
Sbjct: 347 SGKFCSYFNVTKGGN------------FEGKNILNLINSS--------IP-EDDMQFIEN 385

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
           CR KLF  R KR  P+ DDK++ SWNGL+I + + A+++L                +  +
Sbjct: 386 CREKLFAEREKRIHPYKDDKILTSWNGLMIGAMSIAARVL----------------NNSK 429

Query: 581 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 640
           Y + A+ A  FI ++L  +   RL   +R+G +   G+LDDY+FLI GL++LYE    T 
Sbjct: 430 YTKAAKKAVDFIYKNLV-KSDGRLLARYRDGEASFLGYLDDYSFLIWGLIELYETTYSTD 488

Query: 641 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 700
           +L  A+EL     +LF D+E GG+F    +   ++ R KE +D A PSGNSV+ +NL+RL
Sbjct: 489 YLKKALELNEDLLKLFWDKENGGFFLYGNDGEKLITRPKEIYDSAIPSGNSVATLNLLRL 548

Query: 701 ASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVG 756
           + + +      +   A+     F   +     A      +      P R+ +V  G
Sbjct: 549 SHLTSSYD---FEDKAKQLFDAFSREINSFPRACSFSLISLLFSKSPIRQIIVSAG 601


>gi|197119298|ref|YP_002139725.1| hypothetical protein Gbem_2926 [Geobacter bemidjiensis Bem]
 gi|197088658|gb|ACH39929.1| thioredoxin domain protein YyaL [Geobacter bemidjiensis Bem]
          Length = 746

 Score =  467 bits (1201), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 270/670 (40%), Positives = 381/670 (56%), Gaps = 51/670 (7%)

Query: 89  RTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYS 148
           RT    S    K+ NRL  E SPYLLQHAHNPV+WF WG+EAF  A++ + P+ +SIGY+
Sbjct: 38  RTRHLESGGEAKYMNRLFLESSPYLLQHAHNPVNWFPWGDEAFELAQRLNRPVLVSIGYA 97

Query: 149 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 208
           TCHWCHVME ESFEDE VA+ LN  F++IKVDREERPDVD +YMT V A+   GGWPL+V
Sbjct: 98  TCHWCHVMEEESFEDEEVARFLNSNFIAIKVDREERPDVDTIYMTAVHAMGMQGGWPLNV 157

Query: 209 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 268
           F +PD KP  GGTYFPP D  G  GF ++L+++++ + +  D +  +G     QL+EA+ 
Sbjct: 158 FATPDRKPFYGGTYFPPRDYAGGIGFLSLLQRIRETYRQAPDRVTHAGV----QLTEAIR 213

Query: 269 ASASSNKLPDELPQNALRL--CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 326
              +   +  E PQN + L    E   + +D++ GG   APKF         L     L 
Sbjct: 214 GMLAP--MGGEPPQNEISLERVIEAYQERFDAKNGGVVGAPKF------PSSLPLGLLLR 265

Query: 327 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 386
           D  + G+ +    M  +TL+ MA GGI+D  GGGFHRY+ D  W +PHFEKMLYD  +LA
Sbjct: 266 DHLRRGDKN-SLFMAQYTLRRMAAGGIYDQAGGGFHRYATDSAWLIPHFEKMLYDNARLA 324

Query: 387 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 446
             YL+ +  T D  ++ + R+IL YL+RDM+ P G  +SA DADS    G   ++EG F+
Sbjct: 325 AAYLEGYQATGDPQFAKVAREILRYLQRDMMSPQGAFYSATDADSLTESG--HREEGIFF 382

Query: 447 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 505
            WT +E++ +LG E A +    Y +   GN            F+G+++L         A 
Sbjct: 383 TWTPEELDAVLGTERARVVAACYGVTSEGN------------FEGRSILHREKSMQHLAE 430

Query: 506 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 565
           +L +P E+   +L E R +L+  R +RP P  D+K++ SWNGL IS+FAR   +L   A 
Sbjct: 431 ELMLPKEELERLLDEAREELYRARQRRPLPLRDEKILASWNGLAISAFARGGLVLNDPA- 489

Query: 566 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 625
                           ++ A  AA+FI + +  ++  RL HS++ G +K  GFLDDYAF 
Sbjct: 490 ---------------LLDTARRAANFILQSMMSQE--RLCHSYQEGEAKGEGFLDDYAFF 532

Query: 626 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 685
           I+GL+DL+E      WL  A+E+     E F D E GG+F T      ++ R K  +DG 
Sbjct: 533 IAGLIDLFEATGELPWLKRALEVAQQVQEQFEDSETGGFFMTGPRHEELISREKPAYDGV 592

Query: 686 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 745
            PSGNSV ++NL+RL ++       +    A+ +L  F  +L     A+  M  A D L 
Sbjct: 593 IPSGNSVMIMNLLRLNALTG---EQWMLDQAQRALDAFSIQLASAPTALSEMLLALDYLQ 649

Query: 746 VPSRKHVVLV 755
              R+ V++ 
Sbjct: 650 DLPREIVIVA 659


>gi|219849212|ref|YP_002463645.1| hypothetical protein Cagg_2330 [Chloroflexus aggregans DSM 9485]
 gi|219543471|gb|ACL25209.1| protein of unknown function DUF255 [Chloroflexus aggregans DSM
           9485]
          Length = 693

 Score =  466 bits (1200), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 266/678 (39%), Positives = 378/678 (55%), Gaps = 56/678 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA+E SPYL QHA NPVDW+ WGEEA   AR+ D P+ +SIGY+ CHWCHVM  ESF 
Sbjct: 9   NRLASEASPYLQQHADNPVDWYPWGEEALERARREDKPLLVSIGYAACHWCHVMAHESFA 68

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D  +A + N++F++IKVDREERPD+D +YM   QAL G GGWPL+VF  PD  P   GTY
Sbjct: 69  DPEIAAIQNEYFINIKVDREERPDLDSIYMAAAQALTGRGGWPLNVFCLPDGTPFFAGTY 128

Query: 223 FPPE---DKYGRPGFKTILRKVKDAWDKKRDML---AQSGAFAIEQLSEALSASASSNKL 276
           FPP+   ++Y  P ++ +L  + +A+  +RD L   AQ     I+ L++ L  +A+ ++ 
Sbjct: 129 FPPDAKANRYRMPSWRQVLLSIAEAYRTRRDDLTASAQELLNHIKLLAQPLPETATVDE- 187

Query: 277 PDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASE 336
                   L   A +L + +D ++GGFG APKFP+P+ ++ +L        T   G   +
Sbjct: 188 ------ALLLEAAAKLEREFDPQYGGFGDAPKFPQPLVLEFLL-------RTHLRGHV-Q 233

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
              M+  TL+ MA GG++D VGGGFHRYSVD RW VPHFEKMLYD   LA VY  A  +T
Sbjct: 234 ALPMLHQTLEQMAHGGMYDQVGGGFHRYSVDTRWLVPHFEKMLYDNALLAEVYHLAALVT 293

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
            D F + I  +   YL RD+  P G  FS+EDADS    GA   +EGAFYVWT  E+   
Sbjct: 294 GDPFLAQIADETFAYLLRDLRHPEGAFFSSEDADSLPVPGAAHAEEGAFYVWTPDELRLA 353

Query: 457 LGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLN 516
           LG+ A +   +Y +   GN            F+GK++L     +SA A++LG+P+E+   
Sbjct: 354 LGDDATIVGAYYGVTRQGN------------FEGKSILYVPRSASAVAARLGVPVERVTE 401

Query: 517 ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGS 576
            +   R  L   R +RPRP  D+K+I +WN L I + A AS  +                
Sbjct: 402 TVERARPILRTFREQRPRPFRDEKIITAWNALAIRALATASARV---------------- 445

Query: 577 DRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG 636
              EY+  A   A F+  +L      RL  S+++G     GFLDDYA L   LL+L+  G
Sbjct: 446 --PEYLSAARQCADFLLANL-RRADGRLLRSWKDGRPGPAGFLDDYALLCDALLELHAAG 502

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
             T +L  AIEL     +LF D +   +F+T  + P+++ R ++  D A PSG S + + 
Sbjct: 503 GETYYLATAIELAEAMLDLFWDAQSWMFFDTGRDQPALVTRPRDLSDNATPSGTSAATMA 562

Query: 697 LVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVG 756
           L+RL ++   + +D +   AE  L      L    +    M CAAD++  P R+ + ++G
Sbjct: 563 LLRLYAL---TGNDLFATRAEQVLQQVAPMLIRFPLGFGRMLCAADLMIGPIRE-LAIIG 618

Query: 757 HKSSVDFENMLAAAHASY 774
                  + +LA A ++Y
Sbjct: 619 PSGHPATQALLAVARSAY 636


>gi|308069056|ref|YP_003870661.1| hypothetical protein PPE_02290 [Paenibacillus polymyxa E681]
 gi|305858335|gb|ADM70123.1| Conserved hypothetical protein [Paenibacillus polymyxa E681]
          Length = 688

 Score =  466 bits (1200), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 274/688 (39%), Positives = 372/688 (54%), Gaps = 63/688 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA E SPYLLQHAHNPV+WF W +EAF  A++ + PIFLS+GYSTCHWCHVM  ESFE
Sbjct: 8   NRLAKEKSPYLLQHAHNPVNWFPWSDEAFEIAKRDNKPIFLSVGYSTCHWCHVMGRESFE 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA++LN  +VSIKVDREERPDVD +YM+  Q + G GGWPL++ ++PD KP   GTY
Sbjct: 68  DEEVAEVLNRDYVSIKVDREERPDVDHIYMSICQTMTGHGGWPLTILMTPDQKPFFAGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKL----PD 278
            P E K+GR G   +L KV   W ++ + L         +LSE +        L      
Sbjct: 128 LPKEQKFGRVGLLELLDKVGTRWKEQPEELV--------ELSEQVLTEHERQDLLAGYRG 179

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
           EL + +L     + S ++D  +GGFG APKFP P  +  +L +++    TG      +  
Sbjct: 180 ELDEQSLNKAFHEYSHTFDKEYGGFGEAPKFPSPHNLSFLLRYAQH---TGN----QQAL 232

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
           +M   TL  M++GGI+DH+G GF RYSVDE+W VPHFEKMLYD   LA  Y +A+ +T  
Sbjct: 233 EMAEKTLDAMSRGGIYDHIGMGFSRYSVDEKWLVPHFEKMLYDNALLAIAYTEAWQMTGK 292

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
             Y  I   I  YL RDM   GG  +SAEDADS   EG    +EG FYVW   EV  +LG
Sbjct: 293 ELYRRITEQIFTYLARDMTDAGGAFYSAEDADS---EG----EEGRFYVWDDSEVRAVLG 345

Query: 459 -EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASASKLGMPLEKYL 515
            E A  F + Y + P GN            F+G N+  LI++N   A   K  +  ++  
Sbjct: 346 DEDAAFFNDLYGITPYGN------------FEGHNIPNLIDIN-LEAYGIKHDLTEQELE 392

Query: 516 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 575
             + E R KLF  R +R  PH DDK++ SWNGL+I++ A+A +                G
Sbjct: 393 QRVSELRAKLFAAREQRVHPHKDDKILTSWNGLMIAALAKAGQ--------------AFG 438

Query: 576 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 635
             R  Y E A  A +F+  HL  E   RL   +R+G +  PG++DDY F + GL++LY+ 
Sbjct: 439 DMR--YTEQARKAETFLWNHLRQENG-RLLARYRDGEAAYPGYVDDYVFYVWGLIELYQA 495

Query: 636 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 695
                +L  A+ L     +LF D E  G F    +   ++ + KE  DGA PSGNS++  
Sbjct: 496 TFDIVYLQRALTLNQNMIDLFWDEERDGLFFYGSDSEQLIAKPKEIDDGAIPSGNSIAAY 555

Query: 696 NLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLV 755
           N VRLA +   S+ + Y   A      F   +         +  A  + +  + K +V+V
Sbjct: 556 NFVRLARLTGESRLENY---AAKQFKAFGGMVAHYPSGHSALLSAL-LYATGTTKEIVIV 611

Query: 756 GHKSSVDFENMLAAAHASYDLNKTVSKK 783
           GH+        + A  A +  N  V  K
Sbjct: 612 GHRDDPQTGQFIRAVRAGFRPNTVVILK 639


>gi|25326752|pir||A88216 protein B0495.5 [imported] - Caenorhabditis elegans
          Length = 722

 Score =  466 bits (1199), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 260/662 (39%), Positives = 369/662 (55%), Gaps = 47/662 (7%)

Query: 115 QHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFEDEGVAKLLNDWF 174
            HA+NP+DW+ WG+EAF +A+  + PIFLS+GYSTCHWCHVME ESFE+E  AK+LND F
Sbjct: 23  NHANNPIDWYPWGQEAFQKAKDNNKPIFLSVGYSTCHWCHVMEKESFENEATAKILNDNF 82

Query: 175 VSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGF 234
           V+IKVDREERPDVDK+YM +V A  G GGWP+SVFL+PDL P+ GGTYFPP+D  G  GF
Sbjct: 83  VAIKVDREERPDVDKLYMAFVVASSGHGGWPMSVFLTPDLHPITGGTYFPPDDNRGMLGF 142

Query: 235 KTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSK 294
            TIL  +     +KR    ++    I +L +  +AS   N+      +   +        
Sbjct: 143 PTILNMIHTEVVEKRRREFETTRAQIIKLLQPETASGDVNR-----SEEVFKSIYSHKQS 197

Query: 295 SYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIH 354
           S+DSR GGFG APKFP+  ++  ++  +    ++ K   A +   M+  TL+ MA GGIH
Sbjct: 198 SFDSRLGGFGRAPKFPKACDLDFLITFAASENESEK---AKDSIMMLQKTLESMADGGIH 254

Query: 355 DHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT--KDVFYSYICRDILDYL 412
           DH+G GFHRYSV   WH+PHFEKMLYDQ QL   Y D   LT  K     ++  DI  Y+
Sbjct: 255 DHIGNGFHRYSVGSEWHIPHFEKMLYDQSQLLATYSDFHKLTERKHDNVKHVINDIYQYM 314

Query: 413 RRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI-------LFK 465
           ++     GG  ++AEDADS     ++ K EGAF  W  +E++ +LG+  I       +  
Sbjct: 315 QKISHKDGG-FYAAEDADSLPNHNSSNKVEGAFCAWEKEEIKQLLGDKKIGSASLFDVVA 373

Query: 466 EHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKL 525
           +++ ++ +GN  ++R SDPH E K KNVL +L      A+   + + +    + E +  L
Sbjct: 374 DYFDVEDSGN--VARSSDPHGELKNKNVLRKLLTDEECATNHEISVAELKKGIDEAKEIL 431

Query: 526 FDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVA 585
           ++ R++RP PHLD K++ SW GL I+   +A +                 ++  +Y++ A
Sbjct: 432 WNARTQRPSPHLDSKMVTSWQGLAITGLVKAYQ----------------ATEETKYLDRA 475

Query: 586 ESAASFIRRHLYDEQTHR------LQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGT 639
           E  A FI + L D    R             G  +   F DDYAFLI  LLDLY      
Sbjct: 476 EKCAEFIGKFLDDNGELRRSVYLGANGEVEQGNQEIRAFSDDYAFLIQALLDLYTTVGKD 535

Query: 640 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVR 699
           ++L  A+ELQ   D  F +  G GYF +   D  V +R+ ED DGAEP+  S++  NL+R
Sbjct: 536 EYLKKAVELQKICDVKFWN--GNGYFISEKTDEDVSVRMIEDQDGAEPTATSIASNNLLR 593

Query: 700 LASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKS 759
           L  I+   + + YR+ A         RL  + +A+P M  A     + S   V++   KS
Sbjct: 594 LYDIL---EKEEYREKANQCFRGASERLNTVPIALPKMAVALHRWQIGSTTFVLVGDPKS 650

Query: 760 SV 761
            +
Sbjct: 651 EL 652


>gi|293376087|ref|ZP_06622338.1| conserved hypothetical protein [Turicibacter sanguinis PC909]
 gi|292645289|gb|EFF63348.1| conserved hypothetical protein [Turicibacter sanguinis PC909]
          Length = 672

 Score =  466 bits (1198), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 265/665 (39%), Positives = 365/665 (54%), Gaps = 64/665 (9%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
            K  N L  E SPYLLQHA+NPV+W+ W +EAF +A++ D PIFLSIGYSTCHWCHVME 
Sbjct: 2   TKQANHLIHEKSPYLLQHAYNPVNWYPWNDEAFTKAKEEDKPIFLSIGYSTCHWCHVMEH 61

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFEDE VA  LN+ F+SIKVDREERPD+D VYM+  QAL G GGWPL++F++P  +   
Sbjct: 62  ESFEDEDVATYLNEHFISIKVDREERPDIDTVYMSICQALTGQGGWPLTIFMTPTQQAFY 121

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
            GTYFP   +YGRPGF  +L+ +   W+  R  +            +        + L  
Sbjct: 122 AGTYFPKTSRYGRPGFLDVLKNIDFNWNHHRAKVTDITKQIESHFKDLEGIETEGDSLSM 181

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
            + QN +     QL +SYD RFGGFG+APKFP P ++  +L + ++ +D          Q
Sbjct: 182 AIIQNGVN----QLKQSYDPRFGGFGTAPKFPTPHKLMFLLRYDEQTKDKSV-------Q 230

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
            MV  TL  M KGGI DH+G GF RYS DE W VPHFEKMLYD   L   Y +A+ +T++
Sbjct: 231 DMVTQTLDHMYKGGIFDHLGYGFSRYSTDEIWLVPHFEKMLYDNALLMISYTEAYQVTRE 290

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
             Y  I     +Y+   +  P G  + AEDADS   EG    +EG FYV+T  E+  ILG
Sbjct: 291 PRYLSIAMQTAEYVLTQLTSPEGGFYCAEDADS---EG----EEGKFYVFTPAEIIQILG 343

Query: 459 -EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI 517
            E    F E Y +   GN            F+GKN+L  L+            LE  +  
Sbjct: 344 HEKGHWFNEFYNVTEEGN------------FEGKNILNRLHHKK---------LELDIKE 382

Query: 518 LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSD 577
           L  CR  L   R +R   H DDK++ SWNGL+I++FA+                 + G  
Sbjct: 383 LEACRETLLTYRLERTHLHKDDKILTSWNGLMIAAFAK-----------------LYGQT 425

Query: 578 RKE-YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG 636
           +K  Y++ A  A  FI++HL+DE   RL   +R G S    +LDDYAFL  GL++L++  
Sbjct: 426 QKMIYLDAASKAVIFIKQHLFDET--RLLARYREGESHFKAYLDDYAFLSYGLIELHQST 483

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
           +  ++L  AI+L     +LF D E GG++ T  +  +++LR KE +DGA PSGNSV+  N
Sbjct: 484 AEVEYLELAIQLNKEMLDLFKD-EAGGFYLTGHDAETLMLRPKELYDGAMPSGNSVAAYN 542

Query: 697 LVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVG 756
           L+RLA +   +    +   AE  +     ++K   M       AA      +++ ++ V 
Sbjct: 543 LIRLAKLTGDT---LFETEAEKQIQYLAKQVKHYEMNHTFYLIAALFALSDTKELMITVT 599

Query: 757 HKSSV 761
            +  +
Sbjct: 600 KQEQI 604


>gi|345020399|ref|ZP_08784012.1| hypothetical protein OTW25_03576 [Ornithinibacillus scapharcae
           TW25]
          Length = 685

 Score =  465 bits (1197), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 268/669 (40%), Positives = 377/669 (56%), Gaps = 63/669 (9%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           N+  N L  E SPYLLQHA+NPV+W+ WGEEAF +A++ + PIFLSIGYSTCHWCHVM  
Sbjct: 4   NQQANNLITEKSPYLLQHAYNPVNWYPWGEEAFEKAKQENKPIFLSIGYSTCHWCHVMAH 63

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFEDE VAKL+ND +++IKVDREERPDVD +YM   Q + G GGWPL++F++PD  P  
Sbjct: 64  ESFEDEEVAKLINDHYIAIKVDREERPDVDSIYMKVCQMMAGHGGWPLTIFMTPDKIPFY 123

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASA---SSNK 275
            GTYFP E KYGRPG K  L ++   +    + +A       E + EAL  +    S+N+
Sbjct: 124 AGTYFPKESKYGRPGIKEALEQLHIKYTTDPEHIAD----VTESVREALDNTIREKSNNR 179

Query: 276 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 335
           L  E    A     +QL + +D  +GGF  APKFP+P   Q +L+  +    +GK+    
Sbjct: 180 LTIETVDQAF----QQLGRGFDFTYGGFWEAPKFPQP---QNLLFLMRYYHFSGKTA--- 229

Query: 336 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 395
              KMV  TLQ MA GGI DH+G GF RYS DE+W VPHFEKMLYD   L  VY + + +
Sbjct: 230 -ALKMVESTLQNMAAGGIWDHIGYGFARYSTDEKWLVPHFEKMLYDNALLLMVYTECYQI 288

Query: 396 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 455
           TK  FY  I   I+ +++R+M    G  +SA DADS   EG     EG +YVW  +E+ D
Sbjct: 289 TKKPFYKNIAEQIITFIKREMTSKDGAFYSAIDADS---EGV----EGKYYVWADEEIYD 341

Query: 456 ILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASASKLGMPLE 512
           ILGE    ++   Y + P GN            F+GKN+  LI  N  S  A +  + L 
Sbjct: 342 ILGEDLGEIYTTTYGITPFGN------------FEGKNIPNLIRANLESV-AEEFDLTLS 388

Query: 513 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 572
           +  + L   R  L   R KR  PH+DDKV+ SWN ++I+  A+AS++ +++         
Sbjct: 389 ELTSQLETARLTLLQEREKRVYPHVDDKVLTSWNAMMIAGLAKASRVFQNQ--------- 439

Query: 573 VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDL 632
                  +Y+ +A+ A SF+  ++  +    L   +R G +K   +LDDYA+LI   ++L
Sbjct: 440 -------DYVTLAKRALSFLEENIVVDG--DLMARYREGETKYHAYLDDYAYLIWAYIEL 490

Query: 633 YEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSV 692
           Y+      +L  A    N   ELF D   GG+F +   +  ++   KE +DGA PSGNSV
Sbjct: 491 YQLEFDLTYLSKAKAQLNIMIELFWDPHHGGFFFSGKNNEKLISNDKEIYDGATPSGNSV 550

Query: 693 SVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHV 752
           + + L ++AS+    + DY  +  E     +E  +K  +  V  +     +L+    K V
Sbjct: 551 AALMLGQMASLTG--EVDYLDKINEMYSTFYEDMMKQPSAGVFFLQSL--LLTENPTKEV 606

Query: 753 VLVGHKSSV 761
           V++GH  +V
Sbjct: 607 VVLGHDENV 615


>gi|167043013|gb|ABZ07725.1| putative protein of unknown function, DUF255 [uncultured marine
           microorganism HF4000_ANIW141A21]
          Length = 678

 Score =  465 bits (1197), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 273/691 (39%), Positives = 397/691 (57%), Gaps = 64/691 (9%)

Query: 94  TSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWC 153
           T+ S+ K +NRL  E SPYLLQHAHNPVDWFAWG+EA ++A++ +  IFLSIGYSTCHWC
Sbjct: 2   TNSSKGK-SNRLINEKSPYLLQHAHNPVDWFAWGDEALSKAKRENKIIFLSIGYSTCHWC 60

Query: 154 HVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPD 213
           HVM  E+FE++  A++LN  F+ IKVDREERPD+D++YM  V ++ G GGWPL+VFL+PD
Sbjct: 61  HVMAHETFENDEAAEILNQNFIPIKVDREERPDIDELYMKAVTSMGGQGGWPLTVFLTPD 120

Query: 214 LKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKR-DMLAQSGAFAIEQLSEALSASAS 272
           LKP  GGTY+P         FK++L  V + W+K+R D+  Q+ +  +E L    +    
Sbjct: 121 LKPFYGGTYYP------LSSFKSLLGSVTEIWNKQRKDVFGQANSI-VENLRRMYTPQEQ 173

Query: 273 SNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSG 332
           S+    E P +A  L    L  S+D R+GGFG +PKFP P  + ++L    +  D  K+ 
Sbjct: 174 SS--ISEYPIDAAYL---NLVDSFDDRWGGFGDSPKFPTPSNLILLL----RYYDRSKNH 224

Query: 333 EASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDA 392
           +A +   MV+ TL  M+ GGI DH+ GGFHRYSVD  W + HFEKMLYD   L   YL+A
Sbjct: 225 KALD---MVVKTLDAMSSGGIQDHLAGGFHRYSVDRMWVISHFEKMLYDNALLTIAYLEA 281

Query: 393 FSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKE 452
           +    +  +    R  L+++ R+M    G  +SA+DADS +        EGA+YVW+  E
Sbjct: 282 YRCKPNDAFEKTARMTLNWILREMQSKDGAFYSAQDADSPDG-------EGAYYVWSKAE 334

Query: 453 VEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPL 511
           + DILG ++ ++  E + +   GN +           K K+VL    +    A K+G+  
Sbjct: 335 ISDILGPKNGMIVAEWFGVGDEGNFE-----------KEKSVLTTRTNLDDLAKKVGLTP 383

Query: 512 EKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNF 571
           +K + ++ + +  L   RS R +P  DDK++ SWNGL IS+ A  +++L           
Sbjct: 384 KKLVALMDKSKAALLQARSHRVKPSTDDKILTSWNGLTISALALGAQVL----------- 432

Query: 572 PVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLD 631
                DR EY+E A+ AASF+   L   +  RL   +R+G +   G L+DYAF I GLLD
Sbjct: 433 ----GDR-EYLEAAKRAASFLMETL--SEKGRLLRRYRDGEAALGGTLEDYAFFIQGLLD 485

Query: 632 LYEFGSGTKWLVWAIELQNTQDELFLDREGGG-YFNTTGEDPSVLLRVKEDHDGAEPSGN 690
           LYE     KWL  A+ L +   ELF D   GG +FN      ++++++KE +DGA PSGN
Sbjct: 486 LYEADLQIKWLQEAMRLADKMIELFWDDSSGGFFFNGKDSSDNMIVKIKEAYDGATPSGN 545

Query: 691 SVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRK 750
           SV  + L++L      S+ D YR+    ++  F  R++   MA   M  A D     SR+
Sbjct: 546 SVGALALLKLGVF---SERDEYREKGVKTIMSFFGRIESNPMAHSHMLSAVDFHLRGSRE 602

Query: 751 HVVLVGHKSSVDFENMLAAAHASYDLNKTVS 781
            +++ G  +++   +ML      Y  NK ++
Sbjct: 603 -IIVAGSDANL-INDMLHEIWRRYIPNKVLA 631


>gi|345302921|ref|YP_004824823.1| hypothetical protein Rhom172_1056 [Rhodothermus marinus
           SG0.5JP17-172]
 gi|345112154|gb|AEN72986.1| protein of unknown function DUF255 [Rhodothermus marinus
           SG0.5JP17-172]
          Length = 699

 Score =  465 bits (1197), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 277/673 (41%), Positives = 377/673 (56%), Gaps = 45/673 (6%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QH  +PVDW+ W EEAF +A+  D PIFLSIGY+ CHWCHVM  ESF+
Sbjct: 3   NRLQFEKSPYLQQHKDDPVDWWPWCEEAFEKAKAEDKPIFLSIGYAACHWCHVMAHESFQ 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA+LLND F++IKVDREERPD+D +YMT  Q + G GGWPL++ ++PD KP    TY
Sbjct: 63  DEEVARLLNDAFINIKVDREERPDIDHLYMTVCQMVTGHGGWPLTIIMTPDKKPFFAATY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
            P   +YGRPG   I+ ++K+AW + RD +  S       L + +S  A S  +  E  +
Sbjct: 123 IPKRSRYGRPGLLEIIPRIKEAWQQHRDEIIASAEKLTGTLQKVMSFEAPSQVIDAEWLE 182

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
            A R    +L   +D + GGFG APKFP P  +  +L +        +SGEA   Q MV 
Sbjct: 183 IAYR----RLDDIFDRKHGGFGHAPKFPTPHTLLFLLRYWH------RSGEAHALQ-MVE 231

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL  M  GGI+DHVG GFHRY+ DE W VPHFEKMLYDQ  L   Y +A+  T + FY 
Sbjct: 232 HTLVQMRPGGIYDHVGFGFHRYATDEAWRVPHFEKMLYDQALLTMAYTEAYQATGNPFYE 291

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHA 461
              R+IL Y+ RD+  P G  +S+EDADS   EG    +EG FYVWT +E+ + LG E A
Sbjct: 292 RTAREILTYVLRDLRAPEGAFYSSEDADS---EG----EEGKFYVWTVEELREALGPELA 344

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
            L  E + + P GN +     +   E  GKN+L       A A + G   E+    L E 
Sbjct: 345 PLAIELFNVNPEGNYE----EEATGERTGKNILYLTRPPKALARERGWTPEELEAKLEEI 400

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
           R++LF  R++R RP  D+K++  WNGL+I++ ARA+++                 D   Y
Sbjct: 401 RQRLFAYRAQRVRPGRDEKILTDWNGLMIAALARAAQVF----------------DEAAY 444

Query: 582 MEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKW 641
           +E A +AA F+ R +   +  RL H +R+G +  PG LDDYAFL  GLLDLYE      +
Sbjct: 445 VEAARAAADFLLRTMRTPEG-RLWHRYRDGEAGIPGMLDDYAFLTWGLLDLYEATFEESY 503

Query: 642 LVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLA 701
           L  A+ L +     F D   G ++ T  +  S+++R +E  D A PSGN+V+++NLVRL 
Sbjct: 504 LETALALTDQTLAHFWDPR-GVFYMTPDDGESLIVRPRETLDNALPSGNAVALMNLVRLG 562

Query: 702 SIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSV 761
            +   +    Y ++A+  +  F   +K        M  A D+   P  + +VL G     
Sbjct: 563 HMTGRT---VYEEHADAMIRFFSGPVKQQPPIFTGMLVAIDLAFGPIYE-LVLAGEPDDP 618

Query: 762 DFENMLAAAHASY 774
               ML   H  Y
Sbjct: 619 TLREMLRTIHRRY 631


>gi|453087339|gb|EMF15380.1| hypothetical protein SEPMUDRAFT_147282 [Mycosphaerella populorum
           SO2202]
          Length = 800

 Score =  465 bits (1196), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 269/642 (41%), Positives = 358/642 (55%), Gaps = 32/642 (4%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NR A   SPY+  H  NP  W  W  E    A++ +  +F+SIGYS CHWCHVM  ESF+
Sbjct: 76  NRCAESKSPYVRSHIDNPTAWQLWTPETLELAKETNRLLFVSIGYSACHWCHVMAHESFD 135

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP-DLKPLMGGT 221
           D  +A+LLN+ F+ +K+DREERPD+D+ YM ++QA  GGGGWPL+VF++P  L+P+ GGT
Sbjct: 136 DPRIAQLLNENFIPVKIDREERPDIDRQYMDFLQATNGGGGWPLNVFVTPGGLEPIFGGT 195

Query: 222 YFPPEDK--YGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASA----SSNK 275
           Y+P  ++    R GF+ I+ KV  AW ++     QS      QL E     +      N+
Sbjct: 196 YWPKRERAQQARTGFEDIILKVSTAWREQEQRCRQSAKDITRQLREFAQEGSIGGKDVNR 255

Query: 276 LPD--ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML----YHSKKLEDTG 329
             D  EL  + L    +     YD + GGFG APKFP PV I+ +L    Y +   E  G
Sbjct: 256 TDDDAELELDLLDDAFQHYKMRYDDKHGGFGGAPKFPTPVHIRPLLRVASYPATVREIVG 315

Query: 330 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 389
           +  E  E + M L TL+ MAKGGI D +G GF RYSV   W +PHFEKMLYD  QL  VY
Sbjct: 316 EE-ECIEARSMALMTLEKMAKGGIKDQIGHGFARYSVTRDWSLPHFEKMLYDNAQLLAVY 374

Query: 390 LDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 448
           LDA+ LTK   +  I +DI  YL    M    G I SAEDADS  T     K+EGA+YVW
Sbjct: 375 LDAYLLTKSPLFLEIVKDIATYLTSAPMQSELGGIHSAEDADSFPTINDKHKREGAYYVW 434

Query: 449 TSKEVEDILGEHAILFKEHYY-LKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 507
           T +E E +L E  +     Y+ +K  GN D  R  D   E   +N L    +++  A +L
Sbjct: 435 TLEEFEQVLSEEEVKVCAKYWNVKAEGNVD--RRHDAQGELIKQNTLCVSRETAELAEEL 492

Query: 508 GMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 566
            M  +     +   R+ L   R + RP P LDDK++ SWNGL I S ARA   L+  +  
Sbjct: 493 NMAEDDVKRAIDSGRQALLAYREANRPSPSLDDKIVTSWNGLAIGSLARAGAALREVS-- 550

Query: 567 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 626
                P  GS    Y+  A  AA  I+ HL+D  +  L+  +R GP +  GF DDYAF I
Sbjct: 551 -----PEAGSS---YVSAARKAALCIQNHLFDAMSGTLRRVYREGPGETQGFADDYAFFI 602

Query: 627 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 686
           SGLLDLYE    + +L  A  LQ TQ++LF D E  G+F+T    P +L+R K+  D AE
Sbjct: 603 SGLLDLYEATFDSDFLQLADTLQETQNKLFWDPEKYGFFSTPAHQPDILIRTKDAMDNAE 662

Query: 687 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLK 728
           PS N VS  NL RL S++     + Y + A  ++A FE  ++
Sbjct: 663 PSVNGVSASNLFRLGSLL---NDEEYSKMARRTVACFEVEIE 701


>gi|15896782|ref|NP_350131.1| hypothetical protein CA_C3546 [Clostridium acetobutylicum ATCC 824]
 gi|337738753|ref|YP_004638200.1| hypothetical protein SMB_G3587 [Clostridium acetobutylicum DSM
           1731]
 gi|384460264|ref|YP_005672684.1| hypothetical protein CEA_G3552 [Clostridium acetobutylicum EA 2018]
 gi|15026641|gb|AAK81471.1|AE007851_2 Highly conserved protein containing a domain related to cellulase
           catalitic domain and a thioredoxin domain [Clostridium
           acetobutylicum ATCC 824]
 gi|325510953|gb|ADZ22589.1| Conserved hypothetical protein [Clostridium acetobutylicum EA 2018]
 gi|336292984|gb|AEI34118.1| hypothetical protein SMB_G3587 [Clostridium acetobutylicum DSM
           1731]
          Length = 677

 Score =  465 bits (1196), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 251/612 (41%), Positives = 351/612 (57%), Gaps = 59/612 (9%)

Query: 95  SHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCH 154
           S + +K +NRL  E SPYLLQHAHNPV+W++W  EAF++A+  D PIFLSIGYSTCHWCH
Sbjct: 2   SETIHKSSNRLINEKSPYLLQHAHNPVNWYSWSPEAFSKAKSEDKPIFLSIGYSTCHWCH 61

Query: 155 VMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDL 214
           VME ESFED+ VA++LN  FVSIKVDREERPD+D++YM    A+ G GGWPL++ ++P+ 
Sbjct: 62  VMERESFEDDDVAEVLNRSFVSIKVDREERPDIDEIYMNVCTAITGSGGWPLTIVMTPEQ 121

Query: 215 KPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSN 274
           KP   GTY P  ++ G  G  ++L  ++  W + ++ L + G   +  L++    +A   
Sbjct: 122 KPFFAGTYIPKNNRMGMQGLISLLENIEYQWKENQNELVEIGDKIVSSLNKDRKTTAK-- 179

Query: 275 KLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEA 334
               EL +  L     Q   ++D  +GGFGS PKFP P  +  ++ +    +D       
Sbjct: 180 ----ELSEEVLEEAFSQFKYNFDRTYGGFGSEPKFPTPHNLIFLMRYFYASKD------- 228

Query: 335 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 394
                M L TL  M +GGI+DH+G GF RYSVD++W VPHFEKMLYD   LA  Y +AF 
Sbjct: 229 KTSLNMALKTLDTMYRGGIYDHIGYGFSRYSVDKKWLVPHFEKMLYDNALLAYAYTEAFK 288

Query: 395 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 454
           +TK+  Y  I   I  Y+ RDM    G  + AEDADS   EG     EG FYVW+ KE+ 
Sbjct: 289 ITKNDNYKNIVDQIFTYILRDMTSNEGGFYCAEDADS---EGV----EGKFYVWSKKEIN 341

Query: 455 DILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEK 513
           ++LGE     F +++ +  TGN            F+G+N+L     +     K+    E 
Sbjct: 342 NVLGEDDGKKFSKYFNVTDTGN------------FEGENIL-----NLIETEKIEFEDE- 383

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
               L  CR+KLFD R KR  P+ DDK++ SWNGL+I++ A   + LK+E          
Sbjct: 384 ---FLNSCRKKLFDYREKRIHPYKDDKILTSWNGLMIAALAFGGRSLKNEI--------- 431

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
                  Y+  AE A +FI   L D    RL   +R+G +   G+L DY+FLI GL++LY
Sbjct: 432 -------YINAAEKAVTFIFTKLID-ANGRLLSRYRHGEASIKGYLTDYSFLIWGLIELY 483

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
           E    ++++  AI+L N   + F D +  G F    +   ++ R KE +DGA PSGNSVS
Sbjct: 484 EATYKSEYIEKAIKLNNDLIKYFWDDKNKGLFLYGSDSEELISRPKEIYDGAIPSGNSVS 543

Query: 694 VINLVRLASIVA 705
            +N +RL+ +  
Sbjct: 544 ALNFIRLSRLTG 555


>gi|301061221|ref|ZP_07202007.1| conserved hypothetical protein [delta proteobacterium NaphS2]
 gi|300444689|gb|EFK08668.1| conserved hypothetical protein [delta proteobacterium NaphS2]
          Length = 694

 Score =  465 bits (1196), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 272/681 (39%), Positives = 384/681 (56%), Gaps = 67/681 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N L  E SPYLLQHA NPVDW+ WG+ AF +A+  D P+FLS+GY+TCHWCHVM  ESFE
Sbjct: 9   NALIHEKSPYLLQHAENPVDWYPWGKGAFLKAKNEDKPVFLSVGYATCHWCHVMAHESFE 68

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D   A++LND +VSIKVDREERPD+DK+YM+  QAL G GGWPLSVFL+P+  P   GTY
Sbjct: 69  DPETARILNDHYVSIKVDREERPDLDKIYMSVCQALTGRGGWPLSVFLTPERIPFFAGTY 128

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP     G  GF  +L K+   W + R+ L  +G    ++++E L  S     +   L  
Sbjct: 129 FPKIGHQGLIGFPELLLKLGKLWKEDRERLLTAG----DEITEHLRNSELGGSVEKSLDM 184

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML---YHSKKLEDTGKSGEASEGQK 339
             L     QLS+S+D R+GGFG APKFP P ++  +L     SK   D           +
Sbjct: 185 EVLNKAGVQLSRSFDPRWGGFGGAPKFPSPHQLTFLLRRHVRSKNARDL----------E 234

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           MV  TLQ M +GG+ DH+G GFHRYSVDE+W  PHFEKMLYDQ  LA  Y +A+ +T   
Sbjct: 235 MVEKTLQSMRRGGLFDHIGYGFHRYSVDEKWFAPHFEKMLYDQALLAMAYTEAYQVTGKS 294

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG- 458
           FY+ + R+I  Y+ RDM  P G  +SAEDADS   EG     EG FY+WT KEV++ILG 
Sbjct: 295 FYARVAREIFTYVLRDMTSPEGGFYSAEDADS---EGV----EGLFYLWTPKEVQEILGT 347

Query: 459 EHAILFKEHYYLKPTGNCDLSR----MSDPHNEF-KGKNVLIELNDSSASASKLGMPLEK 513
           E A LF +++ ++  GN +  R    M +P + F +G+N                M +++
Sbjct: 348 ESADLFCDYFDIRERGNFEEGRSIPHMREPLSTFAEGRN----------------MGVKR 391

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
            +++L + R KLF  R KR  P  DDK++ SWNGL+I++  +  + L   A         
Sbjct: 392 LVSLLRQGREKLFSARQKRIHPLKDDKILTSWNGLMITALFKGYRALGDAA--------- 442

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
                  Y+  A+++  FI   L  E    L   +R G +   G+LDDYAFL+  L++ Y
Sbjct: 443 -------YVTAAQNSLQFILNTLRKEDGC-LIRRYREGETAHAGYLDDYAFLVWALIEGY 494

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
           E       L  A+ L +T  +LF D E GG+F T  E+ +++ R ++  DGA PSGNSV+
Sbjct: 495 ESTFNPNHLKTAMVLTHTMLDLFWDSENGGFFFTGRENETLIARSRDAQDGAIPSGNSVA 554

Query: 694 VINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVV 753
            + L++L  +   +    + + A   +  F  ++     A   M  A D +  P+++ VV
Sbjct: 555 ALTLLQLGRLTGDTS---FEEKANALMQAFSGQMDAYPSAHTQMLQALDFVIGPTQE-VV 610

Query: 754 LVGHKSSVDFENMLAAAHASY 774
           + G +   + + ML     ++
Sbjct: 611 IAGTRHDRNTDVMLKVIQQNF 631


>gi|386760793|ref|YP_006234010.1| hypothetical protein MY9_4222 [Bacillus sp. JS]
 gi|384934076|gb|AFI30754.1| hypothetical protein MY9_4222 [Bacillus sp. JS]
          Length = 689

 Score =  465 bits (1196), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 253/606 (41%), Positives = 353/606 (58%), Gaps = 53/606 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL AE SPYLLQHAHNPVDWF WGEEAF +A+  + P+ +SIGYSTCHWCHVM  ESFE
Sbjct: 8   NRLIAEKSPYLLQHAHNPVDWFPWGEEAFEKAKCENKPVLVSIGYSTCHWCHVMAHESFE 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE +A+LLN+ FV+IKVDREERPDVD VYM   Q + G GGWPL+VF++PD KP   GTY
Sbjct: 68  DEEIARLLNERFVAIKVDREERPDVDSVYMRICQLMTGQGGWPLNVFITPDQKPFYAGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP   K+ RPGF  +L  + + +   R+ +      A + L    +A     K  + L +
Sbjct: 128 FPKTSKFNRPGFVDVLEHLSETFANDREHVENIAENAAKHLQTKTAA-----KTGEGLSE 182

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
           +A+    +QL+  +D+ +GGFG APKFP P    M++Y  +   +TG+        K   
Sbjct: 183 SAIHRTFQQLASGFDTIYGGFGQAPKFPMP---HMLMYLLRYYHNTGQENALYNVTK--- 236

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL  MA GGI+DH+G GF RYS D+ W VPHFEKMLYD   L   Y +A+ +T++  Y 
Sbjct: 237 -TLDSMANGGIYDHIGYGFARYSTDDEWLVPHFEKMLYDNALLLTAYTEAYQVTQNSRYK 295

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHA 461
            IC  I+ +++R+M    G  FSA DAD   TEG    +EG +YVW+ +E+   LG E  
Sbjct: 296 EICEQIITFVQREMTHEDGSFFSALDAD---TEG----EEGKYYVWSREEILKTLGDELG 348

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASASKLGMPLEKYLNILG 519
            L+ + Y +   GN            F+GKN+  LI        A   G+  E+    L 
Sbjct: 349 TLYCQVYDITEEGN------------FEGKNIPNLIHSKREQIKADA-GLTEEELRLKLE 395

Query: 520 ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRK 579
           + R++L   R +R  PH+DDKV+ SWN L+I+  A+A+K+                 +  
Sbjct: 396 DARQRLLKTREERTYPHVDDKVLTSWNALMIAGLAKAAKVY----------------EEP 439

Query: 580 EYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGT 639
           +Y+ +A+ A +FI  HL  +   R+   +R+G  K  GF+DDYAFL+   LDLYE     
Sbjct: 440 KYLSLAQDAITFIENHLIIDG--RVMVRYRDGEVKNKGFIDDYAFLLWAYLDLYEASFDL 497

Query: 640 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVR 699
            +L  A +L +    LF D E GG++ +  +  ++++R KE +DGA PSGNSV+ + L+R
Sbjct: 498 SYLQKAKKLTDDMIGLFWDEEHGGFYFSGHDAEALIVREKEVYDGAVPSGNSVAAVQLLR 557

Query: 700 LASIVA 705
           L  +  
Sbjct: 558 LGQVTG 563


>gi|153939114|ref|YP_001390416.1| hypothetical protein CLI_1150 [Clostridium botulinum F str.
           Langeland]
 gi|384461487|ref|YP_005674082.1| hypothetical protein CBF_1122 [Clostridium botulinum F str. 230613]
 gi|152935010|gb|ABS40508.1| conserved hypothetical protein [Clostridium botulinum F str.
           Langeland]
 gi|295318504|gb|ADF98881.1| conserved hypothetical protein [Clostridium botulinum F str.
           230613]
          Length = 680

 Score =  465 bits (1196), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 265/676 (39%), Positives = 369/676 (54%), Gaps = 64/676 (9%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNRL  E SPYLLQHAHNPVDW+ WGEEAF +A+  D P+FLSIGYSTCHWCHVME ESF
Sbjct: 6   TNRLINEKSPYLLQHAHNPVDWYPWGEEAFEKAKIEDKPVFLSIGYSTCHWCHVMERESF 65

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           EDE VA++LN  F+SIKVDREERPD+D +YM + QA  G GGWPL++ ++PD  P   GT
Sbjct: 66  EDEEVAEVLNKNFISIKVDREERPDIDSIYMNFCQAYTGSGGWPLTILMTPDKNPFFAGT 125

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFP   KY  PG   ILR + + W + ++ + +S    +EQ+          N    EL 
Sbjct: 126 YFPKWGKYNVPGIMDILRSISNLWREDKNKVLESSNRILEQIER-----FQDNHREGELE 180

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQK 339
           +  +    + L  ++D+++GGFG+ PKFP    I  +L  Y+ KK           +   
Sbjct: 181 EYIIEEAIKTLLDNFDNQYGGFGTYPKFPTAHYILFLLRYYYFKK---------DKKILD 231

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           +V  TL  M KGGI DH+G GF RYS D +W VPHFEKMLYD   L+  Y +A+  TK+ 
Sbjct: 232 IVNKTLTSMYKGGIFDHIGFGFSRYSTDNKWLVPHFEKMLYDNALLSMAYTEAYEATKNP 291

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG- 458
            +  I   IL+Y+++ M    G  +SAEDADS   EG     EG FY+WT +E+ DILG 
Sbjct: 292 LFKDITEKILNYVKKSMTSEKGGFYSAEDADS---EGV----EGKFYLWTKEEIMDILGE 344

Query: 459 EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 518
           E   L+ + Y +   GN            F+ KN+   +N            LEK     
Sbjct: 345 EEGELYCKIYDITSKGN------------FENKNIANLINTDLKIVDNNKDKLEK----- 387

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
              R KLF+ R KR  P+ DDK++ SWN L+I +F++A + LK++               
Sbjct: 388 --IREKLFEYREKRIHPYKDDKILTSWNALMIVAFSKAGRSLKND--------------- 430

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG 638
             Y+E+A+ +A+FI  +L DE+   L    R G     GF+DDYAF +  L++LYE    
Sbjct: 431 -NYIEIAKKSANFIIENLMDEKG-TLYARIREGERGNEGFIDDYAFFLWALIELYEASFD 488

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
             +L  +IE+ ++  +LF  +E GG++  +     +L+R KE +DGA PSGN+V+ + L 
Sbjct: 489 IYYLEKSIEVADSMIDLFWHKENGGFYLYSKNSEKLLVRPKEIYDGATPSGNAVASLALN 548

Query: 699 RLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHK 758
            L  I      D Y+   +     F T +K   M   L    A M ++   K + L   +
Sbjct: 549 LLYYITG---EDRYKYLVDKQFKFFATNIKSGPM-YHLFSVMAYMYNILPVKEITLAYRE 604

Query: 759 SSVDFENMLAAAHASY 774
              DF   +   +  Y
Sbjct: 605 KDEDFYKFINEVNNRY 620


>gi|406830400|ref|ZP_11089994.1| hypothetical protein SpalD1_02134 [Schlesneria paludicola DSM
           18645]
          Length = 883

 Score =  465 bits (1196), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 267/641 (41%), Positives = 361/641 (56%), Gaps = 60/641 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA E SPYLL HAHNPVDW+ WG EAF +A+K    IFLS+GYS+C+WCHVME + F 
Sbjct: 68  NRLAKETSPYLLLHAHNPVDWYPWGPEAFEKAKKEGKMIFLSVGYSSCYWCHVMERKVFM 127

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALY------GGGGWPLSVFLSPDLKP 216
           +E +AK LN  FV IKVDREERPDVD +YMT +Q  Y        GGWPLS+FL+PD KP
Sbjct: 128 NEAIAKTLNQDFVCIKVDREERPDVDDIYMTALQVYYQAIKAPASGGWPLSMFLTPDGKP 187

Query: 217 LMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKL 276
           + GGTYFPPE   G  GF  IL K+ D W    + +  +      +    +    S    
Sbjct: 188 IAGGTYFPPEATEGNEGFPAILAKLTDLWKNNHEQMVGNADIVANETRRLMRPKLSLK-- 245

Query: 277 PDELPQNALRLCAEQLSKSYDSRFGGFG------SAPKFPRPVE---IQMMLYHSKKLED 327
           P E+    +      ++ S+D  FGG          PKFP P +   +Q MLY S   ED
Sbjct: 246 PVEVNAKLVESVFAAVAGSFDPEFGGIDFNPNRPDGPKFPTPTKLSFLQQMLYRSPN-ED 304

Query: 328 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 387
                      K++  TL  +A GGI DHVGGGFHRYSVD RW VPHFEKMLYDQ QLA+
Sbjct: 305 V---------SKLLDVTLLQLACGGIRDHVGGGFHRYSVDRRWDVPHFEKMLYDQAQLAD 355

Query: 388 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 447
           VY +A+  +    +  +  ++ +++ RD+  P G  +SA D   AET G     EG FYV
Sbjct: 356 VYAEAYRTSHQPLHKQVAEELFEFVARDLTAPEGGFYSAID---AETNGI----EGEFYV 408

Query: 448 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 507
           W + E++ ILG  A  FKE Y +K   + +   +     +   K   I+   + ASA+  
Sbjct: 409 WDATEIDHILGRSAAAFKEAYRVKELSDFEHGNVLRLSQKRLPKAEAIKAVATPASAT-- 466

Query: 508 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 567
           G   +++ +     R+KL +VR+KR +P  D+K++  WNGL+I ++ARA         +A
Sbjct: 467 GSEKDEFTS----SRQKLLEVRNKRKKPLRDEKLLTCWNGLMIGAYARA---------AA 513

Query: 568 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 627
             N P       EY+E+A  AA FI     D Q  RL H++ +G +K   +LDDYAFLI 
Sbjct: 514 PLNHP-------EYVEIAARAAEFILTKARDSQG-RLLHTYASGQAKLNAYLDDYAFLID 565

Query: 628 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 687
           GL+ LY+     KWL  A +LQ+ Q  LFLD   GG+F T+     +L R K   DG  P
Sbjct: 566 GLISLYDATEDVKWLKVAKQLQDDQLRLFLDESNGGFFFTSHHHEELLTRTKNCFDGVVP 625

Query: 688 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLK 728
           +GNSVS  NL+RLA++   +K   Y   A  ++ +F + ++
Sbjct: 626 AGNSVSARNLIRLAAL---TKISSYADEARATVELFASNIE 663


>gi|196232510|ref|ZP_03131362.1| protein of unknown function DUF255 [Chthoniobacter flavus Ellin428]
 gi|196223272|gb|EDY17790.1| protein of unknown function DUF255 [Chthoniobacter flavus Ellin428]
          Length = 428

 Score =  465 bits (1196), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 228/397 (57%), Positives = 275/397 (69%), Gaps = 10/397 (2%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNRLA E SPYLLQH HNPVDW+ WGEEAF +AR+   PIFLSIGYSTCHWCHVM  ESF
Sbjct: 26  TNRLAHEKSPYLLQHQHNPVDWYPWGEEAFEKARREHKPIFLSIGYSTCHWCHVMAHESF 85

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           E+   AKL+N+ FV+IKVDREERPDVD+VYMTYVQA  G GGWP+SVFL+PDLKP  GGT
Sbjct: 86  ENPATAKLMNENFVNIKVDREERPDVDRVYMTYVQATTGSGGWPMSVFLTPDLKPFYGGT 145

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE-ALSASASSNKLPDEL 280
           YFPPED+YGRPGF TIL+++ +AW    + +  +   AI  L++   S  A S  +  E 
Sbjct: 146 YFPPEDRYGRPGFPTILQRLAEAWKDDHEKVLGAANDAIRALNDYTASGPAQSTAVGKE- 204

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
              A+ L   QL++S+D   GGFG APKFPRPV +  + +   +     + G+A+ G  M
Sbjct: 205 ---AIALALNQLTRSFDDELGGFGGAPKFPRPVTLNFLFHVFAREGHESRDGKAALG--M 259

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
            L TLQ MA GG+HDH+GGGFHRYSVD+ WHVPHFEKMLYDQ QLA+ YLDAF +T D  
Sbjct: 260 ALITLQKMADGGMHDHLGGGFHRYSVDKFWHVPHFEKMLYDQAQLASSYLDAFQVTHDTV 319

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH 460
           Y    RDI DY+RRDM   GG  +SAEDADS   +G     EGAFYVWT  E+  +LGE 
Sbjct: 320 YERTARDIFDYVRRDMTDAGGGFYSAEDADSLLEKGKPEHSEGAFYVWTKDEIVHVLGED 379

Query: 461 -AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIE 496
            A +F   Y +   GN      SDP  EF+GKN+LI+
Sbjct: 380 AAAVFDRVYGVDAEGNA--PEGSDPQGEFRGKNILIQ 414


>gi|170761713|ref|YP_001786452.1| thymidylate kinase [Clostridium botulinum A3 str. Loch Maree]
 gi|169408702|gb|ACA57113.1| thymidylate kinase [Clostridium botulinum A3 str. Loch Maree]
          Length = 682

 Score =  464 bits (1195), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 266/678 (39%), Positives = 370/678 (54%), Gaps = 64/678 (9%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           K TNRL  E SPYLLQHA+NPVDW+ WGEEAF +A+  D P+FLSIGYSTCHWCHVME E
Sbjct: 6   KKTNRLIKEKSPYLLQHAYNPVDWYPWGEEAFEKAKIEDKPVFLSIGYSTCHWCHVMERE 65

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SFEDE VA+ LN  F+SIKVDREERPDVD +YM + QA  G GGWPL++ ++PD KP   
Sbjct: 66  SFEDEEVAEALNKNFISIKVDREERPDVDNIYMNFCQAYTGSGGWPLTIIMTPDKKPFFA 125

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE 279
           GTYFP   KY  PG   +LR + + W + ++ + +S     EQ+          N    E
Sbjct: 126 GTYFPKWGKYNIPGIMDVLRSISNLWREDKNKILESSNRISEQIER-----FQDNHREGE 180

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEG 337
           L +  +    + L  ++D+++GGFG+ PKFP    I  +L  Y+ KK           + 
Sbjct: 181 LEEYIIEEAIKTLLDNFDNQYGGFGTYPKFPTAHYILFLLRYYYFKK---------DKKI 231

Query: 338 QKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTK 397
             ++  TL  M KGGI DH+G GF RYS D +W VPHFEKMLYD   L+  Y +A+  TK
Sbjct: 232 LDVINKTLTNMYKGGIFDHIGFGFSRYSTDNKWLVPHFEKMLYDNALLSMAYTEAYEATK 291

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
           +  +  I   IL+Y+++ M    G  +SAEDADS   EG     EG FY+WT +E+ DIL
Sbjct: 292 NPLFKDITEKILNYVKKSMTSEEGGFYSAEDADS---EGV----EGKFYLWTKEEIMDIL 344

Query: 458 G-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLN 516
           G E   L+ + Y +   GN            F+ KN+   +N    +       LEK   
Sbjct: 345 GEEEGELYCKIYDITSKGN------------FENKNIANLINTDLKTVDNNKDKLEK--- 389

Query: 517 ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGS 576
                R KLF+ R KR  PH DDK++ SWN L+I +F++A + LK++             
Sbjct: 390 ----IREKLFEYREKRIHPHKDDKILTSWNALMIVAFSKAGRSLKND------------- 432

Query: 577 DRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG 636
               Y+E+A+ +A+FI  +L DE+   L    R G     GF+DDYAF +  L++LYE  
Sbjct: 433 ---NYIEIAKKSANFIIENLMDEKG-TLYARIREGERGNEGFIDDYAFFLWALIELYEAS 488

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
               +L  +IE+ ++  +LF  +E GG++  +     +L+R KE +DGA PSGN+V+ + 
Sbjct: 489 FDIYYLEKSIEVADSMIDLFWHKESGGFYLYSKNSEKLLVRPKEIYDGATPSGNAVASLA 548

Query: 697 LVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVG 756
           L  L  I      D Y+   +     F + +K   M   L    A M +V   K + L  
Sbjct: 549 LNLLYYITG---EDRYKDLVDKQFKFFASNIKSGPM-YHLFSVMAYMYNVLPVKEITLAY 604

Query: 757 HKSSVDFENMLAAAHASY 774
            +   DF   +   +  Y
Sbjct: 605 REKDEDFYKFINEVNNRY 622


>gi|296330011|ref|ZP_06872495.1| hypothetical protein BSU6633_02824 [Bacillus subtilis subsp.
           spizizenii ATCC 6633]
 gi|305676735|ref|YP_003868407.1| hypothetical protein BSUW23_20330 [Bacillus subtilis subsp.
           spizizenii str. W23]
 gi|296153050|gb|EFG93915.1| hypothetical protein BSU6633_02824 [Bacillus subtilis subsp.
           spizizenii ATCC 6633]
 gi|305414979|gb|ADM40098.1| conserved hypothetical protein [Bacillus subtilis subsp. spizizenii
           str. W23]
          Length = 695

 Score =  464 bits (1194), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 251/606 (41%), Positives = 352/606 (58%), Gaps = 53/606 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL AE SPYLLQHAHNPV+WF WGEEAF +A++ + P+ +SIGYSTCHWCHVM  ESFE
Sbjct: 14  NRLIAEKSPYLLQHAHNPVEWFPWGEEAFEKAKRENKPVLVSIGYSTCHWCHVMAHESFE 73

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE +A+LLN+ FV+IKVDREERPDVD VYM   Q + G GGWPL+VF++PD KP   GTY
Sbjct: 74  DEEIARLLNERFVAIKVDREERPDVDSVYMRICQLMTGQGGWPLNVFITPDQKPFYAGTY 133

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP   K+ RPGF  +L  + + +   R+ +      A + L    +A +        L +
Sbjct: 134 FPKTSKFNRPGFVDVLEHLSETFANDREHVEDIAENAAKHLQTKTAAKSGEG-----LSK 188

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
           +A+    +QL+  +D+ +GGFG APKFP P    M++Y  +   +TG+        K   
Sbjct: 189 SAIHRTFQQLANGFDTIYGGFGQAPKFPMP---HMLMYLLRYDHNTGQENALYNVTK--- 242

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL  MA GGI+DH+G GF RYS D+ W VPHFEKMLYD   L   Y +A+ +T++  Y 
Sbjct: 243 -TLDSMANGGIYDHIGYGFARYSTDDEWLVPHFEKMLYDNALLLTAYTEAYQVTQNSRYK 301

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH-A 461
            IC  I+ +++R+M    G  FSA DAD   TEG    +EG +YVW+ +E+   LG+   
Sbjct: 302 EICEQIITFIQREMTHEDGSFFSALDAD---TEG----EEGKYYVWSKEEILKTLGDDLG 354

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASASKLGMPLEKYLNILG 519
           +L+ + Y +   GN            F+GKN+  LI        A   G+  E+    L 
Sbjct: 355 MLYCQVYDITEEGN------------FEGKNIPNLIHTMQEQIKADA-GLTKEELSLKLE 401

Query: 520 ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRK 579
             R++L   R +R  PH+DDKV+ SWN L+I+  A+A+K+ +                  
Sbjct: 402 NARQQLLKTREERTYPHVDDKVLTSWNALMIAGLAKAAKVYQ----------------EP 445

Query: 580 EYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGT 639
           +Y+ +AE A +FI   L  +   R+   +R+G  K  GF+DDYAFL+   LDLYE     
Sbjct: 446 KYLSLAEDAITFIENQLIIDG--RVMVRYRDGEVKNKGFIDDYAFLLWAYLDLYEASFDL 503

Query: 640 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVR 699
            +L  A +L +    LF D E GG++ T  +  ++++R KE +DGA PSGNSV+ + L+R
Sbjct: 504 SYLQKAKKLTDDMIGLFWDEEHGGFYFTGHDAEALIVREKEVYDGAVPSGNSVAAVQLLR 563

Query: 700 LASIVA 705
           L  +  
Sbjct: 564 LGQVTG 569


>gi|46446752|ref|YP_008117.1| hypothetical protein pc1118 [Candidatus Protochlamydia amoebophila
           UWE25]
 gi|46400393|emb|CAF23842.1| conserved hypothetical protein [Candidatus Protochlamydia
           amoebophila UWE25]
          Length = 718

 Score =  464 bits (1194), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 261/616 (42%), Positives = 360/616 (58%), Gaps = 54/616 (8%)

Query: 101 HTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVES 160
           +TNRL  E SPYLLQHAHNPVDW+ WGEEAF  A+ +D PIFLSIGY+TCHWCHVME ES
Sbjct: 37  YTNRLIHEKSPYLLQHAHNPVDWYPWGEEAFHIAKTQDKPIFLSIGYATCHWCHVMERES 96

Query: 161 FEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALY-GGGGWPLSVFLSPDLKPLMG 219
           FED  VA  +N  FVSIKVDREE P+VD +YM + Q++  G  GWPL+V L+PDL+P   
Sbjct: 97  FEDIEVADSMNQTFVSIKVDREELPEVDSLYMEFSQSMMAGAAGWPLNVILTPDLQPFFA 156

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAW-DKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
            TY P    +G  G   +++++ + W  ++R+ +       +E  S+A+  +     +PD
Sbjct: 157 TTYLPSHSSHGMMGLIDLIQRIAELWSSEEREKIITQAEKIVEVFSKAVHTTGED--IPD 214

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
           E     + + A+ L K  D  +GG   APKFP   +   ML +   ++D       S   
Sbjct: 215 E---EQISITADLLYKMADPTYGGIKGAPKFPIGYQYSFMLRYYANMKD-------SRAL 264

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
            +V  TL  + +GGI+DH+GGGF RYS+DE+W VPHFEKMLYD   LA  YL+A+ LTK 
Sbjct: 265 FLVERTLDMLHRGGIYDHLGGGFSRYSIDEKWLVPHFEKMLYDNAILAQSYLEAWQLTKK 324

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
             Y  + ++IL+Y+ RDM    G  +SAEDADS   EG     EG FY W  +EV++ILG
Sbjct: 325 NLYKEVAQEILNYILRDMTYSDGGFYSAEDADS---EG----HEGFFYTWKEEEVKEILG 377

Query: 459 EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 518
           +H+ LF E+Y +   GN            F+G+N+L    +    ASK    +++   I 
Sbjct: 378 DHSQLFCEYYDITAEGN------------FEGRNILHTPLNLEEFASKHQQDIDQLRIIF 425

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
              R+KL+  R KR  P  DDK++ SWNGL+I SFA A         +  F+ P+     
Sbjct: 426 DNQRKKLWSAREKRIHPLKDDKILSSWNGLMIYSFAEA---------AFTFDCPL----- 471

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG 638
             Y+E A  AA FI+  L+  Q  +L   +R G +     LD+YAF+I G L L+E  +G
Sbjct: 472 --YLEAAVKAARFIKNKLWKNQ--KLLRRWREGQAMFQAGLDEYAFMIKGALSLFEANAG 527

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
           T+WL WAIE+     + +   E G ++ T G D ++LLR  +  DGAEPSGN+V   NL+
Sbjct: 528 TEWLEWAIEMATLLKDQY-KAEEGAFYQTDGGDKNLLLRKCQFSDGAEPSGNAVHCENLL 586

Query: 699 RLASIVAGSKSDYYRQ 714
           RL  +   ++ DY  Q
Sbjct: 587 RLYQLT--NEEDYLAQ 600


>gi|253699928|ref|YP_003021117.1| hypothetical protein GM21_1299 [Geobacter sp. M21]
 gi|251774778|gb|ACT17359.1| protein of unknown function DUF255 [Geobacter sp. M21]
          Length = 750

 Score =  464 bits (1193), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 270/670 (40%), Positives = 379/670 (56%), Gaps = 51/670 (7%)

Query: 89  RTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYS 148
           RT   T     K+ NRL  E SPYLLQHAHNPV+WF WGEEAF  AR+ + P+ +SIGY+
Sbjct: 38  RTRHLTPGGEAKYMNRLFLETSPYLLQHAHNPVNWFPWGEEAFDLARRLNRPVLVSIGYA 97

Query: 149 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 208
           TCHWCHVME ESFEDE +A+ LN  F++IKVDREERPDVD VYMT V A+   GGWPL++
Sbjct: 98  TCHWCHVMEEESFEDEEIARFLNANFIAIKVDREERPDVDTVYMTAVHAMGMQGGWPLNI 157

Query: 209 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 268
           F +P+ KP  GGTYFPP D  G  GF ++LR++++ + +  D +  +G     QL+EA+ 
Sbjct: 158 FATPERKPFYGGTYFPPSDYAGGIGFLSLLRRIRETYQQAPDRVTHAGL----QLTEAIR 213

Query: 269 ASASSNKLPDELPQNALRL--CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 326
              +   +  E P+  + L    E   + +D++ GG   APKF         L     L 
Sbjct: 214 GILAP--MGGEPPEKEISLERVIEAYQERFDAKNGGVVGAPKF------PSSLPLGLLLR 265

Query: 327 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 386
           D  + GE +    M  +TL+ MA GGI+D  GGGFHRY+ D  W +PHFEKMLYD  +LA
Sbjct: 266 DYLRRGEKN-SLFMAQYTLRRMAAGGIYDQAGGGFHRYATDSTWLIPHFEKMLYDNARLA 324

Query: 387 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 446
             YL+ +  T D  ++ + R+IL YL+RDM+ P G  +SA DADS    G   ++EG F+
Sbjct: 325 AAYLEGYQATGDRHFAQVAREILRYLQRDMMSPEGAFYSATDADSLTESG--HREEGIFF 382

Query: 447 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 505
            WT +E++  LG E A +    Y +   GN            F+G+++L         A 
Sbjct: 383 TWTPEELDAALGAERARVVAACYGVTDEGN------------FEGRSILHREKSMQHLAE 430

Query: 506 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 565
           +L +P E+   +L E R +L+  R +RP P  D+K++ SWNGL IS+FAR   +L + A 
Sbjct: 431 ELMLPKEELERLLDEAREELYLARQRRPLPLRDEKILASWNGLAISAFARGGLVLNAPA- 489

Query: 566 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 625
                           ++ A  AA+F+  ++  ++  RL HS++ G +K  GFLDDYAF 
Sbjct: 490 ---------------LLDTARGAANFMLENMMSQE--RLCHSYQEGEAKGEGFLDDYAFF 532

Query: 626 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 685
           I+GL+DL+E      WL  A+E      E F D E GG+F T      ++ R K  +DG 
Sbjct: 533 IAGLIDLFEATGELPWLKRALEQARQVQEQFEDSETGGFFMTGPHHEELISREKPAYDGV 592

Query: 686 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 745
            PSGNSV ++NL+RL ++            A+ +L  F T+L     A+  M  A D L 
Sbjct: 593 IPSGNSVMIMNLLRLNALTGEQGMP---DQAQRALDAFSTQLASAPTALSEMLLALDYLQ 649

Query: 746 VPSRKHVVLV 755
              R+ V++ 
Sbjct: 650 DVPREIVIVA 659


>gi|163782790|ref|ZP_02177786.1| hypothetical protein HG1285_15681 [Hydrogenivirga sp. 128-5-R1-1]
 gi|159881911|gb|EDP75419.1| hypothetical protein HG1285_15681 [Hydrogenivirga sp. 128-5-R1-1]
          Length = 697

 Score =  464 bits (1193), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 279/664 (42%), Positives = 381/664 (57%), Gaps = 44/664 (6%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
            +  NRL  E SPYL QHA+NPVDW+ WGEEAF +A + D P+FLSIGYSTCHWCHVME 
Sbjct: 3   KRKPNRLIKEKSPYLQQHAYNPVDWYPWGEEAFEKAEREDKPVFLSIGYSTCHWCHVMER 62

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFEDE +A++LN+ +V IKVDREERPDVD VYM+  Q + G GGWPL+V ++PD KP  
Sbjct: 63  ESFEDEEIARILNENYVPIKVDREERPDVDSVYMSVCQMMTGSGGWPLTVIMTPDKKPFF 122

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
            GTYFP E  YGRPG + IL ++ + W   R    Q    A EQ+ +AL+     + + +
Sbjct: 123 AGTYFPKEGMYGRPGLRDILLRIAELWRNDR----QKVLTAAEQVVDALAKGEEESYIGE 178

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
            L ++ L     +L  +YD  +GGFG+APKFP P  +  +L + ++   TG +G+A E  
Sbjct: 179 RLDESILHKGFAELYHTYDEAYGGFGNAPKFPIPHNLMFLLRYYRR---TG-NGKALE-- 232

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
            MV  TL+ M  GGI DHVG GFHRYS D  W +PHFEKMLYD   L  VY +AF  T D
Sbjct: 233 -MVKHTLKKMRLGGIWDHVGFGFHRYSTDREWLLPHFEKMLYDNALLMLVYTEAFQATGD 291

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
            F++ +  +I +YL+RDM+ P G  +SAEDADS   EG    +EG FY WT  E+E++L 
Sbjct: 292 EFFAQVVEEIAEYLQRDMLSPEGAFYSAEDADS---EG----EEGKFYTWTLAELEELLT 344

Query: 459 EHAILFKEHYY-LKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI 517
           E  +      + +   GN     + +      GKNVL    +    A +LG   +     
Sbjct: 345 EEELGIALRLFGIAEEGNF----LEEATRRKVGKNVLHMKKELEKYAEELGYEPDVLKQK 400

Query: 518 LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSD 577
           L E R KLF  R KR RP  D+KV+  WNGL I++F++A                 V   
Sbjct: 401 LEEIRSKLFKRREKRVRPLRDEKVLTDWNGLAIAAFSKAG----------------VALG 444

Query: 578 RKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGS 637
           RK+++ VA+  A F+   + D++  +L H ++ G +  P FL+DYA+LI GL++LY+   
Sbjct: 445 RKDFLAVAKRTADFLLNTMVDDEG-KLLHRYKEGEAGIPAFLEDYAYLIWGLMELYQGSF 503

Query: 638 GTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINL 697
             ++L  A EL +   E F D E  G++ T      VL+R KE +DGA PSGNSV   NL
Sbjct: 504 EGEYLKRAKELTDFALEHFWDEENLGFYQTPDFGERVLVRKKEIYDGATPSGNSVMAYNL 563

Query: 698 VRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGH 757
           VRL  ++   +   Y + A+ +L  F   +     A      A D+L V     +V VG 
Sbjct: 564 VRLGRLLGLQE---YERRADQTLNAFSQVIASFPGAHTFSLLALDIL-VKGSFELVAVGD 619

Query: 758 KSSV 761
           +   
Sbjct: 620 REEA 623


>gi|326203005|ref|ZP_08192872.1| glycoside hydrolase family 76 [Clostridium papyrosolvens DSM 2782]
 gi|325987082|gb|EGD47911.1| glycoside hydrolase family 76 [Clostridium papyrosolvens DSM 2782]
          Length = 672

 Score =  464 bits (1193), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 272/665 (40%), Positives = 369/665 (55%), Gaps = 63/665 (9%)

Query: 97  SRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVM 156
           S +K+TN+L  E SPYLLQHAHNPVDW+ WG EAFA A   D PIFLSIGYSTCHWCHVM
Sbjct: 2   SEHKYTNKLIHEKSPYLLQHAHNPVDWYPWGPEAFARAVSEDKPIFLSIGYSTCHWCHVM 61

Query: 157 EVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKP 216
           E ESFEDE VA +LN  F+ IKVDREERPD+D +YM+  Q L G GGWPL+VFL+PD +P
Sbjct: 62  ERESFEDEEVAHILNRDFICIKVDREERPDIDSIYMSVCQTLTGHGGWPLTVFLTPDRQP 121

Query: 217 LMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKL 276
              GTYFP ++  G  G  ++L  VK+AWD KR+ L +S    IE +S   S+  +    
Sbjct: 122 FYAGTYFPKDNSKGSIGLMSLLDSVKEAWDLKRESLLESAKNIIEHVSHEESSDETI--- 178

Query: 277 PDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASE 336
              + ++ +    +    ++D ++GGFG++PKFP P  +  +L    +   T K   A E
Sbjct: 179 ---ISKDIIHEAFKHFKYNFDIKYGGFGTSPKFPSPHTLLFLL----RYWYTEKEPFALE 231

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
              MV  TL+ M  GGI DH+G GF RYS D++W VPHFEKMLYD   LA  Y +A+S T
Sbjct: 232 ---MVEKTLESMKNGGIFDHIGFGFSRYSTDKKWLVPHFEKMLYDNALLAIAYGEAYSAT 288

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
            +  Y    R ILDY++RDM    G  +SAEDADS   EG     EG FY+W+ +EV  +
Sbjct: 289 GNKNYEETSRQILDYVQRDMSSQLGAFYSAEDADS---EGF----EGKFYIWSQEEVMKV 341

Query: 457 LGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASASKLGMPLEKY 514
           LG+     KE+        C+L  ++ P   F+G N+  LIE    S             
Sbjct: 342 LGQKD--GKEY--------CNLFDIT-PSGNFEGLNIPNLIETGALSQQQKSFA------ 384

Query: 515 LNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVV 574
                ECR+KLF+ R KR  P+ DDKV+ SWNGL+I++ A   +I   E           
Sbjct: 385 ----EECRKKLFNHREKRVHPYKDDKVLTSWNGLMIAAMAYCGRIFGEE----------- 429

Query: 575 GSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYE 634
                 Y+E A+    FI + L      RL   +R+G +  P +L+DYAFL+ GLL+LYE
Sbjct: 430 -----RYIETAKRCVDFIYKKLI-RTDGRLLARYRDGEAMFPAYLEDYAFLVWGLLELYE 483

Query: 635 FGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSV 694
               T +L  A++L +    LF +      F    +   ++ R +E +DGA PSGNSV+ 
Sbjct: 484 ATFTTIYLKRALKLTDAMLNLFGENNSAALFLYGHDSEQLISRPRESYDGAIPSGNSVAA 543

Query: 695 INLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVL 754
           +NL+RLA I    +   Y   A+  +  F  ++K        M  +       +   +V+
Sbjct: 544 MNLLRLARITGHHE---YENRAKAIMDFFNNQVKAAPTGHSYMLSSYMYSVSDNSSEIVI 600

Query: 755 VGHKS 759
            G  S
Sbjct: 601 TGENS 605


>gi|451344787|ref|YP_007443418.1| hypothetical protein KSO_000140 [Bacillus amyloliquefaciens IT-45]
 gi|449848545|gb|AGF25537.1| hypothetical protein KSO_000140 [Bacillus amyloliquefaciens IT-45]
          Length = 689

 Score =  464 bits (1193), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 270/685 (39%), Positives = 382/685 (55%), Gaps = 58/685 (8%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           N   NRL AE SPYLLQHAHNPV+W  WGEEAF +A++ + P+ +SIGYSTCHWCHVM  
Sbjct: 4   NSTPNRLIAEKSPYLLQHAHNPVNWHPWGEEAFEKAKRENKPVLVSIGYSTCHWCHVMAH 63

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFEDE +A +LND F+++KVDREERPDVD VYM   Q + G GGWPL+VF++PD KP  
Sbjct: 64  ESFEDEEIAGMLNDKFIAVKVDREERPDVDSVYMRICQLMTGQGGWPLNVFVTPDQKPFY 123

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
            GTYFP   K+ RPGF  +L  + + +   R          +E ++E  +A       P 
Sbjct: 124 AGTYFPKTSKFNRPGFIDVLEHLSETFANDRQ--------HVEDIAENAAAHLEVKVHPA 175

Query: 279 E--LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASE 336
           E  L + A+     QL+  +D+ +GGFG APKFP P    M+++  +    TGK  +A  
Sbjct: 176 EGMLGEQAVHDTYRQLAGGFDTVYGGFGQAPKFPMP---HMLMFLLRYYSYTGKE-QALA 231

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
           G   V  TL  MA GGI DH+G GF RYS D  W VPHFEKMLYD   L   Y +A+ +T
Sbjct: 232 G---VTKTLDGMANGGIFDHIGFGFARYSTDNEWLVPHFEKMLYDNALLLTAYTEAYQVT 288

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
            +  Y  I   I+ +++R+M+   G  FSA DAD   TEG    +EG +Y+W+ KE+ ++
Sbjct: 289 GNERYKQIAMQIVTFIQREMMHEDGSFFSALDAD---TEG----REGKYYIWSKKEIMNL 341

Query: 457 LG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 515
           LG E   L+ + Y +   GN +   +  PH  F  +  ++E  ++  +  +L   LE   
Sbjct: 342 LGDELGPLYCKVYNITDQGNFEGENI--PHLIFTRREAILE--ETGLTGHELAERLE--- 394

Query: 516 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 575
               E R KL + R  R  PH DDKV+ SWN L+I+  A+A+K+         F+ P   
Sbjct: 395 ----EARTKLLEARENRSYPHTDDKVLTSWNALMIAGLAKAAKV---------FHEP--- 438

Query: 576 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 635
               +++ +AE+A  F+ RHL  +   R+   +R G  K  GF+DDYAFLI G L+LYE 
Sbjct: 439 ----DFLSMAETAIRFLERHLMPDG--RVMVRYREGEVKNKGFIDDYAFLIWGYLELYEA 492

Query: 636 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 695
           G    +L  A  L  +  ELF D   GG+F T  +  ++L+R KE +DGA PSGNS + +
Sbjct: 493 GFHPSYLQKAKTLCTSMLELFWDERHGGFFFTGNDAETLLVREKEVYDGAVPSGNSAAAV 552

Query: 696 NLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLV 755
            L+RL  +          + AE   +VF+  ++    +      +    ++P +K +VL 
Sbjct: 553 QLLRLGRLTGDVS---LIEKAEAMFSVFKREIEAYPSSSAFFMQSVLAHTMP-QKEIVLF 608

Query: 756 GHKSSVDFENMLAAAHASYDLNKTV 780
           G K   D +  + A    +    T+
Sbjct: 609 GRKDDPDRKRFIEALQEHFTPAYTI 633


>gi|321313642|ref|YP_004205929.1| hypothetical protein BSn5_11430 [Bacillus subtilis BSn5]
 gi|320019916|gb|ADV94902.1| hypothetical protein BSn5_11430 [Bacillus subtilis BSn5]
          Length = 689

 Score =  464 bits (1193), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 253/627 (40%), Positives = 361/627 (57%), Gaps = 54/627 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYLLQHAHNPVDWF WGEEAF +A++ + P+ +SIGYSTCHWCHVM  ESFE
Sbjct: 8   NRLINEKSPYLLQHAHNPVDWFPWGEEAFEKAKRENKPVLVSIGYSTCHWCHVMAHESFE 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE +A+LLN+ FV+IKVDREERPDVD VYM   Q + G GGWPL+VF++PD KP   GTY
Sbjct: 68  DEEIARLLNERFVAIKVDREERPDVDSVYMRICQLMTGQGGWPLNVFITPDQKPFYAGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP   K+ RPGF  +L  + + +   R+ +      A + L    +A +        L +
Sbjct: 128 FPKTSKFNRPGFVDVLEHLSETFANDREHVEDIAENAAKHLQTKTAAKSGEG-----LSE 182

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
           +A+    +QL+  +D+ +GGFG APKFP P    M++Y  +   +TG+        K   
Sbjct: 183 SAISRTFQQLASGFDTIYGGFGQAPKFPMP---HMLMYLLRYHHNTGQDNALYNVTK--- 236

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL  MA GGI+DH+G GF RYS D+ W VPHFEKMLYD   L   Y +A+ +T++  Y 
Sbjct: 237 -TLDSMANGGIYDHIGYGFARYSTDDEWLVPHFEKMLYDNALLLTAYTEAYQVTQNSRYK 295

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH-A 461
            IC  I+ +++R+M    G  FSA DAD   TEG    +EG +YVW+ +E+   LG+   
Sbjct: 296 EICEQIITFIQREMTHEDGSFFSALDAD---TEG----EEGKYYVWSKEEILKTLGDDLG 348

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI-LGE 520
            L+ + Y +   GN            F+GKN+   ++       +     EK L++ L +
Sbjct: 349 TLYCQVYDITEEGN------------FEGKNIPNLIHTKREQIKEDAGLTEKELSLKLED 396

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            R++L   R +R  PH+DDKV+ SWN L+I+  A+A+K+ +                  +
Sbjct: 397 ARQQLLKTREERTYPHVDDKVLTSWNALMIAGLAKAAKVYQ----------------EPK 440

Query: 581 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 640
           Y+ +A+ A +FI   L  +   R+   +R+G  K  GF+DDYAFL+   LDLYE      
Sbjct: 441 YLSLAKDAITFIENKLIIDG--RVMVRYRDGEVKNKGFIDDYAFLLWAYLDLYEASFDLS 498

Query: 641 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 700
           +L  A +L +    LF D E GG++ T  +  ++++R KE +DGA PSGNSV+ + L+RL
Sbjct: 499 FLQKAKKLTDDMISLFWDEEHGGFYFTGHDAEALIVREKEVYDGAVPSGNSVAAVQLLRL 558

Query: 701 ASIVAGSKSDYYRQNAEHSLAVFETRL 727
             +   S      + AE   +VF+  +
Sbjct: 559 GQVTGDSS---LIEKAETMFSVFKQHI 582


>gi|421729533|ref|ZP_16168663.1| hypothetical protein WYY_00569 [Bacillus amyloliquefaciens subsp.
           plantarum M27]
 gi|407076503|gb|EKE49486.1| hypothetical protein WYY_00569 [Bacillus amyloliquefaciens subsp.
           plantarum M27]
          Length = 689

 Score =  463 bits (1192), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 271/685 (39%), Positives = 381/685 (55%), Gaps = 58/685 (8%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           N   NRL AE SPYLLQHAHNPV+W  WGEEAF +A++ + PI +SIGYSTCHWCHVM  
Sbjct: 4   NSTPNRLIAEKSPYLLQHAHNPVNWHPWGEEAFEKAKRENKPILVSIGYSTCHWCHVMAH 63

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFEDE +A +LND F++IKVDREERPDVD VYM   Q + G GGWPL+VF++PD KP  
Sbjct: 64  ESFEDEEIAGMLNDKFIAIKVDREERPDVDSVYMRICQLMTGQGGWPLNVFVTPDQKPFY 123

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
            GTYFP   K+ RPGF  +L  + + +   R          +E ++E  +A       P 
Sbjct: 124 AGTYFPKTSKFNRPGFIDVLEHLSETFANDRQ--------HVEDIAENAAAHLEVKIHPA 175

Query: 279 E--LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASE 336
           E  L + A+     QL+  +D+ +GGFG APKFP P    M+++  +    TGK  +A  
Sbjct: 176 EGMLGEQAVHDTYRQLAGGFDTVYGGFGQAPKFPMP---HMLMFLLRYYSYTGKE-QALA 231

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
           G   V  TL  MA GGI DH+G GF RYS D  W VPHFEKMLYD   L   Y +A+ +T
Sbjct: 232 G---VTKTLDGMANGGIFDHIGFGFARYSTDNEWLVPHFEKMLYDNALLLTAYTEAYQVT 288

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
            +  Y  I   I+ +++R+M+   G  FSA DAD   TEG    +EG +Y+W+ KE+ ++
Sbjct: 289 GNERYKQIAMQIVTFIQREMMHEDGSFFSALDAD---TEG----REGKYYIWSKKEIMNL 341

Query: 457 LG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 515
           LG E   L+ + Y +   GN +   +  PH  F  +  ++E  ++  +  +L   LE   
Sbjct: 342 LGDELGPLYCKVYNITDQGNFEGENI--PHLIFTRREAILE--ETGLTGHELAERLE--- 394

Query: 516 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 575
               E R KL + R  R  PH DDKV+ SWN L+I+  A+A+K+         F+ P   
Sbjct: 395 ----EARTKLLEARENRSYPHTDDKVLTSWNALMIAGLAKAAKV---------FHEP--- 438

Query: 576 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 635
               +++ +AE+A  F+ RHL  +   R+   +R G  K  GF+DDYAFLI G L+LYE 
Sbjct: 439 ----DFLSMAETAIRFLERHLMPDG--RVMVRYREGEVKNKGFIDDYAFLIWGYLELYEA 492

Query: 636 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 695
           G    +L  A  L     ELF D   GG+F T  +  ++L+R KE +DGA PSGNS + +
Sbjct: 493 GFHPSYLQKAKTLCTNMLELFWDERHGGFFFTGNDAETLLVREKEVYDGAVPSGNSAAAV 552

Query: 696 NLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLV 755
            L+RL  +          + AE   +VF+  ++    +      +    ++P +K +V+ 
Sbjct: 553 QLLRLGRLTGDVS---LIEKAEAMFSVFKREIEAYPSSSAFFMQSVLAHTMP-QKEIVVF 608

Query: 756 GHKSSVDFENMLAAAHASYDLNKTV 780
           G K   D +  + A    +    T+
Sbjct: 609 GRKDDPDRKRFIEALQEHFTPAYTI 633


>gi|170757692|ref|YP_001780692.1| hypothetical protein CLD_3500 [Clostridium botulinum B1 str. Okra]
 gi|169122904|gb|ACA46740.1| conserved hypothetical protein [Clostridium botulinum B1 str. Okra]
          Length = 680

 Score =  463 bits (1192), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 264/676 (39%), Positives = 369/676 (54%), Gaps = 64/676 (9%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNRL  E SPYLLQHAHNPVDW+ WGEEAF +A+  D P+FLSIGYSTCHWCHVME ESF
Sbjct: 6   TNRLMNEKSPYLLQHAHNPVDWYPWGEEAFEKAKIEDKPVFLSIGYSTCHWCHVMERESF 65

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           EDE VA++LN  F+SIKVDREERPD+D +YM + QA  G GGWPL++ ++PD  P   GT
Sbjct: 66  EDEEVAEVLNKNFISIKVDREERPDIDSIYMNFCQAYTGSGGWPLTILMTPDKNPFFAGT 125

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFP   KY  PG   ILR + + W + ++ + +S    +EQ+          N    EL 
Sbjct: 126 YFPKWGKYNVPGIMDILRSISNLWREDKNKILESSNRILEQIER-----FQDNHREGELE 180

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQK 339
           +  +    + L  ++D+++GGFG+ PKFP    I  +L  Y+ KK           +   
Sbjct: 181 EYIIEEAIKTLLDNFDNQYGGFGTYPKFPTAHYILFLLRYYYFKK---------DKKILD 231

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           +V  TL  M KGGI DH+G GF RYS D +W VPHFEKMLYD   L+  Y +A+  TK+ 
Sbjct: 232 IVNKTLTSMYKGGIFDHIGFGFSRYSTDNKWLVPHFEKMLYDNALLSMAYTEAYEATKNP 291

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG- 458
            +  I   IL+Y+++ M    G  +SAEDADS   EG     EG FY+WT +E+ DILG 
Sbjct: 292 LFKDITEKILNYVKKSMTSDEGGFYSAEDADS---EGV----EGKFYLWTKEEIMDILGE 344

Query: 459 EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 518
           E   L+ + Y +   GN            F+ KN+   +N            LEK     
Sbjct: 345 EEGELYCKIYDITSKGN------------FENKNIANLINTDLKIVDNNKDKLEK----- 387

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
              R+KLF+ R KR  P+ DDK++ SWN L+I +F++A +  K++               
Sbjct: 388 --MRKKLFEYREKRIHPYKDDKILTSWNALMIIAFSKAGRSFKND--------------- 430

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG 638
             Y+E+A+ +A+FI  +L DE+   L    R G     GF+DDYAF +  L++LYE    
Sbjct: 431 -NYIEIAKKSANFIIENLMDERG-TLYARIREGERGNEGFIDDYAFFLWALIELYEASFD 488

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
             +L  +IE+ ++  +LF  +E GG++  +     +L+R KE +DGA PSGN+V+ + L 
Sbjct: 489 IYYLEKSIEVADSMIDLFWHKENGGFYLYSKNSEKLLVRPKEIYDGATPSGNAVASLALN 548

Query: 699 RLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHK 758
            L  I      D Y+   +     F T +K   M   L    A M ++   K + L   +
Sbjct: 549 LLYYITG---EDRYKYLVDKQFKFFATNIKSGPM-YHLFSVMAYMYNILPVKEITLAYRE 604

Query: 759 SSVDFENMLAAAHASY 774
              DF   +   +  Y
Sbjct: 605 KDEDFYKFINELNNRY 620


>gi|134119086|ref|XP_771778.1| hypothetical protein CNBN2230 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|50254378|gb|EAL17131.1| hypothetical protein CNBN2230 [Cryptococcus neoformans var.
           neoformans B-3501A]
          Length = 748

 Score =  463 bits (1191), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 277/699 (39%), Positives = 389/699 (55%), Gaps = 45/699 (6%)

Query: 102 TNRLAAEHSPYLLQHAHNPV------DWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHV 155
           +N LA   SPYLLQH  NPV       W  W  E    A+K D PIFLS GYS CHWCHV
Sbjct: 14  SNVLAKSKSPYLLQHKDNPVAANQVTQWQEWSPETITLAQKLDKPIFLSSGYSACHWCHV 73

Query: 156 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 215
           +  ESFEDE  AK++N+WFV+IKVDREERPDVD++YM+Y+QA+ GGGGWP+S+F++P L+
Sbjct: 74  LAHESFEDEETAKMMNEWFVNIKVDREERPDVDRMYMSYLQAVSGGGGWPMSIFMTPKLE 133

Query: 216 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 275
           P   GTYFP      RP F  +L K+ + W++ R+   + G   IE L +      +S  
Sbjct: 134 PFFAGTYFP------RPNFHQLLNKIHEVWEEDREKCEKMGKGVIEVLKDMSHTGRTSES 187

Query: 276 LPDELPQNALRLCAEQLSKSYDSRFGGF---GSAPKFPRPVEIQMMLYHSKKLEDTGKSG 332
           L   L  +       QLS   D+R+GGF   GS+ + P+     + L    +L      G
Sbjct: 188 LSQLLASSPASKLFSQLSTMNDTRYGGFTNSGSSTRGPKFPSCSITLEPLARLASIPGGG 247

Query: 333 EAS-----EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 387
             +     + ++M +  L+ M  GGI D VGGG  RYSVDE+W VPHFEKMLYDQ QL +
Sbjct: 248 ARNAEIREDAREMGMKMLRSMWSGGIRDWVGGGMARYSVDEKWMVPHFEKMLYDQAQLVS 307

Query: 388 VYLDAFSLT----KDVFYSY-ICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 442
             LD   L     +D    Y +  DIL Y  RD+  P G  +SAEDADSAE +GA +K E
Sbjct: 308 SCLDFARLYPVDHQDRLLCYDLAADILKYTLRDLKSPEGGFWSAEDADSAEYKGA-KKSE 366

Query: 443 GAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSA 502
           GAFY+W   E++++LG+ A LF   + ++P GN D+  + D H E +GKN+L +      
Sbjct: 367 GAFYIWKKTEIDEVLGDDAPLFNSFFGVQPDGNVDI--IHDSHGEMRGKNILHQHKTYEE 424

Query: 503 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 562
            A + G   ++   I+ +   KL   R +R RP LDDK++ +WNGL++++ ++AS +L  
Sbjct: 425 VALEFGKREDQAKGIIIQACEKLRLKREERERPGLDDKILTAWNGLMLTALSKASTLL-- 482

Query: 563 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP-GFLDD 621
                    P     R + +  A    +F++ H++D  T  L  S+R G  K P    DD
Sbjct: 483 ---------PPSYGIRSQCLPAALGIVNFVKSHMWDSSTRTLTRSYREG--KGPQAQTDD 531

Query: 622 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 681
           YAFL+ GLL+LYE       +++A ELQ  QDELF D   GGYF  + ED  VL+R+K+ 
Sbjct: 532 YAFLVQGLLNLYEATGDESHVLFAEELQKRQDELFWDDHDGGYF-ASAEDAHVLVRMKDA 590

Query: 682 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 741
            DGAEPS  +VS  NL R + +++ S+ + Y   AE +       +     AV       
Sbjct: 591 QDGAEPSAAAVSAHNLSRFSLLLS-SEFENYEARAEATFLSMGPLITQAPRAVGYAVSGL 649

Query: 742 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
             L    R+ V+++G  S    +  L AA  +Y  N+ +
Sbjct: 650 IDLEKGYRE-VIVIGSASDEVVKKFLEAARKTYFSNQVI 687


>gi|350268373|ref|YP_004879680.1| hypothetical protein GYO_4496 [Bacillus subtilis subsp. spizizenii
           TU-B-10]
 gi|349601260|gb|AEP89048.1| conserved hypothetical protein [Bacillus subtilis subsp. spizizenii
           TU-B-10]
          Length = 689

 Score =  463 bits (1191), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 267/685 (38%), Positives = 380/685 (55%), Gaps = 57/685 (8%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           N   NRL  E SPYLLQHAHNPVDWF WGEEAF +A++ + P+ +SIGYSTCHWCHVM  
Sbjct: 4   NNKPNRLINEKSPYLLQHAHNPVDWFPWGEEAFEKAKRENKPVLVSIGYSTCHWCHVMAH 63

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFEDE +A+LLN+ FV+IKVDREERPDVD VYM   Q + G GGWPL+VF++PD KP  
Sbjct: 64  ESFEDEEIARLLNERFVAIKVDREERPDVDSVYMRICQLMTGQGGWPLNVFITPDQKPFY 123

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
            GTYFP   K+ RPGF  +L  + + +   R+ +      A + L    +A +       
Sbjct: 124 AGTYFPKTSKFNRPGFVDVLEHLSETFANDREHVEDIAENAAKHLQTKTAAKSGEG---- 179

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
            L ++A+    +QL+  +D+ +GGFG APKFP P    M++Y  +   +T    E     
Sbjct: 180 -LSESAIHRTFQQLANGFDTIYGGFGQAPKFPMP---HMLMYLLRYHHNT----EQENAL 231

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
             V  TL  MA GGI+DH+G GF RYS DE W VPHFEKMLYD   L   Y +A+ +T++
Sbjct: 232 YNVTKTLDSMANGGIYDHIGYGFARYSTDEEWLVPHFEKMLYDNALLLTAYTEAYQVTQN 291

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
             Y  IC  I+ +++R+M    G  FSA DAD   TEG    +EG +YVW+ +E+   LG
Sbjct: 292 SRYKEICEQIITFIQREMTHEDGSFFSALDAD---TEG----EEGKYYVWSKEEILRTLG 344

Query: 459 EH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASASKLGMPLEKYL 515
           +    L+ + Y +   GN            F+GKN+  LI        A   G+  E+  
Sbjct: 345 DDLGTLYCQVYDITEEGN------------FEGKNIPNLIHTKRKQIKADA-GLTEEELS 391

Query: 516 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 575
             L   R+ L   R +R  PH+DDKV+ SWN L+I+  A+A+K+ +              
Sbjct: 392 LKLEGARQLLLKTREERTYPHVDDKVLTSWNALMIAGLAKAAKVYQ-------------- 437

Query: 576 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 635
               +Y+ +A+ A +FI  HL  +   R+   +R+G  K  GF+DDYAFL+   LDLYE 
Sbjct: 438 --EPKYLSLAKDAITFIENHLIIDG--RVMVRYRDGEVKNKGFIDDYAFLLWAYLDLYEA 493

Query: 636 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 695
                +L  A +L +    LF D E GG++ T  +  ++++R KE +DGA PSGNSV+ +
Sbjct: 494 SFDLSYLQKAKKLTDDMIGLFWDEEHGGFYFTGHDAEALIVREKEVYDGAVPSGNSVAAV 553

Query: 696 NLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLV 755
            L+RL   V G  S    + AE   +VF+  + D   +       + +  V  +K +V+ 
Sbjct: 554 QLLRLGQ-VTGDLS--LIEKAETMFSVFKPDI-DAYPSGHAFFMQSVLKHVMPKKEIVIF 609

Query: 756 GHKSSVDFENMLAAAHASYDLNKTV 780
           G       + ++ A   ++  N ++
Sbjct: 610 GSADDPARKQIITALQKAFKPNDSI 634


>gi|298675032|ref|YP_003726782.1| hypothetical protein Metev_1104 [Methanohalobium evestigatum
           Z-7303]
 gi|298288020|gb|ADI73986.1| protein of unknown function DUF255 [Methanohalobium evestigatum
           Z-7303]
          Length = 728

 Score =  463 bits (1191), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 267/707 (37%), Positives = 381/707 (53%), Gaps = 70/707 (9%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           KH N L  E SPYLLQHA+NPV+W+ WG+EAF +A+  D PIFLSIGYSTCHWCHVME E
Sbjct: 10  KHPNHLINEKSPYLLQHAYNPVNWYPWGDEAFEKAKNEDKPIFLSIGYSTCHWCHVMENE 69

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SFED  +A++LND FV IKVDREERPD+D  YM   QAL G GGWPL++ ++P+ KP   
Sbjct: 70  SFEDPEIAQILNDNFVCIKVDREERPDIDSTYMDVCQALTGRGGWPLTIIMTPEKKPFSA 129

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDK-KRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
            TY P E ++G  G   +L ++ D W K KR++++++     EQ++ ++    + +    
Sbjct: 130 ATYLPKESRFGLTGLIDLLPRISDMWSKQKRELVSRA-----EQITSSVEEVFTKSPKTR 184

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
           EL    L    E L ++YD  +GGFG+APKFP P  +  ++ + ++  +       ++  
Sbjct: 185 ELSNQELDSAYESLLENYDPEYGGFGNAPKFPSPHNLMFLMRYWERTSN-------NKAL 237

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
           +MV  TL+ M  GGI+DH+G GFHRYS D  W +PHFEKMLYDQ  L+  Y++ +  T  
Sbjct: 238 EMVEKTLKNMRIGGIYDHIGFGFHRYSTDRYWMIPHFEKMLYDQALLSMAYIEVYQATGK 297

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
           + Y    RD+  Y  RD+    G  +SA DADS   EG     EG FY WT  E+  IL 
Sbjct: 298 IEYKNTARDVFTYALRDLTSKEGGFYSAVDADS---EGV----EGKFYTWTYDEIHKILS 350

Query: 459 E-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIE------------------- 496
           +  A +    + +K  GN    +  +      GKN+  LIE                   
Sbjct: 351 KSEANIVTNLFNIKKEGNFRDEKTGN----LTGKNIPHLIETPLYIDVEPDEELDEFHEK 406

Query: 497 LNDSSASASKLGMPLEKYL---NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSF 553
           LN++          L K +     L   RRKLF+ R  R  P  DDK++  WNGL+I++ 
Sbjct: 407 LNEAREKRGAWKRNLLKTIYSQRRLEVARRKLFEARENRVHPAKDDKILTDWNGLMIAAL 466

Query: 554 ARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPS 613
           ++ +++                   KEY   A  AA FI +++ D  + +L H +R+G S
Sbjct: 467 SKGAQVFND----------------KEYANSARKAADFIIKNMSD-SSGQLMHRYRDGDS 509

Query: 614 KAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPS 673
              GF+DDYAFL  GL++LYE     K+L  A+E  N     F D   GG++ T     +
Sbjct: 510 DIHGFIDDYAFLTWGLIELYETTFEVKYLEKALEFNNYLINHFWDDNNGGFYFTPDNAET 569

Query: 674 VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMA 733
            ++R KE +DGA PSGNSV+++NL+RL  +    +     + A  S+  F   L    +A
Sbjct: 570 PIVRKKEIYDGASPSGNSVALMNLMRLGRMTGNPE---LEKKASDSIKSFSKSLSRNPIA 626

Query: 734 VPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
                 A D +  PS + VV+ G   S D +NM+ +    +   K V
Sbjct: 627 STHSMQALDFVQGPSSE-VVITGDFQSEDTQNMINSLRTEFIPRKVV 672


>gi|435851537|ref|YP_007313123.1| thioredoxin domain protein [Methanomethylovorans hollandica DSM
           15978]
 gi|433662167|gb|AGB49593.1| thioredoxin domain protein [Methanomethylovorans hollandica DSM
           15978]
          Length = 717

 Score =  462 bits (1190), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 268/687 (39%), Positives = 386/687 (56%), Gaps = 52/687 (7%)

Query: 94  TSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWC 153
            S   +K  N L  E SPYLLQHA+NPV W+ WGE+AF  +R  + PIFLSIGYSTCHWC
Sbjct: 11  VSEGGSKTPNFLINEKSPYLLQHAYNPVQWYPWGEKAFERSRAENKPIFLSIGYSTCHWC 70

Query: 154 HVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPD 213
           HVME ESFED  VA+L+N  F+ IKVDREERPD+D VYM   QA+ G GGWPL++ ++P+
Sbjct: 71  HVMEKESFEDPDVARLMNATFICIKVDREERPDIDSVYMAICQAITGRGGWPLTILMTPN 130

Query: 214 LKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS---AS 270
            +P    TY P + ++G PG   ++  +   W ++++ + Q+      +L  ALS     
Sbjct: 131 KEPFFAATYIPKKSRFGNPGMLDLIPHIAKVWTQQQEDILQTA----RELKAALSPQMVQ 186

Query: 271 ASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGK 330
           AS+     E+ +  L     QL  ++D + GGFG APKFP P  +  +L + ++   TGK
Sbjct: 187 ASAKSTGTEINEKTLHSGYSQLLSAFDWQAGGFGRAPKFPSPHNLTFLLRYWQR---TGK 243

Query: 331 SGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYL 390
                E  +MV  TL  M  GGI+DHVG GFHRYS D +W VPHFEKMLYDQ  L   Y 
Sbjct: 244 ----LEALQMVTKTLDGMRGGGIYDHVGFGFHRYSTDGQWLVPHFEKMLYDQAMLIMAYT 299

Query: 391 DAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTS 450
           + F +T    +  +  +I++Y+ RDM    G  + AEDADS   EG     EG FY+W  
Sbjct: 300 EGFQVTGIEDHRQVAAEIIEYVLRDMCSAEGAFYCAEDADS---EGM----EGKFYLWKK 352

Query: 451 KEVEDILG-EHAILFKEHYYLKPTGNC--DLSRMSDPHNEFKGKNVLIELNDSSASASKL 507
           +E+ D+L  E A L  + Y +   GN   ++S +S        +N+L        +A +L
Sbjct: 353 EEIYDLLPLEVANLVCKVYDISSEGNYKEEISGIS------TRQNILHLARPMQEAAQEL 406

Query: 508 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 567
           G+ L++    L   R+ LF  R KR  P  DDKV+  WNGL+I++  +AS+         
Sbjct: 407 GISLDELKAKLEPARKILFAAREKRVHPSKDDKVLTDWNGLMIAALCKASRAF------- 459

Query: 568 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 627
                    +R EY + A   A FI +H+      RL H +R+G +   GFL+DYAFL+ 
Sbjct: 460 ---------ERPEYAQAASRTADFILQHM-SSHDGRLLHRYRDGEASISGFLEDYAFLVW 509

Query: 628 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 687
           GL++LY+     K+L  A+ L + Q   F+D E GG+F+T  +  ++L R K+ +DGA P
Sbjct: 510 GLIELYQATFEKKYLEHALRLNSLQIRDFMDVE-GGFFHTANDSETLLFRNKDLYDGAMP 568

Query: 688 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 747
           SGNSVSV+NL++L+ +   +  +   + A  S+  F  ++  M MA      A D  + P
Sbjct: 569 SGNSVSVLNLLKLSRLTGDTDLE---EKASTSMKAFSGQIDAMPMAYSQFLHALDFTAGP 625

Query: 748 SRKHVVLVGHKSSVDFENMLAAAHASY 774
           + + VV+ G     +   M++ A  S+
Sbjct: 626 AYE-VVIAGDPDDPNTREMISLAGRSF 651


>gi|311070619|ref|YP_003975542.1| hypothetical protein BATR1942_18470 [Bacillus atrophaeus 1942]
 gi|310871136|gb|ADP34611.1| hypothetical protein BATR1942_18470 [Bacillus atrophaeus 1942]
          Length = 687

 Score =  462 bits (1190), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 267/684 (39%), Positives = 388/684 (56%), Gaps = 64/684 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYLLQHAHNPV+W+ WGEEAF +A++ + P+ +SIGYSTCHWCHVM  ESFE
Sbjct: 8   NRLINEKSPYLLQHAHNPVNWYPWGEEAFEKAKRENKPVLVSIGYSTCHWCHVMAHESFE 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE +A+LLN+ FV+IKVDREERPDVD VYM   Q + G GGWPL+VF++PD KP   GTY
Sbjct: 68  DEEIARLLNERFVAIKVDREERPDVDSVYMRICQLMTGQGGWPLNVFITPDQKPFYAGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD---E 279
           FP   K+ RPGF  +L  + + +   R+         +E+++E  S S    K P+    
Sbjct: 128 FPKTSKFNRPGFIDVLEHLSNTFANDREH--------VEEIAENAS-SHLQIKTPEGNGT 178

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
           L + AL    +QL   +D+ +GGFG APKFP P    M++Y  +  + TG+        K
Sbjct: 179 LTKEALHRTFQQLMSGFDTVYGGFGQAPKFPMP---HMLMYLLRYHQYTGQENALYNVTK 235

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
               TL  MA GGI+DHVG GF RYS D+ W VPHFEKMLYD   L   Y +A+ +T+D 
Sbjct: 236 ----TLDSMANGGIYDHVGYGFARYSTDDEWLVPHFEKMLYDNALLLTAYTEAYQVTQDS 291

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG- 458
            Y +I   I+ +++R+M    G  +SA DAD   TEG     EG +YVW+  E+ + LG 
Sbjct: 292 RYQHIVEQIITFIQREMTHEDGSFYSALDAD---TEGV----EGKYYVWSKDEIIETLGD 344

Query: 459 EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASASKLGMPLEKYLN 516
           E   L+   Y +  +GN            F+G N+  LI        A +  +  ++   
Sbjct: 345 ELGELYCAIYNITSSGN------------FEGHNIPNLIHTKLDKVKA-EFDLNEQEINK 391

Query: 517 ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGS 576
            LGE R+KL   R  R  PH+DDKV+ SWN L+I+  A+A+K+ ++              
Sbjct: 392 QLGEARQKLLKKRETRTYPHVDDKVLTSWNALMIAGLAKAAKVFQA-------------- 437

Query: 577 DRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG 636
              EY+ +A++AA+FI + L  +   R+   +R+G  K  GF+DDYAFL+   ++LYE G
Sbjct: 438 --PEYLNMAQAAAAFIEKKLIIDG--RVMVRYRDGEVKNKGFIDDYAFLLWAYIELYEAG 493

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
               +L  A +L     +LF D++ GG++ T  +  ++L+R KE +DGA PSGNSV+ + 
Sbjct: 494 YDLAYLQKAKDLSAKMLDLFWDQKHGGFYFTGHDAEALLVREKEVYDGAVPSGNSVAAVQ 553

Query: 697 LVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVG 756
           L+RL  +  G  S    + AE   + F+  ++           +     +P +K +V+ G
Sbjct: 554 LLRLGQLT-GELS--LIEKAEKMFSAFKRDVEAYPSGHSFFMQSVLTHMMP-KKEIVIFG 609

Query: 757 HKSSVDFENMLAAAHASYDLNKTV 780
            K     +++++A   ++  N +V
Sbjct: 610 RKDDSQRQHIISALQQAFQPNFSV 633


>gi|430756760|ref|YP_007207432.1| hypothetical protein A7A1_1268 [Bacillus subtilis subsp. subtilis
           str. BSP1]
 gi|430021280|gb|AGA21886.1| Hypothetical protein YyaL [Bacillus subtilis subsp. subtilis str.
           BSP1]
          Length = 689

 Score =  462 bits (1189), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 248/605 (40%), Positives = 352/605 (58%), Gaps = 51/605 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYLLQHAHNPVDWF WGEEAF +A++ + P+ +SIGYSTCHWCHVM  ESFE
Sbjct: 8   NRLINEKSPYLLQHAHNPVDWFPWGEEAFEKAKRENKPVLVSIGYSTCHWCHVMAHESFE 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE +A+LLN+ FV+IKVDREERPDVD VYM   Q + G GGWPL+VF++PD KP   GTY
Sbjct: 68  DEEIARLLNERFVAIKVDREERPDVDSVYMRICQLMTGQGGWPLNVFITPDQKPFYAGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP   K+ RPGF  +L  + + +   R+ +      A + L    +A +        L +
Sbjct: 128 FPKTSKFNRPGFVDVLEHLSETFANDREHVEDIAENAAKHLQTKTAAKSGEG-----LSE 182

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
           +A+    +QL+  +D+ +GGFG APKFP P    M++Y  +   +TG+        K   
Sbjct: 183 SAISRTFQQLASGFDTIYGGFGQAPKFPMP---HMLMYLLRYHHNTGQDNALYNVTK--- 236

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL  MA GGI+DH+G GF RYS D+ W VPHFEKMLYD   L   Y +A+ +T++  Y 
Sbjct: 237 -TLDSMANGGIYDHIGYGFARYSTDDEWLVPHFEKMLYDNALLLTAYTEAYQVTQNSRYK 295

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH-A 461
            IC  I+ +++R+M    G  FSA DAD   TEG    +EG +YVW+ +E+   LG+   
Sbjct: 296 EICEQIITFIQREMTHEDGSFFSALDAD---TEG----EEGKYYVWSKEEILKTLGDDLG 348

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI-LGE 520
            L+ + Y +   GN            F+GKN+   ++       +     EK L++ L +
Sbjct: 349 TLYCQVYDITEEGN------------FEGKNIPNLIHTKREQIKEDAGLTEKELSLKLED 396

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            R++L   R +R  PH+DDKV+ SWN L+I+  A+A+K+ +                  +
Sbjct: 397 ARQQLLKTREERTYPHVDDKVLTSWNALMIAGLAKAAKVYQ----------------EPK 440

Query: 581 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 640
           Y+ +A+ A +FI   L  +   R+   +R+G  K  GF+DDYAFL+   LDLYE      
Sbjct: 441 YLSLAKDAITFIENKLIIDG--RVMVRYRDGEVKNKGFIDDYAFLLWAYLDLYEASFDLS 498

Query: 641 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 700
           +L  A +L +    LF D E GG++ T  +  ++++R KE +DGA PSGNSV+ + L+RL
Sbjct: 499 YLQKAKKLTDDMISLFWDEEHGGFYFTGHDAEALIVREKEVYDGAVPSGNSVAAVQLLRL 558

Query: 701 ASIVA 705
             +  
Sbjct: 559 GQVTG 563


>gi|163846817|ref|YP_001634861.1| hypothetical protein Caur_1244 [Chloroflexus aurantiacus J-10-fl]
 gi|222524638|ref|YP_002569109.1| hypothetical protein Chy400_1363 [Chloroflexus sp. Y-400-fl]
 gi|163668106|gb|ABY34472.1| protein of unknown function DUF255 [Chloroflexus aurantiacus
           J-10-fl]
 gi|222448517|gb|ACM52783.1| protein of unknown function DUF255 [Chloroflexus sp. Y-400-fl]
          Length = 693

 Score =  462 bits (1189), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 267/689 (38%), Positives = 385/689 (55%), Gaps = 54/689 (7%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           ++  NRLA E SPYL QHA NPVDW+ WGEEA   AR+ D PI +SIGY+ CHWCHVM  
Sbjct: 5   SRPLNRLAHEASPYLQQHADNPVDWYPWGEEALERARREDKPILVSIGYAACHWCHVMAH 64

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESF D  VA + N++F++IKVDREERPD+D +YM   QAL G GGWPL+VF  PD  P  
Sbjct: 65  ESFADPEVAAVQNEYFINIKVDREERPDLDNIYMAAAQALTGRGGWPLNVFCLPDGTPFF 124

Query: 219 GGTYFPPEDKYGR---PGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 275
            GTYFPP+ K  R   PG++ +L  V +A+  +R  +  S    +E +         +  
Sbjct: 125 AGTYFPPDAKAARYRMPGWRQVLLSVAEAYKTRRADVTASAHELLEHI------KLLTRP 178

Query: 276 LPDELP--QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGE 333
           LP+ LP  +  L   A Q+ + +D ++GGFG APKFP+PV ++ +L        T   G+
Sbjct: 179 LPETLPLDEELLMAAAAQIGREFDPQYGGFGDAPKFPQPVVLEFLLR-------THLRGD 231

Query: 334 ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 393
             +   M+  TL+ MA+GG++D VGGGFHRYSVDERW VPHFEKMLYD   LA VY  A 
Sbjct: 232 V-QALPMLQQTLEQMARGGMYDQVGGGFHRYSVDERWLVPHFEKMLYDNALLAEVYHLAA 290

Query: 394 SLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 453
            +T D F + I  +   Y+ RD+  P G  FS+EDADS  T GA+  +EGAFYVWT  E+
Sbjct: 291 QVTGDTFLARIADETFTYMLRDLRHPDGAFFSSEDADSLPTPGASHAEEGAFYVWTPDEL 350

Query: 454 EDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEK 513
              LG+ A+L   +Y +   GN            F+G+++L     ++A A+ LG+ +E+
Sbjct: 351 RAALGDDAVLVGAYYGVTRQGN------------FEGRSILHVPRPAAAVAAMLGVSVER 398

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
               +   R  L   R +RPRP  D+KVI +WN + I + A AS  + +           
Sbjct: 399 LEATVARARPILRTFRERRPRPFRDEKVITAWNAMAIRALAVASSRVPA----------- 447

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
                  Y++ A   A F+  +L  +   RL  S+++G      FLDDYA     L++L+
Sbjct: 448 -------YLDAARQCADFLLTNLRRDDG-RLLRSWKDGRPGPAAFLDDYALFCDALIELH 499

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
             G  T++L  AI+L +   +LF D + G +F+T  + P+++ R ++  D A PSG+S +
Sbjct: 500 AAGGDTRYLATAIDLADAMIDLFWDDQAGMFFDTGRDQPALVTRPRDLSDNATPSGSSAA 559

Query: 694 VINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVV 753
            + L+RL +I    +   Y   A  +L      LK   +    M CAAD+   P R+ + 
Sbjct: 560 TVALLRLYAITGRER---YETRAMQTLQQTTPLLKRFPLGFGRMLCAADLALGPLRE-LA 615

Query: 754 LVGHKSSVDFENMLAAAHASYDLNKTVSK 782
           ++G       + MLA A ++Y     +++
Sbjct: 616 IIGPPDHPVTQAMLAVARSAYRPRLVIAR 644


>gi|161528699|ref|YP_001582525.1| hypothetical protein Nmar_1191 [Nitrosopumilus maritimus SCM1]
 gi|160340000|gb|ABX13087.1| protein of unknown function DUF255 [Nitrosopumilus maritimus SCM1]
          Length = 675

 Score =  462 bits (1189), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 253/598 (42%), Positives = 350/598 (58%), Gaps = 49/598 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N L  E SPYLLQHAHNPVDW+ W +EA  +A+  + PIFLSIGYS+CHWCHVM  ESFE
Sbjct: 4   NNLIHETSPYLLQHAHNPVDWYGWNDEALKKAKDENKPIFLSIGYSSCHWCHVMAHESFE 63

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           +E VAK +N+ FV+IKVDREERPD+D +Y    Q   G GGWPLS+FL+PD KP   GTY
Sbjct: 64  NEEVAKFMNENFVNIKVDREERPDIDDIYQKACQIATGQGGWPLSIFLTPDQKPFYVGTY 123

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP  D YGRPGF +I R++  AW +K   + +S    ++ L++    S SS     +L +
Sbjct: 124 FPILDSYGRPGFGSICRQLSQAWKEKPKDIEKSADNFLDALNKTEKVSISS-----KLER 178

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             L   A  L +  DS +GGFGSAPKFP    +  +  ++K    +G S     G K   
Sbjct: 179 TILDEAAMNLFQLGDSAYGGFGSAPKFPNAANVSFLFRYAKI---SGLSKFTEFGLK--- 232

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL+ MA GGI D +GGGFHRYS D +W VPHFEKMLYD   +   Y +AF +TKD FY 
Sbjct: 233 -TLKKMANGGIFDQIGGGFHRYSTDAKWLVPHFEKMLYDNALIPVNYAEAFQITKDPFYL 291

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
            + +  LD++ R+M  P G  +SA DADS   EG     EG FYVW   E+++ILG+ A 
Sbjct: 292 DVLKKTLDFVLREMTSPEGGFYSAYDADS---EGV----EGKFYVWKKSEIKEILGDDAD 344

Query: 463 LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECR 522
           +F   Y     GN            ++G N+L    + S  A   G   EK   IL  C 
Sbjct: 345 IFCLFYDATDGGN------------WEGNNILCNNLNISTVAFNFGTTEEKVREILQACS 392

Query: 523 RKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYM 582
           +KL DVRSKR  P LDDK++VSWN L+I++FA+  ++                ++   Y+
Sbjct: 393 KKLLDVRSKRVAPGLDDKILVSWNSLMITAFAKGYRV----------------TNESRYL 436

Query: 583 EVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWL 642
           + A+   SFI  +L+     +L  +++N  +K  G+L+DY++ ++ LLD++E     K+L
Sbjct: 437 DAAKDCISFIENNLF--SGDKLLRTYKNKTAKIDGYLEDYSYFVNCLLDVFEIEPDPKYL 494

Query: 643 VWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 700
             A++L +   E F D E   +F T+     +++R K ++D + PSGNSVS   ++RL
Sbjct: 495 KLALKLGHHLVEHFWDSENNSFFMTSDNHEKLIIRPKSNYDLSLPSGNSVSAFVMLRL 552


>gi|384177739|ref|YP_005559124.1| hypothetical protein I33_4252 [Bacillus subtilis subsp. subtilis
           str. RO-NN-1]
 gi|349596963|gb|AEP93150.1| conserved hypothetical protein [Bacillus subtilis subsp. subtilis
           str. RO-NN-1]
          Length = 689

 Score =  462 bits (1189), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 249/609 (40%), Positives = 353/609 (57%), Gaps = 51/609 (8%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           N   NRL AE SPYLLQHAHNPVDW+ WGEEAF +A++ + P+ +SIGYSTCHWCHVM  
Sbjct: 4   NNKPNRLIAEKSPYLLQHAHNPVDWYPWGEEAFEKAKRENKPVLVSIGYSTCHWCHVMAH 63

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFEDE +A+LLN+ FV+IKVDREERPDVD VYM   Q + G GGWPL+VF++PD KP  
Sbjct: 64  ESFEDEEIARLLNERFVAIKVDREERPDVDSVYMRICQLMTGQGGWPLNVFITPDQKPFY 123

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
            GTYFP   K+ RPGF  +L  + + +   R+ +      A + L    +A +       
Sbjct: 124 AGTYFPKTSKFNRPGFVDVLEHLSETFANDREHVEDIAENAAKHLQTKTAAKSGEG---- 179

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
            L ++A+    +QL+  +D+ +GGFG APKFP P    M++Y  +   +TG+        
Sbjct: 180 -LSESAIHRTFQQLASGFDTIYGGFGQAPKFPMP---HMLMYLLRYHHNTGQENALYNVT 235

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
           K    TL  MA GGI+DH+G GF RYS D+ W VPHFEKMLYD   L   Y +A+ +T++
Sbjct: 236 K----TLDSMANGGIYDHIGYGFARYSTDDEWLVPHFEKMLYDNALLLTAYTEAYQVTQN 291

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
             Y  IC  I+ +++R+M    G  FSA DAD   TEG    +EG +YVW+ +E+   LG
Sbjct: 292 SRYKEICEQIITFIQREMTHEDGSFFSALDAD---TEG----EEGKYYVWSKEEILKTLG 344

Query: 459 EH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI 517
           +    L+ + Y +   GN            F+GKN+   ++       +     EK L++
Sbjct: 345 DDLGTLYCQVYDITEEGN------------FEGKNIPNLIHTKWEQIKEDAGLTEKELSL 392

Query: 518 -LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGS 576
            L + R++L   R +R  PH+DDKV+ SWN L+I+  A+A+K+ +               
Sbjct: 393 KLEDARQQLLKTREERTYPHVDDKVLTSWNALMIAGLAKAAKVYQ--------------- 437

Query: 577 DRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG 636
              +Y+ +A+ A +FI   L  +   R+   +R G  K  GF+DDYAFL+   LDLYE  
Sbjct: 438 -EPKYLSLAKDAITFIENKLIIDG--RVMVRYRGGEVKNKGFIDDYAFLLWAYLDLYEAS 494

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
               +L  A +L +    LF D E GG++ T  +  ++++R KE +DGA PSGNSV+ + 
Sbjct: 495 FDLSYLQKAKKLTDDMIGLFWDEEHGGFYFTGHDAEALIVREKEVYDGAVPSGNSVAAVQ 554

Query: 697 LVRLASIVA 705
           L+RL  +  
Sbjct: 555 LLRLGQVTG 563


>gi|297622269|ref|YP_003703703.1| hypothetical protein [Truepera radiovictrix DSM 17093]
 gi|297163449|gb|ADI13160.1| protein of unknown function DUF255 [Truepera radiovictrix DSM
           17093]
          Length = 704

 Score =  462 bits (1188), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 256/581 (44%), Positives = 341/581 (58%), Gaps = 50/581 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL+ E SPYLLQHA NPVDWF WGEEAFA+AR  D PI LS+GY+ CHWCHVM  ESFE
Sbjct: 28  NRLSRETSPYLLQHAENPVDWFPWGEEAFAKARAEDKPILLSVGYAACHWCHVMAHESFE 87

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           +  +A L+N  FV++KVDREERPDVD VYM+ VQA+ G GGWP++V L+PD KP  GGTY
Sbjct: 88  NPEIADLMNAHFVNVKVDREERPDVDAVYMSAVQAMTGSGGWPMTVALTPDGKPFFGGTY 147

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           +PPED+ G PGFK +L  + +AW  +RD + ++       L++     A+    P  L +
Sbjct: 148 YPPEDRLGHPGFKRVLLSLAEAWRSRRDEVLRAAETLTNHLADLNKLPAAGEPSPGALGE 207

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             L      L +++D + GGFG APKFP    +  +L   +            E ++M  
Sbjct: 208 EVLAEAVRALQRTFDPQHGGFGGAPKFPPHGALAFLLRRPE-----------PEAREMAY 256

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL  MA GGI D +GGGF RYSVD RW VPHFEKMLYD  QL  VY +A++ T+   Y 
Sbjct: 257 VTLDKMAAGGIFDQLGGGFARYSVDARWLVPHFEKMLYDNAQLVGVYAEAYAQTRRARYR 316

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
            +    L +++R++  P G  +SA DADS   EG    +EG FYVW + E  D+LGE A 
Sbjct: 317 EVVEATLAFVQRELTSPEGCFYSALDADS---EG----EEGKFYVWRADEF-DVLGEDAA 368

Query: 463 LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECR 522
           L K ++ +   GN            F+G+NVL   +  +A A + G+        L   +
Sbjct: 369 LAKVYFGVSAAGN------------FEGRNVLFVPHPPAAVAERFGLSEAALAARLARVK 416

Query: 523 RKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYM 582
           R LF++RS+R RP LDDKV+ SWNGL+I +FARA ++L  +A                Y+
Sbjct: 417 RALFEIRSRRTRPGLDDKVLASWNGLMIGAFARAGRVLAEDA----------------YL 460

Query: 583 EVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWL 642
           E A  AA  +R  L  E   RL H+FR G +K  G L+DYA L  GLL+LY       WL
Sbjct: 461 EAARRAARGVRSALLRE--GRLWHTFRGGEAKVEGLLEDYALLGLGLLELYRATLEGPWL 518

Query: 643 VWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 683
           +WA+EL       F D E GG+F+T  +  ++++R KE  D
Sbjct: 519 LWALELAEVIAARFTDPE-GGFFSTAADAEALVVRPKELFD 558


>gi|16081134|ref|NP_391962.1| hypothetical protein BSU40820 [Bacillus subtilis subsp. subtilis
           str. 168]
 gi|221312064|ref|ZP_03593911.1| hypothetical protein Bsubs1_22036 [Bacillus subtilis subsp.
           subtilis str. 168]
 gi|221316389|ref|ZP_03598194.1| hypothetical protein BsubsN3_21942 [Bacillus subtilis subsp.
           subtilis str. NCIB 3610]
 gi|221321302|ref|ZP_03602596.1| hypothetical protein BsubsJ_21895 [Bacillus subtilis subsp.
           subtilis str. JH642]
 gi|221325585|ref|ZP_03606879.1| hypothetical protein BsubsS_22051 [Bacillus subtilis subsp.
           subtilis str. SMY]
 gi|402778252|ref|YP_006632196.1| protein YyaL [Bacillus subtilis QB928]
 gi|586842|sp|P37512.1|YYAL_BACSU RecName: Full=Uncharacterized protein YyaL
 gi|467366|dbj|BAA05212.1| unknown [Bacillus subtilis]
 gi|2636629|emb|CAB16119.1| conserved hypothetical protein [Bacillus subtilis subsp. subtilis
           str. 168]
 gi|402483431|gb|AFQ59940.1| YyaL [Bacillus subtilis QB928]
 gi|407962936|dbj|BAM56176.1| hypothetical protein BEST7613_7245 [Bacillus subtilis BEST7613]
 gi|407966948|dbj|BAM60187.1| hypothetical protein BEST7003_3986 [Bacillus subtilis BEST7003]
          Length = 689

 Score =  462 bits (1188), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 249/605 (41%), Positives = 353/605 (58%), Gaps = 51/605 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYLLQHAHNPVDWF WGEEAF +A++ + P+ +SIGYSTCHWCHVM  ESFE
Sbjct: 8   NRLINEKSPYLLQHAHNPVDWFPWGEEAFEKAKRENKPVLVSIGYSTCHWCHVMAHESFE 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE +A+LLN+ FV+IKVDREERPDVD VYM   Q + G GGWPL+VF++PD KP   GTY
Sbjct: 68  DEEIARLLNERFVAIKVDREERPDVDSVYMRICQLMTGQGGWPLNVFITPDQKPFYAGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP   K+ RPGF  +L  + + +   R+ +      A + L    +A     K  + L +
Sbjct: 128 FPKTSKFNRPGFVDVLEHLSETFANDREHVEDIAENAAKHLQTKTAA-----KTGEGLSE 182

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
           +A+    +QL+  +D+ +GGFG APKFP P    M++Y  +   +TG+        K   
Sbjct: 183 SAIHRTFQQLASGFDTIYGGFGQAPKFPMP---HMLMYLLRYDHNTGQENALYNVTK--- 236

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL  MA GGI+DH+G GF RYS D+ W VPHFEKMLYD   L   Y +A+ +T++  Y 
Sbjct: 237 -TLDSMANGGIYDHIGYGFARYSTDDEWLVPHFEKMLYDNALLLTAYTEAYQVTQNSRYK 295

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH-A 461
            IC  I+ +++R+M    G  FSA DAD   TEG    +EG +YVW+ +E+   LG+   
Sbjct: 296 EICEQIITFIQREMTHEDGSFFSALDAD---TEG----EEGKYYVWSKEEILKTLGDDLG 348

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI-LGE 520
            L+ + Y +   GN            F+GKN+   ++       +     EK L++ L +
Sbjct: 349 TLYCQVYDITEEGN------------FEGKNIPNLIHTKREQIKEDAGLTEKELSLKLED 396

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            R++L   R +R  PH+DDKV+ SWN L+I+  A+A+K+ +                  +
Sbjct: 397 ARQQLLKTREERTYPHVDDKVLTSWNALMIAGLAKAAKVYQ----------------EPK 440

Query: 581 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 640
           Y+ +A+ A +FI   L  +   R+   +R+G  K  GF+DDYAFL+   LDLYE      
Sbjct: 441 YLSLAKDAITFIENKLIIDG--RVMVRYRDGEVKNKGFIDDYAFLLWAYLDLYEASFDLS 498

Query: 641 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 700
           +L  A +L +    LF D E GG++ T  +  ++++R KE +DGA PSGNSV+ + L+RL
Sbjct: 499 YLQKAKKLTDDMISLFWDEEHGGFYFTGHDAEALIVREKEVYDGAVPSGNSVAAVQLLRL 558

Query: 701 ASIVA 705
             +  
Sbjct: 559 GQVTG 563


>gi|73667810|ref|YP_303825.1| hypothetical protein Mbar_A0261 [Methanosarcina barkeri str.
           Fusaro]
 gi|72394972|gb|AAZ69245.1| conserved hypothetical protein [Methanosarcina barkeri str. Fusaro]
          Length = 711

 Score =  461 bits (1187), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 264/685 (38%), Positives = 377/685 (55%), Gaps = 45/685 (6%)

Query: 97  SRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVM 156
           + +K  NRL  E SPYLLQHA+NPV W+ WGEEAF +ARK + PIFLSIGYSTCHWCHVM
Sbjct: 17  TEHKKPNRLINEKSPYLLQHAYNPVKWYPWGEEAFEKARKENKPIFLSIGYSTCHWCHVM 76

Query: 157 EVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKP 216
             ESFEDE +A+L+N  FV IKVDREERPD+D VYMT  Q + G GGWPL++ ++PD+KP
Sbjct: 77  AHESFEDEEIARLMNRAFVCIKVDREERPDIDNVYMTVCQIILGRGGWPLNIIMTPDMKP 136

Query: 217 LMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKL 276
              GTY P   ++ + G   ++ ++++ W+++   + +S       +   +S  A     
Sbjct: 137 FFAGTYIPKNSRFSQTGMLELVPRIEEIWNRQHTEVLESADKITSTIQNMISEPAGEG-- 194

Query: 277 PDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASE 336
              + ++ +    E+L  S+D+ +GGFG APKFP   +I  +L + +      +SG   E
Sbjct: 195 ---IGESIMEEAYEELLTSFDNEYGGFGRAPKFPTSHKIFFLLRYWR------RSGN-PE 244

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
              MV +TL+ M +GGIHDH+G GFHRYS D  W VPHFEKMLYDQ  +A  Y + + +T
Sbjct: 245 ALHMVEYTLENMYRGGIHDHLGSGFHRYSTDNVWIVPHFEKMLYDQALIATAYTEIYQVT 304

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
               Y      ILDY+ RD+    G  +  EDAD    EG    +EG +Y+WT +EV  +
Sbjct: 305 GKRLYKEAAEGILDYVLRDLTSQEGGFYCGEDAD---VEG----EEGKYYLWTLEEVRTV 357

Query: 457 LG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 515
           L  E + L  + + L  TGN +     +      G N+        + A++L +P +   
Sbjct: 358 LSPEESELITKVFNLSETGNFE----EEIRGRKTGTNIFYMPRSLESLAAELNIPADDVD 413

Query: 516 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 575
           + +   + KL   R KR RP  DDK++  WNGL+I++ A+               F   G
Sbjct: 414 SRVKTAKAKLLLARDKRKRPAKDDKILTDWNGLMIAALAKG--------------FQAFG 459

Query: 576 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 635
            ++  Y++ AE AA FI + LY+    RL H +R+G +   G  DDYAFLI GLL+LYE 
Sbjct: 460 EEK--YLKAAEKAADFILKVLYNPD-RRLLHRYRDGKTGISGTADDYAFLIHGLLELYEA 516

Query: 636 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 695
           G    +L  A+ L     E F D   GG F T  +  +++ R KE  D A PSGNS+ ++
Sbjct: 517 GFKLDYLKAALCLNREFLEHFWDPIQGGLFFTADDSEALIFRKKEFSDAAIPSGNSIEML 576

Query: 696 NLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLV 755
           NL+RL+ I A S+ +   Q  E +   F   ++ +         A D    P+ + VV+V
Sbjct: 577 NLLRLSRITADSELEDRAQGLERA---FSKLIQKIPSGYTQFLSALDFGLGPAYQ-VVIV 632

Query: 756 GHKSSVDFENMLAAAHASYDLNKTV 780
           G   S D   ML      +  NK +
Sbjct: 633 GEHESPDTGQMLEELWTYFIPNKVL 657


>gi|384161675|ref|YP_005543748.1| YyaL [Bacillus amyloliquefaciens TA208]
 gi|328555763|gb|AEB26255.1| YyaL [Bacillus amyloliquefaciens TA208]
          Length = 689

 Score =  461 bits (1186), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 264/669 (39%), Positives = 371/669 (55%), Gaps = 62/669 (9%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           +KHTN L  E SPYLLQHAHNPVDWF WG+EAF +A++ + P+ +SIGYSTCHWCHVM  
Sbjct: 4   HKHTNMLITEKSPYLLQHAHNPVDWFPWGDEAFEKAKRENKPVLISIGYSTCHWCHVMAH 63

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFEDE +A +LND F++IKVDREERPDVD VYM   Q + G GGWPL+VF++PD KP  
Sbjct: 64  ESFEDEEIAGMLNDKFIAIKVDREERPDVDSVYMRICQLMTGQGGWPLNVFVTPDQKPFY 123

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
            GTYFP   K+ RPGF  +L  + + +   R          +E ++E  +A       P 
Sbjct: 124 AGTYFPKTSKFNRPGFIDVLEHLSETFANDRQ--------HVEDIAENAAAHLEVKVHPT 175

Query: 279 E--LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASE 336
           E  L + A+     QL+  +D+ +GGFG APKFP P    M+L+  +    TGK  +A  
Sbjct: 176 EGMLGEQAVHDTYRQLAGGFDTVYGGFGQAPKFPMP---HMLLFLLRYYSYTGKE-QALA 231

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
           G   V  TL  MA GGI DH+G GF RYS D  W VPHFEKMLYD   L + Y +A+ +T
Sbjct: 232 G---VTKTLDGMANGGIFDHIGFGFARYSTDNEWLVPHFEKMLYDNALLLSAYTEAYQVT 288

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
            +  Y  I   I+ +++R+M+   G  FSA DAD   TEG    +EG +Y+W+ KE+ ++
Sbjct: 289 NNERYKQIATQIVTFIQREMMHEDGSFFSALDAD---TEG----REGKYYIWSKKEIMNL 341

Query: 457 LGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASASKLGMPLEK 513
           LG+    L+ + Y +   GN            F+G+N+  LI      A   + G+   +
Sbjct: 342 LGDQLGSLYCKVYNITEQGN------------FEGENIPNLI-FTRREAILEETGLTEHE 388

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
               L   R+KL + R  R  PH DDKV+ SWN L+I+  A+A+K+              
Sbjct: 389 LTERLEGARKKLLEARENRSYPHTDDKVLTSWNALMIAGLAKAAKVFHEPG--------- 439

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
                  ++ +AE+A  F+ RHL  +   R+   +R G  K  GF+DDYAFLI   L+LY
Sbjct: 440 -------FLSMAETAIRFLERHLIPDG--RVMVRYREGEVKNKGFIDDYAFLIWAYLELY 490

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
           E G    +L  A  L  +  +LF D   GG+F T  +  ++L+R KE +DGA PSGNS +
Sbjct: 491 EAGFNPSYLKKAKTLCTSMLDLFWDERHGGFFFTGNDAETLLVREKEVYDGAVPSGNSAA 550

Query: 694 VINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVV 753
            + L+RL  +          + AE   +VF+  ++    +      +  +  +  +K +V
Sbjct: 551 AVQLLRLGRLTGDVS---LIEKAEAMFSVFKREIEAYPSSSAFFMQSV-LAHIMPQKEIV 606

Query: 754 LVGHKSSVD 762
           + G K   D
Sbjct: 607 VFGSKDDPD 615


>gi|407462858|ref|YP_006774175.1| hypothetical protein NKOR_06800 [Candidatus Nitrosopumilus
           koreensis AR1]
 gi|407046480|gb|AFS81233.1| hypothetical protein NKOR_06800 [Candidatus Nitrosopumilus
           koreensis AR1]
          Length = 675

 Score =  461 bits (1186), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 250/598 (41%), Positives = 351/598 (58%), Gaps = 49/598 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N L  E SPYLLQHAHNPVDW+ W  EA  +A+  + PIFLSIGYS+CHWCHVM  ESFE
Sbjct: 4   NNLIHETSPYLLQHAHNPVDWYGWNSEALKKAKDENKPIFLSIGYSSCHWCHVMAHESFE 63

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           +E VA+ +N+ FV+IKVDREERPD+D +Y    Q   G GGWPLS+FL+PD KP   GTY
Sbjct: 64  NEEVAQFMNENFVNIKVDREERPDIDDIYQKVCQIATGQGGWPLSIFLTPDQKPFYVGTY 123

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP  D YGRPGF +I R++  AW +K   + +S    ++ L++    S      P +L +
Sbjct: 124 FPVLDSYGRPGFGSICRQLAQAWKEKPHDIEKSANNFLDALNKTEKIST-----PSKLER 178

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             L   A  L +  DS +GGFGSAPKFP    +  +  ++K    +G S     G K   
Sbjct: 179 TILDEAAMNLFQLGDSTYGGFGSAPKFPNAANVSFLFRYAKL---SGLSKFTEFGLK--- 232

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL+ MA GGI D +GGGFHRYS D +W VPHFEKMLYD   +   Y +AF +TKD FY 
Sbjct: 233 -TLKKMANGGIFDQIGGGFHRYSTDAKWLVPHFEKMLYDNALIPVNYAEAFQITKDPFYL 291

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
            I +  LD++ R+M  P G  +SA DADS   EG     EG FYVW   E+++ILG+ + 
Sbjct: 292 DILKKTLDFVLREMTSPEGGFYSAYDADS---EGV----EGKFYVWKKSEIKEILGDDSD 344

Query: 463 LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECR 522
           +F  +Y +   GN            ++G N+L    + S  A   G+  EK   IL  C 
Sbjct: 345 IFCLYYDVTDGGN------------WEGNNILCNNLNISTVAFNFGITEEKVREILQSCS 392

Query: 523 RKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYM 582
           +KL DVRSKR  P LDDK++VSWN L+I++FA+  ++                ++   Y+
Sbjct: 393 KKLLDVRSKRIAPGLDDKILVSWNALMITAFAKGCRV----------------TNDSRYL 436

Query: 583 EVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWL 642
             A++  SFI  +L+     +L  +++N  +K  G+L+DY++ ++ LLD++E     K+L
Sbjct: 437 NAAKTCISFIEDNLF--SGDKLLRTYKNKTAKIDGYLEDYSYFVNCLLDVFEIEPDPKYL 494

Query: 643 VWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 700
             A++L +   + F D E   +F T+     +++R K ++D + PSGNSVS   ++RL
Sbjct: 495 KLALKLGHHLVDHFWDSENNSFFMTSDNHEKLIIRPKSNYDLSLPSGNSVSAFAMLRL 552


>gi|444911449|ref|ZP_21231624.1| Thymidylate kinase [Cystobacter fuscus DSM 2262]
 gi|444718207|gb|ELW59023.1| Thymidylate kinase [Cystobacter fuscus DSM 2262]
          Length = 683

 Score =  461 bits (1185), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 265/679 (39%), Positives = 381/679 (56%), Gaps = 61/679 (8%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            NRL  E SPYL QHA NPVDW+ WGEEAFA AR  D P+ LS+GYS CHWCHVM  ESF
Sbjct: 2   ANRLEREPSPYLRQHASNPVDWYPWGEEAFARARAEDKPLLLSVGYSACHWCHVMAHESF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           EDE +A+L+N+ F+++KVDREERPDVD++Y   VQ +  GGGWPL+VFL+PDL P  GGT
Sbjct: 62  EDEAIARLMNEGFINVKVDREERPDVDQLYQGVVQLMGQGGGWPLTVFLTPDLVPFFGGT 121

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKR-DMLAQSGAFAIEQLSE----ALSASASSNKL 276
           YFPP+D+YGRPGF  +LR + +AW   R ++L+Q+  F  E L E     L A+ ++ K 
Sbjct: 122 YFPPKDRYGRPGFPKVLRALSEAWATNRGELLSQAREFR-EGLGELALHGLDAAPAALK- 179

Query: 277 PDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASE 336
           P+++    L L      +  D   GGFG APKFP P+ + ++L   ++  + G+      
Sbjct: 180 PEDIVSMGLSLL-----ERMDGVNGGFGGAPKFPNPMNVALVLRAWRR--EPGQDAL--- 229

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
            ++ VL TL+ MA+GG++D +GGGFHRYSVDERW VPHFEKMLYD  QL ++Y +A  + 
Sbjct: 230 -KQAVLLTLEKMARGGVYDQLGGGFHRYSVDERWAVPHFEKMLYDNAQLLHLYAEAQQVE 288

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
               +  +  +  +Y+RR+M    G  ++ +DAD   TEG    +EG F+VW  ++V ++
Sbjct: 289 PRPLWRKVVEETAEYVRREMTDARGGFYATQDAD---TEG----EEGRFFVWLPEQVREV 341

Query: 457 L-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 515
           L  E A L   H+ +   GN +            G+ VL       + A +L  P+E+  
Sbjct: 342 LPPELAELALRHFRVTALGNFE-----------HGRTVLESAVSVESLAEELQRPVEEVA 390

Query: 516 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 575
           + L E RR+LF+ R +R +P  DDK++  WNGL+I   A A ++                
Sbjct: 391 SGLSEARRRLFEARERRVKPGRDDKILAGWNGLMIRGLAFAGRVF--------------- 435

Query: 576 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 635
            DR +++E A  AA F+   L+D Q  RL  S++ G ++ PGF++DY  L +GL  LY+ 
Sbjct: 436 -DRADWVESARKAADFVLAELWDGQ--RLSRSYQEGQARIPGFVEDYGDLAAGLTALYQA 492

Query: 636 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 695
               ++L  A  L  T + LF D E G Y         +++      D A PSG S    
Sbjct: 493 TFEPRYLEAAEALVRTAETLFWDEERGAYLTAPRTQGDLVVATYATFDNAFPSGASTLTE 552

Query: 696 NLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLV 755
             V LA++ +  +   Y +  E  ++    +L+   M    +  AAD L V     V   
Sbjct: 553 AQVALAALTSNKQ---YLELPERYVSRMGEQLRKNPMGYGHLALAADAL-VDGAPSVTFA 608

Query: 756 GHKSSVDFENMLAAAHASY 774
           G + +V  E +LA +   Y
Sbjct: 609 GTREAV--EPLLAVSRTVY 625


>gi|392962639|ref|ZP_10328068.1| glycoside hydrolase family 76 [Pelosinus fermentans DSM 17108]
 gi|421053373|ref|ZP_15516355.1| glycoside hydrolase family 76 [Pelosinus fermentans B4]
 gi|421058355|ref|ZP_15521061.1| glycoside hydrolase family 76 [Pelosinus fermentans B3]
 gi|421066419|ref|ZP_15528029.1| glycoside hydrolase family 76 [Pelosinus fermentans A12]
 gi|421073618|ref|ZP_15534678.1| hypothetical protein FA11_0867 [Pelosinus fermentans A11]
 gi|392442414|gb|EIW20004.1| glycoside hydrolase family 76 [Pelosinus fermentans B4]
 gi|392444040|gb|EIW21515.1| hypothetical protein FA11_0867 [Pelosinus fermentans A11]
 gi|392451880|gb|EIW28849.1| glycoside hydrolase family 76 [Pelosinus fermentans DSM 17108]
 gi|392456062|gb|EIW32823.1| glycoside hydrolase family 76 [Pelosinus fermentans A12]
 gi|392460977|gb|EIW37218.1| glycoside hydrolase family 76 [Pelosinus fermentans B3]
          Length = 683

 Score =  461 bits (1185), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 265/678 (39%), Positives = 373/678 (55%), Gaps = 53/678 (7%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           +K  NRL  E SPYLLQHA+NPVDW  W +EAF +A++ D P+F S GYS CHWCHVME 
Sbjct: 2   DKKPNRLIKEKSPYLLQHAYNPVDWHPWCDEAFEKAKREDKPVFFSSGYSCCHWCHVMER 61

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           E FED+ VA LLN  F++IKVDREERPDVD +YM+  QAL G GGWPL++ ++P+ KP  
Sbjct: 62  ECFEDQEVADLLNQHFIAIKVDREERPDVDGIYMSVCQALTGQGGWPLTIIMAPNKKPFF 121

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
            GTYFP   K GR G   +L  +   W+  R  + ++G   +  L     AS       +
Sbjct: 122 AGTYFPKHRKMGRMGLLELLTTLHQHWENNRSEIIKAGNEIVSILQRPKPASEEGQVGEE 181

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
            L Q  L     +L  SYDS+ GGFGSAPKFP P +I  +L + +  ++        +  
Sbjct: 182 LLKQAYL-----ELENSYDSQCGGFGSAPKFPTPHKITFLLRYWQHFKE-------PKAL 229

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
            MV  TL  M +GGI+DH+G GF RYS D++W VPHFEKMLYD   L   YL+A+  T +
Sbjct: 230 AMVEKTLMSMWQGGIYDHLGYGFARYSTDQKWLVPHFEKMLYDNALLCTSYLEAYQCTGN 289

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
             ++ I  +IL Y+ RDM+   G  +SAEDADS   EG     EG FYV+T KEV +ILG
Sbjct: 290 GEFARIAEEILTYVMRDMMDKSGGFYSAEDADS---EGV----EGKFYVFTRKEVLEILG 342

Query: 459 -EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN-DSSASASKLGMPLEKYLN 516
            E   LF + Y +   GN +            G ++   +  D    A K+   +E    
Sbjct: 343 EEEGTLFADFYQISSQGNFE-----------HGTSIPNRIGRDLEEYARKVKWTVESLSA 391

Query: 517 ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGS 576
           +L + R KL+ VR KR  PH DDK++ +WNGL+I++FA+A+K+LK               
Sbjct: 392 LLEQGREKLYHVREKRIHPHKDDKILTAWNGLMIAAFAKAAKVLK--------------- 436

Query: 577 DRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG 636
            + +Y  VAE  A+FI   L  +   RL   +R G +    ++DDYAFL+  L+++YE  
Sbjct: 437 -QSKYANVAEQGAAFIYEKLM-KADGRLLARYREGEAAHQAYIDDYAFLLMALIEVYEAT 494

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
              ++L  A+ L    + LF D   GG++    +   +++R KE +DGA PSGNSV+ + 
Sbjct: 495 CNNQYLHRAVTLAKDMEALFGDNTEGGFYFYGNDGEELIVRPKEIYDGAIPSGNSVAALA 554

Query: 697 LVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVG 756
           L +L  I   +    +   AE  L+ F   +   A        A D   V     +++ G
Sbjct: 555 LQKLGDI---TDDRGFSDIAERLLSSFAGEVSRYAAGYTYFMMAVDYY-VADNTKIIIAG 610

Query: 757 HKSSVDFENMLAAAHASY 774
            K + D + ML   ++ +
Sbjct: 611 DKEAADTKAMLDVINSCF 628


>gi|405123962|gb|AFR98725.1| cold-induced thioredoxin domain-containing protein [Cryptococcus
           neoformans var. grubii H99]
          Length = 745

 Score =  460 bits (1184), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 279/695 (40%), Positives = 392/695 (56%), Gaps = 40/695 (5%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           +N LA   SPYLLQH  NPV W  W  E  A A+K D PIFLS GYS CHWCHV+  ESF
Sbjct: 14  SNVLAKSKSPYLLQHKDNPVAWQEWSPETIALAQKLDKPIFLSSGYSACHWCHVLAHESF 73

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           EDE  AK++N+WFV+IKVDREERPDVD++YM+Y+QA+ GGGGWP+S+F++P L+P   GT
Sbjct: 74  EDEETAKMMNEWFVNIKVDREERPDVDRMYMSYLQAVSGGGGWPMSIFMTPKLEPFFAGT 133

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFP      RP F  +L K+ + W++ R+   + G   IE L +      +S  L   L 
Sbjct: 134 YFP------RPNFHQLLNKIHEVWEEDREKCEKMGKGVIEALKDMSDTGRTSESLSQLLS 187

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSA------PKFPR-PVEIQMMLYHSKKLEDTGKSGEA 334
            +       QLS   D+R+GGF +A      PKFP   + ++ +   +       ++ E 
Sbjct: 188 SSPASKLFAQLSTMNDTRYGGFTNAGSSTRGPKFPSCSITLEPLARLASIPGGGARNAEI 247

Query: 335 SE-GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 393
            E  ++M +  L+ M  GGI D VGGG  RYSVDE+W VPHFEKMLYDQ QL +  LD  
Sbjct: 248 REDAREMGMKMLRSMWSGGIRDWVGGGMARYSVDEKWMVPHFEKMLYDQAQLVSSCLDFA 307

Query: 394 SLT----KDVFYSY-ICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRK--KEGAFY 446
            L     +D    Y +  DIL Y  RD+  P G  +SAEDADSAE +GA +    EGAFY
Sbjct: 308 RLYPANHQDRLLCYDLAADILKYTLRDLKSPEGGFWSAEDADSAEYKGAKKSVLPEGAFY 367

Query: 447 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 506
           +W   E+++ILG+ A LF   + ++P GN ++  + D H E +GKN+L +       A +
Sbjct: 368 IWKKTEIDEILGDDAPLFDSFFGVEPDGNVNI--IHDSHGEMRGKNILHQHKTYEEVALE 425

Query: 507 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 566
            G   ++  +I+ E   KL   R +R RP LDDK++ +WNGL++++ ++AS +L S    
Sbjct: 426 FGKREDQAKDIIIEACEKLRLKREERERPGLDDKILTAWNGLMLTALSKASTLLPSSYGI 485

Query: 567 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP-GFLDDYAFL 625
           +    P            A    +F++ H++D  T  L  S+R G  K P    DDYAFL
Sbjct: 486 SSQCLP-----------AALGIVNFVKSHMWDPSTRTLTRSYREG--KGPQAQTDDYAFL 532

Query: 626 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 685
           I GLL+LYE       +++A ELQ  QDELF D + GGYF  + ED  VL+R+K+  DGA
Sbjct: 533 IQGLLNLYEATGDESHVLFAEELQKRQDELFWDDDDGGYF-ASAEDAHVLVRMKDAQDGA 591

Query: 686 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 745
           EPS  +VS  NL R + +++ S+ + Y   AE +       +     AV         L 
Sbjct: 592 EPSAAAVSAHNLSRFSLLLS-SEFENYEARAEATFLSMGPLITQAPRAVGYAVSGLIDLE 650

Query: 746 VPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
              R+ V+++G  +    +  L AA  +Y  N+ +
Sbjct: 651 KGYRE-VIVIGSANDEMIKEFLKAARETYFSNQVI 684


>gi|440792869|gb|ELR14077.1| Hypothetical protein ACA1_367000 [Acanthamoeba castellanii str.
           Neff]
          Length = 865

 Score =  459 bits (1181), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 276/711 (38%), Positives = 374/711 (52%), Gaps = 121/711 (17%)

Query: 84  VAMAERTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFL 143
           ++ A  TPA+    R +  NRLAAE SPYLLQH HNPVDW+AWGEEAFA+A++ + PIFL
Sbjct: 207 LSTAPTTPAAVPPQRKE--NRLAAEKSPYLLQHKHNPVDWYAWGEEAFAKAKRENKPIFL 264

Query: 144 SIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGG 203
                               E +++LLND FVSIKVDREERPDVD++YMTYV A  G GG
Sbjct: 265 --------------------EKISRLLNDNFVSIKVDREERPDVDRLYMTYVTATTGHGG 304

Query: 204 WPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL 263
           WPLSVFL+PDLKPL+GGTYFPP  KYGRPGF T++  V   W +K+D L          L
Sbjct: 305 WPLSVFLTPDLKPLVGGTYFPPTSKYGRPGFDTLIHNVDKVWREKQDQLKAEADNTAHAL 364

Query: 264 SEALS-ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML-YH 321
            E ++ A      + D+  + A     + L++SYD   GGF  APKFPR   +  +   +
Sbjct: 365 QEYMTVAGKEVEGIDDDSIEIAYDAALKSLAESYDEEHGGFTRAPKFPRLATLNFLFRVY 424

Query: 322 SKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYD 381
             + E    + +A++   M L TL  MA+GGI+DH+G           W VPHFEKMLYD
Sbjct: 425 GHRKEGLELNEKATKAMDMALVTLTKMARGGIYDHIGN----------WLVPHFEKMLYD 474

Query: 382 QGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 441
           Q QL   YL A+ +T +  ++ +  D+L+Y+   +  P G  +SAEDADS  +  +  K 
Sbjct: 475 QSQLTMAYLSAYQITDEPVFADVAEDVLEYVTTKITSPEGAFYSAEDADSLVSPDSDEKV 534

Query: 442 EGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 500
           EGAFYVW   EV   LGE    +F   Y + P GN  +   +D   E K KNVL E   +
Sbjct: 535 EGAFYVWEYDEVIKALGEQDGKIFAHRYGVLPEGN--VPAPADIQGELKHKNVLAEKLTA 592

Query: 501 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 560
             +A + G  ++    +  E + KL   R KRPRPHLDDK+I SWNGL+IS++ARAS++L
Sbjct: 593 EETALEFGFKVDYVDKLTMESKAKLKHERDKRPRPHLDDKIITSWNGLMISAYARASEVL 652

Query: 561 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 620
                             K Y E A   A FIR  LYD+Q                    
Sbjct: 653 GD----------------KRYAESASKCAQFIRDQLYDDQ-------------------- 676

Query: 621 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 680
                              + ++WA +               GYFNT  +DPS+L RV++
Sbjct: 677 -------------------EAILWARQR--------------GYFNTVKDDPSLLARVRD 703

Query: 681 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA------VFETRL-----KD 729
           D DGAEPS NS+S +NLVRL  +     SD + + AE + +      +   RL     KD
Sbjct: 704 DQDGAEPSSNSISAMNLVRLWHMTG---SDDWYKKAEATFSSCKGPIITPLRLTVCPAKD 760

Query: 730 MAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
             + VP M C+ D  S  + K +V+ G  ++ D   +L    + +  N+ +
Sbjct: 761 APLMVPQMLCSLD-FSRATAKQIVIAGDPNAEDTAALLKEVRSQFIPNRVL 810


>gi|302037753|ref|YP_003798075.1| hypothetical protein NIDE2440 [Candidatus Nitrospira defluvii]
 gi|300605817|emb|CBK42150.1| conserved protein of unknown function (modular protein) [Candidatus
           Nitrospira defluvii]
          Length = 1236

 Score =  459 bits (1181), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 259/693 (37%), Positives = 379/693 (54%), Gaps = 55/693 (7%)

Query: 94  TSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWC 153
           TS +  +  NRL  + SPYLLQHA+NPVDW+ WG EA A+A K + PI LSIGYS+CHWC
Sbjct: 2   TSTTPGREPNRLIRQTSPYLLQHAYNPVDWYPWGPEALAQAAKLNRPILLSIGYSSCHWC 61

Query: 154 HVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQAL-YGGGGWPLSVFLSP 212
           HVME ESFE+E +A+L+N  FV IKVDREERPD+D++YM    AL    GGWP++VFL+P
Sbjct: 62  HVMERESFENEAIARLMNHHFVCIKVDREERPDLDEIYMQATLALNRNQGGWPMTVFLTP 121

Query: 213 DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASAS 272
           D KP   GTYFPPED++GRPGF T+L+K+ + W+K    +    A    +L +   A + 
Sbjct: 122 DQKPFFAGTYFPPEDRWGRPGFPTLLKKIAEYWEKDHAGVVAQAATLTARLQDGSHAPS- 180

Query: 273 SNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSG 332
               P  + +  L +   Q ++ +D++ GGFG APKFP    + ++L+   + +D     
Sbjct: 181 ----PTTVGEAELDMAVTQFAEDFDAKLGGFGGAPKFPPATGLSLLLHCYHRTKD----- 231

Query: 333 EASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDA 392
              +   MV  TL  MA GGI+D +G GF RYS D+RW VPHFEKMLYD   LA VY++A
Sbjct: 232 --PQTLTMVRTTLDAMAAGGIYDQIGDGFARYSTDDRWLVPHFEKMLYDNALLARVYVEA 289

Query: 393 FSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKE 452
           F +T D  Y  +  + LDY+ ++M  P G  +SA DADS   EG     EG F+VWT  E
Sbjct: 290 FQVTADPNYRRVACETLDYILKEMTSPEGGFYSATDADS---EGV----EGKFFVWTPDE 342

Query: 453 VEDILG--EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMP 510
           +  +L   E       +Y + P GN            ++ KNVL      ++ A +LG+ 
Sbjct: 343 IRAVLSNEEDVRRICTYYDVTPAGN------------WEHKNVLHTAKPVASVAKELGLT 390

Query: 511 LEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFN 570
           +E     +   +  L+  R+KR  P LDDKVI +WNG++IS+ A A ++         F+
Sbjct: 391 VEDLQATIDRVKPLLYAARAKRVPPGLDDKVITAWNGMMISAMAEAGRV---------FD 441

Query: 571 FPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLL 630
            P        Y   AE A  F+   L  +   RL  ++R G +    +L+DYA+   GL+
Sbjct: 442 MP-------RYRAAAERACEFLLTTL-SKPDGRLLRTYRAGTAHLDAYLEDYAYFAEGLI 493

Query: 631 DLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGN 690
           D YE G   ++L  A+ L       F D + GG+F T     ++++R +E  DGA PSGN
Sbjct: 494 DTYEAGGHERYLSAAVRLAERILADFSDGQQGGFFTTATGHEALIVRSREGPDGATPSGN 553

Query: 691 SVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRK 750
           +V+   L RL+        + +RQ A  ++  +  ++     A        D+L+     
Sbjct: 554 AVAAAALARLSYHFG---REDFRQAAAGAVRAYGRQIARYPRAFAKSLIVVDLLT-SGPV 609

Query: 751 HVVLVGHKSSVDFENMLAAAHASYDLNKTVSKK 783
            + ++G     +   + AA   +Y  N+ ++ +
Sbjct: 610 EIAVIGAPDDSNTVALRAAVSRTYIPNRVIASR 642


>gi|21226721|ref|NP_632643.1| hypothetical protein MM_0619 [Methanosarcina mazei Go1]
 gi|20905010|gb|AAM30315.1| conserved protein [Methanosarcina mazei Go1]
          Length = 700

 Score =  459 bits (1181), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 257/683 (37%), Positives = 372/683 (54%), Gaps = 45/683 (6%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
            K  NRL  E SPYLLQHA+NPVDW+ WGEEAF +ARK + P+FLSIGYSTCHWCH+M  
Sbjct: 8   QKEPNRLIKEKSPYLLQHAYNPVDWYPWGEEAFEKARKENKPVFLSIGYSTCHWCHMMAH 67

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFEDE VA L+N+ FVSIKVDREERPD+D +YMT  Q + G GGWPL++ ++P  KP  
Sbjct: 68  ESFEDEEVAGLMNEAFVSIKVDREERPDIDNIYMTVCQIILGRGGWPLNIIMTPGKKPFF 127

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
            GTY P   ++ + G   ++ ++K+ W+++ + +  S       + E +  S+       
Sbjct: 128 AGTYIPKNTRFNQIGMLELVPRIKEIWEQQHEEVLDSAEKITSTIQEMIKESSGEG---- 183

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
            L +  +    E+L  S+D+ +GGF  APKFP P +I  +L + ++  +        E  
Sbjct: 184 -LGEEVIEEVYEELLSSFDTEYGGFSGAPKFPTPHKISFLLRYWRRSRN-------PEAL 235

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
            M  +TL  M +GGI+DH+G GFHRYS D  W +PHFEKMLYDQ   A  Y +A+ +T  
Sbjct: 236 HMAEYTLDKMRRGGIYDHLGSGFHRYSTDSMWLLPHFEKMLYDQALTAIAYTEAYQVTGK 295

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
             Y      ILDY+ RD+  P G  +  EDAD         ++EG +Y+WT +E+  IL 
Sbjct: 296 DLYKETAEGILDYVLRDLTSPEGGFYCGEDAD-------VEREEGKYYLWTLEEIRSILD 348

Query: 459 -EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI 517
            E + L  + + L+  GN +     +      G N+        + A+K+ +P+E+    
Sbjct: 349 PEDSELIIKMFNLREEGNFE----EEIRGRETGTNLFYMARSPGSLAAKMKIPVEEVEKK 404

Query: 518 LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSD 577
           +   R KL   R +R RP LDDK++  WNGL+I++FA+               + V G  
Sbjct: 405 VKAAREKLLKARYERKRPSLDDKILTDWNGLMIAAFAKG--------------YQVFGEQ 450

Query: 578 RKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGS 637
           R  Y++ AE AA FI   LY      L H +R+G +   G  DDYAFLI GLL+LYE G 
Sbjct: 451 R--YLKAAEKAADFILMALYS-PGDGLLHRYRDGVAGISGTSDDYAFLIHGLLELYEAGF 507

Query: 638 GTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINL 697
             ++L  A+ L +   E F D   GG + T  +  +++ R KE  D A P+GNS  ++NL
Sbjct: 508 KMRYLKAAVSLNSELLECFWDPVNGGLYFTANDSEALIFRKKEFMDSAIPTGNSFEMLNL 567

Query: 698 VRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGH 757
           +RL+ I+A    +   + A+     F  ++            A D    PS + V++ G 
Sbjct: 568 LRLSRIIADPGLE---ETADKLERAFSKQIMKAPSGYTQFLSAFDFRLGPSYE-VIISGK 623

Query: 758 KSSVDFENMLAAAHASYDLNKTV 780
             + D E ML    + +  NK +
Sbjct: 624 AEASDTEQMLKELWSYFVPNKVL 646


>gi|194017545|ref|ZP_03056156.1| YyaL [Bacillus pumilus ATCC 7061]
 gi|194010817|gb|EDW20388.1| YyaL [Bacillus pumilus ATCC 7061]
          Length = 687

 Score =  459 bits (1180), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 254/608 (41%), Positives = 346/608 (56%), Gaps = 49/608 (8%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           N+  N L  E SPYLLQHAHNPV W+ WG+EAF +A++ + P+ +SIGY+TCHWCHVM  
Sbjct: 4   NQTPNPLITEKSPYLLQHAHNPVHWYPWGQEAFDKAKRENKPVLVSIGYATCHWCHVMAH 63

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFED+ VA +LN+ F+SIKVDREERPD+D +YM+  Q + G GGWPL+VF++PD KP  
Sbjct: 64  ESFEDQQVADILNEHFISIKVDREERPDIDSMYMSVCQMMTGQGGWPLNVFVTPDQKPFY 123

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
            GTYFP    YGRPGF   L +++DA+   RD +      A   L    +    S     
Sbjct: 124 AGTYFPKRSAYGRPGFIEALTQLRDAYHNDRDHIESLAEKATNNLRIKAAGQTEST---- 179

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
            L Q A+     QL  S+D+  GGFGSAPKFP P    M+ +  +  E TG+        
Sbjct: 180 -LTQEAIHKAYYQLMSSFDTLHGGFGSAPKFPAP---HMLSFLMRYYEWTGQEN----AL 231

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
             V+ TL  MA GGI+DHVG GF RYS DE+W VPHFEKMLYD   L   Y +A+ LT+ 
Sbjct: 232 YAVMKTLDGMANGGIYDHVGSGFSRYSTDEKWLVPHFEKMLYDNALLMEAYTEAYQLTQQ 291

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
             Y  +   ++ +++RDM+ PGG  +SA DADS   EG    KEG +YVW+  E+   LG
Sbjct: 292 PEYEKLVHRLIHFIKRDMMNPGGSFYSAIDADS---EG----KEGQYYVWSKDEIMTHLG 344

Query: 459 EH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI 517
           E    LF   Y++   GN + + +  PH       +    +D  AS S     L+  L  
Sbjct: 345 EDLGALFCAIYHITEEGNFEGANI--PH------TISTSFDDIKASFSIDDHALQSKLQ- 395

Query: 518 LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSD 577
             E R  L  VR +RP P +DDKV+ SWN L+ISS A+A ++  +E              
Sbjct: 396 --EARHILQSVRQQRPAPLVDDKVLTSWNALMISSLAKAGRVFGAE-------------- 439

Query: 578 RKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGS 637
             E + +A+ A SF+  HL   Q  RL   +R G  K  GF++DYA ++   + LYE   
Sbjct: 440 --EAIRMAKQAMSFLETHLV--QHDRLMVRYREGDVKHLGFIEDYAHMLKAYMSLYEATF 495

Query: 638 GTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINL 697
              WL  A  +     ELF D+E GG+F +  +  ++++R KE +DGA PSGNS ++  L
Sbjct: 496 ELAWLEKATAIAKNMFELFWDKEKGGFFFSGSDAEALIVREKEVYDGAMPSGNSTALKQL 555

Query: 698 VRLASIVA 705
           + L+ +  
Sbjct: 556 LMLSRLTG 563


>gi|340345243|ref|ZP_08668375.1| Thioredoxin [Candidatus Nitrosoarchaeum koreensis MY1]
 gi|339520384|gb|EGP94107.1| Thioredoxin [Candidatus Nitrosoarchaeum koreensis MY1]
          Length = 675

 Score =  459 bits (1180), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 248/606 (40%), Positives = 353/606 (58%), Gaps = 49/606 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N L  E SPYLLQHA NPVDW+AW +E+  +A+  + PIFLS+GYS CHWCHVM  ESFE
Sbjct: 4   NHLIHETSPYLLQHAENPVDWYAWNDESLKKAKDENKPIFLSVGYSACHWCHVMAHESFE 63

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           ++ VAK +N+ FV+IKVDREERPD+D +Y    Q   G GGWPLS+FL+PD KP   GTY
Sbjct: 64  NDEVAKFMNENFVNIKVDREERPDLDDIYQKVCQIATGQGGWPLSIFLTPDQKPFYVGTY 123

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP  D YGRPGF +I R++  AW +K   + +S    +  L +A +      K+P +L +
Sbjct: 124 FPVLDSYGRPGFGSITRQLAQAWKEKPKDIEKSADNFLSALQKAETV-----KIPSKLEK 178

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             L   A  L +  D+ +GGFGSAPKFP    +  +  ++K    TG     S+  +  L
Sbjct: 179 VILDEAAMNLFQLGDAAYGGFGSAPKFPNAANVSFLFRYAKL---TG----LSKFNEFAL 231

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL  MAKGGI D +GGGFHRYS D +W VPHFEKMLYD   +   Y +A+ +T+D FY 
Sbjct: 232 KTLNKMAKGGIFDQIGGGFHRYSTDAKWLVPHFEKMLYDNALIPVNYAEAYQITQDQFYL 291

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
            +    L ++ R+M    G  +SA DADS   EG     EG FYVW   E+++ILG+ A 
Sbjct: 292 EVLHKTLGFVLREMTSKEGGFYSAYDADS---EGV----EGKFYVWKKSEIKEILGDDAE 344

Query: 463 LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECR 522
           +F  +Y +   GN            ++G ++L    + SA A   GMP EK   IL  C 
Sbjct: 345 IFCLYYDVTDGGN------------WEGNSILCNNINISAVAFHFGMPEEKIKEILVRCS 392

Query: 523 RKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYM 582
            KL +VRSKR  P LDDKV+ SWN L+I++FA+  ++                +   +Y+
Sbjct: 393 EKLLNVRSKRVPPGLDDKVLTSWNALMITAFAKGYRV----------------TGETKYL 436

Query: 583 EVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWL 642
           + A++  SFI   L D+   +L  +++N  +K  G+L+DY++  + LLD++E     K+L
Sbjct: 437 DAAKNCVSFIETKLLDDT--KLLRTYKNNVAKIDGYLEDYSYFANALLDVFEIEPEAKYL 494

Query: 643 VWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLAS 702
             A++L +   + F D E   +F T+ +   +++R K ++D + PSGNSVS   ++RL  
Sbjct: 495 NLAVKLGHHLVDHFWDPESSSFFMTSDDHEKLIIRPKSNYDLSLPSGNSVSCFVMLRLYH 554

Query: 703 IVAGSK 708
           +    K
Sbjct: 555 LTQEEK 560


>gi|328951864|ref|YP_004369198.1| hypothetical protein Desac_0120 [Desulfobacca acetoxidans DSM
           11109]
 gi|328452188|gb|AEB08017.1| protein of unknown function DUF255 [Desulfobacca acetoxidans DSM
           11109]
          Length = 693

 Score =  459 bits (1180), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 264/677 (38%), Positives = 374/677 (55%), Gaps = 52/677 (7%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           N   NRL  E SPYL QHA+N VDW  WG EA  +A   D PI LSIGYSTCHWCHVM  
Sbjct: 4   NARPNRLLYETSPYLRQHAYNLVDWHPWGPEALEKAHLEDRPILLSIGYSTCHWCHVMAH 63

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           E FED  +A+L+N+WF++IKVDREERPD+D +YM  VQ + G GGWPL+VFL+P+LKP  
Sbjct: 64  ECFEDPEIARLMNEWFINIKVDREERPDLDDIYMHAVQMITGRGGWPLTVFLTPELKPFY 123

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
           GGTYFPP D+ G PGF  +L+ + D++  K+  +    A  +EQ    L+ + +S + P 
Sbjct: 124 GGTYFPPIDRGGLPGFPRLLQALHDSYKNKKSNIHNVIA-TLEQNMRILALTPASGQAPS 182

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
                AL    E     +D   GGF  APKFP   ++     H        ++G+    Q
Sbjct: 183 ---LAALDQLIEHNLADFDEGNGGFRGAPKFPPSQDLGFWACHYH------RTGQPKVLQ 233

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
            + L TLQ MA+GG++D + GGFHRYSVD+ W +PHFEKMLYD  QLA  YL+A+ +T D
Sbjct: 234 SLSL-TLQKMARGGLYDQLRGGFHRYSVDDVWLIPHFEKMLYDNAQLARRYLEAYQITGD 292

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
           VF + + +  LDY+  +M  P G  ++A+DADS   EG     EG F+VWT +++ ++ G
Sbjct: 293 VFLAQVAQQTLDYVLAEMTAPEGVFYAAQDADS---EGV----EGRFFVWTPEQIAEVAG 345

Query: 459 -EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI 517
            + A L    + +   GN +            G +VL    + +  A +  + +++  ++
Sbjct: 346 AQRAPLICAAFGVTQEGNFE-----------HGASVLHRPQNEAQLAEQFSLNMDEMRHV 394

Query: 518 LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSD 577
           L E RR+L+  R +R RPH D+K+I +WN L+IS+ A  S++L                D
Sbjct: 395 LTEARRRLWQGREQRVRPHRDEKIITAWNALMISALAYGSQVL----------------D 438

Query: 578 RKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGS 637
            + Y   A +AA FI     + Q  RL   +     +   FLDD+AF I+ LLDLYE   
Sbjct: 439 NRTYRGAAITAAQFILGR--EAQAGRLLRIWAATDRQGSAFLDDFAFFIAALLDLYETDF 496

Query: 638 GTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINL 697
              WL  A+ L    +  F DRE GGYF+T  +   +L+R K   D A PSGNSV V NL
Sbjct: 497 SPAWLAAAVRLSKEVETSFYDREAGGYFSTPVDHEKLLVRPKNFFDLAIPSGNSVMVHNL 556

Query: 698 VRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGH 757
           +RL         DY+ + A+ +L   +T + +    +  +  A +    P+   + LVG+
Sbjct: 557 IRLHRFT--DNPDYFLR-AQETLTRLQTLMMENPRGLSHLAAATEDFLAPTLA-ITLVGN 612

Query: 758 KSSVDFENMLAAAHASY 774
            +      MLA  +  Y
Sbjct: 613 PTEPALAEMLAVVYRHY 629


>gi|172058552|ref|YP_001815012.1| hypothetical protein Exig_2546 [Exiguobacterium sibiricum 255-15]
 gi|171991073|gb|ACB61995.1| protein of unknown function DUF255 [Exiguobacterium sibiricum
           255-15]
          Length = 677

 Score =  458 bits (1179), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 262/675 (38%), Positives = 371/675 (54%), Gaps = 65/675 (9%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNRL  E SPYLLQHA NPVDW+ WGEEAFA AR  + PIFLSIGYSTCHWCHV+  ESF
Sbjct: 3   TNRLINEKSPYLLQHATNPVDWYPWGEEAFAAARSANKPIFLSIGYSTCHWCHVLAHESF 62

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           EDE  A++LND F+SIKVDREERPD+D++YMT  Q + G GGWPLSVF+SPD  P   GT
Sbjct: 63  EDEETARMLNDRFISIKVDREERPDIDQIYMTAAQMMNGQGGWPLSVFMSPDQTPFYIGT 122

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFP   ++ RP F+ +L ++ + +    D + + G    +++ +AL+A  + +   D L 
Sbjct: 123 YFPKTPQFNRPSFRQVLLQLSEHYRTDPDKIKRVG----QEIIQALTAVTTFDS-EDPLD 177

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
           +  +    +Q  + YD   GGFG+APKFP P  +  +L       D  +  E     +MV
Sbjct: 178 EALVHETFDQAMRQYDVENGGFGTAPKFPSPSLLTFLL-------DYYRFAEDETALQMV 230

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
           + TL  M  GGI DHVG G +RY+VDERW +PHFEKMLYD    A + ++ + ++    +
Sbjct: 231 MRTLTAMRDGGITDHVGFGLYRYTVDERWEIPHFEKMLYDNALFATLCIETYQVSGRERF 290

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHA 461
                +I  Y+ RD+  P G  +SAEDADS   EG    +EG FY +T  E+ D+LG+ A
Sbjct: 291 KQYAEEIFAYIERDLSSPDGAFYSAEDADS---EG----REGLFYTFTFDELTDLLGQDA 343

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS-KLGMPLEKYLNILGE 520
           + F   Y   P GN            F+G+ V      S    S      ++  L  L +
Sbjct: 344 V-FPLLYQATPQGN------------FEGRIVFRRTGQSIQQLSADRNTAVQDILIQLEQ 390

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            RR L   RS+R RP  DDKV+ SWN L+IS++A+A ++   E                 
Sbjct: 391 ERRTLLLFRSQRTRPFRDDKVLTSWNALMISAYAKAGRVFNDE----------------R 434

Query: 581 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 640
           Y + A  A +F+  HL D+   RL   +R G  +  G+LDDY+FL    L+L++      
Sbjct: 435 YTKFARQALTFLETHLMDDD--RLHVRYRQGHIQGNGYLDDYSFLTEAYLELHQTTQHIP 492

Query: 641 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 700
           +L  AI L       F D E G +F T+ ED ++L+R K+ +D  +P+GNS +V NL+RL
Sbjct: 493 YLKQAIRLTERMIGDFSD-EDGSFFFTSFEDETLLMRPKDVYDVVKPAGNSTAVSNLLRL 551

Query: 701 ASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHV----VLVG 756
           + +   +    YR  A+ + +   + +K            A +LSV +R  +    ++V 
Sbjct: 552 SQLTGRTD---YRDQAQRNFSTLASEIKSQPTGF------ASLLSVYTRTLMEPKELIVL 602

Query: 757 HKSSVDFENMLAAAH 771
            +S  D  + L   H
Sbjct: 603 TESYTDVASFLTQLH 617


>gi|442804077|ref|YP_007372226.1| N-acylglucosamine 2-epimerase family protein [Clostridium
           stercorarium subsp. stercorarium DSM 8532]
 gi|442739927|gb|AGC67616.1| N-acylglucosamine 2-epimerase family protein [Clostridium
           stercorarium subsp. stercorarium DSM 8532]
          Length = 679

 Score =  458 bits (1178), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 254/607 (41%), Positives = 352/607 (57%), Gaps = 60/607 (9%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           +  NRL  E SPYLLQHA+NPVDWF W +EAF +A+  + P+FLSIGYSTCHWCHVME E
Sbjct: 9   RKANRLINEKSPYLLQHAYNPVDWFPWCDEAFNKAKSENKPVFLSIGYSTCHWCHVMERE 68

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SFEDE VA +LN  FV+IKVDREERPD+D +YMT+ QA+ G GGWPL++ ++PD KP   
Sbjct: 69  SFEDEEVADILNKHFVAIKVDREERPDIDHIYMTFCQAITGHGGWPLTIIMTPDKKPFFA 128

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE 279
           GTYFP  D++G PG  TIL+    AW++ +  L + G    EQ+  ++  S  ++   + 
Sbjct: 129 GTYFPKNDRHGMPGLVTILKSAHRAWEENKKDLERLG----EQILNSV-YSEDNDYQHEV 183

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
           L +  +    +QL  S+D  +GGFG+APKFP P  +  +L +         +GE  +  +
Sbjct: 184 LSETIIDDIYKQLESSFDPVYGGFGNAPKFPAPHNLLFLLRYWY------ATGE-KKALE 236

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           MV  TL  M KGGI+DH+G GF RYS D +W +PHFEKMLYD   LA  Y +A+  TK  
Sbjct: 237 MVEKTLDSMHKGGIYDHIGFGFCRYSTDRKWLIPHFEKMLYDNALLAMAYSEAYQATKKD 296

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG- 458
            Y+ I  +I  Y+ RDM  P G  +SAEDADS   EG     EG FY WT +EV  +LG 
Sbjct: 297 KYARIAAEIYKYIERDMTSPEGAFYSAEDADS---EGV----EGFFYTWTYEEVMSVLGD 349

Query: 459 EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 518
           E    F   + + P+GN            F+G+N+   +N   + +  + +         
Sbjct: 350 EDGKRFCGIFDITPSGN------------FEGRNIPNLINADPSDSDFIEI--------- 388

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
             CR+KLF+ R KR RP  DDK++ SWN L+ +S A   +ILK                 
Sbjct: 389 --CRKKLFETREKRIRPFKDDKILTSWNALMAASLAVGGRILKD---------------- 430

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG 638
              + +A+ A SFI+  L  E   RL   +R+G +  P FLDDYA+L    ++LY+    
Sbjct: 431 MNLINMAKKAVSFIKAKLVREDG-RLLARYRDGSADIPAFLDDYAYLQWAYIELYQSTHE 489

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
             +L+ A+ +    + LFLD E GG+F    +   ++ R K+ +DGA PSGNSV  +NL+
Sbjct: 490 PGYLIDAVSINEEINGLFLDDEKGGFFFYGNDAERLITRPKDAYDGAMPSGNSVMAMNLL 549

Query: 699 RLASIVA 705
           +L+ I  
Sbjct: 550 KLSQITG 556


>gi|325288476|ref|YP_004264657.1| hypothetical protein Sgly_0289 [Syntrophobotulus glycolicus DSM
           8271]
 gi|324963877|gb|ADY54656.1| protein of unknown function DUF255 [Syntrophobotulus glycolicus DSM
           8271]
          Length = 752

 Score =  458 bits (1178), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 269/680 (39%), Positives = 379/680 (55%), Gaps = 73/680 (10%)

Query: 95  SHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCH 154
           S ++N  +NRL  E SPYLLQHAHNPVDW+ WG EAF +A K + P+FLSIGYSTCHWCH
Sbjct: 2   SAAKNGVSNRLIHEKSPYLLQHAHNPVDWYPWGIEAFEKAAKENKPVFLSIGYSTCHWCH 61

Query: 155 VMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDL 214
           VME ESFED+ VA+ LN  F+++KVDREERPD+D  YMT+ QAL G GGWPL++ ++PD 
Sbjct: 62  VMERESFEDKEVAEKLNKSFIAVKVDREERPDIDHTYMTFCQALTGAGGWPLTILMTPDK 121

Query: 215 KPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLA------------------QSG 256
           KP   GTYF      GR G   +L    + W  +++ +                   Q  
Sbjct: 122 KPFFAGTYFAKNSGGGRVGLIDVLDYTSEKWKNEKEKILTSAEELYTVVSSHYGGKDQET 181

Query: 257 AFAIEQLSEALSASASSNKLPDEL---PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPV 313
            F  E L E +  + +  +  D++    +  +    E L+K++D +FGGFG APKFP P 
Sbjct: 182 VFKKEGLLEEVRYADARKQTKDDIMVWGKQMIEKGYEMLAKTFDPKFGGFGHAPKFPSPH 241

Query: 314 EIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVP 373
            +  ++       D           +MV  TL  MA GGI+D +G GF RYS D  W VP
Sbjct: 242 TLGFLMRCHLDRPD-------QNALEMVRKTLDLMADGGIYDQIGYGFSRYSTDRFWLVP 294

Query: 374 HFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAE 433
           HFEKMLYD   LA  YL+A+ LT +  Y  + R+I  Y+ R+M  P G  +SAEDADS  
Sbjct: 295 HFEKMLYDNATLAYTYLEAYQLTHEQRYGQVAREIFSYVLREMCSPEGGFYSAEDADS-- 352

Query: 434 TEGATRKKEGAFYVWTSKEVEDILGEHAILFKE-------------------HYYLKPTG 474
            EG    +EG +Y+WT +EV + L    +  +E                   H  + P  
Sbjct: 353 -EG----EEGKYYIWTYQEVMETLTAELLRIQENRASLDQPDGRDIFQSQFAHPDVLPGL 407

Query: 475 NCDLSRMSDPHNEFKGKNVLIEL-NDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRP 533
            C+  +++   N F+GKN+L  L +D    A K  +P ++++  +  C   L  VR +R 
Sbjct: 408 YCEAYQITKEGN-FEGKNILNRLFSDWRDLARKASIPFDEFVRAIRYCNTILLRVRERRV 466

Query: 534 RPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP----VVGSDRKEYMEVAESAA 589
           RP  DDK++VSWNGL+I++ A+ +++L         +FP     V  +   Y+  AE AA
Sbjct: 467 RPIRDDKILVSWNGLMIAALAKGAQVL---------SFPDQTFAVHENASLYLTQAEKAA 517

Query: 590 SFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQ 649
           +FI  ++      RL   +R+G ++ P +LDDYAF I GLL+LY       +L  AIELQ
Sbjct: 518 NFIDDNMRSSDG-RLFARYRHGEAQYPAYLDDYAFYIFGLLELYTACGKPVYLQRAIELQ 576

Query: 650 NTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKS 709
             Q+ LF D E GGYF T  +   +L R KE +DGA PSGNS++V+NL +L  +   +K 
Sbjct: 577 QQQENLFRDTEKGGYFFTGKDSEELLFRPKEVYDGALPSGNSLAVLNLTKLWKMTGDNK- 635

Query: 710 DYYRQNAEHSLAVFETRLKD 729
             ++  AE ++  F   +K+
Sbjct: 636 --WKNIAEGNIQSFHAEMKE 653


>gi|384267593|ref|YP_005423300.1| hypothetical protein BANAU_3964 [Bacillus amyloliquefaciens subsp.
           plantarum YAU B9601-Y2]
 gi|380500946|emb|CCG51984.1| putative protein yyaL [Bacillus amyloliquefaciens subsp. plantarum
           YAU B9601-Y2]
          Length = 689

 Score =  457 bits (1176), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 268/685 (39%), Positives = 380/685 (55%), Gaps = 58/685 (8%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           N   N L  E SPYLLQHAHNPV+W  WGEEAF +A++ + P+ +SIGYSTCHWCHVM  
Sbjct: 4   NSKPNSLITEKSPYLLQHAHNPVNWHPWGEEAFEKAKRENKPVLVSIGYSTCHWCHVMAH 63

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFEDE +A +LND F++IKVDREERPDVD VYM   Q + G GGWPL+VF++PD KP  
Sbjct: 64  ESFEDEEIAGMLNDKFIAIKVDREERPDVDSVYMRICQLMTGQGGWPLNVFVTPDQKPFY 123

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
            GTYFP   KY RPGF  +L  + + +   R          +E ++E  +A       P 
Sbjct: 124 AGTYFPKTSKYNRPGFIDVLEHLSETFANDRQ--------HVEDIAENAAAHLEVKIHPA 175

Query: 279 E--LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASE 336
           E  L + A+     QL+  +D+ +GGFG APKFP P    M+++  +    TGK  +A  
Sbjct: 176 EGMLGEQAVHDTYRQLAGGFDTVYGGFGQAPKFPMP---HMLMFLLRYYSYTGKE-QALA 231

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
           G   V  TL  MA GGI DH+G GF RYS D  W VPHFEKMLYD   L   Y +A+ +T
Sbjct: 232 G---VTKTLDGMANGGIFDHIGFGFARYSTDNEWLVPHFEKMLYDNALLLPAYTEAYQVT 288

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
            +  Y  I   I+ +++R+M+   G  FSA DAD   TEG    +EG +Y+W+ KE+ ++
Sbjct: 289 GNERYKQIAMQIVTFIQREMMHEDGSFFSALDAD---TEG----REGKYYIWSKKEIMNL 341

Query: 457 LG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 515
           LG E   L+ + Y +   GN +   +  PH  F  +  ++E  ++  + ++L   LE   
Sbjct: 342 LGDELGPLYCKVYNITDQGNFEGENI--PHLIFTRREAILE--ETGLTGNELAERLE--- 394

Query: 516 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 575
               E R KL + R  R  PH DDKV+ SWN L+I+  A+A+K+         F+ P   
Sbjct: 395 ----EARTKLLEARENRSYPHTDDKVLTSWNALMIAGLAKAAKV---------FHEP--- 438

Query: 576 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 635
               +++ +AE+A  F+ RHL  +   R+   +R G  K  GF+DDYAFLI   L+LYE 
Sbjct: 439 ----DFLSMAETAIRFLERHLMPDG--RVMVRYREGEVKNKGFIDDYAFLIWAYLELYEA 492

Query: 636 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 695
           G    +L  A  L  +  ELF D   GG+F T  +  ++L+R KE +DGA PSGNS + +
Sbjct: 493 GFHPSYLQKAKTLCTSMLELFWDERHGGFFFTGNDAETLLVREKEVYDGAVPSGNSAAAV 552

Query: 696 NLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLV 755
            L+RL  +          + AE   +VF+  ++    +      +    ++P +K +V+ 
Sbjct: 553 QLLRLGRLTGDIS---LIEKAEAMFSVFKREIEAYPSSNAFFMQSVLAHTMP-QKEIVVF 608

Query: 756 GHKSSVDFENMLAAAHASYDLNKTV 780
           G K   D +  + A    +    T+
Sbjct: 609 GSKDDPDRKRFIEALQEHFTPAYTI 633


>gi|187778206|ref|ZP_02994679.1| hypothetical protein CLOSPO_01798 [Clostridium sporogenes ATCC
           15579]
 gi|187775134|gb|EDU38936.1| hypothetical protein CLOSPO_01798 [Clostridium sporogenes ATCC
           15579]
          Length = 683

 Score =  457 bits (1176), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 259/657 (39%), Positives = 363/657 (55%), Gaps = 66/657 (10%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNRL  E SPYLLQHAHNPVDW+ WGEEAF +A+    P+FLSIGYSTCHWCHVME ESF
Sbjct: 9   TNRLIKEKSPYLLQHAHNPVDWYPWGEEAFEKAKIEVKPVFLSIGYSTCHWCHVMERESF 68

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           EDE VA++LN+ F+SIKVDREERPD+D +YM + QA  G GGWPL++ ++PD KP   GT
Sbjct: 69  EDEDVAEILNENFISIKVDREERPDIDSIYMNFCQAYTGSGGWPLTILMTPDKKPFFAGT 128

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFP   K+  PG   IL+ +   W + ++ + +S    +EQ+          N   DEL 
Sbjct: 129 YFPKWGKHNIPGIMDILKSINKLWREDKNKVLESSNRILEQIER-----FQDNHGEDELE 183

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQK 339
           +  +   A+ L  ++DS++GGFG+ PKFP    I  +L  Y+ KK           +   
Sbjct: 184 EYIIEEAAQTLLDNFDSKYGGFGTKPKFPTAHYILFLLRYYYFKK---------DKKVLD 234

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           ++  TL  M KGGI DH+G GF RYS D +W VPHFEKMLYD   L+  Y +A+  TK+ 
Sbjct: 235 VINKTLTSMYKGGIFDHIGFGFSRYSTDNKWLVPHFEKMLYDNALLSMAYTEAYEATKNP 294

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE 459
            Y  +   IL+Y+++ M    G  +SAEDADS   EG     EG FY+WT KE+ DILGE
Sbjct: 295 LYKVVTEKILNYVKKSMTSEEGGFYSAEDADS---EGV----EGKFYLWTKKEIMDILGE 347

Query: 460 HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASASKLGMPLEKYLNI 517
               F           C L  ++   N F+ KN+  LI+ +      +K         + 
Sbjct: 348 EDGAFY----------CKLYDITSRGN-FEKKNIANLIQTDLKDVDNNK---------DK 387

Query: 518 LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSD 577
           L   R KLF+ R KR  PH DDK++ SWN L+I +F RA +  K++              
Sbjct: 388 LERIREKLFEYREKRIHPHKDDKILTSWNALMIIAFCRAGRSFKND-------------- 433

Query: 578 RKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGS 637
              Y+++A+ +A FI ++L DE+   L    R       GF+DDYAF +  L++LYE   
Sbjct: 434 --NYIDIAKQSADFIIKNLMDEKG-TLYARIREEERGNEGFIDDYAFFLWALIELYEASF 490

Query: 638 GTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINL 697
              +L  +IE+ ++  +LF  +E GG++  +     +++R KE +DGA PSGN+V+ + L
Sbjct: 491 DIYYLEKSIEVADSMIDLFWHKEKGGFYLYSKNSEKLIVRPKEIYDGAMPSGNAVASLAL 550

Query: 698 VRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVL 754
             L  I      D Y+   +     F   +K   M   L    A M ++   + + L
Sbjct: 551 SLLYYITG---EDKYKNLVDKQFKFFAANIKSGPM-YHLFSVIAYMYNISPVQEITL 603


>gi|443631576|ref|ZP_21115757.1| hypothetical protein BSI_08280 [Bacillus subtilis subsp.
           inaquosorum KCTC 13429]
 gi|443349381|gb|ELS63437.1| hypothetical protein BSI_08280 [Bacillus subtilis subsp.
           inaquosorum KCTC 13429]
          Length = 689

 Score =  457 bits (1176), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 262/687 (38%), Positives = 381/687 (55%), Gaps = 69/687 (10%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYLLQHAHNPVDW+ WGEEAF +A++ + P+ +SIGYSTCHWCHVM  ESFE
Sbjct: 8   NRLINEKSPYLLQHAHNPVDWYPWGEEAFEKAKRENKPVLVSIGYSTCHWCHVMAHESFE 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D  +A+LLN+ FV+IKVDREERPDVD VYM   Q + G GGWPL+VF++PD KP   GTY
Sbjct: 68  DAEIARLLNERFVAIKVDREERPDVDSVYMRICQLMTGQGGWPLNVFITPDQKPFYAGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP   K+ RPGF  +L  + + +   R+ +      A + L    +A +        L +
Sbjct: 128 FPKTSKFNRPGFVDVLEHLSETFANDREHVEDIAENAAKHLQTKTAAKSGEG-----LSE 182

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
           +A      QL+  +D+ +GGFG APKFP P    M++Y  +   +TG+        K   
Sbjct: 183 SATHRTFLQLANGFDTIYGGFGQAPKFPMP---HMLMYLLRYHHNTGQENALYNVTK--- 236

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL  MA GGI+DH+G GF RYS D+ W VPHFEKMLYD   L   Y +A+ +T++  Y 
Sbjct: 237 -TLDSMANGGIYDHIGYGFARYSTDDEWLVPHFEKMLYDNALLLTAYTEAYQVTQNSRYK 295

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH-A 461
            IC  I+ +++R+M    G  FSA DAD   TEG    +EG +YVW+  E+   LG+   
Sbjct: 296 EICEQIITFIQREMTHEDGSFFSALDAD---TEG----EEGKYYVWSKDEILKTLGDDLG 348

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIE------LNDSSASASKLGMPLEK 513
            L+ + Y +   GN            F+GKN+  LI       + D+S +  +L + LE 
Sbjct: 349 TLYCQVYDITEKGN------------FEGKNIPNLIHTKREQLIADASLTKEELNLKLE- 395

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
                 + R++L  +R +R  PH+DDKV+ SWN L+I+  A+A+K+ +            
Sbjct: 396 ------DARQQLLKIREERTYPHVDDKVLTSWNALMIAGLAKAAKVYQ------------ 437

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
                 +Y+ +A+ A +FI   L  +   R+   +R+G  K  GF+DDYAFL+   LDLY
Sbjct: 438 ----EPKYLSLAKDAITFIENKLIIDG--RVMVRYRDGEVKNKGFIDDYAFLLWAYLDLY 491

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
           E      +L  A +L +    LF D E GG++ T  +  ++++R KE +DGA PSGNSV+
Sbjct: 492 EASFDLSYLRKAKKLTDDMIGLFWDEEHGGFYFTGHDAEALIVREKEVYDGAMPSGNSVA 551

Query: 694 VINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVV 753
            + L+RL   V G  S    + AE   +VF+  +            +     +P +K +V
Sbjct: 552 AVQLLRLGQ-VTGDLS--LIEKAESMFSVFKPDIDAYPSGHAFFMQSVLKHLMP-KKEIV 607

Query: 754 LVGHKSSVDFENMLAAAHASYDLNKTV 780
           + G+      + ++ A   ++  N ++
Sbjct: 608 IFGNADDPARKQIITALQKAFKPNDSI 634


>gi|188585586|ref|YP_001917131.1| hypothetical protein Nther_0959 [Natranaerobius thermophilus
           JW/NM-WN-LF]
 gi|179350273|gb|ACB84543.1| protein of unknown function DUF255 [Natranaerobius thermophilus
           JW/NM-WN-LF]
          Length = 686

 Score =  457 bits (1176), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 254/609 (41%), Positives = 345/609 (56%), Gaps = 64/609 (10%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            NRLA E SPYLLQHAHNPVDWF W EEAF +A+K D PIFLSIGYSTCHWCHVME ESF
Sbjct: 10  VNRLANEKSPYLLQHAHNPVDWFPWSEEAFEKAKKEDKPIFLSIGYSTCHWCHVMEQESF 69

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           ED  +A +LN  F+SIKVDREERPD+D +YM+  QAL G GGWPL+VFL+ D  P   GT
Sbjct: 70  EDHEIAGILNKNFISIKVDREERPDIDAIYMSACQALTGRGGWPLTVFLNHDKNPFYAGT 129

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFP E++ G PG K IL KV   W   R  L   G    + +       A     P  + 
Sbjct: 130 YFPKENRLGMPGLKDILEKVSSKWQNDRYELINIGNEITQAVEHHFFTHA-----PGNVT 184

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQK 339
           + +L +   QL +++D  +GGFGSAPKFP P  +  +L  YH      TG          
Sbjct: 185 EESLHIAFSQLEENFDEEYGGFGSAPKFPSPHNLYFLLRYYHL-----TGNES----ALH 235

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           MV  TL  M +GGI+DH+G GF RYS D++W VPHFEKMLYD   LA  YL+ + +T++ 
Sbjct: 236 MVKKTLTSMYRGGIYDHIGYGFCRYSTDKKWLVPHFEKMLYDNALLAIAYLEVYEITRNN 295

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE 459
           F+  I ++I  Y+ R++  P G  +SAEDADS   EG    +EG FYV+T +EV ++LGE
Sbjct: 296 FFKEIAQEIFTYVSRELTSPEGGFYSAEDADS---EG----EEGKFYVFTPQEVIEVLGE 348

Query: 460 -HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 518
                F + Y +   GN            F+  N +  L   +    +    L       
Sbjct: 349 VRGQEFCKQYNITANGN------------FEHGNSIPNLIGKNPEKDEFQKDL------- 389

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
               +KLF+ R +R  P  DDK++ SWNGL+I++ A+ S++L  E               
Sbjct: 390 ----KKLFEYREQREHPFKDDKILTSWNGLMIAALAKGSRVLNDE--------------- 430

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG 638
             Y+ +A+S+  FI ++L      RL   +R+G +  PGFLDDYA+L+ GL++LY     
Sbjct: 431 -RYLNMAQSSYRFIEKNLIT-NNQRLLTRYRDGEASIPGFLDDYAYLVWGLIELYNASFE 488

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
             +L  A+   +   +LF D++ GG +    +  +++ R KE  D A PSGNSV+  NL+
Sbjct: 489 PYYLEKALIFNDEMIKLFWDQDQGGLYLYGHDSETLVSRPKEIDDSALPSGNSVATRNLL 548

Query: 699 RLASIVAGS 707
            L  +   +
Sbjct: 549 ELFHLTGKT 557


>gi|429507366|ref|YP_007188550.1| hypothetical protein B938_19420 [Bacillus amyloliquefaciens subsp.
           plantarum AS43.3]
 gi|429488956|gb|AFZ92880.1| hypothetical protein B938_19420 [Bacillus amyloliquefaciens subsp.
           plantarum AS43.3]
          Length = 689

 Score =  456 bits (1174), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 266/685 (38%), Positives = 379/685 (55%), Gaps = 58/685 (8%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           N   N L  E SPYLLQHAHNPV+W  WG+EAF +A++ + P+ +SIGYSTCHWCHVM  
Sbjct: 4   NSKPNSLITEKSPYLLQHAHNPVNWHPWGKEAFEKAKRENKPVLVSIGYSTCHWCHVMAH 63

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFEDE +A +LND F++IKVDREERPDVD VYM   Q + G GGWPL+VF++PD KP  
Sbjct: 64  ESFEDEEIAGILNDKFIAIKVDREERPDVDSVYMRICQLMTGQGGWPLNVFVTPDQKPFY 123

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
            GTYFP   K+ RPGF  +L  + + +   R          +E ++E  +A       P 
Sbjct: 124 AGTYFPKTSKFNRPGFIDVLEHLSETFANDRQ--------HVEDIAENAAAHLEVKVHPT 175

Query: 279 E--LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASE 336
           E  L + A+     QL+  +D+ +GGFG APKFP P    M+++  +    TGK  +A  
Sbjct: 176 EGMLGEQAVHDTYRQLAGGFDTVYGGFGQAPKFPMP---HMLMFLLRYYSYTGKE-QALA 231

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
           G   V  TL  MA GGI DH+G GF RYS D  W VPHFEKMLYD   L   Y +A+ +T
Sbjct: 232 G---VTKTLDGMANGGIFDHIGFGFARYSTDNEWLVPHFEKMLYDNALLLTAYTEAYQVT 288

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
            +  Y  I   I+ +++R+M+   G  FSA DAD   TEG    +EG +Y+W+ KE+ ++
Sbjct: 289 GNERYKQIAMQIVTFIQREMMHEDGSFFSALDAD---TEG----REGKYYIWSKKEIMNL 341

Query: 457 LG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 515
           LG E   L+ + Y +   GN +   +  PH  F  +  ++E  ++  +  +L   LE   
Sbjct: 342 LGDELGPLYCKVYNITDQGNFEGENI--PHLIFTRREAILE--ETGLTGHELAERLE--- 394

Query: 516 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 575
               E R KL + R  R  PH DDKV+ SWN L+I+  A+A+K+         F+ P   
Sbjct: 395 ----EARTKLLEARENRSYPHTDDKVLTSWNALMIAGLAKAAKV---------FHEP--- 438

Query: 576 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 635
               +++ +AE+A  F+ RHL  +   R+   +R G  K  GF+DDYAFLI   L+LYE 
Sbjct: 439 ----DFLSMAETAIRFLERHLMPDG--RVMVRYREGEVKNKGFIDDYAFLIWAYLELYEA 492

Query: 636 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 695
           G    +L  A  L  +  ELF D   GG+F T  +  ++L+R KE +DGA PSGNS + +
Sbjct: 493 GFNPSYLQKAKTLCTSMLELFWDERHGGFFFTGNDAETLLVREKEVYDGAVPSGNSATAV 552

Query: 696 NLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLV 755
            L+RL  +          + AE   +VF+  ++    +      +    ++P +K +V+ 
Sbjct: 553 QLLRLGRLTGDIS---LIEKAEAMFSVFKREIEAYPSSNAFFMQSVLAHTMP-QKEIVVF 608

Query: 756 GHKSSVDFENMLAAAHASYDLNKTV 780
           G K   D +  + A    +    T+
Sbjct: 609 GRKDDPDRKRFIEALQEHFTPAYTI 633


>gi|310641971|ref|YP_003946729.1| cellulase catalitic domain protein and a thioredoxin domain protein
           [Paenibacillus polymyxa SC2]
 gi|386040955|ref|YP_005959909.1| hypothetical protein PPM_2265 [Paenibacillus polymyxa M1]
 gi|309246921|gb|ADO56488.1| cellulase catalitic domain protein and a thioredoxin domain protein
           [Paenibacillus polymyxa SC2]
 gi|343096993|emb|CCC85202.1| hypothetical protein PPM_2265 [Paenibacillus polymyxa M1]
          Length = 691

 Score =  456 bits (1174), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 261/620 (42%), Positives = 355/620 (57%), Gaps = 53/620 (8%)

Query: 97  SRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVM 156
           S +   NRLA E SPYLLQHA+NPV+WF W +EAF  A++ + PIFLSIGYSTCHWCHVM
Sbjct: 2   STSSKPNRLAKEKSPYLLQHAYNPVNWFPWSDEAFEIAKRDNKPIFLSIGYSTCHWCHVM 61

Query: 157 EVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKP 216
           E ESFED+ VA++LN  +VSIKVDREERPDVD +YM+  + + G GGWPL++ ++PD KP
Sbjct: 62  ERESFEDQEVAEVLNQDYVSIKVDREERPDVDHIYMSICETMTGHGGWPLTIMMTPDQKP 121

Query: 217 LMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQ-SGAFAIEQLSEALSASASSNK 275
              GTY P E K+GR G   +L KV   W ++ D L + S     E   + L A      
Sbjct: 122 FFAGTYLPKEQKFGRVGLLELLGKVGIRWKEQPDELMELSEQVLTEHERQDLLAGYRG-- 179

Query: 276 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 335
              EL    L     + S ++D  +GGFG APKFP P  +  +L +++    TG      
Sbjct: 180 ---ELDDQCLNKAFHEYSHTFDHEYGGFGEAPKFPSPHNLSFLLRYAQH---TGN----Q 229

Query: 336 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 395
           +  +MV  TL  M++GGI+DHVG GF RYSVDE+W VPHFEKMLYD   LA  Y +A+ +
Sbjct: 230 QALEMVEKTLDAMSRGGIYDHVGMGFSRYSVDEKWLVPHFEKMLYDNALLAITYTEAWQV 289

Query: 396 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 455
           T    Y  I   I  Y+ RDM   GG  +SAEDADS   EG    +EG FYVW+  E++ 
Sbjct: 290 TGKRLYRQITEQIFTYIARDMTDAGGAFYSAEDADS---EG----EEGRFYVWSDSEIKA 342

Query: 456 ILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASASKLGMPLE 512
           +LG E A  F + Y + P GN            F+G N+  LI++N   A  +K  +   
Sbjct: 343 VLGDEDASFFNDLYGITPYGN------------FEGHNIPNLIDIN-LEAYGNKHDLTEP 389

Query: 513 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 572
           +    + E + KLF  R +R  P  DDK++ SWNGL+I++ A+A +              
Sbjct: 390 ELEQRVSELKDKLFTAREQRVHPQKDDKILTSWNGLMIAALAKAGQ-------------- 435

Query: 573 VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDL 632
             G  R  Y E A  A +F+  HL  E   RL   +R+G +   G++DDYAF + GL++L
Sbjct: 436 AFGDTR--YTEQARKAETFLWNHLRREDG-RLLARYRDGQAAYLGYVDDYAFYVWGLIEL 492

Query: 633 YEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSV 692
           Y+     ++L  A+ L     +LF D E  G F T  +   ++ R KE +DGA PSGNS+
Sbjct: 493 YQATFDVQYLQRALTLNQNMIDLFWDEERDGLFFTGSDSEQLISRPKEIYDGAIPSGNSI 552

Query: 693 SVINLVRLASIVAGSKSDYY 712
           +  N VRLA +   ++ + Y
Sbjct: 553 AAHNFVRLARLTGETRLEDY 572


>gi|385266996|ref|ZP_10045083.1| hypothetical protein MY7_3797 [Bacillus sp. 5B6]
 gi|385151492|gb|EIF15429.1| hypothetical protein MY7_3797 [Bacillus sp. 5B6]
          Length = 689

 Score =  456 bits (1173), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 267/685 (38%), Positives = 378/685 (55%), Gaps = 58/685 (8%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           N   N L  E SPYLLQHAHNPV+W  WGEEAF +A++ + P+ +SIGYSTCHWCHVM  
Sbjct: 4   NGIANSLITEKSPYLLQHAHNPVNWHPWGEEAFEKAKRENKPVLVSIGYSTCHWCHVMAH 63

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFEDE +A +LND F++IKVDREERPDVD VYM   Q + G GGWPL+VF++PD KP  
Sbjct: 64  ESFEDEEIAGMLNDKFIAIKVDREERPDVDSVYMRICQLMTGQGGWPLNVFVTPDQKPFY 123

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
            GTYFP   K+ RPGF  +L  + + +   R          +E ++E  +A       P 
Sbjct: 124 AGTYFPKTSKFNRPGFIDVLEHLSETFANDRQ--------HVEDIAENAAAHLEVKVHPA 175

Query: 279 E--LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASE 336
           E  L + A+     QL+  +D+ +GGFG APKFP P    M+++  +    TGK  +A  
Sbjct: 176 EGMLGEQAVHDTYRQLAGGFDTVYGGFGQAPKFPMP---HMLMFLLRYYSYTGKE-QALA 231

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
           G   V  TL  MA GGI DH+G GF RYS D  W VPHFEKMLYD   L   Y +A+ +T
Sbjct: 232 G---VTKTLDGMANGGIFDHIGFGFARYSTDNEWLVPHFEKMLYDNALLLTAYTEAYQVT 288

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
            +  Y  I   I+ +++R+M+   G  FSA DAD   TEG    +EG +Y+W+ KE+ ++
Sbjct: 289 GNERYKQIAMQIVTFIQREMMHEDGSFFSALDAD---TEG----REGKYYIWSKKEIMNL 341

Query: 457 LG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 515
           LG E   L+ + Y +   GN +   +  PH  F  +  ++E   +  +  +L   LE   
Sbjct: 342 LGDELGPLYCKVYNITDQGNFEGENI--PHLIFTRREAILE--GTGLTGHELAERLE--- 394

Query: 516 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 575
               E R KL + R  R  PH DDKV+ SWN L+I+  A+A+K+         F+ P   
Sbjct: 395 ----EARTKLLEARENRSYPHTDDKVLTSWNALMIAGLAKAAKV---------FHEP--- 438

Query: 576 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 635
               +++ +AE+A  F+ RHL  +   R+   +R G  K  GF+DDYAFLI   L+LYE 
Sbjct: 439 ----DFLSMAETAIRFLERHLMPDG--RVMVRYREGEVKNKGFIDDYAFLIWAYLELYEA 492

Query: 636 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 695
           G    +L  A  L  +  ELF D   GG+F T  +  ++L+R KE +DGA PSGNS + +
Sbjct: 493 GFNPSYLQKAKTLCTSMLELFWDERHGGFFFTGNDAETLLVREKEVYDGAVPSGNSAAAV 552

Query: 696 NLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLV 755
            L+RL  +          + AE   +VF+  ++    +      +    ++P +K +V+ 
Sbjct: 553 QLLRLGRLTGDIS---LIEKAEAMFSVFKREIEAYPSSNAFFMQSVLAHTMP-QKEIVVF 608

Query: 756 GHKSSVDFENMLAAAHASYDLNKTV 780
           G K   D +  + A    +    T+
Sbjct: 609 GRKDDPDRKRFIEALQEHFTPAYTI 633


>gi|220927673|ref|YP_002504582.1| hypothetical protein Ccel_0215 [Clostridium cellulolyticum H10]
 gi|219998001|gb|ACL74602.1| protein of unknown function DUF255 [Clostridium cellulolyticum H10]
          Length = 673

 Score =  456 bits (1172), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 275/670 (41%), Positives = 373/670 (55%), Gaps = 79/670 (11%)

Query: 97  SRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVM 156
           + N+  N+L  E SPYLLQHAHNPVDW+ WG EAF+ A   D PIFLSIGYSTCHWCHVM
Sbjct: 3   TNNRMPNKLINEKSPYLLQHAHNPVDWYPWGPEAFSRAVSEDKPIFLSIGYSTCHWCHVM 62

Query: 157 EVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKP 216
           E ESFEDE VA +LN  F+ IKVDREERPD+D +YM+  QAL G GGWPL+VFL+PD +P
Sbjct: 63  ERESFEDEEVAHILNRDFICIKVDREERPDIDSIYMSVCQALTGHGGWPLTVFLTPDKQP 122

Query: 217 LMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKL 276
              GTYFP ED  G  G  ++L  VK+AWD KR+ L  S    I  +S+   +  S    
Sbjct: 123 FYAGTYFPKEDSKGLMGLISLLGSVKEAWDNKREHLLVSAENIINHVSKESISKDSKISS 182

Query: 277 PDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLEDTGKSGEA 334
             ++ Q A          ++DS++GGFG++PKFP P  +  +L  +++KK          
Sbjct: 183 --DIIQEAF----AHFKYNFDSKYGGFGTSPKFPSPHTLLFLLRYWYTKK---------E 227

Query: 335 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 394
               +MV  TL+ M  GGI DH+G GF RYS D++W VPHFEKMLYD   LA  Y +A+S
Sbjct: 228 PYALEMVEKTLESMKNGGIFDHIGFGFSRYSTDKKWLVPHFEKMLYDNALLAIAYGEAYS 287

Query: 395 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 454
            T +  Y    R ILDY++RDM    G  +SAEDADS   EG     EG FY+W+ +EV 
Sbjct: 288 ATGNKNYEETARQILDYVQRDMSSQLGAFYSAEDADS---EGV----EGKFYIWSKEEVI 340

Query: 455 DILG-----EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASASKL 507
           ++LG     E+  +F     + P+GN            F+G N+  LIE           
Sbjct: 341 NVLGSKDGEEYCRIFD----ISPSGN------------FEGLNIPNLIE----------T 374

Query: 508 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 567
           G   E+  +   +CR+KLF  R KR  P+ DDK++ +WNGL+ ++ A   ++L       
Sbjct: 375 GTLPEQQKSFAEDCRKKLFTHREKRIHPYKDDKILTAWNGLMTAAMAYCGRVL------- 427

Query: 568 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 627
                  G D+  Y+E A+    FI + L      RL   +R G +  P +L+DYAFL+ 
Sbjct: 428 -------GEDK--YIESAKRCIDFISKKLV-RTDGRLLARYREGEAVFPAYLEDYAFLVW 477

Query: 628 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 687
           GLL+LYE    T +L  A++L +    LF +    G F    +   ++ R +E +DGA P
Sbjct: 478 GLLELYEATFTTLYLKRALKLTDAMLNLFGENNSTGLFLYGHDSEQLIARPRESYDGAIP 537

Query: 688 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 747
           SGNSV+ +NL+RLA I    +   Y   A+  +  F T++         M C+  M SV 
Sbjct: 538 SGNSVAAMNLLRLARITGRHE---YENRAKAIMDFFGTQINAAPTGHSYMLCSY-MYSVS 593

Query: 748 S-RKHVVLVG 756
                VV+ G
Sbjct: 594 DISSEVVIAG 603


>gi|383762697|ref|YP_005441679.1| hypothetical protein CLDAP_17420 [Caldilinea aerophila DSM 14535 =
           NBRC 104270]
 gi|381382965|dbj|BAL99781.1| hypothetical protein CLDAP_17420 [Caldilinea aerophila DSM 14535 =
           NBRC 104270]
          Length = 689

 Score =  456 bits (1172), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 268/673 (39%), Positives = 373/673 (55%), Gaps = 53/673 (7%)

Query: 97  SRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVM 156
           S  +HTNRL  E SPYLLQHAHNPVDW+ WGEEA   AR  D PIFLSIGYS CHWCHVM
Sbjct: 2   STRQHTNRLIHETSPYLLQHAHNPVDWYPWGEEALQRARAEDKPIFLSIGYSACHWCHVM 61

Query: 157 EVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKP 216
           E ESFEDE  A L+N+ FV+IKVDREERPD+D +YM  VQA+ G GGWP+SV+L+PD KP
Sbjct: 62  ERESFEDEETAALMNELFVNIKVDREERPDLDAIYMDAVQAMTGQGGWPMSVWLTPDGKP 121

Query: 217 LMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKL 276
             GGTYFP E +YG P F+ +LR V +A+ ++R+M+        E+L+  L  +AS    
Sbjct: 122 FYGGTYFPKEPRYGMPSFQQVLRAVAEAYRERREMVEGQA----ERLASMLQRTASLRAE 177

Query: 277 PDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASE 336
             EL +  L     Q+ + +D   GGFGS PKFP+P+ +   L    +   TG      +
Sbjct: 178 GGELGEEILEEALGQMRQYFDEEEGGFGSQPKFPQPMTLDFALTQYLR---TGN----LD 230

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
              M   TL+ MA GGI+D +GGGFHRYSVD  W VPHFEKMLYD  QL   YL A+ +T
Sbjct: 231 ALYMAELTLEKMAHGGIYDQLGGGFHRYSVDAIWLVPHFEKMLYDNAQLLRTYLHAWQVT 290

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
           +   +  +  + +DY+ R+M  P G  +SA+DADS   EG     EG F++W+ +EVE +
Sbjct: 291 QRPLFRRVVEETIDYVLREMTAPDGGFYSAQDADS---EG----HEGKFFLWSQQEVESL 343

Query: 457 LGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 515
           L  H A +F ++Y +   GN            F+GKN+L  +      A +  +   +  
Sbjct: 344 LDPHTAAIFCDYYGVSAHGN------------FEGKNILSVVRSIEQVAQRFRIGEAEVE 391

Query: 516 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 575
           + L   R  LF  R KR +P  D+K++  WNGL+I + A    +L               
Sbjct: 392 DALRRARAILFAHREKRIKPARDEKILTEWNGLMIHALAECGVVL--------------- 436

Query: 576 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 635
            +R++ +  A  AA FI   +  +   RL  S+++G ++   +L+DYA LI GL+ LYE 
Sbjct: 437 -ERQDALAAAVRAAEFILAQM-SQPDGRLYRSYKDGRARFNAYLEDYASLIRGLIALYEA 494

Query: 636 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 695
               +WL  A  L     E F D   GG+F T  +   ++ R K+  D A PSGNS++  
Sbjct: 495 TFDLRWLGEATRLAQIMFEQFHD-PAGGFFQTGVDHEQLVARRKDFVDNAVPSGNSLAAE 553

Query: 696 NLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLV 755
            L+RL+  +   +   YR  A   L + +  +         + C  D    PS++ + +V
Sbjct: 554 ALLRLSVFLDKPE---YRTEAGRILLMMKDAMARQPTGFGRLLCVLDAYLSPSQE-IAIV 609

Query: 756 GHKSSVDFENMLA 768
           G +       +LA
Sbjct: 610 GRRDDPATAALLA 622


>gi|165970642|gb|AAI58572.1| Spata20 protein [Rattus norvegicus]
          Length = 550

 Score =  456 bits (1172), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 229/515 (44%), Positives = 320/515 (62%), Gaps = 43/515 (8%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
            K  NRL  E SPYLLQHAHNPVDW+ WG+EAF +A+K + PIFLS+GYSTCHWCH+ME 
Sbjct: 62  QKTANRLINEKSPYLLQHAHNPVDWYPWGQEAFDKAKKENKPIFLSVGYSTCHWCHMMEE 121

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESF++E +  LLN+ FVS+ VDREERPDVDKVYMT+VQA   GGGWP++V+L+P L+P +
Sbjct: 122 ESFQNEEIGHLLNENFVSVMVDREERPDVDKVYMTFVQATSSGGGWPMNVWLTPSLQPFV 181

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
           GGTYFPPED   R GF+T+L ++ D W + ++ L ++     ++++ AL A +  +    
Sbjct: 182 GGTYFPPEDGLTRVGFRTVLMRICDQWKQNKNTLLENS----QRVTTALLARSEISVGDR 237

Query: 279 ELPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLEDTGKSGE 333
           +LP +A  +   C +QL + YD  +GGF  APKFP PV +  +   + S ++   G    
Sbjct: 238 QLPPSAATMNSRCFQQLDEGYDEEYGGFAEAPKFPTPVILNFLFSYWLSHRVTQDG---- 293

Query: 334 ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 393
            S  Q+M L TL+ MA GGI DHVG GFHRYS D +WH+PHFEKMLYDQ QL+ VY  AF
Sbjct: 294 -SRAQQMALHTLKMMANGGIRDHVGQGFHRYSTDRQWHIPHFEKMLYDQAQLSVVYCQAF 352

Query: 394 SLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 453
            ++ D F+S + + IL Y+ R++    G  +SAEDADS    G  + +EGA Y+WT KEV
Sbjct: 353 QISGDEFFSDVAKGILQYVTRNLSHRSGGFYSAEDADSPPERG-VKPQEGALYLWTVKEV 411

Query: 454 EDILGE----------HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 503
           + +L E             L  +HY L   GN + ++  D + E  G+NVL        +
Sbjct: 412 QQLLPEPVGGASEPLTSGQLLMKHYGLSEAGNINPTQ--DVNGEMHGQNVLTVRYSLELT 469

Query: 504 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 563
           A++ G+ +E    +L     KLF  R  RP+ HLD+K++ +WNGL++S FA A  +L  E
Sbjct: 470 AARYGLEVEAVRALLNTGLEKLFQARKHRPKAHLDNKMLAAWNGLMVSGFAVAGSVLGME 529

Query: 564 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYD 598
                           + +  A + A F++RH++D
Sbjct: 530 ----------------KLVTQATNGAKFLKRHMFD 548


>gi|154688185|ref|YP_001423346.1| hypothetical protein RBAM_037900 [Bacillus amyloliquefaciens FZB42]
 gi|154354036|gb|ABS76115.1| YyaL [Bacillus amyloliquefaciens FZB42]
          Length = 689

 Score =  455 bits (1171), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 266/685 (38%), Positives = 378/685 (55%), Gaps = 58/685 (8%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           N   N L  E SPYLLQHAHNPV+W  WG+EAF +A++ + P+ +SIGYSTCHWCHVM  
Sbjct: 4   NSKPNSLITEKSPYLLQHAHNPVNWHPWGKEAFEKAKRENKPVLVSIGYSTCHWCHVMAH 63

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFEDE +A +LND F++IKVDREERPDVD VYM   Q + G GGWPL+VF++PD KP  
Sbjct: 64  ESFEDEEIAGMLNDKFIAIKVDREERPDVDSVYMRICQLMTGQGGWPLNVFVTPDQKPFY 123

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
            GTYFP   K+ RPGF  +L  + + +   R          +E ++E  +A       P 
Sbjct: 124 AGTYFPKTSKFNRPGFIDVLEHLSETFANDRQ--------HVEDIAENAAAHLEVKVHPT 175

Query: 279 E--LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASE 336
           E  L + A+     QL+  +D+ +GGFG APKFP P    M+++  +    TGK  +A  
Sbjct: 176 EGMLGEQAVHDTYRQLAGGFDTVYGGFGQAPKFPMP---HMLMFLLRYYSYTGKE-QALA 231

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
           G   V  TL  MA GGI DH+G GF RYS D  W VPHFEKMLYD   L   Y +A+ +T
Sbjct: 232 G---VTKTLDGMANGGIFDHIGYGFARYSTDNEWLVPHFEKMLYDNALLLTAYTEAYQVT 288

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
            +  Y  I   I+ +++R+M    G  FSA DAD   TEG    +EG +Y+W+ KE+ ++
Sbjct: 289 GNERYKQIAMQIVMFIQREMTHEDGSFFSALDAD---TEG----REGKYYIWSKKEIMNL 341

Query: 457 LG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 515
           LG E   L+ + Y +   GN +   +  PH  F  +  ++E  ++  +  +L   LE   
Sbjct: 342 LGDELGPLYCKVYNITDQGNFEGENI--PHLIFTRREAILE--ETGLTGHELAERLE--- 394

Query: 516 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 575
               E R KL + R  R  PH DDKV+ SWN L+I+  A+A+K+         F+ P   
Sbjct: 395 ----EARTKLLEARENRSYPHTDDKVLTSWNALMITGLAKAAKV---------FHEP--- 438

Query: 576 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 635
               +++ +AE+A  F+ RHL  +   R+   +R G  K  GF+DDYAFLI   L+LYE 
Sbjct: 439 ----DFLSMAETAIRFLERHLMPDG--RVMVRYREGEVKNKGFIDDYAFLIWAYLELYEA 492

Query: 636 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 695
           G    +L  A  L  +  ELF D   GG+F T  +  ++L+R KE +DGA PSGNS + +
Sbjct: 493 GFNPSYLQKAKTLCTSMLELFWDERHGGFFFTGNDAETLLVREKEVYDGAVPSGNSAAAV 552

Query: 696 NLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLV 755
            L+RL  +          + AE   +VF+  ++    +      +    ++P +K +V+ 
Sbjct: 553 QLLRLGRLTGDIS---LIEKAEAMFSVFKREIEAYPSSNAFFMQSVLAHTMP-QKEIVVF 608

Query: 756 GHKSSVDFENMLAAAHASYDLNKTV 780
           G K   D +  + A    +    T+
Sbjct: 609 GRKDDPDRKRFIEALQEHFTPAYTI 633


>gi|375308642|ref|ZP_09773925.1| hypothetical protein WG8_2450 [Paenibacillus sp. Aloe-11]
 gi|375079269|gb|EHS57494.1| hypothetical protein WG8_2450 [Paenibacillus sp. Aloe-11]
          Length = 690

 Score =  454 bits (1168), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 257/616 (41%), Positives = 352/616 (57%), Gaps = 57/616 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYLLQHA+NP+DW++W  EAF +A+K + PIFLS+GYS+CHWCHVM+ ESFE
Sbjct: 10  NRLIHEKSPYLLQHAYNPIDWYSWESEAFEKAKKENKPIFLSVGYSSCHWCHVMKRESFE 69

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE +A++LN  +VSIKVDREERPDVD +YM+  Q + G GGWPL++ ++PD KP   GTY
Sbjct: 70  DEEIAEILNRDYVSIKVDREERPDVDHIYMSICQTMTGHGGWPLTILMTPDQKPFFAGTY 129

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLP---DE 279
            P E K+GR G   +L KV   W ++ + L       +E   + L+     + L     E
Sbjct: 130 LPKEQKFGRVGLLELLDKVGTRWKEQPEEL-------VELSEQVLTEHERQDMLAGYRGE 182

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
           L + +L     Q S ++D  +GGFG APKFP P  +  +L +++    TG      +  +
Sbjct: 183 LDEQSLNKAFHQYSHTFDKEYGGFGEAPKFPSPHILSFLLRYAQH---TGN----QQALE 235

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           MV  TL  M +GGI+DHVG GF RYSVDE+W VPHFEKMLYD   LA  Y + + +T   
Sbjct: 236 MVEKTLDAMYRGGIYDHVGMGFSRYSVDEKWLVPHFEKMLYDNALLAIAYTETWQVTGKE 295

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG- 458
            Y  I   I  Y+ R+M   GG  +SAEDADS   EG    +EG FYVW   EV  +LG 
Sbjct: 296 LYRQITEQIFTYIAREMTDAGGAFYSAEDADS---EG----EEGRFYVWDDSEVRAVLGD 348

Query: 459 EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASASKLGMPLEKYLN 516
           E A  F + Y + P GN            F+G N+  LI++N   A   K  +  ++  +
Sbjct: 349 EDASFFNDLYGITPYGN------------FEGHNIPNLIDIN-LEAYGLKHDLTKQELED 395

Query: 517 ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGS 576
            + E R KLF  R KR  PH DDK++ SWNGL+I + A+A +                  
Sbjct: 396 RVRELRDKLFAAREKRVHPHKDDKILTSWNGLMIVALAKAGQAFGDVT------------ 443

Query: 577 DRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG 636
               Y E A+ A SF+  HL      RL   +R+G +  PG+LDDYAF + GL++LY+  
Sbjct: 444 ----YTERAQKAESFLWSHL-RRVDGRLLARYRDGDAAYPGYLDDYAFYVWGLIELYQAT 498

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
              ++L  A+ L     +LF D E  G F    +   ++ + KE +DGA PSGNS++  N
Sbjct: 499 FDVQYLQRALTLNQNMIDLFWDEEHHGLFFYGKDSEQLIAKPKEIYDGAIPSGNSIAAHN 558

Query: 697 LVRLASIVAGSKSDYY 712
           LVRLA +   ++ + Y
Sbjct: 559 LVRLARLTGEARLEDY 574


>gi|408403905|ref|YP_006861888.1| hypothetical protein Ngar_c12930 [Candidatus Nitrososphaera
           gargensis Ga9.2]
 gi|408364501|gb|AFU58231.1| protein of unknown function DUF255 [Candidatus Nitrososphaera
           gargensis Ga9.2]
          Length = 695

 Score =  454 bits (1167), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 259/611 (42%), Positives = 354/611 (57%), Gaps = 52/611 (8%)

Query: 96  HSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHV 155
           H+     N+LA E SPYLLQHA+NPVDW++WGEEA   A+K D PIFLS+GYS CHWCHV
Sbjct: 5   HASRGKPNKLAKETSPYLLQHAYNPVDWYSWGEEALERAKKEDKPIFLSVGYSACHWCHV 64

Query: 156 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 215
           M  ESFED+ +AK++N+ F++IKVDREERPD+D +Y    Q   G GGWPLSVFL+PD K
Sbjct: 65  MAHESFEDDEIAKIMNEHFINIKVDREERPDLDDIYQRVCQLATGTGGWPLSVFLTPDQK 124

Query: 216 PLMGGTYFPPE-DKYGRPGFKTILRKVKDAW-DKKRDMLAQSGAF--AIEQLSEALSASA 271
           P   GTYFP E   Y  PGFKTIL ++  A+  KK+++ A SG F  A+ Q +  ++  A
Sbjct: 125 PFYVGTYFPKEGGHYNMPGFKTILLQLATAYKSKKQEIEAASGEFMDALAQTARDVALGA 184

Query: 272 SSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKS 331
           +       L ++ L   A  L +  D  +GGFG APKFP    +  +L   +  + +G S
Sbjct: 185 AGKA---SLERSILDEAAVGLLQMGDPIYGGFGQAPKFPNASNLMFLL---RYYDISGMS 238

Query: 332 GEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLD 391
                 +  V FT   MA GGIHD +GGGF RY+ D++W VPHFEKMLYD   LA +Y +
Sbjct: 239 C----FKDFVAFTADKMAAGGIHDQLGGGFARYATDQKWLVPHFEKMLYDNALLAQLYSE 294

Query: 392 AFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSK 451
            + +TK   Y  I R  LD++ R+M  P G  +SA+DADS   EG    +EG FYVW+ K
Sbjct: 295 LYQITKAEKYLQITRKTLDFVIREMTHPEGGFYSAQDADS---EG----EEGKFYVWSKK 347

Query: 452 EVEDILGEHAI--LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGM 509
           E+  ILG+ A   +F EHY +   GN            F+GKN+L      S+   + G 
Sbjct: 348 EIASILGDQAATDIFCEHYGVTEGGN------------FEGKNILNVRVPVSSVGLRYGK 395

Query: 510 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 569
             E+   I+ +   KLF  R KR RP  D+K++ SWNGL+IS FA+   I          
Sbjct: 396 TPEQTAQIIADASAKLFAAREKRVRPARDEKILTSWNGLMISGFAKGYGI---------- 445

Query: 570 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 629
                 +  ++Y++ A+ A  FI   +      RL H+F++G SK   +LDDYAF   GL
Sbjct: 446 ------TGDQKYLQAAKDAVKFIETKIVTGDG-RLLHTFKDGKSKLNAYLDDYAFYTGGL 498

Query: 630 LDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSG 689
           LDL+   S  ++L  A++  +     F D +    F T+ +   +++R K  +D A PSG
Sbjct: 499 LDLFAIDSRQEYLDKAVKYTDFMLAHFWDEKEENLFFTSDDHEKLIVRTKSFYDLAIPSG 558

Query: 690 NSVSVINLVRL 700
           NSV+  NL+RL
Sbjct: 559 NSVAASNLLRL 569


>gi|297566141|ref|YP_003685113.1| hypothetical protein [Meiothermus silvanus DSM 9946]
 gi|296850590|gb|ADH63605.1| protein of unknown function DUF255 [Meiothermus silvanus DSM 9946]
          Length = 665

 Score =  453 bits (1166), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 263/630 (41%), Positives = 349/630 (55%), Gaps = 62/630 (9%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            NRLA E SPYLLQHAHNPVDWF WGEEAFA+A+  D PIFLS+GY+TCHWCHVME ESF
Sbjct: 2   ANRLALETSPYLLQHAHNPVDWFPWGEEAFAKAKAEDKPIFLSVGYATCHWCHVMERESF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           ED   A+LLN++FV +KVDREE PDVD VYM  +QAL G GGWP+S+FL+PDLKP  GGT
Sbjct: 62  EDPETAQLLNEFFVPVKVDREELPDVDHVYMMALQALTGSGGWPMSLFLTPDLKPFYGGT 121

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFPPED++G P F  +L+ +   W  +R+ +  S     + L + L        LP +L 
Sbjct: 122 YFPPEDRHGLPSFARVLKTIASTWQNRREEVLGSADELTQHLHKLL--VPRGGPLPQDLH 179

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
             AL+    QL++++D+  GGFG APKFP+   +  +L  + K +             M+
Sbjct: 180 AQALK----QLARAHDATHGGFGGAPKFPQAPTLTYLLALAWKGDPLAWG--------ML 227

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             TL  MA+GGI+D VGGGFHRY+VD  W VPHFEKMLYD  QLA VYL    LT    Y
Sbjct: 228 ELTLDKMAEGGIYDQVGGGFHRYAVDGIWRVPHFEKMLYDNAQLAWVYLGMSRLTGKTLY 287

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHA 461
             +  + LDYL R+M  P G  +SA+DADS   EG     EG FYVW+ +EV  +LG  A
Sbjct: 288 RRVTLETLDYLLREMQHPEGGFYSAQDADS---EGV----EGKFYVWSEQEVRAVLGSDA 340

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
               + + +   GN            ++G NVL       A   +LG+    +   L E 
Sbjct: 341 EAALKLFGVSQAGN------------WEGVNVLEARYPEPALRQELGLDEATFARWLEEV 388

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
           + KL+  R +R  P  DDK++  WNGL + +FA A +IL  EA                Y
Sbjct: 389 KAKLYQARRQRIPPLTDDKILADWNGLALRAFAAAGRILGKEA----------------Y 432

Query: 582 MEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKW 641
           +E A   A F+   +  +    L+HS+R G  +   +L D A    GLL+ Y+     +W
Sbjct: 433 LEAARKNAEFVTSRMMRDGL--LRHSWRGGKLRPEAYLSDQASYGLGLLETYQATGEMRW 490

Query: 642 LVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLA 701
           L  A  L       F D   GG+F+ +G    + LR K+  DG  P GNS +   L+RLA
Sbjct: 491 LEAARTLAEGILTHFRD-PNGGFFDASGG--GLPLRAKDVFDGPYPGGNSAAAELLIRLA 547

Query: 702 SI--------VAGSKSDYYRQNAEHSLAVF 723
           ++         A    +++ Q   HS + F
Sbjct: 548 ALYEREDWAEAARGAIEFHAQGLAHSPSAF 577


>gi|255306584|ref|ZP_05350755.1| hypothetical protein CdifA_08327 [Clostridium difficile ATCC 43255]
          Length = 678

 Score =  453 bits (1166), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 243/615 (39%), Positives = 353/615 (57%), Gaps = 65/615 (10%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           N+  N L  E SPYLLQHA+NP++W++W +EAF +A++ D PIFLS+GYSTCHWCHVME 
Sbjct: 4   NRKPNNLINEKSPYLLQHAYNPINWYSWNDEAFKKAKEEDKPIFLSVGYSTCHWCHVMEK 63

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFEDE VA+++N  FV+IKVD+EERPDVD VYMT  QA+ G GGWP+++ ++PD KP  
Sbjct: 64  ESFEDEEVAEIMNRNFVAIKVDKEERPDVDSVYMTVCQAMTGSGGWPMTIIMTPDKKPFF 123

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
            GTYFP   +Y RPG   +L  V + W+  RD+L +SG   IE L +      +   L  
Sbjct: 124 AGTYFPKYSRYNRPGVIDLLENVSEKWNTSRDILIKSGDEIIEALKDDFGVKNTEGDLSK 183

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASE 336
           E+  +++R+        YD ++GGFG+APKFP P  +  ++  Y  +K +D         
Sbjct: 184 EMLSSSVRV----FKAIYDEKYGGFGNAPKFPSPQNLMFLMKYYSIEKDKDV-------- 231

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
             KMV  TL  M +GG+ DH+G GF RYS D++W  PHFEKMLYD   L   +LDA+ +T
Sbjct: 232 -LKMVEKTLDGMYRGGLFDHIGFGFSRYSTDKKWLAPHFEKMLYDNAMLTIAFLDAYKIT 290

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
           K   Y  I    +DY+ R+M    G  +SA+DADS   EG    +EG FY +   E+ ++
Sbjct: 291 KKELYKEIAIKTIDYVVREMKDKEGGFYSAQDADS---EG----EEGKFYTFNPLEIIEV 343

Query: 457 LGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASASKLGMPLEK 513
           LGE   I F  ++ +  +GN            F+GK++  LI+               E+
Sbjct: 344 LGEEDGIFFNNYFDITSSGN------------FEGKSIPNLIK-----------NKEYER 380

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
           +   + +  +K+F+ R +R   H DDK++ SWN L+I +  +A   LK++          
Sbjct: 381 HNEKIADLSKKVFEYRKERTSLHKDDKILTSWNALMIVALTKAYSTLKNDI--------- 431

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
                  Y+E +    +FI  +L +E + RL   +R+G S    +LDDYAFLI   ++LY
Sbjct: 432 -------YLEYSNKCLNFINNNLVNE-SGRLLARYRDGSSDYLAYLDDYAFLIWAYIELY 483

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
           E     K+L  A+ L  +   LF D E  G++    +  +++ R K+ +DGA PSGNSV 
Sbjct: 484 ESTFNMKYLEKALNLNESCINLFWDYEKSGFYIYGKDSENLIARPKDLYDGAIPSGNSVQ 543

Query: 694 VINLVRLASIVAGSK 708
           + NL+RLA I   ++
Sbjct: 544 LYNLIRLAKITGDNR 558


>gi|222056570|ref|YP_002538932.1| hypothetical protein Geob_3488 [Geobacter daltonii FRC-32]
 gi|221565859|gb|ACM21831.1| protein of unknown function DUF255 [Geobacter daltonii FRC-32]
          Length = 705

 Score =  453 bits (1165), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 265/625 (42%), Positives = 351/625 (56%), Gaps = 61/625 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL    SPYLLQHA NPVDWF WGEEAFA+AR  D PIFLSIGY+TCHWCHVM  ESFE
Sbjct: 34  NRLIFADSPYLLQHAENPVDWFQWGEEAFAKARAEDKPIFLSIGYATCHWCHVMAHESFE 93

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D  VAK LND FV+IKVDREERPD+D  +M   Q + G GGWPL+V L+PD KP    TY
Sbjct: 94  DREVAKALNDSFVAIKVDREERPDIDDQFMAVAQMISGSGGWPLNVLLTPDKKPFFAATY 153

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAF---AIEQLSEALSASASSNKLPDE 279
            P E + G PG   +L ++   W ++RD + +S +    ++E+L+    A A       E
Sbjct: 154 LPKERRMGVPGIIDLLERISRFWQRERDKVEESCSTIMASLERLNRTEPAYAGG-----E 208

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
           L + A      QL+  YD  +GGFG APKFP P  I  +L          K+G   E  +
Sbjct: 209 LEEAAF----NQLAAMYDDDWGGFGQAPKFPMPHYISFLL-------RCWKAGR-PEALQ 256

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           M   TL  M +GGI+D +G G HRYSVD +W VPHFEKMLYDQ  +A  + +AF  T   
Sbjct: 257 MAEHTLTRMRQGGIYDQLGFGIHRYSVDRQWLVPHFEKMLYDQALVAIAFAEAFQATGKN 316

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE 459
           +Y  + R+IL+Y   +M G  G   SA+DAD   TEG    +EG FY+W + EV+++LGE
Sbjct: 317 YYREVVREILNYCLVEMTGIDGGFCSAQDAD---TEG----QEGKFYLWAAAEVKEVLGE 369

Query: 460 HAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 518
            A  LF   + +   GN            F+GKN+L      ++ A + G+  E +   L
Sbjct: 370 EAARLFCRLFDITEKGN------------FEGKNILHLPVSIASFADREGLIAESFKGEL 417

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
            + R KL  VR KR RP  D KV+ +WNGL+I++ A+   +   E               
Sbjct: 418 IKWRAKLLTVRQKRVRPLRDAKVLTAWNGLLIAALAKGYGVTGDET-------------- 463

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG 638
             Y+  AESA + I   L  ++  RL  S+  G +K P FL+DYAFL  GLL+LY+    
Sbjct: 464 --YLRAAESAVTIILEKLQTKEG-RLSRSYHLGQAKIPAFLEDYAFLGWGLLELYQVSLH 520

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
             +L  A+ L      LF    GGG+++   +   VL+R K  +DGA PSGNS++ +NL+
Sbjct: 521 QGYLFQALRLARDMIRLF-SAPGGGFYDNGMDAEEVLIRQKNAYDGAMPSGNSIAAMNLL 579

Query: 699 RLASIVAGSKSDYYRQNAEHSLAVF 723
           RL  I+   K D      EH +  F
Sbjct: 580 RLGKIL---KDDSLETAGEHGVGAF 601


>gi|91772578|ref|YP_565270.1| hypothetical protein Mbur_0543 [Methanococcoides burtonii DSM 6242]
 gi|91711593|gb|ABE51520.1| Protein of unknown function DUF255 [Methanococcoides burtonii DSM
           6242]
          Length = 703

 Score =  453 bits (1165), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 252/683 (36%), Positives = 372/683 (54%), Gaps = 45/683 (6%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E +PYLLQHA++ VDW+ W EEAF +A+  D PIFLSIGYSTCHWCHVM  ESF 
Sbjct: 10  NRLINEKNPYLLQHANDSVDWYPWTEEAFEKAKNEDKPIFLSIGYSTCHWCHVMAKESFR 69

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           ++ VAK++ND FVSIKVDREERPD+D VYM   Q + G GGWPL++ ++P+  P +  TY
Sbjct: 70  NKDVAKMMNDTFVSIKVDREERPDIDSVYMDICQKMNGSGGWPLTIIMTPEKVPFIAATY 129

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
            P +  +GR G   I+  ++  W ++ + + +        LSE      S N   +E+ +
Sbjct: 130 IPLKSGFGRKGMLEIIPWIEHLWKEEHNKIVEQTELIKTALSE-----KSENSHNEEVTE 184

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             +      L+ ++D+  GGFG++PKFP P  I  +L + K      ++G  +  Q MV 
Sbjct: 185 EIIHRTYTYLANNFDNENGGFGTSPKFPSPHNISYLLRYWK------RTGNPTALQ-MVE 237

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TLQ M KGGI+DH+G GFHRYS D  W VPHFEKMLYDQ  L   Y +A+  T    YS
Sbjct: 238 RTLQAMRKGGIYDHIGFGFHRYSTDSSWLVPHFEKMLYDQALLIIAYTEAYQATNKEEYS 297

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHA 461
               +I++Y+ RDM  P G  + A DADS E        EG FY W   E+E IL  E  
Sbjct: 298 NTANEIIEYILRDMTSPDGGFYCAGDADSEEV-------EGRFYTWELSEIESILNREDH 350

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
            +F++ + ++P GN        P+    GKN+L    D  +   +  +  ++  +I+  C
Sbjct: 351 PIFRDAFNVRPEGNFLEESTHRPN----GKNILHLEKDLESIEKQYNITRKEIDHIIERC 406

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
           R++LF  R KR  P  DDK++  WNGL++++ + + +++ +                K Y
Sbjct: 407 RKQLFSTREKRIHPSKDDKILTDWNGLMLAALSISGRVMGN----------------KRY 450

Query: 582 MEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKW 641
           +++A+  A  +      E    L H++ +      GFLDDYAF   GL++LYE      +
Sbjct: 451 IDIAKRNADLLISERMKENG-ELYHNYSSNKEPTIGFLDDYAFFTWGLIELYEATFEVTY 509

Query: 642 LVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLA 701
           L  A++L +   E F D   GG+F+T+ +  ++L R KE +DGA PSGNSV + NL++L+
Sbjct: 510 LAKALQLTDYMIENFKDTINGGFFHTSNKSETLLFRKKEVYDGAIPSGNSVEINNLLKLS 569

Query: 702 SIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSV 761
            +    + +     A  +   F + +  M           D+   PS + +V+ G   S 
Sbjct: 570 KLTGNPELN---SEAIDTSNAFASTIYAMPFGYTHFIAGLDLALAPSVE-IVIAGELDSE 625

Query: 762 DFENMLAAAHASYDLNKTVSKKS 784
           D + ML   +  +   KTV  KS
Sbjct: 626 DTQLMLNNINEEFIPGKTVIVKS 648


>gi|255100682|ref|ZP_05329659.1| hypothetical protein CdifQCD-6_07712 [Clostridium difficile
           QCD-63q42]
          Length = 678

 Score =  453 bits (1165), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 243/615 (39%), Positives = 352/615 (57%), Gaps = 65/615 (10%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           N+  N L  E SPYLLQHA+NP++W++W +EAF +A++ D PIFLS+GYSTCHWCHVME 
Sbjct: 4   NRKPNNLINEKSPYLLQHAYNPINWYSWNDEAFKKAKEEDKPIFLSVGYSTCHWCHVMEK 63

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFEDE VA+++N  FV+IKVD+EERPDVD VYMT  QA+ G GGWP+++ ++PD KP  
Sbjct: 64  ESFEDEEVAEIMNRNFVAIKVDKEERPDVDSVYMTVCQAMTGSGGWPMTIIMTPDKKPFF 123

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
            GTYFP   +Y RPG   +L  V + W+  RD+L +SG   IE L +      +   L  
Sbjct: 124 AGTYFPKYSRYNRPGVIDLLENVSEKWNTSRDILIKSGDEIIEALKDDFGVKNTEGDLSK 183

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASE 336
           E+  +++R+        YD  +GGFG+APKFP P  +  ++  Y  +K +D         
Sbjct: 184 EMLSSSVRV----FKAIYDENYGGFGNAPKFPSPQNLMFLMKYYSIEKDKDV-------- 231

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
             KMV  TL  M +GG+ DH+G GF RYS D++W  PHFEKMLYD   L   +LDA+ +T
Sbjct: 232 -LKMVEKTLDGMYRGGLFDHIGFGFSRYSTDKKWLAPHFEKMLYDNAMLTIAFLDAYKIT 290

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
           K   Y  I    +DY+ R+M    G  +SA+DADS   EG    +EG FY +   E+ ++
Sbjct: 291 KKELYKEIAIKTIDYVVREMKDKEGGFYSAQDADS---EG----EEGKFYTFNPLEIIEV 343

Query: 457 LGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASASKLGMPLEK 513
           LGE   I F  ++ +  +GN            F+GK++  LI+               E+
Sbjct: 344 LGEEDGIFFNNYFDITSSGN------------FEGKSIPNLIK-----------NKEYER 380

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
           +   + +  +K+F+ R +R   H DDK++ SWN L+I +  +A   LK++          
Sbjct: 381 HNEKIADLSKKVFEYRKERTSLHKDDKILTSWNALMIVALTKAYSTLKNDI--------- 431

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
                  Y+E +    +FI  +L +E + RL   +R+G S    +LDDYAFLI   ++LY
Sbjct: 432 -------YLEYSNKCLNFINNNLVNE-SGRLLARYRDGSSDYLAYLDDYAFLIWAYIELY 483

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
           E     K+L  A+ L  +   LF D E  G++    +  +++ R K+ +DGA PSGNSV 
Sbjct: 484 ESTFNMKYLEKALNLNESCINLFWDYEKSGFYIYGKDSENLIARPKDLYDGAIPSGNSVQ 543

Query: 694 VINLVRLASIVAGSK 708
           + NL+RLA I   ++
Sbjct: 544 LYNLIRLAKITGDNR 558


>gi|448382091|ref|ZP_21561926.1| hypothetical protein C478_06099 [Haloterrigena thermotolerans DSM
           11522]
 gi|445662325|gb|ELZ15095.1| hypothetical protein C478_06099 [Haloterrigena thermotolerans DSM
           11522]
          Length = 731

 Score =  452 bits (1164), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 256/653 (39%), Positives = 362/653 (55%), Gaps = 49/653 (7%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA NPV+W  W E+A A A++RDVPIFLSIGYS CHWCHVME ESF 
Sbjct: 8   NRLDEEESPYLRQHADNPVNWQPWDEQALAAAKERDVPIFLSIGYSACHWCHVMEEESFA 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA++LN+ FV IKVDREERPDVD +YMT  Q + G GGWPLS +L+P+ KP   GTY
Sbjct: 68  DEAVAEVLNENFVPIKVDREERPDVDSIYMTVCQLVRGQGGWPLSAWLTPEGKPFFIGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRD------MLAQSGAFAIEQLSEALSASASSNKL 276
           FP E K G+PGF  +  ++ D+W+ + D         Q    A ++L E   ++      
Sbjct: 128 FPREGKRGQPGFLDLCERISDSWESEEDREEMQHRAQQWTDAATDRLEETPDSAGVDAGG 187

Query: 277 PDELP-QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 335
             E P  + L   A+ + +S D ++GGFG+  KFP+P  ++++   ++  + TG+     
Sbjct: 188 AAEPPSSDVLEAAADAVLRSADRQYGGFGTGQKFPQPSRLRVL---ARTYDRTGR----E 240

Query: 336 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 395
           E ++++  TL  MA GG+ DHVGGGFHRY VD  W VPHFEKMLYD  ++   +L  + L
Sbjct: 241 EYREVLAETLDAMAAGGLADHVGGGFHRYCVDRDWTVPHFEKMLYDNAEIPRAFLAGYQL 300

Query: 396 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 455
           T +  Y+    D L ++ R++    G  FS  DA S + E   R +EGAFYVWT +EV D
Sbjct: 301 TGEDRYAETVADTLAFVDRELTHDEGGFFSTLDAQSEDPETGER-EEGAFYVWTPEEVHD 359

Query: 456 ILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEK 513
           ++ +   A LF   Y +  +GN            F+G+N    +   S  AS+  +   +
Sbjct: 360 VIADETDASLFCARYDITESGN------------FEGQNQPNRIARVSELASQFDLAESE 407

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
            L  L   R++LF+ R +RPRP  D+K++  WNGL+IS++A A+ +L             
Sbjct: 408 VLKRLDSARKRLFEAREERPRPDRDEKILAGWNGLMISTYAEAALVL------------- 454

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
            G D  EY E A  A  F+R  L+D+++ RL   ++ G  K  G+L+DYAFL  G LD Y
Sbjct: 455 -GED--EYAETAVDALEFVRDRLWDDESQRLSRRYKAGDVKVDGYLEDYAFLARGALDCY 511

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
           +       L +A+EL    +  F D + G  + T     S++ R +E  D + PS   V+
Sbjct: 512 QATGEVDHLAFALELARVIETEFWDADRGTLYFTPESGESLVTRPQELGDQSTPSSTGVA 571

Query: 694 VINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 746
           V  L+ L    A    D     A   L     +L+  A+    +C AAD L+ 
Sbjct: 572 VETLLALDEFAASEFGDI----AATVLETHANKLEANALEHATLCLAADRLAA 620


>gi|254975197|ref|ZP_05271669.1| hypothetical protein CdifQC_07775 [Clostridium difficile QCD-66c26]
 gi|255092587|ref|ZP_05322065.1| hypothetical protein CdifC_07992 [Clostridium difficile CIP 107932]
 gi|255314324|ref|ZP_05355907.1| hypothetical protein CdifQCD-7_08235 [Clostridium difficile
           QCD-76w55]
 gi|255517004|ref|ZP_05384680.1| hypothetical protein CdifQCD-_07809 [Clostridium difficile
           QCD-97b34]
 gi|255650105|ref|ZP_05397007.1| hypothetical protein CdifQCD_07959 [Clostridium difficile
           QCD-37x79]
 gi|260683234|ref|YP_003214519.1| hypothetical protein CD196_1491 [Clostridium difficile CD196]
 gi|260686830|ref|YP_003217963.1| hypothetical protein CDR20291_1466 [Clostridium difficile R20291]
 gi|306520110|ref|ZP_07406457.1| hypothetical protein CdifQ_08874 [Clostridium difficile QCD-32g58]
 gi|384360839|ref|YP_006198691.1| hypothetical protein CDBI1_07695 [Clostridium difficile BI1]
 gi|260209397|emb|CBA62859.1| conserved hypothetical protein [Clostridium difficile CD196]
 gi|260212846|emb|CBE04045.1| conserved hypothetical protein [Clostridium difficile R20291]
          Length = 678

 Score =  452 bits (1164), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 242/615 (39%), Positives = 353/615 (57%), Gaps = 65/615 (10%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           N+  N L  E SPYLLQHA+NP++W++W +EAF +A++ D PIFLS+GYSTCHWCHVME 
Sbjct: 4   NRKPNNLINEKSPYLLQHAYNPINWYSWNDEAFKKAKEEDKPIFLSVGYSTCHWCHVMEK 63

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFEDE VA+++N  FV+IKVD+EERPDVD VYMT  QA+ G GGWP+++ ++PD KP  
Sbjct: 64  ESFEDEEVAEIMNRNFVAIKVDKEERPDVDSVYMTVCQAMTGSGGWPMTIIMTPDKKPFF 123

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
            GTYFP   +Y RPG   +L+ V + W+  RD+L +SG   IE L +      +   L  
Sbjct: 124 AGTYFPKYSRYNRPGVIDLLKNVSEKWNTSRDILIKSGDEIIEALKDDFGVKNTEGDLSK 183

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASE 336
           E+  +++R+        YD ++GGFG+APKFP P  +  ++  Y  +K +D         
Sbjct: 184 EMLSSSVRV----FKAIYDEKYGGFGNAPKFPSPQNLMFLMKYYSIEKDKDV-------- 231

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
             KMV  TL  M +GG+ DH+G GF RYS D++W  PHFEKMLYD   L   +LDA+ +T
Sbjct: 232 -LKMVEKTLDGMYRGGLFDHIGFGFSRYSTDKKWLAPHFEKMLYDNAMLTIAFLDAYKIT 290

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
           K   Y  I    +DY+ R+M    G  +SA+DADS   EG    +EG FY +   E+ ++
Sbjct: 291 KKELYKEIAIKTIDYVVREMKDKEGGFYSAQDADS---EG----EEGKFYTFNPLEIIEV 343

Query: 457 LGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASASKLGMPLEK 513
           LGE     F  ++ +  +GN            F+GK++  LI+               E+
Sbjct: 344 LGEEDGTFFNNYFDITSSGN------------FEGKSIPNLIK-----------NKEYER 380

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
           +   + +  +K+F+ R +R   H DDK++ SWN L+I +  +A   LK++          
Sbjct: 381 HNEKIADLSKKVFEYRKERTSLHKDDKILTSWNALMIVALTKAYSTLKNDI--------- 431

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
                  Y+E +    +FI  +L +E + RL   +R+G S    +LDDYAFLI   ++LY
Sbjct: 432 -------YLEYSNKCLNFINNNLVNE-SGRLLARYRDGSSDYLAYLDDYAFLIWAYIELY 483

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
           E     K+L  A+ L  +   LF D E  G++    +  +++ R K+ +DGA PSGNSV 
Sbjct: 484 ESTFNMKYLEKALNLNESCINLFWDYEKSGFYIYGKDSENLIARPKDLYDGAIPSGNSVQ 543

Query: 694 VINLVRLASIVAGSK 708
           + NL+RLA I   ++
Sbjct: 544 LYNLIRLAKITGDNR 558


>gi|218887845|ref|YP_002437166.1| hypothetical protein DvMF_2759 [Desulfovibrio vulgaris str.
           'Miyazaki F']
 gi|218758799|gb|ACL09698.1| protein of unknown function DUF255 [Desulfovibrio vulgaris str.
           'Miyazaki F']
          Length = 756

 Score =  452 bits (1163), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 273/712 (38%), Positives = 379/712 (53%), Gaps = 67/712 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL+   SPYLLQHA NPV W  WG+EA   AR  D P+F+S+GYSTCHWCHVM  ESFE
Sbjct: 5   NRLSTSKSPYLLQHADNPVHWHPWGDEALQRARDEDRPLFVSVGYSTCHWCHVMAHESFE 64

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D+ VA+LLND FV +KVDREERPD+D  YM   Q L G GGWPL++   PD +P    TY
Sbjct: 65  DDEVARLLNDAFVCVKVDREERPDIDAAYMAACQMLTGSGGWPLTIIALPDGRPFFAATY 124

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL---SEALSASASSNKLPDE 279
            P   + GR G   ++ +V + W  KRD +  S    +E +   +EA+    +  +LP  
Sbjct: 125 LPKHSRPGRIGLMDLVPRVLEVWRHKRDDVLDSADSIVEHVRRHAEAMLRPPADGRLPG- 183

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK---------LEDTGK 330
                L    E ++  +D+  GGFG+APKFP P  +  +L  +++         L   G 
Sbjct: 184 --AGTLHAACEAMASEFDAVNGGFGTAPKFPSPHNLLFLLRWARRNGHAAGQPGLAQAGT 241

Query: 331 --SGEASEGQK---MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 385
             +GE S G K   M   TL+ + +GGIHDHVG GFHRYS D RW +PHFEKMLYDQ  L
Sbjct: 242 VPTGEESGGAKALRMAAQTLRSIRRGGIHDHVGYGFHRYSTDARWLLPHFEKMLYDQAML 301

Query: 386 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 445
              Y +A+  T D  +     +   Y+ RD+  P G  +SAEDADS E +GA  + EG F
Sbjct: 302 MLAYAEAWLATGDGEFRRTAEETAAYVLRDLASPEGAFYSAEDADS-ELDGA--RGEGLF 358

Query: 446 YVWTSKEVEDILGEHAILFKEHYYLKPTGN------------CDLSRMS----------- 482
           Y +T  ++E+      +       ++P G+             DL+  +           
Sbjct: 359 YTFTLADIEEACAPLDVRPGVRPAVRPDGDGGGGVNPASLSEADLTARAFGCTAYGNYED 418

Query: 483 DPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVI 542
           +      G+NVL         A  LG+P  +    L   R  LFD+R++RPRPHLDDKV+
Sbjct: 419 EATRSRTGRNVLHLPRAPQELARDLGLPPREVEERLEAARAALFDLRARRPRPHLDDKVL 478

Query: 543 VSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTH 602
             WNGL I++ +R ++                  D     E A +AA F+   +   Q  
Sbjct: 479 ADWNGLAIAAMSRCAQAF----------------DAPHLAEAAAAAADFVLARMV-TQEG 521

Query: 603 RLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGG 662
           RL H +R+G +  PG LDDYAF+I GL++LY      +WL  A+ LQ  QD  F D EGG
Sbjct: 522 RLLHRWRDGEAAVPGLLDDYAFMIWGLIELYGATGEVRWLRRALRLQEVQDTFFHDAEGG 581

Query: 663 GYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 722
           GY+ T  +  ++L+R KE HDGA PSGN+ ++ NL+RLA ++   +   Y + A   L  
Sbjct: 582 GYWMTPADGDALLVRRKEGHDGALPSGNAAALFNLLRLALLLGRPE---YGERARGVLRA 638

Query: 723 FETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASY 774
           F T+++   +   +  C  D  ++   + V++ G     D E MLAA   +Y
Sbjct: 639 FATQVRHHPVGSTMFLCGVD-FALSGGRSVIVAGEPDQPDTEAMLAAVRGTY 689


>gi|80978835|gb|ABB54669.1| SSP411 [Homo sapiens]
          Length = 521

 Score =  452 bits (1163), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 224/460 (48%), Positives = 300/460 (65%), Gaps = 27/460 (5%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYLLQHA+NPVDW+ WG+EAF +ARK + PIFLS+GYSTCHWCH+ME ESF+
Sbjct: 63  NRLIHEKSPYLLQHAYNPVDWYPWGQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQ 122

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           +E + +LL++ FVS+KVDREERPDVDKVYMT+VQA   GGGWP++V+L+P+L+P +GGTY
Sbjct: 123 NEEIGRLLSEDFVSVKVDREERPDVDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTY 182

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FPPED   R GF+T+L ++++ W + ++ L ++     ++++ AL A +  +    +LP 
Sbjct: 183 FPPEDGLTRVGFRTVLLRIREQWKQNKNTLLENS----QRVTTALLARSEISVGDRQLPP 238

Query: 283 NALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYH--SKKLEDTGKSGEASEG 337
           +A  +   C +QL + YD  +GGF  APKFP PV +  +  +  S +L   G     S  
Sbjct: 239 SAATVNNRCFQQLDEGYDEEYGGFAEAPKFPTPVILSFLFSYWLSHRLTQDG-----SRA 293

Query: 338 QKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTK 397
           Q+M L TL+ MA GGI DHVG GFHRYS D +WHVPHFEKMLYDQ QLA  Y  AF L+ 
Sbjct: 294 QQMALHTLKMMANGGIRDHVGQGFHRYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQLSG 353

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
           D FYS + + IL Y+ R +    G  +SAEDADS    G  R KEGA+YVWT KEV+ +L
Sbjct: 354 DEFYSDVAKGILQYVARSLSHRSGGFYSAEDADSPPERG-QRPKEGAYYVWTVKEVQQLL 412

Query: 458 GEHAI----------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 507
            E  +          L  +HY L   GN   S+  DP  E +G+NVL        +A++ 
Sbjct: 413 PEPVLGATEPLTSGQLLMKHYGLTEAGNISPSQ--DPKGELQGQNVLTVRYSLELTAARF 470

Query: 508 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNG 547
           G+ +E    +L     KLF  R  RP+PHLD K++ +WNG
Sbjct: 471 GLDVEAVRTLLNSGLEKLFQARKHRPKPHLDSKMLAAWNG 510


>gi|197121417|ref|YP_002133368.1| hypothetical protein AnaeK_1004 [Anaeromyxobacter sp. K]
 gi|196171266|gb|ACG72239.1| protein of unknown function DUF255 [Anaeromyxobacter sp. K]
          Length = 718

 Score =  452 bits (1162), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 276/660 (41%), Positives = 382/660 (57%), Gaps = 67/660 (10%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           + TNRLA E SPYLLQHAHNPV W+AWG+EAF EAR+   P+FLS+GYSTCHWCHVME E
Sbjct: 37  RFTNRLALERSPYLLQHAHNPVSWWAWGDEAFEEARRTGRPVFLSVGYSTCHWCHVMERE 96

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SFEDE +A++LN+ +V+IKVDREERPDVD +YMT VQ L G GGWP+SV+L+PD +P  G
Sbjct: 97  SFEDEEIARVLNERYVAIKVDREERPDVDAIYMTAVQLLTGSGGWPMSVWLTPDREPFFG 156

Query: 220 GTYFPPEDKYGRP--GFKTILRKVKDAWDKKRDML-AQSGAFAIEQLSEALSASASSNKL 276
           GTYFPP D    P  GF +IL ++   W++  D + + +GA      +    A  ++ ++
Sbjct: 157 GTYFPPRDGVRGPARGFLSILHEIAGLWERDPDRIRSATGALVEAVRTALAPAGPAAAEV 216

Query: 277 PDELP-QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 335
           P   P ++A+ L    L +S+D R GG   APKFP  V ++++L H +      ++GE  
Sbjct: 217 PGPEPIEHAVAL----LERSFDERHGGLRRAPKFPSNVPVRLLLRHHR------RTGE-E 265

Query: 336 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 395
              +M   TL+ MA GG+HD VGGGFHRYS D  W VPHFEKMLYD   LA  Y +A+ L
Sbjct: 266 RSLRMATVTLERMAAGGLHDQVGGGFHRYSTDAEWLVPHFEKMLYDNALLALAYAEAWQL 325

Query: 396 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 455
           T    ++ + R  LDYL R++  P G ++SA DADS   EG    +EG F+ WT  E+ +
Sbjct: 326 TGRRDFARVTRQTLDYLLRELTSPEGGLYSATDADS---EG----EEGRFFTWTEAELRE 378

Query: 456 ILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 515
            LG+ A  F   + ++P GN            F+G++VL            +  P E   
Sbjct: 379 ALGDRAEAFLRFHGVRPEGN------------FEGRSVL-----------HVPAPDEDAW 415

Query: 516 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 575
             L   R  L+ +R +RPRP  D+K++  WNGL IS+ A   + L               
Sbjct: 416 EALAPDRAALYALRERRPRPLRDEKILAGWNGLAISALAFGGRALAE------------- 462

Query: 576 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 635
                +++ A  AA F+   L  +   RLQ S+  G +  P +L+D+AFL+ GLLDL+E 
Sbjct: 463 ---PRWVDAAARAADFVLTRLVKDG--RLQRSWLAGRAGVPAYLEDHAFLVQGLLDLHEA 517

Query: 636 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 695
               +WL  A EL   QD LF D EGGG+F +  +   +L R K  HDGAEPSG SV+ +
Sbjct: 518 TFDPRWLAAAAELAGAQDRLFGDPEGGGWFQSATDHERLLAREKPTHDGAEPSGASVAAL 577

Query: 696 NLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLV 755
           N +RL +  +  +   +R+ A+ +L      L +  +A+  +  A D  S   R+ VVLV
Sbjct: 578 NALRLEAFTSDPR---WRRAADGALRHHARTLAEQPLAMSELLLALDCASDAVRE-VVLV 633


>gi|220916114|ref|YP_002491418.1| hypothetical protein A2cp1_1001 [Anaeromyxobacter dehalogenans
           2CP-1]
 gi|219953968|gb|ACL64352.1| protein of unknown function DUF255 [Anaeromyxobacter dehalogenans
           2CP-1]
          Length = 718

 Score =  452 bits (1162), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 273/659 (41%), Positives = 382/659 (57%), Gaps = 66/659 (10%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           + TNRLA E SPYLLQHAHNPV W+AWG+EAF EAR+   P+FLS+GYSTCHWCHVME E
Sbjct: 37  RFTNRLALERSPYLLQHAHNPVSWWAWGDEAFEEARRTGRPVFLSVGYSTCHWCHVMERE 96

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SFEDE +A++LN+ +V+IKVDREERPDVD +YMT VQ L G GGWP+SV+L+PD +P  G
Sbjct: 97  SFEDEEIARVLNERYVAIKVDREERPDVDAIYMTAVQLLTGSGGWPMSVWLTPDREPFFG 156

Query: 220 GTYFPPEDKYGRP--GFKTILRKVKDAWDKKRDML-AQSGAFAIEQLSEALSASASSNKL 276
           GTYFPP D    P  GF +IL ++   W++  D + + +GA      +    A  ++ ++
Sbjct: 157 GTYFPPRDGVRGPARGFLSILHEIAGLWERDPDRIRSATGALVEAVRTALAPAGPAAAQV 216

Query: 277 PDELP-QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 335
           P   P ++A+ L    L +S+D R GG   APKFP  V ++++L H +      ++GEA 
Sbjct: 217 PGPEPIEHAVAL----LERSFDERHGGLRRAPKFPSNVPVRLLLRHHR------RTGEA- 265

Query: 336 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 395
              +M   TL+ MA GG+HD VGGGFHRYS D  W VPHFEKMLYD   LA  Y +A+ +
Sbjct: 266 RSLRMATVTLERMAAGGLHDQVGGGFHRYSTDAEWLVPHFEKMLYDNALLALAYAEAWQV 325

Query: 396 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 455
           T    ++ + R  LDYL R++  P G ++SA DADS   EG    +EG F+ WT  E+ +
Sbjct: 326 TGRRDFARVTRQTLDYLLRELTSPEGGLYSATDADS---EG----EEGRFFTWTEAELRE 378

Query: 456 ILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 515
            LG+ A  F   + ++P GN            F+G++VL            +  P E   
Sbjct: 379 ALGDRAEAFLRFHGVRPEGN------------FEGRSVL-----------HVPAPDEDAW 415

Query: 516 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 575
             L   R  L+ +R +RPRP  D+K++  WNGL IS+ A   + L               
Sbjct: 416 EALAPDRAALYALRERRPRPLRDEKILAGWNGLAISALAFGGRALAE------------- 462

Query: 576 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 635
                +++ A  AA F+   L  +   RLQ S+  G +  P +L+D+AFL+ GLLDL+E 
Sbjct: 463 ---PRWVDAAARAADFVLTRLVKDG--RLQRSWLAGRAGVPAYLEDHAFLVQGLLDLHEA 517

Query: 636 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 695
               +WL  A EL   QD LF D EGGG+F +  +   +L R K  HDGAEPSG SV+ +
Sbjct: 518 TFDPRWLAAAAELAGAQDRLFGDPEGGGWFQSATDHERLLAREKPTHDGAEPSGASVAAL 577

Query: 696 NLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVL 754
           N +RL +  +  +   +R+ A+ +L      L +  +A+  +  A D  S   R+ V++
Sbjct: 578 NALRLEAFTSDPR---WRRAADGALRHHARTLAEQPLAMSELLLALDYASDAVREVVLI 633


>gi|126699171|ref|YP_001088068.1| hypothetical protein CD630_15680 [Clostridium difficile 630]
 gi|115250608|emb|CAJ68432.1| conserved hypothetical protein [Clostridium difficile 630]
          Length = 678

 Score =  452 bits (1162), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 242/615 (39%), Positives = 353/615 (57%), Gaps = 65/615 (10%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           N+  N L  E SPYLLQHA+NP++W++W +EAF +A++ D PIFLS+GYSTCHWCHVME 
Sbjct: 4   NRKPNNLINEKSPYLLQHAYNPINWYSWNDEAFKKAKEEDKPIFLSVGYSTCHWCHVMEK 63

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFEDE VA+++N  FV+IKVD+EERPDVD VYMT  QA+ G GGWP+++ ++PD KP  
Sbjct: 64  ESFEDEEVAEIMNRNFVAIKVDKEERPDVDSVYMTVCQAMTGSGGWPMTIIMTPDKKPFF 123

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
            GTYFP   +Y RPG   +L  V + W+  RD+L +SG   IE L +      +   L  
Sbjct: 124 AGTYFPKYSRYNRPGVIDLLENVSEKWNTSRDILIKSGDEIIEALKDDFGVKNTEGDLSK 183

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASE 336
           ++  +++R+        YD ++GGFG+APKFP P  +  ++  Y  +K +D         
Sbjct: 184 DMLSSSVRV----FKAIYDEKYGGFGNAPKFPSPQNLMFLMKYYSIEKDKDV-------- 231

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
             KMV  TL  M +GG+ DH+G GF RYS D++W  PHFEKMLYD   L   +LDA+ +T
Sbjct: 232 -LKMVEKTLDGMYRGGLFDHIGFGFSRYSTDKKWLAPHFEKMLYDNAMLTIAFLDAYKIT 290

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
           K   Y  I    +DY+ R+M    G  +SA+DADS   EG    +EG FY +   E+ ++
Sbjct: 291 KKELYKEIAIKTIDYVVREMKDKEGGFYSAQDADS---EG----EEGKFYTFNPLEIIEV 343

Query: 457 LGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASASKLGMPLEK 513
           LGE   I F  ++ +  +GN            F+GK++  LI+               E+
Sbjct: 344 LGEEDGIFFNNYFDITSSGN------------FEGKSIPNLIK-----------NKEYER 380

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
           +   + +  +K+F+ R +R   H DDK++ SWN L+I +  +A   LK++          
Sbjct: 381 HNEKIADLSKKVFEYRKERTSLHKDDKILTSWNALMIVALTKAYSTLKNDI--------- 431

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
                  Y+E +    +FI  +L +E + RL   +R+G S    +LDDYAFLI   ++LY
Sbjct: 432 -------YLEYSNKCLNFINNNLVNE-SGRLLARYRDGSSDYLAYLDDYAFLIWAYIELY 483

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
           E     K+L  A+ L  +   LF D E  G++    +  +++ R K+ +DGA PSGNSV 
Sbjct: 484 ESTFNMKYLEKALNLNESCINLFWDYEKSGFYIYGKDSENLIARPKDLYDGAIPSGNSVQ 543

Query: 694 VINLVRLASIVAGSK 708
           + NL+RLA I   ++
Sbjct: 544 LYNLIRLAKITGDNR 558


>gi|423090012|ref|ZP_17078355.1| hypothetical protein HMPREF9945_01541 [Clostridium difficile
           70-100-2010]
 gi|357557317|gb|EHJ38868.1| hypothetical protein HMPREF9945_01541 [Clostridium difficile
           70-100-2010]
          Length = 678

 Score =  452 bits (1162), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 242/615 (39%), Positives = 352/615 (57%), Gaps = 65/615 (10%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           N+  N L  E SPYLLQHA+NP++W++W +EAF +A++ D PIFLS+GYSTCHWCHVME 
Sbjct: 4   NRKPNNLINEKSPYLLQHAYNPINWYSWNDEAFKKAKEEDKPIFLSVGYSTCHWCHVMEK 63

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFEDE VA+++N  FV+IKVD+EERPDVD VYMT  QA+ G GGWP+++ ++PD KP  
Sbjct: 64  ESFEDEEVAEIMNRNFVAIKVDKEERPDVDSVYMTVCQAMTGSGGWPMTIIMTPDKKPFF 123

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
            GTYFP   +Y RPG   +L  V + W+  RD+L +SG   IE L +      +   L  
Sbjct: 124 AGTYFPKYSRYNRPGVIDLLENVSEKWNTSRDILIKSGDEIIEALKDDFGVKNTEGDLSK 183

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASE 336
           E+  +++R+        YD ++GGFG+APKFP P  +  ++  Y  +K +D         
Sbjct: 184 EMLSSSVRV----FKAIYDEKYGGFGNAPKFPSPQNLMFLMKYYSIEKDKDV-------- 231

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
             KMV  TL  M +GG+ DH+G GF RYS D++W  PHFEKMLYD   L   +LDA+ +T
Sbjct: 232 -LKMVEKTLDGMYRGGLFDHIGFGFSRYSTDKKWLAPHFEKMLYDNAMLTIAFLDAYKIT 290

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
           K   Y  I    +DY+ R+M    G  +SA+DADS   EG    +EG FY +   E+ ++
Sbjct: 291 KKELYKEIAIKTIDYVVREMKDKEGGFYSAQDADS---EG----EEGKFYTFNPLEIIEV 343

Query: 457 LGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASASKLGMPLEK 513
           LGE     F  ++ +  +GN            F+GK++  LI+               E+
Sbjct: 344 LGEEDGTFFNNYFDITSSGN------------FEGKSIPNLIK-----------NKEYER 380

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
           +   + +  +K+F+ R +R   H DDK++ SWN L+I +  +A   LK++          
Sbjct: 381 HNEKIADLSKKVFEYRKERTSLHKDDKILTSWNALMIVALTKAYSTLKNDI--------- 431

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
                  Y+E +    +FI  +L +E + RL   +R+G S    +LDDYAFLI   ++LY
Sbjct: 432 -------YLEYSNKCLNFINNNLVNE-SGRLLARYRDGSSDYLAYLDDYAFLIWAYIELY 483

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
           E     K+L  A+ L  +   LF D E  G++    +  +++ R K+ +DGA PSGNSV 
Sbjct: 484 ESTFNMKYLEKALNLNESCINLFWDYEKSGFYIYGKDSENLIARPKDLYDGAIPSGNSVQ 543

Query: 694 VINLVRLASIVAGSK 708
           + NL+RLA I   ++
Sbjct: 544 LYNLIRLAKITGDNR 558


>gi|149174989|ref|ZP_01853613.1| hypothetical protein PM8797T_11454 [Planctomyces maris DSM 8797]
 gi|148846326|gb|EDL60665.1| hypothetical protein PM8797T_11454 [Planctomyces maris DSM 8797]
          Length = 876

 Score =  452 bits (1162), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 260/689 (37%), Positives = 380/689 (55%), Gaps = 62/689 (8%)

Query: 93  STSHSRNKH----TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYS 148
           +T   + KH    TNRL+ E SPYLL H HNPVDW+ WG  AF +A++ +  IFLS+GYS
Sbjct: 44  ATESEKTKHKAMFTNRLSKETSPYLLLHQHNPVDWYPWGPAAFEKAKQENKIIFLSVGYS 103

Query: 149 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALY------GGG 202
           +C+WCHVME   FE+  +AK +N+ FV+IKVDREERPD+D +YMT +   +        G
Sbjct: 104 SCYWCHVMERLVFENPEIAKYMNENFVNIKVDREERPDIDDIYMTSLSVYFHLIGAPDNG 163

Query: 203 GWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQ 262
           GWPLS+FL+PD +P  GGTYFPP D+ G+  F  +L+KV + W   +  + QS     ++
Sbjct: 164 GWPLSMFLTPDREPFAGGTYFPPTDQGGQMSFPRVLQKVNELWSGDKAKVQQSATIIAKE 223

Query: 263 LSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFG------SAPKFPRPVEIQ 316
           ++       ++  +P E     ++     ++ S+DS +GG        + PKFP   ++ 
Sbjct: 224 VARLQKEEGATEAIPIE--DRLVKAGVRSINASFDSEYGGIDFSEVSPNGPKFPTSSKLV 281

Query: 317 MMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFE 376
           ++ Y  + ++    S E++   K++  TL  MA GGI+DH+GGGFHRYS D  WHVPHFE
Sbjct: 282 LLQYDIESMDAESTSAESA---KVLYQTLDAMANGGIYDHLGGGFHRYSTDRYWHVPHFE 338

Query: 377 KMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEG 436
           KMLYD GQLA++Y  A+  T +  Y  +   I+D++ R++    G  +SA D   AET+G
Sbjct: 339 KMLYDNGQLASLYAKAYGQTGNEQYKQVAAGIIDFVLRELTDTQGGFYSALD---AETDG 395

Query: 437 ATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIE 496
                EG  Y W+ +E+++IL E   LF E Y L           ++P   F+   VL  
Sbjct: 396 V----EGEHYAWSQEELKEILDEGYPLFAEFYGL-----------NEP-VRFEHGYVLHR 439

Query: 497 LNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARA 556
           +    A A K     E   + L   R+KL  VR++R     DDK++ SWNGL+I+  A A
Sbjct: 440 VTTLKALAEKQKTTPEALESQLAAMRKKLHTVRNQRQPLLKDDKILTSWNGLMITGMANA 499

Query: 557 SKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP 616
            +ILK                R +Y   AE AA FI   + D+Q H L  S+R   ++  
Sbjct: 500 GRILK----------------RPDYTAAAEKAAQFILDQMRDKQGH-LYRSYRADQARLN 542

Query: 617 GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL 676
            +LDDYAFL+ GLL LYE     +WL  A  L + Q +LF D++  G+F TT +   ++ 
Sbjct: 543 AYLDDYAFLVQGLLALYEATGKQQWLDQAQALTDLQIKLFWDQKEHGFFFTTHDHEQLIA 602

Query: 677 RVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMA-MAVP 735
           R K  +D A PSGNS+S  NL++L  +    K   YRQ+A+ +L +F   +K        
Sbjct: 603 RTKNAYDAAIPSGNSISTRNLIQLTQLTGDPK---YRQHADQTLQLFGRVIKRYPNRCAQ 659

Query: 736 LMCCAADMLSV-PSRKHVVLVGHKSSVDF 763
           L+    + L+  P++K   L+   S   F
Sbjct: 660 LVQAVGEFLTTPPAQKQSALLAPTSDAGF 688


>gi|226356002|ref|YP_002785742.1| hypothetical protein Deide_10920 [Deinococcus deserti VCD115]
 gi|226317992|gb|ACO45988.1| conserved hypothetical protein [Deinococcus deserti VCD115]
          Length = 696

 Score =  451 bits (1161), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 246/594 (41%), Positives = 340/594 (57%), Gaps = 43/594 (7%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA+E SPYLLQH  NPV+W+ W  EAFAEAR+RD+P+ LS+GYSTCHWCHVM  ESFE
Sbjct: 17  NRLASESSPYLLQHKDNPVNWWPWSPEAFAEARQRDLPVLLSVGYSTCHWCHVMAHESFE 76

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE  A  +N+ FV +KVDREERPDVD VYMT  QA+ G GGWP++VFL+PD +P   GTY
Sbjct: 77  DEATAAQMNEHFVCVKVDREERPDVDAVYMTATQAMTGQGGWPMTVFLTPDGEPFYAGTY 136

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FPP+D YG P F+ +L  + +AW   R+ L  +     + + EA     S   LP    Q
Sbjct: 137 FPPQDGYGLPSFRRLLASIANAWQNDREKLTGNARALTDHIREASRPRPSQGDLPAGFLQ 196

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
            A     ++L + +D+  GGFG APKFP P  ++ +L                EG+ M L
Sbjct: 197 QA----PDKLRRVFDADLGGFGGAPKFPAPTLLEFLLTR-------------PEGRDMAL 239

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL+ MA GGI+D +GGGFHRYSVDERW VPHFEKMLYD  QL  V + A+  T D  ++
Sbjct: 240 HTLRRMAAGGIYDQLGGGFHRYSVDERWLVPHFEKMLYDNAQLTRVLVQAYQHTDDEDFA 299

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
            + R+ L YL R+M+ P G  +SA+DAD+    G     EG  + WT  E+  +LG  + 
Sbjct: 300 RLARETLTYLEREMLSPAGGFYSAQDADTPTDHGGV---EGLTFTWTPAEIRAVLGGDSA 356

Query: 463 LFKEHYYLKPTGNCDLSRMSDPH-NEFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
           L +  Y +   GN       DPH  E+  +NVL         A  LG   + + + + + 
Sbjct: 357 LIERVYGVTDQGN-----FLDPHRREYGSRNVLHLPTPLEQLARDLGEDPQAFHSRVDQA 411

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
           R +L + R +R +P  DDKV+ SWNGL +++FA A+++L              G  R  Y
Sbjct: 412 RARLLEAREQRTQPGTDDKVLTSWNGLALAAFADAARVL--------------GEPR--Y 455

Query: 582 MEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKW 641
           +E+A   A F+RR L       L+H+F++G ++  G L+D+A    GL+ L++ G     
Sbjct: 456 LEIARQNAEFVRRELRLPDG-TLRHTFKDGQARVEGLLEDHALYGLGLVALFQAGGDLGH 514

Query: 642 LVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 695
           L WA EL       F D + G + +T G+   +L R  +  D A  S N+ + +
Sbjct: 515 LEWARELWTLVRRDFWDEDAGVFHSTGGQAEPLLSRQVQGFDSAVLSDNAAAAL 568


>gi|124504310|gb|AAI28719.1| Spata20 protein [Rattus norvegicus]
          Length = 550

 Score =  451 bits (1161), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 228/515 (44%), Positives = 319/515 (61%), Gaps = 43/515 (8%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
            K  NRL  E SPYLLQHAHNPVDW+ WG+EAF +A+K + PIFLS+GYSTCHWCH+ME 
Sbjct: 62  QKTANRLINEKSPYLLQHAHNPVDWYPWGQEAFDKAKKENKPIFLSVGYSTCHWCHMMEE 121

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESF++E +  LLN+ FVS+ VDREERPDVDKVYMT+VQA   GGGWP++V+L+P L+P +
Sbjct: 122 ESFQNEEIGHLLNENFVSVMVDREERPDVDKVYMTFVQATSSGGGWPMNVWLTPSLQPFV 181

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
           GGTYFPPED   R GF+T+L ++ D W + ++ L ++     ++++ AL A +  +    
Sbjct: 182 GGTYFPPEDGLTRVGFRTVLMRICDQWKQNKNTLLENS----QRVTTALLARSEISVGDR 237

Query: 279 ELPQNALRL---CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLEDTGKSGE 333
           +LP +A  +   C +QL + YD  +GGF  APKFP PV +  +   + S ++   G    
Sbjct: 238 QLPPSAATMNSRCFQQLDEGYDEEYGGFAEAPKFPTPVILNFLFSYWLSHRVTQDG---- 293

Query: 334 ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 393
            S  Q+M L TL+ MA GGI DHVG GFHRYS D +WH+PHFEKMLYDQ QL+ VY  AF
Sbjct: 294 -SRAQQMALHTLKMMANGGIRDHVGQGFHRYSTDRQWHIPHFEKMLYDQAQLSVVYCQAF 352

Query: 394 SLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 453
            ++ D F+S + + IL Y+ R++    G  +SAEDADS    G  + +EGA Y+WT KEV
Sbjct: 353 QISGDEFFSDVAKGILQYVTRNLSHRSGGFYSAEDADSPPERG-VKPQEGALYLWTVKEV 411

Query: 454 EDILGE----------HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 503
           + +L E             L  +HY L   GN + ++  D + E  G+NVL        +
Sbjct: 412 QQLLPEPVGGASEPLTSGQLLMKHYGLSEAGNINPTQ--DVNGEMHGQNVLTVRYSLELT 469

Query: 504 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 563
           A++ G+ +E    +L     KLF  R  R + HLD+K++ +WNGL++S FA A  +L  E
Sbjct: 470 AARYGLEVEAVRALLNTGLEKLFQARKHRLKAHLDNKMLAAWNGLMVSGFAVAGSVLGME 529

Query: 564 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYD 598
                           + +  A + A F++RH++D
Sbjct: 530 ----------------KLVTQATNGAKFLKRHMFD 548


>gi|157690983|ref|YP_001485445.1| thioredoxin [Bacillus pumilus SAFR-032]
 gi|157679741|gb|ABV60885.1| possible thioredoxin [Bacillus pumilus SAFR-032]
          Length = 687

 Score =  451 bits (1161), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 252/613 (41%), Positives = 348/613 (56%), Gaps = 55/613 (8%)

Query: 97  SRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVM 156
           S N+  N L  E SPYLLQHAHNPV W+ WG+EAF +A++ + P+ +SIGY+TCHWCHVM
Sbjct: 2   SNNQTPNPLITEKSPYLLQHAHNPVHWYPWGQEAFDKAKRENKPVLVSIGYATCHWCHVM 61

Query: 157 EVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKP 216
             ESFED+ VA +LN+ F+SIKVDREERPD+D +YM+  Q + G GGWPL+VF++PD KP
Sbjct: 62  AHESFEDQQVADILNEHFISIKVDREERPDIDSMYMSVCQMMTGQGGWPLNVFVTPDQKP 121

Query: 217 LMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSAS---ASS 273
              GTYFP    YGRPGF   L ++ DA+   RD         IE L+E  + +    ++
Sbjct: 122 FYAGTYFPKRSAYGRPGFIEALTQLLDAYHNDRD--------HIESLAEKATNNLRIKAA 173

Query: 274 NKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGE 333
            +  + L Q  +     QL  S+D+  GGFG+APKFP P    M+ +  +  E TG+   
Sbjct: 174 GQTENTLTQETIHKAYYQLMSSFDTLHGGFGTAPKFPAP---HMLSFLMRYYEWTGQENA 230

Query: 334 ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 393
                K    TL  +A GGI+DHVG GF RYS DE+W VPHFEKMLYD   L   Y +A+
Sbjct: 231 LYAVTK----TLDGIANGGIYDHVGSGFSRYSTDEKWLVPHFEKMLYDNALLMEAYTEAY 286

Query: 394 SLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 453
            LT+   Y  +   ++ +++RDM+ P G  +SA DADS   EG    KEG FYVW+  E+
Sbjct: 287 QLTQQPTYEKLVHRLIHFIKRDMMNPDGSFYSAIDADS---EG----KEGQFYVWSKDEI 339

Query: 454 EDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE 512
              LGE    LF   Y++   GN +   +  PH       +    +D  AS S     L+
Sbjct: 340 MTHLGEDLGALFCAVYHITDEGNFEGENI--PH------TISTSFDDIKASFSIDDQTLQ 391

Query: 513 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 572
             L    E R  L  VR +RP P +DDKV+ SWN L+IS+ A+  ++             
Sbjct: 392 SKLQ---EARYILQSVRQQRPAPLVDDKVLTSWNALMISALAKTGRVF------------ 436

Query: 573 VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDL 632
               D +E + +A+ A SF+  HL   Q  RL   +R G  K  GF++DYA ++   + L
Sbjct: 437 ----DAEEAIRMAKQAISFLETHLV--QHDRLMVRYREGDVKHLGFIEDYAHMLKAYMSL 490

Query: 633 YEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSV 692
           YE      WL  A  +     ELF D+E GG+F +  +  ++L+R KE +DGA PSGNS 
Sbjct: 491 YEATFELAWLEKATAIAENMFELFWDKEKGGFFFSGSDAEALLVREKEVYDGAMPSGNST 550

Query: 693 SVINLVRLASIVA 705
           ++ +L+ L+ +  
Sbjct: 551 ALKHLLILSRLTG 563


>gi|255655589|ref|ZP_05400998.1| hypothetical protein CdifQCD-2_07782 [Clostridium difficile
           QCD-23m63]
 gi|296451580|ref|ZP_06893315.1| thymidylate kinase [Clostridium difficile NAP08]
 gi|296878837|ref|ZP_06902837.1| thymidylate kinase [Clostridium difficile NAP07]
 gi|296259645|gb|EFH06505.1| thymidylate kinase [Clostridium difficile NAP08]
 gi|296430109|gb|EFH15956.1| thymidylate kinase [Clostridium difficile NAP07]
          Length = 678

 Score =  451 bits (1159), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 241/615 (39%), Positives = 350/615 (56%), Gaps = 65/615 (10%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           N+  N L  E SPYLLQHA+NP++W++W EEAF +A++ D PIFLS+GYSTCHWCHVME 
Sbjct: 4   NRKPNNLINEKSPYLLQHAYNPINWYSWNEEAFKKAKEEDKPIFLSVGYSTCHWCHVMEK 63

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFEDE VA+++N  FV+IKVD+EERPDVD VYMT  QA+ G GGWP+++ ++PD KP  
Sbjct: 64  ESFEDEEVAEIMNRNFVAIKVDKEERPDVDSVYMTVCQAMTGSGGWPMTIIMTPDKKPFF 123

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
            GTYFP   +Y RPG   +L  V + W+  RD+L +SG   IE L +      +   L  
Sbjct: 124 AGTYFPKYSRYNRPGVIDLLENVSEKWNTSRDILIKSGDEIIEALKDDFGVKNTEGDLSK 183

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASE 336
           E+  +++R+        YD ++GGFG+APKFP P  +  ++  Y  +K +D         
Sbjct: 184 EMLSSSVRV----FKAIYDEKYGGFGNAPKFPSPQNLMFLMKYYSIEKDKDV-------- 231

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
             KMV  TL  M +GG+ DH+G GF RYS D++W  PHFEKMLYD   L   +LDA+ +T
Sbjct: 232 -LKMVEKTLDGMYRGGLFDHIGFGFSRYSTDKKWLAPHFEKMLYDNAMLTIAFLDAYKIT 290

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
               Y  I    +DY+ R+M    G  +SA+DADS   EG    +EG FY +   E+ ++
Sbjct: 291 NKELYKEIAMKTIDYVVREMQDKDGGFYSAQDADS---EG----EEGKFYTFNPLEIIEV 343

Query: 457 LGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASASKLGMPLEK 513
           LGE     F  ++ +  +GN            F+GK++  LI+               E+
Sbjct: 344 LGEEDGTFFNNYFDITSSGN------------FEGKSIPNLIK-----------NKEYER 380

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
           +   +    +K+F+ R +R   H DDK++ SWN L++ +  +A   LK++          
Sbjct: 381 HNEKIDNLSKKVFEYRKERTSLHKDDKILTSWNALMVVALTKAYSTLKNDM--------- 431

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
                  Y++ +     FI  +L +E + RL   +R+G S    +LDDYAFLI   ++LY
Sbjct: 432 -------YLDYSNKCLDFINNNLVNE-SGRLLARYRDGSSDYLAYLDDYAFLIWAYIELY 483

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
           E     K+L  A+ L  +  +LF D E  G++    +  +++ R K+ +DGA PSGNSV 
Sbjct: 484 ESTFNMKYLEKALNLNESCIDLFWDYEKSGFYIYGKDSENLIARPKDLYDGAIPSGNSVQ 543

Query: 694 VINLVRLASIVAGSK 708
           + NL+RLA I   +K
Sbjct: 544 LYNLIRLAKITGDNK 558


>gi|329765558|ref|ZP_08257134.1| hypothetical protein Nlim_0902 [Candidatus Nitrosoarchaeum limnia
           SFB1]
 gi|329137996|gb|EGG42256.1| hypothetical protein Nlim_0902 [Candidatus Nitrosoarchaeum limnia
           SFB1]
          Length = 675

 Score =  451 bits (1159), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 247/598 (41%), Positives = 348/598 (58%), Gaps = 49/598 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYLLQH HNPVDW+AW EE+  +A+  + PIFLS+GYS CHWCHVM  ESFE
Sbjct: 4   NRLKNETSPYLLQHTHNPVDWYAWNEESLKKAKDENKPIFLSVGYSACHWCHVMAHESFE 63

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           +E VAK +N+ F++IKVDREERPD+D +Y    Q   G GGWPLSVFL+PD KP   GTY
Sbjct: 64  NEDVAKFMNENFINIKVDREERPDLDDIYQKVCQIATGQGGWPLSVFLTPDQKPFYVGTY 123

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP  D YGRPGF +I R++  AW +K   + +S    I  L +       + K+P +L +
Sbjct: 124 FPVLDSYGRPGFGSICRQLAQAWKEKSKDIEKSADKFIVALQK-----TDTVKVPSKLDK 178

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             L   A  L +  D+ +GGFGSAPKFP    +  +  ++K    TG     S+  +  L
Sbjct: 179 TILDEAAMNLFQLGDAAYGGFGSAPKFPNAANVSFLFRYAKL---TG----LSKFNEFAL 231

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL  MA+GGI D +GGGFHRYS D +W VPHFEKMLYD   +   Y++A+ +T+D FY 
Sbjct: 232 KTLNKMARGGIFDQIGGGFHRYSTDAKWLVPHFEKMLYDNALIPVNYVEAYQITQDPFYL 291

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
            +    LD++ R+M    G  +SA DADS   EG     EG FYVW   +++ ILG+ + 
Sbjct: 292 EVLNKTLDFVLREMTAKNGGFYSAYDADS---EGI----EGKFYVWKKSDIKVILGDDSD 344

Query: 463 LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECR 522
           LF  +Y +   GN            ++G N+L    + SA +   GMP EK   IL  C 
Sbjct: 345 LFCLYYDVTDGGN------------WEGNNILCNNINISAVSFHFGMPEEKIKKILTMCS 392

Query: 523 RKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYM 582
           +KL   RS R  P LDDK++ SWN L+I++FA+   +                +D  +Y+
Sbjct: 393 QKLLKSRSMRVAPGLDDKILTSWNALMITAFAKGYGV----------------TDDLKYL 436

Query: 583 EVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWL 642
           + A++   FI   L  +   +L  + +NG +K  G+L+DY++  + LLD++E    +K+L
Sbjct: 437 DAAKNCIHFIETTLLVDD--KLLRTSKNGITKIDGYLEDYSYFANALLDVFEVEPDSKYL 494

Query: 643 VWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 700
             A++L N   + F D E   +F T+     +++R K ++D + PSGNSVS   ++RL
Sbjct: 495 DLALKLGNYLVDHFWDSESSSFFMTSDNHEKLIIRPKSNYDLSLPSGNSVSCSVMLRL 552


>gi|397775180|ref|YP_006542726.1| hypothetical protein NJ7G_3432 [Natrinema sp. J7-2]
 gi|397684273|gb|AFO58650.1| hypothetical protein NJ7G_3432 [Natrinema sp. J7-2]
          Length = 732

 Score =  451 bits (1159), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 252/654 (38%), Positives = 364/654 (55%), Gaps = 50/654 (7%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA NPV+W  W + A   AR+RDVPIFLSIGYS CHWCHVME ESF+
Sbjct: 8   NRLDEEESPYLRQHADNPVNWQPWDDRALEAARERDVPIFLSIGYSACHWCHVMEEESFQ 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA++LN+ FV IKVDREERPD+D +YMT  Q + G GGWPLS +L+P+ +P   GTY
Sbjct: 68  DEAVAEVLNENFVPIKVDREERPDIDSIYMTVCQLVRGQGGWPLSAWLTPEGEPFFIGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRD------MLAQSGAFAIEQLSEALSASASSN-K 275
           FP E + G+PGF+ + +++ D+W+   D         Q    A ++L E   A+     +
Sbjct: 128 FPREGQRGQPGFRELCKRISDSWESDADREEMENRAQQWTDAATDRLEETPDAAGGGTVE 187

Query: 276 LPDELPQNALRLCAEQLSKSYDSRFGGFGSA-PKFPRPVEIQMMLYHSKKLEDTGKSGEA 334
            P+    + L   A+ + +S D  +GGFGS+ PKFP+P  I+++   ++  + TG+    
Sbjct: 188 APEPPSSDVLETAADAVVRSADREYGGFGSSGPKFPQPSRIRVL---ARTYDRTGR---- 240

Query: 335 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 394
            E ++++  TL  MA GG++DHVGGGFHRY VD  W VPHFEKMLYD  ++   +L  + 
Sbjct: 241 DEYREVLEETLDAMAAGGLYDHVGGGFHRYCVDRDWTVPHFEKMLYDNAEIPRAFLSGYQ 300

Query: 395 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 454
           LT +  Y+ +  D L ++ R++    G  FS  DA SA  E   R +EGAFYVWT  EV 
Sbjct: 301 LTGEDRYAELVADTLSFVERELTHDDGGFFSTLDAQSASPETGER-EEGAFYVWTPAEVH 359

Query: 455 DILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE 512
           D+L +   A LF   Y +   GN            F+G+N    +   S  A++  +   
Sbjct: 360 DVLEDETDAALFCARYDITEAGN------------FEGRNQPNRVARVSELAAQFDLAEH 407

Query: 513 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 572
           + L  L   R++LF+ R +RPRP+ D+K++  WNGL+IS++A A+ +L            
Sbjct: 408 EILKRLASARQRLFEARQERPRPNRDEKILAGWNGLMISTYAEAALVL------------ 455

Query: 573 VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDL 632
             G+D  +Y + A  A  F+R  L+D+   RL   +++G  K  G+L+DYAFL  G LD 
Sbjct: 456 --GAD--DYADTAVDALEFVRDELWDDDEQRLSRRYKDGDVKVDGYLEDYAFLARGALDC 511

Query: 633 YEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSV 692
           Y+       L +A+EL       F D + G  + T     +++ R +E  D + PS   V
Sbjct: 512 YQATGEVDHLAFALELARVIKAEFWDADRGTLYFTPESGEALVTRPQELSDQSTPSATGV 571

Query: 693 SVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 746
           +V  L+ L    A    + +   A   L     +L+  A+    +C AAD L  
Sbjct: 572 AVETLLALDEFAA----EDFEPIAATVLETHANKLETNALEHATLCLAADRLEA 621


>gi|451982157|ref|ZP_21930485.1| conserved hypothetical protein, contains Thioredoxin domain
           [Nitrospina gracilis 3/211]
 gi|451760626|emb|CCQ91765.1| conserved hypothetical protein, contains Thioredoxin domain
           [Nitrospina gracilis 3/211]
          Length = 727

 Score =  450 bits (1157), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 255/686 (37%), Positives = 375/686 (54%), Gaps = 55/686 (8%)

Query: 101 HTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVES 160
           HTNRL  E SPYLLQHAHNPVDW+ WG EA  +A++ D PIFLSIGYS+CHWCHVM  ES
Sbjct: 6   HTNRLKDETSPYLLQHAHNPVDWYPWGPEALDKAKREDKPIFLSIGYSSCHWCHVMAHES 65

Query: 161 FEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGG 220
           FE E  AKL+N+ FV+IKVDREERPD+D +YM  V AL G GGWP+SVFL+P+ +P +GG
Sbjct: 66  FESEETAKLMNELFVNIKVDREERPDIDAIYMKSVIALNGHGGWPMSVFLTPEQEPYLGG 125

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           TY+PPE K+ RPGF  +L++  D +  ++D +    A  +E+L+             D L
Sbjct: 126 TYYPPEPKFNRPGFPQVLQQAADIYRNQKDRMKSVSARLMEKLTTPPPIPQGQGAGTDAL 185

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
              A+ L  E+    +D  +GGFGS  KFP P+   ++L H +K ED       ++   M
Sbjct: 186 IPQAVELMKEK----FDETYGGFGSGMKFPEPMLYTLLLRHWQKRED-------NDAILM 234

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
              +L  MA+GG++D VGGGFHRYS D +W VPHFEKMLYD   LA ++++ F  TK   
Sbjct: 235 ADKSLTKMAEGGMYDQVGGGFHRYSTDRKWLVPHFEKMLYDNALLARLFVEMFQATKQEI 294

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-E 459
           Y  I R++  Y+ R+M  P    +S++DAD       T   EG F+ WT KEV DILG  
Sbjct: 295 YERIAREVFHYIGREMTSPEWAFYSSQDAD-------TDAGEGHFFTWTMKEVLDILGPR 347

Query: 460 HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILG 519
           H+ +F   Y +  TGN            F+ +NVL         +   G+P+ +  +I+ 
Sbjct: 348 HSKVFARVYGMTATGN------------FEKRNVLHIAETMEKVSESEGVPIFEVDHIIR 395

Query: 520 ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRK 579
             R+ L + R KR  P  DDK++  WNG++I++FA  + + +                  
Sbjct: 396 NGRQTLLESRGKRQNPGRDDKILTGWNGMMIAAFAAGAVVFRDRV--------------- 440

Query: 580 EYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGT 639
            Y + A  AA F+   ++ +   +L   +++G  +  G L+DYA+ I GLL ++E     
Sbjct: 441 -YRDHAVQAARFLWDTMWKDG--KLFRVYKDGKVRVDGCLEDYAWFIEGLLGVFEATGEG 497

Query: 640 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVR 699
           +W+  A  + +   + F D +  G+F T  +   ++ R+K   D A PS N V+ + L +
Sbjct: 498 EWIDKAQAVADALIDRFWDDKDNGFFMTAADQEKLITRLKNPEDEAIPSANGVAALALAK 557

Query: 700 LASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML-SVPSRKHVVLVGHK 758
           L  +      D Y +    ++  F  R++    A   +  A D + S+P    V + G +
Sbjct: 558 LGRLTG---KDAYFEKGRDTVRAFADRIEHRPTAYTSLLAAMDFIESLPM--EVTISGPE 612

Query: 759 SSVDFENMLAAAHASYDLNKTVSKKS 784
               +  +L A +A Y  +K V + S
Sbjct: 613 GDPQYGKLLEAVYADYRPDKLVVRYS 638


>gi|225848123|ref|YP_002728286.1| thymidylate kinase [Sulfurihydrogenibium azorense Az-Fu1]
 gi|225644610|gb|ACN99660.1| thymidylate kinase [Sulfurihydrogenibium azorense Az-Fu1]
          Length = 684

 Score =  450 bits (1157), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 261/659 (39%), Positives = 367/659 (55%), Gaps = 55/659 (8%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           +K  NRL  E SPYLLQHA+NPVDW+ W +EAF +A+K D PIFLSIGYS+CHWCHVME 
Sbjct: 2   SKKPNRLINEKSPYLLQHAYNPVDWYPWCDEAFEKAKKEDKPIFLSIGYSSCHWCHVMEK 61

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFEDE VA++LN +FV IKVDREERPD+D VYM       G GGWPL++ ++PD KP  
Sbjct: 62  ESFEDEEVAEILNKYFVPIKVDREERPDIDAVYMNVCMLFNGSGGWPLTIIMTPDKKPFF 121

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAW-DKKRDMLAQSGAFAIEQLSEALSASASSNKLP 277
            GTYFP   +  R G   +L  V   W + K D++++S     E++   L     SN   
Sbjct: 122 AGTYFPKHSRPNRIGVVDLLLSVAKYWQENKEDLISRS-----EKVLGYLKEDNKSNY-- 174

Query: 278 DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML---YHSKKLEDTGKSGEA 334
            EL ++ +      L   +D+ +GGF + PKFP P  I  +L   YH+K+          
Sbjct: 175 GELKKDYIHAGFYDLKGRFDNTYGGFSNKPKFPTPHNIMFLLRYYYHTKE---------- 224

Query: 335 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 394
            E  +MV  TL  M  GGI+DHVG GFHRYS D +W +PHFEKM YDQ  L   Y + + 
Sbjct: 225 EEALQMVEKTLTNMRLGGIYDHVGFGFHRYSTDRQWLLPHFEKMHYDQAMLLMAYTETYQ 284

Query: 395 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 454
           +TK   Y    ++I++Y+ RDM    G  FSAEDADS   EG    +EG FY WT +E++
Sbjct: 285 ITKKDLYKQTVQEIIEYVIRDMTNEEGVFFSAEDADS---EG----EEGKFYTWTFQEIK 337

Query: 455 DILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKY 514
           DIL E + L  + + +K  GN        P     G+N++         A  LG+     
Sbjct: 338 DILKEESDLAIKIFNIKEEGNYLEEATGHP----TGRNIIYLSKTLRDYAIDLGIDENTL 393

Query: 515 LNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVV 574
              L + R+KLF  R KR  P  DDKV+  WNGL+I++ ++A K   ++           
Sbjct: 394 KQKLEQIRKKLFKEREKRVHPLKDDKVLTDWNGLMIAALSKAGKAFSNQ----------- 442

Query: 575 GSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYE 634
                +Y+  A+ AA FI  ++  +   +L H +++   K  G LDDYAFL+ GL++LY+
Sbjct: 443 -----DYISYAQKAADFIIHNMIIDG--KLYHLYKDKEVKIEGMLDDYAFLVWGLIELYQ 495

Query: 635 FGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSV 694
                K+L  A++L N   +   D + GG+F +  +D  +++  KE  DGA PSGNSV  
Sbjct: 496 ATGELKYLKTAVDLTNKAIQPLYDEKNGGFFLSKSQD--LIVNPKESFDGAIPSGNSVMA 553

Query: 695 INLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVV 753
            NL RL  I A  + ++Y+++ E +L  F   +K +     +   A  M   P+ + V+
Sbjct: 554 YNLYRLYLITA--QEEFYKKSYE-TLTAFAGDIKRLPSYHTMFLIALMMHFFPTSEIVI 609


>gi|407465214|ref|YP_006776096.1| hypothetical protein NSED_06780 [Candidatus Nitrosopumilus sp. AR2]
 gi|407048402|gb|AFS83154.1| hypothetical protein NSED_06780 [Candidatus Nitrosopumilus sp. AR2]
          Length = 675

 Score =  450 bits (1157), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 246/598 (41%), Positives = 349/598 (58%), Gaps = 49/598 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N LA+E SPYLLQH +NPVDW+ W +E+  +A+  + PIFLSIGYS+CHWCHVM  ESFE
Sbjct: 4   NHLASETSPYLLQHVNNPVDWYGWNDESLKKAKDENKPIFLSIGYSSCHWCHVMAHESFE 63

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           +E VAK +N+ F++IKVDREERPD+D +Y    Q   G GGWPLSVFL+PD KP   GTY
Sbjct: 64  NEDVAKFMNENFINIKVDREERPDIDDIYQKVCQIATGQGGWPLSVFLTPDQKPFYVGTY 123

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP  D YGRPGF +I R++  AW +K + +  S    I+ L++     A + ++P +L +
Sbjct: 124 FPVLDSYGRPGFGSICRQLSQAWKEKPNDIETSAKRFIDALTK-----AEAIQVPSKLER 178

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             L   A  L +  D+ +GGFGSAPKFP    I   L+   KL    K  E        L
Sbjct: 179 ILLDEAAMNLFQLGDATYGGFGSAPKFPNAANIS-FLFRYAKLSGLTKFNE------FAL 231

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL+ MA GGI D +GGGF RYS D +W VPHFEKMLYD   ++  Y +AF +TKD FY 
Sbjct: 232 KTLKKMANGGIFDQIGGGFSRYSTDAKWLVPHFEKMLYDNALISVNYAEAFQITKDPFYL 291

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
            + R  LD++ R+M  P G  +SA DADS   EG     EG +YVW   E+++ILG+ A 
Sbjct: 292 EVLRKTLDFVLREMTSPEGGFYSAYDADS---EGV----EGKYYVWKKSEIKEILGDDAD 344

Query: 463 LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECR 522
           LF  +Y +   GN            ++G N+L    + S  A   G+   +   I+  C 
Sbjct: 345 LFCLYYDVTDGGN------------WEGNNILCNNLNISTVAFNFGISETEVKKIINLCS 392

Query: 523 RKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYM 582
           +KL  VRS R  P LDDK++VSWN L+I++ A+  ++                +    Y+
Sbjct: 393 KKLLKVRSSRIPPGLDDKILVSWNSLMITALAKGYRV----------------TGDILYL 436

Query: 583 EVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWL 642
             A++  SFI  +L      +L  +++NG +K  G+L+DY++ I+ LLD++E     K+L
Sbjct: 437 NAAKNCISFIENNLL--VNDKLLRTYKNGTAKIDGYLEDYSYFINALLDVFEIEPDEKYL 494

Query: 643 VWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 700
             +++L +     F D +   +F T+ +   +++R K ++D + PSGNSVS   L+RL
Sbjct: 495 KLSLKLAHHLVNHFWDSKNNNFFMTSDDHEKLIIRPKSNYDLSLPSGNSVSAFALLRL 552


>gi|404493392|ref|YP_006717498.1| thioredoxin domain-containing protein YyaL [Pelobacter carbinolicus
           DSM 2380]
 gi|77545446|gb|ABA89008.1| thioredoxin domain protein YyaL [Pelobacter carbinolicus DSM 2380]
          Length = 711

 Score =  450 bits (1157), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 262/679 (38%), Positives = 372/679 (54%), Gaps = 52/679 (7%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYLLQHA NPVDW  WG++AF  AR+++ P+ +SIGYSTCHWCHVME ESFE
Sbjct: 31  NRLIFESSPYLLQHATNPVDWHPWGQQAFDLAREQNKPVLVSIGYSTCHWCHVMEQESFE 90

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D  VA++LN  F+ IKVDREERPD+D +YMT  Q + GGGGWPL+VFL+PD  P    TY
Sbjct: 91  DREVAEVLNKLFIPIKVDREERPDIDNLYMTACQLVTGGGGWPLNVFLTPDKAPFYAATY 150

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
            P   +   PG   IL K+   W   RD L Q+G    E L   +   +S+  +   L +
Sbjct: 151 MPRRPRGQMPGIIAILTKIGAMWQSDRDQLLQTGREIGETL---IRLESSAAPVASSLTE 207

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             L    E+   ++D   GGFG APKFP P  + ++ + +++       G+ +  + M +
Sbjct: 208 APLTEAFERFKANFDHERGGFGKAPKFPMPHNLSLLFHIAQRF------GQET-AEAMAI 260

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TLQ +  GG++DH+G G HRYSVD  W VPHFEKMLYDQ  +    LDA+ +T D F+ 
Sbjct: 261 KTLQHIRLGGMYDHIGFGMHRYSVDAFWRVPHFEKMLYDQALVTLAALDAYQVTHDTFFE 320

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHA 461
            +    + Y+ RD+  P G   S EDAD   TEGA    EG FY+WT ++VE++LG + A
Sbjct: 321 SLADQTMSYVLRDLSLPEGGFCSGEDAD---TEGA----EGTFYLWTPQQVEEVLGHQQA 373

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
            +F   Y +   GN            F+G N+     D    A   G   ++   +L + 
Sbjct: 374 TIFCTCYEISEAGN------------FEGSNIPRLEMDLKEWAQWFGTDTDELGAVLEDG 421

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
           RRKL   R  R RPH DDKV+V+WNGL I++ AR ++++                   EY
Sbjct: 422 RRKLLQARKLRVRPHRDDKVLVAWNGLAIAAMARTARLIG----------------HPEY 465

Query: 582 MEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKW 641
           +E A  AA FI  ++ +E+   L+   R   +  P FL+DYA LI GL++LY+ G   ++
Sbjct: 466 LEGATRAADFILSNMRNEEGRLLRRWRRG-QAGIPAFLEDYAALILGLIELYQAGFNARY 524

Query: 642 LVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLA 701
           L  A++L     E F     G Y++T  +   VL+R +  HDGA  SGNS++ + L+RL 
Sbjct: 525 LAEAVQLGRDMQERF-GTPDGVYYDTGTDAEEVLVRKRTLHDGAMISGNSMAAMALLRLG 583

Query: 702 SIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSV 761
           S+   +      ++AE  L     +  D   A   +  A D L++  R+ +V+   K   
Sbjct: 584 SL---TGEPALEEHAEKILLASSKQWTDAPTASGQLLMALD-LALSQREVLVIAAPKDDP 639

Query: 762 DFENMLAAAHASYDLNKTV 780
           +   M+ AAH  +  N  +
Sbjct: 640 EGTRMVKAAHTGFRPNLII 658


>gi|338733047|ref|YP_004671520.1| hypothetical protein SNE_A11520 [Simkania negevensis Z]
 gi|336482430|emb|CCB89029.1| uncharacterized protein yyaL [Simkania negevensis Z]
          Length = 676

 Score =  449 bits (1156), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 268/689 (38%), Positives = 371/689 (53%), Gaps = 78/689 (11%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYLLQHAHNPVDW+ WG+EAF  A+K D PIFLSIGY+TCHWCHVM  ESF 
Sbjct: 5   NRLIKEKSPYLLQHAHNPVDWYPWGDEAFEAAKKLDKPIFLSIGYATCHWCHVMSRESFA 64

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYG-GGGWPLSVFLSPDLKPLMGGT 221
           +  +A L+N+ F+++KVDREE P++D +YM + QAL   G GWPL++ L+P+LKP    T
Sbjct: 65  NSEIATLMNETFINVKVDREELPEIDSLYMEFAQALMASGSGWPLNLILTPELKPFYATT 124

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDK-KRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           Y PP  +    G K ++  +K  W   +R++L       ++    A S      +LP+E 
Sbjct: 125 YMPPTTRQELMGIKELVSHIKQLWKSAERELLLDQAEKLVDLF--ARSVQTRGEELPNE- 181

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
               L    EQ  ++ D  +GG   APKFP   +I   L H+++  D       S     
Sbjct: 182 --EHLDAAVEQFYEAVDPVYGGIKGAPKFPLGYQILFFLEHARREHD-------SRSLFF 232

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
              TL  M +GGI+D VGGGF RYSVDE+W +PHFEKMLYD   +A  +LDA+ LTK   
Sbjct: 233 AELTLSMMHRGGIYDQVGGGFSRYSVDEKWIIPHFEKMLYDNALMALAFLDAWKLTKKPL 292

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH 460
           Y  +C +ILDYL RDM   GG  +SAED   AET+G    +EGA+Y W ++E++ +L   
Sbjct: 293 YRQVCEEILDYLLRDMQHQGGGFYSAED---AETDG----EEGAYYTWHAQEIQKLLPPA 345

Query: 461 AI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILG 519
            + LF E++ + P+GN            F GKNVL         A   G+        L 
Sbjct: 346 DLDLFCEYFDVTPSGN------------FGGKNVLYRTMTIQEFAELRGLDPLMIQTRLD 393

Query: 520 ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRK 579
            C   LFD R  R RP  DDK++V+WN + I  F +A +  ++EA               
Sbjct: 394 SCLNLLFDARKGRKRPFKDDKILVTWNAMAIDVFIKAGRAFQNEA--------------- 438

Query: 580 EYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGT 639
            Y++   +AASFIR++L+  +  +L+  FR G +   G LDDYA+LI  L+ L E   G 
Sbjct: 439 -YLKSGLAAASFIRQNLW--KGGKLKRRFREGQTDYEGGLDDYAYLIRALITLSEADLGN 495

Query: 640 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVR 699
            WL WA+EL +  ++ F   EG   F  TG + S+LLR  E  D A+PSGN++   NL+R
Sbjct: 496 VWLQWALELADFLEKEFKADEGA--FYQTGPEYSILLRRPELFDSAQPSGNAIHAENLIR 553

Query: 700 LASI---------------VAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMC-----C 739
           L+ +               VA S  + Y Q A + L   +  L   A+ + +        
Sbjct: 554 LSQLTQNRELRIQAEDILKVATSYIETYPQGACYHLIALQHYLDKEALTIVVALDEKESL 613

Query: 740 AADMLSVPSRK----HVVLVGHKSSVDFE 764
             ++L V S +    HVV     S  +FE
Sbjct: 614 KEEILEVLSTEFIPHHVVFWKRHSDKEFE 642


>gi|423083522|ref|ZP_17072052.1| hypothetical protein HMPREF1122_03047 [Clostridium difficile
           002-P50-2011]
 gi|423088427|ref|ZP_17076810.1| hypothetical protein HMPREF1123_03965 [Clostridium difficile
           050-P50-2011]
 gi|357542999|gb|EHJ25034.1| hypothetical protein HMPREF1123_03965 [Clostridium difficile
           050-P50-2011]
 gi|357544282|gb|EHJ26286.1| hypothetical protein HMPREF1122_03047 [Clostridium difficile
           002-P50-2011]
          Length = 678

 Score =  449 bits (1156), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 241/615 (39%), Positives = 350/615 (56%), Gaps = 65/615 (10%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           N+  N L  E SPYLLQHA+NP++W++W +EAF +A++ D PIFLS+GYSTCHWCHVME 
Sbjct: 4   NRKPNNLINEKSPYLLQHAYNPINWYSWNDEAFKKAKEEDKPIFLSVGYSTCHWCHVMEK 63

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFEDE VA+++N  FV+IKVD+EERPDVD VYMT  QA+ G GGWP+++ ++PD KP  
Sbjct: 64  ESFEDEEVAEIMNRNFVAIKVDKEERPDVDSVYMTVCQAMTGSGGWPMTIIMTPDKKPFF 123

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
            GTYFP   +Y RPG   +L  V + W+  RD+L +SG   I+ L +      +   L  
Sbjct: 124 AGTYFPKYSRYNRPGVIDLLENVSEKWNTSRDILIKSGDEIIKALKDDFDVKNTEGDLSK 183

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASE 336
           E+  +++R+        YD ++GGFG+APKFP P  +  ++  Y  +K +D         
Sbjct: 184 EMLSSSVRV----FKAIYDEKYGGFGNAPKFPSPQNLMFLMKYYSIEKDKDV-------- 231

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
             KMV  TL  M +GG+ DH+G GF RYS D++W  PHFEKMLYD   L   +LDA+ +T
Sbjct: 232 -LKMVEKTLDGMYRGGLFDHIGFGFSRYSTDKKWLAPHFEKMLYDNAMLTIAFLDAYKIT 290

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
           K   Y  I    +DY+ R+M    G  +SA+DADS   EG    +EG FY++   E+ ++
Sbjct: 291 KKELYKEIAIKTIDYVVREMKDKDGGFYSAQDADS---EG----EEGKFYIFNPLEIIEV 343

Query: 457 LGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASASKLGMPLEK 513
           LGE     F  ++ +  +GN            F+GK++  LI+               E+
Sbjct: 344 LGEEDGTFFNNYFDITSSGN------------FEGKSIPNLIK-----------NKEYER 380

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
           +   + +   K+F+ R +R   H DDK++ SWN L+I +  +A   L+++          
Sbjct: 381 HNEKIADLSEKVFEYRKERTSLHKDDKILTSWNALMIVALTKAYSTLENDI--------- 431

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
                  Y+E +     FI  +L +E + RL   +R+G S    +LDDYAFLI   ++LY
Sbjct: 432 -------YLEYSNKCLDFINNNLVNE-SGRLLARYRDGSSDYLAYLDDYAFLIWAYIELY 483

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
           E     K+L  A+ L      LF D E  G++    +  +++ R K+ +DGA PSGNSV 
Sbjct: 484 ESTFNMKYLEKALNLNENCINLFWDYEKSGFYIYGKDSENLIARPKDLYDGAIPSGNSVQ 543

Query: 694 VINLVRLASIVAGSK 708
           + NL+RLA I   S+
Sbjct: 544 LYNLIRLAKITGDSR 558


>gi|440631885|gb|ELR01804.1| hypothetical protein GMDG_00904 [Geomyces destructans 20631-21]
          Length = 918

 Score =  449 bits (1156), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 259/647 (40%), Positives = 367/647 (56%), Gaps = 37/647 (5%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NR A   SPY+  H +NPV W  +G+EA   A++ +  +F+SIGYS CHWCHVME ESFE
Sbjct: 51  NRAAESRSPYVRGHMNNPVAWQLFGDEAIKLAKRENKLLFISIGYSACHWCHVMEKESFE 110

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           ++ VA +LN  F+ IK+DREERPD+D++YM +VQA  G GGWPL+VF++P L+P+ GGTY
Sbjct: 111 NDEVAAILNKDFIPIKIDREERPDIDRIYMNFVQATTGSGGWPLNVFVTPTLEPVFGGTY 170

Query: 223 F-------PPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS--EALSASASS 273
           +       P  +      F  IL K+  AW ++        A  ++QL    A      +
Sbjct: 171 WHGPHSNTPQLELEDHVDFLRILGKLSQAWREQESRCRLDSAQILQQLKVFAAEGTLGGA 230

Query: 274 NKLPDELPQNALRL-----CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML---YHSKKL 325
            K   E P   L L       + L  ++D+   GF +APKFP P ++  +L   +  + +
Sbjct: 231 PKTGAEPPAGGLDLDIIDEAYQHLVSTFDTTNSGFSAAPKFPTPSKLAFLLRLPHFPQPV 290

Query: 326 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 385
            D   + E    Q M L TL+ MA+GGIHDH+G GF RYSV   W +PHFEKMLYD  QL
Sbjct: 291 LDVVGAEEVKSAQFMALSTLRAMARGGIHDHIGHGFSRYSVTADWSLPHFEKMLYDNAQL 350

Query: 386 ANVYLDAF-SLTK-DVFYSYICRDILDYLRRDMI-GPGGEIFSAEDADSAETEGATRKKE 442
            ++YLDAF  L K D     +  D+  YL    I  PGG  +S++DADS   +G    +E
Sbjct: 351 LSLYLDAFLGLPKPDPELLGVVYDLAAYLLSPPIAAPGGGFYSSQDADSFYRKGDKETRE 410

Query: 443 GAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 501
           GA+YVWT++E+E +L   A  +    + + P GN   S   D H+EF  +NVL   +  S
Sbjct: 411 GAYYVWTARELETLLPAGAYDIVAAFFGVNPDGNVAPSH--DVHDEFINQNVLRIASTPS 468

Query: 502 ASASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVISSFARASKIL 560
             AS+ G+   + +  +   +R L   R ++R  P+LDDK++ +WNG+ I + AR    L
Sbjct: 469 QLASQFGIAESEVVETIKSAKRTLLAHREAERVVPNLDDKIVCAWNGIAIGALARTGASL 528

Query: 561 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 620
           + E ++ M       S+R   ++ A  AA F+RR +YDE    L+  +R GP +  GF D
Sbjct: 529 R-EVDAQM-------SER--CLDAAIRAARFMRREMYDEDAKTLRRVWRGGPGETAGFAD 578

Query: 621 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 680
           DYAFL+ GLL+LYE     +W+ WA ELQ TQ+  FLD    G+F T    P  +LR+K+
Sbjct: 579 DYAFLVEGLLELYEATFADEWVRWADELQATQNSHFLDPTASGFFATAAAAPHTILRLKD 638

Query: 681 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 727
             D +EPS N VS  NL RLAS++     D Y   A+ ++  FE  +
Sbjct: 639 GMDASEPSTNGVSASNLFRLASLLG---DDKYEALAKETVGAFEAEI 682


>gi|373488750|ref|ZP_09579414.1| protein of unknown function DUF255 [Holophaga foetida DSM 6591]
 gi|372005695|gb|EHP06331.1| protein of unknown function DUF255 [Holophaga foetida DSM 6591]
          Length = 660

 Score =  449 bits (1155), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 265/601 (44%), Positives = 347/601 (57%), Gaps = 69/601 (11%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL    SPYLLQHAHNPVDW  WG EA   AR+ D+PIFLS+GYS CHWCHVME ESFE
Sbjct: 3   NRLIEATSPYLLQHAHNPVDWHPWGPEALNLARELDLPIFLSVGYSACHWCHVMERESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           +  VA  LN  FV IKVDREERPD+D++YM  VQ L G GGWP+SV+L+P+L+P  GGTY
Sbjct: 63  NADVAAFLNKHFVPIKVDREERPDLDELYMGAVQLLAGRGGWPMSVWLTPELEPFYGGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKR-DMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           FPP  + G PGF  +L  V   W ++R D+LAQ+G     +L  AL A       P    
Sbjct: 123 FPPVSRGGMPGFLDVLEGVARVWQERRQDVLAQAG-----ELVAALRAGRGIGGDPPG-- 175

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
           +  L +    LS S+D+R+GGFG APKFP    + ++L                +   M 
Sbjct: 176 EGLLEVAIRHLSYSFDARWGGFGGAPKFPPIPALTLLLGRGD-----------PKALDMA 224

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
           + TL  MA GGI DH+GGGF RYSVDERW VPHFEKML D  QLA VYL+AF +T +V +
Sbjct: 225 IRTLDAMAAGGIRDHLGGGFARYSVDERWKVPHFEKMLCDNAQLAWVYLEAFRVTGEVRH 284

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHA 461
               R+ILDY   +M    G  FS+EDADS   EG    +EG FY ++  EV+++LG  A
Sbjct: 285 GERAREILDYFLGEMRDASGGFFSSEDADS---EG----EEGRFYTFSWGEVQEVLGPGA 337

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
            LF   Y + P GN +            G+++L  +       S+L +            
Sbjct: 338 DLFCRAYGVTPEGNFE-----------GGRSLLHRMEVGDFPESELAI-----------L 375

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
           R ++   R +R RPH DDK++V+WNGL +S+ A+ S +L                    Y
Sbjct: 376 RERIRLYRDRRVRPHRDDKILVAWNGLALSALAKGSALLGE----------------PRY 419

Query: 582 MEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKW 641
           +E AE+ A F++R L+ + T  L  ++R G    PGFL+DY  LI GLLDLY+ G  ++W
Sbjct: 420 LEAAEACADFLQRELWRDGT--LLRTWRQGRGHTPGFLEDYGALILGLLDLYQTGFHSRW 477

Query: 642 LVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLA 701
           L WA EL     E F + E GG+F T   D  V+LR     D A PSGN+++ + L+RL 
Sbjct: 478 LHWAQELGEALLERFHEAE-GGFFGTEALD--VILRQCPVFDHAIPSGNALAALALLRLG 534

Query: 702 S 702
           +
Sbjct: 535 N 535


>gi|189218169|ref|YP_001938811.1| Highly conserved protein containing a thioredoxin domain
           [Methylacidiphilum infernorum V4]
 gi|189185027|gb|ACD82212.1| Highly conserved protein containing a thioredoxin domain
           [Methylacidiphilum infernorum V4]
          Length = 724

 Score =  449 bits (1155), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 258/663 (38%), Positives = 371/663 (55%), Gaps = 34/663 (5%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N L  E SPYLLQHA NPVDW  W EE+  +A+  D PIFLS+GYSTCHWCHVM  ESFE
Sbjct: 2   NALCKEKSPYLLQHADNPVDWHPWTEESLLKAKHLDRPIFLSVGYSTCHWCHVMAKESFE 61

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           +  VA+LLN +F+ IKVDREERPD+D+ YM +VQA  G GGWP++V+L+P+L+P  GGTY
Sbjct: 62  NPIVAQLLNSFFIPIKVDREERPDIDQFYMEFVQAFTGQGGWPMNVWLTPNLEPFFGGTY 121

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP E K+G+PGF  IL+K+ + W   R +L Q G     ++ E + +S      P+    
Sbjct: 122 FPLESKWGKPGFVDILKKIAELWQYNRSLLEQQGQEIFHKMREVIQSSFEPKSPPNL--A 179

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
            A R   EQL  S+D   GGF  +PKFPRP  +   L+ +  L D  +  +    Q M L
Sbjct: 180 IASRKAVEQLWGSFDRTHGGFSPSPKFPRP-SLFYFLFRAGSLADFSEDYKKKSLQ-MAL 237

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
           ++LQ M+ GGIHD + GGFHRYSVDE+W +PHFEKMLYDQ  L   YLDA+  T D  + 
Sbjct: 238 YSLQKMSGGGIHDQLEGGFHRYSVDEKWRLPHFEKMLYDQATLGLSYLDAYQATDDPLFK 297

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKE----VEDILG 458
                +++YL   +  P G  +SAEDADS    G  +++EGA+Y+WT +E    +E I+G
Sbjct: 298 DTFESLVEYLLSHLHHPSGGFYSAEDADSLNASG--QEEEGAYYLWTFQELQQTLEPIVG 355

Query: 459 EHAILFKEHYY-LKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI 517
           +       H++     GN     +S+       KN+L+     S  A +LG+ LE+   I
Sbjct: 356 KDRSKILAHFFGATEQGNLPGGLISE--EALAKKNILLMEKPLSDLAHELGISLEEAREI 413

Query: 518 LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSD 577
           + + +  L   R KR +P LDDK+I +WNG  +S+ A+A              + V+G  
Sbjct: 414 VLKAKEGLKKERLKRSKPFLDDKIICAWNGYTLSALAKA--------------YMVIGDG 459

Query: 578 RKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGS 637
           R   +  A+  A+F+  +L+D  +  L   +RNG    PGF  DYA L   +L L+E   
Sbjct: 460 R--LINEAKKTATFLLENLWDPSSKTLYRIYRNG-RGTPGFSSDYASLALSMLHLFEADQ 516

Query: 638 GTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINL 697
             KWL  A   Q   +E F+D     Y     E  +  ++ +E++DGAEP+  S++  +L
Sbjct: 517 DEKWLSLAKLFQELLEEKFVDPYRHNYMVEAVEISAKSIQTREEYDGAEPATLSLAAHSL 576

Query: 698 VRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGH 757
           ++L ++    K   +R+  E   +     L+    A+P +         P  + ++LVG 
Sbjct: 577 LKLYTLTGEEK---WRKRLEELFSYAWPILERFPTALPYLLGVYCEYRAPLVE-IILVGE 632

Query: 758 KSS 760
           K +
Sbjct: 633 KKN 635


>gi|448343975|ref|ZP_21532892.1| hypothetical protein C486_20033 [Natrinema gari JCM 14663]
 gi|445622058|gb|ELY75523.1| hypothetical protein C486_20033 [Natrinema gari JCM 14663]
          Length = 732

 Score =  448 bits (1153), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 249/654 (38%), Positives = 364/654 (55%), Gaps = 50/654 (7%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA NPV+W  W + A   AR+RDVP+FLSIGYS CHWCHVME ESF+
Sbjct: 8   NRLDEEESPYLRQHADNPVNWQPWDDRALEAARERDVPVFLSIGYSACHWCHVMEAESFQ 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA++LN+ FV IKVDREERPD+D +YMT  Q + G GGWPLS +L+P+ +P   GTY
Sbjct: 68  DEAVAEVLNENFVPIKVDREERPDIDSIYMTVCQLVRGQGGWPLSAWLTPEGEPFFIGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRD------MLAQSGAFAIEQLSEALSASASSN-K 275
           FP E + G+PGF+ + +++ D+W+   D         Q    A ++L E   A+     +
Sbjct: 128 FPREGQRGQPGFRELCKRISDSWESDADREEMENRAQQWTDAATDRLEETPDAAGGGTVE 187

Query: 276 LPDELPQNALRLCAEQLSKSYDSRFGGFGSA-PKFPRPVEIQMMLYHSKKLEDTGKSGEA 334
            P+    + L   A+ + +S D  +GGFGS+ PKFP+P  I+++   ++  + TG+    
Sbjct: 188 APEPPSSDVLETAADAVVRSADREYGGFGSSGPKFPQPSRIRVL---ARTYDRTGR---- 240

Query: 335 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 394
            E ++++  TL  MA GG++DHVGGGFHRY VD  W VPHFEKMLYD  ++   +L  + 
Sbjct: 241 DEYREVLEETLDAMAAGGLYDHVGGGFHRYCVDRDWTVPHFEKMLYDNAEIPRAFLSGYQ 300

Query: 395 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 454
           LT +  Y+ +  D L ++ R++    G  FS  DA SA  E   R +EGAFYVWT  EV 
Sbjct: 301 LTGEDRYAELVADTLSFVERELTHDDGGFFSTLDAQSASPETGER-EEGAFYVWTPAEVH 359

Query: 455 DILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE 512
           D+L +   A LF   + +   GN            F+G+N    +   S  A++  +   
Sbjct: 360 DVLEDETDAALFCARFDITEAGN------------FEGRNQPNRVARVSELAAQFDLAEH 407

Query: 513 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 572
           + L  L   R++LF+ R +RPRP+ D+K++  WNGL+IS++A A+ +L            
Sbjct: 408 EILKRLASARQRLFEARQERPRPNRDEKILAGWNGLMISTYAEAALVL------------ 455

Query: 573 VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDL 632
             G+D  +Y + A  A  F+R  L+D+   RL   +++G  K  G+L+DYAFL  G LD 
Sbjct: 456 --GAD--DYADTAVDALEFVRDELWDDDEQRLSRRYKDGDVKVDGYLEDYAFLARGALDC 511

Query: 633 YEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSV 692
           Y+       L +A+EL    +  F D + G  + T     +++ R +E  D + PS   V
Sbjct: 512 YQATGEVDHLAFALELARVIEAEFWDADRGTLYFTPESGEALVTRPQELGDQSTPSATGV 571

Query: 693 SVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 746
           +V  L+ L    A    + +   A   L     +L+  A+    +C  AD L  
Sbjct: 572 AVETLLALDEFAA----EDFEPIAATVLETHANKLETNALEHATLCLVADRLEA 621


>gi|430745763|ref|YP_007204892.1| thioredoxin domain-containing protein [Singulisphaera acidiphila
           DSM 18658]
 gi|430017483|gb|AGA29197.1| thioredoxin domain protein [Singulisphaera acidiphila DSM 18658]
          Length = 811

 Score =  448 bits (1153), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 274/673 (40%), Positives = 365/673 (54%), Gaps = 60/673 (8%)

Query: 83  VVAMAERTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIF 142
           + A+A    A          NRLA E SPYLL HAHNPVDW+ WG EAFA+A+    PIF
Sbjct: 21  LAALASGPEAKADPEPKAPANRLAKETSPYLLLHAHNPVDWYPWGPEAFAKAKAEKKPIF 80

Query: 143 LSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGG 202
           LSIGYS+C+WCHVME E F+D  +AKL+N  FV IKVDREERPD+D++YM  +QA +G G
Sbjct: 81  LSIGYSSCYWCHVMERECFKDPQIAKLMNQKFVCIKVDREERPDIDQIYMAALQA-FGNG 139

Query: 203 GWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQ 262
           GWP+S+FL+PD +P  GGTYFPP+D+ G  GF T+L  V DAW  ++  + +S     + 
Sbjct: 140 GWPMSMFLTPDGRPFFGGTYFPPKDRNGIRGFPTVLAGVADAWRDEKAQIEESADRLTDL 199

Query: 263 LSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFG------SAPKFPRPVEIQ 316
           +  +L+ S      P  L +       E+L++ +D  +GGFG        PKFP PV + 
Sbjct: 200 VRRSLAKSNDKRHAP--LTRAVAAQGREELTEQFDPEYGGFGFNPENARRPKFPEPVNLV 257

Query: 317 MMLYHSKKLEDTGKSGEASEGQK-------MVLFTLQCMAKGGIHDHVGGGFHRYSVDER 369
            +L   ++    GK     EGQ+       MVL TL  MA+GGI D + GG+HRY+    
Sbjct: 258 FLLDEHRRGAAAGK----KEGQEASSNALAMVLKTLDQMARGGIRDQLAGGYHRYATSRY 313

Query: 370 WHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDA 429
           W VPHFEKMLYD  QLA+ +L AF LT D  +         ++ R M  P G  +SA D 
Sbjct: 314 WIVPHFEKMLYDNAQLASTHLLAFELTADPRWRLEAESTFAFIARSMTSPEGGFYSAID- 372

Query: 430 DSAETEGATRKKEGAFYVWTSKEVEDILGEHAI--LFKEHYYLKPTGNCDLSRMSDPHNE 487
             AET+G     EG +YVWT  EVE  LG       F + Y LK   N +          
Sbjct: 373 --AETDG----DEGQYYVWTRDEVEKTLGAGPDYEAFAQVYGLKREPNFE---------- 416

Query: 488 FKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNG 547
            K + VL+E    +  A+ L          +   R KL  VR +RP P LDDKV+ SWNG
Sbjct: 417 -KERYVLLEPRSRADQAATLKTTPAALEATMAPLRAKLLAVRERRPAPLLDDKVLTSWNG 475

Query: 548 LVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS 607
           L+I+++A   +IL                   +Y + A+ AA FI   L      RL  S
Sbjct: 476 LMIAAYADGFRILHD----------------AKYRQAADKAADFILAKLRSPD-GRLLRS 518

Query: 608 FRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT 667
           +R G +K  G+L+DYAFL+ GLL L+      K L  A EL +     F D E GG+F T
Sbjct: 519 YRLGQAKLAGYLEDYAFLVHGLLRLHAATGDPKRLTQARELTDRMIADFSDPEEGGFFYT 578

Query: 668 TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 727
                S+L R K+ +DGA PSGNSV++ NLV LAS    ++   Y   A+ +L  F + L
Sbjct: 579 ADGHESLLARPKDPYDGALPSGNSVAIRNLVALASATGEAR---YLDQAQKALDAFSSTL 635

Query: 728 KDMAMAVPLMCCA 740
                ++PL+  A
Sbjct: 636 AQNPGSLPLLVVA 648


>gi|320102044|ref|YP_004177635.1| hypothetical protein Isop_0491 [Isosphaera pallida ATCC 43644]
 gi|319749326|gb|ADV61086.1| protein of unknown function DUF255 [Isosphaera pallida ATCC 43644]
          Length = 723

 Score =  448 bits (1153), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 266/689 (38%), Positives = 383/689 (55%), Gaps = 81/689 (11%)

Query: 93  STSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHW 152
           S   + ++  NRLA E SPYLLQHAHNPVDWF WGEEAFA+A+  + PIFLS+GYS CHW
Sbjct: 6   SGFQATSRPANRLARETSPYLLQHAHNPVDWFPWGEEAFAKAKAENKPIFLSVGYSACHW 65

Query: 153 CHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQAL-YGGGGWPLSVFLS 211
           CHVME ESFE   +A L+N WFV+IKVDREERPD+D++YM  VQAL  G GGWP+SVF++
Sbjct: 66  CHVMERESFESPTIAALMNQWFVNIKVDREERPDIDQIYMAAVQALNQGHGGWPMSVFMT 125

Query: 212 PDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE------ 265
           P+ +P  GGTY+PP D  G PGF  IL  +  AW ++   + ++ A  +E L +      
Sbjct: 126 PEGEPFFGGTYYPPHDARGMPGFPRILEGLATAWREREPEVREAAARLVEHLRKRNEPMP 185

Query: 266 ------ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML 319
                 AL   A+ ++  D L    +   A  L + +DSR+GGFGSAPKFP P++++++L
Sbjct: 186 PLIKGPALDHPAADDR--DGLDPGWIAEAARALGRVFDSRYGGFGSAPKFPHPMDLKLLL 243

Query: 320 YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKML 379
            H ++++D            MV+ TL  M++GGI+DH+GGGF RY+ DERW VPHFEKML
Sbjct: 244 RHHQRVQD-------PRALAMVIQTLDHMSRGGIYDHLGGGFARYATDERWLVPHFEKML 296

Query: 380 YDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGP--GGEIFSAEDADSAETEGA 437
           YD   L +   +      D   + +  + LDYL   M GP      F+ EDADS   EG 
Sbjct: 297 YDNALLISALAETIQCRPDPTLARVVVETLDYLAERMTGPPEAPGFFATEDADS---EGV 353

Query: 438 TRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIE 496
               EG +YVW+  E+ + LGE    LF E Y +   GN            ++G ++L  
Sbjct: 354 ----EGKYYVWSRDEMLETLGEPLGSLFAEVYDVTEAGN------------WEGHSILNL 397

Query: 497 LNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARA 556
                  A +LG P ++    L + R  L   R +R  P  D K++ SWNGL++++ A A
Sbjct: 398 PEPLDRVAQRLGRPTDQLAAELAQARALLKARRDRRIPPGKDTKILTSWNGLMLAAIAEA 457

Query: 557 SKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP 616
           + ++                DR +++E AE AA F+  HL  +   RL H F++G ++  
Sbjct: 458 AWVV----------------DRPDHLERAEKAAGFLLDHLR-QPDGRLFHVFKDGRARFN 500

Query: 617 GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR--EGGGYFNTTG-EDPS 673
           G+L+DYA+LI GL  L +    T+W+  A +L     E F D   +G G F  TG    +
Sbjct: 501 GYLEDYAYLIDGLTRLGQVTGTTRWIREARDLSRLMIEEFGDEVIDGVGGFAFTGVRHET 560

Query: 674 VLLRVKEDHDGAEPSGNSVSVINLVRLASI----------VAGSKS-----DYYRQNAEH 718
           ++ R ++  D A PS  +++V  L+RLA++          +AG ++      +    A  
Sbjct: 561 LVARPRDLFDNATPSAAAMAVTALLRLAALTDDQALRGRGLAGLRALAPLMKHAPTAAAQ 620

Query: 719 SLAVFETRLKD--MAMAVPLMCCAADMLS 745
           SL   +  L+D  +A+ VP     +D L+
Sbjct: 621 SLIALDFALRDPEIALVVPGQLDPSDTLA 649


>gi|383458464|ref|YP_005372453.1| hypothetical protein COCOR_06500 [Corallococcus coralloides DSM
           2259]
 gi|380730954|gb|AFE06956.1| hypothetical protein COCOR_06500 [Corallococcus coralloides DSM
           2259]
          Length = 696

 Score =  448 bits (1152), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 262/681 (38%), Positives = 368/681 (54%), Gaps = 53/681 (7%)

Query: 96  HSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHV 155
           H  + HTNRLA E SPYL QHA NPVDW+ WG+EA A AR  + PI LS+GYS CHWCHV
Sbjct: 4   HPPSGHTNRLAQEPSPYLRQHATNPVDWYPWGDEALARARAENKPILLSVGYSACHWCHV 63

Query: 156 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 215
           M  ESFE   +A+L+N+ F++IKVDREERPD+D++Y   VQ +  GGGWPL+VFL+PDL+
Sbjct: 64  MAHESFEHPDIARLMNEGFINIKVDREERPDLDQIYQGVVQLMGQGGGWPLTVFLTPDLR 123

Query: 216 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 275
           P  GGTYFPP D+YGRPGF  +L  ++DAW+ K D + +      E L E   ++   + 
Sbjct: 124 PFYGGTYFPPSDRYGRPGFPRLLTALRDAWENKADEIEEQAKRFQEGLGEL--STHGLDA 181

Query: 276 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 335
            P  L    +    + + K  D   GGFG APKFP P+ + ++L   ++       G   
Sbjct: 182 APAHLSAEDIVAMGQSMLKRMDPVNGGFGGAPKFPNPMNVALLLRAWRR-------GGGE 234

Query: 336 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 395
             +  V  TL+ MA GGI+D +GGGFHRYSVDERW VPHFEKMLYD  QL ++Y +A  +
Sbjct: 235 PLKAAVFRTLERMALGGIYDQLGGGFHRYSVDERWLVPHFEKMLYDNAQLLHLYSEAEQV 294

Query: 396 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 455
                +  +  + ++Y+RR+M  P G  ++ +DADS   EG    +EG F+VW  +EV  
Sbjct: 295 ESRPLWRKVVEETVEYVRREMTDPAGGFYATQDADS---EG----EEGKFFVWHPEEVRA 347

Query: 456 IL--GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEK 513
            L  G+ A     H+ +KP GN +            G  VL  +      A + G P+E 
Sbjct: 348 ALSVGQQADTVLRHFGIKPGGNFE-----------HGATVLEVVVPVEQLAKEQGRPVEA 396

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
               L E RR LF +R +R +P  DDK++  WNGL+I   A AS++              
Sbjct: 397 VEKELAEARRVLFLLREQRVKPGRDDKILAGWNGLMIRGLALASRVF------------- 443

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
              DR ++ ++A  AA F+   ++D +  RL  S+++G  +  GFL+DY    SGL  LY
Sbjct: 444 ---DRPDWAKLAADAADFVLAKMWDGK--RLLRSYQHGQGRIDGFLEDYGDFASGLTALY 498

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
           +     K+L  A  L +   ELF D E   Y +       +++      D A PSG S  
Sbjct: 499 QATFDAKYLDAADALAHRAVELFWDEEKQAYLSAPRGQKDLVVAAFSLFDNAFPSGASTL 558

Query: 694 VINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVV 753
               V L+++   +    +    EH +A    +L    M    +  AAD L V     V 
Sbjct: 559 TEAQVTLSAL---TGDVCHLDQPEHYVAKLHDQLVRNPMGYGHLGLAADSL-VDGASGVT 614

Query: 754 LVGHKSSVDFENMLAAAHASY 774
             G + +V    +LAAA+ +Y
Sbjct: 615 FAGTREAV--APLLAAANRTY 633


>gi|410462713|ref|ZP_11316275.1| thioredoxin domain containing protein [Desulfovibrio magneticus
           str. Maddingley MBC34]
 gi|409984165|gb|EKO40492.1| thioredoxin domain containing protein [Desulfovibrio magneticus
           str. Maddingley MBC34]
          Length = 697

 Score =  447 bits (1151), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 272/678 (40%), Positives = 369/678 (54%), Gaps = 45/678 (6%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           N+  NRL  E SPYLLQHAHNPVDWF WGEEAFA+AR  D P+ LSIGYSTCHWCHVME 
Sbjct: 3   NRAPNRLIREKSPYLLQHAHNPVDWFPWGEEAFAKARAEDKPVLLSIGYSTCHWCHVMER 62

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFEDE +A L+N   VSIKVDREERPD+D +YM+   AL G GGWPL+VFL+PD +P  
Sbjct: 63  ESFEDEDIAALMNAVAVSIKVDREERPDLDTLYMSVCHALTGRGGWPLTVFLTPDKEPFF 122

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKL-P 277
            GTYFP E  YGR G + +L++V  +W   R  +  +    ++ + E L+A+A +    P
Sbjct: 123 AGTYFPKESAYGRTGLRELLQRVHMSWKGNRQAVVNNAGQIMDAVREQLTAAAGAASAEP 182

Query: 278 DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEG 337
            E   +A R    QLS  +D+R GGFG APKFP P  +  +L   +      ++G+AS  
Sbjct: 183 GEAVLDAAR---AQLSGIFDARNGGFGGAPKFPSPHNLLFLLREYR------RTGDAS-C 232

Query: 338 QKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTK 397
           + MV  TL  M +GG++DHVG G HRY+ D +W +PHFEKMLYDQ       ++A+  + 
Sbjct: 233 RDMVCRTLDAMRRGGVYDHVGFGLHRYATDAQWFLPHFEKMLYDQALTVMACVEAYQASG 292

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
           D  +  +  +IL+Y+RRD+  P G   SAEDADS   EG     EG FYVW++ E+  +L
Sbjct: 293 DAAHKTMALEILEYVRRDLTSPEGLFHSAEDADS---EGV----EGKFYVWSAAELRRLL 345

Query: 458 GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI 517
           G+ A L          GN       +   E  G N+L        +A++LG+ +E     
Sbjct: 346 GDEAALVMAAMGATEEGNAH----DEATGETTGSNILHLPRPLDETAAQLGLTVEALTTR 401

Query: 518 LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSD 577
           L ECRR L   R KR RP  DDKV+   NGL++++ A+A++    E  +           
Sbjct: 402 LEECRRILLVEREKRVRPLCDDKVLTDNNGLMLAALAKAARAFDDEELAG---------- 451

Query: 578 RKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGS 637
               +  AES  + + R        RL H  R+G +   GFLDDY FL  GL++LY+   
Sbjct: 452 --RAVTAAESLLTRLTR-----PNGRLLHRLRDGEAAIDGFLDDYVFLAWGLVELYQTVF 504

Query: 638 GTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINL 697
            T +L  A+ L     + F D   GG+F T  +   +L+R K   D A PSGNSV+   L
Sbjct: 505 DTAYLHRAVALLRAVADHFADPAEGGFFVTPDDGEQLLVRQKVFFDAAVPSGNSVAYFVL 564

Query: 698 VRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA-ADMLSVPSRKHVVLVG 756
             L  +   +    +++ A         RL D A       C  + +L  PS   V L G
Sbjct: 565 TTLFRL---TGDPVFKEQATALARAMAPRLADHAAGHAFFLCGLSQVLGKPS--EVTLAG 619

Query: 757 HKSSVDFENMLAAAHASY 774
             +  D + +  A    Y
Sbjct: 620 DPAGPDTQALARAVFGRY 637


>gi|386875180|ref|ZP_10117368.1| lanthionine synthetase C-like protein, partial [Candidatus
           Nitrosopumilus salaria BD31]
 gi|386807022|gb|EIJ66453.1| lanthionine synthetase C-like protein, partial [Candidatus
           Nitrosopumilus salaria BD31]
          Length = 539

 Score =  447 bits (1150), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 245/585 (41%), Positives = 339/585 (57%), Gaps = 49/585 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N L  E SPYLLQHAHNPVDW+AW +EA  +A+  + PIFLSIGYS+CHWCHVM  ESFE
Sbjct: 4   NNLIHETSPYLLQHAHNPVDWYAWNDEALKKAKDENKPIFLSIGYSSCHWCHVMAHESFE 63

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           ++ VAK +N+ FV+IKVDREERPD+D +Y    Q   G GGWPLS+FL+PD KP   GTY
Sbjct: 64  NDEVAKFMNENFVNIKVDREERPDIDDIYQKVCQIATGQGGWPLSIFLTPDQKPFYVGTY 123

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP  D YGRPGF +I R++  AW +K   + +S     E    AL  + + +  P +L +
Sbjct: 124 FPVLDSYGRPGFGSICRQLSQAWKEKPKDIEKSA----ENFLNALHKTETVHT-PSKLEK 178

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             L   A  L +  D+ +GGFGSAPKFP    I  +  ++   E TG     S+  +  L
Sbjct: 179 IILDEAAMNLFQLGDATYGGFGSAPKFPNAANISFLFRYA---ELTG----LSKFNEFAL 231

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL  MAKGGI D +GGGFHRYS D +W VPHFEKMLYD   +   Y++A+ +TKD FY 
Sbjct: 232 KTLNKMAKGGIFDQIGGGFHRYSTDAKWLVPHFEKMLYDNALIPVNYVEAYQITKDPFYL 291

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
            + +  LD++ R+M  P G  +SA DADS   EG     EG FYVW   E+++ILG  A 
Sbjct: 292 EVLQKTLDFVLREMTTPEGGFYSAYDADS---EGV----EGKFYVWKKSEIKEILGSDAD 344

Query: 463 LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECR 522
           +F   Y +   GN            ++G  +L    + S  A   G   ++  +IL  C 
Sbjct: 345 IFCLFYDVTDGGN------------WEGNTILCNNLNISTVAFNFGKSEQEIHDILNSCA 392

Query: 523 RKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYM 582
            KL  VRS R  P LDDK++VSWN L+I++FA+               + V G  R  Y+
Sbjct: 393 EKLLKVRSTRISPGLDDKILVSWNSLMITAFAKG--------------YRVTGDQR--YL 436

Query: 583 EVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWL 642
             A+   SFI ++L      +LQ +++N  +K  G+L+DY++ I+ LLD++E  S  K+L
Sbjct: 437 SAAKDCISFIEKNLL--VGEKLQRTYKNNTAKIDGYLEDYSYFINALLDVFEIESDQKYL 494

Query: 643 VWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 687
             ++ L N   E F D +   +F T+     +++R K ++D + P
Sbjct: 495 QLSLNLANYLLEHFWDSDANSFFMTSDNHEKLIIRPKSNYDLSLP 539


>gi|429217838|ref|YP_007179482.1| thioredoxin domain-containing protein [Deinococcus peraridilitoris
           DSM 19664]
 gi|429128701|gb|AFZ65716.1| thioredoxin domain protein [Deinococcus peraridilitoris DSM 19664]
          Length = 677

 Score =  447 bits (1150), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 265/667 (39%), Positives = 371/667 (55%), Gaps = 51/667 (7%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL+ E SPYLLQH  NPVDWF WG EAF +A   + PI LSIGYSTCHWCHVM  ESFE
Sbjct: 2   NRLSHETSPYLLQHQDNPVDWFPWGPEAFQKALNENKPILLSIGYSTCHWCHVMAHESFE 61

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA  +N  FV+IKVDREERPDVD VYM+ VQA  G GGWP++VFL    +P   GTY
Sbjct: 62  DETVAGFMNTHFVNIKVDREERPDVDAVYMSAVQATTGSGGWPMTVFLDAQGRPFYAGTY 121

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FPP D +G P F  +L  V  AW+ +R  L Q+     E L++ L  SA   +  + LP 
Sbjct: 122 FPPRDAHGMPSFSRVLAGVAQAWNGRRQDLMQNA----ETLTQHLQ-SAGRREGSEALPA 176

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
           +       Q+ K +D+R GGFGSAPKFP P  +  +L                + + + L
Sbjct: 177 DFTARGLAQVRKLFDARHGGFGSAPKFPAPTTLAYLLTQ-------------PQARDISL 223

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TLQ MA GG++D +GGGFHRYSVDERW VPHFEKMLYD  QLA VYL A+ LT +  ++
Sbjct: 224 TTLQKMAAGGLYDQLGGGFHRYSVDERWLVPHFEKMLYDNAQLARVYLQAYQLTGEASFT 283

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
              R+ L+YL R+M+ P G  +SA+DADS   EG     EG F+VWT +E++ ILG+ A 
Sbjct: 284 QFARETLEYLEREMLSPEGGFYSAQDADS---EGI----EGKFFVWTPQELQAILGDDAA 336

Query: 463 LFKEHYYLKPTGNCDLSRMSDPHN-EFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
           L    + +   GN       DPH+ +F  ++VL  +   +  A + G+        L   
Sbjct: 337 LAARFWGVTAEGN-----FMDPHHPDFGRRSVLSVVASPTELAEQFGLSEPDVRRRLEAA 391

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
           RR+L++ R  R  P  D KV+ SWNGL + +FA A+++L+ E                 +
Sbjct: 392 RRRLWEERELRVHPGTDTKVLTSWNGLALGAFALAARVLREE----------------RF 435

Query: 582 MEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKW 641
           ++VA   A F+R HL  E    L+HS+++G ++  G L+D+A    GL++LY+       
Sbjct: 436 LDVARRNADFVRSHLRSEDA-TLRHSYKDGQARVQGLLEDHALYALGLIELYQASGHLPH 494

Query: 642 LVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLA 701
           L WA EL N     F D+EGG +++T+    +++ R K+  D A  S N+ + +  + + 
Sbjct: 495 LEWARELWNVVATEFWDQEGGAFWSTSARAETLITRQKDAFDSAVMSDNAAAALLGLWMG 554

Query: 702 SIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSV 761
                 + +   + A  ++  F   +         +  A  +L+ P  +  VL   ++  
Sbjct: 555 RYYGDPRGE---ELATRTIGTFAADMLAAPSGFGGLWQAHALLTAPHVEVAVLGSSQARA 611

Query: 762 DFENMLA 768
            FE  LA
Sbjct: 612 PFEAELA 618


>gi|335427892|ref|ZP_08554812.1| hypothetical protein HLPCO_03015 [Haloplasma contractile SSD-17B]
 gi|334893818|gb|EGM32027.1| hypothetical protein HLPCO_03015 [Haloplasma contractile SSD-17B]
          Length = 682

 Score =  447 bits (1149), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 252/663 (38%), Positives = 371/663 (55%), Gaps = 61/663 (9%)

Query: 97  SRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVM 156
           S   +TN+LA E SPYLLQHA+NPVDW+ W +EAF++AR+ D PIFLSIGYSTCHWCHVM
Sbjct: 2   SGQNYTNKLANEKSPYLLQHANNPVDWYPWCDEAFSKAREEDKPIFLSIGYSTCHWCHVM 61

Query: 157 EVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKP 216
           E ESFEDE +++LLN  F+SIKVDREERPD+D +YM   QAL G GGWPL++ ++ D KP
Sbjct: 62  ERESFEDEEISELLNKDFISIKVDREERPDIDHIYMEVCQALTGRGGWPLTIVMTADKKP 121

Query: 217 LMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASAS-SNK 275
              GTYFP      + G   +L  +   W   +D +  S     + L++      S   K
Sbjct: 122 FYAGTYFPKTTVGKQLGLTQLLPTITKQWKSNKDKILDSATEIYDVLNKYREEQESVRGK 181

Query: 276 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 335
           L  ++ +N  +     L  ++D+ +GGFG+APKFP P  +  +L++       G      
Sbjct: 182 LSLDVVENLFK----NLRGAFDNLYGGFGTAPKFPSPHNLLFLLHY-------GYINNNQ 230

Query: 336 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 395
           +   MV  TL+ M KGGI+DH+G GF RYSVD +W VPHFEKMLYD   L   Y++A+ L
Sbjct: 231 DAVFMVERTLEQMYKGGIYDHIGYGFSRYSVDRKWLVPHFEKMLYDNALLTLAYIEAYQL 290

Query: 396 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 455
             D  Y  +  + L+Y+ R M    G  ++AEDADS   EG    +EG FY +T  E+++
Sbjct: 291 KNDPLYKQVVEETLEYVSRVMTDKEGGFYTAEDADS---EG----EEGKFYTFTKNEIKE 343

Query: 456 ILG-EHAILFKEHYYLKPTGNCDLSRMSD-PHNEFKGKNVLIELNDSSASASKLGMPLEK 513
           +L  E A    E+Y +   GN + + + +  H ++      ++L+D              
Sbjct: 344 LLDKEDATFIIEYYNISEEGNFERTNILNLIHKDY------LDLDDKERER--------- 388

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
               L + + +LF+ R KR  PH DDK++ SWN ++I+++ARA ++L ++A         
Sbjct: 389 ----LNKIKERLFNYRDKRVHPHKDDKILTSWNAMMITAYARAGRVLNNDA--------- 435

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
                  Y+  A+    FI  HL DE   R+Q  +R+G +K  G++DDYA+L   L++L+
Sbjct: 436 -------YINKAKQGVQFISDHLIDENG-RIQARYRDGEAKFKGYIDDYAYLNWALIELF 487

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
              S   ++  A++L +   ELF D E  G++    +   +L+R KE +DGA PSGNS++
Sbjct: 488 LGTSDQTYIHQALKLTDDMIELFWDDEKDGFYYYGNDSEYLLMRNKEIYDGAIPSGNSIA 547

Query: 694 VINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVV 753
            +N ++L+ I    K   Y + A      F  ++K    +   M       S P  K VV
Sbjct: 548 TMNFIKLSEITDEIK---YEKYARKLFDAFAYKVKQSPSSHSYMLNTYLHASHPKTK-VV 603

Query: 754 LVG 756
           +VG
Sbjct: 604 IVG 606


>gi|442323509|ref|YP_007363530.1| hypothetical protein MYSTI_06573 [Myxococcus stipitatus DSM 14675]
 gi|441491151|gb|AGC47846.1| hypothetical protein MYSTI_06573 [Myxococcus stipitatus DSM 14675]
          Length = 697

 Score =  446 bits (1147), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 248/605 (40%), Positives = 339/605 (56%), Gaps = 46/605 (7%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           +NRLA E SPYL QHA NPVDWFAWG+EA A AR  D PI LS+GYS CHWCHVM  ESF
Sbjct: 11  SNRLAREPSPYLRQHASNPVDWFAWGDEALARARAEDKPILLSVGYSACHWCHVMAHESF 70

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           E    A+L+N+ F++IKVDREERPD+D++Y   VQ +  GGGWPL+VFL+PDLKP  GGT
Sbjct: 71  ESPDTARLMNEGFINIKVDREERPDLDQIYQGVVQLMGQGGGWPLTVFLTPDLKPFYGGT 130

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFPPED+YGRPGF  +L  ++DAW  KR+ + +  A   E L E   A+   +  P  L 
Sbjct: 131 YFPPEDRYGRPGFPRLLMALRDAWKNKREDIHRQAAQFEEGLGEL--AAYGLDAAPGVLS 188

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
              +    ++++   DS  GGFG APKFP P+   ++L   ++       G     +  V
Sbjct: 189 VEDVLSMGQRMALQVDSVHGGFGGAPKFPNPMNFSLLLRAWRR-------GGGDSLRDAV 241

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             TL+ MA GGI+D +GGGFHRYSVD RW VPHFEKMLYD  QL ++Y +A  +     +
Sbjct: 242 FLTLERMALGGIYDQLGGGFHRYSVDARWLVPHFEKMLYDNAQLMHLYSEAQQVAPRPLW 301

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL-GEH 460
             +  + ++Y+RR+M   GG  ++A+DADS   EG    +EG F+VW  +E++ +L  E 
Sbjct: 302 RKVVEETVEYVRREMTDAGGGFYAAQDADS---EG----EEGKFFVWRPEEIQAVLPPER 354

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
           A L   H+ + P GN +            G  VL  +  +   A +  + LE     L E
Sbjct: 355 AELVMRHFRVTPLGNFE-----------HGATVLEVVVPAETLARERSLSLEAVERELAE 403

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            R+ LF  R +R +P  DDK++  WNGL+I   A A+++                 DR +
Sbjct: 404 TRQVLFQARERRVKPGRDDKILAGWNGLMIRGLALAARVF----------------DRPD 447

Query: 581 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 640
           +  +A SAA F+   L+D    RL  S++ G ++  GFL+DY  L SGL  LY+     K
Sbjct: 448 WTRLAVSAADFVLAKLWD--GTRLARSYQEGQARIDGFLEDYGDLASGLTALYQATFDVK 505

Query: 641 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 700
           +L  A  L    +ELF D E   Y         +++      D A PSG S      V L
Sbjct: 506 YLEAAKALVKRAEELFWDAEKQAYLTAPRGQKDLVVATYGLFDNAFPSGASTLTEAQVAL 565

Query: 701 ASIVA 705
           A++  
Sbjct: 566 AALTG 570


>gi|15805870|ref|NP_294568.1| hypothetical protein DR_0844 [Deinococcus radiodurans R1]
 gi|6458560|gb|AAF10421.1|AE001938_7 conserved hypothetical protein [Deinococcus radiodurans R1]
          Length = 690

 Score =  446 bits (1147), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 249/594 (41%), Positives = 335/594 (56%), Gaps = 46/594 (7%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA E SPYLLQH  NPVDW+ W  EAFAEAR+RDVP+ LS+GYSTCHWCHVM  ESFE
Sbjct: 17  NRLAQESSPYLLQHQDNPVDWWPWSPEAFAEARQRDVPVLLSVGYSTCHWCHVMAHESFE 76

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           +E  A  +N  FV+IKVDREERPDVD VYM   QAL G GGWP++VFL+PD +P   GTY
Sbjct: 77  NERTAAFMNAHFVNIKVDREERPDVDAVYMAATQALTGQGGWPMTVFLTPDAEPFYAGTY 136

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FPP++  G P F  +L  + D W  +RD    +     + L+E +  ++   +   ELP 
Sbjct: 137 FPPQEGMGMPSFMRVLASIDDVWQNRRDQALGNA----QALTEHVRGASQPTRREGELPG 192

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
            AL    E  ++ YD++FGGFG APKFP P  +  +L                +G++M L
Sbjct: 193 GALARAVENAARLYDAQFGGFGRAPKFPAPSTLDFLLTQ-------------PQGREMAL 239

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL+ M  GGI+D +GGGFHRYSVD +W VPHFEKMLYD  QL    L A+ LT +  ++
Sbjct: 240 HTLRMMGAGGIYDQLGGGFHRYSVDAQWLVPHFEKMLYDNAQLVRTLLRAYQLTGEDDFA 299

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
            + R+ L YL R+M+ P G  +SA+DAD+    G     EG  + WT  E+  +LGE A 
Sbjct: 300 RLARETLAYLEREMLAPDGGFYSAQDADTPTEHGGV---EGLTFTWTPDEIRAVLGEDAD 356

Query: 463 LFKEHYYLKPTGNCDLSRMSDPHNEFKG-KNVLIELNDSSASASKLGMPLEKYLNILGEC 521
           L    + +   GN       DPH    G +NVL       A A +LG   +     L   
Sbjct: 357 LALRSFNVTAQGN-----FRDPHQPAYGSRNVLHTPTPLPALARELG---DDAAQRLQAA 408

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
           R KLF  R  RP+PH DDKV+ SWNGLV+++ A A++IL  E                +Y
Sbjct: 409 RAKLFAARQVRPQPHTDDKVLTSWNGLVLAALADAARILGEE----------------KY 452

Query: 582 MEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKW 641
           +++A   A F+ R L       L+H+F++G +   G L+D+A    GL+ L++ G     
Sbjct: 453 LDLARRNADFVHRELR-LPGGTLRHTFKDGRASVEGLLEDHALYGLGLVALFQAGGDLAH 511

Query: 642 LVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 695
           L WA EL N     F D   G ++++ G   ++L R     D A  S N+ + +
Sbjct: 512 LHWARELWNIVRRDFWDEGAGVFYSSGGHAETLLTRQASFFDSAILSDNAAAAL 565


>gi|433591712|ref|YP_007281208.1| thioredoxin domain protein [Natrinema pellirubrum DSM 15624]
 gi|448334040|ref|ZP_21523224.1| hypothetical protein C488_11564 [Natrinema pellirubrum DSM 15624]
 gi|433306492|gb|AGB32304.1| thioredoxin domain protein [Natrinema pellirubrum DSM 15624]
 gi|445620768|gb|ELY74256.1| hypothetical protein C488_11564 [Natrinema pellirubrum DSM 15624]
          Length = 731

 Score =  446 bits (1146), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 251/649 (38%), Positives = 360/649 (55%), Gaps = 49/649 (7%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA NPV+W  W E+A A A++RDVPIFLSIGYS CHWCHVME ESF 
Sbjct: 8   NRLDEEESPYLRQHADNPVNWQPWDEQALAAAKERDVPIFLSIGYSACHWCHVMEEESFA 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA++LN+ FV IKVDREERPDVD +YMT  Q + G GGWPLS +L+P+ KP   GTY
Sbjct: 68  DEAVAEILNENFVPIKVDREERPDVDSIYMTVCQLVRGQGGWPLSAWLTPEGKPFFIGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRD------MLAQSGAFAIEQLSEALSASASSNKL 276
           FP + + G+PGF  + +++ D+W+ + D         Q    A ++L E   ++     +
Sbjct: 128 FPRDGERGQPGFPDLCQRISDSWESEEDREEMQHRAQQWTDAAKDRLEETPDSAGVDAGV 187

Query: 277 PDELP-QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 335
             E P  + L   A+ + +S D ++GGFG+  KFP+P  ++++   ++  + TG+     
Sbjct: 188 AAEPPSSDVLETAADAVLRSADRQYGGFGTGQKFPQPSRLRVL---ARTYDRTGR----E 240

Query: 336 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 395
           E ++++  TL  MA GG+ DHVGGGFHRY VD  W VPHFEKMLYD  ++   +L  + L
Sbjct: 241 EYREVLEETLDAMAAGGLADHVGGGFHRYCVDRDWTVPHFEKMLYDNAEIPRAFLAGYQL 300

Query: 396 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 455
           T +  Y+    D L ++ R++    G  FS  DA S + E   R +EGAFYVWT +EV D
Sbjct: 301 TGEDRYAETVADTLAFVDRELTHDEGGFFSTLDAQSEDPETGER-EEGAFYVWTPEEVHD 359

Query: 456 ILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEK 513
           ++ +   A LF   Y +  +GN            F+G+N    +   S  AS+  +   +
Sbjct: 360 VIADETDASLFCARYDITESGN------------FEGQNQPNRIARVSELASQFDLAESE 407

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
            L  L   R++LF+ R +RPRP  D+K++  WNGL+IS++A A+ +L             
Sbjct: 408 VLKRLDSARKRLFEAREERPRPDRDEKILAGWNGLMISTYAEAALVL------------- 454

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
            G D  EY E A  A  F+R  L+D ++ RL   ++ G  K  G+L+DYAFL  G LD Y
Sbjct: 455 -GED--EYAETAVDALEFVRDRLWDTESQRLSRRYKAGDVKVDGYLEDYAFLARGALDCY 511

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
           +       L +A+EL    +  F D + G  + T     S++ R +E  D + PS   V+
Sbjct: 512 QATGDVDHLAFALELARVIEAEFWDADRGTLYFTPESGESLVTRPQELGDQSTPSSTGVA 571

Query: 694 VINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 742
           V  L+ L         D + + A   L      L+  A+    +C  AD
Sbjct: 572 VETLLALDEFA----DDDFSEIAATVLETHANELEANALEHATLCIGAD 616


>gi|150016393|ref|YP_001308647.1| hypothetical protein Cbei_1515 [Clostridium beijerinckii NCIMB
           8052]
 gi|149902858|gb|ABR33691.1| protein of unknown function DUF255 [Clostridium beijerinckii NCIMB
           8052]
          Length = 680

 Score =  445 bits (1145), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 251/607 (41%), Positives = 342/607 (56%), Gaps = 63/607 (10%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N L  E SPYLLQHA+NP++W++WG+EAFA+A++ D PIFLSIGYSTCHWCHVM  ESFE
Sbjct: 8   NNLINEKSPYLLQHANNPINWYSWGDEAFAKAKEEDKPIFLSIGYSTCHWCHVMAHESFE 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE +A ++ND F++IKVDREERPD+D VYMT  QAL G GGWPL+V ++PD KP   GTY
Sbjct: 68  DEEIAGIMNDSFIAIKVDREERPDIDSVYMTVCQALTGHGGWPLTVIMTPDQKPFFAGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP + KY  PG   IL  +   W   +D L  SG   + +L        S  KL  +  +
Sbjct: 128 FPKKAKYNMPGLMDILNSINKQWKDNKDKLISSGDSILSELGGYFDGETSKLKLTSKTLK 187

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
           N       Q+  +++ ++GGFG APKFP P  I M L    K     K+ E +E      
Sbjct: 188 NGYN----QILHAFEEKYGGFGDAPKFPTP-HITMFLLRYYKSHKEIKALEMAEK----- 237

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL  M +GGI DH+G GF RYS D +W VPHFEKMLYD   L   YL+ + +TK+  Y 
Sbjct: 238 -TLISMYRGGIFDHIGFGFSRYSTDNKWLVPHFEKMLYDNALLVISYLEGYEVTKNEIYK 296

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH-A 461
            +   +L+Y+ R++    G  + AEDADS   EG    +EG +YV+   E+  +LGE   
Sbjct: 297 EVATKVLEYVFRELTSKNGGFYCAEDADS---EG----EEGKYYVFEPLEILSVLGEEDG 349

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASASKLGMPLEKYLNILG 519
             F +++ +   GN            F+GK++  LI+  +   S  ++ +  E+ L    
Sbjct: 350 TYFNDYFDITSDGN------------FEGKSIPNLIKNKNFHKSDDRIKLLSEQILQ--- 394

Query: 520 ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRK 579
                    RS R   H DDK++ SWNGL+I++  +A K+++ E                
Sbjct: 395 --------YRSDRTELHKDDKILTSWNGLMIAALGKAYKVIEDE---------------- 430

Query: 580 EYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGT 639
            Y E A+ A  FI  +L DE   RL   +R+  S+   +LDDYAFL  GL++LYE     
Sbjct: 431 RYFEYAKKAVEFIFNNLMDEN-KRLLARYRDKDSRHKAYLDDYAFLCFGLIELYESSYDI 489

Query: 640 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL-RVKEDHDGAEPSGNSVSVINLV 698
           ++L  AIE+      LF D E  G+F   GED   L+ R KE  DGA PSGNSV+  NL+
Sbjct: 490 EFLNKAIEINKDMINLFWDNEKDGFF-LYGEDSEKLIARPKELFDGAMPSGNSVAAYNLI 548

Query: 699 RLASIVA 705
           +LA +  
Sbjct: 549 KLARLTG 555


>gi|67517751|ref|XP_658661.1| hypothetical protein AN1057.2 [Aspergillus nidulans FGSC A4]
 gi|40747019|gb|EAA66175.1| hypothetical protein AN1057.2 [Aspergillus nidulans FGSC A4]
 gi|259488639|tpe|CBF88239.1| TPA: DUF255 domain protein (AFU_orthologue; AFUA_1G12370)
           [Aspergillus nidulans FGSC A4]
          Length = 774

 Score =  445 bits (1145), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 266/650 (40%), Positives = 360/650 (55%), Gaps = 37/650 (5%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           K  NRL A  SPY+  H HNPV W  W  E+   AR+ +  IFLSIGYS CHWCHVME E
Sbjct: 18  KLVNRLEASKSPYVRAHRHNPVAWQLWDAESMELARRHNRLIFLSIGYSACHWCHVMEKE 77

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SF  + VA +LN+ F+ IKVDREERPDVD +YM YVQA  G GGWPL+VFL+PDL+P+ G
Sbjct: 78  SFMSQEVASILNESFIPIKVDREERPDVDDIYMNYVQATTGSGGWPLNVFLTPDLEPVFG 137

Query: 220 GTYFPPEDKYGRPG-----FKTILRKVKDAWDKKRDMLAQSGAFAIEQL---SEALSASA 271
           GTY+P  +     G     F  IL K++D W  +R    +S     +QL   +E  + + 
Sbjct: 138 GTYWPGPNAASLLGPETVSFIEILEKLRDVWQTQRQRCLESAKEITKQLREFAEEGTHTF 197

Query: 272 SSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML---YHSKKLEDT 328
             ++  ++L    L    +  +  YD   GGF  APKFP P  +  +L    +   + D 
Sbjct: 198 QGDQSDEDLDVELLEEAYQHFASRYDINNGGFSRAPKFPTPANLSFLLRLGIYPSAVTDI 257

Query: 329 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 388
               E      M + TL  MA+GGI DH+G GF RYSV   W +PHFEKMLYDQ QL +V
Sbjct: 258 VGQEECENATAMAVSTLISMARGGIRDHIGHGFARYSVTADWSLPHFEKMLYDQAQLLDV 317

Query: 389 YLDAFSLTKDVFYSYICRDILDYLRRDMI-GPGGEIFSAEDADSAETEGATRKKEGAFYV 447
           Y DAF +T +  +     D++ YL    I    G   S+EDADS  T   T K+EGAFYV
Sbjct: 318 YADAFKITHNPEFLGAVYDLITYLTSAPIQSTTGGFHSSEDADSLPTPNDTEKREGAFYV 377

Query: 448 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 506
           WT KE+  +LG   A +   H+ +   GN  ++  +DPH+EF  +NVL      S  A +
Sbjct: 378 WTLKELTQVLGPRDAGVCARHWGVLSDGN--IAPENDPHDEFMDQNVLSIKVTPSKLAKE 435

Query: 507 LGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 565
            G+  ++ + I+   R++L + R K R RP LDDK+IV+WNGL I + A+ S +L  E +
Sbjct: 436 FGLGEDEVVRIIKSGRQRLREYRDKNRVRPDLDDKIIVAWNGLAIGALAKCS-VLFEEID 494

Query: 566 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP-SKAPGFLDDYAF 624
           S         S   +  E A  A +FI+  LYD+ T +L   +R+G     PGF +DYAF
Sbjct: 495 S---------SKSAQCREAAAKAINFIKETLYDKATGQLWRIYRDGSKGTTPGFAEDYAF 545

Query: 625 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG---GGYFNTTGE----DPSVLLR 677
           L SGLLD+YE      +L +A +LQ   +E FL   G    GY+ T        P+ LLR
Sbjct: 546 LTSGLLDMYEATFDDSYLQFAEQLQRYLNENFLAYAGSSPAGYYTTPSTSAPGSPATLLR 605

Query: 678 VKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 727
           +K   + A PS N V   NL+RL+SI+   + + YR  A  +   F   +
Sbjct: 606 LKTGTESAVPSVNGVIARNLLRLSSIL---EENSYRVLARQTCQSFAVEI 652


>gi|328950404|ref|YP_004367739.1| hypothetical protein Marky_0883 [Marinithermus hydrothermalis DSM
           14884]
 gi|328450728|gb|AEB11629.1| protein of unknown function DUF255 [Marinithermus hydrothermalis
           DSM 14884]
          Length = 667

 Score =  445 bits (1144), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 257/601 (42%), Positives = 342/601 (56%), Gaps = 54/601 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL+ E SPYLLQHA NPVDW+ WGEEAFA A++   PIFLS+GY+TCHWCHVM  ESFE
Sbjct: 3   NRLSREASPYLLQHAENPVDWYPWGEEAFARAQQEGKPIFLSVGYATCHWCHVMARESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D  VA+LLN  FV +KVDREERPDVD  YM  +QAL G GGWP+S+FL+P+ KP  GGTY
Sbjct: 63  DPEVARLLNAHFVPVKVDREERPDVDHAYMQALQALTGQGGWPMSLFLTPEGKPFYGGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FPP D+YG P F+ +L  V +AW K+R+ +    A   +++++AL  +     LP +L  
Sbjct: 123 FPPTDRYGLPSFRRVLEAVAEAWTKRRNEIETHAAALAQRIAQAL--TNRPGDLPPQLHA 180

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
            AL    E   +++D + GGFG APKFP    ++ +L  +         GEA+ G+ M+ 
Sbjct: 181 KAL----EAYRQAFDPQHGGFGGAPKFPNAPALRYLLLQAWL-------GEAAAGE-MLR 228

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL  M  GG++D VGGGFHRY+VD  W VPHFEKMLYD  QLA VYL AF L  D  Y 
Sbjct: 229 VTLDRMQAGGVYDQVGGGFHRYAVDAVWRVPHFEKMLYDNAQLARVYLGAFRLFGDARYR 288

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
              R+ LDYL R+M    G  ++A+D   AE+EG    +EG +YVW   E+  +LG    
Sbjct: 289 RTARETLDYLLREMQDAAGGFYAAQD---AESEG----EEGRYYVWRIPELRAVLGADFE 341

Query: 463 LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECR 522
               ++ +   GN            ++GKN+L         A +LG+    +   L   +
Sbjct: 342 AAARYFGVSDAGN------------WEGKNILEARYPEPLLAQELGLDAAGFEAWLASVK 389

Query: 523 RKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYM 582
            +L + R +R RP  DDK++  WNGL +++FA A + L              G  R  Y+
Sbjct: 390 ARLLEARLRRVRPLTDDKILADWNGLALAAFAEAGRWL--------------GEAR--YL 433

Query: 583 EVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWL 642
           E A   A F+   LY  Q   L+H++R G      +L D A    GLL L+E     +WL
Sbjct: 434 EAARKNAEFVLGALY--QDGLLRHAWRRGRLGRHAYLSDQAHYGLGLLALFEATGEMRWL 491

Query: 643 VWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLAS 702
             A  L     E F D E GG+F+    +P  L R K+  DGA PSGN+ +   LVRLA 
Sbjct: 492 EAARVLAEGILEHFRDPE-GGFFDALEANP--LGRPKDVFDGAWPSGNAAAAELLVRLAR 548

Query: 703 I 703
           +
Sbjct: 549 L 549


>gi|225571461|ref|ZP_03780457.1| hypothetical protein CLOHYLEM_07559 [Clostridium hylemonae DSM
           15053]
 gi|225159937|gb|EEG72556.1| hypothetical protein CLOHYLEM_07559 [Clostridium hylemonae DSM
           15053]
          Length = 669

 Score =  444 bits (1143), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 260/666 (39%), Positives = 360/666 (54%), Gaps = 71/666 (10%)

Query: 98  RNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVME 157
           R   +N L  E SPYLLQH+ NPVDW+ W EEAF  A + D PIFLSIGYSTCHWCHVM 
Sbjct: 11  RTVMSNHLKNESSPYLLQHSENPVDWYPWCEEAFERAGREDKPIFLSIGYSTCHWCHVMA 70

Query: 158 VESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPL 217
            ESFED+  A +LN+ F+SIKVDREERPD+D VYM+  QAL G GGWP+S+F++ + KP 
Sbjct: 71  HESFEDKRTADILNENFISIKVDREERPDIDSVYMSVCQALTGSGGWPMSIFMTAEQKPF 130

Query: 218 MGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAI------EQLSEALSASA 271
              TY PP+++YG  GF+ +L ++   W  K+  L +S    +      E+ ++  +   
Sbjct: 131 YAATYIPPDNRYGMKGFRELLLEISGHWKYKKSELLESAEQILDHIDTKEERAKKKTLKR 190

Query: 272 SSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKS 331
                   LP+ A    AE  ++++D ++GGFG+APKFP P  +  ++ +S  L+D G S
Sbjct: 191 VGAGTDTTLPERA----AELFAQAFDEKYGGFGAAPKFPTPHNLLFLMIYS-SLQDAGMS 245

Query: 332 GEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLD 391
            EA +       TL+ M +GGI DH+G GF RYS D  + VPHFEKMLYD   L   Y  
Sbjct: 246 YEAEK-------TLEQMRRGGIFDHIGYGFSRYSTDRFYLVPHFEKMLYDNALLMIAYSA 298

Query: 392 AFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSK 451
           A+ ++    +        +Y+ R+M GP GE +SA+DADS   EG    +EG +YVW  +
Sbjct: 299 AYKVSGKTMFLETAEKTAEYILREMTGPDGEFYSAQDADS---EG----REGLYYVWDEE 351

Query: 452 EVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMP 510
           E+  ILG E    F  +Y +   GN            F+GKN+  EL+    +       
Sbjct: 352 EICGILGAERGTEFCRYYGITEEGN------------FEGKNIPNELDGKEIT------- 392

Query: 511 LEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFN 570
                +   + R  L+D R +R R HLDDKV+ SWN L+IS+ A    +L          
Sbjct: 393 -----DRFHKERELLYDYRKRRARLHLDDKVLTSWNSLMISAMA----VL---------- 433

Query: 571 FPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLL 630
           + V G +R  Y+E AE A  FI  +L D  T R+  S R G     GFLDDYA+  + LL
Sbjct: 434 YRVTGKER--YLEAAERARRFIEHNLADGNTLRV--SCRGGSGSVKGFLDDYAYYTAALL 489

Query: 631 DLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGN 690
            LYE  S    L  A ++     + F D EGGG+F     + S++ R KE +DGA PSGN
Sbjct: 490 SLYEAVSDVDHLTRAEQICREARQQFADEEGGGFFLYGSRNDSLITRPKETYDGALPSGN 549

Query: 691 SVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRK 750
           S    +LVRL  I    +   Y+  A+  LA      ++      +   A  +   P +K
Sbjct: 550 STMAYDLVRLYQITGNEE---YKDAAKRQLAFMSGEAQEYPAGYSMFLTALLLYENPPQK 606

Query: 751 HVVLVG 756
             V++ 
Sbjct: 607 ITVVLA 612


>gi|388254779|gb|AFK24895.1| protein of unknown function DUF255 [uncultured archaeon]
          Length = 691

 Score =  443 bits (1139), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 254/602 (42%), Positives = 347/602 (57%), Gaps = 48/602 (7%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYLLQHA+NPVDW++WGEEA   A+K D PIFLSIGYS CHWCHVM  ESFE
Sbjct: 10  NRLLQETSPYLLQHAYNPVDWYSWGEEALERAKKEDKPIFLSIGYSACHWCHVMAHESFE 69

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D+ VAK++N+ F++IKVDREERPD+D +Y    Q   G GGWPLSVFL+ D KP   GTY
Sbjct: 70  DDEVAKIMNEHFINIKVDREERPDLDDIYQRVCQLATGTGGWPLSVFLTSDQKPFYVGTY 129

Query: 223 FPPE-DKYGRPGFKTILRKVKDAW-DKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           FP E  +Y  PGFKTIL ++  A+  KK+++ A SG F +  L++     AS       L
Sbjct: 130 FPKEGGRYNMPGFKTILLQLATAYKSKKQEIEAASGEF-MGALAQTAKDIASGMAEKASL 188

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
            ++ +   A  L +  D  +GGFG APKFP P  +  +L +         SG  +  +  
Sbjct: 189 ERSIIDEAAMGLLQMGDPIYGGFGQAPKFPNPTNLMFLLRYYN------LSG-LNRFKDF 241

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
           V FT   MA GGIHD +GGGF RY+ D++W +PHFEKMLYD   LA +Y + + +TK   
Sbjct: 242 VAFTADKMAAGGIHDQLGGGFARYATDQKWLIPHFEKMLYDNALLAQLYSELYQITKADK 301

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH 460
           Y  I R  LD++ R+M+ P G  +SA DADS   EG    +EG FY+W  KE+  ILG+ 
Sbjct: 302 YVQITRKTLDFVSREMMHPEGGFYSALDADS---EG----EEGKFYIWQKKEIASILGDQ 354

Query: 461 AI--LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 518
               +F EHY +   GN            F+G+N+L      +    + G   E+   I+
Sbjct: 355 VATDIFCEHYGVTEGGN------------FEGQNILNVRVPLANVGLRYGKTPEQAAQII 402

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
            +   KLF  R KR RP  D+K++ SWNGL+IS FA+   I                +  
Sbjct: 403 ADASAKLFTAREKRVRPGRDEKILTSWNGLMISGFAKGYSI----------------TGD 446

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG 638
            +Y++ A++A  FI   +      RL  +F++G SK   +LDDYAF +SGLLDL+   S 
Sbjct: 447 AKYLQAAKNAVDFIEAKI-AAGDGRLLRTFKDGHSKLNAYLDDYAFYVSGLLDLFAVDSK 505

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
             +L  AI   +   + F D + G  F T+ +   +++R K  +D A PSGNS++  +L+
Sbjct: 506 QAYLDKAIMHTDFMLKHFWDEKEGNLFFTSDDHEKLIVRTKSFYDLAIPSGNSMAAADLL 565

Query: 699 RL 700
           RL
Sbjct: 566 RL 567


>gi|312385290|gb|EFR29828.1| hypothetical protein AND_00943 [Anopheles darlingi]
          Length = 874

 Score =  442 bits (1138), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 258/630 (40%), Positives = 342/630 (54%), Gaps = 88/630 (13%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           K TNRLA E SPYLLQHAHNPVD                                     
Sbjct: 165 KFTNRLAQEKSPYLLQHAHNPVD------------------------------------- 187

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
            F++E VA+++N+ F+++K+DREERPD+DK+YM ++  + G GGWP+SV+L+PDL P+ G
Sbjct: 188 CFQNEEVARIMNENFINVKLDREERPDIDKLYMMFILLINGSGGWPMSVWLTPDLAPITG 247

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE 279
           GTYFPP D++G PGF T+L K+   W   R+ L ++G   IE +   +     S    +E
Sbjct: 248 GTYFPPNDRWGMPGFTTVLTKLAAKWASDREDLVRTGRSVIEAIKRNVDQKQGSGNGDEE 307

Query: 280 LPQNALRLCAEQL-----------SKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 328
               A+    E L            ++YD  +GG   APKFP   ++ +M +H    E  
Sbjct: 308 DGAAAVAAAGETLEAKFRQAINLYQRNYDPVWGGSLGAPKFPEAAKLNLM-FHLHVQEPK 366

Query: 329 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 388
            K         +VL TL  MA GGIHDHV GGF RYSVD++WHVPHFEKMLYDQGQL ++
Sbjct: 367 HKI------LGVVLNTLDKMAAGGIHDHVFGGFARYSVDKKWHVPHFEKMLYDQGQLLSL 420

Query: 389 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 448
           Y + + LT    Y  +   I  YL +D+  PGG  +S EDADS  T  +  K EGAFY W
Sbjct: 421 YANGYRLTHKPLYLTVADAIYRYLCKDLRHPGGGFYSGEDADSLPTADSDVKVEGAFYAW 480

Query: 449 TSKEVEDILGEHAI-----------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 497
           T  EV++ L   A            ++ EHY +K TGN + +  SDPH    GKN+ I  
Sbjct: 481 TYAEVKETLERGAAKFGDTTVSPIEVYAEHYDIKETGNVEPA--SDPHGHLLGKNIPIVY 538

Query: 498 NDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 557
                +A K G   E    +L      L +VR +RPRPHLD K+I +WNGLV+S  +  +
Sbjct: 539 GSVRETAEKCGTRPEIVERVLRVANELLHEVREQRPRPHLDTKIICAWNGLVLSGLSHLA 598

Query: 558 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS-FRNG----P 612
            +  +              DR +Y+  AE    F+R +LYD Q  +L  S + NG     
Sbjct: 599 CVHDA-------------PDRSKYLATAEELVKFVRANLYDVQARKLLRSCYGNGEETLA 645

Query: 613 SKAP--GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGE 670
           S+ P  GF+DDYAFLI GL+D Y        L WA ELQ+ QDELF D + G YF +   
Sbjct: 646 SERPIYGFIDDYAFLIRGLIDYYVASLDEHRLHWAKELQDIQDELFWDPKHGAYFYSEAN 705

Query: 671 DPSVLLRVKEDHDGAEPSGNSVSVINLVRL 700
            P V +R+KEDHDGAEP GNSV+  NL+ L
Sbjct: 706 SPHVAVRLKEDHDGAEPCGNSVAGHNLLLL 735


>gi|403389033|ref|ZP_10931090.1| hypothetical protein CJC12_14629 [Clostridium sp. JC122]
          Length = 593

 Score =  442 bits (1137), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 249/645 (38%), Positives = 364/645 (56%), Gaps = 60/645 (9%)

Query: 97  SRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVM 156
           +  K  N L  E SPYLLQHA+NPV+W++W +EAF +A+  + PIFLSIGYSTCHWCHVM
Sbjct: 3   TNQKVPNNLINEKSPYLLQHAYNPVNWYSWCDEAFEKAKDENKPIFLSIGYSTCHWCHVM 62

Query: 157 EVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKP 216
             E FED+ VAK+LND F+SIKVDREERPDVD +YMT  QA  GGGGWPL++F++PD KP
Sbjct: 63  AHECFEDDEVAKILNDNFISIKVDREERPDVDSIYMTVCQAFTGGGGWPLNLFITPDQKP 122

Query: 217 LMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKL 276
              GTYFP   KY  PGF  IL  + D W   ++ +  +    I QL  A   + + +++
Sbjct: 123 FYAGTYFPKHAKYNVPGFMDILSSISDQWKSDKERIIDASEEVINQLENAFQPTTTDDEI 182

Query: 277 PDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASE 336
             ++ +     C E     +D   GGF  APKFP P ++  +L +  KLE+  K+ E   
Sbjct: 183 GKDIIEGGYLWCLE----FFDVVNGGFDKAPKFPTPHKLMFLLKYY-KLENEPKALE--- 234

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
              MV  TL  M +GGI DH+G GF RYS D++W VPHFEKMLYD   L   YL+ +S+T
Sbjct: 235 ---MVEKTLNQMYRGGIFDHIGYGFSRYSTDDKWLVPHFEKMLYDNALLTMAYLETYSIT 291

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
           K  FY  +    +DY+ R++    G  + A+DADS   EG     EG FYV+   E+ ++
Sbjct: 292 KKEFYKNVAIKTMDYVLRELTSDEGGFYCAQDADS---EG----DEGKFYVFNPLEICEV 344

Query: 457 LGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 515
           LGE     F  ++ +  +GN            F+GK++   L ++S          EK  
Sbjct: 345 LGEDDGKYFNNYFDITTSGN------------FEGKSIANLLKNNSFENDD-----EK-- 385

Query: 516 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 575
             + + R+K+F+ R +R   H D+K++ SWN L+I++FA+A  ILK E            
Sbjct: 386 --INDLRKKVFNYRLERTTLHKDEKILTSWNALMITAFAKAYSILKDE------------ 431

Query: 576 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 635
               +Y++V + A +FI  +L + + +RL   +++G      +L+DYAFLI   ++LYE 
Sbjct: 432 ----KYLKVCKDAIAFIENNLVN-KDNRLLARYKDGDVAYFSYLEDYAFLIWSFIELYEG 486

Query: 636 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 695
            +  ++L  AI L +   + F D    G+F    +   ++ R KE +DGA PSGNSV+  
Sbjct: 487 TNEKEYLEKAISLNSEMIDKFWDENSSGFFLYGKDSEKLIARPKEIYDGAIPSGNSVAAY 546

Query: 696 NLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 740
            LV+L+ I   +K    +    + L  F + +K+  ++  +   A
Sbjct: 547 VLVKLSKI---TKDKILKDITYNQLKYFSSTVKNSPISYTMYLIA 588


>gi|255937427|ref|XP_002559740.1| Pc13g13260 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211584360|emb|CAP92395.1| Pc13g13260 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 788

 Score =  442 bits (1136), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 276/670 (41%), Positives = 363/670 (54%), Gaps = 49/670 (7%)

Query: 92  ASTSHSRNKH---------TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIF 142
           AS +HS  +H          NRL    SPY+  H +NPV W  W  EA   A+K +  IF
Sbjct: 3   ASINHSHPRHDVPDTGPKMVNRLHQSKSPYVRGHMNNPVAWQVWDAEAMELAKKHNRLIF 62

Query: 143 LSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGG 202
           LSIGYS CHWCHVME ESF    VA +LN+ FV IKVDREERPD+D VYM YVQA  G G
Sbjct: 63  LSIGYSACHWCHVMEKESFMSSEVASILNESFVPIKVDREERPDIDDVYMNYVQATTGSG 122

Query: 203 GWPLSVFLSPDLKPLMGGTYF--PPEDKYGRP---GFKTILRKVKDAWDKKRDMLAQSGA 257
           GWPL+VFL+P L+P+ GGTY+  P    +  P   GF  IL K++D W  ++     S  
Sbjct: 123 GWPLNVFLTPSLEPVFGGTYWQGPNSTTFRGPEAIGFVEILEKLRDVWQTQQQRCLDSAK 182

Query: 258 FAIEQLSEALSASASS------NKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPR 311
              +QL E       +      N   +E+    L    +  +  YDS  GGFG APKFP 
Sbjct: 183 EITKQLREFAEEGTHTQQGDRDNDKDEEMDIELLEEAYQHFASRYDSVNGGFGRAPKFPT 242

Query: 312 PVEIQMML---YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDE 368
           P  +  +L    +  ++ D     E  +   M + TL  MA+GGI DH+G GF RYSV  
Sbjct: 243 PSNLSFLLRLGAYPTQVMDVVGHDECEQATAMAVTTLVNMARGGIRDHIGHGFARYSVTA 302

Query: 369 RWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMI-GPGGEIFSAE 427
            W +PHFEKMLYDQ QL +VY+DAF LT D        D+  YL    I  P G  FS+E
Sbjct: 303 DWGLPHFEKMLYDQAQLLDVYVDAFRLTHDPELLGAVYDLSAYLTSAPIQSPTGGFFSSE 362

Query: 428 DADSAETEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHN 486
           DADS      T K+EGAFYVW+ KE+  +LG   A +  +H+ + P GN  +    DPH+
Sbjct: 363 DADSYPHPNDTEKREGAFYVWSLKELTSVLGPRDAPVCAKHWGVLPDGN--VPPEYDPHD 420

Query: 487 EFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSW 545
           EF  +NVL      S  A   G+  E+ + I+   ++KL D R + R RP LDDK+IV+W
Sbjct: 421 EFMNQNVLSIRATPSKLAKDFGLSEEEVVKIIKSSKQKLHDHREQTRGRPDLDDKIIVAW 480

Query: 546 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQ 605
           NGL I + A+ S +L  E ES         S      E A  A  FI+  L+D+ T +L 
Sbjct: 481 NGLAIGALAKCS-VLFEEIES---------SKAVHCREAAARAIGFIKDKLFDKATGQLW 530

Query: 606 HSFRNGP-SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG--- 661
             +R+G     PGF DDYA+L SGLLD+Y+      +L +A  LQ   +E FL + G   
Sbjct: 531 RIYRDGNRGDTPGFADDYAYLASGLLDMYDATYDDSYLQFAERLQKYLNEYFLAQSGSTA 590

Query: 662 GGYFN----TTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAE 717
            GY++    TT   P  LLR+K   + A PS N V   NL+RL++++ G +S  YR  A 
Sbjct: 591 AGYYSTPSVTTPGMPGPLLRLKTGTESATPSVNGVIARNLLRLSALL-GDES--YRTLAR 647

Query: 718 HSLAVFETRL 727
            +   F   +
Sbjct: 648 QTCNTFAVEI 657


>gi|392865908|gb|EAS31753.2| hypothetical protein CIMG_06900 [Coccidioides immitis RS]
          Length = 799

 Score =  441 bits (1134), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 267/667 (40%), Positives = 368/667 (55%), Gaps = 49/667 (7%)

Query: 92  ASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCH 151
           A+ +   ++  NRL+   SPY+  H +NPV W  W   A   A++ +  IFLSIGYS CH
Sbjct: 13  ATETAGPSRLVNRLSESRSPYVRGHMNNPVAWQLWDSAAINLAKRLNRLIFLSIGYSACH 72

Query: 152 WCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLS 211
           WCHVME ESF    VA +LN  FV IK+DREERPD+D+VYM YVQA+ G GGWPL+VFL+
Sbjct: 73  WCHVMEKESFMSPEVAAILNKSFVPIKLDREERPDIDEVYMNYVQAITGSGGWPLNVFLT 132

Query: 212 PDLKPLMGGTYFPPEDKYGRP--------GFKTILRKVKDAWDKKRDMLAQSGAFAIEQL 263
           PDL+P+ GGTY+P       P         F  IL K++D W+ ++    +S      QL
Sbjct: 133 PDLEPVFGGTYWPGPYSSSMPRVGGEEPITFIDILEKLRDVWNSQQLRCMESAKEITRQL 192

Query: 264 SEALSASASSNKLPDELPQNALRLCA-----EQLSKSYDSRFGGFGSAPKFPRPVEIQMM 318
            E  +   +  + P+   +  L L       +     YD   GGF  APKFP P  +  +
Sbjct: 193 RE-FAEEGTHLRRPETESEEDLELELLEEAHQHFVSRYDPINGGFSRAPKFPTPANLSFL 251

Query: 319 L----YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPH 374
           L    Y    ++  G+  E +   +MV  TL  MA+GGIHD +G GF RYSV   W +PH
Sbjct: 252 LRLGRYPDVVMDIVGRE-ECARATEMVSKTLLQMARGGIHDQIGHGFARYSVTPDWSLPH 310

Query: 375 FEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDADSAE 433
           FEKMLYDQ QL +VY+D F +T++        DI+ Y+    ++ P G   S+EDADS  
Sbjct: 311 FEKMLYDQAQLLDVYVDCFEITQEPKLLEAVYDIIAYITSPPILSPEGAFHSSEDADSFP 370

Query: 434 TEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKN 492
               T K+EGAFYVWT KE++ ILG+  A +   H+ + P GN  ++R +DPH+EF  +N
Sbjct: 371 NSNDTEKREGAFYVWTLKEMQQILGQRDAEVCAHHWGVLPDGN--VARGNDPHDEFINQN 428

Query: 493 VLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVIS 551
           VL         A   G+  ++ + ++   R+KL + R + R RP LDDK+IVSWNGL I 
Sbjct: 429 VLCIRASPRKIAKDFGLSEDEVVRVIKSSRKKLQEFRDEHRVRPDLDDKIIVSWNGLAIG 488

Query: 552 SFARASKIL-KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRN 610
           + A+ S +L K +AE A                VAE AA FIR +L+D +T +L   +R+
Sbjct: 489 ALAKCSLLLDKIDAERA-----------THCRRVAEKAAKFIRENLFDAETGQLWRVYRD 537

Query: 611 G-PSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG-----GGY 664
           G   + PGF DDYA+L SGL+ LYE      +L +A  LQ   +  FL          GY
Sbjct: 538 GRRGETPGFGDDYAYLASGLISLYEATFDDSYLQFAENLQQYLNRYFLATASDGTTPAGY 597

Query: 665 F----NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL 720
           +    N  G+ P  L R+K   D A PS N V   NL+RLAS++   + D Y+  A H+ 
Sbjct: 598 YMTPQNMPGDVPGPLFRLKTGTDAATPSTNGVIAQNLLRLASLL---EDDSYKALARHTC 654

Query: 721 AVFETRL 727
           + F   +
Sbjct: 655 SAFAAEM 661


>gi|448345120|ref|ZP_21534020.1| hypothetical protein C485_05016, partial [Natrinema altunense JCM
           12890]
 gi|445636069|gb|ELY89233.1| hypothetical protein C485_05016, partial [Natrinema altunense JCM
           12890]
          Length = 589

 Score =  441 bits (1134), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 243/613 (39%), Positives = 347/613 (56%), Gaps = 46/613 (7%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA NPV+W  W E A   A +RDVPIFLSIGYS CHWCHVME ESF+
Sbjct: 8   NRLDEEESPYLRQHADNPVNWQPWDERALEAATERDVPIFLSIGYSACHWCHVMEEESFQ 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA+++N+ FV IKVDREERPD+D +YMT  Q + G GGWPLS +L+P+ KP   GTY
Sbjct: 68  DEAVAEVINENFVPIKVDREERPDIDSIYMTVCQLVRGQGGWPLSAWLTPEGKPFFIGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRD------MLAQSGAFAIEQLSEALSASASSN-K 275
           FP E + G+PGF+ + +++ D+W+   D         Q    A ++L E   A+  S  +
Sbjct: 128 FPREGQRGQPGFRDLCQRISDSWESDADREEMENRAQQWTDAATDRLEETPDAAGGSPVE 187

Query: 276 LPDELPQNALRLCAEQLSKSYDSRFGGFGSA-PKFPRPVEIQMMLYHSKKLEDTGKSGEA 334
            P+    + L   A+ + +S D  +GGFGS+ PKFP+P  ++++   ++  + TG+    
Sbjct: 188 APEPPSSDVLETAADAVVQSADREYGGFGSSGPKFPQPSRLRVL---ARTYDRTGR---- 240

Query: 335 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 394
            E +++   TL  MA GG+ DHVGGGFHRY VD  W VPHFEKMLYD  ++   +L  + 
Sbjct: 241 EEYREVFEETLDAMAAGGLADHVGGGFHRYCVDRDWTVPHFEKMLYDNAEIPRAFLSGYQ 300

Query: 395 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 454
           LT +  Y+ +  D L ++ R++    G  FS  DA S   E   R +EGAFYVWT  EV 
Sbjct: 301 LTGEDRYAELVADTLSFVERELTHDDGGFFSTLDAQSDSPETGER-EEGAFYVWTPDEVH 359

Query: 455 DILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE 512
           D+L +   A LF   Y +   GN            F+G+N    +   S  A++  +   
Sbjct: 360 DVLEDETDAALFCARYDITEAGN------------FEGRNQPNRVARVSELAAQFDLADH 407

Query: 513 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 572
           + L  L   R++LF+ R +RPRP+ D+K++  WNGL+IS++A A+ +L            
Sbjct: 408 EILKRLESARQRLFEARQERPRPNRDEKILAGWNGLMISTYAEAALVL------------ 455

Query: 573 VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDL 632
             G+D  +Y + A  A  F+R  L+DE   RL   +++G  K  G+L+DYAFL  G LD 
Sbjct: 456 --GAD--DYADTAVDALGFVRDELWDEDEQRLSRRYKDGDVKIDGYLEDYAFLARGALDC 511

Query: 633 YEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSV 692
           Y+       L +A+EL    +  F D + G  + T     +++ R +E  D + PS   V
Sbjct: 512 YQATGEVDHLAFALELARVIEAEFWDADSGTLYFTPESGEALVTRPQELGDQSTPSATGV 571

Query: 693 SVINLVRLASIVA 705
           +V  L+ L    A
Sbjct: 572 AVETLLALDEFAA 584


>gi|119184130|ref|XP_001243004.1| hypothetical protein CIMG_06900 [Coccidioides immitis RS]
          Length = 797

 Score =  441 bits (1133), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 271/700 (38%), Positives = 379/700 (54%), Gaps = 49/700 (7%)

Query: 92  ASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCH 151
           A+ +   ++  NRL+   SPY+  H +NPV W  W   A   A++ +  IFLSIGYS CH
Sbjct: 13  ATETAGPSRLVNRLSESRSPYVRGHMNNPVAWQLWDSAAINLAKRLNRLIFLSIGYSACH 72

Query: 152 WCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLS 211
           WCHVME ESF    VA +LN  FV IK+DREERPD+D+VYM YVQA+ G GGWPL+VFL+
Sbjct: 73  WCHVMEKESFMSPEVAAILNKSFVPIKLDREERPDIDEVYMNYVQAITGSGGWPLNVFLT 132

Query: 212 PDLKPLMGGTYFPPEDKYGRP--------GFKTILRKVKDAWDKKRDMLAQSGAFAIEQL 263
           PDL+P+ GGTY+P       P         F  IL K++D W+ ++    +S      QL
Sbjct: 133 PDLEPVFGGTYWPGPYSSSMPRVGGEEPITFIDILEKLRDVWNSQQLRCMESAKEITRQL 192

Query: 264 SEALSASASSNKLPDELPQNALRLCA-----EQLSKSYDSRFGGFGSAPKFPRPVEIQMM 318
            E  +   +  + P+   +  L L       +     YD   GGF  APKFP P  +  +
Sbjct: 193 RE-FAEEGTHLRRPETESEEDLELELLEEAHQHFVSRYDPINGGFSRAPKFPTPANLSFL 251

Query: 319 L----YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPH 374
           L    Y    ++  G+  E +   +MV  TL  MA+GGIHD +G GF RYSV   W +PH
Sbjct: 252 LRLGRYPDVVMDIVGRE-ECARATEMVSKTLLQMARGGIHDQIGHGFARYSVTPDWSLPH 310

Query: 375 FEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDADSAE 433
           FEKMLYDQ QL +VY+D F +T++        DI+ Y+    ++ P G   S+EDADS  
Sbjct: 311 FEKMLYDQAQLLDVYVDCFEITQEPKLLEAVYDIIAYITSPPILSPEGAFHSSEDADSFP 370

Query: 434 TEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKN 492
               T K+EGAFYVWT KE++ ILG+  A +   H+ + P GN  ++R +DPH+EF  +N
Sbjct: 371 NSNDTEKREGAFYVWTLKEMQQILGQRDAEVCAHHWGVLPDGN--VARGNDPHDEFINQN 428

Query: 493 VLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVIS 551
           VL         A   G+  ++ + ++   R+KL + R + R RP LDDK+IVSWNGL I 
Sbjct: 429 VLCIRASPRKIAKDFGLSEDEVVRVIKSSRKKLQEFRDEHRVRPDLDDKIIVSWNGLAIG 488

Query: 552 SFARASKIL-KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRN 610
           + A+ S +L K +AE A                VAE AA FIR +L+D +T +L   +R+
Sbjct: 489 ALAKCSLLLDKIDAERA-----------THCRRVAEKAAKFIRENLFDAETGQLWRVYRD 537

Query: 611 G-PSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG-----GGY 664
           G   + PGF DDYA+L SGL+ LYE      +L +A  LQ   +  FL          GY
Sbjct: 538 GRRGETPGFGDDYAYLASGLISLYEATFDDSYLQFAENLQQYLNRYFLATASDGTTPAGY 597

Query: 665 F----NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL 720
           +    N  G+ P  L R+K   D A PS N V   NL+RLAS++   + D Y+  A H+ 
Sbjct: 598 YMTPQNMPGDVPGPLFRLKTGTDAATPSTNGVIAQNLLRLASLL---EDDSYKALARHTC 654

Query: 721 AVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSS 760
           + F   +         +      L V  +  + ++GH ++
Sbjct: 655 SAFAAEMLQHPFLFVGLLDVVVGLEVGVKSVIGVLGHDTT 694


>gi|448365504|ref|ZP_21553884.1| hypothetical protein C480_03514 [Natrialba aegyptia DSM 13077]
 gi|445655043|gb|ELZ07890.1| hypothetical protein C480_03514 [Natrialba aegyptia DSM 13077]
          Length = 717

 Score =  440 bits (1132), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 258/649 (39%), Positives = 352/649 (54%), Gaps = 49/649 (7%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA E SPYL QHA NPV+W  W E A   AR+ DVPIFLSIGYS CHWCHVM  ESF 
Sbjct: 8   NRLADEESPYLRQHADNPVNWQPWDERALETAREHDVPIFLSIGYSACHWCHVMADESFA 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA  LN+ FV IKVDREERPD+D +YMT  Q + G GGWPLS +L+PD KP   GTY
Sbjct: 68  DETVAAQLNEHFVPIKVDREERPDIDSIYMTVCQLVTGRGGWPLSAWLTPDGKPFYVGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLA----QSGAFAIEQLSEALSASASSNKLPD 278
           FP E K G+PGF  IL  V ++W+  R+ +     Q  A A ++L E   A  +S     
Sbjct: 128 FPREAKRGQPGFLDILENVTNSWESDREEIENRADQWTAAATDRLEETPDAVGASQP--- 184

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLEDTGKSGEASEG 337
               + L   A    +S D  FGGFGS  PKFP+P  ++++   ++  + TG+     E 
Sbjct: 185 -PSSDVLEAAANASLRSADREFGGFGSDGPKFPQPSRLRVL---ARAADRTGR----DEF 236

Query: 338 QKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTK 397
             +++ TL  MA GG++DHVGGGFHRY VD  W VPHFEKMLYD  ++   +L  +  T 
Sbjct: 237 SDVLVETLDAMAAGGLYDHVGGGFHRYCVDRDWTVPHFEKMLYDNAEIPRAFLLGYQQTG 296

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
           D  Y+ +  + LD++ R++    G  FS  DA S + E   R +EGAFYVWT  +V D+L
Sbjct: 297 DERYAEVVAETLDFVERELTHEAGGFFSTLDAQSEDPETGER-EEGAFYVWTPDDVRDVL 355

Query: 458 GEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 515
            +   A LF   Y +  +GN            F+GKN    +       ++  +P ++  
Sbjct: 356 ADETDAELFCSRYDITESGN------------FEGKNQPNRVASIDDLTNRSELPADETR 403

Query: 516 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 575
             L   RR LF+ R +RPRP+ D+KV+  WNGL+I++ A A+ +L              G
Sbjct: 404 ERLESARRDLFEARERRPRPNRDEKVLAGWNGLMIATCAEAALVL--------------G 449

Query: 576 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 635
            D  +Y E+A  A +F+R  L+D    RL   +++      G+L+DYAFL  G L  YE 
Sbjct: 450 ED--DYAEMATDALAFVRDRLWDADEQRLSRRYKDHDVAIDGYLEDYAFLARGALGCYEA 507

Query: 636 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 695
                 L +A+EL    +  F D   G  + T     S++ R +E  D + PS   V+V 
Sbjct: 508 TGEVDHLAFALELARVIEAEFWDEAQGTLYFTPESGESLVTRPQELGDQSTPSAAGVAVE 567

Query: 696 NLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 744
            L+ L    AG   ++ R  A   L     RL+  ++    +C AAD L
Sbjct: 568 TLLELDGF-AGESGEFERI-ATTVLETHANRLETNSLEHATLCLAADRL 614


>gi|14548135|gb|AAK66792.1|U40238_13 Highly conserved protein containing a thioredoxin domain
           [uncultured crenarchaeote 4B7]
          Length = 674

 Score =  440 bits (1132), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 237/603 (39%), Positives = 346/603 (57%), Gaps = 51/603 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N L+ E SPYLLQH  NPV+W++W +E+  +A+  + PIFLS+GYS+CHWCHVM  ESFE
Sbjct: 3   NNLSKETSPYLLQHKDNPVEWYSWNDESLKKAKDENKPIFLSVGYSSCHWCHVMAHESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           ++ VAK++N+ FV+IKVDREERPD+D +Y    Q   G GGWPLSVFL+P+ KP   GTY
Sbjct: 63  NDDVAKIMNENFVNIKVDREERPDLDDIYQKICQMSTGQGGWPLSVFLTPEQKPFYVGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP  D YGRPGF ++ R++  AW++K   +  S    +  L++    S        E+ +
Sbjct: 123 FPVLDSYGRPGFGSLCRQLAQAWNEKPKDVGTSAEQFMSNLTKLEKVSDGG-----EIEK 177

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
           + L   A  L +  D+ +GGFG APKFP    +  M  +SK       SG  ++ Q+  L
Sbjct: 178 SILDEAAVNLLQVADTNYGGFGQAPKFPNAANLSFMFRYSK------LSG-ITKFQEFAL 230

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL+ MAKGGI D +GGGFHRYS D RW VPHFEKMLYD   L  VY +A+ +TKD FY 
Sbjct: 231 MTLKKMAKGGIFDQIGGGFHRYSTDARWLVPHFEKMLYDNALLPPVYAEAYQITKDPFYL 290

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
            +    LDY+ R+M    G  +SA+DAD+   EG T       +VW  +E+E+ILG+ + 
Sbjct: 291 DVVTKTLDYIMREMTSASGLFYSAQDADTNGEEGQT-------FVWKKREIENILGDDSE 343

Query: 463 LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECR 522
           +F  +Y +   GN            F+G  +L    + S+ + K     ++   +L    
Sbjct: 344 IFCIYYDVTDGGN------------FEGNTILANNINISSLSFKFNKTEDEITKLLKRSS 391

Query: 523 RKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYM 582
           +KL DVRS R +P  DDK+I SWN ++IS+FA+  +I                S  ++Y+
Sbjct: 392 KKLLDVRSNRDQPGTDDKIITSWNSMMISAFAKGYRI----------------SGNEKYL 435

Query: 583 EVAESAASFIRRHLYDEQTHRLQH-SFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKW 641
            VA +AA +          H   H +F+N   K  G+LDDY++L++ L+D++E  S   +
Sbjct: 436 NVAVNAAKYFSEQF---SKHGFIHRTFKNDTPKLNGYLDDYSYLVNSLIDVFEITSDAYF 492

Query: 642 LVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLA 701
           L  A ++ +   E F +     ++ T     S+++R K  +D + PSGNSV+   L++L 
Sbjct: 493 LDIAQKITHYMIEHFWNETEKSFYFTADTHESLIVRPKNYYDLSVPSGNSVAANALLKLH 552

Query: 702 SIV 704
            +V
Sbjct: 553 HLV 555


>gi|448397958|ref|ZP_21569896.1| hypothetical protein C476_03843 [Haloterrigena limicola JCM 13563]
 gi|445672174|gb|ELZ24751.1| hypothetical protein C476_03843 [Haloterrigena limicola JCM 13563]
          Length = 731

 Score =  439 bits (1130), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 253/652 (38%), Positives = 351/652 (53%), Gaps = 50/652 (7%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA NPV+W  W E+A   A++RDVPIFLSIGYS CHWCHVME ESF 
Sbjct: 8   NRLDEEESPYLRQHADNPVNWQPWDEQALEAAKERDVPIFLSIGYSACHWCHVMEAESFA 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA++LN+ FV IKVDREERPD+D +YMT  Q + G GGWPLS +L+P+ KP   GTY
Sbjct: 68  DEAVAEVLNENFVPIKVDREERPDIDSIYMTVCQLVSGQGGWPLSAWLTPEGKPFFIGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRD------MLAQSGAFAIEQLSEALSASASSNKL 276
           FP E K G+PGF  +  ++ D+W    D         Q    A ++L E  +  A ++  
Sbjct: 128 FPREGKRGQPGFLDLCERISDSWASAEDRPEMESRAEQWTDAAKDRLEETPTEDADTDAS 187

Query: 277 PDELPQNALRLCAEQLSKSYDSRFGGFGSA-PKFPRPVEIQMMLYHSKKLEDTGKSGEAS 335
                   L   A+ + +S D R GGFGS+ PKFP+P  ++++     + +D     E  
Sbjct: 188 AGPPSSEVLETAADAIVRSADRRCGGFGSSGPKFPQPSRLRVLARAHDRTDDETAYREVL 247

Query: 336 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 395
           E       TL  MA GG++DHVGGGFHRY VD  W VPHFEKMLYD  ++   +L  + L
Sbjct: 248 EE------TLDAMAAGGLYDHVGGGFHRYCVDRDWTVPHFEKMLYDNAEIPRAFLAGYQL 301

Query: 396 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 455
           T +  Y+ +  D L+++ R++    G  FS  DA S   E   R KEGAFYVWT  EV D
Sbjct: 302 TGENRYAEVVGDTLEFVERELTHDDGGFFSTLDAQSESPETGER-KEGAFYVWTPDEVHD 360

Query: 456 ILGEH---AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE 512
           ++ EH   A LF + Y +  +GN            F+G++    +   S  A    +   
Sbjct: 361 VI-EHEPDAALFCKRYDITESGN------------FEGRSQPNRVTPVSELAVGFDLEES 407

Query: 513 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 572
           + L  L   R++LF+ R +RPRP+ D+K++  WNGL+IS++A A+ +L            
Sbjct: 408 EVLKRLDAIRQRLFEAREERPRPNRDEKILAGWNGLMISTYAEAALVL------------ 455

Query: 573 VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDL 632
             G D  +Y E A  A  F+R  L+D    RL   ++ G     G+L+DYAFL  G LD 
Sbjct: 456 --GED--DYAETAVDALEFVRDRLWDADEQRLSRRYKGGDVAIDGYLEDYAFLARGALDC 511

Query: 633 YEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSV 692
           Y+       L +A+EL    +  F D + G  + T     S++ R +E  D + PS   V
Sbjct: 512 YQATGEVDHLAFALELARVIEVEFWDADHGTLYFTPASGESLVTRPQELSDQSTPSAAGV 571

Query: 693 SVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 744
           +V  L+ L        ++ + + A   L      L+  A+    +C AAD L
Sbjct: 572 AVETLLSLDEFA----TEDFEEIAATVLETHANTLEANALEHATLCLAADRL 619


>gi|284045681|ref|YP_003396021.1| hypothetical protein Cwoe_4232 [Conexibacter woesei DSM 14684]
 gi|283949902|gb|ADB52646.1| protein of unknown function DUF255 [Conexibacter woesei DSM 14684]
          Length = 666

 Score =  439 bits (1129), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 260/662 (39%), Positives = 353/662 (53%), Gaps = 70/662 (10%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            N LA E SPYLLQH  NPVDW  WG +A A AR+RDVP+ +SIGYS CHWCHVME ESF
Sbjct: 2   ANALANETSPYLLQHKDNPVDWRPWGPDALAAARERDVPLLISIGYSACHWCHVMERESF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           ED   A L+N+ FV IKVDREERPDVD +YM  VQA+ G GGWPL+ F +P+  P   GT
Sbjct: 62  EDPQTAALMNERFVCIKVDREERPDVDAIYMDAVQAMTGHGGWPLNAFATPEQVPFYAGT 121

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFPP+ ++G P ++ +L  + DAW  +RD +       +  LS     + S   +   L 
Sbjct: 122 YFPPQPRHGLPSWRQVLEAISDAWRARRDEILAQNDRIVAHLSAGARLAPSGAMVDPGLL 181

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
            +A+    + L  + D   GGFGSAPKFP+   I+++L          + GE    Q + 
Sbjct: 182 DDAV----DSLRMAADPVNGGFGSAPKFPQASVIELLL----------RRGE----QTVA 223

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
           L  L+ MA+GGIHD +GGGF RY+VD  W VPHFEKMLYD   LA  YL  + ++ D   
Sbjct: 224 LDALRAMARGGIHDQLGGGFSRYTVDAAWVVPHFEKMLYDNALLARAYLHGWQVSGDPLL 283

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHA 461
             +C D LD+  R+M GP G   SA DADS   EG     EG FYVW+  E+   LG+  
Sbjct: 284 RQVCEDTLDWALREMRGPEGGFHSALDADS---EGV----EGKFYVWSLAELRSALGDDE 336

Query: 462 I--LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILG 519
           +  +    Y     GN            F+G N+L+    +SA+      P E     L 
Sbjct: 337 LYDVAVAWYGATVAGN------------FEGLNILVRAGSASAAE-----PPE-----LP 374

Query: 520 ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRK 579
           E RR+L   RS R RP LDDK + SWN L+I++ A A  +L                +R 
Sbjct: 375 EIRRRLLAARSTRVRPGLDDKRLTSWNALMIAALAEAGAVL----------------ERD 418

Query: 580 EYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGT 639
           +Y++ A   ASF+   L      RL  S+++G +  PG+L+D+A+ +  LL LYE     
Sbjct: 419 DYLDAARGTASFLLDSLATSDG-RLLRSWKDGRATLPGYLEDHAYALEALLTLYEATFEE 477

Query: 640 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVR 699
           +W   A  L +     F D E GG+F T  +   ++ R K+  D   PSGNS +   L+R
Sbjct: 478 RWFTAARALADATIAHFADAEHGGFFMTADDHEQLVARRKDLEDTPIPSGNSAAAFGLLR 537

Query: 700 LASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKS 759
           LA +     +DY R+ AE  +A+        AMA   +  A D   +     V +VG ++
Sbjct: 538 LARLT--GSADYERE-AERVIALLHPLAAGHAMAFAHLLAAID-FQLGEVHEVAIVGDRA 593

Query: 760 SV 761
           + 
Sbjct: 594 AA 595


>gi|121701517|ref|XP_001269023.1| DUF255 domain protein [Aspergillus clavatus NRRL 1]
 gi|119397166|gb|EAW07597.1| DUF255 domain protein [Aspergillus clavatus NRRL 1]
          Length = 788

 Score =  438 bits (1127), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 269/668 (40%), Positives = 363/668 (54%), Gaps = 43/668 (6%)

Query: 78  IHPYKVVAMAERTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKR 137
           IHP   +   +  P        K  NRL    SPY+  H +NPV W  W  EA   AR+ 
Sbjct: 7   IHPSTHIGGNDTEP--------KLVNRLRDSRSPYVRAHMNNPVAWQLWDAEAIGLARRH 58

Query: 138 DVPIFLSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQA 197
           +  IFLSIGYS CHWCHV+E ESF  + VA LLN+ F+ IKVDREERPD+D VYM YVQA
Sbjct: 59  NRLIFLSIGYSACHWCHVIEKESFMSQEVASLLNESFIPIKVDREERPDIDDVYMNYVQA 118

Query: 198 LYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRP-----GFKTILRKVKDAWDKKRDML 252
             G GGWPLSVFL+PDL+P+ GGTY+P  +          GF  IL K++D W  ++   
Sbjct: 119 TTGSGGWPLSVFLTPDLEPVFGGTYWPGPNSSTLSGPHTIGFVDILEKLRDVWKTQQQRC 178

Query: 253 AQSGAFAIEQL---SEALSASASSNKLPDE-LPQNALRLCAEQLSKSYDSRFGGFGSAPK 308
            +S      QL   +E  + S   ++  DE L    L    +  +  YD+  GGF  APK
Sbjct: 179 RESAKEITRQLREFAEEGTHSQQGDREADEDLDIELLEEAYQHFASRYDAVNGGFSRAPK 238

Query: 309 FPRPVEIQMML---YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYS 365
           FP P  +  +L    +   + D     E  +   M + TL  MA+GGI DH+G GF RYS
Sbjct: 239 FPTPANLSFLLRLKTYPSAVSDIVGQEECDKATTMAVSTLVSMARGGIRDHIGHGFARYS 298

Query: 366 VDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMI-GPGGEIF 424
           V   W +PHFEKMLYDQ QL +VY+DAF +T +        D+  YL    I    G   
Sbjct: 299 VTSDWSLPHFEKMLYDQAQLLDVYVDAFQITHNPELLGAVYDLATYLTTAPIQSSTGAFH 358

Query: 425 SAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSD 483
           S+EDADS      T K+EGAFYVWT KE+  +LG+  A +   H+ + P GN  ++   D
Sbjct: 359 SSEDADSLPAPNDTEKREGAFYVWTLKELTQVLGQRDAGVCARHWGVLPDGN--IAPEHD 416

Query: 484 PHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVI 542
           PH+EF  +NVL      S  A + G+  E+ + I+   ++KL + R K R RP LDDK+I
Sbjct: 417 PHDEFMNQNVLSIKVTPSKLAREFGLSEEEVVKIIKSAKQKLREYREKTRVRPDLDDKII 476

Query: 543 VSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTH 602
           V+WNGL I + A+ S + + E ES         S   E  E A  A SFI+ +L+++ T 
Sbjct: 477 VAWNGLAIGALAKCSALFE-EIES---------SKAVECREAAARAISFIKENLFEKVTG 526

Query: 603 RLQHSFRNGP-SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG 661
           +L   +R+G     PGF DDYA+L  GLLD+YE      +L +A +LQ   +  FL   G
Sbjct: 527 QLWRIYRDGSRGDTPGFADDYAYLTQGLLDMYEATFEDSYLQFAEQLQRYLNRNFLAYIG 586

Query: 662 ---GGYFNT----TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQ 714
               GY++T    T   P  LLR+K   + A PS N V   NL+RL++++   +     +
Sbjct: 587 STPAGYYSTPSTMTPGMPGPLLRLKTGTESATPSINGVIARNLLRLSALLEDEEYRTLAR 646

Query: 715 NAEHSLAV 722
              HS +V
Sbjct: 647 QTCHSFSV 654


>gi|448339114|ref|ZP_21528145.1| hypothetical protein C487_15484 [Natrinema pallidum DSM 3751]
 gi|445621085|gb|ELY74571.1| hypothetical protein C487_15484 [Natrinema pallidum DSM 3751]
          Length = 727

 Score =  438 bits (1127), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 248/652 (38%), Positives = 358/652 (54%), Gaps = 51/652 (7%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA NPV+W  W E A   AR+R+VPIFLSIGYS CHWCHVM  ESFE
Sbjct: 8   NRLDEEESPYLRQHADNPVNWQPWDETALEAARERNVPIFLSIGYSACHWCHVMAEESFE 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA+++N+ FV IKVDREERPD+D +YMT  Q + G GGWPLS +L+P+ KP   GTY
Sbjct: 68  DEAVAEVINENFVPIKVDREERPDIDSIYMTVCQLVRGQGGWPLSAWLTPEGKPFFIGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAW------DKKRDMLAQSGAFAIEQLSEALSASASSNKL 276
           FP E + G+PGF+ + +++ D+W      ++  +   Q    A +QL E    +    + 
Sbjct: 128 FPREGQRGQPGFRDLCQRISDSWESEEDREEMENRAQQWTDAAKDQLEETPDTAGVGAEP 187

Query: 277 PDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASE 336
           P     + L   A+ + +S D ++GGFGS  KFP+P  ++++   ++  + TG+     E
Sbjct: 188 PS---SDVLETAADMVLRSADRQYGGFGSGQKFPQPSRLRVL---ARAYDRTGR----EE 237

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
            +++   TL  MA GG++DHVGGGFHRY VD  W VPHFEKMLYD  ++   +L  + LT
Sbjct: 238 YREVFEETLDAMAAGGLYDHVGGGFHRYCVDRDWTVPHFEKMLYDNAEIPRAFLSGYQLT 297

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
            +  Y+ +  + L+++ R++    G  FS  DA S   E   R +EGAFYVWT  EV + 
Sbjct: 298 GEDRYATVVSETLEFVDRELTHDEGGFFSTLDAQSESPETGER-EEGAFYVWTPAEVHEA 356

Query: 457 LGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKY 514
           L +   A LF   + +  +GN            F+G+N    +   S  A +  +   + 
Sbjct: 357 LDDETDAALFCARFDISESGN------------FEGRNQPNRVATVSELADQFDLAEHEI 404

Query: 515 LNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVV 574
           L  L   R+ LF+ R +RPRP+ D+K++  WNGL+IS++A A+ +L              
Sbjct: 405 LKRLDSARQTLFEAREERPRPNRDEKILAGWNGLLISTYAEAALVL-------------- 450

Query: 575 GSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYE 634
           G+D  +Y + A  A  F+R  L+DE   RL   +++G  K  G+L+DYAFL  G LD Y+
Sbjct: 451 GAD--DYADTAVDALEFVRDRLWDEDDQRLSRRYKDGDVKVDGYLEDYAFLARGALDCYQ 508

Query: 635 FGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSV 694
                  L +A+EL    +  F D + G  + T     S++ R +E  D + PS   V+V
Sbjct: 509 ATGEVDHLAFALELARVIEAEFWDADRGTLYFTPESGESLVTRPQELGDQSTPSATGVAV 568

Query: 695 INLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 746
             L+ L    A    D     A   L      L+  A+    +C AAD L+ 
Sbjct: 569 ETLLALDEFAAEDFEDI----AATVLETHANELESNALEHATLCLAADRLAA 616


>gi|347733897|ref|ZP_08866951.1| hypothetical protein DA2_3260 [Desulfovibrio sp. A2]
 gi|347517453|gb|EGY24644.1| hypothetical protein DA2_3260 [Desulfovibrio sp. A2]
          Length = 781

 Score =  438 bits (1126), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 265/708 (37%), Positives = 365/708 (51%), Gaps = 67/708 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N LA   SPYLLQHA NPV W  WG+EA   AR  D P+F+SIGYSTCHWCHVM  ESFE
Sbjct: 38  NLLARAKSPYLLQHAANPVHWRPWGDEALQRARDEDRPLFVSIGYSTCHWCHVMAHESFE 97

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D+ VA+LLND FV +KVDREERPD+D  YM   Q L G GGWPL++   PD +P    TY
Sbjct: 98  DDEVARLLNDAFVCVKVDREERPDIDAAYMAACQMLTGTGGWPLTIIALPDGRPFFAATY 157

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL---SEALSASASSNKLPDE 279
            P   + GR G   ++ +V   W  KR  +  S    +E +   +EA+    +  +LP  
Sbjct: 158 LPKHSRPGRIGLMDLVPRVLAVWRDKRGEVLDSAESIVEHVRRHAEAMLRPPADGRLPG- 216

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK--------------L 325
                L    E ++  +D+  GGFGSAPKFP P  +  +L  +++               
Sbjct: 217 --AGTLHAACEAMASEFDAANGGFGSAPKFPSPHNLLFLLRWARRNGYGAGSGASGAAAP 274

Query: 326 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 385
             T      ++  +M   TL+ + +GGIHDHVG GFHRYS D RW +PHFEKMLYDQ  L
Sbjct: 275 GATQDEPGGAKALRMAAQTLRAIRRGGIHDHVGYGFHRYSTDARWLLPHFEKMLYDQAML 334

Query: 386 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 445
              Y +A+  T D  +     +   Y+ RD+    G  +SAEDADS E +G   + EG F
Sbjct: 335 MLAYAEAWLATGDGEFRRTAEETAAYVLRDLTSSEGAFYSAEDADS-ELDGV--RGEGLF 391

Query: 446 YVWTSKEVEDILG-------------------EHAILFKEHYYLKPTGNCDLSRMSDPHN 486
           Y +T  ++E                         A L    +     GN +     +   
Sbjct: 392 YTFTLADLEAACAPLDVGSGGDGGAEAGEGAISDADLAARAFGCTAYGNYE----DEATR 447

Query: 487 EFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWN 546
              G+NVL       A A +LG+P  +    L   R  LFD+R+ RPRPHLDDKV+  WN
Sbjct: 448 SRTGRNVLHLPRSPEALARELGLPPREVEERLEAARAALFDLRTTRPRPHLDDKVLADWN 507

Query: 547 GLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQH 606
           GL I++ +R ++                  D     E A  AA F+   +   +  RL H
Sbjct: 508 GLAIAAMSRCAQAF----------------DAPHLAEAAAVAADFVLTRMVTPEG-RLLH 550

Query: 607 SFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFN 666
            +R+G +  PG LDDYAF+I GL++LY      +WL  A+ LQ  QD  F D EGGGY+ 
Sbjct: 551 RWRDGEAAVPGLLDDYAFMIWGLVELYGATGEVRWLRRALRLQEVQDTFFHDPEGGGYWM 610

Query: 667 TTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETR 726
           T  +  ++L+R KE HDGA PSGN+ ++ NL+RL+ ++   +   Y + A   L  F T+
Sbjct: 611 TPADGDALLVRRKEGHDGALPSGNAAALFNLLRLSLLLGRPE---YGERARGVLRAFATQ 667

Query: 727 LKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASY 774
           ++   +   +  C  D  ++   + V++ G     D E MLAA   +Y
Sbjct: 668 VRHHPIGSTMFLCGVD-FALSGGRSVIVAGEPDQPDTEAMLAAVRGTY 714


>gi|320031949|gb|EFW13906.1| DUF255 domain-containing protein [Coccidioides posadasii str.
           Silveira]
          Length = 799

 Score =  438 bits (1126), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 266/667 (39%), Positives = 367/667 (55%), Gaps = 49/667 (7%)

Query: 92  ASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCH 151
           A+ +   ++  NRL+   SPY+  H +NPV W  W   A   A++ +  IFLSIGYS CH
Sbjct: 13  ATETAGPSRLVNRLSESRSPYVRGHMNNPVAWQLWDSAAINLAKRLNRLIFLSIGYSACH 72

Query: 152 WCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLS 211
           WCHVME ESF    VA +LN  FV IK+DREERPD+D+VYM YVQA+ G GGWPL+VFL+
Sbjct: 73  WCHVMEKESFMSPEVAAILNKSFVPIKLDREERPDIDEVYMNYVQAITGSGGWPLNVFLT 132

Query: 212 PDLKPLMGGTYFPPEDKYGRP--------GFKTILRKVKDAWDKKRDMLAQSGAFAIEQL 263
           PDL+P+ GGTY+P       P         F  IL K++D W+ ++    +S      QL
Sbjct: 133 PDLEPVFGGTYWPGPYSSSMPRVGGEEPITFIDILEKLRDVWNSQQLRCMESAKEITRQL 192

Query: 264 SEALSASASSNKLPDELPQNALRLCA-----EQLSKSYDSRFGGFGSAPKFPRPVEIQMM 318
            E  +   +  + P+   +  L L       +     YD   GGF  APKFP P  +  +
Sbjct: 193 RE-FAEEGTHLRRPETESEEDLELELLEEAHQHFVSRYDPINGGFSRAPKFPTPANLSFL 251

Query: 319 L----YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPH 374
           L    Y    ++  G+  E +   +MV  TL  MA+GGIHD +G GF RYSV   W +PH
Sbjct: 252 LRLGRYPDVVMDIVGRE-ECARATEMVSKTLLQMARGGIHDQIGHGFARYSVTPDWSLPH 310

Query: 375 FEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDADSAE 433
           FEKMLYDQ QL +VY+D F +T++        DI+ Y+    ++ P G   S+EDADS  
Sbjct: 311 FEKMLYDQAQLLDVYVDCFEITQEPKLLEAVYDIIAYITSPPILSPEGAFHSSEDADSFP 370

Query: 434 TEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKN 492
               T K+EGAFYVWT KE++ ILG+  A +   H+ + P GN  ++R +DPH+EF  +N
Sbjct: 371 NSNDTEKREGAFYVWTLKEMQQILGQRDAEVCARHWGVLPDGN--VARGNDPHDEFINQN 428

Query: 493 VLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVIS 551
           VL         A   G+  ++ + ++   R+KL + R + R RP LDDK+IVSWNGL I 
Sbjct: 429 VLCIRASPRKIAKDFGLSEDEVVRVIKSSRKKLQEFRDEHRVRPDLDDKIIVSWNGLAIG 488

Query: 552 SFARASKIL-KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRN 610
           + A+ S +L K +AE A                VAE AA FIR +L+D +T +L   +R+
Sbjct: 489 ALAKCSLLLDKIDAERA-----------THCRRVAEKAAKFIRENLFDAETGQLWRVYRD 537

Query: 611 G-PSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG-----GGY 664
           G   + PGF DDYA+L SGL+ LYE      +L +A  LQ   +  FL          GY
Sbjct: 538 GRRGETPGFGDDYAYLASGLISLYEATFDDSYLQFAENLQQYLNRYFLATASDGTTPAGY 597

Query: 665 F----NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL 720
           +    N   + P  L R+K   D A PS N V   NL+RLAS++   + D Y+  A H+ 
Sbjct: 598 YMTPQNMPEDVPGPLFRLKTGTDAATPSTNGVIAQNLLRLASLL---EDDSYKALARHTC 654

Query: 721 AVFETRL 727
           + F   +
Sbjct: 655 SAFAAEM 661


>gi|303320203|ref|XP_003070101.1| hypothetical protein CPC735_032920 [Coccidioides posadasii C735
           delta SOWgp]
 gi|240109787|gb|EER27956.1| hypothetical protein CPC735_032920 [Coccidioides posadasii C735
           delta SOWgp]
          Length = 799

 Score =  438 bits (1126), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 266/667 (39%), Positives = 367/667 (55%), Gaps = 49/667 (7%)

Query: 92  ASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCH 151
           A+ +   ++  NRL+   SPY+  H +NPV W  W   A   A++ +  IFLSIGYS CH
Sbjct: 13  ATETAGPSRLVNRLSESRSPYVRGHMNNPVAWQLWDSAAINLAKRLNRLIFLSIGYSACH 72

Query: 152 WCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLS 211
           WCHVME ESF    VA +LN  FV IK+DREERPD+D+VYM YVQA+ G GGWPL+VFL+
Sbjct: 73  WCHVMEKESFMSPEVAAILNKSFVPIKLDREERPDIDEVYMNYVQAITGSGGWPLNVFLT 132

Query: 212 PDLKPLMGGTYFPPEDKYGRP--------GFKTILRKVKDAWDKKRDMLAQSGAFAIEQL 263
           PDL+P+ GGTY+P       P         F  IL K++D W+ ++    +S      QL
Sbjct: 133 PDLEPVFGGTYWPGPYSSSMPRVGGEEPITFIDILEKLRDVWNSQQLRCMESAKEITRQL 192

Query: 264 SEALSASASSNKLPDELPQNALRLCA-----EQLSKSYDSRFGGFGSAPKFPRPVEIQMM 318
            E  +   +  + P+   +  L L       +     YD   GGF  APKFP P  +  +
Sbjct: 193 RE-FAEEGTHLRRPETESEEDLELELLEEAHQHFVSRYDPINGGFSRAPKFPTPANLSFL 251

Query: 319 L----YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPH 374
           L    Y    ++  G+  E +   +MV  TL  MA+GGIHD +G GF RYSV   W +PH
Sbjct: 252 LRLGRYPDVVMDIVGRE-ECARATEMVSKTLLQMARGGIHDQIGHGFARYSVTPDWSLPH 310

Query: 375 FEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDADSAE 433
           FEKMLYDQ QL +VY+D F +T++        DI+ Y+    ++ P G   S+EDADS  
Sbjct: 311 FEKMLYDQAQLLDVYVDCFEITQEPKLLEAVYDIIAYITSPPILSPEGAFHSSEDADSFP 370

Query: 434 TEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKN 492
               T K+EGAFYVWT KE++ ILG+  A +   H+ + P GN  ++R +DPH+EF  +N
Sbjct: 371 NSNDTEKREGAFYVWTLKEMQQILGQRDAEVCARHWGVLPDGN--VARGNDPHDEFINQN 428

Query: 493 VLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVIS 551
           VL         A   G+  ++ + ++   R+KL + R + R RP LDDK+IVSWNGL I 
Sbjct: 429 VLCIRASPRKIAKDFGLSEDEVVRVIKSSRKKLQEFRDEHRVRPDLDDKIIVSWNGLAIG 488

Query: 552 SFARASKIL-KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRN 610
           + A+ S +L K +AE A                VAE AA FIR +L+D +T +L   +R+
Sbjct: 489 ALAKCSLLLDKIDAERA-----------THCRRVAEKAAKFIRENLFDAETGQLWRVYRD 537

Query: 611 G-PSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG-----GGY 664
           G   + PGF DDYA+L SGL+ LYE      +L +A  LQ   +  FL          GY
Sbjct: 538 GRRGETPGFGDDYAYLASGLISLYEATFDDSYLQFAENLQQYLNRYFLATASDGTTPAGY 597

Query: 665 F----NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL 720
           +    N   + P  L R+K   D A PS N V   NL+RLAS++   + D Y+  A H+ 
Sbjct: 598 YMTPQNMPEDVPGPLFRLKTGTDAATPSTNGVIAQNLLRLASLL---EDDSYKALARHTC 654

Query: 721 AVFETRL 727
           + F   +
Sbjct: 655 SAFAAEM 661


>gi|420158002|ref|ZP_14664826.1| PF03190 family protein [Clostridium sp. MSTE9]
 gi|394755349|gb|EJF38596.1| PF03190 family protein [Clostridium sp. MSTE9]
          Length = 685

 Score =  438 bits (1126), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 260/652 (39%), Positives = 358/652 (54%), Gaps = 60/652 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N LA E SPYLLQHA NPVDWF WGE+AF +A++ D PIFLSIGYSTCHWCHVM  ESFE
Sbjct: 9   NHLAKEKSPYLLQHAENPVDWFPWGEQAFEKAKREDKPIFLSIGYSTCHWCHVMAHESFE 68

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D+ VA+ LN  FV IKVDREERPD+D VYMT  QA+ G GGWP+++ ++P+ +P   GTY
Sbjct: 69  DDEVAEALNQGFVCIKVDREERPDIDAVYMTVCQAMTGSGGWPMTILMTPEQRPFWAGTY 128

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
            P    +   G   +L  +++ W   R  L  +G      L E    S  S K   +L  
Sbjct: 129 LPKMSTFRSTGLLELLAFIREQWSTNRQQLLNAGEEITNYLREQSGPSLGSAKPELDL-- 186

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             LR    QLS SYDSR+GGFG APKFP P  +  +L +S  + +  KS      Q M  
Sbjct: 187 --LRGAVAQLSASYDSRWGGFGGAPKFPAPHNLLFLLRYS--VLEREKS-----AQSMAE 237

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
           +TL  M +GG+ DH+GGGF RYS D +W VPHFEKMLYD   LA  YL+A+++T    Y 
Sbjct: 238 YTLSQMFRGGLFDHIGGGFSRYSTDVKWLVPHFEKMLYDNALLAYTYLEAYAVTGRPLYR 297

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHA 461
            + +  LDY+ R++    G  +  +DADS   +G     EG +YV+T +EV+ +LG E  
Sbjct: 298 SVAKRTLDYVLRELTDEQGGFYCGQDADS---DGV----EGKYYVFTPQEVQGVLGKEDG 350

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
            LF   + +   GN            F+GK++   L+ S+          E+  +I   C
Sbjct: 351 ELFCSRFGVTEAGN------------FEGKSIPNLLDFSAYD--------EEDPHIAQLC 390

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
           +R L++ R +R R H DDKV+ SWN L+I++ A+A  +L                D  EY
Sbjct: 391 QR-LYEYRLERTRLHRDDKVLTSWNALMIAALAKAGWLL----------------DEPEY 433

Query: 582 MEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKW 641
           ++ A+ A  F+   L DE+  RL   +R G +   G LDDYAF    LL+LY       +
Sbjct: 434 LQAAQKAQRFLEEKLVDERG-RLLLRWREGEAANDGQLDDYAFYAFSLLELYRSSFDCTY 492

Query: 642 LVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLA 701
           L+ A ++     ELF D E GG + T  +   ++ R KE +DGA PSGNSV+    VRLA
Sbjct: 493 LLRAAQIAEQILELFSDAEQGGLYLTAKDSEQLISRPKEVYDGAIPSGNSVAGEVFVRLA 552

Query: 702 SIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVV 753
           ++    +   +RQ  E  +      +K+      +   A   +  PS++ V 
Sbjct: 553 ALTGEER---WRQAGERQIRFLTGWIKEYPAGYGMSLIALSSVLYPSQELVC 601


>gi|115491785|ref|XP_001210520.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114197380|gb|EAU39080.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 787

 Score =  437 bits (1125), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 262/623 (42%), Positives = 353/623 (56%), Gaps = 37/623 (5%)

Query: 94  TSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWC 153
           TS    K  NRL    SPY+  H +NPV W  W  EA   AR+ +  +FLSIGYS CHWC
Sbjct: 16  TSDLGPKLVNRLRESRSPYVRAHMNNPVAWQLWDAEAINLARRYNRLVFLSIGYSACHWC 75

Query: 154 HVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPD 213
           HVME ESF  + VA +LN+ F+ IKVDREERPD+D VYM YVQA  G GGWPL+VFL+PD
Sbjct: 76  HVMEKESFMSQEVASILNESFIPIKVDREERPDIDDVYMNYVQATTGSGGWPLNVFLTPD 135

Query: 214 LKPLMGGTYFPPEDKYGRPGFKT-----ILRKVKDAWDKKRDMLAQSGAFAIEQL---SE 265
           L+P+ GGTY+P  +    PG +T     IL K++D W  ++    +S     +QL   +E
Sbjct: 136 LEPVFGGTYWPGPNATTNPGHETIGFVDILEKLRDVWQTQQQRCRESAKDITKQLREFAE 195

Query: 266 ALSASASSNKLPDE-LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML----Y 320
             + S   ++  DE L    L    +     YD+  GGF  APKFP P  +  +L    Y
Sbjct: 196 EGTHSYQGDRAADEDLDIELLEEAYQHFVSRYDTAHGGFSKAPKFPTPANLSFLLRLGVY 255

Query: 321 HSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLY 380
            S  ++  GK  E      M + TL  MA+GGIHDH+G GF RYSV   W +PHFEKMLY
Sbjct: 256 PSAVVDVVGKE-ECENATAMAVNTLINMARGGIHDHIGHGFARYSVTADWGLPHFEKMLY 314

Query: 381 DQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDADSAETEGATR 439
           DQ QL +VY+DAF +T +        D++ YL    +    G   S+EDADS      T 
Sbjct: 315 DQAQLLDVYIDAFKITHNPELLGAVYDLVTYLTTAPLQSSTGAFHSSEDADSLPMPNDTE 374

Query: 440 KKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN 498
           K+EGAFYVWT KE+  +LG   A +   H+ + P GN  +S  +DPH+EF  +NVL    
Sbjct: 375 KREGAFYVWTLKELTQVLGSRDAGVCARHWGVLPDGN--ISPANDPHDEFMNQNVLSIKV 432

Query: 499 DSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARAS 557
             S  A + G+  ++ + IL   ++KL + R K R RP LDDK+IV+WNGL I + A+AS
Sbjct: 433 TPSKLAREFGLGEDEVVRILRSAKQKLREYREKNRVRPDLDDKIIVAWNGLAIGALAKAS 492

Query: 558 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP-SKAP 616
            +   + +S+M +         +  E A  A SFI+  L+++ T +L   +R+G     P
Sbjct: 493 ALF-DQIDSSMAS---------KCREAAARAVSFIKETLFEKSTGQLWRIYRDGSRGDTP 542

Query: 617 GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG---GGYFNT----TG 669
           GF DDYA+L SGLL++YE      +L +A +LQ   +E FL   G    GY++T    T 
Sbjct: 543 GFADDYAYLTSGLLEMYEATFDDSYLQFAEQLQKYLNEKFLAYVGSTPAGYYSTPSTMTP 602

Query: 670 EDPSVLLRVKEDHDGAEPSGNSV 692
             P  LLR+K   + A PS N V
Sbjct: 603 GMPGPLLRLKTGTESATPSINGV 625


>gi|417766154|ref|ZP_12414108.1| PF03190 family protein [Leptospira interrogans serovar Bulgarica
           str. Mallika]
 gi|400351608|gb|EJP03827.1| PF03190 family protein [Leptospira interrogans serovar Bulgarica
           str. Mallika]
          Length = 691

 Score =  437 bits (1125), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 272/689 (39%), Positives = 379/689 (55%), Gaps = 69/689 (10%)

Query: 95  SHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCH 154
           S+SRN   NRL+ E SPYL QH++NPVDWF WGEEA  +A+ +D  IFLSIGY+TCHWCH
Sbjct: 3   SNSRN--PNRLSKEKSPYLQQHSYNPVDWFPWGEEALTKAKDQDKLIFLSIGYATCHWCH 60

Query: 155 VMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDL 214
           VME ESFE++ +A  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++FL+P+ 
Sbjct: 61  VMEKESFENQSIADYLNFHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNMFLTPEG 120

Query: 215 KPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSN 274
           +P+ GGTYFPPE +YGR GF  +L  ++  W +KR  L  + +   + L ++  + A   
Sbjct: 121 QPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGESRAKEK 180

Query: 275 KLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKKLEDTGK 330
           +  D  P+N            YDS+FGGF +    KFP  + +  +L  YHS        
Sbjct: 181 QEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-------- 232

Query: 331 SGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYL 390
           SG  +   +MV  TL  M +GGI+D +GGG  RYS D RW VPHFEKMLYD      +  
Sbjct: 233 SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSLFLEILA 291

Query: 391 DAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTS 450
           +   ++K +       DI+ YL RDM    G IFSAEDADS   EG    +EG FY+W  
Sbjct: 292 EYSLVSKKISAESFALDIVSYLHRDMRMDEGGIFSAEDADS---EG----EEGLFYIWDL 344

Query: 451 KEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMP 510
           +E  ++ GE + L ++ + +   GN            F+GKN+L E    S    +    
Sbjct: 345 EEFREVCGEDSFLLEKFWNVTKEGN------------FEGKNILHENFRGSNFTEEELKQ 392

Query: 511 LEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFN 570
           L+K    L + + KL + RSKR RP  DDK++ SWNGL I +  +               
Sbjct: 393 LDK---ALAKGKVKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------------- 436

Query: 571 FPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLL 630
              +   R++++++AE   SFI ++L D    R+   FR G S   G+ +DYA +I+  +
Sbjct: 437 ---IAFQREDFLKLAEETYSFIEKNLID-SNGRILRRFREGESGILGYSNDYAEMIASSI 492

Query: 631 DLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGAEPSG 689
            L+E G G ++L  A+        LF  R   G F  TG D  VLLR   D +DG EPS 
Sbjct: 493 VLFEAGRGVRYLQNAVLWMEEAIRLF--RSPAGVFFDTGIDGEVLLRRSVDGYDGVEPSA 550

Query: 690 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSR 749
           NS    +LVRL+ +  G  SDYYR+ AE     F   L   A++ P +  A       S 
Sbjct: 551 NSSLAHSLVRLSFL--GVNSDYYREIAESIFLYFRKELYSYALSYPFLLSA-----YWSY 603

Query: 750 KH----VVLVGHKSSVDFENMLAAAHASY 774
           KH    +VL+  K+S + ++MLA   + +
Sbjct: 604 KHHFREIVLI-RKNSEEGKDMLAWIQSRF 631


>gi|304314907|ref|YP_003850054.1| hypothetical protein MTBMA_c11480 [Methanothermobacter marburgensis
           str. Marburg]
 gi|302588366|gb|ADL58741.1| conserved hypothetical protein [Methanothermobacter marburgensis
           str. Marburg]
          Length = 677

 Score =  437 bits (1125), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 241/604 (39%), Positives = 353/604 (58%), Gaps = 53/604 (8%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TN L  E SPYLLQHAHNPV+W+ WG+EAF  A + + PIFLSIGYSTCHWCHVM  ESF
Sbjct: 7   TNSLINEKSPYLLQHAHNPVNWYPWGDEAFQLAGEEEKPIFLSIGYSTCHWCHVMARESF 66

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           ED  +A +LN+ FV++KVDREERPD+D +YM   Q + G GGWPL++ ++P+ +P   GT
Sbjct: 67  EDPEIADILNENFVAVKVDREERPDIDAIYMKVCQMMTGTGGWPLTIIMTPEGEPFFAGT 126

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFPP+D+ G PG +TIL +V   W    D + ++    +  L +++   A ++KL  E  
Sbjct: 127 YFPPDDRGGVPGLRTILERVVLLWKNDPDGIVKTARDVVSALKKSV---AKASKLKPETV 183

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML-YHSKKLEDTGKSGEASEGQKM 340
             A     E L +++D+R GGFGS  KFP P  I  +L YH ++ +D        E  +M
Sbjct: 184 DAAY----EYLRRNFDTRNGGFGSYQKFPTPHNIYFLLRYHLRRGDD--------EALRM 231

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
           V  TL+ M  GGI+D +G GFHRY+V+  W VPHFEKMLYDQ  +   YL+AF +T D  
Sbjct: 232 VNLTLRRMRYGGIYDQLGYGFHRYAVEPTWTVPHFEKMLYDQALILKAYLEAFQVTCDDL 291

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH 460
           Y     +I++Y+  ++  P G  +SAED   AE+EG     EG +Y+W + E+ ++LG+ 
Sbjct: 292 YKKTALEIVEYVLGNLQSPEGAFYSAED---AESEGV----EGKYYLWRASEIREVLGDD 344

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
           A +   ++ +   GN           + +G+N+L  +      A +  + L++   I+  
Sbjct: 345 ANVVMRYFNVLEDGNF--------AGDVRGENIL-HIGSPWRVADEFNLTLDELNEIIEN 395

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            RR L + R +RP P LDDK++  WNGL++ + A   +IL SE                E
Sbjct: 396 ARRHLLERRMERPTPALDDKILTDWNGLMLGALAACGRILDSE----------------E 439

Query: 581 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 640
            +  AE    FI  +L+ +    L H +R+  +   G LDDYAFLI GLL+L++      
Sbjct: 440 ALAAAERCLKFIMDNLHVDG--ELLHRYRDSEAGIDGKLDDYAFLIWGLLELHDATFREG 497

Query: 641 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 700
           ++  A+EL  + ++ F   +GG Y     +DP +++R  +  DGA PSGNSV ++NL+RL
Sbjct: 498 YVEMALELSESLEDRFGAPDGGFYLT---DDPKLIVRPMDATDGAIPSGNSVQMLNLLRL 554

Query: 701 ASIV 704
             I+
Sbjct: 555 GGIL 558


>gi|448363039|ref|ZP_21551643.1| hypothetical protein C481_13364 [Natrialba asiatica DSM 12278]
 gi|445647661|gb|ELZ00635.1| hypothetical protein C481_13364 [Natrialba asiatica DSM 12278]
          Length = 717

 Score =  437 bits (1124), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 255/649 (39%), Positives = 352/649 (54%), Gaps = 49/649 (7%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA NPV+W  W E A   AR+ DVPIFLSIGYS CHWCHVM  ESF 
Sbjct: 8   NRLEDEESPYLRQHADNPVNWQPWDERALETAREHDVPIFLSIGYSACHWCHVMADESFA 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA  LN+ FV IKVDREERPD+D +YMT  Q + G GGWPLS +L+P+ KP   GTY
Sbjct: 68  DEAVAAELNEHFVPIKVDREERPDIDSIYMTVCQLVTGRGGWPLSAWLTPEGKPFYVGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLA----QSGAFAIEQLSEALSASASSNKLPD 278
           FP E K G+PGF  +L  V ++W+  R+ +     Q  A A ++L E   A  +S     
Sbjct: 128 FPREAKRGQPGFLDVLENVTNSWESDREEIENRADQWTAAATDRLEETPDAVGASQP--- 184

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLEDTGKSGEASEG 337
               + L   A    +S D  FGGFGS  PKFP+P  ++++   ++  + TG+     E 
Sbjct: 185 -PSSDVLEAAANASLRSADREFGGFGSDGPKFPQPSRLRVL---ARATDRTGR----DEF 236

Query: 338 QKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTK 397
            ++++ TL  MA GG++DHVGGGFHRY VD  W VPHFEKMLYD  ++   +L  +  T 
Sbjct: 237 SEVLVETLDAMAAGGLYDHVGGGFHRYCVDRDWTVPHFEKMLYDNAEIPRAFLLGYQQTG 296

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
           D  Y+ +  + LD++ R++    G  FS  DA S + E   R +EGAFYVWT  EVE  +
Sbjct: 297 DERYAEVVAETLDFVERELTHDAGGFFSTLDAQSEDPETGER-EEGAFYVWTPDEVEAAV 355

Query: 458 GEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 515
            +   A LF+  Y +  +GN            F+G N    +      A +  +P ++  
Sbjct: 356 TDETDAELFRSRYDITQSGN------------FEGTNQPNRVASIDELADRFDLPADEVE 403

Query: 516 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 575
           + L   RR LF  R +RPRP+ D+KV+  WNGL+I++ A A+ +L              G
Sbjct: 404 DRLESARRDLFQAREQRPRPNRDEKVLAGWNGLMIATCAEAALVL--------------G 449

Query: 576 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 635
            D  +Y E+A  A +F+R  L+D    RL   +++      G+L+DYAFL  G L  YE 
Sbjct: 450 ED--DYAEMATDALAFVRERLWDGDEKRLSRRYKDDDVAIDGYLEDYAFLARGALGCYEA 507

Query: 636 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 695
                 L +A+EL    +  F D   G  + T     S++ R +E  D + PS   V+V 
Sbjct: 508 TGEVDHLAFALELARVIEAEFWDEAQGTLYFTPESGESLVTRPQELGDQSTPSAAGVAVE 567

Query: 696 NLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 744
            L++L    AG   ++ R  A   L     RL+  ++    +C AAD L
Sbjct: 568 TLLQLDGF-AGESGEFERI-ATTVLETHANRLETNSLEHATLCLAADRL 614


>gi|357632813|ref|ZP_09130691.1| hypothetical protein DFW101_0683 [Desulfovibrio sp. FW1012B]
 gi|357581367|gb|EHJ46700.1| hypothetical protein DFW101_0683 [Desulfovibrio sp. FW1012B]
          Length = 737

 Score =  437 bits (1123), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 259/610 (42%), Positives = 338/610 (55%), Gaps = 42/610 (6%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            NRL  E SPYL QHAHNPVDW+ WGEEAFA AR  D PIFLSIGYSTCHWCHVME ESF
Sbjct: 34  ANRLITEKSPYLQQHAHNPVDWYPWGEEAFALARAEDKPIFLSIGYSTCHWCHVMEHESF 93

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           EDE +A L+    V++KVDREERPD+D +YMT+ QAL G GGWPL+VFL+PD +P   GT
Sbjct: 94  EDEDIAALMRATVVAVKVDREERPDLDNLYMTFCQALTGRGGWPLNVFLTPDGQPFFAGT 153

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA-SASSNKLPDEL 280
           YFP E  +GR G + +L++V  AW   R  +  +    ++ +   L A  A     P E 
Sbjct: 154 YFPKESGFGRTGMRELLQRVHMAWTSNRQAVIGNATQILDAVRSQLEARDAGETAEPGEA 213

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
             +A R    +L+ +YD+  GGFG APKFP P     +L+  ++   TG+     E   M
Sbjct: 214 QLDAAR---NELAAAYDAANGGFGGAPKFPSP---HNLLFLLREFRRTGR----EENLAM 263

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
           V  TL  M +GG+ D +G G HRYS D  W VPHFEKMLYDQ   A    +A+  T D  
Sbjct: 264 VTATLDAMRRGGVFDQIGLGLHRYSTDAHWFVPHFEKMLYDQALTAMAATEAYLATGDAE 323

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL-GE 459
           +  + RDI +Y+ RD+ GP G  +SAEDADS   EG     EG FYVWT  E+  +L G+
Sbjct: 324 WRRMARDIFEYVHRDLTGPDGAFYSAEDADS---EGV----EGKFYVWTESEIRAVLAGD 376

Query: 460 HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILG 519
            A LF + Y + P GN       +   +  G N+       +A A K G+   +  + L 
Sbjct: 377 EAGLFMDVYGIAPGGNFH----DEATGQATGANIPFLEEPIAAVAGKKGLGPAELASRLE 432

Query: 520 ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRK 579
             R  L   R KR RP  DDKV+   NGL+I++ A+A++                  D +
Sbjct: 433 RSRELLLAARQKRVRPLCDDKVLTDMNGLMIAALAKAARAF----------------DDE 476

Query: 580 EYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGT 639
           E    A+ A+ F+   +    + RL H  R G +   G LDDYAFL  GLL+LY+     
Sbjct: 477 ELAGRAKRASDFLLAKMLLPDS-RLLHRLRLGEAAVTGMLDDYAFLAWGLLELYQTVFDP 535

Query: 640 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVR 699
            +L  A+ L       F D   GG F T  +  ++LLR K  +D A PSGNSV+ + L  
Sbjct: 536 AYLAQAVALAKAMVRHFGD-AAGGLFLTPDDGEALLLRQKTYYDAAIPSGNSVAFLVLTT 594

Query: 700 LASIVAGSKS 709
           L  +  G KS
Sbjct: 595 LYRLT-GEKS 603


>gi|448301393|ref|ZP_21491386.1| hypothetical protein C496_17562 [Natronorubrum tibetense GA33]
 gi|445584129|gb|ELY38453.1| hypothetical protein C496_17562 [Natronorubrum tibetense GA33]
          Length = 788

 Score =  437 bits (1123), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 248/647 (38%), Positives = 351/647 (54%), Gaps = 43/647 (6%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA NPV+W  W ++A  EAR+RDVPIFLSIGYS CHWCHVME ESF 
Sbjct: 71  NRLDEEESPYLRQHADNPVNWQPWDDQALEEARERDVPIFLSIGYSACHWCHVMEDESFA 130

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA LLN+ FV IKVDREERPDVD +YMT  Q + G GGWPLS +L+P  KP   GTY
Sbjct: 131 DEEVADLLNENFVPIKVDREERPDVDSIYMTVAQLVTGRGGWPLSAWLTPQGKPFYVGTY 190

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP E K G+PGF  +L ++ ++W++ RD +        +   + L  +  S    +    
Sbjct: 191 FPKEAKRGQPGFLDVLEQLANSWEQDRDEVENRAQQWTDAAKDRLEETPDSVAQAEPPSS 250

Query: 283 NALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
             L   A+   +S D + GGFGS  PKFP+P  + ++   ++  + TG+     + ++++
Sbjct: 251 EVLTTAADAALRSADRQHGGFGSGGPKFPQPSRLHVL---ARAYDRTGR----EQFREVL 303

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             +L  MA GG++DHVGGGFHRY VD  W VPHFEKMLYD  ++   +L  + LT D  Y
Sbjct: 304 EESLDAMAAGGLYDHVGGGFHRYCVDADWTVPHFEKMLYDNAEIPRAFLAGYQLTGDDRY 363

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH- 460
           + +  + L+++ R++    G  FS  DA S   +G   K+EG FYVWT  E+ ++L E  
Sbjct: 364 AEVTAETLEFVDRELTHEEGGFFSTLDAQSKTEDG--EKEEGVFYVWTPDEISEVLEEET 421

Query: 461 -AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILG 519
            A LF   Y +  +GN            F+G N    +      A +  +  +     L 
Sbjct: 422 DAELFCARYDITESGN------------FEGTNQPNRVRSIPDLADEFDLAEDDTEQRLE 469

Query: 520 ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRK 579
             R+ LF+ R +RPRP+ D+KV+ SWNGL+I++ A A+ +L              G D  
Sbjct: 470 SARKALFEARERRPRPNRDEKVLASWNGLLINTCAEAALVL--------------GED-- 513

Query: 580 EYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGT 639
           EY E+   A  F+R  L+D    RL   +++G  K  G+L+DYAFL  G L  YE     
Sbjct: 514 EYAEMGVDALDFVRERLWDADEGRLARRYKDGDVKVDGYLEDYAFLARGALRCYEATGDV 573

Query: 640 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVR 699
             L +A++L  T +  F D E G  + T     S++ R +E  D + PS   V++  L+ 
Sbjct: 574 DHLAFALDLARTIEAEFWDEERGTLYFTPESGESLVTRPQELDDQSTPSATGVALETLLA 633

Query: 700 LASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 746
           L    A      + + A   L     R++  ++    +C AAD L  
Sbjct: 634 LDGFAADEN---FEKIASTVLETHANRIEANSLQHASLCLAADRLEA 677


>gi|302390271|ref|YP_003826092.1| hypothetical protein Toce_1734 [Thermosediminibacter oceani DSM
           16646]
 gi|302200899|gb|ADL08469.1| conserved hypothetical protein [Thermosediminibacter oceani DSM
           16646]
          Length = 670

 Score =  436 bits (1122), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 258/671 (38%), Positives = 364/671 (54%), Gaps = 70/671 (10%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           +  NRL  E SPYLLQHA+NPVDW+ WG EAF +A+  +  IFLSIGYSTCHWCHVME E
Sbjct: 8   RKPNRLINEKSPYLLQHAYNPVDWYPWGTEAFEKAKTENKLIFLSIGYSTCHWCHVMEKE 67

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SFEDE V  +LN ++VSIKVDREE PDVD  YM   QAL G GGWPL++ ++PD  P+  
Sbjct: 68  SFEDEEVGNILNRYYVSIKVDREEHPDVDNFYMEVCQALTGSGGWPLTIIMTPDKHPVFA 127

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE 279
            TY P ED YGRPG KT+L K+ + W K R+ L  +G   +  + +             E
Sbjct: 128 ATYLPKEDSYGRPGLKTVLFKINELWQKDRERLITTGREIVSSIKKLERTGHG------E 181

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEG 337
           L    +    E L  SYD ++GGF  APKFP P  +  +L  YH +K           E 
Sbjct: 182 LDPGVIDKAFEILKASYDRKYGGFFGAPKFPMPGTLLFLLGYYHYRK---------DPEA 232

Query: 338 QKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTK 397
            +MV  TL+ M KGGI+DH+G G  RYS D RW VPHFEKMLYD   ++ V  +A+ + +
Sbjct: 233 LEMVENTLKNMYKGGIYDHIGFGLCRYSTDRRWLVPHFEKMLYDNALVSFVCAEAYKIAR 292

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
           D F+     +I+DY+ R++  P G  ++AEDADS   EG    +EG FY WT +E+  +L
Sbjct: 293 DEFFKTFALEIIDYVLRNLRNPEGGFYTAEDADS---EG----EEGRFYTWTPQEIRHVL 345

Query: 458 GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI 517
           G+ A  F E Y +   GN            F+GKN+           + +G  L   ++ 
Sbjct: 346 GDRADEFMESYNITERGN------------FEGKNI----------PNLIGRDLSCKMD- 382

Query: 518 LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSD 577
             + R+KLF+ R +R +P  D+K++VS N L+I+S  R   I K+E              
Sbjct: 383 -EDTRKKLFEYREQRVKPFRDEKILVSGNSLMIASLFRVYGITKNE-------------- 427

Query: 578 RKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGS 637
              Y + AE A +FI  +       RL   +R G  KA    DDY+ L+  LL+ YE+  
Sbjct: 428 --NYRKEAEVALNFILENARGSDG-RLHVGYREGIMKAKATFDDYSHLLWALLEAYEYTL 484

Query: 638 GTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINL 697
            T +L  A  L +   +LF D+E GG++ T  +   +  R K+ +DGA PSGNS++  +L
Sbjct: 485 ETSYLKKAKSLADEMIDLFYDKEAGGFYLTGSDVDHLPARAKDAYDGAVPSGNSMAAFSL 544

Query: 698 VRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGH 757
            RL+ ++  S  +   + A +   VF   + +  +       +  + +V     V++ G 
Sbjct: 545 ARLSRLLFDSGME---ELARNQYRVFARTISENPVYHTFFLYSF-IYAVTGGTEVIIAGE 600

Query: 758 KSSVDFENMLA 768
           +  + F N LA
Sbjct: 601 RPEM-FTNYLA 610


>gi|407768088|ref|ZP_11115467.1| hypothetical protein TH3_01375 [Thalassospira xiamenensis M-5 = DSM
           17429]
 gi|407288801|gb|EKF14278.1| hypothetical protein TH3_01375 [Thalassospira xiamenensis M-5 = DSM
           17429]
          Length = 683

 Score =  436 bits (1122), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 256/685 (37%), Positives = 372/685 (54%), Gaps = 69/685 (10%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N L +E SPYLLQH  NPV W  W  E  A A+  + P+ LS+GY+ CHWCHVM  ESFE
Sbjct: 6   NNLGSETSPYLLQHRDNPVHWQPWSTEVLAAAKAANKPVLLSVGYAACHWCHVMAHESFE 65

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D+G+A L+N+ FV+IK+DREERPD+D VY   +  L   GGWPL++FL+PD +P  GGTY
Sbjct: 66  DDGIAALMNELFVNIKLDREERPDLDSVYQNALALLGQQGGWPLTMFLTPDGEPFWGGTY 125

Query: 223 FPPEDKYGRPGFKTILRKVKDAW----DKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
           FP E +YGRPGF  +L+ V + +    D  R  +AQ G  A+ +++   + S  S  + D
Sbjct: 126 FPKEARYGRPGFGDVLKSVSEIYTQQPDNIRHNVAQIGQ-ALIKMNSGATGSMPSLAMID 184

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
           +        C     +  D   GG   APKFP+P  + ++     +  DT       + +
Sbjct: 185 Q--------CGHGCLQIMDGENGGTNGAPKFPQPSILALIWRVGVRTNDT-------DLK 229

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
           ++V  +L  M +GGI+DHVGGGF RY+VD++W VPHFEKMLYD  QL ++  D +  T +
Sbjct: 230 RIVRHSLDRMCQGGIYDHVGGGFARYAVDDQWLVPHFEKMLYDNAQLIDLLCDVWRETGN 289

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
             Y     + +D++ RDM  PGG   ++ DADS   EG     EG FYVW   E+  ILG
Sbjct: 290 PLYEARISETIDWILRDMRVPGGAFAASLDADS---EGV----EGKFYVWDEAEINAILG 342

Query: 459 EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 518
             A LFK+ Y + P+GN            ++ KN+L      + + S LG+        L
Sbjct: 343 NDAALFKDIYDVSPSGN------------WEHKNIL------NRTQSGLGLADRTTEKKL 384

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
            E R KL  VR+KR  P  DDK +  WN + I++ A A+ + K                R
Sbjct: 385 SETRTKLLAVRNKRIWPGWDDKALTDWNAMTIAALAEAAMVFK----------------R 428

Query: 579 KEYMEVAESAASFIRRHLYDEQTH--RLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG 636
            ++++ A+ A +F+   L   +++  R  HS+RNG ++  G L+DYA +I   L LYE  
Sbjct: 429 ADWLDYAKLAYNFVINSLMTGESNDRRFLHSYRNGKAQHAGMLEDYAHMIRAALRLYECF 488

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
               +L  A E     + LF D + GGYF +  +   +++R K   D A P+GNSV   N
Sbjct: 489 GEDAYLREATEWCEAVENLFADTK-GGYFQSASDADDLVVRQKPHMDNAVPAGNSVMAQN 547

Query: 697 LVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVG 756
           L RL ++   +K   YR  AE ++A F  RL +    +P +  AA+ML  P +  +VL+ 
Sbjct: 548 LARLYALTGDTK---YRDRAEITIAAFAGRLNEQFPNMPGLLLAAEMLQNPLQ--IVLIA 602

Query: 757 HKSSVDFENMLAAAHASYDLNKTVS 781
            + S  +  M  A  A+Y  N+ ++
Sbjct: 603 KERSQMYMEMRRAIFAAYLPNRAIT 627


>gi|239906990|ref|YP_002953731.1| hypothetical protein DMR_23540 [Desulfovibrio magneticus RS-1]
 gi|239796856|dbj|BAH75845.1| hypothetical protein [Desulfovibrio magneticus RS-1]
          Length = 697

 Score =  436 bits (1122), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 263/676 (38%), Positives = 355/676 (52%), Gaps = 41/676 (6%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           N+  NRL+ E SPYLLQHAHNPVDWF WGEEAFA+AR  D P+ LSIGYSTCHWCHVME 
Sbjct: 3   NRAPNRLSREKSPYLLQHAHNPVDWFPWGEEAFAKARAEDKPVLLSIGYSTCHWCHVMER 62

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFEDE +A L+N   VS+KVDREERPD+D +YM+   AL G GGWPL+VFL+PD +P  
Sbjct: 63  ESFEDEDIAALMNAVVVSVKVDREERPDLDALYMSVCHALTGRGGWPLTVFLTPDKEPFF 122

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
            GTYFP E  YGR G + +L++V   W   R  +  +    ++ + E L+A+A +     
Sbjct: 123 AGTYFPKESAYGRTGLRELLQRVHMFWKGNRQAVVNNAGQIMDAVREQLAAAAGTASA-- 180

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
           E  Q AL     QL+  +D+R GGFG APKFP P  +  +L   ++  D          +
Sbjct: 181 EPGQAALDAARTQLAGIFDARNGGFGGAPKFPSPHNLLFLLREYRRTGDV-------SCR 233

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
            M   TL  M +GG++D VG G HRY+ D  W +PHFEKMLYDQ       ++A+  + D
Sbjct: 234 DMACRTLVAMRRGGVYDQVGFGLHRYATDAHWFLPHFEKMLYDQALTVMACVEAYQASGD 293

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
           V +  +  +IL+Y+RRD+  P G  +SAEDADS   EG     EG FYVW++ E+  +LG
Sbjct: 294 VAHKTMALEILEYVRRDLTSPEGLFYSAEDADS---EGV----EGKFYVWSAAELRRLLG 346

Query: 459 EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 518
           + A L          GN       +   E  G N+L        +A++LG+  E     L
Sbjct: 347 DEAALIMAAMGATEEGNAH----DEATGETTGANILHLPRPLDETAARLGLTAEILAERL 402

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
             CR  L   R KR RP  DDKV+   NGL++++ A+A++    E  +            
Sbjct: 403 EACRHVLLAEREKRVRPLCDDKVLTDNNGLMLAALAKAARAFDDEDLAG----------- 451

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG 638
              +  AE+  S + R     Q  RL H  R+  +   G LDDY FL  GL++LY+    
Sbjct: 452 -RAVTAAEALLSRLAR-----QNGRLLHRLRDDEAAIDGLLDDYVFLAWGLVELYQTVFD 505

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
           T +L  A+EL     E F D   GGYF    +   +L+R K   D A PSGNSV+   L 
Sbjct: 506 TAYLRRAVELMKAVAEHFADPNEGGYFLAPDDGEQLLVRQKIFFDAAVPSGNSVAYFVLT 565

Query: 699 RLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHK 758
            L  +        +++ A         RL D A       C    + +     V L G  
Sbjct: 566 TLFRLTGDPA---FKEQATALARAMAPRLADHAAGYAFFLCGLSQV-LGQASEVTLAGDP 621

Query: 759 SSVDFENMLAAAHASY 774
           +  D + +  A    Y
Sbjct: 622 AGPDTQTLARAIFERY 637


>gi|70995702|ref|XP_752606.1| DUF255 domain protein [Aspergillus fumigatus Af293]
 gi|19309415|emb|CAD27314.1| hypothetical protein [Aspergillus fumigatus]
 gi|41581314|emb|CAE47963.1| hypothetical protein, conserved [Aspergillus fumigatus]
 gi|66850241|gb|EAL90568.1| DUF255 domain protein [Aspergillus fumigatus Af293]
          Length = 799

 Score =  436 bits (1121), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 268/682 (39%), Positives = 365/682 (53%), Gaps = 55/682 (8%)

Query: 86  MAERTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSI 145
           M  +T   ++    K  NRL    SPY+  H +NPV W  W  EA   AR+ +  IFLSI
Sbjct: 1   MHSQTHLGSADHEPKLVNRLRDSRSPYVRAHMNNPVAWQLWDAEAIELARRYNRLIFLSI 60

Query: 146 GYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWP 205
           GYS CHWCHVME ESF  + VA LLN+ F+ IKVDREERPD+D VYM YVQA  G GGWP
Sbjct: 61  GYSACHWCHVMEKESFMSQEVASLLNESFIPIKVDREERPDIDDVYMNYVQATTGSGGWP 120

Query: 206 LSVFLSPDLKPLMGGTYFPPED-----KYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAI 260
           LSVFL+P+L+P+ GGTY+P  +     +    GF  IL K++D W  ++     S     
Sbjct: 121 LSVFLTPNLEPVFGGTYWPGPNSSTLSRQDTVGFVDILEKLRDVWKTQQQRCLDSAKEIT 180

Query: 261 EQL----SEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQ 316
            QL     E   +     +  ++L    L    +  +  YD+  GGF  APKFP P  + 
Sbjct: 181 RQLREFAEEGTHSQQGDRQAGEDLDIELLEEAYQHFASRYDTVNGGFSRAPKFPTPANLS 240

Query: 317 MML---YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVP 373
            +L    +   + D     E      M + TL  MA+GGI DH+G GF RYSV   W +P
Sbjct: 241 FLLRLKTYPSAVSDIVGQEECDRAAAMAVSTLISMARGGIRDHIGHGFARYSVTADWSLP 300

Query: 374 HFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMI-GPGGEIFSAEDADSA 432
           HFEKMLYDQ QL +VY+DAF +T +        D+  YL    I  P G   S+EDADS 
Sbjct: 301 HFEKMLYDQAQLLDVYVDAFKITHNPELLGAVYDLATYLTTAPIQSPVGAFHSSEDADSL 360

Query: 433 ETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGK 491
            T   T K+EGAFYVWT KE+  +LG+  A +   H+ + P GN  ++   DPH+EF  +
Sbjct: 361 PTPNDTEKREGAFYVWTLKELTQVLGQRDAGVCARHWGVLPDGN--IAPEHDPHDEFMNQ 418

Query: 492 NVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVI 550
           NVL      S  A + G+  E+ + I+   ++KL + R K R RP LDDKVIV+WNGL I
Sbjct: 419 NVLSIKVTPSKLAREFGLSEEEVVKIIKSAKQKLREYREKTRVRPDLDDKVIVAWNGLAI 478

Query: 551 SSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRN 610
            + A+ S + + E ES         S   +  E A  A +FI+ +L+++ T +L   +R+
Sbjct: 479 GALAKCSALFE-EIES---------SKAVQCREAAARAINFIKENLFEKATGQLWRIYRD 528

Query: 611 GP-SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQN-------------TQDELF 656
           G   + PGF DDYA+LI GLLD+YE      +L +A +LQ+             TQ E  
Sbjct: 529 GSRGETPGFADDYAYLIHGLLDMYEATYDDSYLQFAEQLQSMFHDRGSFGRTILTQAEYL 588

Query: 657 LDR-------EGGGYFNT----TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVA 705
            D           GY++T    T   P  LLR+K   + A PS N V   NL+RL++++ 
Sbjct: 589 NDNFLAYVGSTPAGYYSTPSTMTPGMPGPLLRLKTGTESATPSINGVIARNLLRLSALL- 647

Query: 706 GSKSDYYRQNAEHSLAVFETRL 727
             + + YR  A  +   F   +
Sbjct: 648 --EEEEYRTLARQTCLSFSVEI 667


>gi|404329401|ref|ZP_10969849.1| hypothetical protein SvinD2_04859 [Sporolactobacillus vineae DSM
           21990 = SL153]
          Length = 731

 Score =  436 bits (1121), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 255/607 (42%), Positives = 342/607 (56%), Gaps = 67/607 (11%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N L  E SPYLLQHA NPV+W  W   AF +A++   P+ +SIGYS CHWCHVM  ESFE
Sbjct: 49  NWLIKEKSPYLLQHATNPVNWLPWTPAAFQKAKREGKPVLVSIGYSACHWCHVMAGESFE 108

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D+  A LLN+ +VSIKVDREERPD+D VYM   Q L G GGWPL+VFL+PD  P   GTY
Sbjct: 109 DQETAALLNENYVSIKVDREERPDIDAVYMKVCQTLTGQGGWPLNVFLTPDQTPFYAGTY 168

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS-ASASSNKLPDELP 281
           FP    YG P FK +LR++K  +D+  D +A  G+    Q+  AL+  S S  KL DE  
Sbjct: 169 FPLHAAYGHPAFKDVLRELKKQYDQNPDKIAAIGS----QIMTALAKQSRSGRKLTDE-- 222

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
              +R   E LS+++D RFGGFG APKFP P ++  +L        TGK     +   M 
Sbjct: 223 --TVRKAYEALSENFDPRFGGFGDAPKFPAPHQLIFLLRFGSL---TGK----KQAMDMA 273

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
           + TL+ +A+GGI DH+GGGF RY+ D +W VPHFEKMLYDQ  LA  + +A+  T +  +
Sbjct: 274 VRTLRALAEGGIRDHIGGGFCRYATDRQWQVPHFEKMLYDQAMLAAAFTEAYQATGEAAF 333

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHA 461
             +   I DY  RD++ P G  + +EDADS   EG    +EG +Y+W   EV  +LG  A
Sbjct: 334 RDVVATIFDYCERDLLSPAGGFYCSEDADS---EG----EEGKYYLWNPGEVRAVLGADA 386

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLN-ILGE 520
            LF E Y++   GN      S PH    G ++        A A+ L +P    LN  L  
Sbjct: 387 GLFCEVYHITDAGN--FHGQSIPH--LSGSDL-----GRIAEANHLSLPA---LNQQLAA 434

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            R KLF  R KR  P  DDK++ SWN L+I+  A A ++L +                K 
Sbjct: 435 SRHKLFAARQKRVHPFKDDKILTSWNALMIAVLAEAGRVLHN----------------KH 478

Query: 581 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 640
           Y+ +A+S   FI  HL  + T  L   +R+  ++   +LDDYAFL      +YE      
Sbjct: 479 YVNLAKSCFHFIDTHLVQDST--LLARYRDEEARFSAYLDDYAFLTLACEAMYEATFDLT 536

Query: 641 WL----VWAIELQNTQDELFLDREGGGYFNTTGEDP--SVLLRVKEDHDGAEPSGNSVSV 694
           +L    VW   +       F+DRE GG+F    E+P  ++++R KE +D A PSGNS +V
Sbjct: 537 YLEKMKVWGDRMTGR----FMDREHGGFFM---EEPQSTLIIRNKEAYDSAVPSGNSAAV 589

Query: 695 INLVRLA 701
           + L+RL+
Sbjct: 590 LALLRLS 596


>gi|448352262|ref|ZP_21541053.1| hypothetical protein C484_22028 [Natrialba taiwanensis DSM 12281]
 gi|445631642|gb|ELY84871.1| hypothetical protein C484_22028 [Natrialba taiwanensis DSM 12281]
          Length = 717

 Score =  436 bits (1121), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 251/649 (38%), Positives = 347/649 (53%), Gaps = 49/649 (7%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA E SPYL QHA NPV+W  W E A   AR+ DVPIFLSIGYS CHWCHVM  ESF 
Sbjct: 8   NRLADEESPYLRQHADNPVNWQPWDERALETAREHDVPIFLSIGYSACHWCHVMADESFA 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA  LN+ FV IKVDREERPD+D +YMT  Q + G GGWPLS +L+P+ KP   GTY
Sbjct: 68  DEAVAAQLNEHFVPIKVDREERPDIDSIYMTVCQLVTGRGGWPLSAWLTPEGKPFYVGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLA----QSGAFAIEQLSEALSASASSNKLPD 278
           FP E K G+PGF  IL  V ++W+  R+ +     Q  A A ++L E   A  +S     
Sbjct: 128 FPREAKRGQPGFLEILENVTNSWENDREEIETRADQWTAAATDRLEETPDAVGASQP--- 184

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLEDTGKSGEASEG 337
               + L   A    +S D  FGGFGS  PKFP+P  ++++   ++  + TG+     E 
Sbjct: 185 -PSSDVLEAAANASLRSADREFGGFGSDGPKFPQPSRLRVL---ARAADRTGR----DEF 236

Query: 338 QKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTK 397
             +++ TL  MA GG++DHVGGGFHRY VD  W VPHFEKMLYD  ++   +L  +  T 
Sbjct: 237 SDVLVETLDAMAAGGLYDHVGGGFHRYCVDRDWTVPHFEKMLYDNAEIPRAFLLGYQQTG 296

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
           D  Y+ +  + LD++ R+++   G  FS  DA S   E   R +EGAFYVWT  +V D+L
Sbjct: 297 DERYAEVVAETLDFVERELMHEAGGFFSTLDAQSEAPETGER-EEGAFYVWTPDDVRDVL 355

Query: 458 GEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 515
            +   A LF   Y +  +GN            F+G N    +      A +  +P ++  
Sbjct: 356 ADETDAELFCSRYDITESGN------------FEGTNQPNRVASIDELADRFDLPTDEVE 403

Query: 516 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 575
             L   R   F  R +RPRP+ D+KV+  WNGL+I++ A A+ +L               
Sbjct: 404 ERLDSARETAFQAREQRPRPNRDEKVLAGWNGLMIATCAEAALVLG-------------- 449

Query: 576 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 635
             + +Y E+A  A +F+R  L+D    RL   +++      G+L+DYAFL  G L  YE 
Sbjct: 450 --KDDYAEMATDALAFVRDRLWDADEKRLSRRYKDDDVAIDGYLEDYAFLARGALGCYEA 507

Query: 636 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 695
                 L +A+EL    +  F D   G  + T     S++ R +E  D + PS   V+V 
Sbjct: 508 TGEVDHLAFALELARVIEAEFWDEAQGTLYFTPESGESLVTRPQELGDQSTPSAAGVAVE 567

Query: 696 NLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 744
            L+ L       ++D + + A   L     RL+  ++    +C AAD L
Sbjct: 568 TLLELDGFAG--ETDEFERIATTVLETHANRLETNSLEHATLCLAADRL 614


>gi|441505288|ref|ZP_20987276.1| Thymidylate kinase [Photobacterium sp. AK15]
 gi|441427143|gb|ELR64617.1| Thymidylate kinase [Photobacterium sp. AK15]
          Length = 732

 Score =  436 bits (1121), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 270/713 (37%), Positives = 383/713 (53%), Gaps = 60/713 (8%)

Query: 86  MAERTPASTSHSRNK--------HTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKR 137
           MAE  P   S    K        + NRL  E SPYLLQHA NPVDW+ W +EAF +A+  
Sbjct: 1   MAEHHPEIPSEDELKKLPPDGGGYWNRLVFEQSPYLLQHAANPVDWYPWSDEAFEKAKSE 60

Query: 138 DVPIFLSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQA 197
           D PIFLSIGY+TCHWCHVME ESFED  VA LLN  FV+IKVDREERPD+D+++M   Q+
Sbjct: 61  DKPIFLSIGYATCHWCHVMERESFEDTEVAALLNRDFVAIKVDREERPDIDQLHMAACQS 120

Query: 198 LYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGA 257
           + GGGGWPL+  L+P+ +     TY P + +YGRPG   ++  +  AW K+RD+L  +GA
Sbjct: 121 MTGGGGWPLNCVLTPEGQVFYATTYLPKQGQYGRPGMMELIPTIALAWQKQRDVLL-NGA 179

Query: 258 FAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQM 317
             + +  +ALS  +++  L + +   A  L  EQ   ++D   GGFG APKFP P +   
Sbjct: 180 IQLNKQLQALSGVSAAGVLDENIEHQAY-LWFEQ---TFDPEHGGFGDAPKFPLPHQYFF 235

Query: 318 MLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEK 377
           +L +  +   TG+    S    MV  +LQ M  GG+ DH+G GFHRYS D  W VPHFEK
Sbjct: 236 LLRYWYR---TGQRQALS----MVEESLQAMRLGGLFDHIGYGFHRYSTDNCWLVPHFEK 288

Query: 378 MLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGA 437
           MLYDQ  L   Y +A++ T + FY     ++++YL+  M+ P G  FSAEDADS   EG 
Sbjct: 289 MLYDQSLLLMAYSEAYAATGNEFYKQTAEEVVEYLKSRMLHPDGGFFSAEDADS---EG- 344

Query: 438 TRKKEGAFYVWTSKEVEDILGEHAILF-KEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIE 496
              +EG FY+W  +E++ +L E  + + ++HY + P GN     + +      G N+L  
Sbjct: 345 ---EEGKFYIWRYEELKAVLEESELTWLEQHYCIFPQGN----YVDEVSGRMTGANILHL 397

Query: 497 LNDSSASASKLG------MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVI 550
                 SA K G         E + N     R+KL+  R +R  P LDDKV+  WNGL I
Sbjct: 398 SMHPLVSADKKGKVDHDKATPECWRNQWQLIRQKLYQHRERREHPLLDDKVLSDWNGLTI 457

Query: 551 SSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRN 610
           ++ AR S ++                D  + +E+A  A  FIR +L DE +H L   +RN
Sbjct: 458 AALARCSLLI----------------DSSDCLEMARKAFEFIRLNLVDENSH-LMKRYRN 500

Query: 611 GPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGE 670
           G +  P  LDDYA LI   L+L++      +L  A+       + F D +  G++ T   
Sbjct: 501 GNAGLPAHLDDYASLIWAALELHQATLNNDYLQQALNWTEMAVDKFWDSDNHGFYFTEA- 559

Query: 671 DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDM 730
           +  + +R KE +DGA PSGN+V   NL  L  +   S+   ++      +A F  +L   
Sbjct: 560 NTDLAVRAKEIYDGAIPSGNAVMARNLAFLYRLTGESR---WQTKFNKLIAAFAPQLNRY 616

Query: 731 AMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVSKK 783
                L+  A D+++ P  +H++  G   + D    L   +    L   V+ K
Sbjct: 617 PAGYTLLLTAVDLMNSPG-QHLLFSGAGVAEDILRPLKGKYLPNTLWLAVNDK 668


>gi|87310211|ref|ZP_01092343.1| hypothetical protein DSM3645_14105 [Blastopirellula marina DSM
           3645]
 gi|87287201|gb|EAQ79103.1| hypothetical protein DSM3645_14105 [Blastopirellula marina DSM
           3645]
          Length = 637

 Score =  436 bits (1120), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 253/629 (40%), Positives = 352/629 (55%), Gaps = 56/629 (8%)

Query: 83  VVAMAERTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIF 142
            +A  + +    +  +    N LA E SPYLL HAHNPVDW  WGEEA A A++ + PIF
Sbjct: 6   TLAACQSSAEEPAAGKQHPANHLAGETSPYLLAHAHNPVDWRPWGEEALALAKQENKPIF 65

Query: 143 LSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGG 202
           LSIGYS+CHWCHVME ESF DE +AK LN+ F+ IKVDREERPD+D VYMT VQ +  GG
Sbjct: 66  LSIGYSSCHWCHVMEHESFTDEEIAKFLNEHFICIKVDREERPDIDHVYMTAVQIMTRGG 125

Query: 203 GWPLSVFLSPDLKPLMGGTYFPPE--DKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAI 260
           GWPLSVFL+P+ KP  GGTY+P    D+  + GF T++ +V   W++K   L +SG    
Sbjct: 126 GWPLSVFLTPEGKPFYGGTYWPARDGDRDAQVGFLTVIDRVAQFWEEKEADLRKSGDGLS 185

Query: 261 EQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFG------SAPKFPRPVE 314
           + + EAL    +    P  L +  L      +++++D+  GGF       + PKFP P  
Sbjct: 186 DLVKEALRPRVTLQ--PLTLDEQLLATADAAIAETFDAEHGGFNFSADDPNQPKFPEPAT 243

Query: 315 IQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPH 374
           +Q +L  +       +SG A E QKM+  TL  +A GGI DH+GGG HRYSVD  W +PH
Sbjct: 244 LQYLLARA-------RSGSA-EAQKMLTTTLDGIAAGGIRDHIGGGLHRYSVDRFWRIPH 295

Query: 375 FEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAET 434
           FEKMLYD  QLA++Y +A+ LT +  Y  +  +  D++ R+M GP G+ +SA DADS   
Sbjct: 296 FEKMLYDNAQLASLYAEAYQLTGNPQYRRVAAETCDFVLREMTGPDGQFYSAIDADS--- 352

Query: 435 EGATRKKEGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV 493
           EG    +EG +Y W+  E+  IL    + L K  Y L  + N            F+    
Sbjct: 353 EG----EEGKYYRWSQAELTAILSPAQLELAKSVYGLGGSPN------------FEEVYF 396

Query: 494 LIELNDSSASASK-LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISS 552
           + EL    A   + L +  ++    L   R  L   R+KR  P +D K + +WNGL+I+ 
Sbjct: 397 VPELQAPIAELPQNLKLDADQLQTRLQTLRETLLAARAKRTPPAIDTKALTAWNGLMIAG 456

Query: 553 FARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP 612
            A A +IL+                R++Y++ A  +A FI  ++      RL  SF++G 
Sbjct: 457 LADAGRILQ----------------RQDYLDAAARSADFILANVTSADG-RLLRSFKDGQ 499

Query: 613 SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDP 672
           +K   ++DDYA L+ GL+ L+E     KWL  A  L   Q ELF D   GG++ T  +  
Sbjct: 500 AKITAYVDDYAMLVDGLIALHEATGEPKWLDAAERLTKQQIELFGDPRLGGFYFTAADAE 559

Query: 673 SVLLRVKEDHDGAEPSGNSVSVINLVRLA 701
            V++R K   D A P+GNSV+  NL+ LA
Sbjct: 560 EVIVRGKIATDNAIPAGNSVAAGNLLYLA 588


>gi|119495483|ref|XP_001264525.1| hypothetical protein NFIA_013170 [Neosartorya fischeri NRRL 181]
 gi|119412687|gb|EAW22628.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181]
          Length = 805

 Score =  436 bits (1120), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 269/680 (39%), Positives = 369/680 (54%), Gaps = 52/680 (7%)

Query: 83  VVAMAERTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIF 142
             AM  +T   ++    K  NRL    SPY+  H +NPV W  W  EA   AR+ +  IF
Sbjct: 4   TAAMHPQTHLGSADHEPKLVNRLRDSRSPYVRAHMNNPVAWQLWDAEAIELARRYNRLIF 63

Query: 143 LSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGG 202
           LSIGYS CHWCHVME ESF  + VA LLN+ F+ IKVDREERPD+D VYM YVQA  G G
Sbjct: 64  LSIGYSACHWCHVMEKESFMSQEVASLLNESFIPIKVDREERPDIDDVYMNYVQATTGSG 123

Query: 203 GWPLSVFLSPDLKPLMGGTYFPPED-----KYGRPGFKTILRKVKDAWDKKRDMLAQSGA 257
           GWPLSVFL+P+L+P+ GGTY+P  +     +    GF  IL K++D W  ++     S  
Sbjct: 124 GWPLSVFLTPNLEPVFGGTYWPGPNSSTLSRQDTVGFVDILEKLRDVWKTQQQRCLDSAK 183

Query: 258 FAIEQL---SEALSASASSNKLPDE-LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPV 313
               QL   +E  + S   ++  DE L    L    +  +  YD+  GGF  APKFP P 
Sbjct: 184 EITRQLREFAEEGTHSQQGDRQTDEDLDIELLEEAYQHFASRYDTVNGGFSRAPKFPTPA 243

Query: 314 EIQMML---YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERW 370
            +  +L    +   + D     E  +   M + TL  MA+GGI DH+G GF RYSV   W
Sbjct: 244 NLSFLLRLKTYPSAVSDIVGQEECDKAAAMAVSTLISMARGGIRDHIGHGFARYSVTADW 303

Query: 371 HVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMI-GPGGEIFSAEDA 429
            +PHFEKMLYDQ QL +VY+DAF +T +        D+  YL    I  P G   S+EDA
Sbjct: 304 SLPHFEKMLYDQAQLLDVYVDAFKITHNPELLGAVYDLATYLTTAPIQSPVGAFHSSEDA 363

Query: 430 DSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEF 488
           DS  T   T K+EGAFYVWT KE+  +LG+  A +   H+ + P GN  ++   DPH+EF
Sbjct: 364 DSLPTPNDTEKREGAFYVWTLKELTQVLGQRDAGVCARHWGVLPDGN--IAPEHDPHDEF 421

Query: 489 KGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNG 547
             +NVL      S  A + G+  E+ + I+   ++KL + R + R RP LDDKVIV+WNG
Sbjct: 422 MNQNVLSIKVTPSKLAREFGLSEEEVVKIIKSAKQKLREYRETTRVRPDLDDKVIVAWNG 481

Query: 548 LVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS 607
           L I + A+ S + + E ES         S   +  E A  A +FI+ +L+++ T +L   
Sbjct: 482 LAIGALAKCSALFE-EIES---------SKAVQCREAAARAINFIKENLFEKATGQLWRI 531

Query: 608 FRNGP-SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNT--------------- 651
           +R+G   + PGF DDYA+LI GLLD+YE      +L +A +LQ+                
Sbjct: 532 YRDGSRGETPGFADDYAYLIHGLLDMYEATYDDSYLQFAEQLQSMFHDRGSFGRTILTHA 591

Query: 652 --QDELFLDREG---GGYFNT----TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLAS 702
              ++ FL   G    GY++T    T   P  LLR+K   + A PS N V   NL+RL++
Sbjct: 592 EYLNDNFLAYVGSTPAGYYSTPSTMTPGMPGPLLRLKTGTESATPSINGVIARNLLRLSA 651

Query: 703 IVAGSKSDYYRQNAEHSLAV 722
           ++   +     +   HS +V
Sbjct: 652 LLEEEEYRTLARQTCHSFSV 671


>gi|258569036|ref|XP_002585262.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
 gi|237906708|gb|EEP81109.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
          Length = 818

 Score =  436 bits (1120), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 267/673 (39%), Positives = 367/673 (54%), Gaps = 54/673 (8%)

Query: 86  MAERTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSI 145
           MA   PAS+     +  NRL+   SPY+  H +NPV W  W   A   A++ +  IFLSI
Sbjct: 1   MAAEPPASS-----QLVNRLSESRSPYVRGHMNNPVAWQLWDSAAIDLAKRLNRLIFLSI 55

Query: 146 GYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWP 205
           GYS CHWCHVME ESF  + VA +LN  F+ IK+DREERPD+D+VYM YVQA  G GGWP
Sbjct: 56  GYSACHWCHVMEKESFMSQEVAAILNKSFIPIKLDREERPDIDEVYMNYVQATTGSGGWP 115

Query: 206 LSVFLSPDLKPLMGGTYFPPEDKYGRP--------GFKTILRKVKDAWDKKRDMLAQSGA 257
           L+VFL+PDL+P+ GGTY+P       P         F  IL K++D W+ ++    +S  
Sbjct: 116 LNVFLTPDLEPVFGGTYWPGPHSSSVPRLGGEEPITFVDILEKLRDVWNSQQLRCMESAK 175

Query: 258 FAIEQLSEALSASASSNKLPDELPQNALRLCA-----EQLSKSYDSRFGGFGSAPKFPRP 312
               QL E  +   +  + PD   +  L +       +     YD   GGF  APKFP P
Sbjct: 176 EITRQLRE-FAEEGTHLRRPDSEGEEDLEVELLEEAYQHFVSRYDPVNGGFSRAPKFPTP 234

Query: 313 VEIQMML----YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDE 368
             +  +L    Y    ++  G+  E +   +MV  TL  M +GGIHD +G GF RYSV  
Sbjct: 235 ANLSFLLRLGRYPGAVMDIVGQE-ECARATEMVSKTLLQMVRGGIHDQIGHGFARYSVTA 293

Query: 369 RWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAE 427
            W +PHFEKMLYDQ QL +VY+D F  T+D        DI+ Y+    M+ P G   S+E
Sbjct: 294 DWSLPHFEKMLYDQAQLLDVYVDCFEATQDPELLGAVYDIVAYMTSPPMLSPEGAFHSSE 353

Query: 428 DADSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHN 486
           DADS  T   T K+EGAFYVWT KE++ ILG+  A +   H+ + P GN  ++R  DPH+
Sbjct: 354 DADSLPTPKDTEKREGAFYVWTLKEMQQILGQRDAEVCARHWGVLPDGN--VARGYDPHD 411

Query: 487 EFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSW 545
           EF  +NVL         A  LG+  ++ + I+   R+KL + R ++R RP LDDKVIVSW
Sbjct: 412 EFINQNVLSIKATPRHIAKDLGLSEDEVVRIIKSSRKKLQEFRDTQRVRPDLDDKVIVSW 471

Query: 546 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYM-EVAESAASFIRRHLYDEQTHRL 604
           NGL I + A+ S +L             +  D+ E+    A +AA+FI+  L+D  T +L
Sbjct: 472 NGLAIGALAKCSVLLDR-----------IDPDKAEHCRRSAATAAAFIKEKLFDADTGQL 520

Query: 605 QHSFRNGP-SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG-- 661
              +R+G   + PGF DDYA+L +GL+ LYE      +L +A +LQ   +  FL      
Sbjct: 521 WRVYRDGVRGETPGFGDDYAYLTAGLIQLYEATFDDSYLRFAEQLQKYMNTHFLAMAADG 580

Query: 662 ---GGYF----NTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQ 714
               GY+    N  G+ P  L R+K   D A PS N V   NLVRL S++   + + Y  
Sbjct: 581 STPAGYYMTQENMPGDVPGPLFRLKTGTDAATPSTNGVIAQNLVRLGSLL---EDESYSV 637

Query: 715 NAEHSLAVFETRL 727
            A+ + + F   +
Sbjct: 638 LAKQTCSAFAAEI 650


>gi|317030461|ref|XP_001392621.2| hypothetical protein ANI_1_728074 [Aspergillus niger CBS 513.88]
          Length = 791

 Score =  436 bits (1120), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 260/628 (41%), Positives = 351/628 (55%), Gaps = 35/628 (5%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           K  NRL    SPY+  H +NPV W  W  EA   A++ +  IFLSIGYS CHWCHVME E
Sbjct: 25  KLVNRLHESRSPYVRAHMNNPVGWQLWDAEAIDLAKRHNRLIFLSIGYSACHWCHVMEKE 84

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SF  + VA +LN  F+ IKVDREERPD+D VYM YVQA  G GGWPL+VFL+PDL+P+ G
Sbjct: 85  SFMSQEVASILNQSFIPIKVDREERPDIDDVYMNYVQATTGSGGWPLNVFLTPDLEPVFG 144

Query: 220 GTYFPPEDKY-----GRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASS- 273
           GTY+P  +       G  GF  IL K+ D W  ++    +S     +QL E       S 
Sbjct: 145 GTYWPGPNSSTLTGNGTIGFVEILEKLSDVWQTQQLRCRESAKEITKQLREFAEEGTHSY 204

Query: 274 ---NKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML---YHSKKLED 327
               +  ++L    L    +     YD   GGF +APKFP P  +  +L    +   + D
Sbjct: 205 QGDRQADEDLDLELLEEAYQHFVSRYDPLHGGFSTAPKFPTPSNLSFLLRLGIYPTAVAD 264

Query: 328 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 387
                E ++   M + TL  MA+GGI DH+G GF RYSV   W +PHFEKMLYDQ QL +
Sbjct: 265 IVGRDECAKATAMAVDTLISMARGGIRDHIGHGFARYSVTGDWGLPHFEKMLYDQAQLLD 324

Query: 388 VYLDAFSLTKDVFYSYICRDILDYLRRDMI-GPGGEIFSAEDADSAETEGATRKKEGAFY 446
           VY+DAF +T +        D+  YL    I  P G   S+EDADS  T   T K+EGAFY
Sbjct: 325 VYVDAFKITHNPELLGAVYDLATYLTTAPIQSPTGAFHSSEDADSLPTPNDTEKREGAFY 384

Query: 447 VWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 505
           VWT KE+  +LG+  A +   H+ + P GN  ++  +DPH+EF  +NVL      S  A 
Sbjct: 385 VWTLKELTQVLGQRDAGVCARHWGVLPDGN--IAPENDPHDEFMNQNVLSVKVTPSRLAK 442

Query: 506 KLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEA 564
             G+  E+ + I+   ++KL D R + R RP LDDK+IV+WNGL I + A+ S + + E 
Sbjct: 443 DFGLGEEEVVRIIRAAKQKLRDYRERTRVRPDLDDKIIVAWNGLAIGALAKCSALFE-EI 501

Query: 565 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP-SKAPGFLDDYA 623
           ES         S   +  E A  A +FI+ +L+++ T +L   +R+G     PGF DDYA
Sbjct: 502 ES---------SKAVQCREAAAKAINFIKENLFEKPTGQLWRIYRDGGRGNTPGFADDYA 552

Query: 624 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG---GGYFNT----TGEDPSVLL 676
           +LI GLLD+YE      +L +A +LQ   ++ FL   G    GY++T    T   P  LL
Sbjct: 553 YLIGGLLDMYEATFDDSYLQFAEQLQKYLNDNFLAYVGTTPAGYYSTPSTMTSGAPGPLL 612

Query: 677 RVKEDHDGAEPSGNSVSVINLVRLASIV 704
           R+K   + A P+ N V   NL+RL S++
Sbjct: 613 RLKTGTESATPAVNGVIARNLLRLGSLL 640


>gi|159131360|gb|EDP56473.1| DUF255 domain protein [Aspergillus fumigatus A1163]
          Length = 799

 Score =  436 bits (1120), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 268/682 (39%), Positives = 364/682 (53%), Gaps = 55/682 (8%)

Query: 86  MAERTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSI 145
           M  +T   ++    K  NRL    SPY+  H +NPV W  W  EA   AR+ +  IFLSI
Sbjct: 1   MHSQTHLGSADHEPKLVNRLRDSRSPYVRAHMNNPVAWQLWDAEAIELARRYNRLIFLSI 60

Query: 146 GYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWP 205
           GYS CHWCHVME ESF  + VA LLN+ F+ IKVDREERPD+D VYM YVQA  G GGWP
Sbjct: 61  GYSACHWCHVMEKESFMSQEVASLLNESFIPIKVDREERPDIDDVYMNYVQATTGSGGWP 120

Query: 206 LSVFLSPDLKPLMGGTYFPPED-----KYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAI 260
           LSVFL+P+L P+ GGTY+P  +     +    GF  IL K++D W  ++     S     
Sbjct: 121 LSVFLTPNLDPVFGGTYWPGPNSSTLSRQDTVGFVDILEKLRDVWKTQQQRCLDSAKEIT 180

Query: 261 EQL----SEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQ 316
            QL     E   +     +  ++L    L    +  +  YD+  GGF  APKFP P  + 
Sbjct: 181 RQLREFAEEGTHSQQGDRQAGEDLDIELLEEAYQHFASRYDTVNGGFSRAPKFPTPANLS 240

Query: 317 MML---YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVP 373
            +L    +   + D     E      M + TL  MA+GGI DH+G GF RYSV   W +P
Sbjct: 241 FLLRLKTYPSAVSDIVGQEECDRAAAMAVSTLISMARGGIRDHIGHGFARYSVTADWSLP 300

Query: 374 HFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMI-GPGGEIFSAEDADSA 432
           HFEKMLYDQ QL +VY+DAF +T +        D+  YL    I  P G   S+EDADS 
Sbjct: 301 HFEKMLYDQAQLLDVYVDAFKITHNPELLGAVYDLATYLTTAPIQSPVGAFHSSEDADSL 360

Query: 433 ETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGK 491
            T   T K+EGAFYVWT KE+  +LG+  A +   H+ + P GN  ++   DPH+EF  +
Sbjct: 361 PTPNDTEKREGAFYVWTLKELTQVLGQRDAGVCARHWGVLPDGN--IAPEHDPHDEFMNQ 418

Query: 492 NVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVI 550
           NVL      S  A + G+  E+ + I+   ++KL + R K R RP LDDKVIV+WNGL I
Sbjct: 419 NVLSIKVTPSKLAREFGLSEEEVVKIIKSAKQKLREYREKTRVRPDLDDKVIVAWNGLAI 478

Query: 551 SSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRN 610
            + A+ S + + E ES         S   +  E A  A +FI+ +L+++ T +L   +R+
Sbjct: 479 GALAKCSALFE-EIES---------SKAVQCREAAARAINFIKENLFEKATGQLWRIYRD 528

Query: 611 GP-SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQN-------------TQDELF 656
           G   + PGF DDYA+LI GLLD+YE      +L +A +LQ+             TQ E  
Sbjct: 529 GSRGETPGFADDYAYLIHGLLDMYEATYDDSYLQFAEQLQSMFHDRGSFGRTILTQAEYL 588

Query: 657 LDR-------EGGGYFNT----TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVA 705
            D           GY++T    T   P  LLR+K   + A PS N V   NL+RL++++ 
Sbjct: 589 NDNFLAYVGSTPAGYYSTPSTMTPGMPGPLLRLKTGTESATPSINGVIARNLLRLSALL- 647

Query: 706 GSKSDYYRQNAEHSLAVFETRL 727
             + + YR  A  +   F   +
Sbjct: 648 --EEEEYRTLARQTCLSFSVEI 667


>gi|397690129|ref|YP_006527383.1| Thioredoxin domain protein [Melioribacter roseus P3M]
 gi|395811621|gb|AFN74370.1| Thioredoxin domain protein [Melioribacter roseus P3M]
          Length = 690

 Score =  435 bits (1119), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 261/667 (39%), Positives = 359/667 (53%), Gaps = 61/667 (9%)

Query: 98  RNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVME 157
           R    NRL  E SPYL QH++NPVDW  W +EAF  AR+ D P+FLSIGYSTCHWCHVM 
Sbjct: 16  RTYKINRLTNEKSPYLKQHSNNPVDWHPWCDEAFRIARREDKPVFLSIGYSTCHWCHVMA 75

Query: 158 VESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPL 217
            ESFEDE VA+LLN  F+SIKVDREERPD+D +YM   Q + G GGWPLS+FL+PD KP 
Sbjct: 76  HESFEDEEVAELLNKNFISIKVDREERPDIDSIYMASCQLITGRGGWPLSIFLTPDGKPF 135

Query: 218 MGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLP 277
             GTYFP    YGR GF  +L ++ D W+K R++L ++       +++   +SA      
Sbjct: 136 YAGTYFPKYSYYGRIGFVDLLNRIIDLWNKDRNVLLRTSDEITAAINKHFESSAKE-AFD 194

Query: 278 DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEG 337
           D +   A     E L  ++D  +GGFGSAPKFP P  +  +L  +    D          
Sbjct: 195 DSVVDKAF----ETLKLNFDPEYGGFGSAPKFPSPHNLLFLLDRNNPQAD---------- 240

Query: 338 QKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTK 397
            +MV  TL  M KGGI D +G GFHRYS D +W +PHFEKM+YDQ  L   Y  AF+ T 
Sbjct: 241 -EMVQKTLTEMRKGGIFDQLGFGFHRYSTDGKWFLPHFEKMIYDQASLIEAYAYAFAKTG 299

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
           D  Y+    +I ++++ +M    G  +SA DADS   EG    +EG FY+WTS+E+  + 
Sbjct: 300 DALYADTINEIYEFIKNEMTSHEGAFYSALDADS---EG----EEGKFYLWTSEEIRSVA 352

Query: 458 GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI 517
           G+   + KE +     GN      ++ +    GKN+L           K G    KY +I
Sbjct: 353 GDDYEIAKEIFNFTDEGN----HRNESNGNSTGKNILFLRKRPDKLYEKYGRS--KYDSI 406

Query: 518 LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSD 577
               R  L + R KR  P  D+K++  WN +VISS A A  I++++   A          
Sbjct: 407 ----RINLLEARKKRIPPMRDEKILTDWNAMVISSLANAGSIIENDDMVAW--------- 453

Query: 578 RKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGS 637
                  AE A   + +H +      L H   N  +   GFLDDYA+LI   LDLY    
Sbjct: 454 -------AERAYQCLMKHAF--VNGELYHYPENNIT---GFLDDYAYLIKAALDLYRATL 501

Query: 638 GTKWLVWAIELQNTQDELFLDR-EGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
             ++L  A+EL +   E F D+ EGG +FN  G +    +RVK+ +DGA PSGNS+ + N
Sbjct: 502 NEEYLFNALELNDLLSENFEDKSEGGYFFNKAGANT---IRVKDAYDGAVPSGNSIQLSN 558

Query: 697 LVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVG 756
           L+ L   + G+ S  YR +AE+S+  F + L   ++           L       +++ G
Sbjct: 559 LIELY-FITGNNS--YRLSAENSIKTFSSGLNKSSIGYTYFLRGIKKLYSKDTSLLLIAG 615

Query: 757 HKSSVDF 763
            K+  +F
Sbjct: 616 KKTGREF 622


>gi|448359615|ref|ZP_21548265.1| hypothetical protein C482_16798 [Natrialba chahannaoensis JCM
           10990]
 gi|445642250|gb|ELY95319.1| hypothetical protein C482_16798 [Natrialba chahannaoensis JCM
           10990]
          Length = 811

 Score =  435 bits (1119), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 248/649 (38%), Positives = 350/649 (53%), Gaps = 43/649 (6%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA NPV+W  W E+A   AR+ DVPIFLSIGYS CHWCHVME ESF 
Sbjct: 10  NRLDEEESPYLRQHADNPVNWQPWDEQALETAREHDVPIFLSIGYSACHWCHVMEDESFA 69

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA+ LN+ FV IKVDREERPDVD +YMT  Q + G GGWPLS +L+P+ KP   GTY
Sbjct: 70  DEQVAEALNENFVPIKVDREERPDVDSIYMTVCQLVTGRGGWPLSAWLTPEGKPFYVGTY 129

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDML---AQSGAFAIEQLSEALSASASSNKLPDE 279
           FP   K G+PGF  IL  V ++W++ RD +   A+    A +   E    + S+++ P  
Sbjct: 130 FPKNAKRGQPGFLDILENVTNSWERDRDEVENRAEQWTNAAKDRLEETPDTVSASQPPS- 188

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
              + L   A    +S D +FGGFGS  PKFP+P  ++++   + +        E  + Q
Sbjct: 189 --SDVLDAAANASFRSADRQFGGFGSDGPKFPQPSRLRVLARAADRT-------EREDFQ 239

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
            +++ TL  MA GG++DHVGGGFHRY VD  W VPHFEKMLYD   +   +L  +  T D
Sbjct: 240 DVLVETLDAMAAGGLYDHVGGGFHRYCVDRDWTVPHFEKMLYDNAAIPRAFLIGYQQTGD 299

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
             Y+ +  + L ++ R++    G  FS  DA S + +   R +EG FYVWT  E+ D+L 
Sbjct: 300 ERYAEVVAETLAFVERELTHEEGGFFSTLDAQSEDPDTGER-EEGTFYVWTPDEIHDVLE 358

Query: 459 EH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLN 516
               A LF + Y +  +GN            F+G N    +   S  A++  +      +
Sbjct: 359 NETTADLFCDRYDITESGN------------FEGSNQPNRVRSVSDLAAEYDLEAPDVQD 406

Query: 517 ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGS 576
            L   R +LF  R +RPRP+ D+KV+  WNGL+I++ A A+ +L              G 
Sbjct: 407 RLESAREELFAAREQRPRPNRDEKVLAGWNGLMIATCAEAALVLGG------------GE 454

Query: 577 DRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG 636
           D  EY  +A  A  F+R  L+DE   RL   +++G     G+L+DYAFL    L  YE  
Sbjct: 455 DGDEYATMAVDALEFVRDRLWDEDEQRLSRRYKDGDVAIDGYLEDYAFLARAALGCYEAT 514

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
                L +A++L    ++ F D + G  + T     S++ R +E  D + PS   V+V  
Sbjct: 515 GEVDHLAFALDLARVIEDEFWDADRGTLYFTPESGESLVTRPQELGDQSTPSAAGVAVET 574

Query: 697 LVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 745
           L+ L       + D + + A   L     R++  ++    +C AAD L+
Sbjct: 575 LLALEGFA--DQGDEFEEIATTVLETHANRIETNSLEHATLCLAADRLA 621


>gi|405355793|ref|ZP_11024905.1| Thymidylate kinase [Chondromyces apiculatus DSM 436]
 gi|397091065|gb|EJJ21892.1| Thymidylate kinase [Myxococcus sp. (contaminant ex DSM 436)]
          Length = 696

 Score =  435 bits (1119), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 261/661 (39%), Positives = 353/661 (53%), Gaps = 50/661 (7%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           +NRLA E SPYL QHAHNPVDWF WGEEA A A+  + PI LS+GYS CHWCHVM  ESF
Sbjct: 11  SNRLAREPSPYLRQHAHNPVDWFPWGEEALARAKAENKPILLSVGYSACHWCHVMAHESF 70

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           E    A+L+N+ F++IKVDREERPD+D++Y   VQ +  GGGWPL+VFL+PDLKP  GGT
Sbjct: 71  ESPDTARLMNEGFINIKVDREERPDLDQIYQGVVQLMGQGGGWPLTVFLTPDLKPFYGGT 130

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFPP+DKYGRPGF  +L  ++DAW+ K+D + +  A   E L E   AS      P  L 
Sbjct: 131 YFPPQDKYGRPGFPRLLMALRDAWENKQDEVQRQSAQFEEGLGEL--ASYGLEAAPAVLT 188

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
              +    + ++K  D+  GGFG APKFP P+   +ML   ++       G  +  +  V
Sbjct: 189 VADVVAMGQGMAKQVDAVNGGFGGAPKFPNPMNFALMLRAWRR-------GGGAALKDAV 241

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             TL+ MA+GGI+D +GGGFHRYSVDERW VPHFEKMLYD  QL ++Y  A  +     +
Sbjct: 242 FLTLERMARGGIYDQLGGGFHRYSVDERWLVPHFEKMLYDNAQLLHLYAQAQQVEPRPLW 301

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE-H 460
             +  + ++Y+RR+M   GG  ++A+DADS   EG    +EG F+VW  +EV   L E  
Sbjct: 302 RKVVEETVEYVRREMTDAGGGFYAAQDADS---EG----EEGKFFVWKPEEVRAALPEAQ 354

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
           A L   H+ +KP GN +            G  VL  +    A A + G   +   + L  
Sbjct: 355 AELVLRHFGIKPGGNFE-----------HGATVLEVVVPVDALAKERGGAEDVVASELAA 403

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            R+ LF  R +R +P  DDK +  WNGL+I   A AS++                 DR E
Sbjct: 404 ARKTLFAAREQRVKPGRDDKQLSGWNGLMIRGLALASRVF----------------DRPE 447

Query: 581 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 640
           +   A  AA F+    +D    RL  S++ G ++  GFL+DY  L SGL  LY+     K
Sbjct: 448 WARWAADAADFVLEKAWD--GTRLARSYQEGQARIDGFLEDYGNLASGLTALYQATFDVK 505

Query: 641 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 700
           +L  A  L     +LF D E   Y         +++      D A PSG S      V L
Sbjct: 506 YLEAADALVRRAVDLFWDAEKAAYLTAPRGQKDLVVATYGLFDNAFPSGASTLTEAQVEL 565

Query: 701 ASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSS 760
           A++    +   + +  E  ++     L    M    +  AAD L +     V L G +  
Sbjct: 566 AALTGDKR---HLELPERYVSRMHDGLVRNPMGYGYLGLAADAL-LEGAAAVTLAGSRED 621

Query: 761 V 761
           V
Sbjct: 622 V 622


>gi|238498046|ref|XP_002380258.1| DUF255 domain protein [Aspergillus flavus NRRL3357]
 gi|317141806|ref|XP_003189401.1| hypothetical protein AOR_1_504164 [Aspergillus oryzae RIB40]
 gi|220693532|gb|EED49877.1| DUF255 domain protein [Aspergillus flavus NRRL3357]
          Length = 787

 Score =  435 bits (1118), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 257/616 (41%), Positives = 345/616 (56%), Gaps = 35/616 (5%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           K  NRL    SPY+  H +NPV W  W  EA   AR+ +  +FLSIGYS CHWCHVME E
Sbjct: 21  KLVNRLRDSRSPYVRAHMNNPVAWQLWDAEAINLARRYNRLVFLSIGYSACHWCHVMEKE 80

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SF    VA +LN+ F+ IKVDREERPD+D +YM YVQA  G GGWPL+VFL+PDL+P+ G
Sbjct: 81  SFMSPEVATILNESFIPIKVDREERPDIDDIYMNYVQATTGSGGWPLNVFLTPDLEPVFG 140

Query: 220 GTYFPPEDKYG-----RPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL---SEALSASA 271
           GTY+P  +          GF  IL K+++ W  ++     S     +QL   +E  + S 
Sbjct: 141 GTYWPGPNSSTLLGNETIGFVDILEKLREVWQTQQQRCLDSAKEITKQLREFAEEGTHSY 200

Query: 272 SSNKLPDE-LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML---YHSKKLED 327
             +K  DE L    L    +     YDS  GGF  APKFP P  +  +L    +   + D
Sbjct: 201 QGDKEADEDLDIELLEEAYQHFVSRYDSVHGGFSRAPKFPTPANLSFLLRLGAYPNAVSD 260

Query: 328 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 387
                E  +   M + TL  MA+GGI DH+G GF RYSV   W +PHFEKMLYDQ QL +
Sbjct: 261 IVGREECEKATAMAVHTLISMARGGIRDHIGHGFARYSVTADWSLPHFEKMLYDQAQLLD 320

Query: 388 VYLDAFSLTKDVFYSYICRDILDYLRRDMI-GPGGEIFSAEDADSAETEGATRKKEGAFY 446
           VY+DAF +T +        D+  YL    I  P G   S+EDADS  +   T K+EGAFY
Sbjct: 321 VYVDAFKITHNPELLGAVYDLATYLTTAPIQSPTGAFHSSEDADSLPSPKDTEKREGAFY 380

Query: 447 VWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 505
           VWT KE+  +LG+  A +   H+ + P GN  +S  +DPH+EF  +NVL      S  A 
Sbjct: 381 VWTLKELTQVLGQRDAGVCARHWGVHPDGN--ISPENDPHDEFMNQNVLSVKVTPSKLAR 438

Query: 506 KLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEA 564
           + G+  E+ + I+   +++L + R + R RP LDDK+IV+WNGLVI + A+ S + +   
Sbjct: 439 EFGLGEEEVVRIIRSAKQRLREYRERTRVRPDLDDKIIVAWNGLVIGALAKCSALFER-- 496

Query: 565 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP-SKAPGFLDDYA 623
                   +  S   +  E A  A SFI+ +L+D+ T +L   +R+G     PGF DDYA
Sbjct: 497 --------IESSKAVQCREAAAKAISFIKNNLFDKATGQLWRIYRDGGRGDTPGFADDYA 548

Query: 624 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG---GGYF----NTTGEDPSVLL 676
           +LISGLLD+YE      +L +A +LQ   +E FL   G    GY+    N T + P  LL
Sbjct: 549 YLISGLLDMYEATFDDSYLQFAEQLQKYLNENFLAYVGSTPAGYYSTPSNMTSDMPGPLL 608

Query: 677 RVKEDHDGAEPSGNSV 692
           R+K   + A PS N V
Sbjct: 609 RLKTGTESATPSVNGV 624


>gi|256419531|ref|YP_003120184.1| hypothetical protein Cpin_0485 [Chitinophaga pinensis DSM 2588]
 gi|256034439|gb|ACU57983.1| protein of unknown function DUF255 [Chitinophaga pinensis DSM 2588]
          Length = 680

 Score =  435 bits (1118), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 247/610 (40%), Positives = 336/610 (55%), Gaps = 55/610 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA E SPYLLQHAHNPVDW+ WGEEA   A+  D PI +SIGY+ CHWCHVME ESFE
Sbjct: 2   NRLAKETSPYLLQHAHNPVDWYPWGEEALQRAKTEDKPILVSIGYAACHWCHVMERESFE 61

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
            E  A+++N+ F++IK+DREERPD+D +YM  VQA+ G GGWPL+VFL+PD  P  GGTY
Sbjct: 62  HEETARIMNEHFINIKIDREERPDLDHIYMDAVQAMTGSGGWPLNVFLTPDKLPFYGGTY 121

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLP--DEL 280
           FPP   + RP +  +L  +  A+ ++R+ L        + L   + AS  S K P  D +
Sbjct: 122 FPPVKAFNRPSWTDVLLALSQAFKERREDLETQAQNMRDHL---VQASGFSGKAPGQDLV 178

Query: 281 PQNALRLCAE------QLSKSYDSRFGGFGSAPKFPRPVEIQMML-YHSKKLEDTGKSGE 333
           P   L   A+       + +  D  +GGFGSAPKFP    IQ +L YH         S  
Sbjct: 179 PHEELFTKAQCETIFNNMMQQGDKVWGGFGSAPKFPGTFIIQYLLRYH--------HSFN 230

Query: 334 ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 393
             +  +  L +L  M +GGI+D +GGGF RYS D +W  PHFEKMLYD   L +V  +A+
Sbjct: 231 EPKALEQALLSLDKMIRGGIYDQLGGGFARYSTDAKWLAPHFEKMLYDNALLVDVLSEAY 290

Query: 394 SLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 453
            LT +  Y+    D L ++ R+M   GG  +SA DADS   EG     EG FY W+ +E+
Sbjct: 291 QLTGNELYARTIADTLGFVAREMTDAGGGFYSALDADS---EGV----EGKFYTWSKEEI 343

Query: 454 EDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEK 513
           E ILG  A LF   Y +   GN            ++  N+L     ++  A++ G+  E 
Sbjct: 344 EHILGTDAALFCAFYDVTEEGN------------WEETNILWVTKPAAVFAAEQGITEEA 391

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
               L   R KL  VR+KR RP LDDK+I+ WN L+I +  +A              +  
Sbjct: 392 LERSLAISREKLMAVRAKRIRPGLDDKIILGWNALMIHACCKA--------------YAA 437

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
           +G +R  Y E+  +A  F   HL +       H+F+ G +K P FLDDYA+++  L+ L 
Sbjct: 438 LGIER--YREMGVNAMKFCLEHLQNTDKQSFFHTFKGGVAKYPAFLDDYAWMVRALIALQ 495

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
           E     +WL  A EL       F D  G  ++ T      V++R KE +DGA PSGN+V 
Sbjct: 496 EVSGEPEWLSKAKELTEYVVNNFSDEGGIYFYYTEAGQTDVIVRKKEVYDGATPSGNAVM 555

Query: 694 VINLVRLASI 703
             NL+ L+ +
Sbjct: 556 AANLLYLSVV 565


>gi|455791360|gb|EMF43176.1| PF03190 family protein [Leptospira interrogans serovar Lora str. TE
           1992]
          Length = 691

 Score =  435 bits (1118), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 271/689 (39%), Positives = 378/689 (54%), Gaps = 69/689 (10%)

Query: 95  SHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCH 154
           S+SRN   NRL+ E SPYL QH++NPVDWF WG EA  +A+ +D  IFLSIGY+TCHWCH
Sbjct: 3   SNSRN--PNRLSKEKSPYLQQHSYNPVDWFPWGAEALTKAKDQDKLIFLSIGYATCHWCH 60

Query: 155 VMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDL 214
           VME ESFE++ +A  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++FL+P+ 
Sbjct: 61  VMEKESFENQSIADYLNFHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNMFLTPEG 120

Query: 215 KPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSN 274
           +P+ GGTYFPPE +YGR GF  +L  ++  W +KR  L  + +   + L ++  + A   
Sbjct: 121 QPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGESRAKEK 180

Query: 275 KLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKKLEDTGK 330
           +  D  P+N            YDS+FGGF +    KFP  + +  +L  YHS        
Sbjct: 181 QEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-------- 232

Query: 331 SGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYL 390
           SG  +   +MV  TL  M +GGI+D +GGG  RYS D RW VPHFEKMLYD      +  
Sbjct: 233 SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSLFLEILA 291

Query: 391 DAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTS 450
           +   ++K +       DI+ YL RDM    G IFSAEDADS   EG    +EG FY+W  
Sbjct: 292 EYSLVSKKISAESFALDIVSYLHRDMRMDEGGIFSAEDADS---EG----EEGLFYIWDL 344

Query: 451 KEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMP 510
           +E  ++ GE + L ++ + +   GN            F+GKN+L E    S    +    
Sbjct: 345 EEFREVCGEDSFLLEKFWNVTKEGN------------FEGKNILHENFRGSNFTEEELKQ 392

Query: 511 LEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFN 570
           L+K    L + + KL + RSKR RP  DDK++ SWNGL I +  +               
Sbjct: 393 LDK---ALAKGKVKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------------- 436

Query: 571 FPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLL 630
              +   R++++++AE   SFI ++L D    R+   FR G S   G+ +DYA +I+  +
Sbjct: 437 ---IAFQREDFLKLAEETYSFIEKNLID-SNGRILRRFREGESGILGYSNDYAEMIASSI 492

Query: 631 DLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGAEPSG 689
            L+E G G ++L  A+        LF  R   G F  TG D  VLLR   D +DG EPS 
Sbjct: 493 VLFEAGRGVRYLQNAVLWMEEAISLF--RSPAGVFFDTGIDGEVLLRRSVDGYDGVEPSA 550

Query: 690 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSR 749
           NS    +LVRL+ +  G  SDYYR+ AE     F   L   A++ P +  A       S 
Sbjct: 551 NSSLAHSLVRLSFL--GVNSDYYREIAESIFLYFRKELYSYALSYPFLLSA-----YWSY 603

Query: 750 KH----VVLVGHKSSVDFENMLAAAHASY 774
           KH    +VL+  K+S + ++MLA   + +
Sbjct: 604 KHHFREIVLI-RKNSEEGKDMLAWIQSRF 631


>gi|289582639|ref|YP_003481105.1| hypothetical protein Nmag_2991 [Natrialba magadii ATCC 43099]
 gi|448281932|ref|ZP_21473225.1| hypothetical protein C500_05433 [Natrialba magadii ATCC 43099]
 gi|289532192|gb|ADD06543.1| protein of unknown function DUF255 [Natrialba magadii ATCC 43099]
 gi|445577561|gb|ELY31994.1| hypothetical protein C500_05433 [Natrialba magadii ATCC 43099]
          Length = 722

 Score =  435 bits (1118), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 253/648 (39%), Positives = 351/648 (54%), Gaps = 43/648 (6%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA NPV+W  W E+A   AR+ DVPIFLSIGYS CHWCHVME ESF 
Sbjct: 10  NRLDEEESPYLRQHADNPVNWQPWDEQALETAREHDVPIFLSIGYSACHWCHVMEDESFA 69

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA++LN+ FV IKVDREERPDVD +YMT  Q + G GGWPLS +L+P+ KP   GTY
Sbjct: 70  DEQVAEVLNENFVPIKVDREERPDVDSIYMTVCQLVTGRGGWPLSAWLTPEGKPFYVGTY 129

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDML---AQSGAFAIEQLSEALSASASSNKLPDE 279
           FP   K G+PGF  IL  V ++W+  RD +   A+    A +   E    S S+++ P  
Sbjct: 130 FPKNAKRGQPGFLDILENVTNSWEGDRDEVENRAEQWTDAAKDRLEETPDSVSASQPP-- 187

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
              + L   A    +S D +FGGFGS  PKFP+P  ++++   + +   TG+     + Q
Sbjct: 188 -SSDVLEAAANASLRSADRQFGGFGSDGPKFPQPSRLRVLARAAAR---TGR----DDFQ 239

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
            + + TL  MA GG++DHVGGGFHRY VD  W VPHFEKMLYD   +   +L  +  T D
Sbjct: 240 DVFVETLDAMAAGGLYDHVGGGFHRYCVDRDWTVPHFEKMLYDNAAIPRAFLVGYQQTGD 299

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
             Y+ +  + L ++ R++    G  FS  DA S + +   R +EG+FYVWT  EV D+L 
Sbjct: 300 ERYAEVVAETLTFVERELTHEEGGFFSTLDAQSEDPDTGER-EEGSFYVWTPDEVHDVLE 358

Query: 459 EH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLN 516
               A LF + Y +  +GN            F+G N    +   S  A++  +       
Sbjct: 359 NETDADLFCDRYDITESGN------------FEGSNQPNRVASVSDLAAEYDLDATDVRE 406

Query: 517 ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGS 576
            L   R KLF  R +RPRP+ D+KV+  WNGL+I++ A A+ +L              G 
Sbjct: 407 RLESAREKLFAAREQRPRPNRDEKVLAGWNGLMIATCAEAALVLGG------------GE 454

Query: 577 DRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG 636
           D  EY  +A  A  F+R  L+DE   RL   +++      G+L+DYAFL  G L  YE  
Sbjct: 455 DGDEYATMAVDALEFVRDRLWDEDEQRLSRRYKDEDVAIDGYLEDYAFLARGALGCYEAT 514

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
                L +A++L    ++ F D + G  + T     S++ R +E  D + PS   V+V  
Sbjct: 515 GEVDHLAFALDLARVIEDEFWDADRGTLYFTPESGESLVTRPQELGDQSTPSAAGVAVET 574

Query: 697 LVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 744
           L+ L   V   + D + + A   L     R++  ++    +C AAD L
Sbjct: 575 LLALEGFV--DQGDEFEEIATTVLETHANRIETNSLEHATLCLAADRL 620


>gi|448307474|ref|ZP_21497369.1| hypothetical protein C494_07045 [Natronorubrum bangense JCM 10635]
 gi|445595646|gb|ELY49750.1| hypothetical protein C494_07045 [Natronorubrum bangense JCM 10635]
          Length = 727

 Score =  434 bits (1116), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 244/645 (37%), Positives = 353/645 (54%), Gaps = 41/645 (6%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA NPV+W  W E+A   A++ DVPIFLSIGYS CHWCHVME ESF 
Sbjct: 8   NRLDEEESPYLRQHADNPVNWQPWDEQALETAKEHDVPIFLSIGYSACHWCHVMESESFA 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA++LN+ FV IKVDREERPDVD +YMT  Q +   GGWPLS +L+P+ KP   GTY
Sbjct: 68  DEEVAEMLNENFVPIKVDREERPDVDSIYMTVCQLVTSRGGWPLSAWLTPEGKPFHIGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP E K G+PGF  IL ++ + W+  RD +        +  ++ L  +  +    +    
Sbjct: 128 FPKESKRGQPGFLDILERLAETWETDRDEVENRAQQWTDAATDQLEETPDTVAAAEPPSS 187

Query: 283 NALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
           +AL   A+   +S D ++GGFGS  PKFP+P  ++++   ++  + TG+     E  +++
Sbjct: 188 DALEAAADTAVRSADRQYGGFGSGGPKFPQPSRLRVL---ARAFDRTGR----EEYLEVL 240

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             +L  M  GG++DHVGGGFHRY VD  W VPHFEKMLYD  ++    L  + LT +  Y
Sbjct: 241 EESLDAMIDGGLYDHVGGGFHRYCVDRDWTVPHFEKMLYDNAEIPRALLAGYQLTDEERY 300

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH- 460
           +    + L+++ R++    G  FS  DA S ++E   R +EGAF+VWT +EV ++L +  
Sbjct: 301 AETVAETLEFVERELTHDEGGFFSTLDAQSEDSETGER-EEGAFFVWTPEEVSEVLADET 359

Query: 461 -AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILG 519
            A LF   Y +  +GN            F+G+N    +   S+ A +  +        L 
Sbjct: 360 DADLFCARYDITESGN------------FEGQNQPNRVQSISSLAGEFDLEESDVETRLE 407

Query: 520 ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRK 579
             R +LF+ R +RPRP+ D+KV+ SWNGL+I+++A A+ +L              G D  
Sbjct: 408 AARERLFEAREQRPRPNRDEKVLASWNGLMIATYAEAALVL--------------GDD-- 451

Query: 580 EYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGT 639
           EY E A  A  F+R  L+D    RL   +++G     G+L+DYAFL    +  YE     
Sbjct: 452 EYAETAVDALEFVRDRLWDADEKRLSRRYKDGDVAVDGYLEDYAFLARAAVGCYEATGEV 511

Query: 640 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVR 699
             L +A+EL  + +  F D E G  + T     S++ R +E +D   PS   V+V  L+ 
Sbjct: 512 DHLAFALELARSIEAEFWDAEAGTLYFTPESGESLVTRPQELNDQPTPSAAGVAVETLLA 571

Query: 700 LASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 744
           L      S++  +   A   L     R++   +    +C AAD L
Sbjct: 572 LDGFAGDSEA--FEAIASTVLETHANRIEANPLQHASLCLAADRL 614


>gi|418670392|ref|ZP_13231763.1| PF03190 family protein [Leptospira interrogans serovar Pyrogenes
           str. 2006006960]
 gi|418689642|ref|ZP_13250763.1| PF03190 family protein [Leptospira interrogans str. FPW2026]
 gi|418725255|ref|ZP_13283931.1| PF03190 family protein [Leptospira interrogans str. UI 12621]
 gi|418729313|ref|ZP_13287860.1| PF03190 family protein [Leptospira interrogans str. UI 12758]
 gi|421118286|ref|ZP_15578631.1| PF03190 family protein [Leptospira interrogans serovar Canicola
           str. Fiocruz LV133]
 gi|421121658|ref|ZP_15581951.1| PF03190 family protein [Leptospira interrogans str. Brem 329]
 gi|400361321|gb|EJP17288.1| PF03190 family protein [Leptospira interrogans str. FPW2026]
 gi|409961637|gb|EKO25382.1| PF03190 family protein [Leptospira interrogans str. UI 12621]
 gi|410010134|gb|EKO68280.1| PF03190 family protein [Leptospira interrogans serovar Canicola
           str. Fiocruz LV133]
 gi|410345509|gb|EKO96605.1| PF03190 family protein [Leptospira interrogans str. Brem 329]
 gi|410753774|gb|EKR15432.1| PF03190 family protein [Leptospira interrogans serovar Pyrogenes
           str. 2006006960]
 gi|410775491|gb|EKR55482.1| PF03190 family protein [Leptospira interrogans str. UI 12758]
 gi|456824626|gb|EMF73052.1| PF03190 family protein [Leptospira interrogans serovar Canicola
           str. LT1962]
          Length = 691

 Score =  434 bits (1116), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 271/689 (39%), Positives = 378/689 (54%), Gaps = 69/689 (10%)

Query: 95  SHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCH 154
           S+SRN   NRL+ E SPYL QH++NPVDWF WGEEA  +A+ +D  IFLSIGY+TCHWCH
Sbjct: 3   SNSRN--PNRLSKEKSPYLQQHSYNPVDWFPWGEEALTKAKDQDKLIFLSIGYATCHWCH 60

Query: 155 VMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDL 214
           VME ESFE++ +A  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++FL+P+ 
Sbjct: 61  VMEKESFENQSIADYLNFHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNMFLTPEG 120

Query: 215 KPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSN 274
           +P+ GGTYFPPE +YGR GF  +L  ++  W +KR  L  + +   + L ++  + A   
Sbjct: 121 QPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGESRAKEK 180

Query: 275 KLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKKLEDTGK 330
           +  D  P+N            YDS+FGGF +    KFP  + +  +L  YHS        
Sbjct: 181 QEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-------- 232

Query: 331 SGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYL 390
           SG  +   +MV  TL  M +GGI+D +GGG  RYS D RW VPHFEKMLYD      +  
Sbjct: 233 SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSLFLEILA 291

Query: 391 DAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTS 450
           +   ++K +       DI+ YL RDM    G I SAEDADS   EG    +EG FY+W  
Sbjct: 292 EYSLVSKKISAESFALDIVSYLHRDMRMDEGGICSAEDADS---EG----EEGLFYIWDL 344

Query: 451 KEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMP 510
           +E  ++ GE + L ++ + +   GN            F+GKN+L E    S    +    
Sbjct: 345 EEFREVCGEDSFLLEKFWNVTKEGN------------FEGKNILHENFRGSNFTEEELKQ 392

Query: 511 LEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFN 570
           L+K    L + + KL + RSKR RP  DDK++ SWNGL I +  +               
Sbjct: 393 LDK---ALAKGKVKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------------- 436

Query: 571 FPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLL 630
              +   R++++++AE   SFI ++L D    R+   FR G S   G+ +DYA +I+  +
Sbjct: 437 ---IAFQREDFLKLAEETYSFIEKNLID-SNGRILRRFREGESGILGYSNDYAEMIASSI 492

Query: 631 DLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGAEPSG 689
            L+E G G ++L  A+        LF  R   G F  TG D  VLLR   D +DG EPS 
Sbjct: 493 VLFEAGRGVRYLQNAVLWMEEAIRLF--RSPAGVFFDTGIDGEVLLRRSVDGYDGVEPSA 550

Query: 690 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSR 749
           NS    +LVRL+ +  G  SDYYR+ AE     F   L   A++ P +  A       S 
Sbjct: 551 NSSLAHSLVRLSFL--GVNSDYYREIAESIFLYFRKELYSYALSYPFLLSA-----YWSY 603

Query: 750 KH----VVLVGHKSSVDFENMLAAAHASY 774
           KH    +VL+  K+S + ++MLA   + +
Sbjct: 604 KHHFREIVLI-RKNSEEGKDMLAWIQSRF 631


>gi|154303146|ref|XP_001551981.1| hypothetical protein BC1G_09593 [Botryotinia fuckeliana B05.10]
          Length = 753

 Score =  434 bits (1116), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 238/587 (40%), Positives = 349/587 (59%), Gaps = 26/587 (4%)

Query: 153 CHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP 212
           CH+ME ESFE+E VA +LN  F+ IK+DREERPD+D++YM +VQA  G GGWPL+VFL+P
Sbjct: 17  CHIMERESFENEEVAAILNSSFIPIKIDREERPDIDRIYMNFVQATTGSGGWPLNVFLTP 76

Query: 213 DLKPLMGGTYF----PPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 268
            L+P+ GGTY+       D   +  F  IL K+   W ++     Q  A +++QL +  +
Sbjct: 77  SLEPVFGGTYWRGPSKTTDFEDQVDFLGILDKLSTVWSEQESRCRQDSAQSLQQLKDFAN 136

Query: 269 ASASSNKLP---DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML---YHS 322
               SN+L    D +    L    E  + SYD   GGFGSAPKFP P +I  +L      
Sbjct: 137 EGTLSNRLGEGVDNIDLELLEEVTEHFASSYDKANGGFGSAPKFPTPSKIAFLLRLGQFP 196

Query: 323 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 382
           + + D     +    +++ + TL+ MA+GGIHDH+G GF RYS    W +PHFEKMLYD 
Sbjct: 197 QAVVDIVGLPDCQNAREIAITTLRKMARGGIHDHIGNGFARYSATADWSLPHFEKMLYDN 256

Query: 383 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 442
            QL ++YLD F L++D  +  +  DI +YL   +    G  +S+EDADS    G + K+E
Sbjct: 257 AQLLHLYLDGFLLSRDPEFLGVAYDIANYLTTTLSHSEGGFYSSEDADSYYKNGDSEKRE 316

Query: 443 GAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSA 502
           GA+YVWT +E E+ILG    L    ++   TG+ ++ + +DPH+EF  +NVL   +  SA
Sbjct: 317 GAYYVWTKREFENILGSERGLILSAFF-NVTGHGNVGQENDPHDEFMDQNVLAISSTPSA 375

Query: 503 SASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVISSFARASKILK 561
            AS+ G+   + + ++ E + +L   R + R +P +DDKV+VSWNG+ + + AR S ++ 
Sbjct: 376 LASQFGIKESEIIKVIKEGKAQLRRRRETDRVKPAMDDKVVVSWNGIAVGALARLSSVIN 435

Query: 562 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDD 621
                  F+ PV     +EY++ A  AA+FI+++LYD++   L   +R G     GF DD
Sbjct: 436 G------FD-PVKA---QEYLDAALKAATFIKKNLYDDKAKILYRIWREGRGDTQGFADD 485

Query: 622 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG-GGYFNTTGEDPSVLLRVKE 680
           YAFLI GL+DLYE     KWL WA ELQ +Q  LF D+ G G +F+TT   P+V+LR+K+
Sbjct: 486 YAFLIEGLIDLYETTFDEKWLQWADELQQSQINLFYDKNGTGAFFSTTVSAPNVILRLKD 545

Query: 681 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 727
             D +EPS N +S  NL RL+S+      + Y + A+ ++  FE  +
Sbjct: 546 AMDSSEPSTNGISSSNLYRLSSMF---NDESYAKKAKETVKSFEAEM 589


>gi|418679291|ref|ZP_13240555.1| PF03190 family protein [Leptospira kirschneri serovar Grippotyphosa
           str. RM52]
 gi|400320416|gb|EJO68286.1| PF03190 family protein [Leptospira kirschneri serovar Grippotyphosa
           str. RM52]
          Length = 696

 Score =  434 bits (1116), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 263/682 (38%), Positives = 375/682 (54%), Gaps = 61/682 (8%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           +++ NRL+ E SPYL QH++NPVDWF WGEEA  +A+ +D  IFLS+GY+TCHWCHVME 
Sbjct: 13  SRNPNRLSKEKSPYLQQHSYNPVDWFPWGEEALTKAKDQDKLIFLSVGYATCHWCHVMEK 72

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFE++ +A  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++FL+P+ +P+ 
Sbjct: 73  ESFENQSIADYLNSHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNMFLTPEGQPIT 132

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
           GGTYFPPE +YGR GF  +L  ++  W +KR  L  + +   + L ++  + A   +  D
Sbjct: 133 GGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGESRAKEKQEAD 192

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKKLEDTGKSGEA 334
             P+N            YDS+FGGF +    KFP  + +  +L  YHS        SG  
Sbjct: 193 FPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS--------SGNP 244

Query: 335 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 394
           +   +MV  TL  M +GGI+D +GGG  RYS D RW VPHFEKMLYD      +  + F 
Sbjct: 245 N-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSLFLEILAEYFL 303

Query: 395 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 454
           ++K +       DI+ YL RDM   GG I SAEDADS   EG    +EG FY+W  +E  
Sbjct: 304 VSKKISAKSFALDIVSYLHRDMRMDGGGICSAEDADS---EG----EEGLFYIWDLEEFR 356

Query: 455 DILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKY 514
           ++ GE + L ++ + +   GN            F+GKN+L E    +   S       K+
Sbjct: 357 EVCGEDSSLLEKFWNVTKEGN------------FEGKNILHE----NFRGSNFTEEESKH 400

Query: 515 LN-ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
           L+  L   + KL + RSKR RP  DDK++ SWNGL I +  +                  
Sbjct: 401 LDGALTRGKAKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG---------------- 444

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
           +   R++++++AE   SFI ++L D +  R+   FR G S   G+ +DYA +I+  + L+
Sbjct: 445 IAFQREDFLKLAEETYSFIEKNLIDSKG-RILRRFREGESGILGYSNDYAEMIASSIVLF 503

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGAEPSGNSV 692
           E G G ++L  A+        LF  R   G F  TG D  VLLR   D +DG EPS NS 
Sbjct: 504 EAGRGVRYLQNAVLWMEETIRLF--RSTAGVFFDTGIDGEVLLRRSVDGYDGVEPSANSS 561

Query: 693 SVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHV 752
              +LV+L+ +  G  SD YR+ AE     F   L   A++ P +  A       SR+ V
Sbjct: 562 LAHSLVKLSFL--GVNSDRYREVAESIFLYFRKELYSYALSYPFLLSAYWSYKYHSREIV 619

Query: 753 VLVGHKSSVDFENMLAAAHASY 774
           ++   K+S    ++LA   + +
Sbjct: 620 LI--RKNSEAGRDLLAWIQSRF 639


>gi|417784564|ref|ZP_12432270.1| PF03190 family protein [Leptospira interrogans str. C10069]
 gi|421127859|ref|ZP_15588077.1| PF03190 family protein [Leptospira interrogans serovar
           Grippotyphosa str. 2006006986]
 gi|421133342|ref|ZP_15593490.1| PF03190 family protein [Leptospira interrogans serovar
           Grippotyphosa str. Andaman]
 gi|409952381|gb|EKO06894.1| PF03190 family protein [Leptospira interrogans str. C10069]
 gi|410022350|gb|EKO89127.1| PF03190 family protein [Leptospira interrogans serovar
           Grippotyphosa str. Andaman]
 gi|410434326|gb|EKP83464.1| PF03190 family protein [Leptospira interrogans serovar
           Grippotyphosa str. 2006006986]
          Length = 691

 Score =  434 bits (1115), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 271/689 (39%), Positives = 378/689 (54%), Gaps = 69/689 (10%)

Query: 95  SHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCH 154
           S+SRN   NRL+ E SPYL QH++NPVDWF WGEEA  +A+ +D  IFLSIGY+TCHWCH
Sbjct: 3   SNSRN--PNRLSKEKSPYLQQHSYNPVDWFPWGEEALTKAKDQDKLIFLSIGYATCHWCH 60

Query: 155 VMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDL 214
           VME ESFE++ +A  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++FL+P+ 
Sbjct: 61  VMEKESFENQSIADYLNFHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNMFLTPEG 120

Query: 215 KPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSN 274
           +P+ GGTYFPPE +YGR GF  +L  ++  W +KR  L  + +   + L ++  + A   
Sbjct: 121 QPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGESRAKEK 180

Query: 275 KLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKKLEDTGK 330
           +  D  P+N            YDS+FGGF +    KFP  + +  +L  YHS        
Sbjct: 181 QEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-------- 232

Query: 331 SGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYL 390
           SG  +   +MV  TL  M +GGI+D +GGG  RYS D RW VPHFEKMLYD      +  
Sbjct: 233 SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSLFLEILA 291

Query: 391 DAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTS 450
           +   ++K +       DI+ YL RDM    G I SAEDADS   EG    +EG FY+W  
Sbjct: 292 EYSLVSKKISAESFALDIVSYLHRDMRMDEGGICSAEDADS---EG----EEGLFYIWDL 344

Query: 451 KEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMP 510
           +E  ++ GE + L ++ + +   GN            F+GKN+L E    S    +    
Sbjct: 345 EEFREVCGEDSFLLEKFWNVTKEGN------------FEGKNILHENFRGSNFTEEELKQ 392

Query: 511 LEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFN 570
           L+K    L + + KL + RSKR RP  DDK++ SWNGL I +  +               
Sbjct: 393 LDK---ALAKGKVKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------------- 436

Query: 571 FPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLL 630
              +   R++++++AE   SFI ++L D    R+   FR G S   G+ +DYA +I+  +
Sbjct: 437 ---IAFQREDFLKLAEETYSFIEKNLID-SNGRILRRFREGESGILGYSNDYAEMIASSI 492

Query: 631 DLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGAEPSG 689
            L+E G G ++L  A+        LF  R   G F  TG D  VLLR   D +DG EPS 
Sbjct: 493 VLFEAGRGVRYLQNAVLWMEEAIRLF--RSPAGVFFDTGIDGEVLLRRSVDGYDGVEPSA 550

Query: 690 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSR 749
           NS    +LVRL+ +  G  SDYYR+ AE     F   L   A++ P +  A       S 
Sbjct: 551 NSSLAHSLVRLSFL--GVNSDYYREIAESIFLYFRKELYSYALSYPFLLSA-----YWSY 603

Query: 750 KH----VVLVGHKSSVDFENMLAAAHASY 774
           KH    +VL+  K+S + ++MLA   + +
Sbjct: 604 KHHFREIVLI-RKNSEEGKDMLAWIQSRF 631


>gi|212538503|ref|XP_002149407.1| DUF255 domain protein [Talaromyces marneffei ATCC 18224]
 gi|210069149|gb|EEA23240.1| DUF255 domain protein [Talaromyces marneffei ATCC 18224]
          Length = 783

 Score =  434 bits (1115), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 261/660 (39%), Positives = 362/660 (54%), Gaps = 51/660 (7%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           K  NRL+   SPY+  H HNPV W  W  ++   A+K +  IF+SIGYS CHWCHVME E
Sbjct: 20  KLVNRLSESRSPYVRGHMHNPVAWQLWDSKSIELAKKHNRLIFVSIGYSACHWCHVMEKE 79

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SF    VA +LND F+ IKVDREERPD+D VYM YVQA  G GGWPL+VFL+PDL+P+ G
Sbjct: 80  SFMSTEVATILNDSFIPIKVDREERPDIDDVYMNYVQATTGSGGWPLNVFLTPDLEPVFG 139

Query: 220 GTYFP-----PEDKYGRP---GFKTILRKVKDAW--------DKKRDMLAQSGAFAIEQL 263
           GTY+P      + ++G     GF  IL K++D W        D  +++  Q   FA E  
Sbjct: 140 GTYWPGPQASSQSQWGAEGPIGFVDILEKLRDVWQTQQARCLDSAKEITKQLREFAEEGT 199

Query: 264 SEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML---Y 320
                A      L  EL + A     +  +  YD  +GGFG APKF  P  +  ++    
Sbjct: 200 HTQQGAKGGGEDLEIELIEEAF----QHFASRYDPLYGGFGRAPKFHTPANLSFLIRLGM 255

Query: 321 HSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLY 380
           +   + D     E      M   TL  +A+GGI DH+G G  RYSV   W +PHFEKMLY
Sbjct: 256 YPSAVSDIVGQDECVRATAMATNTLLNIARGGIRDHIGHGVARYSVTADWLLPHFEKMLY 315

Query: 381 DQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMI-GPGGEIFSAEDADSAETEGATR 439
           DQ QL +VY+DAF  T +        D++ YL  + I    G  +S+EDADS  T   T 
Sbjct: 316 DQAQLLDVYVDAFRATHEPELLGAVYDLVSYLTSEPIQASTGGYYSSEDADSLPTPNDTE 375

Query: 440 KKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN 498
           K+EGAFYVWT KE++ +LG+  A +   H+ +   GN  ++  +DPH+EF  +NVL    
Sbjct: 376 KREGAFYVWTMKELKQVLGQRDAGVCARHWGVLADGN--IAPENDPHDEFMDQNVLSIKV 433

Query: 499 DSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARAS 557
             S  A + G+  E+ + I+   ++KL D R K R RP LDDK+IV+WNGL I + A+AS
Sbjct: 434 TPSKLAKEFGLSEEEVIKIIKSGKQKLRDYREKIRVRPDLDDKIIVAWNGLTIGALAKAS 493

Query: 558 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG-PSKAP 616
            +L+           +     ++  + A  A  FIR+ L++  + +L   +R+G     P
Sbjct: 494 VLLEE----------IDKVKAQQCRDSAHKAVEFIRKTLFEPSSGQLWRIYRDGHRGNTP 543

Query: 617 GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG-----GGYFNTTGE- 670
           GF DDYAFL SGL+ +YE      +L +A +LQ   ++ F+   G      GY+ T+ E 
Sbjct: 544 GFADDYAFLTSGLIAMYEATFDDSYLQFAEQLQKHLNQYFMAPGGESGTSAGYYTTSSEP 603

Query: 671 ---DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 727
              +P  LLR+K   D A PS N +   NLVRL +++   + D YR+ A  + + F   L
Sbjct: 604 ISGEPGPLLRLKSGTDSATPSINGIIARNLVRLGTLL---EDDNYRRLARQTCSTFSVEL 660


>gi|294827769|ref|NP_711139.2| hypothetical protein LA_0958 [Leptospira interrogans serovar Lai
           str. 56601]
 gi|386073252|ref|YP_005987569.1| hypothetical protein LIF_A0779 [Leptospira interrogans serovar Lai
           str. IPAV]
 gi|293385614|gb|AAN48157.2| conserved protein containing a thioredoxin domain [Leptospira
           interrogans serovar Lai str. 56601]
 gi|353457041|gb|AER01586.1| conserved protein containing a thioredoxin domain [Leptospira
           interrogans serovar Lai str. IPAV]
          Length = 714

 Score =  434 bits (1115), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 268/691 (38%), Positives = 379/691 (54%), Gaps = 67/691 (9%)

Query: 93  STSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHW 152
           ++  S +++ NRL+ E SPYL QH++NPVDWF WGEEA  +A+ +D  IFLSIGY+TCHW
Sbjct: 22  NSMESNSRNPNRLSKEKSPYLQQHSYNPVDWFPWGEEALTKAKDQDKLIFLSIGYATCHW 81

Query: 153 CHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP 212
           CHVME ESFE++ +A  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++FL+P
Sbjct: 82  CHVMEKESFENQSIADYLNFHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNMFLTP 141

Query: 213 DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASAS 272
           + +P+ GGTYFPPE +YGR GF  +L  ++  W +KR  L  + +   + L ++  + A 
Sbjct: 142 EGQPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGESRAK 201

Query: 273 SNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKKLEDT 328
             +  D  P+N            YDS+FGGF +    KFP  + +  +L  YHS      
Sbjct: 202 EKQEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS------ 255

Query: 329 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 388
             SG  +   +MV  TL  M +GGI+D +GGG  RYS D RW VPHFEKMLYD      +
Sbjct: 256 --SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSLFLEI 312

Query: 389 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 448
             +   ++K +       DI+ YL RDM    G I SAEDADS   EG    +EG FY+W
Sbjct: 313 LAEYSLVSKKISAESFALDIVSYLHRDMRMDEGGICSAEDADS---EG----EEGLFYIW 365

Query: 449 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 508
             +E  ++ GE + L ++ + +   GN            F+GKN+L E    S    +  
Sbjct: 366 DLEEFREVCGEDSFLLEKFWNVTKEGN------------FEGKNILHENFRGSNFTEEEL 413

Query: 509 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 568
             L+K    L + + KL + RSKR RP  DDK++ SWNGL I +  +             
Sbjct: 414 KQLDK---ALAKGKVKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG----------- 459

Query: 569 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 628
                +   R++++++AE   SFI ++L D    R+   FR G S   G+ +DYA +I+ 
Sbjct: 460 -----IAFQREDFLKLAEETYSFIEKNLID-SNGRILRRFREGESGILGYSNDYAEMIAS 513

Query: 629 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGAEP 687
            + L+E G G ++L  A+        LF  R   G F  TG D  VLLR   D +DG EP
Sbjct: 514 SIVLFEAGRGVRYLQNAVLWMEEAIRLF--RSPAGVFFDTGIDGEVLLRRSVDGYDGVEP 571

Query: 688 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 747
           S NS    +LVRL+ +  G  SDYYR+ AE     F   L   A++ P +  A       
Sbjct: 572 SANSSLAHSLVRLSFL--GVNSDYYREIAESIFLYFRKELYSYALSYPFLLSA-----YW 624

Query: 748 SRKH----VVLVGHKSSVDFENMLAAAHASY 774
           S KH    +VL+  K+S + ++MLA   + +
Sbjct: 625 SYKHHFREIVLI-RKNSEEGKDMLAWIQSRF 654


>gi|456972139|gb|EMG12591.1| PF03190 family protein [Leptospira interrogans serovar
           Grippotyphosa str. LT2186]
          Length = 699

 Score =  434 bits (1115), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 267/691 (38%), Positives = 379/691 (54%), Gaps = 67/691 (9%)

Query: 93  STSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHW 152
           ++  S +++ NRL+ E SPYL QH++NPVDWF WGEEA  +A+ +D  IFLSIGY+TCHW
Sbjct: 7   NSMESNSRNPNRLSKEKSPYLQQHSYNPVDWFPWGEEALTKAKDQDKLIFLSIGYATCHW 66

Query: 153 CHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP 212
           CHVME ESFE++ +A  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++FL+P
Sbjct: 67  CHVMEKESFENQSIADYLNFHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNMFLTP 126

Query: 213 DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASAS 272
           + +P+ GGTYFPPE +YGR GF  +L  ++  W +KR  L  + +   + L ++  + A 
Sbjct: 127 EGQPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGESRAK 186

Query: 273 SNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKKLEDT 328
             +  D  P+N            YDS+FGGF +    KFP  + +  +L  YHS      
Sbjct: 187 EKQEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS------ 240

Query: 329 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 388
             SG  +   +MV  TL  M +GGI+D +GGG  RYS D RW VPHFEKMLYD      +
Sbjct: 241 --SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSLFLEI 297

Query: 389 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 448
             +   ++K +       DI+ YL RDM    G I SAEDADS   EG    +EG FY+W
Sbjct: 298 LAEYSLVSKKISAESFALDIVSYLHRDMRMDEGGICSAEDADS---EG----EEGLFYIW 350

Query: 449 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 508
             +E  ++ GE + L ++ + +   GN            F+GKN+L E    S    +  
Sbjct: 351 DLEEFREVCGEDSFLLEKFWNVTKEGN------------FEGKNILHENFRGSNFTEEEL 398

Query: 509 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 568
             L+K    L + + KL + RSKR RP  DDK++ SWNGL I +  +             
Sbjct: 399 KQLDK---ALAKGKVKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG----------- 444

Query: 569 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 628
                +   R++++++AE   SFI ++L D    R+   FR G S   G+ +DYA +I+ 
Sbjct: 445 -----IAFQREDFLKLAEETYSFIEKNLID-SNGRILRRFREGESGILGYSNDYAEMIAS 498

Query: 629 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGAEP 687
            + L+E G G ++L  A+        LF  R   G F  TG D  VLLR   D +DG EP
Sbjct: 499 SIVLFEAGRGVRYLQNAVLWMEEAIRLF--RSSAGVFFDTGIDGEVLLRRSVDGYDGVEP 556

Query: 688 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 747
           S NS    +LVRL+ +  G  S+YYR+ AE     F   L   A++ P +  A       
Sbjct: 557 SANSSLAHSLVRLSFL--GVNSNYYREIAESIFLYFRKELYSYALSYPFLLSA-----YW 609

Query: 748 SRKH----VVLVGHKSSVDFENMLAAAHASY 774
           S KH    +VL+  K+S + ++MLA   + +
Sbjct: 610 SYKHHFREIVLI-RKNSEEGKDMLAWIQSRF 639


>gi|448305439|ref|ZP_21495370.1| hypothetical protein C495_14092 [Natronorubrum sulfidifaciens JCM
           14089]
 gi|445588825|gb|ELY43066.1| hypothetical protein C495_14092 [Natronorubrum sulfidifaciens JCM
           14089]
          Length = 727

 Score =  434 bits (1115), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 246/645 (38%), Positives = 353/645 (54%), Gaps = 41/645 (6%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA NPV+W  W E+A   AR+ DVPIFLSIGYS CHWCHVME ESF 
Sbjct: 8   NRLDEEESPYLRQHADNPVNWQPWDEQALETAREHDVPIFLSIGYSACHWCHVMEDESFA 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D+ VA+LLN+ FV IKVDREERPDVD +YMT  Q +   GGWPLS +L+P+ KP   GTY
Sbjct: 68  DDEVAELLNENFVPIKVDREERPDVDSIYMTVCQLVTSRGGWPLSAWLTPEGKPFHIGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP E K G+PGF  IL ++ + W+  R+ +        +  ++ L  +  +    +    
Sbjct: 128 FPKESKRGQPGFLDILERLAETWETDREEVENRAQQWTDAATDQLEETPDTVAAAEPPSS 187

Query: 283 NALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
           + L   A+   +S D ++GGFGS  PKFP+P  ++++   ++  + TG+    SE  +++
Sbjct: 188 DVLETAADTALRSADRQYGGFGSGGPKFPQPSRLRVL---ARAFDRTGQ----SEYLEVL 240

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             +L  M  GG++DHVGGGFHRY VD  W VPHFEKMLYD  ++    L  + LT +  Y
Sbjct: 241 EESLDAMIDGGLYDHVGGGFHRYCVDRDWTVPHFEKMLYDNAEIPRALLAGYQLTGEERY 300

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH- 460
           +    + L ++ R++    G  FS  DA S + E   R +EGAF+VWT +EV ++L +  
Sbjct: 301 AETVAETLAFVDRELTHDDGGFFSTLDAQSKDPETGER-EEGAFFVWTPEEVSEVLEDQT 359

Query: 461 -AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILG 519
            A LF E Y +  +GN            F+G+N    +   S+ A    +  ++    L 
Sbjct: 360 TAELFCERYDITESGN------------FEGQNQPNRVQSISSLAEAFDLEEQEVETRLE 407

Query: 520 ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRK 579
             R +LF+ R +RPRP+ D+KV+ SWNGL+I+++A A+ +L              G D  
Sbjct: 408 AARERLFEAREQRPRPNRDEKVLASWNGLMIATYAEAALVL--------------GDD-- 451

Query: 580 EYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGT 639
           EY E A  A  F+R  L+D    RL   +++G     G+L+DYAFL    +  YE     
Sbjct: 452 EYAETAVDALEFVRDRLWDADEKRLSRRYKDGDVAVDGYLEDYAFLARAAVGCYEATGEV 511

Query: 640 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVR 699
             L +A+EL  T +  F D E G  + T     S++ R +E +D + PS   V+V  L+ 
Sbjct: 512 DHLAFALELARTIEAEFWDAEAGTLYFTPESGESLVTRPQELNDQSTPSAAGVAVETLLA 571

Query: 700 LASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 744
           L      S+   +   A   L     R++   +    +C AAD L
Sbjct: 572 LDRFAVDSEE--FEAIASTVLETHANRIEANPLQHASLCLAADRL 614


>gi|46579138|ref|YP_009946.1| hypothetical protein DVU0725 [Desulfovibrio vulgaris str.
           Hildenborough]
 gi|387152533|ref|YP_005701469.1| hypothetical protein Deval_0667 [Desulfovibrio vulgaris RCH1]
 gi|46448551|gb|AAS95205.1| conserved hypothetical protein [Desulfovibrio vulgaris str.
           Hildenborough]
 gi|311232977|gb|ADP85831.1| hypothetical protein Deval_0667 [Desulfovibrio vulgaris RCH1]
          Length = 715

 Score =  433 bits (1114), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 269/699 (38%), Positives = 378/699 (54%), Gaps = 53/699 (7%)

Query: 89  RTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYS 148
           RTP  T+       NRLA   SPYL QHAHNPVDW  WGE A A AR+RDVP+F+S+GYS
Sbjct: 5   RTPLQTTGP-----NRLATAPSPYLRQHAHNPVDWHPWGEAALALARERDVPLFVSVGYS 59

Query: 149 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 208
           TCHWCHVM  ESFED  V++ LN+ FV +KVDREERPD+D +YM   Q L G GGWPL++
Sbjct: 60  TCHWCHVMAHESFEDAEVSQALNEGFVCVKVDREERPDIDALYMNACQMLTGTGGWPLTI 119

Query: 209 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKR-DMLAQSG--AFAIEQLSE 265
           F  PD  P    TY P   + GR G   ++ +V+D +  +R D+ A +   A A+ + + 
Sbjct: 120 FALPDGTPFFAATYLPKRSRGGRAGLLDLIPRVRDIYATRRADVEASAADIAKAMRERAA 179

Query: 266 ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 325
            L  S    + P       LR     L  ++D+  GGFG APKFP P  +  +L H ++ 
Sbjct: 180 ELLQSPPDGRTP---AAGTLRAAFNDLVANFDTAHGGFGGAPKFPSPHLLLFLLRHGRRT 236

Query: 326 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 385
            D       S  Q M L TL+ M +GG+ D +GGG HRYS D RW +PHFEKML+DQ   
Sbjct: 237 GD-------SRSQDMALATLRGMLRGGLWDRLGGGIHRYSTDARWLLPHFEKMLHDQAMF 289

Query: 386 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 445
                + +  T++           DY+ RDM   GG + +AEDADS   EG  +++EGAF
Sbjct: 290 MLATAETWLATREDDMREAALATADYILRDMALSGGGLAAAEDADSLTPEG--KRREGAF 347

Query: 446 YVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDSSASA 504
           Y +T  EV +  G++A L    + +   GN       +     +G NVL + L D   +A
Sbjct: 348 YTFTFDEVREAAGDNADLAVRLFGITGEGNI----ADESTGRREGHNVLHLPLGDD--AA 401

Query: 505 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 564
           + LG+  E+      +    L  +R+ R RPH DDK++  WNGL I++ AR   +     
Sbjct: 402 TTLGIDAEELAFRHDDILAGLRSLRATRRRPHRDDKLLTDWNGLAIAALARCGHV----- 456

Query: 565 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQT--HRLQHSFRNGPSKAPGFLDDY 622
               F+ P           + ++AAS     L  + T    L HS   G    PGFLDDY
Sbjct: 457 ----FDAP----------HLTDAAASLADAVLTLQHTPDGGLLHSRFEGTGSTPGFLDDY 502

Query: 623 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDP-SVLLRVKED 681
           AF+I GLL+LY   +  +WL  AI LQ+ QD+ FLD   GGY++T  + P +  LR+KE 
Sbjct: 503 AFVIWGLLELYTATNQPQWLEEAIRLQHAQDDRFLDPVDGGYWHTPADAPRTAALRLKEA 562

Query: 682 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 741
            DGA PSGN+ +++NL+RLA ++  +    Y + A   +  F ++++   +   +  C  
Sbjct: 563 RDGALPSGNAAALLNLLRLARLLGDAS---YEEKAHGLIRAFASQVRHNPLGAAMFLCGV 619

Query: 742 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           D  ++   + V++ G   + D E ML A   SY  N  +
Sbjct: 620 D-FALTGGRLVIIAGEAQAPDTEAMLDAVRRSYSPNTVM 657


>gi|242806544|ref|XP_002484765.1| DUF255 domain protein [Talaromyces stipitatus ATCC 10500]
 gi|218715390|gb|EED14812.1| DUF255 domain protein [Talaromyces stipitatus ATCC 10500]
          Length = 791

 Score =  433 bits (1114), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 268/678 (39%), Positives = 371/678 (54%), Gaps = 51/678 (7%)

Query: 82  KVVAMAERTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPI 141
           K  A +E   A+T   R K  NRL    SPY+  H +NPV W  W  +A   A+K +  I
Sbjct: 4   KANARSEEHHATTGAPRLKLVNRLNESRSPYVRGHMNNPVAWQLWDSKAIELAKKHNRLI 63

Query: 142 FLSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGG 201
           F+SIGYS CHWCHVME ESF    VA +LN+ F+ IKVDREERPD+D VYM YVQA  G 
Sbjct: 64  FVSIGYSACHWCHVMEKESFMSTEVATILNESFIPIKVDREERPDIDDVYMNYVQATTGS 123

Query: 202 GGWPLSVFLSPDLKPLMGGTYFP-----PEDKYGRP---GFKTILRKVKDAW-------- 245
           GGWPL+VFL+PDL+P+ GGTY+P      + ++G     GF  IL K++D W        
Sbjct: 124 GGWPLNVFLTPDLEPVFGGTYWPGPHSSSQSQWGVEGPIGFVDILEKLRDVWQTQQARCL 183

Query: 246 DKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS 305
           D  +++  Q   FA E       A +    L  EL + A     +  +  YD  +GGFG 
Sbjct: 184 DSAKEITKQLREFAEEGTHVQQGAKSGGEDLEIELIEEAF----QHFASRYDPVYGGFGR 239

Query: 306 APKFPRPVEIQMML---YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFH 362
           APKFP P  +  ++    +   + D     E      M   TL  +A+GGI DH+G G  
Sbjct: 240 APKFPTPANLGFLIRLGMYPTAVSDIVGQDECVRATAMATKTLLNIARGGIRDHIGHGVA 299

Query: 363 RYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMI-GPGG 421
           RYSV   W +PHFEKMLYDQ QL +VY+DAF  T +        D++ YL  + I    G
Sbjct: 300 RYSVTTDWLLPHFEKMLYDQAQLLDVYVDAFRATHEPELLGAVYDLVSYLTSEPIQASTG 359

Query: 422 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSR 480
             +S+EDADS  +   T K+EGAFYVWT KE++ +LG+  A +   H+ +   GN  ++ 
Sbjct: 360 GYYSSEDADSLPSPNDTEKREGAFYVWTLKELKQVLGQRDAGVCARHWGVLADGN--IAP 417

Query: 481 MSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDD 539
            +DPH+EF  +NVL      S  A + G+  E+ + I+   ++KL + R K R RP LDD
Sbjct: 418 ENDPHDEFMDQNVLSIKVTPSKLAKEFGLSEEEVIKIIKSGKQKLREYREKARVRPDLDD 477

Query: 540 KVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE 599
           K+I +WNGL I + A+AS IL  E ++            ++  + A+ A  FI+  L++ 
Sbjct: 478 KIIAAWNGLAIGALAKAS-ILLEEIDTI---------KAQQCRDSAQRAVEFIKTTLFEP 527

Query: 600 QTHRLQHSFRNGP-SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFL- 657
            T +L   +R+G     PGF DDYAFLISGL+ +YE      +L +A +LQ   ++ F+ 
Sbjct: 528 STGQLWRIYRDGSRGNTPGFADDYAFLISGLITMYEATFDDSYLQFAEQLQEHLNKYFIA 587

Query: 658 ----DREGGGYFNTTGE----DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKS 709
                    GY+ T+ E    +P  LLR+K   D A PS N +   NLVRL S++   + 
Sbjct: 588 PGDEPDTYAGYYTTSSEPIPDEPGPLLRLKSGTDSATPSINGIIARNLVRLGSLL---ED 644

Query: 710 DYYRQNAEHSLAVFETRL 727
           D YRQ A  + + F   L
Sbjct: 645 DTYRQLARQTCSTFSVEL 662


>gi|188475827|gb|ACD50089.1| hypothetical protein [uncultured crenarchaeote MCG]
          Length = 684

 Score =  433 bits (1113), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 254/629 (40%), Positives = 362/629 (57%), Gaps = 61/629 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N LA E+SPYLLQHA NPVDW  WGE+A A A++ + PIFLSIGY+ CHWCHVM  ESFE
Sbjct: 3   NYLAEENSPYLLQHASNPVDWHPWGEQALARAKQENKPIFLSIGYAACHWCHVMAHESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE  A +LN+ FV +KVDREERPD+D +YM    AL G GGWP+SVFL+PDL+P   GTY
Sbjct: 63  DELTASILNENFVCVKVDREERPDLDAIYMRATVALSGSGGWPMSVFLTPDLRPFYAGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL-- 280
           FPP  +Y  PGF  +LR +  AW  ++          I  ++  +  S S+  LP  L  
Sbjct: 123 FPPARRYNLPGFPELLRALAQAWGTRQQ--------EIHAVAARVDQSLSTPDLPSHLGV 174

Query: 281 -PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
             Q  L      L +  D + GG+G+APKFP+P+ I+++L     L+     G  ++G  
Sbjct: 175 VSQQLLEQAESWLVRHADRQHGGWGAAPKFPQPMAIELLL-----LQAAADPGAHADGLA 229

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           +   +LQ MA+GG++D +GGGF RYS D  WHVPHFEKMLYD  QLA  YL AF +T + 
Sbjct: 230 VATQSLQAMARGGMYDVLGGGFSRYSTDTTWHVPHFEKMLYDNAQLALAYLHAFLVTGET 289

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE 459
            +  +  + LD++ R+M  P G  +S+ DADS   EG    +EG +YVWT  E+ +++G+
Sbjct: 290 SFRQVAAETLDFVAREMTHPEGGFYSSLDADS---EG----REGKYYVWTQAEIREVIGD 342

Query: 460 HAI--LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI 517
            ++  LF   Y     G    S         +G+ +L    + +  +++      +   +
Sbjct: 343 PSMTELFLAAY---DAGTAPAS---------QGEIILQRAPNDANLSARFDKSASEIEEL 390

Query: 518 LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSD 577
           L   R +LF  R  RPRP LDDKVIV+WNGL++ +FA+A++            F   GS 
Sbjct: 391 LQRARARLFRARQARPRPGLDDKVIVAWNGLMLQAFAQAARC-----------FGGAGSG 439

Query: 578 RKE-YMEVAESAASFIRRHLYDE-QTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 635
             + Y+EVA   A+F+  +L +  Q HR+   +R G +    FL+DYA LI GLLDLY+ 
Sbjct: 440 TGDMYLEVATRNAAFLLGNLRNHGQLHRI---WRRGKTGQHVFLEDYAALILGLLDLYQA 496

Query: 636 GSGTKWLVWAIELQNTQDELFLDREG--GGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
                W + A +L    DE+ L      GG+F+T  +    L+R  E  DGA P+G +++
Sbjct: 497 DFSNAWFIAARQL---ADEMLLRFAAPDGGFFDTPDDSKPPLIRPMELQDGATPAGGALA 553

Query: 694 VINLVRLASIVAGSKSDYYRQNAEHSLAV 722
              L++LA++   +    YR +AE +L +
Sbjct: 554 TEALLKLAALTGEAT---YRDHAERTLPL 579


>gi|418710447|ref|ZP_13271218.1| PF03190 family protein [Leptospira interrogans serovar
           Grippotyphosa str. UI 08368]
 gi|410769383|gb|EKR44625.1| PF03190 family protein [Leptospira interrogans serovar
           Grippotyphosa str. UI 08368]
          Length = 691

 Score =  433 bits (1113), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 270/689 (39%), Positives = 378/689 (54%), Gaps = 69/689 (10%)

Query: 95  SHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCH 154
           S+SRN   NRL+ E SPYL QH++NPVDWF WGEEA  +A+ +D  IFLSIGY+TCHWCH
Sbjct: 3   SNSRN--PNRLSKEKSPYLQQHSYNPVDWFPWGEEALTKAKDQDKLIFLSIGYATCHWCH 60

Query: 155 VMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDL 214
           VME ESFE++ +A  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++FL+P+ 
Sbjct: 61  VMEKESFENQSIADYLNFHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNMFLTPEG 120

Query: 215 KPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSN 274
           +P+ GGTYFPPE +YGR GF  +L  ++  W +KR  L  + +   + L ++  + A   
Sbjct: 121 QPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGESRAKEK 180

Query: 275 KLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKKLEDTGK 330
           +  D  P+N            YDS+FGGF +    KFP  + +  +L  YHS        
Sbjct: 181 QEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-------- 232

Query: 331 SGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYL 390
           SG  +   +MV  TL  M +GGI+D +GGG  RYS D RW VPHFEKMLYD      +  
Sbjct: 233 SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSLFLEILA 291

Query: 391 DAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTS 450
           +   ++K +       DI+ YL RDM    G I SAEDADS   EG    +EG FY+W  
Sbjct: 292 EYSLVSKKISAESFALDIVSYLHRDMRMDEGGICSAEDADS---EG----EEGLFYIWDL 344

Query: 451 KEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMP 510
           +E  ++ GE + L ++ + +   GN            F+GKN+L E    S    +    
Sbjct: 345 EEFREVCGEDSFLLEKFWNVTKEGN------------FEGKNILHENFRGSNFTEEELKQ 392

Query: 511 LEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFN 570
           L+K    L + + KL + RSKR RP  DDK++ SWNGL I +  +               
Sbjct: 393 LDK---ALAKGKVKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------------- 436

Query: 571 FPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLL 630
              +   R++++++AE   SFI ++L D    R+   FR G S   G+ +DYA +I+  +
Sbjct: 437 ---IAFQREDFLKLAEETYSFIEKNLID-SNGRILRRFREGESGILGYSNDYAEMIASSI 492

Query: 631 DLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGAEPSG 689
            L+E G G ++L  A+        LF  R   G F  TG D  VLLR   D +DG EPS 
Sbjct: 493 VLFEAGRGVRYLQNAVLWMEEAIRLF--RSSAGVFFDTGIDGEVLLRRSVDGYDGVEPSA 550

Query: 690 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSR 749
           NS    +LVRL+ +  G  S+YYR+ AE     F   L   A++ P +  A       S 
Sbjct: 551 NSSLAHSLVRLSFL--GVNSNYYREIAESIFLYFRKELYSYALSYPFLLSA-----YWSY 603

Query: 750 KH----VVLVGHKSSVDFENMLAAAHASY 774
           KH    +VL+  K+S + ++MLA   + +
Sbjct: 604 KHHFREIVLI-RKNSEEGKDMLAWIQSRF 631


>gi|358371871|dbj|GAA88477.1| DUF255 domain protein [Aspergillus kawachii IFO 4308]
          Length = 784

 Score =  433 bits (1113), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 259/628 (41%), Positives = 349/628 (55%), Gaps = 35/628 (5%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           K  NRL    SPY+  H +NPV W  W  EA   A++ +  IFLSIGYS CHWCHVME E
Sbjct: 18  KLVNRLHESRSPYVRAHMNNPVGWQLWDAEAIDLAKRHNRLIFLSIGYSACHWCHVMEKE 77

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SF  + VA +LN  F+ IKVDREERPD+D VYM YVQA  G GGWPL+VFL+PDL+P+ G
Sbjct: 78  SFMSQEVASILNQSFIPIKVDREERPDIDDVYMNYVQATTGSGGWPLNVFLTPDLEPVFG 137

Query: 220 GTYFPPEDKYGRP-----GFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASS- 273
           GTY+P  +          GF  IL K+ D W  ++    +S     +QL E       S 
Sbjct: 138 GTYWPGPNSSTLTGNETIGFVEILEKLSDVWQTQQLRCRESAKEITKQLREFAEEGTHSY 197

Query: 274 ---NKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML---YHSKKLED 327
               +  ++L    L    +     YD   GGF +APKFP P  +  +L    +   + D
Sbjct: 198 QGDRQADEDLDLELLEEAYQHFVSRYDPLHGGFSTAPKFPTPSNLSFLLRLGIYPTAVAD 257

Query: 328 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 387
                E ++   M + TL  MA+GGI DH+G GF RYSV   W +PHFEKMLYDQ QL +
Sbjct: 258 IVGRDECAKATAMAVDTLISMARGGIRDHIGHGFARYSVTGDWGLPHFEKMLYDQAQLLD 317

Query: 388 VYLDAFSLTKDVFYSYICRDILDYLRRDMI-GPGGEIFSAEDADSAETEGATRKKEGAFY 446
           VY+DAF +T +        D+  YL    I  P G   S+EDADS  T   T K+EGAFY
Sbjct: 318 VYVDAFKITHNPELLGAVYDLATYLTTAPIQSPTGAFHSSEDADSLPTPNDTEKREGAFY 377

Query: 447 VWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 505
           VWT KE+  +LG+  A +   H+ + P GN  ++  +DPH+EF  +NVL      S  A 
Sbjct: 378 VWTLKELTQVLGQRDAGVCARHWGVLPDGN--IAPENDPHDEFMNQNVLSVKVTPSRLAK 435

Query: 506 KLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEA 564
             G+  E+ + I+   ++KL D R + R RP LDDK+IV+WNGL I + A+ S + + E 
Sbjct: 436 DFGLGEEEVVRIIRTAKQKLRDYRERTRVRPDLDDKIIVAWNGLAIGALAKCSALFE-EI 494

Query: 565 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP-SKAPGFLDDYA 623
           ES         S   +  E A  A SFI+ +L+++ T +L   +R+G     PGF DDYA
Sbjct: 495 ES---------SKAVQCREAAAKAISFIKENLFEKSTGQLWRIYRDGGRGNTPGFADDYA 545

Query: 624 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG---GGYFNT----TGEDPSVLL 676
           +LI GLLD+YE      +L +A +LQ   ++ FL   G    GY++T    T   P  LL
Sbjct: 546 YLIGGLLDMYEATFDDSYLQFAEQLQKYLNDNFLAYVGTTPAGYYSTPSTMTSGAPGPLL 605

Query: 677 RVKEDHDGAEPSGNSVSVINLVRLASIV 704
           R+K   +   P+ N V   NL+RL S++
Sbjct: 606 RLKTGTESVTPAVNGVIARNLLRLGSLL 633


>gi|383625377|ref|ZP_09949783.1| hypothetical protein HlacAJ_18680 [Halobiforma lacisalsi AJ5]
 gi|448700355|ref|ZP_21699463.1| hypothetical protein C445_15926 [Halobiforma lacisalsi AJ5]
 gi|445779895|gb|EMA30810.1| hypothetical protein C445_15926 [Halobiforma lacisalsi AJ5]
          Length = 746

 Score =  432 bits (1112), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 257/655 (39%), Positives = 350/655 (53%), Gaps = 52/655 (7%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA NPV+W  W E+A   AR+ DVPIFLSIGYS CHWCHVME ESF 
Sbjct: 12  NRLDEEESPYLRQHADNPVNWQPWDEQALETAREHDVPIFLSIGYSACHWCHVMEEESFA 71

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA LLND FV IKVDREERPDVD +YMT  Q + G GGWPLS +L+P+ KP   GTY
Sbjct: 72  DEDVADLLNDHFVPIKVDREERPDVDSIYMTVCQLVSGRGGWPLSAWLTPEGKPFYVGTY 131

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGA----FAIEQLSEAL------SASAS 272
           FP E K G+PGF  IL  V D+W+  R+ +          A ++L E         A+ +
Sbjct: 132 FPKESKRGQPGFVDILENVIDSWETDREEIENRAQKWTDAARDELEETPGTGGPGDAAVA 191

Query: 273 SNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLEDTGKS 331
            +  P     + L   A+   +S D  +GGFGS  PKFP+P  ++++   S +   TG  
Sbjct: 192 ESTEPTPPSSDLLETTADAAVRSADRGYGGFGSDGPKFPQPSRLRVLARASDR---TG-- 246

Query: 332 GEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLD 391
           GE    ++++  TL  MA GG++DHVGGGFHRY VD  W VPHFEKMLYD  ++   +L 
Sbjct: 247 GETY--REVLEETLDAMAAGGLYDHVGGGFHRYCVDRDWTVPHFEKMLYDNAEIPRAFLT 304

Query: 392 AFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSK 451
            + LT D  Y+ +  + L ++ R++    G  F+  DA S + E   R +EGAFYVWT  
Sbjct: 305 GYRLTGDDRYAEVVEETLAFVDRELTHDEGGFFATLDAQSEDPETGER-EEGAFYVWTPD 363

Query: 452 EVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGM 509
           EV D+L +   A LF E Y +  +GN            F+G+N    +   +  A    +
Sbjct: 364 EVRDVLEDETDAELFCERYDITASGN------------FEGENQPNRVRSVADLAESFDL 411

Query: 510 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 569
              +    L + R +LF  R +RPRP+ D+KV+  WNGL+I++ A A+  L         
Sbjct: 412 EESEVRERLADARERLFAAREERPRPNRDEKVLAGWNGLMIATCAEAAMTL--------- 462

Query: 570 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 629
                G D  EY  +A  A  F+R  L+D    RL   +++      G+L+DYAFL  G 
Sbjct: 463 -----GED--EYATMAVDALEFVRERLWDADERRLSRRYKDDDVAIDGYLEDYAFLARGA 515

Query: 630 LDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSG 689
           L  Y+       L +A++L    +  F D E G  + T      ++ R +E  D + PS 
Sbjct: 516 LACYQATGDVDHLAFALDLAREIEGEFWDEEAGTLYFTPESGEDLVTRPQELGDQSTPSA 575

Query: 690 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 744
             V+V  L+ L S V  +    Y + AE  L     RL+   +    +C  AD L
Sbjct: 576 AGVAVETLLALESFVPDAD---YAELAETVLGTHVDRLEGSPLQHATLCLGADRL 627


>gi|120603287|ref|YP_967687.1| hypothetical protein Dvul_2244 [Desulfovibrio vulgaris DP4]
 gi|120563516|gb|ABM29260.1| protein of unknown function DUF255 [Desulfovibrio vulgaris DP4]
          Length = 715

 Score =  432 bits (1112), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 268/699 (38%), Positives = 376/699 (53%), Gaps = 53/699 (7%)

Query: 89  RTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYS 148
           RTP  T+       NRLA   SPYL QHAHNPVDW  WGE A A AR+RDVP+F+S+GYS
Sbjct: 5   RTPLQTTGP-----NRLATAPSPYLRQHAHNPVDWHPWGEAALALARERDVPLFVSVGYS 59

Query: 149 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 208
           TCHWCHVM  ESFED  VA+ LN+ FV +KVDREERPD+D +YM   Q L G GGWPL++
Sbjct: 60  TCHWCHVMAHESFEDAEVAQALNEGFVCVKVDREERPDIDALYMNACQMLTGTGGWPLTI 119

Query: 209 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSG---AFAIEQLSE 265
           F  PD  P    TY P   + GR G   ++ +V+D +  +R  +  S    A A+ + + 
Sbjct: 120 FALPDGTPFFAATYLPKRSRGGRAGLLDLIPRVRDIYATRRADVEASAADIAKAMRERAA 179

Query: 266 ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 325
            L  S    + P       LR     L  ++D+  GGFG APKFP P  +  +L H ++ 
Sbjct: 180 ELLQSPPDGRTP---AAGTLRAAFNDLVANFDTAHGGFGGAPKFPSPHLLLFLLRHGRRT 236

Query: 326 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 385
            D       S  Q M L TL+ M +GG+ D +GGG HRYS D RW +PHFEKML+DQ   
Sbjct: 237 GD-------SRSQDMALATLRGMLRGGLWDRLGGGIHRYSTDARWLLPHFEKMLHDQAMF 289

Query: 386 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 445
                + +  T++           DY+ RDM   GG + +AEDADS   EG  +++EGAF
Sbjct: 290 MLATAETWLATREDDMREAALATADYILRDMALSGGGLAAAEDADSLTPEG--KRREGAF 347

Query: 446 YVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDSSASA 504
           Y +T  EV +  G++A L    + +   GN       +     +G NVL + L D   +A
Sbjct: 348 YTFTFDEVREAAGDNADLAVRLFGITGEGNI----ADESTGRREGHNVLHLPLGDD--AA 401

Query: 505 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 564
           + LG+  ++      +    L  +R+ R RPH DDK++  WNGL I++ AR   +     
Sbjct: 402 TTLGIDADELAFRHDDILAGLRSLRATRRRPHRDDKLLTDWNGLAIAALARCGHV----- 456

Query: 565 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQT--HRLQHSFRNGPSKAPGFLDDY 622
               F+ P           + ++AAS     L  + T    L HS   G    PGFLDDY
Sbjct: 457 ----FDAP----------HLTDAAASLADAVLTLQHTPDGGLLHSRFEGTGSTPGFLDDY 502

Query: 623 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDP-SVLLRVKED 681
           AF+I GLL+LY   +  +WL  AI LQ+ QD+ FLD   GGY++T  + P +  LR+KE 
Sbjct: 503 AFVIWGLLELYTATNQPQWLEEAIRLQHAQDDRFLDPVDGGYWHTPADAPRTAALRLKEA 562

Query: 682 HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 741
            DGA PSGN+ +++NL+RLA ++  +    Y + A   +  F ++++   +   +  C  
Sbjct: 563 RDGALPSGNAAALLNLLRLARLLGDAS---YEEKAHGLIRAFASQVRHNPLGAAMFLCGV 619

Query: 742 DMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           D  ++   + V++ G   + D E ML A   SY  N  +
Sbjct: 620 D-FALTGGRLVIIAGEAQAPDTEAMLDAVRRSYSPNTVM 657


>gi|418701443|ref|ZP_13262368.1| PF03190 family protein [Leptospira interrogans serovar Bataviae
           str. L1111]
 gi|410759525|gb|EKR25737.1| PF03190 family protein [Leptospira interrogans serovar Bataviae
           str. L1111]
          Length = 691

 Score =  432 bits (1112), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 271/689 (39%), Positives = 377/689 (54%), Gaps = 69/689 (10%)

Query: 95  SHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCH 154
           S+SRN   NRL+ E SPYL QH++NPVDWF WGEEA  +A+ +D  IFLSIGY+TCHWCH
Sbjct: 3   SNSRN--PNRLSKEKSPYLQQHSYNPVDWFPWGEEALTKAKDQDKLIFLSIGYATCHWCH 60

Query: 155 VMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDL 214
           VME ESFE++ +A  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++FL+P+ 
Sbjct: 61  VMEKESFENQSIADYLNFHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNMFLTPEG 120

Query: 215 KPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSN 274
           +P+ GGTYFPPE +YGR GF  +L  ++  W +KR  L  + +   + L ++  + A   
Sbjct: 121 QPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASEFSQYLKDSGESRAKEK 180

Query: 275 KLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKKLEDTGK 330
           +  D  P+N            YDS+FGGF +    KFP  + +  +L  YHS        
Sbjct: 181 QEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-------- 232

Query: 331 SGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYL 390
           SG  +   +MV  TL  M +GGI+D +GGG  RYS D RW VPHFEKMLYD      +  
Sbjct: 233 SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSLFLEILA 291

Query: 391 DAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTS 450
           +   ++K +       DI+ YL RDM    G I SAEDADS   EG    +EG FY+W  
Sbjct: 292 EYSLVSKKISAESFALDIVSYLHRDMRMDEGGICSAEDADS---EG----EEGLFYIWDL 344

Query: 451 KEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMP 510
           +E  ++ GE + L ++ + +   GN            F+GKN+L E    S    +    
Sbjct: 345 EEFREVCGEDSFLLEKFWNVTKEGN------------FEGKNILHENFRGSNFTEEELKQ 392

Query: 511 LEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFN 570
           L+K    L + + KL + RSKR RP  DDK++ SWNGL I +  +               
Sbjct: 393 LDK---ALAKGKVKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------------- 436

Query: 571 FPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLL 630
              +   R++++++AE   SFI ++L D    R+   FR G S   G+ +DYA +I+  +
Sbjct: 437 ---IAFQREDFLKLAEETYSFIEKNLID-SNGRILRRFREGESGILGYSNDYAEMIASSI 492

Query: 631 DLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGAEPSG 689
            L+E G G ++L  A+        LF  R   G F  TG D  VLLR   D +DG EPS 
Sbjct: 493 VLFEAGRGVRYLQNAVLWMEEAIRLF--RSPAGVFFDTGIDGEVLLRRSVDGYDGVEPSA 550

Query: 690 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSR 749
           NS    +LVRL+ +  G  SDYYR+ AE     F   L   A+  P +  A       S 
Sbjct: 551 NSSLAHSLVRLSFL--GVNSDYYREIAESIFLYFRKELYSYALNYPFLLSA-----YWSY 603

Query: 750 KH----VVLVGHKSSVDFENMLAAAHASY 774
           KH    +VL+  K+S + ++MLA   + +
Sbjct: 604 KHHFREIVLI-RKNSEEGKDMLAWIQSRF 631


>gi|417761487|ref|ZP_12409496.1| PF03190 family protein [Leptospira interrogans str. 2002000624]
 gi|417772112|ref|ZP_12420002.1| PF03190 family protein [Leptospira interrogans serovar Pomona str.
           Pomona]
 gi|417776397|ref|ZP_12424235.1| PF03190 family protein [Leptospira interrogans str. 2002000621]
 gi|418671976|ref|ZP_13233322.1| PF03190 family protein [Leptospira interrogans str. 2002000623]
 gi|418680449|ref|ZP_13241698.1| PF03190 family protein [Leptospira interrogans serovar Pomona str.
           Kennewicki LC82-25]
 gi|418703630|ref|ZP_13264514.1| PF03190 family protein [Leptospira interrogans serovar Hebdomadis
           str. R499]
 gi|400327807|gb|EJO80047.1| PF03190 family protein [Leptospira interrogans serovar Pomona str.
           Kennewicki LC82-25]
 gi|409942568|gb|EKN88176.1| PF03190 family protein [Leptospira interrogans str. 2002000624]
 gi|409946069|gb|EKN96083.1| PF03190 family protein [Leptospira interrogans serovar Pomona str.
           Pomona]
 gi|410573764|gb|EKQ36808.1| PF03190 family protein [Leptospira interrogans str. 2002000621]
 gi|410581098|gb|EKQ48913.1| PF03190 family protein [Leptospira interrogans str. 2002000623]
 gi|410766766|gb|EKR37449.1| PF03190 family protein [Leptospira interrogans serovar Hebdomadis
           str. R499]
 gi|455668123|gb|EMF33372.1| PF03190 family protein [Leptospira interrogans serovar Pomona str.
           Fox 32256]
          Length = 691

 Score =  432 bits (1111), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 270/689 (39%), Positives = 378/689 (54%), Gaps = 69/689 (10%)

Query: 95  SHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCH 154
           S+SRN   NRL+ E SPYL QH++NPVDWF WGEEA  +A+ +D  IFLSIGY+TCHWCH
Sbjct: 3   SNSRN--PNRLSKEKSPYLQQHSYNPVDWFPWGEEALTKAKDQDKLIFLSIGYATCHWCH 60

Query: 155 VMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDL 214
           VME ESFE++ +A  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++FL+P+ 
Sbjct: 61  VMEKESFENQSIADYLNFHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNMFLTPEG 120

Query: 215 KPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSN 274
           +P+ GGTYFPPE +YGR GF  +L  ++  W +KR  L  + +   + L ++  + A   
Sbjct: 121 QPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGESRAKEK 180

Query: 275 KLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKKLEDTGK 330
           +  D  P+N            YDS+FGGF +    KFP  + +  +L  YHS        
Sbjct: 181 QEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-------- 232

Query: 331 SGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYL 390
           SG  +   +MV  TL  M +GGI+D +GGG  RYS D RW VPHFEKMLYD      +  
Sbjct: 233 SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSLFLEILA 291

Query: 391 DAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTS 450
           +   ++K +       DI+ YL RDM    G I SAEDADS   EG    +EG FY+W  
Sbjct: 292 EYSLVSKKISAESFALDIVSYLHRDMRMDEGGICSAEDADS---EG----EEGLFYIWDL 344

Query: 451 KEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMP 510
           +E  ++ GE + L ++ + +   GN            F+GKN+L E    S    +    
Sbjct: 345 EEFREVCGEDSFLLEKFWNVTKEGN------------FEGKNILHENFRGSNFTEEELKQ 392

Query: 511 LEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFN 570
           L+K    L + + KL + RSKR RP  DDK++ SWNGL I +  +               
Sbjct: 393 LDK---ALAKGKVKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------------- 436

Query: 571 FPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLL 630
              +   R++++++AE   SFI ++L D    R+   FR G S   G+ +DYA +I+  +
Sbjct: 437 ---IAFQREDFLKLAEETYSFIEKNLID-SNGRILRRFREGESGILGYSNDYAEMIASSI 492

Query: 631 DLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGAEPSG 689
            L+E G G ++L  A+        LF  R   G F  TG D  VLLR   D +DG EPS 
Sbjct: 493 VLFEAGRGVRYLQNAVLWMEEAIRLF--RSPAGVFFDTGIDGEVLLRRSVDGYDGVEPSA 550

Query: 690 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSR 749
           NS    +LVRL+ +  G  S+YYR+ AE     F   L   A++ P +  A       S 
Sbjct: 551 NSSLAHSLVRLSFL--GVNSNYYREIAESIFLYFRKELYSYALSYPFLLSA-----YWSY 603

Query: 750 KH----VVLVGHKSSVDFENMLAAAHASY 774
           KH    +VL+  K+S + ++MLA   + +
Sbjct: 604 KHHFREIVLI-RKNSEEGKDMLAWIQSRF 631


>gi|418715817|ref|ZP_13275928.1| PF03190 family protein [Leptospira interrogans str. UI 08452]
 gi|410788318|gb|EKR82040.1| PF03190 family protein [Leptospira interrogans str. UI 08452]
          Length = 691

 Score =  432 bits (1111), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 270/689 (39%), Positives = 378/689 (54%), Gaps = 69/689 (10%)

Query: 95  SHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCH 154
           S+SRN   NRL+ E SPYL QH++NPVDWF WGEEA  +A+ +D  IFLSIGY+TCHWCH
Sbjct: 3   SNSRN--PNRLSKEKSPYLQQHSYNPVDWFPWGEEALTKAKDQDKLIFLSIGYATCHWCH 60

Query: 155 VMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDL 214
           VME ESFE++ +A  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++FL+P+ 
Sbjct: 61  VMEKESFENQSIADYLNFHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNMFLTPEG 120

Query: 215 KPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSN 274
           +P+ GGTYFPPE +YGR GF  +L  ++  W +KR  L  + +   + L ++  + A   
Sbjct: 121 QPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGESRAKEK 180

Query: 275 KLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKKLEDTGK 330
           +  D  P+N            YDS+FGGF +    KFP  + +  +L  YHS        
Sbjct: 181 QEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-------- 232

Query: 331 SGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYL 390
           SG  +   +MV  TL  M +GGI+D +GGG  RYS D RW VPHFEKMLYD      +  
Sbjct: 233 SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSLFLEILA 291

Query: 391 DAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTS 450
           +   ++K +       DI+ YL RDM    G I SAEDADS   EG    +EG FY+W  
Sbjct: 292 EYSLVSKKISAESFALDIVSYLHRDMRMDEGGICSAEDADS---EG----EEGLFYIWDL 344

Query: 451 KEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMP 510
           +E  ++ GE + L ++ + +   GN            F+GKN+L E    S    +    
Sbjct: 345 EEFREVCGEDSFLLEKFWNVTKEGN------------FEGKNILHENFRGSNFTEEELKQ 392

Query: 511 LEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFN 570
           L+K    L + + KL + RSKR RP  DDK++ SWNGL I +  +               
Sbjct: 393 LDK---ALAKGKVKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------------- 436

Query: 571 FPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLL 630
              +   R++++++A+   SFI ++L D    R+   FR G S   G+ +DYA +I+  +
Sbjct: 437 ---IAFQREDFLKLAKETYSFIEKNLID-SNGRILRRFREGESGILGYSNDYAEMIASSI 492

Query: 631 DLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGAEPSG 689
            L+E G G ++L  A+        LF  R   G F  TG D  VLLR   D +DG EPS 
Sbjct: 493 VLFEAGRGVRYLQNAVLWMEEAIRLF--RSPAGVFFDTGIDGEVLLRRSVDGYDGVEPSA 550

Query: 690 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSR 749
           NS    +LVRL+ +  G  SDYYR+ AE     F   L   A++ P +  A       S 
Sbjct: 551 NSSLAHSLVRLSFL--GVNSDYYREIAESIFLYFRKELYSYALSYPFLLSA-----YWSY 603

Query: 750 KH----VVLVGHKSSVDFENMLAAAHASY 774
           KH    +VL+  K+S + ++MLA   + +
Sbjct: 604 KHHFREIVLI-RKNSEEGKDMLAWIQSRF 631


>gi|386392363|ref|ZP_10077144.1| thioredoxin domain-containing protein [Desulfovibrio sp. U5L]
 gi|385733241|gb|EIG53439.1| thioredoxin domain-containing protein [Desulfovibrio sp. U5L]
          Length = 704

 Score =  432 bits (1111), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 268/681 (39%), Positives = 352/681 (51%), Gaps = 59/681 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHAHNPVDW  WGEEAFA AR  D PIFLSIGYSTCHWCHVME ESFE
Sbjct: 6   NRLITEKSPYLQQHAHNPVDWHPWGEEAFALARTEDKPIFLSIGYSTCHWCHVMEHESFE 65

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE +A L+    V++KVDREERPD+D +YMT+ QAL G GGWPL+VFL+PD +P   GTY
Sbjct: 66  DEDIAALMRATVVAVKVDREERPDLDNLYMTFCQALTGRGGWPLNVFLTPDGRPFFAGTY 125

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP E  +GR G + +L++V  AW   R  +  +    ++ + + L A  +   +  E  Q
Sbjct: 126 FPKESGFGRTGMRELLQRVHMAWTSNRQAVIGNATQILDAVRDQLEARDAGEAV--EPGQ 183

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             L     +L+ ++D+  GGFG APKFP P  +  +L   ++   TG+     +   MV 
Sbjct: 184 AQLGAARNELAAAFDTANGGFGGAPKFPSPHNLLFLLREYRR---TGQ----EDNLAMVT 236

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL  M +GG+ D +G G HRYS D RW VPHFEKMLYDQ   A    +A+  T D    
Sbjct: 237 ATLDAMRRGGVFDQIGLGLHRYSTDARWFVPHFEKMLYDQALTAMAATEAYLATGDAGLR 296

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL-GEHA 461
            +  +I +Y+RRD+ GP G  +SAEDADS   EG     EG FYVWT  E+  +L G+ A
Sbjct: 297 RMAMEIFEYVRRDLTGPDGAFYSAEDADS---EGV----EGRFYVWTESEIRAVLPGDEA 349

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
            LF + Y + P GN       +   +  G N+       +A A K G    +    L   
Sbjct: 350 GLFMDVYGIAPGGNFH----DEATGQATGANIPFLEEPIAAVAGKRGQEPAELAARLERS 405

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
           R  L   R KR RP  DDKV+   NGL+I++ A+A++                  D +E 
Sbjct: 406 RELLLAARQKRVRPLCDDKVLTDMNGLMIAALAKAARAF----------------DDEEL 449

Query: 582 MEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKW 641
              A+ A+ F+   +    + RL H  R G +   G LDDYAFL  GLL+LY+      +
Sbjct: 450 AGRAKRASDFLLGKMLLPDS-RLLHRLRLGEAAVSGMLDDYAFLAWGLLELYQTVFDPAY 508

Query: 642 LVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLA 701
           L  A+ L       F D   GG F T  +  ++LLR K  +D A PSGNSV+ + L  L 
Sbjct: 509 LAQAVALAKAMVRHFGD-AAGGLFLTPDDGEALLLRQKTYYDAAIPSGNSVAFLVLTTL- 566

Query: 702 SIVAGSKSDYYRQNAEHSLAVFETRLKDMA--------MAVPLMCCAADMLSVPSRKHVV 753
                     YR   E S     TRL   A               C    +  PS   V 
Sbjct: 567 ----------YRLTGEKSFMEEATRLARAAGPWLAGHPSGFTFFLCGLSQMLAPS-AEVT 615

Query: 754 LVGHKSSVDFENMLAAAHASY 774
           + G   + D + +  A    Y
Sbjct: 616 IAGDPDAPDTQALARALFERY 636


>gi|327357546|gb|EGE86403.1| DUF255 domain-containing protein [Ajellomyces dermatitidis ATCC
           18188]
          Length = 833

 Score =  432 bits (1110), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 261/672 (38%), Positives = 363/672 (54%), Gaps = 61/672 (9%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            NRL+   SPY+  H +NPV W  W  EA   A+K +  +FLSIGYS CHWCHVME ESF
Sbjct: 25  VNRLSQSKSPYVRGHMNNPVAWQMWDSEAITLAKKLNRMVFLSIGYSACHWCHVMEKESF 84

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
               VA +LN  F+ IK+DREERPD+D+VYM YVQA  G GGWPL+VFL+PDL+P+ GGT
Sbjct: 85  MSPEVAAILNKSFIPIKLDREERPDIDEVYMNYVQATTGSGGWPLNVFLTPDLEPVFGGT 144

Query: 222 YFPPEDKYGRPG--------FKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE-ALSASAS 272
           Y+P       P         F  IL K++D W  ++    +S     +QL E A   + S
Sbjct: 145 YWPGPHSSTLPALGGEGHVTFIDILEKLRDVWQTQQLRCRESAKDITKQLREFAEEGTHS 204

Query: 273 SNKLPDELPQNALRLCA---EQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK---KLE 326
             K  D      + L     +  +  +D   GGF  APKF  P  +  ++  S+    + 
Sbjct: 205 KQKAADADEDLEVELLEESYQHFASRFDPVNGGFSRAPKFATPANLSFLINLSRYPSAVS 264

Query: 327 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 386
           D     E S   +M   TL  M++GGIHD +G GF RYSV   W +PHFEKMLYDQ QL 
Sbjct: 265 DIVGYDECSRALEMATKTLISMSRGGIHDQIGHGFARYSVTADWSLPHFEKMLYDQAQLL 324

Query: 387 NVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDADSAETEGATRKKEGAF 445
           NVY+DAF    +        DI  Y+    ++ P G  +S+EDADS  T   T K+EGAF
Sbjct: 325 NVYVDAFDSAHNPELLGAIYDIATYITSPPILSPTGGFYSSEDADSLPTPSDTDKREGAF 384

Query: 446 YVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 504
           YVWT KE + ILG+  A +   H+ + P GN  ++R +DPH+EF  +NVL      +  A
Sbjct: 385 YVWTHKEFKQILGQRDADVCARHWGVLPDGN--VARGNDPHDEFINQNVLSIKVTPAKLA 442

Query: 505 SKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 563
            + G+  E+ + I+   R KL + R SKR RP LDDK+IVSWNGL I + A+ S +L++ 
Sbjct: 443 KEFGLSEEEVVKIIKASREKLREYRESKRVRPGLDDKIIVSWNGLAIGALAKCSVVLEN- 501

Query: 564 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP-SKAPGFLDDY 622
                    V  +  +E+   AE+AA FIR++L+D  + +L   +R+G     PGF DDY
Sbjct: 502 ---------VDRAKAQEFRLAAENAAKFIRQNLFDPASGQLWRIYRDGERGDTPGFADDY 552

Query: 623 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR---------------------EG 661
           ++L SGL+DLYE      +L +A +LQ   +  FL +                       
Sbjct: 553 SYLASGLIDLYEATFDDGYLQFAEQLQQYLNTYFLAQGPTPTPSPRTSITTESTPAPSSS 612

Query: 662 GGYFNT------TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQN 715
            GY+ T          P+ L R+K   D + PS N V   NL+RL++++   + D Y++ 
Sbjct: 613 TGYYTTPSTIHQASAHPAPLFRLKTGTDASTPSPNGVIAQNLLRLSTLL---EDDTYKRL 669

Query: 716 AEHSLAVFETRL 727
           A  ++  F   +
Sbjct: 670 ARETVNAFAVEI 681


>gi|448318308|ref|ZP_21507834.1| hypothetical protein C492_17600 [Natronococcus jeotgali DSM 18795]
 gi|445599332|gb|ELY53367.1| hypothetical protein C492_17600 [Natronococcus jeotgali DSM 18795]
          Length = 721

 Score =  432 bits (1110), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 258/661 (39%), Positives = 356/661 (53%), Gaps = 57/661 (8%)

Query: 97  SRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVM 156
           +R    NRL  E SPYL QHA NPV+W  W E A   AR++D PIFLSIGYS CHWCHVM
Sbjct: 2   TRPTERNRLDEEESPYLRQHADNPVNWQPWDERALEAAREQDKPIFLSIGYSACHWCHVM 61

Query: 157 EVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKP 216
             ESF DE VA+LLN+ FV IKVDREERPDVD +YMT  Q + GGGGWPLSV+L+P+ KP
Sbjct: 62  ADESFADEEVAELLNEEFVPIKVDREERPDVDSIYMTVCQLVSGGGGWPLSVWLTPEGKP 121

Query: 217 LMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASS--N 274
              GTYFP   K G+PGF  +L  + D+W+  R+         IE  +E  +A+A     
Sbjct: 122 FYVGTYFPKRSKRGQPGFLDLLEGLADSWETDRE--------EIENRAEEWTAAARDRLE 173

Query: 275 KLPDEL------PQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLED 327
           + PD +          L   A+   +S D + GGFGS  PKFP+P  ++++   ++  + 
Sbjct: 174 ETPDSIGAAEPPSSEVLERAADAALRSADRQNGGFGSGGPKFPQPARLRVL---ARAFDR 230

Query: 328 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 387
           TG      E ++++  +L  M +GG++DHVGGGFHRY VD  W VPHFEKMLYD  ++  
Sbjct: 231 TGN----DEYREVLEGSLTAMIEGGLYDHVGGGFHRYCVDADWTVPHFEKMLYDNAEIPR 286

Query: 388 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 447
             L  + LT D  Y+   R+ L+++ R++    G  FS  DA S + E   R +EGAFYV
Sbjct: 287 ALLAGYRLTGDERYADYVRETLEFVSRELTHAEGGFFSTLDAQSEDPETGER-EEGAFYV 345

Query: 448 WTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 505
           WT  EV D+LG    A LF   Y +  +GN            F+G++        S  A 
Sbjct: 346 WTPAEVRDVLGSETDADLFCARYDITESGN------------FEGQSQPNLAASISELAD 393

Query: 506 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 565
           +  +   +    L   RR+LF+ R +RPRP+ D+KV+  WNGL+I++ A A+  L     
Sbjct: 394 RFDLEEREVEERLESARRELFEAREERPRPNRDEKVLAGWNGLMIATCAEAALAL----- 448

Query: 566 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 625
                    G DR  Y  +A  A  F+R  L++    RL   F++G     G+L+DYAFL
Sbjct: 449 ---------GEDR--YAGMAVDALEFVRDRLWNADEGRLSRRFKDGDVAVQGYLEDYAFL 497

Query: 626 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 685
             G L  YE       L +A+EL    +  F D E G  + T     S++ R +E +D +
Sbjct: 498 ARGALGCYEATGEVDHLAFALELARAIEAEFYDAERGTLYFTPESGESLVTRPQELNDQS 557

Query: 686 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 745
            PS   V+V  L+ L  +    + D + + A   L     RL+  A+    +C AAD L 
Sbjct: 558 TPSATGVAVETLLALGDVAG--EDDGFEEIATSVLRTHAGRLESNALEHATLCLAADRLE 615

Query: 746 V 746
            
Sbjct: 616 A 616


>gi|108757716|ref|YP_634091.1| hypothetical protein MXAN_5954 [Myxococcus xanthus DK 1622]
 gi|108461596|gb|ABF86781.1| conserved hypothetical protein [Myxococcus xanthus DK 1622]
          Length = 696

 Score =  432 bits (1110), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 250/606 (41%), Positives = 338/606 (55%), Gaps = 48/606 (7%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           +NRLA E SPYL QHAHNPVDWF WGEEA A+A+  + PI LS+GYS CHWCHVM  ESF
Sbjct: 11  SNRLAREPSPYLRQHAHNPVDWFPWGEEALAKAKAENKPILLSVGYSACHWCHVMAHESF 70

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           E    A+L+N+ F++IKVDREERPD+D++Y   VQ +  GGGWPL+VFL+PDLKP  GGT
Sbjct: 71  ESPETARLMNEGFINIKVDREERPDLDQIYQGVVQLMGQGGGWPLTVFLTPDLKPFYGGT 130

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLA-QSGAFAIEQLSEALSASASSNKLPDEL 280
           YFPP+D+YGRPGF  +L  ++DAW+ K+D +  QSG F  E L E   A+      P  L
Sbjct: 131 YFPPQDRYGRPGFPRLLMALRDAWENKQDEVQRQSGQFE-EGLGEL--ATYGLEAAPAVL 187

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
               +    ++++K  D+  GGFG APKFP P+   +ML   ++       G  +  +  
Sbjct: 188 TAADVVGMGQRMAKQVDAVHGGFGGAPKFPNPMNFALMLRAWRR-------GGGAPLKDA 240

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
           V  TL+ MA GGI+D +GGGFHRYSVDERW VPHFEKMLYD  QL ++Y  A  +     
Sbjct: 241 VFLTLERMALGGIYDQLGGGFHRYSVDERWLVPHFEKMLYDNAQLLHLYAQAQQVEPRQL 300

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE- 459
           +  +  + + Y+RR+M   GG  ++A+DADS   EG    +EG F+VW  +EV   L E 
Sbjct: 301 WRKVVEETVAYVRREMTDAGGGFYAAQDADS---EG----EEGKFFVWRPEEVRAALPEA 353

Query: 460 HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILG 519
            A L   H+ +KP GN +            G  VL  +   S  A + G+  +     L 
Sbjct: 354 QAELVLRHFGIKPGGNFE-----------HGATVLEVVVPVSELARERGVSEDAMERELA 402

Query: 520 ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRK 579
             ++ LFD R +R +P  DDK++  WNGL+I   A AS++                  R 
Sbjct: 403 AAKQTLFDARERRVKPGRDDKLLSGWNGLMIRGLALASRVF----------------GRP 446

Query: 580 EYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGT 639
           E+ + A  AA F+    +D    RL  S++ G ++  GFL+DY  L SGL  LY+     
Sbjct: 447 EWAKWAADAADFVLEKAWD--GTRLARSYQEGQARIDGFLEDYGDLASGLTALYQATFDV 504

Query: 640 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVR 699
           K+L  A  L     +LF D E   Y         +++      D A PSG S      V 
Sbjct: 505 KYLEAADALVRRAVDLFWDAEKAAYLTAPRGQRDLVVATYGLFDNAFPSGASTLTEAQVE 564

Query: 700 LASIVA 705
           LA++  
Sbjct: 565 LAALTG 570


>gi|456984461|gb|EMG20516.1| PF03190 family protein [Leptospira interrogans serovar Copenhageni
           str. LT2050]
          Length = 699

 Score =  431 bits (1109), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 267/691 (38%), Positives = 379/691 (54%), Gaps = 67/691 (9%)

Query: 93  STSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHW 152
           ++  S +++ NRL+ E SPYL QH++NPVDWF WGEEA  +A+ +D  IFLSIGY+TCHW
Sbjct: 7   NSMESNSRNPNRLSKEKSPYLQQHSYNPVDWFPWGEEALTKAKDQDKLIFLSIGYATCHW 66

Query: 153 CHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP 212
           CHVME ESFE++ +A  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++FL+P
Sbjct: 67  CHVMEKESFENQSIADYLNFHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNMFLTP 126

Query: 213 DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASAS 272
           + +P+ GGTYFPPE +YGR GF  +L  ++  W +KR  L  + +   + L ++  + A 
Sbjct: 127 EGQPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGESRAK 186

Query: 273 SNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKKLEDT 328
             +  D  P+N            YDS+FGGF +    KFP  + +  +L  YHS      
Sbjct: 187 EKQEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS------ 240

Query: 329 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 388
             SG  +   +MV  TL  M +GGI+D +GGG  RYS D RW VPHFEKMLYD      +
Sbjct: 241 --SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSLFLEI 297

Query: 389 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 448
             +   ++K +       DI+ YL RDM    G I SAEDADS   EG    +EG FY+W
Sbjct: 298 LAEYSLVSKKISAESFALDIVSYLHRDMRMDEGGICSAEDADS---EG----EEGLFYIW 350

Query: 449 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 508
             +E  ++ GE + L ++ + +   GN            F+GKN+L E    S    +  
Sbjct: 351 DLEEFREVCGEDSFLLEKFWNVTKEGN------------FEGKNILHENFRGSNFTEEEL 398

Query: 509 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 568
             L+K    L + + KL + RSKR RP  DDK++ SWNGL I +  +             
Sbjct: 399 KQLDK---ALAKGKVKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG----------- 444

Query: 569 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 628
                +   R++++++AE   SFI ++L D    R+   FR G S   G+ +DYA +I+ 
Sbjct: 445 -----IAFQREDFLKLAEETYSFIEKNLID-SNGRILRRFREGESGILGYSNDYAEMIAS 498

Query: 629 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGAEP 687
            + L+E G G ++L  A+        LF  R   G F  TG D  VLLR   D +DG EP
Sbjct: 499 SIVLFEAGRGVRYLQNAVLWMEEAIRLF--RSPVGVFFDTGIDGEVLLRRSVDGYDGVEP 556

Query: 688 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 747
           S NS    +LVRL+ +  G  S+YYR+ AE     F   L   A++ P +  A       
Sbjct: 557 SANSSLAHSLVRLSFL--GVNSNYYREIAESIFLYFRKELYSYALSYPFLLSA-----YW 609

Query: 748 SRKH----VVLVGHKSSVDFENMLAAAHASY 774
           S KH    +VL+  K+S + ++MLA   + +
Sbjct: 610 SYKHHFREIVLI-RKNSEEGKDMLAWIQSRF 639


>gi|45658527|ref|YP_002613.1| hypothetical protein LIC12692 [Leptospira interrogans serovar
           Copenhageni str. Fiocruz L1-130]
 gi|45601770|gb|AAS71250.1| conserved hypothetical protein [Leptospira interrogans serovar
           Copenhageni str. Fiocruz L1-130]
          Length = 716

 Score =  431 bits (1109), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 267/691 (38%), Positives = 379/691 (54%), Gaps = 67/691 (9%)

Query: 93  STSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHW 152
           ++  S +++ NRL+ E SPYL QH++NPVDWF WGEEA  +A+ +D  IFLSIGY+TCHW
Sbjct: 24  NSMESNSRNPNRLSKEKSPYLQQHSYNPVDWFPWGEEALTKAKDQDKLIFLSIGYATCHW 83

Query: 153 CHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP 212
           CHVME ESFE++ +A  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++FL+P
Sbjct: 84  CHVMEKESFENQSIADYLNFHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNMFLTP 143

Query: 213 DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASAS 272
           + +P+ GGTYFPPE +YGR GF  +L  ++  W +KR  L  + +   + L ++  + A 
Sbjct: 144 EGQPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGESRAK 203

Query: 273 SNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKKLEDT 328
             +  D  P+N            YDS+FGGF +    KFP  + +  +L  YHS      
Sbjct: 204 EKQEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS------ 257

Query: 329 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 388
             SG  +   +MV  TL  M +GGI+D +GGG  RYS D RW VPHFEKMLYD      +
Sbjct: 258 --SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSLFLEI 314

Query: 389 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 448
             +   ++K +       DI+ YL RDM    G I SAEDADS   EG    +EG FY+W
Sbjct: 315 LAEYSLVSKKISAESFALDIVSYLHRDMRMDEGGICSAEDADS---EG----EEGLFYIW 367

Query: 449 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 508
             +E  ++ GE + L ++ + +   GN            F+GKN+L E    S    +  
Sbjct: 368 DLEEFREVCGEDSFLLEKFWNVTKEGN------------FEGKNILHENFRGSNFTEEEL 415

Query: 509 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 568
             L+K    L + + KL + RSKR RP  DDK++ SWNGL I +  +             
Sbjct: 416 KQLDK---ALAKGKVKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG----------- 461

Query: 569 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 628
                +   R++++++AE   SFI ++L D    R+   FR G S   G+ +DYA +I+ 
Sbjct: 462 -----IAFQREDFLKLAEETYSFIEKNLID-SNGRILRRFREGESGILGYSNDYAEMIAS 515

Query: 629 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGAEP 687
            + L+E G G ++L  A+        LF  R   G F  TG D  VLLR   D +DG EP
Sbjct: 516 SIVLFEAGRGVRYLQNAVLWMEEAIRLF--RSPVGVFFDTGIDGEVLLRRSVDGYDGVEP 573

Query: 688 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 747
           S NS    +LVRL+ +  G  S+YYR+ AE     F   L   A++ P +  A       
Sbjct: 574 SANSSLAHSLVRLSFL--GVNSNYYREIAESIFLYFRKELYSYALSYPFLLSA-----YW 626

Query: 748 SRKH----VVLVGHKSSVDFENMLAAAHASY 774
           S KH    +VL+  K+S + ++MLA   + +
Sbjct: 627 SYKHHFREIVLI-RKNSEEGKDMLAWIQSRF 656


>gi|421085457|ref|ZP_15546310.1| PF03190 family protein [Leptospira santarosai str. HAI1594]
 gi|421103567|ref|ZP_15564164.1| PF03190 family protein [Leptospira interrogans serovar
           Icterohaemorrhagiae str. Verdun LP]
 gi|410366530|gb|EKP21921.1| PF03190 family protein [Leptospira interrogans serovar
           Icterohaemorrhagiae str. Verdun LP]
 gi|410432093|gb|EKP76451.1| PF03190 family protein [Leptospira santarosai str. HAI1594]
          Length = 691

 Score =  431 bits (1108), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 270/689 (39%), Positives = 378/689 (54%), Gaps = 69/689 (10%)

Query: 95  SHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCH 154
           S+SRN   NRL+ E SPYL QH++NPVDWF WGEEA  +A+ +D  IFLSIGY+TCHWCH
Sbjct: 3   SNSRN--PNRLSKEKSPYLQQHSYNPVDWFPWGEEALTKAKDQDKLIFLSIGYATCHWCH 60

Query: 155 VMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDL 214
           VME ESFE++ +A  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++FL+P+ 
Sbjct: 61  VMEKESFENQSIADYLNFHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNMFLTPEG 120

Query: 215 KPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSN 274
           +P+ GGTYFPPE +YGR GF  +L  ++  W +KR  L  + +   + L ++  + A   
Sbjct: 121 QPITGGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGESRAKEK 180

Query: 275 KLPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKKLEDTGK 330
           +  D  P+N            YDS+FGGF +    KFP  + +  +L  YHS        
Sbjct: 181 QEADFPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS-------- 232

Query: 331 SGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYL 390
           SG  +   +MV  TL  M +GGI+D +GGG  RYS D RW VPHFEKMLYD      +  
Sbjct: 233 SGNPN-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSLFLEILA 291

Query: 391 DAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTS 450
           +   ++K +       DI+ YL RDM    G I SAEDADS   EG    +EG FY+W  
Sbjct: 292 EYSLVSKKISAESFALDIVSYLHRDMRMDEGGICSAEDADS---EG----EEGLFYIWDL 344

Query: 451 KEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMP 510
           +E  ++ GE + L ++ + +   GN            F+GKN+L E    S    +    
Sbjct: 345 EEFREVCGEDSFLLEKFWNVTKEGN------------FEGKNILHENFRGSNFTEEELKQ 392

Query: 511 LEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFN 570
           L+K    L + + KL + RSKR RP  DDK++ SWNGL I +  +               
Sbjct: 393 LDK---ALAKGKVKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG------------- 436

Query: 571 FPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLL 630
              +   R++++++AE   SFI ++L D    R+   FR G S   G+ +DYA +I+  +
Sbjct: 437 ---IAFQREDFLKLAEETYSFIEKNLID-SNGRILRRFREGESGILGYSNDYAEMIASSI 492

Query: 631 DLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGAEPSG 689
            L+E G G ++L  A+        LF  R   G F  TG D  VLLR   D +DG EPS 
Sbjct: 493 VLFEAGRGVRYLQNAVLWMEEAIRLF--RSPVGVFFDTGIDGEVLLRRSVDGYDGVEPSA 550

Query: 690 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSR 749
           NS    +LVRL+ +  G  S+YYR+ AE     F   L   A++ P +  A       S 
Sbjct: 551 NSSLAHSLVRLSFL--GVNSNYYREIAESIFLYFRKELYSYALSYPFLLSA-----YWSY 603

Query: 750 KH----VVLVGHKSSVDFENMLAAAHASY 774
           KH    +VL+  K+S + ++MLA   + +
Sbjct: 604 KHHFREIVLI-RKNSEEGKDMLAWIQSRF 631


>gi|379010883|ref|YP_005268695.1| thymidylate kinase YyaL [Acetobacterium woodii DSM 1030]
 gi|375301672|gb|AFA47806.1| thymidylate kinase YyaL [Acetobacterium woodii DSM 1030]
          Length = 686

 Score =  431 bits (1108), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 259/666 (38%), Positives = 362/666 (54%), Gaps = 62/666 (9%)

Query: 97  SRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVM 156
           ++ K +NRL  E SPYLLQHA+NPV+W+ W +EAF  A+++D PIFLSIGYSTCHWCHVM
Sbjct: 5   NKQKKSNRLVHEMSPYLLQHAYNPVNWYPWSDEAFNLAKRQDKPIFLSIGYSTCHWCHVM 64

Query: 157 EVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKP 216
           E ESFED  VA+ LN +F+SIKVDREERPD+D++YMT+ Q   G GGWPL+VFL+ + KP
Sbjct: 65  EKESFEDAEVAEYLNKYFISIKVDREERPDIDQIYMTFSQVSTGQGGWPLNVFLTAERKP 124

Query: 217 LMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKL 276
               TY P   +YG PG   +L  ++  W +  + +  S A  +  L   L      NKL
Sbjct: 125 FYVTTYLPKRSRYGHPGLMDVLVGIEGQWRQNNEEIIYS-ADKMTSLLNDLEIRKDENKL 183

Query: 277 PDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASE 336
              +  +A     E    S+D R+GGFG APKFP P       +H   L    ++    +
Sbjct: 184 KRTIFFDAYDFFDE----SFDDRYGGFGKAPKFPTP-------HHLFYLLRCYQAFNQPD 232

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
              MV  TL+ M +GG+ DH+G GF RYS DE+W VPHFEKMLYD   L  +Y + + +T
Sbjct: 233 ALVMVEKTLKQMYQGGLFDHIGFGFSRYSTDEQWLVPHFEKMLYDNALLVMIYAETYQVT 292

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
            +  Y  I +  + Y+ RD+    G  F AEDADS   EG    +EG FYVW+ ++VE I
Sbjct: 293 GNPLYKKIAQKTITYVNRDLRSEEGGFFCAEDADS---EG----EEGRFYVWSMEKVEKI 345

Query: 457 LG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASASKLGMPLEK 513
           LG + A +F + Y +   GN            F GKN+  +I ++     A+     LEK
Sbjct: 346 LGKKRAAVFFKFYPMTAKGN------------FDGKNIPNMIPVDLDLIEANP---ELEK 390

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
              +L E +  LF+ R KR  PH DDK++ +WNGL+I++ A A +I              
Sbjct: 391 ---VLDEMKADLFNQREKRIHPHKDDKILTAWNGLMITALAMAGRIF------------- 434

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
              D+ EY+  AE   +FI   +   +  RL   +R G +K   +LDDYA +I G L+LY
Sbjct: 435 ---DQPEYLIQAEETMAFIENKM-TRRNGRLYARYRLGEAKILAYLDDYASVIWGYLELY 490

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREG-GGYFNTTGEDPSVLLRVKEDHDGAEPSGNSV 692
           +    T++L  AI        +F D  G  G+F    +   ++ R KE +D A+PSGN++
Sbjct: 491 QATFKTEYLEKAILRAVDMINIFGDDFGMSGFFQYGNDAEKLIARPKEIYDNAQPSGNAL 550

Query: 693 SVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHV 752
           +   L++L  I    K   Y        A F   L    MA  +M CA      P+ + V
Sbjct: 551 AACCLLKLGKITGEQK---YIDIVNGMFAYFAGNLNQAPMASTMMLCAKLFHEQPTTE-V 606

Query: 753 VLVGHK 758
           V  G++
Sbjct: 607 VFAGYE 612


>gi|115372663|ref|ZP_01459970.1| thymidylate kinase [Stigmatella aurantiaca DW4/3-1]
 gi|310823874|ref|YP_003956232.1| hypothetical protein STAUR_6648 [Stigmatella aurantiaca DW4/3-1]
 gi|115370384|gb|EAU69312.1| thymidylate kinase [Stigmatella aurantiaca DW4/3-1]
 gi|309396946|gb|ADO74405.1| conserved uncharacterized protein [Stigmatella aurantiaca DW4/3-1]
          Length = 694

 Score =  431 bits (1108), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 248/650 (38%), Positives = 344/650 (52%), Gaps = 49/650 (7%)

Query: 96  HSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHV 155
            + ++  NRLA EHSPYL QHA NPVDW+ WGEEA   AR  D PI LS+GYS CHWCHV
Sbjct: 5   QTPSRSGNRLAREHSPYLRQHASNPVDWYPWGEEALERARAEDKPILLSVGYSACHWCHV 64

Query: 156 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 215
           M  ESFED  +A ++N  F++IKVDREERPD+D++Y   VQ +  GGGWPL+VFL+PDL+
Sbjct: 65  MAHESFEDPAIASVMNAHFINIKVDREERPDLDQIYQGVVQLMGQGGGWPLTVFLTPDLR 124

Query: 216 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 275
           P  GGTYFPP+DKYGRPGF  +L  + DAW  +R+ +    A   E L E   A+     
Sbjct: 125 PFYGGTYFPPQDKYGRPGFPKVLESLHDAWMNQREKVLGQAADFREGLGEL--ATYGLEA 182

Query: 276 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 335
            P  L    +    E++ +  D   GGFG APKFP P+ +  +L   ++       G   
Sbjct: 183 APAALSVEDVLKMGERMLRHVDPVNGGFGGAPKFPNPMNVSFLLRAWRR-------GGPE 235

Query: 336 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 395
             +   L TL+ MA GG++D +GGGFHRY+VD+RW VPHFEKMLYD  QL ++Y +   +
Sbjct: 236 PLKDAALRTLERMALGGVYDQLGGGFHRYAVDDRWRVPHFEKMLYDNAQLLHLYAEGEQV 295

Query: 396 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 455
                +  +  +  +Y+RR+M    G  ++A+DADS   EG    +EG F+VWT  +V  
Sbjct: 296 ESRPLWRKVVEETAEYVRREMTDARGGFYAAQDADS---EG----EEGRFFVWTPAQVCS 348

Query: 456 ILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKY 514
           +L  EHA L   H+ + P GN +           +G  VL      +  A + G+  E  
Sbjct: 349 VLTPEHANLLLRHFRITPQGNFE-----------QGATVLEVAVPVAQIAHERGLSQEAL 397

Query: 515 LNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVV 574
              L   R  LF +R +R +P  DDK++  WNGL+I   A AS++               
Sbjct: 398 ERTLTAAREALFGIREQRVKPGRDDKILSGWNGLMIRGLAFASRVF-------------- 443

Query: 575 GSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYE 634
              R E+ ++A  +A F+  H++D    RL  S+  G  +  GFL+DY     GL  LY+
Sbjct: 444 --GRPEWAQLAAGSADFVLTHMWD--GTRLSRSYEEGGGRIDGFLEDYGDFAVGLTALYQ 499

Query: 635 FGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSV 694
                K+L  A  L      LF D E   Y +       +++      D A PSG S   
Sbjct: 500 ATFEAKYLEAASALVKRAVALFWDEEKQAYLSAPKGQKDLVVATYSLFDNAFPSGASTLT 559

Query: 695 INLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 744
              V LA++  G KS  + +  E  L+     L+D  +    +  AAD  
Sbjct: 560 EAQVALAALT-GDKS--HLELPERYLSRMRKALEDNPLGYGHLALAADTF 606


>gi|283778697|ref|YP_003369452.1| hypothetical protein Psta_0907 [Pirellula staleyi DSM 6068]
 gi|283437150|gb|ADB15592.1| protein of unknown function DUF255 [Pirellula staleyi DSM 6068]
          Length = 667

 Score =  431 bits (1107), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 271/677 (40%), Positives = 365/677 (53%), Gaps = 78/677 (11%)

Query: 85  AMAERTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLS 144
           AMAE  PA    ++   TNRLA E SPYLL HAHNPVDW+ WG EA   A+K + PIFLS
Sbjct: 22  AMAEE-PAPKQPTK---TNRLAQETSPYLLLHAHNPVDWYPWGNEALERAKKENKPIFLS 77

Query: 145 IGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYM----TYVQALYG 200
           +GYS+CHWCHVME ESF D  +AKLLN+ F+ IKVDREERPD+D +YM    TY+Q   G
Sbjct: 78  VGYSSCHWCHVMERESFLDPEIAKLLNENFICIKVDREERPDIDTIYMTAVQTYLQLTTG 137

Query: 201 --GGGWPLSVFLSPDLKPLMGGTYFPPE--DKYGRPGFKTILRKVKDAWDKKRDMLAQSG 256
             GGGWP++VFL+P+  P  GGTYFP    D+ G  GF T+  KV + W K+   L    
Sbjct: 138 RRGGGWPMTVFLTPEGNPFFGGTYFPARDGDREGMTGFLTLSSKVSEMWKKEPVKLGDDA 197

Query: 257 A----FAIEQLS--EALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFG------ 304
                F  +QL   + L A     KL   + +         L+  +D R+GGFG      
Sbjct: 198 TTLARFIKDQLEGPKLLLAVVLDTKLTTSVEKG--------LAAQFDERYGGFGFDEIEW 249

Query: 305 SAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRY 364
             PKFP P  +Q +L   KK         ASE + M++ TL  MA GGI+DHVGGGFHRY
Sbjct: 250 QRPKFPEPSNLQFLLEIVKKT-------PASESRAMLVHTLDRMAMGGIYDHVGGGFHRY 302

Query: 365 SVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIF 424
           SVD  W +PHFEKMLYD GQL  VY +A++LT D  Y  I R+  +++ R+M    G  +
Sbjct: 303 SVDRMWRIPHFEKMLYDNGQLLTVYSEAYALTGDENYQRIARETAEFMLREMRDTSGGFY 362

Query: 425 SAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDP 484
           +A D   AETEG     EG FY W   EVE +L +         Y        LSR  + 
Sbjct: 363 AALD---AETEGV----EGKFYRWDKAEVEKLLTKEEFELYSAVY-------GLSRAPNF 408

Query: 485 HNEFKGKNVLIELNDSSASASKL-GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIV 543
              F     +I+L D+    +K   + +EK +N L     KL   R+ R RP  D K++ 
Sbjct: 409 EETF----YVIQLRDTLVDIAKTREITVEKLVNDLRPIHAKLLAARNARKRPLTDTKILA 464

Query: 544 SWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHR 603
             NGL I+  A A K+LK                   Y E A +AA+ +   +   +  R
Sbjct: 465 GENGLAITGLATAGKLLKE----------------PRYTEAAATAATLVLSKMTAPE-GR 507

Query: 604 LQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGG 663
           L  ++    +K   +L DY+ L+ GLL L+E     +WL  AI+L + Q ELF D   GG
Sbjct: 508 LFRTYSGEKAKLNAYLSDYSMLVEGLLALHEATGEQRWLDEAIKLTDQQVELFHDVPRGG 567

Query: 664 YFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVF 723
           ++ T+ +  S+L RVKE  D A P+GNSV+ +NLV+L  I   ++   Y + AE ++   
Sbjct: 568 FYFTSKDHESLLARVKETVDSAMPAGNSVAAVNLVKLVKITGKNE---YLKLAEGAIQSA 624

Query: 724 ETRLKDMAMAVPLMCCA 740
             ++++     P +  A
Sbjct: 625 AGQMQENPTVSPRLATA 641


>gi|345560346|gb|EGX43471.1| hypothetical protein AOL_s00215g207 [Arthrobotrys oligospora ATCC
           24927]
          Length = 758

 Score =  431 bits (1107), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 268/697 (38%), Positives = 376/697 (53%), Gaps = 43/697 (6%)

Query: 86  MAERTP---ASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIF 142
           MA   P     +  S+ K  N+LA   SPY+  HA+N   W  W  E+ A A+  +  IF
Sbjct: 1   MATSIPLQSGDSGKSKLKLVNQLANSTSPYVRSHANNLTAWQQWTPESLALAKSENRLIF 60

Query: 143 LSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGG 202
           LS GY+ CHWCHVME ESF+D  VAK+LND F+ IK+DREERPD+D++YM YVQA  G G
Sbjct: 61  LSSGYAACHWCHVMERESFQDAYVAKILNDNFIPIKIDREERPDIDRIYMNYVQATTGSG 120

Query: 203 GWPLSVFLSPDLKPLMGGTYFPPEDKYGRP------GFKTILRKVKDAWDKKRDMLAQSG 256
           GWPL+VFL+P+L+P+ GGTY+P  +    P      GF  +L K+   W +++D    S 
Sbjct: 121 GWPLNVFLTPNLEPVFGGTYWPGPNATDGPSMKDQIGFVEVLDKIVKVWKEQQDKCLASA 180

Query: 257 AFAIEQL----SEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRP 312
              ++QL     E L     +    + L  + L    +     YD+  GGFG+ PKFP P
Sbjct: 181 KDILKQLKEFSDEGLKEQGGNQDGAEILEIDLLEEAYQHFLSRYDTTHGGFGTEPKFPTP 240

Query: 313 VEIQMMLYHSKKLEDTGKSGEASEGQK---MVLFTLQCMAKGGIHDHVGGGFHRYSVDER 369
             +  +L  S             E ++   M + TL+ M++GGIHDH+G GF RYSV   
Sbjct: 241 TNLAFLLRLSSLSSVVEDVVGDVECERAKFMAVTTLRHMSRGGIHDHIGNGFERYSVTAD 300

Query: 370 WHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGP----GGEIFS 425
           W +PHFEKMLYD  QL +VYLDA+ LTKD        D  DYL     GP     G  +S
Sbjct: 301 WSLPHFEKMLYDNAQLISVYLDAYLLTKDREMLDAALDAADYL---CSGPLSHKDGGFYS 357

Query: 426 AEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDP 484
           AEDADS   +G T K+EGAFYVW  KE   +LGE  A +  +++ ++  GN D +R  D 
Sbjct: 358 AEDADSYARKGDTEKREGAFYVWDKKEFIKVLGEQDAEVCSKYWGVRTDGNVDPAR--DI 415

Query: 485 HNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPH-LDDKVIV 543
           H+EF  +NVL      +   S LG+     +  +   R KL + R +      LDDK++ 
Sbjct: 416 HDEFLHQNVLQISQTPAQIGSMLGLSETAIVEKIKNGRAKLREYRERERPRPILDDKILT 475

Query: 544 SWNGLVISSFARASKILK-SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTH 602
            WNGL I++ +R +  L+  +AE + F           Y+  A  AA FIR++++D++T 
Sbjct: 476 GWNGLAIAALSRLAAALEIVDAEKSKF-----------YLNQAIRAAEFIRKNVFDQRTL 524

Query: 603 RLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGG 662
            L+  +R  P     F DDYA+LI GL+ LYE      WL WA  LQ  Q +LF D   G
Sbjct: 525 GLKRVWRETPGATKAFADDYAYLIYGLISLYEATFDAGWLRWAHSLQAAQTKLFWDEAQG 584

Query: 663 GYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 722
           G+F+T  + P ++LR+K+  D AEPS N +S  NL +L S++  +   +    A  +   
Sbjct: 585 GFFSTERDAPDLILRLKDGLDSAEPSTNGISAANLYKLGSLLGDASFSFL---ASKTCNA 641

Query: 723 FETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKS 759
           F T L         M  +   L++ +   V++ G KS
Sbjct: 642 FSTELMQHPFLFSTMLPSVVALNLGTGT-VIIAGKKS 677


>gi|418686893|ref|ZP_13248057.1| PF03190 family protein [Leptospira kirschneri serovar Grippotyphosa
           str. Moskva]
 gi|410738600|gb|EKQ83334.1| PF03190 family protein [Leptospira kirschneri serovar Grippotyphosa
           str. Moskva]
          Length = 713

 Score =  430 bits (1106), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 261/682 (38%), Positives = 375/682 (54%), Gaps = 61/682 (8%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           +++ NRL+ E SPYL QH++NPVDWF WGEEA  +A+ +D  IFLS+GY+TCHWCHVME 
Sbjct: 30  SRNPNRLSKEKSPYLQQHSYNPVDWFPWGEEALTKAKDQDKLIFLSVGYATCHWCHVMEK 89

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFE++ +A  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++FL+P+ +P+ 
Sbjct: 90  ESFENQSIADYLNSHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNMFLTPEGQPIT 149

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
           GGTYFPPE +YGR GF  +L  ++  W +KR  L  + +   + L ++  + A   +  D
Sbjct: 150 GGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGESRAKEKQEAD 209

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKKLEDTGKSGEA 334
             P+N            YDS+FGGF +    KFP  + +  +L  YHS        SG  
Sbjct: 210 FPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS--------SGNP 261

Query: 335 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 394
           +   +MV  TL  M +GGI+D +GGG  RYS D RW VPHFEKMLYD      +  +   
Sbjct: 262 N-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSLFLEILAEYSL 320

Query: 395 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 454
           ++K +       DI+ YL RDM   GG I SAEDADS   EG    +EG FY+W  +E  
Sbjct: 321 VSKKISAKSFALDIVSYLHRDMRMDGGGICSAEDADS---EG----EEGLFYIWDLEEFR 373

Query: 455 DILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKY 514
           ++ G+ + L ++ + +   GN            F+GKN+L E    +   S       K+
Sbjct: 374 EVCGDDSSLLEKFWNVTKEGN------------FEGKNILHE----NFRGSNFTEEESKH 417

Query: 515 LN-ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
           L+ +L   + KL + RSKR RP  DDK++ SWNGL I +  +                  
Sbjct: 418 LDGVLTRGKAKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG---------------- 461

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
           +   R++++++AE   SFI ++L D +  R+   FR G S   G+ +DYA +I+  + L+
Sbjct: 462 IAFQREDFLKLAEETYSFIEKNLIDSKG-RILRRFREGESGILGYSNDYAEMIASSIVLF 520

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGAEPSGNSV 692
           E G G ++L  A+        LF  R   G F  TG D  VLLR   D +DG EPS NS 
Sbjct: 521 EAGRGVRYLQNAVLWMEETIRLF--RSTAGVFFDTGIDGEVLLRRSVDGYDGVEPSANSS 578

Query: 693 SVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHV 752
              +LV+L+ +  G  SD YR+ AE     F   L   A++ P +  A       SR+ V
Sbjct: 579 LAHSLVKLSFL--GVNSDRYREVAESIFLYFRKELYSYALSYPFLLSAYWSYKYHSREIV 636

Query: 753 VLVGHKSSVDFENMLAAAHASY 774
           ++   K+S    ++LA   + +
Sbjct: 637 LI--RKNSEAGRDLLAWIQSRF 656


>gi|418695562|ref|ZP_13256581.1| PF03190 family protein [Leptospira kirschneri str. H1]
 gi|409956647|gb|EKO15569.1| PF03190 family protein [Leptospira kirschneri str. H1]
          Length = 711

 Score =  430 bits (1106), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 262/682 (38%), Positives = 374/682 (54%), Gaps = 61/682 (8%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           +++ NRL+ E SPYL QH++NPVDWF WGEEA   A+ +D  IFLS+GY+TCHWCHVME 
Sbjct: 28  SRNPNRLSKEKSPYLQQHSYNPVDWFPWGEEALTRAKDQDKLIFLSVGYATCHWCHVMEK 87

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFE++ +A  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++FL+P+ +P+ 
Sbjct: 88  ESFENQSIADYLNSHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNLFLTPEGQPIT 147

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
           GGTYFPPE +YGR GF  +L  ++  W +KR  L  + +   + L ++  + A   +  D
Sbjct: 148 GGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGESRAKEKQEAD 207

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKKLEDTGKSGEA 334
             P+N            YDS+FGGF +    KFP  + +  +L  YHS        SG  
Sbjct: 208 FPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS--------SGNP 259

Query: 335 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 394
           +   +MV  TL  M +GGI+D +GGG  RYS D RW VPHFEKMLYD      +  +   
Sbjct: 260 N-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSLFLEILAEYSL 318

Query: 395 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 454
           ++K +       DI+ YL RDM   GG I SAEDADS   EG    +EG FY+W  +E  
Sbjct: 319 VSKKISAKSFALDIVSYLHRDMRMDGGGICSAEDADS---EG----EEGLFYIWDLEEFR 371

Query: 455 DILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKY 514
           ++ GE + L ++ + +   GN            F+GKN+L E    +   S       K+
Sbjct: 372 EVCGEDSSLLEKFWNVTKEGN------------FEGKNILHE----NFRGSNFTEEESKH 415

Query: 515 LN-ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
           L+  L   + KL + RSKR RP  DDK++ SWNGL I +  +                  
Sbjct: 416 LDGALTRGKAKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG---------------- 459

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
           +   R++++++AE   SFI ++L D +  R+   FR G S+  G+ +DYA +I+  + L+
Sbjct: 460 IAFQREDFLKLAEETYSFIEKNLIDSKG-RILRRFREGESRILGYSNDYAEMIASSIVLF 518

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGAEPSGNSV 692
           E G G ++L  A+        LF  R   G F  TG D  VLLR   D +DG EPS NS 
Sbjct: 519 EAGRGVRYLQNAVLWMEETIRLF--RSTAGVFFDTGIDGEVLLRRSVDGYDGVEPSANSS 576

Query: 693 SVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHV 752
              +LV+L+ +  G  SD YR+ AE     F   L   A++ P +  A       SR+ V
Sbjct: 577 LAHSLVKLSFL--GVNSDRYREVAESIFLYFRKELYSSALSYPFLLSAYWSYKHHSREIV 634

Query: 753 VLVGHKSSVDFENMLAAAHASY 774
           ++   K+S    ++LA   + +
Sbjct: 635 LI--RKNSEAGRDLLAWIQSRF 654


>gi|330508169|ref|YP_004384597.1| hypothetical protein MCON_2284 [Methanosaeta concilii GP6]
 gi|328928977|gb|AEB68779.1| protein of unknown function (DUF255) [Methanosaeta concilii GP6]
          Length = 710

 Score =  430 bits (1106), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 281/709 (39%), Positives = 375/709 (52%), Gaps = 65/709 (9%)

Query: 86  MAERTPASTSHSRNK-HTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLS 144
           M E   +    SR+    NRL  E SPYLLQHA NPVDW+ WGEEAF  AR+ D PIFLS
Sbjct: 1   MTEDPSSGIDPSRSSCQQNRLCKEKSPYLLQHACNPVDWYPWGEEAFEAARREDKPIFLS 60

Query: 145 IGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGW 204
           +GYSTCHWCHVM  ESFED  VA+LLN  F+ IKVDREERPD+D++YM    A+ G GGW
Sbjct: 61  VGYSTCHWCHVMAHESFEDPNVARLLNQSFICIKVDREERPDIDQIYMAAAIAVSGRGGW 120

Query: 205 PLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS 264
           PL+V ++PD KP    TY P +   G  G   ++ +VK+ WD  R+ L  S    ++ L 
Sbjct: 121 PLTVMMTPDKKPFFAATYIPKKGHMGLTGLMELIAQVKEMWDNDRESLMSSANIIVDHLK 180

Query: 265 EALS---ASASSNKLPDELP-----QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQ 316
              S   A        D L       + L      LS  YD   GGFG+APKFP P  I 
Sbjct: 181 GRQSGRGAGVQKEAHKDSLSGSPFDSSLLSRGYSALSSIYDPENGGFGTAPKFPTPHHIL 240

Query: 317 MMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFE 376
            +L   K+ ++           +M   TLQ M  GGI+DHVG GFHRYS D  W VPHFE
Sbjct: 241 FLLRCWKRTKNILP-------LEMAKTTLQGMRMGGIYDHVGFGFHRYSTDPEWFVPHFE 293

Query: 377 KMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEG 436
           KMLYDQ  LA  Y +A+  T +  Y+   R+IL+Y+ RDM  P G  +SAEDADS   EG
Sbjct: 294 KMLYDQALLAMAYAEAYQATGEEEYAQTVREILEYILRDMTSPEGGFYSAEDADS---EG 350

Query: 437 ATRKKEGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 495
               +EG FY WT+ E+++ LGE    L    + +  +GN +  R           N+L 
Sbjct: 351 ----EEGKFYTWTAVELKESLGEEDFRLLIRLFDVYESGNYEGER-----------NILR 395

Query: 496 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 555
           + +  S +AS L +P E+  +   +   +L+  R KR  P  DDK++  WNGL+I++ AR
Sbjct: 396 QRSSFSDAASVLKIPEEELYHRSSDMISRLYLAREKRVHPLKDDKILTDWNGLMIAALAR 455

Query: 556 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA 615
           A+  L+                  +    A  AA F+   +   +  RL H +R G +  
Sbjct: 456 AAGALQD----------------PDLATAASRAADFLLEVMRTPEG-RLMHRYRQG-ADI 497

Query: 616 PGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL 675
              LDDYAFLI GL++LYE     K+L  A+ L    D+ F D E GG+F T  +   +L
Sbjct: 498 QANLDDYAFLIWGLIELYEATFDVKYLKAAVHLNEIMDKHFWDGEAGGFFFTADDGEELL 557

Query: 676 LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP 735
           +R KE +DGA PSGNS++++NL+RL  +   +       + E   A+          A P
Sbjct: 558 VRKKEYYDGALPSGNSIALLNLLRLLHLTGDT-------SLEEKAALLARSALPAVSAQP 610

Query: 736 L----MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           L    + CA D    P+ + V LVG       + MLAA    +  NK V
Sbjct: 611 LGYTMLLCALDYALGPTYE-VALVGSLEDGGLKEMLAAIRIRFLPNKAV 658


>gi|418030673|ref|ZP_12669158.1| hypothetical protein BSSC8_01020 [Bacillus subtilis subsp. subtilis
           str. SC-8]
 gi|351471732|gb|EHA31845.1| hypothetical protein BSSC8_01020 [Bacillus subtilis subsp. subtilis
           str. SC-8]
          Length = 664

 Score =  430 bits (1106), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 237/609 (38%), Positives = 346/609 (56%), Gaps = 54/609 (8%)

Query: 121 VDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVD 180
           +DWF WGEEAF +A++ + P+ +SIGYSTCHWCHVM  ESFEDE +A+LLN+ FV+IKVD
Sbjct: 1   MDWFPWGEEAFEKAKRENKPVLVSIGYSTCHWCHVMAHESFEDEEIARLLNERFVAIKVD 60

Query: 181 REERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRK 240
           REERPDVD VYM   Q + G GGWPL+VF++PD KP   GTYFP   K+ RPGF  +L  
Sbjct: 61  REERPDVDSVYMRICQLMTGQGGWPLNVFITPDQKPFYAGTYFPKTSKFNRPGFVDVLEH 120

Query: 241 VKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRF 300
           + + +   R+ +      A + L    +A +        L ++A+    +QL+  +D+ +
Sbjct: 121 LSETFANDREHVEDIAENAAKHLQTKTAAKSGEG-----LSESAISRTFQQLASGFDTIY 175

Query: 301 GGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGG 360
           GGFG APKFP P    M++Y  +   +TG+        K    TL  MA GGI+DH+G G
Sbjct: 176 GGFGQAPKFPMP---HMLMYLLRYHHNTGQDNALYNVTK----TLDSMANGGIYDHIGYG 228

Query: 361 FHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPG 420
           F RYS D+ W VPHFEKMLYD   L   Y +A+ +T++  Y  IC  I+ +++R+M    
Sbjct: 229 FARYSTDDEWLVPHFEKMLYDNALLLTAYTEAYQVTQNSRYKEICEQIITFIQREMTHED 288

Query: 421 GEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLS 479
           G  FSA DAD   TEG    +EG +YVW+ +E+   LG+    L+ + Y +   GN    
Sbjct: 289 GSFFSALDAD---TEG----EEGKYYVWSKEEILKTLGDDLGTLYCQVYDITEEGN---- 337

Query: 480 RMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI-LGECRRKLFDVRSKRPRPHLD 538
                   F+GKN+   ++       +     EK L++ L + R++L   R +R  PH+D
Sbjct: 338 --------FEGKNIPNLIHTKREQIKEDAGLTEKELSLKLEDARQQLLKTREERTYPHVD 389

Query: 539 DKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYD 598
           DKV+ SWN L+I+  A+A+K+ +                  +Y+ +A+ A +FI   L  
Sbjct: 390 DKVLTSWNALMIAGLAKAAKVYQ----------------EPKYLSLAKDAITFIENKLII 433

Query: 599 EQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLD 658
           +   R+   +R+G  K  GF+DDYAFL+   LDLYE      +L  A +L +    LF D
Sbjct: 434 DG--RVMVRYRDGEVKNKGFIDDYAFLLWAYLDLYEASFDLSFLQKAKKLTDDMISLFWD 491

Query: 659 REGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEH 718
            E GG++ T  +  ++++R KE +DGA PSGNSV+ + L+RL  +   S      + AE 
Sbjct: 492 EEHGGFYFTGHDAEALIVREKEVYDGAVPSGNSVAAVQLLRLGQVTGDSS---LIEKAET 548

Query: 719 SLAVFETRL 727
             +VF+  +
Sbjct: 549 MFSVFKQHI 557


>gi|392955811|ref|ZP_10321341.1| hypothetical protein A374_03694 [Bacillus macauensis ZFHKF-1]
 gi|391878053|gb|EIT86643.1| hypothetical protein A374_03694 [Bacillus macauensis ZFHKF-1]
          Length = 679

 Score =  430 bits (1106), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 254/661 (38%), Positives = 353/661 (53%), Gaps = 56/661 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYLLQHAH PVDW+ WGEEAF +AR+   P+FLSIGYSTCHWCHVM+ ESF+
Sbjct: 4   NRLIHEKSPYLLQHAHQPVDWYPWGEEAFEKARREKKPVFLSIGYSTCHWCHVMKKESFD 63

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D  VA LLN+ FV+IKVDREERPD+D+VYM   Q L G GGWPL+VFL+ D +P   G Y
Sbjct: 64  DHEVAALLNERFVAIKVDREERPDLDQVYMAVCQGLTGQGGWPLNVFLTADQRPFYAGVY 123

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP ED+YG PGFK+++ ++ + + ++ + +        ++L+E+L         P  L +
Sbjct: 124 FPKEDRYGSPGFKSVITQLSEKYTERHEEIHDYS----KRLTESLQRKMKQE--PTALQE 177

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             L  C  QL + +DS +GGF  APKFP P  +  +L +       G+        +MV 
Sbjct: 178 TILHTCFNQLGQMFDSIYGGFSQAPKFPAPTILTYLLRY-------GQWQGNDLALQMVE 230

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL  MA GGI+D +G GF RY+VD+ W VPHFEKMLYD   L   Y++A+ +TK   Y 
Sbjct: 231 RTLDAMADGGIYDQIGYGFSRYAVDQMWLVPHFEKMLYDNALLLIAYVEAYQVTKKPRYQ 290

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
            I  +I+ Y+   M    G  + AEDADS   EG    +EG +YV++  E+E  L +   
Sbjct: 291 QIAAEIIQYVTTVMRDEQGGFYCAEDADS---EG----EEGKYYVFSKTEIERQLPQE-- 341

Query: 463 LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASASKLGMPLEKYLNILGE 520
                   + +  C L  ++D  N F+G NV  LI        A  LG+  EK   ++ +
Sbjct: 342 --------QASAFCALYDITDEGN-FEGNNVPNLIHQRKERI-AQTLGITEEKLSTLVEQ 391

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            R+ L+  R  R  PH DDK++ SWN L+I   A+A+                   D   
Sbjct: 392 ARQTLYRYRETRIPPHKDDKILTSWNALMIVGLAKAA----------------AAWDEPA 435

Query: 581 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 640
           Y E A+SA SFI + L      R+   +R G  +  GF+DDYAFL    L++YE     +
Sbjct: 436 YREHAKSALSFIEKELVIHD--RVMVRYREGDVQGKGFIDDYAFLAWAYLEMYEATFDDR 493

Query: 641 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 700
           ++  A  L      LF D   GG++    +   +++  KE +DGA PSGN V+   L +L
Sbjct: 494 YISKAQTLTQDMLSLFWDESHGGFYYAGNDAEQLIVTGKEAYDGAMPSGNGVAAYVLWKL 553

Query: 701 ASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSS 760
             + A  +   Y +  E    VF + L         +     ML+      VVLV  +  
Sbjct: 554 GKLTADPQ---YDEKLEALFDVFSSDLSHYPTGHTQLLQVW-MLTQMKTAEVVLVAEQEQ 609

Query: 761 V 761
           V
Sbjct: 610 V 610


>gi|418746293|ref|ZP_13302623.1| PF03190 family protein [Leptospira santarosai str. CBC379]
 gi|410792840|gb|EKR90765.1| PF03190 family protein [Leptospira santarosai str. CBC379]
          Length = 699

 Score =  430 bits (1106), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 271/690 (39%), Positives = 376/690 (54%), Gaps = 63/690 (9%)

Query: 93  STSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHW 152
           ++  S +++ NRL+ E SPYL QHA+NPVDWF WGEEAF +A+++D  IFLSIGY+TCHW
Sbjct: 7   NSMQSGSRNPNRLSKEKSPYLQQHAYNPVDWFPWGEEAFTKAKEQDKLIFLSIGYATCHW 66

Query: 153 CHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP 212
           CHVME ESFE+  VA  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL+VFL+P
Sbjct: 67  CHVMERESFENPTVADYLNSHFVSIKVDREERPDIDRIYMDALHAMNQQGGWPLNVFLTP 126

Query: 213 DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASAS 272
           D KP+ GGTYFPPE  YGR  F  +L  ++  W++KR  L      A  +LS+ L  S  
Sbjct: 127 DGKPITGGTYFPPEPGYGRKSFLEVLNILRKIWNEKRQEL----VVASSELSQYLKDSGE 182

Query: 273 SNKLPDE---LPQNALRLCAEQLSKS-YDSRFGGFGS--APKFPRPVEIQMML-YHSKKL 325
              +  +   LP       A  L +S YDS FGGF +    KFP  + +  +L YH    
Sbjct: 183 GRAVEKQEGNLPSENCFDSAFSLYESYYDSEFGGFKTNHVNKFPPSMGLSFLLRYH---- 238

Query: 326 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 385
               +S    +  +M   TL  M +GGI+D VGGG  RYS D RW VPHFEKMLYD    
Sbjct: 239 ----RSSGNPKALEMAENTLLAMKQGGIYDQVGGGLCRYSTDPRWTVPHFEKMLYDNSLF 294

Query: 386 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 445
               ++  S++K +       D++ YL RDM    G I SAEDADS   EG    +EG F
Sbjct: 295 LETLVECSSVSKKISAKSFALDVISYLHRDMRNEDGGICSAEDADS---EG----EEGLF 347

Query: 446 YVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 505
           YVW  +E  ++ GE + + ++ + +   GN            F+GKN+L E +  S +A 
Sbjct: 348 YVWDLEEFREVCGEDSRILEKFWNVTEKGN------------FEGKNILRE-SYPSGAAK 394

Query: 506 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 565
                  +  ++L   R KL + RSKR RP  DDK++ SWNGL   +  +A         
Sbjct: 395 FSEEEWNRIDSVLERGRAKLLERRSKRIRPLRDDKILTSWNGLYTKALTKAG-------- 446

Query: 566 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 625
                   V   +++++++AE   SFI R+L D    R+   FR+G S   G+ +DYA +
Sbjct: 447 --------VAFQKEDFLKLAEETYSFIERNLID-SNGRILRRFRDGESGILGYSNDYAEM 497

Query: 626 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDG 684
           I+  + L+E G G ++L  A+        LF  R   G F  TG D  VLLR   D +DG
Sbjct: 498 IASSIALFEAGRGIRYLKNAVLWMEEAIRLF--RSPAGVFFDTGNDGEVLLRRSVDGYDG 555

Query: 685 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 744
            EPS NS  V +LV+L+  + G  S  YR+ AE   + F   L   ++  P +  A    
Sbjct: 556 VEPSANSSLVYSLVKLS--LFGVDSARYRKFAESIFSYFTKELSSYSLGYPHLLSAYWTY 613

Query: 745 SVPSRKHVVLVGHKSSVDFENMLAAAHASY 774
              S K +VL+  K +   +++LA     +
Sbjct: 614 RFHS-KEIVLI-RKDADSGKDLLAEIQTKF 641


>gi|421092713|ref|ZP_15553445.1| PF03190 family protein [Leptospira borgpetersenii str. 200801926]
 gi|410364564|gb|EKP15585.1| PF03190 family protein [Leptospira borgpetersenii str. 200801926]
 gi|456889958|gb|EMG00828.1| PF03190 family protein [Leptospira borgpetersenii str. 200701203]
          Length = 700

 Score =  430 bits (1106), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 267/682 (39%), Positives = 371/682 (54%), Gaps = 58/682 (8%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           ++  NRL+ E SPYL QHA+NPVDWF WGEEA  +AR++D  IFLSIGY+TCHWCHVME 
Sbjct: 13  SRSPNRLSKEKSPYLQQHAYNPVDWFPWGEEALTKAREQDKLIFLSIGYATCHWCHVMEK 72

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFE++ VA  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++FL+PD KP+ 
Sbjct: 73  ESFENQMVADYLNSHFVSIKVDREERPDIDRIYMDALHAMDQQGGWPLNIFLTPDGKPIT 132

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
           GGTYFPPE  YGR  F  +L  ++  W +KR  L  + +     L ++    A   +   
Sbjct: 133 GGTYFPPEPGYGRKSFLEVLNILRKVWSEKRQELIVASSELSRYLKDSGEGRAIEKQEEG 192

Query: 279 ELP-QNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML-YHSKKLEDTGKSGEA 334
            LP ++            YD+ FGGF +    KFP  + +  +L YH         S   
Sbjct: 193 SLPSKDCFNFGFSLYESYYDAEFGGFKTNHVNKFPPSMGLSFLLRYH--------HSSGN 244

Query: 335 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 394
            +  +MV  TL  M +GGI+D VGGG  RYS D RW VPHFEKMLYD        ++   
Sbjct: 245 PKALEMVENTLLAMKRGGIYDQVGGGLCRYSTDHRWMVPHFEKMLYDNSLFLETLVECSQ 304

Query: 395 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 454
           ++K +       D++ YL RDM   GG I SAEDADS   EG    +EG FY+W  +E  
Sbjct: 305 VSKKISAESFALDVISYLHRDMRIVGGGICSAEDADS---EG----EEGLFYIWDFEEFR 357

Query: 455 DILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKY 514
           ++ GE + + ++ + +   GN            F+GKN+L E       A+KL     K 
Sbjct: 358 EVCGEDSRILEKFWNVTNKGN------------FEGKNILHE--SYGGEATKLSEEEWKR 403

Query: 515 LN-ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
           ++ +L   R KL + RSKR RP  DDK++ SWNGL I + A+A                 
Sbjct: 404 IDSVLERARAKLLERRSKRVRPLRDDKILTSWNGLYIKALAKAG---------------- 447

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
           +   R++++++AE   SFI R+L D    R+   FR+G S   G+ +DYA +IS  + L+
Sbjct: 448 IAFRREDFLKLAEETYSFIERNLIDPDG-RILRRFRDGESGILGYSNDYAEMISSSIVLF 506

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGAEPSGNSV 692
           E G G ++L  A+        LF  R   G F  TG D  VLLR   D +DG EPS NS 
Sbjct: 507 EAGCGIRYLKNAVLWMEEAIRLF--RSPAGVFFDTGNDGEVLLRRSVDGYDGVEPSANSS 564

Query: 693 SVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHV 752
              +LV+L+  + G  S  YR+ AE   + F   L   +++ P +  A       S K +
Sbjct: 565 LAYSLVKLS--LLGIDSVRYRKFAELIFSYFTKELSTHSLSYPHLLSAYWTYRYHS-KEI 621

Query: 753 VLVGHKSSVDFENMLAAAHASY 774
           VL+  K +   +++LAA    +
Sbjct: 622 VLI-RKDANSGKDLLAAIQTRF 642


>gi|168703256|ref|ZP_02735533.1| hypothetical protein GobsU_27241 [Gemmata obscuriglobus UQM 2246]
          Length = 698

 Score =  430 bits (1106), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 268/686 (39%), Positives = 368/686 (53%), Gaps = 54/686 (7%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           +  NRLA E S YL QHA+NPVDW+ WG EA A AR  D PIFLS+GYS CHWCHVME E
Sbjct: 5   RQPNRLATETSLYLRQHANNPVDWYPWGPEALARARDLDRPIFLSVGYSACHWCHVMEHE 64

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALY-GGGGWPLSVFLSPDLKPLM 218
           SFEDE  A ++N+ FV IKVDREERPD+D +YMT +Q +   GGGWPLSVFL+PDLKP  
Sbjct: 65  SFEDEATAAIMNEHFVCIKVDREERPDLDTIYMTALQVMTREGGGWPLSVFLAPDLKPFF 124

Query: 219 GGTYFPPEDKY---GRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 275
            GTY+PP+D+Y   GRPGFK +L  + +AW  +RD + + G   +  L    +   +   
Sbjct: 125 AGTYYPPDDRYAAQGRPGFKKLLLGIHNAWQTQRDRVHEIGTSVVGDLQRMGALGDADGP 184

Query: 276 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 335
           +  EL   A       L +SYD RFGGFGS PKFP  +E++++L  S +  D        
Sbjct: 185 VAPELLAGA----LAALRRSYDPRFGGFGSQPKFPHALELKLLLRLSDRFND-------P 233

Query: 336 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 395
               MV  TL  MA+GGI+D +GGGF RYSVD +W VPHFEKMLYD   LA+   +A+  
Sbjct: 234 VALDMVKHTLTTMARGGIYDQLGGGFARYSVDAKWLVPHFEKMLYDNALLASALAEAYQR 293

Query: 396 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 455
           T D F+  I R+ LDY+ R+M   GG  FS +DADS   EG    +EG FYVW+  E+  
Sbjct: 294 TGDPFFQQIGRETLDYVVREMWAEGGAFFSTQDADS---EG----EEGKFYVWSLDELRA 346

Query: 456 ILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 515
           +LG     F    +    G             F+G+N+L      +      G   E + 
Sbjct: 347 VLGAEDAEFACKVWGATRG-----------GNFEGRNILFRTLSDADEGKAHGTSEEAFR 395

Query: 516 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 575
             L   +  L+  R+KR  P  D+K++ +WNGL+I++FA+             F     G
Sbjct: 396 ARLRAVKDTLYAARAKRVWPGRDEKILTAWNGLMIAAFAQ-------------FGMATGG 442

Query: 576 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 635
            D       A+     I R +        + +    P K  G+L+DYAFL   L+ LYE 
Sbjct: 443 EDAACAAVAADH----ILRTMRTADGRLYRTAGVGQPPKLSGYLEDYAFLADALVTLYEA 498

Query: 636 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 695
               KWL  A+EL     + F D  G G+F T  +   ++ R K+ HDG+ PSGN+V+V 
Sbjct: 499 TFEVKWLRAALELAEALLKHFADPNGPGFFFTADDHEELIARTKDLHDGSTPSGNAVAVT 558

Query: 696 NLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLV 755
            L+RLA++    + D   + AE +L  +   + +   A   M  A D    P ++ V +V
Sbjct: 559 VLLRLAALT--GRRDLA-EPAERTLRGYRETMAEHPAASGQMLIALDFHLGPVQQ-VAIV 614

Query: 756 GHKSSVDFENMLAAAHASYDLNKTVS 781
           G +        + A  A++   + V+
Sbjct: 615 GPEHDQATRRAIEAVRATFGPRRVVA 640


>gi|326474295|gb|EGD98304.1| hypothetical protein TESG_05683 [Trichophyton tonsurans CBS 112818]
 gi|326479253|gb|EGE03263.1| DUF255 domain-containing protein [Trichophyton equinum CBS 127.97]
          Length = 774

 Score =  430 bits (1106), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 248/651 (38%), Positives = 363/651 (55%), Gaps = 42/651 (6%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            NRL+   SPY+  H +NPV W  W   A  +A++ +  IFLSIGYS CHWCHVME ESF
Sbjct: 23  VNRLSESRSPYVRGHMNNPVAWQLWDSTAINKAKQLNRLIFLSIGYSACHWCHVMEKESF 82

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
               VA +LN  F+ IK+DREERPD+D VYM YVQA  G GGWPL+VFL+PDL+P+ GGT
Sbjct: 83  MSAEVAAILNKSFIPIKLDREERPDIDDVYMNYVQATTGSGGWPLNVFLTPDLEPVFGGT 142

Query: 222 YFPPEDKYGRP--------GFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS----- 268
           Y+P  +    P        GF  +L K++D W+ ++    +S      QL E        
Sbjct: 143 YWPGPNATPLPKLGGEEPVGFIDVLEKLRDVWNTQQLRCRESAKEITRQLREFAEEGIHL 202

Query: 269 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK---KL 325
           +  + ++  ++L  + L       +  YD+  GGF  +PKFP PV +  +L  S+   ++
Sbjct: 203 SQVNKSEQEEDLEVDLLEEAFTHFAARYDATNGGFSGSPKFPTPVNLSFLLRLSRYPEEV 262

Query: 326 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 385
            D     E ++  +M + T+  +A+GGI D +G GF RYSV   W +PHFEKMLYDQ QL
Sbjct: 263 MDIVGREECAKATEMAVNTMIKVARGGIRDQIGYGFSRYSVTPDWSLPHFEKMLYDQAQL 322

Query: 386 ANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDADSAETEGATRKKEGA 444
            +V++D F  + +        D++ Y+    ++ P G  +S+EDADS  +   T K+EGA
Sbjct: 323 LDVFIDGFEASHEPELLGAIYDLVTYITSPPILSPMGCFYSSEDADSQPSPEDTEKREGA 382

Query: 445 FYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 503
           +YVWT KE++ ILG+  A +   H+ + P GN  ++R++DPH+EF  +NVL      +  
Sbjct: 383 YYVWTLKELKQILGQRDADVCARHWGVLPDGN--VARVNDPHDEFMNRNVLRIATTPAQV 440

Query: 504 ASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVISSFARASKILKS 562
           A + G+  E+ + IL   R KL + R +KR RP LDDK+IV+WNGLVI + A+ + +L+ 
Sbjct: 441 AKEFGLNEEETIRILKTSRVKLREYRETKRVRPELDDKIIVAWNGLVIGALAKCAILLED 500

Query: 563 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFR-NGPSKAPGFLDD 621
                     +     K    +A +A  FI+ +L+D ++ +L   +R +     PGF DD
Sbjct: 501 ----------IDAEKSKHCRLMAGNAVKFIKENLFDAESGQLWRIYRADSRGDTPGFADD 550

Query: 622 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGG------GYFNTTGE----D 671
           YA+LISGLL LYE       L +A +LQ   ++ F+           G++ T  E     
Sbjct: 551 YAYLISGLLQLYEATFDDAHLQFADKLQQYLNKYFISVSASDSSICTGFYMTPSEAVTDT 610

Query: 672 PSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 722
           PS L R+K   D A PS N V   NL+RL+S++         +   H+ AV
Sbjct: 611 PSALFRLKTGTDSATPSTNGVIAQNLLRLSSLLEDESYKLKARQTCHAFAV 661


>gi|302342409|ref|YP_003806938.1| hypothetical protein Deba_0974 [Desulfarculus baarsii DSM 2075]
 gi|301639022|gb|ADK84344.1| protein of unknown function DUF255 [Desulfarculus baarsii DSM 2075]
          Length = 681

 Score =  430 bits (1105), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 267/674 (39%), Positives = 366/674 (54%), Gaps = 55/674 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N LAAE SPYL QHA NPVDW  WG  A A+AR +  PIFLSIGY+TCHWCHVM  ESFE
Sbjct: 3   NALAAEQSPYLRQHADNPVDWLPWGPAALAKARDQQKPIFLSIGYATCHWCHVMAHESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D+ VA LLN  +V++KVDREERPD+D +YMT  QAL G GGWPL+  L+PD  P + GTY
Sbjct: 63  DQAVADLLNQHYVAVKVDREERPDLDAIYMTACQALSGAGGWPLTALLTPDGLPFIAGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWD-KKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           FP   + GRPG   IL +V   W+  +R  + Q+G    ++++ A+   A       +L 
Sbjct: 123 FPKTARLGRPGLLEILAEVARRWNGPERARMIQAG----QEVARAIQPQAGPKT---DLD 175

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
             AL +   QL +S+D +FGGFG APKFP P  +  +L    +          S+   MV
Sbjct: 176 PRALGMAYSQLRQSFDDQFGGFGQAPKFPTPHNLLFLLRWQAR-------NPGSDALAMV 228

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             TL  MA GG+ D VG GFHRYSVD  W  PHFEKMLYDQ  LA  YL+A  LT    +
Sbjct: 229 EKTLTAMADGGLFDQVGFGFHRYSVDRPWLTPHFEKMLYDQALLAMAYLEAHQLTGREDF 288

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE-H 460
           +   R +  Y+   M GP G  ++AEDADS   EG     EG +YVWT +EV    G+  
Sbjct: 289 AATARQVFTYVLTRMTGPEGGFYAAEDADS---EGV----EGKYYVWTPQEVLAAAGQAD 341

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
             LF + + +   GN +    S PH     +  L +       A++ G+  ++    L  
Sbjct: 342 GRLFNDFHGITADGNFEHG-TSIPHR----RQSLADF------ATQHGLDADQAAQALER 390

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            R  L   R +R  P  DDK+I +WNGL+I++ A+A + L  EA +A             
Sbjct: 391 ARLALLAARQQRIPPLKDDKIITAWNGLMIAALAKAGQALADEALTAAAA-----RAATF 445

Query: 581 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 640
            ++ A +               RL  S R+G +  PGFL+DYAF+I GL++L+E      
Sbjct: 446 ILQTARATGG------------RLARSQRDGQASGPGFLEDYAFMIWGLIELFEATFELD 493

Query: 641 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 700
            L  A+EL +   ELF D   GGYF +  +   +++R K+D+DGA P+GNS   +NL+RL
Sbjct: 494 HLEAALELTDKCCELFWDEADGGYFFSPADGEKLIMRDKDDYDGATPAGNSTMTLNLLRL 553

Query: 701 ASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSS 760
           A +    + +   Q    ++A    RL    MA  ++  A D    P+ K +V+ G K+ 
Sbjct: 554 ARLTGRRQLEDMAQQLMQTMAAQTMRLP---MAHTMLLMALDFAQGPT-KEIVICGAKND 609

Query: 761 VDFENMLAAAHASY 774
              + M+A A   +
Sbjct: 610 PAAQAMIAKAQQKF 623


>gi|448310353|ref|ZP_21500197.1| hypothetical protein C493_01015 [Natronolimnobius innermongolicus
           JCM 12255]
 gi|445608208|gb|ELY62067.1| hypothetical protein C493_01015 [Natronolimnobius innermongolicus
           JCM 12255]
          Length = 729

 Score =  430 bits (1105), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 251/653 (38%), Positives = 355/653 (54%), Gaps = 53/653 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA NPV+W  W E+A   AR+ DVPIFLSIGYS CHWCHVME ESF 
Sbjct: 8   NRLEEEESPYLRQHADNPVNWQPWDEQALETAREHDVPIFLSIGYSACHWCHVMEEESFA 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA +LN+ FV IKVDREERPDVD +YMT  Q + G GGWPLS +L+P+ KP   GTY
Sbjct: 68  DEAVADVLNEHFVPIKVDREERPDVDSIYMTVCQLVSGRGGWPLSAWLTPEGKPFFVGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRD------MLAQSGAFAIEQLSEALSASASSNKL 276
           FP E+K G+PGF  + R++ D+W    D         Q    A ++L E   + A +   
Sbjct: 128 FPKEEKRGQPGFLDLCRRISDSWSSPEDRPEMENRAEQWTDAAKDRLEETPDSVAGAEPP 187

Query: 277 PDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLEDTGKSGEAS 335
             E+    L   A+   +S D + GGFGS  PKFP+P  ++++   ++  + TG+     
Sbjct: 188 TSEV----LTAAADAAVRSADHQHGGFGSGGPKFPQPSRLRVL---ARAYDRTGE----G 236

Query: 336 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 395
           E + ++  +L  MA GG++DHVGGGFHRY VD  W VPHFEKMLYD  ++   +L  + L
Sbjct: 237 EYRAVLEESLDAMAAGGLYDHVGGGFHRYCVDADWTVPHFEKMLYDNAEIPRAFLAGYQL 296

Query: 396 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 455
           T D  Y+ +  + L+++ R++   GG  FS  DA S + E   R +EGAF+VWT  E+ D
Sbjct: 297 TGDERYAEVVAETLEFVDRELTHEGGGFFSTLDAQSEDPETGER-EEGAFFVWTPDEIRD 355

Query: 456 ILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEK 513
           IL +   A LF E Y +  +GN            F+G+N    +    + A    +  ++
Sbjct: 356 ILDDETTAELFCERYDVTESGN------------FEGQNQPNRVRSIDSLAEAYDLAEDE 403

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
               L + R ++F+ R +RPRP+ D+KV+ SWNGL+I++ A A+ +L  +A         
Sbjct: 404 LRERLEDAREQVFEAREERPRPNRDEKVLASWNGLMIATCAEAALVLGEDA--------- 454

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
                  Y E+   A  F+R  L+D    RL+  +++G     G+L+DYAFL  G L  Y
Sbjct: 455 -------YAEMGVDALEFVRDRLWDADEGRLRRRYKDGDVAIQGYLEDYAFLARGALGCY 507

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
           E       L +A+EL  + +  F D + G  + T     S++ R +E  D + PS   V+
Sbjct: 508 EATGDVDHLAFALELARSIEAEFWDADAGTLYFTPESGESLVTRPQELDDQSTPSATGVA 567

Query: 694 VINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 746
           V  L+ L     G   D     A   L      ++  A+    +C AAD L  
Sbjct: 568 VETLLAL----DGFADDDLESIAVGVLRTHANEIQTNALQHASLCLAADRLEA 616


>gi|448328363|ref|ZP_21517675.1| hypothetical protein C489_04491 [Natrinema versiforme JCM 10478]
 gi|445615887|gb|ELY69525.1| hypothetical protein C489_04491 [Natrinema versiforme JCM 10478]
          Length = 729

 Score =  429 bits (1104), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 249/655 (38%), Positives = 355/655 (54%), Gaps = 55/655 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA NPV+W  W E A A A++R+VPIFLSIGYS CHWCHVME ESFE
Sbjct: 8   NRLDEEESPYLRQHADNPVNWQPWDEAALAAAKERNVPIFLSIGYSACHWCHVMEDESFE 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA++LN+ FV IKVDREERPDVD +YMT  Q + G GGWPLS +L+P+ KP   GTY
Sbjct: 68  DEAVAEVLNENFVPIKVDREERPDVDSIYMTVCQLVTGRGGWPLSAWLTPEGKPFFVGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL-- 280
           FP E K G+PGF  +  ++ D+W+ + D          EQ ++A  A     + PD    
Sbjct: 128 FPREGKQGQPGFLDLCERISDSWESEEDRAEMEN--RAEQWTDA--AKDQLEETPDAAGA 183

Query: 281 -------PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGE 333
                    + L   A+ + +S D + GGFGS  KFP+P  ++++   ++  + TG+   
Sbjct: 184 GTGAAPPSSDVLETAADMVLRSADRQHGGFGSGQKFPQPSRLRVL---ARAYDRTGR--- 237

Query: 334 ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 393
             E  ++   TL  MA GG++DHVGGGFHRY VD  W VPHFEKMLYD  ++   +L  +
Sbjct: 238 -EEYLEVFEETLDAMAAGGLYDHVGGGFHRYCVDRDWTVPHFEKMLYDNAEIPRAFLSGY 296

Query: 394 SLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 453
            LT +  Y+ +  + L+++ R++    G  FS  DA S E+      +EGAFYVWT ++V
Sbjct: 297 QLTGEDRYATVVSETLEFVDRELTHDEGGFFSTLDAQS-ESPETGEHEEGAFYVWTPEDV 355

Query: 454 EDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPL 511
            + L     A LF   + +  +GN            F+G+N    +   S  A +  +  
Sbjct: 356 HEALESETDAALFCARFDISESGN------------FEGRNQPNRVATVSELADQFDLEE 403

Query: 512 EKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNF 571
            + L  L   R+ LF+ R +RPRP  D+KV+  WNGL+IS++A A+ +L           
Sbjct: 404 SEILKRLDSARQTLFEAREERPRPARDEKVLAGWNGLLISTYAEAALVL----------- 452

Query: 572 PVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLD 631
              G+D  +Y   A  A  F+R  L++E   RL   +++G  K  G+L+DYAFL  G LD
Sbjct: 453 ---GAD--DYAATAVDALEFVRDRLWNEADQRLSRRYKDGDVKVDGYLEDYAFLARGALD 507

Query: 632 LYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNS 691
            Y+       L +A+EL    +  F D + G  + T     S++ R +E  D + PS   
Sbjct: 508 CYQATGEVAHLAFALELARVIEAEFWDEDRGTLYFTPESGESLVTRPQELGDQSTPSATG 567

Query: 692 VSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 746
           V+V  L+ L         + +   A   L     +L+  A+    +C AAD L+ 
Sbjct: 568 VAVEVLLALDEFA----DEDFEDIAATVLETHANKLESSALEHATLCLAADRLAA 618


>gi|398339915|ref|ZP_10524618.1| hypothetical protein LkirsB1_10954 [Leptospira kirschneri serovar
           Bim str. 1051]
          Length = 696

 Score =  429 bits (1104), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 262/682 (38%), Positives = 373/682 (54%), Gaps = 61/682 (8%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           +++ NRL+ E SPYL QH++NPVDWF WGEEA  +A+ +D  IFLS+GY+TCHWCHVME 
Sbjct: 13  SRNPNRLSKEKSPYLQQHSYNPVDWFPWGEEALTKAKDQDKLIFLSVGYATCHWCHVMEK 72

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFE++ +A  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++FL+P+ +P+ 
Sbjct: 73  ESFENQSIADYLNSHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNMFLTPEGQPIT 132

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
           GGTYFPPE +YGR GF  +L  ++  W +KR  L  + +   + L ++  + A   +  D
Sbjct: 133 GGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGESRAKEKQEAD 192

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKKLEDTGKSGEA 334
             P+N            YDS+FGGF +    KFP  + +  +L  YHS        SG  
Sbjct: 193 FPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS--------SGNP 244

Query: 335 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 394
           +   +MV  TL  M +GGI+D +GGG  RYS D RW VPHFEKMLYD      +  +   
Sbjct: 245 N-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSLFLEILAEYSL 303

Query: 395 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 454
           ++K +       DI+ YL RDM   GG I SAEDADS   EG    +EG FY+W  +E  
Sbjct: 304 VSKKISAKSFALDIVSYLHRDMRMDGGGICSAEDADS---EG----EEGLFYIWDLEEFR 356

Query: 455 DILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKY 514
           ++ GE + L ++ + +   GN            F+GKN+L E    +   S       K+
Sbjct: 357 EVCGEDSSLLEKFWNVTKEGN------------FEGKNILHE----NFRGSNFTEEESKH 400

Query: 515 LN-ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
           L+  L   + KL + RSKR RP  DDK++ SWNGL I +  +                  
Sbjct: 401 LDGALTRGKAKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG---------------- 444

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
           +   R++++++AE   SFI ++L D +  R+   FR G S   G+ +DYA +I+  + L+
Sbjct: 445 IAFQREDFLKLAEETYSFIEKNLIDSKG-RILRRFREGESGILGYSNDYAEMIASSIVLF 503

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGAEPSGNSV 692
           E G G ++L  A+        LF  R   G F  TG D  VLLR   D +DG EPS NS 
Sbjct: 504 EAGRGVRYLQNAVLWMEETIRLF--RSTAGVFFDTGIDGEVLLRRSVDGYDGVEPSANSS 561

Query: 693 SVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHV 752
              +LV+L+ +  G  SD YR+ AE     F   L   A+  P +  A       SR+ V
Sbjct: 562 LAHSLVKLSFL--GVNSDRYREVAESIFLYFRKELYSYALNYPFLLSAYWSYKYHSREIV 619

Query: 753 VLVGHKSSVDFENMLAAAHASY 774
           ++   K+S    ++LA   + +
Sbjct: 620 LI--RKNSEAGRDLLAWIQSRF 639


>gi|418741789|ref|ZP_13298163.1| PF03190 family protein [Leptospira kirschneri serovar Valbuzzi str.
           200702274]
 gi|410751237|gb|EKR08216.1| PF03190 family protein [Leptospira kirschneri serovar Valbuzzi str.
           200702274]
          Length = 688

 Score =  429 bits (1104), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 261/682 (38%), Positives = 375/682 (54%), Gaps = 61/682 (8%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           +++ NRL+ E SPYL QH++NPVDWF WGEEA  +A+ +D  IFLS+GY+TCHWCHVME 
Sbjct: 5   SRNPNRLSKEKSPYLQQHSYNPVDWFPWGEEALTKAKDQDKLIFLSVGYATCHWCHVMEK 64

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFE++ +A  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++FL+P+ +P+ 
Sbjct: 65  ESFENQSIADYLNSHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNMFLTPEGQPIT 124

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
           GGTYFPPE +YGR GF  +L  ++  W +KR  L  + +   + L ++  + A   +  D
Sbjct: 125 GGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGESRAKEKQEAD 184

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKKLEDTGKSGEA 334
             P+N            YDS+FGGF +    KFP  + +  +L  YHS        SG  
Sbjct: 185 FPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS--------SGNP 236

Query: 335 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 394
           +   +MV  TL  M +GGI+D +GGG  RYS D RW VPHFEKMLYD      +  +   
Sbjct: 237 N-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSLFLEILAEYSL 295

Query: 395 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 454
           ++K +       DI+ YL RDM   GG I SAEDADS   EG    +EG FY+W  +E  
Sbjct: 296 VSKKISAKSFALDIVSYLHRDMRMDGGGICSAEDADS---EG----EEGLFYIWDLEEFR 348

Query: 455 DILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKY 514
           ++ G+ + L ++ + +   GN            F+GKN+L E    +   S       K+
Sbjct: 349 EVCGDDSSLLEKFWNVTKEGN------------FEGKNILHE----NFRGSNFTEEESKH 392

Query: 515 LN-ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
           L+ +L   + KL + RSKR RP  DDK++ SWNGL I +  +                  
Sbjct: 393 LDGVLTRGKAKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG---------------- 436

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
           +   R++++++AE   SFI ++L D +  R+   FR G S   G+ +DYA +I+  + L+
Sbjct: 437 IAFQREDFLKLAEETYSFIEKNLIDSKG-RILRRFREGESGILGYSNDYAEMIASSIVLF 495

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGAEPSGNSV 692
           E G G ++L  A+        LF  R   G F  TG D  VLLR   D +DG EPS NS 
Sbjct: 496 EAGRGVRYLQNAVLWMEETIRLF--RSTAGVFFDTGIDGEVLLRRSVDGYDGVEPSANSS 553

Query: 693 SVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHV 752
              +LV+L+ +  G  SD YR+ AE     F   L   A++ P +  A       SR+ V
Sbjct: 554 LAHSLVKLSFL--GVNSDRYREVAESIFLYFRKELYSYALSYPFLLSAYWSYKYHSREIV 611

Query: 753 VLVGHKSSVDFENMLAAAHASY 774
           ++   K+S    ++LA   + +
Sbjct: 612 LI--RKNSEAGRDLLAWIQSRF 631


>gi|74318745|ref|YP_316485.1| hypothetical protein Tbd_2727 [Thiobacillus denitrificans ATCC
           25259]
 gi|74058240|gb|AAZ98680.1| conserved hypothetical protein [Thiobacillus denitrificans ATCC
           25259]
          Length = 673

 Score =  429 bits (1104), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 259/663 (39%), Positives = 365/663 (55%), Gaps = 56/663 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA+E SPYLLQHA NPVDW+ WG+EA  +AR+ D PI LSIGYS CHWCHVM  + FE
Sbjct: 3   NRLASEQSPYLLQHADNPVDWYPWGDEALEKARREDKPILLSIGYSACHWCHVMAHDCFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYG-GGGWPLSVFLSPDLKPLMGGT 221
           D  V  ++N  FV+IKVDREERPD+D++Y T  Q L   GGGWPL+VFL+PD  P   GT
Sbjct: 63  DAEVGAVMNRLFVNIKVDREERPDLDQIYQTAHQLLAQRGGGWPLTVFLTPDQTPFFAGT 122

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKR-DMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           YFP   +Y  PGF  ++  V  AW  +R ++LAQ+ A     L+++ S  A+S   P  L
Sbjct: 123 YFPKTARYQLPGFPELMENVAHAWHARRGEVLAQNDAVRA-ALAQSQSQPAASASTP--L 179

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
               L      L++++D  +GGF  APKFPRP E+  +L  ++        G  ++ ++M
Sbjct: 180 TAAPLEQGVRDLAQAFDPVWGGFSRAPKFPRPGELFFLLRRAQ--------GGDAKAREM 231

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
            LFTL+ MA GG+ D +GGGF RYSVDE W +PHFEKMLYD G L ++Y DA++L  +  
Sbjct: 232 ALFTLRKMASGGVVDQLGGGFCRYSVDEEWAIPHFEKMLYDNGPLLHLYADAWALRGETL 291

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-- 458
           +      I+ +L R+M  P G  +SA DADS   EG     EG FYVW+ +EV+ +L   
Sbjct: 292 FRETAEGIVAWLLREMRAPEGGFYSALDADS---EG----HEGKFYVWSREEVKSLLTPD 344

Query: 459 EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 518
           E+A+    + +  P           P+ E    N L         A+ LG+        +
Sbjct: 345 EYAVAAPFYGFDAP-----------PNFENTSWNPL-RARPLEEIAAALGLFPTDAEARV 392

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
              RRKLF  R  R RP  DDK + SWN L+I   A A +++                 R
Sbjct: 393 AAARRKLFAARESRIRPGRDDKQLTSWNALMIGGLAHAGRVMA----------------R 436

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG 638
            E++  A +A  F+RR+L+  +  RL+ +F+ G ++   +LDDYAFL+  LL+  +    
Sbjct: 437 PEWVAEAHAAIDFLRRNLW--RDGRLRATFKRGEARLNAYLDDYAFLVDALLETMQAAYR 494

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
              + WA EL +     F DRE GG+F T+ +  ++L R K  +D A PSGN V+   L 
Sbjct: 495 EADMAWAQELADALLAHFEDREAGGFFFTSHDHEALLTRPKPGYDNATPSGNGVAAFALQ 554

Query: 699 RLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHK 758
           RL  ++  ++   Y   +   L +F  ++    +A P +    D    P R  +VL G  
Sbjct: 555 RLGHLLGETR---YLDASARCLRLFLPQVVQQPIAHPTLLAVLDEALRPPRV-IVLRGPD 610

Query: 759 SSV 761
           + V
Sbjct: 611 TPV 613


>gi|327293790|ref|XP_003231591.1| hypothetical protein TERG_07891 [Trichophyton rubrum CBS 118892]
 gi|326466219|gb|EGD91672.1| hypothetical protein TERG_07891 [Trichophyton rubrum CBS 118892]
          Length = 774

 Score =  429 bits (1104), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 247/651 (37%), Positives = 362/651 (55%), Gaps = 42/651 (6%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            NRL+   SPY+  H +NPV W  W   A  +A++ +  IFLSIGYS CHWCHVME ESF
Sbjct: 23  VNRLSESRSPYVRSHMNNPVAWQLWDSTAINKAKQLNRLIFLSIGYSACHWCHVMEKESF 82

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
               VA +LN  F+ IK+DREERPD+D VYM YVQA  G GGWPL+VFL+PDL+P+ GGT
Sbjct: 83  MSAEVAAILNKSFIPIKLDREERPDIDDVYMNYVQATTGSGGWPLNVFLTPDLEPVFGGT 142

Query: 222 YFPPEDKYGRP--------GFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS----- 268
           Y+P  +    P        GF  +L K++D W+ ++    +S      QL E        
Sbjct: 143 YWPGPNATPLPKLGGEDPVGFIDVLEKLRDVWNTQQLRCRESAKEITRQLREFAEEGIHL 202

Query: 269 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK---KL 325
           +  + ++  ++L  + L       +  YD+  GGF  +PKFP PV +  +L  S+   ++
Sbjct: 203 SQVNKSEQEEDLEVDLLEEAFTHFAARYDATNGGFSGSPKFPTPVNLSFLLRLSRYPEEV 262

Query: 326 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 385
            D     E ++  +M + T+  +A+GGI D +G GF RYSV   W +PHFEKMLYDQ QL
Sbjct: 263 MDIVGREECAKATEMAVNTMIKVARGGIRDQIGYGFSRYSVTPDWSLPHFEKMLYDQAQL 322

Query: 386 ANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDADSAETEGATRKKEGA 444
            +V++D F  + +        D++ Y+    ++ P G  +S+EDADS  +   T K+EGA
Sbjct: 323 LDVFIDGFEASHEPELLGAIYDLVTYITSPPILSPKGCFYSSEDADSQPSPEDTEKREGA 382

Query: 445 FYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 503
           +YVWT KE++ ILG+  A +   H+ + P GN  ++R++DPH+EF  +NVL      +  
Sbjct: 383 YYVWTLKELKQILGQRDADVCARHWGVLPDGN--VARVNDPHDEFMNRNVLRIATTPAQV 440

Query: 504 ASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVISSFARASKILKS 562
           A + G+  E+ + IL   R KL + R +KR RP LDDK+IV+WNGLVI + A+ + +L+ 
Sbjct: 441 AKEFGLNEEETIRILKTSRVKLREYRETKRVRPELDDKIIVAWNGLVIGALAKCAILLED 500

Query: 563 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFR-NGPSKAPGFLDD 621
                     +     K    +A +A  FI+ +L+D ++ +L   +R +     PGF DD
Sbjct: 501 ----------IDAEKSKHCRLMAGNAVKFIKENLFDAESGQLWRIYRADSRGDTPGFADD 550

Query: 622 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGG------GYFNTTGE----D 671
           YA+LISGLL LYE       L +A +LQ   ++ F+           G++ T  E     
Sbjct: 551 YAYLISGLLQLYEATFDDAHLQYADKLQQYLNKYFISVSASDSSICTGFYMTPSEAVTDT 610

Query: 672 PSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 722
           P  L R+K   D A PS N V   NL+RL+S++         +   H+ AV
Sbjct: 611 PGALFRLKTGTDSATPSTNGVIAQNLLRLSSLLEDESYKLKARQTCHAFAV 661


>gi|421111206|ref|ZP_15571685.1| PF03190 family protein [Leptospira santarosai str. JET]
 gi|410803388|gb|EKS09527.1| PF03190 family protein [Leptospira santarosai str. JET]
          Length = 699

 Score =  429 bits (1103), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 271/690 (39%), Positives = 376/690 (54%), Gaps = 63/690 (9%)

Query: 93  STSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHW 152
           ++  S +++ NRL+ E SPYL QHA+NPVDWF WGEEAF +A+++D  IFLSIGY+TCHW
Sbjct: 7   NSMQSGSRNPNRLSKEKSPYLQQHAYNPVDWFPWGEEAFTKAKEQDKLIFLSIGYATCHW 66

Query: 153 CHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP 212
           CHVME ESFE+  VA  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL+VFL+P
Sbjct: 67  CHVMERESFENPTVADYLNSHFVSIKVDREERPDIDRIYMDALHAMNQQGGWPLNVFLTP 126

Query: 213 DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASAS 272
           D KP+ GGTYFPPE  YGR  F  +L  ++  W++KR  L      A  +LS+ L  S  
Sbjct: 127 DGKPITGGTYFPPEPGYGRKSFLEVLNILRKIWNEKRQEL----VVASSELSQYLKDSGE 182

Query: 273 SNKLPDE---LPQNALRLCAEQLSKS-YDSRFGGFGS--APKFPRPVEIQMML-YHSKKL 325
              +  +   LP       A  L +S YDS FGGF +    KFP  + +  +L YH    
Sbjct: 183 GRAVEKQEGNLPSENCFDSAFSLYESYYDSEFGGFKTNHVNKFPPSMGLSFLLRYH---- 238

Query: 326 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 385
               +S    +  +M   TL  M +GGI+D VGGG  RYS D RW VPHFEKMLYD    
Sbjct: 239 ----RSSGNPKALEMAENTLLAMKQGGIYDQVGGGLCRYSTDPRWTVPHFEKMLYDNSLF 294

Query: 386 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 445
               ++  S++K +       D++ YL RDM    G I SAEDADS   EG    +EG F
Sbjct: 295 LETLVECSSVSKKISAKSFALDVISYLHRDMRNEDGGICSAEDADS---EG----EEGLF 347

Query: 446 YVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 505
           YVW  +E  ++ GE + + ++ + +   GN            F+GKN+L E +  S +A 
Sbjct: 348 YVWDLEEFREVCGEDSRILEKFWNVTEKGN------------FEGKNILRE-SYPSGAAK 394

Query: 506 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 565
                  +  ++L   R KL + RSKR RP  DDK++ SWNGL   +  +A         
Sbjct: 395 FSEEEWNRIDSVLERGRAKLLERRSKRIRPLRDDKILTSWNGLYTKALTKAG-------- 446

Query: 566 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 625
                   V   +++++++AE   SFI R+L D    R+   FR+G S   G+ +DYA +
Sbjct: 447 --------VAFQKEDFLKLAEETYSFIERNLID-PNGRILRRFRDGESGILGYSNDYAEM 497

Query: 626 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDG 684
           I+  + L+E G G ++L  A+        LF  R   G F  TG D  VLLR   D +DG
Sbjct: 498 IASSIALFEAGRGIRYLKNAVLWMEEAIRLF--RSPAGVFFDTGNDGEVLLRRSVDGYDG 555

Query: 685 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 744
            EPS NS  V +LV+L+  + G  S  YR+ AE   + F   L   ++  P +  A    
Sbjct: 556 VEPSANSSLVYSLVKLS--LFGIDSARYRKFAESIFSYFTKELSSYSLGYPHLLSAYWTY 613

Query: 745 SVPSRKHVVLVGHKSSVDFENMLAAAHASY 774
              S K +VL+  K +   +++LA     +
Sbjct: 614 RFHS-KEIVLI-RKDADSGKDLLAEIQTKF 641


>gi|359683227|ref|ZP_09253228.1| hypothetical protein Lsan2_00420 [Leptospira santarosai str.
           2000030832]
          Length = 691

 Score =  429 bits (1103), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 265/683 (38%), Positives = 369/683 (54%), Gaps = 55/683 (8%)

Query: 96  HSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHV 155
            S +++ NRL+ E SPYL QHA+NPVDWF WGEEAF +A+++D  IFLSIGY+TCHWCHV
Sbjct: 2   QSGSRNPNRLSKEKSPYLQQHAYNPVDWFPWGEEAFTKAKEQDKLIFLSIGYATCHWCHV 61

Query: 156 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 215
           ME ESFE+  VA  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL+VFL+PD K
Sbjct: 62  MERESFENPTVADYLNSHFVSIKVDREERPDIDRIYMDALHAMNQQGGWPLNVFLTPDGK 121

Query: 216 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 275
           P+ GGTYFPPE  YGR  F  +L  ++  W +KR  L  + +   + L ++    A   +
Sbjct: 122 PITGGTYFPPEPGYGRKSFLEVLNILRKIWSEKRQELVVASSELSQYLKDSGEGRAVEKQ 181

Query: 276 LPDELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML-YHSKKLEDTGKSG 332
             D   +N            YDS FGGF +    KFP  + +  +L YH        +S 
Sbjct: 182 EGDLPSENCFDSAFSLYESYYDSEFGGFKTNHVNKFPPSMGLSFLLRYH--------RSS 233

Query: 333 EASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDA 392
              +  +M   TL  M +GGI+D VGGG  RYS D RW VPHFEKMLYD        ++ 
Sbjct: 234 GNPKALEMAENTLLAMKQGGIYDQVGGGLCRYSTDPRWTVPHFEKMLYDNSLFLETLVEC 293

Query: 393 FSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKE 452
            S++K +       D++ YL RDM    G I SAEDADS   EG    +EG FYVW  +E
Sbjct: 294 SSVSKKISAKSFALDVISYLHRDMRNEDGGICSAEDADS---EG----EEGLFYVWDLEE 346

Query: 453 VEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE 512
             ++ GE + + ++ + +   GN            F+GKN+L E +  S +A        
Sbjct: 347 FREVCGEDSRILEKFWNVTEKGN------------FEGKNILRE-SYPSGAAKFSEEEWN 393

Query: 513 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 572
           +  ++L   R KL + RSKR RP  DDK++ SWNGL   +  +A                
Sbjct: 394 RIDSVLERGRAKLLERRSKRIRPLRDDKILTSWNGLYTKALTKAG--------------- 438

Query: 573 VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDL 632
            V   +++++++AE   SFI R+L D    R+   FR+G S   G+ +DYA +I+  + L
Sbjct: 439 -VAFQKEDFLKLAEETYSFIERNLID-PNGRILRRFRDGESGILGYSNDYAEMIASSIAL 496

Query: 633 YEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGAEPSGNS 691
           +E G G ++L  A+        LF  R   G F  TG D  VLLR   D +DG EPS NS
Sbjct: 497 FEAGRGIRYLKNAVLWMEEAIRLF--RSPAGVFFDTGNDGEVLLRRSVDGYDGVEPSANS 554

Query: 692 VSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKH 751
             V +LV+L+  + G  S  YR+ AE   + F   L   ++  P +  A       S K 
Sbjct: 555 SLVYSLVKLS--LFGVDSARYRKFAESIFSYFTKELSSYSLGYPHLLSAYWTYRFHS-KE 611

Query: 752 VVLVGHKSSVDFENMLAAAHASY 774
           +VL+  K +   +++LA     +
Sbjct: 612 IVLI-RKDADSGKDLLAEIQTKF 633


>gi|429193250|ref|YP_007178928.1| thioredoxin domain-containing protein [Natronobacterium gregoryi
           SP2]
 gi|448324467|ref|ZP_21513897.1| hypothetical protein C490_03868 [Natronobacterium gregoryi SP2]
 gi|429137468|gb|AFZ74479.1| thioredoxin domain protein [Natronobacterium gregoryi SP2]
 gi|445618899|gb|ELY72451.1| hypothetical protein C490_03868 [Natronobacterium gregoryi SP2]
          Length = 741

 Score =  429 bits (1103), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 255/660 (38%), Positives = 356/660 (53%), Gaps = 56/660 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  + SPYL QHA NPV+W  W E+A   AR+ D PIFLSIGYS CHWCHVME ESF 
Sbjct: 8   NRLDEQESPYLRQHADNPVNWQPWDEQALETAREHDRPIFLSIGYSACHWCHVMEEESFA 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA++LN+ FV IKVDREERPDVD +YMT    + G GGWPLS +L+P+ KP   GTY
Sbjct: 68  DEAVAEVLNENFVPIKVDREERPDVDSIYMTVCNLVTGRGGWPLSAWLTPEGKPFYVGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLA----QSGAFAIEQLSEALSASASSNKLPD 278
           FP E K G+PGF  +L  + ++W+  R+ +     Q    A +QL E  +  A S    D
Sbjct: 128 FPTEAKRGQPGFLDVLENITNSWENDREEVENRADQWTEAARDQLEE--TPGAPSPGAAD 185

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLEDTGKSGEASEG 337
               + L   A+   +S D ++GGFGS  PKFP+P  +Q++   ++  + TG      E 
Sbjct: 186 PPSSDLLERAADASLRSADRQYGGFGSDGPKFPQPSRLQVL---ARAYDRTGD----EEY 238

Query: 338 QKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTK 397
           ++++  TL  MA GG++DHVGGGFHRY VD  W VPHFEKMLYD  ++   +L  + LT 
Sbjct: 239 RQVLEETLDAMAAGGLYDHVGGGFHRYCVDRDWTVPHFEKMLYDNAEIPRAFLAGYQLTG 298

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
           +  Y+ +  + L ++ R++    G  FS  DA S + E   R +EG FYVWT  EV D+L
Sbjct: 299 EERYAEVVHETLAFVDRELTHEDGGFFSTLDAQSEDPETGER-EEGTFYVWTPAEVHDVL 357

Query: 458 GEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 515
            +   A LF  HY +  +GN            F+G N    +   +  A +  +   +  
Sbjct: 358 ADETDADLFCAHYDITASGN------------FEGANQPNRVRSIADLAGEFDLAEHEVK 405

Query: 516 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 575
             L + R++LF+ R KRPRP+ D+KV+  WNGL+I++ A A+  L  E            
Sbjct: 406 QRLEDARQQLFETREKRPRPNRDEKVLAGWNGLMIATCAEAALTLGEE------------ 453

Query: 576 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 635
                Y E+A  A  F+R  L+D++  RL   ++       G+L+DYAFL  G L  YE 
Sbjct: 454 ----RYAEMAVDALEFVRDRLWDDEEGRLSRRYKGEDVAIEGYLEDYAFLARGALGCYEA 509

Query: 636 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 695
                 L +A+EL    +E F D + G  + T     S++ R +E  D + PS   V+V 
Sbjct: 510 TGEVDHLAFALELGRAIEEEFWDADRGTLYFTPESGESLVTRPQELGDQSTPSSAGVAVE 569

Query: 696 NLVRLASIVA--GSKSDY---------YRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 744
            L+ L       GSKS           Y + A   L+    RL+  ++    +C AAD L
Sbjct: 570 ILLALEKFAGSEGSKSPRGDGEVADADYEEIAATVLSTHANRLEANSLQHATLCLAADHL 629


>gi|421131211|ref|ZP_15591395.1| PF03190 family protein [Leptospira kirschneri str. 2008720114]
 gi|410357462|gb|EKP04717.1| PF03190 family protein [Leptospira kirschneri str. 2008720114]
          Length = 696

 Score =  429 bits (1103), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 261/682 (38%), Positives = 374/682 (54%), Gaps = 61/682 (8%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           +++ NRL+ E SPYL QH++NPVDWF WGEEA  +A+ +D  IFLS+GY+TCHWCHVME 
Sbjct: 13  SRNPNRLSKEKSPYLQQHSYNPVDWFPWGEEALTKAKDQDKLIFLSVGYATCHWCHVMEK 72

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFE++ +A  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++FL+P+ +P+ 
Sbjct: 73  ESFENQSIADYLNSHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNMFLTPEGQPIT 132

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
           GGTYFPPE +YGR GF  +L  ++  W +KR  L  + +   + L ++  + A   +  D
Sbjct: 133 GGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGESRAKEKQEAD 192

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKKLEDTGKSGEA 334
             P+N            YDS+FGGF +    KFP  + +  +L  YHS        SG  
Sbjct: 193 FPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS--------SGNP 244

Query: 335 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 394
           +   +MV  TL  M +GGI+D +GGG  RYS D RW VPHFEKMLYD      +  +   
Sbjct: 245 N-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSLFLEILAEYSL 303

Query: 395 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 454
           ++K +       DI+ YL RDM   GG I SAEDADS   EG    +EG FY+W  +E  
Sbjct: 304 VSKKISAKSFALDIVSYLHRDMRMDGGGICSAEDADS---EG----EEGLFYIWDLEEFR 356

Query: 455 DILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKY 514
           ++ G+ + L ++ + +   GN            F+GKN+L E    +   S       K+
Sbjct: 357 EVCGDDSSLLEKFWNVTKEGN------------FEGKNILHE----NFRGSNFTEEESKH 400

Query: 515 LN-ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
           L+  L   + KL + RSKR RP  DDK++ SWNGL I +  +                  
Sbjct: 401 LDGALTRGKAKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG---------------- 444

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
           +   R++++++AE   SFI ++L D +  R+   FR G S   G+ +DYA +I+  + L+
Sbjct: 445 IAFQREDFLKLAEETYSFIEKNLIDSKG-RILRRFREGESGILGYSNDYAEMIASSIVLF 503

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGAEPSGNSV 692
           E G G ++L  A+        LF  R   G F  TG D  VLLR   D +DG EPS NS 
Sbjct: 504 EAGRGVRYLQNAVLWMEETIRLF--RSTAGVFFDTGIDGEVLLRRSVDGYDGVEPSANSS 561

Query: 693 SVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHV 752
              +LV+L+ +  G  SD YR+ AE     F   L   A++ P +  A       SR+ V
Sbjct: 562 LAHSLVKLSFL--GVNSDRYREVAESIFLYFRKELYSYALSYPFLLSAYWSYKYHSREIV 619

Query: 753 VLVGHKSSVDFENMLAAAHASY 774
           ++   K+S    ++LA   + +
Sbjct: 620 LI--RKNSEAGRDLLAWIQSRF 639


>gi|308513297|ref|NP_952224.2| thioredoxin domain-containing protein YyaL [Geobacter
           sulfurreducens PCA]
 gi|409911713|ref|YP_006890178.1| thioredoxin domain-containing protein YyaL [Geobacter
           sulfurreducens KN400]
 gi|41152670|gb|AAR34547.2| thioredoxin domain protein YyaL [Geobacter sulfurreducens PCA]
 gi|298505285|gb|ADI84008.1| thioredoxin domain protein YyaL [Geobacter sulfurreducens KN400]
          Length = 710

 Score =  429 bits (1103), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 252/654 (38%), Positives = 352/654 (53%), Gaps = 60/654 (9%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
             H NRL    SPYLLQHA NPV+W+ WGE+AFA AR  D P+FLSIGY+TCHWCHVM  
Sbjct: 29  GPHFNRLIFATSPYLLQHADNPVEWYPWGEDAFARARAEDRPVFLSIGYATCHWCHVMAA 88

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESF+D+ VA +LN  +V +KVDREERPD+D  +M   Q + G GGWPL++ ++PD +P  
Sbjct: 89  ESFDDDEVAAVLNREYVPVKVDREERPDIDDTFMRVAQMMNGSGGWPLTIIMTPDRQPFF 148

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
             TY P   + G PG   +L K+ + W ++RD++ Q+ +  ++ LS   S   ++ +  D
Sbjct: 149 AATYIPRRSRGGMPGLIDLLEKIAEVWRQRRDVVRQNCSAIMDALSRFNSVRPAAAE--D 206

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
           E P +  R   +QL+  YD  FGGFG APKFP  + +  +L + ++  D        E  
Sbjct: 207 EAPLHGAR---QQLADIYDKEFGGFGGAPKFPMAMNLSFLLRYGQRYGD-------GEAV 256

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
            M   TL  MA+GGI DH+GGGFHRY+VD RW VPHFEKMLYDQ       ++A  +T +
Sbjct: 257 AMATDTLTAMAQGGIWDHLGGGFHRYTVDGRWLVPHFEKMLYDQALCTLALVEAAQVTGN 316

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
             +  + ++   ++ R++  P G  +SA DADS   EG    +EGA Y+WT  +V DILG
Sbjct: 317 SVFRELAKETCGFVLRELSAPAGGFYSALDADS---EG----REGACYLWTPAQVRDILG 369

Query: 459 -EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI 517
                LF   Y +   GN            F+G NVL       A A   G+   +    
Sbjct: 370 VADGELFCRLYAVTAWGN------------FEGANVLHLPLAPDAFARDEGVDPLRLQEK 417

Query: 518 LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSD 577
           + +    L + R +RPRP  D+K+I  WNGL+I++ AR   I   E              
Sbjct: 418 IAQWHILLLEARERRPRPFRDEKIITGWNGLMIAALARTFLICGDEL------------- 464

Query: 578 RKEYMEVAESAASFIRRHLYDEQT--HRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 635
               +E AE A   +RR   D +T   RL  S   G +  PGFL+DYAF I GLL+L+E 
Sbjct: 465 ---LLEGAERA---VRRVCIDLRTPAGRLVRSCHRGEASGPGFLEDYAFFIRGLLELHEA 518

Query: 636 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 695
               + L  A  L +    LF D  GGG F+T  +  ++L+R K   DGA PSGN+++  
Sbjct: 519 TLDPRHLALARSLAHDMLRLFGD-SGGGLFDTGSDAETILVRGKGALDGAIPSGNAMAAS 577

Query: 696 NLVRLASIVAGSKSDYYRQNAEHSL--AVFETRLKDMAMAVPLMCCAADMLSVP 747
            L+RL  I      D   + A   +  A      +  A  + L+C   ++L+ P
Sbjct: 578 VLIRLGRIT----GDGVFEEAGRGIIRAFLAGAARQPAAHIHLLCALGELLADP 627


>gi|435846903|ref|YP_007309153.1| thioredoxin domain protein [Natronococcus occultus SP4]
 gi|433673171|gb|AGB37363.1| thioredoxin domain protein [Natronococcus occultus SP4]
          Length = 732

 Score =  429 bits (1103), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 259/661 (39%), Positives = 361/661 (54%), Gaps = 64/661 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA NPV+W  W E A   AR++D PIFLSIGYS CHWCHVME ESF 
Sbjct: 8   NRLDEEESPYLRQHADNPVNWQPWDERALETAREQDKPIFLSIGYSACHWCHVMEEESFA 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA++LN+ FV IKVDREERPDVD +YMT  Q + G GGWPLS +L+P+ KP   GTY
Sbjct: 68  DEEVAEVLNEEFVPIKVDREERPDVDSIYMTVCQLVSGRGGWPLSAWLTPEGKPFYVGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASS--NKLPDEL 280
           FP   K G+PGF  ++  + D+W   R+         IE  +E  +A+A+    + PD +
Sbjct: 128 FPKHSKRGQPGFLDLIEGLADSWKTDRE--------EIENRAEEWTAAATDRLEETPDSI 179

Query: 281 ------PQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLEDTGKSGE 333
                   + L   A+   +S D + GGFGS  PKFP+P  ++++   ++  + TG+   
Sbjct: 180 GAAEPPSSDVLERAADAALRSADRQNGGFGSGGPKFPQPARLRVL---ARAYDRTGR--- 233

Query: 334 ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 393
             E ++++  +L  M +GG++DHVGGGFHRY VDE W VPHFEKMLYD  ++    L  +
Sbjct: 234 -DEYREVLEGSLTAMIEGGLYDHVGGGFHRYCVDEDWTVPHFEKMLYDNAEIPRALLAGY 292

Query: 394 SLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 453
            LT D  Y+   RD L+++ R++    G  FS  DA S E      ++EGAF+VWT  EV
Sbjct: 293 QLTGDERYADSVRDTLEFVSRELTHAEGGFFSTLDAQS-EDPATGEREEGAFFVWTPAEV 351

Query: 454 EDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPL 511
            ++LG+   A LF   Y +  +GN            F G+N    +   S  A +  +  
Sbjct: 352 REVLGDETDAELFCARYDITESGN------------FGGQNQPNVVASISELAERFDLAA 399

Query: 512 EKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNF 571
           E     L + R +LF+ R +RPRP+ D+KV+ SWNGL+I++ A A   L           
Sbjct: 400 ETVEQRLEDARAELFEAREERPRPNRDEKVLASWNGLMIATCAEAGLAL----------- 448

Query: 572 PVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLD 631
              G DR  Y  +A  A  F+R  L+D +  RL   F++G     G+L+DYAFL  G L 
Sbjct: 449 ---GEDR--YAGMAVDALEFVRDRLWDAEEGRLSRRFKDGDVAVQGYLEDYAFLARGALG 503

Query: 632 LYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNS 691
            YE     + L +A+EL    +  F D E    + T     S++ R +E +D + PS   
Sbjct: 504 CYEATGEVEHLAFALELARVIEAEFYDAERETIYFTPESGESLVTRPQELNDQSTPSATG 563

Query: 692 VSVINLVRLASIVAGSKSDYYRQNA-----EHSLAVFET---RLKDMAMAVPLMCCAADM 743
           V+V  L+ L    AG  S   R++      E + +V  T   RL+  A+    +C AAD 
Sbjct: 564 VAVETLLALDGF-AGEGSTSPREDGDAEFEEIAASVLRTHAGRLESNALQHATLCLAADR 622

Query: 744 L 744
           L
Sbjct: 623 L 623


>gi|422002946|ref|ZP_16350180.1| hypothetical protein LSS_05548 [Leptospira santarosai serovar
           Shermani str. LT 821]
 gi|417258416|gb|EKT87804.1| hypothetical protein LSS_05548 [Leptospira santarosai serovar
           Shermani str. LT 821]
          Length = 691

 Score =  429 bits (1102), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 271/687 (39%), Positives = 374/687 (54%), Gaps = 63/687 (9%)

Query: 96  HSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHV 155
            S +++ NRL+ E SPYL QHA+NPVDWF WGEEAF +A+++D  IFLSIGY+TCHWCHV
Sbjct: 2   QSGSRNPNRLSKEKSPYLQQHAYNPVDWFPWGEEAFTKAKEQDKLIFLSIGYATCHWCHV 61

Query: 156 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 215
           ME ESFE+  VA  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL+VFL+PD K
Sbjct: 62  MERESFENPTVADYLNSHFVSIKVDREERPDIDRIYMDALHAMNQQGGWPLNVFLTPDGK 121

Query: 216 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 275
           P+ GGTYFPPE  YGR  F  +L  ++  W++KR  L      A  +LS+ L  S     
Sbjct: 122 PITGGTYFPPEPGYGRKSFLEVLNILRKIWNEKRQEL----VVASSELSQYLKDSGEGRA 177

Query: 276 LPDE---LPQNALRLCAEQLSKS-YDSRFGGFGS--APKFPRPVEIQMML-YHSKKLEDT 328
           +  +   LP       A  L +S YDS FGGF +    KFP  + +  +L YH       
Sbjct: 178 VEKQEGNLPSENCFDSAFSLYESYYDSEFGGFKTNHVNKFPPSMGLSFLLRYH------- 230

Query: 329 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 388
            +S    +  +M   TL  M +GGI+D VGGG  RYS D RW VPHFEKMLYD       
Sbjct: 231 -RSSGNPKALEMAENTLLAMKQGGIYDQVGGGLCRYSTDPRWTVPHFEKMLYDNSLFLET 289

Query: 389 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 448
            ++  S++K +       D++ YL RDM    G I SAEDADS   EG    +EG FYVW
Sbjct: 290 LVECSSVSKKISAKSFALDVISYLHRDMRNEDGGICSAEDADS---EG----EEGLFYVW 342

Query: 449 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 508
             +E  ++ GE + + ++ + +   GN            F+GKN+L E +  S +A    
Sbjct: 343 DLEEFREVCGEDSRILEKFWNVTEKGN------------FEGKNILRE-SYPSGAAKFSE 389

Query: 509 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 568
               +  ++L   R KL + RSKR RP  DDK++ SWNGL   +  +A            
Sbjct: 390 EEWNRIDSVLERGRAKLLERRSKRIRPLRDDKILTSWNGLYTKALTKAG----------- 438

Query: 569 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 628
                V   +++++++AE   SFI R+L D    R+   FR+G S   G+ +DYA +I+ 
Sbjct: 439 -----VAFQKEDFLKLAEETYSFIERNLID-SNGRILRRFRDGESGILGYSNDYAEMIAS 492

Query: 629 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGAEP 687
            + L+E G G ++L  A+        LF  R   G F  TG D  VLLR   D +DG EP
Sbjct: 493 SIALFEAGRGIRYLKNAVLWMEEAIRLF--RSPAGVFFDTGNDGEVLLRRSVDGYDGVEP 550

Query: 688 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 747
           S NS  V +LV+L+  + G  S  YR+ AE   + F   L   ++  P +  A       
Sbjct: 551 SANSSLVYSLVKLS--LFGIDSARYRKFAESIFSYFTKELSSYSLGYPHLLSAYWTYRFH 608

Query: 748 SRKHVVLVGHKSSVDFENMLAAAHASY 774
           S K +VL+  K +   +++LA     +
Sbjct: 609 S-KEIVLI-RKDADSGKDLLAEIQTKF 633


>gi|350629727|gb|EHA18100.1| hypothetical protein ASPNIDRAFT_47529 [Aspergillus niger ATCC 1015]
          Length = 769

 Score =  429 bits (1102), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 261/639 (40%), Positives = 352/639 (55%), Gaps = 46/639 (7%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           K  NRL    SPY+  H +NPV W  W  EA   A++ +  IFLSIGYS CHWCHVME E
Sbjct: 12  KLVNRLHESRSPYVRAHMNNPVGWQLWDAEAIDLAKRHNRLIFLSIGYSACHWCHVMEKE 71

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SF  + VA +LN  F+ IKVDREERPD+D VYM YVQA  G GGWPL+VFL+PDL+P+ G
Sbjct: 72  SFMSQEVASILNQSFIPIKVDREERPDIDDVYMNYVQATTGSGGWPLNVFLTPDLEPVFG 131

Query: 220 GTYFPPEDKY-----GRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASS- 273
           GTY+P  +       G  GF  IL K+ D W  ++    +S     +QL E       S 
Sbjct: 132 GTYWPGPNSSTLTGNGTIGFVEILEKLSDVWQTQQLRCRESAKEITKQLREFAEEGTHSY 191

Query: 274 ---NKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML---YHSKKLED 327
               +  ++L    L    +     YD   GGF +APKFP P  +  +L    +   + D
Sbjct: 192 QGDRQADEDLDLELLEEAYQHFVSRYDPLHGGFSTAPKFPTPSNLSFLLRLGIYPTAVAD 251

Query: 328 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 387
                E ++   M + TL  MA+GGI DH+G GF RYSV   W +PHFEKMLYDQ QL +
Sbjct: 252 IVGRDECAKATAMAVDTLISMARGGIRDHIGHGFARYSVTGDWGLPHFEKMLYDQAQLLD 311

Query: 388 VYLDAFSLTKDVFYSYICRDILDYLRRDMI-GPGGEIFSAEDADSAETEGATRKKEGAFY 446
           VY+DAF +T +        D+  YL    I  P G   S+EDADS  T   T K+EGAFY
Sbjct: 312 VYVDAFKITHNPELLGAVYDLATYLTTAPIQSPTGAFHSSEDADSLPTPNDTEKREGAFY 371

Query: 447 VWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 505
           VWT KE+  +LG+  A +   H+ + P GN  ++  +DPH+EF  +NVL      S  A 
Sbjct: 372 VWTLKELTQVLGQRDAGVCARHWGVLPDGN--IAPENDPHDEFMNQNVLSVKVTPSRLAK 429

Query: 506 KLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEA 564
             G+  E+ + I+   ++KL D R + R RP LDDK+IV+WNGL I + A+ S + + E 
Sbjct: 430 DFGLGEEEVVRIIRAAKQKLRDYRERTRVRPDLDDKIIVAWNGLAIGALAKCSALFE-EI 488

Query: 565 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP-SKAPGFLDDYA 623
           ES         S   +  E A  A +FI+ +L+++ T +L   +R+G     PGF DDYA
Sbjct: 489 ES---------SKAVQCREAAAKAINFIKENLFEKPTGQLWRIYRDGGRGNTPGFADDYA 539

Query: 624 FLISGLLDLYEFGSGTKWLVWAIELQNTQDEL-----------FLDREG---GGYFNT-- 667
           +LI GLLD+YE      +L +A +LQ+ +  L           FL   G    GY++T  
Sbjct: 540 YLIGGLLDMYEATFDDSYLQFAEQLQSKRLALLTFLLEYLNDNFLAYVGTTPAGYYSTPS 599

Query: 668 --TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIV 704
             T   P  LLR+K   + A P+ N V   NL+RL S++
Sbjct: 600 TMTSGAPGPLLRLKTGTESATPAVNGVIARNLLRLGSLL 638


>gi|448393368|ref|ZP_21567693.1| hypothetical protein C477_15875 [Haloterrigena salina JCM 13891]
 gi|445663783|gb|ELZ16525.1| hypothetical protein C477_15875 [Haloterrigena salina JCM 13891]
          Length = 730

 Score =  429 bits (1102), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 246/655 (37%), Positives = 354/655 (54%), Gaps = 54/655 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA NPV+W  W E+A   A++RDVPIFLSIGYS CHWCHVME ESFE
Sbjct: 8   NRLEDEESPYLRQHADNPVNWQPWDEQALEAAKERDVPIFLSIGYSACHWCHVMEDESFE 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D+ VA++LN+ FV IKVDREERPD+D +YMT  Q + G GGWPLS +L+P+ KP   GTY
Sbjct: 68  DDDVAEVLNENFVPIKVDREERPDIDSIYMTVAQLVSGRGGWPLSAWLTPEGKPFFVGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDK--------KRDMLAQSGAFAIEQLSEALSASASSN 274
           FP E +  +PGF  + +++ D+W+         + D   ++    +E+  +   A+  + 
Sbjct: 128 FPKESQRNQPGFLELCQRISDSWESEDREEMEHRADQWTEAAKDRLEETPDGAGAAGGAA 187

Query: 275 KLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLEDTGKSGE 333
           + P       L   A  + +S D ++GGFGS  PKFP+P  + ++   ++  + TG+   
Sbjct: 188 EPPS---SEVLETAANAVLRSADRQYGGFGSGGPKFPQPSRLHVL---ARAYDRTGR--- 238

Query: 334 ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 393
             E  +++  TL  MA GG+ DHVGGGFHRY VD+ W VPHFEKMLYD  ++   +L  +
Sbjct: 239 -EEYLEVIEETLDAMAAGGLSDHVGGGFHRYCVDKDWTVPHFEKMLYDNAEIPRAFLAGY 297

Query: 394 SLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 453
            LT D  Y+ +  + LD+L R++    G  FS  DA S E      ++EGAFYVWT  EV
Sbjct: 298 QLTGDERYAEVVEETLDFLERELTHDEGGFFSTLDAQS-EDPATGEREEGAFYVWTPGEV 356

Query: 454 EDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPL 511
            ++L +   A LF   Y +  +GN            F+G+N    +    + A +  +  
Sbjct: 357 SEVLEDETTADLFCARYDITESGN------------FEGRNQPNRVRSLESLAEEYDLEQ 404

Query: 512 EKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNF 571
            +    L + R  LF+ R +RPRP+ D+KV+  WNGL+I++ A A+ +L           
Sbjct: 405 SEIEERLEDARETLFEAREERPRPNRDEKVLAGWNGLMINACAEAALVL----------- 453

Query: 572 PVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLD 631
              G DR  Y E A  A  F+R  L+D    RL   F++G  K  G+L+DYAFL  G L 
Sbjct: 454 ---GEDR--YAEQAVDALEFVRDRLWDADEQRLSRRFKDGDVKVDGYLEDYAFLARGALG 508

Query: 632 LYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNS 691
            Y+       L +A++L  T +  F D E G  + T      ++ R +E  D + PS   
Sbjct: 509 CYQATGDVDHLAFALDLARTIEAEFWDEEQGTIYFTPESGEPLVTRPQELTDQSTPSAAG 568

Query: 692 VSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 746
           V+V  L+ L         D   + A   L     +++  ++    +C AAD L  
Sbjct: 569 VAVETLLALDEFA----EDDLERIAATVLETHANKIEANSLEHASLCLAADRLEA 619


>gi|452913203|ref|ZP_21961831.1| hypothetical protein BS732_1003 [Bacillus subtilis MB73/2]
 gi|452118231|gb|EME08625.1| hypothetical protein BS732_1003 [Bacillus subtilis MB73/2]
          Length = 664

 Score =  428 bits (1101), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 232/587 (39%), Positives = 339/587 (57%), Gaps = 51/587 (8%)

Query: 121 VDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVD 180
           +DWF WGEEAF +A++ + P+ +SIGYSTCHWCHVM  ESFEDE +A+LLN+ FV+IKVD
Sbjct: 1   MDWFPWGEEAFEKAKRENKPVLVSIGYSTCHWCHVMAHESFEDEEIARLLNERFVAIKVD 60

Query: 181 REERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRK 240
           REERPDVD VYM   Q + G GGWPL+VF++PD KP   GTYFP   K+ RPGF  +L  
Sbjct: 61  REERPDVDSVYMRICQLMTGQGGWPLNVFITPDQKPFYAGTYFPKTSKFNRPGFVDVLEH 120

Query: 241 VKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRF 300
           + + +   R+ +      A + L      + ++ K  + L ++A+    +QL+  +D+ +
Sbjct: 121 LSETFANDREHVEDIAENAAKHLQ-----TKTAAKTGEGLSESAIHRTFQQLASGFDTIY 175

Query: 301 GGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGG 360
           GGFG APKFP P    M++Y  +   +TG+        K    TL  MA GGI+DH+G G
Sbjct: 176 GGFGQAPKFPMP---HMLMYLLRYDHNTGQENALYNVTK----TLDSMANGGIYDHIGYG 228

Query: 361 FHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPG 420
           F RYS D+ W VPHFEKMLYD   L   Y +A+ +T++  Y  IC  I+ +++R+M    
Sbjct: 229 FARYSTDDEWLVPHFEKMLYDNALLLTAYTEAYQVTQNSRYKEICEQIITFIQREMTHED 288

Query: 421 GEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLS 479
           G  FSA DAD   TEG    +EG +YVW+ +E+   LG+    L+ + Y +   GN    
Sbjct: 289 GSFFSALDAD---TEG----EEGKYYVWSKEEILKTLGDDLGTLYCQVYDITEEGN---- 337

Query: 480 RMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI-LGECRRKLFDVRSKRPRPHLD 538
                   F+GKN+   ++       +     EK L++ L + R++L   R +R  PH+D
Sbjct: 338 --------FEGKNIPNLIHTKREQIKEDAGLTEKELSLKLEDARQQLLKTREERTYPHVD 389

Query: 539 DKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYD 598
           DKV+ SWN L+I+  A+A+K+ +                  +Y+ +A+ A +FI   L  
Sbjct: 390 DKVLTSWNALMIAGLAKAAKVYQ----------------EPKYLSLAKDAITFIENKLII 433

Query: 599 EQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLD 658
           +   R+   +R+G  K  GF+DDYAFL+   LDLYE      +L  A +L +    LF D
Sbjct: 434 DG--RVMVRYRDGEVKNKGFIDDYAFLLWAYLDLYEASFDLSYLQKAKKLTDDMISLFWD 491

Query: 659 REGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVA 705
            E GG++ T  +  ++++R KE +DGA PSGNSV+ + L+RL  +  
Sbjct: 492 EEHGGFYFTGHDAEALIVREKEVYDGAVPSGNSVAAVQLLRLGQVTG 538


>gi|325283375|ref|YP_004255916.1| hypothetical protein Deipr_1147 [Deinococcus proteolyticus MRP]
 gi|324315184|gb|ADY26299.1| hypothetical protein Deipr_1147 [Deinococcus proteolyticus MRP]
          Length = 679

 Score =  428 bits (1101), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 246/609 (40%), Positives = 340/609 (55%), Gaps = 64/609 (10%)

Query: 86  MAERTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSI 145
           M   TPAS  H      NRL AE SPYL QHA NPV W+ W +EAFAEA +R VP+ LSI
Sbjct: 1   MTNATPASGGH------NRLGAESSPYLRQHADNPVHWWPWSDEAFAEAERRGVPVLLSI 54

Query: 146 GYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWP 205
           GYSTCHWCHVM  ESFE+E  A L+N+ FV+IKVDREERPDVD +YM   QA+ G GGWP
Sbjct: 55  GYSTCHWCHVMAHESFENEATAGLMNERFVNIKVDREERPDVDGIYMAATQAMTGQGGWP 114

Query: 206 LSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE 265
           ++VFL    +P   GTY+PP +  G P F+ ++  V DAW  +R  L ++ A A+ +  +
Sbjct: 115 MTVFLDHQRRPFHAGTYYPPHEGLGLPSFRRVMTAVSDAWQNRRADL-EANAQALTEHIQ 173

Query: 266 ALSA--SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK 323
           A+S   SA   + P EL Q  L L    L + +D   GGFG APKFP P  +  +L    
Sbjct: 174 AMSEPRSAGGQEWPAELLQAPLDL----LPQVFDPVHGGFGGAPKFPAPTTLDFLL---- 225

Query: 324 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 383
                 KSG+  +GQ+M L TL+ M +GGI+D +GGGFHRYSVD +W VPHFEKMLYD  
Sbjct: 226 ------KSGD-EQGQQMALHTLRQMGRGGIYDQLGGGFHRYSVDAQWLVPHFEKMLYDNA 278

Query: 384 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 443
           QL    L A+ ++ D  ++   R+ L YL R+M  P G  +SA+DAD+   EG T     
Sbjct: 279 QLTRTLLAAYQVSGDPAFAEAARETLRYLEREMRHPSGSFYSAQDADTEGVEGLT----- 333

Query: 444 AFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSA 502
             + WT  E++ +LG E A      Y +   GN +     DPH    G+  ++       
Sbjct: 334 --FTWTPAELQAVLGAEDAEWLARFYGVTEGGNFE-----DPHRRDAGRRTVL------- 379

Query: 503 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 562
             S++G    +  + L E R +L   R +RP+PH DDKV+ SWNGLV+++ A AS+IL  
Sbjct: 380 --SRVGELTPEQRSRLPELRARLLTAREERPQPHRDDKVLTSWNGLVLAALADASRILGE 437

Query: 563 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDD 621
                             ++E+A   A+++R  +  +    L H++ +G + +  G L+D
Sbjct: 438 ----------------PHWLELARQNAAWVRETMR-QPDGTLWHTWLDGHAPSVEGLLED 480

Query: 622 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 681
           +A    GL+ LY+     ++L WA EL       F D   G + ++ G+  ++L R    
Sbjct: 481 HALYGLGLVALYQASGELEYLTWARELWTVVQRDFWDDAAGLFRSSGGKAEALLTRQSSA 540

Query: 682 HDGAEPSGN 690
            D A  S N
Sbjct: 541 FDSAIISDN 549


>gi|410450937|ref|ZP_11304964.1| PF03190 family protein [Leptospira sp. Fiocruz LV3954]
 gi|410015249|gb|EKO77354.1| PF03190 family protein [Leptospira sp. Fiocruz LV3954]
          Length = 691

 Score =  428 bits (1101), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 271/687 (39%), Positives = 373/687 (54%), Gaps = 63/687 (9%)

Query: 96  HSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHV 155
            S +++ NRL+ E SPYL QHA+NPVDWF WGEEAF +A+++D  IFLSIGY+TCHWCHV
Sbjct: 2   QSGSRNPNRLSKEKSPYLQQHAYNPVDWFPWGEEAFTKAKEQDKLIFLSIGYATCHWCHV 61

Query: 156 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 215
           ME ESFE+  VA  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL+VFL+PD K
Sbjct: 62  MERESFENPTVADYLNSHFVSIKVDREERPDIDRIYMDALHAMNQQGGWPLNVFLTPDGK 121

Query: 216 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 275
           P+ GGTYFPPE  YGR  F  +L  ++  W++KR  L      A  +LS+ L  S     
Sbjct: 122 PITGGTYFPPEPGYGRKSFLEVLNILRKIWNEKRQEL----VVASSELSQYLKDSGEGRA 177

Query: 276 LPDE---LPQNALRLCAEQLSKS-YDSRFGGFGS--APKFPRPVEIQMML-YHSKKLEDT 328
           +  +   LP       A  L +S YDS FGGF +    KFP  + +  +L YH       
Sbjct: 178 VEKQEGNLPSENCFDSAFSLYESYYDSEFGGFKTNHVNKFPPSMGLSFLLRYH------- 230

Query: 329 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 388
            +S    +  +M   TL  M +GGI+D VGGG  RYS D RW VPHFEKMLYD       
Sbjct: 231 -RSSGNPKALEMAENTLLAMKQGGIYDQVGGGLCRYSTDPRWTVPHFEKMLYDNSLFLET 289

Query: 389 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 448
             +  S++K +       D++ YL RDM    G I SAEDADS   EG    +EG FYVW
Sbjct: 290 LAECSSVSKKISAKSFALDVISYLHRDMRNEDGGICSAEDADS---EG----EEGLFYVW 342

Query: 449 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 508
             +E  ++ GE + + ++ + +   GN            F+GKN+L E +  S +A    
Sbjct: 343 DLEEFREVCGEDSRILEKFWNVTEKGN------------FEGKNILRE-SYPSGAAKFSE 389

Query: 509 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 568
               +  ++L   R KL + RSKR RP  DDK++ SWNGL   +  +A            
Sbjct: 390 EEWNRIDSVLERGRAKLLERRSKRIRPLRDDKILTSWNGLYTKALTKAG----------- 438

Query: 569 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 628
                V   +++++++AE   SFI R+L D    R+   FR+G S   G+ +DYA +I+ 
Sbjct: 439 -----VAFQKEDFLKLAEETYSFIERNLID-SNGRILRRFRDGESGILGYSNDYAEMIAS 492

Query: 629 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGAEP 687
            + L+E G G ++L  A+        LF  R   G F  TG D  VLLR   D +DG EP
Sbjct: 493 SIALFEAGRGIRYLKNAVLWMEEAIRLF--RSPAGVFFDTGNDGEVLLRRSVDGYDGVEP 550

Query: 688 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 747
           S NS  V +LV+L+  + G  S  YR+ AE   + F   L   ++  P +  A       
Sbjct: 551 SANSSLVYSLVKLS--LFGVDSARYRKFAESIFSYFTKELSSYSLGYPHLLSAYWTYRFH 608

Query: 748 SRKHVVLVGHKSSVDFENMLAAAHASY 774
           S K +VL+  K +   +++LA     +
Sbjct: 609 S-KEIVLI-RKDADSGKDLLAEIQTKF 633


>gi|284164956|ref|YP_003403235.1| hypothetical protein Htur_1677 [Haloterrigena turkmenica DSM 5511]
 gi|284014611|gb|ADB60562.1| protein of unknown function DUF255 [Haloterrigena turkmenica DSM
           5511]
          Length = 733

 Score =  428 bits (1100), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 247/653 (37%), Positives = 357/653 (54%), Gaps = 49/653 (7%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA NPV+W  W E+A   A++RDVPIFLSIGYS CHWCHVME ESFE
Sbjct: 8   NRLEDEESPYLRQHADNPVNWQPWDEDALEAAKERDVPIFLSIGYSACHWCHVMEDESFE 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D+ VA +LN+ FV IKVDREERPD+D +YMT  Q + G GGWPLS +L+P+ KP   GTY
Sbjct: 68  DDEVAAVLNENFVPIKVDREERPDIDSIYMTVAQLVSGRGGWPLSAWLTPEGKPFFVGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRD------MLAQSGAFAIEQLSEALSASASSNKL 276
           FP E +  +PGF  + +++ D+W+   D         Q    A ++L E    + ++   
Sbjct: 128 FPKESQRNQPGFLELCQRISDSWESGEDREEMEHRADQWTEAAKDRLEETPDDAGTAGGA 187

Query: 277 PDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLEDTGKSGEAS 335
            +      L   A+   +S D ++GGFGS  PKFP+P  + ++   ++  + TG+     
Sbjct: 188 AEPPSSEVLETAADAALRSADRQYGGFGSGGPKFPQPSRLHVL---ARAYDRTGR----E 240

Query: 336 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 395
           E  ++V  +L  MA GG++DHVGGGFHRY VD+ W VPHFEKMLYD  ++   +L  + L
Sbjct: 241 EYLEVVEESLDAMAAGGLYDHVGGGFHRYCVDKDWTVPHFEKMLYDNAEIPRAFLAGYQL 300

Query: 396 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 455
           T +  Y+ +  + L +L R++    G  FS  DA S + E   R +EG FYVWT  EV +
Sbjct: 301 TGEERYAEVVDETLAFLERELTHDEGGFFSTLDAQSEDPETGER-EEGVFYVWTPDEVSE 359

Query: 456 ILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEK 513
           +L +   A LF   Y +  +GN            F+G+N    +    + A +  +   +
Sbjct: 360 VLEDETTADLFCARYDITESGN------------FEGRNQPNRVRSLESLADEYDLAEAE 407

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
             + L + R +LF+ R +RPRP+ D+KV+  WNGL+I++ A A+               V
Sbjct: 408 IEDRLEDAREQLFEAREQRPRPNRDEKVLAGWNGLMINACAEAAL--------------V 453

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
           VG+D  EY + A  A  F+R  L+DE   RL   F++G  K  G+L+DYAFL  G L  Y
Sbjct: 454 VGND--EYADQAVDALEFVRDRLWDEDEQRLSRRFKDGNVKVDGYLEDYAFLARGALGCY 511

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
           +       L +A++L  T +  F D E G  + T     S++ R +E  D + PS   V+
Sbjct: 512 QATGDVDHLGFALDLARTIEAEFWDEEQGTIYFTPESGESLVTRPQELTDQSTPSAAGVA 571

Query: 694 VINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 746
           V  L+ L         D + + A   L     +++  ++    +C AAD L  
Sbjct: 572 VETLLALDEFA----EDDFGEIAATVLETHANKIEANSLEHASLCLAADRLEA 620


>gi|456873671|gb|EMF89033.1| PF03190 family protein [Leptospira santarosai str. ST188]
          Length = 691

 Score =  428 bits (1100), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 271/687 (39%), Positives = 372/687 (54%), Gaps = 63/687 (9%)

Query: 96  HSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHV 155
            S +++ NRL+ E SPYL QHA+NPVDWF WGEEAF +A+++D  IFLSIGY+TCHWCHV
Sbjct: 2   QSGSRNPNRLSKEKSPYLQQHAYNPVDWFPWGEEAFTKAKEQDKLIFLSIGYATCHWCHV 61

Query: 156 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 215
           ME ESFE+  VA  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL+VFL+PD K
Sbjct: 62  MERESFENPTVADYLNSHFVSIKVDREERPDIDRIYMDALHAMNQQGGWPLNVFLTPDGK 121

Query: 216 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 275
           P+ GGTYFPPE  YGR  F  +L  ++  W +KR  L      A  +LS+ L  S     
Sbjct: 122 PITGGTYFPPEPGYGRKSFLEVLNILRKIWSEKRQEL----VVASSELSQYLKDSGEGRA 177

Query: 276 LPDE---LPQNALRLCAEQLSKS-YDSRFGGFGS--APKFPRPVEIQMML-YHSKKLEDT 328
           +  +   LP       A  L +S YDS FGGF +    KFP  + +  +L YH       
Sbjct: 178 VEKQEGNLPSENCFDSAFSLYESYYDSEFGGFKTNHVNKFPPSMGLSFLLRYH------- 230

Query: 329 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 388
            +S    +  +M   TL  M +GGI+D VGGG  RYS D RW VPHFEKMLYD       
Sbjct: 231 -RSSGNPKALEMAENTLLAMKQGGIYDQVGGGLCRYSTDPRWTVPHFEKMLYDNSLFLET 289

Query: 389 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 448
             +  S++K +       D++ YL RDM    G I SAEDADS   EG    +EG FYVW
Sbjct: 290 LAECSSVSKKISAKSFALDVISYLHRDMRNEDGGICSAEDADS---EG----EEGLFYVW 342

Query: 449 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 508
             +E  ++ GE + + ++ + +   GN            F+GKN+L E +  S +A    
Sbjct: 343 DLEEFREVCGEDSRILEKFWNVTEKGN------------FEGKNILRE-SYPSGAAKFSE 389

Query: 509 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 568
               +  ++L   R KL + RSKR RP  DDK++ SWNGL   +  +A            
Sbjct: 390 EEWNRIDSVLERGRAKLLERRSKRIRPLRDDKILTSWNGLYTKALTKAG----------- 438

Query: 569 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 628
                V   +++++++AE   SFI R+L D    R+   FR+G S   G+ +DYA +I+ 
Sbjct: 439 -----VAFQKEDFLKLAEETYSFIERNLID-SNGRILRRFRDGESGILGYSNDYAEMIAS 492

Query: 629 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGAEP 687
            + L+E G G ++L  A+        LF  R   G F  TG D  VLLR   D +DG EP
Sbjct: 493 SIALFEAGRGIRYLKNAVLWMEEAIRLF--RSPAGVFFDTGNDGEVLLRRSVDGYDGVEP 550

Query: 688 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 747
           S NS  V +LV+L+  + G  S  YR+ AE   + F   L   ++  P +  A       
Sbjct: 551 SANSSLVYSLVKLS--LFGVDSARYRKFAESIFSYFTKELSSYSLGYPHLLSAYWTYRFH 608

Query: 748 SRKHVVLVGHKSSVDFENMLAAAHASY 774
           S K +VL+  K +   +++LA     +
Sbjct: 609 S-KEIVLI-RKDADSGKDLLAEIQTKF 633


>gi|425767540|gb|EKV06109.1| hypothetical protein PDIG_78870 [Penicillium digitatum PHI26]
 gi|425780454|gb|EKV18461.1| hypothetical protein PDIP_27280 [Penicillium digitatum Pd1]
          Length = 752

 Score =  427 bits (1099), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 262/636 (41%), Positives = 351/636 (55%), Gaps = 42/636 (6%)

Query: 118 HNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSI 177
           +NPV W  W  EA   A+K +  IFLSIGYS CHWCHVME ESF    VA +LN+ FV I
Sbjct: 2   NNPVAWQVWDAEAMELAKKHNRLIFLSIGYSACHWCHVMEKESFMSSEVASILNESFVPI 61

Query: 178 KVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYF--PPEDKYGRP--- 232
           KVDREERPD+D +YM YVQA  G GGWPL+VFL+PDL+P+ GGTY+  P    +  P   
Sbjct: 62  KVDREERPDIDDIYMNYVQATTGSGGWPLNVFLTPDLEPVFGGTYWQGPNSTTFTGPEAI 121

Query: 233 GFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK------LPDELPQNALR 286
           GF  IL K++D W  ++     S     +QL E       S +        +++    L 
Sbjct: 122 GFVEILEKLRDVWQTQQQRCLDSAKEITKQLREFAEEGTHSQQGDRDDDNDEDMDIELLE 181

Query: 287 LCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML----YHSKKLEDTGKSGEASEGQKMVL 342
              +  +  YDS  GGFG APKFP P  +  +L    Y ++ ++  G   E  +   M +
Sbjct: 182 EAYQHFASRYDSVNGGFGRAPKFPTPSNLSFLLRLGAYPTQVMDVVGHD-ECEQATAMAV 240

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL  MA+GGI DH+G GF RYSV   W +PHFEKMLYDQ QL +VY+DAF LT D    
Sbjct: 241 TTLVNMARGGIRDHIGHGFARYSVTTDWGLPHFEKMLYDQAQLLDVYVDAFRLTHDPELL 300

Query: 403 YICRDILDYLRRDMI-GPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH- 460
               D+  YL    I  P G  FS+EDADS      T K+EGAFYVW+ KE+  +LG   
Sbjct: 301 GAVYDLAAYLTSAPIQSPTGGFFSSEDADSYPHPNDTEKREGAFYVWSLKELTSVLGPRD 360

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
           A +  +H+ + P GN  +    DPH+EF  +NVL      S  A   G+  E+ + I+  
Sbjct: 361 APVCAKHWGVLPDGN--VPPEYDPHDEFMNQNVLSIRATPSKLAKDFGLSEEEVVKIIKS 418

Query: 521 CRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRK 579
            ++KL D R + R RP LDDK+IV+WNGL I + A+ S +L  E ES+   +        
Sbjct: 419 SKQKLHDYRERSRGRPDLDDKIIVAWNGLAIGALAKCS-VLFEEIESSKAVY-------- 469

Query: 580 EYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP-SKAPGFLDDYAFLISGLLDLYEFGSG 638
              E A  A SFI+  L+D+ T +L   +R G     PGF DDYA+L SGLLD+Y+    
Sbjct: 470 -CREAAARAISFIKDKLFDKTTGQLWRIYRGGNRGDTPGFADDYAYLASGLLDMYDATYD 528

Query: 639 TKWLVWAIELQNTQDELFLDREGG---GYFNT----TGEDPSVLLRVKEDHDGAEPSGNS 691
             +L +A  LQ   +E FL + G    GY++T    T   P  LLR+K   + A PS N 
Sbjct: 529 DSYLQFAERLQKYLNEYFLAQSGSTATGYYSTPSVITPGMPGPLLRLKTGTESATPSVNG 588

Query: 692 VSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 727
           V   NL+RL++++   + + YR  A  +   F   +
Sbjct: 589 VIARNLLRLSALL---EDESYRTLARQTCNTFAVEI 621


>gi|53803351|ref|YP_114889.1| hypothetical protein MCA2477 [Methylococcus capsulatus str. Bath]
 gi|53757112|gb|AAU91403.1| conserved hypothetical protein [Methylococcus capsulatus str. Bath]
          Length = 679

 Score =  427 bits (1098), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 250/624 (40%), Positives = 345/624 (55%), Gaps = 59/624 (9%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           + +NRLA E SPYLLQHAHNPVDW+ WG EA  EAR+ D PI LSIGYS CHWCHVM  E
Sbjct: 5   QRSNRLAGETSPYLLQHAHNPVDWYPWGPEALEEARRSDRPILLSIGYSACHWCHVMAHE 64

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQAL-YGGGGWPLSVFLSP-DLKPL 217
           SFEDE  A+++N  FV+IKVDREERPD+D++Y T  Q L   GGGWPL+V L+P DL P 
Sbjct: 65  SFEDEATAEVMNRLFVNIKVDREERPDLDRIYQTVHQLLSRRGGGWPLTVCLNPHDLVPF 124

Query: 218 MGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLP 277
             GTYFP E +YG P F ++L  +   + + R  LA++G    E L EA+        +P
Sbjct: 125 FTGTYFPKEPRYGMPAFVSVLHHLAAFYAEHRGDLARNGQVLREAL-EAMGREGDGALMP 183

Query: 278 DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEG 337
           D      L    + L  S+D+  GGFG APKFPR  +++++L                EG
Sbjct: 184 D---AGLLARATQALRTSFDASHGGFGGAPKFPRTADLELLLRSD------------GEG 228

Query: 338 QKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTK 397
            +M+  TL  MA+GGI+DH+GGGF RYSVDERW +PHFEKMLYD G L  +Y    + T 
Sbjct: 229 VEMLRTTLDGMARGGIYDHLGGGFARYSVDERWEIPHFEKMLYDNGPLLELYARMAAQTG 288

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
           D  Y+ +     +++ R+M  P G  ++A DADS   EG     EG FY+W  +EV+ +L
Sbjct: 289 DPAYAVVATGTAEWVIREMQSPEGGYYAALDADS---EGG----EGRFYLWDRQEVQGLL 341

Query: 458 -GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLN 516
             +  ++F   Y L    N            F+G   L       A A+  G   ++   
Sbjct: 342 SADEYLVFSLRYGLDGPPN------------FEGHWHLRVARSLEAVAAATGKGGDEVTR 389

Query: 517 ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGS 576
           +L   R +L   R +R RP  DDKVI +WNGL++     A ++L                
Sbjct: 390 LLESARTRLRRAREQRVRPGRDDKVIAAWNGLMVRGMTVAGRLLG--------------- 434

Query: 577 DRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG 636
            R ++ME A+ A  F+RR +  +   RL   +R+G ++   +LDD+AFL+   L++ +  
Sbjct: 435 -RADFMESADRALGFVRRTM--DAGGRLMSVYRDGRARFDAYLDDHAFLLDAALEILQTR 491

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
             T  L WA+ L +   E F D E GG+F T  +  +++ R K   D + PSGN V++  
Sbjct: 492 WSTDDLEWAVSLADRLLERFEDAEHGGFFFTAADHETLIQRPKPWMDESMPSGNGVAIRA 551

Query: 697 LVRLASIVAGSKSDYYRQNAEHSL 720
           L+RLA +   S+   Y   AE  L
Sbjct: 552 LIRLAGLTGESR---YADAAERGL 572


>gi|358063474|ref|ZP_09150085.1| hypothetical protein HMPREF9473_02147 [Clostridium hathewayi
           WAL-18680]
 gi|356698267|gb|EHI59816.1| hypothetical protein HMPREF9473_02147 [Clostridium hathewayi
           WAL-18680]
          Length = 682

 Score =  427 bits (1098), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 242/611 (39%), Positives = 337/611 (55%), Gaps = 61/611 (9%)

Query: 96  HSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHV 155
           + + +  NRL  E SPYLLQHA+NPV+W+ WG+E+F +A + D PIFLSIGYSTCHWCHV
Sbjct: 5   NGKERKPNRLIGEKSPYLLQHAYNPVEWYPWGKESFEKAEREDKPIFLSIGYSTCHWCHV 64

Query: 156 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 215
           ME ESFE+EG+A ++N  FV +KVDREERPDVD VYM+  QA+ G GGWPL++ ++P+ +
Sbjct: 65  MEEESFENEGIAGIMNREFVCVKVDREERPDVDSVYMSVCQAMTGQGGWPLTIIMTPECR 124

Query: 216 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 275
           P   GTY PP  +YGR G   +L  V   W + R  L +S     EQ+ +A     +   
Sbjct: 125 PFFAGTYLPPVRRYGRMGLAELLNSVAKQWKENRQQLFRSA----EQI-QAFLRQQTEMD 179

Query: 276 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 335
           +  E+ +  +    +QL +S+D   GGFG APKFP P       +H   L D G   +  
Sbjct: 180 VEGEVSKALVSQGYQQLERSFDEIHGGFGGAPKFPTP-------HHLLFLMDYGVRRDVP 232

Query: 336 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 395
           E   MV  TL  M +GGI DH+GGGF RYS DERW VPHFEKMLYD   L   Y  A+ +
Sbjct: 233 EAFYMVDRTLVQMYRGGIFDHIGGGFSRYSTDERWLVPHFEKMLYDNALLTLAYAKAYGI 292

Query: 396 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 455
           T    Y+ +   IL Y++ ++   GG  +  +DADS          EG +YV+T +E+  
Sbjct: 293 TGKKLYAEVAGRILGYVKAELTDEGGGFYCGQDADSDGV-------EGKYYVFTPEEIRA 345

Query: 456 ILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKY 514
           +LG      F   Y +  +GN            F+GK +   L D      ++  P    
Sbjct: 346 VLGNADGERFLARYGMTGSGN------------FEGKWI-PNLLDYQGDLEEM-QP---- 387

Query: 515 LNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVV 574
                E  R+L++ R  R R H DDK++VSWNG +I++  RA  +L+ +A          
Sbjct: 388 -----EKDRRLYEYRLARARLHKDDKILVSWNGWMITACGRAGAVLEEDA---------- 432

Query: 575 GSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYE 634
                 Y+E+A  A +F+R  L  +   RL   +R+G +   G LDDYA     L++LYE
Sbjct: 433 ------YVEMAVRAEAFLREKLVKD--GRLMVRYRDGEAAGEGKLDDYACYCQALVELYE 484

Query: 635 FGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSV 694
               T +L  A EL +   E F D E GG++    +   +++R KE +DGA PSGNSV+ 
Sbjct: 485 VTYETDYLRRARELADVMVEQFFDGERGGFYLYAKDGEELIVRTKETYDGAMPSGNSVAA 544

Query: 695 INLVRLASIVA 705
           + L +L  I  
Sbjct: 545 LVLEQLGRITG 555


>gi|418738150|ref|ZP_13294546.1| PF03190 family protein [Leptospira borgpetersenii serovar
           Castellonis str. 200801910]
 gi|410746324|gb|EKQ99231.1| PF03190 family protein [Leptospira borgpetersenii serovar
           Castellonis str. 200801910]
          Length = 692

 Score =  427 bits (1098), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 261/648 (40%), Positives = 357/648 (55%), Gaps = 56/648 (8%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           ++  NRL+ E SPYL QHA+NPVDWF WGEEA  +AR++D  IFLSIGY+TCHWCHVME 
Sbjct: 5   SRSPNRLSKEKSPYLQQHAYNPVDWFPWGEEALTKAREQDKLIFLSIGYATCHWCHVMEK 64

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFE++ VA  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++FL+PD KP+ 
Sbjct: 65  ESFENQMVADYLNSHFVSIKVDREERPDIDRIYMDALHAMDQQGGWPLNIFLTPDGKPIT 124

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
           GGTYFPPE  YGR  F  +L  ++  W +KR  L  + +     L ++    A   +   
Sbjct: 125 GGTYFPPEPGYGRKSFLEVLNILRKVWSEKRQELIVASSELSRYLKDSGEGRAIEKQEEG 184

Query: 279 ELPQNALRLCAEQLSKS-YDSRFGGFGS--APKFPRPVEIQMML-YHSKKLEDTGKSGEA 334
            LP          L +S YD+ FGGF +    KFP  + +  +L YH         S   
Sbjct: 185 SLPSKDCFNSGFSLYESYYDAEFGGFKTNHVNKFPPSMGLSFLLRYH--------HSSGN 236

Query: 335 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 394
            +  +MV  TL  M +GGI+D VGGG  RYS D RW VPHFEKMLYD        ++   
Sbjct: 237 PKALEMVENTLLAMKRGGIYDQVGGGLCRYSTDHRWMVPHFEKMLYDNSLFLETLVECSQ 296

Query: 395 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 454
           ++K +       D++ YL RDM   GG I SAEDADS   EG    +EG FY+W  +E  
Sbjct: 297 VSKKISAESFALDVISYLHRDMRIVGGGICSAEDADS---EG----EEGLFYIWDFEEFR 349

Query: 455 DILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKY 514
           ++ GE + + ++ + +   GN            F+GKN+L E       A+KL     K 
Sbjct: 350 EVCGEDSRILEKFWNVTNKGN------------FEGKNILHE--SYGGEATKLSEEEWKR 395

Query: 515 LN-ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
           ++ +L   R KL + RSKR RP  DDK++ SWNGL I + A+A                 
Sbjct: 396 IDSVLERARAKLLERRSKRVRPLRDDKILTSWNGLYIKALAKAG---------------- 439

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
           +   R++++++AE   SFI R+L D    R+   FR+G S   G+ +DYA +IS  + L+
Sbjct: 440 IAFRREDFLKLAEETYSFIERNLIDPDG-RILRRFRDGESGILGYSNDYAEMISSSIVLF 498

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGAEPSGNSV 692
           E G G ++L  A+        LF  R   G F  TG D  VLLR   D +DG EPS NS 
Sbjct: 499 EAGCGIRYLKNAVLWMEEAIRLF--RSPAGVFFDTGNDGEVLLRRSVDGYDGVEPSANSS 556

Query: 693 SVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 740
              +LV+L+  + G  S  YR+ AE   + F   L   +++ P +  A
Sbjct: 557 LAYSLVKLS--LLGIDSVRYRKFAELIFSYFTKELSTHSLSYPHLLSA 602


>gi|116327565|ref|YP_797285.1| hypothetical protein LBL_0795 [Leptospira borgpetersenii serovar
           Hardjo-bovis str. L550]
 gi|116120309|gb|ABJ78352.1| Conserved hypothetical protein containing a thioredoxin domain
           [Leptospira borgpetersenii serovar Hardjo-bovis str.
           L550]
          Length = 692

 Score =  427 bits (1098), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 268/682 (39%), Positives = 371/682 (54%), Gaps = 58/682 (8%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           ++  NRL+ E SPYL QHA+NPVDWF WGEEA  +AR++D  IFLSIGY+TCHWCHVME 
Sbjct: 5   SRSPNRLSKEKSPYLQQHAYNPVDWFPWGEEALTKAREQDKLIFLSIGYATCHWCHVMEK 64

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFE++ VA  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++FL+PD KP+ 
Sbjct: 65  ESFENQMVADYLNSHFVSIKVDREERPDIDRIYMDALHAMDQQGGWPLNIFLTPDGKPIA 124

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
           GGTYFPPE  YGR  F  +L  ++  W +KR  L  + +     L ++    A   +   
Sbjct: 125 GGTYFPPEPVYGRKSFLEVLNILRKVWSEKRQELIVASSELSRYLKDSGEGRAIEKQEEG 184

Query: 279 ELPQNALRLCAEQLSKS-YDSRFGGFGS--APKFPRPVEIQMML-YHSKKLEDTGKSGEA 334
            LP          L +S YD+ FGGF +    KFP  + +  +L YH         S   
Sbjct: 185 SLPSKDCFNSGFSLYESYYDAEFGGFRTNHVNKFPPSMGLSFLLRYH--------HSSGN 236

Query: 335 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 394
            +  +MV  TL  M +GGI+D VGGG  RYS D RW VPHFEKMLYD        ++   
Sbjct: 237 PKALEMVENTLLAMKRGGIYDQVGGGLCRYSTDHRWMVPHFEKMLYDNSLFLETLVECSQ 296

Query: 395 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 454
           ++K +       D++ YL RDM   GG I SAEDADS   EG    +EG FY+W  +E  
Sbjct: 297 VSKKISAESFALDVISYLHRDMRIVGGGICSAEDADS---EG----EEGLFYIWDFEEFR 349

Query: 455 DILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKY 514
           ++ GE + + ++ + +   GN            F+GKN+L E       A+KL     K 
Sbjct: 350 EVCGEDSRILEKFWNVTNKGN------------FEGKNILHE--SYGGEATKLSEEEWKR 395

Query: 515 LN-ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
           ++ +L   R KL + RSKR RP  DDK++ SWNGL I + A+A                 
Sbjct: 396 IDSVLERARAKLLERRSKRVRPLRDDKILTSWNGLYIKALAKAG---------------- 439

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
           +   R++++++AE   SFI R+L D    R+   FR+  S   G+ +DYA +IS  + L+
Sbjct: 440 IAFQREDFLKLAEETYSFIERNLIDPDG-RILRRFRDSESGILGYSNDYAEMISSSIVLF 498

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGAEPSGNSV 692
           E G G ++L  A+        LF  R   G F  TG D  VLLR   D +DG EPS NS 
Sbjct: 499 EAGCGIRYLKNAVLWMEEAIRLF--RSPAGVFFDTGNDGEVLLRRSVDGYDGVEPSANSS 556

Query: 693 SVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHV 752
              +LV+L+  + G  S  YR+ AE   + F   L   +++ P +  A       S K +
Sbjct: 557 LAYSLVKLS--LLGIDSVRYRKFAELIFSYFTKELSTHSLSYPHLLSAYWTYKYHS-KEI 613

Query: 753 VLVGHKSSVDFENMLAAAHASY 774
           VL+  K +   +++LAA    +
Sbjct: 614 VLI-RKDANSGKDLLAAIQTRF 634


>gi|452209206|ref|YP_007489320.1| hypothetical protein MmTuc01_0632 [Methanosarcina mazei Tuc01]
 gi|452099108|gb|AGF96048.1| hypothetical protein MmTuc01_0632 [Methanosarcina mazei Tuc01]
          Length = 690

 Score =  427 bits (1097), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 247/683 (36%), Positives = 361/683 (52%), Gaps = 55/683 (8%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
            K  NRL  E SPYLLQHA+NPVDW+ WGEEAF +ARK + P           WCH+M  
Sbjct: 8   QKEPNRLIKEKSPYLLQHAYNPVDWYPWGEEAFEKARKENKP----------DWCHMMAH 57

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFEDE VA L+N+ FVSIKVDREERPD+D +YMT  Q + G GGWPL++ ++P  KP  
Sbjct: 58  ESFEDEEVAGLMNEAFVSIKVDREERPDIDNIYMTVCQIILGRGGWPLNIIMTPGKKPFF 117

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
            GTY P   ++ + G   ++ ++K+ W+++ + +  S       + E +  S+       
Sbjct: 118 AGTYIPKNTRFNQIGMLELVPRIKEIWEQQHEEVLDSAEKITSTIQEMIKESSGEG---- 173

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
            L +  +    E+L  S+D+ +GGF  APKFP P +I  +L + ++  +        E  
Sbjct: 174 -LGEEVIEEVYEELLSSFDTEYGGFSGAPKFPTPHKISFLLRYWRRSRN-------PEAL 225

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
            M  +TL  M +GGI+DH+G GFHRYS D  W +PHFEKMLYDQ   A  Y +A+ +T  
Sbjct: 226 HMAEYTLDKMRRGGIYDHLGSGFHRYSTDSMWLLPHFEKMLYDQALTAIAYTEAYQVTGK 285

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
             Y      ILDY+ RD+  P G  +  EDAD         ++EG +Y+WT +E+  IL 
Sbjct: 286 DLYKETAEGILDYVLRDLTSPEGGFYCGEDAD-------VEREEGKYYLWTLEEIRSILD 338

Query: 459 -EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI 517
            E + L  + + L+  GN +     +      G N+        + A+K+ +P+E+    
Sbjct: 339 PEDSELIIKMFNLREEGNFE----EEIRGRETGTNLFYMARSPGSLAAKMKIPVEEVEKK 394

Query: 518 LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSD 577
           +   R KL   R +R RP LDDK++  WNGL+I++FA+               + V G  
Sbjct: 395 VKAAREKLLKARYERKRPSLDDKILTDWNGLMIAAFAKG--------------YQVFGEQ 440

Query: 578 RKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGS 637
           R  Y++ AE AA FI   LY      L H +R+G +   G  DDYAFLI GLL+LYE G 
Sbjct: 441 R--YLKAAEKAADFILMALYS-PGDGLLHRYRDGVAGISGTSDDYAFLIHGLLELYEAGF 497

Query: 638 GTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINL 697
             ++L  A+ L +   E F D   GG + T  +  +++ R KE  D A P+GNS  ++NL
Sbjct: 498 KMRYLKAAVSLNSELLECFWDPVNGGLYFTANDSEALIFRKKEFMDSAIPTGNSFEMLNL 557

Query: 698 VRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGH 757
           +RL+ I+A    +   + A+     F  ++            A D    PS + V++ G 
Sbjct: 558 LRLSRIIADPGLE---ETADKLERAFSKQIMKAPSGYTQFLSAFDFRLGPSYE-VIISGK 613

Query: 758 KSSVDFENMLAAAHASYDLNKTV 780
             + D E ML    + +  NK +
Sbjct: 614 AEASDTEQMLKELWSYFVPNKVL 636


>gi|337293410|emb|CCB91399.1| uncharacterized protein yyaL [Waddlia chondrophila 2032/99]
          Length = 691

 Score =  426 bits (1096), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 248/606 (40%), Positives = 340/606 (56%), Gaps = 59/606 (9%)

Query: 101 HTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVES 160
           +TNRL  + SPYLLQHAHNPVDW  WGEEAF +A++ + PIFLSIGY+TCHWCHVME ES
Sbjct: 7   YTNRLITQKSPYLLQHAHNPVDWHPWGEEAFEKAKELNKPIFLSIGYATCHWCHVMEEES 66

Query: 161 FEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALY-GGGGWPLSVFLSPDLKPLMG 219
           F++  VA+ LN  F++IKVDREE P+VD++YM + QAL     GWPL+VFL+PDL P   
Sbjct: 67  FQNLEVAEQLNRAFINIKVDREELPEVDQLYMDFAQALMPNSAGWPLNVFLTPDLLPFFA 126

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAW-DKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
            TY PP +  G PG   +++ + + W  K  D +       ++   + +        LPD
Sbjct: 127 TTYLPPRNASGLPGMIDLIQHIHELWIGKGHDQILMQAQQIVDLFQQNIQVYGID--LPD 184

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
              +  + L  + L +  D  +GG   APKFP   +  + L H   LE  G+        
Sbjct: 185 ---RKCVPLAVDTLLQISDPVWGGVKGAPKFPIGYQY-VFLMHYSALEKDGRP------M 234

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
            +V  TL+ M +GGI+DH+G GF RYS+DE+W +PHFEKMLYD   LA  Y +A+  TK 
Sbjct: 235 FLVEKTLELMYRGGIYDHLGSGFSRYSIDEQWQIPHFEKMLYDNALLAECYCEAWKATKR 294

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
             +  +C +++DY+   + G  G   SAEDADS   EG     EG FY WT  E++D+LG
Sbjct: 295 SLHRRVCCEVIDYVLSKLTGEQGAFLSAEDADS---EGV----EGKFYTWTMDEIDDVLG 347

Query: 459 -EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASASKLG-MPLEKY 514
            + + LF   Y    TGN            F+GKN+  L  L +  AS +++    LE  
Sbjct: 348 SDDSELFCSVYGATATGN------------FEGKNILHLPALLEHYASDNQMDHFELEAR 395

Query: 515 LNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVV 574
              + E + KL+ VR KR  P  DDKV+ SWNGL+I S   A K  +             
Sbjct: 396 ---IAELKEKLYKVREKRGHPLKDDKVLSSWNGLMIHSIVEAGKAFEI------------ 440

Query: 575 GSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYE 634
                 Y++    AA FI  HL+  +  RL   +R G     G LDDYAF+I   L L+E
Sbjct: 441 ----SRYVDAGRRAARFIYGHLW--KNGRLLRRYREGKVDFSGGLDDYAFMIRASLTLFE 494

Query: 635 FGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSV 694
            G GT+WL WA  ++    + F   EGG ++ T G+DP++++R     DGAEPSGN+V  
Sbjct: 495 AGCGTEWLEWAFSMERVLRDAF-KAEGGAFYQTDGKDPNLIIRQCLFADGAEPSGNAVHC 553

Query: 695 INLVRL 700
            NL+R+
Sbjct: 554 ENLLRI 559


>gi|296816653|ref|XP_002848663.1| DUF255 domain-containing protein [Arthroderma otae CBS 113480]
 gi|238839116|gb|EEQ28778.1| DUF255 domain-containing protein [Arthroderma otae CBS 113480]
          Length = 781

 Score =  426 bits (1096), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 255/657 (38%), Positives = 366/657 (55%), Gaps = 47/657 (7%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            NRL+   SPY+  H +NPV W  W   A   A+  +  IFLSIGYS CHWCHVME ESF
Sbjct: 23  VNRLSESRSPYVRGHMNNPVAWQLWDSTAMNLAKDFNRLIFLSIGYSACHWCHVMEKESF 82

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
               VA +LN  F+ IK+DREERPD+D VYM YVQA  G GGWPL+VFL+PDL+P+ GGT
Sbjct: 83  MSLEVAAILNKSFIPIKLDREERPDIDDVYMNYVQATTGSGGWPLNVFLTPDLEPVFGGT 142

Query: 222 YFPPEDKYGRP--------GFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS----- 268
           Y+P  +    P        GF  +L K++D W+ ++    +S      QL E        
Sbjct: 143 YWPGPNATPLPKLGGEEPVGFIDVLEKLRDVWNTQQLRCRESAKEITRQLREFAEEGTHL 202

Query: 269 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK---KL 325
           A A+  +  ++L    L       +  YD+  GGF ++PKFP PV +  +L  S+   ++
Sbjct: 203 AQANKKEQMEDLEIELLEEAFVHFAARYDATNGGFSTSPKFPTPVNLSFLLRLSRYPEEV 262

Query: 326 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 385
            D     E ++  +M + TL  +A+GGI D +G GF RYSV   W +PHFEKMLYDQ QL
Sbjct: 263 MDIVGREECTKATEMAVNTLIKVARGGIRDQIGYGFSRYSVTPDWSLPHFEKMLYDQAQL 322

Query: 386 ANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDADSAETEGATRKKEGA 444
            +VY+D F  + +        D++ Y+    ++ P G  +S+EDADS  +   T K+EGA
Sbjct: 323 LDVYIDGFEASHEPELLGAIYDLVTYITSPPILSPMGCFYSSEDADSQPSPDDTDKREGA 382

Query: 445 FYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 503
           +YVWT KE++ ILG   A +   H+ + P GN  ++R++DPH+EF  +NVL      +  
Sbjct: 383 YYVWTLKELKQILGHRDADVCARHWGVLPDGN--VARVNDPHDEFMNRNVLRIATTPAQV 440

Query: 504 ASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVISSFARASKILKS 562
           A + G+  E+ + IL   R KL + R +KR RP LDDK+IVSWNGLVI + A+ + +L+ 
Sbjct: 441 AKEFGLHEEETIRILKNSRVKLREYRETKRVRPELDDKIIVSWNGLVIGALAKCAILLED 500

Query: 563 -EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFR-NGPSKAPGFLD 620
            +AE +           K    +A +A  FI+ +L D ++ +L   +R +     PGF D
Sbjct: 501 IDAEKS-----------KHCKLMASNAVKFIKENLLDAESGQLWRIYRADSRGNTPGFAD 549

Query: 621 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGG------GYFNTTGE---- 670
           DYA+LISGL+ LYE      +L +A +LQ   ++ F+           GY+ T  E    
Sbjct: 550 DYAYLISGLIQLYEATFDDSYLQFADKLQQYLNKYFISVSTSDSSICTGYYMTPSEAVTN 609

Query: 671 DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 727
            PS L R+K   D A PS N V   NL+RL+S++   + + Y+  A  +   F   +
Sbjct: 610 TPSALFRLKTGTDSATPSTNGVIAQNLLRLSSLL---EDESYKVKARQTCNAFAVEI 663


>gi|116331824|ref|YP_801542.1| hypothetical protein LBJ_2312 [Leptospira borgpetersenii serovar
           Hardjo-bovis str. JB197]
 gi|116125513|gb|ABJ76784.1| Conserved hypothetical protein containing a thioredoxin domain
           [Leptospira borgpetersenii serovar Hardjo-bovis str.
           JB197]
          Length = 692

 Score =  426 bits (1096), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 267/682 (39%), Positives = 371/682 (54%), Gaps = 58/682 (8%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           ++  NRL+ E SPYL QHA+NPVDWF WGEEA  +AR++D  IFLSIGY+TCHWCHVME 
Sbjct: 5   SRSPNRLSKEKSPYLQQHAYNPVDWFPWGEEALTKAREQDKLIFLSIGYATCHWCHVMEK 64

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFE++ VA  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++FL+PD +P+ 
Sbjct: 65  ESFENQMVADYLNSHFVSIKVDREERPDIDRIYMDALHAMDQQGGWPLNIFLTPDGRPIA 124

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
           GGTYFPPE  YGR  F  +L  ++  W +KR  L  + +     L ++    A   +   
Sbjct: 125 GGTYFPPEPVYGRKSFLEVLNILRKVWSEKRQELIVASSELSRYLKDSGEGRAIEKQEEG 184

Query: 279 ELPQNALRLCAEQLSKS-YDSRFGGFGS--APKFPRPVEIQMML-YHSKKLEDTGKSGEA 334
            LP          L +S YD+ FGGF +    KFP  + +  +L YH         S   
Sbjct: 185 SLPSKDCFNSGFSLYESYYDAEFGGFRTNHVNKFPPSMGLSFLLRYH--------HSSGN 236

Query: 335 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 394
            +  +MV  TL  M +GGI+D VGGG  RYS D RW VPHFEKMLYD        ++   
Sbjct: 237 PKALEMVENTLLAMKRGGIYDQVGGGLCRYSTDHRWMVPHFEKMLYDNSLFLETLVECSQ 296

Query: 395 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 454
           ++K +       D++ YL RDM   GG I SAEDADS   EG    +EG FY+W  +E  
Sbjct: 297 VSKKISAESFALDVISYLHRDMRIVGGGICSAEDADS---EG----EEGLFYIWDFEEFR 349

Query: 455 DILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKY 514
           ++ GE + + ++ + +   GN            F+GKN+L E       A+KL     K 
Sbjct: 350 EVCGEDSRILEKFWNVTNKGN------------FEGKNILHE--SYGGEATKLSEEEWKR 395

Query: 515 LN-ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
           ++ +L   R KL + RSKR RP  DDK++ SWNGL I + A+A                 
Sbjct: 396 IDSVLERARAKLLERRSKRVRPLRDDKILTSWNGLYIKALAKAG---------------- 439

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
           +   R++++++AE   SFI R+L D    R+   FR+  S   G+ +DYA +IS  + L+
Sbjct: 440 IAFQREDFLKLAEETYSFIERNLIDPDG-RILRRFRDSESGILGYSNDYAEMISSSIVLF 498

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGAEPSGNSV 692
           E G G ++L  A+        LF  R   G F  TG D  VLLR   D +DG EPS NS 
Sbjct: 499 EAGCGIRYLKNAVLWMEEAIRLF--RSPAGVFFDTGNDGEVLLRRSVDGYDGVEPSANSS 556

Query: 693 SVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHV 752
              +LV+L+  + G  S  YR+ AE   + F   L   +++ P +  A       S K +
Sbjct: 557 LAYSLVKLS--LLGIDSVRYRKFAELIFSYFTKELSTHSLSYPHLLSAYWTYKYHS-KEI 613

Query: 753 VLVGHKSSVDFENMLAAAHASY 774
           VL+  K +   +++LAA    +
Sbjct: 614 VLI-RKDANSGKDLLAAIQTRF 634


>gi|421108799|ref|ZP_15569331.1| PF03190 family protein [Leptospira kirschneri str. H2]
 gi|410006082|gb|EKO59855.1| PF03190 family protein [Leptospira kirschneri str. H2]
          Length = 688

 Score =  426 bits (1095), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 261/682 (38%), Positives = 372/682 (54%), Gaps = 61/682 (8%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           +++ NRL+ E SPYL QH++NPVDWF WGEEA   A+ +D  IFLS+GY+TCHWCHVME 
Sbjct: 5   SRNPNRLSKEKSPYLQQHSYNPVDWFPWGEEALTRAKDQDKLIFLSVGYATCHWCHVMEK 64

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFE++ +A  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++FL+P+ +P+ 
Sbjct: 65  ESFENQSIADYLNSHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNMFLTPEGQPIT 124

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
           GGTYFPPE +YGR GF  +L  ++  W +KR  L  + +   + L ++  + A   +  D
Sbjct: 125 GGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGESRAKEKQEAD 184

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKKLEDTGKSGEA 334
             P+N            YDS+FGGF +    KFP  + +  +L  YHS        SG  
Sbjct: 185 FPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS--------SGNP 236

Query: 335 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 394
           +   +MV  TL  M +GGI+D +GGG  RYS D RW VPHFEKMLYD      +  +   
Sbjct: 237 N-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSLFLEILAEYSL 295

Query: 395 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 454
           ++K +       DI+ YL RDM   GG I SAED+DS   EG    +EG FY+W  +E  
Sbjct: 296 VSKKISAKSFALDIVSYLHRDMRMDGGGICSAEDSDS---EG----EEGLFYIWDLEEFR 348

Query: 455 DILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKY 514
           ++ GE + L ++ + +   GN            F+GKN+L E    +   S       K+
Sbjct: 349 EVCGEDSSLLEKFWNVTKEGN------------FEGKNILHE----NFRGSNFTEEESKH 392

Query: 515 LN-ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
           L+  L   + KL + RSKR RP  DDK++ SWNGL I +  +                  
Sbjct: 393 LDGALTRGKAKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG---------------- 436

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
           +   R++++++AE   SFI ++L D +  R+   FR G S   G+ +DYA +I+  + L+
Sbjct: 437 IAFQREDFLKLAEETYSFIEKNLIDSKG-RILRRFREGESGILGYSNDYAEMIASSIVLF 495

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGAEPSGNSV 692
           E G G ++L  A+        LF  R   G F  TG D  VLLR   D +DG EPS NS 
Sbjct: 496 EAGRGVRYLQNAVLWMEETIRLF--RSTAGVFFDTGIDGEVLLRRSVDGYDGVEPSANSS 553

Query: 693 SVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHV 752
              +LV+L+ +  G  SD YR+ AE     F   L   A+  P +  A       SR+ V
Sbjct: 554 LAHSLVKLSFL--GVNSDRYREVAESIFLYFRKELYSSALIYPFLLSAYWSYKHHSREIV 611

Query: 753 VLVGHKSSVDFENMLAAAHASY 774
           ++   K+S    ++LA   + +
Sbjct: 612 LI--RKNSEAGRDLLAWIQSRF 631


>gi|381206676|ref|ZP_09913747.1| hypothetical protein SclubJA_13745 [SAR324 cluster bacterium
           JCVI-SC AAA005]
          Length = 693

 Score =  426 bits (1095), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 255/678 (37%), Positives = 370/678 (54%), Gaps = 58/678 (8%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNRL  + SPYLLQHAHNPVDWF W +EAF +A+     I +SIGY+TCHWCHVME ESF
Sbjct: 5   TNRLIDQKSPYLLQHAHNPVDWFPWCQEAFDKAKSEQKLILVSIGYATCHWCHVMERESF 64

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           ED   A  LN  FV++KVDREERPD+D+V+M  + AL   GGWPL++F +PD +P  GGT
Sbjct: 65  EDLETADYLNRNFVAVKVDREERPDIDQVFMDALHALGEQGGWPLNMFATPDGRPFTGGT 124

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFPP+  YGR  F+ IL  ++  W +++  + ++     +Q++  L  + +   L + LP
Sbjct: 125 YFPPKPMYGRQSFRQILESLRYYWQEEKAKIHETA----DQVTAYLRRAPAPQPLDEPLP 180

Query: 282 Q-NALRLCAEQLSKSYDSRFGGFG--SAPKFPRPVEIQMML-YHSKKLEDTGKSGEASEG 337
           Q N +    +   +++DS  GGF      KFP  + +Q++L YH +              
Sbjct: 181 QWNCVEETVQAYRQAFDSEDGGFALQRPNKFPPSMGLQLLLRYHLRT--------RIPSD 232

Query: 338 QKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTK 397
             MV  TL  M  GGI+D VGGG  RYS D RW VPHFEKMLYD    A   L+ F +T 
Sbjct: 233 LFMVELTLFKMRNGGIYDQVGGGLCRYSTDYRWLVPHFEKMLYDNALFAQTSLECFQVTS 292

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
           + FY  I  DI  Y+ RDM+       SAEDADS   EG     EG FY+WT+ E +  +
Sbjct: 293 NPFYREIAEDIFQYVTRDMMAESSAFCSAEDADS---EG----HEGLFYLWTADEFKKTV 345

Query: 458 -GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLN 516
             +++     ++ + P GN            F+G+N+L     +     +LG+   ++  
Sbjct: 346 EDKYSDSLANYWNVTPQGN------------FEGRNILNVSQSTKVFGEQLGLEENEWQT 393

Query: 517 ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGS 576
           I+   R  L DVR++R RP  DDK++VSWN L+ISSFA+A++IL                
Sbjct: 394 IIKSARSNLQDVRAQRIRPLKDDKILVSWNALMISSFAQAARIL---------------- 437

Query: 577 DRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG 636
           +  EY   A +A +FI  HL + Q  RL   +R+G +K P +L DYA L    LD+Y + 
Sbjct: 438 EHNEYGITANNALAFIEEHLIN-QEGRLLRRYRDGDAKFPAYLSDYAQLGLACLDIYAWN 496

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
              ++++ A    N  + LFL+ + G YF T  +   VL+R  + +DG EPSGN+ + + 
Sbjct: 497 YEPQYVLKAHHWANEINRLFLNPD-GAYFETGFDAEEVLVRKADGYDGVEPSGNTSTALL 555

Query: 697 LVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVG 756
            ++LAS   GS      ++AE  L  F   L    +    M  A  + +      +V+ G
Sbjct: 556 FLKLASFGMGSG---LLRDAERILHSFSPHLHQAGVNFSAMLNAL-IWARKGGTEIVVSG 611

Query: 757 HKSSVDFENMLAAAHASY 774
            +S+++ + +L     S+
Sbjct: 612 DESNLETKEVLQWLRQSF 629


>gi|338532946|ref|YP_004666280.1| hypothetical protein LILAB_16495 [Myxococcus fulvus HW-1]
 gi|337259042|gb|AEI65202.1| hypothetical protein LILAB_16495 [Myxococcus fulvus HW-1]
          Length = 696

 Score =  426 bits (1095), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 256/653 (39%), Positives = 350/653 (53%), Gaps = 49/653 (7%)

Query: 93  STSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHW 152
           +T  +    +NRLA E SPYL QHAHNPVDWF WGEEA A A+  + PI LS+GYS CHW
Sbjct: 2   ATPPASPDTSNRLAREPSPYLRQHAHNPVDWFPWGEEALARAKAENKPILLSVGYSACHW 61

Query: 153 CHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP 212
           CHVM  ESFE    A+L+N+ F++IKVDREERPD+D++Y   VQ +  GGGWPL+VFL+P
Sbjct: 62  CHVMAHESFESPETARLMNEGFINIKVDREERPDLDQIYQGVVQLMGQGGGWPLTVFLTP 121

Query: 213 DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASAS 272
           DLKP  GGTYFPP+D+YGRPGF  +L  ++DAW+ K+D + +  A   E L E   A+  
Sbjct: 122 DLKPFYGGTYFPPQDRYGRPGFPRLLGALRDAWENKQDEVQRQAAQFEEGLGEL--ATYG 179

Query: 273 SNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSG 332
            +  P  L    +    + ++K  D   GGFG APKFP P+   +ML   ++       G
Sbjct: 180 LDAAPSALTAADVVAMGQGMAKQVDPAHGGFGGAPKFPNPMNFALMLRAWRR-------G 232

Query: 333 EASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDA 392
             +  +  V  TL+ MA GGI+D +GGGFHRYSVD RW VPHFEKMLYD  QL ++Y  A
Sbjct: 233 GGAPLKDAVFLTLERMALGGIYDQLGGGFHRYSVDARWRVPHFEKMLYDNAQLLHLYAQA 292

Query: 393 FSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKE 452
             +     +  +  + + Y+RR+M   GG  ++A+DADS   EG    +EG F+VW  +E
Sbjct: 293 QQVEPRPLWRKVVEETVAYVRREMTDAGGGFYAAQDADS---EG----EEGKFFVWRPEE 345

Query: 453 VEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPL 511
           V   L E  A L   H+ +KP GN +            G  VL  +   +  A + G+  
Sbjct: 346 VRAALPEAQAELVLRHFGIKPEGNFE-----------HGATVLEVVVPVAELARERGLSE 394

Query: 512 EKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNF 571
           +     L   R+ LF+ R +R +P  DDK++  WNGL+I   A A+++            
Sbjct: 395 DAVARALAAARQTLFEARERRVKPGRDDKLLSGWNGLMIRGLALAARVF----------- 443

Query: 572 PVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLD 631
                +R E+   A  AA F+    +D    RL  S++ G ++  GFL+DY  L SGL  
Sbjct: 444 -----ERPEWATWAAEAADFVLAKAWD--GTRLARSYQEGQARIDGFLEDYGDLASGLTA 496

Query: 632 LYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNS 691
           LY+     K+L  A  L      LF D E   Y         +++      D A PSG S
Sbjct: 497 LYQATFDVKYLEAADALVRRAVALFWDAEKAAYLTAPRGQKDLVVATYGLFDNASPSGAS 556

Query: 692 VSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 744
                 V LA++  G K   + +  E  +A     L   AM    +  AAD L
Sbjct: 557 TLTEAQVELAALT-GDKQ--HLELPERYVARMREGLVRNAMGYGYLGLAADAL 606


>gi|320160551|ref|YP_004173775.1| hypothetical protein ANT_11410 [Anaerolinea thermophila UNI-1]
 gi|319994404|dbj|BAJ63175.1| hypothetical protein ANT_11410 [Anaerolinea thermophila UNI-1]
          Length = 684

 Score =  426 bits (1094), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 260/656 (39%), Positives = 355/656 (54%), Gaps = 58/656 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYLLQHA NPVDW+ WG+EAF +AR+ + P+FLSIGY+ CHWCHVM  ESFE
Sbjct: 3   NRLIHETSPYLLQHATNPVDWYPWGDEAFEKARRENKPVFLSIGYAACHWCHVMAHESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D  +A++LN  FVSIKVDREERPDVD +YM  V AL G GGWPLSVFL+P+ KP  GGTY
Sbjct: 63  DPQIAEILNQHFVSIKVDREERPDVDGIYMNAVIALTGQGGWPLSVFLTPEGKPFYGGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD-ELP 281
           FPP  ++G P F+ +L     AW+  RD L ++G    EQL++ + A      +P   L 
Sbjct: 123 FPPTPRHGLPAFRDVLHAALQAWENDRDDLFKAG----EQLAQHIHAMNDWGSVPGLVLR 178

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
            N L      L  SYD R+GG+G+AP+FP+P+ ++ +L    +  +        +  K V
Sbjct: 179 ANLLEQVTHALLASYDRRYGGWGNAPRFPQPMALEFLLLQVTRGNE--------DALKPV 230

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
              LQ M++GG++D +GGGF RYS D  W VPHFEKMLYD  Q+++VYL A  L K+ ++
Sbjct: 231 EHNLQVMSRGGLYDIIGGGFARYSTDNHWLVPHFEKMLYDNAQISSVYLHAGMLEKNPWF 290

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHA 461
             I    LD+L  +M  P G  FS+ DADS   EG    +EG FY+W   E+  I     
Sbjct: 291 LRIATQTLDFLLEEMRHPLGGFFSSLDADS---EG----EEGKFYLWDFDELRQI----- 338

Query: 462 ILFKEHYYLKPTGNCDLS--RMSDPHN-EFKGKNVLIELNDSSASASKLGMPLEKYLNIL 518
                   L+P G  D S    + P N  F+GK +L    D      K G+    +L  +
Sbjct: 339 --------LEPAGQWDFSCQVFNLPRNGNFEGKIILQIQEDWERLPEKTGLSETDFLKQM 390

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
              R  L+  RS R RP  DDKVIVSWNG  + + A A++ L                +R
Sbjct: 391 DTVRALLYQKRSLRVRPSTDDKVIVSWNGFALRALAEAARYL----------------NR 434

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG 638
            +Y+  A+  A F+  +LY  +   L  ++R G  +    L+DYA LI GLL LY+    
Sbjct: 435 PDYLHAAQQNAHFLLENLYTPRG--LMRTWREGSPRQIALLEDYASLIIGLLALYQSDDN 492

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
             W  WA++L       + D   GG+++T  +   +++R K+  D A P GNS++   L+
Sbjct: 493 IVWYEWAVKLGEEMISRYRD-PAGGFYDTRDDQQDLIIRPKDFQDNATPCGNSLASYALL 551

Query: 699 RLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVL 754
            L      S  D   Q A     + +  L     A      A D    PSR+  +L
Sbjct: 552 LLYEF---SGDDSIYQLATRVFPLLQDSLVKYPTAFGFWLQAIDWAMGPSRQVALL 604


>gi|320334089|ref|YP_004170800.1| hypothetical protein [Deinococcus maricopensis DSM 21211]
 gi|319755378|gb|ADV67135.1| hypothetical protein Deima_1486 [Deinococcus maricopensis DSM
           21211]
          Length = 674

 Score =  425 bits (1093), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 244/597 (40%), Positives = 322/597 (53%), Gaps = 55/597 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL    SPYL QHA NPVDW+ WG+EAF  AR+RDVPI LS+GY+TCHWCHVM  ESFE
Sbjct: 2   NRLGNATSPYLQQHADNPVDWYEWGDEAFRAARERDVPILLSVGYATCHWCHVMAHESFE 61

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D   A  +N+ FV++KVDRE+RPDVD VYM  VQA+ G GGWP++VFL+PD +P   GTY
Sbjct: 62  DAQTAAFMNEHFVNVKVDREQRPDVDAVYMRAVQAMTGAGGWPMTVFLAPDRRPFYAGTY 121

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA--SASSNKLPDEL 280
           FPP D YG P F+T+L  V +AW  +RD L    A A+ +   A+SA   A+   LP++ 
Sbjct: 122 FPPRDAYGMPSFRTVLASVANAWADRRDQL-LGNADALTEHVRAMSAPKPAADGALPEDF 180

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
               L    +   +++D+R GGFGSAPKFP P  +  +L                +G+ M
Sbjct: 181 APRGL----DNARRTFDARHGGFGSAPKFPAPTFLTYLLTQ-------------PDGRDM 223

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
            + TL  M +GG+ D +GGGFHRYSVDERW VPHFEKMLYD  QL   YL A  +T    
Sbjct: 224 AVRTLDAMMRGGLMDQLGGGFHRYSVDERWLVPHFEKMLYDNAQLVRAYLRAHVVTGRAD 283

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH 460
           +    R  L Y+ R+++ P G    A+DAD    EG     EG F+VWT +E  D+LG  
Sbjct: 284 FLDTARATLAYMERELLTPEGGFACAQDADQ---EGI----EGKFFVWTPQEFRDLLGAD 336

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHN-EFKGKNVLIELNDSSASASKLGMPLEKYLNILG 519
           A L   HY +   GN       DPH+  F  ++VL  + D    A    +  +     LG
Sbjct: 337 ADLALRHYGVTDAGN-----FQDPHHPAFGRRSVLSVVTDVPELARAFSLGEDDVRARLG 391

Query: 520 ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRK 579
             R  LF  R  R  P LDDKV+ SWNGL + +FA A ++                +   
Sbjct: 392 RARETLFSARRARAHPGLDDKVLTSWNGLALMAFADAYRL----------------TGET 435

Query: 580 EYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGT 639
            Y++VA   A F+R  L       L H++R   +   G L+D A    GL+ LY      
Sbjct: 436 HYLDVARRNADFVRARLTAPDGAPL-HAYR---ADVRGLLEDAALYGLGLVALYAAAGNL 491

Query: 640 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLR-VKEDHDGAEPSGNSVSVI 695
           + L WA  L +       D +  G F ++G D   L+    E  D A  S N+ + +
Sbjct: 492 EHLQWARALWDRARRDHWD-DAAGVFYSSGPDAEALVAPTTETFDAAIMSDNAAACL 547


>gi|282889930|ref|ZP_06298465.1| hypothetical protein pah_c008o011 [Parachlamydia acanthamoebae str.
           Hall's coccus]
 gi|338175432|ref|YP_004652242.1| hypothetical protein PUV_14380 [Parachlamydia acanthamoebae UV-7]
 gi|281500123|gb|EFB42407.1| hypothetical protein pah_c008o011 [Parachlamydia acanthamoebae str.
           Hall's coccus]
 gi|336479790|emb|CCB86388.1| uncharacterized protein yyaL [Parachlamydia acanthamoebae UV-7]
          Length = 692

 Score =  424 bits (1091), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 246/611 (40%), Positives = 343/611 (56%), Gaps = 60/611 (9%)

Query: 101 HTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVES 160
           +TNRL  + SPYLLQHAHNPVDW+ WG+EAF  A++ D PIFLS+GY+TCHWCHVME ES
Sbjct: 7   YTNRLIHQKSPYLLQHAHNPVDWYPWGDEAFLAAKEADKPIFLSVGYATCHWCHVMEQES 66

Query: 161 FEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALY-GGGGWPLSVFLSPDLKPLMG 219
           FE+  VA+ LN+ F++IKVDREE P+VD +YM + Q++  G  GWPL+V L+PDL P   
Sbjct: 67  FENLEVAQALNEAFINIKVDREELPEVDSLYMEFAQSMMSGAAGWPLNVILTPDLYPFFA 126

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAW--DKKRDMLAQSGAFAIEQLSEALS--ASASSNK 275
            TY PP + +G  G   ++ ++ +AW  D++  +L QS     E++ E        S   
Sbjct: 127 ATYLPPVNSHGLIGMLELVERIHEAWQGDERERILMQS-----EKIVEVFEQHVHTSGEL 181

Query: 276 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 335
           LP   P   +    E L K  D   GG   APKFP   +   +L +S + +D       S
Sbjct: 182 LP---PPEVIEKTIEMLIKLADPVNGGMKGAPKFPIAYQSVFLLRYSMEKKD-------S 231

Query: 336 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 395
               +V  TL+ M +GGI+DH+GGGF RYSVDE W +PHFEKMLYD   LA+ Y +A+  
Sbjct: 232 RPLFLVERTLEMMRRGGIYDHLGGGFSRYSVDEAWQIPHFEKMLYDNALLADCYFEAWQA 291

Query: 396 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT--SKEV 453
           T++  Y  +C +IL Y+ RDM    G  +SAEDADS   EG     EG FY WT    E 
Sbjct: 292 TQNPQYKKVCEEILHYVLRDMSHFRGGFYSAEDADS---EG----HEGRFYTWTLEEVEE 344

Query: 454 EDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEK 513
                  + LF  ++ + P GN            F+G+NVL         A K+GM  ++
Sbjct: 345 LLGGENESELFVHYFDITPEGN------------FEGRNVLHTPLSLEEFAKKMGMDAQQ 392

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
              +  E +  L+  R KR  P  DDK++ +WNGL+I + A A                 
Sbjct: 393 LDLLFTEQKHILWKAREKRVHPFKDDKILTAWNGLMIQAMAEAG---------------C 437

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
              D++ ++  A+++A FI+  L++E  H L   +R+  +     LD+YAFLI  LL L+
Sbjct: 438 AFCDQR-FLSAAQNSAKFIKAKLWNE--HGLLRRWRDDEAMFSAGLDEYAFLIRSLLTLF 494

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
           E G GT+WL WA+EL       F     G Y+ T G+D S+++R  +  DGAEPSGN++ 
Sbjct: 495 EAGCGTEWLQWALELNEILKNQF-KALNGAYYQTNGQDLSLVIRKCQFSDGAEPSGNAIQ 553

Query: 694 VINLVRLASIV 704
             NL+RL  + 
Sbjct: 554 CENLLRLYQLT 564


>gi|398331059|ref|ZP_10515764.1| hypothetical protein LalesM3_03040 [Leptospira alexanderi serovar
           Manhao 3 str. L 60]
          Length = 699

 Score =  424 bits (1091), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 271/686 (39%), Positives = 372/686 (54%), Gaps = 67/686 (9%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           ++  NRL+ E SPYL QHA+NPVDWF WGEEA  +AR++D  IFLSIGY+TCHWCHVME 
Sbjct: 13  SRSPNRLSKEKSPYLQQHAYNPVDWFPWGEEALTKAREQDKLIFLSIGYATCHWCHVMEK 72

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFE++ VA  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++FL+PD KP+ 
Sbjct: 73  ESFENQMVADYLNSHFVSIKVDREERPDIDRIYMDALHAMDQQGGWPLNIFLTPDGKPIT 132

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
           GGTYFPPE +YGR  F  IL  ++  W +KR  L      A  +LS  L  S     +  
Sbjct: 133 GGTYFPPEPRYGRKSFLEILNILRKVWKEKRQEL----IVASSELSRYLKDSGEGRAIEK 188

Query: 279 E---LP-QNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKKLEDTGK 330
           +   LP +N            YD+ FGGF +    KFP  + +  +L  YHS        
Sbjct: 189 QEGSLPSENCFDSGFSLYESYYDAEFGGFKTNHVNKFPPSMGLSFLLRYYHS-------- 240

Query: 331 SGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYL 390
           SG  S   +MV  TL  M +GGI+D +GGG  RYS D  W VPHFEKMLYD        +
Sbjct: 241 SGNPS-ALEMVENTLLAMKQGGIYDQIGGGLCRYSTDHHWMVPHFEKMLYDNSLFLETLV 299

Query: 391 DAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTS 450
           +   ++K +       D++ YL RDM   GG I SAEDADS   EG    +EG FY+W  
Sbjct: 300 ECSQVSKKISAKSFALDVISYLHRDMRIVGGGICSAEDADS---EG----EEGLFYIWDF 352

Query: 451 KEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMP 510
           +E  ++ GE + + ++ + +   GN            F+GKN+L E     + A+K    
Sbjct: 353 EEFREVCGEDSRILEKFWNVTKKGN------------FEGKNILHE--SYRSEATKFSEE 398

Query: 511 LEKYLN-ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 569
             K ++ +L   R KL + R+KR RP  DDK++ SWNGL I + A+A             
Sbjct: 399 EWKRIDSVLERGRAKLLERRNKRVRPLRDDKILTSWNGLYIKALAKAG------------ 446

Query: 570 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 629
               V   R++++++AE   SFI R+L D  + R+   FR+  S   G+ +DYA +IS  
Sbjct: 447 ----VAFQREDFLKLAEETYSFIERNLID-PSGRILRRFRDKESGILGYSNDYAEMISSS 501

Query: 630 LDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGAEPS 688
           + L+E G G ++L  A+        LF  R   G F  TG D  VLLR   D +DG EPS
Sbjct: 502 IALFEAGCGIRYLKNAVLWMEEAIRLF--RSPAGVFFDTGNDGEVLLRRSVDSYDGVEPS 559

Query: 689 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPS 748
            NS    +LV+L+  + G  S  YR+ AE     F   L   +++ P +  A       S
Sbjct: 560 ANSSLAYSLVKLS--LFGIDSVRYREFAESIFLYFTKELSTYSLSYPHLLSAYWTYRHHS 617

Query: 749 RKHVVLVGHKSSVDFENMLAAAHASY 774
            K +VL+  K +   + +LAA    +
Sbjct: 618 -KEIVLI-RKDTDSGKELLAAIQTRF 641


>gi|297621186|ref|YP_003709323.1| thymidylate kinase [Waddlia chondrophila WSU 86-1044]
 gi|297376487|gb|ADI38317.1| putative thymidylate kinase [Waddlia chondrophila WSU 86-1044]
          Length = 691

 Score =  424 bits (1091), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 247/606 (40%), Positives = 339/606 (55%), Gaps = 59/606 (9%)

Query: 101 HTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVES 160
           +TNRL  + SPYLLQHAHNPVDW  WGEEAF +A++ + PIFLSIGY+TCHWCHVME ES
Sbjct: 7   YTNRLITQKSPYLLQHAHNPVDWHPWGEEAFEKAKELNKPIFLSIGYATCHWCHVMEEES 66

Query: 161 FEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALY-GGGGWPLSVFLSPDLKPLMG 219
           F++  VA+ LN  F++IKVDREE P+VD++YM + QAL     GWPL+VFL+PDL P   
Sbjct: 67  FQNLEVAEQLNRAFINIKVDREELPEVDQLYMDFAQALMPNSAGWPLNVFLTPDLLPFFA 126

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAW-DKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
            TY PP +  G PG   +++ + + W  K  D +       ++   + +        LPD
Sbjct: 127 TTYLPPRNASGLPGMIDLIQHIHELWIGKGHDQILMQAQQIVDLFQQNIQVYGID--LPD 184

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
              +  + L  + L +  D  +GG   APKFP   +  + L H   LE  G+        
Sbjct: 185 ---RKCVPLAVDTLLQISDPVWGGVKGAPKFPIGYQY-VFLMHYSALEKDGRP------M 234

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
            +V  TL+ M +GGI+DH+G GF RYS+DE+W +PHFEKMLYD   LA  Y +A+  TK 
Sbjct: 235 FLVEKTLELMYRGGIYDHLGSGFSRYSIDEQWQIPHFEKMLYDNALLAECYCEAWKATKR 294

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
             +  +C +++DY+   + G  G   SAEDADS   EG     EG FY WT  E++D+LG
Sbjct: 295 SLHRRVCCEVIDYVLSKLTGEQGAFLSAEDADS---EGV----EGKFYTWTMDEIDDVLG 347

Query: 459 -EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASASKLG-MPLEKY 514
            + + LF   Y     GN            F+GKN+  L  L +  AS +++    LE  
Sbjct: 348 SDDSELFCSVYGATAIGN------------FEGKNILHLPALLEHYASDNQMDHFELEAR 395

Query: 515 LNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVV 574
              + E + KL+ VR KR  P  DDKV+ SWNGL+I S   A K  +             
Sbjct: 396 ---IAELKEKLYKVREKRGHPLKDDKVLSSWNGLMIHSIVEAGKAFEI------------ 440

Query: 575 GSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYE 634
                 Y++    AA FI  HL+  +  RL   +R G     G LDDYAF+I   L L+E
Sbjct: 441 ----SRYVDAGRRAARFIYGHLW--KNGRLLRRYREGKVDFSGGLDDYAFMIRASLTLFE 494

Query: 635 FGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSV 694
            G GT+WL WA  ++    + F   EGG ++ T G+DP++++R     DGAEPSGN+V  
Sbjct: 495 AGCGTEWLEWAFSMERVLRDAF-KAEGGAFYQTDGKDPNLIIRQCLFADGAEPSGNAVHC 553

Query: 695 INLVRL 700
            NL+R+
Sbjct: 554 ENLLRI 559


>gi|359728137|ref|ZP_09266833.1| hypothetical protein Lwei2_14957 [Leptospira weilii str.
           2006001855]
          Length = 724

 Score =  424 bits (1089), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 272/716 (37%), Positives = 385/716 (53%), Gaps = 69/716 (9%)

Query: 70  LAVISHRPIHPYKVVAMAERTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEE 129
           + ++  R I   + +       +++    ++  NRL+ E SPYL QHA+NPVDWF WGEE
Sbjct: 9   MDMVGIRKIFRNRKIDFMSLKESNSMQFSSRGPNRLSKEKSPYLQQHAYNPVDWFPWGEE 68

Query: 130 AFAEARKRDVPIFLSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDK 189
           A  +AR+++  IFLSIGY+TCHWCHVME ESFE++ VA  LN  FVSIKVDREERPD+D+
Sbjct: 69  ALTKAREQNKLIFLSIGYATCHWCHVMEKESFENQMVADYLNSHFVSIKVDREERPDIDR 128

Query: 190 VYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKR 249
           +YM  + A+   GGWPL++FL+PD KP+ GGTYFPPE +YGR  F  IL  ++  W++KR
Sbjct: 129 IYMDALHAMDQQGGWPLNIFLTPDGKPITGGTYFPPEPRYGRKSFLEILNILRKVWNEKR 188

Query: 250 DMLAQSGAFAIEQLSEALSASASSNKLPDE---LP-QNALRLCAEQLSKSYDSRFGGFGS 305
               Q    A  +LS  L  S     +  +   LP +N            YD+ FGGF +
Sbjct: 189 ----QELIVASSELSRYLKDSGEGRAIEKQEGSLPSENCFDSGFSLYESYYDAEFGGFKT 244

Query: 306 --APKFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGF 361
               KFP  + +  +L  YHS        SG      +MV  TL  M +GGI+D +GGG 
Sbjct: 245 NHVNKFPPSMGLSFLLRYYHS--------SGNP-RALEMVENTLLAMKQGGIYDQIGGGL 295

Query: 362 HRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGG 421
            RYS D  W VPHFEKMLYD        ++   ++K +       D++ YL RDM   GG
Sbjct: 296 CRYSTDHHWMVPHFEKMLYDNSLFLETLVECSQVSKKISAKSFALDVISYLHRDMRIVGG 355

Query: 422 EIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRM 481
            I SAEDADS   EG    +EG FY+W  +E  ++ GE + + ++ + +   GN      
Sbjct: 356 GICSAEDADS---EG----EEGLFYIWDFEEFREVCGEDSQILEKFWNVTKKGN------ 402

Query: 482 SDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLN-ILGECRRKLFDVRSKRPRPHLDDK 540
                 F+GKN+L E     + A+K      K ++ +L   R KL + RSKR RP  DDK
Sbjct: 403 ------FEGKNILHE--SYRSEATKFSEEEWKRIDSVLERGRAKLLERRSKRVRPLRDDK 454

Query: 541 VIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQ 600
           ++ SWNGL I + A+A                 V   R++++++AE   SFI ++L D  
Sbjct: 455 ILTSWNGLYIKALAKAG----------------VAFQREDFLKLAEETYSFIEKNLIDPN 498

Query: 601 THRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDRE 660
             R+   FR+G S   G+ +DYA +IS  + L+E G G ++L  A+     +D + L R 
Sbjct: 499 G-RILRRFRDGESGILGYSNDYAEMISSSIALFEAGCGIRYLKNAVLWM--EDAIRLFRS 555

Query: 661 GGGYFNTTGEDPSVLLRVKED-HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHS 719
             G F  TG D  VLLR   D +DG EPS NS    +LV+L+  + G  S  Y + AE  
Sbjct: 556 PAGVFFDTGNDGEVLLRRSVDGYDGVEPSANSSLAYSLVKLS--LLGIDSARYGEFAESI 613

Query: 720 LAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDF-ENMLAAAHASY 774
              F   L   +++ P +  A       S K +VL+  +   DF +++LAA    +
Sbjct: 614 FLYFTKELSTNSLSYPHLLSAYWTYRRHS-KEIVLI--RKDTDFGKDLLAAIQTRF 666


>gi|296121436|ref|YP_003629214.1| hypothetical protein Plim_1180 [Planctomyces limnophilus DSM 3776]
 gi|296013776|gb|ADG67015.1| protein of unknown function DUF255 [Planctomyces limnophilus DSM
           3776]
          Length = 707

 Score =  423 bits (1087), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 253/673 (37%), Positives = 359/673 (53%), Gaps = 66/673 (9%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            NRLAAE S YL QHA NPV W  W +EA+  AR+ D P+FLSIGYS CHWCHVME ESF
Sbjct: 4   VNRLAAETSLYLNQHAQNPVAWQPWDDEAWRLARELDRPVFLSIGYSACHWCHVMEHESF 63

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           E+  +A+LLN WFVSIKVDREERPD+D++YM  V A+   GGWP+SVFL+P   P  GGT
Sbjct: 64  ENPRIAELLNQWFVSIKVDREERPDLDQIYMAAVIAMTQQGGWPMSVFLTPQGHPFYGGT 123

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFPP  +YGRPGF  +L  + DAW+ +R+++ +  +    QL+  +    S  + P  L 
Sbjct: 124 YFPPTSRYGRPGFAEVLAAIHDAWENRREVVTEQAS----QLTMTVHDQLSERQEPTTLH 179

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
           +N L      L +  D   GGFG APKFP  +++++ +  + +  DT ++ E +E     
Sbjct: 180 ENLLEKAGRTLVRVCDRVNGGFGHAPKFPHAMDLRLAMRLAHRF-DTTETAEVAE----- 233

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
              L  MAKGGIHDH+GGGF RYS DE W VPHFEKMLYD   L   YLD +   K  FY
Sbjct: 234 -LGLTAMAKGGIHDHLGGGFARYSTDEIWLVPHFEKMLYDNALLLQAYLDGWQFNKTDFY 292

Query: 402 SYICRDILDYLRRDMIGPGGEI----FSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
               + I+ Y+ R+M  P  E+     +A+DADS   EG    +EG F+VW+  E+ D+L
Sbjct: 293 RRTAQSIVHYVLREMQVPRAELPGGFCAAQDADS---EG----EEGRFFVWSQSEIRDVL 345

Query: 458 ------GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGM-- 509
                  + + LF+  Y +   GN            ++G N+L      +A   +LGM  
Sbjct: 346 SGSELGNDDSRLFERAYGVTSGGN------------WEGHNILNLPKTIAALGRELGMAE 393

Query: 510 -PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 568
             LE+ L++L   R KLF+ R  R  P  D+K+IV+WNGL+IS+ ARA  +L  +     
Sbjct: 394 TALEQKLSLL---RTKLFEHRKNRIAPGRDEKLIVAWNGLMISALARAGLVLDDQEALQA 450

Query: 569 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 628
                     +  +++AES              + L HS + G  K   +LDDY   +  
Sbjct: 451 AQ-----RAARVILDMAESL------------PYGLPHSIQKGQPKHGAYLDDYGCFLEA 493

Query: 629 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 688
           L++L+       WL  A+ L +     F D E GG++ T+ +   ++ R ++  D   PS
Sbjct: 494 LIELFLADGDPSWLSRAVPLIDRLVNEFHDDEQGGFYFTSSQAEKLISRSRDFQDNVTPS 553

Query: 689 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPS 748
           GN+     L++   I   ++S+   + A   L      ++   MA      A D    PS
Sbjct: 554 GNAAVANALLKFGRITGDARSE---ELAHEVLQAASGLMQQSTMATAHSLAALDWWLGPS 610

Query: 749 RKHVVLVGHKSSV 761
            + V +    +S 
Sbjct: 611 YECVYVPAETTST 623


>gi|432330863|ref|YP_007249006.1| thioredoxin domain protein [Methanoregula formicicum SMSP]
 gi|432137572|gb|AGB02499.1| thioredoxin domain protein [Methanoregula formicicum SMSP]
          Length = 708

 Score =  423 bits (1087), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 270/683 (39%), Positives = 369/683 (54%), Gaps = 49/683 (7%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL+ E SPYLLQHA NPVDWF WGEEAF  A + D P+FLSIGY+TCHWCHVM  ESFE
Sbjct: 14  NRLSREKSPYLLQHAENPVDWFPWGEEAFLRAAREDKPVFLSIGYATCHWCHVMAHESFE 73

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D  VA+LLN  F+++KVDREERPD+D  YM   Q L G GGWPL++ ++P+ KP    TY
Sbjct: 74  DLEVAELLNRDFIAVKVDREERPDIDSTYMQVCQMLSGQGGWPLTIVMTPEKKPFFAATY 133

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
            P E ++  PG   +L ++  AW ++R  L QS     E +++AL    ++   P+  P 
Sbjct: 134 LPKERRFAVPGLLDLLPRIAKAWREQRGELLQSA----ESITQALETRDAAPAGPE--PD 187

Query: 283 NA-LRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
            A L    E L   +D  +GGF  APKFP P  +  +L + K+   TGK         MV
Sbjct: 188 AALLDEGYEDLLLRFDPGYGGFSGAPKFPTPHTLLFLLRYWKR---TGK----KRALDMV 240

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
           + TL     GGIHDH+GGGFHRYS D +W VPHFEKMLYDQ  L   Y +AF  T++  Y
Sbjct: 241 VKTLDAFRDGGIHDHIGGGFHRYSTDAQWRVPHFEKMLYDQALLVIAYTEAFQATRNYRY 300

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL-GEH 460
                  + Y+ RD+  P G  FSAEDADS       R  EGAFY+WT  E+E +L  + 
Sbjct: 301 RETAMSTVRYVLRDLTDPEGAFFSAEDADS-------RGGEGAFYLWTMGELEAVLEKDD 353

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
           A +    + ++  GN        P +    +N+L       A  S  G+  E+    +  
Sbjct: 354 AAIAGRVFNVRDEGN-----FLSPEST-GAENILFRTRTDEALVSVTGIHQEELDERIAS 407

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            R +LF  R KR RP  DDKV++ WNGL+I++ A+A++   +            G  R  
Sbjct: 408 IRERLFAAREKRERPRRDDKVLLDWNGLMIAALAKAARAFGN------------GECRTA 455

Query: 581 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 640
                E   S +R         RL H +R+G    PGF DDYAFL   L++LYE     +
Sbjct: 456 AERAMECILSRMR-----TGDGRLYHRYRDGERAIPGFADDYAFLGLALIELYECTFDPR 510

Query: 641 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 700
           +L  A+ +  T  + FLDRE GG+F T G+  ++L+R K  +DGA PS NSV+   L+RL
Sbjct: 511 YLAEALAIMKTFRDHFLDRENGGFFFTAGDAEALLVRDKVIYDGAVPSANSVACEVLLRL 570

Query: 701 ASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSS 760
           + +   ++ +        S   F  R+++   A     CA +    PS + +V+ G   S
Sbjct: 571 SRLTGTTEHEDLAAALARS---FAGRVRESPSAFCWFLCAIERAVGPS-QDIVIAGDSGS 626

Query: 761 VDFENMLAAAHASYDLNKTVSKK 783
              +  LAA  + Y  + TV  K
Sbjct: 627 PAVQEFLAAVRSRYLPHCTVIHK 649


>gi|384170788|ref|YP_005552166.1| hypothetical protein BAXH7_04212 [Bacillus amyloliquefaciens XH7]
 gi|341830067|gb|AEK91318.1| hypothetical protein BAXH7_04212 [Bacillus amyloliquefaciens XH7]
          Length = 664

 Score =  422 bits (1086), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 246/647 (38%), Positives = 353/647 (54%), Gaps = 62/647 (9%)

Query: 121 VDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVD 180
           +DWF WG+EAF +A++ + P+ +SIGYSTCHWCHVM  ESFEDE +A +LND F++IKVD
Sbjct: 1   MDWFPWGDEAFEKAKRENKPVLISIGYSTCHWCHVMAHESFEDEEIAGMLNDKFIAIKVD 60

Query: 181 REERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRK 240
           REERPDVD VYM   Q + G GGWPL+VF++PD KP   GTYFP   K+ RPGF  +L  
Sbjct: 61  REERPDVDSVYMRICQLMTGQGGWPLNVFVTPDQKPFYAGTYFPKTSKFNRPGFIDVLEH 120

Query: 241 VKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE--LPQNALRLCAEQLSKSYDS 298
           + + +   R          +E ++E  +A       P E  L + A+     QL+  +D+
Sbjct: 121 LSETFANDRQ--------HVEDIAENAAAHLEVKVHPTEGMLGEQAVHDTYRQLAGGFDT 172

Query: 299 RFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVG 358
            +GGFG APKFP P    M+L+  +    TGK  +A  G   V  TL  MA GGI DH+G
Sbjct: 173 VYGGFGQAPKFPMP---HMLLFLLRYYSYTGKE-QALAG---VTKTLDGMANGGIFDHIG 225

Query: 359 GGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIG 418
            GF RYS D  W VPHFEKMLYD   L + Y +A+ +T +  Y  I   I+ +++R+M+ 
Sbjct: 226 FGFARYSTDNEWLVPHFEKMLYDNALLLSAYTEAYQVTNNERYKQIATQIVTFIQREMMH 285

Query: 419 PGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCD 477
             G  FSA DAD   TEG    +EG +Y+W+ KE+ ++LG+    L+ + Y +   GN  
Sbjct: 286 EDGSFFSALDAD---TEG----REGKYYIWSKKEIMNLLGDQLGSLYCKVYNITEQGN-- 336

Query: 478 LSRMSDPHNEFKGKNV--LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRP 535
                     F+G+N+  LI      A   + G+   +    L   R+KL + R  R  P
Sbjct: 337 ----------FEGENIPNLI-FTRREAILEETGLTEHELTERLEGARKKLLEARENRSYP 385

Query: 536 HLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRH 595
           H DDKV+ SWN L+I+  A+A+K+                     ++ +AE+A  F+ RH
Sbjct: 386 HTDDKVLTSWNALMIAGLAKAAKVFHEPG----------------FLSMAETAIRFLERH 429

Query: 596 LYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDEL 655
           L  +   R+   +R G  K  GF+DDYAFLI   L+LYE G    +L  A  L  +  +L
Sbjct: 430 LIPDG--RVMVRYREGEVKNKGFIDDYAFLIWAYLELYEAGFNPSYLKKAKTLCTSMLDL 487

Query: 656 FLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQN 715
           F D   GG+F T  +  ++L+R KE +DGA PSGNS + + L+RL  +          + 
Sbjct: 488 FWDERHGGFFFTGNDAETLLVREKEVYDGAVPSGNSAAAVQLLRLGRLTGDVS---LIEK 544

Query: 716 AEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD 762
           AE   +VF+  ++    +      +  +  +  +K +V+ G K   D
Sbjct: 545 AEAMFSVFKREIEAYPSSSAFFMQSV-LAHIMPQKEIVVFGSKDDPD 590


>gi|448355570|ref|ZP_21544321.1| hypothetical protein C483_16206 [Natrialba hulunbeirensis JCM
           10989]
 gi|445635098|gb|ELY88270.1| hypothetical protein C483_16206 [Natrialba hulunbeirensis JCM
           10989]
          Length = 722

 Score =  422 bits (1086), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 250/652 (38%), Positives = 350/652 (53%), Gaps = 51/652 (7%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA NPV+W  W E+A   AR+ DVPIFLSIGYS CHWCHVME ESF 
Sbjct: 10  NRLDEEESPYLRQHADNPVNWQPWDEQALETAREHDVPIFLSIGYSACHWCHVMEDESFA 69

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA++LN+ FV IKVDREERPDVD +YMT  Q + G GGWPLS +L+P+ KP   GTY
Sbjct: 70  DEQVAEVLNENFVPIKVDREERPDVDSIYMTVCQLVTGRGGWPLSAWLTPEGKPFYVGTY 129

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDML-------AQSGAFAIEQLSEALSASASSNK 275
           FP   K G+PGF  IL  + ++W   RD +         +    +E+  +A+SAS   + 
Sbjct: 130 FPKNAKRGQPGFLDILENLTNSWAGDRDEIENRAEQWTDAAKDRLEETPDAVSASQPPSS 189

Query: 276 LPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLEDTGKSGEA 334
                  + L   A    +S D +FGGFGS  PKFP+P  ++++   ++  + TG+    
Sbjct: 190 -------DVLEAAANASLRSADRQFGGFGSDGPKFPQPSRLRVL---ARAADRTGR---- 235

Query: 335 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 394
            E Q +++ TL  MA GG++DHVGGGFHRY VD  W VPHFEKMLYD  ++   +L  + 
Sbjct: 236 DEFQDVLVETLDAMAAGGLYDHVGGGFHRYCVDRDWTVPHFEKMLYDNAEIPRAFLIGYQ 295

Query: 395 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 454
            T D  Y+ +  + L ++ R++    G  FS  DA S E E    ++EGAFYVWT  E+ 
Sbjct: 296 QTGDERYAEVVAETLAFVARELTHEEGGFFSTLDAQSEEPE-TGEREEGAFYVWTPDEIH 354

Query: 455 DILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE 512
           D+L     A LF + Y +  +GN            F+G      +   S  A++  +   
Sbjct: 355 DVLENETTADLFCDRYDITESGN------------FEGSTQPNRVRSVSDLAAEYDLEAA 402

Query: 513 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 572
                L   R KLF  R +RPRP+ D+KV+  WNGL+I++ A A+ +L            
Sbjct: 403 DVRARLESAREKLFAAREQRPRPNRDEKVLAGWNGLMIATCAEAALVLGG---------- 452

Query: 573 VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDL 632
               D  EY  +A  A  F+R  L+DE   RL   +++G     G+L+DYAFL    L  
Sbjct: 453 --SEDGDEYATMAVDALEFVRDRLWDEDEQRLSRRYKDGDVAIDGYLEDYAFLARAALGC 510

Query: 633 YEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSV 692
           YE       L +A++L    ++ F D + G  + T     S++ R +E  D + PS   V
Sbjct: 511 YEATGEVDHLAFALDLARIIEDEFWDADRGTLYFTPESGESLVTRPQELGDQSTPSAAGV 570

Query: 693 SVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 744
           +V  L+ L       + D + + A   L     R++  ++    +C AAD L
Sbjct: 571 AVETLLALEGF--ADQDDEFEEIATTVLETHANRIETNSLEHATLCLAADRL 620


>gi|436836357|ref|YP_007321573.1| protein of unknown function DUF255 [Fibrella aestuarina BUZ 2]
 gi|384067770|emb|CCH00980.1| protein of unknown function DUF255 [Fibrella aestuarina BUZ 2]
          Length = 682

 Score =  422 bits (1085), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 244/608 (40%), Positives = 337/608 (55%), Gaps = 48/608 (7%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA+E SPYLLQHAHNPVDWF WG+EA A+AR  D PI +SIGYS CHWCHVME ESFE
Sbjct: 2   NRLASETSPYLLQHAHNPVDWFPWGDEALAKARDEDKPILVSIGYSACHWCHVMERESFE 61

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           +E +AK++N+ FV IKVDREERPDVD VYM  VQA+   GGWPL+VFL PD +P  G TY
Sbjct: 62  NEQIAKIMNERFVCIKVDREERPDVDAVYMEAVQAMGVQGGWPLNVFLMPDARPFYGLTY 121

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
            PP++      +  ++  V+ A+D+ RD L +S     E L+ + S             Q
Sbjct: 122 APPQN------WANLMVGVRQAFDENRDELLRSAEGFAEHLNTSESTRFQLQTAEPVYAQ 175

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             +     +L+  +D+  GG G APKFP P     +L ++        +G+ S  Q++ L
Sbjct: 176 ETVETMYRKLATRFDTELGGTGRAPKFPMPSIYTFLLRYAD------LTGDPSAFQQLTL 229

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL  MA GGI+D +GGGF RYS D+ W  PHFEKMLYD  QL  +Y +AF++T    Y 
Sbjct: 230 -TLNRMALGGIYDQLGGGFARYSTDKHWFAPHFEKMLYDNAQLLTLYSEAFAMTGSALYR 288

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
           +     +++L R+++ P G  +SA DADS   EG     EG FY W++ E++ ILG+   
Sbjct: 289 FTVYHTIEFLERELLSPDGGFYSALDADS---EGI----EGKFYTWSADELQSILGDDYD 341

Query: 463 LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECR 522
            F + Y + P GN D+      H   +  N+L     + A A +LG    +    L   +
Sbjct: 342 WFAQLYTITPEGNWDIG-----HGHGR-TNILHRTETNPAFADQLGWTAAELNERLTTAK 395

Query: 523 RKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYM 582
            KL  VRS+R RP LDDK++ SWNGL +     A ++         FN P       E++
Sbjct: 396 EKLLAVRSQRVRPGLDDKLLCSWNGLALKGLVSAYRV---------FNEP-------EFL 439

Query: 583 EVAESAASFIRRHLYDEQT-HRLQHSFRNGP-----SKAPGFLDDYAFLISGLLDLYEFG 636
            +A   A FI++ L D +   RL HS++ GP     ++  GFL+DYA +I G + LY+  
Sbjct: 440 SMALRLAFFIKQKLTDGRNGGRLWHSYKTGPDGVGRARQLGFLEDYAAVIDGYVALYQAT 499

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
              +WL  A  L       F D +    F T      ++ R KE  D   P+ NS+   N
Sbjct: 500 FADEWLTEADRLTQYVLAHFNDPDEPLLFFTDKSGEELIARKKELFDNVIPASNSIMAQN 559

Query: 697 LVRLASIV 704
           L  L+ ++
Sbjct: 560 LYTLSLLL 567


>gi|134077135|emb|CAK45476.1| unnamed protein product [Aspergillus niger]
          Length = 765

 Score =  422 bits (1085), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 252/609 (41%), Positives = 342/609 (56%), Gaps = 39/609 (6%)

Query: 116 HAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFV 175
           H +NPV W  W  EA   A++ +  IFLSIGYS CHWCHVME ESF  + VA +LN  F+
Sbjct: 25  HMNNPVGWQLWDAEAIDLAKRHNRLIFLSIGYSACHWCHVMEKESFMSQEVASILNQSFI 84

Query: 176 SIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKY-----G 230
            IKVDREERPD+D VYM YVQA  G GGWPL+VFL+PDL+P+ GGTY+P  +       G
Sbjct: 85  PIKVDREERPDIDDVYMNYVQATTGSGGWPLNVFLTPDLEPVFGGTYWPGPNSSTLTGNG 144

Query: 231 RPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASS----NKLPDELPQNALR 286
             GF  IL K+ D W  ++    +S     +QL E       S     +  ++L    L 
Sbjct: 145 TIGFVEILEKLSDVWQTQQLRCRESAKEITKQLREFAEEGTHSYQGDRQADEDLDLELLE 204

Query: 287 LCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQ 346
              +     YD   GGF +APKFP P  +  +L+   +        E ++   M + TL 
Sbjct: 205 EAYQHFVSRYDPLHGGFSTAPKFPTPSNLSFLLHIVGR-------DECAKATAMAVDTLI 257

Query: 347 CMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICR 406
            MA+GGI DH+G GF RYSV   W +PHFEKMLYDQ QL +VY+DAF +T +        
Sbjct: 258 SMARGGIRDHIGHGFARYSVTGDWGLPHFEKMLYDQAQLLDVYVDAFKITHNPELLGAVY 317

Query: 407 DILDYLRRDMI-GPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILF 464
           D+  YL    I  P G   S+EDADS  T   T K+EGAFYVWT KE+  +LG+  A + 
Sbjct: 318 DLATYLTTAPIQSPTGAFHSSEDADSLPTPNDTEKREGAFYVWTLKELTQVLGQRDAGVC 377

Query: 465 KEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRK 524
             H+ + P GN  ++  +DPH+EF  +NVL      S  A   G+  E+ + I+   ++K
Sbjct: 378 ARHWGVLPDGN--IAPENDPHDEFMNQNVLSVKVTPSRLAKDFGLGEEEVVRIIRAAKQK 435

Query: 525 LFDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYME 583
           L D R + R RP LDDK+IV+WNGL I + A+ S + + E ES         S   +  E
Sbjct: 436 LRDYRERTRVRPDLDDKIIVAWNGLAIGALAKCSALFE-EIES---------SKAVQCRE 485

Query: 584 VAESAASFIRRHLYDEQTHRLQHSFRNGP-SKAPGFLDDYAFLISGLLDLYEFGSGTKWL 642
            A  A +FI+ +L+++ T +L   +R+G     PGF DDYA+LI GLLD+YE      +L
Sbjct: 486 AAAKAINFIKENLFEKPTGQLWRIYRDGGRGNTPGFADDYAYLIGGLLDMYEATFDDSYL 545

Query: 643 VWAIELQNTQDELFLDREG---GGYFNT----TGEDPSVLLRVKEDHDGAEPSGNSVSVI 695
            +A +LQ   ++ FL   G    GY++T    T   P  LLR+K   + A P+ N V   
Sbjct: 546 QFAEQLQKYLNDNFLAYVGTTPAGYYSTPSTMTSGAPGPLLRLKTGTESATPAVNGVIAR 605

Query: 696 NLVRLASIV 704
           NL+RL S++
Sbjct: 606 NLLRLGSLL 614


>gi|374376399|ref|ZP_09634057.1| protein of unknown function DUF255 [Niabella soli DSM 19437]
 gi|373233239|gb|EHP53034.1| protein of unknown function DUF255 [Niabella soli DSM 19437]
          Length = 687

 Score =  422 bits (1084), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 238/603 (39%), Positives = 333/603 (55%), Gaps = 44/603 (7%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           +N L  E SPYLLQHAHNPVDW+ WGE+A  +A   D PI +SIGY+ CHWCHVME ESF
Sbjct: 2   SNHLIHETSPYLLQHAHNPVDWYPWGEKALQKAINEDKPILVSIGYAACHWCHVMERESF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           ED   A L+N+ F++IKVDREERPD+D +YM  VQ + G GGWPL+VFL+PD KP  GGT
Sbjct: 62  EDAATAALMNEHFINIKVDREERPDIDHIYMDAVQTMTGSGGWPLNVFLTPDKKPFYGGT 121

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           Y+PP     RP +K +L  V DA+  KR  + Q      +QL +A S         D L 
Sbjct: 122 YYPPVSYANRPSWKDVLTAVSDAFQNKRTAIQQQAEGLTQQLVDANSFGIGDGSGADFLR 181

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
                 C+  L ++ D+ +GGFG APKFP+   I+ +L +    +D   S  A    +  
Sbjct: 182 DEVDAACSAILKQA-DTSWGGFGRAPKFPQTQTIRFLLRYHYAEKDRPDSF-ADNALQQA 239

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
           L +L  M +GGI+D VGGGF RY+ D  W  PHFEKMLYD   L     +A+ +T+D  Y
Sbjct: 240 LLSLDKMMEGGIYDQVGGGFARYATDTEWLAPHFEKMLYDNALLVVTLSEAYQVTRDERY 299

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHA 461
                  + ++ R++    G  ++A DADS   EG    +EG FYVW+ KE+E++L E A
Sbjct: 300 RGCIEQTIAFIERELTDASGGFYAALDADS---EG----EEGKFYVWSKKEIEELLREDA 352

Query: 462 ILFKEHYYLKPTGNCD---LSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 518
            LF  +Y +  +GN +   + R+  P  EF   N   E+N++   A            +L
Sbjct: 353 DLFCRYYDITESGNWEGKNILRILTPLKEFAATN---EINETLLEA------------LL 397

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
            + R +L   R+ R RP LDDK+I+ WN L+ +++++A +   +EA              
Sbjct: 398 EKGRLQLLVARAHRIRPALDDKIILGWNALMNTAYSKAFEATGNEA-------------- 443

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG 638
             Y++ A     F+  + ++       H ++ G +K P FLDDYA+LI  LL L    + 
Sbjct: 444 --YLQRATDNMRFL-LNAFENTDGSFAHVWKAGVAKYPAFLDDYAYLIEALLQLARVTAD 500

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
             +L  A  L     E F + E G +F T      V+LR KE +DGA PSGN+V   NL+
Sbjct: 501 YSYLEKARALCQGIQEHFAESETGYFFYTPQNQGDVILRKKEVYDGATPSGNAVMAANLL 560

Query: 699 RLA 701
            L+
Sbjct: 561 HLS 563


>gi|394990058|ref|ZP_10382890.1| hypothetical protein SCD_02483 [Sulfuricella denitrificans skB26]
 gi|393790323|dbj|GAB72529.1| hypothetical protein SCD_02483 [Sulfuricella denitrificans skB26]
          Length = 681

 Score =  422 bits (1084), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 247/625 (39%), Positives = 350/625 (56%), Gaps = 54/625 (8%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            N L +E SPYL QHA NPV+W  W E+A A AR++D PI LS+GYSTCHWCHVM  ESF
Sbjct: 2   ANHLVSESSPYLQQHADNPVNWHPWCEQALALAREQDKPILLSVGYSTCHWCHVMAHESF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYG-GGGWPLSVFLSPDLKPLMGG 220
           ED+  A L+N  +++IKVDREERPD+D++Y +    L G  GGWPL++FL+PD  P  GG
Sbjct: 62  EDQTTADLINRDYIAIKVDREERPDLDQIYQSAHNLLTGKSGGWPLTLFLTPDQTPFYGG 121

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           TYFPPE +Y RPGFK +L KV  A+ ++R  +AQ        L E+L++     +   E 
Sbjct: 122 TYFPPEARYNRPGFKDLLPKVAQAYRERRHDIAQQNI----SLRESLASGGPVPQAGIEP 177

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
               L     QL K++D   GGFG APKFPRP EI   L      E+       ++  +M
Sbjct: 178 NPAPLAGAQSQLEKNFDPVHGGFGGAPKFPRPSEIAFCLRRYAAEEN-------AQALEM 230

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
              TL+ +A GGI+D +GGGF RYSVDERW +PHFEKMLYD G L  +Y +A+  + D  
Sbjct: 231 ARQTLRKIADGGINDQLGGGFCRYSVDERWLIPHFEKMLYDNGPLLELYANAWCCSGDER 290

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-- 458
           +  +  + + +L R+M  P G  +SA DADS          EG FYVWT +EV   L   
Sbjct: 291 FRRVAEETVAWLEREMRAPQGGFYSALDADSEHV-------EGKFYVWTPQEVAATLSAD 343

Query: 459 EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 518
           E+A+L + HY L    N + S     H  F   + L ++      A +L + L+    +L
Sbjct: 344 EYAVLSR-HYGLDQPANFEGS-----HWHFYVAHPLDQV------ARELSVELDDAWRLL 391

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
              R KL  +R++R RP  D+K++ SWN L+I   A A +                   R
Sbjct: 392 ESARTKLIALRAQRVRPGRDEKILTSWNALMIKGLAHAGRTF----------------GR 435

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG 638
           ++++ +A+ A  FI   L+  + +RL  S+++G S   G+LDDYAFL+  L++L +    
Sbjct: 436 EDWIALAQQATDFIHAELW--RNNRLLASWKDGKSNLGGYLDDYAFLLDALVELLQARFR 493

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
           T  L +A EL       F D + GG++ T  +  +++ R K   D A PSGN+V+   L 
Sbjct: 494 TADLTFACELAEALLVRFEDCDQGGFYFTAHDHETLIFRPKTGFDNATPSGNAVAAFALQ 553

Query: 699 RLASIVAGSKSDYYRQNAEHSLAVF 723
           RL  ++  ++   Y   AE +L +F
Sbjct: 554 RLGHLLGETR---YLAAAERALKLF 575


>gi|325262773|ref|ZP_08129509.1| dTMP kinase [Clostridium sp. D5]
 gi|324031867|gb|EGB93146.1| dTMP kinase [Clostridium sp. D5]
          Length = 668

 Score =  422 bits (1084), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 242/606 (39%), Positives = 339/606 (55%), Gaps = 69/606 (11%)

Query: 101 HTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVES 160
           + N L  E SPYLLQHA NPVDW+ WG EAF +A++ D P+FLSIGYSTCHWCHVM  ES
Sbjct: 2   YMNHLKNEKSPYLLQHAENPVDWYPWGPEAFQKAKQEDRPVFLSIGYSTCHWCHVMAHES 61

Query: 161 FEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGG 220
           FEDE VA++LN  ++ IKVDREERPD+D VYM+  QA+ G GGWPL+  L+P+ +P   G
Sbjct: 62  FEDEQVAEVLNSQYICIKVDREERPDIDSVYMSACQAVTGAGGWPLTAILTPEQQPFFLG 121

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS-ASASSNKLPDE 279
           TYFP   +YG PG   +L ++   W + R+ L ++G    +Q++E +S    +S  +PD 
Sbjct: 122 TYFPKHPRYGHPGLIELLEEIGSLWRENRNKLIEAG----QQITEFISIPDHASGSIPD- 176

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGE-ASEGQ 338
             +  L+   E   + YDSR+GGFG APKFP P        H+          E   E  
Sbjct: 177 --KKGLKRAFELYRRQYDSRWGGFGKAPKFPAP--------HNLLFLLHYSLLENEQEAL 226

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
           +M   TL  MA GG++D +GGGF RYS DE+W VPHFEKMLYD   LA  YL+A+ + K 
Sbjct: 227 EMAEHTLTAMAHGGMNDQIGGGFSRYSTDEKWLVPHFEKMLYDNALLAIAYLEAYHIKKR 286

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
             Y+   R  LDY+ R++ GP G+ +  +DADS   EG     EG +Y ++ +E+  +LG
Sbjct: 287 ELYADTARRTLDYVLRELTGPSGQFYCGQDADS---EGI----EGKYYFFSPEEIMSVLG 339

Query: 459 E-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASASKLGMPLEKYL 515
           +     F   Y +  +GN            F+G+++  LI  ++    A  + +      
Sbjct: 340 DGDGEEFCRIYDITASGN------------FEGRSIPNLIGQSELPWRADDIRL------ 381

Query: 516 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 575
                   ++++ R  R   H DDKVI+SWN  ++ + A+A++IL              G
Sbjct: 382 -------NRIYNYRRNRTLLHRDDKVILSWNSWMMIAMAKAAQIL--------------G 420

Query: 576 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 635
             R  Y + A +   FI+ H+ D+ + RL H +R G +   G LDDYA     LL+LY  
Sbjct: 421 DTR--YKDAAIAVHRFIQAHMTDD-SRRLYHRWREGEAAIEGQLDDYAVYGLALLELYRT 477

Query: 636 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 695
                +L  A        ELF DRE GGYF T  +  +++ R KE +DGA PSGNS + +
Sbjct: 478 AYEPVYLEEAAFFAGQMAELFEDRENGGYFLTASDTEALITRPKETYDGAVPSGNSAAAV 537

Query: 696 NLVRLA 701
            L +LA
Sbjct: 538 LLSQLA 543


>gi|83649209|ref|YP_437644.1| hypothetical protein HCH_06582 [Hahella chejuensis KCTC 2396]
 gi|83637252|gb|ABC33219.1| Highly conserved protein containing a thioredoxin domain [Hahella
           chejuensis KCTC 2396]
          Length = 762

 Score =  421 bits (1083), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 256/659 (38%), Positives = 359/659 (54%), Gaps = 72/659 (10%)

Query: 91  PASTSHSRNK----HTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIG 146
           P  T + R       TN L  E SPYLLQHAHNPV+W AW ++ FA A+  + PIFLSIG
Sbjct: 19  PVRTRYRRQDGSPVFTNHLILESSPYLLQHAHNPVNWRAWNDDTFALAKAENKPIFLSIG 78

Query: 147 YSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPL 206
           YSTCHWCHVME ESF++E VA+ LN +F+ IKVDRE+RPD+D++YMT VQ + G GGWP+
Sbjct: 79  YSTCHWCHVMEEESFDNEEVAQTLNGYFIPIKVDREQRPDLDEIYMTAVQIITGHGGWPM 138

Query: 207 SVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 266
           S FL+P+  P  G TYFP      RP F  +LRKV + W+++++ L + G     +LSEA
Sbjct: 139 SSFLTPEGNPFFGATYFP------RPRFINLLRKVHELWEEQQENLLEQG----RRLSEA 188

Query: 267 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 326
           +S       + + L +N +    E+L    D  +GGFGS PKFP+   +  +L     +E
Sbjct: 189 VSVYLRPKPISETLAENLIETAMEKLIGYSDREWGGFGSEPKFPQEPNLLFLL---DIIE 245

Query: 327 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 386
              +  +      +V   L  +  GG++D  GGGFHRY+VD+RW VPHFEKMLY+Q QLA
Sbjct: 246 RDSRPLDRQPAWTVVKTALDALLAGGVYDQAGGGFHRYAVDQRWLVPHFEKMLYNQAQLA 305

Query: 387 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 446
             ++ A+ L++D  Y  ICR+ LDY+ R+M  P G  +SA DADS   EG    +EG ++
Sbjct: 306 RCFIRAYKLSQDPEYLRICRETLDYVLREMRSPEGVFYSATDADS---EG----EEGKYF 358

Query: 447 VWTSKEVEDILGEHAILFKEHYY-LKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 505
           VW  +E+  +L    +   E  Y +   GN            F+G N+L        SA+
Sbjct: 359 VWAYQELSQLLDTPGLALAEQVYGVTRKGN------------FEGANILYLPRPLQKSAA 406

Query: 506 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 565
            LG+  E+ L  L + +  L   RS+R  P  DDKVI  WNG++I++ A  + I    A 
Sbjct: 407 TLGLTYEELLQQLADLKAILLQTRSQRVPPLRDDKVITEWNGMMIAALAETAAITGISA- 465

Query: 566 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQT--HRLQHSFRNGPSKAPGFLDDYA 623
                          Y + A  AA+ + R    E    HR+  S  N PS     L+DY 
Sbjct: 466 ---------------YGDAAVIAANQLWRSQRGEDGLFHRI--SLDNLPSDD-ALLEDYV 507

Query: 624 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT--TGEDPSVLLRVKED 681
             + GLL LY++     WL     L  T +E FLD E GG+F T  + + P +L+R K  
Sbjct: 508 HYMEGLLQLYDYTHDHLWLERLEALTTTLEEQFLDAEQGGFFITPQSAQGP-LLVRSKHC 566

Query: 682 HDGAEPSGNSVSVINLVRLASIVAGSK---SDYYRQN-AEHSLAVFETRLKDMAMAVPL 736
            D A  SGNS       +LAS++A  +    D   Q  AE+ +A F  ++    ++ P+
Sbjct: 567 SDNATISGNS-------QLASVLAALRLRTGDLNVQRMAENQIAAFTGQINRHPLSAPV 618


>gi|257092092|ref|YP_003165733.1| hypothetical protein CAP2UW1_0453 [Candidatus Accumulibacter
           phosphatis clade IIA str. UW-1]
 gi|257044616|gb|ACV33804.1| protein of unknown function DUF255 [Candidatus Accumulibacter
           phosphatis clade IIA str. UW-1]
          Length = 734

 Score =  421 bits (1083), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 265/687 (38%), Positives = 376/687 (54%), Gaps = 75/687 (10%)

Query: 85  AMAERTPA---STSH----SRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKR 137
           A+A R PA    T H     R    NRLA E SPYLLQHAHNPV+WF WG+EAFAEAR+ 
Sbjct: 23  AIALRGPAYVPRTHHLDADGRPLFINRLALETSPYLLQHAHNPVNWFPWGDEAFAEARRL 82

Query: 138 DVPIFLSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQA 197
             P+FLSIGYSTCHWCHVME ESFEDE +A+ LN  +V+IKVDREERPD+D VYM+ VQ 
Sbjct: 83  GRPVFLSIGYSTCHWCHVMEAESFEDEAIARFLNRHYVAIKVDREERPDIDAVYMSAVQQ 142

Query: 198 LYGGGGWPLSVFLSPDLKPLMGGTYFPPED--KYGRPGFKTILRKVKDAWDKKRDMLAQS 255
           L G GGWP+SV+L+   +P  GGTYFPP D  + G+ GF  +L  + D + +  + + Q+
Sbjct: 143 LTGAGGWPMSVWLTAAREPFFGGTYFPPRDGGRDGQRGFLPLLGALSDTFHRDPERVGQA 202

Query: 256 GAFAIEQLSEALSASASSNKLPDE--LPQ-NALRLCAEQLSKSYDSRFGGFGSAPKFPRP 312
               +E +   +  +  +        LP  + +        +S+D+R GG   APKFP  
Sbjct: 203 CTALVEAIRHDMQGAYGTGGADAAIGLPAGDVIDATVAHYRQSFDARHGGLSRAPKFPSH 262

Query: 313 VEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHV 372
           + ++++L + ++  D       ++  +M   TL+ MA GG++D +GGGFHRYS D RW V
Sbjct: 263 IPVRLLLRYHQRTGD-------ADALRMATLTLEKMAAGGLYDQLGGGFHRYSTDVRWLV 315

Query: 373 PHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSA 432
           PHFEKMLYD   L   Y +AF +T    ++ + R+  DY+ R+M   GG  +SA DADS 
Sbjct: 316 PHFEKMLYDNALLVVAYAEAFQVTDRADFARVARETCDYILREMTDAGGGFYSATDADS- 374

Query: 433 ETEGATRKKEGAFYVWTSKEVE---DILGEHAIL--FKEHYYLKPTGNCDLSRMSDPHNE 487
             EG    +EG F+VW   E+    D LG+      F  HY + P GN            
Sbjct: 375 --EG----EEGRFFVWREDEIRRELDALGDGDTTEHFLAHYDVHPGGN------------ 416

Query: 488 FKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNG 547
           ++G  +L            +  P E     L   R +L+ VR++R  P  D+K++  WNG
Sbjct: 417 WEGHTIL-----------NVPRPDEAAWEALAAARARLYAVRARRTPPLRDEKILAGWNG 465

Query: 548 LVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS 607
           L+IS+ A A ++L                D   Y+  A  AA F+  HL       L+ S
Sbjct: 466 LMISALAVAGRVL----------------DAPRYVAAAVRAADFVLTHLRGADGG-LRRS 508

Query: 608 FRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT 667
           F++G ++   FLDD+AFL +GL+DLYE     + L  A+ L  T + LF D   G +F +
Sbjct: 509 FKDGQARQAAFLDDHAFLAAGLIDLYEATFDVRHLRDALALAETTEHLFAD-PAGAWFMS 567

Query: 668 TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 727
           +    S++ R K  +DGAEPSG SV+++N +RL  +   +  + +RQ AE  L      L
Sbjct: 568 SEAHESLIAREKPAYDGAEPSGTSVALLNALRLGVL---TDDERWRQIAERGLRAHARVL 624

Query: 728 KDMAMAVPLMCCAADMLSVPSRKHVVL 754
            +  +A+     A D L+   R+  V+
Sbjct: 625 GERPIAMTEALLAVDFLATTPRQIAVV 651


>gi|448321193|ref|ZP_21510673.1| hypothetical protein C491_09424 [Natronococcus amylolyticus DSM
           10524]
 gi|445604053|gb|ELY58004.1| hypothetical protein C491_09424 [Natronococcus amylolyticus DSM
           10524]
          Length = 724

 Score =  421 bits (1082), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 249/645 (38%), Positives = 346/645 (53%), Gaps = 41/645 (6%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA NPV+W  W E A   AR++D PIFLSIGYS CHWCHVME ESF 
Sbjct: 8   NRLDEEESPYLRQHADNPVNWQPWDERALESAREQDKPIFLSIGYSACHWCHVMEEESFA 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA LLN+ F+ IKVDREERPDVD +YMT  Q + GGGGWPLS +L+P+ KP   GTY
Sbjct: 68  DEEVADLLNEEFIPIKVDREERPDVDSIYMTVCQLVSGGGGWPLSAWLTPEGKPFYVGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP   K G+PGF  +L  + D+W+  R+ +            + L  +  S    +    
Sbjct: 128 FPKRSKRGQPGFLDLLEGLADSWETDREEIESRADEWTAAARDQLEETPDSIGAAEPPSS 187

Query: 283 NALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
           + L   A+   +S D + GGFGS  PKFP+P  ++++   ++  + TG+     E ++++
Sbjct: 188 DVLERAADAALRSADRQNGGFGSGGPKFPQPARLRVL---ARAYDRTGR----DEYREVL 240

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             +L  M +GG++DHVGGGFHRY VD  W VPHFEKMLYD  ++    L  + LT D  Y
Sbjct: 241 EGSLTAMIEGGLYDHVGGGFHRYCVDADWTVPHFEKMLYDNAEIPRALLAGYRLTGDERY 300

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH- 460
           +   R+ L+++ R++    G  FS  DA S + E   R +EGAF+VWT  EV ++LG+  
Sbjct: 301 AGYVRETLEFVSRELTHDEGGFFSTLDAQSEDPETGER-EEGAFFVWTPAEVREVLGDET 359

Query: 461 -AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILG 519
            A LF   Y +  +GN            F+G++        S  A +  +   +    L 
Sbjct: 360 DADLFCARYDITESGN------------FEGQSQPNLAASISELADRFDLEEREVEERLE 407

Query: 520 ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRK 579
             R+KLF+ R +RPRP+ D+KV+  WNGL+IS+ A A+  L              G DR 
Sbjct: 408 SARQKLFEAREERPRPNRDEKVLAGWNGLMISTCAEAALAL--------------GEDR- 452

Query: 580 EYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGT 639
            Y E+A  A  F+R  L+D    RL   +++G     G L+DYAFL  G L  YE     
Sbjct: 453 -YAEMATDALEFVRDRLWDADEGRLSRRYKDGDVAVQGNLEDYAFLARGALGCYEATGEV 511

Query: 640 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVR 699
             L +A+EL    +  F D E    + T     S++ R +E  D + P+   V+V  L+ 
Sbjct: 512 DHLAFALELARGIEAEFYDAERETLYFTPESGESLVTRPQELTDQSTPAAAGVAVETLLA 571

Query: 700 LASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 744
           L       + D +   A   L     RL+  A+    +C AAD L
Sbjct: 572 LEGFA--DEDDEFEGIAASVLGTHAGRLESNALQHVTLCLAADRL 614


>gi|456865795|gb|EMF84112.1| PF03190 family protein [Leptospira weilii serovar Topaz str.
           LT2116]
          Length = 716

 Score =  421 bits (1081), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 267/712 (37%), Positives = 378/712 (53%), Gaps = 61/712 (8%)

Query: 70  LAVISHRPIHPYKVVAMAERTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEE 129
           + ++  R I   + +       +++    ++  NRL+ E S YL QHAHNPVDWF WGEE
Sbjct: 1   MDMVGIRKIFRNRKIDFMSLKESNSMQFSSRSPNRLSKEKSLYLQQHAHNPVDWFPWGEE 60

Query: 130 AFAEARKRDVPIFLSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDK 189
           A  +AR++D  IFLSIGY+TCHWCHVME ESFE++ VA  LN  FVSIKVDREERPD+D+
Sbjct: 61  ALTKAREQDKLIFLSIGYATCHWCHVMEKESFENQMVADYLNSHFVSIKVDREERPDIDR 120

Query: 190 VYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKR 249
           +YM  + A+   GGWPL++FL+PD KP+ GGTYFPPE +YGR  F  IL  ++  W +KR
Sbjct: 121 IYMDALHAMDQQGGWPLNMFLTPDGKPITGGTYFPPEPRYGRKSFLEILNILRKVWSEKR 180

Query: 250 DMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS--AP 307
             L  + +     L ++    A   ++     +N            YD+ FGGF +    
Sbjct: 181 QELIVASSELSRYLKDSGEGRAIEKQVGSLPSENCFDSGFSLYESYYDAEFGGFKTNHVN 240

Query: 308 KFPRPVEIQMML--YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYS 365
           KFP  + +  +L  YHS        SG      +MV  TL  M +GGI+D +GGG  RYS
Sbjct: 241 KFPPSMGLSFLLRYYHS--------SGNP-RALEMVENTLLAMKQGGIYDQIGGGLCRYS 291

Query: 366 VDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFS 425
            D  W VPHFEKMLYD        ++   ++K +       D++ YL RDM   GG I S
Sbjct: 292 TDHHWMVPHFEKMLYDNSLFLETLVECSQVSKKISAKSFALDVISYLHRDMRIVGGGICS 351

Query: 426 AEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPH 485
           AEDADS   EG    +EG FY+W  +E  ++ GE + + ++ + +   GN          
Sbjct: 352 AEDADS---EG----EEGLFYIWDFEEFREVCGEDSQILEKFWNVTKKGN---------- 394

Query: 486 NEFKGKNVLIELNDSSASASKLGMPLEKYLN-ILGECRRKLFDVRSKRPRPHLDDKVIVS 544
             F+GKN+L E     + A+K      K ++ +L   R KL + RSKR RP  DDK++ S
Sbjct: 395 --FEGKNILHE--SYRSEATKFSEEEWKRIDSVLERGRAKLLERRSKRVRPLRDDKILTS 450

Query: 545 WNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRL 604
           WNGL I + A+A                 V   R++++++AE   SFI ++L D    R+
Sbjct: 451 WNGLYIKALAKAG----------------VAFQREDFLKLAEETYSFIEKNLIDPNG-RI 493

Query: 605 QHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 664
              FR+  S   G+ +DYA +IS  + L+E G G ++L  A+        LF  R   G 
Sbjct: 494 LRRFRDNESGILGYSNDYAEMISSSIALFEAGCGIRYLKNAVLWMEEAIRLF--RSPAGV 551

Query: 665 FNTTGEDPSVLLRVKED-HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVF 723
           F  TG D  VLLR   D +DG EPS NS    +LV+L+  + G  S  Y + AE     F
Sbjct: 552 FFDTGNDGEVLLRRSVDGYDGVEPSANSSLAYSLVKLS--LLGIDSARYGEFAESIFLYF 609

Query: 724 ETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDF-ENMLAAAHASY 774
              L   +++ P +  A       S K +VL+  +   DF +++LAA    +
Sbjct: 610 TKELSTNSLSYPHLLSAYWTYRRHS-KEIVLI--RKDTDFGKDLLAAIQTRF 658


>gi|386856660|ref|YP_006260837.1| hypothetical protein DGo_CA1452 [Deinococcus gobiensis I-0]
 gi|380000189|gb|AFD25379.1| hypothetical protein DGo_CA1452 [Deinococcus gobiensis I-0]
          Length = 680

 Score =  421 bits (1081), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 240/590 (40%), Positives = 317/590 (53%), Gaps = 46/590 (7%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA E SPYL QHA NPVDW+ W  EAF EAR+RDVP+ LS+GYSTCHWCHVM  ESFE
Sbjct: 2   NRLAQESSPYLRQHAENPVDWWPWSPEAFEEARRRDVPVLLSVGYSTCHWCHVMAHESFE 61

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE  A  +N  FV+IKVDREERPD+D VYM   QAL G GGWP++VFL+PD +P   GTY
Sbjct: 62  DEATAAQMNAGFVNIKVDREERPDIDAVYMAATQALTGQGGWPMTVFLTPDAEPFYAGTY 121

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRD-MLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           FPP +  G P F  +L  V  AW  +RD ML  +     + L+  +  +++  +  D LP
Sbjct: 122 FPPREGLGMPSFGRVLGSVSGAWTTQRDKMLGNA-----QALTAHIQEASAPRRGEDPLP 176

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
             A  L  E L + YD+  GGFG APKFP P  +  +L  S              G+ M 
Sbjct: 177 DGATGLAVEHLRRVYDADLGGFGGAPKFPSPATLDFLLTQSA-------------GRDMA 223

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
           L TL+ M  GGIHD +GGGFHRYSVD +W VPHFEKMLYD  QLA   L AF ++ D  +
Sbjct: 224 LHTLRRMGAGGIHDQLGGGFHRYSVDAQWLVPHFEKMLYDNAQLARTLLRAFQVSGDGAF 283

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHA 461
           + + R  L YL R+M+   G  FSA+DAD+    G     EG  + WT  E+ ++LG   
Sbjct: 284 ADLARTTLGYLEREMLSAEGGFFSAQDADTPTDHGGV---EGLTFTWTPAEIREVLGAGG 340

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHN-EFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
                   L+  G  +     DPH  E+  +NVL      S     LG  +   L     
Sbjct: 341 ---DTDLALRAYGVTEEGNFLDPHRPEYGRRNVLHLPTPVSQLTRDLGPDVPTRLEAARA 397

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
                   R++   P  DDKV+ SWNGL +++FA A+++L                   +
Sbjct: 398 HLLAARQARTQ---PGTDDKVLTSWNGLALAAFADAARVLGD----------------TQ 438

Query: 581 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 640
            +EVA   A F+RR L       L+H++++G ++  G L+D+     GL+ L++ G    
Sbjct: 439 LLEVARRNADFVRRELRLPDG-TLRHTYKDGQARVEGLLEDHVLYALGLVALFQAGGDLA 497

Query: 641 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGN 690
            L WA EL       F D E G + +  G   ++L R  +  D A  S N
Sbjct: 498 HLHWARELWTVVRRDFWDAEAGVFHSAGGRAETLLTRQAQGFDSAILSDN 547


>gi|114778919|ref|ZP_01453713.1| hypothetical protein SPV1_12250 [Mariprofundus ferrooxydans PV-1]
 gi|114550835|gb|EAU53402.1| hypothetical protein SPV1_12250 [Mariprofundus ferrooxydans PV-1]
          Length = 685

 Score =  421 bits (1081), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 260/685 (37%), Positives = 350/685 (51%), Gaps = 65/685 (9%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
            + +N L  E SPYLLQHAHNPV+W  WGEEAFA AR +D PIFLSIGYSTCHWCHVME 
Sbjct: 13  TEKSNALIHESSPYLLQHAHNPVNWLPWGEEAFALARMQDKPIFLSIGYSTCHWCHVMEH 72

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFED  VA++LN +F++IKVDREERPD+D VYM   Q +   GGWPL++ L+PD KP  
Sbjct: 73  ESFEDPQVAEVLNRYFIAIKVDREERPDIDAVYMHAAQLMNVSGGWPLNLLLTPDKKPFY 132

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
             TY P E ++GR G   + ++V   W + R  +  S       L++++ A A +  +  
Sbjct: 133 AATYLPKEGRFGRMGLIELAQRVGVMWKQDRQRIEASANSISSALTDSI-AVAKTGAMDM 191

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
            L   A R  A++    +D   GGFG AP FP P  +  +L +       G   +  +  
Sbjct: 192 ALVDAAYRDTAQR----FDKGSGGFGGAPLFPSPQRLLFLLRY-------GILKDQPQAL 240

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
            MV  +L  M +GGIHD +GGGFHRYS D  W +PHFEKML DQ  L   Y + +  T D
Sbjct: 241 TMVKESLTAMQRGGIHDQLGGGFHRYSTDAHWLLPHFEKMLSDQAMLMMAYAEGWKATGD 300

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
             ++   RD  +YL RDM       ++AEDADS   EG    +EG FY+W++ E+   LG
Sbjct: 301 ASFAATARDTAEYLLRDMRDKQDGFYTAEDADS---EG----EEGRFYLWSADEIRHALG 353

Query: 459 EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 518
             A  F + Y ++  GN       +  +E  G N+L    +   +A              
Sbjct: 354 RRADAFMQAYGVEADGNFS----DEASHEKTGANILHRTGEMDPAA-------------F 396

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
              R KL   R+KR RP  DDKV+  WNGL I++ A   +IL                D 
Sbjct: 397 AAEREKLLASRAKRVRPFRDDKVLADWNGLTIAALAITGRIL----------------DE 440

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG 638
             Y+E A  AA FI  +L  +    L H +R G +   G LDDY  ++ GL +LYE    
Sbjct: 441 PRYIEAATKAADFILHNLRRDDGS-LLHRWRRGEAGIAGQLDDYTDMVWGLTELYEATFD 499

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
            +WL  A+ L +     F   EGGG++     D  ++ R  +  DGA PSGN+V++ NL+
Sbjct: 500 ARWLKQALALNHIMLSRF-KAEGGGFYQVERSD-DLIARPMQGFDGALPSGNAVAMHNLL 557

Query: 699 RLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSR---KHVVLV 755
           RL+ +   +             A       DMA   P          + +    K VVLV
Sbjct: 558 RLSRLTGDAAL-------AKQAAAVAGHFSDMAEQAPSGLLHLLSAELLAESPGKEVVLV 610

Query: 756 GHKSSVDFENMLAAAHASYDLNKTV 780
           G +SS     MLA  H  Y  N  V
Sbjct: 611 GDRSSAGAGAMLAVLHERYRPNTVV 635


>gi|373849972|ref|ZP_09592773.1| hypothetical protein Opit5DRAFT_0827 [Opitutaceae bacterium TAV5]
 gi|372476137|gb|EHP36146.1| hypothetical protein Opit5DRAFT_0827 [Opitutaceae bacterium TAV5]
          Length = 785

 Score =  420 bits (1080), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 252/704 (35%), Positives = 371/704 (52%), Gaps = 66/704 (9%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            NRLA   SPYL QHA +PVDW  W ++  A AR+ + P+FLS GYSTCHWCHVM  E+F
Sbjct: 66  ANRLADAASPYLRQHADDPVDWQPWNDDTLARARRENRPVFLSSGYSTCHWCHVMRRETF 125

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
               VA  LN+ F+ +K+DREERPD+D++Y+ +V    G GGWPL+V+L+PDLKP +GGT
Sbjct: 126 SRADVAAFLNEHFIPVKLDREERPDIDRIYLAFVAGTTGRGGWPLNVWLTPDLKPFLGGT 185

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE-- 279
           Y+PPED+ G+PGF T+ R   + W + R+ +A            A  AS +    PD+  
Sbjct: 186 YYPPEDQPGQPGFLTVARVAAEGWARDREKVAAH-----ADRIAAALASLAGAAGPDQRS 240

Query: 280 -------LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSG 332
                  +   A    A QL + +D   GGFG   KFP   +I+ +   +  ++    +G
Sbjct: 241 GRSGAATIDNAAWSAAAAQLFEEFDPEHGGFGRDAKFPHASKIRFLFRFA--VQPGVPAG 298

Query: 333 EASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDA 392
           EA+  +++   +L+ +  GG+ DH+GGGFHRY+VD  W +PHFEKMLYDQ  +A + +DA
Sbjct: 299 EAARAREVAFASLEALTGGGLRDHLGGGFHRYTVDRGWRLPHFEKMLYDQALVAGLLVDA 358

Query: 393 FSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGA-TRKKEGAFYVWTSK 451
           + L+ D     + R+ L ++   +  P G  ++A DA+SA    A   K EGAFY W+  
Sbjct: 359 YQLSGDTRRFDLLRETLAFVEAALTSPDGAFYAALDAESALPGAAEGDKAEGAFYTWSLD 418

Query: 452 EVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL--------IELNDSSA 502
           E+   L  + A L    Y     GN   + + +       +NVL          +  +  
Sbjct: 419 EITAALPPDEAALVIARYGFTAEGNA--TSLEERAGVLHNRNVLVPASSAAATAVTKAPG 476

Query: 503 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 562
           +A KL   L+           +L  +RS R  P  D+K+I +WNG +IS+ ARA +    
Sbjct: 477 AAEKLSRALD-----------RLRAIRSTRQPPARDEKIITAWNGYMISALARAHQ---- 521

Query: 563 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDY 622
                     V G  R  ++++A  AA+ + +  ++ +T  L+      P    GF +DY
Sbjct: 522 ----------VTGESR--WLDLATRAATHLWQTAWNGKTATLRRI--AAPGGGDGFAEDY 567

Query: 623 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDRE-----GGGYFNTTGEDPSVLLR 677
           A  I GLLDLYE G   +WL  A+ LQ T D  F D       GGGYF T      VL+R
Sbjct: 568 AAFIQGLLDLYEAGFDPRWLDRALALQATLDTRFADPAPASAGGGGYFGTAAGASGVLVR 627

Query: 678 VKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLM 737
           +KED DGAEP+ +S++  NL RLA     +    Y   A   LA F  + +    A+P++
Sbjct: 628 MKEDFDGAEPAASSLAADNLRRLAVFTGDAA---YEHRARAVLAAFAPQHRRAPAAMPVL 684

Query: 738 CCAADMLSVPSR-KHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
             AA  L+  ++ + +V+ G   + D   +LA A   +    T+
Sbjct: 685 LAAAFGLAEGAKPRQIVIAGRAGADDTRALLAEARRRFQPFATI 728


>gi|291295832|ref|YP_003507230.1| hypothetical protein [Meiothermus ruber DSM 1279]
 gi|290470791|gb|ADD28210.1| protein of unknown function DUF255 [Meiothermus ruber DSM 1279]
          Length = 672

 Score =  420 bits (1079), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 255/643 (39%), Positives = 350/643 (54%), Gaps = 62/643 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA E SPYLLQHAHNPVDW+ WGEEAFA+AR  + PIFLS+GY+TCHWCHVME ESFE
Sbjct: 3   NRLAKESSPYLLQHAHNPVDWYPWGEEAFAKARAENKPIFLSVGYATCHWCHVMERESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D  VA+ LN  FV IKVDREERPDVD+VYM+ +QA+ G GGWP+++FL PDL+P  GGTY
Sbjct: 63  DPEVAQFLNAHFVPIKVDREERPDVDQVYMSALQAMTGSGGWPMNMFLMPDLRPFFGGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           +PPED+ G P F+ +L  V +AW  ++  + ++       L + L     +  LPD+L  
Sbjct: 123 WPPEDRQGFPSFRRVLAGVHNAWLHQQKEVLENAEQLTTYLQDQLKPRGGA--LPDDLHS 180

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
            AL      LS+ +D   GGFG APKFP+   +  +L  +    +           K + 
Sbjct: 181 TAL----AGLSRIFDPAHGGFGGAPKFPQSPALGYLLTQAWLGHEA--------AWKHLQ 228

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDA-----FSLTK 397
            TL  MA+GG++D VGGGFHRY+VD  W VPHFEKMLYD  QLA +Y  A      SL +
Sbjct: 229 LTLDRMAEGGLYDQVGGGFHRYTVDHIWRVPHFEKMLYDNAQLARLYAAASRMPQASLEQ 288

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
              Y  I ++ LDY+ R++ GP G  +SA+DADS   EG     EG FYVW ++E   +L
Sbjct: 289 ARRYQRIAQETLDYVLRELTGPEGGFWSAQDADS---EGV----EGKFYVWQAEEFRRVL 341

Query: 458 GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI 517
           G  A      + +   GN            ++  NVL      +A    LG+  E +   
Sbjct: 342 GAEAEAAMLLFGVSEAGN------------WEHTNVLERRIPDAALMQHLGLGPEAFERW 389

Query: 518 LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSD 577
           +   R +L+  R +R  P  DDKV+  WNGL++ + A   + L                +
Sbjct: 390 VQSVRHRLYAARQQRTPPLTDDKVLADWNGLMLRALADVGRWL----------------E 433

Query: 578 RKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGS 637
              Y+E A   A+F+ + +Y +    L+HS+R G  K   +L D A    GLL L+E   
Sbjct: 434 EPRYIEAARKNAAFVMQEMYRDGL--LRHSWRQGQLKPQAYLSDQAHYGLGLLALFEATG 491

Query: 638 GTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINL 697
              WL  A +L       F  +E  G F  +  D ++ +   + +DG  PSGN+V+   L
Sbjct: 492 EVGWLEGARQLAEAILTHF--KEPTGAFRDS-LDQTLPVVALDAYDGPYPSGNAVAAELL 548

Query: 698 VRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 740
            RLA++    + D++ Q A  ++     RL   A   P M  A
Sbjct: 549 FRLAALY--ERPDWH-QAALTTVESNAQRLLHNAFGFPAMLQA 588


>gi|110638981|ref|YP_679190.1| hypothetical protein CHU_2595 [Cytophaga hutchinsonii ATCC 33406]
 gi|110281662|gb|ABG59848.1| conserved hypothetical protein; thioredoxin domain [Cytophaga
           hutchinsonii ATCC 33406]
          Length = 681

 Score =  420 bits (1079), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 237/610 (38%), Positives = 338/610 (55%), Gaps = 49/610 (8%)

Query: 95  SHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCH 154
           S++++ HTNRLA+E SPYLLQHAHNPV+WF WGEEA  +A+  D PI +SIGYS CHWCH
Sbjct: 3   SYTKHTHTNRLASESSPYLLQHAHNPVEWFPWGEEALQKAKAEDKPILVSIGYSACHWCH 62

Query: 155 VMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDL 214
           VME E FE E VA ++ND F++IK+DREERPD+D++YM  V A+   GGWPL+VFL+PD 
Sbjct: 63  VMEHECFEKEEVAAVMNDLFINIKIDREERPDLDQIYMDAVSAMGLRGGWPLNVFLTPDA 122

Query: 215 KPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSN 274
           KP  GGTYFP +       +  +L ++ +A+   R+ + +S     E L+++        
Sbjct: 123 KPFYGGTYFPQDH------WLNLLGQISNAYLNHREDILKSAESFTESLNQSDVFKYGLV 176

Query: 275 KLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEA 334
              +   ++ L L  +++S+ +D+  GG   APKFP P    + LY  +    TG+ G  
Sbjct: 177 DDAETFHKDELDLAYDRISQQFDTDMGGMNKAPKFPMP---SIYLYLLRDYALTGRQGSL 233

Query: 335 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 394
               + V  TL  MA GGI+D +GGGF RYSVD  W  PHFEKMLYD GQL ++Y +A++
Sbjct: 234 ----QHVELTLDKMAMGGIYDTIGGGFARYSVDGAWFAPHFEKMLYDNGQLLSLYSEAYT 289

Query: 395 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 454
           +TK   Y  +  +   +L+R+M+ P G  +SA DADS   EG     EG FY W  +E+ 
Sbjct: 290 VTKKPLYKEVIEETYTWLKREMLSPEGGFYSALDADS---EGV----EGKFYCWQYEELA 342

Query: 455 DILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKY 514
            ++ E   LF  +Y +   GN +            G N+L +     A A+   +  E  
Sbjct: 343 QLIQEDFALFCAYYAITENGNWE-----------HGMNILYKRMSDEAFAAAHSISAEAL 391

Query: 515 LNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVV 574
              +   +  LF  R  R  P LDDK++ SWNG+++     A +IL    ++A+ N  ++
Sbjct: 392 RESVSRWKNILFSERDPREHPGLDDKILASWNGIMLKGLCDAYRIL---GDAAILNTALM 448

Query: 575 GSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYE 634
                         A FI   LYD +T  L HS++N  +  PGFL+DY  +I G L LYE
Sbjct: 449 N-------------AEFILTKLYDGKT--LFHSYKNKKATIPGFLEDYTHVIDGYLALYE 493

Query: 635 FGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSV 694
                +WL  AI L N   + F D + G +F T+     ++ R KE  D   P+ NS   
Sbjct: 494 VSLDEQWLRQAITLVNHVIDHFYDDDEGLFFYTSRTSEKLIARKKEIFDNVIPASNSSLA 553

Query: 695 INLVRLASIV 704
            NL  L  ++
Sbjct: 554 RNLYHLGKLL 563


>gi|150400057|ref|YP_001323824.1| hypothetical protein Mevan_1315 [Methanococcus vannielii SB]
 gi|150012760|gb|ABR55212.1| protein of unknown function DUF255 [Methanococcus vannielii SB]
          Length = 687

 Score =  419 bits (1077), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 238/603 (39%), Positives = 341/603 (56%), Gaps = 45/603 (7%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA NPVDW+ WGEEAF +A+  + PIFLSIGYSTCHWCHVM  +SFE
Sbjct: 4   NRLINEKSPYLKQHAKNPVDWYPWGEEAFKKAKLENKPIFLSIGYSTCHWCHVMAKDSFE 63

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D  VA  LN  F+SIKVDREERPD+D +Y+   Q + G GGWPL++ ++PD KP    T+
Sbjct: 64  DFDVADTLNKNFISIKVDREERPDLDDIYLKTCQLMTGSGGWPLTIIMTPDKKPFFAATF 123

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
              E ++G PG   +L  + + W  K D + +     +  L E +S + S  KL ++L +
Sbjct: 124 ISKEPRFGSPGIIDLLEGISELWAIKHDEIVKRSDEILIHL-ENISKTTSKGKLDEKLLE 182

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
            A      QL + YD  +GGFG  PKFP    I  ++ + KK   TG      E  +M +
Sbjct: 183 KAFL----QLKEIYDKNYGGFG-VPKFPTAHLIIFLIKYWKK---TGN----DEALEMAI 230

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL  M  GGI+DH+  GFHRY+VDE W +PHFEKMLYDQ  ++  YL+++  T++  + 
Sbjct: 231 KTLDKMKMGGIYDHISYGFHRYAVDEMWKLPHFEKMLYDQALISMAYLESYRATRNEEHK 290

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL-GEHA 461
            I  ++ +Y+ + +  P    +SAE+   AE+EG     EG FY W   E++ IL     
Sbjct: 291 KIVSEVFEYVLKVLKSPEKAFYSAEN---AESEGI----EGKFYTWNITEIDQILRNSEN 343

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
            +FK+ Y +KP GN  L   ++  N   G N+L         AS++ M  E+   IL + 
Sbjct: 344 NIFKKVYNIKPEGNY-LGESTEATN---GTNILYMERSIQEIASEMEMWPEEVDQILEKA 399

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
           R+KL D    R RP  D K++  WNGL+I+S ++A +I K+E                EY
Sbjct: 400 RKKLLDALENRKRPSKDYKILADWNGLMIASLSKAGRIFKNE----------------EY 443

Query: 582 MEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKW 641
           ++ +E A SF+   +   +  +L HS+     K PGFLDDYAF+  GL++LY      ++
Sbjct: 444 IKASEDAMSFLLSKMVINE--KLYHSYIENELKVPGFLDDYAFITWGLIELYFATFNIEY 501

Query: 642 LVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLA 701
           L  A +      ELF   E GG+   + E    + +V+  +DGA PSG S+  +NL++L+
Sbjct: 502 LKKARDFAEKTLELFW--EDGGFNFASKEVNDNIFKVRNIYDGAIPSGTSIMALNLLKLS 559

Query: 702 SIV 704
            I+
Sbjct: 560 HIL 562


>gi|448373972|ref|ZP_21557857.1| hypothetical protein C479_01326 [Halovivax asiaticus JCM 14624]
 gi|445660649|gb|ELZ13444.1| hypothetical protein C479_01326 [Halovivax asiaticus JCM 14624]
          Length = 760

 Score =  419 bits (1077), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 245/635 (38%), Positives = 332/635 (52%), Gaps = 53/635 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA NPV+W  W E A + A++RD PIFLSIGYS CHWCHVME ESF 
Sbjct: 8   NRLDEEASPYLRQHADNPVNWQPWDERARSAAQERDRPIFLSIGYSACHWCHVMEAESFA 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA +LN+ FV IKVDREERPDVD +YMT  QA+ G GGWPLS +L+PD +P   GTY
Sbjct: 68  DETVATVLNEGFVPIKVDREERPDVDSIYMTVCQAVTGRGGWPLSAWLTPDGRPFYVGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSG----AFAIEQLSEALSASASSNKLP- 277
           FP E + G PGF  + R+++ +W + RD +        A A ++L  A +A   S+  P 
Sbjct: 128 FPREAQRGTPGFLELCRQIRVSWSENRDEIESRADEWTAMAADRLDSAAAAGNESSSTPA 187

Query: 278 --------------DELPQNALRLCAEQLSKSYDSRFGGFG-SAPKFPRPVEIQMMLYHS 322
                         D    +AL    E   ++ D   GGFG   PKFP+P  ++ +L   
Sbjct: 188 PISADTGSPIDGGLDADGPDALERVGEAALRASDDEHGGFGRGGPKFPQPRRVESLL--- 244

Query: 323 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 382
            +L+    + +    ++     L  M  GG++DHVGGGFHRY VDE W VPHFEKMLYD 
Sbjct: 245 -RLD---AAHDRPNARETATRALDAMCSGGLYDHVGGGFHRYCVDEDWTVPHFEKMLYDN 300

Query: 383 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 442
             +    L  + +T D  Y+   R+ +D+L R++  P G  +S  DA S ETE   R +E
Sbjct: 301 AAIPRALLAGYQVTGDDRYARTVRETVDFLERELRHPEGGFYSTLDAQS-ETESGER-EE 358

Query: 443 GAFYVWTSKEVEDILGEHAI------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIE 496
           GAFYVWT  E+E  + E  +      LF   + +  +GN            F+G  VL  
Sbjct: 359 GAFYVWTPAEIESAVAEAGLSDESGALFCNRFGVTDSGN------------FEGSTVLTV 406

Query: 497 LNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARA 556
                  A+  G+      + L   R  +F+ R+ RPRP  D+K++  WNGL I   A A
Sbjct: 407 EASIEDLATDYGLAPSTVEDRLDAARTAVFEARATRPRPPRDEKILAGWNGLAIDMLAEA 466

Query: 557 SKILKSEAESAMFNFPVVG------SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRN 610
           S +L +    A  N    G      S    Y ++A  A +F+R +L+D+ T RL    R+
Sbjct: 467 SIVLGTSGREAATNAASAGGASDGPSGDDRYAQLATDALAFVRTNLWDDDTGRLARRVRD 526

Query: 611 GPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGE 670
           G     G+L+DYAFL  G L  YE     + L +A++L       F D      + T   
Sbjct: 527 GDVGIDGYLEDYAFLARGALTCYEATGEVEPLAFALDLARAIRRDFWDESAETLYFTPER 586

Query: 671 DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVA 705
             S+L+R +E  D + PS   V+V  L  L    A
Sbjct: 587 GESLLVRPQELGDQSTPSPTGVAVEILAMLDPFTA 621


>gi|375150037|ref|YP_005012478.1| hypothetical protein [Niastella koreensis GR20-10]
 gi|361064083|gb|AEW03075.1| hypothetical protein Niako_6853 [Niastella koreensis GR20-10]
          Length = 685

 Score =  419 bits (1077), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 235/619 (37%), Positives = 332/619 (53%), Gaps = 69/619 (11%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           KHTNRLA E SPYLLQHAHNPVDW+ WG EA   A+K D P+ +SIGY+ CHWCHVME E
Sbjct: 3   KHTNRLAEETSPYLLQHAHNPVDWYPWGNEALDRAKKEDKPLLVSIGYAACHWCHVMEKE 62

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SFE+E  A ++N  F+++K+DREERPD+D +YM  VQA+ G GGWPL++FL+PD +P  G
Sbjct: 63  SFENEETASMMNAHFINVKIDREERPDLDHIYMDAVQAMTGSGGWPLNIFLTPDGRPFYG 122

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRD-----------MLAQSGAFAIEQLSEALS 268
           GTYFPP+  Y RP +  +L  V +AW +KRD            + QS +F  + +   ++
Sbjct: 123 GTYFPPKAIYNRPSWHDVLTGVANAWTEKRDDIDAQATNLTGHIVQSNSFGQQAVEGDIN 182

Query: 269 ASAS-SNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 327
             A  S ++ D +  N +         + D   GGFGSAPKFP+   I  +L +  K  +
Sbjct: 183 MDALFSKEIADTMFNNIM--------GTADKEEGGFGSAPKFPQTFTIGYLLRYYHKTGN 234

Query: 328 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 387
                +A         +L  M +GG++DH+GGGF RYS D  W VPHFEKMLYD   L +
Sbjct: 235 EQALAQAC-------LSLDKMIRGGLYDHLGGGFARYSTDREWLVPHFEKMLYDNALLVS 287

Query: 388 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 447
           V  DA+ LT+   Y     + L ++ R++  P    +SA DADS   EG     EG FYV
Sbjct: 288 VLCDAWQLTQQPLYKQAVEETLAFVERELHSPEKGFYSALDADS---EGV----EGKFYV 340

Query: 448 WTSKEVEDILGEHAILFKEHYYLKPTGN---CDLSRMSDPHNEFKGKNVLIELNDSSASA 504
           W+  E+E IL + A +F   Y +   GN    ++  +  P  +F   N            
Sbjct: 341 WSKPEIEAILQQDAAVFCAFYDVTEGGNWEHTNILNIRKPLKQFAADN------------ 388

Query: 505 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 564
               +P  +   +L + R KL   R+ R RP LDDK+++ WN L+ +++++A  +     
Sbjct: 389 ---NIPEARLQELLQQGREKLLQHRAGRIRPQLDDKILLGWNALMNTAYSKAYSV----- 440

Query: 565 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 624
               F  P       +Y EVAE    FI    +        H+++   ++ P FLDDYA+
Sbjct: 441 ----FGNP-------QYAEVAEENMKFIMNR-FTRDGLEFFHTYKKEIARYPAFLDDYAY 488

Query: 625 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 684
           LI  L+ L E      +L  A  L     + F +   G +F T      V++R KE +DG
Sbjct: 489 LIQALIHLQEITGKAAYLYKAKALTQQVIDQFSEEGTGYFFYTHQGQQDVIVRKKEVYDG 548

Query: 685 AEPSGNSVSVINLVRLASI 703
           A PSGN++   NL  L  +
Sbjct: 549 AIPSGNAIMAFNLQYLGVV 567


>gi|394994118|ref|ZP_10386849.1| YyaL, partial [Bacillus sp. 916]
 gi|393805058|gb|EJD66446.1| YyaL, partial [Bacillus sp. 916]
          Length = 607

 Score =  419 bits (1077), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 252/657 (38%), Positives = 362/657 (55%), Gaps = 58/657 (8%)

Query: 127 GEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPD 186
           GEEAF +A++ + P+ +SIGYSTCHWCHVM  ESFEDE +A +LND F++IKVDREERPD
Sbjct: 2   GEEAFEKAKRENKPVLISIGYSTCHWCHVMAHESFEDEEIADMLNDKFIAIKVDREERPD 61

Query: 187 VDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWD 246
           VD VYM   Q + G GGWPL+VF++PD KP   GTYFP   KY RPGF  +L  + + + 
Sbjct: 62  VDSVYMRICQLMTGQGGWPLNVFVTPDQKPFYAGTYFPKTSKYNRPGFIDVLEHLSETFA 121

Query: 247 KKRDMLAQSGAFAIEQLSEALSASASSNKLPDE--LPQNALRLCAEQLSKSYDSRFGGFG 304
             R          +E ++E  +A       P E  L + A+     QL+  +D+ +GGFG
Sbjct: 122 NDRQ--------HVEDIAENAAAHLEVKVHPAEGMLGEQAVHDTYRQLAGGFDTVYGGFG 173

Query: 305 SAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRY 364
            APKFP P    M+++  +    TGK  +A  G   V  TL  MA GGI DH+G GF RY
Sbjct: 174 QAPKFPMP---HMLMFLLRYYSYTGKE-QALAG---VTKTLDGMANGGIFDHIGFGFARY 226

Query: 365 SVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIF 424
           S D  W VPHFEKMLYD   L   Y +A+ +T +  Y  I   I+ +++R+M+   G  F
Sbjct: 227 STDNEWLVPHFEKMLYDNALLLTAYTEAYQVTGNERYKQIAMQIVTFIQREMMHEDGSFF 286

Query: 425 SAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSD 483
           SA DAD   TEG    +EG +Y+W+ KE+ ++LG E   L+ + Y +   GN +   +  
Sbjct: 287 SALDAD---TEG----REGKYYIWSKKEIMNLLGDELGPLYCKVYNITDQGNFEGENI-- 337

Query: 484 PHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIV 543
           PH  F  +  ++E  ++  +  +L   LE       E R KL + R  R  PH DDKV+ 
Sbjct: 338 PHLIFTRREAILE--ETGLTGHELAERLE-------EARTKLLEARENRSYPHTDDKVLT 388

Query: 544 SWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHR 603
           SWN L+I+  A+A+K+         F+ P       +++ +AE+A  F+ RHL  +   R
Sbjct: 389 SWNALMIAGLAKAAKV---------FHEP-------DFLSMAETAIRFLERHLMPDA--R 430

Query: 604 LQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGG 663
           +   +R G  K  GF+DDYAFLI   L+LYE G    +L  A  L  +  ELF D   GG
Sbjct: 431 VMVRYREGEVKNKGFIDDYAFLIWAYLELYEAGFHPSYLQKAKTLCTSMLELFWDERHGG 490

Query: 664 YFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVF 723
           +F T  +  ++L+R KE +DGA PSGNS + + L+RL  +  G  S    + AE   +VF
Sbjct: 491 FFFTGNDAETLLVREKEVYDGAVPSGNSAAAVQLLRLGRLT-GDIS--LIEKAEAMFSVF 547

Query: 724 ETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
           +  ++    +      +    ++P +K +V+ G K   D +  + A    +    T+
Sbjct: 548 KREIEAYPSSNAFFMQSVLAHTMP-QKEIVVFGRKDDPDRKRFIEALQEHFTPAYTI 603


>gi|300087365|ref|YP_003757887.1| hypothetical protein Dehly_0239 [Dehalogenimonas
           lykanthroporepellens BL-DC-9]
 gi|299527098|gb|ADJ25566.1| protein of unknown function DUF255 [Dehalogenimonas
           lykanthroporepellens BL-DC-9]
          Length = 669

 Score =  418 bits (1075), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 242/593 (40%), Positives = 334/593 (56%), Gaps = 66/593 (11%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N L    SPYL QHA NPV+W+ W +EA A A+K + PI LS+GYS CHWCHVM  ESFE
Sbjct: 3   NHLKDAVSPYLRQHADNPVEWYPWADEALARAKKENKPILLSVGYSACHWCHVMAHESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE  A ++N  F++IKVDREERPD+D +YM  VQA+ G GGWP++VFL+PD KP  GGTY
Sbjct: 63  DEATAAVMNRHFINIKVDREERPDIDSIYMAAVQAMTGHGGWPMTVFLTPDGKPFYGGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           +PPED++G P F  IL  V +A+ ++ D +A +    +  +++     A  + L  EL  
Sbjct: 123 YPPEDRHGLPAFTRILEAVAEAYRERPDEVAATATRLVTAVADKPVGDAGESSLTVELLD 182

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML-YHSKKLEDTGKSGEASEGQKMV 341
            A     + L++ +D    GFG APKFP+P+ +  +L YH +          ++   +MV
Sbjct: 183 RAF----QALTRDFDENHAGFGGAPKFPQPLVLDFLLRYHYRT--------SSARALEMV 230

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             TL+ M +GG++DH+GGGFHRYSVD+ W VPHFEKMLYD   LA VYL AF +T    Y
Sbjct: 231 EKTLEAMYRGGMYDHLGGGFHRYSVDDAWQVPHFEKMLYDNALLARVYLHAFQITGKAQY 290

Query: 402 SYICRDILDYLRRDMIGPGGEIF-SAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-E 459
             +  DILDY+  +M  P    F SA+DADS   EG    +EG +Y+WT  E+E +LG E
Sbjct: 291 RLVTEDILDYVLEEMTDPATSGFYSAQDADS---EG----EEGRYYIWTPDEIESVLGRE 343

Query: 460 HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILG 519
            A +F   Y +   GN            F+G+N+L    + S  AS  G+  +       
Sbjct: 344 SAEIFGRRYGVTQAGN------------FEGRNILHLTGEFSVEASA-GVSAD------- 383

Query: 520 ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRK 579
             R +L   R KR  P  D K++VSWN +   + A A                 V  DR 
Sbjct: 384 --RARLLAERRKRVPPGTDTKILVSWNAMTQLALASAG----------------VALDRP 425

Query: 580 EYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGT 639
           +Y+  AE+ A+F+  +L D  + RL+H+     S A GFL+DYA L   LL L++     
Sbjct: 426 DYLAAAEANAAFLLDNLLD--SGRLRHTV----SVAEGFLEDYALLTESLLALHKATLTP 479

Query: 640 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSV 692
           +WL  A+ L     ELF D + G +++T  +   +  R +   DGA PSG SV
Sbjct: 480 RWLRQAMALGAAMVELFWDEDEGVFYDTPADAGQLFQRPRNFQDGAVPSGASV 532


>gi|433638443|ref|YP_007284203.1| thioredoxin domain protein [Halovivax ruber XH-70]
 gi|433290247|gb|AGB16070.1| thioredoxin domain protein [Halovivax ruber XH-70]
          Length = 759

 Score =  418 bits (1075), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 250/629 (39%), Positives = 333/629 (52%), Gaps = 42/629 (6%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA NPV+W  W E A + A++RD PIFLSIGYS CHWCHVME ESF 
Sbjct: 8   NRLGEEASPYLRQHADNPVNWQPWDERARSAAQERDRPIFLSIGYSACHWCHVMEAESFA 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA +LN+ FV IKVDREERPDVD +YMT  QA+ G GGWPLS +L+PD +P   GTY
Sbjct: 68  DETVAAVLNEGFVPIKVDREERPDVDSIYMTVCQAVTGRGGWPLSAWLTPDGRPFYVGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLA----QSGAFAIEQLSEALSASASSNKLPD 278
           FP E + G PGF  + R+++ +W + RD +     +  A A ++L  A      S   P+
Sbjct: 128 FPREAQRGTPGFVELCRQIRVSWSENRDEIEARANEWAAMATDRLDSA-DGGGESASTPE 186

Query: 279 ELPQ---------------NALRLCAEQLSKSYDSRFGGFG-SAPKFPRPVEIQMMLYHS 322
            +                 + L    E   ++ D   GGFG   PKFP+P  ++ +    
Sbjct: 187 PISADTDSPIDVGLDADGPDGLERVGEAALRASDDEHGGFGRGGPKFPQPRRVEALF--- 243

Query: 323 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 382
            +L+ T     A E        L  M  GG++DHVGGGFHRY VDE W VPHFEKMLYD 
Sbjct: 244 -RLDATHDRPTAHE---TATRALDAMCTGGLYDHVGGGFHRYCVDEDWTVPHFEKMLYDN 299

Query: 383 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 442
             +  V L  + +T D  Y+   R+ +D+L R++  P G  +S  DA S ETE   R +E
Sbjct: 300 AAIPRVLLAGYQVTGDDRYARTVRETVDFLERELRHPEGGFYSTLDAQS-ETESGER-EE 357

Query: 443 GAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSA 502
           GAFYVWT  E+E  + E A L  E   L     CD   ++D  N F+G  VL        
Sbjct: 358 GAFYVWTPAEIESAVAE-AGLSDESGAL----FCDRFGVTDSGN-FEGSTVLTVEASIED 411

Query: 503 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 562
            A+  G+      + L   R  +F+ R+ RPRP  D+K++  WNGL I   A AS +L +
Sbjct: 412 LATDYGLAPSTVEDRLDAARTAVFEARATRPRPPRDEKILAGWNGLAIDMLAEASIVLGT 471

Query: 563 EAESAMFNFP--VVGSDR----KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP 616
               A  +    V  SD       Y ++A  A +F+R HL+D+ T RL    R+G     
Sbjct: 472 SGREAAIDAASDVASSDEPSGDDRYAQLATDALAFVRTHLWDDDTGRLARRVRDGDVGID 531

Query: 617 GFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL 676
           G+L+DYAFL  G L  YE     ++L +A++L       F D      + T     S+L+
Sbjct: 532 GYLEDYAFLARGALTCYEATGEVEFLAFALDLARAIRRDFWDESAETLYFTPERGESLLV 591

Query: 677 RVKEDHDGAEPSGNSVSVINLVRLASIVA 705
           R +E  D + PS   V+V  L  L    A
Sbjct: 592 RPQELGDQSTPSPTGVAVEILALLDPFTA 620


>gi|397780504|ref|YP_006544977.1| hypothetical protein BN140_1338 [Methanoculleus bourgensis MS2]
 gi|396939006|emb|CCJ36261.1| putative protein yyaL [Methanoculleus bourgensis MS2]
          Length = 719

 Score =  418 bits (1074), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 257/696 (36%), Positives = 370/696 (53%), Gaps = 54/696 (7%)

Query: 95  SHSRNKHT--------NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIG 146
           +H R++ T        NRL  E SPYLLQHA+NPVDW+ WGEEAF  A++   PIFLSIG
Sbjct: 4   AHGRDQETSVREESPPNRLIHEQSPYLLQHAYNPVDWYPWGEEAFLRAKEEAKPIFLSIG 63

Query: 147 YSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYG-GGGWP 205
           YS CHWCHVME ESF D  VAKLLND FV IKVDREERPD+D++Y+     L G   GWP
Sbjct: 64  YSACHWCHVMEEESFADPMVAKLLNDVFVCIKVDREERPDIDQIYIDAAHVLSGVAVGWP 123

Query: 206 LSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE 265
           L++F++ D +P    +Y P E +YG  G   ++ ++   W  +R  L Q+G+    ++ E
Sbjct: 124 LTIFMTHDGRPFFAASYIPKESRYGMTGLVDLIPRISRIWQTRRQELEQTGS----RVLE 179

Query: 266 ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 325
           AL ++A +     EL +  L    + L + +D   GGFG APKFP P  +  +L +  + 
Sbjct: 180 ALQSAARTPPGESELSEATLDDAYDTLFRLFDGENGGFGDAPKFPAPHNLIFLLRYGHR- 238

Query: 326 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 385
             TGK    +    MV  TL  M +GGI DH+G GFHRY+ D  W VPHFEKMLYDQ  L
Sbjct: 239 --TGK----TPAYTMVEKTLHAMRRGGIFDHIGWGFHRYTTDAEWLVPHFEKMLYDQALL 292

Query: 386 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 445
              Y +A+  T    ++   R+ + Y+ R+M  P G  +SAEDADS   EG     EG F
Sbjct: 293 IMAYTEAYLATGREEFARTARETIAYVLREMTDPDGGFYSAEDADS---EGV----EGKF 345

Query: 446 YVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 504
           Y+WT   +  +LGE     F   + +   GN     +  P     G+NVL      ++ A
Sbjct: 346 YIWTKAGILQVLGEEDGERFSRIFGVTEPGNY----LEQPGARRTGQNVLRLRRPLASWA 401

Query: 505 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 564
            +  MP E     + + R++LF  R +R RP  DDK++  WNGL+I++ A A++      
Sbjct: 402 HEFSMPEEDLAWFVEDARQRLFAAREERARPAKDDKILTDWNGLMIAALATAARAF---- 457

Query: 565 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 624
                       D  EY+  AE AA+F+   L      RL H +RNG +     LDDYAF
Sbjct: 458 ------------DDPEYLAAAEKAAAFVLTRLRGPDG-RLLHRYRNGEAGITATLDDYAF 504

Query: 625 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 684
           ++  L+++YE      +L  A++L       + D + GG+F T  +D  + +R K   DG
Sbjct: 505 MLWALIEVYEASFAPGYLRTAVKLARDLSARYWDCDHGGFFFTP-DDVEIAVRQKPVFDG 563

Query: 685 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 744
           A PSGNSV++  L  L  + A  +   + + A     VF   +++  +A        + +
Sbjct: 564 ATPSGNSVAMYALFLLGRMTANLE---FEEMANRIRRVFADTVRESPIAYSYFLTGLEFM 620

Query: 745 SVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
             P+ + V++ G + + D   M+ A  + Y  +  V
Sbjct: 621 LGPNVE-VIISGVRDAEDTRAMIQAIRSRYTPDAVV 655


>gi|410941737|ref|ZP_11373531.1| PF03190 family protein [Leptospira noguchii str. 2006001870]
 gi|410783286|gb|EKR72283.1| PF03190 family protein [Leptospira noguchii str. 2006001870]
          Length = 698

 Score =  417 bits (1073), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 253/651 (38%), Positives = 355/651 (54%), Gaps = 66/651 (10%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           ++  NRL  E SPYL QH++NPVDWF WGEEAF +A+ +D  IFLSIGY+TCHWCHVME 
Sbjct: 13  SRKPNRLLKEKSPYLQQHSYNPVDWFPWGEEAFTKAKDQDKLIFLSIGYATCHWCHVMEK 72

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFE++ +A  LN  FVSIKVDREERPD+D++YM  +  +   GGWPL++FL+P+ KP+ 
Sbjct: 73  ESFENQSIADYLNSHFVSIKVDREERPDIDRIYMDALHEMEQQGGWPLNMFLTPEGKPIT 132

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
           GGTYFPPE KYGR GF  +L  ++  W +KR  L  + +    +LS+ L  SA S     
Sbjct: 133 GGTYFPPESKYGRKGFLEVLNIIQKVWTEKRSELIAAAS----ELSQYLKDSAESKSRAQ 188

Query: 279 E---LPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMMLYHSKKLEDTGKSGE 333
           E      N            YDS+FGGF +    KFP  + +  +L +         S +
Sbjct: 189 ETDFTSANCFDSGFLLYENYYDSQFGGFKTNQVNKFPPNMGLGFLLRYY-------LSSK 241

Query: 334 ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 393
                +MV  TL  M +GGI+D +GGG  RYS D RW VPHFEKMLYD      +  +  
Sbjct: 242 NPRALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSLFLEILAEYS 301

Query: 394 SLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 453
            ++K +       DI+ YL RDM   GG I SAEDADS   EG    +EG FY+W  +E 
Sbjct: 302 LVSKKISAESFALDIVSYLHRDMRMDGGGICSAEDADS---EG----EEGLFYIWDLEEF 354

Query: 454 EDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEK 513
            ++ GE + L ++ + +   GN            F+GKN+L E N   ++ ++     E+
Sbjct: 355 REVCGEDSFLLEKFWNVSKEGN------------FEGKNILHE-NFRGSNFTE-----EE 396

Query: 514 YLNILGECRR---KLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFN 570
           +  + G   R   KL + RSKR RP  DDK++ SWNGL I +  +               
Sbjct: 397 FKQLDGALLRGKAKLLERRSKRIRPFRDDKILTSWNGLYIKALVKTG------------- 443

Query: 571 FPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLL 630
              +   R++++++AE   SFI ++L D +  R+   FR G S   G+ +DY+ +I+  +
Sbjct: 444 ---IAFQREDFLKLAEETYSFIEKNLIDSKG-RMLRRFREGESGILGYSNDYSEMIASSI 499

Query: 631 DLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGAEPSG 689
            L+E G G ++L  A+        LF  R   G F  TG D  VLLR   D +DG EPS 
Sbjct: 500 VLFEAGRGIRYLRNAVLWMEEVIRLF--RSSAGVFFDTGIDGEVLLRRSVDGYDGVEPSA 557

Query: 690 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 740
           NS    +L++L+ +  G  S+ Y + AE     F   L   A++ P +  A
Sbjct: 558 NSSLAHSLIKLSFL--GVNSERYLEIAESIFVYFRKELYSYALSYPYLLSA 606


>gi|168702337|ref|ZP_02734614.1| hypothetical protein GobsU_22617 [Gemmata obscuriglobus UQM 2246]
          Length = 793

 Score =  417 bits (1073), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 263/657 (40%), Positives = 349/657 (53%), Gaps = 63/657 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA E SPYLLQHAHNPVDW+ WG EAF  A+K    IFLSIGYS CHWCHVME ESF 
Sbjct: 40  NRLAKESSPYLLQHAHNPVDWYPWGPEAFERAKKEKKLIFLSIGYSACHWCHVMERESFS 99

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
              VAK+LN  FV IKVDREERPDVD +YMT +      GGWPL++FL+PD KP+ G TY
Sbjct: 100 RADVAKILNANFVCIKVDREERPDVDDIYMTALNTTGEQGGWPLNMFLTPDGKPIFGATY 159

Query: 223 FPPED-KYGR---PGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
           FPP+D K G    PGFKT+L KV + +DK R  L +      +   EAL A++ +  L  
Sbjct: 160 FPPDDRKIGDDTVPGFKTVLNKVME-FDKDRADLEKQADRVAKATVEALDANSRAIAL-- 216

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGS------APKFPRPVEIQMMLYHSKKLEDTGKSG 332
            +P     +     +   D   GG GS        KFPRP     +L  +KK    G   
Sbjct: 217 -VPLKRDLVSDGLDAFDIDPEHGGTGSKKRDYKGTKFPRPPVWGFVLTQTKK---PGNER 272

Query: 333 EASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDA 392
            A    K+   TL  + +GGI+DH+GGGFHRYS +  W VPHFEKMLYD  QL  +Y +A
Sbjct: 273 LA----KLTHNTLAKILEGGIYDHLGGGFHRYSTERTWTVPHFEKMLYDNAQLVELYSEA 328

Query: 393 FSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKE 452
           ++L     Y  +  + L+++RR+M  P    +SA DADS +       KEG FYVWT+ E
Sbjct: 329 YALAPRPEYKRVVAETLEFVRREMTAPEKGFYSALDADSND-------KEGEFYVWTADE 381

Query: 453 VEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE 512
           V  +LG  A    +   +K           D  +  +    L E+      A +L +  +
Sbjct: 382 VAKVLGTDA----DTAIVKAVYGVTAPNFEDKFHILRLPKPLAEI------AKELKLTED 431

Query: 513 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 572
             L  L   ++KLFD R+KR RP LD KVI +WNG +I+ +ARA  + K  A        
Sbjct: 432 ALLTKLEPLKKKLFDHRAKRERPFLDTKVITAWNGQMIAGYARAGGVFKEPA-------- 483

Query: 573 VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP-----GFLDDYAFLIS 627
                   Y+  A  AA F+   L D+   RL   +   P   P      FLDDYA+LI 
Sbjct: 484 --------YVRAAADAADFLLTKLRDKD-GRLYRMYAAAPGGKPAPKGAAFLDDYAYLIH 534

Query: 628 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 687
           GLL+L++     KWL  A  L +   + + D   GG++ T  +   +  R K+ +DG +P
Sbjct: 535 GLLNLHDATGEPKWLDAAKGLTDLAVKHYADPVNGGFYFTAADGEKLFARAKDSYDGVQP 594

Query: 688 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 744
           SGNS    NL+RL +    +K + YR     ++  F   L+    ++PLM    D L
Sbjct: 595 SGNSQMARNLLRLGT---KTKDEGYRDRGIRTVKAFSFALRTAPTSMPLMLRTLDEL 648


>gi|421098293|ref|ZP_15558964.1| PF03190 family protein [Leptospira borgpetersenii str. 200901122]
 gi|410798561|gb|EKS00650.1| PF03190 family protein [Leptospira borgpetersenii str. 200901122]
          Length = 691

 Score =  417 bits (1072), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 263/685 (38%), Positives = 367/685 (53%), Gaps = 65/685 (9%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
            +  NRL+ E SPYL QHA+NPVDWF WGEEA  +A+++D  IFLSIGY+TCHWCHVME 
Sbjct: 5   TRSPNRLSKEKSPYLQQHAYNPVDWFPWGEEALTKAKEQDKLIFLSIGYATCHWCHVMEK 64

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFE++ VA  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++FL+PD KP+ 
Sbjct: 65  ESFENQMVADYLNSHFVSIKVDREERPDIDRIYMDALHAMDQQGGWPLNIFLTPDGKPIT 124

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
           GGTYFPPE  YGR  F  +L  ++  W++KR  L  + +    +LS+ L  S     +  
Sbjct: 125 GGTYFPPEPMYGRKSFLEVLNILRKVWNEKRQELIAASS----ELSQYLKDSGERRTIEK 180

Query: 279 E----LPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML-YHSKKLEDTGKS 331
           +      +N            YD+ FGGF +    KFP  + +  +L YH        +S
Sbjct: 181 QEGGLSSENCFDSGFSLYESYYDAEFGGFKTNHVNKFPPSMGLSFLLRYH--------RS 232

Query: 332 GEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLD 391
                  +MV  TL  M +GGI+D VGGG  RYS D  W VPHFEKMLYD        ++
Sbjct: 233 SGNPRALEMVENTLLAMKQGGIYDQVGGGLCRYSTDFYWMVPHFEKMLYDNSLFLETLVE 292

Query: 392 AFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSK 451
              ++K +       D++ YL RDM    G I SAEDADS   EG    KEG FY+W  +
Sbjct: 293 CSQVSKKISAKSFALDVISYLHRDMRIVDGGICSAEDADS---EG----KEGLFYIWGLE 345

Query: 452 EVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPL 511
           E  ++ GE + + ++ + +   GN            F+GKN+L E     + A+KL    
Sbjct: 346 EFREVCGEDSRILEKFWNVTEKGN------------FEGKNILYE--SYRSEATKLSEEE 391

Query: 512 EKYLN-ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFN 570
            K ++ +L   R KL + R+KR RP  DDK++ SWNGL I +  +A              
Sbjct: 392 WKQIDSVLERGRAKLLERRNKRVRPLRDDKILTSWNGLYIKALTKAG------------- 438

Query: 571 FPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLL 630
              V   R++++ +AE   SFI R+L D  + R+   FR+G S   G+ +DYA +I+  +
Sbjct: 439 ---VAFQREDFLRLAEETYSFIERNLID-PSGRMLRRFRDGESGILGYSNDYAEMITSSI 494

Query: 631 DLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGAEPSG 689
            L+E G G ++L  A+        LF  R   G F   G D  VLLR   D +DG EPS 
Sbjct: 495 ALFEAGRGIRYLKNAVLWMEEAIRLF--RSPAGVFFDAGSDGEVLLRRSVDGYDGVEPSA 552

Query: 690 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSR 749
           NS    +LV+L+  + G  S  YR+ AE     F   L   +++ P +  A       S 
Sbjct: 553 NSSLAYSLVKLS--LFGIDSVRYRKFAESIFLYFTKELSTNSLSYPHLLSAYWTYRHHS- 609

Query: 750 KHVVLVGHKSSVDFENMLAAAHASY 774
           K +VL+  K S   +++LA     +
Sbjct: 610 KEIVLI-RKDSDSGKDLLAEIQTKF 633


>gi|291614213|ref|YP_003524370.1| hypothetical protein Slit_1752 [Sideroxydans lithotrophicus ES-1]
 gi|291584325|gb|ADE11983.1| protein of unknown function DUF255 [Sideroxydans lithotrophicus
           ES-1]
          Length = 676

 Score =  417 bits (1072), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 247/664 (37%), Positives = 362/664 (54%), Gaps = 66/664 (9%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TN LA E SPYLLQHA NPVDW  W       AR    PI LSIGYS CHWCHVM  ESF
Sbjct: 2   TNHLAHETSPYLLQHADNPVDWHPWSAATLQLARDLGKPILLSIGYSACHWCHVMAHESF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQAL-YGGGGWPLSVFLSPDLKPLMGG 220
           EDE VA ++N+ F++IKVDREERPD+D++Y    Q L    GGWPL++FL+PD  P   G
Sbjct: 62  EDEAVAAVMNELFINIKVDREERPDLDQIYQNAHQLLSRRSGGWPLTMFLAPDGTPFYSG 121

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE- 279
           TYFP + +YG PGF  +++ +  A+ ++R  LA+ G    +Q+  AL+A        D  
Sbjct: 122 TYFPKQARYGLPGFPALIQDIAHAYKEQRGELAEQG----KQIVAALAAWQPEKSATDST 177

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
           L  + +     Q S+++D   GGFG APKF  P E+ ++L  +    D       ++ + 
Sbjct: 178 LDASPIATSIRQHSENFDRVNGGFGGAPKFLHPAELDLLLQQTHATHD-------AQTRH 230

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           +VLFTLQ MA+GG++D +GGGF RYSVD  W +PHFEKMLYD G L  +Y DA+  + D 
Sbjct: 231 IVLFTLQQMAQGGLYDQLGGGFCRYSVDAEWDIPHFEKMLYDNGLLLGLYSDAWLSSSDP 290

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL-- 457
           F++ I      ++ R+M  P G  +++ DADS         +EG FYVW   ++ D+L  
Sbjct: 291 FFARIVEQTAAWVMREMQSPQGGYYASLDADS-------EHEEGKFYVWQRNDIRDLLSA 343

Query: 458 GEHAILFKEHYYLKPTGNCDLS----RMSDPHNEFKGKNVLIELNDSSASASKLGMPLEK 513
            E+A L + HY L  T N +      R+S P  E                A KLG+  E+
Sbjct: 344 AEYA-LIQPHYGLDSTPNFENHAWNLRVSQPLGEI---------------AQKLGLGEEQ 387

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
              +L   + KLF  R +R RP  D+K++ SWNGL+I+  A+A++I              
Sbjct: 388 AAMLLAAAKTKLFAAREQRIRPGRDEKILGSWNGLMIAGMAKAARIFG------------ 435

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
               R++++  A+ A  F+R  L+  Q  RL  + ++G +    +LDD+A+L++  L+L 
Sbjct: 436 ----REDWLHSAQQAMDFVRTTLW--QDGRLLATHKDGKTHLNAYLDDHAYLLNAALELL 489

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
           +    +  L +A+++ +     F D   GG+F T+ +  +++ R K   D A PSGN ++
Sbjct: 490 QAEFRSPDLSFAVQIADALLARFEDVRNGGFFFTSHDHEALIQRNKTAQDNATPSGNGIA 549

Query: 694 VINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA-ADMLSVPSRKHV 752
              L+RLA +    +   Y   AE  L +F   ++  A     +C A  + L  PS   +
Sbjct: 550 TQGLLRLAELTGDIR---YTDAAERCLKLFFPIMQRAAGQFSSLCTALGEALQPPSM--L 604

Query: 753 VLVG 756
           VL G
Sbjct: 605 VLCG 608


>gi|313126304|ref|YP_004036574.1| hypothetical protein Hbor_15590 [Halogeometricum borinquense DSM
           11551]
 gi|448286147|ref|ZP_21477382.1| hypothetical protein C499_05218 [Halogeometricum borinquense DSM
           11551]
 gi|312292669|gb|ADQ67129.1| hypothetical protein containing a thioredoxin domain
           [Halogeometricum borinquense DSM 11551]
 gi|445575198|gb|ELY29677.1| hypothetical protein C499_05218 [Halogeometricum borinquense DSM
           11551]
          Length = 725

 Score =  416 bits (1070), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 239/605 (39%), Positives = 325/605 (53%), Gaps = 52/605 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA E SPYL QHA NPV+W  W E A   AR++D PIFLS+GYS CHWCHVM  ESFE
Sbjct: 8   NRLADEQSPYLQQHADNPVNWQPWDETAIEAAREKDRPIFLSVGYSACHWCHVMADESFE 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D+ VA +LN+ FV +KVDREERPD+D++Y T  Q + GGGGWPLSV+L+P  KP   GTY
Sbjct: 68  DDDVAAVLNESFVPVKVDREERPDLDRIYQTICQLVTGGGGWPLSVWLTPQGKPFYVGTY 127

Query: 223 FPPEDKYGR---PGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE 279
           FP E++  R   PGF  + R   +AW+  R+ +          + + L A+      P E
Sbjct: 128 FPKEERRDRGNVPGFLDLCRSFAEAWENDREEIENRAQQWTAAIQDQLEATPDD---PGE 184

Query: 280 LP-QNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLEDTGKSGEASEG 337
            P    L   A+   +  D  +GGFGS  PKFP+P  ++ +L           SGE  E 
Sbjct: 185 SPGTEILGEVAKAALRGADREYGGFGSGGPKFPQPGRVEALLRSYV------HSGE-DEP 237

Query: 338 QKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTK 397
             + + TL  MA GG++DHVGGGFHRY+ D +W VPHFEKMLYD  ++  VYL A  LT 
Sbjct: 238 LTVAMETLDAMAGGGMYDHVGGGFHRYATDRQWTVPHFEKMLYDNAEIPRVYLAAHRLTG 297

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
              Y+ + R+  D++ R++  P G  FS  DA S         +EG FYVWT ++V + L
Sbjct: 298 RADYAEVARETFDFVARELRHPDGGFFSTLDAQSG-------GEEGTFYVWTPEQVHEAL 350

Query: 458 GEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 515
            +   A +F ++Y +   GN +            G  VL       + A + G+  ++  
Sbjct: 351 ADETRAEVFCDYYGVTSGGNFE-----------NGTTVLTVSATVDSVADEHGLTTDEVT 399

Query: 516 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 575
           + L   R  LFD R  R RP  D+KV+  WNGL+ISS A+ + +L               
Sbjct: 400 DHLDAARETLFDTRESRTRPPRDEKVLAGWNGLMISSLAQGALVLGD------------- 446

Query: 576 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 635
               EY E+A  A  F R HL+DE   RL   F++G  K  G+L+DYAFL  G  DLY+ 
Sbjct: 447 ----EYAELAADALGFAREHLWDESEGRLSRRFKDGDVKGEGYLEDYAFLARGAFDLYQA 502

Query: 636 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 695
                 L +A+EL       F D   G  + T  +  +++ R +E  D + PS   V+  
Sbjct: 503 TGDVDHLAFAVELAREIVASFYDDAAGTLYFTPDDGEALVTRPQELQDQSTPSSVGVATS 562

Query: 696 NLVRL 700
            L+ L
Sbjct: 563 LLLDL 567


>gi|336254491|ref|YP_004597598.1| hypothetical protein Halxa_3105 [Halopiger xanaduensis SH-6]
 gi|335338480|gb|AEH37719.1| protein of unknown function DUF255 [Halopiger xanaduensis SH-6]
          Length = 730

 Score =  416 bits (1069), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 248/660 (37%), Positives = 351/660 (53%), Gaps = 57/660 (8%)

Query: 97  SRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVM 156
           S     NRL  E SPYL QHA NPV+W  W E+A   AR+RDVPIFLSIGYS CHWCHVM
Sbjct: 2   SEPTERNRLEDEGSPYLRQHADNPVNWQPWDEQALEAARERDVPIFLSIGYSACHWCHVM 61

Query: 157 EVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKP 216
           E ESF+DEGVA++LN+ FV IKVDREERPD+D +YMT  Q + G GGWPLS +L+P+ KP
Sbjct: 62  EEESFQDEGVAEVLNENFVPIKVDREERPDIDSIYMTVCQLVSGRGGWPLSAWLTPEGKP 121

Query: 217 LMGGTYFPPEDKYGRPGFKTILRKVKDAWDK--------KRDMLAQSGAFAIEQLSEALS 268
              GTYFP E + G+PGF  +  ++ D+W+         + D   ++    +E   E   
Sbjct: 122 FFIGTYFPREGQRGQPGFLDLCERISDSWNSEDREEMEHRADQWTEAAKDRLEDTPEGAG 181

Query: 269 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLED 327
           A  ++     E+    L   A    +S D  +GGFGS  PKFP+P  +Q +   ++  + 
Sbjct: 182 AGGAAEPPSSEV----LETAASAALRSADREYGGFGSDGPKFPQPARLQAL---ARAYDR 234

Query: 328 TGKSGEASEGQKMVL-FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 386
           TG+     E  + VL  TL  MA GG++DHVG GFHRY VD  W VPHFEKMLYD  ++ 
Sbjct: 235 TGR-----EAYREVLEETLDAMAAGGLYDHVGSGFHRYCVDRDWTVPHFEKMLYDNAEIP 289

Query: 387 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 446
             +L  + LT D  Y+ +  + L ++ R++    G  FS  DA S + E   R +EGAFY
Sbjct: 290 RAFLTGYQLTGDERYAEVVAETLAFVDRELTHEEGGFFSTLDAQSEDPETGER-EEGAFY 348

Query: 447 VWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 504
           VWT  EV + L +   A LF + Y +  +GN            F+G+N    +      A
Sbjct: 349 VWTPDEVREALEDETTADLFCDRYDITESGN------------FEGRNQPNRVRPIDDLA 396

Query: 505 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 564
            +  +   +    L   R +LF  R  RPRP+ D+KV+  WNGL+I++ A A+ +L    
Sbjct: 397 DEYDLEESEVQKRLETAREQLFAAREGRPRPNRDEKVLAGWNGLMIATCAEAALVL---- 452

Query: 565 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 624
                     G D  +Y ++A  A  F+R  L++E   RL   +++G  K  G+L+DYAF
Sbjct: 453 ----------GDD--QYADMAVDALDFVRDRLWNESEQRLNRRYKDGDVKVDGYLEDYAF 500

Query: 625 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 684
           L  G L  YE       L +A+EL    +  F D + G  + T     S++ R +E  D 
Sbjct: 501 LARGALGCYEATGEVDHLRFALELARVVEAEFWDADRGTLYFTPESGESLVTRPQELGDQ 560

Query: 685 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADML 744
           + P+   V+V  L+ L         + +   A   L     +++  ++    +C AAD L
Sbjct: 561 STPAATGVAVEVLLALDEFT----DEDFEGIAATVLETHANKIEANSLEHTTLCLAADRL 616


>gi|295667924|ref|XP_002794511.1| spermatogenesis-associated protein [Paracoccidioides sp. 'lutzii'
           Pb01]
 gi|226285927|gb|EEH41493.1| spermatogenesis-associated protein [Paracoccidioides sp. 'lutzii'
           Pb01]
          Length = 791

 Score =  416 bits (1068), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 241/569 (42%), Positives = 329/569 (57%), Gaps = 33/569 (5%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            NRL    SPY+L H +NPV W  W  EA A A+K +  IFLSIGYS CHWCHVME ESF
Sbjct: 24  VNRLYQSKSPYVLVHMNNPVAWQLWDSEAIALAKKLNRLIFLSIGYSACHWCHVMEKESF 83

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
               +A +LN  F+ IK+DREERPD+D+VYM YVQA  G GGWPL+VFL+PDL+P+ GG+
Sbjct: 84  MSPEIAAILNKSFIPIKLDREERPDIDEVYMNYVQATTGSGGWPLNVFLTPDLEPVFGGS 143

Query: 222 YFP-PEDKY-------GRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASS 273
           Y+P P           G+  F  IL K++D W  ++    +S     +QL E  +   + 
Sbjct: 144 YWPGPHSNALPTLGGEGQITFVDILEKLRDVWHTQQLRCRESAKDITKQLRE-FAEEGTH 202

Query: 274 NKLPD-----ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK---KL 325
           +K  D     +L    L    +  +  YD+  GGF  APKFP PV +  +++ S+    +
Sbjct: 203 SKQSDVETEEDLEIELLEEAYQHFASRYDAVNGGFSEAPKFPTPVNLSFLVHLSRYPSAV 262

Query: 326 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 385
            D     E S   ++ + TL  M++GGIHD +G GF RYSV   W +PHFEKMLYDQ QL
Sbjct: 263 ADIVGYEECSRAIEIAVKTLIAMSRGGIHDQIGHGFARYSVTADWSLPHFEKMLYDQAQL 322

Query: 386 ANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDADSAETEGATRKKEGA 444
            +VY+DAF    D        DI  Y+    M+ P G   S+EDADS  +   T K+EGA
Sbjct: 323 LDVYVDAFDSAYDPELLGAMYDIATYITSPPMLSPTGGFHSSEDADSRPSPNDTEKREGA 382

Query: 445 FYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 503
           FYVWT KE++ ILG+  A +   H+ +   GN  ++R++DPH+EF  +NVL      S  
Sbjct: 383 FYVWTLKELKQILGQRDADVCARHWGVLADGN--VARINDPHDEFINQNVLSIQVTPSKL 440

Query: 504 ASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVISSFARASKILKS 562
           A + G+  ++ + I+   R KL + R SKR RP LDDK+IV+WNGL I + A+ S +L++
Sbjct: 441 AKEFGLGEDEVVRIIKRSREKLREYRESKRVRPDLDDKIIVAWNGLAIGALAKCSVVLEN 500

Query: 563 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP-SKAPGFLDD 621
                 + F             AE A  FI+ +L+DEQT +L   +R G     PGF DD
Sbjct: 501 LDRDKAYQF----------RRAAEEAVRFIKHNLFDEQTGQLWRIYRGGVRGDTPGFADD 550

Query: 622 YAFLISGLLDLYEFGSGTKWLVWAIELQN 650
           YA+LISGL++LYE       L +A +LQ+
Sbjct: 551 YAYLISGLINLYEATFDDSHLQFAEQLQH 579


>gi|448688002|ref|ZP_21693970.1| thioredoxin [Haloarcula japonica DSM 6131]
 gi|445779793|gb|EMA30709.1| thioredoxin [Haloarcula japonica DSM 6131]
          Length = 717

 Score =  416 bits (1068), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 242/665 (36%), Positives = 361/665 (54%), Gaps = 53/665 (7%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL    SPYL QHA NPV+W  W E A   A++RDVPIFLSIGY+ CHWCHVME ESFE
Sbjct: 11  NRLDEAESPYLRQHADNPVNWQPWDETALEAAKERDVPIFLSIGYAACHWCHVMEEESFE 70

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE +A+ LN+ FV IKVDREERPD+D VYM+  Q + GGGGWPLS +L+P+ +P   GTY
Sbjct: 71  DEAIAEQLNEDFVPIKVDREERPDLDSVYMSICQQVTGGGGWPLSAWLTPEGEPFYVGTY 130

Query: 223 FPPEDKYGRPGFKTILRKVKDAWD--KKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           FPPE+K G+PGF  +L+++ D+W   ++R+ +        E +   L A+ +    P++ 
Sbjct: 131 FPPEEKRGQPGFGDLLQRLADSWSDPEQREEMENRARQWTEAIESDLEATPAD---PEDP 187

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
            ++ ++       +  D + GG+GS  PKFP+   +  +L   +   D G+     +   
Sbjct: 188 AEDIIQTAGTIAHRGADRQDGGWGSGGPKFPQNGRLHALL---RAHADGGQ----EDYLN 240

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           +V  TL  MA  G++DHVGGGFHRY+ D++W VPHFEKMLYD  ++   +L  +      
Sbjct: 241 VVEETLDVMADRGLYDHVGGGFHRYATDQQWAVPHFEKMLYDNAEIPRAFLAGYQAIGSE 300

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSA---ETEGATRKKEGAFYVWTSKEVEDI 456
            Y+ + R+  ++++R++  P G  FS  DA+SA   E EG T  +EG FYVWT ++V D 
Sbjct: 301 RYASVVRETFEFVQRELQHPDGGFFSTLDAESAPIDEPEGET--EEGLFYVWTPEQVRDA 358

Query: 457 LGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKY 514
           + +   A +F +++ +   GN            F+G  VL      S  A +     +K 
Sbjct: 359 VDDETDAEIFCDYFGVTARGN------------FEGATVLAVRKPVSVLAEEYDQSEDKI 406

Query: 515 LNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVV 574
              L     + F+ R++RPRP  D+KV+  WNGL+I + A  + +L              
Sbjct: 407 TASLQRALNQTFEARTERPRPARDEKVLAGWNGLMIRTLAEGAIVLDD------------ 454

Query: 575 GSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYE 634
                +Y +VA  A SF+R HL++E  +RL   +++G     G+L+DYAFL  G L L+E
Sbjct: 455 -----QYADVAADALSFVREHLWNEDENRLNRRYKDGDVAIDGYLEDYAFLGRGALTLFE 509

Query: 635 FGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSV 694
                + L +A++L     E F D E G  F T     S++ R +E  D + PS   V+V
Sbjct: 510 ATGDVEHLAFAMDLGQAITEAFWDDEQGTLFFTPTGGESLVARPQELTDQSTPSSTGVAV 569

Query: 695 INLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVL 754
             L+ L+     S  D + + AE  +     R+    +    +  A D     + + + L
Sbjct: 570 DLLLSLSHF---SDDDRFEEVAERVIRTHADRVSSNPLQHASLTLATDTYEQGALE-LTL 625

Query: 755 VGHKS 759
           VG +S
Sbjct: 626 VGDRS 630


>gi|345864005|ref|ZP_08816211.1| uncharacterized protein YyaL [endosymbiont of Tevnia jerichonana
           (vent Tica)]
 gi|345124912|gb|EGW54786.1| uncharacterized protein YyaL [endosymbiont of Tevnia jerichonana
           (vent Tica)]
          Length = 799

 Score =  416 bits (1068), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 253/692 (36%), Positives = 360/692 (52%), Gaps = 60/692 (8%)

Query: 64  YPFRRPLAVISH-RPIHPYKVVAMAERTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVD 122
           Y   RP+ +       +  K V    RT    +    ++ NRL  E SPYLLQHAHNPVD
Sbjct: 27  YQVTRPMQIQQQLEAAYLAKGVGYRPRTEHLEADGSPRYLNRLILEDSPYLLQHAHNPVD 86

Query: 123 WFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDRE 182
           W+ WGE AFA+A++ + PIFLSIGYSTCHWCHVME ESFE+E +A+ LN+ F++IKVDRE
Sbjct: 87  WYPWGEAAFAKAKRENKPIFLSIGYSTCHWCHVMERESFENESIARFLNEHFIAIKVDRE 146

Query: 183 ERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVK 242
             PD+D+ YMT V  + G GGWP+S  L+P+ KP  GGTYFPP+       F ++L++++
Sbjct: 147 SHPDIDETYMTAVMLMTGSGGWPMSSLLTPEGKPFFGGTYFPPQQ------FASVLQQIQ 200

Query: 243 DAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGG 302
             W+++ +   Q      E++++A+ A+ S       L   A      Q+ +S+D   GG
Sbjct: 201 TIWEERPEDTRQQA----ERVAKAVEAANSQRGKAKALDSQAADKAVAQMLRSFDELQGG 256

Query: 303 FGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFH 362
           F  APKFP    + ++L       D  +     E  + +  TL  MA+GGI+D  GGGFH
Sbjct: 257 FSQAPKFPHEPWLFLLL-------DQLQRQPHPEALQALEVTLDAMARGGIYDQAGGGFH 309

Query: 363 RYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGE 422
           RYS D  W VPHFEKMLY+Q QLA +YL A+ LT    Y  +    LDY+ R+M  P G 
Sbjct: 310 RYSTDNEWLVPHFEKMLYNQAQLARIYLLAWRLTGKEQYRRVVTQTLDYVLREMTAPSGG 369

Query: 423 IFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRM 481
            +SA DADSA        +EG F+ W   E+ D L    A L  E Y +   GN      
Sbjct: 370 FYSATDADSA-------GEEGLFFTWIPAEIRDALEPRDAGLAIELYAISERGN------ 416

Query: 482 SDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKV 541
                 F+G+N+L         A    M LE     +    + L  +R +R  P  DDK+
Sbjct: 417 ------FEGRNILHLPQSLEEYAETKSMNLEALHQRIDHINQVLRQIREQREHPLRDDKI 470

Query: 542 IVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQT 601
           + +WNG++I++FA+A+ +L S++                Y + AE AA F+ +H   +  
Sbjct: 471 VTAWNGMMITAFAQAADLLDSDS----------------YRQAAERAAEFLWQH-NRKGA 513

Query: 602 HRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG 661
            +L     +G S      +DYA+L  GL  LY+     KWL  + EL +     F +++G
Sbjct: 514 GQLWRVHLDGKSSISANQEDYAYLGEGLSYLYDLTGDPKWLSRSRELADAMLARFQEKDG 573

Query: 662 GGYFNTTGEDPSVLLRVKED--HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHS 719
           G Y +  GED    +    D   D A  SG+SV++  L RL  + +G     Y+  AE  
Sbjct: 574 GFYMSEAGEDHFNAMGRPRDGGSDNAIASGSSVALHLLQRLW-LRSGHLD--YKTAAESL 630

Query: 720 LAVFETRLKDMAMAVPLMCCAADMLSVPSRKH 751
           +A F   ++        M  A D L+   R H
Sbjct: 631 IAYFAANIERQPNGYTYMLSAVDNLNQGERTH 662


>gi|372487318|ref|YP_005026883.1| thioredoxin domain-containing protein [Dechlorosoma suillum PS]
 gi|359353871|gb|AEV25042.1| thioredoxin domain-containing protein [Dechlorosoma suillum PS]
          Length = 682

 Score =  415 bits (1067), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 257/652 (39%), Positives = 353/652 (54%), Gaps = 58/652 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLAAE SPYLLQHA NPVDW+ WGEEA A AR  + PI LSIGYS CHWCHVM  E F 
Sbjct: 3   NRLAAETSPYLLQHADNPVDWYPWGEEALARARAENRPILLSIGYSACHWCHVMAHECFA 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYG-GGGWPLSVFLSPDLKPLMGGT 221
           D  VA  +N  F++IKVDREERPD+D+VY T  Q L G  GGWPL++FL+PD  P  GGT
Sbjct: 63  DATVAAEMNRLFINIKVDREERPDLDQVYQTAHQMLVGRPGGWPLTMFLTPDAMPFFGGT 122

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFP E ++G P F  +L  V  A+ +K+  +A+ G    E     L  +     L +  P
Sbjct: 123 YFPREPRHGLPAFVEVLHSVARAFTEKQSEIAEQGRTMREAFGSTLPRAVRGEPLFNADP 182

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
              L     +L  +YD R GGFG APKFPRP  +  +L       D    G       M 
Sbjct: 183 ---LAQAVAELDTNYDRRRGGFGGAPKFPRPAALDFLLRRHAATGDPHARG-------MA 232

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
           L TL+ MA+GGIHDH+GGGF+RYSVD +W +PHFEKMLYD  QL ++Y +A++L++   +
Sbjct: 233 LTTLERMAEGGIHDHLGGGFYRYSVDAQWSIPHFEKMLYDNAQLLHLYAEAWALSRKQVF 292

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHA 461
                 I+ +L+ +M  PGG   +A DADS   EG    +EG FY+WT++EV      HA
Sbjct: 293 RQAAEGIVAWLQHEMALPGGAFAAALDADS---EG----EEGRFYLWTAREV------HA 339

Query: 462 ILFKEHYYLKPTGNCDLSR----MSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI 517
           +L        P    D++     +  P N    +  L ++      A +L +   +    
Sbjct: 340 LL--------PPQQWDVASIHWGLDGPPNFEDAEWHLRQVQPLEQVAERLRLTPGEARQQ 391

Query: 518 LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSD 577
           L   R  L   R++R RP  DDKV+   N L I   ARA++                   
Sbjct: 392 LEGARHTLLAARNERIRPGRDDKVLTGCNALAIKGLARAARAF----------------G 435

Query: 578 RKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGS 637
           R E++ +A  AA F++R L+  +  RL  ++++G ++ P +LDD+AFL+  +L+L + G 
Sbjct: 436 RPEWLGLACGAADFLQRELW--RDGRLLAAWKDGRARLPAYLDDHAFLLEAMLELLQAGW 493

Query: 638 GTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINL 697
                  A+ L +   + F DRE GG+F T  +  +++ R K   D A PSGN V+   L
Sbjct: 494 RDADYRCAVALADALLQHFEDREEGGFFFTAHDHETLIYRTKPVEDHATPSGNGVAAFAL 553

Query: 698 VRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP-LMCCAADMLSVPS 748
            RLA +   S    Y   A  +LA+F   L+    A P L+    D LS P+
Sbjct: 554 GRLALL---SGEPRYAAAARRALALFLPDLRQHPGAHPGLLNVLGDELSPPA 602


>gi|448414488|ref|ZP_21577557.1| hypothetical protein C474_02196 [Halosarcina pallida JCM 14848]
 gi|445682054|gb|ELZ34478.1| hypothetical protein C474_02196 [Halosarcina pallida JCM 14848]
          Length = 725

 Score =  415 bits (1066), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 239/612 (39%), Positives = 329/612 (53%), Gaps = 58/612 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA NPV W  W E A   AR+ D PIFLS+GYS CHWCHVM  ESFE
Sbjct: 8   NRLGEEQSPYLRQHADNPVHWQPWDEAALETAREEDKPIFLSVGYSACHWCHVMAEESFE 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA++LN+ FV +KVDREERPD+D++Y T  Q + GGGGWPLSV+L+P+ KP   GTY
Sbjct: 68  DEAVARVLNESFVPVKVDREERPDLDRIYQTICQLVSGGGGWPLSVWLTPEGKPFYVGTY 127

Query: 223 FPPEDKYGR---PGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE 279
           FP E++  R   PGF  +     +AW+  R+ +        EQ ++AL       + PDE
Sbjct: 128 FPKEERRDRGNVPGFLDLCESFANAWETDREEIENRA----EQWTDALKDQL--EETPDE 181

Query: 280 LPQNALRLCAEQLSKS----YDSRFGGFGS-APKFPRPVEIQMMLYHSKKLEDTGKSGEA 334
           + +        +++K+     D  +GGFGS  PKFP+P  I+ +L           SGE 
Sbjct: 182 VGEAPGTEVLGEVTKAALRGADREYGGFGSGGPKFPQPGRIEALLRSYV------HSGE- 234

Query: 335 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 394
            E   + +  L  MA GG++DHVGGGFHRY+ D +W VPHFEKMLYD  ++  VYL A  
Sbjct: 235 EEPLDVAMEALDAMAGGGMYDHVGGGFHRYATDRQWTVPHFEKMLYDNAEIPRVYLAAHR 294

Query: 395 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 454
           LT    Y+ + R+  D++ R++  P G  +S  DA S         +EG FYVWT +EV 
Sbjct: 295 LTGREAYADVARETFDFVARELRHPDGGFYSTLDAQS-------DGEEGTFYVWTPEEVR 347

Query: 455 DILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE 512
           + L +   A +F ++Y +   GN +            G  VL         A + G+  E
Sbjct: 348 ETLDDETRADVFCDYYGVTADGNFE-----------NGTTVLTVSAPIDEVAEERGLTTE 396

Query: 513 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 572
           + ++ L   R  LF+ R  R RP  D+KV+  WNGL++SS A+ S +L            
Sbjct: 397 EAVDHLDAARETLFEARESRTRPPRDEKVLAGWNGLMVSSLAQGSLVLGD---------- 446

Query: 573 VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDL 632
                  EY E+A  A  F+R HL+D    RL   F++G  K  G+L+DYAFL  G  DL
Sbjct: 447 -------EYAELAADALGFVREHLWDSDEKRLSRRFKDGDVKGDGYLEDYAFLARGAFDL 499

Query: 633 YEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSV 692
           Y+       L +A++L     E F D   G  + T  +  +++ R +E  D + PS   V
Sbjct: 500 YQATGDVDHLAFAVDLSRALVESFYDESAGTLYFTPADGETLVTRPQELQDQSTPSSVGV 559

Query: 693 SVINLVRLASIV 704
           +   L+ L S  
Sbjct: 560 AASLLLDLDSFA 571


>gi|336477876|ref|YP_004617017.1| hypothetical protein [Methanosalsum zhilinae DSM 4017]
 gi|335931257|gb|AEH61798.1| protein of unknown function DUF255 [Methanosalsum zhilinae DSM
           4017]
          Length = 704

 Score =  414 bits (1065), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 237/613 (38%), Positives = 336/613 (54%), Gaps = 41/613 (6%)

Query: 95  SHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCH 154
           S S NK  NRL  E+SPYLLQHA+NPVDW+ WG+EAF  AR++++P+FLSIGYSTCHWCH
Sbjct: 3   SGSSNK-PNRLIHENSPYLLQHAYNPVDWYPWGKEAFQTARQKNIPVFLSIGYSTCHWCH 61

Query: 155 VMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDL 214
           VME ESFED  +A ++N  F+ IKVDREERPD+D +YM   Q +    GWP++V ++P  
Sbjct: 62  VMEEESFEDPKIADMMNRTFICIKVDREERPDIDSMYMKICQQMTERCGWPMTVIMTPGK 121

Query: 215 KPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSN 274
            P    TY P +      G   ++ ++ + W  ++D +        ++L+   +A   + 
Sbjct: 122 VPFFISTYVPKKSGLAGIGMADLIPQIAEIWKTRQDEIVNKTEEIKQRLNRITAAPEGAE 181

Query: 275 KLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEA 334
            +    P++ ++     L+  YD  +GGFG APKFP P  I  +L H     +T      
Sbjct: 182 YIS---PKDVIQKGYHLLAHYYDQNYGGFGRAPKFPAPHNIMFLLRHWNYTGNT------ 232

Query: 335 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 394
            +  KM   TL  M  GGI DHVG GFHRYS DE+W +PHFEKML DQ  LA  Y +A+ 
Sbjct: 233 -DALKMAETTLTSMQLGGIFDHVGYGFHRYSTDEKWKLPHFEKMLNDQALLALAYTEAYQ 291

Query: 395 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 454
            T    Y    R IL Y+ RDM    G  +SAEDADS   EG     EG FY+WT  E+ 
Sbjct: 292 ATGKKVYENTARKILRYVLRDMRSEKGGFYSAEDADS---EGV----EGKFYLWTEDEIR 344

Query: 455 DILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEK 513
            IL  E A L    + +K  GN       +   +  G N+L    ++S          E+
Sbjct: 345 YILTPEEADLVCRVFNVKREGNF----AEESTGKLTGNNILYMKGETSEIVEPTEKENEE 400

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
              +L +   KL++VRS R  P  DDK++  WNGL+I++ A+A         S  F  P 
Sbjct: 401 IQKLLNQALDKLYEVRSARVHPLKDDKILTDWNGLMIAALAKA---------SGAFQEP- 450

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
                 EY+E A++   FI  ++YD  + +L H +    +   GF+DDYA  + GL++LY
Sbjct: 451 ------EYVEYAKTCTKFILDNMYD-GSGKLLHRYHRENAGIDGFVDDYAAFVWGLIELY 503

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGG-YFNTTGEDPSVLLRVKEDHDGAEPSGNSV 692
           E     K+L  A+E+ +     F D +G G YF +      +++R  E  D + PSGNS+
Sbjct: 504 EATFEEKYLQKALEINDYFISHFQDEKGRGFYFTSNDRSGDLIVRSMEICDTSMPSGNSM 563

Query: 693 SVINLVRLASIVA 705
           +V+N++RLA +  
Sbjct: 564 AVLNILRLAKMTG 576


>gi|441496345|ref|ZP_20978578.1| Thymidylate kinase [Fulvivirga imtechensis AK7]
 gi|441439862|gb|ELR73159.1| Thymidylate kinase [Fulvivirga imtechensis AK7]
          Length = 680

 Score =  414 bits (1065), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 239/614 (38%), Positives = 334/614 (54%), Gaps = 57/614 (9%)

Query: 94  TSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWC 153
           T+  +    NRL    SPYLLQHA+NPV+W+ WGEEA  +A+K D PI +SIGYS+CHWC
Sbjct: 4   TTEPKKGEANRLINATSPYLLQHAYNPVNWYPWGEEALEKAKKEDKPILVSIGYSSCHWC 63

Query: 154 HVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPD 213
           HVME ESFE++ +A ++N+ F+SIK+DREERPDVD++YM  VQA+   GGWPL+VFL+ D
Sbjct: 64  HVMERESFENDSIAAIMNEHFISIKIDREERPDVDQIYMDAVQAMGQSGGWPLNVFLTSD 123

Query: 214 LKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASS 273
            KP  GGTYFPPE       +  +L++V   +++KR  + +S     +QL+ A++ S   
Sbjct: 124 QKPFYGGTYFPPE------SWAQLLKQVARVYNEKRSEVEESA----DQLTNAIATSEVI 173

Query: 274 N-KLPD---ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 329
             +L D   E     L    E+LS  +D   GGF  APKFP P     +L +     D  
Sbjct: 174 KFRLKDNGTEYTTTTLEKMYEKLSMKFDGNKGGFKGAPKFPMPGNWLFLLRYYNATND-- 231

Query: 330 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 389
                 E  + +  TL  +A+GGI+D +GGGF RYSVD  W VPHFEKMLYD GQL ++Y
Sbjct: 232 -----QEALRQLEVTLSEIARGGIYDQIGGGFARYSVDADWLVPHFEKMLYDNGQLVSLY 286

Query: 390 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 449
            +A++ TK   Y  +    +D+L R+M    G  +SA DADS   EG    +EG FYVWT
Sbjct: 287 AEAYTATKLELYKEVVYQTIDWLEREMTSKEGGFYSALDADS---EG----EEGKFYVWT 339

Query: 450 SKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGM 509
             EVE +LG  A L   +Y ++  GN +           +GKN+L         A +  +
Sbjct: 340 KDEVEHVLGAEANLIMSYYNIEKEGNWE-----------EGKNILHMHVSDEEFAKRHDL 388

Query: 510 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 569
            + +    + +    L + RSKR RP LDDKV+  WNGL+      A             
Sbjct: 389 GVAELKEKVWKADELLLEERSKRVRPGLDDKVLAGWNGLMQKGLVDA------------- 435

Query: 570 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 629
               V     +++++A   A F+ +H+  +   RL  SF++G +   G+L+DYAF+I   
Sbjct: 436 ---YVAFGEPKFLDLALRNAHFLDQHMIHD--FRLNRSFKSGKASIDGYLEDYAFVIDAY 490

Query: 630 LDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSG 689
             LYE     +WL  A  L +   E F D     +F T      ++ R KE  D   P+ 
Sbjct: 491 TALYEATFDEQWLKKAKGLMDYTIEHFYDNSEKLFFFTDDRSEKLIARKKEVFDNVIPAS 550

Query: 690 NSVSVINLVRLASI 703
           NS   +NL RL  I
Sbjct: 551 NSQMALNLYRLGKI 564


>gi|126180264|ref|YP_001048229.1| hypothetical protein Memar_2324 [Methanoculleus marisnigri JR1]
 gi|125863058|gb|ABN58247.1| protein of unknown function DUF255 [Methanoculleus marisnigri JR1]
          Length = 721

 Score =  414 bits (1065), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 255/673 (37%), Positives = 357/673 (53%), Gaps = 45/673 (6%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYLLQHA NPVDW+ WGEEAF+ AR+   PIFLSIGYS CHWCHVME ESF 
Sbjct: 23  NRLINEQSPYLLQHARNPVDWYPWGEEAFSRAREEGKPIFLSIGYSACHWCHVMEEESFA 82

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D+ VAKLLND FV IKVDREERPD+D+VYM    AL G GGWPL++ ++ D KP    +Y
Sbjct: 83  DQQVAKLLNDVFVCIKVDREERPDIDQVYMAAAHALTGAGGWPLTILMTADKKPFFAASY 142

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
            P E +YG  G   ++ ++   W  +R  L  +G    +Q+ +AL ++A +     EL +
Sbjct: 143 IPKESRYGMTGLLDLIPRISKVWQTQRQGLENAG----DQVLQALQSAARTPPEEGELAE 198

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             L        + +D   GGFG AP+FP P  +  +L +  +   TGK         MV 
Sbjct: 199 AVLDEAYNMFFRVFDGENGGFGDAPRFPTPHNLIFLLRYGNR---TGK----EPAYTMVE 251

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL  M +GGI D VG GFHRYS D  W VPHFEKMLYDQ  L   Y +A+  T    ++
Sbjct: 252 KTLHAMRRGGIFDQVGYGFHRYSTDAEWFVPHFEKMLYDQALLVMAYTEAYLATGREEFA 311

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH-A 461
              R+ + Y+ R+M  P G  +SAEDADS   EG    +EG FY+WT  E+  +LGE   
Sbjct: 312 RTARETIAYVLREMTDPDGGFYSAEDADS---EG----EEGKFYLWTKDEILGVLGEEDG 364

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
             F   + +   GN        P  +  G+N+L      ++ A +   P +     + E 
Sbjct: 365 ERFSRIFNVTEPGNY----REQPGGKRTGRNILRLRRPLASWAHEFETPEDDLAWSVEEG 420

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
           R+KL   R +R RP  DDK++  WN L+I++ A+A++                  D  +Y
Sbjct: 421 RQKLLAARKQRVRPGRDDKILTDWNALMIAALAKAARAF----------------DEPDY 464

Query: 582 MEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKW 641
           +  AE AA+F+  +L  E   RL H +R G +     LDDYAF+I  L+++YE      +
Sbjct: 465 LAAAERAAAFVLANLRREDG-RLLHRYRGGEAGLAATLDDYAFMIWALIEVYEASFAPGY 523

Query: 642 LVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLA 701
           L  A++L       + D   GG+F    +D  V +R K  +DGA PSGNSV++  L  L 
Sbjct: 524 LKTAVDLSRDLIARYWDCNEGGFFFVP-DDGDVPVRQKPVYDGAIPSGNSVAMYALFVLG 582

Query: 702 SIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSV 761
            + A  + +   + AE    VF   + +   A        + +  P+ + V++ G   + 
Sbjct: 583 RMTANLELE---ETAERIRRVFAGTVSESPTACSHFLTGLEFMLGPNFE-VIISGVPDAE 638

Query: 762 DFENMLAAAHASY 774
           D   M+ A  + Y
Sbjct: 639 DTRAMIGAIRSHY 651


>gi|322371783|ref|ZP_08046326.1| hypothetical protein ZOD2009_19818 [Haladaptatus paucihalophilus
           DX253]
 gi|320548668|gb|EFW90339.1| hypothetical protein ZOD2009_19818 [Haladaptatus paucihalophilus
           DX253]
          Length = 713

 Score =  414 bits (1064), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 236/614 (38%), Positives = 329/614 (53%), Gaps = 54/614 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA NPV W  W + A   A++R+VPIFLSIGYS CHWCHVME ESFE
Sbjct: 8   NRLDEEESPYLRQHADNPVHWQPWDDAALEAAKERNVPIFLSIGYSACHWCHVMEEESFE 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA+LLN+ FV IKVDREERPD+D +YM+  Q + GGGGWPLS +L+PD KP   GTY
Sbjct: 68  DEDVAELLNEHFVPIKVDREERPDIDAIYMSICQQVTGGGGWPLSAWLTPDGKPFYVGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP- 281
           FP   + GRPGF  +L  VK+ W +  + +   G    EQ ++A+     S    D+ P 
Sbjct: 128 FPKRSQQGRPGFIDLLENVKNTWQENPEEMKNRG----EQWTDAIEGELESTPEADDAPG 183

Query: 282 QNALRLCAEQLSKSYDSRFGGFG-SAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
              L   AEQ  ++ D  +GGFG   PKFP+P  + ++L   +  + TG    A++ + +
Sbjct: 184 PELLGSAAEQTVRTADREYGGFGRGGPKFPQPARLHLLL---RAYDRTG----ATQYRDV 236

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
            +  L  MA GG++DH+GGGFHRY+ D +W VPHFEKMLYD  +L   YL  + LT D  
Sbjct: 237 AVEALDAMADGGMYDHIGGGFHRYATDRKWTVPHFEKMLYDNAELPRAYLAGYQLTGDER 296

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV------- 453
           Y+ + R+    L R+M  P G  +S  DA S +  G    +EG FYVWT  +V       
Sbjct: 297 YAELVRETFASLEREMRHPEGGFYSTLDARSEDEAG--NYEEGPFYVWTPSDVYEAVEDE 354

Query: 454 --EDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMP 510
             +DI  E  A +  E Y +  +GN            F+GK VL    D    A K  + 
Sbjct: 355 RDDDIDTETRADIVCERYGVTQSGN------------FEGKTVLTLTTDVPDLAEKYDVS 402

Query: 511 LEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFN 570
            ++  ++L + R  +F+ R +R RP  D+K++  WNGL+I++ A    +L          
Sbjct: 403 EDEVRDVLADARHSMFEAREERERPPRDEKILAGWNGLLIAALAEGGFVLD--------- 453

Query: 571 FPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLL 630
                   + Y ++A  A  F+R  L+DE   +L   F++      G+L+DYAFL  G  
Sbjct: 454 --------EHYTDLAADALDFVREKLWDEADAKLSRRFKDEDVAIDGYLEDYAFLARGAF 505

Query: 631 DLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGN 690
            LYE       L +A++L    +  F D E    + T      ++ R +E  D + PS  
Sbjct: 506 ALYESTGNPDHLEFALDLARAIEREFWDAERETLYFTPESGERLVARPQELADQSTPSSL 565

Query: 691 SVSVINLVRLASIV 704
            V+   L  L+   
Sbjct: 566 GVATDVLAVLSEFA 579


>gi|374852688|dbj|BAL55616.1| hypothetical conserved protein [uncultured gamma proteobacterium]
          Length = 723

 Score =  414 bits (1064), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 248/598 (41%), Positives = 341/598 (57%), Gaps = 60/598 (10%)

Query: 95  SHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCH 154
           + +  K TNRL  E+SPYLLQHAHNPVDW+ WGEEAFA+AR+   PIFLS GYS+CHWCH
Sbjct: 2   ARAEKKFTNRLILENSPYLLQHAHNPVDWYPWGEEAFAKARREAKPIFLSSGYSSCHWCH 61

Query: 155 VMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDL 214
           VME ESFEDE +A +LN  FV +K+DRE+RPDVD VYM  VQ L G GGWPLS FL+PD 
Sbjct: 62  VMERESFEDEEIAAILNRDFVPVKLDREQRPDVDAVYMHAVQLLTGHGGWPLSAFLTPDG 121

Query: 215 KPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKR-DMLAQSGAFAIEQLSEALSASASS 273
           +P  GGTYFPP+       FK +L++V +AW  +R ++ AQ+     E+L +AL    S+
Sbjct: 122 RPFFGGTYFPPQ------AFKRLLQQVAEAWRSRRAEIEAQA-----ERLKQALLELEST 170

Query: 274 NKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGE 333
           +  P E+    +     ++   +D R GGFG+APKFP    + +++       D    G+
Sbjct: 171 H--PGEIGPETVEAAIAEILAPFDPRHGGFGAAPKFPNEPWLALLI-------DELWRGD 221

Query: 334 ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 393
             +  ++V  TL  MA+GG+ D +G GFHRY VD  + +PHFEKMLY+Q QL  +Y  A 
Sbjct: 222 DPKVLEVVRKTLDAMARGGLCDQIGDGFHRYCVDAAFQIPHFEKMLYNQAQLGRLYARAA 281

Query: 394 SLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 453
           +LTKD  ++Y  R   D++ R++  P G  ++A DADS   EG    +EG FY+WT +E+
Sbjct: 282 ALTKDALFAYAARCTFDFVLRELTAPEGGFYAAIDADS---EG----EEGKFYLWTPEEI 334

Query: 454 EDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE 512
              L  + A L  E + +  +GN            F+GKNVL      +  A   GM  E
Sbjct: 335 RAALPKDDAELAIELFGVSASGN------------FEGKNVLHLPRPLAEIAQAKGMTEE 382

Query: 513 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 572
           + L  L   R++L+ VR +R  P  DDK++ +WNG++I++ A A++         +F+ P
Sbjct: 383 ELLACLDRIRQRLYQVRRRRVPPLRDDKIVTAWNGMMIAALAEAAR---------LFHEP 433

Query: 573 VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDL 632
                  +Y+  A  AA F+ RH    Q  RL  + RNG     G  +DYAFL  G L L
Sbjct: 434 -------KYLLAARRAAEFLSRHHL--QGERLLRASRNGRPAGEGLQEDYAFLAEGFLAL 484

Query: 633 YEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGN 690
           Y+  +   WL  A  L       F D   G  F     D  + +R K+  DGA PSGN
Sbjct: 485 YDVSADPVWLQEAEALTAAMLAQFWDEARGACFMNRA-DERLAVRPKDLFDGAYPSGN 541


>gi|399574327|ref|ZP_10768086.1| hypothetical protein HSB1_01250 [Halogranum salarium B-1]
 gi|399240159|gb|EJN61084.1| hypothetical protein HSB1_01250 [Halogranum salarium B-1]
          Length = 723

 Score =  414 bits (1063), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 246/617 (39%), Positives = 333/617 (53%), Gaps = 56/617 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA NPV+W  W ++A AEA++RDVPIFLSIGYS CHWCHVM  ESFE
Sbjct: 8   NRLGDEQSPYLRQHADNPVNWQPWDDQALAEAKERDVPIFLSIGYSACHWCHVMADESFE 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA +LND FV IKVDREERPD+D+VY T  Q + G GGWPLSV+L+P+ KP   GTY
Sbjct: 68  DEAVADVLNDEFVPIKVDREERPDLDRVYQTICQLVSGRGGWPLSVWLTPEGKPFYVGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLP-DELP 281
           FPP+ + G PGF  +LR + ++WD + D          +Q + AL    +    P DE P
Sbjct: 128 FPPQARQGAPGFLDLLRNISNSWDSEEDRAEMEN--RADQWTTALDDQLADTPDPADETP 185

Query: 282 Q-NALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
             + L   A+   +  D   GGFGS   PKFP P  I ++L   +  + +G+     E  
Sbjct: 186 DVDVLGTAAQAALRGADREHGGFGSGEGPKFPHPGRIDLLL---RTYDRSGR----GETL 238

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
            +   TL  MA GG++D VGGGFHRY+VD  W VPHFEKMLYD  +L   YL  + +T +
Sbjct: 239 NVATETLDAMANGGLYDQVGGGFHRYTVDRSWTVPHFEKMLYDNAELPKSYLAGYQVTGE 298

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDA-------DSAETEGA-------TRKKEGA 444
             Y+ I ++   ++ R++  P G  FS  DA       +SAE+            ++EGA
Sbjct: 299 PRYARIAQETFAFVERELTHPDGGFFSTLDAQSEGFDDESAESADGDDSEGGEAEREEGA 358

Query: 445 FYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 503
           FYVWT ++V ++L E  A LF + Y +   GN +            G +VL         
Sbjct: 359 FYVWTPEQVHEVLDEEDAELFCDRYGITKRGNFE-----------HGTSVLNISTPVEEL 407

Query: 504 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 563
           A +  +        L   R  LF+ R +RPRP  D+KV+  WNGL+ISSFA  +++L   
Sbjct: 408 AEEYDIDRADVSERLTNARVALFEAREERPRPPRDEKVLAGWNGLMISSFAMGARVLDPA 467

Query: 564 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 623
              A                 AE A SF+R HL+D+   RL   F++   K  G+L+DYA
Sbjct: 468 LAGA-----------------AERALSFVREHLWDDDAKRLSRRFKDQDVKGDGYLEDYA 510

Query: 624 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 683
           FL  G  +LY+       L +A++L    +  F D E G  + T      ++ R +E  D
Sbjct: 511 FLARGAFELYQATGDVDHLAFALDLARVIEAEFWDDEKGTLYFTPASGEQLVTRPQELTD 570

Query: 684 GAEPSGNSVSVINLVRL 700
            + PS   V+   LV L
Sbjct: 571 SSTPSSLGVATDLLVDL 587


>gi|392966241|ref|ZP_10331660.1| protein of unknown function DUF255 [Fibrisoma limi BUZ 3]
 gi|387845305|emb|CCH53706.1| protein of unknown function DUF255 [Fibrisoma limi BUZ 3]
          Length = 677

 Score =  413 bits (1062), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 243/606 (40%), Positives = 332/606 (54%), Gaps = 50/606 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA E SPYLLQHAHNPVDW+ WGEEA  +A++ D PI +SIGYS CHWCHVME ESFE
Sbjct: 3   NRLANETSPYLLQHAHNPVDWYPWGEEALTKAQQEDKPIIVSIGYSACHWCHVMERESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
            E VA+++N+ FV IKVDREERPDVD +YM  VQA+   GGWPL+VFL PD KP  G TY
Sbjct: 63  KEPVARVMNENFVCIKVDREERPDVDAIYMEAVQAMGVQGGWPLNVFLMPDAKPFYGVTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSG-AFAIEQLSEALSASASSNKLPDE-- 279
            PP++      +  +L  ++DA+D+ R  LAQS   FA E     LS S      P +  
Sbjct: 123 LPPQN------WVNLLGNIRDAFDEHRADLAQSAEGFATEL---NLSDSERFGLQPADPL 173

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML-YHSKKLEDTGKSGEASEGQ 338
                L +   ++    D   GG   APKFP P   Q +L Y+   +  T ++  A    
Sbjct: 174 FSAETLDVLYRKVHVKADDEKGGMRRAPKFPMPSIWQFLLRYYDSTVASTTENETA---L 230

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
           ++V  TL  MA GGI+D +GGGF RYS D  W  PHFEKMLYD GQL  +Y +A+SLTK 
Sbjct: 231 RLVTLTLDRMALGGIYDQLGGGFARYSTDADWFAPHFEKMLYDNGQLLTLYSEAYSLTKS 290

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
             Y ++    + + +R+++ P G  +SA DADS   EG     EG FY +T+ E+ D LG
Sbjct: 291 PLYKHVVYQTIAFAQRELLSPEGGFYSALDADS---EGV----EGKFYTFTTSELRDALG 343

Query: 459 EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 518
           +    F E Y L   GN +            G+N+L       + A ++G         L
Sbjct: 344 DEFDWFAELYNLSEDGNWE-----------HGRNILHRTESDESFAERMGWSAADLSVRL 392

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
                +L  +R++R RP LDDK++ SWNGL++   A A ++         F  P      
Sbjct: 393 DATHLRLLKIRNERIRPGLDDKILCSWNGLMLKGLATAYRV---------FGEP------ 437

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG 638
            E++ +A   A F+ + + D +  RL H+++ G ++ PGFL+DYA +I GLL LY+    
Sbjct: 438 -EFLTLALRNAYFLLQKMRDNRNGRLWHTYKEGRARQPGFLEDYATVIDGLLALYQATFT 496

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
             WL  A  L     + F D     +F T      ++ R KE  D   PS NS+   NL 
Sbjct: 497 ESWLTEADRLTQYVFDSFSDPNDDLFFFTDKNGEELIARRKELFDNVIPSSNSIMAGNLY 556

Query: 699 RLASIV 704
            ++ ++
Sbjct: 557 AMSLLL 562


>gi|118575698|ref|YP_875441.1| thioredoxin [Cenarchaeum symbiosum A]
 gi|118194219|gb|ABK77137.1| thioredoxin [Cenarchaeum symbiosum A]
          Length = 676

 Score =  413 bits (1062), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 238/601 (39%), Positives = 330/601 (54%), Gaps = 56/601 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N L  E SPYLLQHA NPV+W+AW +EA   A   D PIFLSIGYS CHWCHVM  ESFE
Sbjct: 7   NSLIHETSPYLLQHAQNPVEWYAWNKEALGRAVDEDKPIFLSIGYSACHWCHVMAHESFE 66

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           +E +A ++N+ F++IKVDREERPD+D +Y    Q   G GGWPLS FL+PD KP   GTY
Sbjct: 67  NENIADIMNENFINIKVDREERPDIDDIYQKGCQLATGQGGWPLSAFLTPDRKPFYIGTY 126

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
            PP   +GR GF++ILR++  AW +K   +  +    +E L     A+A     P E  +
Sbjct: 127 IPPSSSHGRNGFESILRQLSQAWKEKPGDIKGTAEKFLETLRGGERATA-----PAEPDR 181

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
           + L   A  L +  D+  GGFG APKFP    I  +  +       GK    S+  +  L
Sbjct: 182 SVLDEAAVNLLQMADTTHGGFGRAPKFPGSANISFLFRY-------GKLSGISKFTRFAL 234

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL  MA+GGI D VGGGFHRYS DERW  PHFEKMLYD   +   Y +A+ +T    Y 
Sbjct: 235 LTLDRMARGGIFDQVGGGFHRYSTDERWLAPHFEKMLYDNALIPVNYAEAYQVTGSPAYL 294

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
            I    LDY+ R++  P G  +S++DAD   TEG    +EG +YVW+ KEV++ILG  A 
Sbjct: 295 RIMEKTLDYVLRELSSPEGGFYSSQDAD---TEG----EEGRYYVWSKKEVKEILGADAD 347

Query: 463 LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECR 522
            F   Y +   GN            ++GK +L      SA A + G+ + +   I+    
Sbjct: 348 AFCMFYDVTDGGN------------WEGKTILYNGAAPSAVAFQCGITVGELDGIIERSA 395

Query: 523 RKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYM 582
            KL + RS R  P LDDKV+ SWN L++++ AR  +                 S    Y+
Sbjct: 396 AKLLEARSGRVPPGLDDKVLASWNSLMVTALARGYR----------------ASGEARYL 439

Query: 583 EVAESAASFIRRHLYDEQTHR---LQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGT 639
           + A     FI     D + HR   L  +++ G ++ PG+LDD+A+    LLD +E  +  
Sbjct: 440 DAARRCLGFI-----DAKMHRDGALMRTYK-GEARIPGYLDDHAYYGCALLDAFEVDAEE 493

Query: 640 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVR 699
           ++L  A E+ +   + F D E GG+F T+     +++R +  +D + PSGNS +   ++R
Sbjct: 494 RYLRRASEIGSHLVQNFWDEERGGFFMTSDVHEGLIVRPRSGYDLSLPSGNSAAAHLMLR 553

Query: 700 L 700
           L
Sbjct: 554 L 554


>gi|395645901|ref|ZP_10433761.1| hypothetical protein Metli_1447 [Methanofollis liminatans DSM 4140]
 gi|395442641|gb|EJG07398.1| hypothetical protein Metli_1447 [Methanofollis liminatans DSM 4140]
          Length = 690

 Score =  413 bits (1062), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 240/608 (39%), Positives = 334/608 (54%), Gaps = 55/608 (9%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            NRL  E SPYL QHAHNPVDW+ WGEEAF +AR  D P+FLSIGYSTCHWCHVM  ESF
Sbjct: 9   ANRLVGEKSPYLRQHAHNPVDWYPWGEEAFKKARDEDKPVFLSIGYSTCHWCHVMAEESF 68

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           ED GVA++LN+ FV++KVDREERPD+D VYM    AL G GGWPL++ ++PD  P    T
Sbjct: 69  EDAGVAEVLNEGFVAVKVDREERPDIDAVYMQVCLALTGRGGWPLTIVMTPDRLPFFAAT 128

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           Y P E + G  G   +L+K++  W+ +RD L  S      ++ + L A AS   L  +  
Sbjct: 129 YLPKETRLGVTGLIDVLKKIRHLWETRRDDLVGSA----REIVDDLGAGAS---LRGKAE 181

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
              LR    ++ + YD  +GGF  +PKFP P    M+++  +    TG     +  ++  
Sbjct: 182 TALLREGYAEMKRRYDPSYGGFDRSPKFPSP---HMIIFLIRYWHWTGDPMALAMAEQ-- 236

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             TL+ +  GGI D +G G HRY+ D +W VPHFEKMLYDQ  LA  + +A   T D FY
Sbjct: 237 --TLREVRGGGIFDQIGFGVHRYATDRKWLVPHFEKMLYDQAMLALAFTEAHMATGDAFY 294

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL-GEH 460
                +I  Y++RD+  P G  ++AEDADS   EG     EG FY+WT++EV   + GE 
Sbjct: 295 LSAADEIFTYVQRDLASPEGAFYTAEDADS---EGV----EGKFYLWTAEEVRSAVGGED 347

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
           A LF E Y +   G+ D+     PH     + +          +   G+P ++    L  
Sbjct: 348 AALFIEAYGIG-EGSGDI-----PHRAVSPQVL----------SRTTGIPEDEIRRRLEA 391

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            R KL  VR  R RPH D+K+++ WN L++++ ARA +                 S R  
Sbjct: 392 VREKLLSVRKGRARPHRDEKILLDWNALMVAALARAGRY----------------SGRTG 435

Query: 581 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 640
           Y+  A+ AA  +   L       L H + +G +   G L DYA+L+  L ++YE     +
Sbjct: 436 YVAAAQGAAGVLLDRLRRPDGG-LLHRYMDGEAAVSGMLADYAYLVWALAEVYEASFDPE 494

Query: 641 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 700
            L  A  L +   E F D  GGG++  + +   ++LR KE HDGA PSGNS+++  LV L
Sbjct: 495 ILREACRLADAMIERFGDPSGGGFYTVSADGEQLILRQKEIHDGALPSGNSMALFALVTL 554

Query: 701 ASIVAGSK 708
             +   S+
Sbjct: 555 FRLTGLSR 562


>gi|448469568|ref|ZP_21600250.1| hypothetical protein C468_14982 [Halorubrum kocurii JCM 14978]
 gi|445808905|gb|EMA58956.1| hypothetical protein C468_14982 [Halorubrum kocurii JCM 14978]
          Length = 740

 Score =  412 bits (1059), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 246/637 (38%), Positives = 333/637 (52%), Gaps = 71/637 (11%)

Query: 97  SRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVM 156
           S+    NRL  E SPYL QHA NPV+W  WGEEAF  AR+ DVP+F+SIGYS+CHWCHVM
Sbjct: 2   SQPTERNRLDGEASPYLQQHADNPVNWQPWGEEAFERAREHDVPVFVSIGYSSCHWCHVM 61

Query: 157 EVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKP 216
             ESFEDE +A +LND FV +KVDREERPDVD  +MT  Q + GGGGWPLS + +P+ +P
Sbjct: 62  AEESFEDESIAAVLNDEFVPVKVDREERPDVDSTFMTVSQLVTGGGGWPLSAWCTPEGEP 121

Query: 217 LMGGTYFPPEDKYGRPGFKTILRKVKDAW---------DKKRDMLAQSGAFAIEQLSE-A 266
              GTYFPPE +  +PGF+ +  ++ D+W         +++ D    S    +E + + +
Sbjct: 122 FYVGTYFPPEPRRNQPGFRDLCERIADSWADPEQREEMERRADQWTTSARDELESVPDPS 181

Query: 267 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKL 325
           L+  A  ++ P     N L   A    + YD  +GGFGS   KFP P  I +++    + 
Sbjct: 182 LAGDAGGSEAPG---PNLLDEAAAAAVRGYDDEYGGFGSGGAKFPMPGRIDVLMRAYAR- 237

Query: 326 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 385
             TG+    +        TL  MA+GG++D +GGGFHRY+VD +W VPHFEKMLYD  +L
Sbjct: 238 --TGRDAALT----AATGTLDGMARGGMYDQIGGGFHRYAVDRQWTVPHFEKMLYDNAEL 291

Query: 386 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADS---------AETEG 436
              YLDA  LT D  Y+ +  + L ++ R++    G  FS  DA S         A ++G
Sbjct: 292 PMAYLDAHRLTGDASYARVASETLGFIDRELRHDDGGFFSTLDARSRPPESRRGNAGSDG 351

Query: 437 ATRKK-----EGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKG 490
           +   +     EGAFYVWT  EV+  L E A  L KE Y +   GN +           +G
Sbjct: 352 SDAAEDVADVEGAFYVWTPGEVDAALDEPAASLAKERYGIASGGNFE-----------RG 400

Query: 491 KNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVI 550
             V          A +  M        L   R  LF+ R  RPRP  D+KV+ SWNG  I
Sbjct: 401 TTVPTIAASVPELADQRDMSTADVREALTAARVALFEARESRPRPARDEKVLASWNGRAI 460

Query: 551 SSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRN 610
           S+FA A ++L                  K Y ++A  A +F R  LYDE+T  L   + +
Sbjct: 461 SAFAAAGQVLG-----------------KPYADIASDALAFCRERLYDEETGGLARRWLD 503

Query: 611 GPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGG-YFN--- 666
           G  + PG+LDD+AFL  G LD Y        L +A++L  T    F D + G  YF    
Sbjct: 504 GDVRGPGYLDDHAFLARGALDAYSATGDPAALGFALDLAETVVSDFYDADDGTIYFTRDP 563

Query: 667 ---TTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 700
              T   D ++  R +E  D + PS   V+   L  L
Sbjct: 564 DEETEQGDDTLFARPQEFTDRSTPSSLGVAAETLALL 600


>gi|294102620|ref|YP_003554478.1| hypothetical protein [Aminobacterium colombiense DSM 12261]
 gi|293617600|gb|ADE57754.1| protein of unknown function DUF255 [Aminobacterium colombiense DSM
           12261]
          Length = 595

 Score =  412 bits (1059), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 243/607 (40%), Positives = 338/607 (55%), Gaps = 61/607 (10%)

Query: 98  RNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVME 157
           +NK  NRL  E SPYLLQHAHNPVDW  WG+EAF +A++ + PIFLSIGYSTCHWCHVME
Sbjct: 2   KNKE-NRLITEKSPYLLQHAHNPVDWHPWGKEAFTKAQEENKPIFLSIGYSTCHWCHVME 60

Query: 158 VESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPL 217
            E F DE VA+LLND  VSIKVDREERPD+D V M     + G GGWPL++FL+P+ KP 
Sbjct: 61  KECFSDEEVAQLLNDACVSIKVDREERPDIDHVCMAVSLIMNGSGGWPLNLFLTPNGKPF 120

Query: 218 MGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK-- 275
              +Y P E     PG   ++ +VK  W  +++ + +S     E +  AL    ++ K  
Sbjct: 121 FAASYIPKETSGRIPGLMDMVPRVKWLWLMQKEDVLKSA----ESIMNALEKEMTNQKGT 176

Query: 276 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 335
            PD   +N  +   ++LS+++D  +GGF  APKFP P  +  +L       + GK  +  
Sbjct: 177 CPD---KNLAKKAFQELSRNFDPLWGGFSKAPKFPMPPVLLFLL-------EYGKIFKEE 226

Query: 336 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 395
           +  KMV  TL CMA GGI DH+GGGF RYS D  W +PHFEKMLYDQ  L   Y  A+ +
Sbjct: 227 KAIKMVEKTLDCMAMGGIRDHLGGGFARYSTDREWKIPHFEKMLYDQALLLKAYTAAWEM 286

Query: 396 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 455
           T    Y  I  +I  Y+ RD+  P G  F+AEDADS   EG     EG FYVWT +E+  
Sbjct: 287 TGRDIYKKIAFEIAAYVLRDLRSPEGVFFAAEDADS---EGV----EGRFYVWTEEEIRR 339

Query: 456 IL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKY 514
           ++  E   LF + Y +   GN     ++ P +       L EL      A+   + L+K 
Sbjct: 340 LVPSEDRQLFLQAYGIHGEGNV----LALPAS-------LEEL------AATYNVELQKL 382

Query: 515 LNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVV 574
              L + R  LF+ R++R RPH D K++  WN L+I + A A +I               
Sbjct: 383 DQSLQKSRALLFEARNRRVRPHCDRKILTDWNALMIEALAFAGRIF-------------- 428

Query: 575 GSDRKEYMEVAESAASF-IRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
             + ++++E A +A  F + + +Y E+   + HS  +G    PG L+DY+F I  LL+L 
Sbjct: 429 --EERQFIEAARNAVDFLLEKAVYQEK--EVYHSVADGKGHIPGLLNDYSFFIRALLELE 484

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
           E      +    + L  + +++F D + GGYF  +G D  +  R     DG   SGNSV+
Sbjct: 485 EATGEEDYGEKGMGLLRSMNDIFYDPKRGGYFMNSGLDELLFFRPWSGEDGVMVSGNSVA 544

Query: 694 VINLVRL 700
           ++NL+R 
Sbjct: 545 MMNLLRF 551


>gi|421090081|ref|ZP_15550882.1| PF03190 family protein [Leptospira kirschneri str. 200802841]
 gi|410001344|gb|EKO51958.1| PF03190 family protein [Leptospira kirschneri str. 200802841]
          Length = 711

 Score =  412 bits (1058), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 253/682 (37%), Positives = 368/682 (53%), Gaps = 61/682 (8%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           +++ NRL+ E SPYL QH++NPVDWF WGEEA  +A+ +D  IFLS+GY+TCHWCHVME 
Sbjct: 28  SRNPNRLSKEKSPYLQQHSYNPVDWFPWGEEALTKAKDQDKLIFLSVGYATCHWCHVMEK 87

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFE++ +A  LN  FVSIKVDREERPD+D++YM  + A+   GGWPL++FL+P+ +P+ 
Sbjct: 88  ESFENQSIADYLNSHFVSIKVDREERPDIDRIYMDALHAMEQQGGWPLNMFLTPEGQPIT 147

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
           GGTYFPPE +YGR GF  +L  ++  W +KR  L  + +   + L ++  + A   +  D
Sbjct: 148 GGTYFPPESRYGRKGFLEVLNIIQKVWTEKRSELIAAASELSQYLKDSGESRAKEKQEAD 207

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGS--APKFPRPVEIQMML--YHSKKLEDTGKSGEA 334
             P+N            YDS+FGGF +    KFP  + +  +L  YHS        SG  
Sbjct: 208 FPPENCFDSGFLLYENYYDSQFGGFKTNQVNKFPPSMGLGFLLRYYHS--------SGNP 259

Query: 335 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 394
           +   +MV  TL  M +GGI+D +GGG  RYS D RW VPHFEKMLYD      +  +   
Sbjct: 260 N-ALEMVENTLLAMKRGGIYDQIGGGLCRYSTDPRWLVPHFEKMLYDNSLFLEILAEYSL 318

Query: 395 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 454
           ++K +       DI+ YL RDM   GG I        +  +  + ++EG FY+W  +E  
Sbjct: 319 VSKKISAKSFALDIVSYLHRDMRMDGGGI-------CSAEDADSEEEEGLFYIWDLEEFR 371

Query: 455 DILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKY 514
           ++ GE + L ++ + +   GN            F+GKN+L E    +   S       K+
Sbjct: 372 EVCGEDSSLLEKFWNVTKEGN------------FEGKNILHE----NFRGSNFTEEESKH 415

Query: 515 LN-ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
           L+  L   + KL + RSKR RP  DDK++ SWNGL I +  +                  
Sbjct: 416 LDGALTRGKAKLLERRSKRIRPLRDDKILTSWNGLYIKALVKTG---------------- 459

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
           +   R++++++AE   SFI ++L D +  R+   FR G S   G+ +DYA +I+  + L+
Sbjct: 460 IAFQREDFLKLAEETYSFIEKNLIDSKG-RILRRFREGESGILGYSNDYAEMIASSIVLF 518

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGAEPSGNSV 692
           E G G ++L  A+        LF  R   G F  TG D  VLLR   D +DG EPS NS 
Sbjct: 519 EAGRGVRYLQNAVFWMEETIRLF--RSTAGVFFDTGIDGEVLLRRSVDGYDGVEPSANSS 576

Query: 693 SVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHV 752
              +LV+L+ +  G  SD YR+ AE     F   L   A+  P +  A       SR+ V
Sbjct: 577 LAHSLVKLSFL--GVNSDRYREVAESIFLYFRKELYSYALNYPFLLSAYWSYKYHSREIV 634

Query: 753 VLVGHKSSVDFENMLAAAHASY 774
           ++   K+S    ++LA   + +
Sbjct: 635 LI--RKNSEAGRDLLAWIQSRF 654


>gi|222479721|ref|YP_002565958.1| hypothetical protein Hlac_1296 [Halorubrum lacusprofundi ATCC
           49239]
 gi|222452623|gb|ACM56888.1| protein of unknown function DUF255 [Halorubrum lacusprofundi ATCC
           49239]
          Length = 744

 Score =  411 bits (1056), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 255/674 (37%), Positives = 345/674 (51%), Gaps = 76/674 (11%)

Query: 97  SRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVM 156
           S+    NRL  E SPYL QHA NPV+W  WGEEAF  AR+ DVP+F+SIGYS+CHWCHVM
Sbjct: 2   SQPTERNRLDGEASPYLQQHADNPVNWQPWGEEAFERAREHDVPVFVSIGYSSCHWCHVM 61

Query: 157 EVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKP 216
             ESFEDE +A +LN+ FV +KVDREERPDVD  +MT  Q + GGGGWPLS + +P  KP
Sbjct: 62  AEESFEDESIAAVLNEKFVPVKVDREERPDVDSTFMTVSQLVTGGGGWPLSAWCTPKGKP 121

Query: 217 LMGGTYFPPEDKYGRPGFKTILRKVKDAW---------DKKRDMLAQSGAFAIEQLSEAL 267
              GTYFPPE +  +PGF+ +  ++ D+W          ++ D    S    +E + E  
Sbjct: 122 FYVGTYFPPEPRRNQPGFRDLCERIADSWADPEQREEMKRRADQWTTSARDELESVPEPD 181

Query: 268 SAS-ASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKL 325
           +A  AS          + L   A    + YD  +GGFGS   KFP P  I ++L    + 
Sbjct: 182 AAGDASGTGGAGPPGPDLLDEAAAAAIRGYDDEYGGFGSGGAKFPMPGRIDVLLRAYAR- 240

Query: 326 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 385
                 G+A+        TL  MA+GG++D +GGGFHRY+VD +W VPHFEKMLYD  +L
Sbjct: 241 ----SGGDAA--LTAATGTLDGMARGGMYDQIGGGFHRYAVDRQWTVPHFEKMLYDNAEL 294

Query: 386 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEG--------- 436
              YLD + LT D  Y+ +  + L +L R++    G  FS  DA S   E          
Sbjct: 295 PMAYLDGYRLTGDASYARVASETLGFLDRELRHDDGGFFSTLDARSRPPENRRGNAGSDE 354

Query: 437 -----ATRKKEGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKG 490
                     EGAFYVWT  EV+ +L E A  L K+ Y ++  GN +           +G
Sbjct: 355 SDDADDVADVEGAFYVWTPAEVDAVLDEPAASLAKDRYGIRSGGNFE-----------RG 403

Query: 491 KNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVI 550
             V       +  A +  M  E     L   R  LF+ R  RPRP  D+KV+ SWNG  I
Sbjct: 404 TTVPTIAASIAELADEHDMSTEAVREALTAARVALFEARESRPRPARDEKVLASWNGRAI 463

Query: 551 SSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRN 610
           S+FA A ++L                  + Y ++A  A SF R  LYDE+T  L   + +
Sbjct: 464 SAFATAGQVLG-----------------EPYADIASDALSFCRERLYDEETETLARRWLD 506

Query: 611 GPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTT-- 668
           G  + PG+LDD+AFL  G LD+Y      + L +A++L  T    F D   G  + T   
Sbjct: 507 GDVRGPGYLDDHAFLARGALDVYSVTGDPEALGFALDLAATVVSDFYDEADGTIYFTRDP 566

Query: 669 ------GEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 722
                 G D ++  R +E  D + PS   V+   L    +++ G ++D  R+ AE +  V
Sbjct: 567 DGNAGHGGDDTLFARPQEFTDQSTPSSLGVAAETL----ALLDGFRTD--REFAEVAETV 620

Query: 723 FETRLKDMAMAVPL 736
             T   D   A PL
Sbjct: 621 VTTH-ADRIRASPL 633


>gi|374585294|ref|ZP_09658386.1| hypothetical protein Lepil_1460 [Leptonema illini DSM 21528]
 gi|373874155|gb|EHQ06149.1| hypothetical protein Lepil_1460 [Leptonema illini DSM 21528]
          Length = 685

 Score =  411 bits (1056), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 250/665 (37%), Positives = 357/665 (53%), Gaps = 65/665 (9%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           + TNRL  E SPYLLQHAHNPVDW+AWGEEAF +AR  D  I +SIGY+TCHWCHVME E
Sbjct: 2   QKTNRLIHEKSPYLLQHAHNPVDWYAWGEEAFTKARNEDKLILISIGYATCHWCHVMERE 61

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SFED+  A LLN+ +V+IKVDREE PDVD +YM  + A+   GGWPL++FL+PD +P+ G
Sbjct: 62  SFEDQSTADLLNEHYVAIKVDREELPDVDSIYMKALHAMGQPGGWPLNLFLTPDRRPITG 121

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE 279
           GTYFPP+  +GRP FK +L  +   W   R  L ++ +   E L+E    +A ++ LPD 
Sbjct: 122 GTYFPPQPAHGRPSFKQMLGTLAQMWKNDRPRLLEAASSITEFLNE---QNALASDLPD- 177

Query: 280 LPQNALRLCAEQLSKSYDSRFGGF-GSAP-KFPRPVEIQMMLYHSKKLEDTGKSGEASEG 337
            P    R   E + +++D + GGF G+ P KFP  + + ++L    +L +  + G +S  
Sbjct: 178 -PSIFARFIGE-MEQAFDVQRGGFYGNGPNKFPPSMALMLLL----RLHERDRQGSSSV- 230

Query: 338 QKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTK 397
             MV  TL+ M++GGI+D +GGG  RYS D  W VPHFEKMLYD         +A+ +T 
Sbjct: 231 LVMVEKTLEAMSRGGIYDQLGGGLCRYSTDPAWLVPHFEKMLYDNALFLQALTEAYRITG 290

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
           + FY  +  D++ YLRRD++ P G  + AEDADS   EG     EG FYVW++ E  + L
Sbjct: 291 NDFYRRMAYDVIAYLRRDLMSPEGAFYCAEDADS---EGV----EGKFYVWSAAEFRETL 343

Query: 458 GEHAI------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPL 511
               +      L   ++ +   GN            F+GKN+L         AS+  + L
Sbjct: 344 RSSGLSDDEIRLLSLYWNVTEAGN------------FEGKNILHLTGSDEDFASQHSLTL 391

Query: 512 EKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNF 571
                +  + R+ LF VR +R RP  DDK++ SWN L+IS+ +RAS +    + + M   
Sbjct: 392 TSLNEMTQKARQALFAVRERRIRPLRDDKILTSWNALMISALSRASIVFGDASLADM--- 448

Query: 572 PVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLD 631
                        A + A F+  HL   Q  +L   +R+G ++    L D+A L   L+D
Sbjct: 449 -------------AVACADFVESHLM--QDGQLMRRYRDGEARFKATLTDHALLGCALID 493

Query: 632 LYEFGSGTKWLVWAIE-LQNTQDELFLDREGGGYFNTTGEDPS--VLLRVKEDHDGAEPS 688
           L+     + ++  A+E  +      F D    G    T ED S  + LR  + +DG  PS
Sbjct: 494 LFRVTGKSVYMRRALERAEAIMSSFFAD----GRLYETAEDDSDDLFLRPIDSYDGVMPS 549

Query: 689 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPS 748
           G S ++   V L+    G  +  Y + A+  L  F       A A P M  A    S  +
Sbjct: 550 GPSAALRLFVTLSRY--GESARIYEETAKVILRQFSPEWAQAARAYPAMVSAFLTFSDEA 607

Query: 749 RKHVV 753
           R+  +
Sbjct: 608 REIAI 612


>gi|257076883|ref|ZP_05571244.1| thymidylate kinase [Ferroplasma acidarmanus fer1]
          Length = 638

 Score =  411 bits (1056), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 244/618 (39%), Positives = 340/618 (55%), Gaps = 63/618 (10%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N+LA E+SPYLL+H++NPVDW  W +EAF  A+K D P+FLSIGYS+CHWCHVME ESF 
Sbjct: 2   NKLANENSPYLLEHSNNPVDWNPWSDEAFNLAKKEDKPVFLSIGYSSCHWCHVMEQESFT 61

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D  VAK +N  FV IKVDREE PDVD +YMT+ Q + G GGWPL+V L+PD KP+   TY
Sbjct: 62  DPEVAKRMNSTFVCIKVDREEMPDVDSLYMTFSQVMTGTGGWPLNVILTPDRKPIFAFTY 121

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
            P   +    G   +   +   W  KR  + ++G  AI +L          N  P +  +
Sbjct: 122 IPRVSRNNMIGIMELAENIDYLWKNKRGEMEKNGDEAISRLRNM--ERKEENNSPVDYKK 179

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
            A+    E L ++YDS +GGFG+APKFP    I  +L + K     GK     E  +MV 
Sbjct: 180 -AIEATYESLKRNYDSEYGGFGNAPKFPSFHNIIFLLNYYKA---HGK----EEALEMVK 231

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            +L+ M  GG++DHVGGGFHRYS D  + +PHFEKM YDQ      Y  A+ +T D FY 
Sbjct: 232 HSLRMMYIGGMYDHVGGGFHRYSTDPFFRIPHFEKMTYDQAMAIIAYSYAYDVTGDTFYK 291

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
            +  +I  +L+++M   G   ++A DADS   EG    +EG +Y WT +E+ +  G+   
Sbjct: 292 NVVYEIYKFLKQEMFSRG--FYTAMDADS---EG----QEGKYYTWTYEELVENAGKK-- 340

Query: 463 LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECR 522
            F   + + P GN       D ++   G+N+L    D        G P   Y N L   +
Sbjct: 341 -FVYDFNILPEGN-----FYDANSRQTGRNILYMGRDIQ------GDPTTLYKNELEALK 388

Query: 523 RKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYM 582
           +     R KR +P  DDK++   NGLVI + + AS I                 + K+ +
Sbjct: 389 KS----REKRIKPLTDDKILTDINGLVIKALSIASMIF----------------NDKDML 428

Query: 583 EVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWL 642
             AE +A FI   +Y ++  +L HS+RNG S   G LDDY+F++SGLL LYE      +L
Sbjct: 429 NTAEGSADFIMNDMYTDK--KLMHSYRNGKSSINGMLDDYSFMVSGLLSLYEASLNDIYL 486

Query: 643 VWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLAS 702
            +A +LQ T  + F D+  GG++N  G   ++L+R+KE +D A PSG S  + N++    
Sbjct: 487 DYARDLQKTIMDTFYDKTSGGFYNGMG---NLLVRLKESYDNAIPSGFSFEIGNMIVFNY 543

Query: 703 IVAGSKSDYYRQNAEHSL 720
           I      D YR   E S+
Sbjct: 544 I-----DDKYRVELEKSI 556


>gi|448562484|ref|ZP_21635442.1| thioredoxin domain containing protein [Haloferax prahovense DSM
           18310]
 gi|445718802|gb|ELZ70486.1| thioredoxin domain containing protein [Haloferax prahovense DSM
           18310]
          Length = 709

 Score =  410 bits (1055), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 252/666 (37%), Positives = 353/666 (53%), Gaps = 76/666 (11%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA NPV+W  W E A   AR+ D PIFLS+GYS CHWCHVM  ESF 
Sbjct: 8   NRLDEEQSPYLRQHADNPVNWQPWDETALDAAREADKPIFLSVGYSACHWCHVMADESFS 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D  +A++LN+ FV +KVDREERPD+D++Y T  Q + GGGGWPLSV+L+P+ KP   GTY
Sbjct: 68  DPDIAEVLNEHFVPVKVDREERPDLDRIYQTICQLVTGGGGWPLSVWLTPEGKPFFVGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSAS-ASSNKLPDELP 281
           FPPE + G PGF+ ++    ++W   RD +A       EQ + A++     +  +P E P
Sbjct: 128 FPPEPRRGAPGFRDLVESFAESWRTDRDEIANRA----EQWTSAITDRLEETPDVPGEAP 183

Query: 282 -QNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
             + L    +   +  D   GGFG   PKFP+P  I  +L            G A  G++
Sbjct: 184 GSDVLDSTVQAALRGADRDHGGFGGDGPKFPQPGRIDALL-----------RGYAVSGRR 232

Query: 340 MVL----FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 395
             L     +L  MA GG+ DH+GGGFHRY VD  W VPHFEKMLYDQ  LA+ YLDA  L
Sbjct: 233 EALDVARQSLDAMANGGLRDHLGGGFHRYCVDREWTVPHFEKMLYDQAGLASRYLDAARL 292

Query: 396 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 455
           T +  Y+ +  +  +++RR++    G  F+  DA S         +EG FYVWT  +V D
Sbjct: 293 TGNESYATVAAETFEFVRRELTHDDGGFFATLDAQSG-------GEEGTFYVWTPDDVRD 345

Query: 456 ILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS-ASKLGMPLEK 513
           +L E  A LF + Y + P GN            F+ K  ++ ++ ++A  A +  +   +
Sbjct: 346 LLPELDADLFCDRYGVTPGGN------------FENKTTVLNVSATTAELADEYDLDESE 393

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
             + L + R+ LF  R  R RP  D+KV+  WNGL+IS+FA+ S +L+ ++         
Sbjct: 394 VEDRLEKARKALFAAREGRERPARDEKVLAGWNGLMISAFAQGSVVLEDDS--------- 444

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
           + SD       A  A  F+R  L+D++T  L     NG  K  G+L+DYAFL  G  DLY
Sbjct: 445 LASD-------ARRALDFVRERLWDDETETLSRRVMNGEVKGDGYLEDYAFLARGAFDLY 497

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
           +       L +A++L       F D + G  + T     S++ R +E  D + PS   V+
Sbjct: 498 QATGDLAPLSFALDLARATRREFYDADAGTLYFTPESGESLVTRPQEPTDQSTPSSLGVA 557

Query: 694 VINLVRL------------ASIVAGSKSDYYRQNA-EH-SLAVFETRLKDMAMAVPLMCC 739
               + L            A  V GS ++  R +  EH SLA+   +    A  VP +  
Sbjct: 558 TSLFLDLEQFAPDADFGDVADAVLGSFANRVRGSPLEHVSLALAAEK---AASGVPELTI 614

Query: 740 AADMLS 745
           AAD +S
Sbjct: 615 AADEVS 620


>gi|262197654|ref|YP_003268863.1| hypothetical protein [Haliangium ochraceum DSM 14365]
 gi|262081001|gb|ACY16970.1| protein of unknown function DUF255 [Haliangium ochraceum DSM 14365]
          Length = 681

 Score =  410 bits (1054), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 243/613 (39%), Positives = 344/613 (56%), Gaps = 67/613 (10%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA E SPYLLQH  NPVDW+ WGEEAFA A+++  P+F+SIGY+ CHWCHVM  ESFE
Sbjct: 3   NRLAHESSPYLLQHKDNPVDWYPWGEEAFAAAQEQGKPVFVSIGYAACHWCHVMAHESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D  +A ++N+ FV++K+DREERPDVD VYM  +Q L  GGGWPLS F +PD KP   GTY
Sbjct: 63  DAEIAAVMNELFVNVKIDREERPDVDAVYMNALQILGEGGGWPLSAFCTPDGKPYFLGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL---SEALSASASSNKLPDE 279
           FPP+D+YGRPGF ++LR +   ++ +RD + Q+    ++ L    E     A S ++   
Sbjct: 123 FPPQDRYGRPGFASVLRTMAKVFEDQRDKVDQNTEAIVDGLRRVDEHFRRGALSGEV-GA 181

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
           L  + L     QL++  D + GG GS PKFP      +       L   G+    +  ++
Sbjct: 182 LRADLLITAGRQLAQRSDPQHGGLGSKPKFPSSTTHAL-------LARAGRLAFGAPARE 234

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
             L   + MA+GGI+DH+GGGF RYSVDERW VPHFEKMLYD GQL  +Y DA+++ +D 
Sbjct: 235 AFLKQARSMARGGIYDHLGGGFARYSVDERWLVPHFEKMLYDNGQLLGIYGDAYAMDQDP 294

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE 459
            ++ +  + + +L  +M  P G +++++DADS   EG    +EG +YVWT +E+  +LG 
Sbjct: 295 AFARVIDETITWLEDEMQHPSGALYASQDADS---EG----EEGKYYVWTPEEIRAVLGP 347

Query: 460 -HAILFKEHYYLKPTGNCD-----LSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEK 513
             AI F+  Y +  TGN +     LSR+SDP  +          +D +A AS        
Sbjct: 348 VDAIFFERAYGVSETGNFEHGTTVLSRVSDPGGD----------SDEAALASAR------ 391

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
                     +L   R +R  P  D KV+  WNGL +    RA              +  
Sbjct: 392 ---------ARLLAARKQRVAPETDTKVLAGWNGLAVRGAVRA--------------WET 428

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
            G+ R   + +A   A F+  H+  E   RL   F++G +K  G LDDYAF+  G L L 
Sbjct: 429 TGNARA--LALAVRVAEFLAGHMLHEGGTRLWRVFKDGSTKLDGTLDDYAFVAHGFLHLA 486

Query: 634 EFGSGTKWLVWAIELQNTQDELFL-DREGGG-YFNTTGEDPSVLLRVKEDHDGAEPSGNS 691
           E     +W      L +T  E F  +R+G G ++ T G+D  ++ R + + D A P+G S
Sbjct: 487 EATGDARWWRHGAALIDTILERFYEERDGVGIFYMTPGDDTLLVHRPESNSDHAIPAGAS 546

Query: 692 VSVINLVRLASIV 704
           V+V  L+RLA + 
Sbjct: 547 VAVACLLRLAQVA 559


>gi|335436727|ref|ZP_08559519.1| hypothetical protein HLRTI_06517 [Halorhabdus tiamatea SARL4B]
 gi|335437369|ref|ZP_08560149.1| hypothetical protein HLRTI_09692 [Halorhabdus tiamatea SARL4B]
 gi|334896155|gb|EGM34310.1| hypothetical protein HLRTI_09692 [Halorhabdus tiamatea SARL4B]
 gi|334897442|gb|EGM35575.1| hypothetical protein HLRTI_06517 [Halorhabdus tiamatea SARL4B]
          Length = 715

 Score =  410 bits (1053), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 240/615 (39%), Positives = 336/615 (54%), Gaps = 52/615 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA E SPYL  HA NPV W  W E A + A + D PIFLSIGY+ CHWCHVM  ESFE
Sbjct: 8   NRLAEEGSPYLQAHADNPVHWQPWDETALSAAEREDKPIFLSIGYAACHWCHVMAEESFE 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D+  A +LN+ FV IKVDREERPDVD++Y T  Q L   GGWPLSV+L+PD +P   GTY
Sbjct: 68  DDETAAVLNENFVPIKVDREERPDVDRIYQTLAQLLDQQGGWPLSVWLTPDGRPFYVGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSAS--ASSNKLPDEL 280
           FPP+ + GRPGF  +L  ++  W+  R+ + Q      + +S  L  +  A+ +   DEL
Sbjct: 128 FPPDSRGGRPGFAELLEDLQATWENDREGIEQRADQWADAISGELEGTPDAARDTAGDEL 187

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLEDT----GKSGEAS 335
               LR  A+   ++ D   GGFGS  PKFP+P  +Q++L    +  D     G++ EA+
Sbjct: 188 ----LRSGADAAVRTADREQGGFGSGGPKFPQPGRLQLLLRADARFGDARREEGENAEAT 243

Query: 336 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 395
           E + ++  TL  M  GG++DHVGGGFHRY+ D  W VPHFEKMLYD  ++  V L+A+  
Sbjct: 244 EYRSILTETLDAMVDGGLYDHVGGGFHRYATDRSWTVPHFEKMLYDNAEIPRVLLEAYRA 303

Query: 396 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 455
           T D  Y+ + R+  D+L R++  P G  +S  DA S   EG    +EG FYVWT  +V +
Sbjct: 304 TGDERYARVARETFDFLDRELGHPEGGFYSTLDARS---EG----EEGKFYVWTPAQVRE 356

Query: 456 ILGEHA--ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEK 513
           ++ +     L  E Y +   GN +            G+ VL         A++ G+   +
Sbjct: 357 VIDDETDVSLVCERYGITEEGNFE-----------DGQTVLTIAASVDELAARSGLGAGE 405

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
               L   R +LFD RS+R RP  D+K++  WNGL IS+ A  S  L             
Sbjct: 406 VRERLDRAREELFDARSERTRPPRDEKILAGWNGLAISALAEGSLTL------------- 452

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
            G+D   +++ A  A  F+R  L+D+    L+  + +G  +  G+L+DYAFL  G LD Y
Sbjct: 453 -GND---FLDRAVDALEFVRETLWDDDAGLLKRRYIDGDVRVDGYLEDYAFLARGALDCY 508

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTT--GE--DPSVLLRVKEDHDGAEPSG 689
                   L +A++L    +  F D++ G  + T   GE  +  +L R +E  D + PS 
Sbjct: 509 GASGDLDHLAFALDLAREIETRFFDKDVGTLYFTEAPGESRETDLLARPQELTDRSTPSS 568

Query: 690 NSVSVINLVRLASIV 704
             V+V  LV L   V
Sbjct: 569 AGVAVDVLVTLDEFV 583


>gi|448570870|ref|ZP_21639381.1| thioredoxin domain containing protein [Haloferax lucentense DSM
           14919]
 gi|448595768|ref|ZP_21653215.1| thioredoxin domain containing protein [Haloferax alexandrinus JCM
           10717]
 gi|445722788|gb|ELZ74439.1| thioredoxin domain containing protein [Haloferax lucentense DSM
           14919]
 gi|445742222|gb|ELZ93717.1| thioredoxin domain containing protein [Haloferax alexandrinus JCM
           10717]
          Length = 703

 Score =  409 bits (1052), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 251/675 (37%), Positives = 351/675 (52%), Gaps = 76/675 (11%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA NPV+W  W E A   AR+ D PIFLSIGYS CHWCHVM  ESF 
Sbjct: 8   NRLDEEQSPYLRQHADNPVNWQPWDETALEAAREADKPIFLSIGYSACHWCHVMADESFS 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D  +A++LN+ FV +KVDREERPD+D++Y T  Q + GGGGWPLSV+L+P+ KP   GTY
Sbjct: 68  DPDIAEVLNEEFVPVKVDREERPDLDRIYQTICQQVTGGGGWPLSVWLTPEGKPFFVGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP- 281
           FPPE + G PGF+ ++    ++W   RD +          +++ L  +  +   P E P 
Sbjct: 128 FPPEPRRGAPGFRDVVESFAESWRTDRDEIENRADQWTSAITDRLEETPDT---PGEAPG 184

Query: 282 QNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
            + L    +   +  D   GGFG   PKFP+P  I  +L            G A  G++ 
Sbjct: 185 SDILDTTVQAALRGADRDHGGFGGDGPKFPQPGRIDALL-----------RGYAVSGRRE 233

Query: 341 VL----FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
            L     +L  MA GG+ DH+GGGFHRY VD  W VPHFEKMLYDQ  LA+ YLDA  LT
Sbjct: 234 ALDVARQSLDAMANGGLRDHLGGGFHRYCVDREWTVPHFEKMLYDQAGLASRYLDAARLT 293

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
            +  Y+ +  +  +++RR++    G  F+  DA S         +EG FYVWT  +V D+
Sbjct: 294 GNDSYATVAAETFEFVRRELTHDDGGFFATLDAQSG-------GEEGTFYVWTPADVRDL 346

Query: 457 LGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS-ASKLGMPLEKY 514
           L E  A LF + Y + P GN            F+ K  ++ ++ ++A  A +  +   + 
Sbjct: 347 LPELDADLFCDRYGVTPGGN------------FEDKTTVLNVSATTADLADEYDLDESEV 394

Query: 515 LNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVV 574
            + L + R+ LF  R  R RP  D+KV+  WNGL+IS+FA+ S +L+ ++ +A       
Sbjct: 395 EDRLEKARKALFAAREGRERPARDEKVLAGWNGLMISAFAQGSVVLEDDSLAAD------ 448

Query: 575 GSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYE 634
                     A  A  F+R  L+D++T  L     NG  K  G+L+DYAFL+ G  DLY+
Sbjct: 449 ----------ARRALDFVRERLWDDETETLSRRVMNGEVKGDGYLEDYAFLVRGAFDLYQ 498

Query: 635 FGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSV 694
                  L +A++L       F D + G  + T     S++ R +E  D + PS   V+ 
Sbjct: 499 ATGDLAPLSFALDLARATRREFYDADAGTLYFTPESGESLVTRPQEPTDQSTPSSLGVAT 558

Query: 695 INLVRL------------ASIVAGSKSDYYRQNA-EH-SLAVFETRLKDMAMAVPLMCCA 740
              + L            A  V GS ++  R +  EH SLA+   +    A  VP +  A
Sbjct: 559 SLFLDLKQFAPDAGFGEVADAVLGSFANRVRGSPLEHVSLALAAEK---AASGVPELTVA 615

Query: 741 ADMLSVPSRKHVVLV 755
           AD   VP      L 
Sbjct: 616 AD--EVPDEWRATLA 628


>gi|344940058|ref|ZP_08779346.1| hypothetical protein Mettu_0287 [Methylobacter tundripaludum SV96]
 gi|344261250|gb|EGW21521.1| hypothetical protein Mettu_0287 [Methylobacter tundripaludum SV96]
          Length = 754

 Score =  409 bits (1052), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 250/634 (39%), Positives = 349/634 (55%), Gaps = 58/634 (9%)

Query: 95  SHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCH 154
           S S +   NRL    SPYLLQHAHNPVDW+ WGEEAFA+ARK + PI LSIGYSTC+WCH
Sbjct: 4   SLSTHASANRLIDSSSPYLLQHAHNPVDWYPWGEEAFAKARKENKPILLSIGYSTCYWCH 63

Query: 155 VMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDL 214
           VME E FE+  +AKL+N+  VSIK+DRE+RPDVD +YMT  Q +   GGWP +VF++PDL
Sbjct: 64  VMEREIFENPEIAKLMNESIVSIKIDREQRPDVDDLYMTATQMMTHSGGWPNNVFVTPDL 123

Query: 215 KPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDML---AQSGAFAIEQLSEALSASA 271
           KP   GTYFPP        F ++++++   W + +  L   A+  A AI ++ +    +A
Sbjct: 124 KPFYAGTYFPP------AAFSSLIQQIHYIWMQDQVPLKAQAERLASAIIRIKQQ-ENNA 176

Query: 272 SSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKS 331
            S+ LP      AL       S  YD+R GGF  APKFP   +  + L  + +L      
Sbjct: 177 QSSSLPGSRLVEAL---ISHFSDYYDNRLGGFYQAPKFPNE-DALLFLLEAYRLTSNNTC 232

Query: 332 GEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLD 391
            E + G      TL+ MA+GGIHDHVGGGFHRY+ D +W +PHFEKMLY+Q  L   Y +
Sbjct: 233 LEMARG------TLEKMAEGGIHDHVGGGFHRYATDAQWRIPHFEKMLYNQALLGRAYTE 286

Query: 392 AFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSK 451
            ++L+       +   I D+  R M    G  +SA DA+       T   EGA+Y WT  
Sbjct: 287 LYALSNKPDDRVVAEGIFDFTLRQMTHKDGGFYSALDAE-------TDAVEGAYYAWTDA 339

Query: 452 EVEDIL--GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGM 509
           E++D L    +A L K HY     G  ++ ++   H    G+ VL  +   S SA+  G+
Sbjct: 340 ELQDALDTDSYAWLMK-HY-----GLAEIPKIPG-HKHVDGR-VLYLIQPLSESATAEGL 391

Query: 510 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 569
             E  +         L + R KR  PHLD+K+I SWNGL+I +FARA   ++        
Sbjct: 392 SYEDAVKKQQAVMTSLRESRDKRKLPHLDNKIITSWNGLMIDAFARAGLCMR-------- 443

Query: 570 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 629
                   + EY E +  AA FI  +L  +Q   L  ++R+G ++   + +DYAF+I GL
Sbjct: 444 --------KLEYTEASRRAADFILANL-RKQDGSLYRTWRDGQAEISAYFEDYAFMIQGL 494

Query: 630 LDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSG 689
           + +Y      ++L  A EL     +LF D + GGY+ T G +  +L+R+K   D A PSG
Sbjct: 495 VSIYRAAKDNRYLQAAKELAAKAKQLFWDEKHGGYYFTDGSE-LLLVRMKNAVDSAIPSG 553

Query: 690 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVF 723
           N+V    L+ L  I   ++   ++Q AE  L  F
Sbjct: 554 NAVMAQALLDLYEITGDAE---WKQQAEALLIAF 584


>gi|448627283|ref|ZP_21671896.1| thioredoxin [Haloarcula vallismortis ATCC 29715]
 gi|445759112|gb|EMA10399.1| thioredoxin [Haloarcula vallismortis ATCC 29715]
          Length = 733

 Score =  409 bits (1051), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 245/683 (35%), Positives = 359/683 (52%), Gaps = 71/683 (10%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL    SPYL QHA NPV+W  W E A   A++RDVPIFLSIGY+ CHWCHVME ESFE
Sbjct: 11  NRLDEAESPYLRQHADNPVNWQPWDETALEAAKERDVPIFLSIGYAACHWCHVMEEESFE 70

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           +E +A+ LN+ FV IKVDREERPD+D VYM+  Q + GGGGWPLS +L+PD +P   GTY
Sbjct: 71  NEAIAEQLNEHFVPIKVDREERPDLDSVYMSICQQVTGGGGWPLSAWLTPDGEPFYVGTY 130

Query: 223 FPPEDKYGRPGFKTILRKVKDAW---DKKRDM--LAQSGAFAIEQLSEALSASASSNKLP 277
           FPPE+K G+PGF  +L+++ D+W   +++ +M   AQ    AIE   EA  A       P
Sbjct: 131 FPPEEKRGQPGFGDLLQRLADSWSDPEQREEMENRAQQWTEAIESDLEATPAD------P 184

Query: 278 DELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLEDTGKSGEASE 336
           ++  ++ ++       +  D + GG+GS  PKFP+   +  +L   +   D G+     +
Sbjct: 185 EDPAEDIIQTAGTIAHRGADRQDGGWGSGGPKFPQNGRLHALL---RAHADGGQ----ED 237

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
              +V  TL  MA  G++DHVGGGFHRY+ D++W VPHFEKMLYD  ++   +L  +   
Sbjct: 238 YLNVVEETLDVMADRGLYDHVGGGFHRYATDQQWAVPHFEKMLYDNAEIPRAFLAGYQAI 297

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSA-----------------ETEGATR 439
               Y+ + R+  ++++R++  P G  FS  DA+SA                   E    
Sbjct: 298 GSERYASVVRETFEFVQRELQHPDGGFFSTLDAESAPHSESRSDSEQSSGESPRDEPGGE 357

Query: 440 KKEGAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 497
            +EG FYVWT ++V D + +   A +F ++Y +   GN            F+G  VL   
Sbjct: 358 TEEGLFYVWTPEQVHDAVDDETDAEVFCDYYGVTERGN------------FEGATVLAVR 405

Query: 498 NDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 557
              +  A +     ++    L     + F+ R  RPRP  D+KV+  WNGL+I + A  +
Sbjct: 406 KPVAVLAEEYEQSEDEITASLQRALNQTFEARKDRPRPARDEKVLAGWNGLMIRTLAEGA 465

Query: 558 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPG 617
            +L                  ++Y +VA  A SF+R HL+DE   RL   +++G     G
Sbjct: 466 IVLD-----------------EQYADVAADALSFVREHLWDEDERRLNRRYKDGDVAIDG 508

Query: 618 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLR 677
           +L+DYAFL  G L L+E     + L +A++L     E F D E G  F T     S++ R
Sbjct: 509 YLEDYAFLGRGALTLFEATGDVEHLAFAMDLGQAITEAFWDDEQGTLFFTPTGGESLVAR 568

Query: 678 VKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLM 737
            +E  D + PS   V+V  L+ L+     S +D +   AE  L     R+    +    +
Sbjct: 569 PQELTDQSTPSSTGVAVDLLLSLSHF---SDNDRFESVAERVLRTHADRVSSNPLQHASL 625

Query: 738 CCAADMLSVPSRKHVVLVGHKSS 760
             A D     + + + LVG +S+
Sbjct: 626 TLATDTYEQGALE-LTLVGDQSA 647


>gi|448731719|ref|ZP_21714012.1| hypothetical protein C450_00645, partial [Halococcus salifodinae
           DSM 8989]
 gi|445805618|gb|EMA55820.1| hypothetical protein C450_00645, partial [Halococcus salifodinae
           DSM 8989]
          Length = 580

 Score =  409 bits (1051), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 231/610 (37%), Positives = 329/610 (53%), Gaps = 43/610 (7%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA NPV+W  W ++A   AR+RDVPIFLSIGYS CHWCHVME ESFE
Sbjct: 7   NRLDEEQSPYLRQHADNPVNWQPWDDDALDAARERDVPIFLSIGYSACHWCHVMEDESFE 66

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA+ LND FV IKVDREERPD+D++Y T    + G GGWPLSV+L+PD +P   GTY
Sbjct: 67  DERVAERLNDEFVPIKVDREERPDLDRLYQTICGMVSGQGGWPLSVWLTPDGRPFYVGTY 126

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKR-DMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           FP ++K G+PGF  +L  + ++W+  R D+  ++  +A     E  +      ++PD   
Sbjct: 127 FPRDEKRGQPGFLDLLDSIAESWENDREDIEGRADQWAGAMAGELEATPEQPGEVPD--- 183

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
            + L   A+Q  ++ D  +GGFG   KFP+   + +++   +  E TG+        ++ 
Sbjct: 184 SDLLETAAQQAVENADREYGGFGHGQKFPQTGRLHLLM---RAAERTGRES----FDEVA 236

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
              L  M++GG+ DH GGGFHRY+ D  W VPHFEKMLYD  +L   YL  +  T    Y
Sbjct: 237 HEALDAMSEGGLRDHAGGGFHRYTTDREWTVPHFEKMLYDNAELTRAYLAGYRRTGAERY 296

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH- 460
           + + R+ L ++ R++  P G  FS  DA S +  G   ++EGAFYVWT   V D + +  
Sbjct: 297 AEVARETLGFVERELRHPDGGFFSTLDAQSEDESG--EREEGAFYVWTPNGVHDAVDDEF 354

Query: 461 -AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILG 519
            A LF E Y +   GN +            GK VL    +    A +     E+    L 
Sbjct: 355 AADLFCERYGVTEAGNFE-----------DGKTVLTVSTEIEDLADEHDTTTEEVSAELE 403

Query: 520 ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRK 579
             R  +F  R++R RP  D+KV+  WNGL+IS+FA A   L +                 
Sbjct: 404 RAREAVFAARAERERPERDEKVLAGWNGLMISAFAEAGLALDA----------------- 446

Query: 580 EYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGT 639
            + + A +   F+  HL++++  RLQ  +++G  K  G+L+DYAFL  G L+ YE     
Sbjct: 447 RFADTAVAGIEFVHEHLWNDEKRRLQRRYKDGDVKIEGYLEDYAFLARGALNCYEATGEV 506

Query: 640 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVR 699
             L +A++L    +  F D +    + T     S++ R +E  D + PS   V+V  L+ 
Sbjct: 507 DHLAFALDLARAIETEFWDSDEETLYFTPQTGESLVARPQELDDQSTPSSTGVAVDVLLA 566

Query: 700 LASIVAGSKS 709
           L    A   S
Sbjct: 567 LDHFAADRPS 576


>gi|407772664|ref|ZP_11119966.1| hypothetical protein TH2_02165 [Thalassospira profundimaris WP0211]
 gi|407284617|gb|EKF10133.1| hypothetical protein TH2_02165 [Thalassospira profundimaris WP0211]
          Length = 679

 Score =  409 bits (1051), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 242/681 (35%), Positives = 366/681 (53%), Gaps = 65/681 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N L +E SPYL+QH  NPV W  W  +  A+A++ + PI LS+GY+ CHWCHVM  ESFE
Sbjct: 6   NNLGSETSPYLVQHRDNPVHWQPWSTDILAKAKELNKPILLSVGYAACHWCHVMAHESFE 65

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DEG+A L+N+ F++IK+DREERPD+D +Y   +  L   GGWPL++FL+PD +P  GGTY
Sbjct: 66  DEGIAALMNELFINIKLDREERPDLDALYQNALALLGQQGGWPLTMFLTPDGEPFWGGTY 125

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA--SASSNKLPDEL 280
           FP E +YGRPGF  +L+ V   + +K D +  +    + Q+S AL    SA+   +P   
Sbjct: 126 FPKEARYGRPGFGDVLKTVAKIYAEKPDDVRHN----VSQISNALIKMNSAAVGAVPS-- 179

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
               +  C     +  D   GG   APKFP+P  +  +     + +D G        +++
Sbjct: 180 -LEMIDRCGHGCLQIMDGENGGTSGAPKFPQPSLLSYIWRTGVRTDDDGL-------KRI 231

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
           V  +L  M +GGI+DH+GGG  RY+VD++W VPHFEKMLYD  QL ++  D + +  +  
Sbjct: 232 VKHSLDRMCQGGIYDHLGGGLARYAVDDQWLVPHFEKMLYDNAQLIDLLCDVWRVDPNPL 291

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH 460
           Y+    + + ++ R+M  PGG   ++ DADS   EG     EG FYVW+  E++ ILG +
Sbjct: 292 YAKRVEETIGWILREMRIPGGAFTASLDADS---EGV----EGKFYVWSEDEIDQILGAN 344

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
           A LFK+ Y +   GN            ++G  +L      + +AS L +  +     L E
Sbjct: 345 ADLFKKFYDVSKDGN------------WEGHTIL------NRTASGLELADDATEEKLAE 386

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            R KL   R+KR RP  DDK +  WN + I++FA A+                    R +
Sbjct: 387 LRAKLLAERAKRIRPGWDDKALTDWNAMTIAAFAEAAMTFH----------------RAD 430

Query: 581 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 640
           +++ A+ A  F+   L   +  R  HS+R+G  +  G L+DYA +I   L LYE      
Sbjct: 431 WLDYAKLAYGFVINTLM--KGDRFLHSYRDGRVQHAGMLEDYAHMIRAALRLYECFGEDA 488

Query: 641 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 700
           +L  AI      + LF D + GGYF +  +   +++R K   D A PSGN++   NL +L
Sbjct: 489 YLNEAIRWSAAVETLFADAK-GGYFQSASDASDLVVRQKPFMDNAVPSGNAIMAQNLAKL 547

Query: 701 ASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSS 760
            ++   ++   YR  AE +LA F  R+ +    +P +  AA+ML  P +  +VL+    S
Sbjct: 548 YALTGDTQ---YRDQAEITLAAFGGRIGEQFPNMPGLMMAAEMLQNPVQ--IVLIAKDRS 602

Query: 761 VDFENMLAAAHASYDLNKTVS 781
             + +M  A   +Y  N+ ++
Sbjct: 603 QTYLDMRRAIFGAYLPNRAIT 623


>gi|448455362|ref|ZP_21594542.1| hypothetical protein C469_02259 [Halorubrum lipolyticum DSM 21995]
 gi|445813964|gb|EMA63937.1| hypothetical protein C469_02259 [Halorubrum lipolyticum DSM 21995]
          Length = 747

 Score =  409 bits (1050), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 258/692 (37%), Positives = 357/692 (51%), Gaps = 87/692 (12%)

Query: 97  SRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVM 156
           S+    NRL  E SPYL QHA NPV+W  WGEEAF  AR+ DVP+F+SIGYS+CHWCHVM
Sbjct: 2   SQPTERNRLDGEASPYLQQHADNPVNWQPWGEEAFERAREHDVPVFVSIGYSSCHWCHVM 61

Query: 157 EVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKP 216
             ESFEDE VA +LN+ FV +KVDREERPDVD  +MT  Q + GGGGWPLS + +P+ +P
Sbjct: 62  AEESFEDESVAAVLNESFVPVKVDREERPDVDSTFMTVSQLVTGGGGWPLSAWCTPEGEP 121

Query: 217 LMGGTYFPPEDKYGRPGFKTILRKVKDAW---------DKKRDMLAQSGAFAIEQL---- 263
              GTYFPPE +  +PGF+ +  ++ D+W          ++ D    S    +E +    
Sbjct: 122 FYVGTYFPPEPRRNQPGFRDLCERIADSWADPEQREEMKRRADQWTTSARDELESVPDSG 181

Query: 264 -----SEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQM 317
                 +A   S +    PD L + A         + YD  +GGFGS   KFP P  I +
Sbjct: 182 PVGGAGDAGDMSGAEAPGPDLLDEAAAAAI-----RGYDDEYGGFGSGGAKFPMPGRIDV 236

Query: 318 MLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEK 377
           +L    K   TG++   +        TL  MA+GG++D VGGGFHRY+VD +W VPHFEK
Sbjct: 237 LLRAYAK---TGRNAALT----AATGTLDGMARGGMYDQVGGGFHRYAVDRQWTVPHFEK 289

Query: 378 MLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADS------ 431
           MLYD  +L   YLDA  LT D  Y+ +  + L +L R++    G  FS  DA S      
Sbjct: 290 MLYDNAELPMAYLDAHRLTGDASYARVANETLGFLDRELRHDEGGFFSTLDARSRPPASR 349

Query: 432 ---AETEGATRKK-----EGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMS 482
              A ++G+ R       EGAFYVWT  EV+ +L E A  L K+ Y ++  GN +     
Sbjct: 350 RGDAGSDGSGRDDDANDVEGAFYVWTPGEVDAVLDEPAASLAKDRYGIESGGNFE----- 404

Query: 483 DPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVI 542
                 +G  V       +  A    M  +     L   R  LF+ R  RPRP  D+KV+
Sbjct: 405 ------RGTTVPTIAASVAELAEAHDMSTDDVRETLTAARVALFEARESRPRPARDEKVL 458

Query: 543 VSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTH 602
            SWNG  IS+FA A ++L                  + Y ++A  A +F R  LYDE+T 
Sbjct: 459 ASWNGRAISAFAAAGRVLG-----------------EPYADIASDALAFCRERLYDEETG 501

Query: 603 RLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGG 662
            L   + +G  + PG+LDD+AFL  G LD Y      + L +A++L  T    F D E G
Sbjct: 502 ALARRWLDGDVRGPGYLDDHAFLARGALDAYSATGDPEALGFALDLAETIVSDFYDEEDG 561

Query: 663 G-YFN-----TTG--EDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDY-YR 713
             YF      T G   D ++  R +E  D + PS   V+   L    +++ G ++D  + 
Sbjct: 562 TIYFTRDPDETAGGDGDDTLFARPQEFTDRSTPSSLGVAAETL----ALLDGFRTDREFA 617

Query: 714 QNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 745
           + AE  +     R++   +    +  AAD ++
Sbjct: 618 EVAERVVTTHADRIRASPLEHVSLVRAADRVA 649


>gi|448439398|ref|ZP_21588039.1| hypothetical protein C471_00950 [Halorubrum saccharovorum DSM 1137]
 gi|445691449|gb|ELZ43640.1| hypothetical protein C471_00950 [Halorubrum saccharovorum DSM 1137]
          Length = 751

 Score =  409 bits (1050), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 260/683 (38%), Positives = 352/683 (51%), Gaps = 87/683 (12%)

Query: 97  SRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVM 156
           S+    NRL  E SPYL QHA NPV+W  WGE AF  AR+ DVP+F+SIGYS+CHWCHVM
Sbjct: 2   SQPTERNRLDGEASPYLQQHADNPVNWQPWGEAAFERAREHDVPVFVSIGYSSCHWCHVM 61

Query: 157 EVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKP 216
             ESFEDE VA +LN+ FV +KVDREERPDVD  +MT  Q + GGGGWPLS + +P+ +P
Sbjct: 62  AEESFEDESVAAVLNEEFVPVKVDREERPDVDSAFMTVSQLVTGGGGWPLSAWCTPEGEP 121

Query: 217 LMGGTYFPPEDKYGRPGFKTILRKVKDAW---------DKKRDMLAQSGAFAIEQLSEAL 267
              GTYFPPE +  +PGF+ +  ++ D+W          ++ D    S    +E + +A 
Sbjct: 122 FYVGTYFPPEPRRNQPGFRDLCERIADSWADPEQREEMQRRADQWTTSARDELESVPDAE 181

Query: 268 SASA-------SSNKLPDELP-QNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMM 318
           +  A        ++    E P  + L   A    + YD  +GGFGS   KFP P  I ++
Sbjct: 182 AGPAGGADDAGGTDGADGEAPGPDLLDEAAAAAIRGYDDEYGGFGSGGAKFPMPGRIDVL 241

Query: 319 LYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKM 378
           +    +   TG+    +        TL  MA+GG++D +GGGFHRY+VD +W VPHFEKM
Sbjct: 242 MRAYAR---TGRDAALT----AATGTLDGMARGGMYDQIGGGFHRYAVDRQWTVPHFEKM 294

Query: 379 LYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGAT 438
           LYD  +L   +LDA  LT D  Y+ +  + L +L R++    G  FS  DA S   E  T
Sbjct: 295 LYDNAELPMAFLDAARLTGDASYARVASETLGFLDRELRHDDGGFFSTLDARSRPPE--T 352

Query: 439 RKK----------------EGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRM 481
           R+                 EGAFYVWT  EV+ +L E A  L KE Y ++  GN +    
Sbjct: 353 RRGGVGSDGSDGSGHAADVEGAFYVWTPGEVDAVLDEPAASLAKERYGIESGGNFE---- 408

Query: 482 SDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKV 541
                  +G  V          A    M  E     L E R  LF+ R  RPRP  D+KV
Sbjct: 409 -------RGTTVPTVAASIEELADDHDMSPEAVREALTEARVALFEARESRPRPARDEKV 461

Query: 542 IVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQT 601
           + SWNG  IS+FA A ++L                  + Y ++A  A +F R +LYDE T
Sbjct: 462 LASWNGRAISAFAAAGQVLG-----------------EPYADIAGDALAFCRENLYDEST 504

Query: 602 HRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG 661
             L   + +G  + PG+LDD+AFL  G LD+Y        L +A++L  T    F D E 
Sbjct: 505 GDLARRWLDGDVRGPGYLDDHAFLARGALDVYAATGDPDALGFALDLAETVVADFYDDED 564

Query: 662 GGYFNT------TGE--DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYR 713
           G  + T       GE  D ++  R +E  D + PS   V+   LV    ++ G ++D  R
Sbjct: 565 GTIYFTRDPDEAAGEDGDDTLFARPQEFTDRSTPSSLGVAAETLV----LLDGFRTD--R 618

Query: 714 QNAEHSLAVFETRLKDMAMAVPL 736
           + AE + AV  T   D   A PL
Sbjct: 619 EFAEVAEAVVTTH-ADRIRASPL 640


>gi|448529052|ref|ZP_21620367.1| hypothetical protein C467_01076 [Halorubrum hochstenium ATCC
           700873]
 gi|445709758|gb|ELZ61582.1| hypothetical protein C467_01076 [Halorubrum hochstenium ATCC
           700873]
          Length = 744

 Score =  409 bits (1050), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 252/684 (36%), Positives = 348/684 (50%), Gaps = 75/684 (10%)

Query: 97  SRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVM 156
           S+    NRL  E SPYL QHA NPV+W  WG+EAF  AR+ DVP+F+SIGYS+CHWCHVM
Sbjct: 2   SQPTERNRLDGEASPYLRQHADNPVNWQPWGDEAFERAREHDVPVFVSIGYSSCHWCHVM 61

Query: 157 EVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKP 216
             ESFEDE VA ++ND FV IKVDREERPDVD  +MT  Q + GGGGWPLS + +P+ KP
Sbjct: 62  AEESFEDESVAGVINDSFVPIKVDREERPDVDSTFMTVCQLVTGGGGWPLSAWCTPEGKP 121

Query: 217 LMGGTYFPPEDKYGRPGFKTILRKVKDAWD---------KKRDMLAQSGAFAIEQLSEAL 267
              GTYFP E +  +PGF+ +  ++ D+W          ++ D  A+S    +E +    
Sbjct: 122 FYVGTYFPLEARRNQPGFRDLCERIADSWSDPEQREEMRRRADQWAESARDELESVPTPD 181

Query: 268 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLY-HSKKL 325
           +A               L   A    + YD  +GGFGS   KFP P  I +++  +++  
Sbjct: 182 AADPDGEGDASPPGDGLLESAAASALRGYDDEYGGFGSGGAKFPMPGRIDLLMRAYARSG 241

Query: 326 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 385
            D   S  A         TL  MA+GG++D +GGGFHRY+VD  W VPHFEKMLYD  +L
Sbjct: 242 RDALLSAAAG--------TLDGMARGGMYDQIGGGFHRYAVDREWTVPHFEKMLYDNAEL 293

Query: 386 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDA---------DSAETEG 436
              YLD + LT D  Y+ +  + L +L R++    G  FS  DA         D  E+E 
Sbjct: 294 PMAYLDGYRLTGDPAYARVASESLAFLDRELRRDDGGFFSTLDARSRPPESRRDGNESE- 352

Query: 437 ATRKKEGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 495
                EGAFYVWT +EV+ +L E A  L KE Y ++P GN +           +G  V  
Sbjct: 353 EGEDVEGAFYVWTPEEVDAVLDEPAASLVKERYGIRPGGNFE-----------RGTTVPT 401

Query: 496 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 555
                   A+   +  E+    L E R  LFD R  RPRP  D+KV+ SWNG  IS+FA 
Sbjct: 402 LAASVDELAADRDLSPEEVREALTEARTALFDARESRPRPARDEKVLASWNGRAISAFAD 461

Query: 556 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYD--EQTHRLQHSFRNGPS 613
           A+  L                  + Y ++A  A  F R  LYD   +T  L   + +G  
Sbjct: 462 AAGTLG-----------------EPYADIAREALDFCRDRLYDPEAETGALARRWLDGDV 504

Query: 614 KAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT------ 667
           + PG+LDDYAFL  G LD+Y      + L +A+EL       F D + G  + T      
Sbjct: 505 RGPGYLDDYAFLARGALDVYAATGDLEPLGFALELAEALVAEFYDADDGTIYFTRSLDGR 564

Query: 668 ----TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSD-YYRQNAEHSLAV 722
                G+   ++ R +E  D + PS   V+   L    +++ G ++D  +R  A   +  
Sbjct: 565 ESGGDGDAGPLMARPQEFTDRSTPSSLGVAAETL----ALLDGFRTDGRFRDVARRVVTT 620

Query: 723 FETRLKDMAMAVPLMCCAADMLSV 746
              R++   +    +  AAD++  
Sbjct: 621 HADRIRGGPLEHASLVRAADLVET 644


>gi|409730794|ref|ZP_11272353.1| hypothetical protein Hham1_16314 [Halococcus hamelinensis 100A6]
 gi|448723490|ref|ZP_21706008.1| hypothetical protein C447_10082 [Halococcus hamelinensis 100A6]
 gi|445787756|gb|EMA38495.1| hypothetical protein C447_10082 [Halococcus hamelinensis 100A6]
          Length = 719

 Score =  408 bits (1049), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 234/601 (38%), Positives = 331/601 (55%), Gaps = 44/601 (7%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA NPV+W  W ++A   AR+ DVPIFLSIGYS+CHWCHVM  ESFE
Sbjct: 7   NRLDNERSPYLRQHADNPVNWQPWDDDALEAAREHDVPIFLSIGYSSCHWCHVMADESFE 66

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA+ LN+ FV IKVDREERPD+D++Y T +  + G GGWPLSV+L+PD +P   GTY
Sbjct: 67  DERVAERLNEDFVPIKVDREERPDLDRLYQTVIGMVSGRGGWPLSVWLTPDGRPFYIGTY 126

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FPPE K G+PGF  +L  + +AW+ +R+ +        +Q ++A++    +   P + P 
Sbjct: 127 FPPEAKRGQPGFLDLLDSITEAWETEREDIEGRA----DQWADAMTGELEATPEPGDPPG 182

Query: 283 NA-LRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
           +  L   A    ++ D  +GG G   KFP+   +++++  + +++D      A E     
Sbjct: 183 SELLETAARSAVRNADREYGGSGRGQKFPQTGRLRLLMEAADRIDDEEFGTVARE----- 237

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
              L  MA GG+ DHVGGGFHRY+ D  W VPHFEKMLYD  +L   YLD + L  D  Y
Sbjct: 238 --ALDAMADGGLRDHVGGGFHRYTTDREWTVPHFEKMLYDNAELVRAYLDGYRLFGDERY 295

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH- 460
           + + R+ L ++ R++  P G  FS  DA S +  G   ++EGAFYVWT  EV D +G+  
Sbjct: 296 AEVARETLGFVERELTSPEGGFFSTLDAQSVDESG--EREEGAFYVWTPDEVHDAVGDDR 353

Query: 461 -AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILG 519
            A LF E Y +  +GN +            G  VL    D    A +    +E+    L 
Sbjct: 354 AAELFCERYGISESGNFE-----------NGTTVLTLAADVQGLADEYDTTVEEVEADLE 402

Query: 520 ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRK 579
             R  +F  R++R RP  D+KV+  WNGL++++FA A   L                   
Sbjct: 403 RAREAVFAARAERSRPDRDEKVLAGWNGLMVAAFAEAGLALD-----------------P 445

Query: 580 EYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGT 639
            + E A +A  F+R  L++E+  RL   +++G  K  G+L+DYAFL  G L  YE     
Sbjct: 446 RFAETAVAALDFVREELWNEEEERLSRRYKDGEVKIDGYLEDYAFLARGALACYEATGDV 505

Query: 640 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVR 699
             L +A++L    +  F D E G  + T     S++ R +E  D + PS   V+V  L+ 
Sbjct: 506 HHLGFALDLARAIESEFWDPEEGTLYFTPSSGESLVARPQELDDQSTPSSTGVAVETLLA 565

Query: 700 L 700
           L
Sbjct: 566 L 566


>gi|240276138|gb|EER39650.1| DUF255 domain-containing protein [Ajellomyces capsulatus H143]
 gi|325089996|gb|EGC43306.1| DUF255 domain-containing protein [Ajellomyces capsulatus H88]
          Length = 766

 Score =  408 bits (1048), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 253/649 (38%), Positives = 344/649 (53%), Gaps = 73/649 (11%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            NRL    SPY+  H +NPV W  W  EA A A+K +  IFLSIGYS CHWCHVME ESF
Sbjct: 23  VNRLNQSKSPYVRGHMNNPVAWQMWDAEAIALAKKLNRMIFLSIGYSACHWCHVMEKESF 82

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
               VA +LN  F+ IK+DREERPD+D VYM YVQA  G GGWPL+VFL+PDL+P+ GGT
Sbjct: 83  MSPEVAAILNKAFIPIKLDREERPDIDDVYMNYVQATTGSGGWPLNVFLTPDLEPVFGGT 142

Query: 222 YFP-PEDKY-------GRPGFKTILRKVKDAWDKK--------RDMLAQSGAFAIEQLSE 265
           Y+P P           G+  F  IL K++D W  +        +D+  Q   FA E    
Sbjct: 143 YWPGPHSSASSTLGGEGQVTFIDILEKLRDVWQTQQLRCRESAKDITRQLQEFAEEGTYS 202

Query: 266 ALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK-- 323
             S + +  +   +L    L    +  +  YD   GGF  APKFP P  +  ++  S+  
Sbjct: 203 KQSGAGADGEE--DLEVELLEEAYKHFASRYDPVNGGFSRAPKFPTPANLSFLVNLSRFS 260

Query: 324 -KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 382
             + D     E +   +M + TL  +++GGIHDH+G GF RYSV   W +PHFEKMLYDQ
Sbjct: 261 NAVADIVGYEECAHALEMAIKTLISISRGGIHDHIGHGFARYSVTADWSLPHFEKMLYDQ 320

Query: 383 GQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDADSAETEGATRKK 441
            QL  VY DAF    D        DI  Y+    ++ P     S+EDADS  T   T K+
Sbjct: 321 AQLLRVYTDAFDSAHDPELLGAMYDIAAYITSPPVLSPTSGFHSSEDADSLPTPSDTDKR 380

Query: 442 EGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 500
           EGAFYVWT KE + ILG+  A +   H+ + P GN +  R++DPH+EF  +NVL      
Sbjct: 381 EGAFYVWTHKEFKQILGQRDADVCARHWGVLPDGNVE--RVNDPHDEFINQNVLHIQTTP 438

Query: 501 SASASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVISSFARASKI 559
              A + G+  E+ + I+     KL + R SKR RP LDDK+IV+WNGL I + A+ S +
Sbjct: 439 GKLAKEFGLSEEEVVRIIKASTEKLREYRESKRVRPALDDKIIVAWNGLAIGALAKCSVV 498

Query: 560 LKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP-SKAPGF 618
           L +          V     +E+   AE+AA FIR+ L+D  + +L   +R       PGF
Sbjct: 499 LDN----------VDRIKAQEFRLAAENAAKFIRQSLFDPASGQLWRIYRGEERGDTPGF 548

Query: 619 LDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRV 678
            DDYA+LISGL+DLYE      +L +A +LQ+                            
Sbjct: 549 ADDYAYLISGLIDLYEATFDDSYLQFAEQLQH---------------------------- 580

Query: 679 KEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 727
                 + PS N V   NL+RL++++   + D YR+ A  +++ F   +
Sbjct: 581 -----ASTPSPNGVIARNLLRLSTLL---EDDTYRRLARDTVSAFAVEI 621


>gi|433424873|ref|ZP_20406585.1| thioredoxin domain containing protein [Haloferax sp. BAB2207]
 gi|432197957|gb|ELK54295.1| thioredoxin domain containing protein [Haloferax sp. BAB2207]
          Length = 703

 Score =  407 bits (1047), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 248/664 (37%), Positives = 348/664 (52%), Gaps = 74/664 (11%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA NPV+W  W E A   AR+ D PIFLSIGYS CHWCHVM  ESF 
Sbjct: 8   NRLDEEQSPYLRQHADNPVNWQPWDETALEAAREADKPIFLSIGYSACHWCHVMADESFS 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D  +A++LN+ FV +KVDREERPD+D++Y T  Q + GGGGWPLSV+L+P+ KP   GTY
Sbjct: 68  DPDIAEVLNEEFVPVKVDREERPDLDRIYQTICQQVTGGGGWPLSVWLTPEGKPFFVGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP- 281
           FPPE + G PGF+ ++    ++W   RD +          +++ L  +  +   P E P 
Sbjct: 128 FPPEPRRGAPGFRDVVESFAESWRTDRDEIENRADQWTSAITDRLEETPDT---PGEAPG 184

Query: 282 QNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
            + L    +   +  D   GGFG   PKFP+P  I  +L            G A  G++ 
Sbjct: 185 SDILDTTVQAALRGADRDHGGFGGDGPKFPQPGRIDALL-----------RGYAVSGRRE 233

Query: 341 VL----FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
            L     +L  MA GG+ DH+GGGFHRY VD  W VPHFEKMLYDQ  LA+ YLDA  LT
Sbjct: 234 ALDVARQSLDAMANGGLRDHLGGGFHRYCVDREWTVPHFEKMLYDQAGLASRYLDAARLT 293

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
            +  Y+ +  +  +++RR++    G  F+  DA S         +EG FYVWT  +V D+
Sbjct: 294 GNDSYATVAAETFEFVRRELTHDDGGFFATLDAQSG-------GEEGTFYVWTPADVRDL 346

Query: 457 LGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS-ASKLGMPLEKY 514
           L E  A LF + Y + P GN            F+ K  ++ ++ ++A  A +  +   + 
Sbjct: 347 LPELDADLFCDRYGVTPGGN------------FEDKTTVLNVSATTADLADEYDLDESEV 394

Query: 515 LNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVV 574
            + L + R+ LF  R  R RP  D+KV+  WNGL+IS+FA+ S +L+ ++ +A       
Sbjct: 395 EDRLEKARKALFAAREGRERPARDEKVLAGWNGLMISAFAQGSVVLEDDSLAAD------ 448

Query: 575 GSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYE 634
                     A  A  F+R  L+D++T  L     NG  K  G+L+DYAFL  G  DLY+
Sbjct: 449 ----------ARRALDFVRERLWDDETETLSRRVMNGEVKGDGYLEDYAFLARGAFDLYQ 498

Query: 635 FGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSV 694
                  L +A++L       F D + G  + T     S++ R +E  D + PS   V+ 
Sbjct: 499 ATGDLAPLSFALDLARATRREFYDADAGTLYFTPESGESLVTRPQEPTDQSTPSSLGVAT 558

Query: 695 INLVRL------------ASIVAGSKSDYYRQNA-EH-SLAVFETRLKDMAMAVPLMCCA 740
              + L            A  V GS ++  R +  EH SLA+   +    A  VP +  A
Sbjct: 559 SLFLDLEQFAPDAGFGEVADAVLGSFANRVRGSPLEHVSLALAAEK---AASGVPELTVA 615

Query: 741 ADML 744
           AD +
Sbjct: 616 ADEI 619


>gi|404447779|ref|ZP_11012773.1| hypothetical protein A33Q_00490 [Indibacter alkaliphilus LW1]
 gi|403766365|gb|EJZ27237.1| hypothetical protein A33Q_00490 [Indibacter alkaliphilus LW1]
          Length = 674

 Score =  407 bits (1047), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 252/665 (37%), Positives = 356/665 (53%), Gaps = 73/665 (10%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL    SPYLLQHA+NPVDWF WG+EA  ++++ D PI +SIGYS CHWCHVME ESFE
Sbjct: 2   NRLKDSQSPYLLQHANNPVDWFPWGDEALEKSKREDKPIIVSIGYSACHWCHVMEKESFE 61

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE  A+L+N +FV IK+DREERPD+D +YM  VQA+   GGWPL+VFL P+ KP  GGTY
Sbjct: 62  DEATAQLMNQYFVCIKIDREERPDLDNIYMDAVQAMGLQGGWPLNVFLMPNQKPFYGGTY 121

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSG-AFAIE-QLSEALSASASSNKL---P 277
           FP         +K +L+ + +A+ +  D LA+S   F    Q SE L    S       P
Sbjct: 122 FP------NAQWKALLQNIGEAYQEHYDQLAKSAEEFGNSLQTSEFLKYGLSHGTFQLDP 175

Query: 278 DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEG 337
            EL + A++L   Q    +D  +GG    PKFP P     ++ ++       KS E    
Sbjct: 176 KELAE-AIKLLENQ----FDLDWGGMNRKPKFPMPAIWSFVMDYA-----LAKSDEVLLA 225

Query: 338 QKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTK 397
           +  V FTL+ +  GGI+DH+ GGF RYSVD  W  PHFEKMLYD GQL ++Y  A++++ 
Sbjct: 226 K--VFFTLKKIGMGGIYDHLRGGFARYSVDGEWFAPHFEKMLYDNGQLLDLYSKAYAVSG 283

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
           + FY     + + +L+ +M+   G  ++A+DADS   EG     EG FY WT +E+E I+
Sbjct: 284 EYFYKEKILETIAWLKSEMLHKEGGFYAAQDADS---EGV----EGKFYTWTYEELESIV 336

Query: 458 GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI 517
           GE    F + Y LK  GN +            G N+L +       A    +  E Y+  
Sbjct: 337 GEDLHWFAKLYNLKYQGNWE-----------DGVNILFQTESYEKLAESSELSEEGYIQR 385

Query: 518 LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSD 577
           L E + KL  VR++R  P LDDK++  WNGL+IS    A   L  E              
Sbjct: 386 LNEIKAKLLSVRNQRIFPGLDDKILSGWNGLMISGLVSAYTSLGDE-------------- 431

Query: 578 RKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGS 637
             E +E++ + A+FI   +Y ++   L  S++NG +  P FL+DYA +I G + LY+   
Sbjct: 432 --EALELSLNNATFILDKMYKDKV--LYRSYKNGHAYTPAFLEDYAAVIRGFISLYQATL 487

Query: 638 GTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINL 697
            +KWL+ A EL +   E F D E G ++    +   ++   KE  D   P+ NS+   NL
Sbjct: 488 DSKWLLKAKELSDKVIEAFYDEEEGFFYFNNPQAEKLIANKKELFDNVIPASNSIMARNL 547

Query: 698 VRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA-----ADMLSVPSRKHV 752
           + L+        D Y   A++ L      +K + +  P   C       DML +P +  V
Sbjct: 548 LDLSMFFY---EDNYAAIAKNMLGT----MKKLIIKEPGFLCNWASLYLDML-LP-KAEV 598

Query: 753 VLVGH 757
            +VG 
Sbjct: 599 AIVGE 603


>gi|419820995|ref|ZP_14344599.1| hypothetical protein UY9_06334, partial [Bacillus atrophaeus C89]
 gi|388474906|gb|EIM11625.1| hypothetical protein UY9_06334, partial [Bacillus atrophaeus C89]
          Length = 645

 Score =  407 bits (1046), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 243/647 (37%), Positives = 358/647 (55%), Gaps = 64/647 (9%)

Query: 140 PIFLSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALY 199
           P+ +SIGYSTCHWCHVM  ESFEDE +A+LLN+ FV+IKVDREERPDVD VYM   Q + 
Sbjct: 3   PVLVSIGYSTCHWCHVMAHESFEDEEIARLLNERFVAIKVDREERPDVDSVYMRICQLMT 62

Query: 200 GGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFA 259
           G GGWPL+VF++PD KP   GTYFP   K+ RPGF  +L  + + +   R+         
Sbjct: 63  GQGGWPLNVFITPDQKPFYAGTYFPKTSKFNRPGFIDVLEHLSNTFANDREH-------- 114

Query: 260 IEQLSEALSASASSNKLPD---ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQ 316
           +E+++E  S S    K P+    L + AL    +QL   +D+ +GGFG APKFP P    
Sbjct: 115 VEEIAENAS-SHLQIKTPEGNGTLTKEALHRTFQQLMSGFDTVYGGFGQAPKFPMP---H 170

Query: 317 MMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFE 376
           M++Y  +  + TG+        K    TL  MA GGI+DHVG GF RYS D+ W VPHFE
Sbjct: 171 MLMYLLRYHQYTGQENALYNVTK----TLDSMANGGIYDHVGYGFARYSTDDEWLVPHFE 226

Query: 377 KMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEG 436
           KMLYD   L   Y +A+ +T+D  Y +I   I+ +++R+M    G  +SA DAD   TEG
Sbjct: 227 KMLYDNALLLTAYTEAYQVTQDSRYQHIVEQIITFIQREMTHEDGSFYSALDAD---TEG 283

Query: 437 ATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV-- 493
                EG +YVW+  E+ + LG E   L+   Y +  +GN            F+G N+  
Sbjct: 284 V----EGKYYVWSKDEIIETLGDELGELYCAIYNITSSGN------------FEGHNIPN 327

Query: 494 LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSF 553
           LI        A +  +  ++    LGE R+KL   R  R  PH+DDKV+ SWN L+I+  
Sbjct: 328 LIHTKLDKVKA-EFDLNEQEINKQLGEARQKLLKKRETRTYPHVDDKVLTSWNALMIAGL 386

Query: 554 ARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPS 613
           A+A+K+ ++                 EY+ +A++AA+FI + L  +   R+   +R+G  
Sbjct: 387 AKAAKVFQA----------------PEYLNMAQAAAAFIEKKLIIDG--RVMVRYRDGEV 428

Query: 614 KAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPS 673
           K  GF+DDYAFL+   ++LYE G    +L  A +L     +LF D++ GG++ T  +  +
Sbjct: 429 KNKGFIDDYAFLLWAYIELYEAGYDLAYLQKAKDLSAKMLDLFWDQKHGGFYFTGHDAEA 488

Query: 674 VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMA 733
           +L+R KE +DGA PSGNSV+ + L+RL  +  G  S    + AE   + F+  ++     
Sbjct: 489 LLVREKEVYDGAVPSGNSVAAVQLLRLGQLT-GELS--LIEKAEKMFSAFKRDVEAYPSG 545

Query: 734 VPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
                 +     +P +K +V+ G K     +++++A   ++  N +V
Sbjct: 546 HSFFMQSVLTHMMP-KKEIVIFGRKDDSQRQHIISALQQAFQPNFSV 591


>gi|312143535|ref|YP_003994981.1| glutamate--cysteine ligase [Halanaerobium hydrogeniformans]
 gi|311904186|gb|ADQ14627.1| putative glutamate--cysteine ligase/putative amino acid ligase
           [Halanaerobium hydrogeniformans]
          Length = 647

 Score =  407 bits (1045), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 223/625 (35%), Positives = 343/625 (54%), Gaps = 68/625 (10%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N+L  E+SPYL QHA NPV+W+ WGEEAF  A+ +++PIFLSIGYSTCHWCHVME ESFE
Sbjct: 5   NKLKDENSPYLKQHADNPVNWYPWGEEAFKLAKDKNLPIFLSIGYSTCHWCHVMEKESFE 64

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA++LN +F+SIKVDREERP++D +YM   Q + G GGWPLS+F++ D KP    TY
Sbjct: 65  DEEVAQMLNQFFISIKVDREERPEIDSLYMDVCQTMTGSGGWPLSIFMTADKKPFYAATY 124

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
            P E+KYGR G  TIL ++   W ++R  L Q+    +  LS+      +      EL  
Sbjct: 125 IPKENKYGRKGLLTILPEIHYLWTEERKKLLQASENIVSHLSKINQNQKA------ELAS 178

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
           N      E +  +YD ++GGFGS+PKFP    +  +L++ KK   TG+    S    ++ 
Sbjct: 179 NIFEKTVEAIESNYDHQYGGFGSSPKFPMYQYLLFLLHYWKK---TGEDKYLS----ILE 231

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TLQ M  GGI+D +  GFHRYS D  W +PHFEKMLYDQ  +  +Y  A+  T    Y+
Sbjct: 232 TTLQQMRAGGIYDQLAFGFHRYSTDREWKMPHFEKMLYDQALMIYIYTAAYQATAKEIYA 291

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
            + ++I+ +L  +M+   G  F+A DADS         +EG +Y+W   E++ IL E   
Sbjct: 292 DVVKEIVSFLESEMLAKEGAFFTAIDADSG-------GEEGKYYLWEKSELKSILNE--- 341

Query: 463 LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECR 522
                           +R++   +    KN+ + L +           ++ Y N L E +
Sbjct: 342 -------------AQFNRLNKIFDIQANKNINLSLKN-----------VQDY-NQLAELK 376

Query: 523 RKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYM 582
            KL   R +R  P  D K++  WNGL+I++ A+A  +LK               DR  Y+
Sbjct: 377 DKLLKHRKERIHPSKDKKILTDWNGLLIAALAKAGFVLK--------------EDR--YL 420

Query: 583 EVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWL 642
           ++A+    FI  ++   +  RL HS+  G       L+DY+FL+ GL++LY+     ++L
Sbjct: 421 KLADDVEKFIHNNMKTNKG-RLAHSYYEGEKSKIDNLNDYSFLLWGLIELYQATLKDEYL 479

Query: 643 VWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLAS 702
           + A +      E F D++   ++ +  ++  + ++    +D + PS NS++  N ++LA 
Sbjct: 480 IKAEKTAKIMKEYFWDQKEEAFYFSAKDNEDLFIKQINANDHSLPSANSIAAFNFLKLAH 539

Query: 703 IVAGSKSDYYRQNAEHSLAVFETRL 727
           +        Y+++A+  +A F  ++
Sbjct: 540 LKDNLA---YQKDAQKIIAAFSDQI 561


>gi|448435859|ref|ZP_21586927.1| hypothetical protein C472_11724 [Halorubrum tebenquichense DSM
           14210]
 gi|445683294|gb|ELZ35694.1| hypothetical protein C472_11724 [Halorubrum tebenquichense DSM
           14210]
          Length = 739

 Score =  407 bits (1045), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 252/683 (36%), Positives = 352/683 (51%), Gaps = 75/683 (10%)

Query: 97  SRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVM 156
           S+    NRL  E SPYL QHA NPV+W  WG+EAF  AR+ DVP+F+SIGYS+CHWCHVM
Sbjct: 2   SQPTERNRLDGEASPYLRQHADNPVNWQPWGDEAFERAREHDVPVFVSIGYSSCHWCHVM 61

Query: 157 EVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKP 216
             ESFEDE VA ++ND FV IKVDREERPDVD  +MT  Q + GGGGWPLS + +P+ KP
Sbjct: 62  AEESFEDESVAGVINDSFVPIKVDREERPDVDSTFMTVCQLVTGGGGWPLSAWCTPEGKP 121

Query: 217 LMGGTYFPPEDKYGRPGFKTILRKVKDAW---DKKRDMLAQSGAFAIEQLSEALSASASS 273
              GTYFPPE +  +PGF+ +  ++ D+W   +++ +M  ++  +A     E  S     
Sbjct: 122 FYVGTYFPPEARQNQPGFRDLCERIADSWSDPEQREEMKRRADQWAESARDELESVPTPD 181

Query: 274 NKLPDELPQ------NALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMML-YHSKKL 325
              PD          + L   A    +SYD  +GGFGS   KFP P  I +++  +++  
Sbjct: 182 APGPDGEGDASPPGGDLLESAAASALRSYDDEYGGFGSGGAKFPMPGRIDLLMRAYARSG 241

Query: 326 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 385
            D   S  A         TL  M++GG++D +GGGFHRY+VD  W VPHFEKMLYD  +L
Sbjct: 242 RDALLSAAAG--------TLDGMSRGGMYDQIGGGFHRYAVDREWTVPHFEKMLYDNAEL 293

Query: 386 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK---- 441
              YLD + L  D  Y+ +  + L +L R++    G  FS  DA S   E  +R+     
Sbjct: 294 PMAYLDGYRLAGDPAYARVASESLAFLDRELRHDDGGFFSTLDARSRPPE--SRRDDDGH 351

Query: 442 -----EGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI 495
                EGAFYVWT +EV+ +L E A  L  E Y ++  GN +           +G  V  
Sbjct: 352 EAGDVEGAFYVWTPEEVDAVLDEPAASLAAERYGIRSGGNFE-----------RGTTVPT 400

Query: 496 ELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR 555
                   A+   +  E     L E R  LFD R  RPRP  D+KV+ SWNG  IS+FA 
Sbjct: 401 TAASVEELAADRDLSPEAVRQALTEARTALFDARESRPRPARDEKVLASWNGRAISAFAD 460

Query: 556 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY--DEQTHRLQHSFRNGPS 613
           A+  L                  + Y ++A  A  F R  LY  D +T  L   + +G  
Sbjct: 461 AAGTLG-----------------EPYADIAREALGFCRDRLYDADAETGALARRWLDGDV 503

Query: 614 KAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT------ 667
           + PG+LDDYAFL  G LD Y      + L +A+EL     + F D + G  + T      
Sbjct: 504 RGPGYLDDYAFLARGALDTYAATGDLEPLGFALELAEALVDEFYDADDGTIYFTRDPEGD 563

Query: 668 ---TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSD-YYRQNAEHSLAVF 723
              T +   ++ R +E  D + PS   V+   L    +++ G ++D  +R+ A   +   
Sbjct: 564 GGQTDDAGPLIARPQEFTDRSTPSSLGVAAETL----ALLDGFRTDGRFREIARRVVTTH 619

Query: 724 ETRLKDMAMAVPLMCCAADMLSV 746
             R++   +A   +  AAD++  
Sbjct: 620 ADRIRGGPLAHASLVRAADLVET 642


>gi|257051594|ref|YP_003129427.1| hypothetical protein Huta_0507 [Halorhabdus utahensis DSM 12940]
 gi|256690357|gb|ACV10694.1| protein of unknown function DUF255 [Halorhabdus utahensis DSM
           12940]
          Length = 717

 Score =  407 bits (1045), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 234/614 (38%), Positives = 329/614 (53%), Gaps = 48/614 (7%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLAAE SPYL  HA NPV W  W E A + A   D PIFLSIGY+ CHWCHVM  ESFE
Sbjct: 8   NRLAAEGSPYLQAHADNPVHWQPWDETALSTAEDEDKPIFLSIGYAACHWCHVMAEESFE 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE  A +LN+ FV IKVDREERPDVD++Y T  Q L   GGWPLSV+L+PD +P   GTY
Sbjct: 68  DEATAAVLNENFVPIKVDREERPDVDRIYQTLAQLLGQQGGWPLSVWLTPDGRPFYVGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           F P+ + GRPGF  +L  +K+ W+  RD + Q      + +S  L  + +     D    
Sbjct: 128 FAPDSRGGRPGFADLLEDLKETWENDRDGIEQRADQWADAISGELEGTPTPADPSDVRSD 187

Query: 283 NALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMML-----YHSKKLEDTGKSGEASE 336
             LR  A+   ++ D   GGFGS  PKFP+P  +Q++L     + S++  D G   +  E
Sbjct: 188 ELLRAGADAAVRTADREQGGFGSGGPKFPQPGRLQLLLRADARFGSERSAD-GDGADPGE 246

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
            + ++  +L  M  GG++DHVGGGFHRY+ D  W VPHFEKMLYD  ++    ++ + +T
Sbjct: 247 YRAVLTESLDAMVDGGLYDHVGGGFHRYATDRSWTVPHFEKMLYDNAEIPRALIEGYRVT 306

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
            D  Y+ +  +  ++L R++  P G  +S  DA S   EG    +EG FYVWT +EV   
Sbjct: 307 GDERYARVAGETFEFLDRELGHPEGGFYSTLDARS---EG----EEGKFYVWTPEEVRAA 359

Query: 457 LGEHA--ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKY 514
           +G+     L  + Y +   GN +            G+ VL         A++ G+ ++  
Sbjct: 360 VGDETDVSLVLDRYGITEDGNFE-----------DGQTVLTIAASVDELAAQSGLEVDDV 408

Query: 515 LNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVV 574
            + L   R +LFD RS+R RP  D+K++  WNGL IS+ A  S  L+             
Sbjct: 409 QDRLDRAREQLFDARSERTRPPRDEKILAGWNGLAISALAEGSLALED------------ 456

Query: 575 GSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYE 634
                + ++ A  A  F+R  L+DE +  L+  F +G  +  G+L+DYAFL  G LD Y+
Sbjct: 457 -----DILDRAVDALEFVRETLWDEDSGLLKRRFIDGDVRVEGYLEDYAFLARGALDCYQ 511

Query: 635 FGSGTKWLVWAIELQNTQDELFLDREGGGYFNT--TGEDPS--VLLRVKEDHDGAEPSGN 690
                  L +A++L    +  F D + G  + T   G D    +L R +E  D + PS  
Sbjct: 512 ASGDPDQLAFALDLAEEIESRFFDEDAGTLYFTEEAGSDAGTDLLARPQELTDRSTPSSA 571

Query: 691 SVSVINLVRLASIV 704
            V+V  LV L   V
Sbjct: 572 GVAVDVLVTLDEFV 585


>gi|303245350|ref|ZP_07331634.1| protein of unknown function DUF255 [Desulfovibrio fructosovorans
           JJ]
 gi|302493199|gb|EFL53061.1| protein of unknown function DUF255 [Desulfovibrio fructosovorans
           JJ]
          Length = 702

 Score =  406 bits (1044), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 259/665 (38%), Positives = 342/665 (51%), Gaps = 43/665 (6%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           ++  NRL  E SPYL QHAHNPVDW+ WGEEAFA A+  D PIFLSIGYSTCHWCHVME 
Sbjct: 2   SRKANRLINEKSPYLQQHAHNPVDWYPWGEEAFALAKAEDKPIFLSIGYSTCHWCHVMER 61

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFEDE +A L+    V+IKVDREERPD+D +YMT+ QAL G GGWPL+VFL+PD +P  
Sbjct: 62  ESFEDEDIAALMRAIVVAIKVDREERPDLDTLYMTFCQALTGRGGWPLNVFLTPDGEPFF 121

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
            GTYFP E  +GR G + +L++V  AW   R  +  + A  +  + + ++A   +     
Sbjct: 122 AGTYFPKESGFGRTGMRELLQRVHMAWKSNRQAVIGNAAQLLGAVRDQITARDGTGAA-- 179

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
           E     L     +L+ S+D   GGFGSAPKFP P     +L   ++   TG      +  
Sbjct: 180 EPGTVELEAATGELAASFDVENGGFGSAPKFPAP---HNLLLLLREYRRTGN----KDLL 232

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
            MV  TL  M +GG++DHVG GFHRYS D  W VPHFEKMLYDQ       ++A+  T +
Sbjct: 233 AMVTATLSAMRRGGVYDHVGFGFHRYSTDAGWLVPHFEKMLYDQALCVMACVEAWQATGE 292

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL- 457
           V+      + L+Y+RRD+  P G  +SAEDADS   EG     EG FYVWT  E+ + L 
Sbjct: 293 VWLKDTALEALEYVRRDLTSPDGVFYSAEDADS---EGV----EGKFYVWTEAEIREALP 345

Query: 458 GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI 517
            E A L  + Y ++ TGN       +      G N+L        +A+  G  +      
Sbjct: 346 PEDAQLVVDVYGVEATGNF----RDEATGVATGTNILHLPRSLEDAAAGRGTSVAALAAR 401

Query: 518 LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSD 577
           L  CR  L  VR KR RP  DDKV+   NG           +      +  FN      D
Sbjct: 402 LETCRAALLAVREKRARPLCDDKVLTDNNG---------LMLAALAKAARAFN------D 446

Query: 578 RKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGS 637
                    +A   + +    E   RL H  R G +   G LDDYAF   GL++LY+   
Sbjct: 447 EALAARAVAAADFLLEKMALPED--RLLHRLRQGEAAVAGMLDDYAFFAWGLVELYQTVF 504

Query: 638 GTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINL 697
             ++L  A  L       F D   GG+F +  +  S+LLR K  +D A PSGNSV+   L
Sbjct: 505 APRYLERAAALAKAMIAHFGD-GAGGFFLSPDDGESLLLRQKTFYDAAVPSGNSVAFFVL 563

Query: 698 VRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGH 757
             L  +  G KS  +R+ A         R+ +         C+   +  P+   V L G 
Sbjct: 564 TTLFRLT-GEKS--FREEAAKLAKAAGGRVAEHPSGYAFFLCSLSQMLAPA-AEVTLAGD 619

Query: 758 KSSVD 762
             + D
Sbjct: 620 PDAAD 624


>gi|448585374|ref|ZP_21647767.1| thioredoxin domain containing protein [Haloferax gibbonsii ATCC
           33959]
 gi|445726074|gb|ELZ77691.1| thioredoxin domain containing protein [Haloferax gibbonsii ATCC
           33959]
          Length = 709

 Score =  406 bits (1044), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 249/659 (37%), Positives = 350/659 (53%), Gaps = 68/659 (10%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA NPV+W  W E A   AR+ D PIFLS+GYS CHWCHVM  ESF 
Sbjct: 8   NRLDEEQSPYLRQHADNPVNWQPWDETALDAAREADKPIFLSVGYSACHWCHVMADESFS 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D  +A++LN+ FV +KVDREERPD+D++Y T  Q + GGGGWPLSV+L+P+ KP   GTY
Sbjct: 68  DPDIAEVLNEHFVPVKVDREERPDLDRIYQTICQLVTGGGGWPLSVWLTPEGKPFFVGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSAS-ASSNKLPDELP 281
           FPPE + G PGF+ ++    ++W   RD +        EQ + A++     +  +P E P
Sbjct: 128 FPPEPRRGAPGFRDLVESFAESWRTDRDEIENRA----EQWTSAITDRLEETPDVPGEAP 183

Query: 282 -QNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
             + L    +   +  D   GGFG   PKFP+P  I  +L   +    TG+     E   
Sbjct: 184 GSDVLDSTVQAALRGADRDHGGFGGDGPKFPQPGRIDALL---RGYAVTGR----REALD 236

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           +   +L  MA GG+ DH+GGGFHRY VD  W VPHFEKMLYDQ  LA+ YLDA  LT + 
Sbjct: 237 VARQSLDAMANGGLRDHLGGGFHRYCVDREWTVPHFEKMLYDQAGLASRYLDAARLTGNE 296

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE 459
            Y+ +  +  +++RR++    G  F+  DA S         +EG FYVWT  +V D+L E
Sbjct: 297 SYATVAAETFEFVRRELTHDDGGFFATLDAQSG-------GEEGTFYVWTPDDVRDLLPE 349

Query: 460 -HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS-ASKLGMPLEKYLNI 517
             A LF + Y + P GN            F+ K  ++ ++ ++A  A +  +   +  + 
Sbjct: 350 LDADLFCDRYGVTPGGN------------FERKTTVLNVSATTAELAEEYELDESEVEDR 397

Query: 518 LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSD 577
           L + R+ LF  R  R RP  D+KV+  WNGL+IS+FA+ S +L+ ++         + SD
Sbjct: 398 LEKARKALFAAREGRERPARDEKVLAGWNGLMISAFAQGSVVLEDDS---------LASD 448

Query: 578 RKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGS 637
                  A  A  F+R  L+D++T  L     NG  K  G+L+DYAFL  G  DLY+   
Sbjct: 449 -------ARRALDFVRERLWDDETETLSRRVMNGEVKGDGYLEDYAFLARGAFDLYQATG 501

Query: 638 GTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINL 697
               L +A++L       F D + G  + T     S++ R +E  D + PS   V+    
Sbjct: 502 DLAPLSFALDLARATRREFYDADAGTLYFTPESGESLVTRPQEPTDQSTPSSLGVATSLF 561

Query: 698 VRL------------ASIVAGSKSDYYRQNA-EH-SLAVFETRLKDMAMAVPLMCCAAD 742
           + L            A  V GS ++  R +  EH SLA+   +    A  VP +  AAD
Sbjct: 562 LDLEQFAPDADFGGVADAVLGSFANRVRGSPLEHVSLALAAEK---AASGVPELTIAAD 617


>gi|418053652|ref|ZP_12691708.1| protein of unknown function DUF255 [Hyphomicrobium denitrificans
           1NES1]
 gi|353211277|gb|EHB76677.1| protein of unknown function DUF255 [Hyphomicrobium denitrificans
           1NES1]
          Length = 677

 Score =  406 bits (1044), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 238/643 (37%), Positives = 349/643 (54%), Gaps = 72/643 (11%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYLLQH  NPV W+AWG EA AEA++   PI LS+GY+ CHWCHVM  ESFE
Sbjct: 4   NRLQYETSPYLLQHKDNPVHWWAWGPEALAEAKRTGKPILLSVGYAACHWCHVMAHESFE 63

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D G A+++N+ FV+IKVDREERPD+D +YM  +  L   GGWPL++FL  D KP  GGTY
Sbjct: 64  DSGTAEVMNELFVNIKVDREERPDIDAIYMGALHRLGEQGGWPLTMFLDSDAKPFWGGTY 123

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP E +YGRP F T+L ++ +A+  + D         I + +EAL A+   +  P+E   
Sbjct: 124 FPREARYGRPAFVTVLLRIAEAYQNQPDN--------IRKNTEALLAALKES--PNETSA 173

Query: 283 NALRLCAEQ----LSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
           +A R   +     ++++ D   GG   APKFP+     ++   + + +D          Q
Sbjct: 174 DASRPMTKDVVAAIARAVDREHGGLSGAPKFPQWSVFWLLWRGAIRYDD-------PNAQ 226

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
           + V+ TL+ + +GGI+DH+GGGF RYSVDE W VPHFEKMLYD   L ++  + +  T+D
Sbjct: 227 EAVVTTLRHICQGGIYDHLGGGFARYSVDEFWLVPHFEKMLYDNALLIDLLTEVWRETQD 286

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
             +     + + +L+R+MIG  G   ++ DADS   EG    +EG FYVW++ E+ED+LG
Sbjct: 287 PIFKTRIAETVTWLKREMIGEAGGFAASLDADS---EG----EEGKFYVWSAAEIEDVLG 339

Query: 459 -EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI 517
            E A  F   Y + P GN            F+G  +L  LN        L +   +    
Sbjct: 340 AEDAAFFSRVYGVTPEGN------------FEGHTILNRLN-------SLALLTNEEEAH 380

Query: 518 LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSD 577
           L + R KL + R+ R RP  DDK++  WNGL+I++ +RA+ + +                
Sbjct: 381 LAKLRAKLLERRASRIRPGWDDKILADWNGLMIAALSRAAVVFEC--------------- 425

Query: 578 RKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGS 637
             +++ +AE A   I   L      RL H++R G +KAP    DYA + S  L L+    
Sbjct: 426 -SDWLALAERAFDCIVTKLAAPDG-RLFHAYRKGLAKAPAIASDYANMTSAALRLFAATG 483

Query: 638 GTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINL 697
             ++L  A +     D+ + D + GGYF    +   V++R+K   D A PS N++ + NL
Sbjct: 484 SERYLEHARQWTRILDKHYWDVQRGGYFTAADDTGDVVVRLKVASDDAAPSANAIQLSNL 543

Query: 698 VRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 740
           + LA++          Q+ E +  + E     MA+  P+  CA
Sbjct: 544 IALAAVTGDV------QHHERARQLLEAFAPAMALG-PIGHCA 579


>gi|448729708|ref|ZP_21712022.1| hypothetical protein C449_08002 [Halococcus saccharolyticus DSM
           5350]
 gi|445794670|gb|EMA45214.1| hypothetical protein C449_08002 [Halococcus saccharolyticus DSM
           5350]
          Length = 721

 Score =  406 bits (1044), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 240/601 (39%), Positives = 331/601 (55%), Gaps = 43/601 (7%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA NPV+W  W ++A A AR+RDVPIFLSIGYS CHWCHVME ESFE
Sbjct: 7   NRLEEEGSPYLRQHADNPVNWQPWDDDALAAARERDVPIFLSIGYSACHWCHVMEDESFE 66

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA+ LND FV IKVDREERPD+D++Y T    + G GGWPLSV+L+PD +P   GTY
Sbjct: 67  DEAVAERLNDDFVPIKVDREERPDLDRLYQTICGMVSGQGGWPLSVWLTPDGRPFYVGTY 126

Query: 223 FPPEDKYGRPGFKTILRKVKDAW-DKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           FP + K G+PGF  +L  + ++W D + D+  ++  +A     E     A+  +  D   
Sbjct: 127 FPRDAKRGQPGFLDLLDSIAESWEDDREDVEGRADQWAGAMAGE---LEATPEQPGDPPG 183

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
            + L   A+Q  +S D  +GGFG   KFP+   + +++   +  E TG++       ++ 
Sbjct: 184 SDLLETAAQQAVESADREYGGFGRGQKFPQTGRLHLLM---RAAERTGRAV----FDEVA 236

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             TL  MA GG+ DHVGGGFHRY+ D  W VPHFEKMLYD  +L   YL  +  T+   Y
Sbjct: 237 RETLDAMADGGLRDHVGGGFHRYTTDREWTVPHFEKMLYDNAELVRAYLAGYRRTEAERY 296

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH- 460
           + + R+ L ++ R++  P G  FS  DA S +  G    +EGAFYVWT  EV D + +  
Sbjct: 297 AEVARETLGFVERELHHPDGGFFSTLDAQSEDESG--EHEEGAFYVWTPDEVHDAVDDEF 354

Query: 461 -AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILG 519
            A LF E Y +  TGN +            G  VL    D    A +     E+    L 
Sbjct: 355 AADLFCERYGVTETGNFE-----------DGTTVLTLSADIEDLADEHDTTAEEIEAELE 403

Query: 520 ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRK 579
             R  +F  R++R RP  D+K++  WNGL+IS+FA A   L +                 
Sbjct: 404 RARETVFAARAERARPARDEKILAGWNGLMISAFAEAGLTLDA----------------- 446

Query: 580 EYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGT 639
            + + A +A  FIR HL+D++  RLQ  +++   K  G+L+DYAFL  G L+ YE     
Sbjct: 447 RFADTAVTALDFIREHLWDDEEKRLQRRYKDEDVKIDGYLEDYAFLARGALNCYEATGDV 506

Query: 640 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVR 699
             L +A++L  T +  F D E    + T     S++ R +E  D + PS   V+V  L+ 
Sbjct: 507 DHLAFALDLARTIETEFWDSEEETLYFTPQTGESLVARPQELDDQSTPSSTGVAVDVLLA 566

Query: 700 L 700
           L
Sbjct: 567 L 567


>gi|162450797|ref|YP_001613164.1| hypothetical protein sce2525 [Sorangium cellulosum So ce56]
 gi|161161379|emb|CAN92684.1| hypothetical protein sce2525 [Sorangium cellulosum So ce56]
          Length = 716

 Score =  406 bits (1044), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 262/709 (36%), Positives = 365/709 (51%), Gaps = 78/709 (11%)

Query: 101 HTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVES 160
           H NRLA+E SPYLLQHAHNPV W+ WG EA   AR+ D PI LSIGY+ CHWCHVME ES
Sbjct: 4   HKNRLASESSPYLLQHAHNPVAWYPWGAEALDLARREDKPILLSIGYAACHWCHVMERES 63

Query: 161 FEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGG 220
           FEDE +A+ +ND FV+IKVDREERPD+D +Y   VQ +   GGWPL+VFL+PD +P   G
Sbjct: 64  FEDEAIARHMNDLFVNIKVDREERPDLDHIYQLVVQLMGRSGGWPLTVFLTPDQRPFFAG 123

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           TYFPP+D  G PGF  +L K+ DA+  +RD + Q      E +  A  A A +  +    
Sbjct: 124 TYFPPKDALGMPGFPKVLDKIADAFRNRRDDVEQQAQEITEAIERAQRAPARAAGVAAPA 183

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
             + LR  + QL    D R GG GS PKFP  + + ++L       D      A+EG   
Sbjct: 184 SSDLLRRASRQLLARLDPRHGGIGSRPKFPNTMALDVLLRRGVLESDR----VAAEG--- 236

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
           V  TL  M  GGI DH+ GGFHRYS DERW VPHFEKMLYD   L  +Y D F   K   
Sbjct: 237 VELTLDRMRDGGIWDHLRGGFHRYSTDERWLVPHFEKMLYDNALLLRLYADGFRAFKKPI 296

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH 460
           Y+   R+I+ YL  +M  P G  ++++DADS   EG    +EG F+VWT +++ D +GE 
Sbjct: 297 YAETAREIVGYLFAEMRDPEGGFYASQDADS---EG----REGKFFVWTLEQLRDAVGED 349

Query: 461 AILFKEHYYLKPTGNCDLSRM----SDPHN-EFKGKNVLIELNDSSASASKL-----GMP 510
            + +            D++R+    S+  N E  G  VL +      +A+ +     G P
Sbjct: 350 QLAY------------DMARLVFGISEEGNFEDSGATVLSQHRTLEQAAAVIDDGAGGGP 397

Query: 511 ---LEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 567
              L++  + L   R  +   R  RPRP  DDKV+ SWNGL+I + A A + L       
Sbjct: 398 STHLDRCRDALARARVAMLAARDARPRPARDDKVLASWNGLLIGALADAGRAL------- 450

Query: 568 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG-PSKA----------- 615
                    D   +++ A  A + + R L   +  R+    ++G P+ A           
Sbjct: 451 ---------DEPAWVDAAARAFALLERKLL--RGGRVGRYLKDGAPAGANREHGGSGAAV 499

Query: 616 ----PGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGED 671
               PGFLDD A+L +  LDLYE  S  +++  A  + +       D    G+F T  + 
Sbjct: 500 GDVRPGFLDDQAYLGNAALDLYEATSDPRYVDVARAIADAMIAHHWDEAAPGFFFTPDDG 559

Query: 672 PSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMA 731
            +++ R ++ +D A PS  S++ +  +RL+ I      + Y   AE  L V      + A
Sbjct: 560 DALIARTQDIYDQAAPSAASMAALLCLRLSEIA----DERYLSPAERQLDVLAPTALENA 615

Query: 732 MAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
             +    C  D L+  +   VV+VG   S     +   A   Y  N+ +
Sbjct: 616 FGLGQTVCVLDRLTRGA-VTVVVVGEAGSASAAELTREAFKVYLPNRAI 663


>gi|448666501|ref|ZP_21685146.1| thioredoxin domain-containing protein [Haloarcula amylolytica JCM
           13557]
 gi|445771632|gb|EMA22688.1| thioredoxin domain-containing protein [Haloarcula amylolytica JCM
           13557]
          Length = 717

 Score =  406 bits (1044), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 238/663 (35%), Positives = 355/663 (53%), Gaps = 49/663 (7%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL    SPYL QHA NPV+W  W E A   A++R VPIFLSIGY+ CHWCHVME ESFE
Sbjct: 11  NRLDEAESPYLRQHADNPVNWQPWDETALEAAKERGVPIFLSIGYAACHWCHVMEEESFE 70

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           +E +A+ LN+ FV IKVDREERPD+D VYM+  Q + GGGGWPLS +L+P+ +P   GTY
Sbjct: 71  NEAIAEQLNENFVPIKVDREERPDLDSVYMSICQQVTGGGGWPLSAWLTPEGEPFYVGTY 130

Query: 223 FPPEDKYGRPGFKTILRKVKDAW--DKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           FPPE+K G+PGF  +L+++ D+W   ++R+ +        E +   L A+ ++   P++ 
Sbjct: 131 FPPEEKRGQPGFGDLLQRLADSWADPEQREEMENRARQWTEAIESDLEATPAN---PEDP 187

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
            ++ ++       +  D + GG+GS  PKFP+   +  +L   +   D G+    +    
Sbjct: 188 AEDIIQTAGTIAHRGADRQDGGWGSGGPKFPQNGRLHALL---RAYSDGGQQDHLN---- 240

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           +V  TL  MA  G++DHVGGGFHRY+ D++W VPHFEKMLYD  ++   +L  +      
Sbjct: 241 VVQETLDVMADRGLYDHVGGGFHRYATDQQWAVPHFEKMLYDNAEIPRAFLAGYQAIGSE 300

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGAT-RKKEGAFYVWTSKEVEDILG 458
            Y+ + R+  ++++R++  P G  FS  DA+S   E      +EG FYVWT ++V D + 
Sbjct: 301 RYASVVRETFEFVQRELQHPDGGFFSTLDAESIPPEDPDGDSEEGLFYVWTPEQVHDAVD 360

Query: 459 EH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLN 516
           +   A +F           CD   +++P N F+G  VL      S  A +     ++   
Sbjct: 361 DETDADIF-----------CDYYGVTEPGN-FEGATVLAVRKPVSVLAEEYERSEDEITA 408

Query: 517 ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGS 576
            L     + F+ R +RPRP  D+K++  WNGL+I + A  + +L                
Sbjct: 409 GLQRALNETFEARKERPRPARDEKILAGWNGLMIRALAEGAIVLDD-------------- 454

Query: 577 DRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG 636
              EY +VA  A SF+R HL+DE   RL   +++G     G+L+DYAFL  G L L+E  
Sbjct: 455 ---EYADVAADALSFVREHLWDETEQRLNRRYKDGDVAIDGYLEDYAFLGRGALTLFEAT 511

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
                L +A++L     E F D + G  F T     S++ R +E  D + PS   V+V  
Sbjct: 512 GDVDHLAFAMDLGQAITEAFWDDDEGTLFFTPTGGESLVARPQELTDQSTPSSTGVAVDL 571

Query: 697 LVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVG 756
           L+ L+     S  D + + AE  L     R+    +    +  A D     + + + LVG
Sbjct: 572 LLSLSHF---SDDDRFEEVAERVLRTHADRVSSNPLQHASLTLATDTYEQGALE-LTLVG 627

Query: 757 HKS 759
            +S
Sbjct: 628 DQS 630


>gi|431930442|ref|YP_007243488.1| thioredoxin domain-containing protein [Thioflavicoccus mobilis
           8321]
 gi|431828745|gb|AGA89858.1| thioredoxin domain protein [Thioflavicoccus mobilis 8321]
          Length = 683

 Score =  406 bits (1043), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 241/623 (38%), Positives = 341/623 (54%), Gaps = 49/623 (7%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            NRLAA  SPYL QHA NPVDW+ W + A AEAR +D PI LSIGYS CHWCHVM  ESF
Sbjct: 8   ANRLAATASPYLRQHARNPVDWWPWCDAALAEARAQDRPILLSIGYSACHWCHVMAHESF 67

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYG-GGGWPLSVFLSPD-LKPLMG 219
           ED   A L+N  FV+IKVDREERPD+D++Y T  Q L    GGWPL+VFL+P+ L+P   
Sbjct: 68  EDPATAALMNRLFVNIKVDREERPDLDRIYQTAHQLLSSRAGGWPLTVFLTPETLEPFFC 127

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE 279
           GTYFP E ++G P F+ +L  V+ A+ ++R+ + +     +  L+E    +  +  +PD 
Sbjct: 128 GTYFPREPRHGLPAFRQLLEGVERAFREQREAIREQSQGLMAALAE---LAPRAGAIPDS 184

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
            P    R    QL+ S+D+  GGFG APKFPR  +++++L H    +  G+    +    
Sbjct: 185 APLEGAR---RQLAASFDAARGGFGGAPKFPRVPDLELLLRHWAATDAAGQPD--ARALA 239

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           MV FTL+ M  GGI+D VGGGF+RYSVD+ W +PHFEKMLYD  QL  +  DA+  T + 
Sbjct: 240 MVTFTLERMIAGGINDQVGGGFYRYSVDDAWMIPHFEKMLYDNAQLLALCCDAWQATSEP 299

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE 459
            +        D++  +M    G  +SA DADS   EG    +EG +YVWT +E+E  L  
Sbjct: 300 VFRAAAEATADWVIGEMQSDEGGYYSALDADS---EG----QEGRYYVWTREELEGTLAP 352

Query: 460 HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILG 519
                    Y           +  P N F+G+  L      +  A +LG+ + +   ++ 
Sbjct: 353 EEFAAFAARY----------GLDGPAN-FEGRWHLHAQAMPAEVAGRLGLTVAQVEGLID 401

Query: 520 ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRK 579
             RRKL +VR  R RP  D+KV+ +WN L+I   ARA+++L                 R 
Sbjct: 402 GARRKLLEVRRARVRPACDEKVLTAWNALMIKGMARAARVLA----------------RP 445

Query: 580 EYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGT 639
           +Y+  AE A   +R  L+  +  RL  S+ +G +  P +LDD+A LI  LL+L +     
Sbjct: 446 DYLASAERALGLVRSTLW--RDGRLLASYMDGTAHLPAYLDDHAMLIDALLELLQVRWRR 503

Query: 640 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVR 699
             L +AIEL       F D   GG+F T  +  +++ R K   D + P+GN+V+     R
Sbjct: 504 DDLRFAIELAEILLARFEDSGEGGFFFTASDHETLIHRPKPLADESLPAGNAVAARVFQR 563

Query: 700 LASIVAGSKSDYYRQNAEHSLAV 722
           L  ++   +   Y + A   LAV
Sbjct: 564 LGHLLGEPR---YLEAAARVLAV 583


>gi|292655805|ref|YP_003535702.1| thioredoxin domain containing protein [Haloferax volcanii DS2]
 gi|448289792|ref|ZP_21480955.1| thioredoxin domain containing protein [Haloferax volcanii DS2]
 gi|291370452|gb|ADE02679.1| thioredoxin domain containing protein [Haloferax volcanii DS2]
 gi|445581309|gb|ELY35670.1| thioredoxin domain containing protein [Haloferax volcanii DS2]
          Length = 703

 Score =  405 bits (1042), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 252/675 (37%), Positives = 349/675 (51%), Gaps = 76/675 (11%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA NPV+W  W E A   AR+ D PIFLSIGYS CHWCHVM  ESF 
Sbjct: 8   NRLDEEQSPYLRQHADNPVNWQPWDETALDAAREADKPIFLSIGYSACHWCHVMADESFS 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D  +A++LN+ FV +KVDREERPD+D++Y T  Q + GGGGWPLSV+L+P+ KP   GTY
Sbjct: 68  DPDIAEVLNEEFVPVKVDREERPDLDRIYQTICQQVTGGGGWPLSVWLTPEGKPFFVGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP- 281
           FPPE + G PGF+ I+    ++W   R+ +          +++ L  +  +   P E P 
Sbjct: 128 FPPEPRRGAPGFRDIVESFAESWLTDREEIENRAEQWTSAITDRLEETPDT---PGEAPG 184

Query: 282 QNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
            + L    +   +  D   GGFG   PKFP+P  I  ML            G A  G++ 
Sbjct: 185 SDILDTTVQAALRGADRDHGGFGGDGPKFPQPGRIDAML-----------RGYAVSGRRE 233

Query: 341 VL----FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
            L     +L  MA GG+ DH+GGGFHRY VD  W VPHFEKMLYDQ  LA+ YLDA  LT
Sbjct: 234 ALDVARQSLDAMANGGLRDHLGGGFHRYCVDREWTVPHFEKMLYDQAGLASRYLDAARLT 293

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
            +  Y+ +  +  +++RR++    G  F+  DA S         +EG FYVWT  +V D+
Sbjct: 294 GNDSYATVAAETFEFVRRELTHDDGGFFATLDAQSG-------GEEGTFYVWTPDDVRDL 346

Query: 457 LGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS-ASKLGMPLEKY 514
           L E  A LF + Y + P GN            F+ K  ++ ++ ++A  A +  +   + 
Sbjct: 347 LPELDADLFCDRYGVTPGGN------------FEDKTTVLNVSATTADLADEYDLDESEV 394

Query: 515 LNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVV 574
            + L + R+ LF  R  R RP  D+KV+  WNGL+IS+FA+ S +L+ ++ +A       
Sbjct: 395 EDRLEKARKALFAAREGRERPARDEKVLAGWNGLMISAFAQGSVVLEDDSLAAD------ 448

Query: 575 GSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYE 634
                     A  A  F+R  L+D +T  L     NG  K  G+L+DYAFL  G  DLY+
Sbjct: 449 ----------ARRALDFVRERLWDAETATLSRRVMNGEVKGDGYLEDYAFLARGAFDLYQ 498

Query: 635 FGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSV 694
                  L +A++L       F D + G  + T     S++ R +E  D + PS   V+ 
Sbjct: 499 ATGDLAPLSFALDLARATRREFYDADAGTLYFTPESGESLVTRPQEPTDQSTPSSLGVAT 558

Query: 695 INLVRL------------ASIVAGSKSDYYRQNA-EH-SLAVFETRLKDMAMAVPLMCCA 740
              + L            A  V GS ++  R +  EH SLA+   +    A  VP +  A
Sbjct: 559 SLFLDLEQFAPDAGFGEVADAVLGSFANRVRGSPLEHVSLALAAEK---AASGVPELTVA 615

Query: 741 ADMLSVPSRKHVVLV 755
           AD   VP      L 
Sbjct: 616 AD--EVPDEWRATLA 628


>gi|338741363|ref|YP_004678325.1| hypothetical protein HYPMC_4552 [Hyphomicrobium sp. MC1]
 gi|337761926|emb|CCB67761.1| conserved protein of unknown function [Hyphomicrobium sp. MC1]
          Length = 682

 Score =  405 bits (1042), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 229/604 (37%), Positives = 332/604 (54%), Gaps = 45/604 (7%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYLLQH  NPV W+AWG EA AEA++   PI LS+GY+ CHWCHVM  ESFE
Sbjct: 4   NRLKYETSPYLLQHQDNPVHWWAWGPEALAEAKRTGKPILLSVGYAACHWCHVMAHESFE 63

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D   A+++ND FV+IKVDREERPD+D +YM  +  L   GGWPL++FL  + KP  GGTY
Sbjct: 64  DPETARVMNDLFVNIKVDREERPDIDAIYMGALHRLGEQGGWPLTMFLDSEAKPFWGGTY 123

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP E +YGRP F T+L ++ +A+  + + +A++    +  L E  S +      PD +P 
Sbjct: 124 FPRESRYGRPSFVTVLLRIAEAYQSQPENVAKNTEALVAALKEEASTTDRVEAGPD-VPD 182

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
              R     ++++ D   GG   APKFP+     ++   + +  D        + ++ V+
Sbjct: 183 LVAR-----ITRAVDRDHGGINGAPKFPQWNIFWLLWRGAMRFGD-------EDAKQAVI 230

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL+ + +GGI+DH+GGGF RYSVD  W VPHFEKMLYD   L ++  + +  T+D  + 
Sbjct: 231 TTLRNICQGGIYDHLGGGFARYSVDPFWLVPHFEKMLYDNALLIDLITEVWRETQDPLFK 290

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHA 461
               + + +L+R+MIG  G   ++ DADS   EG    +EG FYVW  KE+ D+LG E A
Sbjct: 291 IRIAETVAWLKREMIGEAGGFAASLDADS---EG----EEGKFYVWHKKEIVDVLGPEDA 343

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
            +F + Y +   GN             +G  +L  L   S S+ +    L        E 
Sbjct: 344 AIFGKVYGVTRDGNFSEHAAITASGRIEGPTILNRLESQSFSSDEAEARLS-------EM 396

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
           R KL   R+ R RP  DDK++  WNGL+I++ +RA+ +                 D+ E+
Sbjct: 397 RAKLLTRRAGRVRPGWDDKILADWNGLMIAAMSRAAIVF----------------DQPEW 440

Query: 582 MEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKW 641
           + +AE+A + +   L      RL HS+R G +KAP    DYA +I   L LYE  S  ++
Sbjct: 441 LGMAEAAFTCVATKL-SAGGDRLYHSYRGGLAKAPATASDYANMIWAALRLYEATSSDRY 499

Query: 642 LVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLA 701
           L  A       D  + D + GGYF    +   V++R+K   D A PS N++ + NL+ LA
Sbjct: 500 LSQAQRWAAVLDTHYWDGDSGGYFTAADDTSDVVVRLKSASDDATPSANAIQLSNLITLA 559

Query: 702 SIVA 705
           ++  
Sbjct: 560 AMTG 563


>gi|448540737|ref|ZP_21623658.1| thioredoxin domain containing protein [Haloferax sp. ATCC BAA-646]
 gi|448549039|ref|ZP_21627815.1| thioredoxin domain containing protein [Haloferax sp. ATCC BAA-645]
 gi|448555786|ref|ZP_21631715.1| thioredoxin domain containing protein [Haloferax sp. ATCC BAA-644]
 gi|445708890|gb|ELZ60725.1| thioredoxin domain containing protein [Haloferax sp. ATCC BAA-646]
 gi|445713728|gb|ELZ65503.1| thioredoxin domain containing protein [Haloferax sp. ATCC BAA-645]
 gi|445717309|gb|ELZ69027.1| thioredoxin domain containing protein [Haloferax sp. ATCC BAA-644]
          Length = 703

 Score =  405 bits (1042), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 250/675 (37%), Positives = 349/675 (51%), Gaps = 76/675 (11%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA NPV+W  W E A   AR+ D PIFLSIGYS CHWCHVM  ESF 
Sbjct: 8   NRLDEEQSPYLRQHADNPVNWQPWDETALEAAREADKPIFLSIGYSACHWCHVMADESFS 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D  +A++LN+ FV +KVDREERPD+D++Y    Q + GGGGWPLSV+L+P+ KP   GTY
Sbjct: 68  DPDIAEVLNEEFVPVKVDREERPDLDRIYQNICQQVTGGGGWPLSVWLTPEGKPFFVGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP- 281
           FPPE + G PGF+ I+    ++W   RD +          +++ L  +  +   P E P 
Sbjct: 128 FPPEPRRGAPGFRDIVESFAESWRTDRDEIENRADQWTSAITDRLEETPDT---PGEAPG 184

Query: 282 QNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
            + L    +   +  D   GGFG   PKFP+P  I  +L            G A  G++ 
Sbjct: 185 SDILDTTVQAALRGADRDHGGFGGDGPKFPQPGRIDALL-----------RGYAVSGRRE 233

Query: 341 VL----FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
            L     +L  MA GG+ DH+GGGFHRY VD  W VPHFEKMLYDQ  LA+ YLDA  LT
Sbjct: 234 ALDVARQSLDAMANGGLRDHLGGGFHRYCVDREWTVPHFEKMLYDQAGLASRYLDAARLT 293

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
            +  Y+ +  +  +++RR++    G  F+  DA S         +EG FYVWT  +V D+
Sbjct: 294 GNDSYATVAAETFEFVRRELTHDDGGFFATLDAQSG-------GEEGTFYVWTPDDVRDL 346

Query: 457 LGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS-ASKLGMPLEKY 514
           L E  A LF + Y + P GN            F+ K  ++ ++ ++A    +  +   + 
Sbjct: 347 LPELDADLFCDRYGVTPGGN------------FENKTTVLNVSATTAELVDEYDLDESEV 394

Query: 515 LNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVV 574
            + L + R+ LF  R  R RP  D+KV+  WNGL+IS+FA+ S +L+ ++         +
Sbjct: 395 EDRLEKARKALFAAREGRERPARDEKVLAGWNGLMISAFAQGSVVLEDDS---------L 445

Query: 575 GSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYE 634
            SD       A  A  F+R  L+D++T  L     NG  K  G+L+DYAFL  G  DLY+
Sbjct: 446 ASD-------ARRALDFVRERLWDDETETLSRRAMNGEVKGDGYLEDYAFLARGAFDLYQ 498

Query: 635 FGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSV 694
                  L +A++L       F D + G  + T     S++ R +E  D + PS   V+ 
Sbjct: 499 ATGDLAPLSFALDLARATRREFYDADAGTLYFTPESGESLVTRPQEPTDQSTPSSLGVAT 558

Query: 695 ------------INLVRLASIVAGSKSDYYRQNA-EH-SLAVFETRLKDMAMAVPLMCCA 740
                        +   +A  V GS ++  R +  EH SLA+   +    A  VP +  A
Sbjct: 559 SLFLDLEQFAPNADFGEVADAVLGSFANRVRGSPLEHVSLALAAEK---AASGVPELTVA 615

Query: 741 ADMLSVPSRKHVVLV 755
           AD   VP      L 
Sbjct: 616 AD--EVPDEWRATLA 628


>gi|385803931|ref|YP_005840331.1| hypothetical protein Hqrw_2868 [Haloquadratum walsbyi C23]
 gi|339729423|emb|CCC40679.1| YyaL family protein [Haloquadratum walsbyi C23]
          Length = 768

 Score =  405 bits (1042), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 234/628 (37%), Positives = 331/628 (52%), Gaps = 75/628 (11%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA NPV+W  W + A   A   D PIFLS+GY+ CHWCHVM  ESFE
Sbjct: 8   NRLDNEASPYLTQHAENPVNWQPWDDRALEYAESADKPIFLSVGYAACHWCHVMAEESFE 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D+ VA +LND FV IKVDREERPD+D++Y T  Q + GGGGWPLSV+L+PD KP   GTY
Sbjct: 68  DDTVATILNDSFVPIKVDREERPDLDRIYQTICQLVTGGGGWPLSVWLTPDGKPFYVGTY 127

Query: 223 FPPEDKYGR---PGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE 279
           FP  ++  R   PGF  I +    AW+  R  L        + L + L    +++   D 
Sbjct: 128 FPKTERSDRGDTPGFLEICQSFATAWENDRSELESRANQWADTLQDRLEVDTNADTSIDV 187

Query: 280 L------------PQNA-----------LRLCAEQLSKSYDSRFGGFGS-APKFPRPVEI 315
                        PQ             L   +    ++ D+ +GGFGS  PKFP+P  I
Sbjct: 188 DDDDDVPAPDIASPQTDSDADDDSTMDLLTSVSTAAIRATDNEYGGFGSRGPKFPQPGRI 247

Query: 316 QMML-YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPH 374
           + ++  H++   +T      +        TL  MA GGI+DHVGGGFHRY+ D +W VPH
Sbjct: 248 EALIRAHAETNRETALDAATA--------TLDAMAAGGIYDHVGGGFHRYATDRKWTVPH 299

Query: 375 FEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAET 434
           FEKMLYD  +L+ VYL A+  T    Y+ +  +   +L R++  P G  +S  D   A++
Sbjct: 300 FEKMLYDNAELSRVYLSAYQHTGRDRYARVAHETFAFLSRELQHPEGGFYSTLD---AQS 356

Query: 435 EGATRKKEGAFYVWTSKEVEDILGEHAI--LFKEHYYLKPTGNCDLSRMSDPHNEFKGKN 492
           EG    +EG FYVWT + + + + +  I  +  + + +   GN            F+G  
Sbjct: 357 EG----EEGRFYVWTPETIRNAITDQQIADIAIDRFGVTEGGN------------FEGST 400

Query: 493 VLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISS 552
           VL      S  A+K  +  ++ ++ L + R  LFD R  R RP+ D+K++ +WNGL ISS
Sbjct: 401 VLTATASVSQLATKYSLTTDEIMSQLADARDSLFDARMDRERPNRDEKILTAWNGLAISS 460

Query: 553 FARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP 612
            AR   IL++E                +Y E+A  A SFIR HL+D  + RL   +++G 
Sbjct: 461 LARGGLILETE----------------QYTELANDALSFIRTHLWDSDSGRLSRRYKDGD 504

Query: 613 SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDP 672
               G+LDDYAFL  G  DLY+     + L +A+ L  +  ELF D  G   + T  +  
Sbjct: 505 VDETGYLDDYAFLARGAFDLYQTTGAVEHLSFAVTLAESIVELFYDTAGETLYLTPEDAE 564

Query: 673 SVLLRVKE--DHDGAEPSGNSVSVINLV 698
           S++ R ++  D   +  +G +V  +N V
Sbjct: 565 SLVARPQDLRDQSTSSSAGIAVQTLNAV 592


>gi|345856701|ref|ZP_08809173.1| hypothetical protein DOT_0529 [Desulfosporosinus sp. OT]
 gi|344330213|gb|EGW41519.1| hypothetical protein DOT_0529 [Desulfosporosinus sp. OT]
          Length = 652

 Score =  405 bits (1042), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 236/634 (37%), Positives = 343/634 (54%), Gaps = 70/634 (11%)

Query: 156 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 215
           ME ESFE++ VA +LN +F+SIKVDREERPDVD +YM + Q L G GGWPL++ ++PD K
Sbjct: 1   MERESFENDEVAGILNRYFISIKVDREERPDVDHLYMAFCQTLTGSGGWPLTIIMTPDKK 60

Query: 216 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 275
           P   GTYFP  ++YGRPG   +  +V   W      L +S    +  +    +  + S+ 
Sbjct: 61  PFFAGTYFPKTERYGRPGLMELAEQVGTLWKTNEGKLRESSDEIVAAVHSQRTVPSKSSP 120

Query: 276 LPDELPQNA-------------LRLCAEQL--------SKSYDSRFGGFGSAPKFPRPVE 314
           LP  +  +               +  +EQL        ++S+D+R+GGFG APKFP P  
Sbjct: 121 LPSAVTNDPSLKDGNGPTSSEDFQTWSEQLIDKAYQVFAQSFDARYGGFGRAPKFPTPHT 180

Query: 315 IQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPH 374
           I  +L ++       +    S+  +MV  TL  MA+GGI+DHVG GF RYS DE+W VPH
Sbjct: 181 ISFLLRYA-------QDHPQSKALEMVRKTLDGMAQGGIYDHVGFGFARYSTDEKWLVPH 233

Query: 375 FEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAET 434
           FEKMLYD   LA+ YL+++        +   ++I  Y+ RDM  P G  +SAEDAD+   
Sbjct: 234 FEKMLYDNALLASTYLESYQANHQPDDAQKAKEIFTYVLRDMTSPEGGFYSAEDADA--- 290

Query: 435 EGATRKKEGAFYVWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV 493
           EG     EG F+VWT  E+E +LG + A ++   Y + P GN            F+GKN+
Sbjct: 291 EGV----EGKFHVWTRAEIETLLGKDTAAMYCAVYDITPEGN------------FEGKNI 334

Query: 494 L-IELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISS 552
             + L +    A    +   + L IL + R+ LF  R KR  PH DDK++ +WNGL+I++
Sbjct: 335 PNLLLGNLEKIARNNSLAAAEVLQILEKARQTLFTAREKRIHPHKDDKILTAWNGLMIAA 394

Query: 553 FARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP 612
           FA+ +++L   A                Y+E AE+AA F+  HL      RL   +R G 
Sbjct: 395 FAKGAQVLGIPA----------------YLEAAENAADFVLTHL-KRNDGRLLARYREGH 437

Query: 613 SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDP 672
           S   G+LDDYAF I GLL+LY       +L  A++LQ  Q+ LFLD E GGY+ T  +  
Sbjct: 438 SAYLGYLDDYAFFIGGLLELYSVSGKPHYLQVALQLQEEQERLFLDEEDGGYYLTGSDGE 497

Query: 673 SVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAM 732
            +L R KE +DGA P+GNS++ +NL +LA +    +   + + AE  L VF + L++   
Sbjct: 498 ELLFRPKESYDGAIPAGNSITALNLFKLARLTGDER---WERKAEQQLLVFRSVLEEHPS 554

Query: 733 AVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENM 766
                  A      PS++ ++L G  ++ +   M
Sbjct: 555 GYTAFLQALQFAVHPSQE-LILAGALNATELPEM 587


>gi|355673311|ref|ZP_09058908.1| hypothetical protein HMPREF9469_01945 [Clostridium citroniae
           WAL-17108]
 gi|354814777|gb|EHE99376.1| hypothetical protein HMPREF9469_01945 [Clostridium citroniae
           WAL-17108]
          Length = 688

 Score =  405 bits (1042), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 252/677 (37%), Positives = 360/677 (53%), Gaps = 97/677 (14%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N L +E SPYLLQH+ NPVDW+ W ++AF +A+    P+FLSIGYSTCHWCHVM  ESFE
Sbjct: 3   NHLYSEKSPYLLQHSENPVDWYPWSDQAFLKAQSEGKPVFLSIGYSTCHWCHVMAHESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D+ +A++LN  FV +KVDREERP++D VYM+  QA+ G GGWPL++ ++PD KP   GTY
Sbjct: 63  DKEIARILNTHFVPVKVDREERPEIDMVYMSVCQAMTGRGGWPLTIIMTPDKKPFFAGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQ--------------SGAFAIEQLSEALS 268
            PP  +YG  G   +L KV   W+  R+ L Q              +GA  +    + + 
Sbjct: 123 LPPRSRYGMTGLTELLEKVSGLWETDREQLLQMSRQVMSLIHGREGNGADGMGTAGDGMD 182

Query: 269 ASASS-NKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQ-MMLYHSKKLE 326
            + ++ ++  D +         ++LS  +D + GGFG APKFP P  +  +M+Y++ + E
Sbjct: 183 GTGTAGDRTEDSVSWELAHEGFKELSAMFDKKHGGFGRAPKFPAPHNLLFLMMYYAARDE 242

Query: 327 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 386
           D            M   TL  MA+GGIHD +GGGF RYS DE W VPHFEKMLYD   LA
Sbjct: 243 D--------HAMDMAEQTLTAMARGGIHDQIGGGFSRYSTDEAWLVPHFEKMLYDNALLA 294

Query: 387 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 446
             YL+ + LT + +Y  I   IL Y+ R++    G  +  +DADS   EG     EG FY
Sbjct: 295 LAYLEGYRLTDNPYYRQIAERILIYVERELSDSDGGFYCGQDADS---EGV----EGKFY 347

Query: 447 VWTSKEVEDILGEHAIL--FKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 504
           V++  E+  IL        F + + +   GN            F+GKN+   L++     
Sbjct: 348 VFSKDEIRQILDTPREYDDFCQWFGITEKGN------------FEGKNIPNLLHNPGYKD 395

Query: 505 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 564
           +            +G   +K++D R KR   H DDK++ SWN ++I+++A+A  +L    
Sbjct: 396 T---------FPFMGPVCKKVYDHRIKRMALHRDDKILTSWNSMMITAYAKAGLLL---- 442

Query: 565 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 624
                       D+K Y + A +A  F+ +HL DE  HR+   +R+G    PG LDDYA+
Sbjct: 443 ------------DQKAYEKKARNAQMFVEQHLVDE-NHRMFVRYRDGERAFPGNLDDYAY 489

Query: 625 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLD-REGGGYFNTTGEDPSVLL-RVKEDH 682
              GLL LYE      +L  A++      +LF D R+GG YF   G D   L+ R KE +
Sbjct: 490 YCLGLLALYEATLEVDYLELALKRAAQMADLFWDSRQGGFYF--YGRDVQELIHRPKEIY 547

Query: 683 DGAEPSGNSVSVINLV-----------------RLASIVAGSKSDYYRQNAEHSLAVFET 725
           DGA PSGNS +   L+                 +LA + AG+K   Y      SL  F  
Sbjct: 548 DGAVPSGNSAAAHVLLALASLTAEPRWQEFADRQLAFLAAGAKG--YPSAHCFSLMAF-- 603

Query: 726 RLKDMAMAVPLMCCAAD 742
            +K ++++  L+C +AD
Sbjct: 604 -MKALSISRELVCVSAD 619


>gi|256005004|ref|ZP_05429976.1| protein of unknown function DUF255 [Clostridium thermocellum DSM
           2360]
 gi|255991073|gb|EEU01183.1| protein of unknown function DUF255 [Clostridium thermocellum DSM
           2360]
          Length = 482

 Score =  404 bits (1039), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 222/518 (42%), Positives = 302/518 (58%), Gaps = 59/518 (11%)

Query: 97  SRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVM 156
           S  K  NRL  E SPYLLQHA+NPVDW+ W +EAF +A++ + PIFLSIGYSTCHWCHVM
Sbjct: 2   SAYKQANRLIHEKSPYLLQHAYNPVDWYPWCDEAFEKAKRENKPIFLSIGYSTCHWCHVM 61

Query: 157 EVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKP 216
           E ESFEDE VA++LN  FVSIKVDREERPD+D +YMT  QAL G GGWPL++ ++PD KP
Sbjct: 62  ESESFEDEEVAEILNKNFVSIKVDREERPDIDSIYMTACQALTGHGGWPLTIIMTPDKKP 121

Query: 217 LMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKL 276
              GTYFP +D+ G PG  +IL+ V + W  ++D LA+  +  +  +SE++      +  
Sbjct: 122 FFAGTYFPKKDRMGMPGLISILKSVHNTWVNEKDSLAKYSSKVVSVISESIDDDYYYS-- 179

Query: 277 PDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASE 336
            DE+ ++       Q    +D+ +GGFG+APKFP P  +  +L +  K         A E
Sbjct: 180 VDEITEDIFEDAFSQFKYDFDNIYGGFGNAPKFPMPHNLYFLLRYWHK---------AKE 230

Query: 337 GQKMVLF--TLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 394
              +V+   TL  M  GGI+DH+G GF RYS DE+W VPHFEKMLYD   LA  YL+ + 
Sbjct: 231 EYALVMVEKTLDSMYSGGIYDHIGFGFCRYSTDEKWLVPHFEKMLYDNALLAIAYLETYQ 290

Query: 395 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 454
            TK+  Y+ I ++I  Y+ RDM  P G  +SAEDADS   EG    +EG FY+W+  E++
Sbjct: 291 ATKNKKYADIAKEIFTYVLRDMTSPEGGFYSAEDADS---EG----EEGKFYIWSPTEIK 343

Query: 455 DILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEK 513
           ++LGE     F ++Y +   GN            F+G N+   +N +     K  + L  
Sbjct: 344 EVLGESDGEKFCKYYNITEEGN------------FEGLNIPNLINSTIPDEDKEFVEL-- 389

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
                  CR+KLFD R KR  PH DDK++ +WNGL+I++ A   ++L  E          
Sbjct: 390 -------CRKKLFDHREKRVHPHKDDKILTAWNGLMIAALAIGGRVLGIE---------- 432

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG 611
                 +Y   AE A+ FI   L      RL   +R+G
Sbjct: 433 ------KYTLAAEKASEFIFSKLV-RPDGRLLARYRDG 463


>gi|448448658|ref|ZP_21591316.1| hypothetical protein C470_01183 [Halorubrum litoreum JCM 13561]
 gi|445814276|gb|EMA64242.1| hypothetical protein C470_01183 [Halorubrum litoreum JCM 13561]
          Length = 740

 Score =  404 bits (1039), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 258/724 (35%), Positives = 358/724 (49%), Gaps = 96/724 (13%)

Query: 97  SRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVM 156
           S+    NRL  E SPYL QHA NPV+W  WG+EAF  AR+ DVP+F+SIGYS+CHWCHVM
Sbjct: 2   SQPTERNRLDGEASPYLQQHADNPVNWQPWGDEAFERAREHDVPVFVSIGYSSCHWCHVM 61

Query: 157 EVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKP 216
             ESFEDE VA ++N+ FV IKVDREERPDVD  +MT  Q + GGGGWPLS + +P+ KP
Sbjct: 62  AEESFEDESVAGVVNESFVPIKVDREERPDVDSTFMTVCQLVTGGGGWPLSAWCTPEGKP 121

Query: 217 LMGGTYFPPEDKYGRPGFKTILRKVKDAWD---------KKRDMLAQSGAFAIEQLSEAL 267
              GTYFPPE +   PGF+ +  ++ D+W          ++ D  A+S    +E +    
Sbjct: 122 FYVGTYFPPEPRQNHPGFRGLCERIADSWSDPEQREEMKRRADQWAESARDELESVPTPE 181

Query: 268 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLE 326
           +  +           + L   A    + YD   GGFGS   KFP P  I +++       
Sbjct: 182 AVGSDGEDTASPPGDDLLDTAAAAALRGYDEEHGGFGSGGAKFPMPGRIDLLM------- 234

Query: 327 DTGKSGEASEGQKMVLF----TLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 382
                  A  G+  +L     TL  MA GG++D +GGGFHRY+VD +W VPHFEKMLYD 
Sbjct: 235 ----RAYAGRGRDALLSAATGTLDGMANGGMYDQIGGGFHRYAVDRQWTVPHFEKMLYDN 290

Query: 383 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEG------ 436
            +L   YLD + L  D  Y+ +  + L +L R++   GG  FS  DA S   EG      
Sbjct: 291 AELPMAYLDGYRLAGDPAYARVASESLAFLDRELRHEGGAFFSTLDARSRPPEGRRGDDT 350

Query: 437 --ATRKKEGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV 493
             +    EGAFYVWT +EV+ +L E A  L KE Y ++  GN +           +G  V
Sbjct: 351 GDSDEDVEGAFYVWTPEEVDAVLDEPAASLAKERYGIRSGGNFE-----------RGTTV 399

Query: 494 LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSF 553
                     A+      ++    L   R  LFD R +RPRP  D+KV+ +WNG  IS+F
Sbjct: 400 PTIAASVEELAADRDRSPDEVREALTAARTALFDAREERPRPARDEKVLAAWNGRAISAF 459

Query: 554 ARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYD--EQTHRLQHSFRNG 611
           ARA   L                  + Y E+A  A  F R  LYD   +T  L   + +G
Sbjct: 460 ARAGDTLG-----------------EPYAEIAREALDFCRERLYDAESETGALARRWLDG 502

Query: 612 PSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGED 671
             + PG+LDDYAF+  G LD+Y      + L +A+EL +   + F D + G  + T   D
Sbjct: 503 DVRGPGYLDDYAFVARGALDVYAATGDPEPLGFALELADALVDEFYDADDGTIYFTRDRD 562

Query: 672 PS---------VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 722
                      ++ R +E  D + PS   V+   L  L         D +R + E     
Sbjct: 563 ADGTPDDDAGPLIARPQEFTDRSTPSSLGVAAETLALL---------DGFRTDGE----- 608

Query: 723 FETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD---FENMLAAAHASYDLNKT 779
               L+++A  V  +   AD +     +H  LV   + V+    E  +AA     D  +T
Sbjct: 609 ----LREIAERV--VTTHADRIRGSPLEHASLVRAANVVETGGIEVTIAADEVPDDWRET 662

Query: 780 VSKK 783
           + ++
Sbjct: 663 LGER 666


>gi|118579433|ref|YP_900683.1| hypothetical protein Ppro_0998 [Pelobacter propionicus DSM 2379]
 gi|118502143|gb|ABK98625.1| protein of unknown function DUF255 [Pelobacter propionicus DSM
           2379]
          Length = 705

 Score =  404 bits (1039), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 236/674 (35%), Positives = 348/674 (51%), Gaps = 60/674 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL    SPYLLQHA NPVDW+ WGEEAF  A + D P+ +SIGY+TCHWCHVM  ESFE
Sbjct: 34  NRLIFAASPYLLQHADNPVDWYPWGEEAFETAAREDKPLMVSIGYATCHWCHVMARESFE 93

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYG-GGGWPLSVFLSPDLKPLMGGT 221
           D  VA ++N   + +KVDREERPD+D +YMT  + L G G GWPL++FL+P+ KP    T
Sbjct: 94  DPEVAAIINRHLIPVKVDREERPDIDSLYMTAARILTGSGAGWPLTIFLTPERKPFYCAT 153

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA---LSASASSNKLPD 278
           Y P     G  G    + K+ + W+  RD++ ++    +  L E    +SA     ++ D
Sbjct: 154 YIPKTGSNGVLGIVETVEKISEIWNTNRDLINENSDTVVRALREIVAPVSADTDFGRVLD 213

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
           E            L   YD   GGFG   KFP P  +  +L   ++ ++        + +
Sbjct: 214 E--------AQASLQGMYDYLNGGFGGGAKFPLPHNLSFLLRMWRRTQN-------QDIE 258

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
           +MV +TL+ M  GGI+D +G GFHRY+VD  W VPHFEKMLYDQ  +A   L+AF    D
Sbjct: 259 EMVAYTLRMMRDGGIYDQLGFGFHRYAVDPEWRVPHFEKMLYDQALIAITCLEAFQAYGD 318

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE-DIL 457
            F   +  +I  ++  ++  P G   S   ADS          EG +Y+W+  E++ ++ 
Sbjct: 319 EFLKDMAMEIFSFVFDELTSPDGGFCSGLGADSG-------GGEGYYYLWSRGEIDRNLD 371

Query: 458 GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI 517
           GE + LF E + +  TGN            F+G N+L +    +  A + G+   +    
Sbjct: 372 GETSRLFCEAFGVTDTGN------------FEGGNILYQPRSVALLARENGLDAGELDRR 419

Query: 518 LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSD 577
           L   R KL +VR++R RP  D+K++V+WNGL++++ AR + +                S 
Sbjct: 420 LETARAKLLEVRAERVRPFRDEKILVAWNGLMVAALARGAAV----------------SG 463

Query: 578 RKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGS 637
            +  +E A SA  FI R+L+     RL  S+    +  P FL+DYAFL  G+++LY+   
Sbjct: 464 EQRLLEAARSAVRFIARNLH-TPAGRLLRSYHQSVASVPAFLEDYAFLCWGMVELYQVDG 522

Query: 638 GTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINL 697
               L  A+ L     +LF D   G +++T  E   VL+R+K  HDGA PSGNS++ + L
Sbjct: 523 DPVMLQGALGLARGMLDLFSDAVTGAFYDTASEAEQVLVRMKNAHDGAIPSGNSIACLCL 582

Query: 698 VRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGH 757
           ++L  I      +      E  L  +   L +  +A   M  A D    P  + + L+G 
Sbjct: 583 LKLGKICG---DEALTHAGERCLVSWMGSLAEQPIAHIQMVTALDFFLGPDVE-ITLIGD 638

Query: 758 KSSVDFENMLAAAH 771
           +       +L   H
Sbjct: 639 RDKPGVRELLNVIH 652


>gi|113867298|ref|YP_725787.1| hypothetical protein H16_A1279 [Ralstonia eutropha H16]
 gi|113526074|emb|CAJ92419.1| highly conserved protein containing a thioredoxin domain [Ralstonia
           eutropha H16]
          Length = 673

 Score =  404 bits (1038), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 255/627 (40%), Positives = 338/627 (53%), Gaps = 72/627 (11%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNRLA E SPYL QHA NPVDW+ W EEAF  AR  D P+ LS+GY+TCHWCHVM  ESF
Sbjct: 3   TNRLATETSPYLRQHAENPVDWYPWCEEAFRRARDDDKPVLLSVGYATCHWCHVMAHESF 62

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           E+  +A L+ND F+SIKVDR+ERPD+D +Y    Q +  GGGWPL+VFL+P  +P  GGT
Sbjct: 63  ENPRIAGLMNDRFISIKVDRQERPDLDDIYQKVPQMMGQGGGWPLTVFLTPQGEPFYGGT 122

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA---LSASASSNKLPD 278
           YFPP+D+YGRPG   +L  + +AW  +R+ L  +    IEQ  +    L  +  S +  +
Sbjct: 123 YFPPDDRYGRPGLARVLLSLSEAWTHRREALRDT----IEQFQQGFRQLDDTVLSREDAE 178

Query: 279 ELP--QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASE 336
           E    Q+     A  L+++ D   GG G APKFP      ++L   ++  +         
Sbjct: 179 EAAEVQDLPAQTALALARNTDPTHGGLGGAPKFPNASAYDLVLRICQRTHEPALLDALER 238

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
                  TL  MA GGIHD +GGGF RYSVDERW VPHFEKMLYD GQL  +Y +A+ LT
Sbjct: 239 -------TLDGMAAGGIHDQLGGGFARYSVDERWAVPHFEKMLYDNGQLVTLYANAYRLT 291

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
               +  +    + Y+ RDM  P G  ++ EDADS   EG    +EG FYVWT+ EV+ +
Sbjct: 292 GKQAWRRVFEGTIAYIVRDMTHPDGGFYAGEDADS---EG----EEGRFYVWTAPEVKAV 344

Query: 457 LGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 515
           LGE    L    Y +   GN +            G++VL         A  L  PLE+  
Sbjct: 345 LGESEGALACRAYGVTEGGNFE-----------PGRSVL-------QRAVTL-TPLEE-- 383

Query: 516 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 575
             L   R +L   R++R RP  DD ++  WNGL+I     A +   + A           
Sbjct: 384 ARLEGWRERLLAARAQRVRPGRDDNILAGWNGLMIQGLCAAYQATGNPA----------- 432

Query: 576 SDRKEYMEVAESAASFIRRHLY--DEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
                ++  A  AASFI+  L   D   +R    +++G  K PGFL+DYAFL + L+DLY
Sbjct: 433 -----HLAAARRAASFIQDKLTMPDGGVYRY---WKDGTVKVPGFLEDYAFLANALIDLY 484

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
           E     ++L  A EL     + F D   G YF     +P ++ R +  HDGA PSG S S
Sbjct: 485 ESCFDRRYLDRAAELVALIIDNFWD--DGLYFTPNDGEP-LIHRPRAPHDGAWPSGISAS 541

Query: 694 VINLVRLASIVAGSKSDYYRQNAEHSL 720
           V + +RL  +   S  D YR  AEH  
Sbjct: 542 VFSFLRLHEL---SGEDRYRDLAEHEF 565


>gi|317470765|ref|ZP_07930149.1| hypothetical protein HMPREF1011_00496 [Anaerostipes sp. 3_2_56FAA]
 gi|316901754|gb|EFV23684.1| hypothetical protein HMPREF1011_00496 [Anaerostipes sp. 3_2_56FAA]
          Length = 679

 Score =  404 bits (1038), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 248/661 (37%), Positives = 353/661 (53%), Gaps = 67/661 (10%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N L  E SPYLLQHAHNPV W+ WG EAF +AR  D P+FLSIGY++CHWCHVME ESFE
Sbjct: 7   NLLIHEKSPYLLQHAHNPVRWYPWGSEAFEKARAEDKPVFLSIGYASCHWCHVMEEESFE 66

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D  VA+LLN  F+SIKVDREERPD+D VYM+  QA+ G GGWP+SVF++PD KP    TY
Sbjct: 67  DHEVAELLNKHFISIKVDREERPDIDSVYMSVCQAMTGSGGWPMSVFMTPDQKPFFAATY 126

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
            P   +Y   G   +L ++   W + R+ L + G    + L+     S + + L +++P 
Sbjct: 127 LPKTSRYHLTGLMDLLPRISLLWKQDRERLLKIGNEITDHLNTDQRPSETVS-LSEDVPA 185

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
            AL      L+ S+D+  GGFG+APKFP P  +  ++   K   D        +   M  
Sbjct: 186 QAL----ADLNASFDNVNGGFGTAPKFPTPAVLLFLIQQYKLCGD-------KDSLAMAE 234

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL  M +GGI DH+GGGF RYS D+RW VPHFEKMLYD   L   Y +A++  ++  + 
Sbjct: 235 HTLLRMYRGGIFDHIGGGFSRYSTDDRWLVPHFEKMLYDNALLLEAYAEAYACCENPLFP 294

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHA 461
            I   ++  +  ++  P G  + ++DADS   EG    +EG +Y +T  EV  +LG E+ 
Sbjct: 295 EIADAVVSCVLNELSHPDGGFYCSQDADS---EG----EEGKYYTFTRDEVLHVLGEENG 347

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
            LF           C L  ++D  N F+GK++   L  S       G         L   
Sbjct: 348 SLF-----------CSLYDITDRGN-FEGKSIPNLLKQSPFPNDHEG---------LKRM 386

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
           +R L+  R KR     D K++ SWN L+IS+  +AS+I                  R+++
Sbjct: 387 KRTLYLYRKKRTSLSTDKKILTSWNCLMISALTKASRIF----------------GREKF 430

Query: 582 MEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKW 641
           +  A+ A SF+ +HL  +   RL   + +G +   G L+DYAF    +L LY      ++
Sbjct: 431 LAAAQKAESFLDKHLRKDDG-RLFLRWCDGEAAYDGQLEDYAFYSLSMLSLYRSTFLEEY 489

Query: 642 LVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLA 701
           L  A++  +    LF DRE GG+F  + E  +++L+ KE +DGA PSGNS ++  L  L+
Sbjct: 490 LEKAVQAADLMISLFFDREHGGFFLYSSESEALILKPKELYDGAMPSGNSAALHVLFILS 549

Query: 702 SIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV---PSRKHVVLVGHK 758
            I   S    YR   + + + F   L     A    C A  +LS    PSR+ V+    +
Sbjct: 550 KITGKS---IYRDCMDQTFSYFSPELSVHPSAY---CYALSVLSSQFHPSRQLVITTKKE 603

Query: 759 S 759
           S
Sbjct: 604 S 604


>gi|398893990|ref|ZP_10646420.1| thioredoxin domain-containing protein [Pseudomonas sp. GM55]
 gi|398183122|gb|EJM70617.1| thioredoxin domain-containing protein [Pseudomonas sp. GM55]
          Length = 662

 Score =  404 bits (1037), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 252/665 (37%), Positives = 348/665 (52%), Gaps = 71/665 (10%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           +NRLA E SPYL QHA NPVDW+ WGEEAF  AR  D P+ LS+GY+ CHWCHVM  ESF
Sbjct: 2   SNRLAKETSPYLRQHAENPVDWYPWGEEAFQHARDEDKPVHLSLGYAACHWCHVMAHESF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           E+  +A+L+N+ F++IKVDR+ERPD+D +Y   VQ +  GGGWPL+VFL+P  +P  GGT
Sbjct: 62  ENPEIARLMNERFINIKVDRQERPDLDDIYQKIVQMMGQGGGWPLTVFLTPRREPFFGGT 121

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFPP++ YGR GF  +LR + +AW   R  L Q+ A  + Q   A+         P E  
Sbjct: 122 YFPPQESYGRAGFPQLLRGLSEAWQNNRAALEQNVAQFL-QGYRAMDTQMLEGDTPLEQD 180

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPV--EIQMMLYHSKKLEDTGKSGEASEGQK 339
           Q A    A   +++ D   GG G+APKFP     ++ + LY      D  +S E      
Sbjct: 181 QPA--AAARLFARNTDPVHGGLGNAPKFPNVACHDLVLRLYQRLHEPDLLRSLE------ 232

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
               TL  +A GG++DH+GGGF RY VDE W VPHFEKMLYD GQL  +Y DA+  T + 
Sbjct: 233 ---LTLDQVAAGGLYDHLGGGFARYCVDEHWAVPHFEKMLYDNGQLVKLYADAWRATGEP 289

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE 459
            +  +  + +DY+ RDM  P G  +++EDADS   EG    +EG FYVWT  +V+ +LG+
Sbjct: 290 AWRRVFEETIDYILRDMTHPEGGFYASEDADS---EG----EEGKFYVWTPAQVQAVLGD 342

Query: 460 -HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 518
             A L  + Y +  +GN +            G  VL         A+ L    E  L  L
Sbjct: 343 PDAALACQAYGVTASGNFE-----------HGTTVL-------HRAATLDTAQEAQLAGL 384

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
              R KL   R++R RP  D+ ++ SWN L+I     A +                 +  
Sbjct: 385 ---RDKLLVARAQRIRPGRDENILTSWNALMIQGLCAAYQ----------------ATGT 425

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG 638
             +++ A  AA FI   L       L  ++R   +K PGFL+DYAFL + LLDLYE    
Sbjct: 426 ATHLDAARRAADFILDRLSTPDGG-LYRAWREDTAKVPGFLEDYAFLANALLDLYECEFD 484

Query: 639 TKWLVWAIELQNTQDELFLDR--EGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
             +L  A  L     EL L++  E G YF     +P ++ R +   D A PSG S SV  
Sbjct: 485 QLYLERATRLV----ELILEKFWEDGLYFTPKDGEP-LVHRPRAPQDNAWPSGTSTSVFA 539

Query: 697 LVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVG 756
            +RL  +   +  + YR+ AE  L ++             +  A D +       +V+ G
Sbjct: 540 FLRLFEL---TGRELYRERAEQVLTMYRAAAAQNPFGFAHLLAAQDFVQR-GPISIVIAG 595

Query: 757 HKSSV 761
            +S+ 
Sbjct: 596 ERSAA 600


>gi|239608009|gb|EEQ84996.1| DUF255 domain-containing protein [Ajellomyces dermatitidis ER-3]
          Length = 823

 Score =  403 bits (1036), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 253/672 (37%), Positives = 355/672 (52%), Gaps = 69/672 (10%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            NRL+   SPY+  H +NPV W  W  EA   A+K +  +FL         CHVME ESF
Sbjct: 23  VNRLSQSKSPYVRGHMNNPVAWQMWDSEAITLAKKLNRMVFLR--------CHVMEKESF 74

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
               VA +LN  F+ IK+DREERPD+D+VYM YVQA  G GGWPL+VFL+PDL+P+ GGT
Sbjct: 75  MSPEVAAILNKSFIPIKLDREERPDIDEVYMNYVQATTGSGGWPLNVFLTPDLEPVFGGT 134

Query: 222 YFPPEDKYGRPG--------FKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE-ALSASAS 272
           Y+P       P         F  IL K++D W  ++    +S     +QL E A   + S
Sbjct: 135 YWPGPHSSTLPALGGEGHVTFIDILEKLRDVWQTQQLRCRESAKDITKQLREFAEEGTHS 194

Query: 273 SNKLPDELPQNALRLCA---EQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK---KLE 326
             K  D      + L     +  +  +D   GGF  APKF  P  +  ++  S+    + 
Sbjct: 195 KQKAADADEDLEVELLEESYQHFASRFDPVNGGFSRAPKFATPANLSFLINLSRYPSAVS 254

Query: 327 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 386
           D     E S   +M   TL  M++GGIHD +G GF RYSV   W +PHFEKMLYDQ QL 
Sbjct: 255 DIVGYDECSRALEMATKTLISMSRGGIHDQIGHGFARYSVTADWSLPHFEKMLYDQAQLL 314

Query: 387 NVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDADSAETEGATRKKEGAF 445
           NVY+DAF    +        DI  Y+    ++ P G  +S+EDADS  T   T K+EGAF
Sbjct: 315 NVYVDAFDSAHNPELLGAIYDIATYITSPPILSPTGGFYSSEDADSLPTPSDTDKREGAF 374

Query: 446 YVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 504
           YVWT KE + ILG+  A +   H+ + P GN  ++R +DPH+EF  +NVL      +  A
Sbjct: 375 YVWTHKEFKQILGQRDADVCARHWGVLPDGN--VARGNDPHDEFINQNVLSIKVTPAKLA 432

Query: 505 SKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 563
            + G+  E+ + I+   R KL + R SKR RP LDDK+IVSWNGL I + A+ S +L++ 
Sbjct: 433 KEFGLSEEEVVKIIKASREKLREYRESKRVRPGLDDKIIVSWNGLAIGALAKCSVVLEN- 491

Query: 564 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP-SKAPGFLDDY 622
                    V  +  +E+   AE+AA FIR++L+D  + +L   +R+G     PGF DDY
Sbjct: 492 ---------VDRAKAQEFRLAAENAAKFIRQNLFDPASGQLWRIYRDGERGDTPGFADDY 542

Query: 623 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR---------------------EG 661
           ++L SGL+DLYE      +L +A +LQ   +  FL +                       
Sbjct: 543 SYLASGLIDLYEATFDDGYLQFAEQLQQYLNTYFLAQGPTPTPSPRTSITTESTPAPSSS 602

Query: 662 GGYFNT------TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQN 715
            GY+ T          P+ L R+K   D + PS N V   NL+RL++++   + D Y++ 
Sbjct: 603 TGYYTTPSTIHQASAHPAPLFRLKTGTDASTPSPNGVIAQNLLRLSTLL---EDDTYKRL 659

Query: 716 AEHSLAVFETRL 727
           A  ++  F   +
Sbjct: 660 ARETVNAFAVEI 671


>gi|448424193|ref|ZP_21582319.1| hypothetical protein C473_04874 [Halorubrum terrestre JCM 10247]
 gi|445682858|gb|ELZ35271.1| hypothetical protein C473_04874 [Halorubrum terrestre JCM 10247]
          Length = 742

 Score =  403 bits (1036), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 258/726 (35%), Positives = 358/726 (49%), Gaps = 98/726 (13%)

Query: 97  SRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVM 156
           S+    NRL  E SPYL QHA NPV+W  WG+EAF  AR+ DVP+F+SIGYS+CHWCHVM
Sbjct: 2   SQPTERNRLDGEASPYLQQHADNPVNWQPWGDEAFERAREHDVPVFVSIGYSSCHWCHVM 61

Query: 157 EVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKP 216
             ESFEDE VA ++N+ FV IKVDREERPDVD  +MT  Q + GGGGWPLS + +P+ KP
Sbjct: 62  AEESFEDESVAGVVNESFVPIKVDREERPDVDSTFMTVCQLVTGGGGWPLSAWCTPEGKP 121

Query: 217 LMGGTYFPPEDKYGRPGFKTILRKVKDAWD---------KKRDMLAQSGAFAIEQLSEAL 267
              GTYFPPE +   PGF+ +  ++ D+W          ++ D  A+S    +E +    
Sbjct: 122 FYVGTYFPPEPRQNHPGFRGLCERIADSWSDPEQREEMKRRADQWAESARDELESVPTPE 181

Query: 268 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLE 326
           +  +   +       + L   A    + YD   GGFGS   KFP P  I +++       
Sbjct: 182 AVGSDGEETASPPGDDLLDTAAAAALRGYDEEHGGFGSGGAKFPMPGRIDLLM------- 234

Query: 327 DTGKSGEASEGQKMVLF----TLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 382
                  A  G+  +L     TL  MA GG++D +GGGFHRY+VD +W VPHFEKMLYD 
Sbjct: 235 ----RAYAGRGRDALLSAATGTLDGMANGGMYDQIGGGFHRYAVDRQWTVPHFEKMLYDN 290

Query: 383 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEG------ 436
            +L   YLD + L  D  Y+ +  + L +L R++   GG  FS  DA S   EG      
Sbjct: 291 AELPMAYLDGYRLAGDPAYARVASESLAFLDRELRHEGGAFFSTLDARSRPPEGRRGDDT 350

Query: 437 ----ATRKKEGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGK 491
                    EGAFYVWT +EV+ +L E A  L KE Y ++  GN +           +G 
Sbjct: 351 GDSDEDEDVEGAFYVWTPEEVDAVLDEPAASLAKERYGIRSGGNFE-----------RGT 399

Query: 492 NVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVIS 551
            V          A+      ++    L   R  LFD R +RPRP  D+KV+ +WNG  IS
Sbjct: 400 TVPTIAASVEELAADRDRSPDEVREALTAARTALFDAREERPRPARDEKVLAAWNGRAIS 459

Query: 552 SFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYD--EQTHRLQHSFR 609
           +FARA   L                  + Y E+A  A  F R  LYD   +T  L   + 
Sbjct: 460 AFARAGDTLG-----------------EPYAEIAREALDFCRERLYDAESETGALARRWL 502

Query: 610 NGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTG 669
           +G  + PG+LDDYAF+  G LD+Y      + L +A+EL +   + F D + G  + T  
Sbjct: 503 DGDVRGPGYLDDYAFVARGALDVYAATGDPEPLGFALELADALVDEFYDADDGTIYFTRD 562

Query: 670 EDPS---------VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL 720
            D           ++ R +E  D + PS   V+   L  L         D +R + E   
Sbjct: 563 RDADGTPDDDAGPLIARPQEFTDRSTPSSLGVAAETLALL---------DGFRTDGE--- 610

Query: 721 AVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD---FENMLAAAHASYDLN 777
                 L+++A  V  +   AD +     +H  LV   + V+    E  +AA     D  
Sbjct: 611 ------LREIAERV--VTTHADRIRGSPLEHASLVRAANVVETGGIEVTIAADEVPDDWR 662

Query: 778 KTVSKK 783
           +T+ ++
Sbjct: 663 ETLGER 668


>gi|448639421|ref|ZP_21676747.1| thioredoxin [Haloarcula sinaiiensis ATCC 33800]
 gi|445762700|gb|EMA13918.1| thioredoxin [Haloarcula sinaiiensis ATCC 33800]
          Length = 717

 Score =  403 bits (1035), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 240/666 (36%), Positives = 354/666 (53%), Gaps = 55/666 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL    SPYL QHA NPV+W  W E A   AR+RDVPIFLSIGY+ CHWCHVME ESFE
Sbjct: 11  NRLDEAESPYLRQHADNPVNWQPWDETALEAARERDVPIFLSIGYAACHWCHVMEEESFE 70

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE +A+ LN+ FV IKVDREERPD+D VYM+  Q + GGGGWPLS +L+P+ +P   GTY
Sbjct: 71  DEAIAEQLNENFVPIKVDREERPDLDSVYMSICQQVTGGGGWPLSAWLTPEGEPFYVGTY 130

Query: 223 FPPEDKYGRPGFKTILRKVKDAW---DKKRDM--LAQSGAFAIEQLSEALSASASSNKLP 277
           FPPE+K G+PGF  +L+++ ++W   +++ +M   AQ    AIE   EA  A       P
Sbjct: 131 FPPEEKRGQPGFGDLLQRLANSWSDPEQREEMENRAQQWTEAIESDLEATPAD------P 184

Query: 278 DELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLEDTGKSGEASE 336
           ++  ++ ++       +  D + GG+GS  PKFP+   +  +L   +   D G+     +
Sbjct: 185 EDPAEDIIQTAGTIAHRGADRQDGGWGSGGPKFPQNGRLHALL---RAYSDGGQ----ED 237

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
              +V  TL  MA  G++DHVGGGFHRY+ D++W VPHFEKMLYD  ++   +L  +   
Sbjct: 238 YLNVVEETLDVMADRGLYDHVGGGFHRYATDQQWAVPHFEKMLYDNAEIPRAFLAGYQAI 297

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGAT-RKKEGAFYVWTSKEVED 455
               Y+ + R+  ++++R++  P G  FS  DA+SA  +      +EG FYVWT +EV +
Sbjct: 298 GSERYASVVRETFEFVQRELQHPDGGFFSTLDAESAPPDDPDGDSEEGLFYVWTPEEVHE 357

Query: 456 ILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEK 513
            + +   A +F +++ +   GN            F+G  VL      +  A +     + 
Sbjct: 358 AVDDETDAEVFCDYFGVTERGN------------FEGATVLAVRKPVAVLAEEYDRSEDD 405

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
               L     + F  R  RPRP  D+KV+  WNGL+I + A  + +L             
Sbjct: 406 ITASLQRALNETFKARKSRPRPARDEKVLAGWNGLMIRALAEGAIVLDD----------- 454

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
                 +Y +VA  A SF+R+HL+D    RL   +++      G+L+DYAFL  G L L+
Sbjct: 455 ------QYADVAADALSFVRKHLWDADAGRLNRRYKDDDVAIDGYLEDYAFLGRGALTLF 508

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
           E     + L +A++L     E F D E G  F T     S++ R +E  D + PS   V+
Sbjct: 509 EATGDVEHLAFAMDLGQAITEAFWDDEQGTLFFTPTGGESLVARPQELTDQSTPSSTGVA 568

Query: 694 VINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVV 753
           V  L+ L+     S+ D +   AE  +     R+    +    +  A D     + + + 
Sbjct: 569 VDLLLSLSHF---SEDDRFESVAERVIRTHADRVSSNPLQHASLTLATDTYEQGALE-LT 624

Query: 754 LVGHKS 759
           LVG +S
Sbjct: 625 LVGDQS 630


>gi|448502781|ref|ZP_21612730.1| hypothetical protein C464_11620 [Halorubrum coriense DSM 10284]
 gi|445693844|gb|ELZ45985.1| hypothetical protein C464_11620 [Halorubrum coriense DSM 10284]
          Length = 745

 Score =  403 bits (1035), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 261/697 (37%), Positives = 347/697 (49%), Gaps = 97/697 (13%)

Query: 97  SRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVM 156
           S+    NRL  E SPYL QHA NPV+W  WG+EAF  AR+ DVP+F+SIGYS+CHWCHVM
Sbjct: 2   SQPTERNRLDGEASPYLRQHADNPVNWQPWGDEAFERAREHDVPVFVSIGYSSCHWCHVM 61

Query: 157 EVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKP 216
             ESFEDE VA ++ND FV IKVDREERPDVD  +MT  Q + GGGGWPLS + +P+ KP
Sbjct: 62  AEESFEDESVAAVVNDSFVPIKVDREERPDVDSTFMTVCQLVTGGGGWPLSAWCTPEGKP 121

Query: 217 LMGGTYFPPEDKYGRPGFKTILRKVKDAWD---------KKRDMLAQSGAFAIEQLSEAL 267
              GTYFPPE +  +PGF+ +  ++ D+W          ++ D   QS    +E +    
Sbjct: 122 FYVGTYFPPEPRRNQPGFRGLCERIADSWSDPEQREEMKRRADQWTQSARDELESVPTPA 181

Query: 268 SASAS--SNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKK 324
              AS   + L D     ALR         YD  +GGFGS   KFP P  I +++     
Sbjct: 182 EGDASPPGSDLLDTAAAAALR--------GYDEEYGGFGSGGAKFPMPGRIDLLM----- 228

Query: 325 LEDTGKSGEASEGQKMVLF----TLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLY 380
                    A  G+  +L     TL  MA GG++D VGGGFHRY+VD +W VPHFEKMLY
Sbjct: 229 ------RAYAGRGRDALLSAATGTLDGMADGGMYDQVGGGFHRYAVDRQWTVPHFEKMLY 282

Query: 381 DQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDA----------D 430
           D  +L   YLD + LT D  Y+ +  + L +L R++   GG  FS  DA          D
Sbjct: 283 DNAELPMAYLDGYRLTGDPRYARVASESLAFLDRELRHEGGGFFSTLDARSRRPASRGSD 342

Query: 431 SAETEGATRKK--------EGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRM 481
           S   E A            EGAFYVWT +EV+ +L E A  L K+ Y ++  GN +    
Sbjct: 343 SEADEEADVDAGNVGGDDVEGAFYVWTPEEVDAVLDEPAASLAKDRYGIRSGGNFE---- 398

Query: 482 SDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKV 541
                  +G  V          A+   +  E     L E R  LFD R  RPRP  D+KV
Sbjct: 399 -------RGTTVPTIAASVEGLAADRDLSPEAVRETLVEARTALFDARESRPRPARDEKV 451

Query: 542 IVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQT 601
           + SWNG  IS+FARA   L                  + Y E+A  A  F R  LYD   
Sbjct: 452 LASWNGRAISAFARAGDSLG-----------------EPYAEIAREALDFCRERLYDADA 494

Query: 602 HRLQHSFR--NGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR 659
                + R  +G  + PG+LDDYAFL  G LD Y      + L +A++L     E F D 
Sbjct: 495 DAGALARRWLDGDVRGPGYLDDYAFLARGALDTYAATGDPEPLGFALDLAGALVEEFYDA 554

Query: 660 EGGGYFNT------TGEDPS----VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKS 709
           + G  + T      T +D +    ++ R +E  D + PS   V+   L  L    A  + 
Sbjct: 555 DDGTIYFTRDLDDGTADDRADAGPLIARPQEFTDRSTPSSLGVAAETLALLDGFRADGE- 613

Query: 710 DYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 746
             +R+ AE  +     R++   +    +  AAD++  
Sbjct: 614 --FREIAERVVTTHGDRIRGSPLEHASLVRAADLVET 648


>gi|448658484|ref|ZP_21682884.1| thioredoxin [Haloarcula californiae ATCC 33799]
 gi|445761209|gb|EMA12458.1| thioredoxin [Haloarcula californiae ATCC 33799]
          Length = 717

 Score =  402 bits (1034), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 240/666 (36%), Positives = 354/666 (53%), Gaps = 55/666 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL    SPYL QHA NPV+W  W E A   AR+RDVPIFLSIGY+ CHWCHVME ESFE
Sbjct: 11  NRLDEAESPYLRQHADNPVNWQPWDETALEAARERDVPIFLSIGYAACHWCHVMEEESFE 70

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE +A+ LN+ FV IKVDREERPD+D VYM+  Q + GGGGWPLS +L+P+ +P   GTY
Sbjct: 71  DEAIAEQLNENFVPIKVDREERPDLDSVYMSICQQVTGGGGWPLSAWLTPEGEPFYVGTY 130

Query: 223 FPPEDKYGRPGFKTILRKVKDAW---DKKRDM--LAQSGAFAIEQLSEALSASASSNKLP 277
           FPPE+K G+PGF  +L+++  +W   +++ +M   AQ    AIE   EA  A       P
Sbjct: 131 FPPEEKRGQPGFGDLLQRLSGSWSDPEQREEMENRAQQWTEAIESDLEATPAD------P 184

Query: 278 DELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLEDTGKSGEASE 336
           ++  ++ ++       +  D + GG+GS  PKFP+   +  +L   +   D G+     +
Sbjct: 185 EDPAEDIIQTAGTIAHRGADRQDGGWGSGGPKFPQNGRLHALL---RAYADGGQ----ED 237

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
              +V  TL  MA  G++DHVGGGFHRY+ D++W VPHFEKMLYD  ++   +L  +   
Sbjct: 238 YLNVVEETLDVMADRGLYDHVGGGFHRYATDQQWAVPHFEKMLYDNAEIPRAFLAGYQAI 297

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGAT-RKKEGAFYVWTSKEVED 455
               Y+ + R+  ++++R++  P G  FS  DA+SA  +      +EG FYVWT +EV +
Sbjct: 298 GSERYASVVRETFEFVQRELQHPDGGFFSTLDAESAPPDDPDGDSEEGLFYVWTPEEVHE 357

Query: 456 ILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEK 513
            + +   A +F +++ +   GN            F+G  VL      +  A +     + 
Sbjct: 358 AVDDETDAEVFCDYFGVTERGN------------FEGATVLAVRKPVAVLAEEYDRSEDD 405

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
               L     + F+ R  RPRP  D+KV+  WNGL+I + A  + +L             
Sbjct: 406 ITASLQRALNETFEARKSRPRPARDEKVLAGWNGLMIRALAEGAIVLDD----------- 454

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
                 +Y +VA  A SF+R+HL+D    RL   +++      G+L+DYAFL  G L L+
Sbjct: 455 ------QYADVAADALSFVRKHLWDADAGRLNRRYKDDDVAIDGYLEDYAFLGRGALTLF 508

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
           E     + L +A++L     E F D E G  F T     S++ R +E  D + PS   V+
Sbjct: 509 EATGDVEHLAFAMDLGQAITEAFWDDEQGTLFFTPTGGESLVARPQELTDQSTPSSTGVA 568

Query: 694 VINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVV 753
           V  L+ L+     S+ D +   AE  +     R+    +    +  A D     + + + 
Sbjct: 569 VDLLLSLSHF---SEDDRFESVAERVIRTHADRVSSNPLQHASLTLATDTYEQGALE-LT 624

Query: 754 LVGHKS 759
           LVG +S
Sbjct: 625 LVGDQS 630


>gi|451980948|ref|ZP_21929330.1| conserved hypothetical protein, contains Thioredoxin domain
           [Nitrospina gracilis 3/211]
 gi|451761870|emb|CCQ90575.1| conserved hypothetical protein, contains Thioredoxin domain
           [Nitrospina gracilis 3/211]
          Length = 697

 Score =  402 bits (1034), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 232/598 (38%), Positives = 325/598 (54%), Gaps = 48/598 (8%)

Query: 97  SRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVM 156
           + +K+TN+L  E SPYLLQHAHNPVDW  WG EAF  A+K + P+ +SIGY+TCHWCHVM
Sbjct: 2   TEHKYTNKLIHEKSPYLLQHAHNPVDWHPWGPEAFELAKKANKPLLVSIGYATCHWCHVM 61

Query: 157 EVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKP 216
           E ESFED  +A+ LN  FV IKVDREERPDVD +YM  VQA    GGWPL+VF++PD  P
Sbjct: 62  ERESFEDPEIAEYLNAHFVPIKVDREERPDVDSIYMKSVQAFGQQGGWPLNVFVTPDGVP 121

Query: 217 LMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKL 276
             GGTY+P   +YG P F  +L  +   W ++ + + +     I  L +      ++   
Sbjct: 122 FYGGTYYPSVGRYGLPSFLEVLTFLDKTWREEPEKVEKQSTALINYLKDVSKQEQNTEGT 181

Query: 277 PDELPQNALRLCAEQLSKSYDSRFGG--FGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEA 334
            D+L  +      E  ++SYD    G  F    KFP  + + ++L H  +  D       
Sbjct: 182 VDDLGFHGENKTREFYTQSYDRLHHGFLFQQQNKFPPSMGLSLLLRHHHRTGD------- 234

Query: 335 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 394
           +   +MV  TL+ M +GGI+D +GGG  RYS D +W VPHFEKMLYD G      ++ + 
Sbjct: 235 ALSLEMVENTLRAMKQGGIYDQIGGGLARYSTDHQWLVPHFEKMLYDNGLFVTALIETYQ 294

Query: 395 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 454
           +T    ++    D+L Y+ RDM    G  +SAEDADS   EG     EG FYVWT +E+E
Sbjct: 295 VTGKREFADYANDVLQYIDRDMTSAEGAFYSAEDADS---EGV----EGKFYVWTQEEIE 347

Query: 455 DILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEK 513
            +LG E A +   +Y + P GN            ++GKN+L         A  LG+PL+ 
Sbjct: 348 KVLGRETASIAIPYYNVLPNGN------------WEGKNILHVKRPPEQIAKDLGLPLDH 395

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
               + E R KL  VRS+R RP LDDK++ SWNGL+I + A+  ++L             
Sbjct: 396 VEAKIAEAREKLLAVRSQRIRPLLDDKILTSWNGLMIRAMAQVGRVL------------- 442

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
              D  + +  AE A  FI  +L   +  +L   +R G ++  G+L DY  +     DLY
Sbjct: 443 ---DDADRIAKAEKALHFIWNNLRTPEG-KLLRRWREGEARYDGYLCDYTSIALACCDLY 498

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNS 691
           E      ++  A  L  T +E F ++  G Y+ T  +   +++R    +DG EPSGNS
Sbjct: 499 EATYNPDYINKAEALMKTVEEKFGNQ--GAYYETASDAEELIVRQVSGYDGVEPSGNS 554


>gi|300710941|ref|YP_003736755.1| hypothetical protein HacjB3_07890 [Halalkalicoccus jeotgali B3]
 gi|448296966|ref|ZP_21487016.1| hypothetical protein C497_14832 [Halalkalicoccus jeotgali B3]
 gi|299124624|gb|ADJ14963.1| hypothetical protein HacjB3_07890 [Halalkalicoccus jeotgali B3]
 gi|445580643|gb|ELY35021.1| hypothetical protein C497_14832 [Halalkalicoccus jeotgali B3]
          Length = 709

 Score =  402 bits (1034), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 253/669 (37%), Positives = 355/669 (53%), Gaps = 59/669 (8%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           N   NRL  E SPYL QHA NPV+W  W + A AEA +RDVPIFLS+GYS CHWCHVME 
Sbjct: 2   NTDRNRLDEEASPYLRQHADNPVNWQPWDDAALAEAEERDVPIFLSVGYSACHWCHVMEE 61

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFEDE +AK LN+ FV IKVDREERPD+D +Y T  Q +   GGWPLSV+L+PD +P  
Sbjct: 62  ESFEDEDIAKQLNENFVPIKVDREERPDLDSIYQTICQLVTRRGGWPLSVWLTPDGRPFY 121

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
            GTYFP E + G PGF  +L  + ++W+  R+ +        +Q + A++          
Sbjct: 122 VGTYFPRESRRGTPGFGDLLGNLAESWEGDREEIENRA----DQWTRAITDQLEEVPEAG 177

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFG-SAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEG 337
           E P+  L   A+   +  D   GGFG + PKFP+   ++++L   +  + TG+       
Sbjct: 178 ERPEGVLIEAADAALRGADREHGGFGQNGPKFPQTARLEVLL---RAYDRTGR----GPY 230

Query: 338 QKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTK 397
            ++V  TL  M   G++D +GGGFHRY+ D  W VPHFEKMLYD  +L   YL  + +T 
Sbjct: 231 DEVVRETLDAMGSRGMYDQLGGGFHRYATDREWVVPHFEKMLYDNAELPRSYLAGYRVTG 290

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
              Y+ I R+ L ++ R++  P G  +S  DA S + E   R +EGAFYVWT   VE++L
Sbjct: 291 QERYARIVRETLAFVERELGHPDGGFYSTLDAQSEDPETGER-EEGAFYVWTPAAVEEVL 349

Query: 458 GEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLN 516
            E  A LF E Y +   GN            F+GK VL       + A + G+  ++  +
Sbjct: 350 DEERAALFCERYGVDKRGN------------FEGKTVLTLARSVGSLAEEYGLDEDEVED 397

Query: 517 ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGS 576
            L E  R+LF+ R +RPRP  D+KV+  WNGL+ISSFA A   L              GS
Sbjct: 398 RLVEAERRLFEAREERPRPRRDEKVLAGWNGLMISSFAEAGLTLD-------------GS 444

Query: 577 DRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG 636
               Y + A  A  F+R  L+D +  RL   F++   K  G+L+DYAFL  G  D Y+  
Sbjct: 445 ----YAKRAAEALEFVREQLWDTEGKRLSRRFKDREVKIDGYLEDYAFLARGAFDTYQAT 500

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
              + L +A++L    +  F D E    + T      ++ R +E +D + PS   V+   
Sbjct: 501 GDVEHLKFALDLARAIEREFWDEERETLYFTPEAGEELVARPQELNDQSTPSSLGVACDV 560

Query: 697 LVRLASI-----------VAGSKSDYYRQNA-EH-SLAVFETRLKDMAMAVPLMCCAADM 743
           L+ L+             V     D  R N  EH +LA+   R ++ ++ V     AAD+
Sbjct: 561 LLSLSQFADADFEGIVERVLARHGDRIRGNPLEHATLALVADRFENGSLEV---TVAADV 617

Query: 744 LSVPSRKHV 752
           L    R+ +
Sbjct: 618 LPTEWRERL 626


>gi|448506299|ref|ZP_21614409.1| hypothetical protein C465_02621 [Halorubrum distributum JCM 9100]
 gi|448525080|ref|ZP_21619498.1| hypothetical protein C466_12493 [Halorubrum distributum JCM 10118]
 gi|445699949|gb|ELZ51967.1| hypothetical protein C465_02621 [Halorubrum distributum JCM 9100]
 gi|445700052|gb|ELZ52067.1| hypothetical protein C466_12493 [Halorubrum distributum JCM 10118]
          Length = 742

 Score =  402 bits (1034), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 258/726 (35%), Positives = 357/726 (49%), Gaps = 98/726 (13%)

Query: 97  SRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVM 156
           S+    NRL  E SPYL QHA NPV+W  WG+EAF  AR+ DVP+F+SIGYS+CHWCHVM
Sbjct: 2   SQPTERNRLDGEASPYLQQHADNPVNWQPWGDEAFERAREHDVPVFVSIGYSSCHWCHVM 61

Query: 157 EVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKP 216
             ESFEDE VA ++N+ FV IKVDREERPDVD  +MT  Q + GGGGWPLS + +P+ KP
Sbjct: 62  AEESFEDESVAGVVNESFVPIKVDREERPDVDSTFMTVCQLVTGGGGWPLSAWCTPEGKP 121

Query: 217 LMGGTYFPPEDKYGRPGFKTILRKVKDAWD---------KKRDMLAQSGAFAIEQLSEAL 267
              GTYFPPE +   PGF+ +  ++ D+W          ++ D  A+S    +E +    
Sbjct: 122 FYVGTYFPPEPRQNHPGFRGLCERIADSWSDPEQREEMKRRADQWAESARDELESVPTPE 181

Query: 268 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLE 326
           +  +           + L   A    + YD   GGFGS   KFP P  I +++       
Sbjct: 182 TVGSDGEDTASPPGDDLLDTAAAAALRGYDEEHGGFGSGGAKFPMPGRIDLLM------- 234

Query: 327 DTGKSGEASEGQKMVLF----TLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 382
                  A  G+  +L     TL  MA GG++D +GGGFHRY+VD +W VPHFEKMLYD 
Sbjct: 235 ----RAYAGRGRDALLSAATGTLDGMANGGMYDQIGGGFHRYAVDRQWTVPHFEKMLYDN 290

Query: 383 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEG------ 436
            +L   YLD + L  D  Y+ +  + L +L R++   GG  FS  DA S   EG      
Sbjct: 291 AELPMAYLDGYRLAGDPAYARVASESLAFLDRELRHEGGAFFSTLDARSRPPEGRRGDDT 350

Query: 437 ----ATRKKEGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGK 491
                    EGAFYVWT +EV+ +L E A  L KE Y ++  GN +           +G 
Sbjct: 351 GDSDEDEDVEGAFYVWTPEEVDAVLDEPAASLAKERYGIRSGGNFE-----------RGT 399

Query: 492 NVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVIS 551
            V          A+      ++    L   R  LFD R +RPRP  D+KV+ +WNG  IS
Sbjct: 400 TVPTIAASVEELAADRDRSPDEVREALTAARTALFDAREERPRPARDEKVLAAWNGRAIS 459

Query: 552 SFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY--DEQTHRLQHSFR 609
           +FARA   L                  + Y E+A  A  F R  LY  D +T  L   + 
Sbjct: 460 AFARAGDTLG-----------------EPYAEIAREALEFCRERLYDADRETGALARRWL 502

Query: 610 NGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTG 669
           +G  + PG+LDDYAF+  G LD+Y      + L +A+EL +   + F D + G  + T  
Sbjct: 503 DGDVRGPGYLDDYAFVARGALDVYAATGDPEPLGFALELADALVDEFYDADDGTIYFTRD 562

Query: 670 EDPS---------VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL 720
            D           ++ R +E  D + PS   V+   L  L         D +R + E   
Sbjct: 563 RDADGTPDDDAGPLIARPQEFTDRSTPSSLGVAAETLALL---------DGFRTDGE--- 610

Query: 721 AVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD---FENMLAAAHASYDLN 777
                 L+++A  V  +   AD +     +H  LV   + V+    E  +AA     D  
Sbjct: 611 ------LREIAERV--VTTHADRIRGSPLEHASLVRAANVVETGGIEVTIAADEVPDDWR 662

Query: 778 KTVSKK 783
           +T+ ++
Sbjct: 663 ETLGER 668


>gi|448479213|ref|ZP_21604065.1| hypothetical protein C462_01682 [Halorubrum arcis JCM 13916]
 gi|445822491|gb|EMA72255.1| hypothetical protein C462_01682 [Halorubrum arcis JCM 13916]
          Length = 742

 Score =  402 bits (1034), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 258/726 (35%), Positives = 357/726 (49%), Gaps = 98/726 (13%)

Query: 97  SRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVM 156
           S+    NRL  E SPYL QHA NPV+W  WG+EAF  AR+ DVP+F+SIGYS+CHWCHVM
Sbjct: 2   SQPTERNRLDGEASPYLQQHADNPVNWQPWGDEAFERAREHDVPVFVSIGYSSCHWCHVM 61

Query: 157 EVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKP 216
             ESFEDE VA ++N+ FV IKVDREERPDVD  +MT  Q + GGGGWPLS + +P+ KP
Sbjct: 62  AEESFEDESVAGVVNESFVPIKVDREERPDVDSTFMTVCQLVTGGGGWPLSAWCTPEGKP 121

Query: 217 LMGGTYFPPEDKYGRPGFKTILRKVKDAWD---------KKRDMLAQSGAFAIEQLSEAL 267
              GTYFPPE +   PGF+ +  ++ D+W          ++ D  A+S    +E +    
Sbjct: 122 FYVGTYFPPEPRQNHPGFRGLCERIADSWSDPEQREEMKRRADQWAESARDELESVPTPE 181

Query: 268 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLE 326
           +  +           + L   A    + YD   GGFGS   KFP P  I +++       
Sbjct: 182 AVGSDGEDTASPPGDDLLDTAAAAALRGYDEEHGGFGSGGAKFPMPGRIDLLM------- 234

Query: 327 DTGKSGEASEGQKMVLF----TLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 382
                  A  G+  +L     TL  MA GG++D +GGGFHRY+VD +W VPHFEKMLYD 
Sbjct: 235 ----RAYAGRGRDALLSAATGTLDGMANGGMYDQIGGGFHRYAVDRQWTVPHFEKMLYDN 290

Query: 383 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEG------ 436
            +L   YLD + L  D  Y+ +  + L +L R++   GG  FS  DA S   EG      
Sbjct: 291 AELPMAYLDGYRLAGDPAYARVASESLAFLDRELRHEGGAFFSTLDARSRPPEGRRGDDT 350

Query: 437 ----ATRKKEGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGK 491
                    EGAFYVWT +EV+ +L E A  L KE Y ++  GN +           +G 
Sbjct: 351 GDSDEDEDVEGAFYVWTPEEVDAVLDEPAASLAKERYGIRSGGNFE-----------RGT 399

Query: 492 NVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVIS 551
            V          A+      ++    L   R  LFD R +RPRP  D+KV+ +WNG  IS
Sbjct: 400 TVPTIAASVEELAADRDRSPDEVREALTAARTALFDAREERPRPARDEKVLAAWNGRAIS 459

Query: 552 SFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYD--EQTHRLQHSFR 609
           +FARA   L                  + Y E+A  A  F R  LYD   +T  L   + 
Sbjct: 460 AFARAGDTLG-----------------EPYAEIAREALDFCRERLYDAESETGALARRWL 502

Query: 610 NGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTG 669
           +G  + PG+LDDYAF+  G LD+Y      + L +A+EL +   + F D + G  + T  
Sbjct: 503 DGDVRGPGYLDDYAFVACGALDVYAATGDPEPLGFALELADALVDEFYDADDGTIYFTRD 562

Query: 670 EDPS---------VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL 720
            D           ++ R +E  D + PS   V+   L  L         D +R + E   
Sbjct: 563 RDADGTPDDDAGPLIARPQEFTDRSTPSSLGVAAETLALL---------DGFRTDGE--- 610

Query: 721 AVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVD---FENMLAAAHASYDLN 777
                 L+++A  V  +   AD +     +H  LV   + V+    E  +AA     D  
Sbjct: 611 ------LREIAERV--VTTHADRIRGSPLEHASLVRAANVVETGGIEVTIAADEVPDDWR 662

Query: 778 KTVSKK 783
           +T+ ++
Sbjct: 663 ETLGER 668


>gi|110668468|ref|YP_658279.1| thioredoxin domain-containing protein [Haloquadratum walsbyi DSM
           16790]
 gi|109626215|emb|CAJ52671.1| YyaL family protein [Haloquadratum walsbyi DSM 16790]
          Length = 768

 Score =  402 bits (1033), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 232/625 (37%), Positives = 327/625 (52%), Gaps = 73/625 (11%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA NPV+W  W + A   A   D PIFLS+GY+ CHWCHVM  ESFE
Sbjct: 8   NRLDNEASPYLTQHAENPVNWQPWDDRALEYAESADKPIFLSVGYAACHWCHVMAEESFE 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D+ VA +LND FV IKVDREERPD+D++Y T  Q + GGGGWPLSV+L+PD KP   GTY
Sbjct: 68  DDTVATILNDSFVPIKVDREERPDLDRIYQTICQLVTGGGGWPLSVWLTPDGKPFYVGTY 127

Query: 223 FPPEDKYGR---PGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE 279
           FP  ++  R   PGF  I +    AW+  R  L        + L + L    + +   D 
Sbjct: 128 FPKTERSDRGDTPGFLEICQSFATAWENDRSELESRANQWADTLQDRLEVDTNVDTNIDV 187

Query: 280 L------------PQNA-----------LRLCAEQLSKSYDSRFGGFGS-APKFPRPVEI 315
                        PQ             L   +    ++ D+ +GGFGS  PKFP+   I
Sbjct: 188 DDDDDVPAPDIASPQTDSDADDDSTMDLLTSVSTAAIRATDNEYGGFGSRGPKFPQTGRI 247

Query: 316 QMML-YHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPH 374
           + ++  H++   +T      +        TL  MA GGI+DHVGGGFHRY+ D +W VPH
Sbjct: 248 EALIRAHAETNRETALDAATA--------TLDAMAAGGIYDHVGGGFHRYATDRKWTVPH 299

Query: 375 FEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAET 434
           FEKMLYD  +L+ VYL A+  T    Y+ +  +   +L R++  P G  +S  D   A++
Sbjct: 300 FEKMLYDNAELSRVYLSAYQHTGRDRYARVAHETFAFLSRELQHPEGGFYSTLD---AQS 356

Query: 435 EGATRKKEGAFYVWTSKEVEDILGEHAI--LFKEHYYLKPTGNCDLSRMSDPHNEFKGKN 492
           EG    +EG FYVWT + + + + +  I  +  + + +   GN            F+G  
Sbjct: 357 EG----EEGRFYVWTPETIRNAITDQQIADIAIDRFGVTEGGN------------FEGST 400

Query: 493 VLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISS 552
           VL      S  A+K  +  ++ ++ L + R  LFD R  R RP+ D+K++ +WNGL ISS
Sbjct: 401 VLTATASVSQLATKYSLTTDEIMSQLADARDSLFDARMDRERPNRDEKILTAWNGLAISS 460

Query: 553 FARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP 612
            AR   IL++E                +Y E+A  A SFIR HL+D  + RL   +++G 
Sbjct: 461 LARGGLILETE----------------QYTELANDALSFIRTHLWDSDSGRLSRRYKDGD 504

Query: 613 SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDP 672
               G+LDDYAFL  G  DLY+     + L +A+ L  +  ELF D  G   +    +  
Sbjct: 505 VDETGYLDDYAFLARGAFDLYQTTGAVEHLCFAVTLAESIVELFYDAAGETLYLAPEDAE 564

Query: 673 SVLLRVKEDHDGAEPSGNSVSVINL 697
           S++ R ++  D + PS   ++V  L
Sbjct: 565 SLVARPQDLRDQSTPSSAGIAVQTL 589


>gi|392380898|ref|YP_005030094.1| conserved protein of unknown function; putative Thioredoxin and
           glycosidase domains [Azospirillum brasilense Sp245]
 gi|356875862|emb|CCC96610.1| conserved protein of unknown function; putative Thioredoxin and
           glycosidase domains [Azospirillum brasilense Sp245]
          Length = 672

 Score =  402 bits (1032), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 245/665 (36%), Positives = 352/665 (52%), Gaps = 71/665 (10%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N L  E SPYLLQH  NPV W AWG +AF  A++ + P+ LS+GY+ CHWCHVM  ESFE
Sbjct: 4   NLLGRETSPYLLQHKDNPVHWMAWGRDAFERAKRENKPVLLSVGYAACHWCHVMAHESFE 63

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           +  +A L+N+ FV+IKVDREERPDVD++Y + +  L   GGWPL++FL+P+ +P  GGTY
Sbjct: 64  NPEIAGLMNELFVNIKVDREERPDVDQIYQSALAMLGQQGGWPLTMFLTPEAEPFWGGTY 123

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FPP  +YGRPGF  +LR V + +  K + + ++    +  L +AL   A  N+   E+  
Sbjct: 124 FPPASRYGRPGFPDVLRGVAETYRNKPENVTRN----VAALKDALGKLA-ENRAAGEVDL 178

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             L   A++L +  D   GG G APKFP+ V I  +L+  +    TGK       ++ V 
Sbjct: 179 AMLDQIADRLVREVDPFHGGIGHAPKFPQ-VPIFTLLW--RAWLRTGK----EPYREAVT 231

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL  M++GGI+DH+GGGF RYSVDE W VPHFEKMLYD  QL ++    +   ++  + 
Sbjct: 232 NTLAHMSQGGIYDHLGGGFARYSVDEMWLVPHFEKMLYDNAQLLDLMTLVWQAEREPLFE 291

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
              R+ + ++ R+MI  GG   + +DADS   EG    +EG FY+W  +E++ +LG  A 
Sbjct: 292 TRIRETVGWVLREMIAEGGGFAATQDADS---EG----EEGLFYIWNEEEIDRLLGPGAE 344

Query: 463 LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-----IELNDSSASASKLGMPLEKYLNI 517
           +FK  Y + P GN            ++G  +L     IE  D+   A+            
Sbjct: 345 VFKRAYGVTPQGN------------WEGATILNRLHRIEALDAETEAT------------ 380

Query: 518 LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSD 577
           L E R  L+  R KR +P  DDKV+  WNGL+I++ A+A  +                 D
Sbjct: 381 LAEQRAILWREREKRIKPGWDDKVLADWNGLMIAALAQAGMVF----------------D 424

Query: 578 RKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGS 637
              ++  A+SA +F+R  + ++   RL HS+R G  K    LDDYA +    L L+E   
Sbjct: 425 EPAWIAAAQSAYAFVRDRMTEDG--RLLHSWRAGQLKHRATLDDYAHMARAALALHEATG 482

Query: 638 GTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINL 697
               L  A       D  F D + GGYF T  +   +++R K   D A PSGN      L
Sbjct: 483 DAGALEQARAWVRVLDAHFWDAQAGGYFYTADDADDLIVRTKSAGDAATPSGNGTM---L 539

Query: 698 VRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGH 757
             LA++   +    YR+ A+   A F   L      +P    AA++L       +V+VG 
Sbjct: 540 AVLATLHHRTGEAAYRERADALAAAFSGELSRNFFPLPTYLNAAELLQ--KALQIVIVGD 597

Query: 758 KSSVD 762
             + D
Sbjct: 598 PQASD 602


>gi|120434573|ref|YP_860266.1| hypothetical protein GFO_0204 [Gramella forsetii KT0803]
 gi|117576723|emb|CAL65192.1| protein containing DUF255 [Gramella forsetii KT0803]
          Length = 682

 Score =  402 bits (1032), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 231/634 (36%), Positives = 338/634 (53%), Gaps = 52/634 (8%)

Query: 96  HSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHV 155
           +++ KHTN L  E SPYLLQHAHNPVDW  W +E   +A+K +  + +S+GYS CHWCHV
Sbjct: 3   NNQEKHTNDLIHESSPYLLQHAHNPVDWKPWNDENLDQAQKENKLLLISVGYSACHWCHV 62

Query: 156 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 215
           ME ESFEDE VA+L+N  ++ IKVDREERPDVD+VYM  VQ + G GGWP+++   PD +
Sbjct: 63  MEHESFEDEAVAELMNVNYICIKVDREERPDVDQVYMNAVQIMTGMGGWPMNIVALPDGR 122

Query: 216 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 275
           P+ GGTYF  E       +   L+++   ++ + + L +      E+L + L        
Sbjct: 123 PVWGGTYFRKEQ------WMEALQQISHLFNSQPEKLLEYA----EKLEQGLKQIQIIEP 172

Query: 276 LPDE-LPQNALRL-CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGE 333
           + ++  P     +   E+  +S+D + GG+  +PKF  P   + +L ++ +  D      
Sbjct: 173 VKEQNKPHKDFFIPIIEKWKRSFDPKNGGYQRSPKFMMPNNYEFLLRYAFQNSD------ 226

Query: 334 ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 393
             E +   L TL  ++ GG+ D + GGF RYSVDE+WHVPHFEKMLYD  QL  +Y   +
Sbjct: 227 -KELKSHCLLTLNRISWGGVFDPIEGGFSRYSVDEKWHVPHFEKMLYDNAQLVQLYSKTY 285

Query: 394 SLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 453
            +TK+ +Y  + +  L ++  +M    G  +SA DADSA   G  +K+EGA+YVWT + +
Sbjct: 286 KITKNNWYKEVVKQTLQFISAEMTDESGAFYSALDADSANENG--KKEEGAYYVWTKENL 343

Query: 454 EDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEK 513
           + ILG    +F E+Y +   G  +               VLI        +  L +P E 
Sbjct: 344 KSILGNEFEIFSEYYNINNYGKWEADNY-----------VLIRTKSLDQLSQDLDIPRED 392

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
               + +C  KL   +SKR +P LDDK + SWN L+IS +  A K  ++           
Sbjct: 393 LQQRIAQCNLKLKKAKSKREKPGLDDKSLTSWNALMISGYTEAYKAFRN----------- 441

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
                 EY+E AE  A+FI  +   E   RL HS++NG S   G+L+DYAF IS  LDLY
Sbjct: 442 -----GEYLEAAEKNAAFILENQLQE-NGRLYHSYKNGKSTINGYLEDYAFSISAFLDLY 495

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
           E     ++L  A  L +  D+ F D   G YF T+ +D  ++ +  E  D   P+ NS  
Sbjct: 496 ECTFEQEYLGRARNLIDVTDKDFTDSVSGLYFFTSDKDRELVTKTIEISDNVIPASNSEM 555

Query: 694 VINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 727
             N+ R   +    K   Y   AE  L +   ++
Sbjct: 556 AKNIFRFGKLTGDMK---YVGKAEKMLQIVMDKI 586


>gi|116754985|ref|YP_844103.1| hypothetical protein Mthe_1697 [Methanosaeta thermophila PT]
 gi|116666436|gb|ABK15463.1| protein of unknown function DUF255 [Methanosaeta thermophila PT]
          Length = 669

 Score =  402 bits (1032), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 252/685 (36%), Positives = 361/685 (52%), Gaps = 77/685 (11%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           ++  NRLA E SPYLLQHA+NPVDW+ W  EAF  AR  D PIFLSIGYSTCHWCHVM  
Sbjct: 2   DRKPNRLAGESSPYLLQHAYNPVDWYPWSPEAFERARAEDRPIFLSIGYSTCHWCHVMAR 61

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFEDE +A++LN  FV +KVDREERPD+D +YM   Q + G GGWPL++ +SPD  P  
Sbjct: 62  ESFEDERIAEMLNRAFVCVKVDREERPDIDAIYMEACQIITGRGGWPLTIIMSPDGIPFF 121

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
             TY P + + G  G + ++  V++ W  +R  L   G   +  + +A +   +SN    
Sbjct: 122 AATYIPKDGRLGMMGLRELIPLVEELWRNRRSELTSLGFKVLNAMRKADTHLQASNADES 181

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
            L +  L     +LS  +D   GGFG APKFP     Q +L+  +    TG+     +  
Sbjct: 182 TLSRAYL-----ELSGIFDWTSGGFGRAPKFPLA---QNLLFLLRYWHRTGE----MKAL 229

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
           +MV  TL+ M  GGI+D +  GFHRYS D  W VPHFEKMLYDQ  ++ VYL+A+  T  
Sbjct: 230 EMVELTLREMRCGGIYDQLAYGFHRYSTDSSWGVPHFEKMLYDQALMSVVYLEAYQATGK 289

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
             Y+ +  +IL ++  D+  P G   SA DA+S          EG +Y+WT  ++ D LG
Sbjct: 290 RDYAIVADEILGFVAEDLRSPDGAFCSALDAESDNI-------EGGYYLWTMDQLRDALG 342

Query: 459 EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDSSASASKLGMPLEKYLNI 517
           +      E + L+P G  D            GKNVL I L    +       P+      
Sbjct: 343 DDLKKALEVFVLEPIGGSD------------GKNVLRISLKGELSEFKHTSEPI------ 384

Query: 518 LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSD 577
               RRKL D RS R +P  D+KV+  WNGL+I++F+R +++L  E              
Sbjct: 385 ----RRKLLDARSLRRKPFRDEKVLADWNGLMIAAFSRGAQVLGDE-------------- 426

Query: 578 RKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGS 637
              ++ +A  AA F+   ++ +    L HS++         LDDYAFLI GL++LY+ G 
Sbjct: 427 --RWLRIASEAADFVLSSMHRDGM--LMHSYKGSRVS---ILDDYAFLIFGLIELYQAGF 479

Query: 638 GTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINL 697
             ++L  A  L +     F D +GG Y+ T  E   ++L+ KE  DGA PSG S++ +++
Sbjct: 480 DGRYLERAEILCDEMVSHFSDPDGGFYY-TMKEQSDIILQRKEIRDGAIPSGYSMATMDM 538

Query: 698 VRLASIVAGSKSDYYRQNAEH--SLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLV 755
           + L  I+        R + E   S+++    +  +   V L+  A D+   PS + + +V
Sbjct: 539 LLLGKILG-------RPDLEEIASMSLRHISMASLPAQVGLL-IALDLALGPSHE-IAIV 589

Query: 756 GHKSSVDFENMLAAAHASYDLNKTV 780
           G   +     ML A  + Y   K V
Sbjct: 590 GDADNT--RTMLRALWSVYAPRKVV 612


>gi|163786447|ref|ZP_02180895.1| hypothetical protein FBALC1_14717 [Flavobacteriales bacterium
           ALC-1]
 gi|159878307|gb|EDP72363.1| hypothetical protein FBALC1_14717 [Flavobacteriales bacterium
           ALC-1]
          Length = 705

 Score =  401 bits (1031), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 223/620 (35%), Positives = 347/620 (55%), Gaps = 59/620 (9%)

Query: 89  RTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYS 148
           +T  S + + +   N L  E SPYLLQHA+NPVDW AW +E+   A++++  I +S+GYS
Sbjct: 20  QTNTSVTKNEDNKANDLIKETSPYLLQHAYNPVDWKAWNKESLELAKEQNKLIVISVGYS 79

Query: 149 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 208
            CHWCHVME ESFE++ VA+L+N+ F+SIKVDREERPDVD++YM+ VQ + G GGWPL+ 
Sbjct: 80  ACHWCHVMEEESFENDSVARLMNENFISIKVDREERPDVDQIYMSAVQLMTGSGGWPLNC 139

Query: 209 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 268
              PD +P+ GGTYF       +P +  IL  +   +    + +    A+A E+L+E + 
Sbjct: 140 ITLPDGRPVFGGTYFT------KPQWTKILEDMSSLYKTNPEKVI---AYA-EKLTEGVK 189

Query: 269 ASASSNKLPDELPQNALRL--CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 326
            +   N   + +  N L++    ++L KS D + GG  +APKFP P  +  +L +S + +
Sbjct: 190 NADLINVNKEGIQFNKLQIESTVDELKKSLDFKLGGQKNAPKFPMPSNLDFLLRYSFQND 249

Query: 327 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 386
           D        + Q+ V+ +L  MA GGI+D +GGGF RYSVD+RWH+PHFEKMLYD  QL 
Sbjct: 250 D-------KDLQQFVMTSLNKMANGGIYDQIGGGFSRYSVDDRWHIPHFEKMLYDNAQLV 302

Query: 387 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 446
           ++Y  A+  TK+  +  I  + L+++ R++    G  +S+ DADS   EG    +EG FY
Sbjct: 303 SLYSKAYQFTKNEDFKTIVTETLNFIDRELTQEEGAFYSSLDADSKTKEGEL--EEGVFY 360

Query: 447 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRM----SDPHNEF-KGKNVLIELNDSS 501
            WT  +++  LGE   LFK +Y +  TG  +  +     +   NEF K  N+ I+     
Sbjct: 361 TWTKDDLKTELGEDFDLFKSYYNINATGKWEKDQFILYKTKTDNEFIKTNNITIK----- 415

Query: 502 ASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 561
                     E +  +L   ++KL++VR+KR RP LDDK + SWN L++ ++  A ++  
Sbjct: 416 ----------ELHSKVLA-WKKKLYEVRAKRERPRLDDKALTSWNALMLKAYVDAYRVF- 463

Query: 562 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDD 621
                          +++ Y++ A   A FI+ +   +    L H+++N  S   GF +D
Sbjct: 464 ---------------NKQSYLDKAIDNAKFIKENQI-QNNGSLFHNYKNKKSTIEGFSED 507

Query: 622 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 681
           YA  I+  ++LY+     +WL  A EL +     F ++E   ++ T+  + +++ R  E 
Sbjct: 508 YAHTITAYIELYQATFNEQWLNTAKELMDYAIAHFSNKETSMFYFTSDNETNLITRKTEV 567

Query: 682 HDGAEPSGNSVSVINLVRLA 701
            D   PS NSV    L +L 
Sbjct: 568 FDNVIPSSNSVLADCLFKLG 587


>gi|359690220|ref|ZP_09260221.1| hypothetical protein LlicsVM_17604 [Leptospira licerasiae serovar
           Varillal str. MMD0835]
 gi|418751442|ref|ZP_13307728.1| PF03190 family protein [Leptospira licerasiae str. MMD4847]
 gi|418758573|ref|ZP_13314755.1| PF03190 family protein [Leptospira licerasiae serovar Varillal str.
           VAR 010]
 gi|384114475|gb|EIE00738.1| PF03190 family protein [Leptospira licerasiae serovar Varillal str.
           VAR 010]
 gi|404274045|gb|EJZ41365.1| PF03190 family protein [Leptospira licerasiae str. MMD4847]
          Length = 695

 Score =  401 bits (1031), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 255/666 (38%), Positives = 354/666 (53%), Gaps = 63/666 (9%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           +K  NRLA+E SPYLLQH+ NPVDWF W EEAF +A+  +  IFLSIGY+TCHWCHVME 
Sbjct: 5   DKKLNRLASEKSPYLLQHSANPVDWFPWSEEAFVKAKSENKMIFLSIGYATCHWCHVMEK 64

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFEDE  A++LN  +VSIKVDREERPDVD++YM  + A+   GGWPL++FL+P+ KP+ 
Sbjct: 65  ESFEDETTAEVLNRDYVSIKVDREERPDVDRIYMDALHAMGQQGGWPLNMFLTPEGKPIT 124

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE-----ALSASASS 273
           GGTYFPP  KYGR  F  +L  +   W  K++ L ++     + L E     AL+ +A  
Sbjct: 125 GGTYFPPVPKYGRKSFTEVLGILTGLWKDKKEELLEASEDLTKHLKESEETRALAGTADI 184

Query: 274 NKLPDELPQNALRLCAEQLSKSYDSRFGGF--GSAPKFPRPVEIQMML-YHSKKLEDTGK 330
           +    E+ +N   L      + YD  + GF   S  KFP  + +  +L YH        K
Sbjct: 185 SSPGSEVFENGFLL----YDRLYDPEYAGFKSNSVNKFPPSMGLSFLLRYH--------K 232

Query: 331 SGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYL 390
           S    +  +MV  TL  M KGGI+D +GGG  RYS D  W VPHFEKMLYD        +
Sbjct: 233 STGEPKALEMVEETLTAMKKGGIYDQIGGGLCRYSTDHHWLVPHFEKMLYDNSLFLEALV 292

Query: 391 DAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTS 450
           + +    +  Y     D+++YL RDM  PGG I SAEDADS   EG    +EG FY+WT 
Sbjct: 293 ECYQAVGEEKYKDYAYDVIEYLHRDMRLPGGGIASAEDADS---EG----EEGLFYLWTK 345

Query: 451 KEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL-GM 509
           +EV ++ G+ + L  E + +   GN            F+ KN+L E      + S+L G+
Sbjct: 346 EEVREVCGQDSSLLDEFWNITEKGN------------FEEKNILHE--SFRMNFSRLHGL 391

Query: 510 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 569
              +   I+   R+KL + RS R RP  DDK++ SWN L I +  +A+            
Sbjct: 392 EPSELEEIVSRNRKKLLEKRSTRIRPLRDDKILFSWNCLYIKALTKAAMAFGD------- 444

Query: 570 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 629
                     + +  AE    F+ ++L  E   RL   FR G +K   +  DYA  +   
Sbjct: 445 ---------GDLLREAEETYKFLEKNLIREDG-RLLRRFREGEAKILAYSTDYAEFVLAS 494

Query: 630 LDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGAEPS 688
           L L++ G G ++L  +I  + T++ + L R   G F  +G D   LLR   D +DG EPS
Sbjct: 495 LYLFQAGKGFRYLENSI--RYTEEAIRLFRSPAGVFFDSGIDGEALLRRTVDGYDGVEPS 552

Query: 689 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPS 748
            NS      V L S +    S+ Y Q A+   + F+  L+   M+ P M  A  +   P 
Sbjct: 553 ANSSFATAFV-LLSKLGVVDSEKYLQYADSIFSYFKPELEAYPMSYPYMLSALWLRKSPG 611

Query: 749 RKHVVL 754
           R+  V+
Sbjct: 612 RELAVV 617


>gi|149369679|ref|ZP_01889531.1| hypothetical protein SCB49_07627 [unidentified eubacterium SCB49]
 gi|149357106|gb|EDM45661.1| hypothetical protein SCB49_07627 [unidentified eubacterium SCB49]
          Length = 703

 Score =  401 bits (1031), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 233/621 (37%), Positives = 338/621 (54%), Gaps = 49/621 (7%)

Query: 83  VVAMAERTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIF 142
           ++++   T +   ++ + +TN L+ E SPYLLQHA+NPVDW AW  E  A A+K +  + 
Sbjct: 13  ILSVLACTSSEQKNNTSLYTNSLSKETSPYLLQHANNPVDWRAWNNETLAMAKKENKLMI 72

Query: 143 LSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGG 202
           +SIGY+ CHWCHVME ESFED  VA  +N+ F+S+KVDREERPD+D++Y+  VQ + G  
Sbjct: 73  ISIGYAACHWCHVMEHESFEDSLVAATMNENFISVKVDREERPDLDQIYINAVQLMTGSA 132

Query: 203 GWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQ 262
           GWPL+V   PD +P+ GGTYF  ED      + T+L+K++    +  + L +       Q
Sbjct: 133 GWPLNVVTLPDGRPVWGGTYFKKED------WITVLQKIQKINTENPEKLNEIAG----Q 182

Query: 263 LSEALSA--SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLY 320
           L E +      + N    +L    L         S+D RFGG+  APKF  P   + +L 
Sbjct: 183 LEEGIKNLDLVALNTEDVDLKNYNLDEVIHTWKSSFDHRFGGYKRAPKFMMPSNYEYLLR 242

Query: 321 HSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLY 380
           ++ + +D        E Q  VLFTL  MA GGI+D +GGGF RYSVDE+WHVPHFEKMLY
Sbjct: 243 YAVQDKD-------QELQDYVLFTLDQMAYGGIYDAIGGGFSRYSVDEKWHVPHFEKMLY 295

Query: 381 DQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRK 440
           D  QL ++Y +A+ LTK   Y  I  + L ++  +M    G  +S+ DADS   +G    
Sbjct: 296 DNAQLVSLYSNAYKLTKKPLYKEIITETLAFIFEEMTTEEGAFYSSLDADSLTEDGTL-- 353

Query: 441 KEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 500
           +EGAFYV+T++E++  LG    LF  +Y +   G  +            GK VLI   D 
Sbjct: 354 EEGAFYVYTAQELKSQLGTDFDLFAAYYNVNNFGKWE-----------DGKYVLIRDEDD 402

Query: 501 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 560
           ++ A  LG+  E     +   +  L   R  R +P LDDK + SWNGL++  +       
Sbjct: 403 ASIAKDLGISTEALQRKVANWKAILKAYRGFRSKPRLDDKTLTSWNGLMLKGYV------ 456

Query: 561 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLD 620
             +A +A+ N        KEY++ A   A FI+     E    L H+++ G S   G+L+
Sbjct: 457 --DAYTALGN--------KEYLDAALKNAVFIKDKQLKEDG-SLYHNYKEGRSTINGYLE 505

Query: 621 DYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKE 680
           DYA +ISG + LYE  +  +WL  A +L +     F D E G ++ T+ EDP ++ R  E
Sbjct: 506 DYASVISGFISLYEVTADVQWLDLAKKLTDYTFTKFYDTESGMFYFTSSEDPKLVARSVE 565

Query: 681 DHDGAEPSGNSVSVINLVRLA 701
             D    S N++   N+  L 
Sbjct: 566 YRDNVIASSNAIMAQNIFVLG 586


>gi|448604533|ref|ZP_21657700.1| thioredoxin domain containing protein [Haloferax sulfurifontis ATCC
           BAA-897]
 gi|445743942|gb|ELZ95422.1| thioredoxin domain containing protein [Haloferax sulfurifontis ATCC
           BAA-897]
          Length = 708

 Score =  401 bits (1031), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 243/663 (36%), Positives = 347/663 (52%), Gaps = 76/663 (11%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA NPV+W  W E A   AR+ D PIFLS+GYS CHWCHVM  ESF 
Sbjct: 8   NRLDEEQSPYLRQHADNPVNWQPWDETALDAAREADKPIFLSVGYSACHWCHVMADESFS 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D  +A++LN+ FV +KVDREERPD+D++Y T  Q + GGGGWPLSV+L+P+ KP   GTY
Sbjct: 68  DPDIAEVLNEQFVPVKVDREERPDLDRIYQTICQLVTGGGGWPLSVWLTPEGKPFFVGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDML---AQSGAFAI-EQLSEA--LSASASSNKL 276
           FPPE + G PGF+ ++    ++W   RD +   A+    AI ++L E   ++  A  +++
Sbjct: 128 FPPEPRRGAPGFRDLVESFAESWRTDRDEIENRAEQWTSAITDRLEETPDVAGEAPGSEV 187

Query: 277 PDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLEDTGKSGEAS 335
            D   Q ALR          D   GGFG   PKFP+P  I  +L   +    +G+     
Sbjct: 188 LDTTVQAALR--------GADRDHGGFGGDGPKFPQPGRIDALL---RGYAVSGR----H 232

Query: 336 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 395
           E   +   +L  MA GG+ DH+GGGFHRY VD  W VPHFEKMLYDQ  LA  YLDA  L
Sbjct: 233 EALDVARQSLDAMANGGLRDHLGGGFHRYCVDREWTVPHFEKMLYDQAGLAARYLDAARL 292

Query: 396 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 455
           T +  Y+ +  +  +++RR++    G +F+  DA S         +EG FYVWT  +V  
Sbjct: 293 TGNESYATVAAETFEFVRRELTHDDGGLFATLDAQSG-------GEEGTFYVWTPDDVRG 345

Query: 456 ILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS-ASKLGMPLEK 513
           +L E  A LF + Y + P GN            F+ K  ++ ++ ++A  A +  +   +
Sbjct: 346 LLPELDADLFCDRYGVTPGGN------------FENKTTVLNVSATTADLADEYDLDESE 393

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
             + L + R+ LF  R  R RP  D+KV+  WNGL+IS+FA+ + +L+ ++         
Sbjct: 394 VEDRLEKARKALFAAREGRERPARDEKVLAGWNGLMISAFAQGAVVLEDDS--------- 444

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
                    + A  A  F+R  L+D++T  L     NG  K  G+L+DYAFL  G  DLY
Sbjct: 445 -------LADDARRALDFVRERLWDDETATLSRRVMNGEVKGDGYLEDYAFLARGAFDLY 497

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
           +       L +A++L       F D + G  + T     S++ R +E  D + PS   V+
Sbjct: 498 QATGDLAPLSFALDLARATRREFYDADAGTLYFTPESGESLVTRPQEPTDQSTPSSLGVA 557

Query: 694 VINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLK--------------DMAMAVPLMCC 739
               + L      +  D + + A+  L  F  R++                A  VP +  
Sbjct: 558 TSLFLDLEQF---APEDGFGEVADAVLGSFANRVRGSPLEHVSLALAAEKAASGVPELTI 614

Query: 740 AAD 742
           AAD
Sbjct: 615 AAD 617


>gi|332663431|ref|YP_004446219.1| hypothetical protein [Haliscomenobacter hydrossis DSM 1100]
 gi|332332245|gb|AEE49346.1| protein of unknown function DUF255 [Haliscomenobacter hydrossis DSM
           1100]
          Length = 686

 Score =  401 bits (1031), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 234/610 (38%), Positives = 334/610 (54%), Gaps = 54/610 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYLLQHAHNPVDW+AW  EAF  A+K D PI +SIGYSTCHWCHVME ESFE
Sbjct: 2   NRLQFETSPYLLQHAHNPVDWYAWKPEAFERAKKEDKPILVSIGYSTCHWCHVMERESFE 61

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           +  VA ++N+ F++IKVDREERPDVD +YM     + G GGWPL+ FL+PD +P + GTY
Sbjct: 62  NADVAAIMNENFINIKVDREERPDVDHIYMEACVIMTGSGGWPLNCFLTPDGRPFLAGTY 121

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSN--KLPDEL 280
           +PP   + RP +  +L  V D +  +R  + +  +  I  + +  S   + N  +L    
Sbjct: 122 YPPLAAFNRPSWPQLLHHVTDVYRNRRKDVEEQASRLIGNIEQTNSYFLAKNEAELSGIN 181

Query: 281 PQNALRL--CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML-YHSKKLEDTGKSGEASEG 337
           P N + L    + L K++D + GGFG+APKFP  + +Q +L YH         +GE  E 
Sbjct: 182 PFNPVVLHNVFQTLKKNFDLQDGGFGAAPKFPGSMALQFLLDYHH-------FTGE-KEA 233

Query: 338 QKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTK 397
            +  +F+L  M +GGI+D +GGGF RY+ D  W VPHFEKMLYD   L  +  D + +T+
Sbjct: 234 LEHTVFSLDRMIRGGIYDQLGGGFARYATDRAWLVPHFEKMLYDNALLVGLLSDTYKVTQ 293

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
              +     + L ++ R+M    G  +SA DADS   EG    +EG FYVW+++E+  + 
Sbjct: 294 QPIFRRAIEETLGWIEREMTSADGGFYSALDADS---EG----EEGKFYVWSAEEIAAVC 346

Query: 458 G--EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 515
              E A LF  +Y ++P GN            ++G N+L      +A A + G   E   
Sbjct: 347 PSVEDAALFSSYYGVEPLGN------------WEGHNILWCPLPLAAFAVEAGQSPEALE 394

Query: 516 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 575
                 R +L  VR +R RP LDDK+++SWN L+ S++A+A   L +E            
Sbjct: 395 ARFAPIRTQLMAVRDERIRPGLDDKILLSWNALMASAYAKAYTALGNET----------- 443

Query: 576 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFR----NGPSKAPGFLDDYAFLISGLLD 631
                Y   A     F+      ++   L H+++       ++   FLDDYA+ I+ L+D
Sbjct: 444 -----YKVAALRNVDFLLEKFKRDEIGGLYHTYKKVKDQDQAQYAAFLDDYAYFIAALID 498

Query: 632 LYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNS 691
           +YE    T++L  A +L       FLD     ++ T+ +   V+LR  E +D A PSGNS
Sbjct: 499 VYEISLETRYLRQAADLTEYTLAHFLDDTRNLFYFTSKDQQDVVLRKIELYDNALPSGNS 558

Query: 692 VSVINLVRLA 701
             V NL RL 
Sbjct: 559 SMVQNLQRLG 568


>gi|448726262|ref|ZP_21708672.1| hypothetical protein C448_06453 [Halococcus morrhuae DSM 1307]
 gi|445795880|gb|EMA46400.1| hypothetical protein C448_06453 [Halococcus morrhuae DSM 1307]
          Length = 709

 Score =  401 bits (1030), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 234/608 (38%), Positives = 321/608 (52%), Gaps = 42/608 (6%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA NPV W  W ++A   AR+RDVPIFLSIGYS CHWCHVM  ESF+
Sbjct: 6   NRLDEEASPYLRQHADNPVHWQPWDDDALDAARERDVPIFLSIGYSACHWCHVMADESFD 65

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D  VA+ LN  FV IKVDREERPD+D++Y T    + G GGWPLSV+L+PD +P   GTY
Sbjct: 66  DPVVAERLNKDFVPIKVDREERPDLDRLYQTVAAMVSGQGGWPLSVWLTPDGRPFYVGTY 125

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP + K G+PGF  +L  + D+WD +R+ +        + ++  L  +  S   P E+  
Sbjct: 126 FPRKAKRGQPGFLDLLDSIADSWDDEREDIEGRADQWADAMAGELEGTPDS---PGEVSP 182

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             L   A++     D   GGFG   KFP+   + +++   +  E TG+       +++ +
Sbjct: 183 GLLETAAQRAVSDADREHGGFGRGQKFPQTGRLHLLM---QAYERTGRDA----FREVAV 235

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
             L  MA GG+ DH GGGFHRY  D  W VPHFEKMLYD  +L   Y+  + LT +  Y+
Sbjct: 236 EALDAMADGGLRDHAGGGFHRYVTDREWTVPHFEKMLYDNAELVRAYIAGYRLTGEERYA 295

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH-- 460
            I R+ L ++ R++  P G  FS  DA S     +   +EGAFYVWT  EV + + +   
Sbjct: 296 EIARETLGFVERELRHPDGGFFSTLDAQSEGE--SGEHEEGAFYVWTPPEVHEAIDDEFA 353

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
           A LF E Y +   GN +            GK VL         A + G   E+    L  
Sbjct: 354 ADLFCERYGITEAGNFE-----------DGKTVLTLDTAIDGLADEHGTTTEEIEADLER 402

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            R  +F  R+ R RP  D+KV+  WNGL+IS+FA A   L                  + 
Sbjct: 403 AREAIFAARTDRDRPARDEKVLAGWNGLMISAFAEAGLALD-----------------ET 445

Query: 581 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 640
           Y E A +A  F+R  L+DE   +L   F+ G  K  G+L+DYAFL  G L+ YE     +
Sbjct: 446 YGETAVAALDFVREQLWDEDEQQLARRFKGGEVKIDGYLEDYAFLARGALNCYEATGEVE 505

Query: 641 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 700
           +L +A++L       F D E G  + T     S++ R +E  D + PS   V+V  L+ L
Sbjct: 506 YLTFALDLGRAVVREFFDAEEGTLYFTPQSGESLVARPQELDDQSTPSSTGVAVDTLLAL 565

Query: 701 ASIVAGSK 708
           +    G +
Sbjct: 566 SQFAPGEE 573


>gi|317122770|ref|YP_004102773.1| hypothetical protein [Thermaerobacter marianensis DSM 12885]
 gi|315592750|gb|ADU52046.1| hypothetical protein Tmar_1963 [Thermaerobacter marianensis DSM
           12885]
          Length = 738

 Score =  401 bits (1030), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 270/703 (38%), Positives = 367/703 (52%), Gaps = 82/703 (11%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           ++  NRL  E SPYL QHA+NPVDW+ WG+EA   AR  D PI LSIGY+ CHWCHVME 
Sbjct: 5   DRQPNRLIREASPYLQQHAYNPVDWYPWGQEAIERARAEDRPILLSIGYAACHWCHVMER 64

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           E FED  +A+ +N  FV++KVDREERPD+D+VY T  Q L  GGGWPL+VFL+PDLKP  
Sbjct: 65  ECFEDPAIAEQMNRGFVNVKVDREERPDLDQVYQTAAQILGSGGGWPLTVFLTPDLKPFF 124

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
            GTYFPPED++G PGF  +L  V DA+  +RD + +     +E L  +     ++ +   
Sbjct: 125 AGTYFPPEDRHGLPGFPKVLDAVLDAYRHRRDDVERVANRVVEILRRSAGGPGAAEEPAG 184

Query: 279 ELPQNA-----LRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML-------------- 319
             P        ++  A ++++ YD ++GGFG APKFP    + ++L              
Sbjct: 185 AAPAREAARQWIQRAATRIARRYDPQYGGFGRAPKFPHATGLAVLLRAGVARTPGGPGPS 244

Query: 320 ----YHSKKLEDTGKSGEAS-------EGQK----MVLFTLQCMAKGGIHDHVGGGFHRY 364
                 S     T +SG A        E  +    M L TLQ MA GG+ DH+ GGFHRY
Sbjct: 245 GTTGSGSSGSPGTARSGTADLVAGDVPENPRRHLDMALHTLQAMALGGLFDHLAGGFHRY 304

Query: 365 SVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIF 424
           + D  W +PHFEKMLYDQ QL  +YLDA+ LT D FY+ + R  L ++  +M  P G   
Sbjct: 305 ATDRAWLIPHFEKMLYDQAQLVPLYLDAYRLTGDPFYAGVARQTLHFVLDEMTAPEGGFI 364

Query: 425 SAEDADSAETEGATRKKEGAFYVWTSKEVEDILG--EHAILFKEHYYLKPTGNCDLSRMS 482
           S  DADS   EG    +EGA+YVWT  ++ + LG  + A L    + +   GN +     
Sbjct: 365 STLDADS---EG----REGAYYVWTPDQLREALGDPDEAALAARWFGVTEEGNFE----- 412

Query: 483 DPHNEFKGKNVL---IELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDD 539
                  G  VL   +   D  A A + G   ++    L   RR+L D R +R  P  DD
Sbjct: 413 ------DGTTVLYRAVADQDLPALAREWGTNRDELQRRLESIRRRLLDARRRRTPPGRDD 466

Query: 540 KVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE 599
           K++V WNGL+I++FA+A+ +L                D   Y   A  AA FI   L   
Sbjct: 467 KILVGWNGLMIAAFAQAAPVL----------------DEPGYAAAARRAAEFILGTL--R 508

Query: 600 QTH-RLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLD 658
           + H RL H++R  P   PGFL DYAFLI GLL L+      +WL  A  L     E F D
Sbjct: 509 RPHGRLLHAYRGRPLDVPGFLPDYAFLIGGLLALHAADGDPRWLEEADRLARPMIETFWD 568

Query: 659 REGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEH 718
              G +++   E  + L+R  E  D A P+G++ +   L RLA I   +  + YR+ AE 
Sbjct: 569 DAAGVFYDAPEEAGTPLVRPVELFDQALPAGSAAAATVLARLAVI---TGDEEYRRIAEA 625

Query: 719 SLAVFETRLKDMAMAVP-LMCCAADMLSVPSRKHVVLVGHKSS 760
            L        +  +A+   +   AD L       V LVG  ++
Sbjct: 626 YLRRAAALAAEQPLAMASTVLLQADQLE--GYTEVTLVGDPAA 666


>gi|148264330|ref|YP_001231036.1| hypothetical protein Gura_2283 [Geobacter uraniireducens Rf4]
 gi|146397830|gb|ABQ26463.1| protein of unknown function DUF255 [Geobacter uraniireducens Rf4]
          Length = 700

 Score =  401 bits (1030), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 259/678 (38%), Positives = 360/678 (53%), Gaps = 60/678 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL    SPYLLQHA NPVDW+ WGE+AFA+A   D PIFLSIGY+TCHWCHVME E+FE
Sbjct: 33  NRLIFAMSPYLLQHATNPVDWYPWGEDAFAKAAADDKPIFLSIGYATCHWCHVMEHEAFE 92

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D  VA + N +F+ IKVDREERPD+D+ YM   Q + G GGWPL++F++P+ KP    TY
Sbjct: 93  DREVAAVFNRFFICIKVDREERPDIDEQYMAVAQMMTGSGGWPLNIFMTPEKKPFFAATY 152

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE-LP 281
            P   + G PG   IL +V + W  +R  L Q     IE L+        S  LPD  L 
Sbjct: 153 MPRTPRMGMPGIIQILERVAELWRTERQKLEQDSDVTIEALTHHFQPHPGS--LPDMVLV 210

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
           QNA     +QL++ YD  +GGFG+ PKFP P+ +  +L   K      +SG  +    MV
Sbjct: 211 QNAY----QQLTEMYDDLWGGFGNVPKFPMPLYLTFLLRFWK------RSGNGAS-LAMV 259

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             TL+ + +GGI+D +G GFHRY+VD +W VPHFEKMLYDQ  +A  YLDAF  T   FY
Sbjct: 260 EHTLRMLRQGGIYDQIGFGFHRYAVDRQWLVPHFEKMLYDQALIAIGYLDAFQATAVPFY 319

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EH 460
             +  ++  Y+  +M  P G  F+ +DAD   TEG    +EG +Y+WT  E+   +G + 
Sbjct: 320 RQVAEEVFAYVLGEMTSPEGGFFAGQDAD---TEG----EEGNYYIWTPAEIAAAIGHDE 372

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
           A +F           C L  +++  N F+G+N+L         A++  +  E     L  
Sbjct: 373 AQVF-----------CRLFDVTEKGN-FEGRNILHLPVPPETFAAREAILTEVLTADLER 420

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            R  L  VR  R RP  D+KV+ +WNGL+I++ AR   +                S  + 
Sbjct: 421 WRHTLLKVRGNRIRPFRDEKVLTAWNGLMIAALARGYAL----------------SGEER 464

Query: 581 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 640
           ++  A+ AA+FI   L      RL  SF  G +  P FLDDYAF + GL++L++     +
Sbjct: 465 FLAAAKRAAAFIGTRL-TSPGGRLMRSFHLGEASVPAFLDDYAFFVWGLIELHQVTLEPE 523

Query: 641 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGAEPSGNSVSVINLVR 699
           +L  A  L +    LF   +GG Y   TG D   L  +++   DG  PSGNSV+  +L R
Sbjct: 524 FLDSARFLADEMLRLFHSGKGGLY--ETGLDSEQLPVIRQSARDGVLPSGNSVAAFDLFR 581

Query: 700 LASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKS 759
           L  I    +   + ++ E  +  F   +    +A      A+D    P    V L G++ 
Sbjct: 582 LGRITGDGR---FLESGEAVVRTFMGDVTRQPLASLNFLSASDYHLGPEVT-VTLAGNRE 637

Query: 760 SVDFENMLAAAHASYDLN 777
            +    ML A H  +  N
Sbjct: 638 ELG--GMLDAVHRRFIPN 653


>gi|336113948|ref|YP_004568715.1| hypothetical protein BCO26_1270 [Bacillus coagulans 2-6]
 gi|335367378|gb|AEH53329.1| protein of unknown function DUF255 [Bacillus coagulans 2-6]
          Length = 629

 Score =  400 bits (1029), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 240/613 (39%), Positives = 337/613 (54%), Gaps = 58/613 (9%)

Query: 156 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 215
           ME ESFE+E VA++LN+ FV+IKVDREERPD+D +YM   Q + G GGWPLSVFL+P+  
Sbjct: 1   MERESFENEEVARILNEKFVAIKVDREERPDIDAIYMLVCQMMTGQGGWPLSVFLTPEKV 60

Query: 216 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 275
           P   GTYFP E +YG PGFK +L  +   + +  D +   G     Q+ +AL AS    +
Sbjct: 61  PFYAGTYFPRESRYGMPGFKEVLHYLSQQYTENPDRIKDVGT----QVKQALEASREKGE 116

Query: 276 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 335
               L +       +   +++D R+GGFG APKFP P  +  +L ++K  E+      A+
Sbjct: 117 -QTALTKETTGRAFQTYKQAFDPRYGGFGKAPKFPMPHSLVFLLMYAKFYENRDALAMAT 175

Query: 336 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 395
           +       TL  +A+GGI+DH+G GF RYSVDE++ VPHFEKMLYD   LA  Y DAF +
Sbjct: 176 K-------TLDGLARGGIYDHIGYGFSRYSVDEKFLVPHFEKMLYDNALLALAYTDAFRM 228

Query: 396 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 455
           TK+  Y  I  +I+ Y+ RDM  P G  +SAEDADS   EG    +EG FYVWT KEV+D
Sbjct: 229 TKNARYKKITEEIIKYVLRDMAHPDGGFYSAEDADS---EG----EEGKFYVWTPKEVKD 281

Query: 456 ILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS-ASKLGMPLEK 513
           +LGE    LF + Y +   GN            F+GKN+  ++     + A K G     
Sbjct: 282 VLGEQLGTLFCQAYGITGQGN------------FEGKNIPNQITTHLETIAKKEGFSPAA 329

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
               L   R+ LF  R KR RP  DDK++ +WNGL+I++ A+A ++    +         
Sbjct: 330 LAEKLETARQSLFQHREKRVRPFRDDKILTAWNGLMIAALAKAGRVFYQPS--------- 380

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
                  Y++ AE A SFIR +L   Q  R+   +R+G  K  GF+D+YAFL+ G ++LY
Sbjct: 381 -------YVQAAEKAVSFIRDNLI--QNGRIMVRYRDGEVKNKGFIDEYAFLLWGYMELY 431

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
           E      +L  A  L     +LF D  GGG+F +  +D  +L+R KE +DGA PSGNSV+
Sbjct: 432 ESTFAPFYLAEAKRLAGNMIDLFWDEHGGGFFFSGNDDEPLLVRQKESYDGALPSGNSVA 491

Query: 694 VINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVV 753
              L+RLA +      +   +  +     F   + D   A  +M  A  M +  + K VV
Sbjct: 492 ACQLLRLAKLTGDFTLE---EKVQQMFQAFSKVIHDDPNAHAMMMQAV-MYAQQATKEVV 547

Query: 754 LV---GHKSSVDF 763
           +V     + +VDF
Sbjct: 548 IVMDDETEKAVDF 560


>gi|219852761|ref|YP_002467193.1| hypothetical protein Mpal_2172 [Methanosphaerula palustris E1-9c]
 gi|219547020|gb|ACL17470.1| protein of unknown function DUF255 [Methanosphaerula palustris
           E1-9c]
          Length = 714

 Score =  400 bits (1029), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 258/683 (37%), Positives = 354/683 (51%), Gaps = 56/683 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  + SPYLL HAH PV WF WGEEAFA A     P+FLSIGY+TCHWCHVM  ESF 
Sbjct: 28  NRLIDQKSPYLLAHAHQPVAWFPWGEEAFARAAAEQKPVFLSIGYATCHWCHVMAEESFM 87

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D  VA LLND++++IKVDREERPD+D+VYM   Q + G GGWPL++ ++PD +P    TY
Sbjct: 88  DLKVAALLNDYYIAIKVDREERPDIDQVYMAVCQMMTGSGGWPLTIIMTPDRRPFFAATY 147

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
            P   ++   G   +L  V   W +K   L +     +E L +   A A      D L  
Sbjct: 148 IPKMSRFRGTGMLDLLPMVAQVWREKPGDLIEVATQVVEALHQPARAGAGPEPTIDLLIA 207

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
               L A     ++D   GGFG APKFP P  +  +L + +      +SGE      MV 
Sbjct: 208 GYRGLAA-----TFDPVRGGFGDAPKFPAPHNLLFLLRYWR------RSGEPV-ALAMVE 255

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TLQ M  GGI+DH+ GGFHRYS D  W VPHFEKMLYDQ  L   Y +AF  T +  Y 
Sbjct: 256 QTLQAMRHGGIYDHLAGGFHRYSTDGGWKVPHFEKMLYDQAMLVMAYTEAFLATGNREYR 315

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE-HA 461
                 + Y+ RD++   G   +A+DADS   EG    +EG +Y+WT  EV  +L +  A
Sbjct: 316 KTAEATIQYVLRDLVTREGGFAAAQDADS---EG----EEGRYYLWTLAEVRGLLTQDEA 368

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHN-EFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
             F   Y +   GN      +DP N +  G+NVL    D+         PL+     L  
Sbjct: 369 ATFTTAYQMTERGN-----FTDPSNPKLTGRNVLYRSPDA---------PLQDPDLHLVA 414

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
              KL   R +R  P  DDKV+  WNGL+I++ ARA +                     +
Sbjct: 415 ADAKLAAARRERVPPLTDDKVLTGWNGLMIAALARAGRAFGV----------------AD 458

Query: 581 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 640
           Y++VA  AA F+   + D Q  RL H +R+G     G  +DYA LI GLLDLY+     +
Sbjct: 459 YIDVAGRAADFLLGTMRD-QGGRLLHRYRDGEVAISGQAEDYAALIWGLLDLYQATFTVR 517

Query: 641 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 700
           +L  A+E+         D  GGG+F+   +   +++R KE +DGA PS NSV+ ++L+ L
Sbjct: 518 YLADAVEVMKEFTARCWDPAGGGFFSAAEDATDLIVRQKEQYDGAMPSANSVAFMDLLLL 577

Query: 701 ASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSS 760
           A +   +    Y + AE  L  F T + + +  +     A    ++   + VV+VG + +
Sbjct: 578 ARL---TGEPAYEEQAEE-LGRFMTGVVEQSPLIATFFLAGLDFALGPAQEVVIVGDEGA 633

Query: 761 VDFENMLAAAHASYDLNKTVSKK 783
           VD   M+ A    +  + TV  K
Sbjct: 634 VDTTAMVRALAERFLPSTTVQFK 656


>gi|448474014|ref|ZP_21601982.1| hypothetical protein C461_06214 [Halorubrum aidingense JCM 13560]
 gi|445818294|gb|EMA68153.1| hypothetical protein C461_06214 [Halorubrum aidingense JCM 13560]
          Length = 735

 Score =  400 bits (1028), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 242/663 (36%), Positives = 346/663 (52%), Gaps = 72/663 (10%)

Query: 97  SRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVM 156
           S+    NRL  E SPYL QHA NPV W  WGE+AF  AR+ DVP+F+SIGYS+CHWCHVM
Sbjct: 2   SQPTDRNRLDGEASPYLQQHADNPVHWQPWGEDAFERAREHDVPVFVSIGYSSCHWCHVM 61

Query: 157 EVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKP 216
             ESFED+ +A +LND FV +KVDREERPDVD  +MT  Q + GGGGWPLS + +P+ KP
Sbjct: 62  AEESFEDDSIAAVLNDQFVPVKVDREERPDVDSTFMTVCQLVTGGGGWPLSAWCTPEGKP 121

Query: 217 LMGGTYFPPEDKYGRPGFKTILRKVKDAW---------DKKRDMLAQSGAFAIEQLSEAL 267
              GTYFPPE +  +PGF+ +  ++ D+W          ++ +    S    +E + E  
Sbjct: 122 FYVGTYFPPEPRRNQPGFRDLCERIADSWADPEQREEMKRRAEQWTTSARDELESVPEPG 181

Query: 268 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLE 326
            A  + +  P     + L   A    + YD  +GGFGS   KFP P  I +++  + +  
Sbjct: 182 DADDADDTGPSG--SDLLEEAAAAAIRGYDDEYGGFGSGGAKFPMPGRIDLLMRAAARSG 239

Query: 327 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 386
            +     A+        TL  MA+GG++D +GGGFHRY+VD +W +PHFEKMLYD  +L 
Sbjct: 240 RSAALTAATG-------TLDGMARGGVYDQIGGGFHRYAVDRQWTIPHFEKMLYDNAELP 292

Query: 387 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADS--------------- 431
            VYLD + LT D  Y+ +  + L +L R++    G  FS  DA S               
Sbjct: 293 MVYLDGYRLTGDPSYARVASESLGFLDRELRHADGGFFSTLDARSRPPAGRGGGRGNDEG 352

Query: 432 AETEGATRKKEGAFYVWTSKEVEDILGEHA-ILFKEHYYLKPTGNCDLSRMSDPHNEFKG 490
            + EG     EGA+YVWT +EV+ +L E A  L K  + ++  GN +           +G
Sbjct: 353 GDGEGDAPAVEGAYYVWTPEEVDAVLDEPASSLAKARFGIRSGGNFE-----------RG 401

Query: 491 KNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVI 550
             V          A +   P ++   IL + R  LF+ R  RPRP  D+KV+ SWNG  I
Sbjct: 402 TTVPTVAASIEELADEYDRPADEVREILTDARVALFEARETRPRPARDEKVLASWNGRAI 461

Query: 551 SSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRN 610
           S+FARA  +L                    Y  +A  A +F R  LYDE T  L   + +
Sbjct: 462 SAFARAGDVLG-----------------DSYAAIASDALAFCRDRLYDEDTGELARRWLD 504

Query: 611 GPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNT--- 667
           G  + PG+LDDYAFL  G LD+Y      + L +A++L  +  + F +   G  + T   
Sbjct: 505 GDVRGPGYLDDYAFLARGALDVYAATGDPEPLGFALDLAESLVDAFYEAADGTIYFTRDP 564

Query: 668 -TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDY-YRQNAEHSLAVFET 725
              +D ++  R +E  D + PS   V+   L    +++ G ++D  +R+ AE  +     
Sbjct: 565 DASDDDTLFARPQEFTDRSTPSSLGVAAETL----ALLDGFRTDREFREIAEAVVTTHAD 620

Query: 726 RLK 728
           R++
Sbjct: 621 RIR 623


>gi|389847202|ref|YP_006349441.1| hypothetical protein HFX_1748 [Haloferax mediterranei ATCC 33500]
 gi|448614853|ref|ZP_21663881.1| hypothetical protein C439_01752 [Haloferax mediterranei ATCC 33500]
 gi|388244508|gb|AFK19454.1| highly conserved protein containing a thioredoxin domain [Haloferax
           mediterranei ATCC 33500]
 gi|445752940|gb|EMA04359.1| hypothetical protein C439_01752 [Haloferax mediterranei ATCC 33500]
          Length = 703

 Score =  400 bits (1028), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 246/674 (36%), Positives = 351/674 (52%), Gaps = 76/674 (11%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA NPV+W  W E A   AR++D PIFLSIGYS CHWCHVM  ESF 
Sbjct: 8   NRLDEEQSPYLCQHADNPVNWQPWDETALEAAREQDKPIFLSIGYSACHWCHVMADESFS 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D  +A++LN+ FV +KVDREERPD+D++Y T  Q + GGGGWPLSV+L+P  KP   GTY
Sbjct: 68  DPEIAEVLNEHFVPVKVDREERPDLDRIYQTICQLVTGGGGWPLSVWLTPQGKPFFVGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDML---AQSGAFAI-EQLSEALSASASS--NKL 276
           FPPE + G PGF+ ++    ++W   RD +   A+    AI ++L E    +  +  +++
Sbjct: 128 FPPEPRRGAPGFRDLVESFAESWRTDRDEIENRAEQWTHAITDRLEETPDTTGETPGSEI 187

Query: 277 PDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLEDTGKSGEAS 335
            D+  Q ALR        + D   GGFGS  PKFP+P  I  +L   +    TG+     
Sbjct: 188 LDQTVQAALR--------AADRDHGGFGSGGPKFPQPGRIDALL---RGYAITGR----R 232

Query: 336 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 395
           +   + +  L  MA GG+ DH+GGGFHRY VD +W VPHFEKMLYDQ  LA+ YLDA+ L
Sbjct: 233 QALDVAVEALDAMANGGLRDHLGGGFHRYCVDRQWTVPHFEKMLYDQAGLASRYLDAYRL 292

Query: 396 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 455
           T +  Y+ + R+  +++RR++    G  F+  DA S         +EG FYVWT ++V  
Sbjct: 293 TGNESYATVARETFEFVRRELSHDDGGFFATLDAQSG-------GEEGTFYVWTPEDVRS 345

Query: 456 ILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS-ASKLGMPLEK 513
            L E  A LF + Y + P GN            F+ K  ++ ++ ++A  A +  +   +
Sbjct: 346 HLPELEADLFCDRYGVTPGGN------------FENKTTVLNVSATTADLAEEYDLTESE 393

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
               L E   +LF  R+ R RP  D+KV+  WNGL+IS+FA+ +  L  ++         
Sbjct: 394 VEERLEEAHEELFAARTDRERPARDEKVLAGWNGLMISAFAQGAVALTDDS--------- 444

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
                    + A  A  F+R HL+DE +  L     NG  K  G+L+DYAFL  G  DLY
Sbjct: 445 -------LADDARRALDFVREHLWDEASETLSRRVMNGEVKGDGYLEDYAFLARGAFDLY 497

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
           +     + L +AI+L       F D   G  + T     +++ R +E  D + PS   V+
Sbjct: 498 QATGDLEPLSFAIDLARATHREFYDDAAGTLYFTPESGEALVTRPQEATDQSTPSSLGVA 557

Query: 694 VINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLK--------------DMAMAVPLMCC 739
               + L      +    +   A+  L  F  R++                A  VP +  
Sbjct: 558 TSLFLDLEHFAPDAG---FGDAADAVLESFANRVRGSPLEHVSLVLAAEKAASGVPELTV 614

Query: 740 AADMLSVPSRKHVV 753
           AAD +    R+ + 
Sbjct: 615 AADEMPDEWRETIA 628


>gi|261200020|ref|XP_002626411.1| DUF255 domain-containing protein [Ajellomyces dermatitidis
           SLH14081]
 gi|239594619|gb|EEQ77200.1| DUF255 domain-containing protein [Ajellomyces dermatitidis
           SLH14081]
          Length = 823

 Score =  400 bits (1028), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 252/672 (37%), Positives = 355/672 (52%), Gaps = 69/672 (10%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            NRL+   SPY+  H +NPV W  W  EA   A+K +  +FL         CHVME ESF
Sbjct: 23  VNRLSQSKSPYVRGHMNNPVAWQMWDSEAITLAKKLNRMVFLR--------CHVMEKESF 74

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
               VA +LN  F+ IK+DREERPD+D+VYM YVQA  G GGWPL+VFL+PDL+P+ GGT
Sbjct: 75  MSPEVAAILNKSFIPIKLDREERPDIDEVYMNYVQATTGSGGWPLNVFLTPDLEPVFGGT 134

Query: 222 YFPPEDKYGRPG--------FKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE-ALSASAS 272
           Y+P       P         F  IL K++D W  ++    +S     +QL E A   + S
Sbjct: 135 YWPGPHSSTLPALGGEGHVTFIDILEKLRDVWQTQQLRCRESAKDITKQLREFAEEGTHS 194

Query: 273 SNKLPDELPQNALRLCA---EQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK---KLE 326
             K  D      + L     +  +  +D   GGF  APKF  P  +  ++  S+    + 
Sbjct: 195 KQKAADADEDLEVELLEESYQHFASRFDPVNGGFSRAPKFATPANLSFLINLSRYPSAVS 254

Query: 327 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 386
           D     E +   +M   TL  M++GGIHD +G GF RYSV   W +PHFEKMLYDQ QL 
Sbjct: 255 DIVGYDECARALEMATKTLIYMSRGGIHDQIGHGFARYSVTADWSLPHFEKMLYDQAQLL 314

Query: 387 NVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAEDADSAETEGATRKKEGAF 445
           NVY+DAF    +        DI  Y+    ++ P G  +S+EDADS  T   T K+EGAF
Sbjct: 315 NVYVDAFDSAHNPELLGAIYDIATYITSPPILSPTGGFYSSEDADSLPTPSDTDKREGAF 374

Query: 446 YVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 504
           YVWT KE + ILG+  A +   H+ + P GN  ++R +DPH+EF  +NVL      +  A
Sbjct: 375 YVWTHKEFKQILGQRDADVCARHWGVLPDGN--VARGNDPHDEFINQNVLSIKVTPAKLA 432

Query: 505 SKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 563
            + G+  E+ + I+   R KL + R SKR RP LDDK+IVSWNGL I + A+ S +L++ 
Sbjct: 433 KEFGLSEEEVVKIIKASREKLREYRESKRVRPGLDDKIIVSWNGLAIGALAKCSVVLEN- 491

Query: 564 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP-SKAPGFLDDY 622
                    V  +  +E+   AE+AA FIR++L+D  + +L   +R+G     PGF DDY
Sbjct: 492 ---------VDRAKAQEFRLAAENAAKFIRQNLFDPASGQLWRIYRDGERGDTPGFADDY 542

Query: 623 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR---------------------EG 661
           ++L SGL+DLYE      +L +A +LQ   +  FL +                       
Sbjct: 543 SYLASGLIDLYEATFDDGYLQFAEQLQQYLNTYFLAQGPTPTPSPRTSTTTESTPAPSSS 602

Query: 662 GGYFNT------TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQN 715
            GY+ T          P+ L R+K   D + PS N V   NL+RL++++   + D Y++ 
Sbjct: 603 TGYYTTPSTIHQASAHPAPLFRLKTGTDASTPSPNGVIAQNLLRLSTLL---EDDTYKRL 659

Query: 716 AEHSLAVFETRL 727
           A  ++  F   +
Sbjct: 660 ARETVNAFAVEI 671


>gi|76802617|ref|YP_327625.1| hypothetical protein NP3966A [Natronomonas pharaonis DSM 2160]
 gi|76558482|emb|CAI50074.1| YyaL family protein [Natronomonas pharaonis DSM 2160]
          Length = 698

 Score =  400 bits (1028), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 237/608 (38%), Positives = 316/608 (51%), Gaps = 49/608 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL    SPYL QHA NPV W  W E A   A +RDVPIFLSIGY+ CHWCHVM  ESF+
Sbjct: 3   NRLDEASSPYLRQHADNPVAWQPWDETALETAAERDVPIFLSIGYAACHWCHVMADESFD 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D   A +LN+ FV IKVDREERPDVD VYM   Q + G GGWPLSV+L+P+ KP   GTY
Sbjct: 63  DPDTADVLNEHFVPIKVDREERPDVDNVYMQVCQMVRGSGGWPLSVWLTPEGKPFHVGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWD--KKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           FPPE     PGFK++L  + +AWD  ++R  L Q      +Q + ++S+       P   
Sbjct: 123 FPPEPTKNTPGFKSVLEDIAEAWDDTERRQQLEQQA----DQWATSISSELEDTPEPVAE 178

Query: 281 P--QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
           P  +  L   A     + D   GG+G   KFP P  I ++L   ++ +       A E  
Sbjct: 179 PPGEEFLDTAANAAVGNADREHGGWGRGQKFPHPGRIHLLLCAYQQTDRETYRDVAVE-- 236

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
                TL  MA GG++DHVGGGFHRY VD  W VPHFEKMLYD  ++   +L  + +T D
Sbjct: 237 -----TLDAMASGGLYDHVGGGFHRYCVDREWTVPHFEKMLYDNAEIPRAFLAGYQVTGD 291

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
             Y+ I  +   ++ R++  P G  +S  DA+S ++ G   ++EGAFYVWT + V   + 
Sbjct: 292 DRYAEIVAETFAFVDRELTHPDGGFYSTLDAESEDSTGT--REEGAFYVWTPEVVAAAVD 349

Query: 459 EH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLN 516
               A LF E Y +   GN +               VL E       A++  M       
Sbjct: 350 NETDAELFCERYGVTDAGNFE-----------NATTVLTESRPPEELAAERVMDTATVEE 398

Query: 517 ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGS 576
            +   R +LF+ R++R RP  D+KV+  WNGL+IS+ A  + +L                
Sbjct: 399 RIERAREQLFESRAERSRPPRDEKVLAGWNGLMISALAEGALVLD--------------- 443

Query: 577 DRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG 636
              EY + A +A SF R  L+DE    L   F  G     G+L DYAFL  G LDLY+  
Sbjct: 444 --PEYADDAAAALSFCREQLWDETEEVLNRRFEGGTVGIDGYLQDYAFLGRGALDLYQAT 501

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGG-YFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 695
              + L +A+ L       F D + G  YF   G D S+L R ++  D + PS   V+V 
Sbjct: 502 GDVEQLSFALSLGRVIQSEFYDADAGTLYFTAEGGD-SLLARPQQLADSSTPSSTGVAVE 560

Query: 696 NLVRLASI 703
            L RLA+ 
Sbjct: 561 LLSRLAAF 568


>gi|330465851|ref|YP_004403594.1| n-acylglucosamine 2-epimerase [Verrucosispora maris AB-18-032]
 gi|328808822|gb|AEB42994.1| n-acylglucosamine 2-epimerase [Verrucosispora maris AB-18-032]
          Length = 679

 Score =  400 bits (1028), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 235/627 (37%), Positives = 340/627 (54%), Gaps = 59/627 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA   SPYLLQHA NPVDW+ W +EAFAEA++RDVP+ +S+GYS CHWCHVM  ESFE
Sbjct: 2   NRLAHATSPYLLQHADNPVDWWPWCDEAFAEAKRRDVPVLISVGYSACHWCHVMAHESFE 61

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           +EGV +LLN+ FVSIKVDREERPDVD VYMT  QA+ G GGWP++VF +PD  P   GTY
Sbjct: 62  NEGVGRLLNEGFVSIKVDREERPDVDAVYMTATQAMTGQGGWPMTVFATPDGTPFYCGTY 121

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP      R  F  +L  V  AW ++RD + + GA  +E +  A +    +  L  +L  
Sbjct: 122 FP------RQNFVRLLESVGTAWREQRDAVLRQGAAVVEAVGGAQAVGGPTAPLTADL-- 173

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             L   A QL+  YD   GGFG APKFP  + +  +L H ++   TG    + +  +MV 
Sbjct: 174 --LDAAATQLAGEYDETNGGFGGAPKFPPHLNLLFLLRHHQR---TG----SPQSLEMVR 224

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            T + MA+GGIHD + GGF RYSVD  W VPHFEKMLYD   L  VY   + LT D    
Sbjct: 225 HTCEAMARGGIHDQLAGGFARYSVDGHWTVPHFEKMLYDNALLLRVYTQLWRLTGDALAL 284

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
            + RDI  +L  ++  PG    SA DAD+   EG T       YVWT  ++ ++LG+   
Sbjct: 285 RVARDIARFLADELHRPGQGFASALDADTEGVEGLT-------YVWTPAQLVEVLGDEDG 337

Query: 463 LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECR 522
            +            DL  +++      G +VL    D   +   +    E++ +++    
Sbjct: 338 RWA----------ADLFAVTESGTFEHGTSVLKLARDVDDADPAV---RERWQDVV---- 380

Query: 523 RKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK------SEAESAMFNFPVVGS 576
           R+L   R  RP+P  DDKV+ +WNGL +++ A   ++++      +E E+ +     + +
Sbjct: 381 RRLLAARDTRPQPARDDKVVAAWNGLAVTALAEFVRLVETSGRIGTEGEANLLEGVTIVA 440

Query: 577 DRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP-GFLDDYAFLISGLLDLYEF 635
           D      + ++A    R H+ D    RL+ + R+G    P G L+DY  +      +++ 
Sbjct: 441 DGA----MRDTAEYLARVHMVD---GRLRRASRDGRVGEPAGVLEDYGCVAEAFCAMHQV 493

Query: 636 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 695
               +WL WA +L +T    F    GG +++T  +   ++ R  +  D A PSG S    
Sbjct: 494 TGEGRWLEWAGQLLDTALAHFA-APGGAFYDTADDAEQLVARPADPTDNATPSGRSAIAA 552

Query: 696 NLVRLASIVAGSKSDYYRQNAEHSLAV 722
            LV  +++   +   +YR+ AE +L+ 
Sbjct: 553 ALVAYSAL---TGQTHYREVAEAALST 576


>gi|55377924|ref|YP_135774.1| thioredoxin [Haloarcula marismortui ATCC 43049]
 gi|55230649|gb|AAV46068.1| thioredoxin domain containing protein [Haloarcula marismortui ATCC
           43049]
          Length = 733

 Score =  400 bits (1028), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 243/682 (35%), Positives = 357/682 (52%), Gaps = 71/682 (10%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL    SPYL QHA NPV+W  W E A   AR+RDVPIFLSIGY+ CHWCHVME ESFE
Sbjct: 11  NRLDEAESPYLRQHADNPVNWQPWDETALEAARERDVPIFLSIGYAACHWCHVMEEESFE 70

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE +A+ LN+ FV IKVDREERPD+D VYM+  Q + GGGGWPLS +L+P+ +P   GTY
Sbjct: 71  DEAIAEQLNENFVPIKVDREERPDLDSVYMSICQQVTGGGGWPLSAWLTPEGEPFYVGTY 130

Query: 223 FPPEDKYGRPGFKTILRKVKDAW---DKKRDM--LAQSGAFAIEQLSEALSASASSNKLP 277
           FPPE+K G+PGF  +L+++  +W   +++ +M   AQ    AIE   EA  A       P
Sbjct: 131 FPPEEKRGQPGFGDLLQRLSGSWSDPEQRAEMENRAQQWTEAIESDLEATPAD------P 184

Query: 278 DELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLEDTGKSGEASE 336
           ++  ++ ++       +  D + GG+GS  PKFP+   +  +L   +   D G+     +
Sbjct: 185 EDPAEDIIQTAGTIAHRGADRQDGGWGSGGPKFPQNGRLHALL---RAYADGGQ----ED 237

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
              +V  TL  MA  G++DHVGGGFHRY+ D++W VPHFEKMLYD  ++   +L  +   
Sbjct: 238 YLNVVEETLDVMADRGLYDHVGGGFHRYATDQQWAVPHFEKMLYDNAEIPRAFLAGYQAI 297

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSA----------ETEGATRK------ 440
               Y+ + R+  ++++R++  P G  FS  DA+SA          ++ G + +      
Sbjct: 298 GSERYASVVRETFEFVQRELQHPDGGFFSTLDAESAPHSESRSDSEQSSGESPRDDPDGE 357

Query: 441 -KEGAFYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 497
            +EG FYVWT ++V D + +   A +F ++Y +   GN            F+G  VL   
Sbjct: 358 TEEGLFYVWTPEQVHDAVDDETDADIFCDYYGVTEQGN------------FEGATVLAVR 405

Query: 498 NDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 557
                 A +     ++    L     + F+ R  RPRP  D+KV+  WNGL+I + A  +
Sbjct: 406 KPVPVLAEEYERSEDEITASLQRALNETFEARKDRPRPARDEKVLAGWNGLMIRALAEGA 465

Query: 558 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPG 617
            +L                   +Y +VA  A SF+R HL+D    RL   +++      G
Sbjct: 466 IVLDD-----------------QYADVAADALSFVREHLWDADAGRLNRRYKDDDVAIDG 508

Query: 618 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLR 677
           +L+DYAFL  G L L+E     + L +A++L     E F D E G  F T     S++ R
Sbjct: 509 YLEDYAFLGRGALTLFEATGDVEHLAFAMDLGQAITEAFWDDEQGTLFFTPTGGESLVAR 568

Query: 678 VKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLM 737
            +E  D + PS   V+V  L+ L+     S+ D +   AE  +     R+    +    +
Sbjct: 569 PQELTDQSTPSSTGVAVDLLLSLSHF---SEDDRFESVAERVIRTHADRVSSNPLQHASL 625

Query: 738 CCAADMLSVPSRKHVVLVGHKS 759
             A D     + + V LVG +S
Sbjct: 626 TLATDTYEQGALE-VTLVGDQS 646


>gi|448491519|ref|ZP_21608359.1| hypothetical protein C463_07017 [Halorubrum californiensis DSM
           19288]
 gi|445692519|gb|ELZ44690.1| hypothetical protein C463_07017 [Halorubrum californiensis DSM
           19288]
          Length = 746

 Score =  400 bits (1027), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 256/693 (36%), Positives = 351/693 (50%), Gaps = 88/693 (12%)

Query: 97  SRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVM 156
           S+    NRL  E SPYL QHA NPV+W  WG+EAF  AR+ DVP+F+SIGYS+CHWCHVM
Sbjct: 2   SQPTERNRLDGEASPYLQQHADNPVNWQPWGDEAFELAREHDVPVFVSIGYSSCHWCHVM 61

Query: 157 EVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKP 216
             ESFEDE VA ++ND FV +KVDREERPDVD  +MT  Q + GGGGWPLS + +P+ KP
Sbjct: 62  AEESFEDESVAGVVNDSFVPVKVDREERPDVDSTFMTVCQLVTGGGGWPLSAWCTPEGKP 121

Query: 217 LMGGTYFPPEDKYGRPGFKTILRKVKDAWD---------KKRDMLAQSGAFAIEQLSEAL 267
              GTYFPPE +   PGF+ +  ++ D+W          ++ D   QS    +E +    
Sbjct: 122 FYVGTYFPPEPRQNHPGFRGLCERIADSWSDPEQREEMKRRADQWTQSARDELESVPNP- 180

Query: 268 SASASSNKLPDELPQNALRLCAEQLSKSYDSRF-GGFGSAPKFPRPVEIQMMLYHSKKLE 326
               S  +       + L   A    + YD  + G  G   KFP P  I +++       
Sbjct: 181 DTPGSDGEAASPPGDDLLDTAAAAALRGYDEEYGGFGGGGAKFPMPGRIDLLM------- 233

Query: 327 DTGKSGEASEGQKMVLF----TLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 382
                  A  G+  +L     TL  MA GG++D +GGGFHRY+VD +W VPHFEKMLYD 
Sbjct: 234 ----RAYAGRGRDALLSAATGTLDGMANGGMYDQIGGGFHRYAVDRQWTVPHFEKMLYDN 289

Query: 383 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDA------------D 430
            +L   YLD + L+ D  Y+ +  + L +L R++   GG  FS  DA            D
Sbjct: 290 AELPMAYLDGYRLSGDPAYARVAGESLAFLDRELRHEGGAFFSTLDARSRPPESRRDGSD 349

Query: 431 SAETEGATRKKEGAFYVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFK 489
           S E +G     EGAFYVWT +EV+ +L E A  L K+ Y ++  GN +           +
Sbjct: 350 SDEGDGEG-DVEGAFYVWTPEEVDAVLDEPAASLAKKRYGIRSGGNFE-----------R 397

Query: 490 GKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLV 549
           G  V          A+   +  EK   IL E R  LFD R  RPRP  D+KV+ SWNG  
Sbjct: 398 GTTVPTLAASVEELAADRDLSPEKVREILTEARTTLFDARESRPRPARDEKVLASWNGRA 457

Query: 550 ISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYD--EQTHRLQHS 607
           IS+FARA   L                  +EY E+A  A  F    LYD   +T  L   
Sbjct: 458 ISAFARAGDTLG-----------------EEYAEIAREALDFCHERLYDAENETGALARR 500

Query: 608 FRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNT-QDELF---------- 656
           + +G  + PG+LDDYAFL  G LD+Y      + L +A+EL +   DE +          
Sbjct: 501 WLDGDVRGPGYLDDYAFLARGALDVYAATGDPEPLGFALELADALVDEFYDADDGTIYFT 560

Query: 657 --LDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSD-YYR 713
             LD EG G  +   +   ++ R +E  D + PS   V+   L    +++ G ++D  +R
Sbjct: 561 RDLDGEGAGGGSRNADSGPLIARPQEFTDRSTPSSLGVAAETL----ALLDGFRTDGEFR 616

Query: 714 QNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 746
           + AE  L     R++   +    +  AAD++  
Sbjct: 617 EIAERVLTTHADRIRGSPLEHASLVRAADVVET 649


>gi|408680345|ref|YP_006880172.1| Thymidylate kinase [Streptomyces venezuelae ATCC 10712]
 gi|328884674|emb|CCA57913.1| Thymidylate kinase [Streptomyces venezuelae ATCC 10712]
          Length = 676

 Score =  400 bits (1027), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 242/629 (38%), Positives = 338/629 (53%), Gaps = 65/629 (10%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA E SPYLLQHA NPVDW+ W  EAF EAR+RDVP+ LS+GYS+CHWCHVM  ESFE
Sbjct: 6   NRLAHETSPYLLQHADNPVDWWPWSAEAFEEARRRDVPVLLSVGYSSCHWCHVMAHESFE 65

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D+ +A L+N+ FV++KVDREERPDVD VYM  VQA  G GGWP++VFL+PD  P   GTY
Sbjct: 66  DDAIAGLVNEHFVAVKVDREERPDVDAVYMEAVQAATGQGGWPMTVFLTPDAAPFYFGTY 125

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS-EALSASASSNKLPDELP 281
           FPPE ++G P F  +L  VKDAW  +RD + +     ++ L+  +L+         +EL 
Sbjct: 126 FPPEPRHGMPSFPEVLEGVKDAWADRRDEVGEVAERIVKDLAGRSLAYGGEGVPGEEELA 185

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
           Q  L      L++ YD+  GGFG APKFP  + ++ +L H  +   TG  G      +M 
Sbjct: 186 QALL-----GLTREYDATRGGFGGAPKFPPSMTLEFLLRHHAR---TGAEG----ALQMA 233

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             T + MA+GGI+D +GGGF RY+VD  W VPHFEKMLYD   L   Y   +  T     
Sbjct: 234 ADTCEAMARGGIYDQLGGGFARYAVDRAWVVPHFEKMLYDNALLCRAYAHLWKATGSDLA 293

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EH 460
             +  +  D++ R++  P G   SA DADS   +G  R  EGA+YVWT  ++ ++LG E 
Sbjct: 294 RRVALETADFMVRELRTPEGGFASALDADS--DDGTGRHVEGAYYVWTPAQLTEVLGAED 351

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
           A L   HY +   G             F+  + +++L   +  A           + +  
Sbjct: 352 AALAAAHYGVTEAGT------------FEHGSSVLQLPQQAGPAEA---------DRIAS 390

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
              +L   R +R RP  DDKV+ +WNGL I++ A    +                 DR +
Sbjct: 391 IAARLLAAREERERPGRDDKVVAAWNGLAIAALAETGALF----------------DRPD 434

Query: 581 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYAFLISGLLDLYEFGSGT 639
            +E A  AA  + R   DE   RL  + ++G +    G L+DYA +  G L L       
Sbjct: 435 LVERATEAADLLVRVHMDESA-RLTRTSKDGRAGTNAGVLEDYADVAEGFLALAAVTGEG 493

Query: 640 KWLVWAIELQNTQDELFLDR---EGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
            WL +A  L +    + LDR   EGG  ++T  +  +++ R ++  D A PSG + +   
Sbjct: 494 AWLEFAGFLLD----IVLDRFTAEGGALYDTAHDAEALIRRPQDPTDNATPSGWTAAAGA 549

Query: 697 LVRLASIVAGSKSDYYRQNAEHSLAVFET 725
           L+   S  A + SD +R  AE +L V + 
Sbjct: 550 LL---SYAAHTGSDAHRAAAEGALGVVKA 575


>gi|416351321|ref|ZP_11681110.1| thymidylate kinase [Clostridium botulinum C str. Stockholm]
 gi|338196028|gb|EGO88249.1| thymidylate kinase [Clostridium botulinum C str. Stockholm]
          Length = 611

 Score =  400 bits (1027), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 229/619 (36%), Positives = 336/619 (54%), Gaps = 63/619 (10%)

Query: 156 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 215
           ME ESFEDE VAK+LND ++SIKVDREERPDVD  YMT+ QA+ G GGWPL++ ++P+ K
Sbjct: 1   MEKESFEDEEVAKILNDKYISIKVDREERPDVDNTYMTFCQAVTGSGGWPLTIIMTPEQK 60

Query: 216 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 275
           P   GTYFP +  YGRPG   IL+++ D W   +D +  +    +  + E +S   S   
Sbjct: 61  PFFAGTYFPKKSMYGRPGIIQILKQISDEWKNNKDKIINTSNKLLNTMKERVSQDKS--- 117

Query: 276 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 335
             +E+  + L     +++  YD+++GGFG APKFP P ++ ++L + K   D    G   
Sbjct: 118 --EEINGSILHDAIMEMNYYYDNKYGGFGIAPKFPTPHKLMLLLIYYKVYNDKSALG--- 172

Query: 336 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 395
               MV  TL+CM KGGI DH+G GF RYS DE+W VPHFEKMLYD   LA VY +A+ +
Sbjct: 173 ----MVENTLKCMYKGGIFDHIGFGFSRYSTDEKWLVPHFEKMLYDNALLAYVYTEAYQV 228

Query: 396 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 455
           T   FY  +   I  Y+ RDM  P G  +SAEDADS   EG     EG FYVW+ +E++ 
Sbjct: 229 TGKSFYKEVAEKIFTYILRDMTSPEGGFYSAEDADS---EGV----EGKFYVWSLEEIQS 281

Query: 456 ILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 515
           ILGE A  F   Y +   GN            F+GKN+           + +G  LE  +
Sbjct: 282 ILGEDAKEFCNTYDITEKGN------------FEGKNI----------PNLIGKDLEN-I 318

Query: 516 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 575
           + L E R KLF VR KR  P  DDK++ +WN L+I S + A ++                
Sbjct: 319 DKLEELRNKLFKVREKRVHPFKDDKILTAWNALMIVSLSYAGRVF--------------- 363

Query: 576 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 635
            + KEY+  A+ A  FI  +L   +  RL   FR+G +    +L+DY+FL+  L++LYE 
Sbjct: 364 -ENKEYINRAKKAYDFIENNLI-RKDGRLLARFRHGEAAYIAYLEDYSFLVWALMELYEA 421

Query: 636 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 695
              + +L  A+   +   +LF D E  G+F++  +   ++L +K+ +D A PSGNSV+ +
Sbjct: 422 TFESNYLKQALNFTDKMIKLFWDEESYGFFHSGRDGEKLILNLKDSYDTAIPSGNSVTAM 481

Query: 696 NLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLV 755
           NL++L+ I   +      + A      F   +K+   +  +   +      PSR+ +V+ 
Sbjct: 482 NLIKLSKITGDNS---LGEKAYKMFQGFGGNIKESLQSHSIFLISYMNYIKPSRQ-IVIA 537

Query: 756 GHKSSVDFENMLAAAHASY 774
             K    F+ M+   +  +
Sbjct: 538 SEKEDRLFKEMIKKVNKRF 556


>gi|329935309|ref|ZP_08285275.1| hypothetical protein SGM_6792 [Streptomyces griseoaurantiacus M045]
 gi|329305132|gb|EGG48991.1| hypothetical protein SGM_6792 [Streptomyces griseoaurantiacus M045]
          Length = 675

 Score =  400 bits (1027), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 242/626 (38%), Positives = 333/626 (53%), Gaps = 60/626 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA   SPYLLQHA NPVDW+ W  EAF EAR+RDVP+FLS+GYS CHWCHVM  ESFE
Sbjct: 3   NRLAQATSPYLLQHADNPVDWWPWEAEAFEEARRRDVPVFLSVGYSACHWCHVMAHESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE  A  LN+ FVS+KVDREERPDVD VYM  VQA  G GGWP+SVFL+P+ +P   GTY
Sbjct: 63  DEATAAYLNEHFVSVKVDREERPDVDAVYMEAVQAATGHGGWPMSVFLTPEAEPFYFGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQ-SGAFAIEQLSEALSASASSNKLPDELP 281
           FPPE ++G P F+ IL+ V  AW ++R+ +A  +G    +     L+   +      E+ 
Sbjct: 123 FPPEPRHGSPSFRQILQGVHQAWTERREEVADVAGKITRDLAGRELAHGGAQVPGEQEMA 182

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
           Q  L      L++ YD+R GGFG APKFP  + ++ +L H  +   TG  G      +M 
Sbjct: 183 QALL-----GLTREYDARRGGFGGAPKFPPSMVLEFLLRHHAR---TGSEG----ALQMA 230

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             T + MA+GG++D +GGGF RYSVD  W VPHFEKMLYD   L  VY   +  T     
Sbjct: 231 ADTCERMARGGLYDQLGGGFARYSVDRDWVVPHFEKMLYDNALLCRVYAHLWRATGSDLA 290

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHA 461
             +  +  +++ R++    G   SA DADS   +G  R  EGA+YVWT +++ ++LGE A
Sbjct: 291 RRVALETAEFMVRELGTAEGGFASALDADS--DDGTGRHVEGAYYVWTPEQLAEVLGEDA 348

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDSSASASKLGMPLEKYLNILGE 520
            L   ++ +   G  +            G++VL +   D    A +           +  
Sbjct: 349 GLAARYFGVTEEGTFE-----------HGQSVLQLPQTDGVFDAER-----------VAS 386

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            R +L   RS RP P  DDKV+ +WNGL I++ A                      DR +
Sbjct: 387 VRERLLGARSARPAPGRDDKVVAAWNGLAIAALAETGAYF----------------DRPD 430

Query: 581 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYAFLISGLLDLYEFGSGT 639
            ++ A  AA  + R   DE   RL  + ++G + A  G L+DYA +  G L L +     
Sbjct: 431 LVDAAVRAADLLVRLHLDEHG-RLTRTSKDGRAGAHAGVLEDYADVAEGFLALAQVTGEG 489

Query: 640 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVR 699
            WL +A  L       F   E G  F+T  +   ++ R ++  D A PSG + +   L+ 
Sbjct: 490 VWLEFAGLLLGHVRTRFTGEE-GTLFDTASDAEKLIRRPQDPTDNATPSGWTAAAGALL- 547

Query: 700 LASIVAGSKSDYYRQNAEHSLAVFET 725
             S  A + S+ +R  AE +L V  T
Sbjct: 548 --SYAAHTGSEAHRTAAEQALGVVRT 571


>gi|29829838|ref|NP_824472.1| hypothetical protein SAV_3296 [Streptomyces avermitilis MA-4680]
 gi|29606947|dbj|BAC71007.1| hypothetical protein SAV_3296 [Streptomyces avermitilis MA-4680]
          Length = 675

 Score =  400 bits (1027), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 245/627 (39%), Positives = 335/627 (53%), Gaps = 60/627 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA E SPYLLQHA NPVDW+ W  EAF EARKR VP+ LS+GYS+CHWCHVM  ESFE
Sbjct: 2   NRLAHETSPYLLQHADNPVDWWPWSPEAFEEARKRGVPLLLSVGYSSCHWCHVMAHESFE 61

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE  A  LN+ FV++KVDREERPDVD VYM  VQA  G GGWP++VFL+PD +P   GTY
Sbjct: 62  DETTAAYLNEHFVNVKVDREERPDVDAVYMEAVQAATGQGGWPMTVFLTPDAEPFYFGTY 121

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS-EALSASASSNKLPDELP 281
           FPPE ++G P F+ +L  V+ AW  +RD +A+     +  L+   +S   SS    +EL 
Sbjct: 122 FPPEPRHGMPSFRQVLEGVRSAWTDRRDEVAEVAGKIVRDLAGREISYGDSSTPGEEELA 181

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
           Q  L      L++ YD+R GGFG APKFP  + ++ +L H  +   TG  G      +M 
Sbjct: 182 QALL-----GLTRDYDARRGGFGGAPKFPPSMVVEFLLRHHAR---TGSEG----ALQMA 229

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             T + MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L  VY   +  T     
Sbjct: 230 QDTCERMARGGIYDQLGGGFARYSVDRDWVVPHFEKMLYDNALLCRVYAHLWRATGSELA 289

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EH 460
             +  +  D++ R++    G   SA DADS   +G+ R  EGA+YVWT +++E  LG E 
Sbjct: 290 RRVALETADFMVRELRTGEGGFASALDADS--DDGSGRHVEGAYYVWTPEQLEQALGRED 347

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDSSASASKLGMPLEKYLNILG 519
           A L    + +   G  +           +G +VL +   D    A +           + 
Sbjct: 348 AELAARCFGVTRDGTFE-----------EGASVLQLPQQDVVFDAER-----------IA 385

Query: 520 ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRK 579
             R +L   R++RP P  DDKV+ +WNGL I++ A                      DR 
Sbjct: 386 SVRARLLGRRAERPAPGRDDKVVAAWNGLAIAALAETGAYF----------------DRP 429

Query: 580 EYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYAFLISGLLDLYEFGSG 638
           + +E A  AA  + R   DE   RL  + ++G + A  G L+DY  +  G L L      
Sbjct: 430 DLVEAAIGAADLLVRLHLDEHA-RLARTSKDGRAGAHAGVLEDYGDVAEGFLALASVTGE 488

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
             WL +A  L +     F D E G  ++T  +   ++ R ++  D A PSG S +   L+
Sbjct: 489 GVWLEFAGFLLDHVLAQFTDPESGALYDTAADAEKLIRRPQDPTDNATPSGWSAAAGALL 548

Query: 699 RLASIVAGSKSDYYRQNAEHSLAVFET 725
              S  A + ++ +R  AE +L V + 
Sbjct: 549 ---SYAAHTGAEPHRTAAERALGVVKA 572


>gi|398343191|ref|ZP_10527894.1| hypothetical protein LinasL1_09021 [Leptospira inadai serovar Lyme
           str. 10]
          Length = 692

 Score =  399 bits (1025), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 259/666 (38%), Positives = 352/666 (52%), Gaps = 66/666 (9%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           K  NRLA+E SPYLLQHA NPVDWF W +EAF +A++ D  IFLSIGY+TCHWCHVME E
Sbjct: 6   KKQNRLASEKSPYLLQHAMNPVDWFPWAKEAFLKAKEEDKMIFLSIGYATCHWCHVMEKE 65

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SFEDE  A +LN +FVSIKVDREERPDVD++YM  + A+   GGWPL++FL+ + KP+ G
Sbjct: 66  SFEDEATAAVLNQYFVSIKVDREERPDVDRIYMDALHAMNQQGGWPLNMFLTSEGKPITG 125

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE 279
           GTYFPP  KYGR  F  IL  +   W +K++ L      A E+L++ L  S  S  L + 
Sbjct: 126 GTYFPPVAKYGRKSFTDILNILATLWKEKKEELID----ASEELAQYLKESEESKALSE- 180

Query: 280 LPQNALRLCAEQL--------SKSYDSRFGGFGS--APKFPRPVEIQMMLYHSKKLEDTG 329
             Q+AL+L ++ +         + YD  F GF S    KFP  + +  +L   K      
Sbjct: 181 --QSALQLPSKTVFENAFGMYDRFYDPEFAGFKSNVTNKFPPSMGLSFLLRFYK------ 232

Query: 330 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 389
            +GE  +  +MV  TL  M KGGI+D +GGG  RYS D +W VPHFEKMLYD        
Sbjct: 233 STGE-PKALEMVEETLVAMKKGGIYDQIGGGISRYSTDHKWLVPHFEKMLYDNSLFLEAL 291

Query: 390 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 449
           ++ F  T  + Y     D+L+Y+ RDM   GG I SAEDADS   EG    +EG FY+W 
Sbjct: 292 VECFQTTGHLKYKEAAYDVLEYISRDMRLQGGGIASAEDADS---EG----EEGLFYLWK 344

Query: 450 SKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGM 509
             E  ++    AIL +  + +   GN            F+G N+L E +  +  A   G+
Sbjct: 345 RNEFHEVCDSDAILLEAFWNVTEIGN------------FEGSNILHE-SFRTNFARLHGL 391

Query: 510 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 569
             E+ + I+   ++KL   RS R RP  DDKV++SWN L + +  +A+            
Sbjct: 392 EEEELIEIVNRNKKKLLARRSDRIRPLRDDKVLLSWNCLYVKAATKAAMAFGD------- 444

Query: 570 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 629
                     E + +AE    FI  +L  E   RL   FR G ++   +  DYA  I   
Sbjct: 445 ---------GELLRLAEETFRFIENNLVREDG-RLLRRFREGEARFLAYSGDYAEFILAS 494

Query: 630 LDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVK-EDHDGAEPS 688
           L L++ G G ++L  AI        LF  R   G F  TG D   LLR   E +DG EPS
Sbjct: 495 LWLFQAGKGIRYLTLAIRYAEEAVRLF--RSPAGVFFDTGSDAEDLLRRNVEGYDGVEPS 552

Query: 689 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPS 748
            NS   +    L+ +  G +S  Y   A+   + F+  L+   M  P M  A  + +  S
Sbjct: 553 ANSSFALAFTILSRL--GVESGRYSDFADAIFSYFKVELETHPMNYPYMLSAYWLKNSDS 610

Query: 749 RKHVVL 754
           ++  V+
Sbjct: 611 KELAVV 616


>gi|154150757|ref|YP_001404375.1| hypothetical protein Mboo_1214 [Methanoregula boonei 6A8]
 gi|153999309|gb|ABS55732.1| protein of unknown function DUF255 [Methanoregula boonei 6A8]
          Length = 723

 Score =  399 bits (1025), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 244/684 (35%), Positives = 352/684 (51%), Gaps = 54/684 (7%)

Query: 93  STSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHW 152
           S +    + +NRLA E SPYLLQHA NPVDW+ WG EAF+ A++ D P+FLSIGYS CHW
Sbjct: 20  SGTMQTRRSSNRLARETSPYLLQHASNPVDWYPWGGEAFSRAKREDRPLFLSIGYSACHW 79

Query: 153 CHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP 212
           CHVM  ESFE+  VA +LN  FV IKVDREERPDVD VYM   Q L G GGWPL++ ++P
Sbjct: 80  CHVMARESFENNEVAGILNKHFVCIKVDREERPDVDSVYMGICQQLTGQGGWPLTIIMTP 139

Query: 213 DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASAS 272
           + KP   GTYFP   + G PG   IL  + + W+ +RD L    A A + LS+A     S
Sbjct: 140 EKKPFFAGTYFPKTGRAGMPGLTDILITIANLWETRRDELY---AAAEQILSDAHLLHKS 196

Query: 273 SNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSG 332
            +  PD   ++ L     +L+  +DS  GGFG APKFP P  I  +L + +       +G
Sbjct: 197 PSGDPD---RHLLDKGFRELAAQFDSANGGFGRAPKFPAPHNILFLLRYWQ------MTG 247

Query: 333 EASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDA 392
           E +    M   TL  + +GGI DHVGGG HRY+ D RW VPHFEKML DQ  L     +A
Sbjct: 248 E-NRALDMAEQTLDAIRQGGIWDHVGGGMHRYATDARWLVPHFEKMLSDQAMLVLASTEA 306

Query: 393 FSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKE 452
           ++ T  + Y  I  + + Y+ R++  PGG  ++AEDADS          EGA+Y+WT +E
Sbjct: 307 YAATGKIRYRTIAEECIAYVLRELRDPGGGFYTAEDADSP-------AGEGAYYLWTEEE 359

Query: 453 VEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE 512
           +  ILG  A      + L P           P +E K  +++            LG+  +
Sbjct: 360 IARILGLDAAFASILFSLTPL----------PGSE-KHASIISAAGPDPVLLKNLGITEQ 408

Query: 513 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 572
           + ++      R+L   R KRP+P  D K++   N L  ++ ARA ++L + +        
Sbjct: 409 ELISRRAGILRRLAHEREKRPKPARDTKILTDTNALFCTALARAGRVLGNPS-------- 460

Query: 573 VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDL 632
                   Y + A     F+ +++ + +   L HS   G    PGF DDYA L++  ++L
Sbjct: 461 --------YTDAAACTLRFLLQNMRNGEGRILHHS-GGGEHAVPGFADDYAHLVAAHIEL 511

Query: 633 YEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSV 692
           Y+  S    +  A+ +       + D+EGGG+F T      + ++ KE +DGA PS N+ 
Sbjct: 512 YKATSDIACIKEAVTINALLLTHYRDKEGGGFFTTADTAVDLPVQKKEWYDGAVPSANTT 571

Query: 693 SVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP--LMCCAADMLSVPSRK 750
           +  NL  L  +     +D + + A                AV   L   A   L+  + +
Sbjct: 572 AFENLTALYRLTG---NDVFNEAALECARFITGAASRAPHAVTGFLAALACSPLT-GNTQ 627

Query: 751 HVVLVGHKSSVDFENMLAAAHASY 774
            +V+ G  ++   + +LA A   Y
Sbjct: 628 DLVIAGDPANAGTQTLLAVARRQY 651


>gi|357391644|ref|YP_004906485.1| hypothetical protein KSE_47490 [Kitasatospora setae KM-6054]
 gi|311898121|dbj|BAJ30529.1| hypothetical protein KSE_47490 [Kitasatospora setae KM-6054]
          Length = 687

 Score =  399 bits (1024), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 248/626 (39%), Positives = 336/626 (53%), Gaps = 55/626 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA   SPYLLQHA NPVDW+ W  EAFAEA +R VP+ LS+GY+ CHWCHVM  ESFE
Sbjct: 3   NRLADATSPYLLQHADNPVDWWEWSPEAFAEAERRGVPVLLSVGYAACHWCHVMAHESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DEG A  LN+ FV++KVDREERPDVD VYM  VQA  G GGWP++VFL+P+ +P   GTY
Sbjct: 63  DEGTAGFLNERFVAVKVDREERPDVDAVYMEAVQAATGQGGWPMTVFLTPEKEPFYFGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FPPE ++G P F+ +L  V  AW  +R  + +        L+E  S  A  + +     +
Sbjct: 123 FPPEPRHGMPSFRQVLEGVDKAWTGRRAEVGEVAGRISRDLAERASVYAVGSGVAGVPGE 182

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             L     +L+KSYD R GGFG APKFP  + ++ +L H        ++G A+   +M  
Sbjct: 183 GELGAAVAELAKSYDERRGGFGGAPKFPPSMVLEFLLRHHA------RTGSAA-ALRMAG 235

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            T + MA+GGIHD +GGGF RY+VD  W VPHFEKM YD   L  VYL  +  T +    
Sbjct: 236 RTCEAMARGGIHDQLGGGFARYAVDATWTVPHFEKMCYDNALLLRVYLHLWRATGEERAR 295

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHA 461
            +     D+L R++  P G   SA DADS + E   R  EGA+Y WT +++E +LG   A
Sbjct: 296 RVALSTADFLLRELRTPEGGFASALDADSLD-EATGRTAEGAYYAWTPEQLERVLGAADA 354

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
               E + +   G  +            G +VL  L D            ++Y ++    
Sbjct: 355 GYAAELFGVTANGTFE-----------HGSSVLQLLADPEDR--------DRYESV---- 391

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
           R KLF+ RS RP P  DDKV+ +WNGL I++ A A  +L+                R E 
Sbjct: 392 RAKLFEARSHRPAPARDDKVVAAWNGLAIAALAEAGALLE----------------RPEL 435

Query: 582 MEVAESAAS-FIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYAFLISGLLDLYEFGSGT 639
           +E AE AA   I  HL  +   RL  + R+G + A  G L+DYA    G L LY     +
Sbjct: 436 VEAAERAADLLIAVHLTPDG--RLLRTSRDGRAGANAGVLEDYADTAEGFLALYAVTGES 493

Query: 640 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVR 699
            WL  A EL +     F D   G  ++T  +   ++ R ++  D A PSG + +   L+ 
Sbjct: 494 SWLQLAGELLDLVLRHFTDEASGALYDTADDAEQLIRRPQDPTDNATPSGWTAAAGALLT 553

Query: 700 LASIVAGSKSDYYRQNAEHSLAVFET 725
            A+    + SD +R  AE +L +  T
Sbjct: 554 YAAY---TGSDRHRTAAERALGIVST 576


>gi|75674298|ref|YP_316719.1| hypothetical protein Nwi_0099 [Nitrobacter winogradskyi Nb-255]
 gi|74419168|gb|ABA03367.1| Protein of unknown function DUF255 [Nitrobacter winogradskyi
           Nb-255]
          Length = 676

 Score =  399 bits (1024), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 232/612 (37%), Positives = 329/612 (53%), Gaps = 56/612 (9%)

Query: 97  SRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVM 156
           S  +  NRL+AE SPYLLQH HNPVDW+ WG EA AEA++ + PI LSIGY+ CHWCHVM
Sbjct: 7   SSGRLANRLSAETSPYLLQHQHNPVDWWPWGPEALAEAQRSNRPILLSIGYAACHWCHVM 66

Query: 157 EVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKP 216
             ESFED+ VA ++N+ FV IKVDREERPD+D++YM+ +  L   GGWPL++FLSPD  P
Sbjct: 67  AHESFEDDDVAAVMNELFVCIKVDREERPDIDQIYMSALHHLGEQGGWPLTMFLSPDGSP 126

Query: 217 LMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKL 276
             GGTYFP    +GRP F  +L+ V   +  + D +A+     I +LSE      ++ K 
Sbjct: 127 FWGGTYFPKLPDFGRPAFTDVLQSVARVFRDQPDQIARHRDTLIARLSE-----RATTKS 181

Query: 277 PDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASE 336
           P  L    L   A  + +S D   GG   APKFP+   ++++     +  D       + 
Sbjct: 182 PANLGVAELNNAAVAIMRSTDPVNGGLRGAPKFPQCSVLELLWRAGARTRDDRFFAATT- 240

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
                  TL  M++GGI+DH+GGG+ RYSVD+RW VPHFEKMLYD  Q+ ++    ++ +
Sbjct: 241 ------LTLTRMSQGGIYDHIGGGYARYSVDDRWLVPHFEKMLYDNAQILDLLALDYARS 294

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
           K+  Y     + +D+LRR+M+   G   S+ DADS   EG    +EG FYVW+  E++D+
Sbjct: 295 KNPLYRERAIETVDWLRREMLTAEGGFASSLDADS---EG----EEGRFYVWSLSEIDDV 347

Query: 457 LGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLN 516
           LG          Y   T N +  R + P N  K  +V    ND SA    L         
Sbjct: 348 LGAADAADFAARY-DITANGNFERRNIP-NRLKSIDV---ANDDSAHMRAL--------- 393

Query: 517 ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGS 576
                R+KL   R  R RP LDDK++  WNGL+I++    + +                 
Sbjct: 394 -----RKKLLVRRESRVRPGLDDKILADWNGLMIAALVHGACVF---------------- 432

Query: 577 DRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG 636
           D+ +++ +A +A  FIR  +   +  RL HS+R G    P    DYA +    L L+E  
Sbjct: 433 DKPDWLRIARAAYDFIRTMM--TRDGRLGHSWREGRLLIPALASDYATMARAALALFEAT 490

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
               +L  A+  Q+T D  + D   GGY+ T  +   +++R     D A P+ + V   N
Sbjct: 491 GDGTFLEQALRWQSTLDTHYADAAHGGYYLTADDAEGLIVRPHSSEDDAIPNHDGVIAQN 550

Query: 697 LVRLASIVAGSK 708
           LVRLA++   +K
Sbjct: 551 LVRLAALTGDAK 562


>gi|448624555|ref|ZP_21670503.1| thioredoxin domain containing protein [Haloferax denitrificans ATCC
           35960]
 gi|445749760|gb|EMA01202.1| thioredoxin domain containing protein [Haloferax denitrificans ATCC
           35960]
          Length = 703

 Score =  398 bits (1023), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 245/667 (36%), Positives = 344/667 (51%), Gaps = 84/667 (12%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA NPV+W  W E A   AR+ D PIFLS+GYS CHWCHVM  ESF 
Sbjct: 8   NRLDDEQSPYLRQHADNPVNWQPWDETALDAAREADKPIFLSVGYSACHWCHVMADESFS 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D  +A++LN+ FV +KVDREERPD+D++Y T  Q + GGGGWPLSV+L+P+ KP   GTY
Sbjct: 68  DPDIAEVLNEHFVPVKVDREERPDLDRIYQTICQLVTGGGGWPLSVWLTPEGKPFFVGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDML---AQSGAFAI-EQLSEA--LSASASSNKL 276
           FPPE + G PGF+ ++    ++W   R+ +   A+    AI ++L E   ++  A  +++
Sbjct: 128 FPPEPRRGAPGFRDLVESFAESWRTDREEIENRAEQWTSAITDRLEETPDVAGEAPGSEV 187

Query: 277 PDELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLEDTGKSGEAS 335
            D   Q ALR          D   GGFG   PKFP+P  I  +L            G A 
Sbjct: 188 LDTTVQAALR--------GADRDHGGFGGDGPKFPQPGRIDALL-----------RGYAV 228

Query: 336 EGQKMVL----FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLD 391
            G++  L     +L  MA GG+ DH+GGGFHRY VD  W VPHFEKMLYDQ  LA  YLD
Sbjct: 229 SGRREALDVARQSLDAMANGGLRDHLGGGFHRYCVDREWTVPHFEKMLYDQAGLAARYLD 288

Query: 392 AFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSK 451
           A  LT +  Y+ +  +   ++RR++    G  F+  DA S         +EG FYVWT  
Sbjct: 289 AARLTGNESYATVAAETFAFVRRELTHDDGGFFATLDAQSG-------GEEGTFYVWTPD 341

Query: 452 EVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS-ASKLGM 509
           +V ++L E  A LF + Y + P GN            F+ K  ++ ++ ++A  A +  +
Sbjct: 342 DVRELLPELDADLFCDRYGVTPGGN------------FENKTTVLNVSATTADLAEEYDL 389

Query: 510 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 569
              +    L + R+ LF  R  R RP  D+KV+  WNGL+IS+FA+ S +L+ ++     
Sbjct: 390 AESEVEARLEKARKALFAAREGRDRPARDEKVLAGWNGLMISAFAQGSVVLEDDS----- 444

Query: 570 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 629
                        + A  A  F+R  L+D++T  L     NG  K  G+L+DYAFL  G 
Sbjct: 445 -----------LADDARRALDFVRERLWDDETETLSRRVMNGEVKGDGYLEDYAFLARGA 493

Query: 630 LDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSG 689
            DLY+       L +A++L       F D + G  + T     S++ R +E  D + PS 
Sbjct: 494 FDLYQATGDLAPLSFALDLARATRREFYDADAGTLYFTPESGESLVTRPQEPTDQSTPSS 553

Query: 690 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLK--------------DMAMAVP 735
             V+    + L      +  D +   A+  L  F  R++                A  VP
Sbjct: 554 LGVATSLFLDLEQF---APEDGFGDVADAVLGSFANRVRGSPLEHVSLALAAEKAASGVP 610

Query: 736 LMCCAAD 742
            +  AAD
Sbjct: 611 ELTVAAD 617


>gi|344211988|ref|YP_004796308.1| thioredoxin domain-containing protein [Haloarcula hispanica ATCC
           33960]
 gi|343783343|gb|AEM57320.1| thioredoxin domain-containing protein [Haloarcula hispanica ATCC
           33960]
          Length = 717

 Score =  398 bits (1023), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 239/666 (35%), Positives = 354/666 (53%), Gaps = 55/666 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL    SPYL QHA NPV+W  W E+A   A++RDVPIFLSIGY+ CHWCHVME ESFE
Sbjct: 11  NRLDEAESPYLRQHADNPVNWQPWDEQALEAAKERDVPIFLSIGYAACHWCHVMEEESFE 70

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           +E +A+ LN+ FV IKVDREERPD+D VYM+  Q + GGGGWPLS +L+P+ +P   GTY
Sbjct: 71  NEAIAEQLNEHFVPIKVDREERPDLDSVYMSICQQVTGGGGWPLSAWLTPEGEPFYVGTY 130

Query: 223 FPPEDKYGRPGFKTILRKVKDAW---DKKRDM--LAQSGAFAIEQLSEALSASASSNKLP 277
           FPPE+K G+PGF  +L+++ D+W   +++ +M   AQ    AIE   EA  A+      P
Sbjct: 131 FPPEEKRGQPGFGDLLQRLADSWSDPEQREEMENRAQQWTEAIESDLEATPAN------P 184

Query: 278 DELPQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLEDTGKSGEASE 336
           ++  ++ ++       +  D + GG+GS  PKFP+   +  +L   +   D G+    + 
Sbjct: 185 EDPAEDIIQTAGTIAHRGADRQDGGWGSGGPKFPQNGRLHALL---RAHADGGQEDYLT- 240

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
              +V  TL  MA  G++DHVGGGFHRY+ D++W VPHFEKMLYD  ++   +L  +   
Sbjct: 241 ---VVEETLDVMADRGLYDHVGGGFHRYATDQQWAVPHFEKMLYDNAEIPRAFLAGYQAI 297

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGAT-RKKEGAFYVWTSKEVED 455
               Y+ + R+  ++++R++  P G  FS  DA+S   E      +EG FYVWT ++V D
Sbjct: 298 GSERYASVVRETFEFVQRELQHPDGGFFSTLDAESVPPEDPDGDSEEGLFYVWTPEQVHD 357

Query: 456 ILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEK 513
            + +   A +F           CD   +++P N F+G  VL      S  A +     ++
Sbjct: 358 AVDDETDADIF-----------CDYYGVTEPGN-FEGATVLAVRKPVSVLAEEYEQSEDE 405

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
               L     + F+ R +RPRP  D+KV+  WNGL+I + A  + +L             
Sbjct: 406 ITASLQRALNETFEAREERPRPARDEKVLAGWNGLMIRALAEGAIVLDDAYADVA----- 460

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
                         A SF+R HL+D    RL   +++G     G+L+DYAFL  G L L+
Sbjct: 461 ------------ADALSFVREHLWDADAERLNRRYKDGDVAIDGYLEDYAFLGRGALTLF 508

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
           E     + L +A++L     E+F D + G  F T     S++ R +E  D + PS   V+
Sbjct: 509 EATGNVEHLAFAMDLGQAITEVFWDDDEGTLFFTPTGGESLVARPQELTDQSTPSSTGVA 568

Query: 694 VINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVV 753
           V  L+ L+     S  D +   AE  +     R+    +    +  A D     + + + 
Sbjct: 569 VDLLLSLSHF---SDDDRFETVAERVIRTHADRVSSNPLQHASLTLATDTYEQGALE-LT 624

Query: 754 LVGHKS 759
           LVG +S
Sbjct: 625 LVGDQS 630


>gi|313667030|gb|ADR72969.1| DUF255 family protein [Streptomyces sp. OH-4156]
          Length = 673

 Score =  398 bits (1023), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 243/631 (38%), Positives = 338/631 (53%), Gaps = 67/631 (10%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            NRLA E SPYLLQHA NPVDW+ W  EAF EAR+RDVP+ LS+GYS+CHWCHVM  ESF
Sbjct: 2   ANRLAHETSPYLLQHADNPVDWWPWSAEAFDEARRRDVPVLLSVGYSSCHWCHVMAHESF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           ED+  A L+N+ FV++KVDREERPDVD VYM  VQA  G GGWP++VFL+PD  P   GT
Sbjct: 62  EDDATAALVNENFVAVKVDREERPDVDAVYMEAVQAATGQGGWPMTVFLTPDAAPFYFGT 121

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLP--DE 279
           YFPPE ++G P F  +L  VK AW  +RD + +     ++ L+   S +   + +P  +E
Sbjct: 122 YFPPEPRHGMPSFPEVLEGVKGAWSDRRDEVGEVAERIVKDLA-GRSLAYGGDGVPGEEE 180

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
           L Q  L      L++ YD+  GGFG APKFP  + ++ +L H  +   TG  G      +
Sbjct: 181 LAQALL-----GLTREYDATHGGFGGAPKFPPSMTLEFLLRHHAR---TGSEG----ALQ 228

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           M   T + MA+GGI+D +GGGF RY+VD  W VPHFEKMLYD   L   Y   +  T   
Sbjct: 229 MAADTCEAMARGGIYDQLGGGFARYAVDRAWVVPHFEKMLYDNALLCRAYAHLWKATGSD 288

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG- 458
               +  +  D+L R++  P G   SA DADS   +G  R  EGA+YVWT  ++ ++LG 
Sbjct: 289 LARRVALETADFLVRELRTPEGGFASALDADS--DDGTGRHVEGAYYVWTPAQLTEVLGA 346

Query: 459 EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 518
           E A L   HY +   G             F+  + +++L   + +A             +
Sbjct: 347 EDAALAAAHYGVTEDGT------------FEHGSSVLQLPREAGTADA---------GRI 385

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
                +L   R +R RP  DDKV+ +WNGL I++ A    +                 DR
Sbjct: 386 ASIAARLLAAREERERPGRDDKVVAAWNGLAIAALAETGALF----------------DR 429

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYAFLISGLLDLYEFGS 637
            + +E A  AA  + R   DE   RL  + ++G +    G L+DYA +  G L L     
Sbjct: 430 PDLVERATEAADLLVRVHMDESA-RLTRTSKDGRAGTNDGVLEDYADVAEGFLALAAVTG 488

Query: 638 GTKWLVWAIELQNTQDELFLDR---EGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSV 694
              WL +A  L +    L +DR   EGG  ++T  +  +++ R ++  D A PSG + + 
Sbjct: 489 EGAWLDFAGFLLD----LVIDRFTAEGGALYDTAHDAEALIRRPQDPTDNATPSGWTAAA 544

Query: 695 INLVRLASIVAGSKSDYYRQNAEHSLAVFET 725
             L+   S  A + SD +R  AE +L V + 
Sbjct: 545 GALL---SYAAHTGSDAHRAAAEGALGVVKA 572


>gi|124002212|ref|ZP_01687066.1| thymidylate kinase [Microscilla marina ATCC 23134]
 gi|123992678|gb|EAY32023.1| thymidylate kinase [Microscilla marina ATCC 23134]
          Length = 681

 Score =  398 bits (1023), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 237/634 (37%), Positives = 334/634 (52%), Gaps = 67/634 (10%)

Query: 95  SHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCH 154
           SH   +  NRLA   SPYLLQHA+NPVDW+ WGEEA  +A+  D PI +SIGYS CHWCH
Sbjct: 2   SHQNTQTPNRLAKATSPYLLQHAYNPVDWYPWGEEALQKAKDEDKPIIVSIGYSACHWCH 61

Query: 155 VMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDL 214
           VME ESFED+ VA ++N +F+ IKVDREERPDVD +YM  VQA+   GGWPL+  L+P+ 
Sbjct: 62  VMERESFEDDEVAAIMNRYFICIKVDREERPDVDAIYMDAVQAMGQRGGWPLNALLTPEA 121

Query: 215 KPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSN 274
           KP    TY P E       +  +L+ V + +  KRD L QS     E   EA++ S +  
Sbjct: 122 KPFYALTYLPKE------SWVQLLQNVAEVYQTKRDELEQSA----EAYREAIATSEAKK 171

Query: 275 KLPDELPQNALRLCAEQLSKSYDSRF-------GGFGSAPKFPRPVEIQMMLYHSKKLED 327
               +L  N +R   E L K + S +       GG   APKFP P   Q +L++      
Sbjct: 172 Y---DLKPNDIRYAREDLDKMFQSVYNDVDHTRGGTNRAPKFPMPSIWQFLLHYY----- 223

Query: 328 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 387
             +  +  E  + V  TL  MAKGGI+D +GGGF RYSVD  W  PHFEKMLYD GQL +
Sbjct: 224 --QITKKEEALRTVEVTLNEMAKGGIYDQIGGGFARYSVDADWFAPHFEKMLYDNGQLLS 281

Query: 388 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 447
           +Y DA+++T++  Y  +    +D++ R++    G  FSA DADS   EG     EG FYV
Sbjct: 282 LYADAYNVTQNPLYQQVVMQTVDFVARELTSEEGGFFSALDADS---EGV----EGKFYV 334

Query: 448 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 506
           W     ++++G E A +  ++Y +    N            ++  N+L       A A K
Sbjct: 335 WEKTAFDEVIGVEDAAIAADYYQVTSQAN------------WEEGNILHRSIGDLAFAEK 382

Query: 507 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 566
             + +E     + +   +L   RSKR RP LDDK++ SWNGL++     A ++       
Sbjct: 383 HQIDVESLKQKVTQWNERLLTARSKRIRPGLDDKILTSWNGLMLKGLVDAYRVF------ 436

Query: 567 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 626
                     D  + + +A + A FI   L  E  ++L HS++NG +    +L+DYA ++
Sbjct: 437 ----------DSPKLLNLALANAQFIAEKLTTE-NYQLYHSYKNGKASINAYLEDYAAVV 485

Query: 627 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 686
              + LY+     +WL  A  L +     F D+E G +F T      ++ R KE  D   
Sbjct: 486 DAYIALYQATFDEQWLTKAKSLTDYALANFYDKEEGLFFFTDVNAEKLIARKKELFDNVI 545

Query: 687 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL 720
           P+ NS+   NL  L   +   +SD Y+Q A   L
Sbjct: 546 PASNSMMAKNLYWLG--LYYEQSD-YQQKASQML 576


>gi|440749562|ref|ZP_20928808.1| Thymidylate kinase [Mariniradius saccharolyticus AK6]
 gi|436481848|gb|ELP37994.1| Thymidylate kinase [Mariniradius saccharolyticus AK6]
          Length = 674

 Score =  398 bits (1022), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 240/657 (36%), Positives = 346/657 (52%), Gaps = 59/657 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL+   SPYLLQH HNPVDW+ WGEEA  +A++ D PI +SIGYS CHWCHVME ESFE
Sbjct: 2   NRLSQSKSPYLLQHQHNPVDWYPWGEEALNKAQQEDKPILVSIGYSACHWCHVMERESFE 61

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE  A L+N  FV IK+DREERPD+D +YM  +QA+   GGWPL+VFL P+ KP  GGTY
Sbjct: 62  DEETADLMNAHFVCIKIDREERPDLDNIYMEALQAMGVQGGWPLNVFLMPNQKPFYGGTY 121

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP +       +K +L  + +A+      L +S       +  +             L +
Sbjct: 122 FPNKQ------WKNLLGSIANAYKNHHGQLLESAEGFGRSIGRSELEKYGLKAAETGLEK 175

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             + L  ++L+  +D  +GG    PKFP P     +L       D    G+  E  + V 
Sbjct: 176 ADIELVLDKLTAQFDLEWGGMNRKPKFPMPAVWLFVL-------DAALLGKDQELLEKVF 228

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
           FTL+ +  GGI+DH+ GG+ RYSVD  W  PHFEKMLYD GQL ++Y  A+ ++ D F+ 
Sbjct: 229 FTLKKIGMGGIYDHLRGGWARYSVDGEWFAPHFEKMLYDNGQLLDLYAKAYQVSGDEFFK 288

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
               + +D++  +M+   G  F+A+DADS   EG     EG FY W  +E+E ILGE   
Sbjct: 289 EKVLETVDWIEAEMLLSEGGFFAAQDADS---EGV----EGKFYTWKYEELEAILGEDLS 341

Query: 463 LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECR 522
            FK+ Y LK  GN +            G N+L +    +  A+++G+  + Y   L + +
Sbjct: 342 WFKKLYNLKYQGNWE-----------DGVNILFQTEPYADLAAEIGLSEKAYRERLQQIK 390

Query: 523 RKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYM 582
            KL  VR++R  P LDDKV+  WNGL I+  A+               F   GS++   +
Sbjct: 391 TKLLTVRNRRIYPGLDDKVLSGWNGLAIAGLAQV--------------FLATGSEKA--L 434

Query: 583 EVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWL 642
            +A+    F+   ++  Q   L  S+++G +  P FL+DYA +I G + LY+    T+WL
Sbjct: 435 SLAKRNGKFLWEKMFKGQV--LYRSYKDGQAYTPAFLEDYAAVIRGYISLYQASFETEWL 492

Query: 643 VWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLAS 702
           + A EL +   E + D   G +F    +   ++   KE  D   P+ NSV   NL  L  
Sbjct: 493 LKAKELTDLVLEQYYDEGDGFFFFNNPKAEKLIANKKELFDNVIPASNSVMARNLQDLGL 552

Query: 703 IVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCC--AADML-SVPSRKHVVLVG 756
                  + Y+  AEH LA     +K + +  P   C  A+ ML ++  +  V +VG
Sbjct: 553 YFY---QEEYQAIAEHMLA----SVKRLILTEPGFLCNWASLMLHTLVPKAEVAVVG 602


>gi|452958537|gb|EME63890.1| hypothetical protein H074_04714 [Amycolatopsis decaplanina DSM
           44594]
          Length = 688

 Score =  397 bits (1021), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 243/627 (38%), Positives = 329/627 (52%), Gaps = 78/627 (12%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           +NRL A  SPYLLQHA NPVDW+ WGEEA AEA++R+VPI LS+GY+ CHWCHVM  ESF
Sbjct: 22  SNRLKAATSPYLLQHAGNPVDWWPWGEEALAEAKRRNVPILLSVGYAACHWCHVMAHESF 81

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           EDE  A L+N  FV+IKVDREERPD+D VYM   QA+ G GGWP++ FL+P+ +P   GT
Sbjct: 82  EDEATATLMNANFVNIKVDREERPDIDSVYMAATQAMTGQGGWPMTCFLTPEGEPFHCGT 141

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           Y+PP  + G P F  +L  V +AWD++   L       I  L+E       S  LP+ + 
Sbjct: 142 YYPPSPRPGMPSFSQLLVAVAEAWDERPGELRSGARQIIAHLTE------KSGPLPESVV 195

Query: 282 QNA-LRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
             A L      L K YD+  GGFG APKFP  + +  +L H ++   TG       G  M
Sbjct: 196 DGAVLESAVASLRKEYDAENGGFGGAPKFPPTMALNFLLRHHER---TGS------GLSM 246

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
           V  T + MA GG++D + GGF RYSVD RW VPHFEKMLYD G L   Y     +T   +
Sbjct: 247 VEHTAEAMALGGLNDQLAGGFARYSVDARWEVPHFEKMLYDNGLLLRFYARFHGVTGYEY 306

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH 460
                 +  ++L RD+    G   ++ DAD+   EG T       YVWT  ++ ++LGE 
Sbjct: 307 ARRTVEETAEFLLRDLGTAEGGFAASLDADTDGVEGLT-------YVWTPAQLAEVLGEE 359

Query: 461 -AILFKEHYYLKPTGN----CDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 515
                 E + +   GN        R+ +PH E                        E+Y 
Sbjct: 360 DGAWAAELFQVAEPGNFEHGASTLRLREPHPEDA----------------------ERYE 397

Query: 516 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 575
            +    RR L   R +RP+P  DDKVI +WNGL I +FA A   L               
Sbjct: 398 RV----RRALLAARGQRPQPARDDKVIAAWNGLAIGAFANAGSRLG-------------- 439

Query: 576 SDRKEYMEVAESAASFIR-RHLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFLISGLLDLY 633
             R ++++ A  AA+F+  +H  D    RL+ + R+G      G L+DYA L  GLL+L+
Sbjct: 440 --RPQWIDAATRAAAFLMDKHFVD---GRLRRTSRDGVVGTTAGVLEDYACLAEGLLELH 494

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGAEPSGNSV 692
           +     +WL  AI L +     F   +  G +  T +D  VL++   D  D A PSG S 
Sbjct: 495 QSTGEPRWLADAITLLDLALAHFGVPDSPGAYYDTADDAEVLVQRPSDPTDNASPSGAS- 553

Query: 693 SVINLVRLASIVAG-SKSDYYRQNAEH 718
           ++ N +  AS++AG  +   YR+ AE 
Sbjct: 554 ALANALLTASVLAGHDQVGRYREAAEQ 580


>gi|344340301|ref|ZP_08771227.1| hypothetical protein ThimaDRAFT_2966 [Thiocapsa marina 5811]
 gi|343799959|gb|EGV17907.1| hypothetical protein ThimaDRAFT_2966 [Thiocapsa marina 5811]
          Length = 691

 Score =  397 bits (1020), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 260/669 (38%), Positives = 365/669 (54%), Gaps = 72/669 (10%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            NRLA   SPYL QHAHNPVDW+ W EEA A AR+ D PI LSIGYS CHWCHVM  ESF
Sbjct: 12  VNRLAETTSPYLRQHAHNPVDWWPWCEEALALARETDRPILLSIGYSACHWCHVMAHESF 71

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYG-GGGWPLSVFLSP-DLKPLMG 219
           ED G A+L+N  FV+IKVDREERPD+DK+Y T  Q L    GGWPL+VFL P D KP   
Sbjct: 72  EDPGTAELMNRLFVNIKVDREERPDLDKIYQTAHQLLAQRPGGWPLTVFLMPDDQKPFFA 131

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASA------SS 273
           GTYFP E ++G P FK +++ V+ A+ +++         AIE  +E+L A+       +S
Sbjct: 132 GTYFPREPRHGLPAFKQLMQGVERAYREQKT--------AIESQNESLMAALAELEPHAS 183

Query: 274 NKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGE 333
           + LP+   ++A+    +QL  S+D   GGFG APKFP P  + ++L H+     TG    
Sbjct: 184 DALPE---RSAIDAALQQLDTSFDPEHGGFGDAPKFPHPTNLDLLLRHATDAPQTGAPDR 240

Query: 334 ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 393
           ++  +   ++TL+ M +GG+ D +GGGF+RYSVD  W +PHFEKMLYD G L  +  DAF
Sbjct: 241 SALAK--AVWTLERMVRGGLTDQLGGGFYRYSVDALWMIPHFEKMLYDNGPLLALCCDAF 298

Query: 394 SLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 453
           ++T+D  +        D++ R+M  P G  +S+ DADS   EG    +EG FYVW  +E+
Sbjct: 299 AVTEDPVFRDAAVMTADWVLREMQSPEGGYWSSLDADS---EG----EEGKFYVWDREEI 351

Query: 454 EDIL--GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPL 511
             +L   E+A  F   Y L    NC+            G+  L       A A  LG+  
Sbjct: 352 RALLAPAEYAP-FAAVYRLDRPANCE------------GRWHLHGYRTPEAVAVDLGLEP 398

Query: 512 EKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNF 571
            +   +L   R  L+  R +R RP  D+KV+ +WN L+I   ARA++             
Sbjct: 399 ARVQALLAAARATLYVARERRVRPGRDEKVLTAWNALMIKGLARAARTF----------- 447

Query: 572 PVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLD 631
                DR +Y+E AE A +FIR  L+ E   RL  ++++G +    +LDDYA L+  LL+
Sbjct: 448 -----DRPDYLESAEQALAFIRGTLWREG--RLLATYKDGTAHLNAYLDDYANLLDALLE 500

Query: 632 LYEFGSGTKW----LVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 687
           L +    T+W    L +A+ L     + F D  GGG++ T  +  +++ R K   D A P
Sbjct: 501 LLQ----TRWSRADLDFALALAEVLLDQFEDPIGGGFWFTGRDHETLIHRTKPLGDEAIP 556

Query: 688 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 747
           SGN V+ + L RL  +V   +   Y   AE +L +    ++ M  A   +  A D    P
Sbjct: 557 SGNGVAALALERLGHLVGEPR---YLAAAERTLKLAAESIRRMPYAHATLLFALDEWLDP 613

Query: 748 SRKHVVLVG 756
               V+  G
Sbjct: 614 PETLVIRAG 622


>gi|320101644|ref|YP_004177235.1| N-acylglucosamine 2-epimerase [Isosphaera pallida ATCC 43644]
 gi|319748926|gb|ADV60686.1| N-acylglucosamine 2-epimerase [Isosphaera pallida ATCC 43644]
          Length = 909

 Score =  397 bits (1019), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 254/677 (37%), Positives = 343/677 (50%), Gaps = 75/677 (11%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N LA E S +L +HA  PVDW+ WG+EAFA AR  D P+FLS GY  CHWCHVME E F 
Sbjct: 67  NHLAGETSAHLRRHADTPVDWWPWGDEAFARARAEDKPVFLSSGYLACHWCHVMERECFR 126

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D  +A  LN  FV IK+DREERPDVD+ Y+T ++  +G GGWP+S+FL+P+ KP  GGTY
Sbjct: 127 DPAIAARLNRDFVCIKLDREERPDVDQTYLTALRT-FGTGGWPMSIFLTPEGKPFYGGTY 185

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKL--PDEL 280
           FPPED+ G  GF T+L +V  AW + RD + +        +   L   A+S+ L  P  L
Sbjct: 186 FPPEDRPGLTGFSTVLDRVARAWREDRDRIERVAGELDAMVGRILVRRAASSVLGPPPVL 245

Query: 281 PQNALRLCAEQLSKSYDSRFGGFG------SAPKFPRPVEIQMMLYHSKKLEDTGK---- 330
             +    C   L   +D  +GGFG        PKFP P  +  +L     L++  +    
Sbjct: 246 SSDLTDACYLILCGEFDPEYGGFGFDRTNPRRPKFPEPSRLLFLLERHAALKERPRPVKT 305

Query: 331 ---------SGEAS------EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHF 375
                     G A+          M LFTL  +A+GG+ DHVGGG+HRY V   W VPHF
Sbjct: 306 PARSLLMLDPGPAAAPLIRRAPLDMALFTLDRIARGGLRDHVGGGYHRYCVSRFWIVPHF 365

Query: 376 EKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETE 435
           EK LYD  QLA V++ AF LT D  +      I D++ R+M  P G   SA DA+S + +
Sbjct: 366 EKTLYDNAQLARVFVRAFELTGDPRWRDEAEAIFDFVAREMTLPEGGFLSALDAESRDED 425

Query: 436 GATRKKEGAFYVWTSKEVEDILG---EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKN 492
           G      G +Y+WT  +VE  L    E  I+ + +  L+           DP+ E  G+ 
Sbjct: 426 G------GEYYLWTRPQVEQALANPEESRIVLQVYGMLR-----------DPNFE-GGRY 467

Query: 493 VLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISS 552
           VL+E  + S  A  LG+ L +    L   RR+L  VR +RP P  DDK I  WNGL+I++
Sbjct: 468 VLLEPRERSEHARALGLELPELTRRLDAARRRLHQVRDQRPAPRKDDKAIAGWNGLMIAA 527

Query: 553 FARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP 612
            A A +              V   +R  Y++ A+ AA F       EQ  RL  ++R G 
Sbjct: 528 LAEAGR--------------VCDHNRDRYLKAAQRAAEFAWTQFRREQ-DRLARTWRQGV 572

Query: 613 SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG--GGYFNTTGE 670
           +K  GF +DYAFL  GLL LY      +WL  A  L       F D +   GG F  +  
Sbjct: 573 AKGEGFAEDYAFLAEGLLRLYRADGDPRWLERARRLTERMRHDFGDPDPNRGGLFFASRR 632

Query: 671 DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDM 730
           D  +  R K+  D   PS N+V+   L+ L  +      D   Q  + + A+    L D+
Sbjct: 633 DARLPARFKDPLDSVLPSANAVAARVLIELGRL------DDDPQRYDQAEAILREFLPDL 686

Query: 731 AM---AVPLMCCAADML 744
           A      P+M  A + L
Sbjct: 687 ARRPGVWPMMMVALEEL 703


>gi|85714094|ref|ZP_01045083.1| hypothetical protein NB311A_08058 [Nitrobacter sp. Nb-311A]
 gi|85699220|gb|EAQ37088.1| hypothetical protein NB311A_08058 [Nitrobacter sp. Nb-311A]
          Length = 714

 Score =  397 bits (1019), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 231/608 (37%), Positives = 329/608 (54%), Gaps = 58/608 (9%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            NRLAAE SPYLLQH HNPV+W+ W  EA AEA++ + PI LSIGY+ CHWCHVM  ESF
Sbjct: 47  ANRLAAETSPYLLQHKHNPVNWWPWVPEALAEAQRSNRPILLSIGYAACHWCHVMAHESF 106

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           EDE VA ++N+ FV IKVDREERPD+D++YM  +  L   GGWPL++FL PD  P  GGT
Sbjct: 107 EDEDVAAVMNELFVCIKVDREERPDIDQIYMNALHHLGEQGGWPLTMFLFPDGSPFWGGT 166

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFP    +GRP F  +L+ V   + ++ D +A+     I +LSE   A   +N    EL 
Sbjct: 167 YFPKLPDFGRPAFTDVLQSVARVFREQPDKIARHRDALIARLSERARADNPANIGLAEL- 225

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
            NA  L A+    S D   GG   APKFP+   ++ +     +  D             V
Sbjct: 226 DNAAALIAQ----STDPVHGGLRGAPKFPQCSVLEFLWRAGARTHD-------DHFFAAV 274

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             T+  M++GGI+DH+GGG+ RYSVD++W VPHFEKMLYD  Q+ ++     + +K+  Y
Sbjct: 275 TLTMTRMSQGGIYDHLGGGYARYSVDDKWLVPHFEKMLYDNAQILDLLALDHARSKNPLY 334

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EH 460
                + +D+LRR+M+ P G   S+ DADS   EG    +EG FY+W+ KE+E++LG   
Sbjct: 335 RERATETVDWLRREMLTPAGGFASSLDADS---EG----EEGRFYIWSLKEIEEVLGTTD 387

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
           A  F   Y +   GN            F+G+N+   L     ++         ++  L  
Sbjct: 388 AADFAARYDITANGN------------FEGRNIPNRLRSIEVASDD-----SAHMRAL-- 428

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            R KL   R  R RP LDDK++  WNGL+I++   A+ +                 DR +
Sbjct: 429 -REKLLARRESRVRPGLDDKILADWNGLMIAALVHAACVF----------------DRPD 471

Query: 581 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 640
           ++++A +   F+R  +   +  RL HS+R G    P    DYA +    L L+E      
Sbjct: 472 WLQIARAVYDFVRTTM--TRDGRLGHSWREGRLLVPALASDYAAMGRAALALFEATGDND 529

Query: 641 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 700
            LV A+  Q+T D  + D E GGY+ T  +   +++R     D A P+ + +   NLVRL
Sbjct: 530 CLVQALRWQSTLDTHYADVEHGGYYLTAADAEGLIVRPHSSDDDATPNHDGLIAQNLVRL 589

Query: 701 ASIVAGSK 708
           A++   +K
Sbjct: 590 AALTGDTK 597


>gi|77166007|ref|YP_344532.1| hypothetical protein Noc_2549 [Nitrosococcus oceani ATCC 19707]
 gi|254436399|ref|ZP_05049905.1| conserved hypothetical protein [Nitrosococcus oceani AFC27]
 gi|76884321|gb|ABA59002.1| Protein of unknown function DUF255 [Nitrosococcus oceani ATCC
           19707]
 gi|207088089|gb|EDZ65362.1| conserved hypothetical protein [Nitrosococcus oceani AFC27]
          Length = 694

 Score =  397 bits (1019), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 241/620 (38%), Positives = 342/620 (55%), Gaps = 45/620 (7%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N L  + SPYLLQH  NPVDW+ W EEA A A++ D PI LSIGYS CHWCHVM  ESFE
Sbjct: 8   NHLQGQTSPYLLQHVDNPVDWYPWDEEALARAQEEDKPILLSIGYSACHWCHVMAHESFE 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYG-GGGWPLSVFLSP-DLKPLMGG 220
           D   A ++N +F++IKVDREERPD+D++Y    Q L G  GGWPL++FL P    P  GG
Sbjct: 68  DSETAAVMNQYFINIKVDREERPDLDQIYQLAQQMLTGRPGGWPLTMFLEPIKQAPFFGG 127

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           TYFPPE+++G PGFK +L++V + +  +R+ +       ++   + L A   + ++ + L
Sbjct: 128 TYFPPEERHGLPGFKDLLQRVAEYFHTRREAIQSQNERLLDAFGD-LDARLPAAEV-EGL 185

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
            +  L+    QL++++DSR GGF  APKFP P  I+  L  ++    T    E  +   M
Sbjct: 186 NRAPLQAAHRQLAQAFDSRHGGFRGAPKFPNPSSIERCLRDARGEHLT--EDEKQQALTM 243

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
              TL+ MA+GGI+D +GGGF RYSVDE W +PHFEKMLYD GQL  +Y DA+ L     
Sbjct: 244 ARLTLEQMAQGGIYDQLGGGFCRYSVDEEWRIPHFEKMLYDNGQLLVLYRDAYRLWGSGL 303

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH 460
           +  I  +   +  R+M  P G  +S+ DADS   EG     EG FYVWT ++V  +LGE 
Sbjct: 304 FRRILEETGHWAVREMQSPEGGYYSSLDADS---EG----HEGKFYVWTREQVRALLGEE 356

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
                  Y+           +  P N F+G   L       A A ++ +P       L  
Sbjct: 357 EYALAARYF----------GLDQPAN-FEGYWHLYAATVPEALAQEMKVPAPGLQEQLTA 405

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            ++KLF  R  R RP  DDK++ +WNGL+I   A A + L           PV       
Sbjct: 406 AKQKLFAAREARIRPGRDDKILTAWNGLMIKGMAAAGQALAQ---------PV------- 449

Query: 581 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 640
           ++  AE A  F+R HL+  Q  RL  S+++G ++  G+LDDYAFL+  LL+L +      
Sbjct: 450 FIASAERAVDFVRAHLW--QKGRLLVSYKDGRAQHRGYLDDYAFLLDALLELLQVRWRDG 507

Query: 641 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 700
            L +A++L     E F D+  GG++ T  +   ++ R     D A P+GN V   +L+RL
Sbjct: 508 DLSFAVDLAEAVLERFEDKAQGGFYFTADDHEILIHRPVPLMDDATPAGNGVLAWSLLRL 567

Query: 701 ASIVAGSKSDYYRQNAEHSL 720
             ++   +   Y + AE +L
Sbjct: 568 GHLLGEVR---YLKAAESTL 584


>gi|312115384|ref|YP_004012980.1| hypothetical protein Rvan_2669 [Rhodomicrobium vannielii ATCC
           17100]
 gi|311220513|gb|ADP71881.1| hypothetical protein Rvan_2669 [Rhodomicrobium vannielii ATCC
           17100]
          Length = 685

 Score =  397 bits (1019), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 238/667 (35%), Positives = 356/667 (53%), Gaps = 74/667 (11%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL+ E SPYL QH HNPV+W+ W +EAF EA++ D P+ LS+GY+ CHWCHVM  ESFE
Sbjct: 4   NRLSEETSPYLQQHKHNPVEWWPWCQEAFEEAQRLDKPVLLSVGYAACHWCHVMAHESFE 63

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
            E  A+L+N  F++IKVDREERPDVD +YMT +Q L   GGWPL++FL+PD  P  GGTY
Sbjct: 64  KEDTAELMNRLFINIKVDREERPDVDTLYMTALQELGEQGGWPLTMFLTPDGMPFFGGTY 123

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP + ++G+P FK +L  V   + ++++ +AQ+ A+  ++L+  L+  A+      E  +
Sbjct: 124 FPDKSRFGKPSFKDVLVNVARVYAQEKETIAQNTAYLKQRLTPRLNYGAAP-----EFSE 178

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML-----YHSKKLEDTGKSGEASEG 337
             L   A +   + D   GG   APKFP     Q +      Y+ K   +  K+      
Sbjct: 179 EQLAAIAAKFIGAIDPTNGGLRGAPKFPNTTIFQFLWRAGLRYNLKTCIEEVKN------ 232

Query: 338 QKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTK 397
                 TL  + +GGI+DH+GGGF RY+VDERW VPHFEKMLYD   L     + +  T+
Sbjct: 233 ------TLLHICQGGIYDHLGGGFSRYTVDERWLVPHFEKMLYDNALLIEFMTEVWKETQ 286

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
                    + + +L+RDMI PGG   ++ DADS   EG    +EG FYVWT++E+ DIL
Sbjct: 287 SDRLKTRVAETIGWLKRDMIVPGGAFAASYDADS---EG----EEGKFYVWTAREITDIL 339

Query: 458 --GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 515
             GE A +F + Y +   GN            ++GK +L  L     + + L    E+ +
Sbjct: 340 GHGEEAAIFAQTYDVTEGGN------------WEGKTILNRLK----ALALLNGGEERAM 383

Query: 516 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 575
           +   ECR KLF  R +R +P  DDKV+  WNGL I + ARA                   
Sbjct: 384 D---ECRAKLFAERERRVKPGWDDKVLADWNGLAIRALARAGDAFA-------------- 426

Query: 576 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 635
             + +++ +A  A  F++  +   +  RL HS+R+G  K P    DYA +IS  L L++ 
Sbjct: 427 --QPDWIVLAADAYGFVKSRMI--ENGRLFHSWRDGKLKGPATAADYANIISAALVLHQV 482

Query: 636 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 695
               ++L  A+E     +  + D E GGY+    +   ++LR     D A P+ N+  + 
Sbjct: 483 TGEPRYLDDAVEWTAIMNRHY-DAEQGGYYFAADDTSDLILRPLSASDDAVPNANATMLQ 541

Query: 696 NLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLV 755
           NL  L ++   +    Y + A+  L  F+   + MA+    +   A  L++ S + + + 
Sbjct: 542 NLADLYTLTGDAA---YLKRADGLLTAFQGAAQTMAIGYTGLLSGA--LTLISPQSIAIA 596

Query: 756 GHKSSVD 762
           G ++  D
Sbjct: 597 GDRAGPD 603


>gi|347735180|ref|ZP_08868108.1| hypothetical protein AZA_58766 [Azospirillum amazonense Y2]
 gi|346921671|gb|EGY02301.1| hypothetical protein AZA_58766 [Azospirillum amazonense Y2]
          Length = 686

 Score =  396 bits (1018), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 245/682 (35%), Positives = 348/682 (51%), Gaps = 61/682 (8%)

Query: 93  STSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHW 152
           + S +     N L  E SPYLLQH  NPV W AWG EAFAEA+    PI LS+GY+ CHW
Sbjct: 2   AASDTTQAAENLLVHETSPYLLQHKDNPVHWRAWGPEAFAEAQAAGKPILLSVGYAACHW 61

Query: 153 CHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP 212
           CHVM  ESFE++ ++ L+ND F++IKVDREERPDVD+VY   +  L   GGWPL++FL+P
Sbjct: 62  CHVMAHESFENQAISSLMNDLFINIKVDREERPDVDQVYQQALSLLGQQGGWPLTMFLTP 121

Query: 213 DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASAS 272
             +P  GGTYFPP  +YGRPGF  +L+ V + + +    ++++    ++ L +AL+  + 
Sbjct: 122 KGEPFWGGTYFPPATRYGRPGFPDVLQGVAETYAQDPGKVSRN----VKALGDALARLSR 177

Query: 273 SNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSG 332
            N   D +   +L   A++L +  D   GG   APKFP+P    ++     +   T    
Sbjct: 178 GNP-GDAVTVGSLNAVADRLVREVDPFLGGINGAPKFPQPSIFDLLWRAHLRTART---- 232

Query: 333 EASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDA 392
              + +  V+ TL  MA GGI+DH+ GGF RYS DE+W VPHFEKMLYD  QL  +    
Sbjct: 233 ---DLRDAVITTLTHMANGGIYDHLAGGFARYSTDEQWLVPHFEKMLYDNAQLVALMTQV 289

Query: 393 FSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKE 452
           +  T+D       R+ + ++  +M  PGG   +  DADS   EG    +EG FYVWT  E
Sbjct: 290 WQGTRDPLLEVRVRETVGWVLNEMKVPGGAFGATLDADS---EG----EEGRFYVWTKAE 342

Query: 453 VEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE 512
           ++ +LGE A LF  HY +   GN            ++G  +   LN  +  A     P  
Sbjct: 343 IDRLLGEDAELFCAHYDVTELGN------------WEGHTI---LNRRTPLA-----PGS 382

Query: 513 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA--ESAMFN 570
              N L   R +L   R+ R RP  DDKV+  WNGL+I++ ARA  + +     E+A+  
Sbjct: 383 AEENRLAHARARLLKARALRIRPGWDDKVLADWNGLMIAALARAGFVFEQPGWIEAAI-- 440

Query: 571 FPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLL 630
                     Y  V  S       H   +   RL HS R G ++  G L+DYA +    L
Sbjct: 441 --------DAYRHVVTSLG-----HTGRDGLDRLYHSGRGGRARHAGLLEDYANMGKAAL 487

Query: 631 DLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGN 690
            L+E      +L  A    +T D  F D   GGY+ T  +   +L+R +   D A P+GN
Sbjct: 488 TLHEITGDVAFLDQAARWTDTLDRHFWDAADGGYYTTADDVGDLLVRPRHAQDNAVPAGN 547

Query: 691 SVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRK 750
              + NL RL  +   +  D YR  A+  ++ F   L      +      A+ L   +  
Sbjct: 548 GTQLGNLTRLWLL---TGQDRYRAQADTLMSAFSGELGRNFFPLSTFLNMAETLL--NGM 602

Query: 751 HVVLVGHKSSVDFENMLAAAHA 772
           H VLVG    ++  N +  A +
Sbjct: 603 HAVLVGEGDDLEPFNAVLRAQS 624


>gi|300024782|ref|YP_003757393.1| hypothetical protein Hden_3279 [Hyphomicrobium denitrificans ATCC
           51888]
 gi|299526603|gb|ADJ25072.1| protein of unknown function DUF255 [Hyphomicrobium denitrificans
           ATCC 51888]
          Length = 678

 Score =  396 bits (1018), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 232/609 (38%), Positives = 337/609 (55%), Gaps = 57/609 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYLLQH  NPV W+AWG EA AEA++   PI LS+GY+ CHWCHVM  ESFE
Sbjct: 4   NRLQYETSPYLLQHKDNPVHWWAWGPEALAEAKRTGKPILLSVGYAACHWCHVMAHESFE 63

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D G A+++N++F++IKVDREERPD+D +YM  +  L   GGWPL++FL  D KP  GGTY
Sbjct: 64  DPGTAEVMNEFFINIKVDREERPDIDAIYMGALHQLGEQGGWPLTMFLDSDAKPFWGGTY 123

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP E +YGRP F T+L ++ +A+  +RD +  +     E L  AL  +   N  P + P+
Sbjct: 124 FPREARYGRPAFVTVLLRIAEAYANQRDDVRNN----TEALLAALKTAPGDNA-PRQ-PR 177

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
            A    A  +S++ D  +GG   APKFP+   I  +L+        G   + ++ +  V+
Sbjct: 178 PATEDVAAAISRAVDREYGGLSGAPKFPQ-WSIFWLLWR------VGIRDDNADAKNGVI 230

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL+ + +GGI+DH+GGGF RYSVDE W VPHFEKMLYD   L ++  + +  T+D  + 
Sbjct: 231 TTLRHICQGGIYDHLGGGFSRYSVDEYWLVPHFEKMLYDNALLIDLMTEVWRETQDPLFK 290

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHA 461
               + + ++ R+MIG  G   ++ DADS   EG    +EG FYVW + E+ED+LG E A
Sbjct: 291 TRVAETIAWIEREMIGEAGGFAASLDADS---EG----EEGKFYVWNADEIEDVLGAEDA 343

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
             F   Y + P GN            F+G  +L  L         L    E+    L   
Sbjct: 344 AFFSRVYGVVPGGN------------FEGHTILNRLG-------SLAFLSEEDEARLTSL 384

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
           R KL + R+ R RP  DDK++  WNGL I++ +RA+ +L+  A                +
Sbjct: 385 RAKLLERRASRIRPGWDDKILADWNGLAIAAISRAAIVLEQPA----------------W 428

Query: 582 MEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKW 641
           + +AE A S I   L      RL H++R+G +KAP    DYA +    + L+      ++
Sbjct: 429 LALAERAFSAITTKLA-ASDGRLFHAYRSGLAKAPATASDYANMTWAAIRLFTATGSERY 487

Query: 642 LVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLA 701
           L  A +     D+ + D + GGYF    +   V++R+K   D A P+ N++ + NL+ LA
Sbjct: 488 LDQAQQWTRILDKHYWDEDRGGYFTAADDTLDVVVRLKSATDDAAPNANAIQLSNLIALA 547

Query: 702 SIVAGSKSD 710
           ++   +  D
Sbjct: 548 ALTGDAAYD 556


>gi|448738600|ref|ZP_21720623.1| hypothetical protein C451_13731 [Halococcus thailandensis JCM
           13552]
 gi|445801484|gb|EMA51818.1| hypothetical protein C451_13731 [Halococcus thailandensis JCM
           13552]
          Length = 709

 Score =  396 bits (1018), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 231/605 (38%), Positives = 321/605 (53%), Gaps = 44/605 (7%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA NPV W  W ++A   AR+RDVPIFLSIGYS CHWCHVM  ESF+
Sbjct: 6   NRLDEEASPYLRQHADNPVHWQPWDDDALDAARERDVPIFLSIGYSACHWCHVMADESFD 65

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D  VA+ LN+ FV IKVDREERPD+D++Y T    + G GGWPLSV+L+PD +P   GTY
Sbjct: 66  DPAVAEQLNEEFVPIKVDREERPDLDRLYQTVAAMVSGRGGWPLSVWLTPDGRPFYVGTY 125

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSAS-ASSNKLPDELP 281
           FP E K G+PGF  +L  + D+W+ +R+ +        +Q ++A++     +   P E+ 
Sbjct: 126 FPREAKRGQPGFLDLLDSIADSWNDEREDIESRA----DQWADAMAGELEGTPDTPGEVS 181

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
              L   A++     D   GGFG   KFP+   + +++   +  E TG+       +++ 
Sbjct: 182 PGLLETAAQRAVSEADREHGGFGRGQKFPQTGRLHLLM---QAHERTGRDA----FREVA 234

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
           +  L  +A GG+ DH GGGFHRY  D  W VPHFEKMLYD  +L   YL  + LT +  Y
Sbjct: 235 VEALDAIADGGLRDHAGGGFHRYVTDREWTVPHFEKMLYDNAELVRAYLAGYRLTGEERY 294

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH- 460
           + I R+ L ++ R++  P G  FS  DA S     +   +EGAFYVWT +EV + + +  
Sbjct: 295 AEIARETLGFVERELRHPDGGFFSTLDAQSEGE--SGEHEEGAFYVWTPQEVHEAVDDEF 352

Query: 461 -AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILG 519
            A LF E Y +   GN +            GK VL         A + G   E+    L 
Sbjct: 353 AADLFCERYGITEAGNFE-----------NGKTVLTIDTTIDGLADEHGTTTEEIEADLE 401

Query: 520 ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRK 579
             R  +F  R+ R RP  D+K++  WNGL+IS+FA A   L                  +
Sbjct: 402 RAREAIFAARADRERPARDEKILAGWNGLMISAFAEAGLALD-----------------E 444

Query: 580 EYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGT 639
            Y E A +A  F+   L+DE   +L   F++G  K  G+L+DYAFL  G L+ YE     
Sbjct: 445 TYSETAVAALGFVHEQLWDEDEQQLARRFKDGEVKIDGYLEDYAFLARGALNCYEATGEV 504

Query: 640 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVR 699
             L +A++L       F D E G  + T     S++ R +E  D + PS   V+V  L+ 
Sbjct: 505 AQLEFALDLGRAIVREFFDGEEGTLYFTPRSGESLVARPQELDDQSTPSSTGVAVDTLLA 564

Query: 700 LASIV 704
           L+   
Sbjct: 565 LSQFA 569


>gi|339325405|ref|YP_004685098.1| hypothetical protein CNE_1c12630 [Cupriavidus necator N-1]
 gi|338165562|gb|AEI76617.1| hypothetical protein CNE_1c12630 [Cupriavidus necator N-1]
          Length = 666

 Score =  396 bits (1018), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 258/687 (37%), Positives = 353/687 (51%), Gaps = 87/687 (12%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNRLA E SPYL QHA NPVDW+ W EEAF  AR  D P+ LS+GY+TCHWCHVM  ESF
Sbjct: 2   TNRLATETSPYLRQHADNPVDWYPWCEEAFRRARDDDKPVLLSVGYATCHWCHVMAHESF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           E+  +A L+N+ F+SIKVDR+ERPD+D +Y    Q +  GGGWPL+VFL+P  +P  GGT
Sbjct: 62  ENPRIAALMNERFISIKVDRQERPDLDDIYQKVPQLMGQGGGWPLTVFLTPQGEPFYGGT 121

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL----------SASA 271
           YFPP+D+YGRPG   +L  + +AW  +R  L  +    IEQ  +              + 
Sbjct: 122 YFPPDDRYGRPGLPRVLLSLSEAWRHRRQELRDT----IEQFQQGFRHLDEGVLSREDAE 177

Query: 272 SSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKS 331
            + ++ D   Q AL      L+++ D   GG G APKFP      ++L   ++  +    
Sbjct: 178 QAAEVQDLPAQTAL-----ALARNTDPTHGGLGGAPKFPNASAYDLVLRICQRTHEPALL 232

Query: 332 GEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLD 391
                       TL  MA GGIHD +GGGF RYSVDERW VPHFEKMLYD GQL  +Y +
Sbjct: 233 DALER-------TLDGMAAGGIHDQLGGGFSRYSVDERWAVPHFEKMLYDNGQLVTLYAN 285

Query: 392 AFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSK 451
           A+ LT    +  +    + Y+ RDM  P G   + EDADS   EG    +EG FYVWT+ 
Sbjct: 286 AYRLTGKQAWRRVFEGTIAYILRDMTHPDGGFHAGEDADS---EG----EEGRFYVWTAA 338

Query: 452 EVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMP 510
           EV+ +LGE    L    Y +   GN +            G++VL         A  L  P
Sbjct: 339 EVKAVLGESEGALACRAYGVTEGGNFE-----------PGRSVL-------HRAVTL-TP 379

Query: 511 LEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFN 570
           LE+    L   R +L   R++R RP  DD ++  WNGL+I     A +   + A      
Sbjct: 380 LEE--ARLEGWRERLLAARARRVRPGRDDNILAGWNGLMIQGLCAAYQATGNPA------ 431

Query: 571 FPVVGSDRKEYMEVAESAASFIRRHLY--DEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 628
                     ++  A  AASF++  L   D   +R    ++NG  K PGFL+DYAFL + 
Sbjct: 432 ----------HLAAARRAASFVQDKLTMPDGGVYRY---WKNGTVKVPGFLEDYAFLANA 478

Query: 629 LLDLYEFGSGTKWLVWAIELQNTQDELFLDR-EGGGYFNTTGEDPSVLLRVKEDHDGAEP 687
           L+DLYE     ++L  A EL      L +DR  G G + T  +   ++ R +  +DGA P
Sbjct: 479 LIDLYESCFDRRYLDRAAELVT----LIIDRFRGDGLYFTPNDGEPLIHRPRGPYDGAWP 534

Query: 688 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 747
           SG S SV   +RL  +   +  D YR  AE     +             +  AAD     
Sbjct: 535 SGISASVFAFLRLHEL---TGEDRYRDLAEQEFQRYRAAATAAPAGFVHLLAAADFAQRG 591

Query: 748 SRKHVVLVGHKSSVDFENMLAAAHASY 774
           +   ++L G K++     ++ + H +Y
Sbjct: 592 AFG-IILAGDKAAA--AALVESVHRTY 615


>gi|386826330|ref|ZP_10113437.1| thioredoxin domain-containing protein [Beggiatoa alba B18LD]
 gi|386427214|gb|EIJ41042.1| thioredoxin domain-containing protein [Beggiatoa alba B18LD]
          Length = 700

 Score =  396 bits (1017), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 239/677 (35%), Positives = 355/677 (52%), Gaps = 51/677 (7%)

Query: 94  TSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWC 153
           ++ S   H+N L  E SPYL QHA+NPV W+ WGEEA   AR++D PI LS+GYS CHWC
Sbjct: 2   SATSETVHSNALIHETSPYLQQHANNPVHWYPWGEEALRLAREQDKPILLSVGYSACHWC 61

Query: 154 HVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALY-GGGGWPLSVFLSP 212
           HVM  ESFED   A+++N+ F++IKVDREERPD+DK+Y    Q L    GGWPL++FL+P
Sbjct: 62  HVMAHESFEDPETAQVMNELFINIKVDREERPDLDKIYQMAHQILTRRAGGWPLTMFLTP 121

Query: 213 DLK-PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLA---QSGAFAIEQLSEALS 268
           D   P  GGTYFP E ++  P FK IL +V + + + R  +    Q  A AIE      +
Sbjct: 122 DAHYPFFGGTYFPKEPRFNLPAFKNILYRVAEFYRQNRHGIVEQCQQLAQAIEYHDTPRT 181

Query: 269 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 328
              S   +  EL    L    +Q+ +S+DS +GGF  APKFP    ++ + +H       
Sbjct: 182 EGVSITTISPEL----LNTARQQIEQSFDSEWGGFSKAPKFPHLTNVERLFHHYHITAHQ 237

Query: 329 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 388
               E  +G ++ + TL  MA GGI+D VGGGF RYSVD+ W +PHFEKMLYD      +
Sbjct: 238 ENPDE--DGLQIAMHTLTRMALGGIYDQVGGGFCRYSVDDYWMIPHFEKMLYDNAPFLTI 295

Query: 389 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 448
           Y +A+ L K   Y  + +   D++ R+M    G  +S  DADS   EG     EG FYVW
Sbjct: 296 YSEAWQLAKIPLYKQVAQATADWVLREMQLSEGGFYSTLDADS---EGV----EGKFYVW 348

Query: 449 TSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 507
           T +E++ +L  E    F   + L    N + +              L   +D  A A K 
Sbjct: 349 TPEEIKGLLSPELYAPFAYQFGLNRPANFEETHWH-----------LFGWHDREAVAVKF 397

Query: 508 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 567
            + LE+    L +    LF  R +R  P  D+K++ +WNG++I + A A +I K      
Sbjct: 398 DLSLEEVNARLDKALAILFQAREQRVHPQRDEKILTAWNGMMIKALATAGRIFK------ 451

Query: 568 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 627
                     R +Y+  AE + +FIR  L+  +  +L  ++++G +    +LDDYAFLI 
Sbjct: 452 ----------RTDYIHAAEQSLNFIRSTLW--KNGKLLATYKDGKAHLNAYLDDYAFLIE 499

Query: 628 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 687
           G+L L +         + +EL +     F D+E GG+F T      ++ R+K   D A P
Sbjct: 500 GILTLLQCRWNNSDYAFMLELVDVLLHEFEDKEKGGFFFTGNHHEQLIARLKPLADEAIP 559

Query: 688 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 747
           SGN V+ + L RL  ++    +D Y + A  ++ +    ++ +A A   +  A +    P
Sbjct: 560 SGNGVAAVVLGRLGHLLG---NDEYLRAAARTVNIALPAIEQIAYAHNTLLLAVEDYLFP 616

Query: 748 SRKHVVLVGHKSSVDFE 764
            +  ++    K   +++
Sbjct: 617 PQLIIIRADAKHLAEWQ 633


>gi|258405434|ref|YP_003198176.1| hypothetical protein Dret_1310 [Desulfohalobium retbaense DSM 5692]
 gi|257797661|gb|ACV68598.1| protein of unknown function DUF255 [Desulfohalobium retbaense DSM
           5692]
          Length = 615

 Score =  396 bits (1017), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 233/616 (37%), Positives = 331/616 (53%), Gaps = 45/616 (7%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            NRLA   SPYL QHA NPV W  W ++A A A +   PIFLSIGY+TCHWCHVME E F
Sbjct: 6   VNRLAESGSPYLEQHAGNPVAWQPWDDQALATAHRLQRPIFLSIGYATCHWCHVMERECF 65

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           ED  VA +LN   V IKVDREERPD+D  YM+  QAL G GGWPL++FL+PD +P    T
Sbjct: 66  EDTEVAHILNTVCVPIKVDREERPDLDTFYMSCCQALSGRGGWPLNLFLTPDGRPFFAAT 125

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           Y P + ++ +PG   +L  V++ W + R+ + QS    +  + +  S S+        LP
Sbjct: 126 YIPKQSRFSQPGLLDLLVSVQEDWVRNREQIEQSATRLVSHIHDLFSDSSGP------LP 179

Query: 282 QNAL-RLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
           +NA+     ++L +++D  FGGFG APKFP P  +  +L      +D            M
Sbjct: 180 ENAIFEQAVQELRQNHDDDFGGFGKAPKFPTPHVLLFLLRLYDLSQDRSLL-------NM 232

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
           V  TL+ + +GGI DH+GGGFHRYS D  WH+PHFEKMLYDQ  L     +  + T+   
Sbjct: 233 VDSTLEAICRGGIRDHIGGGFHRYSTDRAWHLPHFEKMLYDQALLLMALAEGHARTRRDL 292

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH 460
           +      + +Y+   +    G ++  EDAD   TEG    +EGAFY WT  E+E  L   
Sbjct: 293 FRREAVAVAEYMLERLHDGDGGLYCGEDAD---TEG----EEGAFYQWTETELEAALPPD 345

Query: 461 AI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILG 519
              + +    ++  GN     + +   +  GKNVL  + D++ +A +LG+  E+      
Sbjct: 346 TFRVVQTVAGIRSDGNI----LDEATRQRTGKNVLARVADTADAAERLGLSEEQVRLEWH 401

Query: 520 ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRK 579
                L  +R++RP+P LDDK + SWNGL +++ AR+  +L  E                
Sbjct: 402 RAMATLGGLRAQRPQPFLDDKQLTSWNGLAVAALARSGILLGEE---------------- 445

Query: 580 EYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGT 639
             +  A   A ++   +  E   RL H  RN  +  PGFL+DYA+ I GLL+L +   G 
Sbjct: 446 HLIAAARETADWVLETMQPEPG-RLWHRARNRHAGIPGFLEDYAYFIWGLLELVQTSEGQ 504

Query: 640 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVR 699
            +   A+ L +T    F D + GG+F T       LLR+K+  D A PS N+V + NLVR
Sbjct: 505 DYRRIALRLADTVLSEFADLKEGGFFQTHAAAQEPLLRLKKVFDDALPSENAVMLYNLVR 564

Query: 700 LASIVAGSKSDYYRQN 715
           L    +G  +D  R++
Sbjct: 565 LYG--SGPTNDCARKH 578


>gi|189424638|ref|YP_001951815.1| hypothetical protein Glov_1579 [Geobacter lovleyi SZ]
 gi|189420897|gb|ACD95295.1| protein of unknown function DUF255 [Geobacter lovleyi SZ]
          Length = 610

 Score =  395 bits (1016), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 236/606 (38%), Positives = 329/606 (54%), Gaps = 66/606 (10%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL    SPYLLQH+ NPVDW  WG  A  EA++R++P+F+SIGY+TCHWCHVM  ESFE
Sbjct: 26  NRLIFSRSPYLLQHSRNPVDWREWGPAAQKEAQERNLPLFVSIGYATCHWCHVMAHESFE 85

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D+ VA +LN  FV +KVDREERPD+D+  M   Q+L   GGWPL+ FL PD  P    TY
Sbjct: 86  DDEVADILNHAFVPVKVDREERPDLDEFCMAACQSLTNSGGWPLNCFLKPDGTPFYALTY 145

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD--EL 280
            P E K G PGF  +L  +   W  K++ + ++    +E L + ++A+      PD  EL
Sbjct: 146 LPKEPKRGMPGFLELLENIARVWQHKQEAVERNARSLMEALGQ-MAAAPVQTTAPDLKEL 204

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
             +A+      L K +D R+ GFG APKFP P  +  +L    ++E           Q++
Sbjct: 205 ADSAV----ATLRKIHDPRYHGFGKAPKFPMPPYLLFLLGRDNRIE-----------QEL 249

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
            L TLQ M +GGI D +GGG HRYS D+ W VPHFEKMLYDQ  +A   L A++LTK+  
Sbjct: 250 ALNTLQAMRQGGIWDQLGGGIHRYSTDQHWLVPHFEKMLYDQALVAYTALKAYALTKENR 309

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH 460
           Y  +  ++L+++  ++  P G  +   DADS   EG    +EGA YVW  +E+E ILG+ 
Sbjct: 310 YLEMADNLLEFVLAELTAPEGGFYCGLDADS---EG----REGACYVWKKQELEQILGDQ 362

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
           A  F ++Y +   GN           E  G+NVL +   ++   + +             
Sbjct: 363 AAFFCQYYGVTEQGNF----------EEPGENVLFQALPAAEEPAAIKA----------- 401

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
             +KL  VR+ R +P  D KV+  WNGL+I++ AR + +                ++ + 
Sbjct: 402 AGQKLLQVRAMRQQPLRDLKVLSGWNGLMIAALARGAAL----------------TNNRR 445

Query: 581 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 640
           ++E A  AA+FI   L      RL  S+   PS   GFL+DYAFL  G L+L++ G    
Sbjct: 446 WLEAARRAATFISSAL-TRADGRLLRSWCGTPSTIAGFLEDYAFLGWGYLELFKAGGDAA 504

Query: 641 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVR 699
            L  A +L   +D L L R       T G D   L L + ++HDG  PSG +  V+NLV 
Sbjct: 505 DLATAEQL--CRDALHLFRTEDERLVTAGNDQEQLPLALSDNHDGVIPSGPAALVMNLVA 562

Query: 700 LASIVA 705
           LA   A
Sbjct: 563 LAKCTA 568


>gi|448608928|ref|ZP_21660207.1| hypothetical protein C440_00355 [Haloferax mucosum ATCC BAA-1512]
 gi|445747305|gb|ELZ98761.1| hypothetical protein C440_00355 [Haloferax mucosum ATCC BAA-1512]
          Length = 702

 Score =  395 bits (1016), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 244/663 (36%), Positives = 345/663 (52%), Gaps = 73/663 (11%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QH  NPV+W  W E A   AR++D PIFLSIGYS CHWCHVM  ESF 
Sbjct: 8   NRLDEEQSPYLRQHVDNPVNWQPWDEAALDAAREQDKPIFLSIGYSACHWCHVMADESFS 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D  +A++LN+ F+ +KVDREERPD+D++Y T  Q + GGGGWPLSV+L+P  KP   GTY
Sbjct: 68  DPEIAEVLNEHFIPVKVDREERPDLDRIYQTICQLVTGGGGWPLSVWLTPQGKPFFVGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDML---AQSGAFAI-EQLSEA--LSASASSNKL 276
           FPPE + G PGF+ ++    + W   RD +   A+    AI ++L E       A  +++
Sbjct: 128 FPPEPRRGAPGFRDLVESFAETWQTDRDEIENRAEQWTHAITDRLEETPDTPGEAPGSEI 187

Query: 277 PDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASE 336
            D+  Q ALR                    PKFP+P  I  +L   +    TG+     E
Sbjct: 188 LDQTVQAALRAADRDDGGFG--------GGPKFPQPGRIDAIL---RGYAITGR----RE 232

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
              + +  L  MA GG+ DH+GGGFHRY VD+ W VPHFEKMLYDQ  LA  YLDA+ LT
Sbjct: 233 ALDVAVEALDAMANGGLRDHLGGGFHRYCVDKDWTVPHFEKMLYDQAGLAARYLDAYRLT 292

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
            +  Y+ + R+  +++RR++    G  F+  DA S         +EG FYVWT + V   
Sbjct: 293 GNESYAAVARETFEFVRRELSHDDGGFFATLDAQS-------DGEEGTFYVWTPEAVRSH 345

Query: 457 LGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS-SASASKLGMPLEKY 514
           L E  A LF + Y + P GN            F+ K  ++ ++ + S  A++  +  ++ 
Sbjct: 346 LPELEADLFCDRYGVTPGGN------------FENKTTVLNVSATLSDLAAEYDLSEDEV 393

Query: 515 LNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVV 574
            + L E ++ LF  R+ R RP  D+KV+  WNGL+IS+FA+ +  L+ ++ +A       
Sbjct: 394 EDHLEEAKKTLFAARADRERPARDEKVLAGWNGLMISAFAQGAVALEDDSLAAD------ 447

Query: 575 GSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYE 634
                     A  A  F+R HL+DE +  L     NG  K  G+L+DYAFL  G  DLY+
Sbjct: 448 ----------ARRALDFVREHLWDEASETLSRRVMNGEVKGDGYLEDYAFLARGAFDLYQ 497

Query: 635 FGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSV 694
                + L +AI+L    +  F D   G  + T     +++ R +E  D + PS   V+ 
Sbjct: 498 ATGDLEPLSFAIDLARATNREFYDAAAGTLYFTPESGEALVTRPQEATDQSTPSSLGVAT 557

Query: 695 INLVRL------------ASIVAGSKSDYYRQNA-EHSLAVFETRLKDMAMAVPLMCCAA 741
              + L            A  V  S ++  R +  EH   V  T  +  A  VP +  AA
Sbjct: 558 SLFLDLEHFAPDAGFGEAADAVLESYANRIRGSPLEHVSLVLAT--EKAASGVPELTAAA 615

Query: 742 DML 744
           D +
Sbjct: 616 DEM 618


>gi|288941778|ref|YP_003444018.1| hypothetical protein Alvin_2064 [Allochromatium vinosum DSM 180]
 gi|288897150|gb|ADC62986.1| protein of unknown function DUF255 [Allochromatium vinosum DSM 180]
          Length = 688

 Score =  395 bits (1016), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 257/665 (38%), Positives = 354/665 (53%), Gaps = 57/665 (8%)

Query: 90  TPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYST 149
           +P+  +H   + TNRLA+  SPYL QHAHNPVDW+ W  EA A AR+ D PI LSIGYS 
Sbjct: 2   SPSIHAHDVQR-TNRLASATSPYLQQHAHNPVDWWPWCAEALALARELDRPILLSIGYSA 60

Query: 150 CHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQAL-YGGGGWPLSV 208
           CHWCHVM  ESFED   A+ +N  FV+IKVDREERPD+DKVY T  Q L    GGWPL+V
Sbjct: 61  CHWCHVMAHESFEDPATAERMNRLFVNIKVDREERPDLDKVYQTAHQLLSQRAGGWPLTV 120

Query: 209 FLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQ---LS 264
           FL+P D  P   GTYFP E ++G P F  +L  V+ A+ ++       GA   EQ   L 
Sbjct: 121 FLTPDDHTPFFAGTYFPREPRHGLPSFTQLLVGVERAYREQ-------GAAIREQNRSLL 173

Query: 265 EALSASASSNKLPDELPQNALRLCA-EQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK 323
           EAL+          ELP+  L   A  QL+ S+D+  GGFG APKFP   +++++L    
Sbjct: 174 EALAGLEPQGGA--ELPEAGLLEAAFHQLALSFDAEHGGFGRAPKFPHATDLELLLRRQA 231

Query: 324 KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQG 383
           +L   G   +      M  FTL+ M +GG+ D +GGGF RYSVD+ W +PHFEKMLYD G
Sbjct: 232 RLAANGGDPDPRP-LHMAGFTLERMIRGGLTDQLGGGFCRYSVDDEWMIPHFEKMLYDNG 290

Query: 384 QLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEG 443
            L  +  DAFS T +  +        D++ R+M  P G  +S  DADS   EG     EG
Sbjct: 291 PLLALCCDAFSATGESIFRDAALATADWVMREMQSPEGGYYSTLDADS---EG----HEG 343

Query: 444 AFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 503
            FYVW    V      HA L    Y L       +  +  P N F+G+  L      + +
Sbjct: 344 TFYVWDRDAV------HARLSAAEYPLFAA----VYGLDRPPN-FEGRWHLHGYRTPTQA 392

Query: 504 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 563
           A  LG+ L +   +L   R  LF  R +R  P  D+K++ +WN L+I   ARA+++L   
Sbjct: 393 AESLGLNLPQAEALLASARATLFSAREQRVHPGRDEKILTAWNALMIKGMARAARVL--- 449

Query: 564 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 623
                        DR +Y+E AE A +FIR  L+ +   RL  + ++G +    +LDDYA
Sbjct: 450 -------------DRPDYLESAEQALAFIRSTLWHDG--RLLATCKDGVAHLNAYLDDYA 494

Query: 624 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 683
            LI  LL+L +    +  L +A+EL     + F D E GG++ T      ++ R K   D
Sbjct: 495 NLIDALLELLQVRWSSADLAFAVELAEVLLDEFHDAERGGFWFTGRSHEPLIHRAKPLGD 554

Query: 684 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMA-VPLMCCAAD 742
            + P+GN V+ + L RL  ++   +   Y + A+ +L +    ++ M  A   L+    D
Sbjct: 555 DSMPAGNGVAALALQRLGHLIGEVR---YLEAADGTLRLAAESMRRMPHAHASLLMALDD 611

Query: 743 MLSVP 747
            L  P
Sbjct: 612 WLDPP 616


>gi|354612894|ref|ZP_09030833.1| thioredoxin domain protein [Saccharomonospora paurometabolica YIM
           90007]
 gi|353222771|gb|EHB87069.1| thioredoxin domain protein [Saccharomonospora paurometabolica YIM
           90007]
          Length = 667

 Score =  395 bits (1015), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 251/678 (37%), Positives = 349/678 (51%), Gaps = 76/678 (11%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA   SPYLLQHA NPVDW+ W  EA AEAR+RDVPI LSIGY+ CHWCHVM  ESF 
Sbjct: 2   NRLATATSPYLLQHADNPVDWWPWCPEALAEARQRDVPILLSIGYAACHWCHVMAHESFS 61

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D   A  +N+ FV+IKVDREERPD+D VYMT  QA+ G GGWP++ FL+PD +P   GTY
Sbjct: 62  DADTAAYMNEHFVNIKVDREERPDIDAVYMTATQAMTGQGGWPMTCFLTPDGEPFHCGTY 121

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           +PP  K+G P F  +L  V  AW ++RD L +     +  ++E       S    DE   
Sbjct: 122 YPPVSKHGLPSFVQVLTAVTQAWTERRDELVEGAGRIVTHIAE--QTGPLSEHPVDE--- 176

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
            AL     +L +  D   GGFG+APKFP  + ++ +L H ++   TG    ++E   +V 
Sbjct: 177 QALSSAVAKLRQEADPANGGFGTAPKFPPSMVLEFLLRHHER---TG----SAEALSLVE 229

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            T + MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L   Y      T     +
Sbjct: 230 LTAERMARGGIYDQLGGGFARYSVDVAWVVPHFEKMLYDNALLLRAYAHLARRTGSAIAT 289

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHA 461
            +  +  ++L RD+    G   ++ DAD+   EG T       YVWT +++ ++LG E  
Sbjct: 290 RVAGETAEFLLRDLRTAEGGFAASLDADTDGVEGLT-------YVWTPEQLVEVLGPEDG 342

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
               E + +   G  +           KG + L   +D    A        ++L +    
Sbjct: 343 AWAAELFGVTEEGTFE-----------KGASTLRLPHDPDDPA--------RWLRV---- 379

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
              LF  R  RP+P  DDKVI +WNGL I++ A A   L+                R E+
Sbjct: 380 STALFQARGTRPQPARDDKVIAAWNGLAITALAEAGTALR----------------RPEW 423

Query: 582 MEVAESAASF-IRRHLYDEQTHRLQHSFRNGP-SKAPGFLDDYAFLISGLLDLYEFGSGT 639
           ++ A SA ++ + RHL D    RL+ S RNG    A G L+D+  L  GLL L++    +
Sbjct: 424 VDAAVSAGAYLLDRHLVD---GRLRRSSRNGEVGAANGVLEDHGCLADGLLALHQATGES 480

Query: 640 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL-RVKEDHDGAEPSGNSVSVINLV 698
            WL+ A  L +   E F   +  G F+ T +D   L+ R  +  D A PSG S     L+
Sbjct: 481 VWLLEATRLLDIARERFAVADTPGAFHDTADDAEALVHRPSDPTDNASPSGASTVAGALL 540

Query: 699 RLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP-----LMCCAADMLSVPSRKHVV 753
             +++V   K+  YR  AE ++    +R   +   VP      +  A  M + P +  V 
Sbjct: 541 TASALVGPEKASDYRAAAEQAV----SRAGALVAQVPRFAGHWLSVAEAMAAGPVQ--VA 594

Query: 754 LVGHKSSVDFENMLAAAH 771
           +VG  +    E +  AAH
Sbjct: 595 VVGPDAEARSELLSTAAH 612


>gi|226291405|gb|EEH46833.1| DUF255 domain-containing protein [Paracoccidioides brasiliensis
           Pb18]
          Length = 804

 Score =  395 bits (1015), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 260/653 (39%), Positives = 355/653 (54%), Gaps = 51/653 (7%)

Query: 85  AMAERTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLS 144
           +  ER  AST     +  NRL    SPY+L H +NPV W  W  EA A A+K +  IFL 
Sbjct: 10  SQTERGAASTG---PELVNRLYQSKSPYVLGHMNNPVAWQLWDSEAIALAKKLNRLIFLR 66

Query: 145 IGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGW 204
                   CHVME ESF    +A +LN  F+ IK+DREERPD+D+VYM YVQA  G GGW
Sbjct: 67  --------CHVMEKESFMSPEIAAILNKSFIPIKLDREERPDIDEVYMNYVQATTGSGGW 118

Query: 205 PLSVFLSPDLKPLMGGTYFP-PEDKY-------GRPGFKTILRKVKDAWDKKRDMLAQSG 256
           PL+VFL+PDL+P+ GG+Y+P P           G+  F  IL K++D W  ++    +S 
Sbjct: 119 PLNVFLTPDLEPVFGGSYWPGPHSNALPTLGGEGQITFVDILEKLRDVWHTQQLRCRESA 178

Query: 257 AFAIEQLSEALSASASSNKLPD-----ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPR 311
               +QL E  +   + +K  D     +L    L    +  +  YD+  GGF  APKFP 
Sbjct: 179 KDITKQLRE-FAEEGTHSKQSDVEAEEDLEIELLEEAYQHFASRYDAVNGGFSEAPKFPT 237

Query: 312 PVEIQMMLYHSK---KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDE 368
           PV +  +++ S+    + D     E S   ++ + TL  M++GGIHD +G GF RYSV  
Sbjct: 238 PVNLSFLVHLSRYPGAVADIVGYEECSRAIEIAVKTLIAMSRGGIHDQIGHGFARYSVTA 297

Query: 369 RWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAE 427
            W +PHFEKMLYDQ QL +VY+DAF    D        DI  Y+    M+ P G   S+E
Sbjct: 298 DWSLPHFEKMLYDQAQLLDVYVDAFDSAYDPELLGAMYDIATYITSPPMLSPTGGFHSSE 357

Query: 428 DADSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHN 486
           DADS  +   T K+EGAFYVWT KE++ ILG+  A +   H+ +   GN  +SR++DPH+
Sbjct: 358 DADSRPSPNDTEKREGAFYVWTLKELKQILGQRDADVCARHWGVLADGN--VSRINDPHD 415

Query: 487 EFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSW 545
           EF  +NVL      S  A + G+  ++ + I+   R KL + R SKR RP LDDK+IV+W
Sbjct: 416 EFINQNVLSIQVTPSKLAKEFGLGEDEVVRIIKGSREKLREYRESKRVRPDLDDKIIVAW 475

Query: 546 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQ 605
           NGL I + A+ S +L++      + F             AE A  FI+ +L+DEQT +L 
Sbjct: 476 NGLAIGALAKCSVVLENLDRDKAYQF----------RRAAEEAVRFIKHNLFDEQTGQLW 525

Query: 606 HSFRNGP-SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQ---NTQDELFLDREG 661
             +R G     PGF DDYA+LISGL++LYE       L +A +LQ    T   LF     
Sbjct: 526 RIYRGGVRGDTPGFADDYAYLISGLINLYEATFDDSHLQFAEQLQRYYTTPSTLFYSPSS 585

Query: 662 GGY----FNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSD 710
             +       T   P  LLR+K   D A PS N V   NL+RL++++ G   D
Sbjct: 586 SDFSTPTSPNTPTLPPPLLRLKPGTDAATPSPNGVIARNLLRLSALLDGGDVD 638


>gi|392399485|ref|YP_006436086.1| thioredoxin domain-containing protein [Flexibacter litoralis DSM
           6794]
 gi|390530563|gb|AFM06293.1| thioredoxin domain protein [Flexibacter litoralis DSM 6794]
          Length = 712

 Score =  395 bits (1015), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 228/645 (35%), Positives = 339/645 (52%), Gaps = 65/645 (10%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N+L+   SPYLLQHA NPV W  W  E   +A++ + PI +SIGYS CHWCHVME ESFE
Sbjct: 2   NQLSKSRSPYLLQHAQNPVHWQMWNNETLQKAKQENKPILVSIGYSACHWCHVMEHESFE 61

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           +E VAK +N+ F+ IKVDREERPDVD +YM  VQ +   GGWPL+VFL+ D KP  GGTY
Sbjct: 62  NEDVAKAMNENFICIKVDREERPDVDAIYMEAVQMMGVSGGWPLNVFLTSDAKPFWGGTY 121

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP ++      +  I+ ++   +  KR+ + +S     + LS +     +   + D    
Sbjct: 122 FPAKE------WIDIVEQIGKTYKNKRNEVEESANKVTKVLSISTLERYNLKDVSD-FDD 174

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK--- 339
           + L    + L K +D+ FGG G APKFP P     +L +   L+   +    +   K   
Sbjct: 175 SILAKAFQSLEKKFDTEFGGIGEAPKFPMPSYYLFLLRYYDYLDKNNQDQNITNPTKNKI 234

Query: 340 --MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTK 397
              +  TL  M +GGI+D +GGGF RYSVD+ W  PHFEKMLYD  QL ++Y +A+++T+
Sbjct: 235 LSQIHLTLNKMDQGGIYDQIGGGFARYSVDKEWFAPHFEKMLYDNAQLLSLYAEAYTITE 294

Query: 398 DVFYSYICRDIL----DYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 453
           D    ++ ++I+    ++L R++    G  ++A DADS   EG    KEG FY WT  E+
Sbjct: 295 DKVQKHVYKEIIEQTTEFLTRELQDKNGGFYAALDADS---EG----KEGKFYTWTIDEI 347

Query: 454 EDILGEHAI-----------LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSA 502
           E +   H             LFK++Y +   GN        PH   +G N+L   N    
Sbjct: 348 EQVFTNHTFSTSINQEEDLQLFKKYYSITAIGN-----WQSPHAT-EGANILYRNNTDEE 401

Query: 503 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 562
            A +  + L      + E +  L ++R  +  P LDDK++ SWN L+I  F  +   L  
Sbjct: 402 FAQENNIELNNLKCKVKEWQNYLLEIRKTKVSPSLDDKILTSWNALLIKGFCNSYSSL-- 459

Query: 563 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTH-----RLQHSFRNGPSKAPG 617
                         + K+Y+ +A   A FI ++L+D+Q       +L H+F++G ++  G
Sbjct: 460 --------------NDKKYLNLALQTAEFIEKNLFDKQNTKNNKLKLHHTFKDGTAEIDG 505

Query: 618 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGG-GYFNTTGEDPSVLL 676
           FL+DYA LI   + LY+     KWL+ A EL       F D+E    YF    E   ++ 
Sbjct: 506 FLEDYALLIESYIALYQVCFDEKWLLRADELTKYVFTNFYDKEEKLFYFTNQNESEKLVA 565

Query: 677 RVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA 721
           + KE  D    S NSV   NL  L  ++   +++ Y++ ++  L+
Sbjct: 566 QKKELFDNVISSSNSVMATNLYFLGILL---ENNLYKETSKEMLS 607


>gi|257388360|ref|YP_003178133.1| hypothetical protein Hmuk_2314 [Halomicrobium mukohataei DSM 12286]
 gi|257170667|gb|ACV48426.1| protein of unknown function DUF255 [Halomicrobium mukohataei DSM
           12286]
          Length = 715

 Score =  395 bits (1015), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 229/660 (34%), Positives = 335/660 (50%), Gaps = 55/660 (8%)

Query: 94  TSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWC 153
           +S S     NRL    SPYL QHA NPV+W  W E+A   AR+ D PIFLSIGYS CHWC
Sbjct: 2   SSDSGPTDRNRLDEAESPYLRQHADNPVNWQPWDEQALETAREHDAPIFLSIGYSACHWC 61

Query: 154 HVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPD 213
           HVME ESF D   A LLN+ FV IKVDREERPD+D +YM+  Q + G GGWPLS +L+PD
Sbjct: 62  HVMEDESFSDPETATLLNEHFVPIKVDREERPDLDAIYMSICQQVTGRGGWPLSAWLTPD 121

Query: 214 LKPLMGGTYFPPEDKYGRPGFKTILRKVKDAW---DKKRDMLAQSGAFAIEQLSEALSAS 270
            +P   GTYFPPE++ G P F  +L  +  +W   +++ +M  ++      Q ++A+ + 
Sbjct: 122 GEPFYVGTYFPPEERRGMPAFGQLLEDIAGSWSDSEQREEMYNRA-----RQWTDAIESD 176

Query: 271 ASSNKLPDELPQN-ALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 329
                 P ++P + AL+   +   ++ D   GG+G+ PKFP+P  +  ++    +     
Sbjct: 177 VGDVGQPGDVPDDEALQAAVDAAIRAADREHGGWGNGPKFPQPGRLHYLMREVAR----- 231

Query: 330 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 389
              +  + + +V  TL  MA GG+ DHVGGGFHRY  D  W VPHFEKMLYD   L   Y
Sbjct: 232 --SDRDDVRSVVTETLDAMADGGLFDHVGGGFHRYCTDREWVVPHFEKMLYDNATLPRAY 289

Query: 390 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGA-----TRKKEGA 444
           L  + LT D  Y+ + R+   ++ R++    G  FS  DA S    G         +EGA
Sbjct: 290 LAGYQLTGDERYAEVARETFAFVERELTHEDGGFFSTLDAQSVPPAGRREDADAEPEEGA 349

Query: 445 FYVWTSKEVEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSA 502
           ++VW   EV   +     A L  + + +  +GN            F+GK VL       A
Sbjct: 350 YFVWIPDEVRAAVDSETAADLLCDRFGITESGN------------FEGKTVLTVDASIEA 397

Query: 503 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 562
            +   G+        L   R ++F+ R +RPRP  D+KV+  WNGL+I++ A  + +L  
Sbjct: 398 LSESSGLEASDVERTLASAREQVFEAREERPRPARDEKVLAGWNGLMITAIAEGAIVLDD 457

Query: 563 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDY 622
                                    A +F+R HL+DE   RL   +++G     G+L+DY
Sbjct: 458 VDPDPA-----------------ADALAFVREHLWDESEQRLARRYKDGDVAIDGYLEDY 500

Query: 623 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 682
           AFL  G L L+E     + L +A++L +  +  F D + G  + T     S++ R +E  
Sbjct: 501 AFLARGALTLFEATGEVEHLAFALDLAHAIEREFWDADDGTLYFTPTSGESLVARPQELT 560

Query: 683 DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 742
           D + PS   V+V  L+ L++ V     D +   A   L     +++   M    +  AAD
Sbjct: 561 DQSTPSSTGVAVQALLSLSAFV---PHDRFETIAAGVLETHANKIEANPMQHASLVVAAD 617


>gi|409122619|ref|ZP_11222014.1| thioredoxin domain-containing protein [Gillisia sp. CBA3202]
          Length = 620

 Score =  395 bits (1014), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 223/610 (36%), Positives = 339/610 (55%), Gaps = 63/610 (10%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           KHTN L  E SPYLLQHAHNPV+W+ WG +   +A   +  I +S+GY+ CHWCHVME E
Sbjct: 5   KHTNSLINESSPYLLQHAHNPVNWYPWGSDILEKAVADNKLIIISVGYAACHWCHVMEHE 64

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SFEDE VA+++N  + +IKVDREERPDVD VYM+ VQ + G GGWP+++   PD +P+ G
Sbjct: 65  SFEDEDVAEIMNTHYYNIKVDREERPDVDMVYMSAVQIMTGSGGWPMNIVALPDGRPVWG 124

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS--EALSASASSNKLP 277
           GTYF  ED      +K  L ++   + +  + L +      E L   + +++S S N + 
Sbjct: 125 GTYFRKED------WKNSLLQIAKLYKENPEKLYEYADKLNEGLKNIQLIASSKSENDID 178

Query: 278 DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMM-----LYHSKKLEDTGKSG 332
                  L L +E+L K++D ++GG    PKF  P   + +     LY+ K ++D     
Sbjct: 179 -------LNLISEKLEKNFDWQYGGTKQTPKFVIPSNFEFLLKYSQLYNHKNIKD----- 226

Query: 333 EASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDA 392
                   V  +L  ++ GGI+DH+ GGF RYSVDE+WH+PHFEKMLYD  Q+ ++Y  A
Sbjct: 227 -------FVKLSLTKISFGGIYDHIEGGFSRYSVDEKWHIPHFEKMLYDNAQMVSLYSKA 279

Query: 393 FSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKE 452
           +++TK  +Y  +    L+++  ++    G  +S+ DADS +  G  R  EGAFY W   E
Sbjct: 280 YAVTKIGWYREVVEQTLEFIENNLKTKEGSFYSSLDADSIDKNGKLR--EGAFYTWEVDE 337

Query: 453 VEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE 512
           ++++L +   LFKE+Y +   G  +        NE+    VLI   D ++  +K  +   
Sbjct: 338 LKELLKDEFSLFKEYYNVNSYGKWE-------DNEY----VLIRTEDEASFLNKNQLDSM 386

Query: 513 KYLNILGECRRKL-FDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNF 571
           ++  I       L  + R+KR +P LDDK + SWN L++S +  A KI            
Sbjct: 387 EFKAIKAHWLEVLSSEERNKREKPRLDDKQLTSWNALMLSGYVDAYKI------------ 434

Query: 572 PVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLD 631
               +  K+Y+  A   A+FI+ HLY  + + L  SF+NG S   G+L+DYAF I   + 
Sbjct: 435 ----TQNKDYLATALQNATFIQEHLYKSEGN-LHRSFKNGISSINGYLEDYAFTIEAFIK 489

Query: 632 LYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNS 691
           LYE     +WL ++ +L +   ++F + E G ++ T+ +D  ++ R  E  D   P+ NS
Sbjct: 490 LYEITLDFEWLHFSKKLMDYSIQIFYEPETGLFYFTSKQDKPLITRNYELSDNVIPASNS 549

Query: 692 VSVINLVRLA 701
           V   NL +L+
Sbjct: 550 VMAQNLFKLS 559


>gi|297202044|ref|ZP_06919441.1| transmembrane protein [Streptomyces sviceus ATCC 29083]
 gi|297148022|gb|EDY58354.2| transmembrane protein [Streptomyces sviceus ATCC 29083]
          Length = 570

 Score =  394 bits (1013), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 242/621 (38%), Positives = 332/621 (53%), Gaps = 59/621 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA E SPYLLQHA NPVDW+ W  EAF EARK + P+ LS+GYS+CHWCHVM  ESFE
Sbjct: 6   NRLAHETSPYLLQHADNPVDWWPWSAEAFEEARKTNKPVLLSVGYSSCHWCHVMAQESFE 65

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D+  A LLN+ FVS+KVDREERPDVD VYM  VQA  G GGWP++VFL+PD +P   GTY
Sbjct: 66  DQATADLLNEHFVSVKVDREERPDVDAVYMEAVQAATGQGGWPMTVFLTPDAEPFYFGTY 125

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FPP  + G P F+ +L  V+ AW  +RD +A+     +  L+     S   ++ P E   
Sbjct: 126 FPPSPRQGMPSFRQVLEGVRAAWTDRRDEVAEVAGKIVRDLA-GREISYGDSQAPGEEQL 184

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
            A  L    L++ YD++ GGFG APKFP  + ++ +L H  +   TG  G      +M  
Sbjct: 185 AAALLG---LTREYDAQRGGFGGAPKFPPSMVVEFLLRHHAR---TGAEG----ALQMAQ 234

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            T + MA+GGIHD +GGGF RYSVD  W VPHFEKMLYD   L  VY   +  T      
Sbjct: 235 DTCERMARGGIHDQLGGGFARYSVDRDWIVPHFEKMLYDNALLCRVYAHLWRATGSDLAR 294

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH-A 461
            +  D  D++ R++    G   SA DADS   +G  R  EGA+YVWT +++ ++LGE  A
Sbjct: 295 RVALDTADFMVRELRTAEGGFASALDADS--DDGTGRHVEGAYYVWTPEQLREVLGEQDA 352

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDSSASASKLGMPLEKYLNILGE 520
            L  +++ +   G  +            G++VL +   D+   A K           +  
Sbjct: 353 ELAAQYFGVTEEGTFE-----------HGQSVLQLPQQDTVFDAEK-----------VES 390

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            RR+L D R++RP P  DDKV+ +WNGL I++ A                      DR +
Sbjct: 391 IRRRLLDARAQRPAPGRDDKVVAAWNGLAIAALAETGAYF----------------DRPD 434

Query: 581 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYAFLISGLLDLYEFGSGT 639
            ++ A  AA  + R   DEQ  RL  + ++G   A  G L+DYA +  G L L       
Sbjct: 435 LVDAALGAADLLVRLHLDEQA-RLSRTSKDGQVGANAGVLEDYADVAEGFLALASVTGEG 493

Query: 640 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVR 699
            WL +A  L +     F   E G  F+T  +   ++   +   D A PSG + +    + 
Sbjct: 494 VWLDFAGFLLDHVLTRFTGPE-GALFDTAADAERLIPPPQNPTDNAVPSGWTAAAPAPL- 551

Query: 700 LASIVAGSKSDYYRQNAEHSL 720
             S  A + S+ +R+ AE +L
Sbjct: 552 --SYAAQTGSENHREGAEKAL 570


>gi|431797737|ref|YP_007224641.1| thioredoxin domain-containing protein [Echinicola vietnamensis DSM
           17526]
 gi|430788502|gb|AGA78631.1| thioredoxin domain protein [Echinicola vietnamensis DSM 17526]
          Length = 678

 Score =  394 bits (1013), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 230/601 (38%), Positives = 322/601 (53%), Gaps = 51/601 (8%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            N L    SPYLLQHA+NPV W+ WG EA  +A+  + PI +SIGYS CHWCHVME ESF
Sbjct: 5   ANHLIDSQSPYLLQHAYNPVQWYPWGPEALDKAKLENKPIIVSIGYSACHWCHVMEHESF 64

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           EDE  AK++N  FV IK+DREERPD+D +YM  VQ++   GGWPL+VFL P+ KP  GGT
Sbjct: 65  EDEATAKIMNAHFVCIKIDREERPDLDNIYMDAVQSMGLQGGWPLNVFLMPNQKPFYGGT 124

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFP       P +K +L+ + +A+    D LA+S       +             P  L 
Sbjct: 125 YFP------NPNWKGLLQNIAEAYATHHDELAKSAEGFGNSIKLKEREKYRLADDPSRLT 178

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
              L   A++++   D ++GGF  +PKFP P     +L ++         G+AS  +K V
Sbjct: 179 AEDLTHMAQKIASQMDPQWGGFNRSPKFPMPAVWDFLLRYA------ALKGDASLIEK-V 231

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
           LFTL  +  GGI+DH+ GGF RYSVD  W  PHFEKMLYD GQL ++Y  AF L+ D  +
Sbjct: 232 LFTLTKIGMGGIYDHLRGGFARYSVDSEWFAPHFEKMLYDNGQLLSLYAKAFQLSGDALF 291

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHA 461
                + +++L+ +M+   G  ++A DADS   EG    +EG FY WT  E+E +L +  
Sbjct: 292 KEKINETVNWLQAEMLQEEGGFYAALDADS---EG----EEGKFYTWTHDELESMLDDED 344

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
             F E + +   GN +           KG N+L + +     A K G+  E+    L E 
Sbjct: 345 AWFYECFNISEKGNWE-----------KGVNILFQTHTYEEIAHKHGLEEEQLAQNLNEV 393

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
           + +L  +R+ R  P LDDKVI  WNGL IS  A+A     +         P+  S     
Sbjct: 394 KERLLKIRNLRTPPGLDDKVIAGWNGLTISGLAQAYWATAN---------PLAKS----- 439

Query: 582 MEVAESAASFIRRH-LYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 640
             +A    +FI  H L  EQ +R   S++NG +  P FL+DYA +I G + LY+  S  +
Sbjct: 440 --LAIQNGTFILDHMLKGEQLYR---SYKNGEAYTPAFLEDYAAIIQGFIHLYQLTSEPR 494

Query: 641 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 700
           WL+ A  L     E F D + G ++    +  +++   KE  D   PS N++   NL +L
Sbjct: 495 WLLVAKRLTAFVLEHFFDEDDGLFYFNNPDSETLIANKKEIFDNVIPSSNALMATNLHQL 554

Query: 701 A 701
            
Sbjct: 555 G 555


>gi|114319387|ref|YP_741070.1| hypothetical protein Mlg_0225 [Alkalilimnicola ehrlichii MLHE-1]
 gi|114225781|gb|ABI55580.1| protein of unknown function DUF255 [Alkalilimnicola ehrlichii
           MLHE-1]
          Length = 697

 Score =  394 bits (1013), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 237/605 (39%), Positives = 333/605 (55%), Gaps = 40/605 (6%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL    SPYLLQHA NPV W  W + A A AR++  PI LSIGYS CHWCHVM  ESFE
Sbjct: 6   NRLGDATSPYLLQHADNPVHWQPWDDRALALAREQGKPILLSIGYSACHWCHVMAHESFE 65

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALY-GGGGWPLSVFLSPDLK-PLMGG 220
           D  +A+L+N+ F++IKVDREERPD+D++Y T  Q L    GGWPL++ L+PD + P+  G
Sbjct: 66  DPAIARLMNERFINIKVDREERPDLDRIYQTAHQLLTRRPGGWPLTLVLTPDDQTPVFAG 125

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           TYFPP+ + G PGF  +LR+V +A   +   +A         L     A A        L
Sbjct: 126 TYFPPDTRGGMPGFADVLRQVDEAIRSQPQAVADQNRALRHALGRLAHAPADGGDA--AL 183

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
               LR   + L+ S+D   GGFG+APKFP P  I+ +L H      TG  G   +   M
Sbjct: 184 GNAPLRAARDALADSFDRVHGGFGAAPKFPHPGGIERLLRHYALTLVTG-DGPDRDALHM 242

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
              TL+ MA GGI+D VGGGF RYSVDE W +PHFEKML D   L  +Y DA+  T D  
Sbjct: 243 ACHTLRRMALGGIYDQVGGGFARYSVDEYWMIPHFEKMLCDNALLLGLYADAWHATGDGL 302

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH 460
           Y+ + ++  +++R +M  P G   ++ DADS   EG     EG +Y+WT  EV ++L E 
Sbjct: 303 YARVVQETAEWVRAEMERPEGGYCTSLDADS---EGG----EGRYYLWTPDEVRELLDED 355

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
                EH +           + +P N F+G+  L      S SA +LG P E+ + +   
Sbjct: 356 EWRLVEHRF----------GLDEPAN-FEGRWHLHVQASFSESARRLGRPREQVVALWQS 404

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            R+KL   R +R RP  DDKV+ +WNGL+I++ ARA ++L                D   
Sbjct: 405 ARQKLQRARGQRVRPGRDDKVLTAWNGLMIAALARAGRLL----------------DEPA 448

Query: 581 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 640
           +   A  A  F+R  L D+Q  RL  S+R G +     L+DYA+L+ G+L+  +      
Sbjct: 449 WTASALRALGFLRERLADDQG-RLYASWRAGRAAHQACLEDYAYLLEGVLECLQSEWSDD 507

Query: 641 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 700
            L +A+ L +T  E F D++ GG++ T  +   ++ R +   D + PSGN+V++  L RL
Sbjct: 508 RLGFALHLADTLLERFQDKDEGGFWMTADDHEPLIHRPRPLADDSLPSGNAVALRALQRL 567

Query: 701 ASIVA 705
             ++ 
Sbjct: 568 GHLLG 572


>gi|448677622|ref|ZP_21688812.1| thioredoxin [Haloarcula argentinensis DSM 12282]
 gi|445773297|gb|EMA24330.1| thioredoxin [Haloarcula argentinensis DSM 12282]
          Length = 717

 Score =  394 bits (1012), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 236/669 (35%), Positives = 350/669 (52%), Gaps = 61/669 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL    SPYL QHA NPV+W  W E A   A++RDVPIFLSIGY+ CHWCHVME ESFE
Sbjct: 11  NRLDEAESPYLRQHADNPVNWQPWDETALEAAKERDVPIFLSIGYAACHWCHVMEEESFE 70

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           +E +A+ LN+ FV IKVDREERPD+D VYM+  Q + GGGGWPLS +L+P+ +P   GTY
Sbjct: 71  NEAIAEQLNENFVPIKVDREERPDLDSVYMSICQQVTGGGGWPLSAWLTPEGEPFYVGTY 130

Query: 223 FPPEDKYGRPGFKTILRKVKDAWD--KKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           FPPE+K G+PGF  +L+++  +W   ++R+ +        E +   L A+ +    P++ 
Sbjct: 131 FPPEEKRGQPGFGDLLQRLSGSWSDPEQREEMENRARQWTEAIESDLEATPAD---PEDP 187

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
            ++ ++       +  D + GG+GS  PKFP+   +  +L              A  GQ+
Sbjct: 188 AEDIIQTAGTIAHRGADRQDGGWGSGGPKFPQNGRLHALL-----------RAHAGGGQE 236

Query: 340 ----MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 395
               +V  TL  MA  G++DHVGGGFHRY+ D++W VPHFEKMLYD  ++   +L  +  
Sbjct: 237 DYLNVVEETLDVMADRGLYDHVGGGFHRYATDQQWAVPHFEKMLYDNAEIPRAFLAGYQA 296

Query: 396 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSA---ETEGATRKKEGAFYVWTSKE 452
                Y+ + R+  ++++R+M  P G  FS  DA+SA   E EG T  +EG FYVWT ++
Sbjct: 297 IGSERYASVVRETFEFVQREMQHPEGGFFSTLDAESAPIDEPEGET--EEGLFYVWTPEQ 354

Query: 453 VEDILGEH--AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMP 510
           V + + +   A +F +++ +   GN            F+G  VL      S  A +    
Sbjct: 355 VHEAVDDETDAEIFCDYFGVTERGN------------FEGATVLAVRKPVSVLAEEYDQS 402

Query: 511 LEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFN 570
            ++    L     + F+ R  RPRP  D+KV+  WNGL+I + A  + +L          
Sbjct: 403 EDEITGSLQRALNEAFEARENRPRPARDEKVLAGWNGLMIRTLAEGAIVLDDAYADVA-- 460

Query: 571 FPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLL 630
                            A SF+R +L+D+   RL   +++G     G+L+DYAFL  G L
Sbjct: 461 ---------------ADALSFVREYLWDDDAGRLNRRYKDGDVAIDGYLEDYAFLGRGAL 505

Query: 631 DLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGN 690
            L+E     + L +A++L     E F D E G  F T     S++ R +E  D + PS  
Sbjct: 506 TLFEATGDVEHLAFAMDLGQAITEAFWDDEQGTLFFTPTGGESLVARPQELTDQSTPSST 565

Query: 691 SVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRK 750
            V+V  L+ L+     S  D +   AE  +     R+    +    +  A D     + +
Sbjct: 566 GVAVDLLLSLSHF---SDDDRFESVAERVIRTHADRVSSNPLQHASLTLATDTYEQGALE 622

Query: 751 HVVLVGHKS 759
            + LVG +S
Sbjct: 623 -LTLVGDQS 630


>gi|363583054|ref|ZP_09315864.1| hypothetical protein FbacHQ_16672 [Flavobacteriaceae bacterium
           HQM9]
          Length = 705

 Score =  394 bits (1012), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 219/611 (35%), Positives = 338/611 (55%), Gaps = 51/611 (8%)

Query: 94  TSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWC 153
           T+  +++ TN L  E SPYLLQHAHNPV+W AW  E   EA+++   + +S+GY+ CHWC
Sbjct: 24  TTMEKHEFTNDLIHETSPYLLQHAHNPVNWKAWHPETLNEAKEKKKLLLISVGYAACHWC 83

Query: 154 HVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPD 213
           HVME ESFED  VA ++N  FV+IK+DREERPD+D+VYM+ VQ + G GGWPL+V   PD
Sbjct: 84  HVMEHESFEDSTVAAVMNTNFVNIKIDREERPDIDQVYMSAVQLMTGRGGWPLNVIALPD 143

Query: 214 LKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASS 273
            +P+ GGTYFP ++  G       L++++  ++     L +       +L+E + + +  
Sbjct: 144 GRPVWGGTYFPKDEWMGA------LKQIQKIYEDNPAKLEEYAT----KLTEGIQSVSLV 193

Query: 274 NKLPDEL--PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKS 331
              P+ L   ++ +       +K +D + GG   APKF  P     +L ++ +       
Sbjct: 194 KPNPNTLIFEKDTIENAVANWAKKFDYKKGGLDYAPKFMMPNNYHFLLRYAHQ------- 246

Query: 332 GEASEGQK-MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYL 390
             A+E  K  V+ TL  ++ GG++DHVGGGF RYS DE+WHVPHFEKMLYD  QL ++Y 
Sbjct: 247 -SANEKLKEYVITTLNQISYGGVYDHVGGGFARYSTDEKWHVPHFEKMLYDNAQLVSLYS 305

Query: 391 DAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTS 450
           DA+ +TK+ +Y  +  + LD++ R++    G  +S+ DADS    G  + +EGAFYVW  
Sbjct: 306 DAYLITKNDWYKQVVYETLDFVARELTNDEGAFYSSLDADSLTPSG--KLEEGAFYVWQK 363

Query: 451 KEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMP 510
             +E  LGE   LFK++Y +   G  +       HN +    VLI     +    K  M 
Sbjct: 364 PALETALGEDFPLFKDYYNINTYGLWE-------HNNY----VLIRKESDANFVEKHEME 412

Query: 511 LEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFN 570
           ++ +L    + ++ L  +RSKR RP LDDK + SWN L++  +A A ++           
Sbjct: 413 MDAFLQKQKKWKQLLLGIRSKRERPRLDDKTLTSWNALMLKGYADAYRVF---------- 462

Query: 571 FPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLL 630
                 D  ++++ A + A FI+     + + +L H+++NG S   G+L+DYA  I   +
Sbjct: 463 ------DNAKFLKAALANAEFIKTKQL-KGSGQLMHNYKNGKSTINGYLEDYAATIEAFI 515

Query: 631 DLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGN 690
            LY+     +WL  + ++ +     F D     YF T+ ED +++ R  E  D   P+ N
Sbjct: 516 ALYQVTFDQQWLDLSKKMIDYVHTHFYDSASEMYFFTSDEDAALVTRNIESSDNVIPASN 575

Query: 691 SVSVINLVRLA 701
           S+   NL  L+
Sbjct: 576 SIMAKNLYHLS 586


>gi|452943278|ref|YP_007499443.1| thymidylate kinase [Hydrogenobaculum sp. HO]
 gi|452881696|gb|AGG14400.1| thymidylate kinase [Hydrogenobaculum sp. HO]
          Length = 634

 Score =  394 bits (1011), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 243/639 (38%), Positives = 337/639 (52%), Gaps = 82/639 (12%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           K  NRL  E SPYL  HA+NPVDW+ W EEAF +A K + P+FLSIGYS+CHWCHVME E
Sbjct: 2   KTPNRLINEKSPYLRMHAYNPVDWYPWSEEAFDKAIKENKPVFLSIGYSSCHWCHVMEKE 61

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SFEDE VA  LN +FVSIKVD+EERPD+D +YM Y   L   GGWPLS FL+P  +P   
Sbjct: 62  SFEDEEVASFLNKYFVSIKVDKEERPDIDSLYMEYCVLLNNSGGWPLSAFLTPTKEPFFA 121

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE 279
           GTYFP      +  F  +L+++KD WDK    + +     +EQL + +++         E
Sbjct: 122 GTYFP------KASFLKLLQQIKDLWDKDSKNIIEKSKRLVEQLKQFMNSFEKR-----E 170

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
           L ++ +      L+  YD  FGGF  APKFP    + ++L   K+             Q 
Sbjct: 171 LNESFIDKALFGLANRYDEEFGGFSEAPKFPSLHNVLLLLKSQKQ-----------PFQD 219

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           M L TL  M +GGI DHVGGGFHRYS D  W +PHFEKMLYDQ      Y +A+ LTK+ 
Sbjct: 220 MALSTLLNMRRGGIWDHVGGGFHRYSTDRYWLLPHFEKMLYDQAMAILAYSEAYRLTKNE 279

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE 459
            +       +++++ ++    G  +++ DAD   TEG    +EG FY+WT +E++DIL E
Sbjct: 280 IFKDTVYKTINFVKENLY-ENGFFYTSMDAD---TEG----EEGGFYLWTYQEIKDILKE 331

Query: 460 HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILG 519
            A  F E + +K  GN     + +    + GKNVL         A +  +  E+ L IL 
Sbjct: 332 KADKFIEFFNIKKEGNF----LDEAKRVYTGKNVLY--------AKEPSLAFEEELKIL- 378

Query: 520 ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRK 579
               K F  R KR +P +DDK+++  N ++  +   A  +                 D K
Sbjct: 379 ----KAF--REKRKKPLIDDKILLDQNAMMDFALIEAYLVF----------------DDK 416

Query: 580 EYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGT 639
           +++++A        ++L +   H LQH+  +     P  LDDYA+LI   L LY+     
Sbjct: 417 DFLDMA-------TKNLNNISKHPLQHALNHNKLIEP-MLDDYAYLIKAYLSLYKATFSK 468

Query: 640 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVR 699
             L  AI L     E   D+  GG++ + G+D  VL+  K  +DGA PSGNSV  +NLV 
Sbjct: 469 DALEKAISLTEETIEKLWDKNAGGFYLSVGKD--VLIPQKTLYDGAIPSGNSVMGLNLVE 526

Query: 700 LASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMC 738
           L  I   +K D Y    E+   +  +   DM    P  C
Sbjct: 527 LFFI---TKEDTY----ENRYQILSSIYSDMLSRNPTAC 558


>gi|313203107|ref|YP_004041764.1| hypothetical protein Palpr_0623 [Paludibacter propionicigenes WB4]
 gi|312442423|gb|ADQ78779.1| hypothetical protein Palpr_0623 [Paludibacter propionicigenes WB4]
          Length = 680

 Score =  394 bits (1011), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 232/620 (37%), Positives = 329/620 (53%), Gaps = 75/620 (12%)

Query: 95  SHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCH 154
           S S +K+TN L  E SPYLLQHAHNPVDW+ W +EA  +A+K +  + +SIGY+ CHWCH
Sbjct: 2   STSEHKYTNHLIHESSPYLLQHAHNPVDWYPWSQEALNKAKKENKNLLISIGYAACHWCH 61

Query: 155 VMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDL 214
           VME E FEDE VA+ +N+ FV+IKVDREERPD+D++YMT VQ L   GGWPL+    PD 
Sbjct: 62  VMERECFEDEEVARYMNEHFVAIKVDREERPDIDQIYMTAVQLLTERGGWPLNCVALPDG 121

Query: 215 KPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAF------AIEQLSEALS 268
           +P+ GGTYFP                 K  W    DML Q   F        E  + AL+
Sbjct: 122 RPIYGGTYFP-----------------KAQW---LDMLNQVSGFIQLHPDKTENQARALT 161

Query: 269 ASASSNK------LPD-ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYH 321
               +N+      LP  E   N        +    D+  GG+G+APKFP P  +Q +L H
Sbjct: 162 EGVQNNEMIYRADLPGLEATVNDQEDIFYHIQAGIDTVNGGYGTAPKFPMPSSLQFLL-H 220

Query: 322 SKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYD 381
              L     SG  ++  K +  TL  MA GGI+D +GGGF RY+ DE W +PHFEKMLYD
Sbjct: 221 FHHL-----SGN-NDALKALTTTLDRMAFGGIYDQIGGGFARYATDEAWKIPHFEKMLYD 274

Query: 382 QGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK 441
              L +VY  AF   ++  Y  +  + L+++  ++  P G  +S+ DADS   EG     
Sbjct: 275 NALLVSVYASAFQYNRNPHYEKVLHETLEFVSSELTSPDGGFYSSLDADS---EGV---- 327

Query: 442 EGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS 501
           EG FYVWT  E++ ILG++A L  +++ +   GN + S           +N+L    +  
Sbjct: 328 EGKFYVWTFDELQTILGKNAGLIMDYFQVTAAGNWEES-----------QNILYRKGNDE 376

Query: 502 ASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 561
             A K  +   +    + + R  L  VR+KR +P LDDK++ SWN L++  +  A ++  
Sbjct: 377 EIARKHNLSTVELSESIAQARELLQTVRAKRQKPMLDDKILTSWNALMLKGYCDAYRV-- 434

Query: 562 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDD 621
                         + + EY++ A   A+FI R++     + L  +++NG +  P FLDD
Sbjct: 435 --------------TAKAEYLQAALRNANFILRYM-KSADNGLFRNYKNGKASIPAFLDD 479

Query: 622 YAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 681
           YAF+I   + LY+     +WLV A EL       F D E G ++ T+  +P+++ R  E 
Sbjct: 480 YAFIIQAFISLYQNTFDEQWLVEASELTEYTVSHFYDPESGMFYYTSDTEPALIARKMEI 539

Query: 682 HDGAEPSGNSVSVINLVRLA 701
            D   PS NS    NL  L 
Sbjct: 540 SDNVIPSSNSEMGKNLFVLG 559


>gi|134097521|ref|YP_001103182.1| hypothetical protein SACE_0923 [Saccharopolyspora erythraea NRRL
           2338]
 gi|133910144|emb|CAM00257.1| protein of unknown function DUF255 [Saccharopolyspora erythraea
           NRRL 2338]
          Length = 681

 Score =  393 bits (1010), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 241/631 (38%), Positives = 327/631 (51%), Gaps = 77/631 (12%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           +RLA   SPYLLQHA NPVDW+ W  EAF EAR+RDVP+ LSIGY+ CHWCHVM  ESFE
Sbjct: 3   HRLADATSPYLLQHADNPVDWWQWSPEAFEEARRRDVPVLLSIGYAACHWCHVMAHESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE  A ++N+ FV+IKVDREERPDVD VYM   QA+ G GGWP++ FL+PD +P   GTY
Sbjct: 63  DEATAAVMNENFVNIKVDREERPDVDAVYMEATQAMTGQGGWPMTCFLTPDAEPFHCGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE-LP 281
           +P    +G P F+ +L  V  AW ++   + Q+    +EQL      SA    LP+  L 
Sbjct: 123 YPSAPLHGMPSFRQLLDAVASAWRERGGEVRQAATRVVEQL------SAQRTALPESFLD 176

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
              +     +L    D    GFG APKFP  + ++ +L H ++    G    A E   M 
Sbjct: 177 DEVIATAVSRLHAESDPDHAGFGGAPKFPPSMVLEFLLRHQERQSAPGSGHTALE---MA 233

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             T + MA+GGI+D + GGF RYSVD  W VPHFEKMLYD   L  VY       +    
Sbjct: 234 EATCEAMARGGIYDQLAGGFARYSVDSAWVVPHFEKMLYDNALLLRVYAHLARRRESPLA 293

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE-- 459
             + R+   +L RD+  P G   ++ DAD   TEG     EG  YVWT +++ ++LGE  
Sbjct: 294 ERVARETAAFLLRDLRTPEGGFAASLDAD---TEGV----EGLTYVWTPEQLAEVLGEAD 346

Query: 460 ---HAILF---KEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEK 513
               A LF   +   + + T    L R  DP +  + + V                    
Sbjct: 347 GAWAAELFEVTESGTFEQGTSTLQLKR--DPDDPARWRRV-------------------- 384

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
                   R  L++ RS+RP+P  DDKV+ SWNG+ I++   AS  L             
Sbjct: 385 --------RDALYEARSRRPQPGKDDKVVTSWNGMAITALVEASTALGE----------- 425

Query: 574 VGSDRKEYMEVAESAAS-FIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFLISGLLD 631
                 E++  AE AA   + RHL D+   RL+ S R+G    A G L+DY  L  GLL 
Sbjct: 426 -----PEWLAAAEQAAKLLVERHLVDQ---RLRRSSRDGVVGAAAGVLEDYGCLADGLLS 477

Query: 632 LYEFGSGTKWLVWAIELQNTQDELFLDREG-GGYFNTTGEDPSVLLRVKEDHDGAEPSGN 690
           L++     +WL  A  L +T  E F D +  G YF+T  +   ++ R  +  D A PSG 
Sbjct: 478 LHQATGEPRWLDVACSLLDTALEQFADSDNPGAYFDTAADSEELVRRPSDPTDNASPSGA 537

Query: 691 SVSVINLVRLASIVAGSKSDYYRQNAEHSLA 721
           S     L+  +++  GS +  YR  AE +L+
Sbjct: 538 SSLTSALLTASALAGGSAAQRYRHAAEQALS 568


>gi|387790403|ref|YP_006255468.1| protein containing a thioredoxin domain [Solitalea canadensis DSM
           3403]
 gi|379653236|gb|AFD06292.1| protein containing a thioredoxin domain [Solitalea canadensis DSM
           3403]
          Length = 674

 Score =  393 bits (1010), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 222/621 (35%), Positives = 329/621 (52%), Gaps = 73/621 (11%)

Query: 101 HTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVES 160
           HTN L  E SPYLLQHAHNPV+W+ WG EA  +A+  +  I +S+GYS CHWCHVME ES
Sbjct: 4   HTNSLIHETSPYLLQHAHNPVNWYPWGAEALQKAKDENKLILVSVGYSACHWCHVMEHES 63

Query: 161 FEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGG 220
           FEDE VA ++N+ FV IKVDREERPD+D+VYM  VQ + GGGGWPL+ F  PD +P  GG
Sbjct: 64  FEDEQVASIMNEHFVCIKVDREERPDIDQVYMNAVQLMTGGGGWPLNCFCLPDQRPFYGG 123

Query: 221 TYFPPEDKYG-----RPGFKTILRKVKDAWDKKRDMLAQSG--AFAIEQLSEALSASASS 273
           TYF  +D        +  F    ++ ++  D+    + QS    F  EQ           
Sbjct: 124 TYFRKQDWMRLLNDLQAFFVNKPKEAEEYADRLHKGIKQSDVVGFVAEQ----------- 172

Query: 274 NKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGE 333
                E   N L+   +  ++ +D   GG+  APKFP P   Q +L +++  +D   +  
Sbjct: 173 ----KEYSVNTLKEIVDPWTRYFDYSDGGYNRAPKFPLPNNFQFLLRYARLAKDQASN-- 226

Query: 334 ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 393
                 +   TL  MA GGI+D +GGGF RYSVD  W VPHFEKMLYD GQL ++Y +A+
Sbjct: 227 -----VITRLTLDKMAYGGIYDQLGGGFARYSVDSVWLVPHFEKMLYDNGQLVSLYAEAY 281

Query: 394 SLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 453
             +  + Y  +  + L+++RR++  P G  +SA DADS   EG     EG FY WT  E+
Sbjct: 282 QYSGSLLYKNVVAETLEFIRRELTSPEGGFYSALDADS---EGV----EGKFYCWTRDEL 334

Query: 454 EDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEK 513
           + IL +   +F  +Y +   GN            ++  N+L    D    A+  G+  ++
Sbjct: 335 KGILSDDEEIFSTYYNVTEEGN------------WEETNILHRKEDDKVIANAHGLSEDE 382

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
              I+  C+ KL  VR  R RP LDDK++ SWNG+++  +  A ++ + +          
Sbjct: 383 LTVIIDRCKAKLMKVREHRVRPGLDDKILTSWNGIMLKGYIDAYRVFRVD---------- 432

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
                 EY++ A + ASF+  +L  +     + +++NG +    FLDDY  +    ++LY
Sbjct: 433 ------EYLQTALTNASFLLENL-KQADGSWKRNYKNGNATINAFLDDYVLVAEAFIELY 485

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
           +     +WL  A  + +   E F D++ G ++ T+  D  ++ R  E  D   PS NSV 
Sbjct: 486 QATFDEQWLAEAKAIVDYCIEHFYDQQSGMFYYTSNTDEQLITRKFELMDSVIPSSNSVL 545

Query: 694 VINLVRLASIVAGSKSDYYRQ 714
              L+++ +        YY+Q
Sbjct: 546 ARVLLKIGT--------YYQQ 558


>gi|402848267|ref|ZP_10896531.1| Thymidylate kinase [Rhodovulum sp. PH10]
 gi|402501421|gb|EJW13069.1| Thymidylate kinase [Rhodovulum sp. PH10]
          Length = 710

 Score =  393 bits (1009), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 242/640 (37%), Positives = 348/640 (54%), Gaps = 57/640 (8%)

Query: 101 HTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVES 160
           H NRLA E SPYLLQH HNPVDW+ WG EA AEA +   PI LS+GY+ CHWCHVM  ES
Sbjct: 9   HDNRLAHETSPYLLQHRHNPVDWWPWGPEALAEAERTGKPILLSVGYAACHWCHVMAHES 68

Query: 161 FEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGG 220
           FED   A ++N+ FV IKVDREERPD+D++YM  +  L   GGWPL++FL+P  +P+ GG
Sbjct: 69  FEDPATAAVMNELFVPIKVDREERPDIDQIYMAALHHLGDQGGWPLTMFLTPSGEPVWGG 128

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           TYFP   ++G+P F  +LR+V   + ++ + + Q+    + +L+    A+        EL
Sbjct: 129 TYFPRVSRFGKPAFVDVLREVSRLFREEPEKIEQNRRALMGRLAHRAQAAGRPVIGLAEL 188

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED--TGKSGEASEGQ 338
            +      A Q++ + D   GG   APKFP+P  ++  ++ + + ED  TG +   +   
Sbjct: 189 DR-----MAAQIAGAIDLVNGGLRGAPKFPQPTMLE-TIWRAGEREDARTGFAHPTNLFY 242

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
            +V  TL+ M +GGI DH+GGGF RYSVD+RW VPHFEKMLYD  QL  +   A + T  
Sbjct: 243 DLVALTLERMCEGGIFDHLGGGFARYSVDDRWLVPHFEKMLYDNAQLLELLALAHARTGH 302

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
             +     + + +L R+M  P G   ++ DADS   EG    +EG FYVWT +E+  +LG
Sbjct: 303 ELFRQRAEETVGWLLREMTTPEGAFCASLDADS---EG----EEGKFYVWTLEEIVGVLG 355

Query: 459 -EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELND-SSASASKLGMP--LEKY 514
            E A  F  HY ++P GN            F+GK +L  L     A+ ++ G+P  L KY
Sbjct: 356 PEDAARFAAHYDVEPAGN------------FEGKTILDRLPGLDQAAQARTGLPFALHKY 403

Query: 515 LNI-----LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 569
            +      L   R++LFD RS R RP  DDK++  WNGL I++ A A  +L         
Sbjct: 404 ADARIEADLAAMRQRLFDARSTRVRPGTDDKILADWNGLTIAALANAGTLL--------- 454

Query: 570 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 629
                  D    +++A  A +F+   +   +  RL HS+R+G    PG   DYA +I   
Sbjct: 455 -------DVPASIDLARRAFAFVATEM--TRHGRLGHSWRDGRLLFPGLASDYAAMIRAA 505

Query: 630 LDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSG 689
           L L+E     ++L  A+  Q   D    D E G Y+ +  +   +++R     D A P+ 
Sbjct: 506 LALHEATGEKEFLDRAVAWQEAFDHHHQDVETGTYYLSADDAEGLVVRPSATTDDAIPNP 565

Query: 690 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKD 729
           N ++  NLVRLA +   +  D +R+ A+  L     R  D
Sbjct: 566 NGLAAQNLVRLAVL---TGDDRWRERADALLEGLLPRAAD 602


>gi|291009338|ref|ZP_06567311.1| hypothetical protein SeryN2_32865 [Saccharopolyspora erythraea NRRL
           2338]
          Length = 683

 Score =  393 bits (1009), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 241/631 (38%), Positives = 327/631 (51%), Gaps = 77/631 (12%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           +RLA   SPYLLQHA NPVDW+ W  EAF EAR+RDVP+ LSIGY+ CHWCHVM  ESFE
Sbjct: 5   HRLADATSPYLLQHADNPVDWWQWSPEAFEEARRRDVPVLLSIGYAACHWCHVMAHESFE 64

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE  A ++N+ FV+IKVDREERPDVD VYM   QA+ G GGWP++ FL+PD +P   GTY
Sbjct: 65  DEATAAVMNENFVNIKVDREERPDVDAVYMEATQAMTGQGGWPMTCFLTPDAEPFHCGTY 124

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE-LP 281
           +P    +G P F+ +L  V  AW ++   + Q+    +EQL      SA    LP+  L 
Sbjct: 125 YPSAPLHGMPSFRQLLDAVASAWRERGGEVRQAATRVVEQL------SAQRTALPESFLD 178

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
              +     +L    D    GFG APKFP  + ++ +L H ++    G    A E   M 
Sbjct: 179 DEVIATAVSRLHAESDPDHAGFGGAPKFPPSMVLEFLLRHQERQSAPGSGHTALE---MA 235

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             T + MA+GGI+D + GGF RYSVD  W VPHFEKMLYD   L  VY       +    
Sbjct: 236 EATCEAMARGGIYDQLAGGFARYSVDSAWVVPHFEKMLYDNALLLRVYAHLARRRESPLA 295

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE-- 459
             + R+   +L RD+  P G   ++ DAD   TEG     EG  YVWT +++ ++LGE  
Sbjct: 296 ERVARETAAFLLRDLRTPEGGFAASLDAD---TEGV----EGLTYVWTPEQLAEVLGEAD 348

Query: 460 ---HAILF---KEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEK 513
               A LF   +   + + T    L R  DP +  + + V                    
Sbjct: 349 GAWAAELFEVTESGTFEQGTSTLQLKR--DPDDPARWRRV-------------------- 386

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
                   R  L++ RS+RP+P  DDKV+ SWNG+ I++   AS  L             
Sbjct: 387 --------RDALYEARSRRPQPGKDDKVVTSWNGMAITALVEASTALG------------ 426

Query: 574 VGSDRKEYMEVAESAAS-FIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFLISGLLD 631
                 E++  AE AA   + RHL D+   RL+ S R+G    A G L+DY  L  GLL 
Sbjct: 427 ----EPEWLAAAEQAAKLLVERHLVDQ---RLRRSSRDGVVGAAAGVLEDYGCLADGLLS 479

Query: 632 LYEFGSGTKWLVWAIELQNTQDELFLDREG-GGYFNTTGEDPSVLLRVKEDHDGAEPSGN 690
           L++     +WL  A  L +T  E F D +  G YF+T  +   ++ R  +  D A PSG 
Sbjct: 480 LHQATGEPRWLDVACSLLDTALEQFADSDNPGAYFDTAADSEELVRRPSDPTDNASPSGA 539

Query: 691 SVSVINLVRLASIVAGSKSDYYRQNAEHSLA 721
           S     L+  +++  GS +  YR  AE +L+
Sbjct: 540 SSLTSALLTASALAGGSAAQRYRHAAEQALS 570


>gi|114326678|ref|YP_743835.1| thymidylate kinase [Granulibacter bethesdensis CGDNIH1]
 gi|114314852|gb|ABI60912.1| thymidylate kinase [Granulibacter bethesdensis CGDNIH1]
          Length = 679

 Score =  393 bits (1009), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 243/649 (37%), Positives = 333/649 (51%), Gaps = 70/649 (10%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N L+   SPYLLQHA NPV W  WG +A   ARK D PI LSIGY+ CHWCHVM  ESFE
Sbjct: 15  NHLSEALSPYLLQHADNPVHWLPWGTQALEHARKTDRPILLSIGYAACHWCHVMAHESFE 74

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D+  A  +N+ F+ IKVDREERPD+D +YM+ + A+   GGWPL++FL+P+ +P  GGTY
Sbjct: 75  DQATADEMNNAFICIKVDREERPDIDHIYMSALHAMGQQGGWPLTMFLTPEGQPFWGGTY 134

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLP--DEL 280
           FPPE ++GRP F+ +L  ++DAW  +R  + Q+    + QL+ A++  + +   P  D L
Sbjct: 135 FPPEPRFGRPSFRQVLAAIRDAWATRRSAIEQN----LGQLTRAMNRLSETAAGPEVDVL 190

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
             NA+      L ++ D   GGF  APKFP      +  +  ++   TG+     E    
Sbjct: 191 LLNAVDAA---LLRNLDPEKGGFTGAPKFP---NAPVFRFFWQEFHRTGR----PELSDA 240

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
           V   L  MA+GGI+DH+GGGF RYS D  W VPHFEKM YD GQ+  +    ++      
Sbjct: 241 VHAVLSHMARGGIYDHLGGGFARYSTDAEWLVPHFEKMAYDNGQILELLSLGYAQNPTPL 300

Query: 401 YSYICRDILDYLRRDMIGP---GGEIFSA-EDADSAETEGATRKKEGAFYVWTSKEVEDI 456
           Y+    + + +L RDM  P   GG  F+A EDADS   EG    +EG FY+W   E++ +
Sbjct: 301 YARCIEETVGWLIRDMSVPVEGGGTAFAASEDADS---EG----EEGRFYIWHEDEIDAL 353

Query: 457 LGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLN 516
           LGE A  FK+ + +   GN            ++G  +L  L  S           E    
Sbjct: 354 LGEAATGFKQAFDVTREGN------------WEGHTILRRLTISP----------EADAE 391

Query: 517 ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGS 576
              + RR LF  R  RPRP  DDKV+  WNGLVI    RA+  L                
Sbjct: 392 SWAQERRILFQSRENRPRPGRDDKVLADWNGLVIVGLVRAAIAL---------------- 435

Query: 577 DRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG 636
           DR +++  AESA   +R  L  E   R+ H++R G   A G LDD A +I   L LYE  
Sbjct: 436 DRADWLSAAESAYEAVRAALGSEDG-RIAHAWRLGRITAAGLLDDQASMIRAALSLYEAT 494

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
              ++L  A+ L  +    F    G  Y      D   L R     D A PSGN +    
Sbjct: 495 GQERYLSDAVTLAQSARSFFSSETGAFYTTAHDADDVPLTRPCTASDNAVPSGNGMMADA 554

Query: 697 LVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLS 745
           L RL  +    +   + + A   +  F  R + +A + P +  AAD+L+
Sbjct: 555 LARLYHLTGEQR---WYEAASGLIRAFTGRPQSLA-SSPYLLMAADLLT 599


>gi|374987022|ref|YP_004962517.1| hypothetical protein SBI_04265 [Streptomyces bingchenggensis BCW-1]
 gi|297157674|gb|ADI07386.1| hypothetical protein SBI_04265 [Streptomyces bingchenggensis BCW-1]
          Length = 677

 Score =  393 bits (1009), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 239/627 (38%), Positives = 333/627 (53%), Gaps = 58/627 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA E SPYLLQHA NPVDW  W +EAF EAR+R VP+ LS+GYS+CHWCHVM  ESFE
Sbjct: 3   NRLAHETSPYLLQHADNPVDWRPWSDEAFEEARRRGVPVLLSVGYSSCHWCHVMARESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE  A  LN  FVS+KVDREERPDVD VYM  VQA  G GGWP++VFL+P+ +P   GTY
Sbjct: 63  DEATADYLNAHFVSVKVDREERPDVDAVYMEAVQAATGQGGWPMTVFLTPEAEPFYFGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FPP  ++G P F+ +L  V+ AW  +RD +       +  L+E   AS +        P+
Sbjct: 123 FPPAPRHGMPSFQQVLEGVQAAWADRRDEVKDVAERIVRDLAERGGASLAYGAAQPPGPE 182

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
           + L      L++ +D+  GGFG APKFP  + ++ +L H  +   TG         ++V 
Sbjct: 183 D-LHTALMTLTREFDAVHGGFGGAPKFPPSMVLEFLLRHHAR---TGSQA----ALQIVQ 234

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            T + MA+GGI+D +GGGF RY+VD  W VPHFEKMLYD   L  VY   +  T      
Sbjct: 235 ATCEAMARGGIYDQLGGGFARYAVDATWTVPHFEKMLYDNALLCRVYAHLWRATGSDLAR 294

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH-A 461
            +  +  ++L R++    G   SA DADS + +G     EGA+YVWT +++ + LGE  A
Sbjct: 295 RVAVETAEFLVRELRTEQGGFASALDADSDDGKGG--HAEGAYYVWTPEQLSEALGEKDA 352

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
            L  E++ +   G             F+  + ++ L D  A A       E+  ++    
Sbjct: 353 ELAAEYFGVTEEGT------------FEQSSSVLRLPDREALADA-----ERIASV---- 391

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
           R +L   R +RPRP  DDKV+ +WNGL +++ A                      DR + 
Sbjct: 392 RERLLAARGQRPRPGRDDKVVAAWNGLAVAALAETGAYF----------------DRPDL 435

Query: 582 MEVAESAAS-FIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYAFLISGLLDLYEFGSGT 639
           +E A +AA   +R HL D    RL  +  +G + A  G L+DYA +  G L L       
Sbjct: 436 VEAATAAADLLVRVHLDDRG--RLARTSLDGTAGAHAGVLEDYADVAEGFLALSSVTGEG 493

Query: 640 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGAEPSGNSVSVINLV 698
            W+  A  L +T    F   +G  Y   T +D   L+R  +D  D A PSG + +   L+
Sbjct: 494 AWVGLAGLLLDTVQRHFAAEDGMLY--DTADDAEALIRRPQDPTDNAAPSGWTAAAGALL 551

Query: 699 RLASIVAGSKSDYYRQNAEHSLAVFET 725
             A++   +  D  R+ AE +L V + 
Sbjct: 552 SYAAV---TGEDRPREAAERALGVVQA 575


>gi|423720021|ref|ZP_17694203.1| thioredoxin domain protein [Geobacillus thermoglucosidans
           TNO-09.020]
 gi|383366783|gb|EID44068.1| thioredoxin domain protein [Geobacillus thermoglucosidans
           TNO-09.020]
          Length = 637

 Score =  393 bits (1009), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 230/561 (40%), Positives = 320/561 (57%), Gaps = 53/561 (9%)

Query: 148 STCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 207
           STCHWCHVM  ESFEDE VAK+LN+ +VSIKVDREERPD+D VYM   Q + G GGWPLS
Sbjct: 4   STCHWCHVMAHESFEDEEVAKILNEKYVSIKVDREERPDIDSVYMRVCQMMTGQGGWPLS 63

Query: 208 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 267
           VFL+P+ KP   GTYFP + +YGRPGF  +L ++ D + +  D +        EQ++EAL
Sbjct: 64  VFLTPEGKPFYAGTYFPKQSRYGRPGFIELLTRLYDKYKENPDEIVHVA----EQVTEAL 119

Query: 268 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRP-VEIQMMLYHSKKLE 326
             SA ++   + LP  A+     QL   +D+ +GGFG APKFP P + + +M Y+  K +
Sbjct: 120 RQSARASG-TERLPFAAIEKAYRQLLNGFDAVYGGFGGAPKFPIPHMLMFLMRYYQWKRD 178

Query: 327 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 386
           D            MV  TL  MA GGI+DH+G GF RYS D  W VPHFEKMLYD   L 
Sbjct: 179 D--------RALLMVEKTLNGMANGGIYDHIGYGFARYSTDAMWLVPHFEKMLYDNALLV 230

Query: 387 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 446
             Y +A+ LTK   Y  I   I+++++R+M    G  +SA DADS   EG     EG +Y
Sbjct: 231 IAYTEAYQLTKKERYKEIAEQIIEFVKREMTSQDGAFYSAVDADS---EGV----EGKYY 283

Query: 447 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASA 504
           VWT  EV ++LG       E Y       C +  ++D  N F GKNV  LI        A
Sbjct: 284 VWTPDEVVNVLGAE---LGELY-------CRVYDITDEGN-FAGKNVPNLIHAR-MERLA 331

Query: 505 SKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEA 564
            +  +  E+    L E R++L   RS R RPH+DDK++ +WN L+I++ A+A+K+     
Sbjct: 332 RRYRLTEEELRERLEEARKQLLAERSSRVRPHVDDKILTAWNALMIAALAKAAKVY---- 387

Query: 565 ESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 624
                       +R++Y+++A+ A SFI  HL+  Q  RL   +R G  K  G +DDYA+
Sbjct: 388 ------------ERRDYLQMAKQALSFIETHLW--QNGRLMVRYRGGEVKHLGIIDDYAY 433

Query: 625 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 684
           L+   +++YE      +L  A         LF D + G +F T  +  ++++R KE +DG
Sbjct: 434 LVWAYVEMYEATLDLAYLQKAKTCAERMISLFWDEKHGAFFMTGNDAEALIIREKEIYDG 493

Query: 685 AEPSGNSVSVINLVRLASIVA 705
           A PSGNSV+ + ++RLA +  
Sbjct: 494 ALPSGNSVAAVQMIRLARLTG 514


>gi|448591505|ref|ZP_21650993.1| hypothetical protein C453_10720 [Haloferax elongans ATCC BAA-1513]
 gi|445733479|gb|ELZ85048.1| hypothetical protein C453_10720 [Haloferax elongans ATCC BAA-1513]
          Length = 702

 Score =  392 bits (1008), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 230/600 (38%), Positives = 316/600 (52%), Gaps = 58/600 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA NPV+W  W E A   AR+ D PIFLSIGYS CHWCHVM  ESF 
Sbjct: 8   NRLDDEQSPYLRQHADNPVNWQPWDETALEAAREADKPIFLSIGYSACHWCHVMADESFS 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D  +A+ LN+ FV +KVDREERPD+D++Y T  Q + GGGGWPLSV+L+P  KP   GTY
Sbjct: 68  DPDIAETLNEHFVPVKVDREERPDLDRIYQTICQLVTGGGGWPLSVWLTPQGKPFFVGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDML---AQSGAFAI-EQLSEA--LSASASSNKL 276
           FPPE + G PGF+ ++    ++W   RD +   AQ    AI +QL +       A  +++
Sbjct: 128 FPPEPRRGAPGFRDLVESFAESWQTDRDEIENRAQQWTSAIHDQLEDTPDTPGEAPGSEI 187

Query: 277 PDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASE 336
            D+  Q ALR                    PKFP+P  I  +L   +    TG+     E
Sbjct: 188 LDQTVQAALRAADRDDGGFG--------GGPKFPQPGRIDSLL---RGYAITGR----RE 232

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
              + + +L  MA GG+ DH+GGGFHRY VD+ W VPHFEKMLYDQ  L   YLD + LT
Sbjct: 233 ALDVAVESLDAMANGGLRDHLGGGFHRYCVDKDWTVPHFEKMLYDQAGLVPRYLDTYRLT 292

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
               Y+ +  +  +++RR++    G  F+  DA S         +EG FYVWT  EV  +
Sbjct: 293 GTEAYADVAVETFEFVRRELSHDDGGFFATLDAQSG-------GEEGTFYVWTPDEVRSL 345

Query: 457 LGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS-SASASKLGMPLEKY 514
           L E  A LF + Y + P GN            F+ K  ++ ++ + S  A +  +  ++ 
Sbjct: 346 LPELEADLFCDRYGITPGGN------------FENKTTVLNVSATVSDLAEEYDLSEDEV 393

Query: 515 LNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVV 574
            + L E R+ LF  RS R RP  D+K+I  WNGL+IS+FA+ +  L+ ++          
Sbjct: 394 EDKLAEARKALFAARSGRERPARDEKIIAGWNGLMISAFAQGAVALEDDS---------- 443

Query: 575 GSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYE 634
                   + A  A  FIR HL+D     L     NG  K  G+L+DYAFL  G  DLY+
Sbjct: 444 ------LADDARRALDFIREHLWDADAEHLSRRVMNGEVKGDGYLEDYAFLARGAFDLYQ 497

Query: 635 FGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSV 694
                + L +A++L       F D   G  + T     +++ R +E  D + PS   V+ 
Sbjct: 498 ATGDVEPLAFALDLGRAIHREFYDDAAGTLYFTPESGEALVTRPQEATDQSTPSSLGVAT 557


>gi|255531347|ref|YP_003091719.1| hypothetical protein Phep_1443 [Pedobacter heparinus DSM 2366]
 gi|255344331|gb|ACU03657.1| protein of unknown function DUF255 [Pedobacter heparinus DSM 2366]
          Length = 670

 Score =  392 bits (1008), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 235/634 (37%), Positives = 331/634 (52%), Gaps = 60/634 (9%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           N   N L    SPYLLQHA+NPV+W+ WG EA  +A   +  I +SIGYS CHWCHVME 
Sbjct: 2   NTEPNSLIKASSPYLLQHAYNPVNWYEWGAEALQKASAENKLILVSIGYSACHWCHVMER 61

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFE+  VA+++N  FV IKVDREERPD+D++YM  +Q + G GGWPL+    PD +P+ 
Sbjct: 62  ESFENHEVAEVMNRHFVCIKVDREERPDIDQIYMLAIQLMTGSGGWPLNCICLPDQRPIY 121

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLP- 277
           GGTYF   D      +  +L  V   W  + D   ++ A+A ++L++ +    +   +P 
Sbjct: 122 GGTYFRKAD------WVNVLESVAAMWANEPD---KAIAYA-DRLTDGI--QNAEKIIPQ 169

Query: 278 ---DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEA 334
              DE  +  L    E   + +D   GG+  APKFP P   Q ML +S  ++D      A
Sbjct: 170 IKVDEYTKAHLTAITEPWKRYFDMAEGGYNRAPKFPLPNNWQFMLRYSHLMQDDATHVSA 229

Query: 335 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 394
                  L TL+ MA GGI+DHV GGF RYSVD  WHVPHFEKMLYD GQL ++Y +A+ 
Sbjct: 230 -------LLTLEKMAMGGIYDHVAGGFSRYSVDGDWHVPHFEKMLYDNGQLISLYAEAYQ 282

Query: 395 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 454
            ++ + +  +  + +++L R+M+ P G  ++A DADS   EG     EG FYVW   + E
Sbjct: 283 YSRSLLFKEVAEESIEWLEREMMSPEGLFYAALDADS---EGV----EGKFYVWDKPDFE 335

Query: 455 DILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKY 514
            +LG+ A L  +++ +   GN           E +  N+L+        A   G+ + + 
Sbjct: 336 AVLGDDADLLSDYFNVTDEGNW----------EEEQTNILLRKFTEEEYAEVKGISVVEL 385

Query: 515 LNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVV 574
           L  +   + KL   RSKR RP LDDK + +WN + I   A +++I               
Sbjct: 386 LQKIKTAKIKLLQERSKRIRPGLDDKCLTAWNAMAIKGLAESAEIF-------------- 431

Query: 575 GSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYE 634
             D   Y E+A+ AASFI  H+ +     L  +F+N  +  PGFLDDYAF I  L+ LYE
Sbjct: 432 --DHPHYYEMAKKAASFILAHV-NTADGGLYRNFKNDKASIPGFLDDYAFFIEALIALYE 488

Query: 635 FGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSV 694
                 WL  A  L +     F D      F T+    +++ R  E  D   P+ NSV  
Sbjct: 489 ADFDENWLKEAKRLCDYVLLNFEDEHSPMLFYTSAAGETLIARKHEIMDNVVPASNSVMA 548

Query: 695 INLVRLASIVAGSKSDYYRQNAEHSLAVFETRLK 728
            NL +L  +      D Y   AE  LA    ++K
Sbjct: 549 QNLHKLGLLF---DEDVYSIKAEEMLAAVLPQIK 579


>gi|395774413|ref|ZP_10454928.1| hypothetical protein Saci8_31786 [Streptomyces acidiscabies 84-104]
          Length = 682

 Score =  392 bits (1008), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 230/589 (39%), Positives = 311/589 (52%), Gaps = 54/589 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA E SPYLLQHA NPVDW+ W  EAF EAR+ + P+ LS+GYS+CHWCHVM  ESFE
Sbjct: 3   NRLAHETSPYLLQHADNPVDWWPWSAEAFEEARRSERPVLLSVGYSSCHWCHVMAHESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D+  A  LN+ FVS+KVDREERPDVD VYM  VQA  G GGWP++VFL+PD +P   GTY
Sbjct: 63  DQHTADYLNEHFVSVKVDREERPDVDAVYMEAVQAATGQGGWPMTVFLTPDAEPFYFGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE-ALSASASSNKLPDELP 281
           FPPE ++G P F+ +L  V+ AW  +RD +A+     +  L E  LS   +     +EL 
Sbjct: 123 FPPEPRHGSPSFRQVLEGVRQAWTGRRDEVAEVAGKIVRDLGERELSFGDAQPPGEEELA 182

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
              L      L++ YD + GGFG APKFP  + I+ +L H  +   TG  G      +M 
Sbjct: 183 AALL-----GLTREYDPQRGGFGGAPKFPPSMVIEFLLRHHAR---TGSEG----ALQMA 230

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             T + MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L  VY   +  T     
Sbjct: 231 ADTCERMARGGIYDQLGGGFARYSVDRDWIVPHFEKMLYDNALLCRVYAHLWRSTGSELA 290

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHA 461
             I  +  D++ R++  P G   SA DADS   +G  +  EGA+YVWT  E+ D LGE A
Sbjct: 291 RRIALETADFMVRELRTPEGGFASALDADS--DDGTGKHVEGAYYVWTMAELRDTLGEDA 348

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDSSASASKLGMPLEKYLNILGE 520
            L   ++ +   G  +           +G +VL +   +    A K           +  
Sbjct: 349 DLAAHYFGVTEDGTFE-----------EGASVLQLPQTEGVFDADK-----------IAS 386

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
              +L   R++RP P  DDK++ +WNGL I++ A                      DR +
Sbjct: 387 IHARLLAKRAERPAPGRDDKIVAAWNGLAIAALAETGAYF----------------DRPD 430

Query: 581 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 640
            +E A +AA  + R   D+  H  + S    P    G L+DY  +  G L L    +   
Sbjct: 431 LIEAALTAADLVVRIHLDDHAHLSRTSKDGQPGANAGVLEDYGDVAEGFLALAAVTAEGV 490

Query: 641 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSG 689
           WL +A  L +     F D E G  ++T  +   ++ R ++  D A PSG
Sbjct: 491 WLDFAGLLLDHVLARFTDPESGALYDTASDAEQLIRRPQDPMDNATPSG 539


>gi|282899862|ref|ZP_06307823.1| protein of unknown function DUF255 [Cylindrospermopsis raciborskii
           CS-505]
 gi|281195132|gb|EFA70068.1| protein of unknown function DUF255 [Cylindrospermopsis raciborskii
           CS-505]
          Length = 689

 Score =  392 bits (1007), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 254/710 (35%), Positives = 371/710 (52%), Gaps = 88/710 (12%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNRLA   S YL +HA NP+DW+ W  EA   A+  D PIFLSIGYS+CHWC VME E+F
Sbjct: 2   TNRLAKTRSLYLRKHADNPIDWWTWCNEALLMAQTEDKPIFLSIGYSSCHWCTVMEGEAF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP-DLKPLMGG 220
            D  +A+ +N  F+ IKVDREERPD+D +YM  +Q + G GGWPL+ FLSP DL P   G
Sbjct: 62  SDLAIAEYMNANFIPIKVDREERPDIDSIYMQSLQMMTGQGGWPLNAFLSPDDLVPFYAG 121

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           TYFP   +YGRPGF  +L+ ++  +D +++   Q  A  +E L   LS++   N   D+ 
Sbjct: 122 TYFPVAPRYGRPGFLEVLQAIRHYYDHQKEDFRQRKASILEAL---LSSTVLQNHDLDQF 178

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPK-----FPRPVEIQMMLYHSKKLEDTGKSGEAS 335
             +        L + +++  G     PK     FP     Q++L  ++          A+
Sbjct: 179 AHSQFH---RFLKQGWETAIGVI--TPKQMGNSFPMIPYCQLVLQGTRF-----NYPSAN 228

Query: 336 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 395
           +G +M       +A GGI+DHVGGGFHRY+VD  W VPHFEKMLYD GQ+     + +S 
Sbjct: 229 DGLQMATQRGLDLALGGIYDHVGGGFHRYTVDATWTVPHFEKMLYDNGQIVEYLANLWSA 288

Query: 396 -TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 454
             ++  +       + +L R+MI P G  ++A+DADS         +EGAFYVW+ +E++
Sbjct: 289 GVEEPAFKRAVAGTVSWLEREMISPTGYFYAAQDADSFNCSTDMEPEEGAFYVWSYRELQ 348

Query: 455 DILGEHAIL-FKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEK 513
           ++L +  +L  KEH+ L   GN            F+GKNVL  L     SA +L   LE 
Sbjct: 349 ELLSDQELLEVKEHFSLSLEGN------------FEGKNVLQRL-----SAGELSSSLEL 391

Query: 514 YLNILGECR--------------RKLFDVRSK----RPRPHLDDKVIVSWNGLVISSFAR 555
            L  L  CR              R   + ++     R  P  D K+IV+WN L+IS  AR
Sbjct: 392 ILGRLFLCRYGQTAETLTIFPPARNNHEAKTNPWHGRIPPVTDTKMIVAWNSLMISGLAR 451

Query: 556 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHRLQHSFRNGPSK 614
           AS++ +                +  Y+++A  A  FI  H + D + HRL +   +G   
Sbjct: 452 ASEVFQ----------------QPSYLQLAVQATRFILDHQFVDGRFHRLNY---DGEPT 492

Query: 615 APGFLDDYAFLISGLLDLYEFGSG-TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPS 673
                +DYA  I  LLDL++  SG + WL  AI LQ+  +E  L  E GGYFNT+ ++  
Sbjct: 493 VLAQSEDYALFIKALLDLHQADSGSSNWLEQAITLQDEFNEFLLSVELGGYFNTSSDNSQ 552

Query: 674 -VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAM 732
            +++R +   D A PS N V++ NL++L  +   + + YY   AE +L  F T ++    
Sbjct: 553 DLIIRERNFVDNATPSANGVAIANLIKLCLL---TDNLYYLDLAESALKAFSTIIEKSPQ 609

Query: 733 AVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVSK 782
           + P +  A D       ++  LV  +SS+D   +LA  +    +   +SK
Sbjct: 610 SCPSLLIAIDWY-----RNSTLV--RSSIDNIKILAGKYLPTTIFDVISK 652


>gi|332292243|ref|YP_004430852.1| N-acylglucosamine 2-epimerase [Krokinobacter sp. 4H-3-7-5]
 gi|332170329|gb|AEE19584.1| N-acylglucosamine 2-epimerase [Krokinobacter sp. 4H-3-7-5]
          Length = 679

 Score =  392 bits (1007), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 226/605 (37%), Positives = 332/605 (54%), Gaps = 52/605 (8%)

Query: 101 HTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVES 160
           +TN L  E SPYLLQHAHNPVDW  W E+  A+A+K +  + +SIGYS+CHWCHVME ES
Sbjct: 5   YTNDLIQETSPYLLQHAHNPVDWKPWNEQTLAQAKKENKLLLISIGYSSCHWCHVMEHES 64

Query: 161 FEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGG 220
           FE+  VA+L+N  F +IKVDREERPDVD VYM  VQ +   GGWPL+    PD +P+ GG
Sbjct: 65  FENTEVAQLMNAHFKNIKVDREERPDVDNVYMNAVQLMTSRGGWPLNAIALPDGRPVWGG 124

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD-- 278
           TYFP E+      + + L ++   +    + L +  A  +EQ  + + A   ++  PD  
Sbjct: 125 TYFPKEE------WTSALEQIAKLYQTAPEKLIEY-AEKLEQGMQEMDAIIPNDSSPDFK 177

Query: 279 -ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEG 337
            E  QNA+     Q S+ +D+R GG   APKF  P     +L ++ + +D        E 
Sbjct: 178 LETLQNAI----SQWSRQWDTRQGGLNRAPKFMMPNNYLFLLRYAHQNQD-------QEI 226

Query: 338 QKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTK 397
            + V  TL+ +A GGI+DHVGGGF RYSVD +WHVPHFEKMLYD  QL ++Y  A++ TK
Sbjct: 227 LEYVNTTLEQIAFGGINDHVGGGFARYSVDTKWHVPHFEKMLYDNAQLVSLYALAYTKTK 286

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
           +  Y       L ++ R+M    G  +SA DADS   +G    +EGA+YVWT KE++ ++
Sbjct: 287 NPLYKQTVYQTLTFIAREMTTEDGAFYSAIDADSLTADGIL--EEGAYYVWTEKELQTLV 344

Query: 458 GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI 517
           G+   LFKE+Y +   G  +           K   VLI  +     + +  + +E+ ++ 
Sbjct: 345 GDDFDLFKEYYNINSYGKWE-----------KDNYVLIRQDTDQDFSKECDISVEEIISK 393

Query: 518 LGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGS 576
             +    L   R S + +P LDDK++ SWNGL+I  +  A +    +A            
Sbjct: 394 KNKWHEDLLRFRESNKEKPRLDDKILTSWNGLMIKGYVDAYRAFNEDA------------ 441

Query: 577 DRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG 636
               ++  A   A+F+  +L  E    L  +F+NG S   G+L+DYA ++   + LYE  
Sbjct: 442 ----FLTAALKNATFLSTNLMREDG-GLNRTFKNGKSTINGYLEDYAAIVDAFIALYEVT 496

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
           +  +WL  A EL +   + F + +   +F  + +DPS+  R  E +D   PS NS+   N
Sbjct: 497 ADNQWLNKAKELTDYTFQHFQNPKNDLFFFKSNQDPSLASRNTEFYDNVIPSSNSIMAKN 556

Query: 697 LVRLA 701
           +  L+
Sbjct: 557 IFTLS 561


>gi|448410530|ref|ZP_21575235.1| hypothetical protein C475_12927 [Halosimplex carlsbadense 2-9-1]
 gi|445671566|gb|ELZ24153.1| hypothetical protein C475_12927 [Halosimplex carlsbadense 2-9-1]
          Length = 719

 Score =  392 bits (1007), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 238/656 (36%), Positives = 339/656 (51%), Gaps = 58/656 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA NPV+W  W E A A A ++D PIFLSIGY+ CHWCHVME ESF 
Sbjct: 10  NRLDEEESPYLRQHADNPVNWQPWDEAALAAAEEQDKPIFLSIGYAACHWCHVMEEESFA 69

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE +A+LLN+ FV IKVDREERPD+D +YM+  Q + G GGWPL+ +L+PD  P   GTY
Sbjct: 70  DEDIAELLNENFVPIKVDREERPDIDSIYMSICQQVSGRGGWPLNAWLTPDGDPFYVGTY 129

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSAS-ASSNKLPDELP 281
           FPPE K G PGF+ +L  + ++W    D           Q ++A++    ++   P + P
Sbjct: 130 FPPEPKRGAPGFRQLLDDISESWADSEDRAEMED--RARQWTDAIANDLETTPDQPGDAP 187

Query: 282 -QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
            ++ L   A    +  D  FGG+G   KFP+P  +++++          +SG     +++
Sbjct: 188 GEDVLDTTASAALRGADREFGGWGKGQKFPQPGRLRVLMRAH-------RSGGRDAYREV 240

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
           V  TL  M  GG++DHVGGGFHRY+ D  W VPHFEKMLYD  +LA V+L  +  T    
Sbjct: 241 VGETLDAMGDGGLYDHVGGGFHRYTTDREWVVPHFEKMLYDNAELARVFLTGYQFTGRER 300

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH 460
           Y    R+ L+++ R++  P G  +S  DA+S        ++EGAFY WT   V+D + E+
Sbjct: 301 YRETARETLEFVERELTHPDGGFYSTLDAESEGE--EGEREEGAFYAWTPDGVDDAVAEY 358

Query: 461 --------------AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 506
                         A +F+E Y +  TGN +            G+ VL       + A  
Sbjct: 359 GPEHGVPGEQASLAAEIFRERYGVTATGNFE-----------GGETVLTRSASVESLADD 407

Query: 507 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 566
            G+ L    ++L      +F  R +RPRP  D+KV+  WNGL++S+FA A+ +       
Sbjct: 408 YGLSLGDAEDLLDAATTAVFAAREERPRPPRDEKVLAGWNGLMVSAFAEAAVV------- 460

Query: 567 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 626
                     D + +   A  A  F R HL+D  + RL   F++G     G+L+DYAFL 
Sbjct: 461 ----------DDESWAGTATEALDFARDHLWDADSGRLSRRFKDGDVDIRGYLEDYAFLA 510

Query: 627 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 686
            G  D Y+     + L +A+EL  T +  F D E    + T     S++ R +E  D + 
Sbjct: 511 RGAFDTYQATGEVEHLAFALELARTIETEFWDAEEETLYFTPQSGESLVARPQELADQST 570

Query: 687 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 742
           PS   V+   L+ L   V     D +   A   LA    R++      P +  AAD
Sbjct: 571 PSSAGVAAELLLALDHFV---DHDRFETVASGVLATHGGRVESNPQQHPSLALAAD 623


>gi|288956849|ref|YP_003447190.1| hypothetical protein AZL_000080 [Azospirillum sp. B510]
 gi|288909157|dbj|BAI70646.1| hypothetical protein AZL_000080 [Azospirillum sp. B510]
          Length = 685

 Score =  392 bits (1006), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 246/675 (36%), Positives = 347/675 (51%), Gaps = 75/675 (11%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N L  E SPYLLQH  NPV W  WG EAFA AR  + P+ LS+GY+ CHWCHVM  ESFE
Sbjct: 4   NLLGRETSPYLLQHKDNPVHWMPWGPEAFARARAENKPVLLSVGYAACHWCHVMAHESFE 63

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           +  +A L+N+ F++IKVDREERPD+D +Y + +  L   GGWPL++FL+PD +P  GGTY
Sbjct: 64  NPEIAGLMNELFINIKVDREERPDLDTIYQSALALLGQQGGWPLTMFLTPDAEPFWGGTY 123

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASAS--SNKLPDEL 280
           FPP  +YGR GF  +LR +   +  + D + ++    +E L  AL+      S      +
Sbjct: 124 FPPAQRYGRAGFPDVLRGIAGTYTDEPDKVGKN----VEALRSALAGIGENRSAGAAGTI 179

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
               L   A++L +  D   GG GSAPKFP+ V +  +L+ + +   TG+       +  
Sbjct: 180 DAGMLDQVAQRLLREVDPIHGGIGSAPKFPQ-VPLFELLWRAWR--RTGR----EPFRDA 232

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
           V  TL  MA+GGI+DH+GGGF RYSVDERW VPHFEKMLYD  +L ++    +  T+D  
Sbjct: 233 VTHTLANMAQGGIYDHLGGGFARYSVDERWLVPHFEKMLYDNAELLDLMTLVWQETRDPL 292

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-- 458
                R+ + +L R+MI  GG   +  DADS   EG    +EG FY+W  +EV+ +LG  
Sbjct: 293 LETRIRETVGWLLREMIAEGGGFAATLDADS---EG----EEGLFYIWREEEVDRLLGPA 345

Query: 459 ---EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 515
              +    FK  Y + P GN            ++G  +L  L   + +        E   
Sbjct: 346 LGADGLATFKRVYEVLPQGN------------WEGVTILNRLGGLTPAD-------ESTE 386

Query: 516 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 575
            +L + R  L   R+KR RP  DDKV+  WNGL+I++   A+                  
Sbjct: 387 AMLAKGREALSRARAKRVRPGWDDKVLADWNGLMIAALTHAALA---------------- 430

Query: 576 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 635
            D  E+++ A  A +F+R  +  +   RL HS+R+G  K  G LDDYA +    L L+E 
Sbjct: 431 LDEPEWLDAAGRAFAFVRDRM--DSGGRLCHSWRHGQGKHAGMLDDYAHMARAALALHEA 488

Query: 636 GSGTKWL----VWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNS 691
                 L    VWA  L    D  F D   GGYF T  +   +++R K  +D A PSGN 
Sbjct: 489 TGDPAALDQAKVWAAAL----DAHFWDDANGGYFFTADDAEGLIVRTKTAYDNATPSGNG 544

Query: 692 VSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKH 751
                L  L  +   +  D YR  AE     F   L      +P    A ++++ P    
Sbjct: 545 TM---LAVLTILFQRTGEDAYRDRAEALATAFSGELTRNFFPLPTFLNAVELMTAP--LQ 599

Query: 752 VVLVGHKSSVDFENM 766
           +V+VG   + + E +
Sbjct: 600 IVIVGPPRTAETEAL 614


>gi|345008957|ref|YP_004811311.1| hypothetical protein [Streptomyces violaceusniger Tu 4113]
 gi|344035306|gb|AEM81031.1| hypothetical protein Strvi_1280 [Streptomyces violaceusniger Tu
           4113]
          Length = 678

 Score =  392 bits (1006), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 229/621 (36%), Positives = 326/621 (52%), Gaps = 52/621 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA E SPYLLQHA NPVDW+ W ++AF +AR+R VP+ LS+GYS+CHWCHVM  ESFE
Sbjct: 3   NRLAHETSPYLLQHAENPVDWWPWSDKAFEDARRRGVPVLLSVGYSSCHWCHVMAHESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D+  A  LN  FVS+KVDREERPDVD VYM  VQA  G GGWP++VFL+P+ +P   GTY
Sbjct: 63  DKATADYLNAHFVSVKVDREERPDVDAVYMEAVQAATGQGGWPMTVFLTPEAQPFYFGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FPP  + G   F+ +L  V  AW  +R+ +       +E L++    +  S+  P    +
Sbjct: 123 FPPRPRPGMASFRQVLEGVSAAWTDRREEVVDVAGRIVEDLAQRTGIALGSDA-PAPPGE 181

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             L      L++ +D+  GGFG APKFP  + ++ +L H  +   TG  G      +MV 
Sbjct: 182 EDLHAALMGLTREFDATRGGFGGAPKFPPSMALEFLLRHHAR---TGSEG----ALQMVS 234

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            T + MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L  VY   +  T      
Sbjct: 235 ATCEAMARGGIYDQLGGGFARYSVDAGWTVPHFEKMLYDNALLCRVYAHLWRATGSDLAR 294

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
            +  +  D++ R++    G   SA DADS   +G  R  EGA+YVWT + + ++LGE   
Sbjct: 295 RVALETADFMVRELRTAQGGFASALDADS--DDGTGRHVEGAYYVWTPERLREVLGEADA 352

Query: 463 LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECR 522
            F   Y+                  F+    +++L D    A             +   R
Sbjct: 353 EFAAGYF-----------GVTQEGTFEQGASVLQLPDGKRPADA---------GRVASVR 392

Query: 523 RKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYM 582
            +L   R +R RP  DDK++ +WNGL +++ A                      DR + +
Sbjct: 393 ERLLAARERRARPGRDDKIVAAWNGLAVAALAETGAYF----------------DRPDLV 436

Query: 583 EVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYAFLISGLLDLYEFGSGTKW 641
           +VA  AA  + R L+ +Q  RL  +  +G +    G L+DYA +  G L L        W
Sbjct: 437 DVATEAAELLMR-LHMDQRGRLARTSLDGTAGGHAGVLEDYADVAEGFLALSAVTGDGAW 495

Query: 642 LVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLA 701
           + +A  L +T    F   E G  F+T  +  +++ R ++  D A PSG + +   L+  A
Sbjct: 496 VDFAGLLLDTVLTRFT-AEDGTLFDTADDAEALIRRPQDPTDNAAPSGWTAAAGALLSYA 554

Query: 702 SIVAGSKSDYYRQNAEHSLAV 722
           +I   S+   +R+ AE +LAV
Sbjct: 555 AITGSSR---HRETAERALAV 572


>gi|302553816|ref|ZP_07306158.1| spermatogenesis-associated protein 20 [Streptomyces
           viridochromogenes DSM 40736]
 gi|302471434|gb|EFL34527.1| spermatogenesis-associated protein 20 [Streptomyces
           viridochromogenes DSM 40736]
          Length = 677

 Score =  392 bits (1006), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 256/679 (37%), Positives = 357/679 (52%), Gaps = 71/679 (10%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA E SPYLLQHA NPVDW+ W  EAF EARKR+VP+ LSIGYS+CHWCHVM  ESFE
Sbjct: 3   NRLAHETSPYLLQHADNPVDWWPWSAEAFEEARKRNVPVLLSIGYSSCHWCHVMAHESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D+  A+ LN+ +VS+KVDREERPDVD VYM  VQA  G GGWP++VFL+P+ +P   GTY
Sbjct: 63  DQQTAEYLNEHYVSVKVDREERPDVDAVYMEAVQAATGHGGWPMTVFLTPEAEPFYFGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLP--DEL 280
           FPP  + G P F+ +L  V+ AWD++RD + +     +  L+     S   ++ P   EL
Sbjct: 123 FPPAPRQGMPSFRQVLEGVRQAWDERRDEVTEVAGKIVRDLA-GREISYGDDQAPGEQEL 181

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
            Q  L      L++ YD + GGFG APKFP  + ++ +L H  +   TG  G      +M
Sbjct: 182 AQALL-----ALTREYDPQRGGFGGAPKFPPSMALEFLLRHHAR---TGAEG----ALQM 229

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
              T + MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L  VY   +  T    
Sbjct: 230 ARDTCERMARGGIYDQLGGGFARYSVDRDWIVPHFEKMLYDNALLCRVYAHLWRATGSEL 289

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH 460
              +  +  D++ R++    G   SA DADS   +G  +  EGA+YVWT  ++ ++LGE 
Sbjct: 290 ARRVALETADFMVRELRTTEGGFASALDADS--DDGTGKHVEGAYYVWTPGQLREVLGEQ 347

Query: 461 -AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDSSASASKLGMPLEKYLNIL 518
            A L  +++ +   G  +            G++VL +   DS   A K           +
Sbjct: 348 DAELAAQYFGVTEEGTFE-----------HGQSVLQLPQQDSLFDAGK-----------I 385

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
              R +L   R++RP P  DDKV+ +WNGL I++ A            A F+ P      
Sbjct: 386 ASVRERLLAKRAERPAPGRDDKVVAAWNGLAIAALAET---------GAYFDRP------ 430

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYAFLISGLLDLYEFGS 637
                   +A   +R HL DEQ  RL  + ++G + A  G L+DYA +  G L L     
Sbjct: 431 DLVEAAVAAADLLVRLHL-DEQA-RLTRTSKDGHAGANAGVLEDYADVAEGFLALASVTG 488

Query: 638 GTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINL 697
              WL +A  L +     F D E G  F+T  +   ++ R ++  D A PSG + +   L
Sbjct: 489 EGVWLQFAGFLLDHVLVRFTDAESGALFDTAADAERLIRRPQDPTDNAAPSGWTAAAGAL 548

Query: 698 VRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMC---CAADMLSVPSRKHVVL 754
           +   S  A + S+ +R  A  +L V    +K +   VP       AA   ++   + V +
Sbjct: 549 L---SYAAHTGSEPHRTAARKALGV----VKALGPRVPRFIGWGLAAAEAALDGPREVAI 601

Query: 755 VGHKSSVDFENMLAAAHAS 773
           VG   S+D E   A  H +
Sbjct: 602 VG--PSLDHEGTRALHHTA 618


>gi|373956291|ref|ZP_09616251.1| protein of unknown function DUF255 [Mucilaginibacter paludis DSM
           18603]
 gi|373892891|gb|EHQ28788.1| protein of unknown function DUF255 [Mucilaginibacter paludis DSM
           18603]
          Length = 718

 Score =  392 bits (1006), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 229/605 (37%), Positives = 329/605 (54%), Gaps = 57/605 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N+L+A  SPYLLQHA+NPV+WF WG EA  +AR  +  I +SIGYS CHWCHVME ESFE
Sbjct: 47  NKLSASTSPYLLQHANNPVNWFPWGAEALQKARDENKLILVSIGYSACHWCHVMENESFE 106

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA+++N+ FV IKVDREERPD+D++YM+ VQ + G GGWPL+    PD +P+ GGTY
Sbjct: 107 DEQVAEIMNEHFVCIKVDREERPDIDQIYMSAVQLMTGRGGWPLNCVCLPDQRPIYGGTY 166

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           F   D      +  +L  + + W++K D   ++  +A+ +L+E +    +   + +++  
Sbjct: 167 FRKTD------WMALLFNLANFWEQKPD---EAKEYAV-KLTEGIHQYENIGFVNEQMEN 216

Query: 283 NA--LRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
               L    +   +SYD + GG   APKFP P   Q ++ ++  ++D        E   +
Sbjct: 217 TPADLEAIVKPWKQSYDFKEGGLNRAPKFPMPNNWQFLMRYAYLMQD-------EETNVI 269

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
           V  TL+ MAKGGI+DH+GGGF RYSVD  WHVPHFEKMLYD  QL  +Y +AF+   D  
Sbjct: 270 VRLTLEKMAKGGIYDHIGGGFARYSVDGHWHVPHFEKMLYDNAQLIGLYSEAFTWCGDEL 329

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH 460
           Y  +  + + +++R++  P    +SA DADS   EG     EG FY +T  EVE ILG+ 
Sbjct: 330 YKKVVAETIAFIQRELTSPENGFYSALDADS---EGV----EGKFYTFTLAEVEAILGDD 382

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
           A LF  +Y +   GN           E +  N+    +D +  A KLG+P +  ++ +  
Sbjct: 383 AGLFAIYYNVTNEGNW----------EEEHTNIFFRRDDDAVLAEKLGIPADALVDKIAG 432

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            R ++ + R+KR  P LD K++ SWN L++     A +                  D   
Sbjct: 433 LRNQVLEARAKRVLPGLDYKILTSWNALMLKGLCDAYRAF----------------DEPA 476

Query: 581 YMEVAESAASFIRRHLYDE--QTHRLQHSFRNGPSK--APGFLDDYAFLISGLLDLYEFG 636
           Y+E+A   A FI+ +L ++  Q  R+ ++   G  K  A  FLDDYA LI   + LYE  
Sbjct: 477 YLELALKNAHFIKDNLINKNNQLSRV-YAKPTGDEKLDAIAFLDDYALLIDAFIALYEVT 535

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
               WL  A  L     + F D   G +F T      ++ R  E  D   PS NSV   N
Sbjct: 536 FDEAWLHQAKALTEHTLDHFYDNATGMFFYTPDYGEQLIARKFEVMDNVMPSSNSVMARN 595

Query: 697 LVRLA 701
             +L+
Sbjct: 596 FKKLS 600


>gi|344344146|ref|ZP_08775011.1| hypothetical protein MarpuDRAFT_1824 [Marichromatium purpuratum
           984]
 gi|343804430|gb|EGV22331.1| hypothetical protein MarpuDRAFT_1824 [Marichromatium purpuratum
           984]
          Length = 683

 Score =  391 bits (1004), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 251/658 (38%), Positives = 355/658 (53%), Gaps = 58/658 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL    SPYL QHA NPVDW+ W +EA A+AR+RD PI LSIGYS CHWCHVM  ESF 
Sbjct: 13  NRLDGATSPYLQQHADNPVDWWPWCDEALAQARERDRPILLSIGYSACHWCHVMAHESFA 72

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYG-GGGWPLSVFLSP-DLKPLMGG 220
           D  VA L+N  FV+IKVDREERPD+D +Y    Q L G GGGWPL+VFLSP DL+P   G
Sbjct: 73  DPEVATLMNRAFVNIKVDREERPDLDGLYQRAHQLLNGRGGGWPLTVFLSPHDLRPFFAG 132

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           TYFPP  ++G P F  +L  V+ A+ ++ D + Q G    E L EA  A           
Sbjct: 133 TYFPPTPRHGLPAFTQLLAGVERAYREQHDKILQQG----ENLIEAF-AGLEPEPGERPP 187

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
            +N +     QL+ S+D R GGFG APKFP   E+ ++L  + + +  G+  +A E  +M
Sbjct: 188 ERNLIGAALNQLAVSFDPRHGGFGGAPKFPHAPELALLLRCAARGDRPGE--DAPEPLEM 245

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
              +L+ M + G++D +GGGF RY+VD +W +PHFEKMLYD   L  +  D  + T +  
Sbjct: 246 ARVSLERMIRSGLNDQLGGGFCRYAVDAQWMIPHFEKMLYDNAALLALCCDLHACTGEQL 305

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH 460
           +        D++ R+M  P G  +S+ DADS   EG    +EG FY+W  ++V  +L E 
Sbjct: 306 FRSAAESTADWVLREMQSPEGGYYSSLDADS---EG----EEGRFYLWEREQVRALLPEA 358

Query: 461 AIL-FKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILG 519
               F   Y L    N            F+G+  L      +A A+  G+ LE+  ++LG
Sbjct: 359 EYRPFAAVYGLDRPPN------------FEGRWHLHGHLTPAAVAAAQGLTLEQVQSLLG 406

Query: 520 ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRK 579
             R  LF  R +R RP  DDKV+ +WN L+I + ARA+++L                +R 
Sbjct: 407 AARATLFAERERRVRPGRDDKVLGAWNALMIGAMARAARVL----------------ERD 450

Query: 580 EYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGT 639
           +Y+E AE A   +R  L+  +  RL  S R+G      +LDD+A L++ +L+L +    T
Sbjct: 451 DYLESAEQALGCVRERLW--RDGRLLASCRDGRVAFDAYLDDHALLLATVLELLQ----T 504

Query: 640 KW----LVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 695
           +W    L +AIEL  T    F D E GG++ T  +   ++ R K   D   P+GN V+ +
Sbjct: 505 RWSSADLAFAIELAETLLARFHDPEAGGFWFTAHDHERLIHRTKPLADETLPAGNGVAAL 564

Query: 696 NLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVV 753
            L RL  +V   +   Y    E +L +  T ++ +  A   + CA D    P  + V+
Sbjct: 565 ALQRLGHLVGEPR---YLAAVESTLRLAATAMRRLPHAHATLLCALDEWLDPPEQLVI 619


>gi|375097065|ref|ZP_09743330.1| thioredoxin domain containing protein [Saccharomonospora marina
           XMU15]
 gi|374657798|gb|EHR52631.1| thioredoxin domain containing protein [Saccharomonospora marina
           XMU15]
          Length = 673

 Score =  390 bits (1003), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 234/621 (37%), Positives = 324/621 (52%), Gaps = 63/621 (10%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA+  SPYLLQHA NPVDW+ W  +A  EA++RDVPI LSIGY+ CHWCHVM  ESFE
Sbjct: 2   NRLASATSPYLLQHADNPVDWWPWSAQALDEAKRRDVPILLSIGYAACHWCHVMAHESFE 61

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D+  A  +N  FV+IKVDREERPD+D VYMT  QA+ G GGWP++ FL+PD KP   GTY
Sbjct: 62  DDETAAFMNAHFVNIKVDREERPDIDAVYMTATQAMTGQGGWPMTCFLTPDGKPFHCGTY 121

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           +PP  ++G P F+ +L  V  AW ++ D L Q     +  + E  +  A        + +
Sbjct: 122 YPPTPRHGMPSFRQVLTAVARAWSERADELRQGATKIVSHIQEQTAPLAQR-----PVDE 176

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
            A+      L    D   GGFG APKFP  + ++ +L H    E TG    ++E   +V 
Sbjct: 177 EAIATAVSTLRGQIDPGHGGFGGAPKFPPAMVMEFLLRH---YERTG----SAEALSVVE 229

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            T + MA+GGI+D + GGF RYSVD  W VPHFEKMLYD   L   Y      T     +
Sbjct: 230 LTAEGMARGGIYDQLAGGFARYSVDAAWVVPHFEKMLYDNALLLRCYAHLARRTSSALAT 289

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
            +  +  ++L RD+    G   ++ DAD   TEG     EG  YVWT  ++ ++LG    
Sbjct: 290 RVAAETAEFLLRDLRTQEGGFAASLDAD---TEGV----EGLTYVWTPAQLVEVLGPEDG 342

Query: 463 LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECR 522
            +    +          R+++      G + L    D   +A        ++L +     
Sbjct: 343 SWAAEVF----------RVTEEGTFEHGASTLQLPRDPDETA--------RWLRV----S 380

Query: 523 RKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYM 582
             L + R+ RP+P  DDKV+ +WNGL I++ A A   L                +R +++
Sbjct: 381 TALLEARNGRPQPSRDDKVVTAWNGLAITALAEAGVAL----------------ERPDWV 424

Query: 583 EVAESAAS-FIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFLISGLLDLYEFGSGTK 640
           E A SAA   + RHL D    RL+ S R G   +A G L+DYA L  GLL +++    + 
Sbjct: 425 EAAVSAAELLLDRHLVDA---RLRRSSRGGVVGEAAGVLEDYACLAEGLLAVHQASGESV 481

Query: 641 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL-RVKEDHDGAEPSGNSVSVINLVR 699
           WL  A  L +T  ELF D E  G F+ T  D   L+ R  +  D A PSG S     L+ 
Sbjct: 482 WLTQATLLLDTALELFSDDELPGAFHDTAADAEALVHRPSDPTDNATPSGASALAGALLT 541

Query: 700 LASIVAGSKSDYYRQNAEHSL 720
            +++    ++  YRQ  E +L
Sbjct: 542 ASALAGPDRAGEYRQACERAL 562


>gi|374293368|ref|YP_005040403.1| hypothetical protein AZOLI_3026 [Azospirillum lipoferum 4B]
 gi|357425307|emb|CBS88194.1| conserved protein of unknown function; putative Thioredoxin and
           glycosidase domains [Azospirillum lipoferum 4B]
          Length = 683

 Score =  390 bits (1003), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 241/669 (36%), Positives = 351/669 (52%), Gaps = 66/669 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N L  E SPYLLQH  NPV W  WG +AFA A+  + P+ LS+GY+ CHWCHVM  ESFE
Sbjct: 9   NLLGRETSPYLLQHKDNPVHWMPWGHDAFARAKAENKPVLLSVGYAACHWCHVMAHESFE 68

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           +  +A L+N+ FV+IKVDREERPD+D +Y + +  L   GGWPL++FL+PD +P  GGTY
Sbjct: 69  NPEIAGLMNELFVNIKVDREERPDLDTIYQSALALLGQQGGWPLTMFLTPDAEPFWGGTY 128

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FPP  +YGR GF  +LR +   +  ++D + ++    ++ L  ALS     N+    +  
Sbjct: 129 FPPAPRYGRAGFPDVLRGIAGTYANEQDKVGKN----VDALKSALS-GMGENRSAGAVDA 183

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             L   A++L +  D   GG G+APKFP+ V +  +L+  +  + TG+       ++ V 
Sbjct: 184 GVLDQVAQRLLREVDPIHGGIGTAPKFPQ-VPLFELLW--RAWQRTGR----EPFREAVT 236

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL  MA+GGI+DH+GGGF RYSVDERW VPHFEKMLYD  +L ++    +  T+D    
Sbjct: 237 HTLANMAQGGIYDHLGGGFARYSVDERWLVPHFEKMLYDNAELLDLMTLVWQETRDPLLE 296

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL----- 457
              R+ + +L R+MI  GG   +  DADS   EG    +EG FY+W  +EV+ +L     
Sbjct: 297 TRIRETVGWLLREMIADGGGFAATLDADS---EG----EEGLFYIWNEEEVDRLLTPALG 349

Query: 458 GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI 517
            +    FK  Y + P GN +   +    N   G    + L D +  A+            
Sbjct: 350 ADGLATFKHVYEVLPQGNWEGVTIL---NRLGG----LSLADDATEAT------------ 390

Query: 518 LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSD 577
           L + R  L   R+KR RP  DDKV+  WNGL+I++   A+                   D
Sbjct: 391 LAKGREILLRARAKRVRPGWDDKVLADWNGLMIAALTHAALA----------------LD 434

Query: 578 RKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGS 637
             E+++ A  A +F+R  +  ++  RL HS+R+G  K  G LDDYA +    L L+E   
Sbjct: 435 EPEWLDAAGRAFAFVRDRM--DKNGRLCHSWRHGQGKHTGMLDDYAHMARAALALHEATG 492

Query: 638 GTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINL 697
               L  A     T D  F D   GGYF T  +   +++R K   D A PSGN      L
Sbjct: 493 DPAALDQAKLWVATLDAHFWDGANGGYFFTADDAEGLIVRTKTAFDNATPSGNGTM---L 549

Query: 698 VRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGH 757
             LA++   +  D YR+ A+   A F   L      +     + ++++ P +  +V+VG 
Sbjct: 550 AVLATLFQRTGEDAYRERADALAAAFSGELTRNFFPLTTFLNSVELMTAPLQ--IVVVGP 607

Query: 758 KSSVDFENM 766
             + + E +
Sbjct: 608 PKAAETEAL 616


>gi|367469960|ref|ZP_09469682.1| Thymidylate kinase [Patulibacter sp. I11]
 gi|365814937|gb|EHN10113.1| Thymidylate kinase [Patulibacter sp. I11]
          Length = 685

 Score =  390 bits (1003), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 246/661 (37%), Positives = 331/661 (50%), Gaps = 57/661 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N LAAE SPYLLQHA NPVDW  WG EA   AR+ D P+ +SIGYS CHWCHVM  ESFE
Sbjct: 3   NALAAETSPYLLQHAENPVDWLPWGPEALERARREDKPLLVSIGYSACHWCHVMAHESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D   A ++N  FV +KVDREERPDVD + M  VQA+ G GGWPL+VFL+P+ +P+ GGTY
Sbjct: 63  DPATASVMNAHFVCVKVDREERPDVDAICMEAVQAITGQGGWPLNVFLTPEQQPIHGGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FPP+ + G P ++ +L  V +AW ++   + +  +   ++LS A   + +      EL  
Sbjct: 123 FPPQPRQGMPSWRMVLDAVAEAWRERSGEIREQLSDVADRLSGASRLTPADAVPGPELLD 182

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
            A+R     L + YDS  GGFG APKFP    +  +L  +        SG A     M  
Sbjct: 183 AAVR----GLGERYDSVQGGFGGAPKFPPHPSLLFLLQRAADERPGEDSGTAGRAAAMAR 238

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL+ MA GGI+D +GGGF RY+VD  W VPHFEKMLYD   LA  Y++ F L  D    
Sbjct: 239 HTLRSMASGGINDQIGGGFARYAVDGTWTVPHFEKMLYDNALLARAYVEGFRLWGDERLR 298

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL----G 458
                 L +L  ++ GP G   SA DADS   EG     EG FYVWT ++V   L     
Sbjct: 299 ETAERTLAFLADELRGPEGGFLSALDADS---EGV----EGRFYVWTPEQVRAALSSADA 351

Query: 459 EHAILF---KEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 515
           E AI +    EH   +        R   P +E                            
Sbjct: 352 EAAIAWLGVTEHGNFEDGATVLEDRGERPDDE---------------------------- 383

Query: 516 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 575
             +   R  L   RS+R RP  DDK +  WNGL I +FA AS +L  E      +   V 
Sbjct: 384 -TVARIRAGLLAARSQRIRPGTDDKRVAGWNGLAIHAFAEASAVLGRE------DLLEVA 436

Query: 576 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 635
                ++    +    +RR   D +T     S   G ++    L+D+ FL+   + L+E 
Sbjct: 437 RRAAAFVRRDLTVDGRLRRTWSDRETAGADTSGHGGRARHAAVLEDHGFLLEAAVALFEA 496

Query: 636 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 695
           G   + L WA EL +T    F D E G +F T  +  ++L+R KE  D   PSG + +  
Sbjct: 497 GGDPEDLAWARELADTILNRFADPERGAFFATADDAEALLVRRKELDDAPIPSGGASASR 556

Query: 696 NLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLV 755
            L+RLA++   ++   Y   A+  L +  T  + +  AV     A D    P R+ V +V
Sbjct: 557 GLLRLAALTGEAR---YADAADGWLRLAATVAERIPQAVAYALLALDERHRPPRE-VAIV 612

Query: 756 G 756
           G
Sbjct: 613 G 613


>gi|375012491|ref|YP_004989479.1| thioredoxin domain-containing protein [Owenweeksia hongkongensis
           DSM 17368]
 gi|359348415|gb|AEV32834.1| thioredoxin domain-containing protein [Owenweeksia hongkongensis
           DSM 17368]
          Length = 675

 Score =  390 bits (1002), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 237/635 (37%), Positives = 339/635 (53%), Gaps = 68/635 (10%)

Query: 89  RTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYS 148
           + P +   S   +TN+L  E SPYLLQHAHNPVDW  WGE+AFA+A K +  + +SIGYS
Sbjct: 5   KGPDAQQKSLKMNTNQLINETSPYLLQHAHNPVDWNPWGEDAFAKAEKENKLVIVSIGYS 64

Query: 149 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 208
            CHWCHVME +SFED   A L+N+ F+SIKVDREERPDVD+VYMT VQ + G GGWPL+V
Sbjct: 65  ACHWCHVMEHQSFEDSAAAALMNEHFISIKVDREERPDVDQVYMTAVQLMTGRGGWPLNV 124

Query: 209 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 268
              PD +P+ GGTYFP      + G+   L+ + + +    + + +      E+L+E + 
Sbjct: 125 ITLPDGRPIWGGTYFP------KDGWMQSLQSIVEVYHDDPEKVLEYA----EKLTEGVV 174

Query: 269 AS--ASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 326
            S   S N+ P +  +  + L  +  SK++D + GG   APKFP PV  + +L       
Sbjct: 175 QSELVSPNETPGDYSKEEIDLLFKNWSKNFDKKEGGSAGAPKFPMPVGYEFLL------- 227

Query: 327 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 386
           + G      E  + +  TL+ MA GGI+D VGGGF RYSVD+ W VPHFEKMLYD GQL 
Sbjct: 228 EYGSLTGNEEAMQQLNLTLRKMAFGGIYDQVGGGFSRYSVDDEWKVPHFEKMLYDNGQLV 287

Query: 387 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 446
           ++Y  A+  TK+  Y  I    +++L RDM+GP GE +SA DADS   EG    +EG +Y
Sbjct: 288 SLYSRAYQKTKNPLYKSIVIQTIEWLERDMLGPDGEFYSALDADS---EG----EEGKYY 340

Query: 447 VWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK 506
           VW   E+++I+G+       +Y+       DL +      +++G+ VL+  +DS  + S 
Sbjct: 341 VWPEVELKEIIGDSDWEDFTNYF-------DLKK-----GKWEGRIVLMRSDDSENTDSA 388

Query: 507 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 566
                E          ++L  VR  R  P LDDK + SWN L+I+    A K        
Sbjct: 389 KVKAWE----------QELLKVRENRVPPGLDDKSLTSWNALMITGLVDAYKAFGD---- 434

Query: 567 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 626
                         Y+++A+    ++ ++    +   L HS++ G S   G ++DY F +
Sbjct: 435 ------------SHYLDLAKKNGEWLLKNQV-RKDESLFHSYKKGKSSIDGLIEDYTFAV 481

Query: 627 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 686
            G LDLYE     K+L  A          F D   G +F  +     ++ +  E HD   
Sbjct: 482 QGFLDLYEATFDVKYLEQANAWMKYAKANFEDEGTGLFFTRSKNAKQLIAKSMEVHDNVI 541

Query: 687 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA 721
           P+ NSV   NL  L  +  G++S  Y   +E  LA
Sbjct: 542 PAANSVMAHNLFHLYHLT-GNES--YLAQSEKMLA 573


>gi|182436351|ref|YP_001824070.1| hypothetical protein SGR_2558 [Streptomyces griseus subsp. griseus
           NBRC 13350]
 gi|178464867|dbj|BAG19387.1| conserved hypothetical protein [Streptomyces griseus subsp. griseus
           NBRC 13350]
          Length = 672

 Score =  390 bits (1002), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 242/626 (38%), Positives = 335/626 (53%), Gaps = 61/626 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA E SPYLLQHA NPVDW+ W  EAFAEAR+R VP+ LS+GYS+CHWCHVM  ESFE
Sbjct: 2   NRLADETSPYLLQHADNPVDWWPWSPEAFAEARERGVPVLLSVGYSSCHWCHVMAHESFE 61

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA  LN  FV +KVDREERPD+D VYM  VQA  G GGWP++VFL+PD +P   GTY
Sbjct: 62  DETVATYLNAHFVPVKVDREERPDIDAVYMEAVQAATGHGGWPMTVFLTPDAEPFYFGTY 121

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLP--DEL 280
           FPPE ++G P F+ +L  V  AW  +R+ +A+     +  L+   S     + +P   E+
Sbjct: 122 FPPEARHGSPSFQQVLEGVVAAWTDRREEVAEVAERIVADLA-GRSLVHGGDGVPGESEI 180

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
            Q  L      L++ YD + GGFG APKFP  + ++ +L H  +   TG  G      +M
Sbjct: 181 AQALL-----GLTREYDEQHGGFGGAPKFPPSMVVEFLLRHYAR---TGSEG----ALQM 228

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
              T   MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L  VY   +  T    
Sbjct: 229 AADTCSAMARGGIYDQLGGGFARYSVDREWVVPHFEKMLYDNALLCRVYAHLWRTTGSDE 288

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH 460
              I  +  D++ R++    G   SA DADS + +G  R  EGA+YVWT  ++ ++LGE 
Sbjct: 289 ARRIALETADFMVRELRTAEGGFASALDADSEDADG--RHVEGAYYVWTPAQLREVLGED 346

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
              F   Y+           +++     +G +VL    D+         P++     + +
Sbjct: 347 DAAFAAAYF----------GVTEKGTFEEGASVLRLPGDTG--------PVDA--ARVAD 386

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            R +L   R +RPRP LDDKV+ +WNGL I++ A                      DR +
Sbjct: 387 VRGRLLAAREERPRPGLDDKVVAAWNGLAIAALAETGAYF----------------DRPD 430

Query: 581 YMEVAESAAS-FIRRHLYDEQTHRLQHSFRNGPS-KAPGFLDDYAFLISGLLDLYEFGSG 638
            +E A  AA   +R HL   +  RL  + ++G +    G L+DY  +  G L L      
Sbjct: 431 LVERATEAADLLVRVHL--GEVARLARTSKDGQAGDNAGVLEDYGDVAEGFLTLAAVTGE 488

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
             WL +A  L +   E F   EGG  ++T  +   ++ R ++  D A PSG + +   L+
Sbjct: 489 GAWLEFAGFLLDIVLEQFTG-EGGQLYDTAHDAEQLIRRPQDPTDSATPSGWTAAAGALL 547

Query: 699 RLASIVAGSKSDYYRQNAEHSLAVFE 724
              S  A + S+ +R  AE +L V +
Sbjct: 548 ---SYAAYTGSEAHRTAAEGALGVVK 570


>gi|390953615|ref|YP_006417373.1| thioredoxin domain-containing protein [Aequorivita sublithincola
           DSM 14238]
 gi|390419601|gb|AFL80358.1| thioredoxin domain-containing protein [Aequorivita sublithincola
           DSM 14238]
          Length = 704

 Score =  390 bits (1002), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 218/603 (36%), Positives = 328/603 (54%), Gaps = 49/603 (8%)

Query: 101 HTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVES 160
           +TN L  E SPYLLQHAHNPV+W  +GE +  +A+K    + +SIGY+ CHWCHVME ES
Sbjct: 29  YTNDLIHESSPYLLQHAHNPVNWKPYGEASLQQAKKEKKLLIISIGYAACHWCHVMEHES 88

Query: 161 FEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGG 220
           FED  VA ++N  F+S+KVDREERPDVD+ Y+  VQ + G  GWPL+V   PD +P+ GG
Sbjct: 89  FEDSTVAAVMNKNFISVKVDREERPDVDQTYINAVQLMTGSAGWPLNVVTLPDGRPVWGG 148

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSAS--ASSNKLPD 278
           TYF   D      +   L +++  ++++ + L    A+A  +L E + +      N    
Sbjct: 149 TYFRKND------WIDALEQIQKVYNEEPEKLM---AYA-NRLEEGIKSMDLVHLNTEDV 198

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
           +  +       E LS+++D++ GGF  APKF  P  ++ +L  + +  +    G      
Sbjct: 199 DFAKYPTSEIVENLSQNFDAKNGGFKGAPKFMMPNNLEFLLRQAVQENNADLLG------ 252

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
             V  TL  MA GG++D +GGGF RYS DE+WHVPHFEKMLYD  QL ++Y +A+ +TK 
Sbjct: 253 -YVTLTLDKMAYGGLYDQIGGGFARYSTDEKWHVPHFEKMLYDNAQLVSLYSNAYLVTKK 311

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
             Y  +  + LD++ RDM    G  +S+ DADS +  G  + +EGAFYV+TS+E++ IL 
Sbjct: 312 PLYKEVVEETLDFIARDMTNDEGGFYSSLDADSKDENG--KLEEGAFYVFTSEELQKILK 369

Query: 459 EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 518
           +   +FKE+Y +   G  +           K   VLI          + G+  E +    
Sbjct: 370 DDFDIFKEYYNVNSYGKWE-----------KNHYVLIRKKTDDEIEKEFGITSEAFQQKK 418

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
            + +  L   R+KRP+P LDDK + SWN +++  +  A K                   +
Sbjct: 419 EDWKNTLLAYRNKRPKPRLDDKTLTSWNAMMLKGYVDAYKTF----------------GK 462

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG 638
           +EY++ A   A+FI      ++   L H++++G S   GFL+DYAF I   +DLY+    
Sbjct: 463 REYLDAALKNAAFISEKQL-QKNGALFHNYKDGKSSINGFLEDYAFTIEAFIDLYQATLD 521

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
            KWL  + ++ +     F D E   ++ T+ ED +++ R  E  D   P+ NSV   NL 
Sbjct: 522 EKWLTLSKKMADYAKTNFFDEEKQMFYFTSKEDAAIVTRNFEYRDNVIPASNSVMAKNLF 581

Query: 699 RLA 701
            L+
Sbjct: 582 VLS 584


>gi|302530109|ref|ZP_07282451.1| transcriptional regulator [Streptomyces sp. AA4]
 gi|302439004|gb|EFL10820.1| transcriptional regulator [Streptomyces sp. AA4]
          Length = 663

 Score =  390 bits (1002), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 237/643 (36%), Positives = 335/643 (52%), Gaps = 81/643 (12%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           +NRLA   SPYLLQHA NPVDW+ WG EA AEAR+R VPI LS+GY+ CHWCHVM  ESF
Sbjct: 2   SNRLAEATSPYLLQHAENPVDWWEWGPEALAEARRRGVPILLSVGYAACHWCHVMAHESF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           E EG A L+N  FV+IKVDREERPD+D VYM   QA+ G GGWP++ FL+P+ +P   GT
Sbjct: 62  EHEGTAALMNAHFVNIKVDREERPDIDAVYMAATQAMTGQGGWPMTCFLTPEGEPFHCGT 121

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD-EL 280
           Y+PP  + G P F  +L  V +AW+++ D L +     +  L+E       S  L +  +
Sbjct: 122 YYPPAPRPGIPSFTQLLLAVAEAWEERPDDLREGAKQIVGHLAE------QSGPLKEAAV 175

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
             +AL     +L++  D   GGFG APKFP  + ++ +L H ++   TG    +++   +
Sbjct: 176 DADALAEAVTKLAQEADPVHGGFGGAPKFPPSMVLEFLLRHHER---TG----SAQAYAL 228

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
                + MA+GGIHD +GGGF RYSVD  W VPHFEKMLYD   L  VY    +      
Sbjct: 229 AESAAEAMARGGIHDQLGGGFARYSVDAEWIVPHFEKMLYDNALLLRVYAH-LARRGSAS 287

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH 460
              +   I+ +L  D++ P G   ++ DAD+   EG T       YVWT  ++ ++LGE 
Sbjct: 288 ARRVAEGIVRFLEHDLLTPQGGFAASLDADTEGVEGLT-------YVWTPAQLNEVLGED 340

Query: 461 AILFKEHYYLKPTGNCD-----LSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 515
                E + +   G  +     L   +DP +  + + V                      
Sbjct: 341 GPWAAELFSVTEEGTFEEGASTLQLRADPDDFARFERV---------------------- 378

Query: 516 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 575
                 R+ L + R+ RP+P  DDKV+ +WNGL IS+ A A   L               
Sbjct: 379 ------RQALLEARAARPQPGRDDKVVAAWNGLAISALAEAGVAL--------------- 417

Query: 576 SDRKEYMEVAESAAS-FIRRHLYDEQTHRLQHSFRNGPSKAP-GFLDDYAFLISGLLDLY 633
            +R +++E+A +AAS  +  HL D    RL+ S R+G   AP G L+DYA L  GLL L+
Sbjct: 418 -ERPQWIELARNAASLLLDLHLVD---GRLRRSSRDGAVGAPVGVLEDYACLADGLLALH 473

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGAEPSGNSV 692
           +     +WL  A  L +     F      G ++ T +D  VL++   D  D A PSG S 
Sbjct: 474 QATGEPRWLTEATRLLDVALTHFASDSAPGAYHDTADDAEVLVQRPSDPTDNASPSGASA 533

Query: 693 SVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP 735
               L+  +++    ++  YR  AE +L     R+  +A  VP
Sbjct: 534 LAGALLTASALAGSDQAARYRDAAELAL----RRVGLLAARVP 572


>gi|455649958|gb|EMF28748.1| hypothetical protein H114_12956 [Streptomyces gancidicus BKS 13-15]
          Length = 679

 Score =  390 bits (1001), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 232/591 (39%), Positives = 316/591 (53%), Gaps = 57/591 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA   SPYLLQHA NPVDW+ W EEAF EAR+RDVP+ LS+GYS+CHWCHVM  ESFE
Sbjct: 3   NRLAQATSPYLLQHADNPVDWWTWSEEAFVEARRRDVPVLLSVGYSSCHWCHVMAHESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D+  A  +N  FVSIKVDREERPDVD VYM  VQA  G GGWP++VFL+PD +P   GTY
Sbjct: 63  DQATADEMNAHFVSIKVDREERPDVDAVYMEAVQAATGQGGWPMTVFLTPDAEPFYFGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQ-SGAFAIEQLSEALSASASSNKLPDELP 281
           FPP  ++G P F+ +L  V  AW ++RD + + +G    +     LS          EL 
Sbjct: 123 FPPAPRHGMPSFRQVLEGVAQAWAERRDEVGEVAGKITRDLAGRELSVGGDEVPGEQELA 182

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
           Q  L      L++ YD++ GGFG APKFP  + ++ +L H  +   TG  G      +M 
Sbjct: 183 QALL-----GLTREYDAQRGGFGGAPKFPPSMVLEFLLRHHAR---TGAEG----ALQMA 230

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             T + MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L  VY   +  T     
Sbjct: 231 ADTCERMARGGIYDQLGGGFARYSVDRDWVVPHFEKMLYDNALLCRVYTHLWRTTGSELA 290

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHA 461
             +  +  D++ R++  P G   SA DADS   +G  R  EGA+YVWT  ++ ++LG+  
Sbjct: 291 RRVALETADFMVRELRTPEGGFASALDADS--DDGTGRHVEGAYYVWTPAQLREVLGDAD 348

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDSSASASKLGMPLEKYLNILGE 520
                 Y+           +++     +G +VL +   D  A A++           +  
Sbjct: 349 AEPAARYF----------GVTEEGTFEEGASVLQLPQRDEVADAAR-----------IDG 387

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            R +L   R +RP P  DDKV+ +WNGL I++ A            A F        R +
Sbjct: 388 IRERLLAARDRRPAPGRDDKVVAAWNGLAIAALAET---------GACFG-------RPD 431

Query: 581 YMEVAESAAS-FIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYAFLISGLLDLYEFGSG 638
            +E A +A    +R HL D    R+  + ++G   A  G L+DYA +  G L L      
Sbjct: 432 LVEAAVAAGDLLVRVHLDDHA--RIARTSKDGQVGANAGVLEDYADVAEGFLALASVTGE 489

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSG 689
             WL +A  L +     FLD E G  ++T  +   ++ R ++  D A PSG
Sbjct: 490 GVWLDFAGLLVDHILARFLDAESGALYDTASDAERLIRRPQDPTDNAAPSG 540


>gi|326776975|ref|ZP_08236240.1| hypothetical protein SACT1_2812 [Streptomyces griseus XylebKG-1]
 gi|326657308|gb|EGE42154.1| hypothetical protein SACT1_2812 [Streptomyces griseus XylebKG-1]
          Length = 672

 Score =  390 bits (1001), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 242/626 (38%), Positives = 334/626 (53%), Gaps = 61/626 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA E SPYLLQHA NPVDW+ W  EAFAEAR+R VP+ LS+GYS+CHWCHVM  ESFE
Sbjct: 2   NRLADETSPYLLQHADNPVDWWPWSPEAFAEARERGVPVLLSVGYSSCHWCHVMAHESFE 61

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA  LN  FV +KVDREERPD+D VYM  VQA  G GGWP++VFL+PD +P   GTY
Sbjct: 62  DETVATYLNAHFVPVKVDREERPDIDAVYMEAVQAATGHGGWPMTVFLTPDAEPFYFGTY 121

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLP--DEL 280
           FPPE ++G P F+ +L  V  AW  +R+ +A+     +  L    S     + +P   E+
Sbjct: 122 FPPEARHGSPSFQQVLEGVVAAWTDRREEVAEVAERIVADLG-GRSLVHGGDGVPGESEI 180

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
            Q  L      L++ YD + GGFG APKFP  + ++ +L H  +   TG  G      +M
Sbjct: 181 AQALL-----GLTREYDEQHGGFGGAPKFPPSMVVEFLLRHYAR---TGSEG----ALQM 228

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
              T   MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L  VY   +  T    
Sbjct: 229 AADTCSAMARGGIYDQLGGGFARYSVDREWVVPHFEKMLYDNALLCRVYAHLWRTTGSDE 288

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH 460
              I  +  D++ R++    G   SA DADS + +G  R  EGA+YVWT  ++ ++LGE 
Sbjct: 289 ARRIALETADFMVRELRTAEGGFASALDADSEDADG--RHVEGAYYVWTPAQLREVLGED 346

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
              F   Y+           +++     +G +VL    D+         P++     + +
Sbjct: 347 DAAFAAAYF----------GVTEKGTFEEGASVLRLPGDTG--------PVDA--ARVAD 386

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            R +L   R +RPRP LDDKV+ +WNGL I++ A                      DR +
Sbjct: 387 VRGRLLAAREERPRPGLDDKVVAAWNGLAIAALAETGAYF----------------DRPD 430

Query: 581 YMEVAESAAS-FIRRHLYDEQTHRLQHSFRNGPS-KAPGFLDDYAFLISGLLDLYEFGSG 638
            +E A  AA   +R HL   +  RL  + ++G +    G L+DY  +  G L L      
Sbjct: 431 LVERATEAADLLVRVHL--GEVARLARTSKDGQAGDNAGVLEDYGDVAEGFLTLAAVTGE 488

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
             WL +A  L +   E F   EGG  ++T  +   ++ R ++  D A PSG + +   L+
Sbjct: 489 GAWLEFAGFLLDIVLEQFTG-EGGQLYDTAHDAEQLIRRPQDPTDSATPSGWTAAAGALL 547

Query: 699 RLASIVAGSKSDYYRQNAEHSLAVFE 724
              S  A + S+ +R  AE +L V +
Sbjct: 548 ---SYAAYTGSEAHRTAAEGALGVVK 570


>gi|225679668|gb|EEH17952.1| DUF255 domain-containing protein [Paracoccidioides brasiliensis
           Pb03]
          Length = 865

 Score =  390 bits (1001), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 240/593 (40%), Positives = 331/593 (55%), Gaps = 44/593 (7%)

Query: 85  AMAERTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLS 144
           +  ER  AST     +  NRL    SPY+L H +NPV W  W  EA A A+K +  IFL 
Sbjct: 10  SQTERGAASTG---PELVNRLYQSKSPYVLGHMNNPVAWQLWDSEAIALAKKLNRLIFLR 66

Query: 145 IGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGW 204
                   CHVME ESF    +A +LN  F+ IK+DREERPD+D+VYM YVQA  G GGW
Sbjct: 67  --------CHVMEKESFMAPEIAAILNKSFIPIKLDREERPDIDEVYMNYVQATTGSGGW 118

Query: 205 PLSVFLSPDLKPLMGGTYFP-PEDKY-------GRPGFKTILRKVKDAWDKKRDMLAQSG 256
           PL+VFL+PDL+P+ GG+Y+P P           G+  F  IL K++D W  ++    +S 
Sbjct: 119 PLNVFLTPDLEPVFGGSYWPGPHSNALPTLGGEGQITFVDILEKLRDVWHTQQLRCRESA 178

Query: 257 AFAIEQLSEALSASASSNKLPD-----ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPR 311
               +QL E  +   + +K  D     +L    L    +  +  YD+  GGF  APKFP 
Sbjct: 179 KDITKQLRE-FAEEGTHSKQSDVEAEEDLEIELLEEAYQHFASRYDAVNGGFSEAPKFPT 237

Query: 312 PVEIQMMLYHSK---KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDE 368
           PV +  +++ S+    + D     E S   ++ + TL  M++GGIHD +G GF RYSV  
Sbjct: 238 PVNLSFLVHLSRYPGAVADIVGYEECSRAIEIAVKTLIAMSRGGIHDQIGHGFARYSVTA 297

Query: 369 RWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAE 427
            W +PHFEKMLYDQ QL +VY+DAF    D        DI  Y+    M+ P G   S+E
Sbjct: 298 DWSLPHFEKMLYDQAQLLDVYVDAFDSAYDPELLGAMYDIATYITSPPMLSPTGGFHSSE 357

Query: 428 DADSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHN 486
           DADS  +   T K+EGAFYVWT KE++ ILG+  A +   H+ +   GN  ++R++DPH+
Sbjct: 358 DADSRPSPNDTEKREGAFYVWTLKELKQILGQRDAEVCARHWGVLADGN--VARINDPHD 415

Query: 487 EFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSW 545
           EF  +NVL      S  A + G+  ++ + I+   R KL + R SKR RP LDDK+IV+W
Sbjct: 416 EFINQNVLSIQVTPSKLAKEFGLGEDEVVRIIKGSREKLREYRESKRVRPDLDDKIIVAW 475

Query: 546 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQ 605
           NGL I + A+ S +L++      + F             AE A  FI+ +L+DEQT +L 
Sbjct: 476 NGLAIGALAKCSVVLENLDREKAYQF----------RRAAEEAVRFIKHNLFDEQTGQLW 525

Query: 606 HSFRNGP-SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFL 657
             +R G     PGF DDYA+LISGL++LYE       L +A +LQ   ++ FL
Sbjct: 526 RIYRGGVRGDTPGFADDYAYLISGLINLYEATFDDSHLQFAEQLQQYLNKHFL 578


>gi|313675015|ref|YP_004053011.1| hypothetical protein Ftrac_0901 [Marivirga tractuosa DSM 4126]
 gi|312941713|gb|ADR20903.1| hypothetical protein Ftrac_0901 [Marivirga tractuosa DSM 4126]
          Length = 675

 Score =  389 bits (1000), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 237/618 (38%), Positives = 330/618 (53%), Gaps = 69/618 (11%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            N+L  E SPYLLQHAHNPV+W AWGEEA  +A+K D PI LSIGY+ CHWCHVME ESF
Sbjct: 4   VNKLIHESSPYLLQHAHNPVNWQAWGEEALNQAQKEDKPIILSIGYAACHWCHVMEHESF 63

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           EDE VAK++N+ ++ IK+DREERPD+D++YM  +Q +   GGWPL+VFL P+ KP  GGT
Sbjct: 64  EDEEVAKVMNENYICIKLDREERPDIDQIYMDAIQTMGLHGGWPLNVFLIPNQKPFYGGT 123

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFP      +  +  IL KV  A+   R+ L +S      + ++AL+A+         L 
Sbjct: 124 YFP------KNKWLEILDKVAIAFQSSRNQLEESA----NKFAQALNAADGEKLSLGAL- 172

Query: 282 QNALRLCAEQLSKSY-------DSRFGGFGSAPKFPRPVEIQMML---YHSKKLEDTGKS 331
            NA    ++ LS++Y       D   GG   APKFP PV  Q ++   +HS+        
Sbjct: 173 -NAENFNSKILSEAYQKLGSFLDWDNGGTLGAPKFPMPVIWQFLMKYAFHSQN------- 224

Query: 332 GEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLD 391
               E +K + FTL  +A GGI+D +GGGF RYSVD  W  PHFEKMLYD GQL ++Y D
Sbjct: 225 ---PEAKKALEFTLTSLADGGIYDQIGGGFARYSVDAEWFAPHFEKMLYDNGQLISLYAD 281

Query: 392 AFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSK 451
           AF  TK+ ++  I  D + +  R+++ P    +SA DADS   EG    +EG FY WT  
Sbjct: 282 AFRFTKNPYFKEIFEDSIRFSAREIMDPYCRFYSALDADS---EG----EEGKFYTWTYT 334

Query: 452 EVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPL 511
           E+E ILG+ A    + Y     GN +            G+N+L   +          +  
Sbjct: 335 ELEQILGDKAEPILKFYNATEKGNWE-----------NGRNILFRHSSIEDFCKAEKIDQ 383

Query: 512 EKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNF 571
           EK+   L E +  L D R  R RP +DDK++  WN L +     A K  +          
Sbjct: 384 EKFKAQLIEAKDSLLDAREDRVRPAMDDKILTGWNALQMKGICDAYKAYQD--------- 434

Query: 572 PVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLD 631
                  K+Y  +A+    F+   ++D   ++L  SF+N   K   +L+DYA  I   + 
Sbjct: 435 -------KKYKAIAQDNFVFLSEFVWD--GNQLFRSFKNEQPKIKAYLEDYALAIQASIS 485

Query: 632 LYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNS 691
           L+E  S +K L +A +L N   + F D +   +F T      ++ R KE  D   P+ NS
Sbjct: 486 LFEISSDSKALDFAEKLTNYAIQNFYDEKEKLFFYTDKSSEKLIARKKEIFDNVIPASNS 545

Query: 692 VSVINLVRLASIVAGSKS 709
           V + NL  L  I+ G+ S
Sbjct: 546 VMIENLHWLG-ILKGNSS 562


>gi|345006662|ref|YP_004809515.1| hypothetical protein [halophilic archaeon DL31]
 gi|344322288|gb|AEN07142.1| hypothetical protein Halar_3548 [halophilic archaeon DL31]
          Length = 727

 Score =  389 bits (999), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 233/625 (37%), Positives = 337/625 (53%), Gaps = 57/625 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA NPV+W  W + A AEA++R+VPIFLS+GYS CHWCHVM  ESFE
Sbjct: 5   NRLDTEPSPYLQQHADNPVNWQPWDDAALAEAKEREVPIFLSVGYSACHWCHVMAEESFE 64

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D  VA+ +N+ FV +KVDREERPD+D+VY T  Q + GGGGWPLS +L+P+ KP   GTY
Sbjct: 65  DPAVAETINENFVPVKVDREERPDLDRVYQTVCQLVTGGGGWPLSAWLTPEGKPFYIGTY 124

Query: 223 FPPEDKYGR--PGFKTILRKVKDAW---DKKRDM---LAQSGAFAIEQLSEALSASASSN 274
           FPPE    R  PGF+ + R++ D+W   +++++M     Q  A A ++L  A +   + +
Sbjct: 125 FPPEPHPQRNAPGFQDLCRQIADSWSDPEQRQEMENRAEQWTAAARDRLEPASTGRNTES 184

Query: 275 KLPDELPQNALRL--CAEQLSKSYDSRFGGFGS-APKFPRPVEIQMMLYHSKKLEDTGKS 331
           +   E   +   L   A  + +  D   GGFGS  PKFP P  ++++L    ++   G  
Sbjct: 185 ETATETLSSTELLDDAAAAVVRGADRTNGGFGSGGPKFPHPGRVELLL----RVAALGDD 240

Query: 332 GEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLD 391
           GE      +    L  M  GG++DH+GGGFHRY VD  W VPHFEKM YD G +   +L 
Sbjct: 241 GEP---LSVARNALNAMGSGGLYDHLGGGFHRYCVDAEWTVPHFEKMAYDNGTIPAAFLA 297

Query: 392 AFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGA-------TRKKEGA 444
            +        + + R+ L+++ R++  P G  +S  DA S ET  +         ++EGA
Sbjct: 298 GYRAMGRERDAEVVRETLEFVSRELRHPDGGFYSTLDARS-ETPASRLEDDEEPEREEGA 356

Query: 445 FYVWTSKEVEDILGE-HAILFKEHYYLKPTGN----CDLSRMSDPHNEFKGKNVLIELND 499
           FYVWT  E+  ++ E  A LF   Y +   GN      +   + P  E  G     E ++
Sbjct: 357 FYVWTPAEIRAVVDEPAATLFCRRYGVISGGNFEGGTSVLNETVPIAELVGA----EFDE 412

Query: 500 SSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKI 559
            +A  S+     E    +L    ++LF+ R +RPRP  D+KV+  WNGL+IS+FA A  +
Sbjct: 413 GTAPDSE-----EAVEELLQTATQELFEARGERPRPLRDEKVLAGWNGLLISTFAEAGLV 467

Query: 560 LKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFL 619
           L                   +Y E A++A SF+R HL+D    RL   F++G     G+L
Sbjct: 468 LDD-----------------QYTEDAQAALSFVREHLWDADARRLSRRFKDGDVAVSGYL 510

Query: 620 DDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVK 679
           +DYAFL  G  + Y+     + L +A+EL     + F D + G  + T  +   ++ R +
Sbjct: 511 EDYAFLGRGAFETYQATGNVEPLSFALELAEVIADAFYDADDGTLYFTANDAEELVARPQ 570

Query: 680 EDHDGAEPSGNSVSVINLVRLASIV 704
           E  D + PS    +V  L+ L S  
Sbjct: 571 ELTDQSTPSSVGAAVSLLLELDSFT 595


>gi|113474681|ref|YP_720742.1| hypothetical protein Tery_0863 [Trichodesmium erythraeum IMS101]
 gi|110165729|gb|ABG50269.1| protein of unknown function DUF255 [Trichodesmium erythraeum
           IMS101]
          Length = 693

 Score =  389 bits (999), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 248/678 (36%), Positives = 358/678 (52%), Gaps = 93/678 (13%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNRLA   S YL +HA NPVDW+ W EEA   A+++D PIFLSIGYS+CHWC VME E+F
Sbjct: 2   TNRLAKSQSLYLRKHAENPVDWWPWSEEALETAKQQDKPIFLSIGYSSCHWCTVMEGEAF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP-DLKPLMGG 220
            DE +A+ LN+ F+ IKVDREERPDVD +YM  +Q L G GGWPL++FL+P DL P +GG
Sbjct: 62  SDEKIAQYLNEKFLPIKVDREERPDVDSIYMQALQMLTGQGGWPLNIFLTPDDLIPFVGG 121

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           TYFP E +YGRPGF  +L+K++  +D +++ L       +E L +++    + + L +E+
Sbjct: 122 TYFPIEPRYGRPGFLEVLQKIRSFYDLEKNKLDTLKVEMLEGLRKSVLLPEAED-LKEEI 180

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
            Q  L +  + +   Y        S   FP     Q  L   KKL    ++       K+
Sbjct: 181 LQQGLEVITKIIGDRY--------SQQSFPMIPYAQAAL-QGKKLNFKSQNN----SNKV 227

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ----LANVYLDAFSLT 396
            L     +A GGI+DHV GGFHRY+VD  W VPHFEKMLYD GQ    LAN++   +   
Sbjct: 228 CLERGLNLALGGIYDHVAGGFHRYTVDPNWTVPHFEKMLYDNGQIVEYLANLWSAGYH-- 285

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
           K  F   I   + ++L+R+M  P G  ++A+DADS  T      +EGAFY+W+ KE+E++
Sbjct: 286 KPAFKRGIIGTV-NWLKREMTAPTGFFYAAQDADSFTTPDEVEPEEGAFYIWSYKELENL 344

Query: 457 LGEHAIL-FKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 515
           L +  +    + ++++P GN            F+GK VL         A +L   +E  L
Sbjct: 345 LTKEELSELSKQFFIEPNGN------------FEGKIVL-----QRKQAEELSKTVENSL 387

Query: 516 NILGECRRKL--FDVRSKRPRPH----------------LDDKVIVSWNGLVISSFARAS 557
           + L + R  +  F++ +  P  +                 D K+IV+WN L+IS  AR +
Sbjct: 388 SKLFKLRYGVQPFNIETFPPATNNKEAKNNNWPGKIPAVTDTKMIVAWNSLMISGLARTA 447

Query: 558 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASF-IRRHLYDEQTHRLQHSFRNGPSKAP 616
            +  S                 EY+E+A +AA F I     D + HRL +    G     
Sbjct: 448 TVFNS----------------LEYLELAMNAAHFIITNQQIDGRFHRLNYE---GKPAVT 488

Query: 617 GFLDDYAFLISGLLDLYE----------FGSGTK-WLVWAIELQNTQDELFLDREGGGYF 665
              +DYA  I  LLDL +            + T  WL  AI+LQ+  DE    +E  GY+
Sbjct: 489 AQSEDYALFIKALLDLQQASISLETLSKLNTNTNFWLETAIKLQDEFDEFLWSQETAGYY 548

Query: 666 NTTGE-DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE 724
           NT+ E    ++LR +   D A P+ N +++ NLVRL+ +   ++  YY   AE +L  F 
Sbjct: 549 NTSYEVTGELILRERNYIDNATPAANGIAIANLVRLSLL---TEELYYLDRAESALTAFS 605

Query: 725 TRLKDMAMAVPLMCCAAD 742
           + +K    A P +  A D
Sbjct: 606 SIMKKSPQACPSLFVALD 623


>gi|448576201|ref|ZP_21642244.1| hypothetical protein C455_04761 [Haloferax larsenii JCM 13917]
 gi|445729881|gb|ELZ81475.1| hypothetical protein C455_04761 [Haloferax larsenii JCM 13917]
          Length = 702

 Score =  389 bits (999), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 226/600 (37%), Positives = 316/600 (52%), Gaps = 58/600 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA NPV+W  W + A   AR+ D PIFLSIGYS CHWCHVM  ESF 
Sbjct: 8   NRLDNEQSPYLRQHADNPVNWQPWDDTALEAAREADKPIFLSIGYSACHWCHVMADESFS 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D  +A+ LN+ FV +KVDREERPD+D++Y T  Q + GGGGWPLSV+L+P  KP   GTY
Sbjct: 68  DPDIAETLNEHFVPVKVDREERPDLDRIYQTICQLVTGGGGWPLSVWLTPQGKPFFVGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDML---AQSGAFAI-EQLSEA--LSASASSNKL 276
           FPPE + G PGF+ ++    ++W   RD +   AQ    AI +QL +       A  +++
Sbjct: 128 FPPEPRRGAPGFRDLVESFAESWQTDRDEIENRAQQWTSAIHDQLEDTPDTPGEAPGSEI 187

Query: 277 PDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASE 336
            D+  Q ALR                    PKFP+P  I  +L   +    TG+     +
Sbjct: 188 LDQTVQAALRAADRDDGGFG--------GGPKFPQPGRIDALL---RGYAITGR----RQ 232

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
              + + +L  MA GG+ DH+GGGFHRY VD+ W VPHFEKMLYDQ  L + YLD + LT
Sbjct: 233 ALDVAVESLDAMANGGLRDHLGGGFHRYCVDKDWTVPHFEKMLYDQAGLVSRYLDTYRLT 292

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
               Y+ +  +  +++RR++    G  F+  DA S         +EG FYVWT  EV  +
Sbjct: 293 GTEAYADVAAETFEFVRRELSHDDGGFFATLDAQSG-------GEEGTFYVWTPDEVRSL 345

Query: 457 LGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS-SASASKLGMPLEKY 514
           L E  A LF + Y + P GN            F+ K  ++ ++ + S  A +  +  ++ 
Sbjct: 346 LPELEADLFCDRYGVTPGGN------------FENKTTVLNVSATLSDLAEEYDISEDEV 393

Query: 515 LNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVV 574
            + L E R+ LF  RS R RP  D+K++  WNGL+IS+FA+ +  L+ ++          
Sbjct: 394 EDKLAEARKALFAARSGRERPARDEKILAGWNGLMISAFAQGAVALEDDS---------- 443

Query: 575 GSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYE 634
                   + A  A  F+R HL+D     L     NG  K  G+L+DYAFL  G  DLY+
Sbjct: 444 ------LADDARRALDFVREHLWDADAGHLSRRVMNGEVKGDGYLEDYAFLARGAFDLYQ 497

Query: 635 FGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSV 694
                  L +A++L       F D   G  + T     +++ R +E  D + PS   V+ 
Sbjct: 498 ATGDVDPLAFALDLARAIHREFYDDAAGTLYFTPESGEALVTRPQEATDQSTPSSLGVAT 557


>gi|225418720|ref|ZP_03761909.1| hypothetical protein CLOSTASPAR_05944, partial [Clostridium
           asparagiforme DSM 15981]
 gi|225041746|gb|EEG51992.1| hypothetical protein CLOSTASPAR_05944 [Clostridium asparagiforme
           DSM 15981]
          Length = 506

 Score =  389 bits (999), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 228/565 (40%), Positives = 297/565 (52%), Gaps = 64/565 (11%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           +N L  E SPYLLQHA NPVDW+ W  EAF +A   D PIFLSIGYSTCHWCHVM  ESF
Sbjct: 2   SNHLLREKSPYLLQHAENPVDWYPWSHEAFEKAALEDKPIFLSIGYSTCHWCHVMAHESF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           ED  VAK LN  +V +KVDREERP++D VYM+  QA+ G GGWPL++ ++PD KP   GT
Sbjct: 62  EDREVAKRLNADYVPVKVDREERPEIDMVYMSVCQAMTGQGGWPLTIIMTPDKKPFFAGT 121

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           Y P   +    G   +L  V + W   R  L       +  L  A  AS+ ++      P
Sbjct: 122 YLPKTSRRNMTGLLELLSAVSEIWKSDRKRLLNMSDQILAVLRRAPDASSPAD------P 175

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
           +   R   E+L  ++D  +GGFG APKFP P  +  ++ +            A E Q + 
Sbjct: 176 ETLARRGYEELRAAFDRTYGGFGRAPKFPAPHNLLFLMRY---------RAWADEPQALA 226

Query: 342 LF--TLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           +   TL  MA+GGIHDH+GGGF RYS D+ W VPHFEKMLYD   LA  YL+ + LT + 
Sbjct: 227 MAEKTLSSMARGGIHDHLGGGFSRYSTDQMWLVPHFEKMLYDNALLALAYLEGYRLTGNR 286

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE 459
           FY    R ILDY+RR++ GP G  +  +DADS          EG +YV++ +E+  +LG 
Sbjct: 287 FYQRTARQILDYVRRELTGPEGGFYCGQDADSQGV-------EGKYYVFSEEEIGRVLGS 339

Query: 460 HAIL--FKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI 517
                 F   Y +   GN            F+G N+   +++       L M        
Sbjct: 340 RKDQEKFCRRYGITKEGN------------FEGANIPNLIHNPDYEQRDLEMD------- 380

Query: 518 LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSD 577
              CRR L++ R KR   H DDK++ SWN L+I + ARA  +L                D
Sbjct: 381 -ALCRR-LYEYRLKRLPLHRDDKILASWNALMIIACARAGFLL----------------D 422

Query: 578 RKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGS 637
              Y+E+A  A  F+ + L+DE   RL   +R G S  PG LDDYAF    LL LYE   
Sbjct: 423 DPGYLEMAGRAQMFVEQKLFDENG-RLLVRYRQGESAFPGNLDDYAFYCLALLTLYEVTL 481

Query: 638 GTKWLVWAIELQNTQDELFLDREGG 662
              +L  A+       ELF D E G
Sbjct: 482 DASYLELAVNRAEQMVELFWDEERG 506


>gi|92115739|ref|YP_575468.1| hypothetical protein Nham_0107 [Nitrobacter hamburgensis X14]
 gi|91798633|gb|ABE61008.1| protein of unknown function DUF255 [Nitrobacter hamburgensis X14]
          Length = 682

 Score =  389 bits (998), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 235/611 (38%), Positives = 324/611 (53%), Gaps = 58/611 (9%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
            +  NRLAAE SPYLLQH HNPVDW+ WG  A AEA++ + PI LSIGY+ CHWCHVM  
Sbjct: 9   GRPANRLAAETSPYLLQHQHNPVDWWPWGPAALAEAQRTNRPILLSIGYAACHWCHVMAH 68

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFED+ VA ++N+ FV IKVDREERPD+D++YM  +  L   GGWPL++FLSPD  P  
Sbjct: 69  ESFEDDEVAAVMNELFVCIKVDREERPDIDQIYMNALHLLGEQGGWPLTMFLSPDGSPFW 128

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
           GGTYFP    +GRP F  +L+ V   +  K + +  +    I +LSE     + +N    
Sbjct: 129 GGTYFPKLPDFGRPAFTDVLQSVARVFHDKPERVTLNRDAVIARLSERAKVGSPAN---- 184

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
            L    L   A  +++S D   GG   APKFP+   ++        L   G    +    
Sbjct: 185 -LGVAELNTAAVSIARSTDPVNGGLHGAPKFPQCSVLEF-------LWRAGARTGSDRFY 236

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
                TL  M++GGI+DH+GGG+ RYSVD+RW VPHFEKMLYD  Q+ ++    ++ +K+
Sbjct: 237 AATTLTLTQMSQGGIYDHLGGGYARYSVDDRWLVPHFEKMLYDNAQILDLLALDYARSKN 296

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
             Y     + + +L R+M+   G   S+ DADS   EG    KEG FYVW+  E+E++LG
Sbjct: 297 PLYRERAIETVAWLLREMLTGEGGFASSLDADS---EG----KEGKFYVWSLSEIEEVLG 349

Query: 459 -EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI 517
              A  F   Y +   GN            F+G+N+   L  SS   S  G  +      
Sbjct: 350 ATDAADFAARYDITANGN------------FEGRNIPNRLK-SSDLVSDDGAHMRT---- 392

Query: 518 LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSD 577
               R KL   R+ R RP LDDKV+  WNGL+I++             +  F  P     
Sbjct: 393 ---LRAKLLARRAGRVRPGLDDKVLADWNGLMIAALVHG---------ACAFGLP----- 435

Query: 578 RKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGS 637
             +++E A +A  FIR+ +   +  RL HS+R G    P    DYA ++   L L E   
Sbjct: 436 --DWLETARTAFEFIRKTM--TRGDRLGHSWREGRLLVPALACDYAAMVRAALALSEATG 491

Query: 638 GTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINL 697
            T +L  A+  Q T D  + D E GGY+ T  +   +++R     D A P+ N +   NL
Sbjct: 492 DTAYLEQALRWQATLDTHYADVEHGGYYLTADDAEGLIVRPHSTIDDAIPNYNGLIAQNL 551

Query: 698 VRLASIVAGSK 708
           VRLA++   SK
Sbjct: 552 VRLAALTGDSK 562


>gi|298206807|ref|YP_003714986.1| hypothetical protein CA2559_01090 [Croceibacter atlanticus
           HTCC2559]
 gi|83849439|gb|EAP87307.1| hypothetical protein CA2559_01090 [Croceibacter atlanticus
           HTCC2559]
          Length = 681

 Score =  388 bits (997), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 221/635 (34%), Positives = 346/635 (54%), Gaps = 55/635 (8%)

Query: 97  SRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVM 156
           S+    N L+ E SPYLLQHA+NPV+W  W  +   +A++ +  I +SIGY+ CHWCHVM
Sbjct: 3   SKINTNNLLSKETSPYLLQHANNPVNWVGWSSKVLNKAKEDNKLILISIGYAACHWCHVM 62

Query: 157 EVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKP 216
           E ESFED  +A+++N  F++IKVDREERPDVD+VYM  +Q + G GGWPL++   PD +P
Sbjct: 63  EHESFEDISIAEVMNANFINIKVDREERPDVDQVYMKALQLMTGQGGWPLNIVALPDGRP 122

Query: 217 LMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKL 276
           + G TY P      +  +K  L ++ D +    + +        E+LS+ ++  +   K 
Sbjct: 123 IWGATYLP------KKQWKGSLHQLADLYRSNSEHMITYA----EKLSKGMAQVSLVTKT 172

Query: 277 PD--ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEA 334
               ++ +  L+   +  S  +D  +GG   +PKF  P   Q +L ++ + +D       
Sbjct: 173 DSNTDISKAFLKDSLQTWSNQFDYTYGGTQRSPKFMMPNNYQFLLRYAHQTKDKSL---- 228

Query: 335 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 394
                 V+ TL  ++ GG++DH+GGGF RY+VD +WHVPHFEKMLYD  QL ++Y  A++
Sbjct: 229 ---LDYVILTLNKISYGGVYDHIGGGFSRYAVDSKWHVPHFEKMLYDNAQLVSLYSKAYT 285

Query: 395 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 454
           LTKD +Y  +  + L+++  ++    G  +S+ DADS  TEG  + +EGAFYVWT  E++
Sbjct: 286 LTKDPWYKTVVTNTLNFIETELTRDNGSFYSSLDADSLNTEG--KLEEGAFYVWTKAELK 343

Query: 455 DILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKY 514
            +L E   LF+ +Y +   G+ +       HN +    VLI    +S  A+   +P+   
Sbjct: 344 SLLNEDYPLFEAYYNINEYGHWE-------HNNY----VLIRTKSNSEIANDFSIPISTL 392

Query: 515 LNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVV 574
              L   +  L + R KR +P LDDK + SWN L+I+ +  A K  +             
Sbjct: 393 DKKLTSWKALLNNNRQKRAQPRLDDKSLTSWNALMINGYIDAYKAFQI------------ 440

Query: 575 GSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYE 634
                +Y+E+A  A++FI   +  ++   L HS+    +K  G+L+DYAF I   + L+E
Sbjct: 441 ----NDYLEIALKASNFILDKML-QKDGSLTHSYNKNEAKINGYLEDYAFTIEAFISLFE 495

Query: 635 FGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSV 694
               +KWL  A EL     + F D E   ++  +  D +++ R  E  D   P+ NS   
Sbjct: 496 VTFNSKWLSKAEELTTYALKHFYDEEQHIFYFNSNLDDALVTRPIEQQDNVIPASNSTMA 555

Query: 695 INLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKD 729
            NL +L+ ++ G KS  Y++ AE  L   +T L+D
Sbjct: 556 KNLFKLSHLL-GIKS--YKEIAEQQL---KTVLQD 584


>gi|429201724|ref|ZP_19193171.1| hypothetical protein STRIP9103_06317 [Streptomyces ipomoeae 91-03]
 gi|428662694|gb|EKX62103.1| hypothetical protein STRIP9103_06317 [Streptomyces ipomoeae 91-03]
          Length = 687

 Score =  388 bits (997), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 244/663 (36%), Positives = 345/663 (52%), Gaps = 69/663 (10%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNRLA E SPYLLQHA NPVDW+ W EEAFAEAR+R VP+ LS+GYS+CHWCHVM  ESF
Sbjct: 6   TNRLAHETSPYLLQHADNPVDWWPWSEEAFAEARERGVPVLLSVGYSSCHWCHVMAHESF 65

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           ED   A  LN  FVS+KVDREERPDVD VYM  VQA  G GGWP++VFL+PD +P   GT
Sbjct: 66  EDRETADYLNAHFVSVKVDREERPDVDAVYMEAVQAATGQGGWPMTVFLTPDAEPFYFGT 125

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS-EALSASASSNKLPDEL 280
           YFPP  ++G P F+ +L  V+ AW  +RD + +     +  L+   L  +A      ++L
Sbjct: 126 YFPPAPRHGMPSFRQVLEGVRAAWADRRDEVTEVAGKIVRDLAGRELQFAAVEVPGEEDL 185

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
            +  L      L++ YD+  GGFG APKFP  + I+ +L H  +   TG  G      +M
Sbjct: 186 ARALL-----GLTREYDAVHGGFGGAPKFPPSMVIEFLLRHYAR---TGSEG----ALQM 233

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
              T + MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L  VY   +  T    
Sbjct: 234 AQDTCERMARGGIYDQLGGGFARYSVDREWVVPHFEKMLYDNALLCRVYAHLWRATGSEL 293

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH 460
              +  +  D++ R++    G   SA DADS   +G  +  EGA+YVWT  ++ ++LG+ 
Sbjct: 294 ARRVALETADFMVRELGTGEGGFASALDADS--DDGTGKHVEGAYYVWTPAQLREVLGDQ 351

Query: 461 -AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDSSASASKLGMPLEKYLNIL 518
            A L  + + +   G  +            G++VL +  ++    A K           +
Sbjct: 352 DADLAAQFFGVTEEGTFE-----------HGQSVLRLPQHEGVFDAEK-----------I 389

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
              + +L   R++RP P  DDKV+ +WNGL +++ A                      DR
Sbjct: 390 ASIKDRLNRARAQRPAPGRDDKVVAAWNGLAVAALAETGAYF----------------DR 433

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG 638
            + +E A +AA  + R   DE+    + S         G L+DYA +  G L L      
Sbjct: 434 PDLVEAAIAAADLLVRLHLDEKAQLARTSKDGRVGANAGVLEDYADVAEGFLALASVTGE 493

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
             WL +A  L +     F+D E G  ++T  +   ++ R ++  D A PSG S +   L+
Sbjct: 494 GVWLEFAGFLLDHVLVRFVDEESGALYDTAADAEKLIRRPQDPTDNATPSGWSAAAGALL 553

Query: 699 RLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL-----MCCAADMLSVPSRKHVV 753
              S  A + S+ +R  AE +L +    +K +   VP      +  A  +L  P  + V 
Sbjct: 554 ---SYTAHTGSEPHRAAAERALGI----VKALGPRVPRFIGWGLATAEALLDGP--REVA 604

Query: 754 LVG 756
           +VG
Sbjct: 605 VVG 607


>gi|82701479|ref|YP_411045.1| hypothetical protein Nmul_A0345 [Nitrosospira multiformis ATCC
           25196]
 gi|82409544|gb|ABB73653.1| Protein of unknown function DUF255 [Nitrosospira multiformis ATCC
           25196]
          Length = 700

 Score =  388 bits (997), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 236/647 (36%), Positives = 340/647 (52%), Gaps = 61/647 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N LA E SPYLLQHA NPVDW+ WGEEA   AR +D PI LS+GYS CHWCHVM  E FE
Sbjct: 3   NHLAGETSPYLLQHADNPVDWYPWGEEALTLARAQDRPILLSVGYSACHWCHVMAHECFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYG-GGGWPLSVFLSPDLKPLMGGT 221
           D  VA+++N +F++IKVDREERPD+D++Y T +  L    GGWPL++FL+PD KP  GGT
Sbjct: 63  DAEVAEVMNRYFINIKVDREERPDIDQIYQTALYMLTQRSGGWPLTLFLTPDQKPFFGGT 122

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFP   ++  PGF  +L +V + +  +R  + +  A  ++  +  L + A    +  E P
Sbjct: 123 YFPKTPRHSLPGFLDLLPRVAETYRVRRPEIERQSASLLKSFANMLPSKAPEAPVFSERP 182

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
              L     +L   +DS  GGFG  PKF    E+   L   ++    G     SE   M 
Sbjct: 183 ---LEQALAELKNRFDSENGGFGEPPKFLHLTELDFCL---RRYFTAGN----SEALHMA 232

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             TL+ MA+GGI+D VGGGF+RYS D++W +PHFEKMLYD G L ++Y DA+  + +  +
Sbjct: 233 TLTLEKMAEGGIYDQVGGGFYRYSTDKQWQIPHFEKMLYDNGPLLHLYADAWIASGNPLF 292

Query: 402 SYICRDILDYLRRDMIG--------PGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 453
           + I  +   ++ R+M           G   +S  DADS          EG FYVW   E 
Sbjct: 293 ARIVEETATWVMREMQPEYEENEKRTGAGYWSTLDADSENV-------EGKFYVWDRSEA 345

Query: 454 EDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEK 513
             IL     +    +Y        LS+ ++  N +    V   L +    A   G+   +
Sbjct: 346 SHILSRREYVVAASHY-------GLSQPANFGNRYWHLAVAQSLPE---IAENFGVTYAE 395

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
               L   R+KL   R  R RP  D+K++ SWNGL+I   ARA ++              
Sbjct: 396 ARQWLESGRKKLLAQRQCRVRPGRDEKILTSWNGLMIKGMARAGRVF------------- 442

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
               R +++  A  A  FIR  L+  +  RL  ++++G ++   +LDDYAFL+ GLL+L 
Sbjct: 443 ---GRDDWVRSAICAVDFIRSTLW--KNGRLLATWKDGNARLNAYLDDYAFLLDGLLELM 497

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
           +       L +AI L     + F D+E GG+F T+ +  +++ R K  +D A PSGN V+
Sbjct: 498 QTTFRPVDLDFAIALAEVLLDQFEDKEAGGFFFTSHDHENLIHRPKPGYDNATPSGNGVA 557

Query: 694 VINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 740
              L R+  ++   +   Y Q AE +L +F   L    +  P  CC+
Sbjct: 558 AHTLQRMGYLLGEFR---YLQAAERALRLFYPAL----LRHPDSCCS 597


>gi|46198930|ref|YP_004597.1| hypothetical protein TTC0622 [Thermus thermophilus HB27]
 gi|46196554|gb|AAS80970.1| hypothetical conserved protein [Thermus thermophilus HB27]
          Length = 642

 Score =  388 bits (997), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 245/623 (39%), Positives = 332/623 (53%), Gaps = 83/623 (13%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL A  SPYLL HA +PVDW+ +GEEAF +A+  D PIFLS+GY++CHWCHVM  ESF+
Sbjct: 3   NRLKAARSPYLLAHAEDPVDWYPFGEEAFRKAQAEDKPIFLSVGYASCHWCHVMHRESFQ 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA+LLN  FV +KVDREERPDVD  YM  + +L G GGWP+S+FL+P+ KP  GGTY
Sbjct: 63  DEEVARLLNAHFVPVKVDREERPDVDAAYMRALVSLTGQGGWPMSLFLTPEGKPFFGGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP ED+ G PGFK +L  V +AW  KR+ + +      E+L+ AL  S +    P  LP+
Sbjct: 123 FPKEDRMGLPGFKRVLVAVAEAWAGKREAILEEA----ERLTRALWKSLTPP--PGPLPE 176

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
            A     + L +++D  +GGF  APKFP+   +  +L  + + E+           +++ 
Sbjct: 177 GAEEEALDHLERAFDPEWGGFLPAPKFPQGPLLLYLLARAWEGEERAA--------RLLR 228

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL+ MA GG++D VGGGFHRYSVD  W +PHFEKMLYD   LA VYL A+ L  +  + 
Sbjct: 229 PTLRAMALGGVYDQVGGGFHRYSVDRFWRLPHFEKMLYDNALLARVYLGAYKLFGEDLFL 288

Query: 403 YICRDILDYL----RRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
            + R+ LD+L    RR+     G   +A D   AE+EG    +EG +Y W   E+ + LG
Sbjct: 289 RVARETLDWLLSMQRRE-----GGFHTALD---AESEG----EEGRYYTWAEVELREALG 336

Query: 459 EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 518
           E   L + ++ L      DL            ++VL    ++ A    LG   E +    
Sbjct: 337 EDFPLARRYFAL----GEDLGE----------RSVLTAWGEAEAR-KVLG---EGFFAWR 378

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
              R KL   R +R  P LDDKV+  W+ L + + A A ++   E               
Sbjct: 379 EGVRAKLQGARRRRMPPALDDKVLADWSALAVRALAEAGRLFGEE--------------- 423

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG 638
             Y+E A   A F+  H+Y E    L+H++R G      +L D AF     L+LY     
Sbjct: 424 -RYLEAARRGARFLLAHMYREGL--LRHTWR-GSLGEEAYLSDQAFAALAFLELYAATGE 479

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
             +L WA  L      LF  REG          PS+ L  KE  +GA PSG S     LV
Sbjct: 480 WPYLDWAQRLAEAGWRLF--REG----------PSLPLPAKEVEEGALPSGESALAEALV 527

Query: 699 RLASIVAGSKSDYYRQNAEHSLA 721
           RL ++  G     YR+ AE  LA
Sbjct: 528 RLGAVFGGD----YRERAEEVLA 546


>gi|289548374|ref|YP_003473362.1| hypothetical protein Thal_0601 [Thermocrinis albus DSM 14484]
 gi|289181991|gb|ADC89235.1| protein of unknown function DUF255 [Thermocrinis albus DSM 14484]
          Length = 655

 Score =  388 bits (996), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 231/636 (36%), Positives = 346/636 (54%), Gaps = 56/636 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL + AH PVDW+ W EEAF +A++ D PI LS+G   CHWCHVM  E FE
Sbjct: 11  NRLIKERSPYLKKSAHQPVDWYPWCEEAFRKAKEEDKPILLSVGAVWCHWCHVMAKECFE 70

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           +  +A+++N+ FV+IKVDR+ERPD+D+ Y   V +L G GGWPL+VFL+PD K   GGTY
Sbjct: 71  NPEIAQIINENFVAIKVDRDERPDIDRRYQEVVVSLTGSGGWPLTVFLTPDGKAFFGGTY 130

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FPPED++GRPGFK++L ++   W + RD + +S     E L    + S+SS+K  D + +
Sbjct: 131 FPPEDRWGRPGFKSLLLRIAQLWKEDRDRVIRSAEHIFELLR---NYSSSSHK--DNVGE 185

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             L      L  S D ++GG G+APKF      +++LYH      TG++       + V 
Sbjct: 186 ELLNRGIANLLASVDYQYGGIGTAPKFHHARAFELLLYHHFF---TGQTLPV----EAVE 238

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL  MA+GGI+DH+GGGF RYS D+RW VPHFEKML D  +L  VY  AF +TK   Y 
Sbjct: 239 ITLDSMARGGIYDHLGGGFFRYSTDDRWIVPHFEKMLSDNAELLLVYSLAFQVTKKDLYR 298

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
           Y+   IL+Y +R     GG  ++++DAD  + +      EG +Y ++ +E+  IL E  +
Sbjct: 299 YVVEGILNYYQRFGFDEGGGFYASQDADIGDLD------EGGYYTFSLEELRGILTEEEL 352

Query: 463 LFKEHYY-LKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
                Y+ + P G        DP      KNVL         A+  G+PLE+   +L   
Sbjct: 353 KVTSLYFDIHPKGEMH----HDP-----SKNVLFIAMSEEEVATATGIPLERVRQLLESA 403

Query: 522 RRKLFDVR-SKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
           RRK+   R S R +P +D  +  +WNGL++ + +   K+         F  P V S    
Sbjct: 404 RRKMLSYRESTRQQPFIDKTIYTNWNGLMLEALSTCYKV---------FRIPWVLSS--- 451

Query: 581 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 640
               AE  A  + + ++ +   +L H++        G  +DY FL  GLL L+E     +
Sbjct: 452 ----AEKTADRLMKEMWKDG--QLMHTY-----GVKGMAEDYIFLARGLLSLFEVTQKRE 500

Query: 641 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVR 699
           +L  ++ L +   + F D +G G+F+T  +D  +L +R+K   D    S N  +    + 
Sbjct: 501 YLEASVMLAHEAIKKFWDPQGWGFFDTEEKDEGLLRIRLKTLQDTPTQSVNGAAPYLYLV 560

Query: 700 LASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP 735
           L S+   ++   + + AE +L  F   ++++ +  P
Sbjct: 561 LGSVTPYTE---FLEYAEKNLQAFARMVREIPLISP 593


>gi|408529633|emb|CCK27807.1| hypothetical protein BN159_3428 [Streptomyces davawensis JCM 4913]
          Length = 682

 Score =  388 bits (996), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 230/626 (36%), Positives = 324/626 (51%), Gaps = 59/626 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA E SPYLLQHA NPVDW+ W +EAF EAR    P+ LS+GY++CHWCHVM  ESFE
Sbjct: 9   NRLAHETSPYLLQHADNPVDWWPWSQEAFEEARGSGKPVLLSVGYASCHWCHVMAHESFE 68

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE  A  LN+ FV++KVDREERPDVD VYM  VQA  G GGWP++VFL+PD +P   GTY
Sbjct: 69  DEATAAYLNEHFVNVKVDREERPDVDAVYMEAVQAATGQGGWPMTVFLTPDAEPFYFGTY 128

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FPP  ++G P F+ +L  V+ AW  +RD +A+     +  L+E   +   S    +E   
Sbjct: 129 FPPAPRHGMPSFRQVLEGVQQAWTGRRDEVAEVAGKIVRDLAEREISYGDSQAPGEEELA 188

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
            AL      L++ YD++ GGFG APKFP  + I+ +L H  +   TG  G      +M  
Sbjct: 189 GALL----GLTREYDAQRGGFGGAPKFPPSMVIEFLLRHHAR---TGSEG----ALQMAA 237

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            T + MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L  VY   +  T      
Sbjct: 238 DTCERMARGGIYDQLGGGFARYSVDRDWVVPHFEKMLYDNALLCRVYAHLWRSTGSELAR 297

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
            +  +  D++ R++    G   SA DADS   +G  +  EGA+YVWT ++  ++LG+ A 
Sbjct: 298 RVALETADFMVRELRTNEGGFASALDADS--DDGTGKHVEGAYYVWTPQQFREVLGDDAE 355

Query: 463 LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI---LG 519
              +++ +   G  +                          AS L +P  + L +   + 
Sbjct: 356 RAAQYFGVTEEGTFE------------------------EGASVLQLPQHEGLFVAEKVA 391

Query: 520 ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRK 579
             R +L   R++RP P  DDKV+ +WNGL I++ A                      DR 
Sbjct: 392 SVRERLLAARAERPAPGRDDKVVAAWNGLAIAALAETGAYF----------------DRP 435

Query: 580 EYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGT 639
           + +E A  AA  + R   DE     + S         G L+DYA +  G L L       
Sbjct: 436 DLVEAAVCAADLLVRLHLDEHVQIARTSKDGQVGANAGVLEDYADVAEGFLALASVTGEG 495

Query: 640 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVR 699
            WL +A  L +     F+D   G  ++T  +   ++ R ++  D A PSG + +   L+ 
Sbjct: 496 VWLEFAGFLLDHVLARFVDERSGALYDTAVDAERLIRRPQDPTDNAAPSGWTAAAGALL- 554

Query: 700 LASIVAGSKSDYYRQNAEHSLAVFET 725
             S  A + ++ +R  AE +L V + 
Sbjct: 555 --SYAAQTGAEPHRAAAERALGVVKA 578


>gi|386360498|ref|YP_006058743.1| thioredoxin domain-containing protein [Thermus thermophilus JL-18]
 gi|383509525|gb|AFH38957.1| thioredoxin domain-containing protein [Thermus thermophilus JL-18]
          Length = 639

 Score =  388 bits (996), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 245/623 (39%), Positives = 332/623 (53%), Gaps = 83/623 (13%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL    SPYLL HA +PVDW+ +GEEAF +AR  D PIFLS+GY TCHWCHVM  ESF+
Sbjct: 2   NRLKDAKSPYLLAHAKDPVDWYPFGEEAFQKARAEDKPIFLSVGYHTCHWCHVMHRESFQ 61

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA+LLN  FV +KVDREERPDVD  YM  + +L G GGWP+S+FL+P+ KP  GGTY
Sbjct: 62  DEEVARLLNAHFVPVKVDREERPDVDAAYMRALVSLTGQGGWPMSLFLTPEGKPFFGGTY 121

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP ED+ G PGFK +L  V +AW  KR+ + +      E+L+ AL  S +    P  LP+
Sbjct: 122 FPKEDRMGLPGFKRVLVAVAEAWTGKREAVLEEA----ERLTRALWKSLTPP--PGPLPE 175

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
            A     + L +++D  +GGF  APKFP+   +  +L  + + E+           +++ 
Sbjct: 176 GAEEEALDHLERAFDPEWGGFLPAPKFPQGPLLLYLLARAWEGEE--------RAARLLR 227

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL+ MA GG++D VGGGFHRYSVD  W +PHFEKMLYD   LA VYL A+ L  +  + 
Sbjct: 228 PTLRAMALGGVYDQVGGGFHRYSVDRFWRLPHFEKMLYDNALLARVYLGAYKLFGEDLFL 287

Query: 403 YICRDILDYL----RRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
            + R+ LD+L    RR+     G   +A D   AE+EG    +EG +Y WT  E+ + LG
Sbjct: 288 RVARETLDWLLSMQRRE-----GGFHTALD---AESEG----EEGRYYTWTEAELREALG 335

Query: 459 EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 518
           E   L + ++ L      DL            ++VL    ++    + LG   E +    
Sbjct: 336 EDFPLARRYFAL----GEDLGE----------RSVLTAWGEAEVREA-LG---EGFFAWR 377

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
              R KL   R +R  P LDDKV+  W+ L + + A A ++   EA              
Sbjct: 378 EGVRAKLQGARRRRMPPALDDKVLADWSALAVRALAEAGRLFGEEA-------------- 423

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG 638
             Y+E A+  A F+  H+Y  +   L+H++R G      +L D AF     L+LY     
Sbjct: 424 --YLEAAKRGARFLLAHMY--RGGLLRHTWR-GSLGEEAYLSDQAFAALAFLELYAATGE 478

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
             +L WA         LF  REG          PS+ L  KE  +GA PSG S     LV
Sbjct: 479 WPYLDWAQRFAEAGWRLF--REG----------PSLPLPAKEVEEGALPSGESALAEALV 526

Query: 699 RLASIVAGSKSDYYRQNAEHSLA 721
           RL ++  G     YR+ AE  LA
Sbjct: 527 RLGAVFGGD----YRERAEEVLA 545


>gi|357411497|ref|YP_004923233.1| hypothetical protein Sfla_2286 [Streptomyces flavogriseus ATCC
           33331]
 gi|320008866|gb|ADW03716.1| hypothetical protein Sfla_2286 [Streptomyces flavogriseus ATCC
           33331]
          Length = 675

 Score =  387 bits (995), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 239/625 (38%), Positives = 329/625 (52%), Gaps = 59/625 (9%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            NRLA   SPYLLQHA NPVDW+ W  EAF EAR+R+VP+ LS+GY++CHWCHVM  ESF
Sbjct: 2   VNRLADAMSPYLLQHADNPVDWWQWSPEAFEEARRRNVPVLLSVGYASCHWCHVMAHESF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           ED  VA  LN  FV +KVDREERPDVD VYM  VQA  G GGWP++VFL+ + +P   GT
Sbjct: 62  EDPSVADYLNAHFVPVKVDREERPDVDAVYMEAVQAATGQGGWPMTVFLTAEAEPFYFGT 121

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE-- 279
           YFPPE ++G P F+ +L  V  AW  +R+ +A+     +  L+   S +A+   LP E  
Sbjct: 122 YFPPESRHGMPSFQQVLEGVAAAWTDRREEVAEVAGRIVRDLA-GRSLAAAEGGLPGEPE 180

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
           L Q  LRL     ++ YD R GGFG APKFP  + I+ +L H  +   TG  G      +
Sbjct: 181 LAQALLRL-----TRDYDERHGGFGGAPKFPPSMVIEFLLRHHAR---TGAEG----ALQ 228

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           M   +   MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L  VY   +  T   
Sbjct: 229 MAADSCAAMARGGIYDQLGGGFARYSVDREWVVPHFEKMLYDNALLCRVYAHLWRATGSD 288

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE 459
               +  +  D++ R++    G   SA DADS + +G  R  EGAFYVWT  ++ ++LGE
Sbjct: 289 LARRVALETADFMVRELRTAEGGFASALDADSEDAQG--RHVEGAFYVWTPAQLREVLGE 346

Query: 460 HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILG 519
               F   Y+           +++     +G +VL  +    A  +      E+   +  
Sbjct: 347 DDAAFAAEYF----------GVTEEGTFEEGSSVLRLVPAGEAEPADD----ERIAGV-- 390

Query: 520 ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRK 579
             R +L   R  RPRP  DDKV+ +WNGL I++ A                      DR 
Sbjct: 391 --RGRLLAARELRPRPERDDKVVAAWNGLAIAALAETGAYF----------------DRP 432

Query: 580 EYMEVAESAAS-FIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFLISGLLDLYEFGS 637
           + +E A  AA   +R H+ D    RL  + ++G      G L+DY  +  G L L     
Sbjct: 433 DLVERATEAADLLVRVHMGD--VARLCRTSKDGRAGDNSGVLEDYGDVAEGFLALASVTG 490

Query: 638 GTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINL 697
              WL +A  L +   + F   E G  F+T  +   ++ R ++  D A P+G + +   L
Sbjct: 491 EGAWLEFAGFLLDIVLQHFTG-EKGQLFDTADDAEQLIRRPQDPTDNATPAGWTAAAGAL 549

Query: 698 VRLASIVAGSKSDYYRQNAEHSLAV 722
           +   S  A + S+ +R  AE +L V
Sbjct: 550 L---SYAAHTGSEAHRAAAEGALGV 571


>gi|256389916|ref|YP_003111480.1| hypothetical protein Caci_0704 [Catenulispora acidiphila DSM 44928]
 gi|256356142|gb|ACU69639.1| protein of unknown function DUF255 [Catenulispora acidiphila DSM
           44928]
          Length = 710

 Score =  387 bits (995), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 233/624 (37%), Positives = 335/624 (53%), Gaps = 61/624 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA+  SPYLLQHA NPVDW+ WGEEAFAEAR+RDVP+ LSIGY+ CHWCHVM  ESFE
Sbjct: 2   NRLASATSPYLLQHADNPVDWWPWGEEAFAEARRRDVPVLLSIGYAACHWCHVMAHESFE 61

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE  A L+N+ +V +KVDREERPDVD VYM   QA+ GGGGWP++VF +P+ KP   GTY
Sbjct: 62  DEATAALMNEKYVCVKVDREERPDVDAVYMAATQAMTGGGGWPMTVFATPEGKPFQAGTY 121

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           +PP  ++G P F+ +L  V  AW   R+ + ++G   + +L+      A +  +PD    
Sbjct: 122 YPPVARHGLPSFRQLLVAVDRAWGDIREDVLRAGDGLVAELAHHARVVAGAEGVPD---A 178

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
            AL      L + +D   GGFG APKFP  + ++ +L H  +  D       ++   MV 
Sbjct: 179 GALATAVGVLRREFDGVRGGFGGAPKFPPSMTLEQLLRHHARTGD-------ADALAMVR 231

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            T + MA+GG++D +GGGF RY+VD+ W VPHFEKMLYD   L   YL  +  T D    
Sbjct: 232 QTCEAMARGGMYDQLGGGFARYAVDDAWVVPHFEKMLYDNALLLRAYLHLWRATGDALAL 291

Query: 403 YICRDILDYLRRDMI--GPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH 460
            +  +  D++ R++   G GG   S+ DAD       T   EG FY W ++++ D +GE 
Sbjct: 292 RVVNETADWMLRELWLDGAGG-FASSLDAD-------TDGVEGKFYAWDAEQIADAVGE- 342

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFK-GKNVLIELNDSSASASKLGMPLEKYLNILG 519
               KE                     F+ G +VL  L D           L+++  I  
Sbjct: 343 ----KEAGDAGDAAWAAAVFNVTAQGTFEHGLSVLQLLQDPD--------DLDRFQRI-- 388

Query: 520 ECRRKLFDV-RSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
             R  LF+  R +R  P  DDK + +WNGL +++ A A  +                + R
Sbjct: 389 --RDSLFEARRDQRTAPGRDDKAVAAWNGLAVAALAEAGAL----------------TGR 430

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA--PGFLDDYAFLISGLLDLYEFG 636
           +E +  A   A  + R  +D +T RL  + R+G + A  PG L+DYA +  GLL LY   
Sbjct: 431 QELVSAARQTAEMLERIHWDGKTMRLTRTSRDGVAGAQNPGVLEDYADVAEGLLALYAVT 490

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
             T+W  +A  L +   + F D + G +++T  +  +++ R  +  D A P G S +   
Sbjct: 491 GETRWFAFAGRLLDVVLDNFRD-DSGLFYDTADDAEALIFRPADPTDNATPGGTSAAAGA 549

Query: 697 LVRLASIVAGSKSDYYRQNAEHSL 720
           L+  A++   + S  +R+ AE +L
Sbjct: 550 LLTYAAL---TGSGRHREAAEQAL 570


>gi|381190578|ref|ZP_09898097.1| hypothetical protein RLTM_06066 [Thermus sp. RL]
 gi|384431187|ref|YP_005640547.1| tmk1; thymidylate kinase [Thermus thermophilus SG0.5JP17-16]
 gi|333966655|gb|AEG33420.1| tmk1; thymidylate kinase [Thermus thermophilus SG0.5JP17-16]
 gi|380451573|gb|EIA39178.1| hypothetical protein RLTM_06066 [Thermus sp. RL]
          Length = 642

 Score =  387 bits (995), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 244/623 (39%), Positives = 334/623 (53%), Gaps = 83/623 (13%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL A  SPYLL HA +PVDW+ +GEEAF +A+  D PIFLS+GY++CHWCHVM  ESF+
Sbjct: 3   NRLKAARSPYLLAHAEDPVDWYPFGEEAFRKAQAEDKPIFLSVGYASCHWCHVMHRESFQ 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA+LLN  FV +KVDREERPDVD  YM  + +L G GGWP+S+FL+P+ KP  GGTY
Sbjct: 63  DEEVARLLNAHFVPVKVDREERPDVDAAYMRALVSLTGQGGWPMSLFLTPEGKPFFGGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP ED+ G PGFK +L  V +AW  KR+ + +      E+L+ AL  S +    P  LP+
Sbjct: 123 FPKEDRMGLPGFKRVLVAVAEAWAGKREAVLEEA----ERLTRALWKSLTPP--PGPLPE 176

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
            A     + L +++D  +GGF  APKFP+   +  +L  + + E+           +++ 
Sbjct: 177 GAEEEALDHLERAFDPEWGGFLPAPKFPQGPLLLYLLARAWEGEE--------RAARLLR 228

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL+ MA GG++D VGGGFHRYSVD  W +PHFEKMLYD   LA VYL A+ L  +  + 
Sbjct: 229 PTLRAMALGGVYDQVGGGFHRYSVDRFWRLPHFEKMLYDNALLARVYLGAYKLFGEDLFL 288

Query: 403 YICRDILDYL----RRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
            + R+ LD+L    RR+     G   +A D   AE+EG    +EG +Y WT  E+ + LG
Sbjct: 289 RVARETLDWLLSMQRRE-----GGFHTALD---AESEG----EEGRYYTWTEAELREALG 336

Query: 459 EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 518
           E   L + ++ L      DL            ++VL    ++    + LG   E +    
Sbjct: 337 EDFPLARRYFAL----GEDLGE----------RSVLTAWGEAEVREA-LG---EGFFAWR 378

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
              R KL   R +R  P LDDKV+  W+ L + + A A ++   EA              
Sbjct: 379 EGVRAKLQGARRRRMPPALDDKVLADWSALAVRALAEAGRLFGEEA-------------- 424

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG 638
             Y+E A+  A F+  H+Y  +   L+H++R G      +L D AF     L+LY     
Sbjct: 425 --YLEAAKRGARFLLAHMY--RGGLLRHTWR-GSLGEEAYLSDQAFAALAFLELYAATGE 479

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
             +L WA         LF  REG          PS+ L  KE  +GA PSG S     LV
Sbjct: 480 WPYLDWAQRFAEAGWRLF--REG----------PSLPLPAKEVEEGALPSGESALAEALV 527

Query: 699 RLASIVAGSKSDYYRQNAEHSLA 721
           RL ++  G     YR+ AE  LA
Sbjct: 528 RLGAVFGGD----YRERAEEVLA 546


>gi|398348235|ref|ZP_10532938.1| hypothetical protein Lbro5_13624 [Leptospira broomii str. 5399]
          Length = 669

 Score =  387 bits (995), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 249/647 (38%), Positives = 344/647 (53%), Gaps = 66/647 (10%)

Query: 119 NPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIK 178
           NPVDWF WG++AF +A++ D  IFLSIGY+TCHWCHVME ESFEDE  A +LN +FVSIK
Sbjct: 2   NPVDWFPWGKDAFLKAKEEDKMIFLSIGYATCHWCHVMEKESFEDEATAAVLNQYFVSIK 61

Query: 179 VDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTIL 238
           VDREERPDVD++YM  + A+   GGWPL++FL+ + KP+ GGTYFPP  KYGR  F  +L
Sbjct: 62  VDREERPDVDRIYMDALHAMNQQGGWPLNMFLTSEGKPITGGTYFPPVAKYGRKSFVEVL 121

Query: 239 RKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQL------ 292
             + + W +K+  L      A E+L++ L  S  S  L +   Q+A +L ++++      
Sbjct: 122 NILANLWKEKKGELID----ASEELTQYLKESEESKALNE---QSAFQLPSKKVFENAFG 174

Query: 293 --SKSYDSRFGGFGS--APKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCM 348
              + YD  F GF S    KFP  + +  +L   K       +GE  +  +MV  TL  M
Sbjct: 175 MYDRFYDPEFAGFKSNVTNKFPPSMGLFFLLRFYK------STGE-PKALEMVEETLVAM 227

Query: 349 AKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDI 408
            KGGI+D +GGG  RYS D +W VPHFEKMLYD        ++ F  T  V Y     D+
Sbjct: 228 RKGGIYDQIGGGISRYSTDHKWLVPHFEKMLYDNSLFLEALVECFQTTGHVKYKEAAYDV 287

Query: 409 LDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHY 468
           L+YL RDM   GG I SAEDADS   EG    +EG FY+W   E  ++ G  AIL +E +
Sbjct: 288 LEYLSRDMRLQGGGIASAEDADS---EG----EEGLFYLWKRNEFHEVCGSDAILLEEFW 340

Query: 469 YLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDV 528
            +   GN            F+G N+L E +  +  A   G+  E+ + I+   R+KL   
Sbjct: 341 NVTEIGN------------FEGSNILHE-SFRTNFARLHGLEQEELIEIVDRNRKKLLAR 387

Query: 529 RSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESA 588
           RS R RP  DDKV++SWN L + +  +A+                      E + +AE  
Sbjct: 388 RSDRIRPLRDDKVLLSWNCLYVKAATKAAMAFGD----------------GELLRLAEET 431

Query: 589 ASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIEL 648
             FI  +L  E   RL   FR+G ++   +  DYA  I   L L++ G G ++L  AI  
Sbjct: 432 FRFIENNLVREDG-RLLRRFRDGEARFLAYSGDYAEFILASLWLFQAGKGIRYLTLAI-- 488

Query: 649 QNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGAEPSGNSVSVINLVRLASIVAGS 707
           +  +D + L R   G F  TG D   LLR   D +DG EPS NS        L+ +  G 
Sbjct: 489 RYAEDAVRLFRSPAGVFFDTGSDADDLLRRNVDGYDGVEPSANSSFAFAFTILSRL--GV 546

Query: 708 KSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVL 754
           +SD Y   A+   + F+  L+   M  P M  A  + +  S++  V+
Sbjct: 547 ESDKYSDFADAIFSYFKVELETHPMNYPYMLSAYWLKNSASKELAVV 593


>gi|452207570|ref|YP_007487692.1| YyaL family protein [Natronomonas moolapensis 8.8.11]
 gi|452083670|emb|CCQ36982.1| YyaL family protein [Natronomonas moolapensis 8.8.11]
          Length = 709

 Score =  387 bits (995), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 232/614 (37%), Positives = 320/614 (52%), Gaps = 55/614 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL    SPYL QHA NPV W  W E A   AR+RD PIFLSIGY+ CHWCHVM  ESFE
Sbjct: 3   NRLDEAASPYLRQHADNPVAWQPWDEAALELARERDAPIFLSIGYAACHWCHVMADESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D  +A+ LN+ FV IKVDREERPDVD +YM   Q + G GGWPLSV+L+P+ KP   GTY
Sbjct: 63  DPEIAETLNEAFVPIKVDREERPDVDTLYMNVCQMVRGSGGWPLSVWLTPEGKPFHVGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDK---KRDMLAQSGAFAIEQLSEALSASASSNKLPDE 279
           FPPE     P F ++L  + D+W+    +  + +Q+  +A     E       S + P E
Sbjct: 123 FPPEATANMPSFGSVLGDIADSWNDPEGRSRLESQADQWASSTKGELEGTPDRSGEAPGE 182

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML-YHSKKLEDTGKSGEASEGQ 338
                L   A    +  D   GG+G   KFP P  I ++L  +     DT +        
Sbjct: 183 ---GFLDTAANAAVRGADREAGGWGQGQKFPHPGRIHLLLRAYDATDRDTYR-------- 231

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
            + L TL  MA GG++DHVGGGFHRY VD  W VPHFEKMLYD  ++   +L  + LT +
Sbjct: 232 DVALETLDAMASGGLYDHVGGGFHRYCVDREWTVPHFEKMLYDNAEIPRAFLAGYRLTGE 291

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
             Y+ I  +   +L R++  P G  +S  DA+S ++ G+  ++EGAFYVWT + V + + 
Sbjct: 292 ERYAEIASETFAFLERELTHPDGGFYSTLDAESEDSTGS--REEGAFYVWTPETVREAVD 349

Query: 459 E--HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLN 516
           +   A LF E Y +  +GN +            G  VL E       A+   M  +    
Sbjct: 350 DPTAAELFCERYGVTDSGNFE-----------NGTTVLTESTPIGELAADAVMDTDSVEA 398

Query: 517 ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGS 576
           +L   R +LF+ R  RPRP  D KV+  WNGL+IS+ A  +  L                
Sbjct: 399 LLETARSQLFEARESRPRPPRDGKVLAGWNGLMISALAEGALALN--------------- 443

Query: 577 DRKEYMEVAESAASFIRRHLY-DEQTH-----RLQHSFRNGPSKAPGFLDDYAFLISGLL 630
               Y ++AE+A  F R  L+ DE T      RL   F  G     G+L+DYA+L  G  
Sbjct: 444 --PTYADLAEAALEFCRDRLWEDEGTQDGDVGRLNRRFERGEVGISGYLEDYAYLGRGAF 501

Query: 631 DLYEFGSGTKWLVWAIEL-QNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSG 689
           DLY+     + L +A++L +  +   + + EG  YF  TG +  ++ R ++  D + PS 
Sbjct: 502 DLYQATGDVEHLQFALQLGRAIRASFYEESEGTLYFTPTGGE-ELIARPQQLADSSTPSS 560

Query: 690 NSVSVINLVRLASI 703
             V+V  L  L++ 
Sbjct: 561 TGVAVQLLAALSAF 574


>gi|195952439|ref|YP_002120729.1| hypothetical protein HY04AAS1_0059 [Hydrogenobaculum sp. Y04AAS1]
 gi|195932051|gb|ACG56751.1| protein of unknown function DUF255 [Hydrogenobaculum sp. Y04AAS1]
          Length = 634

 Score =  387 bits (994), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 241/639 (37%), Positives = 334/639 (52%), Gaps = 82/639 (12%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           K  NRL  E SPYL  HA+NPVDW+ W EEAF +A K + P+FLSIGYS+CHWCHVME E
Sbjct: 2   KTPNRLINEKSPYLKMHAYNPVDWYPWSEEAFDKAIKENKPVFLSIGYSSCHWCHVMEKE 61

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SFEDE VA  LN  FVSIKVD+EERPD+D +Y+ Y   L   GGWPLSVFL+P  +P   
Sbjct: 62  SFEDEEVASFLNKCFVSIKVDKEERPDIDSLYIEYCVLLNNSGGWPLSVFLTPTKEPFFA 121

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE 279
           GTYFP      +  F  +L ++KD WDK    + +     +EQL + +++         E
Sbjct: 122 GTYFP------KASFLKLLNQIKDLWDKDSKNIIEKSKRMVEQLKQFMNSFEKR-----E 170

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
           L ++ +      L+  YD  FGGF  APKFP    + ++L   K+             Q 
Sbjct: 171 LNESFIDKALFGLANRYDEEFGGFSEAPKFPSLHNVLLLLKSQKQ-----------PFQD 219

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           M L TL  M +GGI DHVGGGFHRYS D  W +PHFEKMLYDQ      Y +A+ LTK+ 
Sbjct: 220 MALSTLLNMRRGGIWDHVGGGFHRYSTDRYWLLPHFEKMLYDQAMAILAYSEAYRLTKNE 279

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE 459
            +       +++++ ++    G  +++ DAD   TEG    +EG FY+WT +E++DIL E
Sbjct: 280 IFKDTVYKTINFVKENLY-ENGFFYTSMDAD---TEG----EEGGFYLWTYQEIKDILKE 331

Query: 460 HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILG 519
               F E + +K  GN     + +    + GKNVL         A +  M  E  L +L 
Sbjct: 332 KTDKFIEFFNIKKEGNF----LDEAKRVYTGKNVLY--------AKEPTMLFENELQVL- 378

Query: 520 ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRK 579
               K F  R KR +P +DDK+++  N ++  +   A  + +                 K
Sbjct: 379 ----KAF--REKRKKPLIDDKILLDQNAMMDWALIEAYLVFED----------------K 416

Query: 580 EYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGT 639
           +++++A        ++L +   H LQH+  +     P  LDDYA+LI   L LY+     
Sbjct: 417 DFLDMA-------TKNLNNISKHPLQHALNHNKLIEP-MLDDYAYLIKAYLSLYKATFSK 468

Query: 640 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVR 699
             L  AI L     E   D+  GG++ + G+D  VL+  K  +DGA PSGNSV  +NLV 
Sbjct: 469 DALEKAISLTEEAIEKLWDKNAGGFYLSVGKD--VLIPQKTLYDGAIPSGNSVMGLNLVE 526

Query: 700 LASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMC 738
           L  I   +K D Y    E+   +  +   DM    P  C
Sbjct: 527 LFFI---TKEDTY----ENRYQILSSIYSDMLSRNPTAC 558


>gi|291447326|ref|ZP_06586716.1| conserved hypothetical protein [Streptomyces roseosporus NRRL
           15998]
 gi|291350273|gb|EFE77177.1| conserved hypothetical protein [Streptomyces roseosporus NRRL
           15998]
          Length = 679

 Score =  387 bits (994), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 239/627 (38%), Positives = 329/627 (52%), Gaps = 59/627 (9%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            NRLA   SPYLLQHA NPVDW+ W  EAF EARKRDVP+ LS+GY++CHWCHVM  ESF
Sbjct: 8   ANRLAQTTSPYLLQHADNPVDWWPWSPEAFEEARKRDVPVLLSVGYASCHWCHVMAHESF 67

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           ED+  A  LN  FV +KVDREERPDVD VYM  VQA  G GGWP++VFL+PD +P   GT
Sbjct: 68  EDDDTAAYLNAHFVPVKVDREERPDVDAVYMEAVQAATGHGGWPMTVFLTPDAEPFYFGT 127

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQ-SGAFAIEQLSEALSASASSNKLPDEL 280
           YFPPE ++G P F+ +L  V  AW  +RD +A+ +G    +    +L           E+
Sbjct: 128 YFPPEPRHGSPSFQQVLEGVTAAWTDRRDEVAEVAGRIVADLAGRSLVHGGDGVPGESEV 187

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
            Q  L      L++ YD + GGFG APKFP  + ++ +L H  +   TG  G      +M
Sbjct: 188 AQALL-----GLTREYDEQHGGFGGAPKFPPAMVVEFLLRHYAR---TGAEG----ALQM 235

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
              T   MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L  VY   +  T    
Sbjct: 236 AADTCTAMARGGIYDQLGGGFARYSVDREWIVPHFEKMLYDNALLCRVYAHLWRTTGSDE 295

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH 460
              I  +  D++ R++    G   SA DADS + +G  +  EGA+YVWT  ++ ++LGE 
Sbjct: 296 ARRIALETADFMVRELRTAEGGFASALDADSEDADG--KHVEGAYYVWTPAQLREVLGED 353

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
              F   Y+           +++     +G +VL    D+         P++    + G 
Sbjct: 354 DGAFAAAYF----------GVTEDGTFEEGASVLRLPGDAG--------PVDA-ARVAG- 393

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            R +L   R +RPRP  DDKV+ +WNGL I++ A                      DR +
Sbjct: 394 VRARLLAARDERPRPGRDDKVVAAWNGLAIAALAETGAYF----------------DRPD 437

Query: 581 YMEVAESAAS-FIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFLISGLLDLYEFGSG 638
            +E A  AA   +R HL   +  RL  + ++G      G L+DY  +  G L L      
Sbjct: 438 LVERATEAADLLVRVHL--GEVARLTRTSKDGRAGDNAGVLEDYGDVAEGFLALAAVTGE 495

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
             WL +A  L +   E F   EGG  ++T  +   ++ R ++  D A PSG + +   L+
Sbjct: 496 GAWLEFAGFLLDIVLEQFTG-EGGQLYDTAHDAEQLIRRPQDPTDSATPSGWTAAAGALL 554

Query: 699 RLASIVAGSKSDYYRQNAEHSLAVFET 725
              S  A + S+ +R  AE +L V + 
Sbjct: 555 ---SYAAYTGSEAHRTAAEGALGVVKA 578


>gi|118579500|ref|YP_900750.1| hypothetical protein Ppro_1067 [Pelobacter propionicus DSM 2379]
 gi|118502210|gb|ABK98692.1| protein of unknown function DUF255 [Pelobacter propionicus DSM
           2379]
          Length = 687

 Score =  387 bits (994), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 236/600 (39%), Positives = 314/600 (52%), Gaps = 58/600 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL    SPYLLQHA NPV W+ WG+EAFA AR  ++PI LSIGY+TCHWCHVM  + FE
Sbjct: 30  NRLIFARSPYLLQHAENPVAWYEWGDEAFATARSGNLPILLSIGYATCHWCHVMAHDGFE 89

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D+ VA LLN  FV IKVDREERPD+D  YMT  Q L G GGWPL++F++PD +P    TY
Sbjct: 90  DDQVADLLNRHFVCIKVDREERPDIDDFYMTASQVLTGSGGWPLNIFMTPDRRPFFAMTY 149

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
            P      R  F  +L  +   W +    + ++ +  +E +      +     +  EL  
Sbjct: 150 LP------RQRFMELLAGIVTLWQQHPGEVEKNCSAIMEGIERLSRGNDHECPVLAELDS 203

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
            A     EQLS  +D  +GGFG APKFP P+ +         L   G +G   E  +M  
Sbjct: 204 LAF----EQLSAIHDRTWGGFGPAPKFPLPLSLGW-------LAGQGMNGN-QEALEMAQ 251

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL  + +GGI D +GGG HRYSVDERW VPHFEKMLYDQ  LA   LD      D  + 
Sbjct: 252 KTLGMIRQGGIWDQLGGGVHRYSVDERWLVPHFEKMLYDQALLAMACLDVCLAGNDPAFL 311

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
            +  DI  ++ R++    G  FSA DADS         +EGA+Y+WT  ++E+ILG    
Sbjct: 312 TMAEDIFRFVGRELTSTEGAFFSALDADSG-------GEEGAYYLWTRDDIEEILGRDGE 364

Query: 463 LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECR 522
           LF   + +   GN            F+G+N+L    D     +  G   E+   IL +CR
Sbjct: 365 LFCRFFDVGEKGN------------FQGQNILHMPVDLETFCT--GEDPERTGEILDDCR 410

Query: 523 RKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYM 582
            +L + R +R  P  D+K+I SWNGL+I++ AR   +                   +EY+
Sbjct: 411 ERLLEYREERSYPLRDEKIITSWNGLMIAALARGGAL----------------GGEQEYI 454

Query: 583 EVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWL 642
           E A  AA FI ++L   Q  RL  S+  GPS  P FL+DYAFL  GL++L+E    + W 
Sbjct: 455 ESASRAARFILKNLR-RQDGRLLRSYLAGPSSTPAFLEDYAFLCCGLIELFEATLDSFWQ 513

Query: 643 VWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL-RVKEDHDGAEPSGNSVSVINLVRLA 701
             A+ L +    LF D      F T G D   +  +   D DG  PS  S +    +RL 
Sbjct: 514 EQALLLADEMLRLFRD-PVRCVFVTVGLDAEQMAGQSPRDSDGVLPSPFSRAAHCFIRLG 572


>gi|374324300|ref|YP_005077429.1| hypothetical protein HPL003_22410 [Paenibacillus terrae HPL-003]
 gi|357203309|gb|AET61206.1| hypothetical protein HPL003_22410 [Paenibacillus terrae HPL-003]
          Length = 631

 Score =  387 bits (994), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 239/634 (37%), Positives = 337/634 (53%), Gaps = 61/634 (9%)

Query: 156 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 215
           ME ESFEDE VA+LLN  +VSIKVDREERPDVD +YM+  Q + G GGWPL++ ++PD K
Sbjct: 1   MERESFEDEEVAELLNRDYVSIKVDREERPDVDHIYMSICQTMTGHGGWPLTILMTPDHK 60

Query: 216 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 275
           P   GTY P E K+GR G   +L KV   W ++ D L       +E   + L+     +K
Sbjct: 61  PFFAGTYLPKEQKFGRVGLMELLPKVAARWKEQPDEL-------VELSEQVLTEHERHDK 113

Query: 276 LPD---ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSG 332
           L     EL +++L     Q S ++D  +GGFG APKFP P  +  +L +++    TG   
Sbjct: 114 LASYQGELDEHSLNKAFHQFSYAFDKDYGGFGEAPKFPSPHNLSFLLRYAQH---TGN-- 168

Query: 333 EASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDA 392
              +  +M   TL  M +GGI+DHVG GF RY+VDE+W VPHFEKMLYD   LA  Y +A
Sbjct: 169 --QQALEMAEKTLDAMYRGGIYDHVGMGFSRYAVDEKWLVPHFEKMLYDNALLAIAYTEA 226

Query: 393 FSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKE 452
           + +T    Y  I   I  Y+ RDM   GG  +SAEDADS   EG    +EG FYVW   E
Sbjct: 227 WQVTGKELYRRIAEQIFTYIARDMTDAGGAFYSAEDADS---EG----EEGKFYVWDESE 279

Query: 453 VEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASASKLGM 509
           V  ILG+  A  F + Y + P GN            F+G N+  LI++N   A   K  +
Sbjct: 280 VRAILGDKDAAFFNDLYGITPYGN------------FEGHNIPNLIDIN-LEAYGIKHDL 326

Query: 510 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 569
             ++      E R KLF  R +R  PH DDK++ SWNGL+I++ A+A +           
Sbjct: 327 TEQELEQRASELRAKLFTTREQRTHPHKDDKILTSWNGLMIAALAKAGQAFGE------- 379

Query: 570 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 629
                     +Y E A+ A SF+  HL  +   RL   FR+G +  PG++DDYAF + GL
Sbjct: 380 ---------AQYTEQAQRAESFLWNHLRRDDG-RLLARFRDGDAAYPGYVDDYAFYVWGL 429

Query: 630 LDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSG 689
           ++LY+     ++L  A+ L     +LF D E GG F    +   ++ + KE +DGA PSG
Sbjct: 430 IELYQATFDVQYLQRALTLNQDMIDLFWDEERGGLFFYGPDGEQLIAKPKEVYDGAIPSG 489

Query: 690 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSR 749
           NS++  NLVRLA ++  S+ + Y   +     VF   +         +  +  + +  + 
Sbjct: 490 NSIAAHNLVRLARLMGESRLEDY---SAKQFKVFGGLVVQYPTGYSALLSSL-LYATGTT 545

Query: 750 KHVVLVGHKSSVDFENMLAAAHASYDLNKTVSKK 783
           K +V+VGH+ +      + A  A +  N  V  K
Sbjct: 546 KEIVIVGHRDAPQTVQFIRAVQAGFRPNTVVILK 579


>gi|284989523|ref|YP_003408077.1| hypothetical protein Gobs_0945 [Geodermatophilus obscurus DSM
           43160]
 gi|284062768|gb|ADB73706.1| protein of unknown function DUF255 [Geodermatophilus obscurus DSM
           43160]
          Length = 665

 Score =  387 bits (993), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 250/679 (36%), Positives = 336/679 (49%), Gaps = 67/679 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA   SPYLLQHA NPVDW+ WGEEAFAEAR+RDVP+ LS+GY+ CHWCHVM  ESFE
Sbjct: 3   NRLATATSPYLLQHAGNPVDWWEWGEEAFAEARRRDVPVLLSVGYAACHWCHVMAHESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE  A  +N  FV +KVDREERPDVD VYM   QAL G GGWP++VF +PD +P   GTY
Sbjct: 63  DEATAGQMNADFVCVKVDREERPDVDSVYMAATQALTGHGGWPMTVFTTPDGRPFYCGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FPP   +G P F+ +L  V DAW  +R+ L  +G    E +S  L         P  L  
Sbjct: 123 FPPRPAHGMPSFRQLLSAVSDAWRSRREDLETAGTRIAEGISSRLDLGP-----PAPLAA 177

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             L      L+  YD R+GGFG APKFP  + ++ +L H+ +  D           +M  
Sbjct: 178 EVLDHAVAALAGEYDERWGGFGGAPKFPPSMVLEFLLRHAARTGD-------DRALRMAR 230

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL  MA+GGIHD + GGF RYSVD RW VPHFEKMLYD   L  +YL  +  T D +  
Sbjct: 231 GTLGAMARGGIHDQLAGGFARYSVDARWVVPHFEKMLYDNALLLRLYLHLWRATGDEWAR 290

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
            +      +L RD+  P G   SA DAD+   EG T       YVWT  E+ ++LGE   
Sbjct: 291 RVADATAAFLVRDLDTPEGGFASALDADAEGVEGLT-------YVWTPAELVEVLGEDDG 343

Query: 463 LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECR 522
            +    +           ++D      G + L  L D    A             L   R
Sbjct: 344 RWAAAVF----------EVTDAGTFEHGTSTLQLLRDPGDPAR------------LASVR 381

Query: 523 RKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYM 582
            +L   R++RP+P  DDKV+ +WNGL I++ A    +  S +     +        +   
Sbjct: 382 ERLGAARARRPQPARDDKVVTAWNGLAIAALAEHGVLTGSPS-----SVDAARRAAELLA 436

Query: 583 EVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP-GFLDDYAFLISGLLDLYEFGSGTKW 641
           +V          H  D    RL+ + RNG + AP G L+DY  L  GLL L++     +W
Sbjct: 437 DV----------HWGD---GRLRRASRNGVAGAPSGVLEDYGDLAEGLLALHQATGEGRW 483

Query: 642 LVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLA 701
           L  A +L +     F+D +  G+ +T  +  +++ R  +  DG  PSG +      V  A
Sbjct: 484 LELAGDLLDVVAGQFIDAD--GWHDTAADAEALVHRPFDPADGPTPSGLAAVAGAAVTYA 541

Query: 702 SIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSV 761
           ++    +     + A  SLA    R          M     +L+ P     V V   +  
Sbjct: 542 ALAGAPRHRELGEAAVGSLARLAERAPQAVGWA--MAVGEALLAGPLE---VAVSGPAGP 596

Query: 762 DFENMLAAAHASYDLNKTV 780
           D + ++AAA AS      V
Sbjct: 597 DRDALVAAARASTSPGAVV 615


>gi|312194562|ref|YP_004014623.1| N-acylglucosamine 2-epimerase [Frankia sp. EuI1c]
 gi|311225898|gb|ADP78753.1| N-acylglucosamine 2-epimerase [Frankia sp. EuI1c]
          Length = 686

 Score =  387 bits (993), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 245/670 (36%), Positives = 347/670 (51%), Gaps = 71/670 (10%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            NRLA + SPYLLQHA NPVDW+ W   AF EA +R VP+ LS+GY++CHWCHVM  ESF
Sbjct: 2   ANRLADQTSPYLLQHADNPVDWWPWEPAAFDEAARRGVPVLLSVGYASCHWCHVMAHESF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           EDE  A  +N+ FV+IKVDREERPDVD VYM    AL G GGWP++VFL+P  +P   GT
Sbjct: 62  EDEATAAFMNEHFVNIKVDREERPDVDAVYMDVTVALTGHGGWPMTVFLTPAGEPFFAGT 121

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFPP+ + G P F  +L+ + +AW  +RD +  SGA    +L+EA + S    +    L 
Sbjct: 122 YFPPQGRPGMPAFSQVLQALSEAWVTRRDEIESSGADIARKLAEA-AESPVGGRAGTRLD 180

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
            + L    +QL+  +D R GGFG+APKFP  +  +++L H        +SG+A     +V
Sbjct: 181 ADLLDRAVDQLAGRFDPRNGGFGAAPKFPPSMVAELLLRHHA------RSGDA-RALDLV 233

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             T + MA+GGI+D + GGF RYSVD  W VPHFEKMLYD  QL  VYL  +  T     
Sbjct: 234 ALTCERMARGGIYDQLAGGFARYSVDATWTVPHFEKMLYDNAQLLRVYLHLWRATGSGLA 293

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADS-----------AET----EGATRKKEGAFY 446
           + + R+  ++L  D+    G   SA DAD+           AE+    E  +   EGA Y
Sbjct: 294 ARVVRETAEFLLADLRTAEGGFASALDADAVPPAAPDGPGGAESGPGDEHGSHPVEGASY 353

Query: 447 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 505
           VWT  ++  +L  + A    E + + P G             F+  + +++L    A  +
Sbjct: 354 VWTPAQLAAVLAPDDAAWAAELFAVTPEGT------------FEHGSSVLQLPADPADPA 401

Query: 506 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 565
           +           L   R +L   R+ RP+P  DDKV+ SWN            I      
Sbjct: 402 R-----------LARVRDELAAARALRPQPARDDKVVASWN---------GLAIAALAEA 441

Query: 566 SAMFNFPVVGSDRKEYMEVAESAASFIRR-HLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 624
            A+F  P        ++E AE AAS +R  HL D +  R     + GP+   G LDDY  
Sbjct: 442 GALFEVPA-------WIEAAERAASLLRDVHLVDGRLRRTSRHGKVGPNA--GVLDDYGN 492

Query: 625 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDG 684
           +  GLL LY+      WL  A EL +     F   + GG+++T  +  ++L R +E  D 
Sbjct: 493 VAEGLLALYQVTGELAWLELARELLDVARARFRAPD-GGFYDTADDAETLLRRPREISDS 551

Query: 685 AEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA-VFETRLKDMAMAVPLMCCAADM 743
             PSG S     L+  A++   + S  +R++AE ++  +     +D + A      A  +
Sbjct: 552 PTPSGQSAFAGALLTYAAL---TGSADHREDAEATVGLLAALLARDASFAGYAGAVAEAL 608

Query: 744 LSVPSRKHVV 753
           L+ P+   VV
Sbjct: 609 LAGPAEVAVV 618


>gi|239990319|ref|ZP_04710983.1| hypothetical protein SrosN1_23633 [Streptomyces roseosporus NRRL
           11379]
          Length = 673

 Score =  387 bits (993), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 239/627 (38%), Positives = 329/627 (52%), Gaps = 59/627 (9%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            NRLA   SPYLLQHA NPVDW+ W  EAF EARKRDVP+ LS+GY++CHWCHVM  ESF
Sbjct: 2   ANRLAQTTSPYLLQHADNPVDWWPWSPEAFEEARKRDVPVLLSVGYASCHWCHVMAHESF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           ED+  A  LN  FV +KVDREERPDVD VYM  VQA  G GGWP++VFL+PD +P   GT
Sbjct: 62  EDDDTAAYLNAHFVPVKVDREERPDVDAVYMEAVQAATGHGGWPMTVFLTPDAEPFYFGT 121

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQ-SGAFAIEQLSEALSASASSNKLPDEL 280
           YFPPE ++G P F+ +L  V  AW  +RD +A+ +G    +    +L           E+
Sbjct: 122 YFPPEPRHGSPSFQQVLEGVTAAWTDRRDEVAEVAGRIVADLAGRSLVHGGDGVPGESEV 181

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
            Q  L      L++ YD + GGFG APKFP  + ++ +L H  +   TG  G      +M
Sbjct: 182 AQALL-----GLTREYDEQHGGFGGAPKFPPAMVVEFLLRHYAR---TGAEG----ALQM 229

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
              T   MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L  VY   +  T    
Sbjct: 230 AADTCTAMARGGIYDQLGGGFARYSVDREWIVPHFEKMLYDNALLCRVYAHLWRTTGSDE 289

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH 460
              I  +  D++ R++    G   SA DADS + +G  +  EGA+YVWT  ++ ++LGE 
Sbjct: 290 ARRIALETADFMVRELRTAEGGFASALDADSEDADG--KHVEGAYYVWTPAQLREVLGED 347

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
              F   Y+           +++     +G +VL    D+         P++    + G 
Sbjct: 348 DGAFAAAYF----------GVTEDGTFEEGASVLRLPGDAG--------PVDA-ARVAG- 387

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            R +L   R +RPRP  DDKV+ +WNGL I++ A                      DR +
Sbjct: 388 VRARLLAARDERPRPGRDDKVVAAWNGLAIAALAETGAYF----------------DRPD 431

Query: 581 YMEVAESAAS-FIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFLISGLLDLYEFGSG 638
            +E A  AA   +R HL   +  RL  + ++G      G L+DY  +  G L L      
Sbjct: 432 LVERATEAADLLVRVHL--GEVARLTRTSKDGRAGDNAGVLEDYGDVAEGFLALAAVTGE 489

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
             WL +A  L +   E F   EGG  ++T  +   ++ R ++  D A PSG + +   L+
Sbjct: 490 GAWLEFAGFLLDIVLEQFTG-EGGQLYDTAHDAEQLIRRPQDPTDSATPSGWTAAAGALL 548

Query: 699 RLASIVAGSKSDYYRQNAEHSLAVFET 725
              S  A + S+ +R  AE +L V + 
Sbjct: 549 ---SYAAYTGSEAHRTAAEGALGVVKA 572


>gi|427427562|ref|ZP_18917606.1| Thymidylate kinase [Caenispirillum salinarum AK4]
 gi|425883488|gb|EKV32164.1| Thymidylate kinase [Caenispirillum salinarum AK4]
          Length = 678

 Score =  386 bits (992), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 235/620 (37%), Positives = 321/620 (51%), Gaps = 64/620 (10%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N+L  E SPYLLQHA NPV W  W + A  EA+    P+ LS+GY+ CHWCHVM  ESFE
Sbjct: 5   NQLGQETSPYLLQHADNPVHWRPWSQAALDEAKAAGKPVLLSVGYAACHWCHVMAHESFE 64

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D   A ++ND F++IKVDREERPDVD +YM+ +Q +   GGWPL++FL+PD +P  GGTY
Sbjct: 65  DAETAAVMNDLFINIKVDREERPDVDAIYMSALQLMGQRGGWPLTMFLTPDGEPFWGGTY 124

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP +  +GRPGFK +LR+V DA+ +  + ++ +    ++ L + L+   SS   P  L  
Sbjct: 125 FPKDSAFGRPGFKDVLRQVADAYHQSPEKVSNNTGALVDALRKGLNLPQSSEP-PAALAL 183

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             +   AE L+   D  +GG   APKFP       +    +    TG+     E    VL
Sbjct: 184 PVVDQLAESLAGHVDPEWGGLRGAPKFPVVFAFDALW---RSWHRTGR----QELHDAVL 236

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL  + +GGI+DH+GGGF RYS D +W VPHFEKMLYD  QL ++    +  T+     
Sbjct: 237 LTLDRLCQGGIYDHLGGGFARYSTDAQWLVPHFEKMLYDNAQLIDLMTSVWQETRSPLLQ 296

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE--H 460
               + +D+L R+MI   G   S+ DAD   TEG    +EG FYVWT  E++ +LG    
Sbjct: 297 ARVEETVDWLEREMIAENGAFASSLDAD---TEG----EEGRFYVWTKDEIDRVLGTDAD 349

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL----IELNDSSASASKLGMPLEKYLN 516
           A LFK  Y ++P GN            ++GK VL     ++ D  A  +K          
Sbjct: 350 AALFKRAYDVRPGGN------------WEGKTVLNRNFSDVGDEPALETK---------- 387

Query: 517 ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGS 576
            L   R  L   R KR  P  DDKV+  WNGL+I + ARA          A F  P    
Sbjct: 388 -LYRARMLLLRERDKRVMPGRDDKVLADWNGLMIHALARA---------GAAFGRP---- 433

Query: 577 DRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG 636
              E++++A SA   IR  +      RL HSFR G  +    LDDYA +    L L++  
Sbjct: 434 ---EWVDLARSAYDGIRDTM-SRPGDRLGHSFRKGRLQDVAMLDDYANMARAALTLHQVT 489

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
               ++  A       D  + D   GGYF T  +   ++LR K   D A PSGN    + 
Sbjct: 490 GVADFIDHASRWVAVLDAEYWDDAAGGYFLTAADATDLILRTKSAQDNATPSGNGTMAVV 549

Query: 697 LVRLASIVAGSKSDYYRQNA 716
           L  L  +   +  + YR+ A
Sbjct: 550 LATLWHL---TGEERYRRRA 566


>gi|358457848|ref|ZP_09168063.1| N-acylglucosamine 2-epimerase [Frankia sp. CN3]
 gi|357078866|gb|EHI88310.1| N-acylglucosamine 2-epimerase [Frankia sp. CN3]
          Length = 673

 Score =  386 bits (991), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 250/661 (37%), Positives = 343/661 (51%), Gaps = 62/661 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA + SPYLLQHA NPVDW+ W   AFAEA  R VP+ LS+GY++CHWCHVM  ESFE
Sbjct: 3   NRLADQTSPYLLQHADNPVDWWPWEPAAFAEAASRQVPVLLSVGYASCHWCHVMAHESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D+  A  +N+ FV+IKVDREERPDVD VYM    AL G GGWP++VFL+P  +P   GTY
Sbjct: 63  DDTTAAYMNEHFVNIKVDREERPDVDSVYMDVTMALTGHGGWPMTVFLTPTGEPFFAGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD-ELP 281
           FPP  + G   F+ +L  V  AWD +R+ +  SGA    +L+EA  A  +  + P   L 
Sbjct: 123 FPPTPRPGMGSFRQVLSAVSSAWDTRREEIESSGADIARKLAEAAEAPVAGGRGPAIRLD 182

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
              L    +QL+  +D R GGFG APKFP  +  +++L H  +   TG   E S G  MV
Sbjct: 183 GELLDTAVDQLAARFDPRHGGFGGAPKFPPSMVAELLLRHHAR---TGN--ERSLG--MV 235

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             T + MA+GGI+D + GGF RYSVD  W VPHFEKMLYD  QL  VYL  +  T D   
Sbjct: 236 ALTCERMARGGIYDQLTGGFARYSVDATWTVPHFEKMLYDNAQLLRVYLHLWRTTGDALA 295

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADS-----AETEGATRKK-EGAFYVWTSKEVED 455
           + + R+   +L  D+  P G   SA DAD+     ++T+G   +  EGA YVWT  ++ D
Sbjct: 296 ARVVRETAAFLLTDLRTPQGGFASALDADAVPPSDSDTDGHPHQPVEGASYVWTPGQLAD 355

Query: 456 ILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKY 514
            LG + A      + +  TG  +            G +VL    D   +           
Sbjct: 356 ALGPDDAAWAANLFEVTATGTFE-----------HGSSVLALPADPDDA----------- 393

Query: 515 LNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVV 574
            +     R  L   R+ RP+P  DDKV+ SWN            +       A+F  P  
Sbjct: 394 -DRFARVRATLAATRAARPQPARDDKVVASWN---------GLAVAALAEAGALFEEP-- 441

Query: 575 GSDRKEYMEVAESAASFIRR-HLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
                E++  AE AA  +R  HL D +  R     R GP+   G LDDY  +  G L L+
Sbjct: 442 -----EWVTAAERAAVLLRDVHLVDGRLRRTSRDGRVGPNV--GVLDDYGNVADGFLALH 494

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
           +     +WL  A +L +     F   + GG+++T  + P++L R +E  D A PSG S  
Sbjct: 495 QVTGAVEWLELAGQLLDVARARFRAAD-GGFYDTADDAPTLLRRPREVSDSATPSGQSAF 553

Query: 694 VINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL-KDMAMAVPLMCCAADMLSVPSRKHV 752
              L+  A++   + S  +R++AE ++ +    L +D   A      A  +L+ P    V
Sbjct: 554 AGALLTYAAL---TGSAGHREDAEATIGLLAPLLARDARFAGHAGTVAEALLAGPPEVAV 610

Query: 753 V 753
           V
Sbjct: 611 V 611


>gi|383649966|ref|ZP_09960372.1| hypothetical protein SchaN1_31668 [Streptomyces chartreusis NRRL
           12338]
          Length = 677

 Score =  386 bits (991), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 250/663 (37%), Positives = 344/663 (51%), Gaps = 71/663 (10%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA E SPYLLQHA NPVDW+ W  EAF EARKR+VP+ LSIGYS+CHWCHVM  ESFE
Sbjct: 3   NRLAHETSPYLLQHADNPVDWWPWSAEAFEEARKRNVPVLLSIGYSSCHWCHVMAHESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D+  A+ LN  +VS+KVDREERPDVD VYM  VQA  G GGWP++VFL+PD +P   GTY
Sbjct: 63  DQQTAEYLNAHYVSVKVDREERPDVDAVYMEAVQAATGQGGWPMTVFLTPDAEPFYFGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS-EALSASASSNKLPDELP 281
           FPP  + G P F+ +L+ V  AW+++RD + +     +  L+   +S   +      EL 
Sbjct: 123 FPPAPRQGMPSFRQVLQGVHQAWEERRDEVTEVAGKIVRDLAGREISYGDAQTPGEQELA 182

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
           Q  L      L++ YD + GGFG APKFP  + ++ +L H  +   TG  G      +M 
Sbjct: 183 QALL-----ALTREYDPQRGGFGGAPKFPPSMVLEFLLRHHAR---TGAEG----ALQMA 230

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             T + MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L  VY   +  T     
Sbjct: 231 QDTCERMARGGIYDQIGGGFARYSVDRDWIVPHFEKMLYDNALLCRVYAHLWRATGSEPA 290

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH- 460
             +  +  D++ R++    G   SA DADS   +G  +  EGA+YVWT  ++ ++LGE  
Sbjct: 291 RRVALETADFMVRELRTAEGGFASALDADS--DDGTGKHVEGAYYVWTPAQLREVLGEQD 348

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL---NI 517
           A L   ++ +   G  +  R                        S L +P +  L   + 
Sbjct: 349 AELAARYFGVTEEGTFEHGR------------------------SVLQLPQQDGLFDADR 384

Query: 518 LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSD 577
           +   R +L   RS RP P  DDKV+ +WNGL I++ A            A F+ P     
Sbjct: 385 IASIRERLLAARSGRPAPGRDDKVVAAWNGLAIAALAET---------GAYFDRP----- 430

Query: 578 RKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYAFLISGLLDLYEFG 636
                    +A   +R HL DEQ  RL  + ++G + A  G L+DYA +  G L L    
Sbjct: 431 -DLVEAALAAADLLVRLHL-DEQA-RLTRTSKDGHAGANAGVLEDYADVAEGFLALASVT 487

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
               WL +A  L +     F D E G  F+T  +   ++ R ++  D A PSG + +   
Sbjct: 488 GEGVWLEFAGFLLDHVLARFTDEESGALFDTAADAERLIRRPQDPTDNAAPSGWTAAAGA 547

Query: 697 LVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMC---CAADMLSVPSRKHVV 753
           L+   S  A + S  +R  AE +L V    +K +   VP       AA   ++   + V 
Sbjct: 548 LL---SYAAHTGSQPHRTAAEKALGV----VKALGPRVPRFIGWGLAAAEAALDGPREVA 600

Query: 754 LVG 756
           +VG
Sbjct: 601 VVG 603


>gi|88604224|ref|YP_504402.1| hypothetical protein Mhun_2996 [Methanospirillum hungatei JF-1]
 gi|88189686|gb|ABD42683.1| protein of unknown function DUF255 [Methanospirillum hungatei JF-1]
          Length = 700

 Score =  386 bits (991), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 231/607 (38%), Positives = 310/607 (51%), Gaps = 53/607 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  EHSPYL  HAHNPVDW+ WG+EAFA A + D+P+F+SIGY+ CHWCHVME   FE
Sbjct: 6   NRLVKEHSPYLRHHAHNPVDWYPWGDEAFARALENDMPVFVSIGYAACHWCHVMETVCFE 65

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA LLN  FVS+KVDREERPD+D+VYM   QA+ G GGWPL VFL+PD +P    T+
Sbjct: 66  DEVVASLLNTHFVSVKVDREERPDIDQVYMAVCQAMTGSGGWPLHVFLTPDKRPFYAATF 125

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKL--PDEL 280
            P       PG   +L  +   W  +R+ ++       +Q+  A+        L  PDEL
Sbjct: 126 IPKMSSPNMPGMLDLLPYLASVWRDEREKVSDLS----DQIMSAIQEQTRRGTLHDPDEL 181

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
              A R    +L+  YD ++GGF  APKFP    +  +L ++   +D            M
Sbjct: 182 IHTAAR----RLTALYDKKYGGFSPAPKFPSVPVLLFLLRYAVIHQDRSI-------LDM 230

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
           +  TL  MA GG+ DH+ GGFHRY+ D  W +PHFEKMLYDQ   A +Y + + +TK   
Sbjct: 231 ITTTLNRMAWGGMRDHLDGGFHRYATDTAWKLPHFEKMLYDQAMCAIIYTEIWQVTKQDR 290

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH 460
           Y  + R +L+Y+   +    G   S+EDADS          EGA+Y+W+  E+E I GE 
Sbjct: 291 YRRLARSVLEYMTTVLSDAPGGFSSSEDADSP-------GGEGAYYLWSYDEIEKIFGEE 343

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGM--PLEKYLNIL 518
           A L    + +   GN     +S  H    G NVL    D     S  G+  P + Y +IL
Sbjct: 344 ARLVCTMFGITREGN-----VSGMHGMKPGDNVLFPERDPLEILSAAGVRDPEKTYASIL 398

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
                 L + R +R RP LDDKV+  WN L I + A A  +   E+              
Sbjct: 399 N----TLTNARKERERPPLDDKVLTDWNALAIQALAFAGMVFHDESLCTR---------- 444

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG 638
                 A SAA F+  ++       L H +RNG     G   DY  L    + LY+    
Sbjct: 445 ------AISAAEFLFSNMVRPDGSVL-HRWRNGQGGIEGTAGDYVHLAWACVTLYQTTGN 497

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
           + WL  AI L+ +  + F D   GGYF    E   + +R+KE  DG   S N  + + L 
Sbjct: 498 SLWLRRAISLEKSASDRFYDSVHGGYFQVPSET-DLPVRMKEMTDGPTFSTNGAAYLLLC 556

Query: 699 RLASIVA 705
            L +I  
Sbjct: 557 ALFTITG 563


>gi|381163013|ref|ZP_09872243.1| thioredoxin domain-containing protein [Saccharomonospora azurea
           NA-128]
 gi|379254918|gb|EHY88844.1| thioredoxin domain-containing protein [Saccharomonospora azurea
           NA-128]
          Length = 667

 Score =  385 bits (990), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 246/681 (36%), Positives = 341/681 (50%), Gaps = 84/681 (12%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA   SPYLLQHA NPVDW+ WG EA AEA++RDVPI LSIGY+ CHWCHVM  ESF 
Sbjct: 2   NRLATATSPYLLQHADNPVDWWPWGPEALAEAQRRDVPILLSIGYAACHWCHVMAHESFS 61

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA L+N+ FV+IKVDREERPD+D VYMT  QA+ G GGWP++ FL+PD KP   GTY
Sbjct: 62  DEDVAALMNEHFVNIKVDREERPDIDAVYMTATQAMTGQGGWPMTCFLTPDGKPFHCGTY 121

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           +PP   +G P F+ +L  V  AW ++RD L +     ++ + E      +    P  +  
Sbjct: 122 YPPVPAHGMPSFRQLLDAVAQAWRERRDELVEGAGRIVDHIVE-----QTKPLGPHPVTA 176

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             +     +L    D   GGFG APKFP  + ++ +L H    E TG    + E   +V 
Sbjct: 177 ETVASAVSKLRTETDPGHGGFGGAPKFPPSMVLEFLLRH---YERTG----SVEALSIVD 229

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            T + MA+GGI+D + GGF RYSVD  W VPHFEKMLYD   L   Y      T      
Sbjct: 230 MTAEGMARGGIYDQLAGGFSRYSVDAGWVVPHFEKMLYDNALLLRFYAHLARRTGSALAH 289

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
            +  +  ++L RD+  P G   S+ DAD+   EG T       YVWT +++ D+LG    
Sbjct: 290 RVAGETAEFLLRDLRTPQGAFASSLDADTEGVEGLT-------YVWTPQQLVDVLGPDDG 342

Query: 463 LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE-----KYLNI 517
            +    +                       V +E       AS L +P +     +++ +
Sbjct: 343 AWAAATF----------------------GVTVE-GTFERGASTLRLPRDPDDPSRWMRV 379

Query: 518 LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSD 577
                  L + R+ RP+P  DDKVI +WNGL I++ A A   L+                
Sbjct: 380 TA----TLLEARNARPQPARDDKVIAAWNGLAITALAEAGVALQ---------------- 419

Query: 578 RKEYMEVAESAASFI-RRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFLISGLLDLYEF 635
           R E++E A +A +F+   H+ D    R   S R+G   +A G L+DYA L  GLL L++ 
Sbjct: 420 RPEWVEAAVAAGAFVLDAHVSDGTVLR---SSRDGVVGEAAGVLEDYACLADGLLSLHQA 476

Query: 636 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL-RVKEDHDGAEPSGNSVSV 694
               +WLV A  L +T    F      G F+ T  D   L+ R  +  D A PSG S   
Sbjct: 477 TGEPRWLVEATALLDTAMRRFGVEGAPGAFHDTASDAEELVHRPSDPTDNASPSGASALA 536

Query: 695 INLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP-----LMCCAADMLSVPSR 749
             L+  +++     +  YR   E ++    +R   +   VP      +  A  ML+ P +
Sbjct: 537 DALLTASALAGPEHAGTYRAACEEAV----SRAGALIAQVPRFAGHWLSVAEAMLAGPVQ 592

Query: 750 KHVVLVGHKSSVDFENMLAAA 770
             V +VG  +    E ++ AA
Sbjct: 593 --VAVVGEDAQARHELVVEAA 611


>gi|418461665|ref|ZP_13032732.1| thioredoxin domain-containing protein [Saccharomonospora azurea
           SZMC 14600]
 gi|359738246|gb|EHK87140.1| thioredoxin domain-containing protein [Saccharomonospora azurea
           SZMC 14600]
          Length = 667

 Score =  385 bits (990), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 246/681 (36%), Positives = 341/681 (50%), Gaps = 84/681 (12%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA   SPYLLQHA NPVDW+ WG EA AEA++RDVPI LSIGY+ CHWCHVM  ESF 
Sbjct: 2   NRLATATSPYLLQHADNPVDWWPWGPEALAEAQRRDVPILLSIGYAACHWCHVMAHESFS 61

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA L+N+ FV+IKVDREERPD+D VYMT  QA+ G GGWP++ FL+PD KP   GTY
Sbjct: 62  DEDVAALMNEHFVNIKVDREERPDIDAVYMTATQAMTGQGGWPMTCFLTPDGKPFHCGTY 121

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           +PP   +G P F+ +L  V  AW ++RD L +     ++ + E      +    P  +  
Sbjct: 122 YPPVPAHGMPSFRQLLDAVAQAWRERRDELVEGAGRIVDHIVE-----QTKPLGPHPVTA 176

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             +     +L    D   GGFG APKFP  + ++ +L H    E TG    + E   +V 
Sbjct: 177 ETVASAVSKLRTETDPGHGGFGGAPKFPPSMVLEFLLRH---YERTG----SVEALSIVD 229

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            T + MA+GGI+D + GGF RYSVD  W VPHFEKMLYD   L   Y      T      
Sbjct: 230 MTAEGMARGGIYDQLAGGFSRYSVDAGWVVPHFEKMLYDNALLLRFYAHLARRTGSALAH 289

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
            +  +  ++L RD+  P G   S+ DAD+   EG T       YVWT +++ D+LG    
Sbjct: 290 RVAGETAEFLLRDLRTPQGAFASSLDADTEGVEGLT-------YVWTPQQLVDVLGPDDG 342

Query: 463 LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE-----KYLNI 517
            +    +                       V +E       AS L +P +     +++ +
Sbjct: 343 AWAAATF----------------------GVTVE-GTFERGASTLRLPRDPDDPSRWMRV 379

Query: 518 LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSD 577
                  L + R+ RP+P  DDKVI +WNGL I++ A A   L+                
Sbjct: 380 TA----TLLEARNARPQPARDDKVIAAWNGLAITALAEAGVALQ---------------- 419

Query: 578 RKEYMEVAESAASFI-RRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFLISGLLDLYEF 635
           R E++E A +A +F+   H+ D    R   S R+G   +A G L+DYA L  GLL L++ 
Sbjct: 420 RPEWVEAAVAAGAFVLDAHVSDGTVLR---SSRDGVVGEAAGVLEDYACLADGLLSLHQA 476

Query: 636 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL-RVKEDHDGAEPSGNSVSV 694
               +WLV A  L +T    F      G F+ T  D   L+ R  +  D A PSG S   
Sbjct: 477 TGEPRWLVEATALLDTAMRRFGVEGAPGAFHDTASDAEELVHRPSDPTDNASPSGASALA 536

Query: 695 INLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP-----LMCCAADMLSVPSR 749
             L+  +++     +  YR   E ++    +R   +   VP      +  A  ML+ P +
Sbjct: 537 GALLTASALAGPEHAGTYRAACEEAV----SRAGALIAQVPRFAGHWLSVAEAMLAGPVQ 592

Query: 750 KHVVLVGHKSSVDFENMLAAA 770
             V +VG  +    E ++ AA
Sbjct: 593 --VAVVGEDAQARHELVVEAA 611


>gi|255033843|ref|YP_003084464.1| hypothetical protein Dfer_0027 [Dyadobacter fermentans DSM 18053]
 gi|254946599|gb|ACT91299.1| protein of unknown function DUF255 [Dyadobacter fermentans DSM
           18053]
          Length = 671

 Score =  385 bits (990), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 231/609 (37%), Positives = 321/609 (52%), Gaps = 51/609 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL+ + SPYLLQHAHNPVDW+ WGEEA ++A+  + PI +SIGYS CHWCHVME E FE
Sbjct: 2   NRLSEQTSPYLLQHAHNPVDWYPWGEEALSKAKNENKPILVSIGYSACHWCHVMERECFE 61

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
            E +A+++N +FV IKVDREERPDVD VYM  VQA+   GGWPL+VFL PD KP  G TY
Sbjct: 62  KEPIAEVMNAYFVCIKVDREERPDVDAVYMDAVQAMGVRGGWPLNVFLLPDSKPFYGVTY 121

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
            PP++      +  +L+ +  A+    D LA S    ++ +  + S      +       
Sbjct: 122 LPPQN------WVQLLKSINQAFTNHFDELADSAEGFVQNMIASESQKYGLVEGTVHFNA 175

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
           + L +  EQ+ + +D++ GG   APKF  P   + +L    +  D  ++ EA      V 
Sbjct: 176 DDLDVMFEQIQRHFDTQKGGMDRAPKFMMPSIYKFLL----RYFDVSQNPEA---LAQVE 228

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            +L  +A GGI+DHVGGG+ RYSVDE W +PHFEKMLYD  QL +VY +A+SLT++  Y+
Sbjct: 229 LSLNRIALGGIYDHVGGGWARYSVDEDWFIPHFEKMLYDNAQLLSVYAEAYSLTQNPLYA 288

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
                 + +L  +M    G  FSA DADS   EG     EG FY+WT +E++ +LGE   
Sbjct: 289 SRIEQTIQWLSAEMRSADGGFFSALDADS---EGI----EGKFYIWTQQELQSVLGEDFD 341

Query: 463 LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECR 522
            F + Y +   GN +            G N L        +A   G+  + +        
Sbjct: 342 WFSKLYNISAQGNWE-----------HGYNHLHLTEPVEHAAKTAGILTDDFAGRYENAV 390

Query: 523 RKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYM 582
            KL + R +R RP LDDK++ SWNGL+I       + L  E                E  
Sbjct: 391 TKLAEKRRERVRPGLDDKILASWNGLLIKGLTDCYRALGHE----------------EIR 434

Query: 583 EVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWL 642
           E+A     FI   +      +L HSF+NG +   GFL+DYA +I G L LY+      WL
Sbjct: 435 ELAIGTGHFIAGKM--TTGSKLNHSFKNGVATVTGFLEDYAAVIEGYLGLYQITFEEDWL 492

Query: 643 VWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLAS 702
             A +L       F D+  G +  T     +++ R KE  D   P+ NS+   NL  L  
Sbjct: 493 QKAQQLTEYALSNFYDQSEGFFHFTDAYGEALIARKKELFDNVIPASNSIMAQNLYTLGK 552

Query: 703 IVAGSKSDY 711
           ++   + DY
Sbjct: 553 ML--DRDDY 559


>gi|30248134|ref|NP_840204.1| hypothetical protein NE0103 [Nitrosomonas europaea ATCC 19718]
 gi|30180019|emb|CAD84014.1| putative similar to unknown proteins [Nitrosomonas europaea ATCC
           19718]
          Length = 689

 Score =  385 bits (990), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 245/682 (35%), Positives = 355/682 (52%), Gaps = 56/682 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N LA E SPYLLQHA NPVDW+ WGEEA   AR  D PI LSIGYS CHWCHVM  ESFE
Sbjct: 3   NHLAGETSPYLLQHAENPVDWYPWGEEALEIARMLDKPILLSIGYSACHWCHVMAHESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQAL-YGGGGWPLSVFLSPDLKPLMGGT 221
           D  VA  +N+ FV+IKVDREERPD+D++Y +    L +  GGWPL++FL+P+ KP  GGT
Sbjct: 63  DAQVATAMNEHFVNIKVDREERPDIDQIYQSAHYTLNHRSGGWPLTMFLTPEQKPFFGGT 122

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFP E +Y  PGF  +L KV + +  ++  + +  A  ++ L+++L A  +       L 
Sbjct: 123 YFPKEARYSMPGFLELLPKVAELYRTRKTDIEKQNAVLLKLLAQSLPAPDTR---ASALS 179

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
           +  +    EQL++ +D   GGFG APKF  P E+Q  L       DT           +V
Sbjct: 180 RQPIDRAWEQLNRLFDETDGGFGDAPKFLHPAELQFCLRRYVTDNDT-------RALHVV 232

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             TL+ MA+GG++D +GGGF RYS D  W +PHFEKMLYD   +  +Y + + +T +  +
Sbjct: 233 THTLEKMAQGGLYDQLGGGFCRYSTDHSWQIPHFEKMLYDNALMLPLYAETWLVTGNPLF 292

Query: 402 SYICRDILDYLRRDM---IGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
             +  +   ++ R+M   I   G  FS+ DADS         +EG FYVW  + V  IL 
Sbjct: 293 KQVVEETAAWVIREMQSGIDGEGGYFSSLDADS-------EHEEGKFYVWDRQAVSAILT 345

Query: 459 EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 518
                    YY       D S   + H+        IE       A++  +  E    ++
Sbjct: 346 PEEYRVTAAYY-----GLDRSPNFENHHWHLAVTESIE-----TVAARHQISQEAVQQLI 395

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
              RRKL + R +R RP  D+K++ SWN L+I    RA +I                 +R
Sbjct: 396 DSARRKLLNEREQRIRPGRDEKILTSWNALMIKGMTRAGQIF----------------ER 439

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG 638
           +E++  A  A  FIR  L+  Q  RL  +F++  +    +LDD+AFL+  LL L +    
Sbjct: 440 EEWISSAVRALDFIRSRLW--QNDRLLATFKDDKAHLNAYLDDHAFLLDSLLTLLQADFR 497

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
              L +AI L +     F D+  GG+F T+ +  +++ R K  HDGA P+GN ++   L 
Sbjct: 498 QTDLDFAITLADVLLTRFEDKTSGGFFFTSHDHETLIHRPKTGHDGAIPAGNGIAATTLQ 557

Query: 699 RLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHK 758
           RL  ++   +   Y + AE +L VF + L   A +   +    +    P+ K V+L G++
Sbjct: 558 RLGHLLNEQR---YLEAAERTLNVFSSGLSLHASSHCSLLITLEEFLEPT-KTVILHGNR 613

Query: 759 SSVDFENMLAAAHASYDLNKTV 780
             +    +   A   Y L+K V
Sbjct: 614 PEL---QIWLKALLPYSLDKIV 632


>gi|302542885|ref|ZP_07295227.1| conserved hypothetical protein [Streptomyces hygroscopicus ATCC
           53653]
 gi|302460503|gb|EFL23596.1| conserved hypothetical protein [Streptomyces himastatinicus ATCC
           53653]
          Length = 678

 Score =  385 bits (989), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 245/637 (38%), Positives = 330/637 (51%), Gaps = 62/637 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA E SPYLLQHA NPVDW  W +EAF EAR R VP+ LS+GYS+CHWCHVM  ESFE
Sbjct: 3   NRLAHETSPYLLQHADNPVDWRPWSDEAFEEARNRGVPVLLSVGYSSCHWCHVMAHESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D   A+ LN  FVS+KVDREERPDVD VYM  VQA  G GGWP++VFL+PD +P   GTY
Sbjct: 63  DAETAEYLNAHFVSVKVDREERPDVDAVYMEAVQAATGQGGWPMTVFLTPDAQPFYFGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FPP  + G P F+ +L  V+ AW  +RD +       +E L+     +  S       P 
Sbjct: 123 FPPRPRPGMPSFRQVLEGVRAAWADRRDEVRDVAGKIVEDLAGRTGIALGSGA---PQPP 179

Query: 283 NALRLCAE--QLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
            A  L A    L++ +D+  GGFG APKFP  + ++ +L H  +   TG  G      +M
Sbjct: 180 GAEDLAAGLMGLTREFDAVRGGFGGAPKFPPSMALEFLLRHHAR---TGSEG----ALQM 232

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
           V  T + MA+GGI+D +GGGF RY+VD  W VPHFEKMLYD   L  VY   +  T    
Sbjct: 233 VQATCEAMARGGIYDQLGGGFARYAVDAEWIVPHFEKMLYDNALLCRVYAHLWRATGSDL 292

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH 460
              +  +  D+L R+M    G   SA DADS   +G  R  EGA+YVWT +++ + LGE 
Sbjct: 293 ARRVALETADFLVREMRTEQGGFASALDADS--DDGTGRHVEGAYYVWTPEQLREALGEA 350

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
                  Y+           +++     KG +VL +L D +  A             L  
Sbjct: 351 DAEQAAAYF----------GVTEEGTFEKGASVL-QLPDGARPADA---------AQLAS 390

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            R +L   R +R RP  DDK++ +WNGL I++ A                      DR +
Sbjct: 391 VRERLLAARERRERPGRDDKIVAAWNGLAIAALAETGAYF----------------DRPD 434

Query: 581 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYAFLISGLLDLYEFGSGT 639
            +E A  AA  + R L+ +   RL  +   G   A  G L+DYA +  G L L       
Sbjct: 435 LVEAATEAADLLVR-LHMDNGGRLARTSLGGAVGAHAGVLEDYADVAEGFLALSAVSGEG 493

Query: 640 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGAEPSGNSVSVINLV 698
            W+ +A  L +T    F   +G  Y   T +D   L+R  +D  D A PSG + +   L+
Sbjct: 494 VWVDFAGLLLDTVLHHFAAEDGTLY--DTADDAEALIRRPQDPTDNAVPSGWTAAAGALL 551

Query: 699 RLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP 735
             A++   S S  +R+ AE +L V    ++ +A  VP
Sbjct: 552 SYAAV---SGSGRHREAAERALGV----VRALAGRVP 581


>gi|453051421|gb|EME98928.1| hypothetical protein H340_19073 [Streptomyces mobaraensis NBRC
           13819 = DSM 40847]
          Length = 680

 Score =  385 bits (988), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 238/623 (38%), Positives = 333/623 (53%), Gaps = 58/623 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA E SPYLLQHA NPVDW+ W  EAFAEAR+RDVP+ LS+GYS+CHWCHVM  ESFE
Sbjct: 3   NRLAHETSPYLLQHADNPVDWWPWSPEAFAEARRRDVPVLLSVGYSSCHWCHVMAGESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE  A  LN+ FVS+KVDREERPD+D VYM  VQA  G GGWP++VFL+PD +P   GTY
Sbjct: 63  DEETAAYLNEHFVSVKVDREERPDIDAVYMEAVQAATGQGGWPMTVFLTPDAEPFYFGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FPP  ++G P F+ +L  V  AW  +R+ + +     ++ L+     +A   + P     
Sbjct: 123 FPPAPRHGMPSFRQVLEGVAAAWRDRREEVGEVAGRIVQDLARRPLTAAVGGQPP---AA 179

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
           + L +    L++ +D+  GGFG APKFP  + ++ +L H  +   TG +        MV 
Sbjct: 180 DELHMALMALTREFDAVRGGFGGAPKFPPSMVLEFLLRHHVR---TGSAA----ALDMVT 232

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            T + MA+GGIHD +GGGF RYSVD  W VPHFEKMLYD   L  VY   +  T      
Sbjct: 233 ATCEAMARGGIHDQLGGGFARYSVDNGWVVPHFEKMLYDNALLCRVYAHLWRATGSGLAR 292

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE-HA 461
            +  D  D+L R+M    G   SA DADS + +G  R +EGA+YVWT ++  ++LGE  A
Sbjct: 293 RVALDTADFLVREMRTDQGGFASALDADSDDGQG--RHREGAYYVWTPEQFREVLGEADA 350

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI--LG 519
            L  +++ +   G             F+    +++L DS           E+ ++   + 
Sbjct: 351 ELAADYFGVTEEGT------------FEEGASVLQLPDS-----------ERLVDAERIA 387

Query: 520 ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRK 579
             R +L   R++RPRP  DDKV+  WNGL I++ A                      DR 
Sbjct: 388 SVRERLLAARARRPRPGRDDKVVAGWNGLAIAALAETGAYF----------------DRP 431

Query: 580 EYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGT 639
           + ++ A  AA  + R   D      + S         G L+DYA +  G L L       
Sbjct: 432 DLVQAATDAADLLVRTHMDWNARLFRTSLDGVAGGHAGVLEDYADVAEGFLALSAVTGEG 491

Query: 640 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVR 699
            W+ +A  L +T    F D E G  F+T  +  +++ R ++  D A PSG S +   L+ 
Sbjct: 492 VWVDFAGLLLDTVLIRFRDEE-GALFDTADDAETLIRRPQDPTDNATPSGWSAAAGALLT 550

Query: 700 LASIVAGSKSDYYRQNAEHSLAV 722
            A++   + S  +R+ AE +L V
Sbjct: 551 YAAL---TGSAPHREAAERALGV 570


>gi|149279373|ref|ZP_01885504.1| hypothetical protein PBAL39_13682 [Pedobacter sp. BAL39]
 gi|149229899|gb|EDM35287.1| hypothetical protein PBAL39_13682 [Pedobacter sp. BAL39]
          Length = 674

 Score =  385 bits (988), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 228/630 (36%), Positives = 324/630 (51%), Gaps = 52/630 (8%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           N   N+L    SPYLLQHA+NPV W  WG EA  +A++ +  I +SIGYS CHWCHVME 
Sbjct: 2   NPQPNKLINASSPYLLQHAYNPVQWQEWGLEALEQAKRENKLILVSIGYSACHWCHVMER 61

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFE+  VA ++N  +V IKVDREERPD+D++YM  +Q + G GGWPL+    PD +P+ 
Sbjct: 62  ESFENHEVAAVMNQHYVCIKVDREERPDIDQIYMLAIQLMTGSGGWPLNCICLPDQRPVY 121

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
           GGTYF  +D      + +IL  V   W  + D   Q      + +  A     +  K P 
Sbjct: 122 GGTYFKKDD------WTSILENVAALWLHEPDKALQYADRLTDGIRNAEKIIPNEKKEPY 175

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
                 LR   +   +  D   GG+  APKFP P   Q +L +S    D           
Sbjct: 176 NYTH--LREITDPWKRELDMTDGGYNRAPKFPMPNNWQFLLRYSLLTGDNAT-------H 226

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
              L +L+ MA GGI+D +GGGF RYSVD RWHVPHFEKMLYD  Q+  +Y +A+  T+ 
Sbjct: 227 VATLLSLEKMALGGIYDQIGGGFARYSVDGRWHVPHFEKMLYDNAQMIALYAEAYQYTQL 286

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
             ++ +  + + ++ R+M  P G  ++A DADS   EG     EG FYVW  +E E +  
Sbjct: 287 PLFNSVVAETIGWMAREMRSPEGLFYAALDADS---EGV----EGKFYVWDEEEFEVVTQ 339

Query: 459 EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 518
              +L K +Y +  +GN           E +  N+L+        A++ G+ LE+    +
Sbjct: 340 GDHLLMKAYYQVTSSGNW----------EEEETNILMRRFADEDFAAQQGITLEELDLKV 389

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
              R KL + RSKR  P LDDK +++WN + I   A  + +                  R
Sbjct: 390 SAAREKLLEHRSKRVTPALDDKCLLAWNAMAIKGLASCASVF----------------GR 433

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG 638
           ++Y E+A +AA FI + +  EQ  RL  +F+NG +   GFLDDYAF I  L+ LY++   
Sbjct: 434 QDYYEMARTAADFILQPM-QEQDGRLYRNFKNGKATISGFLDDYAFFIDALIALYQYDFD 492

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
            +WL+ A +   T    F D +   +F T     S++ R  E  D   P+ NSV   NL 
Sbjct: 493 EQWLLEARKYAETVLGQFADPDSPMFFYTPSGAESLIARKHELMDNVIPASNSVMAQNLH 552

Query: 699 RLASIVAGSKSDYYRQNAEHSLAVFETRLK 728
            L  +      D Y + A   LA  + ++K
Sbjct: 553 LLGLLF---DDDSYTERASAMLAAIQPQIK 579


>gi|354559793|ref|ZP_08979037.1| hypothetical protein DesmeDRAFT_2750 [Desulfitobacterium
           metallireducens DSM 15288]
 gi|353540319|gb|EHC09795.1| hypothetical protein DesmeDRAFT_2750 [Desulfitobacterium
           metallireducens DSM 15288]
          Length = 653

 Score =  385 bits (988), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 240/628 (38%), Positives = 341/628 (54%), Gaps = 79/628 (12%)

Query: 156 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 215
           ME ESFED  VA+LLN  F++IKVDREERPD+D +YM + QAL G GGWPL++ ++P+ +
Sbjct: 1   MERESFEDTEVAELLNRSFLAIKVDREERPDIDHLYMEFCQALTGSGGWPLTILMTPEKQ 60

Query: 216 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL-------SEALS 268
           P   GTYFP    YGRPG   +L ++ + WDK  + L +S    ++ +       SE ++
Sbjct: 61  PFFTGTYFPKSSHYGRPGLIDLLSQISELWDKDENKLRKSAEEIVKAITSHQKRSSEEVN 120

Query: 269 ------------------ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFP 310
                             ASA      +EL + +     + L +++DSR+GGFG APKFP
Sbjct: 121 PVEVHALQGFLNVQNGGDASADFQSWANELIEQSY----QALIQNFDSRYGGFGQAPKFP 176

Query: 311 RPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERW 370
            P  +  +L ++K   D       S+ + M+   L  M +GGI+DH+G GF RYS D++W
Sbjct: 177 SPHNLTFLLRYAKDHPD-------SQAEAMIRKNLDTMGQGGIYDHIGFGFARYSTDQQW 229

Query: 371 HVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDAD 430
            VPHFEKMLYD   LA  Y++A+   K+   +   ++IL Y+ RDM  P G  +SAEDAD
Sbjct: 230 LVPHFEKMLYDNALLAIAYIEAYQSQKEPRDAQKAQEILTYVLRDMTSPEGGFYSAEDAD 289

Query: 431 SAETEGATRKKEGAFYVWTSKEVEDILGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFK 489
           S   EG     EG FYVWT +E+  +LGE  + LF + + + P GN            F+
Sbjct: 290 S---EGI----EGKFYVWTPEEITSVLGEKRSALFCDVFNITPEGN------------FE 330

Query: 490 GKNVLIELN-DSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGL 548
           GK++   L+ D    A K  +  E    IL E R KL+  R  R  PH DDK++ SWNGL
Sbjct: 331 GKSIPNRLSGDIGELARKHHLNPETLNYILEEDRLKLWQSREHRIHPHKDDKILTSWNGL 390

Query: 549 VISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSF 608
           +I + A+  ++         FN      D K Y+  AE AA F+  +LY  +  RL   F
Sbjct: 391 MIVALAKGGQV---------FN------DNK-YILAAEQAAHFVLENLYPNE--RLLARF 432

Query: 609 RNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTT 668
           R+G +   G+LDDYAF I GLL+LY     + +L  A+ LQ   + LF D E GGY+ T 
Sbjct: 433 RDGNAAYLGYLDDYAFFIWGLLELYTASGKSDYLKSALSLQEQLETLFKDEEAGGYYLTG 492

Query: 669 GEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLK 728
            +   +LLR KE +DGA PSGNS++ +NL+ LA +    +   ++  AE  L  F + L 
Sbjct: 493 SDGEELLLRPKEIYDGALPSGNSITALNLLHLARLTGDER---WKLQAEKQLLSFRSTLT 549

Query: 729 DMAMAVPLMCCAADMLSVPSRKHVVLVG 756
                      A      PS++ ++LVG
Sbjct: 550 SNPAGYTAFLQALQYALHPSQE-LLLVG 576


>gi|209966075|ref|YP_002298990.1| hypothetical protein RC1_2806 [Rhodospirillum centenum SW]
 gi|209959541|gb|ACJ00178.1| conserved hypothetical protein [Rhodospirillum centenum SW]
          Length = 688

 Score =  385 bits (988), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 242/679 (35%), Positives = 349/679 (51%), Gaps = 71/679 (10%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N L  E SPYLLQH  NPV W  WG  AFA AR    P+ LS+GY+ CHWCHVM  ESFE
Sbjct: 6   NLLGQETSPYLLQHKDNPVHWMPWGPAAFARARAEGKPVLLSVGYAACHWCHVMAHESFE 65

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D  +A ++ND FV++KVDREERPDVD++Y + +  L   GGWPL++FL+P+ +P  GGTY
Sbjct: 66  DPTIAAMMNDLFVNVKVDREERPDVDQIYQSALGLLGQQGGWPLTMFLTPEGEPFWGGTY 125

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAF---AIEQLSEALSASASSNKLPDE 279
           FPPE ++GRPGF  +L  V   + ++ D + ++      A+ +L++    +     L DE
Sbjct: 126 FPPERRWGRPGFPDVLLGVSTTYRQEPDKVVRNTTALKDALHRLAQNRPGAGVDVDLLDE 185

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
           +        A +L +  D   GG GSAPKFP+   ++++    K+   TG+     + + 
Sbjct: 186 V--------AARLVQEVDPVHGGIGSAPKFPQTGIVELLWRAWKR---TGR----EDCRA 230

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
            V+ TL  M++GGI+DH+GGG+ RYS D+ W VPHFEKMLYD  QL ++    +  T+D 
Sbjct: 231 AVVTTLTQMSQGGIYDHLGGGYARYSTDQEWLVPHFEKMLYDNAQLIDLLTTVWQDTRDP 290

Query: 400 FYSYICRDILDYLRRDMIG----PGGEIFSAE-DADSAETEGATRKKEGAFYVWTSKEVE 454
            +    R+ + ++ R+M+     P G  F+A  DADS   EG    +EG FYVWT  EV+
Sbjct: 291 LFEARVRETVGWVLREMVSEPGRPVGGGFAATLDADS---EG----EEGRFYVWTWAEVD 343

Query: 455 DILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKY 514
            +LG+ A  F   Y +   GN            ++G  +L  L          G P E+ 
Sbjct: 344 RLLGDRAETFARAYDVTERGN------------WEGTTILNRLKRPEP-----GTPAEE- 385

Query: 515 LNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVV 574
              L E R  LF  R  R RP  DDKV+  WNGL+I++ ARA  +               
Sbjct: 386 -GALAEMRAVLFQARGARVRPGWDDKVLADWNGLMIAALARAGAVF-------------- 430

Query: 575 GSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYE 634
             D  +++  A  A  F+R H+ D    RL HS+R G  +  G LDD A +    L L+E
Sbjct: 431 --DEPDWIAAARRAYDFVRTHMQDAD-GRLWHSWRAGTLRHRGTLDDQAAMARAALALFE 487

Query: 635 FGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSV 694
                  +  A       D  F D E GGYF T  +   +++R +   D A PSGN   +
Sbjct: 488 VTGDGTCVEQARRWAAVADAQFWDTESGGYFLTAADATDLIVRPRNAQDNAVPSGNGTML 547

Query: 695 INLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVL 754
             L RL  I   +  + +R+ A+  +  F    +      PL     ++  +     VV+
Sbjct: 548 GVLARLWLI---TGEEGWRRRADALVTAFGG--EPGRNFFPLATFLNNVELLHRAVQVVV 602

Query: 755 VGHKSSVDFENMLAAAHAS 773
            G  ++ D   +L A H +
Sbjct: 603 AGDPAAADTGALLRAVHGA 621


>gi|411002310|ref|ZP_11378639.1| hypothetical protein SgloC_05852 [Streptomyces globisporus C-1027]
          Length = 673

 Score =  384 bits (987), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 240/627 (38%), Positives = 328/627 (52%), Gaps = 59/627 (9%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            NRLA   SPYLLQHA NPVDW+ W  EAF EARKRDVP+ LS+GY++CHWCHVM  ESF
Sbjct: 2   ANRLAQTTSPYLLQHADNPVDWWPWSPEAFEEARKRDVPVLLSVGYASCHWCHVMAHESF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           ED+  A  LN  FV +KVDREERPDVD VYM  VQA  G GGWP++VFL+PD +P   GT
Sbjct: 62  EDDDTAAYLNAHFVPVKVDREERPDVDAVYMEAVQAATGHGGWPMTVFLTPDAEPFYFGT 121

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQ-SGAFAIEQLSEALSASASSNKLPDEL 280
           YFPPE ++G P F+ +L  V  AW  +R+ +A+ +G    +    +L           E+
Sbjct: 122 YFPPEPRHGSPSFQQVLEGVTTAWTDRREEVAEVAGRIVADLAGRSLVHGGDGVPGESEV 181

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
            Q  L      L++ YD + GGFG APKFP  + ++ +L H  +   TG  G      +M
Sbjct: 182 AQALL-----GLTREYDEQHGGFGGAPKFPPAMAVEFLLRHYAR---TGAEG----ALQM 229

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
              T   MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L  VY   +  T    
Sbjct: 230 AADTCAAMARGGIYDQLGGGFARYSVDREWIVPHFEKMLYDNALLCRVYAHLWRATGSDE 289

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH 460
              I     D++ R++    G   SA DADS + EG  R  EGAFYVWT +++ ++LGE 
Sbjct: 290 ARRIALKTADFMVRELRTAEGGFASALDADSEDAEG--RHVEGAFYVWTPEQLREVLGED 347

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
              F   Y+           +++     +G +VL    D+         P++    + G 
Sbjct: 348 DAAFAAAYF----------GVTEEGTFEEGASVLRLPGDTG--------PVDA-ARVAG- 387

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            R +L   R +RP P  DDKV+ +WNGL I++ A                      DR +
Sbjct: 388 VRARLLAARDERPHPGRDDKVVAAWNGLAIAALAETGAYF----------------DRPD 431

Query: 581 YMEVAESAAS-FIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFLISGLLDLYEFGSG 638
            +E A  AA   +R HL   +  RL  + ++G      G L+DY  +  G L L      
Sbjct: 432 LVERATEAADLLVRVHL--GEVARLTRTSKDGRAGDNAGVLEDYGDVAEGFLALAAVTGE 489

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
             WL +A  L +   E F   EGG  ++T  +   ++ R ++  D A PSG + +   L+
Sbjct: 490 GAWLEFAGFLLDIVLEQFTG-EGGQLYDTAHDAEQLIRRPQDPTDSATPSGWTAAAGALL 548

Query: 699 RLASIVAGSKSDYYRQNAEHSLAVFET 725
              S  A + S+ +R  AE +L V + 
Sbjct: 549 ---SYAAYTGSEAHRTAAEGALGVVKA 572


>gi|302519353|ref|ZP_07271695.1| transmembrane protein [Streptomyces sp. SPB78]
 gi|302428248|gb|EFL00064.1| transmembrane protein [Streptomyces sp. SPB78]
          Length = 578

 Score =  384 bits (986), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 244/626 (38%), Positives = 331/626 (52%), Gaps = 60/626 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA E SPYLLQHA NPVDW+ W ++A  EA +RD PI LS+GYS+CHWCHVM  ESFE
Sbjct: 2   NRLAHEQSPYLLQHASNPVDWWPWSQQAKEEAERRDTPILLSVGYSSCHWCHVMARESFE 61

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D   A  +N  FV +KVDREERPDVD VYM  VQA  G GGWP++VFL+P  +P   GTY
Sbjct: 62  DAETAAYMNAHFVCVKVDREERPDVDAVYMEAVQAATGHGGWPMTVFLTPGGEPFYFGTY 121

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS-EALSASA-SSNKLPDEL 280
           FPP   +G P F+ +L  V+ AW  +R+ +A   A     L+  AL   A +S   PD L
Sbjct: 122 FPPRPLHGTPAFRQVLEGVRAAWADRREEVADVAARVTADLTGRALGLPADASPPGPDAL 181

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
               L      L++ YDSR GGFG APKFP  + ++ +L H  +   TG  G      +M
Sbjct: 182 GAALL-----GLTRDYDSRHGGFGGAPKFPPVMVLEFLLRHHAR---TGAEG----ALQM 229

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
              T + MA+GGI+D +GGGF RY+VD  W VPHFEKML D   L   Y   +  T    
Sbjct: 230 AADTAEHMARGGIYDQLGGGFARYAVDREWIVPHFEKMLSDNALLCRFYAHLWRATGSAL 289

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE- 459
              +  +  D+L R++  P G   SA DADS   +G  R  EGA YVWT +++ ++LGE 
Sbjct: 290 ARRVALETADFLVRELRTPEGGFASALDADS--DDGTGRHVEGASYVWTPEQLREVLGED 347

Query: 460 HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILG 519
            A L   HY + P G             F+  + ++ L  +  S S    P++     L 
Sbjct: 348 DAALAAAHYGVTPEGT------------FEHGSSVLRLPRTDGSDSP---PVDAAR--LD 390

Query: 520 ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRK 579
             RR L   R +RP P  DDKV+ +WNGL I++ A                      DR 
Sbjct: 391 RIRRALLAARDERPAPGRDDKVVAAWNGLAIAALAETGAYF----------------DRP 434

Query: 580 EYMEVAESAAS-FIRRHLYDEQTH-RLQHSFRNGPSKA-PGFLDDYAFLISGLLDLYEFG 636
           + +E A  AA   +R HL    TH RL  + R+G +    G L+DYA +  G L L    
Sbjct: 435 DLVEAALGAADLLVRVHL---DTHGRLSRTSRDGRTGTNTGVLEDYADVAEGFLTLASVT 491

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
               W  +A  L +   + F D + G  ++T  +  +++ R ++  D A PSG + +   
Sbjct: 492 GEGVWTDFAGLLLDHVLDRFRD-DSGALYDTAADAETLIHRPQDPTDNATPSGWNAAAGA 550

Query: 697 LVRLASIVAGSKSDYYRQNAEHSLAV 722
           L+  A++ AGS    +R  +E  L+V
Sbjct: 551 LLTYAAL-AGSTP--HRAASEQGLSV 573


>gi|269125325|ref|YP_003298695.1| hypothetical protein Tcur_1071 [Thermomonospora curvata DSM 43183]
 gi|268310283|gb|ACY96657.1| protein of unknown function DUF255 [Thermomonospora curvata DSM
           43183]
          Length = 662

 Score =  384 bits (986), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 239/611 (39%), Positives = 318/611 (52%), Gaps = 74/611 (12%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL    SPYLLQHA NPVDW+ WGE AFAEAR+RDVPI LS+GY+ CHWCHVM  ESFE
Sbjct: 2   NRLKNATSPYLLQHADNPVDWWEWGEAAFAEARRRDVPILLSVGYAACHWCHVMAHESFE 61

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE  A+L+ND FV+IKVDREERPDVD VYM   QA+ G GGWP++VF +PD +P   GTY
Sbjct: 62  DEATARLMNDLFVNIKVDREERPDVDAVYMEATQAMTGQGGWPMTVFATPDGEPFYCGTY 121

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP      R  F+ +L  V  AW ++R+ + + G   +E L+    A   +     E   
Sbjct: 122 FP------RQQFRALLMAVARAWREEREDVLKQGRKVVEALTARGPAPGETEPPSPERLS 175

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
            A+R     L+ SYD+ +GGFG APKFP  + ++ +L H  + +D       ++   M  
Sbjct: 176 AAVR----SLAASYDTAYGGFGGAPKFPPSMVLEFLLRHYARTQD-------AQALAMAT 224

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL+ MA+GGI+D +GGGF RYSVDE W VPHFEKMLYD   LA VY   + LT      
Sbjct: 225 GTLEAMARGGIYDQLGGGFARYSVDEAWVVPHFEKMLYDNALLARVYAHWWRLTGSPLAK 284

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
            I  +  +++ RD+  P G + SA DADS   EG    +EG +YVWT +++  +LGE   
Sbjct: 285 RIALETCEWMLRDLRTPQGGLASALDADS---EG----QEGKYYVWTPEQLRRVLGEA-- 335

Query: 463 LFKEHYYLKPTGN--CDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
                      GN   +L  +++      G +VL    D                     
Sbjct: 336 ----------DGNAAAELLGVTESGTFEHGTSVLRLPGDPGDQ------------EWWSR 373

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            R +L   R++R  P  DDKV+ +WNGL I++ A    +L                 R +
Sbjct: 374 VRARLLAARAERVPPARDDKVVTAWNGLAIAALAECGALLG----------------RPD 417

Query: 581 YMEVAESAASFIRR-HLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFLISGLLDLYEFGSG 638
            +  AE  A  +R  HL D    RL  + R+G P    G L+DYA    GLL L+     
Sbjct: 418 LVGAAEEIARLLREVHLRD---GRLTRTSRDGVPGANAGVLEDYADFAEGLLALHAVTGD 474

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGAEPSGNSVSVINL 697
              +  A  L  T    F D  GG  F  T +D   L R  +D  D A PSG   +   L
Sbjct: 475 PAHVRLAGTLLETVLTHFPDDRGG--FYDTADDAERLFRRPQDPTDNATPSGQFAAAGAL 532

Query: 698 VRLASIVAGSK 708
           +  A++   S+
Sbjct: 533 LSYAALTGSSR 543


>gi|294631112|ref|ZP_06709672.1| conserved hypothetical protein [Streptomyces sp. e14]
 gi|292834445|gb|EFF92794.1| conserved hypothetical protein [Streptomyces sp. e14]
          Length = 676

 Score =  384 bits (986), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 237/629 (37%), Positives = 323/629 (51%), Gaps = 64/629 (10%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA   SPYLLQHA NPVDW+ W  EAF EAR+RDVP+ LS+GYS CHWCHVM  ESFE
Sbjct: 2   NRLAGVTSPYLLQHADNPVDWWPWSPEAFEEARRRDVPVLLSVGYSACHWCHVMAHESFE 61

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D+  A  LN+ FVS+KVDREERPDVD VYM  VQA  G GGWP++VFL+PD +P   GTY
Sbjct: 62  DQATAGYLNEHFVSVKVDREERPDVDAVYMEAVQAATGQGGWPMTVFLTPDAEPFYFGTY 121

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLP--DEL 280
           FPP  ++G P F+ +L  V+ AW  +RD + +     +  L++         +LP  +EL
Sbjct: 122 FPPAPRHGMPSFRQVLEGVRQAWATRRDEVTEVAGKIVRDLAQ-REIGYGGVQLPGEEEL 180

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
            Q  L      L++ YD + GGFG APKFP  + ++ +L H  +   TG  G      +M
Sbjct: 181 AQALL-----GLTREYDPQRGGFGGAPKFPPSMVLEFLLRHHAR---TGSEG----ALQM 228

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
              T + MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L  VY   +  T    
Sbjct: 229 ARDTCERMARGGIYDQLGGGFARYSVDRDWIVPHFEKMLYDNALLCRVYAHLWRATGSEL 288

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH 460
              +  +  D++ R++    G   SA DADS   +G  R  EGA+YVWT +++ D LGE 
Sbjct: 289 ARRVALETADFMVRELRTGEGGFASALDADS--DDGTGRHVEGAYYVWTPEQLRDALGEE 346

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL---NI 517
                  Y+                        + E       +S L +P ++ +     
Sbjct: 347 DAQLAAQYF-----------------------GVTEEGTFEHGSSVLQLPQQEGVFDAER 383

Query: 518 LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSD 577
           +   RR L + R+ RP P  DDK++ +WNGL I++ A                      D
Sbjct: 384 IESVRRLLLERRAGRPAPGRDDKIVAAWNGLAIAALAETGAYF----------------D 427

Query: 578 RKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYAFLISGLLDLYEFG 636
           R + +E A  AA  + R   DE    L  + R+G   A  G L+DYA +  G L L    
Sbjct: 428 RPDLVEAALGAADLLVRLHMDEHAG-LARTSRDGQVGANAGVLEDYADVAEGFLALASVT 486

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
               WL +A  L       F D + G  ++T  +   ++ R ++  D A PSG S +   
Sbjct: 487 GEGVWLDFAGLLLGHVLTRFTDPDSGALYDTAADAEQLIRRPQDPTDNATPSGWSAAAGA 546

Query: 697 LVRLASIVAGSKSDYYRQNAEHSLAVFET 725
              L    A + S+ +R  AE +L V + 
Sbjct: 547 ---LLGYAAHTGSEAHRTAAEKALGVVKA 572


>gi|375102437|ref|ZP_09748700.1| thioredoxin domain containing protein [Saccharomonospora cyanea
           NA-134]
 gi|374663169|gb|EHR63047.1| thioredoxin domain containing protein [Saccharomonospora cyanea
           NA-134]
          Length = 670

 Score =  384 bits (986), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 241/677 (35%), Positives = 342/677 (50%), Gaps = 73/677 (10%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA   SPYLLQHA NPVDW+ WG EA AEAR+RDVPI LSIGY+ CHWCHVM  ESF 
Sbjct: 2   NRLATATSPYLLQHADNPVDWWPWGPEALAEARRRDVPILLSIGYAACHWCHVMAHESFA 61

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D+ VA  +N+ FV+IKVDREERPD+D VYMT  QA+ G GGWP++ FL+PD +P   GTY
Sbjct: 62  DDDVAAFMNEHFVNIKVDREERPDIDAVYMTATQAMTGQGGWPMTCFLTPDAEPFHCGTY 121

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           +PP   +G P FK +L  V  AW ++RD L +     ++ ++E      +    P  +  
Sbjct: 122 YPPVPAHGIPAFKQLLTAVDQAWRERRDELVEGAGRIVDHIAE-----QTGPLSPHPVTG 176

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
           + +     +L    D   GGFG APKFP  + ++ +L H    E TG    + E   +V 
Sbjct: 177 DTVASAVSKLRTETDPGHGGFGGAPKFPPSMVLEFLLRH---YERTG----SVEALSIVD 229

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            T + MA+GGI+D + GGF RYSVD  W VPHFEKMLYD   L   Y      T      
Sbjct: 230 MTAEGMARGGIYDQLAGGFARYSVDSGWVVPHFEKMLYDNALLLRFYAHLARRTDSPLAH 289

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHA 461
            +  +  ++L RD+  P G   ++ DAD+   EG T       YVWT +++ ++LG +  
Sbjct: 290 RVAGETAEFLLRDLRTPQGAFAASLDADTEGVEGLT-------YVWTPQQLVEVLGPDDG 342

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
               E + +   G             F+     ++L      AS       +++ +    
Sbjct: 343 AWAAETFGVTEEGT------------FEHGASTLQLRRDPDDAS-------RWMRVT--- 380

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
              L   R+ RP+P  DDKVI +WNGL I++ A A   L+                R E+
Sbjct: 381 -SALLQARNARPQPARDDKVIAAWNGLAITALAEAGVALQ----------------RPEW 423

Query: 582 MEVAESAASFIRR-HLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFLISGLLDLYEFGSGT 639
           +E A +A +F+   H   +    L+ + R+G    A G L+DY  L  GLL L++    +
Sbjct: 424 VEAAVAAGAFVLDVHAGGDTAGGLRRTSRDGVVGTAAGVLEDYGCLADGLLALHQATGES 483

Query: 640 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL-RVKEDHDGAEPSGNSVSVINLV 698
            WLV A  L +T    F      G F+ T  D   L+ R  +  D A PSG S     L+
Sbjct: 484 VWLVEATTLLDTALRRFGVEGAPGAFHDTAADAEALVHRPSDPTDNASPSGASALAGALL 543

Query: 699 RLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP-----LMCCAADMLSVPSRKHVV 753
             +++    ++  YR   E +L    +R   +   VP      +  A  +LS P +  V 
Sbjct: 544 PASALAGPERAGTYRAACEEAL----SRAGALVAQVPRFAGHWLSVAEALLSGPVQ--VA 597

Query: 754 LVGHKSSVDFENMLAAA 770
           +VG  ++   E ++ AA
Sbjct: 598 VVGTDAADRAELVVEAA 614


>gi|415885100|ref|ZP_11547028.1| hypothetical protein MGA3_07690 [Bacillus methanolicus MGA3]
 gi|387590769|gb|EIJ83088.1| hypothetical protein MGA3_07690 [Bacillus methanolicus MGA3]
          Length = 625

 Score =  384 bits (986), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 228/555 (41%), Positives = 318/555 (57%), Gaps = 51/555 (9%)

Query: 156 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 215
           ME ESFEDE VAKLLN+ FVSIKVDREERPD+D +YM   Q + G GGWPLSVF++PD K
Sbjct: 1   MERESFEDEEVAKLLNERFVSIKVDREERPDIDSIYMNICQLMNGHGGWPLSVFMTPDQK 60

Query: 216 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 275
           P   GTYFP E +YG PGFK ++ ++ D + K R  + +  + A E L +  SA  SS +
Sbjct: 61  PFFAGTYFPKESRYGVPGFKDVITQLYDQYMKNRSHIEKIASDAAEALKQ--SARESSAE 118

Query: 276 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 335
           LP     + L    +QL+ S++S +GGFG APKFP P  +  +L + K    TG      
Sbjct: 119 LP---SVDVLHKTYQQLAGSFNSVYGGFGDAPKFPIPHHLMFLLKYYKW---TG----TE 168

Query: 336 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 395
              KMV  TL  MA GGI+DH+G GF RYSVD  W VPHFEKMLYD   L   Y +A+ +
Sbjct: 169 MALKMVEKTLVSMANGGIYDHIGFGFARYSVDAMWLVPHFEKMLYDNALLLYTYSEAYQV 228

Query: 396 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 455
           TK+  Y  I   I++++ R+M    G  FSA DADS   EG    +EG +YVW+ +E+ D
Sbjct: 229 TKNSKYKEIAEQIIEFITREMTNEEGAFFSAIDADS---EG----EEGKYYVWSKEEILD 281

Query: 456 ILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASASKLGMPLEK 513
           +LGE    F           C +  ++   N F+GKN+  LI  N    + ++ G+ LE+
Sbjct: 282 VLGEKDGEF----------YCKVYDITSGGN-FEGKNIPNLIHTN-MVKTFAEAGLKLEE 329

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
               L E R+KLF+ R +R  PHLDDK++ SWN L+I+  A+A +  +++          
Sbjct: 330 GKAKLEESRQKLFEKRQERVYPHLDDKILTSWNALMIAGLAKAGQAFQNQ---------- 379

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
                 +Y+E AE A  FI   L       L   +R+G SK   +LDD+AFL+   L+LY
Sbjct: 380 ------DYVEKAEKALRFIEEKLM--VNGELMARYRDGESKYSAYLDDWAFLLWAYLELY 431

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
           E     ++L  A        +LF D + GG++ T  +  ++++R K+ +DGA PSGNSV+
Sbjct: 432 EATFSMEYLDKAQNTAEKMKKLFWDEQDGGFYFTRSDGEALIVREKQVYDGALPSGNSVA 491

Query: 694 VINLVRLASIVAGSK 708
            +N +RL      +K
Sbjct: 492 AVNFLRLGHFTGETK 506


>gi|88813137|ref|ZP_01128378.1| hypothetical protein NB231_12691 [Nitrococcus mobilis Nb-231]
 gi|88789621|gb|EAR20747.1| hypothetical protein NB231_12691 [Nitrococcus mobilis Nb-231]
          Length = 689

 Score =  384 bits (985), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 229/596 (38%), Positives = 332/596 (55%), Gaps = 56/596 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLAA  SPYLLQHA NPVDW+ WG+EA   AR+ D PI LSIGYS CHWCHVM  ESFE
Sbjct: 9   NRLAATTSPYLLQHADNPVDWYPWGQEALERARREDRPILLSIGYSACHWCHVMAHESFE 68

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYG-GGGWPLSVFLSPDLKPLMGGT 221
           DE +A+ +N+ F++IKVDREERPD+D++Y T  Q L    GGWPL+VFL+P+  P   GT
Sbjct: 69  DETIARAMNEHFINIKVDREERPDLDRIYQTAHQLLNNRPGGWPLTVFLTPEQMPFFCGT 128

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLA---QSGAFAIEQLSEALSASASSNKLPD 278
           YFPP+  YG PGF  IL ++  A+ ++ + +    Q+   A+ +LSE     A +     
Sbjct: 129 YFPPKSHYGLPGFHEILLQIAQAYRQQHEAIKKQNQAVLDALNRLSEPPPNRAGA----- 183

Query: 279 ELPQNALRLCAEQ-LSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEG 337
             P+ AL   A   L++ +DS FGGFG APKFP+P  I+ +L H  +           + 
Sbjct: 184 --PKAALFDNARSALAREFDSTFGGFGPAPKFPQPSSIERLLRHYART--AANDVPDYDA 239

Query: 338 QKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTK 397
            +M   TL+ MA GGI+D +GGGF RYSVD  W +PHFEKMLYD GQL  +Y DA+  T 
Sbjct: 240 LRMAQLTLRKMALGGIYDQIGGGFARYSVDNYWIIPHFEKMLYDNGQLLALYADAWRATG 299

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
           +  +  +  +  ++  R+M  P G  +++ DADS   EG     EGAFY+WT +E+ ++L
Sbjct: 300 EELFQRVANETAEWALREMRHPDGAFYASLDADS---EGG----EGAFYLWTPEEIRNVL 352

Query: 458 GE---HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKY 514
            E     +L +          C L+   +    F+G+  L      +  A+    P ++ 
Sbjct: 353 REDEAEVVLAR----------CGLNNQPN----FEGRWHLYVRLTFTDLANNQHRPRQEL 398

Query: 515 LNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVV 574
           + +    R +L + R +RPRP  D+KV+ SWN L++S  ARA +   + A +A       
Sbjct: 399 IALWRSARERLREAREQRPRPPRDEKVLTSWNALMVSGLARAGRRFGNTALTA------- 451

Query: 575 GSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYE 634
                      +    F+  +L+  +  RL   +++G +  P +LDD+A+L++ LL+  E
Sbjct: 452 ---------AGDQTLHFLHSNLW--RNGRLLTVWKDGQADLPAYLDDHAYLLAALLEQLE 500

Query: 635 FGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGN 690
                 WL WA  + +     F D+  GG+F T  +   ++ R +   D A PSGN
Sbjct: 501 ARWEPHWLQWARAIADLLLARFEDKTHGGFFFTADDHEPLVQRPRPLGDDACPSGN 556


>gi|295838670|ref|ZP_06825603.1| conserved hypothetical protein [Streptomyces sp. SPB74]
 gi|197699107|gb|EDY46040.1| conserved hypothetical protein [Streptomyces sp. SPB74]
          Length = 683

 Score =  384 bits (985), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 239/624 (38%), Positives = 324/624 (51%), Gaps = 56/624 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA   SPYLLQHA NPVDW+ W  EAF EA +RDVP+ LS+GYS CHWCHVM  ESFE
Sbjct: 2   NRLAGATSPYLLQHADNPVDWWPWSPEAFEEAARRDVPVLLSVGYSACHWCHVMARESFE 61

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D G A  +N+ FV++KVDREERPDVD VYM  VQA  G GGWP++VFL+P  +P   GTY
Sbjct: 62  DVGTAAYVNEHFVAVKVDREERPDVDAVYMEAVQAATGQGGWPMTVFLTPGGEPFYFGTY 121

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD-ELP 281
           FPP   +G P F+ +L  V+ AW  +R  + +  A     L      +     LPD   P
Sbjct: 122 FPPRPLHGTPAFRQVLEGVRAAWADRRAEVDEVAARVTADL------TGRGLGLPDGAAP 175

Query: 282 QNALRLCAE--QLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
             A  L A    L++ YDSR GGFG APKFP  + ++ +L H  +   TG  G      +
Sbjct: 176 PGADALGAALLGLTRDYDSRHGGFGGAPKFPPVMVLEFLLRHHAR---TGAEG----ALQ 228

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           M   T + MA+GGI+D +GGGF RY+VD  W VPHFEKML D   L   Y   +  T   
Sbjct: 229 MAADTAEHMARGGIYDQLGGGFARYAVDREWTVPHFEKMLSDNALLCRFYAHLWRATGSA 288

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE 459
               +  +  D+L R++  P G   SA DADS   +G  R  EGA YVWT +++ ++LGE
Sbjct: 289 LARRVALETADFLVRELRTPEGGFASALDADS--DDGTGRHVEGASYVWTPEQLREVLGE 346

Query: 460 -HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 518
             A L   HY + P G             F+  + ++ L  +    S    P++     L
Sbjct: 347 ADAALAAAHYGVTPEGT------------FEHGSSVLRLPRTDGFDSP---PVDA--ARL 389

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
              RR L   R +RP P  DDKV+ +WNGLVI++ A            A F        R
Sbjct: 390 DRIRRALLAAREERPAPGRDDKVVAAWNGLVIAALAET---------GAYFG-------R 433

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG 638
            + +  A  AA  + R   D + H  + S    P    G L+DYA +  G L L      
Sbjct: 434 PDLVAAATGAADLLVRVHLDTRGHLTRTSRDGRPGGNAGVLEDYADVAEGFLTLASVTGE 493

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
             W  +A  L +     F D + G  ++T  +  +++ R ++  D A PSG + +   L+
Sbjct: 494 GVWTDFAGLLLDQVLARFRD-DTGALYDTAADAEALIHRPQDPTDNATPSGWNAAAGALL 552

Query: 699 RLASIVAGSKSDYYRQNAEHSLAV 722
             A++   + S  +R  AE +L+V
Sbjct: 553 TYAAL---TGSTAHRAAAEQALSV 573


>gi|404497256|ref|YP_006721362.1| thioredoxin domain-containing protein YyaL [Geobacter
           metallireducens GS-15]
 gi|418065852|ref|ZP_12703222.1| protein of unknown function DUF255 [Geobacter metallireducens RCH3]
 gi|78194859|gb|ABB32626.1| thioredoxin domain protein YyaL [Geobacter metallireducens GS-15]
 gi|373561650|gb|EHP87881.1| protein of unknown function DUF255 [Geobacter metallireducens RCH3]
          Length = 706

 Score =  384 bits (985), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 245/668 (36%), Positives = 344/668 (51%), Gaps = 57/668 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL    SPYLLQHA NPV W+ WG+EAFA AR  D P+FLSIGY+TCHWCHVM  ESF 
Sbjct: 33  NRLVFASSPYLLQHADNPVAWYEWGDEAFARARAEDKPVFLSIGYATCHWCHVMAHESFG 92

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D  VA +LN  FV+IKVDREERPD+D  YM   Q + G GGWPL+V ++PD +P    TY
Sbjct: 93  DHEVAAVLNRDFVAIKVDREERPDIDDTYMRVAQLMNGSGGWPLTVCMTPDREPFFVATY 152

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP- 281
            P   + G PG   IL ++ + W  +R+++ Q+    ++ L     A       P E+P 
Sbjct: 153 IPKHSRGGMPGLVEILGRIAEVWKTRRELVHQNCTAILDSLRNLSVAK------PGEIPG 206

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
              LR    QL+  +D    GFG APKFP P+ +  +L + ++  D G +        MV
Sbjct: 207 AEPLRAARSQLAGMFDPVNAGFGQAPKFPMPLNLSFLLRYGRRFGDPGAT-------VMV 259

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
           + TL+ + +GGI D +G G HRYSVD RW VPHFEKMLYDQ  +A   ++AF  T     
Sbjct: 260 VATLEALRRGGIFDQLGFGLHRYSVDSRWLVPHFEKMLYDQALVAMAAVEAFQATGQESL 319

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE-H 460
             +   + D++ R++  P G  +SA DAD   TEG    +EG +Y+WT  +V  +LGE  
Sbjct: 320 REMAEQLCDFVLRELAAPEGGFYSALDAD---TEG----EEGRYYLWTPAQVRSVLGETE 372

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
             LF   + +   GN            F+G N+L         A + GM  E     +  
Sbjct: 373 GELFCRLFDVTGKGN------------FEGANILNLPVLLHEFAQREGMSPENLEEKVEG 420

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            R  L   R+KR RP  D+K++ +WNGL+I++ AR               F   G +R  
Sbjct: 421 WRLLLLAERAKRERPFRDEKIVTAWNGLMIAALARL--------------FLAGGGER-- 464

Query: 581 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 640
           ++  AE+A   I R L      RL  S   G  + P FL+DYA L+ GLL L++     +
Sbjct: 465 FLVAAEAALVRILRDLR-RADGRLLRSIHRGEGEVPAFLEDYAALLHGLLALHDATLDPR 523

Query: 641 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 700
           +   A  L      LF   E  G ++T  +  +VL+R + D+DG  PSGN ++   LVRL
Sbjct: 524 YREEACSLARDMLRLF-SGEDRGLYDTGNDAETVLMRSRVDYDGVMPSGNGLAATGLVRL 582

Query: 701 ASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSS 760
             +   +  + + +  E  +  F        +A      A D+L  P  +  +  G +  
Sbjct: 583 GRM---ADEERFVEAGEEIIRAFMAGAGRQPVAHLQTLMALDLLRGPQVEVAISGGSRGK 639

Query: 761 VDFENMLA 768
           V  + MLA
Sbjct: 640 V--QGMLA 645


>gi|23100033|ref|NP_693499.1| hypothetical protein OB2578 [Oceanobacillus iheyensis HTE831]
 gi|22778264|dbj|BAC14534.1| hypothetical conserved protein [Oceanobacillus iheyensis HTE831]
          Length = 691

 Score =  383 bits (984), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 231/670 (34%), Positives = 345/670 (51%), Gaps = 55/670 (8%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           ++H N L  E SPYLLQH +NPVDW+ WGE+AF +ARK   PIFLSIGYS+C WCH M  
Sbjct: 4   SRHHNHLINETSPYLLQHVNNPVDWYPWGEKAFNKARKEQKPIFLSIGYSSCTWCHNMNR 63

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESF D+ VA LLN ++VSIKVDREERPD+D +YM   Q + G GGWPL++ ++ D  P  
Sbjct: 64  ESFMDQEVAALLNQYYVSIKVDREERPDIDGLYMKACQMMTGHGGWPLTIIMTDDQVPFF 123

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
            GTYFP    YG PG   IL  +   + +    +A+     ++++ +AL  + S      
Sbjct: 124 AGTYFPKHQNYGLPGLMDILPTIAKKYAEDPQQIAE----YMKKVEDALQDTLSKKSNES 179

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
              ++++R   +QL++ +D  +GGF   PKFP P  +  ++++  K  D           
Sbjct: 180 LTSEDSVR-TYQQLNELFDYPYGGFYKEPKFPSPHNLSFLIHYYYKTGD-------KNAL 231

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
           KMV  TL+ + +    DHVG G  RY+ D +W  PHFEKMLYDQ  L +V +D F +TKD
Sbjct: 232 KMVDMTLKSIFQSSTWDHVGFGVFRYATDRKWMFPHFEKMLYDQAFLLDVSVDMFLITKD 291

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
            FY     +I+ +++R+M    G  +++  ADS         +EGA+Y+W+ +E+  ILG
Sbjct: 292 PFYQLKVNEIIQFVKREMTAENGCFYASLSADS-------NGEEGAYYLWSLEEIYSILG 344

Query: 459 E-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS-ASKLGMPLEKYLN 516
           E    LF E Y + P G              +GKN+      S  S AS  G+ +EK   
Sbjct: 345 EDEGDLFAEAYGIVPVG------------VHQGKNLPYRSGISLESLASTYGIQVEKVKT 392

Query: 517 ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGS 576
            L +   KL   R  R  P  DDK++ SWNG +I++ A+A  + + E             
Sbjct: 393 TLTKSVDKLQKARLLRTAPATDDKILTSWNGYMIAALAKAGSVFQEE------------- 439

Query: 577 DRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG 636
               ++  A +    +   L  +  +R   ++R G +   GFLDDYA ++ G ++L++  
Sbjct: 440 ---NWINHAINTMKNLSDILIKD--NRWFANYRQGKTNTKGFLDDYAAILWGYIELHQAT 494

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
                L  A  + N   +LF D   GG+F    +   ++ R KE +D   PSGNS++ I 
Sbjct: 495 MEIDHLKKAKTIANDMIKLFWDSNDGGFFFVANDAEQLISREKEIYDSPIPSGNSLASIQ 554

Query: 697 LVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVG 756
           L RLA++  G  S  Y    +  +  F   L+D              L     K V+++G
Sbjct: 555 LSRLANLT-GEMS--YYSYVDTMMYTFYRELQDEPSGASFFMRNL-FLQQDQTKQVIIIG 610

Query: 757 HKSSVDFENM 766
             +   F ++
Sbjct: 611 ENTEAFFNHI 620


>gi|209883527|ref|YP_002287384.1| thioredoxin domain-containing protein [Oligotropha carboxidovorans
           OM5]
 gi|337739402|ref|YP_004631130.1| hypothetical protein OCA5_c01570 [Oligotropha carboxidovorans OM5]
 gi|386028421|ref|YP_005949196.1| hypothetical protein OCA4_c01570 [Oligotropha carboxidovorans OM4]
 gi|209871723|gb|ACI91519.1| highly conserved protein contAining a thioredoxin domain
           [Oligotropha carboxidovorans OM5]
 gi|336093489|gb|AEI01315.1| hypothetical protein OCA4_c01570 [Oligotropha carboxidovorans OM4]
 gi|336097066|gb|AEI04889.1| hypothetical protein OCA5_c01570 [Oligotropha carboxidovorans OM5]
          Length = 684

 Score =  383 bits (984), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 227/612 (37%), Positives = 322/612 (52%), Gaps = 65/612 (10%)

Query: 101 HTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVES 160
           HTNRLA E SPYLLQH HNPVDW+ WG EA AEA+K   PI LS+GY+ CHWCHVM  ES
Sbjct: 7   HTNRLAGETSPYLLQHQHNPVDWWPWGTEALAEAQKTGKPILLSVGYAACHWCHVMAHES 66

Query: 161 FEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGG 220
           FED   A+++N+ FV IKVDREERPD+D++YM  +  L   GGWP+++FLSPD  P+ GG
Sbjct: 67  FEDAATAEVMNELFVCIKVDREERPDIDQIYMRALHLLGQQGGWPMTMFLSPDGAPIWGG 126

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           TYFP   +YGRP F  I+R+    +  + D +A +       L+E      +S  L    
Sbjct: 127 TYFPNTPQYGRPSFVGIMREFIRIYRDEPDKIAANKTAIERSLAERSPTDTASIGL---- 182

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
             N L   A  +++S D   GG   APKFP+             LE   ++G  +   + 
Sbjct: 183 --NELDNVAGSIARSTDPDNGGLRGAPKFPQ----------CSMLEFLWRAGARTGDDRF 230

Query: 341 VLFT---LQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTK 397
            + T   L  M++GGI+DH+GGG+ RY+VD++W VPHFEKMLYD  Q+ ++     +   
Sbjct: 231 FITTNLALTRMSQGGIYDHLGGGYARYTVDDKWLVPHFEKMLYDNAQILDLLALEHARAP 290

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
           +  Y     + + +L+R+M+   G   S+ DADS   EG    +EG FY+W+  E+E++L
Sbjct: 291 NALYHQRAEETVGWLKREMLTREGGFASSLDADS---EG----EEGRFYIWSQSEIEELL 343

Query: 458 G-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLN 516
           G + A  F   Y +   GN            F+G+N+L  L D S +A++          
Sbjct: 344 GKDDATFFAAKYGVTADGN------------FEGRNILNRLGDDSDTATE--------AE 383

Query: 517 ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGS 576
            L   R  LF  R KR RP LDDKV+  WNGL I++   A++                  
Sbjct: 384 QLAAMRAILFRAREKRVRPGLDDKVLADWNGLTIAALVHAAQAFA--------------- 428

Query: 577 DRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG 636
            R +++ +A +A  FI   +   +  RL HS+R G    P    D A +I   L L+E  
Sbjct: 429 -RPDWLTLAATAFGFITTTM--SRHGRLGHSWRAGKLLQPALASDNAAMIRAALALHEAT 485

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
               +L  A+  Q   D  + D   GGYF T+ +   ++LR     D A P+   ++  N
Sbjct: 486 GDHLFLDQAVLWQADLDTHYGDPRHGGYFLTSDDAEGLILRPHSSVDDATPNHIGLTAQN 545

Query: 697 LVRLASIVAGSK 708
           L RLA +    +
Sbjct: 546 LARLAVLTGDDR 557


>gi|271969730|ref|YP_003343926.1| hypothetical protein [Streptosporangium roseum DSM 43021]
 gi|270512905|gb|ACZ91183.1| conserved hypothetical protein [Streptosporangium roseum DSM 43021]
          Length = 682

 Score =  383 bits (983), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 237/625 (37%), Positives = 319/625 (51%), Gaps = 88/625 (14%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL    SPYLLQHA NPV+WF WGE+AFAEA +R+VP+ +S+GYS CHWCHVM  ESFE
Sbjct: 2   NRLKDATSPYLLQHADNPVEWFEWGEDAFAEAARRNVPLLISVGYSACHWCHVMAHESFE 61

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DEG A L+N+ FV++KVDREERPDVD VYM   QA+ G GGWP++VF +P   P   GTY
Sbjct: 62  DEGTAALMNEHFVNVKVDREERPDVDAVYMAATQAMTGQGGWPMTVFATPGGHPFYTGTY 121

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP      RP F+ +L  V +AW+  R+ + +  +  +E L+E  +  +     PD L +
Sbjct: 122 FP------RPQFQRLLAGVSNAWNGDREAVLEQSSKIVEALNERSALPSGPLPTPDTLAR 175

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED-TGKSGEASEGQK-- 339
                  + LS+S+D   GGFG APKFP  + ++ +L +    E  TG  G   E ++  
Sbjct: 176 -----AVQSLSRSFDQVRGGFGGAPKFPPSMALEFLLRYGAAAEPRTGAEGGEPEDRREP 230

Query: 340 ---------------MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 384
                          M   TL+ MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   
Sbjct: 231 GAGAGAGAGAPTATAMAGRTLEAMARGGIYDQLGGGFARYSVDADWVVPHFEKMLYDNAL 290

Query: 385 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 444
           L  VY   + LT       +  +  D+L  +M  P G   SA DADS   EG     EG 
Sbjct: 291 LLRVYAHWWRLTGSALGRRVALETADWLLAEMRTPEGGFASALDADS---EGV----EGK 343

Query: 445 FYVWTSKEVEDILGEH----AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS 500
           FY WT +E+ ++LGE     A+   E       G   L  +SDP             +D+
Sbjct: 344 FYAWTPEEIHEVLGEEDGAWAVALYEVTGTFEHGTSVLQLLSDP-------------DDA 390

Query: 501 SASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 560
             SA                 R +L   R+ R RP  DDKV+ +WNGL I++ A    + 
Sbjct: 391 ERSA---------------RVRAELLAARAHRVRPGRDDKVVAAWNGLAIAALAETGALF 435

Query: 561 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFL 619
                           DR + +E A +AA  +     D    RL  + R+G + A  G L
Sbjct: 436 ----------------DRPDLVEAARAAAVLLDGSHMD--GDRLLRTSRDGRAGANAGVL 477

Query: 620 DDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVK 679
           +DYA L  GLL LY      +W   A  L  T  + F D   GG+F+T  +   +  R +
Sbjct: 478 EDYADLAEGLLTLYGVTGEVRWFHRAGALLETVLDRFADGS-GGFFDTADDAERLFQRPQ 536

Query: 680 EDHDGAEPSGNSVSVINLVRLASIV 704
           +  D A PSG   +   L+  A++ 
Sbjct: 537 DPTDNATPSGQFAAAGALLSYAALT 561


>gi|386842157|ref|YP_006247215.1| hypothetical protein SHJG_6075 [Streptomyces hygroscopicus subsp.
           jinggangensis 5008]
 gi|374102458|gb|AEY91342.1| hypothetical protein SHJG_6075 [Streptomyces hygroscopicus subsp.
           jinggangensis 5008]
 gi|451795451|gb|AGF65500.1| hypothetical protein SHJGH_5837 [Streptomyces hygroscopicus subsp.
           jinggangensis TL01]
          Length = 677

 Score =  383 bits (983), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 236/626 (37%), Positives = 326/626 (52%), Gaps = 59/626 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA E SPYLLQHA NPVDW+ W  EAF EAR+   P+ LS+GYS+CHWCHVM  ESFE
Sbjct: 3   NRLAHETSPYLLQHADNPVDWWPWSGEAFDEARRTGRPVLLSVGYSSCHWCHVMAHESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D   A  LN+ FVS+KVDREERPDVD VYM  VQA  G GGWP++VFL+PD +P   GTY
Sbjct: 63  DRATADYLNEHFVSVKVDREERPDVDAVYMEAVQAATGHGGWPMTVFLTPDAEPFYFGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE-ALSASASSNKLPDELP 281
           FPP  ++G P F+ +L  V+ AW  +RD +A      +  L++  +   A+      EL 
Sbjct: 123 FPPAPRHGMPSFRQVLEGVQQAWTTRRDEVADVAGKIVRDLAQREIVRQAAEAPGEQELA 182

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
           Q  L      L++ YD + GGFG APKFP  + ++ +L H  +   TG  G      +M 
Sbjct: 183 QALL-----GLTREYDPQRGGFGGAPKFPPSMVLEFLLRHHAR---TGAEG----ALQMA 230

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             T + MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L  VY   +  T     
Sbjct: 231 QDTCERMARGGIYDQLGGGFARYSVDRDWVVPHFEKMLYDNALLCRVYTHLWRATGSDLA 290

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHA 461
             +  D   +L R++    G   SA DADS   +G+ R  EGA+YVW   ++ + LG+ A
Sbjct: 291 RRVALDTAQFLLRELRTAEGGFASALDADS--DDGSGRHVEGAYYVWRPDQLREALGDDA 348

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
            L  +++ +   G             F+    +++L  +           EK  ++    
Sbjct: 349 ELAAQYFGVTDEGT------------FEHGQSVLQLPQTEGV-----FEAEKIASV---- 387

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
           + +L   R++RP P  DDKV+ +WNGL I++ A                      DR + 
Sbjct: 388 KDRLLAARARRPAPGRDDKVVAAWNGLAIAALAETGACF----------------DRPDL 431

Query: 582 MEVAESAASFIRRHLYDEQTH--RLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGT 639
            E A +AA  + R   DE     R     R GP+   G L+DYA +  G L L       
Sbjct: 432 TEAAVAAADLLVRVHLDEHGRLARTSKDGRVGPNA--GVLEDYADVAEGFLALASVTGEG 489

Query: 640 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVR 699
            WL +A  L +     F D E G  ++T  +   ++ R ++  D A PSG + +   L+ 
Sbjct: 490 VWLDFAGLLLDHVLARFTDTETGALYDTASDAEQLIRRPQDPTDNAAPSGWTAAAGALL- 548

Query: 700 LASIVAGSKSDYYRQNAEHSLAVFET 725
             S  A + S+ +R  AE +L V +T
Sbjct: 549 --SYAAHTGSEPHRAAAERALGVVKT 572


>gi|72160855|ref|YP_288512.1| hypothetical protein Tfu_0451 [Thermobifida fusca YX]
 gi|71914587|gb|AAZ54489.1| conserved hypothetical protein [Thermobifida fusca YX]
          Length = 665

 Score =  383 bits (983), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 252/667 (37%), Positives = 345/667 (51%), Gaps = 86/667 (12%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA   SPYLLQHA NPVDW+ WGEEAFAEAR+RDVPI LSIGY+ CHWCHVM  ESF 
Sbjct: 3   NRLAHATSPYLLQHADNPVDWYPWGEEAFAEARRRDVPILLSIGYAACHWCHVMARESFA 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE  A+++N  FV++KVDREERPDVD VYM   QA+ G GGWP++VF +PD +P   GTY
Sbjct: 63  DEQTAQIMNANFVNVKVDREERPDVDAVYMEATQAMTGHGGWPMTVFATPDGEPFYCGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP      R  F+ +L  +  AW   R  +   G    ++++EALSA      LP   P 
Sbjct: 123 FP------REHFQRLLLGISHAWRTDRTGVVGQG----KRVAEALSA---PRTLPSGPPP 169

Query: 283 NA--LRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
           +A  L     +L+  YD+  GG+G+APKFP    ++ +L H  ++ D    G  +E  +M
Sbjct: 170 SAQVLEQAVARLAAEYDTVNGGYGTAPKFPPSPVMEFLLRHHARVSD----GAETEALRM 225

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
           V  T + MA+GGI+D + GGF RY+VD  W VPHFEKMLYD   L   Y   +  T D  
Sbjct: 226 VRHTAEAMARGGIYDQLAGGFARYAVDATWTVPHFEKMLYDNALLLRCYTHLWRQTGDEL 285

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH 460
              +  +  D++  ++    G   SA DADS   EG    +EG +YVWT  ++ D+LGE 
Sbjct: 286 ARRVAVETADWMVAELRTAEGGFASALDADS---EG----EEGRYYVWTPAQLRDVLGEE 338

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
              +            +L  +++     +G +VL    D            E+Y  +   
Sbjct: 339 DGAWA----------AELFGVTEQGTFERGTSVLQLRADPDDR--------ERYAYV--- 377

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            R +L   R+ R  P  DDKV+  WNGL I+  A A  +L                DR +
Sbjct: 378 -RDRLRKARANRVPPARDDKVVTGWNGLAIAGLAEAGALL----------------DRPD 420

Query: 581 YMEVAESAASF-IRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFLISGLLDLYEFGSG 638
            +E A  AA   + RH  D    RL    R+G P  + G L+DYA L  GLL L+     
Sbjct: 421 LVERAREAARLVVERHYAD---GRLVRVSRDGVPGTSAGVLEDYANLAEGLLALHAVTGE 477

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
            +W+    EL  T    F D   GG+++T  +  ++  R +E  D A PSG S +   L+
Sbjct: 478 IRWVGVCGELLETVLTRFTDGS-GGFYDTADDAEALFNRPREFTDDATPSGWSAAAGALL 536

Query: 699 RLASIVAGSKSDYYRQNAEHSLAVFET------RLKDMAMAVPLMCCAADMLSVPSRKHV 752
             A++   + S  +R+ AE +L V  T      R     MAV     A  +L+ P    +
Sbjct: 537 SYAAL---TGSFRHREAAEAALGVVSTLAEKTPRFAGWGMAV-----AEALLAGPV--EI 586

Query: 753 VLVGHKS 759
            +VG K 
Sbjct: 587 AVVGPKG 593


>gi|407781159|ref|ZP_11128379.1| hypothetical protein P24_03046 [Oceanibaculum indicum P24]
 gi|407208585|gb|EKE78503.1| hypothetical protein P24_03046 [Oceanibaculum indicum P24]
          Length = 680

 Score =  382 bits (982), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 238/671 (35%), Positives = 342/671 (50%), Gaps = 65/671 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N LA E SPYLLQH  NPV W +WG EA   AR    PI LS+GY+ CHWCHVM  ESFE
Sbjct: 4   NLLAQEASPYLLQHKDNPVHWMSWGREALDRARAEGKPILLSVGYAACHWCHVMAHESFE 63

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D+  A L+N  FV++KVDREERPD+D +Y + +  L   GGWPL++FL+PD  P  GGTY
Sbjct: 64  DDETAALMNRLFVNVKVDREERPDIDHIYQSALAILGEQGGWPLTMFLTPDGDPFWGGTY 123

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP E +YGRPGFK +L+ + DA  +  D ++++ +   + L +    +A  N  P  L +
Sbjct: 124 FPKEARYGRPGFKAVLQAIADAHAEGSDKVSRNASALRQALRQLAEPAAGENIEPALLDR 183

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
                 AE+L +  D   GG G APKFP+P  + ++  H        +SG   + +  VL
Sbjct: 184 -----IAERLHREIDPIHGGIGGAPKFPQPGMLMLLWRHWL------RSGN-QDSRDYVL 231

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL+ M +GGI+DH+GGGF RYS D +W  PHFEKMLYD  QL  +   A   T    + 
Sbjct: 232 LTLERMCQGGIYDHLGGGFARYSTDAQWLAPHFEKMLYDNAQLIEMLTHAALETGRPLFR 291

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL----G 458
               + + ++ R+MI   G   S+ DADS   EG    +EG FYVW   E++ +L    G
Sbjct: 292 QRLEETIGWVLREMITDEGGFASSLDADS---EG----EEGKFYVWREAEIDQLLAHLPG 344

Query: 459 EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 518
           E    FK  Y + P GN +   +         +N   +L + +A +             L
Sbjct: 345 EALESFKRAYDVTPEGNWEGVTILH-------RNRRPDLGNGAAESQ------------L 385

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
            + R+ LF+ R +R RP  DDKV+  WNGL+I + A+AS           F F       
Sbjct: 386 AQVRQLLFEHREQRERPGWDDKVLADWNGLMIRALAQAS-----------FAFA-----H 429

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG 638
            +++  A  A  ++   +  +   RL+HS R    + P  L+DYA + S  L L++    
Sbjct: 430 ADWLRAAIRAFDYVVEKMTLDG--RLRHSRRGDILRHPATLEDYANMASAALALFQITRH 487

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
            ++L  AI   +  D  + D EGGGYF T  +   V+LR K   D A P+GN   +  L 
Sbjct: 488 QRFLGQAIAWVDVLDRHYWDHEGGGYFTTADDTNDVVLRAKNAQDNAVPAGNGTMLQVLT 547

Query: 699 RLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHK 758
            L  +   +  D YR  A+  +  F   +      +       D+   P +  + L G  
Sbjct: 548 TLYHL---TGDDSYRGKADLLIPRFAGEIGRNFFPLATFLNGCDIAQRPLQ--ITLTGDP 602

Query: 759 SSVDFENMLAA 769
           ++  +  +L A
Sbjct: 603 TTPTYVGLLRA 613


>gi|225559995|gb|EEH08277.1| DUF255 domain-containing protein [Ajellomyces capsulatus G186AR]
          Length = 804

 Score =  382 bits (982), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 247/659 (37%), Positives = 342/659 (51%), Gaps = 81/659 (12%)

Query: 88  ERTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGY 147
           E   A  + + ++  NRL    SPY+  H +NPV W  W  EA A A+K +  +FL    
Sbjct: 55  ETESAIATGTSHELVNRLNQSKSPYVRGHMNNPVAWQMWDAEAIALAKKLNRMVFLR--- 111

Query: 148 STCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 207
                CHVME ESF    VA +LN  F+ IK+DREERPD+D VYM YVQA  G GGWPL+
Sbjct: 112 -----CHVMEKESFMSPEVAAILNKAFIPIKLDREERPDIDDVYMNYVQATTGSGGWPLN 166

Query: 208 VFLSPDLKPLMGGTYFP-PEDKY-------GRPGFKTILRKVKDAWDKK--------RDM 251
           VFL+PDL+P+ GGTY+P P           G+  F  IL K++D W  +        +D+
Sbjct: 167 VFLTPDLEPVFGGTYWPGPHSSASSTLGGEGQVTFIDILEKLRDVWQTQQLRCRESAKDI 226

Query: 252 LAQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPR 311
             Q   FA E     L  + +  +   +L    L    +  +  YD   GGF  APKFP 
Sbjct: 227 TRQLQEFAEEGTYSKLRGAGADEEE--DLEVELLEEAYKHFASRYDPVNGGFSRAPKFPT 284

Query: 312 PVEIQMMLYHSK---KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDE 368
           P  +  ++  S+    + D     E +   +M + TL  +++GGIHDH+G GF RYSV  
Sbjct: 285 PANLSFLVNLSRFPSAVADIVGYEECAHALEMAIKTLISISRGGIHDHIGHGFARYSVTT 344

Query: 369 RWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRR-DMIGPGGEIFSAE 427
            W +PHFEKMLYDQ QL  VY DAF    D        DI  Y+    ++ P G   S+E
Sbjct: 345 DWSLPHFEKMLYDQAQLLGVYTDAFDSAHDPELLGAMYDIAAYITSPPVLSPTGGFHSSE 404

Query: 428 DADSAETEGATRKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHN 486
           DADS  T   T K+EGAFYVWT KE + ILG+  A +   H+ + P GN +  R++DPH+
Sbjct: 405 DADSLPTPSDTDKREGAFYVWTHKEFKQILGQRDADVCARHWGVLPDGNVE--RVNDPHD 462

Query: 487 EFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSW 545
           EF  +NVL         A + G+  E+ + I+     KL + R SKR RP LDDK+IV+W
Sbjct: 463 EFINQNVLNIQTTPGKLAKEFGLSEEEVVRIIKASTEKLREYRESKRVRPALDDKIIVAW 522

Query: 546 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQ 605
           NGL I + A+ S +L +          V     +E+   AE+AA FIR+ L+D  + +L 
Sbjct: 523 NGLAIGALAKCSVVLDN----------VDRIKAQEFRLAAENAAKFIRQSLFDPASGQLW 572

Query: 606 HSFRNGP-SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGY 664
             +R       PGF DDYA+LISGL+DLYE      +L +A +LQ+              
Sbjct: 573 RIYRGEERGDTPGFADDYAYLISGLIDLYEATFDDSYLQFAEQLQH-------------- 618

Query: 665 FNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVF 723
                               + PS N V   NL+RL++++   + D YR+ A  +++ F
Sbjct: 619 -------------------ASTPSPNGVIARNLLRLSTLL---EDDTYRRLARDTVSAF 655


>gi|299133196|ref|ZP_07026391.1| protein of unknown function DUF255 [Afipia sp. 1NLS2]
 gi|298593333|gb|EFI53533.1| protein of unknown function DUF255 [Afipia sp. 1NLS2]
          Length = 683

 Score =  382 bits (982), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 231/612 (37%), Positives = 326/612 (53%), Gaps = 65/612 (10%)

Query: 101 HTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVES 160
           HTNRLA E SPYLLQH HNPVDW+ WG EA AEA++   PI LS+GY+ CHWCHVM  ES
Sbjct: 7   HTNRLAGETSPYLLQHQHNPVDWWPWGPEALAEAQRTGKPILLSVGYAACHWCHVMAHES 66

Query: 161 FEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGG 220
           FEDE  A ++N+ FV IKVDREERPD+D++YM  +  L   GGWPL++FL+PD  P+ GG
Sbjct: 67  FEDETTAAVMNELFVPIKVDREERPDIDQIYMNALHLLGEQGGWPLTMFLTPDGAPVWGG 126

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           TYFP   +YGR  F  +LR++   +  + D +A + A   + LS+  SA A+S  L    
Sbjct: 127 TYFPKTAQYGRAAFVEVLRELARIFRDEPDKIAANKAAIEKSLSQRSSADAASIGL---- 182

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
             N L   A  ++++ D   GG   APKFP+             LE   ++G  +  ++ 
Sbjct: 183 --NELDNAAGSIARATDPTNGGLRGAPKFPQ----------CSMLEFLWRAGARTGDERY 230

Query: 341 VLFT---LQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTK 397
            + T   L  M++GGI+DH+GGG+ RYSVD RW VPHFEKMLYD  Q+ ++     +   
Sbjct: 231 FITTNLALTQMSQGGIYDHLGGGYARYSVDARWLVPHFEKMLYDNAQILDMLALEHARAP 290

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
           +  Y     + + +L+R+M+   G   S+ DADS   EG    +EG FYVW+  ++  +L
Sbjct: 291 NELYRQRAEETVGWLKREMLTKEGGFASSLDADS---EG----EEGKFYVWSQADIAHLL 343

Query: 458 G-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLN 516
           G + A  F   Y +   GN            F+G N+L  L+D S +A++          
Sbjct: 344 GPDDATFFAAKYGVSAEGN------------FEGHNILNRLDDGSETATE--------AE 383

Query: 517 ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGS 576
            L   R  LF  R KR  P LDDKV+  WNGL I++             +  FN      
Sbjct: 384 QLAALRAILFRAREKRVHPGLDDKVLADWNGLTIAA---------LAHAANAFN------ 428

Query: 577 DRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG 636
            R +++ +A +A  F+   +   +  RL HS+R G    P    D+A +I   L LYE  
Sbjct: 429 -RPDWLTLATTAFGFVTTTM--SRRDRLGHSWRAGKLLQPALASDHAAMIRAALALYEAT 485

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
               +L  AI  Q   D  + D + GGYF T+ +   ++LR     D A P+   ++  N
Sbjct: 486 GDHLFLDQAILWQADLDTHYGDPQHGGYFLTSDDAEGLILRPHSTVDDAIPNHVGLTAQN 545

Query: 697 LVRLASIVAGSK 708
           L RLA +    +
Sbjct: 546 LARLAVLTGDER 557


>gi|167043802|gb|ABZ08492.1| hypothetical protein ALOHA_HF4000APKG3D24ctg2g4 [uncultured marine
           crenarchaeote HF4000_APKG3D24]
          Length = 620

 Score =  382 bits (981), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 215/552 (38%), Positives = 312/552 (56%), Gaps = 55/552 (9%)

Query: 156 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 215
           M  ESFEDE +AK++N+ FV+IKVDREERPD+D +Y    Q   G GGWPLSVFL+P+ +
Sbjct: 1   MAHESFEDEEIAKIMNENFVNIKVDREERPDLDDIYQKVCQMSTGQGGWPLSVFLTPEQR 60

Query: 216 PLMGGTYFPPEDKYGRPGFKTILRKVKDAW-DKKRDMLAQSGAFA--IEQLSEALSASAS 272
           P   GTYFP  D YGRPGF ++ R++  +W +K +D+   +  F   +++L +  + S  
Sbjct: 61  PFYVGTYFPAIDSYGRPGFGSLCRQMAQSWKEKPKDIEKAADNFMQNLDKLKQFPTPSEI 120

Query: 273 SNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSG 332
              + DE   N L++         D  +GGFG APKFP    +  M  +SK       SG
Sbjct: 121 DKSILDEAAINLLQIA--------DITYGGFGQAPKFPNASNLSFMFRYSKL------SG 166

Query: 333 EASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDA 392
             S+ +K  L TL+ MAKGGI D +GGGFHRYS D RW VPHFEKMLYD   L  VY +A
Sbjct: 167 -ISKFEKFALLTLKKMAKGGIFDQIGGGFHRYSTDARWLVPHFEKMLYDNALLPIVYSEA 225

Query: 393 FSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKE 452
           + +TKD F+  + R  LDY+ R+M    G  FSA+DAD+   EG T       +VW  +E
Sbjct: 226 YQITKDPFFENVVRKTLDYIIREMTSSDGMFFSAQDADTNGEEGQT-------FVWKKRE 278

Query: 453 VEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE 512
           +E ILGE + +F  +Y +   GN            F+G  +L    ++S+   K G    
Sbjct: 279 IEKILGEDSEIFCIYYDVTDGGN------------FEGNTILANNINASSLGFKFGKSES 326

Query: 513 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 572
           +  NI+ +C  KL +VR+KR +P  DDKVI SWNGL+IS+F    +I             
Sbjct: 327 EIQNIILKCSDKLLEVRNKREQPGKDDKVITSWNGLMISAFLSGYQI------------- 373

Query: 573 VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDL 632
              +D  +Y+++A+ +  F   +   ++ H L  +F+NG  K  G+LDDYA++ +  +D+
Sbjct: 374 ---TDNSKYLDMAKKSIDFFESNF--KENHILHRTFKNGEPKLNGYLDDYAYMANASIDM 428

Query: 633 YEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSV 692
           +E  S  K+L++A  L N     F D    G+F T+     +++R K ++D + PSGNSV
Sbjct: 429 FENTSDPKYLLFATNLANYLVTHFWDDSTHGFFFTSDNHEKLIIRPKNNYDLSMPSGNSV 488

Query: 693 SVINLVRLASIV 704
           +   L++L  I 
Sbjct: 489 AACVLLKLYHIT 500


>gi|85817359|gb|EAQ38539.1| conserved hypothetical protein [Dokdonia donghaensis MED134]
          Length = 705

 Score =  382 bits (981), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 222/629 (35%), Positives = 338/629 (53%), Gaps = 47/629 (7%)

Query: 75  HRPIHPYKVVAMAERTPASTSHS-RNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAE 133
           H P+  +    +       T  + ++ +TN L  E SPYLLQHAHNPVDW AW +E  A+
Sbjct: 4   HIPVLAFITAILITSCEGKTDTTMQHDYTNDLIKETSPYLLQHAHNPVDWKAWNDETLAQ 63

Query: 134 ARKRDVPIFLSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMT 193
           A+K +  I +SIGYS+CHWCHVME ESFED  VA+ +N+ F++IKVDREERPDVD VYM 
Sbjct: 64  AKKENKLILVSIGYSSCHWCHVMEHESFEDTLVAQFMNENFINIKVDREERPDVDNVYMN 123

Query: 194 YVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLA 253
            VQ + G GGWPL+    PD +P+ GGTYF  ED      +   L +V D +    + L 
Sbjct: 124 AVQLMTGRGGWPLNAVALPDGRPVWGGTYFSKED------WLNALGQVADIYTSDPNKLV 177

Query: 254 QSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPV 313
           +        L++    + + NK       + L+   E+ S+ +D+R GG   APKF  P 
Sbjct: 178 EYADKLGTGLAQMDLVTPNPNK--PSFVIDTLQTSIEKWSRQWDTRQGGLNRAPKFMMPN 235

Query: 314 EIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVP 373
             + +L ++ +  D        E  + V  TL+ +A GG++D VGGGF RYSVD +WH+P
Sbjct: 236 NYEFLLRYAHQNND-------DEILEYVNTTLEQIAFGGVNDQVGGGFARYSVDTKWHIP 288

Query: 374 HFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAE 433
           HFEKMLYD  QL ++Y +A+  TK+  Y     + L++++R+M    G  +SA DADS  
Sbjct: 289 HFEKMLYDNAQLVSLYSNAYLKTKNPLYKETVYETLEFIKREMTTSQGGFYSALDADSLT 348

Query: 434 TEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV 493
            +G    +EGA+YVWT +E+++++G+   LF  +Y +      D  +  + H       V
Sbjct: 349 PDGEL--EEGAYYVWTEEELKNLVGDDFKLFSAYYNIN-----DYGKWENDH------YV 395

Query: 494 LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVISS 552
           LI  +  +    +  + LE+      + R  L   R SK+ +P LDDK++ SWNGL+   
Sbjct: 396 LIRQDLDTDFVKEHQISLEELTTKKSKWREDLLRFRESKKEKPRLDDKILTSWNGLMTKG 455

Query: 553 FARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP 612
           +  A ++                 D KE+++ A   A+F+  +L   +   L  ++++G 
Sbjct: 456 YVDAYRVF----------------DEKEFLDAALKNANFVVDNLL-RKDGGLNRTYKDGK 498

Query: 613 SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDP 672
           S    +L+DYA  I   + L+E     +WL  A  L +     F + E   ++ T+ EDP
Sbjct: 499 STINAYLEDYAATIDAFIALFEVTMDEQWLEKAKSLTDYTFTHFQNAENKLFYFTSNEDP 558

Query: 673 SVLLRVKEDHDGAEPSGNSVSVINLVRLA 701
           ++  R  E +D   PS NS+   N+  L+
Sbjct: 559 TLSSRNTEFYDNVIPSSNSIMAKNIFTLS 587


>gi|456389199|gb|EMF54639.1| hypothetical protein SBD_4307 [Streptomyces bottropensis ATCC
           25435]
          Length = 686

 Score =  382 bits (980), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 239/627 (38%), Positives = 329/627 (52%), Gaps = 60/627 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA E SPYLLQHA NPVDW+ W  EAF EAR+R VP+ LS+GYS+CHWCHVM  ESFE
Sbjct: 7   NRLAHETSPYLLQHADNPVDWWPWSAEAFEEARRRGVPVLLSVGYSSCHWCHVMAHESFE 66

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D   A+ LN  FV+IKVDREERPDVD VYM  VQA  G GGWP++VFL+PD +P   GTY
Sbjct: 67  DGETAEYLNAHFVNIKVDREERPDVDAVYMEAVQAATGQGGWPMTVFLTPDGEPFYFGTY 126

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS-EALSASASSNKLPDELP 281
           FPP  ++G P F+ +L  V+ AW  +RD +A+     +  L+   L  +A      DEL 
Sbjct: 127 FPPAPRHGMPSFRQVLEGVRAAWADRRDEVAEVAGKIVRDLAGRELKFAAVDVPGEDELA 186

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
           Q  L      L++ YD+  GGFG APKFP  + I+ +L H+ +   TG  G      +M 
Sbjct: 187 QALL-----GLTREYDAARGGFGRAPKFPPSMVIEFLLRHAAR---TGSEG----ALQMA 234

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             T + MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L  VY   +  T     
Sbjct: 235 RDTCERMARGGIYDQLGGGFARYSVDREWVVPHFEKMLYDNALLCRVYAHLWRATGSELA 294

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHA 461
             +  +  D++ R++    G   SA DADS +  G  +  EGA+YVWT +++ ++LGE  
Sbjct: 295 RRVALETADFMVRELRTNEGGFASALDADSDDGTGTGKHVEGAYYVWTPEQLTEVLGEED 354

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL---NIL 518
                H++                        + E       AS L +P  + +   + +
Sbjct: 355 ARLAAHHF-----------------------GVTEEGTFEEGASVLQLPQREGVFDADKI 391

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
              R +L   R +RP P  DDKV+ +WNGL +++ A            A F+ P      
Sbjct: 392 ESIRERLLAARVRRPAPGRDDKVVAAWNGLAVAALAET---------GAYFDRP------ 436

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYAFLISGLLDLYEFGS 637
                   +A   +R HL DE+  RL  + ++G   A  G L+DYA +  G L L     
Sbjct: 437 DLVDAAIAAADLLVRLHL-DERA-RLARTSKDGRVGANAGVLEDYADVAEGFLALASVTG 494

Query: 638 GTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINL 697
              WL +A  L +     F+D E G  ++T  +   ++ R ++  D A PSG S +    
Sbjct: 495 EGVWLEFAGFLLDHVLVRFVDEESGALYDTASDAEKLIRRPQDPTDNATPSGWSAAAGA- 553

Query: 698 VRLASIVAGSKSDYYRQNAEHSLAVFE 724
             L    A + S+ +R  AE +L V +
Sbjct: 554 --LLGYAAHTGSEPHRTAAERALGVVK 578


>gi|338213486|ref|YP_004657541.1| hypothetical protein [Runella slithyformis DSM 19594]
 gi|336307307|gb|AEI50409.1| protein of unknown function DUF255 [Runella slithyformis DSM 19594]
          Length = 700

 Score =  382 bits (980), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 230/626 (36%), Positives = 326/626 (52%), Gaps = 71/626 (11%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           +NRL  E SPYLLQHAHNPVDW+ WGEEA  +AR  + PI +SIGYS CHWCHVME ESF
Sbjct: 2   SNRLINETSPYLLQHAHNPVDWYPWGEEALTKARTENKPIIVSIGYSACHWCHVMERESF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           E E VA ++N  FV IKVDREERPDVD +YM  + A+   GGWPL+VFL PD KP  G T
Sbjct: 62  EKEQVAAVMNADFVCIKVDREERPDVDAIYMDAIHAMGARGGWPLNVFLLPDAKPFYGVT 121

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL-----------------S 264
           Y P ++      +  +L  VK+A+    + L +S     + +                  
Sbjct: 122 YLPAQN------WVQLLGSVKNAFVNHHEELVKSAEGFTDNMLIKETDKYNLHATSPQGD 175

Query: 265 EALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKK 324
           EA  A AS     D+L +       E++   +D+  GG   APKFP P   + +L +   
Sbjct: 176 EADRAEASPAPTLDDLHE-----MFEKIKGHFDTEKGGMDRAPKFPMPSIYKFLLRYYAL 230

Query: 325 LEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ 384
            ++        E  + +  +L  +A GGI+DHVGGG+ RYSVD+ W +PHFEKMLYD GQ
Sbjct: 231 TQN-------PEALRHIELSLNRIALGGIYDHVGGGWARYSVDDEWFIPHFEKMLYDNGQ 283

Query: 385 LANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGA 444
           L ++Y +A++LTK+  Y     + +D+L R+M    G  +SA DADS   EG     EG 
Sbjct: 284 LLSIYSEAYTLTKNELYKSRVYETIDWLEREMTSTEGGFYSALDADS---EGV----EGK 336

Query: 445 FYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 504
           FYVWT  E+  +LG+    F + Y ++ +GN +       +N      +         S 
Sbjct: 337 FYVWTQAELRSVLGDDFEWFSKLYNIRASGNWEHG-----YNHLHLTTISFVPETVEKSQ 391

Query: 505 SKLGMPLEKYLNILGE-------CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARAS 557
            ++G PL   +  L E         +KLF  R  R RP LDDK++ SWNGL++     A 
Sbjct: 392 WRVGPPLNYLMKGLFEKNSTYQAALQKLFVARESRIRPGLDDKILASWNGLMLKGLTDAY 451

Query: 558 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPG 617
           +    E                ++  +A  +A F++  +     H+L HS++NG +   G
Sbjct: 452 RAFGEE----------------KFKTLALQSAHFLKDKM-TAPNHQLWHSYKNGKASIVG 494

Query: 618 FLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLR 677
           FL+DYA ++ G L LY+     +WL  A++L     E   D E   ++ T      ++ R
Sbjct: 495 FLEDYAAVVDGYLGLYQATFEEQWLDEALKLTAYAIENLYDPEEELFYFTDANAEELIAR 554

Query: 678 VKEDHDGAEPSGNSVSVINLVRLASI 703
            KE  D   P+ NS+   NL  L ++
Sbjct: 555 KKEIFDNVIPASNSLMAHNLFTLGTL 580


>gi|117929090|ref|YP_873641.1| hypothetical protein Acel_1883 [Acidothermus cellulolyticus 11B]
 gi|117649553|gb|ABK53655.1| protein of unknown function DUF255 [Acidothermus cellulolyticus
           11B]
          Length = 658

 Score =  382 bits (980), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 242/615 (39%), Positives = 324/615 (52%), Gaps = 80/615 (13%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA   SPYLLQH  NPV+W+ W EEAFAEAR+R+VPI LSIGYS+CHWCHVM  ESFE
Sbjct: 3   NRLATATSPYLLQHKDNPVEWWPWCEEAFAEARRRNVPILLSIGYSSCHWCHVMAHESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D   A  +N+ FV +KVDREERPD+D VYM   QA+ G GGWPL+ FL+PD +P   GTY
Sbjct: 63  DPATAAFMNEHFVCVKVDREERPDIDAVYMEATQAMTGRGGWPLTCFLTPDGEPFFTGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL-- 280
           FP E + G P F+ +L  V  AW  +   L  +    +  L +        ++L D+L  
Sbjct: 123 FPKEPRAGMPAFRQVLEAVWTAWQSRSADLVAAARRVVAVLQQ-------GSRLTDDLGA 175

Query: 281 -PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
              + L     +L + YD   GGFGSAPKFP    ++ +L +       G  G      +
Sbjct: 176 IDADLLDAAVGELRRQYDPVHGGFGSAPKFPSATTLEFLLRY-------GSLG----AME 224

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           MV  T + MA+GGI+D + GGFHRYSVD  W VPHFEKMLYD  QL  VYL  +  T+  
Sbjct: 225 MVAVTCEHMARGGIYDQLAGGFHRYSVDAAWTVPHFEKMLYDNAQLLGVYLHWWRRTQHQ 284

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG- 458
               I  ++ ++L RD+  P G   +A DAD+   EG T       YVWT  E+ D LG 
Sbjct: 285 LARRIVEEVAEFLLRDLCTPAGGFAAALDADAGGVEGGT-------YVWTLAELRDALGS 337

Query: 459 EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 518
           + A    E + +   GN +            G++VL    D+          LE++  I 
Sbjct: 338 DDAAYAAELFGVTEHGNTE-----------DGRSVLQLAVDAP--------DLERWRRI- 377

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
              R++L  VRS+R +P  DDK+I SWNGL ++S A A  +L                DR
Sbjct: 378 ---RQRLLAVRSRRAQPARDDKIIASWNGLAVASLAEAGFLL----------------DR 418

Query: 579 KEYMEVA-ESAASFIRRHLYDEQTHRLQHSFRNGP-SKAPGFLDDYAFLISGLLDLYEFG 636
              ++ A  SA   I  HL D    RL  S R+G  +   G LDDYA +  GLL L +  
Sbjct: 419 DALVDAAVRSAEYLIDVHLRD---GRLCRSSRDGERNPVDGALDDYANVAQGLLTLAQIR 475

Query: 637 SGTKWLVWAIELQNTQDELFLDR---EGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
           S  ++L    EL     E  L     E GG+++T  +   ++ R +   D A PSGNS +
Sbjct: 476 SEARYL----ELAGALLEAILTHFRAEDGGFYDTADDAERLVRRPRTFTDDATPSGNSAA 531

Query: 694 VINLVRLASIVAGSK 708
              L+  A++    +
Sbjct: 532 AHALLTYAALTGSQR 546


>gi|110635801|ref|YP_676009.1| hypothetical protein Meso_3473 [Chelativorans sp. BNC1]
 gi|110286785|gb|ABG64844.1| protein of unknown function DUF255 [Chelativorans sp. BNC1]
          Length = 676

 Score =  381 bits (979), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 248/657 (37%), Positives = 334/657 (50%), Gaps = 79/657 (12%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N L  + SPYLLQH  NPV W  W  EA  EAR+ + PI LS+GY+ CHWCHVM  E FE
Sbjct: 7   NLLGEQASPYLLQHRDNPVHWRPWSREALDEARELNRPILLSVGYAACHWCHVMAHECFE 66

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D  VA+L+N  FV+IKVDREERPD+D++YMT + A+   GGWPL++FL+P+ KP  GGTY
Sbjct: 67  DNEVAELMNSLFVNIKVDREERPDIDQIYMTALSAMGEQGGWPLTMFLTPEAKPFWGGTY 126

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS--ASASSNKLPDEL 280
           FP   +YGRPGF  +L+ V  AW  K D L +S       +   L+     +SN++P   
Sbjct: 127 FPKRSRYGRPGFIDVLKAVHSAWQTKEDELLRSADTLSIHVRTHLAPMQGTTSNEVP--- 183

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
               LR  AE++   +D + GG   APKFP    + ++  +   LE+  +S      +  
Sbjct: 184 ----LRALAEKIRAVFDPQLGGLRGAPKFPNAPFLDLLWLN--WLENGAESD-----RDT 232

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
           VL TL+ M  GGI+DHVGGG  RYSVD +W VPHFEKMLYD  QL  +   A+  T D  
Sbjct: 233 VLLTLRSMLAGGIYDHVGGGLARYSVDAQWLVPHFEKMLYDNAQLIRLCSYAYGGTHDRL 292

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-- 458
           +     D + +L R+M   GG   S+ DADS   EG    +EG FY+WT  E+ED+LG  
Sbjct: 293 FRVRIEDTVKWLLREMTVEGGGFASSLDADS---EG----EEGKFYLWTRAEIEDVLGVG 345

Query: 459 EHAILFKEHYYLKPT---GNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 515
           +   L   +    P    GN  L R   P            L+DSS          E+ L
Sbjct: 346 DARELLAIYDLANPEEWEGNPILHRRRHPEV----------LDDSS----------EQRL 385

Query: 516 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 575
             L +   +L   R  R RP  DDKV+V WNGL I++ A A +                 
Sbjct: 386 RTLLD---RLMAAREARTRPGRDDKVLVDWNGLAIAAIAVAGRQFA-------------- 428

Query: 576 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 635
             R E++E A  A  F+   L   +  RL HS R      P    DYA +IS  + LY  
Sbjct: 429 --RPEWIEAAARAFRFV---LESMEEGRLPHSIRGEKRLFPALSSDYAAMISAAIALYGA 483

Query: 636 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 695
                ++  A +  +  D  +LD  G GYF T  +     +R++ D D   PS  +  V 
Sbjct: 484 THDDSYVDQARQWLDKLDAWYLDDAGSGYFLTASDSADTPMRIRGDMDDPIPSATAQIVT 543

Query: 696 NLVRLASIVAGSKSDYYRQNAEHSLAVFE---TRLKDMAMAVPLMCCAADMLSVPSR 749
            LV LA+ V+GS   Y     +H + V E    R ++ A     + CAA +   P +
Sbjct: 544 ALVHLAA-VSGSHELY-----QHGVRVSEAALARAQNQAYGQLGIICAAALAQRPMK 594


>gi|206603590|gb|EDZ40070.1| Protein of unknown function [Leptospirillum sp. Group II '5-way
           CG']
          Length = 689

 Score =  381 bits (979), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 232/667 (34%), Positives = 350/667 (52%), Gaps = 53/667 (7%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA NPVDW+ WG+EAF +AR  + P+ LSIGY+ CHWCHVM  ESFE
Sbjct: 3   NRLKEETSPYLRQHAENPVDWYPWGKEAFEKARLEEKPVLLSIGYAACHWCHVMAHESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVY-MTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
              +AK++N++FV+IKVDREERPD+D++Y M +       GGWPL++FL+P   P  GGT
Sbjct: 63  RPDIAKVMNEFFVNIKVDREERPDLDQIYQMAHTMITRRNGGWPLTMFLTPSQVPFAGGT 122

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFP + ++G PGF  +L +++D +   R+ L +     ++ L +    + S+    D  P
Sbjct: 123 YFPAQPRFGLPGFVQVLEQIRDFYRDHREGLEKEDHPILQYLGQTNPVADSTGFELDLSP 182

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
             AL      L   +D  FGGFG APKFP  +++  +    ++    G S  A     M 
Sbjct: 183 SEAL---VNNLKSRFDPEFGGFGGAPKFPHAMDLSYLF---RRFHRKGDSTAA----HMA 232

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             TL  M +GGI DHVGGGF RYSVDERW +PHFEKMLYD   L        S++++  Y
Sbjct: 233 TLTLSAMKRGGIWDHVGGGFARYSVDERWLIPHFEKMLYDNALLLEALALGASVSRNPVY 292

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EH 460
           S    +++ +L R+M    G  +S+ DADS   EG    +EG FYV+ ++EV  IL  E 
Sbjct: 293 SRTAEELVGWLFREMRSEHGVYYSSLDADS---EG----EEGRFYVFQAEEVRSILSDEE 345

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
             +  +HY L           S+P N       L E       + +  +P     + +  
Sbjct: 346 YRVVSKHYGL-----------SEPPNFESHAWHLYEARSIGELSKEFHLPESDIESRIDS 394

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            R+KLF  RS R RP LDDK++ SWN L+              A++ +F+  ++G  ++E
Sbjct: 395 ARQKLFTYRSLRVRPGLDDKILASWNALM--------------AKALLFSGRILG--KQE 438

Query: 581 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 640
           +M        ++ R+++      L   +       P +LDDYAFL+  +L+        +
Sbjct: 439 WMTAGRKTIDYMHRNMWKNGV--LMAVYSKKEPFLPAYLDDYAFLLLAVLESIRIDFRPE 496

Query: 641 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 700
            L +A  + +     F D E GG++ T     +++ R K  HDGA PSGN+ +V  L+ L
Sbjct: 497 DLSFATAIADVLLTEFYDPESGGFYFTGKNHEALIHRPKNGHDGALPSGNAAAVQGLLWL 556

Query: 701 ASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSS 760
            ++        Y   A+ +L ++  ++K+       M  A +  S    + V+L+    +
Sbjct: 557 GTLTGHLP---YTSAADQTLRLYFAQMKEQPAGYTTMISALETYS--DSQPVILLAGPQA 611

Query: 761 VDFENML 767
            D++N +
Sbjct: 612 EDWKNTI 618


>gi|409096974|ref|ZP_11216998.1| hypothetical protein PagrP_00615 [Pedobacter agri PB92]
          Length = 686

 Score =  381 bits (978), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 236/659 (35%), Positives = 332/659 (50%), Gaps = 65/659 (9%)

Query: 77  PIHPYKVVAMAERTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARK 136
           PIH Y ++AM      S  HS     N L    SPYLLQHA+NPV W+ WG EA  +A+ 
Sbjct: 5   PIHFYTLIAM------SNVHSE---PNSLINASSPYLLQHAYNPVQWYEWGVEALEKAKA 55

Query: 137 RDVPIFLSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQ 196
            +  I +SIGYS CHWCHVME ESFE+  VA+++N  FV IKVDREERPD+D++YM  +Q
Sbjct: 56  ENKLILVSIGYSACHWCHVMERESFENFEVAEVMNKHFVCIKVDREERPDIDQIYMYAIQ 115

Query: 197 ALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSG 256
            + G GGWPL+    PD +P+ GGTYF   D      +  IL  V   W  + +   Q  
Sbjct: 116 LMTGSGGWPLNCICLPDQRPIYGGTYFRKND------WVNILENVAALWSNEPEKAIQYA 169

Query: 257 AFAIEQL--SEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVE 314
                 +  SE +  S +     DE     L    E   + +D  FGG+  APKFP P  
Sbjct: 170 ERLTSGIRDSEKIIPSVTKEDYTDE----HLTEIIEPWKRHFDISFGGYNRAPKFPLPNN 225

Query: 315 IQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPH 374
              +L +    +D             V  TL+ M++GGI+D +GGGF RYSVD++WHVPH
Sbjct: 226 WVFLLRYGYLKDDESVF-------TAVCHTLEEMSRGGIYDQIGGGFARYSVDDKWHVPH 278

Query: 375 FEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAET 434
           FEKMLYD  QL ++Y +A+  TK   +     + ++++  +M  P G  +SA DADS   
Sbjct: 279 FEKMLYDNAQLISLYAEAYQCTKFNSFKQTAVESINWVFNEMTSPEGLFYSALDADS--- 335

Query: 435 EGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL 494
           EG     EG FYVW   E  D+LG+ A L  E++ +   GN           E +  N+L
Sbjct: 336 EGI----EGKFYVWDKTEFYDLLGDDAQLLGEYFNITEEGNW----------EEEQTNIL 381

Query: 495 IELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFA 554
            ++       SK  +  E     +   + KL ++R++R RP LDDK + +WNG++I + A
Sbjct: 382 RKILSDDDILSKHNIDAETLYTKVESAKAKLLNIRNQRIRPGLDDKCLTAWNGMMIKALA 441

Query: 555 RASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSK 614
            A+ +L  +                 Y + A +AA FI  +L    +  L  + +NG + 
Sbjct: 442 DAATVLSHDL----------------YYQKAAAAARFILVNL-KTASGGLYRNCKNGKAS 484

Query: 615 APGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSV 674
              FLDDYAFLI  L+ LYE+     WL  A    +   E F D E   +F T+    S+
Sbjct: 485 ITAFLDDYAFLIEALIALYEYDFDENWLNEAKSFTDYVLENFSDSESPMFFYTSATGESL 544

Query: 675 LLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMA 733
           + R  E  D   P+ NS    NL +L  +      + Y   A   LA  + ++K    A
Sbjct: 545 IARKHEVMDNVIPASNSTMAQNLTKLGLLF---DLEGYNNKAAEMLAAVQPKIKTYGSA 600


>gi|350269357|ref|YP_004880665.1| hypothetical protein OBV_09610 [Oscillibacter valericigenes
           Sjm18-20]
 gi|348594199|dbj|BAK98159.1| hypothetical protein OBV_09610 [Oscillibacter valericigenes
           Sjm18-20]
          Length = 642

 Score =  380 bits (977), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 227/603 (37%), Positives = 319/603 (52%), Gaps = 78/603 (12%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYLLQHA+NPVDW+ W +EAF +A + + P+FLSIGYS+CHWCHVM  ESFE
Sbjct: 22  NRLIHEKSPYLLQHAYNPVDWYPWCQEAFKKATRENKPVFLSIGYSSCHWCHVMAKESFE 81

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA +LN  FVS+KVDREERPD+D +YM   Q   GGGGWP SVF++PD KP   GTY
Sbjct: 82  DETVAGVLNKSFVSVKVDREERPDIDNIYMRVCQTFTGGGGWPTSVFMTPDQKPFFAGTY 141

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP      +  F  +L  +++ W + +  L   G     Q++E L+ S  S + P   P 
Sbjct: 142 FP------KAPFLDLLEVIREKWAEDKQALLNQG----NQITETLTHSTHSPQTPQTAP- 190

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             ++     L +++D+ FGGFG APKFP P  + ++L  +  + +               
Sbjct: 191 --IKAAVSALKETFDNEFGGFGRAPKFPTPHILYLLLKTAPDMAEK-------------- 234

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL  M KGGI D +G GF RYS D  W VPHFEKMLYD   LA  YL AF  T    Y 
Sbjct: 235 -TLIQMYKGGIFDQIGFGFSRYSTDRFWLVPHFEKMLYDNALLATAYLMAFEQTGRELYR 293

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE-HA 461
            +    L Y+ RD+  P G  FSA+DADS         +EG +YV+  +E+  +LGE   
Sbjct: 294 TVAEKTLLYMERDLGSPEGGFFSAQDADS-------DGEEGKYYVFKPEELTALLGEAEG 346

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
             F  ++ +   GN            F+G ++   +N+SS   S     ++K+L      
Sbjct: 347 RRFNAYFGITQNGN------------FEGYSIPNLINNSSMDDS-----VDKFL------ 383

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
             K+++ R  R     D KV+ SWN L +++ A A +I+                 ++ Y
Sbjct: 384 -PKVYEYRKSRTSLRTDQKVLTSWNALALAACANAYRII----------------GKRAY 426

Query: 582 MEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKW 641
           ++ A     F+ R + D  T  +     +G     GFLDDYAF I  L+ L++      +
Sbjct: 427 LDTALKTFGFMEREVTDGDT--VFCGVTDGVRGGVGFLDDYAFYIYALICLHQATQDPAF 484

Query: 642 LVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLA 701
           L+ A +LQ      + D + GG+F +   +  ++   KE +DGA PSGNSV   NL RL 
Sbjct: 485 LIRAQDLQIKAISEYFDDQNGGFFFSGKSNEKLIFNPKETYDGAIPSGNSVMAYNLARLY 544

Query: 702 SIV 704
           ++ 
Sbjct: 545 ALT 547


>gi|300113281|ref|YP_003759856.1| hypothetical protein Nwat_0572 [Nitrosococcus watsonii C-113]
 gi|299539218|gb|ADJ27535.1| protein of unknown function DUF255 [Nitrosococcus watsonii C-113]
          Length = 694

 Score =  380 bits (977), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 245/663 (36%), Positives = 364/663 (54%), Gaps = 48/663 (7%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N L  + SPYLLQH  NPV W+ WGEEA   A+  D PI LSIGYS CHWCHVM  ESFE
Sbjct: 8   NHLQGQTSPYLLQHVDNPVAWYPWGEEALVRAQGEDKPILLSIGYSACHWCHVMAHESFE 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYG-GGGWPLSVFLSP-DLKPLMGG 220
           +   A ++N+ F++IKVDREERPD+D++Y    Q L G  GGWPL++FL P    P  GG
Sbjct: 68  NPETAAVMNEHFINIKVDREERPDLDQIYQLAQQMLTGRPGGWPLTMFLEPVKQAPFFGG 127

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           TYFPPE+++G PGFK +L++V + +  +R+++ QS    +    E L   +S+ ++ + L
Sbjct: 128 TYFPPEERHGLPGFKDLLQRVAEYFHTRREVI-QSQNERLLDAFEKLDGRSSAAEV-EGL 185

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
            +  L+   +QL++++DSR+GGF  APKFP P  I+  L  +     T    E  +   M
Sbjct: 186 NRAPLQAAHQQLAQAFDSRYGGFRGAPKFPNPSIIERCLRDAHGEHIT--EDEKQQALTM 243

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
              TL+ MA+GGI+D +GGGF RYSVDE+W +PHFEKMLYD GQL  +Y DA+ L  +  
Sbjct: 244 ARLTLEQMAQGGIYDQLGGGFCRYSVDEKWRIPHFEKMLYDNGQLLVLYRDAYRLWGNGI 303

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH 460
           +  I  +   ++ R+M  P G  +S+ DADS   EG     EG FYVWT ++V  +L + 
Sbjct: 304 FRRILEETGHWVVREMQSPEGGYYSSLDADS---EG----HEGKFYVWTREQVRALLDDE 356

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
                  Y+           +  P N F+G   L       A A ++ +P       L  
Sbjct: 357 KYTLAVRYF----------SLDQPAN-FEGHWHLYAAMTPEALAEEMKVPAPGLQEQLTA 405

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            ++KLF  R  R RP  DDK++ +WN L+I   A A + L           PV       
Sbjct: 406 AKQKLFAAREARIRPGRDDKILTAWNSLMIKGMAAAGQALAQ---------PV------- 449

Query: 581 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 640
           ++  AE A  F+R HL+  Q  RL  S+++G ++  G+LDDYAFL+  LL+L +      
Sbjct: 450 FIASAEKAVDFVRAHLW--QKGRLLVSYKDGRAQHQGYLDDYAFLLDALLELLQVRWRDG 507

Query: 641 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 700
            L +A++L       F D+  GG++ T  +  +++ R     D A P+GN +   +L+RL
Sbjct: 508 DLAFAVDLAEAVLGHFEDKAQGGFYFTADDHETLIHRPVPLMDNATPAGNGILAWSLLRL 567

Query: 701 ASIVAGSKSDYYRQNAEHSL-AVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKS 759
             ++   +   Y + AE++L A +E+  +       L+    + L+ P  + V+L G   
Sbjct: 568 GHLLGEMR---YLKAAENTLKAAWESLQQTPHAHCSLLKALEEWLTPP--QIVILRGSGE 622

Query: 760 SVD 762
            ++
Sbjct: 623 ELE 625


>gi|344203206|ref|YP_004788349.1| hypothetical protein [Muricauda ruestringensis DSM 13258]
 gi|343955128|gb|AEM70927.1| hypothetical protein Murru_1888 [Muricauda ruestringensis DSM
           13258]
          Length = 699

 Score =  380 bits (977), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 218/615 (35%), Positives = 322/615 (52%), Gaps = 52/615 (8%)

Query: 88  ERTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGY 147
           ++ P   +H   +HTN L  E SPYLLQHAHNPV+W AW  +    A+K D  + +SIGY
Sbjct: 18  KQKPKEVTH---EHTNALIHETSPYLLQHAHNPVNWEAWHPDVLERAKKEDKLLLISIGY 74

Query: 148 STCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 207
           + CHWCHVME E FED  VA+++N  FV+IK+DREERPDVD++YM  +Q + G GGWPL+
Sbjct: 75  AACHWCHVMEKECFEDAEVAEVMNKNFVNIKIDREERPDVDQIYMDAIQMISGQGGWPLN 134

Query: 208 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 267
           +   PD +P  G TY P      +  +   L ++ + + K +  + Q  A     L+  L
Sbjct: 135 IVALPDGRPFWGATYVP------KDNWIKSLEQLAELYKKDKPRVTQYAA----DLANGL 184

Query: 268 SAS--ASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKL 325
            A     ++K  D    + L +  +  ++ +D+  GG   APKF  P     +L+++  +
Sbjct: 185 HAINLVENDKDSDLYSLDQLDVAIQNWTQYFDTFLGGHKRAPKFMMPNNWDFLLHYATAV 244

Query: 326 EDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQL 385
                  +  E  + V  TL  MA GG++DHVGGGF RY+VD +WHVPHFEKMLYD GQL
Sbjct: 245 -------DKPEIMEFVDTTLTRMAYGGVYDHVGGGFSRYAVDTKWHVPHFEKMLYDNGQL 297

Query: 386 ANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 445
            ++Y  A++ TK+  Y  +  + +++++ + +   G  +S+ DADS +        EGA+
Sbjct: 298 TSLYAKAYAATKNELYKNVVEETINFVQEEFLDRSGGFYSSLDADSLDENAELV--EGAY 355

Query: 446 YVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 505
           YVWT KE+  +LG+   LF+E++ +   G  +           +   VLI        A 
Sbjct: 356 YVWTKKELSGLLGDDFELFQEYFNINSYGYWE-----------EENYVLIRDKSDEEVAD 404

Query: 506 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 565
           K  + + +    + E   KL   R KRP+P LDDK++ SWNGL++     A + L  E  
Sbjct: 405 KFNITIPELKTTITESLAKLKGEREKRPKPRLDDKILTSWNGLMLKGLVDAYRYLGEE-- 462

Query: 566 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 625
                         +Y+ +A   A FI R +  +    L  + + G S   GFL+DYA +
Sbjct: 463 --------------DYLNLALKNAEFIEREMI-KSDGSLYRNHKEGKSTINGFLEDYATV 507

Query: 626 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 685
           I     LYE     KWL  A  L     + F D   G +F T+ ED S++ R  E  D  
Sbjct: 508 IDAYFSLYEATFDEKWLDLAKNLLEYSKKHFWDETSGMFFYTSDEDQSLIRRTIEVDDNV 567

Query: 686 EPSGNSVSVINLVRL 700
             S NS+  INL + 
Sbjct: 568 ISSSNSIMAINLYKF 582


>gi|345001747|ref|YP_004804601.1| hypothetical protein SACTE_4222 [Streptomyces sp. SirexAA-E]
 gi|344317373|gb|AEN12061.1| protein of unknown function DUF255 [Streptomyces sp. SirexAA-E]
          Length = 673

 Score =  380 bits (977), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 236/626 (37%), Positives = 329/626 (52%), Gaps = 57/626 (9%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            NRL    SPYLLQHA NPVDW+ W  EAF EAR+R+VP+ LS+GYS CHWCHVM  ESF
Sbjct: 2   ANRLTQTTSPYLLQHADNPVDWWPWSPEAFEEARRRNVPVLLSVGYSACHWCHVMAHESF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           ED  +A  LN+ FV +KVDREERPDVD VYM  VQA  G GGWP++VFL+ D +P   GT
Sbjct: 62  EDAALAAYLNEHFVPVKVDREERPDVDAVYMEAVQAATGQGGWPMTVFLTADAEPFYFGT 121

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFPPE ++G P F+ +L  V  AW  +R  +A+     +  L+   S +   + +P E P
Sbjct: 122 YFPPEPRHGMPSFRQVLEGVTAAWTGRRGEVAEVAGRIVTDLA-GRSLAHGGDGVPGE-P 179

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
           + A  L A  LS+ YD + GGFG APKFP  + ++ +L H  +   TG  G      +M 
Sbjct: 180 ELAQALLA--LSREYDEKHGGFGGAPKFPPSMAVEFLLRHHAR---TGAEG----ALEMA 230

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             T   MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L  VY   +  T     
Sbjct: 231 ADTCAAMARGGIYDQLGGGFARYSVDREWVVPHFEKMLYDNALLCRVYAHLWRATGSDLA 290

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHA 461
             +  +  D++ R++    G   SA DADS +  G  R  EGA+YVWT +++ ++LGE  
Sbjct: 291 RRVALETADFMVRELRTTEGGFASALDADSEDARG--RHVEGAYYVWTPEQLREVLGEDD 348

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
             F   Y+           +S+     +G +VL          ++ G P E    +  + 
Sbjct: 349 AAFAAAYF----------GVSEEGTFEEGSSVL--------RLARTG-PDEDPARV-ADV 388

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
           R +L   R  R RP  DDK++ +WNGL +++ A                      DR + 
Sbjct: 389 RARLLAARGDRVRPERDDKIVAAWNGLAVAALAETGAYF----------------DRPDL 432

Query: 582 MEVAESAAS-FIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFLISGLLDLYEFGSGT 639
           +E A  AA   +R H+ D  T RL  + ++G      G L+DY  +  G L L       
Sbjct: 433 IERATEAADLLVRVHMGD--TARLCRTSKDGRAGDNAGVLEDYGDVAEGFLALASVTGEG 490

Query: 640 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVR 699
            WL +A  L +   E F   E G  ++T  +   ++ R ++  D A P+G + +   L+ 
Sbjct: 491 AWLDFAGFLLDIVLERFTG-ENGQLYDTADDAEQLIRRPQDPTDSATPAGWTAAAGALL- 548

Query: 700 LASIVAGSKSDYYRQNAEHSLAVFET 725
             S  A + S+ +R  AE +L V + 
Sbjct: 549 --SYAAHTGSEAHRTAAEGALGVVKA 572


>gi|238062793|ref|ZP_04607502.1| hypothetical protein MCAG_03759 [Micromonospora sp. ATCC 39149]
 gi|237884604|gb|EEP73432.1| hypothetical protein MCAG_03759 [Micromonospora sp. ATCC 39149]
          Length = 703

 Score =  380 bits (977), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 232/635 (36%), Positives = 337/635 (53%), Gaps = 56/635 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA+  SPYLLQHA NPVDW+ W +EAFAEAR+RDVP+ +S+GY+ CHWCHVM  ESFE
Sbjct: 2   NRLASATSPYLLQHADNPVDWWPWCDEAFAEARRRDVPVLVSVGYAACHWCHVMAHESFE 61

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D GV KLLND FV+IKVDREERPDVD VYMT  QA+ G GGWP++VF +PD  P   GTY
Sbjct: 62  DAGVGKLLNDGFVAIKVDREERPDVDAVYMTATQAMTGQGGWPMTVFATPDGTPFFCGTY 121

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP      +P F  +L  V  AW ++R+ + + G+  +E +  A +    +         
Sbjct: 122 FP------KPNFVRLLESVGTAWREQREAVLRQGSAVVEAIGGAQAVGGPTAP----FTA 171

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             L   A +L++ YD   GGFG APKFP  + +  +L H ++   TG    ++E  ++  
Sbjct: 172 ELLDAAAARLAREYDRDNGGFGGAPKFPPHLNLLFLLRHHQR---TG----SAESLEIAR 224

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            T + MA+GGIHD + GGF RYSVD  W VPHFEKMLYD   L  VY   + LT D    
Sbjct: 225 HTAEAMARGGIHDQLAGGFARYSVDAHWTVPHFEKMLYDNALLLRVYTHLWRLTGDPLAR 284

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH-A 461
            + RD   +L  ++  PG    SA DAD+   EG T       Y WT  ++ ++LGE   
Sbjct: 285 RVARDTARFLADELHRPGEGFASALDADTEGVEGLT-------YAWTPAQLVEVLGESDG 337

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE--------K 513
               + + + P+G       S P      +   +E      S  +L   ++        +
Sbjct: 338 RWAADLFAVTPSGTFAPHSASAPQGGTPDRRKGVE---HGTSVLRLARDVDDADPAIRGR 394

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS------EAESA 567
           + +++G    +L   R  RP+P  DDKV+ +WNGL I++ A   +++++      +A++ 
Sbjct: 395 WRDVVG----RLLAARDTRPQPARDDKVVAAWNGLAITALAEFVRLVEAVGTGDEQADAN 450

Query: 568 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 627
           +     + +D     + AE  A+    HL D +  R+      G  +  G L+DY  +  
Sbjct: 451 LLEGVTIVAD-GALRDAAEHLAAV---HLVDGRLRRVSRDRVVG--EPAGVLEDYGCVAE 504

Query: 628 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 687
               +++     +WL  A +L +T    F    GGG+++T  +   ++ R  +  D A P
Sbjct: 505 AFCAMHQLTGEGRWLELAGDLLDTALARFA-APGGGFYDTADDAERLVTRPADPTDNATP 563

Query: 688 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 722
           SG S  V  LV  A++   S    YR+ AE +LA 
Sbjct: 564 SGRSAIVAALVTYAAL---SGQPRYREVAEAALAT 595


>gi|326800931|ref|YP_004318750.1| hypothetical protein [Sphingobacterium sp. 21]
 gi|326551695|gb|ADZ80080.1| protein of unknown function DUF255 [Sphingobacterium sp. 21]
          Length = 672

 Score =  380 bits (976), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 219/624 (35%), Positives = 334/624 (53%), Gaps = 62/624 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N L  E SPYL QH HNPVDW+ WG+EA ++A+  +  + +SIGYS CHWCHVME ESFE
Sbjct: 3   NHLQNESSPYLKQHQHNPVDWYPWGDEALSKAKAENKLLIVSIGYSACHWCHVMERESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           ++ VA+++N  ++SIKVDREERPD+D++YMT VQ +   GGWPL+    PD +P+ GGTY
Sbjct: 63  NKEVAQVMNRHYISIKVDREERPDIDQIYMTAVQLMTNSGGWPLNCICLPDGRPVYGGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASAS--SNKLPDEL 280
           F P D      +  +L +V+  W  + +   +      E+L++ ++ S +   +K+P++ 
Sbjct: 123 FRPAD------WVNVLNQVQALWANEPETAIEYA----EKLAQGITESETFKISKIPEKY 172

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
            ++ L+   +   +++D   GG+  APKFP P      L +       G     ++  + 
Sbjct: 173 SEDDLKEIVKPWQQTFDPIDGGYKRAPKFPLPNNWLFFLRY-------GHLANDADILEH 225

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
             FTLQ +A GG++D VGGGF RY+VD +WH+PHFEKMLYD  QL ++Y +A+    +  
Sbjct: 226 THFTLQHIAAGGLYDQVGGGFARYAVDGQWHIPHFEKMLYDNAQLISLYAEAYLQKPEPL 285

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH 460
           Y  +  + L ++ R+M    G  +SA DADS   EG     EG +Y +   E++++LG+ 
Sbjct: 286 YKRVVEETLQWVDREMTSAEGAFYSALDADS---EGV----EGKYYTFQQDEIDNLLGKD 338

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
           A LF  ++ +   GN    +           NVL    D+   A + G   E++   L +
Sbjct: 339 ADLFISYFSITAAGNWPEEKT----------NVLKTRLDADKLAEQAGYSKEEWETYLKD 388

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            ++K+   R +R RP LD+K++ SWN +++ ++  A +                  ++KE
Sbjct: 389 IKKKIRHYREQRIRPGLDNKILTSWNAMMLKAYIDAYRTF----------------NKKE 432

Query: 581 YMEVAESAASFIRRHLYDEQ---THRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGS 637
           Y+ VAE  A FI R L  E+    H+ Q  F+        FLDDYAF+I   + LYE   
Sbjct: 433 YLTVAERNAHFILRKLITEEGTLLHQPQTPFKT----ITAFLDDYAFVIEAFIALYEVTF 488

Query: 638 GTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINL 697
              WL  A  L +     F DR+ G ++ T+     ++ R  E  D   PS NSV    L
Sbjct: 489 NKAWLDQAKSLADYTLAQFYDRQAGAFYYTSDLTEVLITRKFEIMDNVIPSSNSVMAHQL 548

Query: 698 VRLASIVAGSKSDYYRQNAEHSLA 721
            +L  I   S    Y++ A   LA
Sbjct: 549 NKLGVIFEDST---YKEIAAQLLA 569


>gi|153953760|ref|YP_001394525.1| hypothetical protein CKL_1135 [Clostridium kluyveri DSM 555]
 gi|219854377|ref|YP_002471499.1| hypothetical protein CKR_1034 [Clostridium kluyveri NBRC 12016]
 gi|146346641|gb|EDK33177.1| Conserved hypothetical protein [Clostridium kluyveri DSM 555]
 gi|219568101|dbj|BAH06085.1| hypothetical protein [Clostridium kluyveri NBRC 12016]
          Length = 633

 Score =  380 bits (976), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 214/610 (35%), Positives = 329/610 (53%), Gaps = 62/610 (10%)

Query: 149 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 208
           TCHWCHVM  ESF+D  VA++LN +F+S+KVDREERPDVD +YM   Q++ G GGWPL++
Sbjct: 8   TCHWCHVMAKESFQDNEVAEILNKYFISVKVDREERPDVDSIYMKVCQSITGSGGWPLTI 67

Query: 209 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 268
            ++P+ KP   GTYFP  +     G   IL  ++ AW   +  L + G  ++  +   L+
Sbjct: 68  IMTPEQKPFFAGTYFPKNNVGEALGLIAILEYIQKAWKDNKAQLLKEGD-SLLDIINTLN 126

Query: 269 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 328
            ++S      EL Q+ L+    +  +++D+ +GGFG  PKFP    +  +L +  K +D 
Sbjct: 127 KNSSG-----ELSQDILKKAFLEFKQNFDTLYGGFGGYPKFPSAHNLLFLLRYFHKTKD- 180

Query: 329 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 388
                 +   +MV  TL+ M +GG++DH+G GF RYSVD +W +PHFEKMLYD   +A  
Sbjct: 181 ------AFALEMVEKTLESMYRGGMYDHIGYGFSRYSVDRKWLIPHFEKMLYDNALIAMA 234

Query: 389 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 448
           YL+ F +T +  Y+ +  +I +Y+ RDM    G  +SAEDADS   EG    +EG FY+W
Sbjct: 235 YLETFQVTGNKKYAKVAEEIFEYVLRDMTSKEGGFYSAEDADS---EG----EEGKFYMW 287

Query: 449 TSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 507
           + +E++DILG E    F  ++ +   GN            F+GKN+   + +S       
Sbjct: 288 SQEEIKDILGQEQGSKFCCYFNVTSQGN------------FRGKNIPNLIGNS------- 328

Query: 508 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 567
              LE+ +  +  CR KLF  R KR  PH DDK++ SWNGL+I++ A A ++L       
Sbjct: 329 --ILEEDVQFIKNCREKLFKYREKRVHPHKDDKILTSWNGLMIAAMALAGRVL------- 379

Query: 568 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 627
                    +  +Y   A+ +  FI ++L   +  RL   +R G S   G+ DDYAFLI 
Sbjct: 380 ---------NNSKYTLAAKKSVDFIYKNLI-RKDGRLLARYREGDSSFLGYADDYAFLIW 429

Query: 628 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 687
           GL++LYE     ++L  A+EL     E+F D E GG+F    +   +++R KE +DG  P
Sbjct: 430 GLIELYETTYNPEYLKNALELNQNFLEIFWDSENGGFFLYGKDSEKLIIRPKEIYDGPTP 489

Query: 688 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 747
            GNS + +NL+RL+ +    +   +    +     F   ++   ++      A      P
Sbjct: 490 CGNSAAALNLLRLSYLATSYE---FEDKVKQLFENFADEIESSPISCSFSLVALLFSKYP 546

Query: 748 SRKHVVLVGH 757
            R+ ++  G 
Sbjct: 547 VRQIIISAGE 556


>gi|418471574|ref|ZP_13041379.1| hypothetical protein SMCF_4347 [Streptomyces coelicoflavus ZG0656]
 gi|371547815|gb|EHN76170.1| hypothetical protein SMCF_4347 [Streptomyces coelicoflavus ZG0656]
          Length = 680

 Score =  380 bits (976), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 251/666 (37%), Positives = 345/666 (51%), Gaps = 78/666 (11%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA   SPYLLQHA NPVDW+ W  +AF EAR+RDVP+ LS+GYS CHWCHVM  ESFE
Sbjct: 3   NRLAQATSPYLLQHAENPVDWWPWETDAFEEARRRDVPVLLSVGYSACHWCHVMAHESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D   A+ LN  FVS+KVDREERPDVD VYM  VQA  G GGWP++VFL+PD +P   GTY
Sbjct: 63  DGPTAEYLNSHFVSVKVDREERPDVDAVYMEAVQAATGQGGWPMTVFLTPDAEPFYFGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS-EALSASASSNKLPDELP 281
           FPPE ++G P F+ +L+ V+ AW ++RD +++     +  L+   +S   +     ++L 
Sbjct: 123 FPPEPRHGMPSFRQVLQGVQQAWAERRDEVSEVAGKIVRDLAGREISYGDAEAPGEEQLG 182

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
           Q  L      L++ YD++ GGFG APKFP  + I+ +L H  +   TG  G      +M 
Sbjct: 183 QALL-----GLTREYDAQRGGFGGAPKFPPSMAIEFLLRHHAR---TGAEG----ALQMA 230

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             T + MA+GG++D +GGGF RYSVD  W VPHFEKMLYD   L  VY   +  T     
Sbjct: 231 ADTCERMARGGLYDQLGGGFARYSVDRDWVVPHFEKMLYDNALLCRVYAHLWRATGSDLA 290

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EH 460
             +  +  D++ R++    G   SA DADS   +G  +  EGA+YVWT  ++ ++LG E 
Sbjct: 291 RRVALETADFMVRELRTAEGGFASALDADS--DDGTGKHVEGAYYVWTPAQLTEVLGAED 348

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL---NI 517
           A L  +++ +   G  +       H                  AS L +P ++ +     
Sbjct: 349 AELAAQYFGVTEEGTFE-------HG-----------------ASVLQLPQQEGVFDAAR 384

Query: 518 LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSD 577
           +   R +L   R  RP P  DDKV+ +WNGL I++ A            A F  P     
Sbjct: 385 IASVRERLLAARDGRPAPGRDDKVVAAWNGLAIAALAET---------GAYFERP----- 430

Query: 578 RKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFLISGLLDLYEFG 636
                    +A   +R HL DEQ  R+  + ++G P    G L+DYA    G L L    
Sbjct: 431 -DLVEAAVAAADLLVRLHL-DEQV-RITRTSKDGRPGANAGVLEDYADAAEGFLALASVT 487

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGAEPSGNSVSVI 695
               WL +A  L +     F D  G G    T  D   L+R  +D  D A PSG S +  
Sbjct: 488 GEGVWLDFAGFLLDHVLTRFTD--GSGSLYDTAADAEQLIRRPQDPTDNATPSGWSAAAG 545

Query: 696 NLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL-----MCCAADMLSVPSRK 750
            L+  A   A + S+ +R  AEH+L V    +K +   VP      +  A  +L  P  +
Sbjct: 546 ALLTYA---AHTGSEPHRTAAEHALGV----VKALGPRVPRFIGWGLAAAEALLDGP--R 596

Query: 751 HVVLVG 756
            V +VG
Sbjct: 597 EVAVVG 602


>gi|340619141|ref|YP_004737594.1| hypothetical protein zobellia_3176 [Zobellia galactanivorans]
 gi|339733938|emb|CAZ97315.1| Conserved hypothetical membrane protein [Zobellia galactanivorans]
          Length = 703

 Score =  380 bits (976), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 229/610 (37%), Positives = 329/610 (53%), Gaps = 65/610 (10%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           K+TN LA E SPYLLQHAHNPV+W AW +EA  +A+K +  + +SIGYS+CHWCHVME E
Sbjct: 36  KYTNALANETSPYLLQHAHNPVNWRAWSQEALDDAKKENKLVLVSIGYSSCHWCHVMEDE 95

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           +FE+E VAK++N+ F++IKVDREERPDVD+VYMT +Q + G GGWPL+V   P+ KPL G
Sbjct: 96  TFENEEVAKIMNENFINIKVDREERPDVDQVYMTALQLISGSGGWPLNVITLPNGKPLYG 155

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLP-- 277
           GTY      + R  +  +L K+ +        L ++     E+ S+ ++A  +   L   
Sbjct: 156 GTY------HTREQWMQVLTKISE--------LYKNDPKKAEEYSDMVAAGIAEANLVEP 201

Query: 278 ----DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGE 333
               + + + AL+      S ++D   GG     KF  P  +  +L ++    D      
Sbjct: 202 AKGFESITKEALKTSVANWSPNWDLEEGGEKGVQKFMIPSNLSFLLDYAVLTGD------ 255

Query: 334 ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 393
             + ++ V  TL  MA GG++D +GGGF+RYS D  W VPHFEKMLYD  Q+ ++Y  A+
Sbjct: 256 -DKAKRHVRNTLDKMALGGVYDQIGGGFYRYSTDAFWKVPHFEKMLYDNAQVLSLYSKAY 314

Query: 394 SLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 453
           +L KD  Y  +  + +D+L R+M    G   +A DADS   EG    +EG FYVW  +E+
Sbjct: 315 TLFKDDAYKNVVWETIDFLDREMKDTNGGYHAALDADS---EG----EEGKFYVWKEEEL 367

Query: 454 EDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEK 513
           + +LGE   LF  +Y +      +            GK VL    D +    +  +   K
Sbjct: 368 KSVLGEGFELFSAYYNINKEAVWE-----------DGKYVLHRKVDDAEFVKEHDIEQGK 416

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
              I  E  +KL   R+KR  P  DDK+I SWN L+++ F  A K               
Sbjct: 417 LNFIKSEWNKKLLAERNKRVFPRSDDKIITSWNALLVNGFVDAYKAF------------- 463

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
               +K ++E AES  SFIR + Y  Q  +L H+F+ G  +  GF++DYAF+I   L+LY
Sbjct: 464 ---GQKRFLEKAESVFSFIRSNAY--QNGKLVHTFKKGSKRKEGFIEDYAFMIDASLELY 518

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
                T++L +A EL    +  F D   G Y    G D  ++ R+ +  DG  PS N+V 
Sbjct: 519 GLTLNTEYLDFAKELNAKAEAGFADEASGMYHYNEGND--LIARIIKTDDGVLPSPNAVM 576

Query: 694 VINLVRLASI 703
             NL RL  +
Sbjct: 577 AHNLFRLGHL 586


>gi|227537485|ref|ZP_03967534.1| possible thioredoxin [Sphingobacterium spiritivorum ATCC 33300]
 gi|227242622|gb|EEI92637.1| possible thioredoxin [Sphingobacterium spiritivorum ATCC 33300]
          Length = 672

 Score =  380 bits (975), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 218/613 (35%), Positives = 315/613 (51%), Gaps = 57/613 (9%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           +N+L  EHSPYL QHAHNPV W  WGEEA  +A+  +  I +SIGYS CHWCHVME ESF
Sbjct: 2   SNQLQFEHSPYLKQHAHNPVHWMPWGEEALTKAKTENKLIIISIGYSACHWCHVMERESF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           E++ +A+ +N ++V +K+DREERPD+D++YMT VQ +   GGWPL+    PD +P+ GGT
Sbjct: 62  ENDAIAQTMNKFYVPVKIDREERPDIDQIYMTAVQLMTNAGGWPLNCICLPDGRPIYGGT 121

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YF P D      ++ IL ++   W+++  +  +        + +  S     N +PD+  
Sbjct: 122 YFKPHD------WQNILLQIAQMWEEQPQVAIEYATKLTNGIQQ--SERLPINPIPDQYD 173

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM- 340
            + L          +D++ GG+  APKFP P     +L          + G  +  +K+ 
Sbjct: 174 SSDLSAIITPWVALFDTKDGGYNRAPKFPLPNNWIFLL----------RYGVLAGDEKII 223

Query: 341 --VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
             V FTLQ MA GGI+D +GGGF RYSVD  WH+PHFEKMLYD GQL +++ +A+     
Sbjct: 224 DHVHFTLQKMASGGIYDQIGGGFARYSVDPYWHIPHFEKMLYDNGQLLSLFSEAYQQRPS 283

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
            FY  I ++ + +  R+M+ P    + A DADS   EG     EG +Y ++  E+EDILG
Sbjct: 284 PFYKRIVQETIQWANREMLAPNNGFYCALDADS---EGV----EGKYYSFSKSEIEDILG 336

Query: 459 EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 518
           E A LF  ++ +   GN             +  N+ I   D+   A   G   E++   L
Sbjct: 337 EDAPLFISYFNITEEGNW----------AEESTNIPILDPDADQMALDAGYSAEEWETCL 386

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
            E + KL+  R  R RP LD K + +WN L++     A +I                 D 
Sbjct: 387 AEAKEKLYSYRETRIRPGLDHKQLATWNALMLKGLTDAYRIF----------------DN 430

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG 638
             Y++ A   A FI   L  +   R+ H  ++   +  GFLDDYAF     + LYE    
Sbjct: 431 SSYLDTAIKNAHFIIDELI-KSDGRILHQPKDANREIFGFLDDYAFTTEAFIALYEATFD 489

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
            KWL  A +L +   ELF D     ++ T      ++ R  E  D   P+  S  V+ L 
Sbjct: 490 EKWLDLARQLADKALELFYDSNQKTFYYTADSSGELIARKSEIMDNVIPASTSTIVLQLK 549

Query: 699 RLASIVAGSKSDY 711
           +L  +    K DY
Sbjct: 550 KLGLLF--DKEDY 560


>gi|385681202|ref|ZP_10055130.1| highly conserved protein containing a thioredoxin domain-containing
           protein [Amycolatopsis sp. ATCC 39116]
          Length = 675

 Score =  380 bits (975), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 230/609 (37%), Positives = 321/609 (52%), Gaps = 67/609 (11%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            NRLAA  SPYLLQHA NPVDW+ W  EA AEA++RDVPI LSIGY+ CHWCHVM  ESF
Sbjct: 2   ANRLAAATSPYLLQHAENPVDWWPWSAEALAEAKRRDVPILLSIGYAACHWCHVMAHESF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           ED   A+L+N+ FV+IKVDREERPD+D VYMT  QA+ G GGWP++ FL+PD +P   GT
Sbjct: 62  EDAETARLMNEHFVNIKVDREERPDIDAVYMTATQAMTGQGGWPMTCFLTPDGEPFHCGT 121

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           Y+PPE + G P F+ +L  V  AW ++RD L +     +E L+  L         P  + 
Sbjct: 122 YYPPEPRPGMPSFQHLLVAVAQAWQERRDELREGAGKIVEHLAGQLGPLP-----PAPVD 176

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
              L     +L+   D   GGFG APKFP  + ++ +L H ++   TG    ++E   +V
Sbjct: 177 AGVLDAALLKLTGEADRARGGFGGAPKFPPSMVLEFLLRHHER---TG----SAEALSLV 229

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
               + MA+GGIHD + GGF RYSVD  W VPHFEKMLYD   L  VY      T     
Sbjct: 230 ESCAEAMARGGIHDQLAGGFARYSVDASWVVPHFEKMLYDNALLLRVYAHLARRTGSALA 289

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EH 460
           + + R   ++L   +    G   ++ DAD       T  +EG  YVWT  ++ ++LG + 
Sbjct: 290 AEVARMTGEFLLARLRTEQGGFAASLDAD-------TLGEEGLTYVWTPAQLREVLGDDD 342

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
                E + +  +G             F+    +++L D            E++  +   
Sbjct: 343 GAWAAELFSVTESGT------------FEHGASVLQLRDPDDR--------ERFERV--- 379

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            R  L   R +RP+P  DDKVI +WNGL I++   A   L                D   
Sbjct: 380 -RSALLAARDERPQPGRDDKVIAAWNGLAITALCEAGVAL----------------DEPH 422

Query: 581 YMEVAESAASFIRR-HLYDEQTHRLQHSFRNGPS-KAPGFLDDYAFLISGLLDLYEFGSG 638
           ++  A+ AAS +   HL D   +RL+ S R+G +  A G L+DY  L  GLL L++    
Sbjct: 423 WVTAAQEAASAVLGIHLRD---NRLRRSSRDGTAGDAAGVLEDYGCLAEGLLALHQATGD 479

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL-RVKEDHDGAEPSGNSVSVINL 697
            +WL  A+ L +T    F   +  G ++ T +D  VL+ R  +  D A PSG S ++ N 
Sbjct: 480 PRWLTEAVNLLDTALANFAVADTPGAYHDTADDAEVLVHRPSDPTDNASPSGAS-ALTNA 538

Query: 698 VRLASIVAG 706
           +  AS++ G
Sbjct: 539 LVTASVLVG 547


>gi|389690661|ref|ZP_10179554.1| thioredoxin domain containing protein [Microvirga sp. WSM3557]
 gi|388588904|gb|EIM29193.1| thioredoxin domain containing protein [Microvirga sp. WSM3557]
          Length = 676

 Score =  380 bits (975), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 241/638 (37%), Positives = 346/638 (54%), Gaps = 72/638 (11%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL    SPYLLQH  NPV W+ WG +A AEA++ D PI +SIGY+ CHWCHVM  ESFE
Sbjct: 2   NRLNEASSPYLLQHRANPVHWWEWGPDALAEAKRLDKPILISIGYAACHWCHVMAHESFE 61

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D  VA ++N+ FV+IKVDREERPDVD VYM+ +  L   GGWPL++FL+P+ +P  GGTY
Sbjct: 62  DADVAAVMNELFVNIKVDREERPDVDHVYMSALHLLGEPGGWPLTMFLTPEGEPFWGGTY 121

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP E ++GRPGF  +LR++   +  + + + ++     + L+ +      +  L D    
Sbjct: 122 FPKEPRFGRPGFVGVLREISRLYRSEPERILKNRDAIKQHLARSDRGDGGTLGLVD---- 177

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             L     +L++  D+  GG   APKFP P  ++ +  ++      G++G+  E ++  L
Sbjct: 178 --LDRLGARLAELIDTENGGLQGAPKFPNPPILECLYRYA------GRTGDG-EAKRRFL 228

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL+ MA GGIHDH+GGGF RYSVDERW VPHFEKMLYD  QL  +Y  A++ T    + 
Sbjct: 229 LTLERMALGGIHDHLGGGFARYSVDERWLVPHFEKMLYDNAQLLELYGLAYAETGRALFR 288

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH-A 461
                I+ +L R+M  P G   S+ DADS   EG    +EG FYVW+  E+ ++LGE  A
Sbjct: 289 DAAEGIVIWLGREMTTPEGGFASSLDADS---EG----EEGLFYVWSLAEIREVLGEEDA 341

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
             F + Y +   GN            F+G+N+   L    A      + +E+ L  L   
Sbjct: 342 AFFGQVYDITEEGN------------FEGRNIPNRLLSGVAP-----LAIEERLAAL--- 381

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
           R KL + RS R RP LDDKV+  WNGL+I++  RAS +L                DR ++
Sbjct: 382 RAKLLERRSARVRPGLDDKVLADWNGLMIAALVRASPLL----------------DRPDW 425

Query: 582 MEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKW 641
           + +A+ A  F+   +   +  RL HS+R G    PGF  D+A ++   L L+E  +   +
Sbjct: 426 IALAQRAYRFVTEAM--TRDGRLGHSWRGGALIVPGFALDHAAMMRAALALFEVTADQAY 483

Query: 642 LVWAIELQNTQDELFLD---REGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
           L    + Q  +D L  D    + G    T      +++R +   D A P+ N V    LV
Sbjct: 484 LR---DAQTWRDRLMSDYRIEDTGALAMTARNADPLVVRPQPTQDDAVPNANGVCAEALV 540

Query: 699 RLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL 736
           RLA +   ++ D   + A   L    T+L  +A + PL
Sbjct: 541 RLAQL---TEMDGDLRQASEVL----TKLGGIARSSPL 571


>gi|354611184|ref|ZP_09029140.1| hypothetical protein HalDL1DRAFT_1849 [Halobacterium sp. DL1]
 gi|353196004|gb|EHB61506.1| hypothetical protein HalDL1DRAFT_1849 [Halobacterium sp. DL1]
          Length = 724

 Score =  380 bits (975), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 245/683 (35%), Positives = 339/683 (49%), Gaps = 49/683 (7%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL    SPYL QHA NPV+W  W E AFA AR+RDVPIFLSIGYS CHWCHVME ESF 
Sbjct: 8   NRLDEAASPYLRQHADNPVNWQPWDETAFAAARERDVPIFLSIGYSACHWCHVMEEESFS 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D+GVA  LN+ FV +KVDREERPDVD +YM   Q + GGGGWPLS FL+PD KP   GTY
Sbjct: 68  DDGVAAALNENFVPVKVDREERPDVDSLYMKVCQVVRGGGGWPLSAFLTPDRKPFFVGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP E K  +PGF  +L  V D+W  +R  L       +      L     +  L D+ P 
Sbjct: 128 FPKEPKRNQPGFTQLLDDVADSWQTERGDLEDRAEQWLSAAKGELEDLPDATDLGDDSP- 186

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             L   A  L+++ D   GGFG APKFP+   +  +L      +D  + G+      +V 
Sbjct: 187 --LDEAANALARTADRDNGGFGRAPKFPQAGRVDALLRAHDASDDGKQYGD------IVR 238

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
             L  MA GG++DH+GGGFHRY  D  W VPHFEKMLYDQ  L   Y+D +    +  Y+
Sbjct: 239 EALDAMAGGGLYDHLGGGFHRYCTDADWTVPHFEKMLYDQATLVRTYVDGYRSFGEERYA 298

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKK-EGAFYVWTSKEVEDILGEHA 461
               + L ++ R++  P G  ++  DA S   +    ++ EGAFYVWT ++VE+ + ++A
Sbjct: 299 DEVGETLAFVDRELGHPDGGFYATLDARSPPIDDPEGERVEGAFYVWTPEQVENAVADYA 358

Query: 462 -------------ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 508
                         LF+  Y +   GN +            G+ VL         A + G
Sbjct: 359 DEAPADVDPGDLVDLFRARYGVDEAGNFE-----------HGQTVLTVSASREELADEFG 407

Query: 509 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 568
              ++   +L     +L   R  RPRP  DDKV+  WNGL+  ++A A            
Sbjct: 408 YQEDEVAELLAAAETRLRAARDDRPRPARDDKVLAGWNGLMARAYAEA---------GLA 458

Query: 569 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 628
           F+     +D   Y E A  A   +R  L+D +  RL     +G     G+ +DYA+L +G
Sbjct: 459 FDGAEARADEDSYAERAAEAIDHVRSELWDGE--RLARRVIDGDVAGIGYAEDYAYLAAG 516

Query: 629 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 688
            L  YE       L +A++L +   +   D E G  + T      V +R +    G  PS
Sbjct: 517 ALATYEATGDHAHLGFALDLADALLDACYDAETGALYQTPASVQDVDVRSQAVDGGPTPS 576

Query: 689 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPS 748
              V+   L+ L +    ++   Y   AE  L  +  R++    A P +  AADML V  
Sbjct: 577 PVGVAAETLLALDAFDPDAE---YANAAEAMLERYGERVQRSPAAHPTLVLAADML-VTG 632

Query: 749 RKHVVLVGHKSSVDFENMLAAAH 771
            + V +      V++   +  A+
Sbjct: 633 HREVTVAADSLPVEWRRTVGTAY 655


>gi|420252291|ref|ZP_14755426.1| thioredoxin domain protein [Burkholderia sp. BT03]
 gi|398055929|gb|EJL47977.1| thioredoxin domain protein [Burkholderia sp. BT03]
          Length = 664

 Score =  379 bits (974), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 260/673 (38%), Positives = 351/673 (52%), Gaps = 88/673 (13%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNRLA E SPYL QHA NPVDW+ W +EAF  AR+ + PI LS+GY+ CHWCHVM  ESF
Sbjct: 2   TNRLATESSPYLRQHADNPVDWYPWSDEAFRRAREENRPILLSVGYAACHWCHVMAHESF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           E+  +A L+N+ +VSIKVDR+ERPD+D++Y    Q +  GGGWPL+VFL+P  +P  GGT
Sbjct: 62  ENPRIASLMNERYVSIKVDRQERPDIDEIYQQVSQMMGQGGGWPLTVFLTPQGEPFFGGT 121

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFPP+D+YGRP F  +L  + +AW  + D L  +    I Q+ +       + + P    
Sbjct: 122 YFPPDDRYGRPAFARVLIALSEAWRHRHDELRDT----IVQIQQGFRQLDQAQQGPTAAV 177

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
           ++     A  L++  D   GG G APKFP P    +ML   ++             ++  
Sbjct: 178 EDLPAQTARALTRDTDPAHGGLGGAPKFPNPSCYDLMLRVYER------------SREPT 225

Query: 342 LF-----TLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
           LF     TL  MA GGI+D VGGGF RYSVD  W VPHFEKMLYD GQL  +Y DA+ LT
Sbjct: 226 LFDALERTLDHMAAGGIYDQVGGGFARYSVDAHWAVPHFEKMLYDNGQLVKLYADAYRLT 285

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
               +  I  + L Y+ RDM  P G  +++EDADS   EG    +EG FY W   E++ +
Sbjct: 286 GKRTWRRIFEETLAYILRDMTHPEGGFYASEDADS---EG----QEGKFYCWMPAEIKAV 338

Query: 457 LGE-HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL---IELNDSSASASKLGMPLE 512
           LGE    L    Y +   GN +            G  VL   +EL+            LE
Sbjct: 339 LGESEGALACRAYGVTERGNFE-----------HGATVLHRAVELD-----------ALE 376

Query: 513 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 572
           +    L   R +L   R++R RP  DD ++  WNGL+I+    A              F 
Sbjct: 377 E--TQLAGWRERLLAARARRVRPARDDNILTGWNGLMIAGLCAA--------------FQ 420

Query: 573 VVGSDRKEYMEVAESAASFIRRHLY--DEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLL 630
             G    EY+  A+ AA+FI   L   D    R+   +++G +K PGFL+DYAFL + LL
Sbjct: 421 ATGV--PEYLSAAKRAANFIGNELTLADGGVFRV---WKDGVAKVPGFLEDYAFLCNALL 475

Query: 631 DLYEFGSGTKWLVWAIELQNTQDELFLDR--EGGGYFNTTGEDPSVLLRVKEDHDGAEPS 688
           DLYE     ++L  AIEL      L LD+  E G YF     +P ++ R +  +D A PS
Sbjct: 476 DLYESCFDRRYLDRAIELAT----LILDKFWEDGLYFTPCDGEP-LVHRPRAPYDSASPS 530

Query: 689 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPS 748
           G S S    VRL ++   +  D Y   AEH    +ET    +  A   +  A D +    
Sbjct: 531 GISSSAFAFVRLHAL---TGRDLYLDRAEHEFRRYETAAGSVPSAFAHLIAARDFVQRGP 587

Query: 749 RKHVVLVGHKSSV 761
            + +V  G K S 
Sbjct: 588 LE-IVFAGEKYSA 599


>gi|311746315|ref|ZP_07720100.1| dTMP kinase [Algoriphagus sp. PR1]
 gi|126576550|gb|EAZ80828.1| dTMP kinase [Algoriphagus sp. PR1]
          Length = 678

 Score =  379 bits (974), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 236/613 (38%), Positives = 318/613 (51%), Gaps = 69/613 (11%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           +N+L    SPYLLQHAHNPVDW+ WGEEA  +A+  + PI +SIGYS CHWCHVME ESF
Sbjct: 5   SNKLIESQSPYLLQHAHNPVDWYPWGEEALNKAKIENKPILVSIGYSACHWCHVMERESF 64

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           ED+  A L+N+ FV IK+DREERPD+D +YM  VQA+   GGWPL+VFL P+ KP  GGT
Sbjct: 65  EDKLTADLMNESFVCIKIDREERPDIDNIYMDAVQAMGLQGGWPLNVFLMPNQKPFYGGT 124

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQS----GAFAIEQLSEALSASASSNKL- 276
           YFP +       +K +L  + DA+    D LA+S    G       +E     +   +L 
Sbjct: 125 YFPNQQ------WKNLLANIADAFANHEDKLAESAEGFGRSIARNETEKYGIRSGKIELD 178

Query: 277 PDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYH-----SKKLEDTGKS 331
           PDEL +  L     QLS   DS +GG    PKFP P     +L +     S+ LED    
Sbjct: 179 PDELAEAVL-----QLSSQIDSEWGGMNRIPKFPMPAIWNFILDYALLSKSQNLEDK--- 230

Query: 332 GEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLD 391
                    VLFTL+ M  GGI+D + GGF RYSVD  W  PHFEKMLYD GQL  +Y  
Sbjct: 231 ---------VLFTLKKMGMGGIYDQLKGGFARYSVDGEWFAPHFEKMLYDNGQLLELYAK 281

Query: 392 AFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSK 451
           A+  + D F+    ++   +L  +M+   G   +A+DADS   EG     EG FY WT +
Sbjct: 282 AYQTSHDDFFLEKIQETYTWLLDEMLQEEGGFHAAQDADS---EGV----EGKFYTWTYE 334

Query: 452 EVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPL 511
           E+  I+ E    F E Y LKP GN +            G N+L +    S  A+   +  
Sbjct: 335 ELSSIIPEEMPWFAELYNLKPQGNWE-----------DGINILFQTKSYSEVAAAHNLSE 383

Query: 512 EKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNF 571
           E     L E +  L  +R++R  P  DDKV+  WN L+IS   +A               
Sbjct: 384 EVLNQKLKEVKATLLSIRNQRIYPGKDDKVLCGWNALMISGLVQAY-------------- 429

Query: 572 PVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLD 631
               SD+K ++++A S   FI + +  ++  RL  S++NG +  P FL+DYA LI   + 
Sbjct: 430 -FATSDQK-FLDLALSNRDFISKKVTVDR--RLYRSYKNGVAYTPAFLEDYAALIKADIM 485

Query: 632 LYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNS 691
           L+E  S    L  A  L     + F D   G +F        ++   KE  D   PS NS
Sbjct: 486 LFEATSEASHLKSAERLTKIVLDEFYDENDGFFFFNNPSSEKLIANKKELFDNVIPSSNS 545

Query: 692 VSVINLVRLASIV 704
           +   NL +L+ + 
Sbjct: 546 LMARNLHQLSILT 558


>gi|257057143|ref|YP_003134975.1| highly conserved protein containing a thioredoxin domain-containing
           protein [Saccharomonospora viridis DSM 43017]
 gi|256587015|gb|ACU98148.1| highly conserved protein containing a thioredoxin domain protein
           [Saccharomonospora viridis DSM 43017]
          Length = 667

 Score =  379 bits (974), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 234/660 (35%), Positives = 334/660 (50%), Gaps = 74/660 (11%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA   SPYLLQHA NPVDW+ W  EA AEAR+RDVPI LS+GY+ CHWCHVM  ESF 
Sbjct: 2   NRLATATSPYLLQHADNPVDWWPWSPEALAEARRRDVPILLSVGYAACHWCHVMAHESFA 61

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D  VA  +N+ FV+IKVDREERPD+D VYMT  QA+ G GGWP++ FL+PD KP   GTY
Sbjct: 62  DADVAAFMNEHFVNIKVDREERPDIDAVYMTATQAMTGQGGWPMTCFLTPDGKPFHCGTY 121

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           +PP    G P FK +L  V  AWD++RD L +     ++ ++E      +    P  +  
Sbjct: 122 YPPVPTQGMPSFKQVLTAVAQAWDERRDELVEGAGRIVDHIAE-----QTRPLSPQPVTA 176

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
           + +     +L    D   GGFG APKFP  + ++ +L H ++        ++ E   +V 
Sbjct: 177 DTIASAVAKLRTEVDPENGGFGGAPKFPPSMVLEFLLRHYERT-------DSMEVLSIVD 229

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            T + MA+GG++D + GGF RYSVD  W VPHFEKMLYD   L   Y      T      
Sbjct: 230 MTAEGMARGGVYDQLAGGFARYSVDAEWVVPHFEKMLYDNALLLRCYAHLARRTGSPLAH 289

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHA 461
            +  +  ++L RD+  P G   S+ DAD+   EG T       YVWT +++ D+LG +  
Sbjct: 290 RVAGETAEFLLRDLRTPQGGFASSLDADAEGVEGLT-------YVWTREQLVDVLGPDDG 342

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
               E + +   G  +           +G + L    D    A        +++ +    
Sbjct: 343 AWAAETFGVTEEGTFE-----------RGASTLRLPQDPDDPA--------RWMRVTS-- 381

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
              L D R++RP+P  DDKVI +WNGL I++ A A   L+                R ++
Sbjct: 382 --TLLDARNERPQPARDDKVIAAWNGLAITALAEAGVALQ----------------RPDW 423

Query: 582 MEVAESAASFIRR-HLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFLISGLLDLYEFGSGT 639
           +E A +A SF+   H  D+    L+ S R+G   +A   L+DY     GLL L++     
Sbjct: 424 IEAAVAAGSFVLDVHKTDDG---LRRSSRDGVVGEADAVLEDYGCFADGLLALHQATGEP 480

Query: 640 KWLVWAIELQNTQDELF-LDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
           +WL  AI L +     F ++   G Y +T  +   ++ R  +  D A PSG S     L+
Sbjct: 481 RWLEEAIALLDIALRRFGVEGMPGAYHDTAVDAEELVHRPSDPTDNASPSGASALAGALL 540

Query: 699 RLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP-----LMCCAADMLSVPSRKHVV 753
             +++    ++  YR   E +LA    R   +   VP      +  A  ML+ P +  VV
Sbjct: 541 TASALAGPERASAYRAACEEALA----RAGALIAQVPRFAGHWLSVAEAMLAGPVQVAVV 596


>gi|443288943|ref|ZP_21028037.1| conserved hypothetical protein [Micromonospora lupini str. Lupac
           08]
 gi|385888344|emb|CCH16111.1| conserved hypothetical protein [Micromonospora lupini str. Lupac
           08]
          Length = 680

 Score =  379 bits (974), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 235/617 (38%), Positives = 318/617 (51%), Gaps = 56/617 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL    SPYLLQHA NPVDW+ W +EAFAEA++RDVP+ +S+GY+ CHWCHVM  ESFE
Sbjct: 2   NRLVDATSPYLLQHADNPVDWWPWCDEAFAEAKRRDVPVLISVGYAACHWCHVMAHESFE 61

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           +E VA LLND FVSIKVDREERPDVD VYMT  QA+ G GGWP++VF +PD  P   GTY
Sbjct: 62  NEQVAALLNDNFVSIKVDREERPDVDAVYMTATQAMTGQGGWPMTVFATPDGTPFFCGTY 121

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP      R  F  +L+ V  AW  +R  + + GA  +E +  A +    +  L   L  
Sbjct: 122 FP------RANFVRLLQSVTTAWADQRAEVLRQGAAVVEAIGGAQAVGGPTAPLDGPL-- 173

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             L   A  L+  YD+  GGFG APKFP  + +  +L H ++  D           ++V 
Sbjct: 174 --LDAAAGNLASGYDATNGGFGGAPKFPPHMNLLFLLRHHQRTGD-------PRSLEIVR 224

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            T + MA+GGI+D + GGF RYSVD  W VPHFEKMLYD   L  VY   + LT D    
Sbjct: 225 HTAEAMARGGIYDQLAGGFARYSVDAHWTVPHFEKMLYDNALLLRVYAQLWRLTGDPLAR 284

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
            + RD   +L  ++  PG    SA DAD+   EG T       Y WT  ++ + LGE   
Sbjct: 285 RVARDTARFLADELHRPGEGFASALDADTEGVEGLT-------YAWTPAQLVEALGEDDG 337

Query: 463 LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECR 522
            F            DL  ++D      G +VL    D    A ++     ++  ++G+  
Sbjct: 338 RFA----------ADLFTVTDEGTFEHGMSVLRLARDVDDVAPEV---RARWQRVVGQ-- 382

Query: 523 RKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR----ASKILKSEAESAMFNFPVVGSDR 578
             L   R  RP+P  DDKV+ +WNGL I++ A     A+     E E A     V     
Sbjct: 383 --LLAARDTRPQPARDDKVVAAWNGLAITAIAEFLQVAALYASPEDEDANLMEGVTIVAD 440

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP-GFLDDYAFLISGLLDLYEFGS 637
               + AE  A+    H+ D    RL+   R+G   AP G L+DY  +      L++   
Sbjct: 441 GAMRDAAEHLATV---HVVD---GRLRRVSRDGRVGAPAGVLEDYGCVAEAFCALHQLTG 494

Query: 638 GTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINL 697
             +WL  A +L +   E F    GG Y++T  +   ++ R  +  D A PSG S  V  L
Sbjct: 495 EGRWLTVAGQLLDAALEHFA-APGGAYYDTADDAEQLVARPADPTDNATPSGRSALVAGL 553

Query: 698 VRLASIVAGSKSDYYRQ 714
           V  A++   ++   YR+
Sbjct: 554 VSYAALTGETR---YRE 567


>gi|55980955|ref|YP_144252.1| hypothetical protein TTHA0986 [Thermus thermophilus HB8]
 gi|55772368|dbj|BAD70809.1| conserved hypothetical protein [Thermus thermophilus HB8]
          Length = 642

 Score =  379 bits (974), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 245/619 (39%), Positives = 332/619 (53%), Gaps = 75/619 (12%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL A  SPYLL HA +PVDW+ +GEEAF +A+  D PIFLS+GY++CHWCHVM  ESF+
Sbjct: 3   NRLKAARSPYLLAHAEDPVDWYPFGEEAFRKAQAEDKPIFLSVGYASCHWCHVMHRESFQ 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA+LLN  FV +KVDREERPDVD  YM  + +L G GGWP+S+FL+P+ KP  GGTY
Sbjct: 63  DEEVARLLNAHFVPVKVDREERPDVDAAYMRALVSLTGQGGWPMSLFLTPEGKPFFGGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP ED+ G PGFK +L  V +AW  KR+ + +      E+L+ AL  S S       LP+
Sbjct: 123 FPKEDRMGLPGFKRVLVAVAEAWAGKREAILEEA----ERLTRALWKSLSPPP--GPLPE 176

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
            A     + L +++D  +GGF  APKFP+   +  +L  + + E+           +++ 
Sbjct: 177 GAEEEALDHLERAFDPEWGGFLPAPKFPQGPLLLYLLARAWEGEERAA--------RLLR 228

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL+ MA GG++D VGGGFHRYSVD  W +PHFEKMLYD   LA VYL A+ L  +  + 
Sbjct: 229 PTLRAMALGGVYDQVGGGFHRYSVDRFWRLPHFEKMLYDNALLARVYLGAYKLFGEDLFL 288

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
            + R+ LD+L       GG   +A D   AE+EG    +EG +Y WT  E+ + LGE   
Sbjct: 289 RVARETLDWLLSMQRREGG-FHTALD---AESEG----EEGRYYTWTEAELREALGEDFP 340

Query: 463 LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECR 522
           L + ++ L      DL            ++VL    ++ A  + LG   E +       R
Sbjct: 341 LARRYFAL----GEDLGE----------RSVLTAWGEAEARKA-LG---EGFFAWREGVR 382

Query: 523 RKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYM 582
            KL   R +R  P LDDKV+  W+ L + + A A ++   E                 Y+
Sbjct: 383 AKLQGARRRRMPPALDDKVLADWSALAVRALAEAGRLFGEE----------------RYL 426

Query: 583 EVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWL 642
           E A+  A F+  H+Y E    L+H++R G      +L D AF     L+LY       +L
Sbjct: 427 EAAKRGARFLLAHMYREGL--LRHTWR-GSLGEEAYLSDQAFAALAFLELYAATGEWPYL 483

Query: 643 VWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLAS 702
            WA  L      LF  REG          PS+ L  KE  +GA PSG S     LVRL +
Sbjct: 484 DWAQRLAEAGWRLF--REG----------PSLPLPAKEVEEGALPSGESALAEALVRLGA 531

Query: 703 IVAGSKSDYYRQNAEHSLA 721
           +  G     YR+ AE  LA
Sbjct: 532 VFGGD----YRERAEEVLA 546


>gi|443624623|ref|ZP_21109091.1| putative Spermatogenesis-associated protein 20 [Streptomyces
           viridochromogenes Tue57]
 gi|443341889|gb|ELS56063.1| putative Spermatogenesis-associated protein 20 [Streptomyces
           viridochromogenes Tue57]
          Length = 680

 Score =  379 bits (973), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 237/627 (37%), Positives = 324/627 (51%), Gaps = 62/627 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA E SPYLLQHA NPVDW+ W  EAF EARKR+VP+ LS+GYS+CHWCHVM  ESFE
Sbjct: 6   NRLAHETSPYLLQHADNPVDWWPWSGEAFEEARKRNVPVLLSVGYSSCHWCHVMAHESFE 65

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D+  A  LN  FV++KVDREERPDVD VYM  VQA  G GGWP++VFL+PD +P   GTY
Sbjct: 66  DQETADYLNAHFVNVKVDREERPDVDAVYMEAVQAATGQGGWPMTVFLTPDAEPFYFGTY 125

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS-EALSASASSNKLPDELP 281
           FPP  ++G P F+ +L  V  AW  +RD +A+     +  L+   +S   +      EL 
Sbjct: 126 FPPAPRHGMPSFRQVLEGVHSAWADRRDEVAEVAGKIVRDLAGREISFGGTEAPGEQELA 185

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
           Q  L      L++ YD + GGFG APKFP  + I+ +L H  +   TG  G      +M 
Sbjct: 186 QALL-----GLTREYDPQRGGFGGAPKFPPSMVIEFLLRHHAR---TGSEG----ALQMA 233

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             T + MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L   Y   +  T     
Sbjct: 234 QDTCERMARGGIYDQLGGGFARYSVDRDWIVPHFEKMLYDNALLCRGYAHLWRATGSELA 293

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHA 461
             +  +  D++ R++    G   SA DADS   +G  R  EGA+YVWT +++ + LG+  
Sbjct: 294 RRVALETADFMVRELRTNEGGFSSALDADS--DDGTGRHVEGAYYVWTPRQLRETLGDDD 351

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL---NIL 518
                 Y+                        + E       +S L +P +  L   + +
Sbjct: 352 AELAARYF-----------------------GVTEEGTFEHGSSVLQLPQQDELFDADRV 388

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
              R++L D RS+RP P  DDK++ +WNGL I++ A            A F+ P      
Sbjct: 389 ASIRQRLLDRRSERPAPGRDDKIVAAWNGLAIAALAET---------GAYFDRP------ 433

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYAFLISGLLDLYEFGS 637
                   +A   +R HL D    RL  + ++G   A  G L+DY  +  G L L     
Sbjct: 434 DLVDAALAAADLLVRLHLDD--AARLARTSKDGQVGANAGVLEDYGDVAEGFLALASVTG 491

Query: 638 GTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINL 697
              WL +A  L +     F D E G  ++T  +   ++ R ++  D A PSG S +   L
Sbjct: 492 EGVWLDFAGFLLDHVLARFTDEESGALYDTAADAEQLIRRPQDPTDNAAPSGWSAAAGAL 551

Query: 698 VRLASIVAGSKSDYYRQNAEHSLAVFE 724
           +   S  A + S  +R  AE +L V +
Sbjct: 552 L---SYAAQTGSAPHRAAAEKALGVVK 575


>gi|284037137|ref|YP_003387067.1| hypothetical protein Slin_2247 [Spirosoma linguale DSM 74]
 gi|283816430|gb|ADB38268.1| protein of unknown function DUF255 [Spirosoma linguale DSM 74]
          Length = 700

 Score =  379 bits (973), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 235/620 (37%), Positives = 332/620 (53%), Gaps = 60/620 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N+L  E SPYLLQHA NPV+W+ WG+EA   A + D PI +SIGYS CHWCHVME ESFE
Sbjct: 3   NQLQYETSPYLLQHAENPVNWYPWGDEALTRAIEEDKPIIVSIGYSACHWCHVMERESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
            E VA+++N  FV IKVDREERPDVD +YM  VQA+   GGWPL+VFL PD KP  G TY
Sbjct: 63  KEAVAQVMNKHFVCIKVDREERPDVDAIYMDAVQAMGVQGGWPLNVFLMPDAKPFYGVTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSG-AFAIE-QLSEALSASASSNKLPDEL 280
            P ++      +  +L  + +A+++ R  LAQS   FA E  LS+A     + N  P   
Sbjct: 123 LPQKN------WVNLLESIDNAFNEHRADLAQSAEGFARELNLSDAERYGLTQND-PLFA 175

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS----E 336
           P+  L +   +++   D   GG   APKFP P   + +L +      + +  EA+    +
Sbjct: 176 PET-LAVLYRKVAVKADDEKGGMRRAPKFPMPSVWRFLLRYYAVASSSRQIAEAADTSDQ 234

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
              +V  TL  MA GGI+D +GGGF RYS D  W  PHFEKMLYD GQL  +Y +A+SLT
Sbjct: 235 ALNLVRITLDRMALGGIYDQLGGGFARYSTDADWFAPHFEKMLYDNGQLLTLYSEAYSLT 294

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
           K   Y ++    + + +R+++ P G  +SA DADS   EG     EG FY +T+ E+++I
Sbjct: 295 KSKLYKHVVYQTIAFAQRELLSPEGGFYSALDADS---EGV----EGKFYTFTTPELKEI 347

Query: 457 LGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLN 516
           LG     F + Y +   GN +            G+N+L  +      A+++G  +     
Sbjct: 348 LGADFDWFADLYSISENGNWE-----------HGRNILHRIEADDEFAARMGWSVADLNV 396

Query: 517 ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGS 576
            L     +L  VR++R RP LDDK++ SWNGL++     A ++         F  P    
Sbjct: 397 RLDATHTRLLRVRNERIRPGLDDKILCSWNGLMLKGLVTAYRV---------FGEP---- 443

Query: 577 DRKEYMEVAESAASFIRRHLYDEQTHRLQHSFR-----NGPSKAPGFLDDYAFLISGLLD 631
              E++ +A   A F+ + + D +  RL H+++      G ++  GFLDDYA +I GLL 
Sbjct: 444 ---EFLTLALRLAYFLLKKMRDSRNGRLWHTYKVSEGGTGRARQAGFLDDYAAVIDGLLA 500

Query: 632 LYEFGSGTKWLVWAIELQ----NTQDELFLDREGGG---YFNTTGEDPSVLLRVKEDHDG 684
           LY+      WL  A +L         +L +D   G     F T      ++ R KE  D 
Sbjct: 501 LYQATFTRNWLTEADQLMQYVLTNFADLSVDELTGPEPLLFFTDKNSEELIARRKELFDN 560

Query: 685 AEPSGNSVSVINLVRLASIV 704
             PS NS+   NL  L+ ++
Sbjct: 561 VIPSSNSMMAENLYVLSLLL 580


>gi|288932323|ref|YP_003436383.1| hypothetical protein Ferp_1971 [Ferroglobus placidus DSM 10642]
 gi|288894571|gb|ADC66108.1| protein of unknown function DUF255 [Ferroglobus placidus DSM 10642]
          Length = 628

 Score =  379 bits (973), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 229/657 (34%), Positives = 343/657 (52%), Gaps = 71/657 (10%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL    SPYL + A+ PVDWF W EEAF +A++ D PI LS+G   CHWCHVM  + FE
Sbjct: 3   NRLEKARSPYLRKAANQPVDWFEWSEEAFKKAKEEDKPILLSVGGVWCHWCHVMAKKCFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           +E +AK++N+ FV++KVDR+ERPD+D+ Y  +V A  G GGWPL+VFL+PD +P  GGTY
Sbjct: 63  NEDIAKIINENFVAVKVDRDERPDIDRRYQEFVFATTGTGGWPLTVFLTPDGEPFFGGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FPPED +G  GFKT+L K+ + W+K R+ L +S    +E L +      SSN     L +
Sbjct: 123 FPPEDGFGMIGFKTLLLKISEMWEKDRESLLKSAKQIVESLKKFSERDFSSN-FDFTLIE 181

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKF--PRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
             ++   + +    D   GG G APKF   +  E+ +  Y+  K ED  K+ E       
Sbjct: 182 KGIKAVLDNM----DYVNGGIGRAPKFHHAKAFELLLTHYYFTKDEDLIKAVE------- 230

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
              TL  MAKGG++D + GGF RYS D+RWHVPHFEKMLYD  +L  +Y  A+ +TK   
Sbjct: 231 --LTLDAMAKGGVYDQLIGGFFRYSTDDRWHVPHFEKMLYDNAELLKLYTIAYQITKKEL 288

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH 460
           Y  + + I+DY R+  +   G  ++++DAD  E E      EG +Y+++ +E++++L + 
Sbjct: 289 YRKVAKGIVDYYRKFGVDERGGFYASQDADIGELE------EGGYYIFSLEEIKEVLNDE 342

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
                  Y+                   +GKNVL    D +  +  LG+P+ +   I+  
Sbjct: 343 EFRIASLYF----------------GLREGKNVLHVSLDENEISEILGIPVRRVKEIIES 386

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            + KL +VR +R  P +D  +  +WNGL+I +     K          FN P        
Sbjct: 387 AKEKLLEVRERRETPFIDKTIYTNWNGLMIEAMCDYYK---------SFNDPWA------ 431

Query: 581 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 640
            +EVAE +     R L       L H+         GF +DY F   GL+ L+E     K
Sbjct: 432 -VEVAEKSGE---RLLKFWDGDVLLHT-----DDVEGFSEDYIFFAKGLIALFEITQKGK 482

Query: 641 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL-LRVKEDHDGAEPSGNSVSVINLVR 699
           +L  A+E+     +LF D + GG+F+       +L L+VK+  D  + S N ++ + L  
Sbjct: 483 YLNAAVEITKRAVDLFWDHKRGGFFDRKSSGNGLLSLKVKDIQDSPQQSVNGIAPLLLTT 542

Query: 700 LASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP-----LMCCAADMLSVPSRKH 751
           L+S+     ++ +   A+ SL  F   L+   +  P     L      +  V +R+H
Sbjct: 543 LSSVTG---TEEFGALAKKSLRAFAGILEKYPLISPSYMISLYAYIRGIYLVKTRRH 596


>gi|408794723|ref|ZP_11206328.1| PF03190 family protein [Leptospira meyeri serovar Hardjo str. Went
           5]
 gi|408461958|gb|EKJ85688.1| PF03190 family protein [Leptospira meyeri serovar Hardjo str. Went
           5]
          Length = 689

 Score =  379 bits (972), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 247/672 (36%), Positives = 353/672 (52%), Gaps = 76/672 (11%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           +K  NRL  E SPYLLQHAHNPVDWF WG EAF  A+K D  I LSIGYSTCHWCHVME 
Sbjct: 5   SKKPNRLVHEKSPYLLQHAHNPVDWFPWGAEAFENAQKEDKIILLSIGYSTCHWCHVMER 64

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFED+  A++LN  FV IK+DREERPD+DK+YM  + A+   GGWPL++FL+P  +P++
Sbjct: 65  ESFEDDSTAEVLNRDFVCIKLDREERPDIDKIYMDALHAMGTQGGWPLNMFLTPTKEPIL 124

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
           GGTYFPPE++YG+  FK +LR V DAW  +R+ L  + A  + Q         +  K+P 
Sbjct: 125 GGTYFPPENRYGKRSFKEVLRLVSDAWKNQREELI-TAATDLTQYLRDNETRPNEGKVP- 182

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGF--GSAPKFPRPVEIQMM--LYHSKKLEDTGKSGEA 334
              +  +    E+  + YD  F GF   S  KFP  + +  +   Y  KK          
Sbjct: 183 --AKEIIEKNFERYVQVYDKEFFGFKTNSVNKFPPSMALSFLTEFYLLKK---------D 231

Query: 335 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 394
               +M   T   M  GGI+D VGGG  RY+ D  W VPHFEKMLYD     ++Y++A +
Sbjct: 232 PRALEMAFNTAYAMKSGGIYDQVGGGICRYATDHEWLVPHFEKMLYDN----SLYVEALA 287

Query: 395 L----TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTS 450
           L    T++ F+  + R+I+ Y+RRDM    G I SAEDADS   EG    +EG FY+W  
Sbjct: 288 LLYKATEEPFFLEVIREIVTYIRRDMTLGSGGIASAEDADS---EG----EEGKFYIWNH 340

Query: 451 KEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMP 510
            E   I+ E  I      +   T   +    +  H  +KGKN  ++           G+ 
Sbjct: 341 SEFNQIVPEEEI----QGFWNVTEEGNFEHQNILHVYWKGKNPFVD-----------GIQ 385

Query: 511 LE-KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 569
            + +++N + + + KL   RS+R RP  DDKV+ SWN L I +   A ++          
Sbjct: 386 FKPEFINKIEKTKEKLLAHRSQRIRPLRDDKVLTSWNCLWIRALLSAYEV---------- 435

Query: 570 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 629
                 S   EY+  A+    FI + L  +    L+  FR G +K  G L DY   I   
Sbjct: 436 ------SGDTEYLNDAKKIYRFITKQLVGDDGSILRR-FREGEAKYFGTLPDYTEFIWVS 488

Query: 630 LDLYEFGSGTKWLVWAIEL-QNTQDELFLDREG--GGYFNTTGEDPSVLLRVKEDHDGAE 686
           + L++     +    A E+ + + D +F + E   G ++ +   +  +++R  E +DG E
Sbjct: 489 MKLFQLDEDIE----AYEIGKKSLDYVFANFESKVGPFYESYHGNEDLIVRTIEGYDGVE 544

Query: 687 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 746
           PSGNS ++++L  L   +   K D  ++ A    A F   L   +++ P M  A      
Sbjct: 545 PSGNS-TILHLFYLLFSIGYKKVD-LQKKANSIFAYFLPELTQNSLSYPSMISAFQKFQY 602

Query: 747 PSRKHVVLVGHK 758
           PS++  VLV +K
Sbjct: 603 PSKE--VLVVYK 612


>gi|336120019|ref|YP_004574797.1| hypothetical protein MLP_43800 [Microlunatus phosphovorus NM-1]
 gi|334687809|dbj|BAK37394.1| hypothetical protein MLP_43800 [Microlunatus phosphovorus NM-1]
          Length = 669

 Score =  379 bits (972), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 230/607 (37%), Positives = 314/607 (51%), Gaps = 64/607 (10%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA+  SPYLLQH  NPVDW+ W +EAFAEA +RDVP+FLS+GY+ CHWCHVM  ESFE
Sbjct: 3   NRLASATSPYLLQHKDNPVDWWEWSDEAFAEAERRDVPVFLSVGYAACHWCHVMAHESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE  A  LN+ FVS+KVDREERPDVD V+M   QAL G GGWP++VFL+PD +P   GTY
Sbjct: 63  DETTAAYLNEHFVSVKVDREERPDVDAVFMAATQALAGQGGWPMTVFLTPDRRPFYAGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FPP  + G P F  +L  +  AW  +RD +  S A    +L         + KLP E+ +
Sbjct: 123 FPPRARQGMPAFADVLAAIASAWRDRRDEVLSSVAHISGELERR-----HAPKLPGEVTR 177

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             L +    L + +D   GGFG APKFP  + ++ +L    +L D        E   MV 
Sbjct: 178 AGLDVARANLQREFDEVRGGFGGAPKFPPSMVLEGLL----RLGD-------DESMAMVD 226

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            T + MA+GGI+D + GGF RYSVD  W VPHFEKMLYD   L  VY   +  T++    
Sbjct: 227 VTCEAMARGGIYDQLAGGFARYSVDAGWVVPHFEKMLYDNALLLGVYTHWWRRTQNPIGE 286

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
            +  + +++L  ++  P G   ++ DADS + +G     EGA+Y W    +  +LGE   
Sbjct: 287 RVVAETVEWLVAELRTPQGGFAASLDADSLDEQG--HSAEGAYYAWDPVGLTAVLGEDDG 344

Query: 463 LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECR 522
            +    +           ++D      G++ L  L D          P+      L   R
Sbjct: 345 RWAAEVF----------GVTDQGTFEHGRSTLRLLGDPD--------PVR-----LASAR 381

Query: 523 RKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYM 582
            +L   R +RPRP  DDKV+ +WNG +I+S   A+ +                  R +++
Sbjct: 382 ERLRTTREQRPRPGRDDKVVAAWNGWLIASLVEAAGVFG----------------RPDWL 425

Query: 583 EVAESAASFIRR-HLYDEQTHRLQHSFRNGP-SKAPGFLDDYAFLISGLLDLYEFGSGTK 640
            +A  AA  I R H  D    RL+ + R+G    A G L+DYA +    + L    +   
Sbjct: 426 ALAREAAELIWRVHWVD---GRLRRTSRDGEVGSAAGVLEDYAAMTMAAVRLGCAEADAT 482

Query: 641 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 700
           WL  A  L       F D  G G+F+T     S+ LR ++  D A PSG S +V  L  L
Sbjct: 483 WLTRAEALAEVILAEFGD--GDGFFDTASGAESLYLRPQDPTDNATPSGLSATVHALALL 540

Query: 701 ASIVAGS 707
           A     S
Sbjct: 541 AETTGRS 547


>gi|51892001|ref|YP_074692.1| hypothetical protein STH863, partial [Symbiobacterium thermophilum
           IAM 14863]
 gi|51855690|dbj|BAD39848.1| conserved hypothetical protein [Symbiobacterium thermophilum IAM
           14863]
          Length = 623

 Score =  379 bits (972), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 240/634 (37%), Positives = 344/634 (54%), Gaps = 64/634 (10%)

Query: 155 VMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDL 214
           +ME ESF D   A+++N  FV IKVDREERPD+D +Y T  Q +   GGWPLSV+L+P+ 
Sbjct: 1   MMERESFADPETAEIMNRHFVCIKVDREERPDLDDIYQTICQLVTRSGGWPLSVWLTPEQ 60

Query: 215 KPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKR---DMLAQSGAFAIEQLSEALSASA 271
           KP   GTYFPP ++YGRPGF+ +L  +  AW +KR   + +A+S A  I Q  E L    
Sbjct: 61  KPFYVGTYFPPVERYGRPGFRQVLLALAQAWREKRQEVEKVAESWARGIAQTDELLP--- 117

Query: 272 SSNKLPD-ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGK 330
            +  +PD  L  +A R  AE++    D + GGFG APKFP  + + +ML H K   D   
Sbjct: 118 PAGPMPDHRLVADAARALAERI----DRQHGGFGGAPKFPNTMALDLMLRHWKATGD--- 170

Query: 331 SGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYL 390
                    +V  TL+ MA+GGI+D +GGGFHRYSVD RW VPHFEKMLYD   L  VYL
Sbjct: 171 ----DLFLHLVTLTLRKMAEGGIYDQLGGGFHRYSVDARWAVPHFEKMLYDNALLPAVYL 226

Query: 391 DAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTS 450
            A+  T +  +  I  + LDY+ R+M  P G  FS  DADS   EG    +EG +YVW  
Sbjct: 227 AAWQATGEPLFRRIVEETLDYVLREMTHPEGGFFSTTDADS---EG----EEGRYYVWDP 279

Query: 451 KEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGM 509
           +EV  +LG +   L   HY +   GN           E  GK VL     ++  AS LG+
Sbjct: 280 REVTAVLGPDLGALICRHYGVTEAGNF----------ERTGKTVLHIAEPAADLASSLGL 329

Query: 510 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 569
           P+E+    L E RR+L + RS+R  P  D+K++  WNGL+IS+ ARA +IL+        
Sbjct: 330 PVEEVERRLAEGRRRLLEARSRRVPPFRDEKILAGWNGLMISALARAGRILR-------- 381

Query: 570 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 629
                   R +Y E A  AA+F+   L D +   L+  +++G +  PG+L+D+AF+ +GL
Sbjct: 382 --------RPDYAEAARRAATFVLDRLADGEGGLLRR-YKDGHAGIPGYLEDHAFMAAGL 432

Query: 630 LDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSG 689
           +DLYE     ++L  A+ L       F D  G  +   +G +P ++ R ++  D + PSG
Sbjct: 433 IDLYECTFDERFLQEAMRLTEETLRRFYDGSGSFHLTQSGAEP-LIHRPRDTTDQSVPSG 491

Query: 690 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADM-LSVPS 748
            +V+V+NL+RL       + D +R+ A+ +       +  +  A   +  A D+ L  P+
Sbjct: 492 AAVAVVNLLRLQPY---RRDDRFREVADTAFRAHRDLMARVPGATATLLQALDLYLDGPT 548

Query: 749 RKHVVLVGHKSSVDFENMLAAAHASYDLNKTVSK 782
              V LVG       E  L A    Y+ N  +++
Sbjct: 549 --EVTLVGDPP----EAWLEALGRRYEPNLVLTR 576


>gi|384567356|ref|ZP_10014460.1| thioredoxin domain-containing protein [Saccharomonospora glauca
           K62]
 gi|384523210|gb|EIF00406.1| thioredoxin domain-containing protein [Saccharomonospora glauca
           K62]
          Length = 670

 Score =  379 bits (972), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 244/672 (36%), Positives = 338/672 (50%), Gaps = 75/672 (11%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA   SPYLLQHA NPVDW+ WG EA AEAR+RDVPI LSIGY+ CHWCHVM  ESF 
Sbjct: 2   NRLATATSPYLLQHADNPVDWWPWGPEALAEARRRDVPILLSIGYAACHWCHVMAHESFS 61

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D+ VA  +ND FV+IKVDREERPD+D VYMT  QA+ G GGWP++ FL+PD KP   GTY
Sbjct: 62  DDEVAAFMNDHFVNIKVDREERPDIDAVYMTATQAMTGQGGWPMTCFLTPDGKPFHCGTY 121

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           +PP   +G P FK +L  V  AW ++RD L +     ++ + E         K     P 
Sbjct: 122 YPPVPAHGMPSFKQVLVAVDQAWRERRDELVEGAGRVVDHIVE-------QTKPLSLRPV 174

Query: 283 NALRLCA--EQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
            A  + A   +L +  D   GGFG APKFP  + ++ +L H    E TG    + E   +
Sbjct: 175 TAETVAAAVSKLRREADPGNGGFGGAPKFPPSMVLEFLLRH---YERTG----SVEALSV 227

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
           V  T + MA+GGI+D + GGF RYSVD  W VPHFEKMLYD   L   Y      T    
Sbjct: 228 VDATAEGMARGGIYDQLAGGFARYSVDAGWVVPHFEKMLYDNALLLRFYAHLARRTGSAL 287

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-E 459
              +  +  ++L RD+  P G   S+ DAD+   EG T       YVWT +++ D+LG E
Sbjct: 288 AYRVAGETAEFLLRDLRTPQGAFASSLDADTEGVEGLT-------YVWTPQQLVDVLGPE 340

Query: 460 HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILG 519
                 + + +   G  +           +G + L    D    A        +++ +  
Sbjct: 341 DGAWAAKLFGVTEEGTFE-----------RGASTLQLRRDPDDPA--------RWMRVTS 381

Query: 520 ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRK 579
              R     R+ RP+P  DDKVI +WNGL I++ A A   L+                R 
Sbjct: 382 ALSR----ARAARPQPARDDKVIAAWNGLAITALAEAGVALR----------------RP 421

Query: 580 EYMEVAESAASFIRR-HLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFLISGLLDLYEFGS 637
           E++E A +AA+F+   H+  +    L+ S R+G    A   L+DY  L  GLL L++   
Sbjct: 422 EWVEAAVAAAAFVLDVHVGGDGAEGLRRSSRDGVVGDAAAVLEDYGCLADGLLALHQATG 481

Query: 638 GTKWLVWAIELQNTQDELF-LDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
              WL  A  L +T    F +D   G + +T  +  +++ R  +  D A PSG S     
Sbjct: 482 EPVWLTEATALLDTALRRFGVDGAPGAFHDTAADAEALVHRPSDPTDNASPSGASALAGA 541

Query: 697 LVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP-----LMCCAADMLSVPSRKH 751
           L+  +++    ++  YR   E +L    +R   +   VP      +  A  +LS P +  
Sbjct: 542 LLTASALAGPERAGAYRAACEEAL----SRAGVLVEQVPRFAGHWLSVAEALLSGPVQVA 597

Query: 752 VVLVGHKSSVDF 763
           VV  G K   + 
Sbjct: 598 VVGAGAKDRAEL 609


>gi|325104043|ref|YP_004273697.1| hypothetical protein [Pedobacter saltans DSM 12145]
 gi|324972891|gb|ADY51875.1| protein of unknown function DUF255 [Pedobacter saltans DSM 12145]
          Length = 669

 Score =  379 bits (972), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 224/603 (37%), Positives = 317/603 (52%), Gaps = 54/603 (8%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            NRLA E SPYLLQHAHNPVDWF WG+EA  +AR  +  I +S+GYS CHWCHVME ESF
Sbjct: 2   ANRLAQESSPYLLQHAHNPVDWFPWGKEALEKARAENKLILVSVGYSACHWCHVMEHESF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           EDE VA+++N+ FV IKVDREERPD+D++YM  VQ + G GGWPL+ F  PD +P+ GGT
Sbjct: 62  EDEEVAQIMNEHFVCIKVDREERPDIDQIYMNAVQLMTGRGGWPLNCFCLPDQRPIYGGT 121

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASA--SSNKLPDE 279
           YF  ED      +K IL  +   +  K   L ++  +A+ +L + ++ S   S  K   E
Sbjct: 122 YFQKED------WKNILHNLAGFYANK---LQEAEEYAV-RLMDGINQSERLSFVKEEKE 171

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
             Q  +    +     +D   GG   APKFP P     ++  +  ++D            
Sbjct: 172 YTQEHIENIVKPWKMHFDFSEGGQNRAPKFPMPDNWAFLMKVAHLMKDDA-------AFV 224

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           +   TL  MA GGI+D +GGGF RYSVD  WH+PHFEKMLYD GQL ++Y DA+   K+ 
Sbjct: 225 ITRLTLDKMAAGGIYDQLGGGFARYSVDHEWHIPHFEKMLYDNGQLMSLYADAYKYYKNE 284

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG- 458
            Y  +  +  D+++R+M  P    +SA DADS   EG     EG FY W  +E+E IL  
Sbjct: 285 RYKEVVYETYDWIKREMTSPEYGFYSALDADS---EGV----EGKFYTWDKQEIEKILDK 337

Query: 459 EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 518
           E A +F  +Y +   GN +   +          N L    +    A    + +E+   I+
Sbjct: 338 EQAAIFNAYYAVTDEGNWEEEEI----------NHLWIRKEKQHIAEAFHISIERLDEII 387

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
              + +L + R+KR  P LDDK++ SWN L++     A K    +               
Sbjct: 388 QHSKTQLLEYRNKRIHPGLDDKILTSWNALMLKGLCDAYKAFADQ--------------- 432

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG 638
            +++ +A   A F+  +L  E    L  +++NG +    FLDDYA L    + LYE    
Sbjct: 433 -QFLTLALDNAKFLLNNLCREDG-MLYRNYKNGKATIEAFLDDYALLAQAFISLYEVTFD 490

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
             W+  A  L +   + F D + G +F T+    +++ R  E  D   PS NSV   NL 
Sbjct: 491 EAWIFKAKSLCDYVIKHFSDAQSGMFFYTSDASEALVARKYEIMDNVIPSSNSVMAWNLR 550

Query: 699 RLA 701
           +L+
Sbjct: 551 KLS 553


>gi|428781674|ref|YP_007173460.1| thioredoxin domain-containing protein [Dactylococcopsis salina PCC
           8305]
 gi|428695953|gb|AFZ52103.1| thioredoxin domain protein [Dactylococcopsis salina PCC 8305]
          Length = 678

 Score =  378 bits (971), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 247/665 (37%), Positives = 351/665 (52%), Gaps = 76/665 (11%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNRLA   S YL +HA NP+DW+ W  EA  +A+  D PIFLS+GYS+CHWC VME E+F
Sbjct: 2   TNRLAETQSLYLRKHAENPIDWWYWCSEALEKAKNEDKPIFLSVGYSSCHWCTVMEGEAF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP-DLKPLMGG 220
            D  +A+ LN+ F+ IKVDREERPD+D +YM  +Q + G GGWPL++FL+P D  P  GG
Sbjct: 62  SDSTIAQYLNENFIPIKVDREERPDLDSIYMQALQMMTGQGGWPLNIFLTPHDRVPFYGG 121

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           TYFP E +YGRPGF  IL+ ++  +D++++ L    +F  E ++  L  SA+       L
Sbjct: 122 TYFPLEPRYGRPGFLQILQAIRRFYDQEKEKL---NSFKGEVMT-LLQRSAT-------L 170

Query: 281 PQNALRLCAEQLSKSYDSRFG---GFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEG 337
           P +   L  E L K  ++  G     G+ P FP     Q+    ++  +++    EA   
Sbjct: 171 PSSETPLNRELLIKGLETAVGITSSRGTPPSFPMIPHAQLARRKTQFSDESRYDAEAITT 230

Query: 338 QKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS--L 395
           Q+ +  TL     GGI+DHVGGGFHRY+VD  W VPHFEKMLYD GQ+     + +S  +
Sbjct: 231 QRGMDLTL-----GGIYDHVGGGFHRYTVDGTWTVPHFEKMLYDNGQIMEYLANLWSSGV 285

Query: 396 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 455
            +  F S I   +  +L+R+M  P G  ++++DADS  T      +EGAFYVW+ +E+E 
Sbjct: 286 KEPAFASAIAHAV-QWLQREMTAPEGYFYASQDADSFTTSEEAEPEEGAFYVWSYQELES 344

Query: 456 ILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI-----ELNDSSASASK--- 506
           +L  E     +  + +   GN            F+G NVL      EL+  S +A K   
Sbjct: 345 LLTPEELNALQSEFTVTSEGN------------FEGNNVLQRQTGGELSSPSETALKKLF 392

Query: 507 ------LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 560
                 L  P+  +         K      + P P  D K+I +WN L+IS  ARA    
Sbjct: 393 NARYGNLSSPVTPFPPATNNTEAKQTAWEGRIP-PVTDTKMITAWNSLMISGLARA---- 447

Query: 561 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHRLQHSFRNGPSKAPGFL 619
                     + V G   K Y E A  AA+FI  + +   + +RL +   +G +      
Sbjct: 448 ----------YAVFG--EKTYWECAVKAANFIGENQWVAGRFYRLNY---DGKATVSAQS 492

Query: 620 DDYAFLISGLLDLY-EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPS-VLLR 677
           +DYA  I  LLDLY      T+WL  A +LQ T DE     E GGYFNT  ++ S +++R
Sbjct: 493 EDYALFIKALLDLYCCHPEQTQWLDQATQLQATFDEYLWSSETGGYFNTAKDNSSDLIIR 552

Query: 678 VKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLM 737
            +   D A P+ N V+V NLVRL  +    K+DY   +AE +L  F + ++    A P +
Sbjct: 553 ERTYIDNATPAANGVAVANLVRLFELT--EKTDYV-ASAEKTLQAFSSIMEQSPQACPGL 609

Query: 738 CCAAD 742
               D
Sbjct: 610 FSGLD 614


>gi|408671866|ref|YP_006871614.1| protein of unknown function DUF255 [Emticicia oligotrophica DSM
           17448]
 gi|387853490|gb|AFK01587.1| protein of unknown function DUF255 [Emticicia oligotrophica DSM
           17448]
          Length = 679

 Score =  378 bits (970), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 227/617 (36%), Positives = 325/617 (52%), Gaps = 71/617 (11%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N+L  E SPYLLQHAHNPV+W+ WGEEA  +A++ D PI +SIGYS CHWCHVME ESFE
Sbjct: 3   NKLINETSPYLLQHAHNPVEWYPWGEEALQKAKEEDKPILVSIGYSACHWCHVMERESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           +E +A+++N   V IKVDREERPDVD +YM  +QA+   GGWPL+VFL PD KP  GGTY
Sbjct: 63  NEQIAQIMNQHLVCIKVDREERPDVDAIYMDALQAMGLRGGWPLNVFLMPDAKPFYGGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSG-AFAIEQLSEALSASASSNKLPDELP 281
           FPP +      +  ++  + +A+   R+ L +S   F    L +       S +      
Sbjct: 123 FPPRN------WANLVESIANAFKNDREKLQKSAEGFTQNMLVKESDKYRMSVEDTLSFS 176

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
           +  L     +L + +D   GG   +PKFP P   + ++ +     D           + +
Sbjct: 177 EEELTTIFNRLHQDFDFEKGGMNRSPKFPMPSIWKFLIRYYSITND-------KRAYQHL 229

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTK---- 397
           + TL  +A GGI+D +GGG+ RYS DE W VPHFEKMLYD GQL ++Y +A++LTK    
Sbjct: 230 IHTLNRVALGGIYDTIGGGWTRYSTDEDWKVPHFEKMLYDNGQLISLYAEAYALTKSEGN 289

Query: 398 -DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
            D FY+    + +++L R+M+   G  +SA DADS   EG    +EG FY+W  +E+   
Sbjct: 290 PDNFYAAKVTETIEWLEREMMSKEGGFYSALDADS---EG----EEGKFYIWKKEEIIAA 342

Query: 457 LGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDSSASASKLGMPLEKYL 515
           LGE A  F E +     GN +            G NV+ +E  D   +    G PL    
Sbjct: 343 LGEDAGPFIETFDFTEAGNWE-----------HGNNVVHLEERDFMEN----GWPL---- 383

Query: 516 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 575
               E ++KLFD R+KR RP LDDK++ SWNGL++     A + L               
Sbjct: 384 --TAEIKQKLFDFRAKRVRPGLDDKILCSWNGLMLKGLVDAYRYL--------------- 426

Query: 576 SDRKEYMEVAESAASFIRRHLY-------DEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 628
            D ++++++A   A FI+  +          +   L H+++NG +    +L+DYA +I  
Sbjct: 427 -DNQKFLDLALKNAHFIKDCMSIKVMNEDGSEARGLWHNYKNGKANIVAYLEDYASVIDA 485

Query: 629 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 688
            L LY+      WL  A  L       F D E   ++ T  +   ++ R KE  D   P+
Sbjct: 486 YLALYQVTFDEVWLHEAEMLAIYTVANFYDDEDEFFYFTDSQGEELIARKKEIFDNVIPA 545

Query: 689 GNSVSVINLVRLASIVA 705
            NS+   NL  L  I+ 
Sbjct: 546 SNSIMATNLYNLGLILG 562


>gi|402773173|ref|YP_006592710.1| thioredoxin domain-containing protein [Methylocystis sp. SC2]
 gi|401775193|emb|CCJ08059.1| Thioredoxin domain protein [Methylocystis sp. SC2]
          Length = 675

 Score =  377 bits (969), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 228/666 (34%), Positives = 345/666 (51%), Gaps = 70/666 (10%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNRL  E SPYLLQH HNPV W AW  E  A A++   PI LS GY+ CHWCHVM  ESF
Sbjct: 5   TNRLGQETSPYLLQHQHNPVHWQAWSAETLALAKQTGKPILLSSGYAACHWCHVMAHESF 64

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           E+  +A L+N+ F+++KVDREERPDVD +Y   +  +   GGWPL++FL+P+ +P  GGT
Sbjct: 65  ENPEIAALMNESFINVKVDREERPDVDYLYQQALMMMGQRGGWPLTMFLTPEGQPFWGGT 124

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFPP  + GRPGF  +L+ + + W  + + +  +    + +LS  L++ + +       P
Sbjct: 125 YFPPFAQGGRPGFAELLKTIAELWRARANAIEHN----VAELSAGLASLSETTPGEPVSP 180

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
                +CA QL++  D   GGFG+APKFP+   +  +    K      ++G  S  Q +V
Sbjct: 181 HLVESICA-QLAQRLDRVDGGFGAAPKFPQTTSLDFLWRAWK------RTGRDSLRQAVV 233

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
           L TL  +++GG++DH+GGGF RYS D RW VPHFEKMLYD  QL  +  + +   +   Y
Sbjct: 234 L-TLDHISQGGVYDHLGGGFARYSTDNRWLVPHFEKMLYDNAQLIELLTEVWQDERRELY 292

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHA 461
                + ++++ R+M  PGG   S+ DADS   EG    +EG FY W+  E+ + LG  A
Sbjct: 293 RLRVTETIEWMTREMRAPGGGFASSLDADS---EG----EEGKFYAWSQTEIREALGARA 345

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-----IELNDSSASASKLGMPLEKYLN 516
             F+  Y +   GN +            GK+VL     IEL D    A+        +L 
Sbjct: 346 PFFERAYGVSREGNWE-----------HGKSVLNRLGSIELLDEETEAALARDRAALFL- 393

Query: 517 ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGS 576
                       R++R RP  DDKV+  WNGL I++ A+A+ +                 
Sbjct: 394 -----------ARARRVRPGCDDKVLADWNGLTIAAIAKAACVF---------------- 426

Query: 577 DRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG 636
           +R++++++A +A  F++  +  ++  RL HS+R   ++    LDDY  +    L LYE  
Sbjct: 427 EREDWLDIAIAAFDFVKSAMTTDEG-RLLHSWRCARARHMAVLDDYGAMCRAALALYEAA 485

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
               +L  A       +  + DR  GGYF    +  +++ RVK   D A PSGN + +  
Sbjct: 486 GAPSYLECARRWVEHVEHHYRDRT-GGYFYAADDADTLIARVKIAEDSALPSGNGMMLQA 544

Query: 697 LVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVG 756
           L +L  +   S    YR+ AE     F   +++  +    +    +ML       +V++G
Sbjct: 545 LAQLYYLTGES---VYRERAEAIAQDFAGTIRERILGFSSLLNGMEMLR--EALQIVVIG 599

Query: 757 HKSSVD 762
              + D
Sbjct: 600 ENDAAD 605


>gi|302536490|ref|ZP_07288832.1| conserved hypothetical protein [Streptomyces sp. C]
 gi|302445385|gb|EFL17201.1| conserved hypothetical protein [Streptomyces sp. C]
          Length = 687

 Score =  377 bits (969), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 241/661 (36%), Positives = 336/661 (50%), Gaps = 60/661 (9%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           +NRLA E SPYLLQHA NPVDW+ W  EAFAEAR+RDVP+ LS+GYS+CHWCHVM  ESF
Sbjct: 2   SNRLANETSPYLLQHADNPVDWWPWSPEAFAEARERDVPVLLSVGYSSCHWCHVMAGESF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           ED+  A  +N+ FV+IKVDREERPD+D VYM  VQA  G GGWP++VFL+PD +P   GT
Sbjct: 62  EDDLAAAYMNEHFVNIKVDREERPDIDAVYMEAVQAATGQGGWPMTVFLTPDAEPFYFGT 121

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS-EALSASASSNKLPDEL 280
           YFPPE ++G P F  +L  V+ AW  +R+ +++     +  L+   L    +    P+EL
Sbjct: 122 YFPPEPRHGMPSFMQVLEGVRTAWAGRREEVSEVAQRIVRDLAGRQLDYGRAGLPGPEEL 181

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
            +  L      L++ YD+  GGFG APKFP  + ++ +L H  +   TG  G      +M
Sbjct: 182 GRALL-----GLTREYDAARGGFGGAPKFPPSMVLEFLLRHHAR---TGSEG----ALQM 229

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
              T + MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L  VY   +  T    
Sbjct: 230 AADTCEAMARGGIYDQLGGGFARYSVDREWVVPHFEKMLYDNALLCRVYAHLWRATGSDL 289

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH 460
              +  +  D++ R++    G   SA DADS E   + +  EGA+Y WT  E+ ++LGE 
Sbjct: 290 ARRVALETADFMVRELRTEQGGFASALDADS-EDPSSGKHVEGAYYAWTPAELAEVLGEE 348

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
                  Y+    G  +          F+    +++L          G P+ +   +   
Sbjct: 349 DGAVAAAYF----GVTE-------EGTFEHGRSVLQLPQ--------GGPVVEAGKV-AS 388

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            R +L   R +RP P  DDKV+ +WNGL +++ A                      +R +
Sbjct: 389 IRERLLAARGRRPAPGRDDKVVAAWNGLAVAALAECGAFF----------------ERPD 432

Query: 581 YMEVAESAASFIRRHLYDEQT--HRLQHSFRNGPSKA-PGFLDDYAFLISGLLDLYEFGS 637
            +E A  AA  + R  +D      RL  + R+G      G L+DY  +  G L L     
Sbjct: 433 LVERAIEAADLLVRVHFDSTAGMARLARTSRDGRVGVNAGVLEDYGDVAEGFLALASVTG 492

Query: 638 GTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED-HDGAEPSGNSVSVIN 696
              WL +A  L +     F    G G    T  D   L+R  +D  D A PSG + +   
Sbjct: 493 EGVWLEFAGFLVDLVMARFT--AGDGSLYDTAHDAEQLIRRPQDPTDTAAPSGWTAAAGA 550

Query: 697 LVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVG 756
           L+   S  A + S  +R+ AE +L V           +      A+ L V   + V +VG
Sbjct: 551 LL---SYAAHTGSAPHREAAERALGVVHALGPRAPRFIGHGLAVAEAL-VDGPREVAVVG 606

Query: 757 H 757
           H
Sbjct: 607 H 607


>gi|383830441|ref|ZP_09985530.1| thioredoxin domain containing protein [Saccharomonospora
           xinjiangensis XJ-54]
 gi|383463094|gb|EID55184.1| thioredoxin domain containing protein [Saccharomonospora
           xinjiangensis XJ-54]
          Length = 667

 Score =  377 bits (969), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 228/621 (36%), Positives = 321/621 (51%), Gaps = 63/621 (10%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA   SPYLLQHA NPVDW+ WG EA  EAR+RDVPI LSIGY+ CHWCHVM  ESF 
Sbjct: 2   NRLADATSPYLLQHADNPVDWWPWGPEALGEARRRDVPILLSIGYAACHWCHVMAHESFS 61

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D+ VA  +N+ FV+IKVDREERPD+D VYM   QA+ G GGWP++ FL+P+ KP   GTY
Sbjct: 62  DDDVAAFMNEHFVNIKVDREERPDIDAVYMAATQAMTGQGGWPMTCFLTPEGKPFHCGTY 121

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           +PP   +G P F+ +L  V  AW ++R  L +     +E ++E  +   S++ + ++   
Sbjct: 122 YPPVPAHGMPSFRQVLEAVDQAWRERRAELVEGAGRIVEHIAE-RTTPLSTHPVDEDTVT 180

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
           +A+      L    D   GGFG APKFP  + ++ +L H    E TG    +++   +V 
Sbjct: 181 SAV----ATLRTETDPGHGGFGGAPKFPPSMVLEFLLRH---YERTG----SAQALSIVD 229

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            T + MA+GGI+D + GGF RYSVD  W VPHFEKMLYD   L   Y      T      
Sbjct: 230 LTAEGMARGGIYDQLAGGFARYSVDAGWVVPHFEKMLYDNALLLRFYAHLARRTGSALAH 289

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHA 461
            +  +  ++L RD+  P G   S+ DAD+   EG T       YVWT +++ D+LG +  
Sbjct: 290 RVAGETAEFLLRDLRTPEGGFASSLDADTDGVEGLT-------YVWTPQQLVDVLGRDDG 342

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
           +   E + +   G  +           +G + L    D    A        +++ +    
Sbjct: 343 VWAAETFGVTREGTFE-----------RGASTLQLRRDPDDPA--------RWMRVT--- 380

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
              L + R+ RP+P  DDKVI +WNGL I++ A A   L+                R E+
Sbjct: 381 -SALVEARNARPQPARDDKVIAAWNGLAITALAEAGLALR----------------RPEW 423

Query: 582 MEVAESAASFIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFLISGLLDLYEFGSGTK 640
           +E A +A +F+           L  S R+G    A G L+DY  L  GLL L++    + 
Sbjct: 424 VEAAVAAGAFVLD--VHASGDGLLRSSRDGVAGAAAGVLEDYGCLADGLLALHQATGESG 481

Query: 641 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL-RVKEDHDGAEPSGNSVSVINLVR 699
           WLV A  L +T    F      G F+ T ED   L+ R  +  D A PSG S     L+ 
Sbjct: 482 WLVEATSLIDTALRRFGVEGAPGAFHDTAEDAETLVHRPSDPTDNASPSGASALAGALLT 541

Query: 700 LASIVAGSKSDYYRQNAEHSL 720
            +++    ++  YR   E +L
Sbjct: 542 ASALAGPDRAGAYRAACEEAL 562


>gi|289769445|ref|ZP_06528823.1| conserved hypothetical protein [Streptomyces lividans TK24]
 gi|289699644|gb|EFD67073.1| conserved hypothetical protein [Streptomyces lividans TK24]
          Length = 680

 Score =  377 bits (969), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 241/627 (38%), Positives = 330/627 (52%), Gaps = 61/627 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA   SPYLLQHA NPVDW+ W  EAF EAR+R VP+ LS+GYS CHWCHVM  ESFE
Sbjct: 3   NRLAQATSPYLLQHAENPVDWWPWEAEAFEEARRRGVPVLLSVGYSACHWCHVMAHESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D   A+ LN  FVS+KVDREERPDVD VYM  VQA  G GGWP++VFL+PD +P   GTY
Sbjct: 63  DGPTAEYLNSHFVSVKVDREERPDVDAVYMEAVQAATGQGGWPMTVFLTPDAEPFYFGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS-EALSASASSNKLPDELP 281
           FPPE ++G P F+ +L+ V+ AW ++RD + +     +  L+   +S   +     ++L 
Sbjct: 123 FPPEPRHGMPSFRQVLQGVRQAWAERRDEVDEVAGKIVRDLAGREISYGDAEAPGEEQLG 182

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
           Q  L      L++ YD R GGFG APKFP  + I+ +L H  +   TG  G      +M 
Sbjct: 183 QALL-----GLTREYDERRGGFGGAPKFPPSMVIEFLLRHHAR---TGAEG----ALQMA 230

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             T + MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L  VY   +  T     
Sbjct: 231 ADTCERMARGGIYDQLGGGFARYSVDREWVVPHFEKMLYDNALLCRVYAHLWRATGSDLA 290

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EH 460
             +  +  D++ R++    G   SA DADS   +G  +  EGA YVWT  ++ ++LG E 
Sbjct: 291 RRVALETADFMVRELRTAEGGFASALDADS--DDGTGKHVEGAHYVWTPAQLTEVLGAED 348

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDSSASASKLGMPLEKYLNILG 519
           A L  +++ +   G  +            G +VL +   +S   A++           + 
Sbjct: 349 AELAAQYFGVTQEGTFE-----------HGASVLQLPQQESVFDAAR-----------IA 386

Query: 520 ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRK 579
             R +L   R  RP P  DDKV+ +WNGL I++ A            A F  P       
Sbjct: 387 SVRERLLAARDGRPAPGRDDKVVAAWNGLAIAALAET---------GAYFERP------D 431

Query: 580 EYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYAFLISGLLDLYEFGSG 638
                  +A   +R HL DEQ  RL  + ++G + A  G L+DYA +  G L L      
Sbjct: 432 LVEAAVAAADLLVRLHL-DEQV-RLTRTSKDGRAGANAGVLEDYADVAEGFLALASVTGE 489

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
             WL +A  L +     F D E G  ++T  +   ++ R ++  D A PSG S +   L+
Sbjct: 490 GVWLDFAGFLLDHVLTRFTD-ESGSLYDTAADAERLIRRPQDPTDNATPSGWSAAAGALL 548

Query: 699 RLASIVAGSKSDYYRQNAEHSLAVFET 725
              S  A + S  +R  AE +L V + 
Sbjct: 549 ---SYAAHTGSAPHRAAAERALGVVKA 572


>gi|52078696|ref|YP_077487.1| hypothetical protein BL00131 [Bacillus licheniformis DSM 13 = ATCC
           14580]
 gi|319649027|ref|ZP_08003236.1| YyaL protein [Bacillus sp. BT1B_CT2]
 gi|52001907|gb|AAU21849.1| conserved protein YyaL [Bacillus licheniformis DSM 13 = ATCC 14580]
 gi|317389021|gb|EFV69839.1| YyaL protein [Bacillus sp. BT1B_CT2]
          Length = 625

 Score =  377 bits (968), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 229/556 (41%), Positives = 311/556 (55%), Gaps = 59/556 (10%)

Query: 156 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 215
           M  ESFEDE VAKLLN+ FVSIKVDREERPDVD +YMT  Q + G GGWPL+VFL+PD K
Sbjct: 1   MAHESFEDEEVAKLLNEKFVSIKVDREERPDVDSIYMTICQMMTGQGGWPLNVFLTPDQK 60

Query: 216 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 275
           P   GTYFP   ++ RPGF  +++++ D + K R+ +        E+ +  L   A S+ 
Sbjct: 61  PFYAGTYFPKTSRFNRPGFVEVVKQLSDTFAKNREHVEDIA----EKAANNLRIKAKSDA 116

Query: 276 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML-YHSKKLEDTGKSGEA 334
             D L ++ LR   +QL  S+D+ +GGFGSAPKFP P  +  +L YH         SGE 
Sbjct: 117 -GDSLGEDILRRTYQQLINSFDAAYGGFGSAPKFPIPHMLTFLLRYHQ-------YSGEE 168

Query: 335 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 394
           +     V+ TL  MA GGI+DHVG GF RYS D+ W VPHFEKMLYD   L   Y +A+ 
Sbjct: 169 N-ALYSVMKTLDSMANGGIYDHVGYGFARYSTDDEWLVPHFEKMLYDNALLLIAYTEAYQ 227

Query: 395 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 454
           +TK+  Y  I   I+ ++RR+M    G  +SA DAD   TEG     EG +YVW+ +EV 
Sbjct: 228 ITKNERYKQISEQIITFVRREMTDEKGAFYSALDAD---TEGV----EGKYYVWSKEEVL 280

Query: 455 DILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKN----VLIELNDSSASASKLGM 509
           + LG E   L+   Y +   GN            F+G N    +   L D      +  +
Sbjct: 281 ETLGDELGELYCAVYNITQEGN------------FEGHNIPNLIYTRLEDIK---DEFAL 325

Query: 510 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 569
             E+  N L E R KLF+ R +R  PH+DDKV+ SWN L+I+  A+A+K+         +
Sbjct: 326 TDEELQNKLEEARTKLFEKRQERTYPHVDDKVLTSWNALMIAGLAKAAKV---------Y 376

Query: 570 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 629
           N P       EY+E+A +AA FI   L   Q  R+   +R+G  K  GF+DDYAFL+   
Sbjct: 377 NAP-------EYLEMARAAAEFIENKLI--QDGRIMVRYRDGEVKNKGFIDDYAFLLWAY 427

Query: 630 LDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSG 689
           ++LYE       L  A +L+     LF D E GG++ T  +  ++++R KE +DGA PSG
Sbjct: 428 IELYEASLDLTDLRKAKKLEADMKGLFWDEEHGGFYFTGSDAEALIVRDKEVYDGALPSG 487

Query: 690 NSVSVINLVRLASIVA 705
           N V  + L RL  +  
Sbjct: 488 NGVLAVQLSRLGRLTG 503


>gi|374369685|ref|ZP_09627707.1| hypothetical protein OR16_29084 [Cupriavidus basilensis OR16]
 gi|373098764|gb|EHP39863.1| hypothetical protein OR16_29084 [Cupriavidus basilensis OR16]
          Length = 683

 Score =  377 bits (967), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 246/625 (39%), Positives = 338/625 (54%), Gaps = 78/625 (12%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNRLA E SPYL QHA NPVDW+ W E AF  AR+ D P+ LS+GY+ CHWCHVM  ESF
Sbjct: 2   TNRLATETSPYLRQHAANPVDWYPWSEAAFRRAREDDKPVLLSVGYAACHWCHVMAHESF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           E+  +A L+N  F+SIKVDR+ERPD+D +Y      +  GGGWPL+VFL+P  +P  GGT
Sbjct: 62  ENPRIAGLMNARFISIKVDRQERPDIDDIYQKVPLMMGQGGGWPLTVFLTPQGEPFFGGT 121

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKK----RDMLAQ-SGAFAIEQLSEALSASASSNKL 276
           YFPP+D+YGRPGF  +L  + +AW  +    RDM+ Q    F    L +    +A    L
Sbjct: 122 YFPPDDRYGRPGFVRVLLSLSEAWTHRRGELRDMIEQFRLGFRQLDLVDLGREAAEVEDL 181

Query: 277 PDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASE 336
           P +         A  L++  D   GG G APKFP      ++L   +  + TG+    + 
Sbjct: 182 PAQ--------TARALAQDTDPTHGGLGGAPKFPNASGYDLVL---RICQRTGEPVLLAA 230

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
            ++    TL  MA GGIHD +GGGF RYSVDERW VPHFEKMLYD GQL  +Y DA+ LT
Sbjct: 231 LER----TLDGMAAGGIHDQLGGGFARYSVDERWAVPHFEKMLYDNGQLVTLYADAYRLT 286

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
               +  +  + + Y+ RDM  P G  ++ EDADS   EG    +EG FYVWT  EV  +
Sbjct: 287 GKPAWRRVFEEAIAYIVRDMTHPDGCFYAGEDADS---EG----EEGRFYVWTPAEVRAV 339

Query: 457 LG--EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKY 514
           LG  E A+             C    ++D  N  +G +VL    + +A+      P ++ 
Sbjct: 340 LGASEGAL------------ACRAYGVTDGGNFARGTSVL----NRAATLD----PFDE- 378

Query: 515 LNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVV 574
              L + R +LF  R++R RP  DD ++  WNGL+I     A +             P +
Sbjct: 379 -ARLEDWRGRLFAARARRARPARDDNILTGWNGLMIQGLCAAYQATGCP--------PHL 429

Query: 575 GSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYE 634
            + R+    + E         + D   +R   ++++G +K PGFL+DYA L + L+DLYE
Sbjct: 430 AAARRAASAIQEKLT------MPDGGVYR---AWKDGTAKVPGFLEDYALLANALIDLYE 480

Query: 635 FGSGTKWLVWAIELQNTQDELFLD--REGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSV 692
                ++L  A+EL      L LD  R+ G YF     +P ++ R +  HD A PSG S 
Sbjct: 481 SCFDKRYLDRAVELV----ALILDKFRDDGLYFTPRDGEP-LVHRPRAPHDSAWPSGIST 535

Query: 693 SVINLVRLASIVAGSKSDYYRQNAE 717
           SV   +RL ++   +  D YR  AE
Sbjct: 536 SVFAFLRLHAL---TGRDVYRDLAE 557


>gi|325845722|ref|ZP_08169003.1| hypothetical protein HMPREF9402_0744 [Turicibacter sp. HGF1]
 gi|325488252|gb|EGC90680.1| hypothetical protein HMPREF9402_0744 [Turicibacter sp. HGF1]
          Length = 614

 Score =  377 bits (967), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 229/608 (37%), Positives = 321/608 (52%), Gaps = 73/608 (12%)

Query: 156 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 215
           ME ESFEDE VA  LN+ F+SIKVDREERPD+D VYM+  QAL G GGWPL++F++P  +
Sbjct: 1   MEHESFEDEDVATYLNEHFISIKVDREERPDIDTVYMSICQALTGQGGWPLTIFMTPTQQ 60

Query: 216 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 275
               GTYFP   +YGRPGF  +L+ +   W+  R  +            +        + 
Sbjct: 61  AFYAGTYFPKTSRYGRPGFLDVLKTIDFNWNHHRAKVTDITKQIASHFKDLEGIETEGDS 120

Query: 276 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 335
           L   + QN +     QL +SYD RFGGFG+APKFP P ++  +L + ++ +D        
Sbjct: 121 LSMAIIQNGVN----QLKQSYDPRFGGFGTAPKFPTPHKLMFLLRYDEQTKDKSV----- 171

Query: 336 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 395
             Q MV  TL  M KGGI DH+G GF RYS DE W VPHFEKMLYD   L   Y +A+ +
Sbjct: 172 --QDMVTQTLDHMYKGGIFDHLGYGFSRYSTDEIWLVPHFEKMLYDNALLMISYTEAYQV 229

Query: 396 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 455
           T++  Y  I     +Y+   +  P G  + AEDADS   EG    +EG FYV+T  E+  
Sbjct: 230 TREPRYLSIAMQTAEYVLTQLTSPEGGFYCAEDADS---EG----EEGKFYVFTPAEIIQ 282

Query: 456 ILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKY 514
           ILG E    F E Y +   GN            F+GKN+L  L+            LE  
Sbjct: 283 ILGPEKGHWFNEFYNVTEEGN------------FEGKNILNRLHHKK---------LELD 321

Query: 515 LNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVV 574
           +  L  CR  L   R +R   H DDK++ SWNGL+I++FA+                 + 
Sbjct: 322 IKELEACRETLLTYRLERTHLHKDDKILTSWNGLMIAAFAK-----------------LY 364

Query: 575 GSDRKE-YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
           G  +K  Y++ A  A +FI++HL+DE   RL   +R G S    +LDDYAFL  GL++L+
Sbjct: 365 GQTQKMIYLDAASKAVTFIKQHLFDET--RLLARYREGESHFKAYLDDYAFLSYGLIELH 422

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
           +  +  ++L  AI+L     +LF D E GG++ T  +  +++LR KE +DGA PSGNSV+
Sbjct: 423 QSTAEVEYLELAIQLNKEMLDLFKD-EAGGFYLTGHDAETLMLRPKELYDGAMPSGNSVA 481

Query: 694 VINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA---------DML 744
             NL+RLA +   +    +   AE  +     ++K   M       AA          M+
Sbjct: 482 AYNLIRLAKLTGDT---LFETEAEKQIQYLAKQVKHYEMNHTFYLIAALFALSDTKELMI 538

Query: 745 SVPSRKHV 752
           +VP ++ +
Sbjct: 539 TVPKQEQI 546


>gi|383775980|ref|YP_005460546.1| hypothetical protein AMIS_8100 [Actinoplanes missouriensis 431]
 gi|381369212|dbj|BAL86030.1| hypothetical protein AMIS_8100 [Actinoplanes missouriensis 431]
          Length = 688

 Score =  376 bits (966), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 239/630 (37%), Positives = 345/630 (54%), Gaps = 63/630 (10%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL + +SPYLLQHA NPVDW+ WG++AFAEA++RDVP+ +S+GYS+CHWCHVM  ESFE
Sbjct: 2   NRLGSANSPYLLQHADNPVDWWPWGDDAFAEAKRRDVPLLISVGYSSCHWCHVMAHESFE 61

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D  +A  +N+ FVS+KVDREERPDVD VYMT  QA+ G GGWP++VF +PD  P   GTY
Sbjct: 62  DAAIAAQMNEGFVSVKVDREERPDVDAVYMTATQAMTGQGGWPMTVFATPDGDPFFCGTY 121

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           F P D++GR     +L  V  AW  +RD + + GA  +E +  A        + P  +  
Sbjct: 122 F-PRDQFGR-----LLASVTTAWRDQRDDVLKQGAAVVEAVGGAQMIGGP--RAP--ISG 171

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
           + L   A+ L+K  D  +GGFG APKFP  + +  +L H ++   TG    +++  ++V 
Sbjct: 172 DLLAAAAQGLAKEQDQTYGGFGGAPKFPPHMNLLFLLRHHER---TG----SADALEIVR 224

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
              + MA+GGI+D + GGF RY+VDE W VPHFEKMLYD   L  VY   + LT D+F  
Sbjct: 225 HACERMARGGIYDQLAGGFARYAVDETWTVPHFEKMLYDNALLLRVYTQLWRLTGDLFAR 284

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHA 461
            I  +   +L RD+    G + SA DAD++  EG T       Y WT  E+ + LG E  
Sbjct: 285 RIADETAAFLLRDLGTAQGGLASALDADTSGVEGLT-------YAWTPAELAEALGAEDG 337

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFK--------GKNVLIELNDSSASASKLGMPLEK 513
               + + +   G    +  S P +           GK+VL+   D   +   +   +E+
Sbjct: 338 AWAADLFRVTEPGTFAHNSASAPIDGAADRMKGVEHGKSVLVLARDIDEADPAI---VER 394

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
           + ++    R++L   R+ RP+P  DDKV+ SWNGL I++ A    +L   A S       
Sbjct: 395 WRDV----RQRLLTARNGRPQPARDDKVVASWNGLAITALAE-HGVLTGSAGS------- 442

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP-GFLDDYAFLISGLLDL 632
               R   + +AE  A    RHL D    RL+   R+G +  P G L+DY  +    L +
Sbjct: 443 ----RDAAVALAEVLAD---RHLVD---GRLRRVSRDGVAGEPAGVLEDYGSVAEAFLAV 492

Query: 633 YEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSV 692
           ++  +  +WL  A EL +     F   + GG+++T  +   +L R  +  D A PSG SV
Sbjct: 493 HQVTASPRWLTLAGELLDVALARFGSGD-GGFYDTADDAEKLLTRPADPTDNATPSGLSV 551

Query: 693 SVINLVRLASIVAGSKSDYYRQNAEHSLAV 722
               LV  A++   S S  +R+ A+ +LA 
Sbjct: 552 VCAALVSYAAL---SGSTAHREAADAALAT 578


>gi|21223348|ref|NP_629127.1| hypothetical protein SCO4975 [Streptomyces coelicolor A3(2)]
 gi|20520976|emb|CAD30960.1| conserved hypothetical protein [Streptomyces coelicolor A3(2)]
          Length = 686

 Score =  376 bits (966), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 240/627 (38%), Positives = 330/627 (52%), Gaps = 61/627 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA   SPYLLQHA NPVDW+ W  EAF EAR+R VP+ LS+GYS CHWCHVM  ESFE
Sbjct: 9   NRLAQATSPYLLQHAENPVDWWPWEAEAFEEARRRGVPVLLSVGYSACHWCHVMAHESFE 68

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D   A+ LN  FVS+KVDREERPDVD VYM  VQA  G GGWP++VFL+PD +P   GTY
Sbjct: 69  DGPTAEYLNSHFVSVKVDREERPDVDAVYMEAVQAATGQGGWPMTVFLTPDAEPFYFGTY 128

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS-EALSASASSNKLPDELP 281
           FPPE ++G P F+ +L+ V+ AW ++RD + +     +  L+   +S   +     ++L 
Sbjct: 129 FPPEPRHGMPSFRQVLQGVQQAWAERRDEVDEVAGKIVRDLAGREISYGDAEAPGEEQLG 188

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
           Q  L      L++ YD R GGFG APKFP  + I+ +L H  +   TG  G      +M 
Sbjct: 189 QALL-----GLTREYDERRGGFGGAPKFPPSMVIEFLLRHHAR---TGAEG----ALQMA 236

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             T + MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L  VY   +  T     
Sbjct: 237 ADTCERMARGGIYDQLGGGFARYSVDREWVVPHFEKMLYDNALLCRVYAHLWRATGSDLA 296

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EH 460
             +  +  D++ R++    G   SA DADS   +G  +  EGA YVWT  ++ ++LG E 
Sbjct: 297 RRVALETADFMVRELRTAEGGFASALDADS--DDGTGKHVEGAHYVWTPAQLTEVLGAED 354

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDSSASASKLGMPLEKYLNILG 519
           A L  +++ +   G  +            G +VL +   +S   A++           + 
Sbjct: 355 AELAAQYFGVTQEGTFE-----------HGASVLQLPQQESVFDAAR-----------IA 392

Query: 520 ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRK 579
             R +L   R  RP P  DDKV+ +WNGL +++ A            A F  P       
Sbjct: 393 SVRERLLAARDGRPAPGRDDKVVAAWNGLAVAALAET---------GAYFERP------D 437

Query: 580 EYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYAFLISGLLDLYEFGSG 638
                  +A   +R HL DEQ  RL  + ++G + A  G L+DYA +  G L L      
Sbjct: 438 LVEAAVAAADLLVRLHL-DEQV-RLTRTSKDGRAGANAGVLEDYADVAEGFLALASVTGE 495

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
             WL +A  L +     F D E G  ++T  +   ++ R ++  D A PSG S +   L+
Sbjct: 496 GVWLDFAGFLLDHVLTRFTD-ESGSLYDTAADAERLIRRPQDPTDNATPSGWSAAAGALL 554

Query: 699 RLASIVAGSKSDYYRQNAEHSLAVFET 725
              S  A + S  +R  AE +L V + 
Sbjct: 555 ---SYAAHTGSAPHRAAAERALGVVKA 578


>gi|421744678|ref|ZP_16182637.1| thioredoxin domain-containing protein [Streptomyces sp. SM8]
 gi|406686908|gb|EKC90970.1| thioredoxin domain-containing protein [Streptomyces sp. SM8]
          Length = 675

 Score =  375 bits (964), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 240/626 (38%), Positives = 331/626 (52%), Gaps = 62/626 (9%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            NRLA   SPYLLQHA NPVDW  WG EAF EAR+RDVP+ LS+GYS CHWCHVM  ESF
Sbjct: 2   ANRLAQSTSPYLLQHADNPVDWHPWGPEAFEEARRRDVPVLLSVGYSACHWCHVMAHESF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           EDE  A ++N  FV++KVDREERPDVD VYM  VQA  G GGWP++VFL+P+ +P   GT
Sbjct: 62  EDEATAAVMNAGFVNVKVDREERPDVDAVYMEAVQAATGQGGWPMTVFLTPEGEPFYFGT 121

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLP--DE 279
           YFPPE ++G PGF+ +L  V+ AW ++R  + +     +  L E   A     +LP  +E
Sbjct: 122 YFPPEPRHGMPGFREVLEGVRVAWAERRGEVDEVAGKIVADLRERRLALGEP-RLPGAEE 180

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
             Q  L      L++ YD   GGFG APKFP  + ++ +L H  +   TG  G      +
Sbjct: 181 AAQALL-----GLTREYDPVNGGFGGAPKFPPSMVLEFLLRHYAR---TGAEG----ALQ 228

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           M   T   MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L  VY+  +  T   
Sbjct: 229 MAADTAGRMARGGIYDQLGGGFARYSVDREWIVPHFEKMLYDNALLCRVYVHLWRATGSE 288

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE 459
               +  +  +++ RD+  P G   SA DADSA+  G  R  EGA+YVWT  ++ ++LGE
Sbjct: 289 QARRVALETAEFMVRDLGTPQGGFASALDADSADASG--RMVEGAYYVWTPAQLVEVLGE 346

Query: 460 H-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 518
               +   H+ +   G             F+    ++ L     +    G         +
Sbjct: 347 EDGRIAAAHFGVTEEGT------------FEEGASVLRLPQEDGAVQDAGR--------I 386

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
              R +L++ R +RP P  DDKV+ +WNGL I++ A A                    +R
Sbjct: 387 ASIRERLYEARLRRPEPGRDDKVVAAWNGLAIAALAEAGACF----------------ER 430

Query: 579 KEYMEVAESAAS-FIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFLISGLLDLYEFG 636
            + ++ A +AA   +R HL D    RL  + R+G  S   G L+DYA +  G L L    
Sbjct: 431 PDLVDAAVTAADLLVRLHLDDHA--RLTRTSRDGRASGNAGVLEDYADVAEGFLALASVT 488

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
               WL +A  L +   + F D E G  ++T  +   ++ R ++  D A PSG + +   
Sbjct: 489 GEGVWLDFAGLLLDGVLDRFTD-ESGALYDTASDAEQLIRRPQDPTDNATPSGWTAAAGA 547

Query: 697 LVRLASIVAGSKSDYYRQNAEHSLAV 722
              L    A + S+ +R  AE +L V
Sbjct: 548 ---LLGYAAQTGSEPHRTAAERALGV 570


>gi|336172537|ref|YP_004579675.1| hypothetical protein [Lacinutrix sp. 5H-3-7-4]
 gi|334727109|gb|AEH01247.1| hypothetical protein Lacal_1399 [Lacinutrix sp. 5H-3-7-4]
          Length = 679

 Score =  375 bits (964), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 220/626 (35%), Positives = 328/626 (52%), Gaps = 58/626 (9%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           K+TN L  E SPYLLQHAHNP+ W AW       A+K +  I +S+GY+ CHWCHVME E
Sbjct: 4   KYTNDLINETSPYLLQHAHNPIHWKAWNSNTLELAKKENKLIIISVGYAACHWCHVMEHE 63

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SFE+E VA ++N  F++IK+DREERPD+D+VYM  VQ + G GGWP++V   PD +P+ G
Sbjct: 64  SFENEDVAIVMNSNFINIKIDREERPDIDQVYMNAVQLMTGSGGWPMNVVALPDGRPVWG 123

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSAS--ASSNKLP 277
           GTYF  E       +   L ++ D + K  D L +       +L++ + A      N   
Sbjct: 124 GTYFKKEQ------WVNALNQISDLYKKNPDKLYEYAT----KLAKGIKAMDLIKPNTNE 173

Query: 278 DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEG 337
            +     L+      S  +D+  GG G  PKF  P   Q +L          + G   + 
Sbjct: 174 PKFDTTFLKEIIADWSVYFDTNKGGIGKEPKFMMPNNYQFLL----------RYGYQKQD 223

Query: 338 QKMVLF---TLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 394
           +K++ F   TL  MA GGI+D +GGGF RYSVD++WHVPHFEKMLYD  QL ++Y +AF+
Sbjct: 224 KKILDFVNTTLTKMAYGGIYDQIGGGFSRYSVDDKWHVPHFEKMLYDNAQLVSLYAEAFA 283

Query: 395 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 454
           LTK+  Y  +  + L++++R++ G  G  +S+ DADS   +     +EGA+YVW  +E++
Sbjct: 284 LTKNELYENVVIETLEFIKRELTGTNGIFYSSLDADSLTEDNVL--EEGAYYVWKKEELQ 341

Query: 455 DILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKY 514
            +L +   LF  +Y +   G  +       H  +    VLI   +     ++  + LEK 
Sbjct: 342 TLLKDDFKLFSTYYNVNNYGYWE-------HKNY----VLIRDKNDLKFTNQENITLEKL 390

Query: 515 LNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVV 574
                  +  L   R KR  P LDDK + SWN L++  +  A ++L+ E           
Sbjct: 391 KEKKKRWKSILLKEREKRNLPRLDDKTLTSWNALMLKGYVDAYRVLQDE----------- 439

Query: 575 GSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYE 634
                 Y++ A   A FI  +   E    L H+++NG S   GFL+DYA  I   L LY+
Sbjct: 440 -----NYLDCAIKNAEFILNNQLKEDG-SLYHNYKNGASSINGFLEDYATTIDAFLALYQ 493

Query: 635 FGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSV 694
             S  KWL  A  L +   + F D E   +F T+ +D  ++++  E  D   P+ NS+  
Sbjct: 494 VTSTIKWLDNAKALTDYCFDTFFDTESQLFFFTSNQDKKLIVQTIEYRDNVIPASNSIMA 553

Query: 695 INLVRLASIVAGSKSDYYRQNAEHSL 720
             L  L+       ++YY + +++ L
Sbjct: 554 NCLYMLSHFY---NNNYYLKTSKNML 576


>gi|432954000|ref|XP_004085500.1| PREDICTED: spermatogenesis-associated protein 20-like, partial
           [Oryzias latipes]
          Length = 393

 Score =  375 bits (964), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 177/322 (54%), Positives = 221/322 (68%), Gaps = 4/322 (1%)

Query: 95  SHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCH 154
           S S +KH+NRLA E SPYLLQHAHNPVDW+ WG EAF +A+  D PIFLS+GYSTCHWCH
Sbjct: 73  SSSPHKHSNRLAREKSPYLLQHAHNPVDWYPWGHEAFEKAKTEDKPIFLSVGYSTCHWCH 132

Query: 155 VMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDL 214
           VME ESF+DE V K+LN  FV IK+DREERPDVDKVYMT+VQA  GGGGWP+SV+L+PDL
Sbjct: 133 VMERESFQDEDVGKILNQHFVCIKLDREERPDVDKVYMTFVQATSGGGGWPMSVWLTPDL 192

Query: 215 KPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSN 274
           +P +GGTYFPP D+  RPGF T+L ++ D W   R  L   G   +  L +  S +A+  
Sbjct: 193 RPFIGGTYFPPRDQGRRPGFITVLTRIIDQWQNNRPSLESGGEKILSALKKGTSITANGG 252

Query: 275 KLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEA 334
           + P   P  A R C +QL+ SY+  +GGF  APKFP PV +  ++        T    E 
Sbjct: 253 EGPPLAPDVADR-CFQQLAHSYEEEYGGFREAPKFPSPVNLMFLMTFWWTNRST---SEG 308

Query: 335 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 394
            E  +M   TL+ MA GGIHDHV  GFHRYS D  WHVPHFEKMLYDQ QLA  Y+ AF 
Sbjct: 309 LEALQMATHTLRMMALGGIHDHVAQGFHRYSTDSSWHVPHFEKMLYDQAQLAVAYITAFQ 368

Query: 395 LTKDVFYSYICRDILDYLRRDM 416
           ++ +  ++ + +D+L Y+ RD+
Sbjct: 369 VSGERLFADVAKDVLQYVSRDL 390


>gi|291451582|ref|ZP_06590972.1| conserved hypothetical protein [Streptomyces albus J1074]
 gi|291354531|gb|EFE81433.1| conserved hypothetical protein [Streptomyces albus J1074]
          Length = 675

 Score =  375 bits (964), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 240/626 (38%), Positives = 331/626 (52%), Gaps = 62/626 (9%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            NRLA   SPYLLQHA NPVDW  WG EAF EAR+RDVP+ LS+GYS CHWCHVM  ESF
Sbjct: 2   ANRLAQSTSPYLLQHADNPVDWHPWGPEAFEEARRRDVPVLLSVGYSACHWCHVMAHESF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           EDE  A ++N  FV++KVDREERPDVD VYM  VQA  G GGWP++VFL+P+ +P   GT
Sbjct: 62  EDEATAAVMNAGFVNVKVDREERPDVDAVYMEAVQAATGQGGWPMTVFLTPEGEPFYFGT 121

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLP--DE 279
           YFPPE ++G PGF+ +L  V+ AW ++R  + +     +  L E   A     +LP  +E
Sbjct: 122 YFPPEPRHGMPGFREVLEGVRVAWAERRGEVDEVAGKIVADLRERRLALGEP-RLPGAEE 180

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
             Q  L      L++ YD   GGFG APKFP  + ++ +L H  +   TG  G      +
Sbjct: 181 AAQALL-----GLTREYDPVNGGFGGAPKFPPSMVLEFLLRHYAR---TGAEG----ALQ 228

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           M   T   MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L  VY+  +  T   
Sbjct: 229 MAADTAGRMARGGIYDQLGGGFARYSVDREWIVPHFEKMLYDNALLCRVYVHLWRATGSE 288

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE 459
               +  +  +++ RD+  P G   SA DADSA+  G  R  EGA+YVWT  ++ ++LGE
Sbjct: 289 QARRVALETAEFMVRDLGTPQGGFASALDADSADASG--RMVEGAYYVWTPAQLVEVLGE 346

Query: 460 H-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 518
               +   H+ +   G             F+    ++ L     +    G         +
Sbjct: 347 EDGRIAAAHFGVTEEGT------------FEEGASVLRLPQEDGAVQDAGR--------I 386

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
              R +L++ R +RP P  DDKV+ +WNGL I++ A A                    +R
Sbjct: 387 ASIRERLYEARLRRPEPGRDDKVVAAWNGLAIAALAEAGACF----------------ER 430

Query: 579 KEYMEVAESAAS-FIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFLISGLLDLYEFG 636
            + ++ A +AA   +R HL D    RL  + R+G  S   G L+DYA +  G L L    
Sbjct: 431 PDLVDAAVTAADLLVRLHLDDHA--RLTRTSRDGRASGNAGVLEDYADVAEGFLALASVT 488

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
               WL +A  L +   + F D E G  ++T  +   ++ R ++  D A PSG + +   
Sbjct: 489 GEGVWLDFAGLLLDGVLDRFTD-ESGALYDTASDAEQLIRRPQDPTDNATPSGWTAAAGA 547

Query: 697 LVRLASIVAGSKSDYYRQNAEHSLAV 722
              L    A + S+ +R  AE +L V
Sbjct: 548 ---LLGYAAQTGSEPHRTAAERALGV 570


>gi|23014746|ref|ZP_00054548.1| COG1331: Highly conserved protein containing a thioredoxin domain
           [Magnetospirillum magnetotacticum MS-1]
          Length = 671

 Score =  375 bits (964), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 247/656 (37%), Positives = 340/656 (51%), Gaps = 66/656 (10%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLAAE SPYLLQHAHNPV W+AWG EA A A+  + PI LS+GYS CHWCHVM  ESFE
Sbjct: 4   NRLAAETSPYLLQHAHNPVHWWAWGPEALAAAKAANKPILLSVGYSACHWCHVMAHESFE 63

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DEG+A L+ND F++IKVDREERPD+D +Y   +  +   GGWPL++FL+PD +P  GGTY
Sbjct: 64  DEGIAGLMNDLFINIKVDREERPDLDALYQNALGLIGQHGGWPLTMFLTPDAEPFWGGTY 123

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP + +YGR  F  +L  +  ++ K  D +  +    + ++ E+L   A S   P  L  
Sbjct: 124 FPAQARYGRAAFPDVLEGISHSFHKDPDKIGHN----VARIRESLEQMARSPG-PLSLDM 178

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             + L A Q  +  D   GG   APKFP+P   +  L+HS       ++G +S  +  V 
Sbjct: 179 EVVDLGAAQCLRLIDFEDGGTVGAPKFPQPGLFR-FLWHSYL-----RTGNSSL-KDAVT 231

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL  + +GGI+DH+GGGF RYS DE W VPHFEKMLYD  QL ++    +  T    Y 
Sbjct: 232 VTLDHICQGGIYDHLGGGFMRYSTDETWLVPHFEKMLYDNAQLVSLLTKVWKQTGSPLYR 291

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHA 461
               + + +L RDM+  GG   +A DADS   EG    +EG FY WTS+E+  +L  E A
Sbjct: 292 ARIFETVGWLLRDMMAEGGAFAAALDADS---EG----EEGLFYTWTSEELSALLDIETA 344

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
             F   Y ++  GN            ++G+N+L   N                 + L E 
Sbjct: 345 TRFGHLYGVQAHGN------------WEGRNIL-HRNHPRGGGDD---------HDLAEA 382

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
           +  L   R KR  P  DDKV+  WN ++I++ A A+                   DR ++
Sbjct: 383 KMVLLAERDKRIWPGRDDKVLADWNAMMITALAEAALTF----------------DRPDW 426

Query: 582 MEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKW 641
           +  AE A   I   +      R  HS   G ++    LDDYA+ I   L LYE  +G ++
Sbjct: 427 LAAAEHAFQVITTRMV-RPDGRPAHSLCRGRAETNAVLDDYAWAIFAALTLYETTTGPEY 485

Query: 642 LVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLA 701
           L  AI           D +GGGYF +  +   V++R K   D A PSGN V    L RL 
Sbjct: 486 LDQAIAWAEQVHAHHWDGQGGGYFLSADDATDVVIRTKPAFDSAVPSGNGVMAEVLARL- 544

Query: 702 SIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRK-HVVLVG 756
            +V G +   +R+ A+   AV +     M   +P M    D  ++ +    VV+VG
Sbjct: 545 WLVTGEER--WRERAQ---AVIDAFGAAMPEQIPHMTSLLDAFAILAEPLQVVIVG 595


>gi|307154410|ref|YP_003889794.1| hypothetical protein Cyan7822_4611 [Cyanothece sp. PCC 7822]
 gi|306984638|gb|ADN16519.1| protein of unknown function DUF255 [Cyanothece sp. PCC 7822]
          Length = 685

 Score =  375 bits (964), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 248/686 (36%), Positives = 352/686 (51%), Gaps = 90/686 (13%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA   S YL +HA+NP+DW++W +EA   A+  + PIFLSIGYS+CHWC VME E+F 
Sbjct: 3   NRLAQVKSLYLRKHANNPIDWWSWCDEALNTAKAENRPIFLSIGYSSCHWCTVMEGEAFS 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP-DLKPLMGGT 221
           D  +A+ +N  F+ IKVDREERPD+D +YM  +Q + G GGWPL++FL+P DL P  GGT
Sbjct: 63  DAAIAEYMNTHFLPIKVDREERPDLDSIYMQALQMMIGQGGWPLNIFLTPDDLVPFYGGT 122

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA-LSASASSNKLPDEL 280
           YFP E +Y RPGF  +L+ V+  +D ++D L       +E L  A +     +N + ++L
Sbjct: 123 YFPVEPRYNRPGFLQVLQSVRHFYDNEKDKLKSFKKEILEVLQSATVLPLGDANLVSNDL 182

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK---KLEDTGKSGEASEG 337
               +      ++ S +     FG  P FP      + L  S+   + ++ GK      G
Sbjct: 183 FYRGIETNTAVITNSAND----FGR-PSFPMIPYANLTLQGSRFEFQSQNDGKQAAIQRG 237

Query: 338 QKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTK 397
           + + L        GGI+DH+GGGFHRY+VD  W VPHFEKMLYD GQ+     + +S   
Sbjct: 238 EDLAL--------GGIYDHIGGGFHRYTVDSTWTVPHFEKMLYDNGQIVEYLANLWS--S 287

Query: 398 DVFYSYICRDI---LDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 454
           +V    + R I   + +L+R+M  P G  ++A+DADS  T      +EGAFYVW+  +++
Sbjct: 288 EVQKPSLARAIAGTVQWLKREMTAPEGYFYAAQDADSFTTPEDVEPEEGAFYVWSYSDIQ 347

Query: 455 DILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEK 513
            +L    +   K  + + P GN            F+GKNVL       AS  K     E 
Sbjct: 348 QLLSTDELEALKTAFTVTPEGN------------FEGKNVL-----QRASEGKFAEDFEA 390

Query: 514 YLNILGECR--------------RKLFDVRS----KRPRPHLDDKVIVSWNGLVISSFAR 555
            L+ L   R              R   + +S     R  P  D K+IV+WN L+IS  AR
Sbjct: 391 VLDKLFAVRYGASSSTLDRFPPARNNAEAKSGNWPGRIPPVTDTKMIVAWNSLMISGLAR 450

Query: 556 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHRLQHSFRNGPSK 614
           A  + +          P+       Y E+A  A  FI  H + + + HRL +    G + 
Sbjct: 451 AYGVFRE---------PL-------YWELAVGATEFIFTHQWKNGRLHRLNYE---GETG 491

Query: 615 APGFLDDYAFLISGLLDLYEFGSG-TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPS 673
                +DYAFLI  LLDL       T+WL  AI +Q   D LF   E GGY+N + ++  
Sbjct: 492 VLAQSEDYAFLIKALLDLQTASPAETEWLNKAISVQQEFDNLFWSVEMGGYYNNSTDNSQ 551

Query: 674 VLLRVKEDH--DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMA 731
            L+ VKE    D A PS N V+V NL+RLA +    +   Y   AE +L  F + LK   
Sbjct: 552 DLI-VKERSYIDNATPSANGVAVTNLIRLARLTENLE---YLSQAEQTLQAFSSILKQSP 607

Query: 732 MAVPLMCCAADM----LSVPSRKHVV 753
            A P +  A D     +SV S+  ++
Sbjct: 608 QACPSLFTALDWYRYSISVRSKPDIL 633


>gi|440700552|ref|ZP_20882794.1| hypothetical protein STRTUCAR8_07071 [Streptomyces turgidiscabies
           Car8]
 gi|440276815|gb|ELP65027.1| hypothetical protein STRTUCAR8_07071 [Streptomyces turgidiscabies
           Car8]
          Length = 677

 Score =  375 bits (963), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 230/590 (38%), Positives = 315/590 (53%), Gaps = 55/590 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA E SPYLLQHA NPVDW+ W EEAFAEAR    P+ LS+GYS+CHWCHVM  ESFE
Sbjct: 3   NRLAHETSPYLLQHADNPVDWWPWSEEAFAEARSSGKPVLLSVGYSSCHWCHVMAHESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D+  A  LN+ FVS+KVDREERPDVD VYM  VQA  G GGWP++VFL+PD +P   GTY
Sbjct: 63  DQATADYLNENFVSVKVDREERPDVDAVYMEAVQAATGQGGWPMTVFLTPDAEPFYFGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FPPE + G P F+ +L  V+ AW  +RD +A+     +  L+        + + P E  Q
Sbjct: 123 FPPEPRSGMPSFREVLEGVRSAWTDRRDEVAEVAQKIVRDLA-GREIGYGATEAPTEEDQ 181

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
               L    L++ YD++ GGFG APKFP  + ++ +L H  +   TG  G      +M  
Sbjct: 182 ARALLG---LTREYDAQRGGFGGAPKFPPSMVLEFLLRHGAR---TGSEG----ALQMAQ 231

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            T + MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L  VY   +  T      
Sbjct: 232 DTCERMARGGIYDQLGGGFARYSVDREWVVPHFEKMLYDNALLCRVYAHLWRATGSELAR 291

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHA 461
            +  +  D+L R++    G   SA DADS   +G  +  EGA+YVWT  ++ ++LG E A
Sbjct: 292 RVALETADFLVRELRTAEGGFASALDADS--DDGTGKHVEGAYYVWTPAQLTEVLGAEDA 349

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDSSASASKLGMPLEKYLNILGE 520
            L  +++ +   G  +           +G +VL +  ++    A K+             
Sbjct: 350 ELAAQYFGVTADGTFE-----------EGASVLQLPQHEGVFDAEKVDY----------- 387

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            + +L   R +RP P  DDKV+ +WNGL I++ A            A F  P        
Sbjct: 388 VKARLLAARGERPAPGRDDKVVAAWNGLAIAALAET---------GAYFERP------DL 432

Query: 581 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYAFLISGLLDLYEFGSGT 639
                 +A   +R HL D++ H L  + ++G   A  G L+DYA +  G L L       
Sbjct: 433 VDAALAAADLLVRVHL-DDRAH-LARTSKDGQVGANAGVLEDYADVAEGFLALASVTGEG 490

Query: 640 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSG 689
            WL +A  L +     F+D E G  F+T  +   ++ R ++  D A PSG
Sbjct: 491 VWLEFAGFLLDHVLVRFVDEESGALFDTASDAEQLIRRPQDPTDNAVPSG 540


>gi|359145694|ref|ZP_09179393.1| hypothetical protein StrS4_07994 [Streptomyces sp. S4]
          Length = 675

 Score =  375 bits (963), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 240/626 (38%), Positives = 331/626 (52%), Gaps = 62/626 (9%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            NRLA   SPYLLQHA NPVDW  WG EAF EAR+RDVP+ LS+GYS CHWCHVM  ESF
Sbjct: 2   ANRLAQSTSPYLLQHADNPVDWHPWGPEAFEEARRRDVPVLLSVGYSACHWCHVMAHESF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           EDE  A ++N  FV++KVDREERPDVD VYM  VQA  G GGWP++VFL+P+ +P   GT
Sbjct: 62  EDEATAAVMNAGFVNVKVDREERPDVDAVYMEAVQAATGQGGWPMTVFLTPEGEPFYFGT 121

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLP--DE 279
           YFPPE ++G PGF+ +L  V+ AW ++R  + +     +  L E   A     +LP  +E
Sbjct: 122 YFPPEPRHGMPGFREVLEGVRVAWAERRGEVDEVAGKIVADLRERRLALGEP-RLPGAEE 180

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
             Q  L      L++ YD   GGFG APKFP  + ++ +L H  +   TG  G      +
Sbjct: 181 AAQALL-----GLTREYDPVNGGFGGAPKFPPSMVLEFLLRHYAR---TGAEG----ALQ 228

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           M   T   MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L  VY+  +  T   
Sbjct: 229 MAADTAGRMARGGIYDQLGGGFARYSVDREWIVPHFEKMLYDNALLCRVYVHLWRATGSE 288

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE 459
               +  +  +++ RD+  P G   SA DADSA+  G  R  EGA+YVWT  ++ ++LGE
Sbjct: 289 QARRVALETAEFMVRDLGTPQGGFASALDADSADASG--RMVEGAYYVWTPAQLVEVLGE 346

Query: 460 H-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 518
               +   H+ +   G             F+    ++ L     +    G         +
Sbjct: 347 EDGRVAAAHFGVTEEGT------------FEEGASVLRLPQEDGAVQDAGR--------I 386

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
              R +L++ R +RP P  DDKV+ +WNGL I++ A A                    +R
Sbjct: 387 ASIRERLYEARLRRPEPGRDDKVVAAWNGLAIAALAEAGACF----------------ER 430

Query: 579 KEYMEVAESAAS-FIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFLISGLLDLYEFG 636
            + ++ A +AA   +R HL D    RL  + R+G  S   G L+DYA +  G L L    
Sbjct: 431 PDLVDAAVTAADLLVRLHLDDHA--RLTRTSRDGRASGNAGVLEDYADVAEGFLALASVT 488

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
               WL +A  L +   + F D E G  ++T  +   ++ R ++  D A PSG + +   
Sbjct: 489 GEGVWLDFAGLLLDGVLDRFTD-ESGALYDTASDAEQLIRRPQDPTDNATPSGWTAAAGA 547

Query: 697 LVRLASIVAGSKSDYYRQNAEHSLAV 722
              L    A + S+ +R  AE +L V
Sbjct: 548 ---LLGYAAQTGSEPHRTAAERALGV 570


>gi|300789899|ref|YP_003770190.1| hypothetical protein AMED_8085 [Amycolatopsis mediterranei U32]
 gi|384153415|ref|YP_005536231.1| hypothetical protein RAM_41535 [Amycolatopsis mediterranei S699]
 gi|399541779|ref|YP_006554441.1| hypothetical protein AMES_7963 [Amycolatopsis mediterranei S699]
 gi|299799413|gb|ADJ49788.1| conserved hypothetical protein [Amycolatopsis mediterranei U32]
 gi|340531569|gb|AEK46774.1| hypothetical protein RAM_41535 [Amycolatopsis mediterranei S699]
 gi|398322549|gb|AFO81496.1| hypothetical protein AMES_7963 [Amycolatopsis mediterranei S699]
          Length = 879

 Score =  375 bits (963), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 229/623 (36%), Positives = 326/623 (52%), Gaps = 73/623 (11%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA+  SPYLLQHA NPVDW+ WG EA AEA++R+VPI LS+GY+ CHWCHVM  ESFE
Sbjct: 226 NRLASATSPYLLQHADNPVDWWPWGPEALAEAKRRNVPILLSVGYAACHWCHVMAHESFE 285

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D G A L+N  FV+IKVDREERPD+D VYM   QA+ G GGWP++ FL+PD +P   GTY
Sbjct: 286 DAGTAALMNANFVTIKVDREERPDIDAVYMAATQAMTGQGGWPMTCFLTPDGEPFHCGTY 345

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           +PP  + G P F+ +L  V  +W ++ D L       +  L+E       +  L + +  
Sbjct: 346 YPPSPRPGMPSFRQLLVAVVQSWQERPDELVDGAKQIVAHLAE------QTGPLKESVVD 399

Query: 283 NALRLCA-EQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
            A+   A  +L +  D   GGFG APKFP  + ++ +L H ++   TG +   S    +V
Sbjct: 400 EAVLAGAVGKLQQEADRVNGGFGRAPKFPPSMVLEFLLRHHER---TGSAVALS----LV 452

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             T + MA+GG++D + GGF RYSVD  W VPHFEKMLYD   L   Y   +  T     
Sbjct: 453 DSTAEAMARGGLYDQLAGGFARYSVDAEWIVPHFEKMLYDNALLLRFYAHLWRRTGSATA 512

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHA 461
             +     ++L   +  P G   S+ DAD+   EG T       YVWT  ++ +++G+ +
Sbjct: 513 LRVATGTAEFLFESLRTPEGGFASSLDADTEGVEGLT-------YVWTPAQLREVVGDDS 565

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
               E + +   G  +           +G + L    D       L  P+          
Sbjct: 566 A--AELFGVTKEGTFE-----------EGASTLRLFGD-------LPEPM---------- 595

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
           R KL + R+KRP+P  DDKVI SWNGL I++ A A   L                DR ++
Sbjct: 596 RVKLLEARAKRPQPGRDDKVIASWNGLAITALAEAGVAL----------------DRPQW 639

Query: 582 MEVAESAASFIRR-HLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFLISGLLDLYEFGSGT 639
           +E A  AA  + R H+ D    RL+ S R+G   ++ G L+DYA +  G L L++     
Sbjct: 640 IEWAREAAELLLRVHVVD---GRLRRSSRDGVVGESAGVLEDYACVADGFLALHQATGAA 696

Query: 640 KWLVWAIELQNTQDELFLDRE-GGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
           KWL  A  L +     F   +  G YF+T  +  +++ R  +  D A PSG S     L+
Sbjct: 697 KWLTEATRLLDLALAHFASPDVPGAYFDTADDAETLVQRPADPGDNASPSGASALAGALL 756

Query: 699 RLASIVAGSKSDYYRQNAEHSLA 721
             +++   + S  YR+ AE +L+
Sbjct: 757 TASALAGHADSGRYREAAERALS 779


>gi|292493652|ref|YP_003529091.1| hypothetical protein Nhal_3684 [Nitrosococcus halophilus Nc4]
 gi|291582247|gb|ADE16704.1| protein of unknown function DUF255 [Nitrosococcus halophilus Nc4]
          Length = 694

 Score =  375 bits (963), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 241/625 (38%), Positives = 342/625 (54%), Gaps = 55/625 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N L  E SPYLLQHA NPVDW+ W EEA A A + + PI LSIGYS CHWCHVM  ESFE
Sbjct: 8   NHLQGETSPYLLQHADNPVDWYPWSEEALARAHRENKPIVLSIGYSACHWCHVMAHESFE 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYG-GGGWPLSVFLSPDLK-PLMGG 220
              +A  +N+ F++IKVDREERPD+D++Y    Q L G  GGWPL++FL P+ + P  GG
Sbjct: 68  SPEIAAAMNEHFINIKVDREERPDLDQIYQLAQQMLTGRPGGWPLTMFLEPENQVPFFGG 127

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           TYFPPE ++G PGFK +L ++ + +   R+ +    +  +    E  + +++    P+ L
Sbjct: 128 TYFPPEGRHGLPGFKDLLERIAEFFHAHREEIQSQNSRLLAAFEELDTRTSAVE--PEML 185

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASE-GQK 339
               L+   +QL++S+D R+GGF  APKFP P  I+  L   + +     S EA +    
Sbjct: 186 GPAPLKAAQQQLAQSFDPRYGGFKGAPKFPNPSSIERCL---RDVRGEHLSAEARQKALD 242

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           +   TL+ MA+GGI+D +GGGF RY+VD +W +PHFEKMLYD GQL  +Y DA+ L    
Sbjct: 243 LARLTLEQMAQGGIYDQLGGGFCRYAVDSQWRIPHFEKMLYDNGQLLALYADAYEL---- 298

Query: 400 FYSYICRDILD----YLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 455
           + S  CR +L+    +  R+M  P G  +S+ DADS   EG    +EG FYVWT ++V+ 
Sbjct: 299 WGSERCRRVLEETGHWAIREMQSPEGGYYSSLDADS---EG----REGKFYVWTREQVQA 351

Query: 456 ILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 515
           +L E        Y+           +  P N F+G   L       A A +L +      
Sbjct: 352 LLEEDEYPLVARYF----------GLDQPAN-FEGHWHLYGAITPEALAQELNLSPRILE 400

Query: 516 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 575
             L   ++KLF  R +R RP  DDK++ SWNGL+I   A A + L   A           
Sbjct: 401 ETLATAKQKLFAAREERIRPGRDDKILTSWNGLMIKGMAAAGQALAEPA----------- 449

Query: 576 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 635
                ++  AE A  F+R HL+ E   RL  S+++G  + PG+LDDYAFL+  LL L + 
Sbjct: 450 -----FIASAERALDFVRGHLWREG--RLLVSYKDGRVQHPGYLDDYAFLLDALLALLQA 502

Query: 636 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 695
                 L +A+EL       F D   GG++ T  +  +++ R     D A P+GN V   
Sbjct: 503 RWREGDLAFAVELAEAALAHFEDPAQGGFYFTADDHETLIHRPVPLMDNATPAGNGVLAW 562

Query: 696 NLVRLASIVAGSKSDYYRQNAEHSL 720
           +L RL  ++   +   Y + AE +L
Sbjct: 563 SLQRLGHLLGEMR---YLKAAERTL 584


>gi|402494465|ref|ZP_10841206.1| thioredoxin domain-containing protein [Aquimarina agarilytica ZC1]
          Length = 706

 Score =  375 bits (963), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 216/637 (33%), Positives = 340/637 (53%), Gaps = 49/637 (7%)

Query: 87  AERTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIG 146
            ++ P   +H   + TN L  E SPYLLQHAHNPV+W AW  E   EA+++   + +S+G
Sbjct: 20  TQKDPIMETH---EFTNDLIHETSPYLLQHAHNPVNWKAWHPETLKEAKEKKKLMLISVG 76

Query: 147 YSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPL 206
           Y+ CHWCHVME ESFED  VA ++N  +++IK+DREERPD+D+VYM+ VQ + G GGWPL
Sbjct: 77  YAACHWCHVMEHESFEDSTVAAVMNKNYINIKIDREERPDIDQVYMSAVQLMTGRGGWPL 136

Query: 207 SVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEA 266
           +V   PD +P+ GGTY+P  +  G       L++++  ++     L +      E +   
Sbjct: 137 NVIALPDGRPVWGGTYYPKAEWMGA------LQQIQKIYEDDPSKLEEYATKLTEGIQSV 190

Query: 267 LSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE 326
              + + N L  E   + +    E  +K +D + GG   APKF  P     +L ++ +  
Sbjct: 191 SLVTPNPNALKFE--NSTIESAVETWAKKFDYKKGGLDYAPKFMMPNNYHFLLRYAHQTN 248

Query: 327 DTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLA 386
           +        + +  V+ TL  ++ GG++DHVGGGF RY+ DE+WHVPHFEKMLYD  QL 
Sbjct: 249 N-------EKLKDYVITTLNQISYGGVYDHVGGGFARYATDEKWHVPHFEKMLYDNAQLV 301

Query: 387 NVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFY 446
           ++Y DA+ LTK+ +Y  +  + LD+++R++    G  +S+ DADS    G  + +EGAFY
Sbjct: 302 SLYSDAYLLTKNEWYKQVVYETLDFVQRELTNAEGVFYSSLDADSVTHSG--KLEEGAFY 359

Query: 447 VWTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 505
           VW    +E  LG E   LF ++Y +   G  +       HN +    VLI     +    
Sbjct: 360 VWQKPALETALGVEDFKLFADYYNVNAYGIWE-------HNNY----VLIRNESDADFIE 408

Query: 506 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 565
           K  +    +L    + +++L  +RSKR RP LDDK + SWN L++  +A A  +      
Sbjct: 409 KHKLDKGDFLQKQKKWKQRLLSIRSKRERPRLDDKTLTSWNALMLKGYADAYSVF----- 463

Query: 566 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 625
                      +   +++VA + A+FI+         +L H+++ G S   G+L+DYA  
Sbjct: 464 -----------NDANFLKVALTNAAFIKNKQM-ASNGQLMHNYKEGKSTINGYLEDYAAT 511

Query: 626 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 685
           I   + LY+     +WL  +  + +   + F D   G +F T+ ED +++ R  E  D  
Sbjct: 512 IDAFIALYQVTFDQQWLDLSKTMTDYVFDHFYDDASGLFFFTSDEDAALVTRNIESSDNV 571

Query: 686 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 722
            P+ NS+   NL +L+   +  K   + Q   H++ V
Sbjct: 572 IPASNSMMAKNLYKLSHYFSNKKYLEHSQKMLHNIQV 608


>gi|284033485|ref|YP_003383416.1| hypothetical protein Kfla_5611 [Kribbella flavida DSM 17836]
 gi|283812778|gb|ADB34617.1| protein of unknown function DUF255 [Kribbella flavida DSM 17836]
          Length = 670

 Score =  375 bits (962), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 227/610 (37%), Positives = 318/610 (52%), Gaps = 71/610 (11%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N L+   SPYL QHA NPV W  WGE AFAEAR+RDVP+FLS+GYS CHWCHVM  ESFE
Sbjct: 4   NELSTSTSPYLRQHADNPVAWKQWGEAAFAEARERDVPVFLSVGYSACHWCHVMAHESFE 63

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D+  A  LN+ FV +KVDREERPDVD +YM    A+ G GGWP+SVFL+P  +P   GTY
Sbjct: 64  DDATAAYLNEHFVCVKVDREERPDVDAIYMEATVAMTGHGGWPMSVFLTPAGEPFFCGTY 123

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP + ++G   F+ +L  + DAW  KR+ +   GA  ++QL       A    + + +  
Sbjct: 124 FPLDPRHGMASFRQVLESLVDAWRTKREQIDGIGASVVQQL------GARQPAVGEAVDA 177

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             L      L   +D   GGFG APKFP  + +  +L H ++   TG    + E   MV 
Sbjct: 178 AVLDRAVALLQGDFDPVDGGFGQAPKFPPSMVLDFLLRHHRR---TG----SEEALAMVT 230

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            T + MA+GG++D + GGF RYSVD++W VPHFEKMLYD   L +VY   +++T      
Sbjct: 231 HTCERMARGGMYDQLAGGFARYSVDKQWIVPHFEKMLYDNALLLDVYTHWWTVTGSPLAE 290

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
            +  +  D+L  ++  P G   SA DAD   TEG    +EG +YVW+  E+ ++LGE A 
Sbjct: 291 RVALETADFLLAELRTPEGGFASALDAD---TEG----EEGRYYVWSPTELRELLGEDAD 343

Query: 463 LFKEHYYLKPT---GNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILG 519
              E   +  T   G   L   SDP +                        L+++  I  
Sbjct: 344 WVIELCDVTGTFEHGTSVLQLRSDPDD------------------------LDRWNRI-- 377

Query: 520 ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRK 579
             R  L D R++R  P  DDKV+ +WNGL I++  RA  +L                DR 
Sbjct: 378 --RSVLRDARARRTYPGRDDKVVAAWNGLAITALTRAGLVL----------------DRP 419

Query: 580 EYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP-SKAPGFLDDYAFLISGLLDLYEFGSG 638
           EY+E A  AA  + R ++ + + RL  + R+G    A G L+DYA      L L      
Sbjct: 420 EYVEAAVKAAELV-RDVHVDGSGRLHRTSRDGAVGTAHGVLEDYAAYAQACLTLLAATRD 478

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
             WL  A  L +   + F+    G +F+T  +  ++  R ++  D A P+G S++     
Sbjct: 479 DSWLTLAQRLLDRVLQQFV--ADGTFFDTAADAETLAWRPQDATDNASPAGVSLAAEAFS 536

Query: 699 RLASIVAGSK 708
            LAS+   ++
Sbjct: 537 TLASVTGEAR 546


>gi|428772641|ref|YP_007164429.1| hypothetical protein Cyast_0808 [Cyanobacterium stanieri PCC 7202]
 gi|428686920|gb|AFZ46780.1| protein of unknown function DUF255 [Cyanobacterium stanieri PCC
           7202]
          Length = 686

 Score =  375 bits (962), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 238/658 (36%), Positives = 348/658 (52%), Gaps = 72/658 (10%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TN L    S YL +HAHNP++W+ WGEEA  +A++   PIFLSIGYS+CHWC VME E+F
Sbjct: 2   TNNLVNAQSLYLRKHAHNPINWYPWGEEALNKAKQEQKPIFLSIGYSSCHWCTVMEGEAF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP-DLKPLMGG 220
            D  +A  LN  F++IKVDREERPD+D +YM  +Q + G GGWPL++FL+P DL P  GG
Sbjct: 62  SDGAIADYLNQNFIAIKVDREERPDIDSIYMQGLQMMTGQGGWPLNIFLTPHDLVPFYGG 121

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASAS-SNKLPDE 279
           TYFP E +YGRPGF  IL  + + + ++ D L       +  L   ++ + S  N L  +
Sbjct: 122 TYFPLEPRYGRPGFLQILESIHNFYHQQTDKLNALKEEIVSILENNINLNPSIENHLNTK 181

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLE--DTGKSGEASEG 337
           L    L   ++ L +   + +GG    P+FP      MM Y +  L    T     A + 
Sbjct: 182 LLIQGLEKNSQILGR---NEYGG----PRFP------MMPYSNTTLTAIHTLPPETAQKA 228

Query: 338 QKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTK 397
            ++ +     +  GGI+DHVGGGFHRY+VD  W VPHFEKMLYD G +     + +S  K
Sbjct: 229 HQLGIQRGIDLVNGGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDNGLIMEFLANLWSSGK 288

Query: 398 -DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
            +  Y   C   L +L R+M+ P G  +SA+DAD+         +EG FYVW   +++ I
Sbjct: 289 ENPQYHIACEGTLQWLEREMVAPEGYFYSAQDADNFGNIQDEEPEEGEFYVWHYLDLQQI 348

Query: 457 LG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 515
           L  E  I  +E + +   GN            F+GKNVL +  D  A    +   L+K  
Sbjct: 349 LSHEELIALQEVFTISNEGN------------FEGKNVLQKHPD-KAITPMVKNALDKLF 395

Query: 516 NI-LGECRRKLFDVRSKRPR-------------PHLDDKVIVSWNGLVISSFARASKILK 561
            +  G+   +L      R               P  D K+IV+WN L+IS  ARA  + K
Sbjct: 396 TMRYGQTPERLTTFPPARNNHEAKSLEWLGRIPPVTDTKMIVAWNSLMISGLARAYGVFK 455

Query: 562 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQ-THRLQHSFRNGPSKAPGFLD 620
           +E                +Y+E+AESA  FI ++ ++ Q  +RL +  +          +
Sbjct: 456 NE----------------KYLELAESAVKFILKNQWENQRLYRLNYGNK---VSVLAQSE 496

Query: 621 DYAFLISGLLDLYE--FGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPS-VLLR 677
           DYAFL+  LLDL +    +G  WL  AI++Q   D+   D++ GGY+N   ++ S +L++
Sbjct: 497 DYAFLVKALLDLQQNSLNAGNYWLEKAIKVQQEFDDYCYDQKNGGYYNNAYDNSSDLLIK 556

Query: 678 VKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP 735
            K   D A PS N V+V NL+RL  +      DY+ + AE +L +F  ++ +  ++ P
Sbjct: 557 EKGYIDNATPSPNGVAVANLLRLGLMT--DNLDYFEK-AEQTLKIFADKMVNSPVSCP 611


>gi|383785408|ref|YP_005469978.1| hypothetical protein LFE_2175 [Leptospirillum ferrooxidans C2-3]
 gi|383084321|dbj|BAM07848.1| hypothetical protein LFE_2175 [Leptospirillum ferrooxidans C2-3]
          Length = 694

 Score =  375 bits (962), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 244/670 (36%), Positives = 353/670 (52%), Gaps = 62/670 (9%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           +N L+ E SPYLLQHA NPV+W+ WG EA + A + + PI LSIGYS CHWCHVM  ESF
Sbjct: 2   SNLLSRETSPYLLQHAENPVNWYPWGPEALSLAHETNRPILLSIGYSACHWCHVMAHESF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVY-MTYVQALYGGGGWPLSVFLSPDLKPLMGG 220
           ED   A ++N+ F++IKVDREERPD+D +Y M +       GGWPL++FL+PD  P  GG
Sbjct: 62  EDPETASVMNESFINIKVDREERPDLDHIYQMAHTVITKRNGGWPLTMFLTPDQVPFAGG 121

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           TYFP   ++G PGF ++L +++  +D+ ++ L+ +     E LS + +    +N  P  L
Sbjct: 122 TYFPKSPRFGLPGFISVLHQIRQFYDENKEALSGTKHPVTELLSRSDALGEGANPDPSSL 181

Query: 281 ---PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEG 337
              P+  LR   + L   +DS  GGF  APKFP P++I      +  L +  + GE  + 
Sbjct: 182 TIEPEARLR---DSLRARFDSEDGGFTPAPKFPHPMDI------AACLREYEREGEVFD- 231

Query: 338 QKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTK 397
             M   TL+ MA GGI+D +GGGF RYSVD  W +PHFEKMLYD   L  VY +   L++
Sbjct: 232 LWMARHTLERMASGGIYDQIGGGFSRYSVDGTWTIPHFEKMLYDNALLLCVYAEGAHLSE 291

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
           D   + +C  I+ +L R+M    G   +A DADS   EG    +EG +YVWT +EV  IL
Sbjct: 292 DAGLASVCDGIVTWLFREMRDSSGAFHAALDADS---EG----EEGKYYVWTREEVSRIL 344

Query: 458 G-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFK--GKNVLIELNDSSASASKLGMPLEKY 514
             E   +    Y L  T N +        +EF    KN+       S  AS+L +    +
Sbjct: 345 TPEEYQVVSLTYGLSETPNFE--------HEFWHFRKNLPF-----SEVASRLSLTEGPF 391

Query: 515 LNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVV 574
            ++L   + KL  VRS+R  P  DDKV+  WNGL+     RA +IL              
Sbjct: 392 HSLLSSAKEKLLSVRSQRIPPGKDDKVLTGWNGLLARGLIRAGRIL-------------- 437

Query: 575 GSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYE 634
             DR E++   +     +R  L+      L      G S+   +LDDYA+++  L++   
Sbjct: 438 --DRPEWIMEGQKILDILRETLW--TGDHLLAVRTKGESRLNAYLDDYAYVLDALVESLA 493

Query: 635 FGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSV 694
                  L WA+ L +     F D   GG+  T+ +   ++ R K  HD A PSG++V+ 
Sbjct: 494 TVYRPSDLAWALSLADVLVSKFWDDAAGGFHFTSHDHEQLIHRPKSGHDAAIPSGSAVTC 553

Query: 695 INLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA-ADMLSVPSRKHVV 753
             L RLA +    + D+  +    +LA++   + +  M    M  A  + LS P    +V
Sbjct: 554 RALNRLAHL--SGRMDWL-EKVGRTLALYSKPMLEQPMGYASMIMALGEYLSPPV---IV 607

Query: 754 LVGHKSSVDF 763
           LV  KSS+++
Sbjct: 608 LVRGKSSLEW 617


>gi|158426331|ref|YP_001527623.1| highly protein [Azorhizobium caulinodans ORS 571]
 gi|158333220|dbj|BAF90705.1| highly conserved protein [Azorhizobium caulinodans ORS 571]
          Length = 657

 Score =  374 bits (961), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 244/662 (36%), Positives = 347/662 (52%), Gaps = 65/662 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL AE SPYLLQH  NPV W+ WG EA AEA++   P+ LS+GY+ CHWCHVM  ESFE
Sbjct: 4   NRLGAETSPYLLQHKDNPVHWWPWGPEALAEAKRSGRPVLLSVGYAACHWCHVMAHESFE 63

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D   A L+N  FV+IKVDREERPDVD++YM  +  L   GGWPL++FL+ D  P  GGTY
Sbjct: 64  DAETADLMNALFVNIKVDREERPDVDQIYMNALHELGEQGGWPLTMFLNADGAPFWGGTY 123

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS-ASASSNKLPDELP 281
           FP    YGRPGFK +L +V  A+ +  + +A +    + +L+ A   A   +  L D   
Sbjct: 124 FPKTASYGRPGFKDVLWQVSQAYRETPEKVAHNTDAILSRLAAAAKPAGGVALTLAD--- 180

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
              L   A+Q++  +D   GG   APKFP+   ++++     +  D        + + +V
Sbjct: 181 ---LDKAAQQIAGLFDRAHGGLRGAPKFPQAGLLELLWRAGDRTGD-------PQLKAVV 230

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
            FTL  M +GGI+DHVGGGF RYSVDERW VPHFEKMLYD  QL  +   A+  T D  +
Sbjct: 231 AFTLNRMCEGGIYDHVGGGFSRYSVDERWLVPHFEKMLYDNAQLLELLALAYQETGDELF 290

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EH 460
               R+ + +L+R+M+   G   ++ DADS   EG     EG FYVWT+ E+  +LG E 
Sbjct: 291 LLRARETVSWLKREMVTADGAFAASLDADS---EG----HEGKFYVWTADEIVAVLGKED 343

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
           A  F   Y +   GN            ++G+ +L     +  S   + M  E  L  + E
Sbjct: 344 AAEFAAFYDVTDEGN------------WEGQTIL-----NRTSFGDVSMVEEARLRPMKE 386

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
              KL   R++R RP LDDKV+  WNGL+I++ ARA  +                 D  E
Sbjct: 387 ---KLLAARAQRVRPGLDDKVLADWNGLMIAALARAGAL----------------LDEPE 427

Query: 581 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 640
           ++++A +A   + R +   +  RL HS+R G    PG   D A +    + L+E      
Sbjct: 428 WVDLAATAFDAVVRLMV--KDGRLGHSYREGRLVLPGLASDLAAMARAGIALHEAAGDEA 485

Query: 641 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 700
            L  A +  N  +  +LD + G YF T  + P++++R     D A P+ NSV+   L+RL
Sbjct: 486 PLAHAEDFLNRLEADYLDPQSGAYFLTAADAPALVMRPLSSLDEALPNYNSVAADALIRL 545

Query: 701 ASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSS 760
           A++   +  D  R  A+  +           +A P +  A D  +      +V VG +S 
Sbjct: 546 AAL---TGQDGLRARADRLIGALTGAAAQNPLAHPSLLNALD--TRLRLAEIVAVGARSV 600

Query: 761 VD 762
            D
Sbjct: 601 RD 602


>gi|154245776|ref|YP_001416734.1| hypothetical protein Xaut_1832 [Xanthobacter autotrophicus Py2]
 gi|154159861|gb|ABS67077.1| protein of unknown function DUF255 [Xanthobacter autotrophicus Py2]
          Length = 669

 Score =  374 bits (960), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 235/619 (37%), Positives = 325/619 (52%), Gaps = 61/619 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL+ E SPYLLQH  NPV W+AWG EAFAEA+    PI LS+GY+ CHWCHVM  ESFE
Sbjct: 4   NRLSRETSPYLLQHKDNPVHWWAWGPEAFAEAQATGKPILLSVGYAACHWCHVMAHESFE 63

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           +  VA L+N  FV+IKVDREERPDVD++YM+ +Q L   GGWPL++FL P+ KP  GGTY
Sbjct: 64  NADVAGLMNALFVNIKVDREERPDVDQIYMSALQQLGQSGGWPLTMFLDPEGKPFWGGTY 123

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FPP   YGRPGF  +L++V   + + +D + ++ A  + +L +A +  A +    ++L  
Sbjct: 124 FPPAASYGRPGFTDVLQQVSTVFTQNKDKVEKNTATILARLKKAATPVAGAAIGREDLND 183

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
            A RL A      +D   GG   APKFP+   ++ +     + +D          + +V 
Sbjct: 184 AAARLPA-----MFDPVHGGLKGAPKFPQSGLLEFLWRVGTRRKDDAL-------KAIVA 231

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL  M +GGI+DH+GGGF RYSVDE W VPHFEKMLYD   L  +   A+S T D  + 
Sbjct: 232 LTLNRMCEGGIYDHLGGGFARYSVDEIWFVPHFEKMLYDNALLLELLALAYSDTGDALFL 291

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHA 461
              R+ + +L+R+M+ P G   ++ DAD   TEG     EG FYVW+  E+  +LG E A
Sbjct: 292 TRARETVGWLKREMLTPEGAFAASLDAD---TEG----HEGRFYVWSEAEITAVLGAEDA 344

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
             F   Y +   GN ++             N+L        SA             L   
Sbjct: 345 AFFNRLYDVSRAGNWEVG------------NILNRTEAGVVSAEDEAR--------LAPL 384

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
           R KL   R KR RP  DDKV+  WNGL+I++ ARA   L                   E+
Sbjct: 385 REKLLLAREKRVRPGRDDKVLADWNGLMIAALARAGGFLG----------------EAEW 428

Query: 582 MEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKW 641
           + +A+ A   +  H+  E   RL HS+       PG   D A +    + L+E     + 
Sbjct: 429 VALAQRAFDAVVSHMVVEG--RLAHSWCGTKIVLPGLASDLAAMARAGIALHEATGAPEP 486

Query: 642 LVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLA 701
           L  A       +    D E G YF T  +  S++LR    HD A P+ N+V+   L+RLA
Sbjct: 487 LAQAAHFLEVLETHHRDPETGAYFLTAYDGDSLILRPLATHDEAVPNANAVAADALIRLA 546

Query: 702 SIVAGSKSDYYRQNAEHSL 720
           ++   + +D +R  A+  L
Sbjct: 547 AL---TGNDAFRTRADRVL 562


>gi|421076735|ref|ZP_15537717.1| hypothetical protein JBW_0882 [Pelosinus fermentans JBW45]
 gi|392525347|gb|EIW48491.1| hypothetical protein JBW_0882 [Pelosinus fermentans JBW45]
          Length = 628

 Score =  374 bits (960), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 231/613 (37%), Positives = 326/613 (53%), Gaps = 51/613 (8%)

Query: 156 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 215
           ME E FED+ VA LLN  F++IKVDREERPDVD +YM+  QAL G GGWPL++ ++PD K
Sbjct: 1   MERECFEDQEVADLLNQHFIAIKVDREERPDVDGIYMSVCQALTGQGGWPLTIIMAPDKK 60

Query: 216 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 275
           P   GTYFP   K GR G   +L  +   W+K R  + ++G   +  L      S     
Sbjct: 61  PFFAGTYFPKHRKMGRMGLLELLTTLHQHWEKNRSEILKAGNEIVNILQRPKPPSGEGQI 120

Query: 276 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 335
             D L Q  L     +L  SYD ++GGFGSAPKFP P +I  +L + +  ++        
Sbjct: 121 GEDLLKQAYL-----ELENSYDPQYGGFGSAPKFPTPHKITFLLRYWQHFKE-------P 168

Query: 336 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 395
           +   MV  TL  M +GGI+DH+G GF RYS D++W VPHFEKMLYD   L   YL+A+  
Sbjct: 169 KALAMVEKTLMSMWQGGIYDHLGYGFARYSTDQKWLVPHFEKMLYDNALLCTSYLEAYQC 228

Query: 396 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 455
           T +  ++ I  DIL Y+ RDM+   G  +SAEDADS   EG     EG FYV+T K+V +
Sbjct: 229 TGNQEFARIAEDILTYVMRDMMDKNGGFYSAEDADS---EGV----EGKFYVFTRKQVVE 281

Query: 456 ILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKY 514
           ILG E   LF + Y++   GN +    S  H    G+N+          A  +   +E  
Sbjct: 282 ILGEEEGALFADFYHISSHGNFEHG-TSILH--LIGRNL-------EEYARVVNKTVENL 331

Query: 515 LNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVV 574
             +L + R KL+ VR  R  P+ DDK++ +WNGL+I++FA+A+++LK             
Sbjct: 332 SEVLKKGREKLYQVREARIHPYKDDKILTAWNGLMIAAFAKAARVLK------------- 378

Query: 575 GSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYE 634
              + +Y +VAE   +FI   L      RL   +R G +    +LDDYAFL+  L+++YE
Sbjct: 379 ---QSKYAKVAEQGIAFIYEKLMGSNG-RLLARYREGEAAHLAYLDDYAFLLMALIEVYE 434

Query: 635 FGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSV 694
                 +L  A  L     ELF DR  GG++    +   ++ R KE +DGA PSGNSV+ 
Sbjct: 435 TTCNDYYLQQAAILAKDMGELFGDRTEGGFYFYGNDGEELIARPKEIYDGAIPSGNSVAA 494

Query: 695 INLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVL 754
             L +LA +   ++   +   AE  L  F   +   A        A D     + K +V+
Sbjct: 495 FALQKLADM---TEDRSFSDTAERLLGHFAGEVSRYAAGYTYFMMAVDYYLADNTK-IVI 550

Query: 755 VGHKSSVDFENML 767
           VG K + D ++M 
Sbjct: 551 VGDKEAADTKSMF 563


>gi|334338370|ref|YP_004543522.1| hypothetical protein Isova_2944 [Isoptericola variabilis 225]
 gi|334108738|gb|AEG45628.1| protein of unknown function DUF255 [Isoptericola variabilis 225]
          Length = 658

 Score =  374 bits (959), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 233/609 (38%), Positives = 321/609 (52%), Gaps = 72/609 (11%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA   SPYLLQHA NPVDW+ WG +AFAEAR+RDVP+ LS+GY+ CHWCHVM  ESFE
Sbjct: 3   NRLAHATSPYLLQHADNPVDWWEWGADAFAEARRRDVPVLLSVGYAACHWCHVMAHESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D+ VA  L D FV+IKVDREERPDVD VYM    AL G GGWP++ FL+PD +P   GTY
Sbjct: 63  DDDVAAALADRFVAIKVDREERPDVDAVYMGATTALTGQGGWPMTCFLTPDGEPFFAGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           +P      R  F  +L  V +AW ++RD + + GA     L+EA+ A  S+   PD L +
Sbjct: 123 YP------REHFLQVLDAVWEAWTERRDAVERQGA----ALTEAI-ARTSARLTPDVLDE 171

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
            AL      +++  D   GGFG APKFP  + ++ +L H  +  D           ++V 
Sbjct: 172 AALERSVRLVARDADPEHGGFGGAPKFPPSMTLEHLLRHHARTGD-------PSALELVE 224

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            T + MA+GGI+D + GGF RY+VD  W VPHFEKMLYD  QL  VYL  +  T      
Sbjct: 225 RTCEAMARGGIYDQLAGGFARYAVDAAWVVPHFEKMLYDNAQLLRVYLHWYRATGSPLAE 284

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
            + R+  ++LR D+  P G   SA DAD+   EG T       YVWT++++ D+LG    
Sbjct: 285 RVVRETAEFLRADLRTPEGGFASALDADTDGVEGLT-------YVWTAEQLADVLG---- 333

Query: 463 LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDS-SASASKLGMPLEKYLNILGE 520
                                P +  +   VL + L  +     S L +  +        
Sbjct: 334 ---------------------PADGARAAEVLSVTLEGTFEHGTSTLQLREDPDPEWWTG 372

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            R +L + R+ RP+P  DDKV+ +WNGL I++ A A ++L           P    D ++
Sbjct: 373 VRARLAEARAGRPQPARDDKVVTAWNGLAIAALAEAGELL---------GVPGYVDDARD 423

Query: 581 YMEVAESAASFIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFLISGLLDLYEFGSGT 639
             ++       +R H+ D    RL+ + R G    APG   D+  L  GLL L++    T
Sbjct: 424 CADL------LLRLHVVD---GRLRRASRGGVVGTAPGVAADHGDLAEGLLALHQATGET 474

Query: 640 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVR 699
           +WL  A EL     E F D   GG+++   +   ++ R K+  DG EPSG S     L  
Sbjct: 475 RWLDAAGELLEVALERFGD-GAGGFYDVADDAERLVSRPKDPTDGPEPSGQSSLAGALAT 533

Query: 700 LASIVAGSK 708
            A++   S+
Sbjct: 534 YAALTGSSR 542


>gi|289209063|ref|YP_003461129.1| hypothetical protein TK90_1902 [Thioalkalivibrio sp. K90mix]
 gi|288944694|gb|ADC72393.1| protein of unknown function DUF255 [Thioalkalivibrio sp. K90mix]
          Length = 677

 Score =  374 bits (959), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 243/659 (36%), Positives = 347/659 (52%), Gaps = 60/659 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA   SPYLLQHA NPVDW+ WGE+A   AR+ D PI LSIGYS CHWCHVM  ESFE
Sbjct: 2   NRLAGASSPYLLQHADNPVDWYPWGEDALERARREDKPILLSIGYSACHWCHVMAHESFE 61

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQAL-YGGGGWPLSVFLSPDLKPLMGGT 221
           D   A+++N  F++IKVDREERPD+D++Y      L    GGWPL+VFL+PD  P   GT
Sbjct: 62  DPATAEVMNRRFINIKVDREERPDLDRIYQNAHMLLSQRPGGWPLTVFLTPDQVPFFAGT 121

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSA--SASSNKLPDE 279
           YFP   ++G P F  ++ +V D   +  D + +      E L +AL+     +   +P  
Sbjct: 122 YFPSTPRHGLPSFVDLMNRVADFLAEHPDEIQRQN----ESLQQALARIYRPAGGAIP-- 175

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
                L     +L++++D +FGGFG APKFP P  ++ + +H+ +  D       +E ++
Sbjct: 176 -AIGVLDKARAELAQTFDDQFGGFGDAPKFPHPASLEWLAWHAARHND-------AEAER 227

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           M+  TL  MA GGI D VGGGF RYSVD RW +PHFEKMLYD G L  +Y +  +   D 
Sbjct: 228 MLERTLAAMAAGGIFDQVGGGFCRYSVDARWMIPHFEKMLYDNGPLLGLYAERAAAGDDR 287

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE 459
               +    + +L R+M  P G  +S+ DADS   EG    +EG FYVW  + VE +L E
Sbjct: 288 -ARRVAEQTVAWLEREMRDPSGAFYSSLDADS---EG----EEGRFYVWDPEMVEGLLPE 339

Query: 460 HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILG 519
              +     +           ++ P N F+G+  L E+   +  A  LG+   +    LG
Sbjct: 340 DEWVVASRVW----------GLNGPAN-FEGRWHLHEVAPIATVADALGIDESEAETRLG 388

Query: 520 ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRK 579
             R +L   R +R RPH DDK++ +WN L+I+  ARA++ L                +R 
Sbjct: 389 RARERLLAAREQRVRPHRDDKILGAWNALMINGLARAARAL----------------ERH 432

Query: 580 EYMEVAESAASFIRRHLYDEQTHRLQHSFRNGP-SKAP-GFLDDYAFLISGLLDLYEFGS 637
           +++ +A +A   +R  L+ +   RL  SFR G  S+ P  +LDD+A L+   L L E   
Sbjct: 433 DWLGLARAAMRAVRERLWHDG--RLFASFREGATSELPRAYLDDHALLLEATLALLEVEW 490

Query: 638 GTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINL 697
               L WA  L       F D E GG+F T  +  +++ R K   D A  +GN ++   L
Sbjct: 491 DGDLLGWATTLAEALLADFEDTEHGGFFYTARDHEALIQRPKVYADDAMAAGNGIAAQAL 550

Query: 698 VRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVG 756
            +L  ++A  +   Y + AE +LA     ++   +    +  A DM   P    VVL G
Sbjct: 551 QKLGYLLAEPR---YLEAAERTLANAGPMIEQAPLGHMSLLVALDMHQQPP-PLVVLRG 605


>gi|441511562|ref|ZP_20993411.1| hypothetical protein GOAMI_01_00780 [Gordonia amicalis NBRC 100051]
 gi|441453542|dbj|GAC51372.1| hypothetical protein GOAMI_01_00780 [Gordonia amicalis NBRC 100051]
          Length = 674

 Score =  373 bits (958), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 230/636 (36%), Positives = 320/636 (50%), Gaps = 72/636 (11%)

Query: 89  RTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYS 148
           RTP  +S       N L +  SPYL QHA NPV W  W + A + AR RDVP+ LS+GY+
Sbjct: 6   RTPDGSS-------NTLGSATSPYLRQHADNPVHWQEWSDAALSRARDRDVPVLLSVGYA 58

Query: 149 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 208
            CHWCHVM  ESFEDE  A  +N  FV IKVDREERPD+D +YM    A+ G GGWP++ 
Sbjct: 59  ACHWCHVMAHESFEDETTAAQMNRDFVCIKVDREERPDIDAIYMAATVAMTGQGGWPMTC 118

Query: 209 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 268
           FL+PD  P   GTY+PP  +   P F+ +L  V +AW ++R  L  + A   E +    S
Sbjct: 119 FLTPDSDPFYTGTYYPPRPRGQMPSFRQVLTAVTEAWTQRRADLDDTAAKVREHIVVNTS 178

Query: 269 A-SASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 327
              A +  + D L  + +R   ++     D   GGFG APKFP    +  ++ H+++  D
Sbjct: 179 PLPAGTVPVDDRLLAHGVRTVLDE----EDREHGGFGGAPKFPPSALLDALIRHTERTGD 234

Query: 328 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 387
           T     A         T+  M +GGI+D +GGGF RYSVD  W VPHFEKMLYD  QL  
Sbjct: 235 TAAIEAAGR-------TMHAMGRGGIYDQLGGGFARYSVDAGWVVPHFEKMLYDNAQLLR 287

Query: 388 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 447
            Y      T D     +  + + +LRRD+  PGG   S+ DAD+   EG+T       YV
Sbjct: 288 AYAHLARRTGDALAHRVVEETVTFLRRDLRVPGG-FASSLDADAGGVEGST-------YV 339

Query: 448 WTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKL 507
           WT  E+ ++LG  A       +                       V+ E        S L
Sbjct: 340 WTPDELAEVLGPEAGRRAAELF-----------------------VVTEQGTFEHGRSTL 376

Query: 508 GMPLE-KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 566
            +P + +  + LG  R  LFD R++R +P  DDKV+ +WN + I++ A A   L    E+
Sbjct: 377 QLPADPEDRDRLGTVRAALFDARARRVQPTRDDKVVTAWNAMTITALAEAGAGL---GET 433

Query: 567 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 626
              +  V  +D              +R HL      RL+ S   G   A G LDD+A L 
Sbjct: 434 GFVDDAVRCAD------------ELLRGHLVG---GRLRRSSLGGAVGADGGLDDHAALS 478

Query: 627 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG-GGYFNTTGEDPSVLLRVKEDHDGA 685
           + LL L++    T+WL   + L +T  ELF D E  G +F+ TGE   ++ R ++  DGA
Sbjct: 479 TALLTLFQVTGETRWLGAGLGLLDTAIELFADPEAPGAWFDATGE--GLIARPRDPIDGA 536

Query: 686 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLA 721
            PSG S+    L+  + +    ++  Y +  EHSL+
Sbjct: 537 TPSGASLMAEALLTASMLADPERAVGYAELLEHSLS 572


>gi|298293757|ref|YP_003695696.1| hypothetical protein Snov_3807 [Starkeya novella DSM 506]
 gi|296930268|gb|ADH91077.1| protein of unknown function DUF255 [Starkeya novella DSM 506]
          Length = 672

 Score =  373 bits (958), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 236/616 (38%), Positives = 327/616 (53%), Gaps = 62/616 (10%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL    SPYLLQH+ NPVDW+ W  EAF EAR+   PI LSIGY+ CHWCHVM  ESFE
Sbjct: 3   NRLQHAASPYLLQHSDNPVDWWQWQPEAFEEARRSGRPILLSIGYAACHWCHVMAHESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE  A ++N+ FV+IKVDREERP+VD++YM+ +Q L   GGWP+++FL  +  P  GGTY
Sbjct: 63  DEATAAVMNELFVNIKVDREERPEVDQIYMSALQQLGVQGGWPMTMFLDAEGAPFWGGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP E +YG+P F  +L+ + +A+      +A +    + +L +  +        P+EL  
Sbjct: 123 FPKEARYGQPAFTDVLKTMANAYGSGDPRIASNREALLARLRQKAAPVGKVTIGPNELDD 182

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
            A R+         DS+ GG   +PKFP    ++++    +  E TG+       +   L
Sbjct: 183 VAGRILG-----IMDSQHGGLQGSPKFPNTPFLELLW---RAWERTGR----QRLRDAAL 230

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
             L  M++GGI+DHVGGG+ RYSVDERW VPHFEKMLYD  Q+  +   A+S T    + 
Sbjct: 231 HALDGMSEGGIYDHVGGGYARYSVDERWLVPHFEKMLYDNAQILELLGLAYSETLADLFR 290

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHA 461
               + + +L+R+M+   G   ++ DADS   EG     EG +YVWT K+V D LG E A
Sbjct: 291 ARAEETVGWLQREMLTTSGAFAASLDADS---EG----HEGRYYVWTLKQVLDALGAEDA 343

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
             F  HY + P GN +   +S P       N L E+  S A   +L M            
Sbjct: 344 EFFARHYDIAPFGNWE--GVSIP-------NRLKEMERSPADEMRLAM-----------L 383

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
           R KL  VR  R  P  DDKV+  WNGL+I++ A  +              P  G  R E+
Sbjct: 384 RDKLLKVRETRVPPGRDDKVLADWNGLMIAALANVA--------------PRFG--RPEW 427

Query: 582 MEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKW 641
           +E+A  A  FI   +  E   RL HS+R G    PG   DYA +I   L L++      +
Sbjct: 428 VELAARAFRFIAESMAREG--RLGHSWREGRLVFPGLSSDYAAMIGAALALHQATGEASY 485

Query: 642 LVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLA 701
              A+  Q  Q E     E GGY+ T  +   ++LR     D A  + N++   NLVRLA
Sbjct: 486 FDHAVAWQ-AQLEAHHAAEDGGYYLTADDAEGLILRPDAAADDAVTNPNALIARNLVRLA 544

Query: 702 SIVAGSKSDYYRQNAE 717
           ++   +  D YR+ A+
Sbjct: 545 AV---TGDDGYRERAD 557


>gi|372222108|ref|ZP_09500529.1| hypothetical protein MzeaS_07308 [Mesoflavibacter
           zeaxanthinifaciens S86]
          Length = 701

 Score =  373 bits (958), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 214/605 (35%), Positives = 330/605 (54%), Gaps = 47/605 (7%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           K+TN L  E SPYLLQHAHNPVDW AW  E    A+  + PI +SIGY+ CHWCHVME E
Sbjct: 28  KYTNALVEETSPYLLQHAHNPVDWNAWKPEVLERAKAENKPILISIGYAACHWCHVMEEE 87

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
            FE+E VAKL+N+ F++IK+DREERPDVD++YM  +Q + G GGWPL++   PD +P  G
Sbjct: 88  CFENEEVAKLMNENFINIKIDREERPDVDQIYMDAIQMMTGNGGWPLNIVALPDGRPFWG 147

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS-ASASSNKLPD 278
            TY P ++      +   L+ + D +    + + Q  A  +EQ  +A++     ++K+  
Sbjct: 148 ATYLPKDN------WTKSLKSLIDLYHNDPEKV-QEYAGKLEQGIQAINLVENKTSKI-- 198

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
              +  L L  +  S S+D+  GG+  APKF  P  ++ +L+++        + +     
Sbjct: 199 HFTKEELDLAVQNWSTSFDTYLGGYKRAPKFMMPNNLEYLLHYA-------TANKNDTIL 251

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
           + V  TL  MA GGI D + GGF RY+VD +WHVPHFEKMLYD GQL ++Y  A+++TK+
Sbjct: 252 EYVNTTLTRMAYGGIFDPIDGGFSRYAVDVKWHVPHFEKMLYDNGQLISLYSKAYAVTKN 311

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
             Y       + +   +++   G  +S+ DADS    G  + +EGA+YVWT KE++ ILG
Sbjct: 312 SLYKETVEKSVGFATLELLDTNGGFYSSLDADSKNNSG--KLEEGAYYVWTEKELDSILG 369

Query: 459 EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 518
             + +FK +Y +   G  +           + K VLI     +  A  LG+        +
Sbjct: 370 SESSVFKTYYNINSYGYWE-----------EDKYVLIRDASDNELADSLGIATTNLTQQI 418

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
            +  ++L  VR +R +P LDDK++ SWNGL++     A + L+++               
Sbjct: 419 AKNLKQLKKVRGQREKPRLDDKILTSWNGLMLKGLTDAYRYLQND--------------- 463

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG 638
            +Y+++A   A+F+ + +  +    +  + +NG S   GFLDDYA LI G + LYE    
Sbjct: 464 -KYLQLALKNANFLEQEIIQDD-FSVYRNHKNGKSSINGFLDDYATLIDGFIGLYEVTFD 521

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
            +WL  A  L +     F D+E   ++ T+  D  ++ R  E +D    + NS+   NL 
Sbjct: 522 DRWLTLAKNLTDYAITHFKDQESNMFYYTSDLDDKLIRRSIETNDNVISASNSIMANNLY 581

Query: 699 RLASI 703
           +L  +
Sbjct: 582 KLHKV 586


>gi|305665308|ref|YP_003861595.1| hypothetical protein FB2170_03390 [Maribacter sp. HTCC2170]
 gi|88710063|gb|EAR02295.1| hypothetical protein FB2170_03390 [Maribacter sp. HTCC2170]
          Length = 703

 Score =  373 bits (958), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 226/642 (35%), Positives = 350/642 (54%), Gaps = 78/642 (12%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TN L  E SPYLLQHAHNPV+W  W +E F EA K D  + +SIGYS+CHWCHVME E+F
Sbjct: 38  TNDLVKETSPYLLQHAHNPVNWKPWSDEIFEEATKEDKLVIISIGYSSCHWCHVMEEETF 97

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           EDE VA+++N+ F+S+KVDREERPDVD+VYMT VQ + G  GWPL+V + P+ KPL GGT
Sbjct: 98  EDEKVAEIMNNDFISVKVDREERPDVDQVYMTAVQLMSGNAGWPLNVIVLPNGKPLYGGT 157

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWD---KKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
           Y      +    +  +L K+ + +     K +  A   +  I+ ++    +  +S     
Sbjct: 158 Y------HTNAQWSQVLEKINNLYKDDPTKANEYADMVSKGIQDVNLIEPSEENS----- 206

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
           E+  + L+    Q   ++D   GG     KF  P  +  +L       D  +       +
Sbjct: 207 EISLDILKEGVTQWKPNWDLERGGNMGPEKFMLPGSLDFLL-------DYAELSNDESVR 259

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
             +  TL  MAKGGI+DH+ GGF+RYS D  W++PHFEKMLYD  QL ++Y  A+++ KD
Sbjct: 260 SYIKTTLDQMAKGGIYDHIAGGFYRYSTDPNWNIPHFEKMLYDNAQLISLYSKAYTIFKD 319

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
             Y  I  + + +L+++M    G  F+A DADS   EG    +EG +YVWT++E+   + 
Sbjct: 320 PVYKQIVLETVAFLQKEMKNTTGGYFAALDADS---EG----EEGKYYVWTNEELRSTIN 372

Query: 459 EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN-DSSASASKLGMPLEKYLNI 517
            +  LF ++Y             ++   + +G  +++  N +    AS+  + +EK   +
Sbjct: 373 NNQELFSKYY------------STEISTKMEGDKIVLRKNQNDEVFASENEISIEKLQEL 420

Query: 518 LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSD 577
             E ++KL +VR+ R +P +DDK+IVSWN L+I+ +  A              F   G  
Sbjct: 421 NKEWKKKLVEVRADRVKPRIDDKIIVSWNALLINGYVDA--------------FKAFGET 466

Query: 578 RKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGS 637
           R  ++  AES  + I  + Y +  ++L HSF+ G ++  GFL+DY+FL +  L+LY    
Sbjct: 467 R--FLVEAESIFTTIHENAYSD--NQLVHSFKKGSNRTEGFLEDYSFLANASLNLYSASM 522

Query: 638 GTKWLVWAIELQNTQDELFLDREGGGY-FNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
              +L +A +L  T  + F D +   Y FN++    S++ ++ ++ DG  PS N+V   N
Sbjct: 523 NPDYLNFAQQLIKTTQKRFKDDDSDFYKFNSSN---SLIAKIIKNDDGVIPSPNAVMAHN 579

Query: 697 LVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAV-PLM 737
           L+ L  I      +Y +  A HS        K+M +++ PL+
Sbjct: 580 LLTLGHI------EYNKDYAAHS--------KNMLISIQPLL 607


>gi|428319651|ref|YP_007117533.1| hypothetical protein Osc7112_4848 [Oscillatoria nigro-viridis PCC
           7112]
 gi|428243331|gb|AFZ09117.1| hypothetical protein Osc7112_4848 [Oscillatoria nigro-viridis PCC
           7112]
          Length = 695

 Score =  373 bits (957), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 244/674 (36%), Positives = 350/674 (51%), Gaps = 82/674 (12%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            NRLA   S YL +HA NP+DW+ W +EA   AR  + PIFLSIGYS+CHWC VME E+F
Sbjct: 2   VNRLAQSQSLYLRKHAENPIDWWPWCDEALETARSENKPIFLSIGYSSCHWCTVMEGEAF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK-PLMGG 220
            D  +A+ +N  F+ IKVDREERPD+D +YM  +Q + G GGWPL+VFL+PD + P  GG
Sbjct: 62  SDRAIAQYMNSHFIPIKVDREERPDIDSIYMQTLQMMTGQGGWPLNVFLTPDERVPFYGG 121

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           TYFP E +YGRPGF  +L+ ++  +D ++  +    A  +  L ++ + S  + +L  EL
Sbjct: 122 TYFPVEPRYGRPGFLEVLQAIRRFYDTEKGKVEAFKAEILSNLQQSAALSGVTAELNREL 181

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
            Q  L +    ++        G    P FP      M+ Y    L  T  + E+    K 
Sbjct: 182 FQKGLEINTGIVA--------GHNPGPSFP------MIPYAELALRGTRFNFESKYDSKQ 227

Query: 341 VLFTLQC-MAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS--LTK 397
           V       +A GGI+DHVGGGFHRY+VD  W VPHFEKMLYD GQ+     + +S  + +
Sbjct: 228 VCTQRGLDLALGGIYDHVGGGFHRYTVDATWTVPHFEKMLYDNGQIVEYLANLWSAGIQE 287

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
             F + I   + ++L+R+MI P G  ++A+DADS  T      +EGAFYVWT  E+E +L
Sbjct: 288 PAFETAIAGTV-EWLKREMIAPTGYFYAAQDADSFNTSEEVEPEEGAFYVWTYAELEQLL 346

Query: 458 -GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI-----ELNDSSASA-SKL--- 507
             E     K  + +  +GN            F+GKNVL       L+D+  +A +KL   
Sbjct: 347 TAEELAEIKAQFTVSRSGN------------FEGKNVLQRRHPGRLSDTVETALAKLFAV 394

Query: 508 ---GMP-LEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 563
              G P   K        +    D    R     D K+I +WN L+IS  ARA+ +  + 
Sbjct: 395 RYGGNPNTVKTFPPARNNQEAKNDSWPGRIPAVTDTKMIAAWNSLMISGLARAAAVFGN- 453

Query: 564 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 623
                           EY+E+A  AA+FI  + + E   R Q    +G S      +DYA
Sbjct: 454 ---------------LEYLELAVKAANFILDNQWTE--GRFQRLNYDGQSAVTAQSEDYA 496

Query: 624 FLISGLLDLYE----FGSGTK---------WLVWAIELQNTQDELFLDREGGGYFNTTGE 670
             +  LLDL++     G+G +         WL  A+++Q   DE     E GGY+N T +
Sbjct: 497 LFVKALLDLHQASLTLGNGEEAKQLPNSQFWLEKALQVQEEFDEFLWSVELGGYYN-TAQ 555

Query: 671 DPS--VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLK 728
           D S  +L+R +   D A P+ N +++ +LVRLA  + G   +Y  + AE  L  F + ++
Sbjct: 556 DASGDLLVRERSYIDNATPAANGIAIASLVRLA--LLGPNLEYLDR-AEQGLQAFSSIVQ 612

Query: 729 DMAMAVPLMCCAAD 742
           D   A P +  A D
Sbjct: 613 DSPQACPSLLSAID 626


>gi|428201584|ref|YP_007080173.1| thioredoxin domain-containing protein [Pleurocapsa sp. PCC 7327]
 gi|427979016|gb|AFY76616.1| thioredoxin domain protein [Pleurocapsa sp. PCC 7327]
          Length = 685

 Score =  373 bits (957), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 248/667 (37%), Positives = 346/667 (51%), Gaps = 76/667 (11%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           +NRLA+  S YL +HA NP+DW+ W EEA   A+ +D PIFLSIGYS+CHWC VME E+F
Sbjct: 2   SNRLASAQSLYLRKHADNPIDWWPWCEEALETAKAQDKPIFLSIGYSSCHWCTVMEREAF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP-DLKPLMGG 220
            D  +A+ +N  F+ IKVDREERPD+D +YM  +Q + G GGWPL++FL P DL P  GG
Sbjct: 62  SDSAIAEYMNANFLPIKVDREERPDIDSIYMQALQMMTGQGGWPLNIFLIPGDLVPFYGG 121

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           TYFP E +YGRPGF  +L+ ++  +D +++ L      A++Q  E L     S  LP   
Sbjct: 122 TYFPLEPRYGRPGFLQVLQSIRRFYDVEKEKLD-----ALKQ--EILGGLKQSTILPIST 174

Query: 281 PQNALRLCAEQLSKSYDSRFGGFG-SAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS-EGQ 338
             +   L  E L +  ++  G     A  F RP    M+ Y S  L+ +    E+  +G+
Sbjct: 175 SDS---LSKELLYRGVETNTGVISIGASDFGRP-SFPMIPYASLALQGSRFQFESRYDGR 230

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL-TK 397
           ++     + +A GGI+DHVGGGFHRY+VD  W VPHFEKMLYD GQ+     + +S   K
Sbjct: 231 QLSARRGEDLALGGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDNGQILEYLSNLWSAGMK 290

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
           +  +       + +L+R+M  P G  ++A+DADS  +  A+  +EGAFYVW   E+E IL
Sbjct: 291 EPAFERAIAGTVAWLKREMTTPEGYFYAAQDADSFTSTEASEPEEGAFYVWRYDELEKIL 350

Query: 458 GEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLN 516
               +   K  + +   GN            F+G NVL         + KL   LE  L+
Sbjct: 351 TADELEELKAAFTITEKGN------------FEGSNVL-----QRKESGKLSDSLEAILD 393

Query: 517 ILGECR--RKLFDVRSKRPRPH----------------LDDKVIVSWNGLVISSFARASK 558
            L E R   K  ++ +  P  +                 D K+I +WN L IS  ARA  
Sbjct: 394 KLFEVRYGAKSTEIETFVPARNNQEAKTGNWKGRIPAVTDTKMIAAWNSLTISGLARA-- 451

Query: 559 ILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE-QTHRLQHSFRNGPSKAPG 617
                   A+F  P        Y E+A  AA FI  + + E + HRL +    G +    
Sbjct: 452 -------YAVFGEP-------SYWELATRAAKFILEYQWIEGRFHRLNY---EGQATVLA 494

Query: 618 FLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPS-VL 675
             +DYAF I  LLDL     + T WL  A+E+Q   DE F   E GGYFNT  +D   +L
Sbjct: 495 QSEDYAFFIKALLDLQAASPTETFWLEKAVEVQQEFDEFFWSLEMGGYFNTAADDSGDLL 554

Query: 676 LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP 735
           +R +   D A P+ N V++ NL+R+A +    +   Y   AE  L  F   L+    A P
Sbjct: 555 VRSRSYIDNATPAANGVAIANLIRIALLTENLE---YLDRAEQGLQAFSAVLQQSPQACP 611

Query: 736 LMCCAAD 742
            +  A D
Sbjct: 612 SLFAALD 618


>gi|359774323|ref|ZP_09277696.1| hypothetical protein GOEFS_115_01140 [Gordonia effusa NBRC 100432]
 gi|359308634|dbj|GAB20474.1| hypothetical protein GOEFS_115_01140 [Gordonia effusa NBRC 100432]
          Length = 654

 Score =  373 bits (957), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 223/630 (35%), Positives = 325/630 (51%), Gaps = 79/630 (12%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL    SPYL QHA NPV W  W  +AFAEA  RDVP+ LS+GY+ CHWCHVM  E FE
Sbjct: 2   NRLTNSTSPYLRQHADNPVHWREWSNDAFAEAVARDVPVLLSVGYAACHWCHVMAHECFE 61

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           +E +A  +N  FV IKVDREERPD+D +YM    A+ G GGWP++ FL+P  +P   GTY
Sbjct: 62  NEQIAAQMNAEFVCIKVDREERPDIDAIYMNATVAMTGQGGWPMTCFLTPAGEPFYCGTY 121

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE--L 280
           FPP  + G+PGF  ++  + D W  +RD + + G    ++L+  L  SA+S  LPD   +
Sbjct: 122 FPPSPRNGQPGFTELMSAITDTWINRRDEVTRVG----KELTGHL--SAASGGLPDAQFV 175

Query: 281 PQNALRL-CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
             +AL +  + +L    D   GGFG APKFP   +++ +L H ++  D        E   
Sbjct: 176 LDDALAIHASNELVAQEDRAHGGFGGAPKFPPSAQLEALLRHYERTGD-------REALG 228

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD- 398
           +V  T Q MA+GGI+D +GGGF RY+VD  W +PHFEKMLYD  QL  VY     +  D 
Sbjct: 229 VVERTAQAMARGGIYDQLGGGFSRYAVDIAWAIPHFEKMLYDNAQLLRVYAHLACVASDA 288

Query: 399 -VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
               + +  + +D+L  D+   GG   S+ DAD+   EGAT       YVWT +E +++L
Sbjct: 289 SAMAARVTAETVDFLATDLRVEGG-FASSLDADTDGVEGAT-------YVWTRREFDELL 340

Query: 458 GEHAILFKEHYYLKPTGNCD-----LSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE 512
           G  +    E + +  TG  +     L    DP N                        ++
Sbjct: 341 GSDSDWAAELFTVTETGTFEHGTSTLQLPVDPDN------------------------VQ 376

Query: 513 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 572
           ++  ++   R      R KRP+P  D KV+ +WNG+ I+    A   L            
Sbjct: 377 RFAAVVDRLRA----AREKRPQPGRDGKVVTAWNGMTITGLVEAGTAL------------ 420

Query: 573 VVGSDRKEYMEVAE-SAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLD 631
               +R E++++A   A   + RH+ + +  R   S        PG LDD+A L++GLL 
Sbjct: 421 ----NRPEWVDLAAWCADELLSRHIVEGELRRT--SLDGVVGTTPGMLDDHAALVTGLLG 474

Query: 632 LYEFGSGTKWLVWAIELQNTQDELFLDREG-GGYFNTTGEDPSVLLRVKEDHDGAEPSGN 690
           L+   +  +WL  AI L +    LF D +  G +F+       ++ R ++  DGA PSG 
Sbjct: 475 LFAATAQERWLDAAIALLDKAIGLFGDPDAQGSWFDAPAGATGLITRPRDPADGATPSGG 534

Query: 691 SVSVINLVRLASIVAGSKSDYYRQNAEHSL 720
           S+    L+  + + A  K+  Y + A+ +L
Sbjct: 535 SLMAEALLTASMLAAPEKAGSYLELADATL 564


>gi|390452556|ref|ZP_10238084.1| hypothetical protein PpeoK3_00885 [Paenibacillus peoriae KCTC 3763]
          Length = 628

 Score =  373 bits (957), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 234/634 (36%), Positives = 333/634 (52%), Gaps = 61/634 (9%)

Query: 156 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 215
           ME ESFEDE +A++LN  +VSIKVDREERPDVD +YM+  Q + G GGWPL++ ++PD K
Sbjct: 1   MERESFEDEEIAEILNRDYVSIKVDREERPDVDHIYMSICQTMTGHGGWPLTILMTPDQK 60

Query: 216 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 275
           P   GTY P E K+GR G   +L KV   W ++ + L       +E   + L+     + 
Sbjct: 61  PFFAGTYLPKEQKFGRIGLLELLDKVGTRWKEQPEEL-------VELSEQVLTEHERQDM 113

Query: 276 LP---DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSG 332
           L     EL + +L     Q S ++D  +GGFG APKFP P  +  +L +++       SG
Sbjct: 114 LAGYRGELDEQSLNKAFHQYSHTFDKEYGGFGEAPKFPAPHNLSFLLRYAQ------HSG 167

Query: 333 EASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDA 392
              +  +M   TL  M +GGI+DHVG GF RYSVDE+W VPHFEKMLYD   LA  Y + 
Sbjct: 168 N-QQALEMAEKTLDAMYRGGIYDHVGMGFSRYSVDEKWLVPHFEKMLYDNALLAIAYTET 226

Query: 393 FSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKE 452
           + +T    Y  I   I  Y+ RDM   GG  +SAEDADS   EG    +EG FYVW   E
Sbjct: 227 WQVTGKGLYRQIAEQIFTYIARDMTDVGGAFYSAEDADS---EG----EEGRFYVWNEAE 279

Query: 453 VEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELNDSSASASKLGM 509
           +  +LG+  A  F + Y + P GN            F+G N+  LI++N   A   K  +
Sbjct: 280 IRAVLGDRDAAFFNDLYGITPYGN------------FEGHNIPNLIDIN-LEAYGLKHDL 326

Query: 510 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 569
             ++  + + E R KLF VR KR  PH DDK++ SWNGL+I++ A+A +           
Sbjct: 327 TKQELEDRVRELRDKLFAVREKRVHPHKDDKILTSWNGLMIAALAKAGQAFGD------- 379

Query: 570 NFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 629
              V+      Y E A+ A SF+  HL      RL   +R+G +  PG+LDDYAF + GL
Sbjct: 380 ---VI------YTERAQKAESFLWNHL-RRANGRLLARYRDGDAAYPGYLDDYAFYVWGL 429

Query: 630 LDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSG 689
           ++LY+     ++L  A+ L     +LF D E  G F    +   ++ + KE +DGA PSG
Sbjct: 430 IELYQATFDVQYLQRALTLNQNMIDLFWDEEHHGLFFYGKDSEQLIAKPKEIYDGAIPSG 489

Query: 690 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSR 749
           NS++  NLVRLA +   ++ + Y   A      F   +     A   +  +  + +  + 
Sbjct: 490 NSIAAHNLVRLARLTGEARLEDY---AAKQFKAFGGMVSYDPSAYSALLSSL-LYATGTT 545

Query: 750 KHVVLVGHKSSVDFENMLAAAHASYDLNKTVSKK 783
           K +V+VG +        + A  A +  N  V  K
Sbjct: 546 KEIVVVGQRDDPQTLQFIRAIQAGFRPNTVVILK 579


>gi|428777664|ref|YP_007169451.1| hypothetical protein PCC7418_3117 [Halothece sp. PCC 7418]
 gi|428691943|gb|AFZ45237.1| hypothetical protein PCC7418_3117 [Halothece sp. PCC 7418]
          Length = 677

 Score =  372 bits (956), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 242/656 (36%), Positives = 353/656 (53%), Gaps = 60/656 (9%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNRLA   S YL +HA NP+DW+ W  EA  +A+  D PIFLS+GYS+CHWC VME E+F
Sbjct: 2   TNRLAETESLYLRKHAENPIDWWYWCPEALEKAKTEDKPIFLSVGYSSCHWCTVMEGEAF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK-PLMGG 220
            D  +A+ LND FV IKVDREERPD+D +YM  +Q + G GGWPL++FL+PD + P  GG
Sbjct: 62  SDSAIAQYLNDNFVPIKVDREERPDLDSIYMQALQMMTGQGGWPLNIFLTPDDRVPFYGG 121

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           TYFP E ++GRPGF  IL+ ++  +D++++ L     F  E +   L  SA+       L
Sbjct: 122 TYFPIEPRFGRPGFLDILKAIRRFYDQEKEKL---NTFKSEVMG-LLQQSAT-------L 170

Query: 281 PQNALRLCAEQLSKSYDSRFGGF---GSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEG 337
           P+    L ++ L+K  ++  G     G+ P FP      M+ Y    L  T  + E+   
Sbjct: 171 PETQTNLNSDLLTKGIETGVGITSHRGTPPSFP------MIPYAQLALRGTRFNYESRYD 224

Query: 338 QKMVLFTLQC-MAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS-- 394
            K V       +A GGI+DHVGGGFHRY+VD  W VPHFEKMLYD GQ+     + +S  
Sbjct: 225 AKDVAQQRGYDLALGGIYDHVGGGFHRYTVDGTWTVPHFEKMLYDNGQIVEYLANLWSSG 284

Query: 395 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 454
           + +  F S I + + ++L+R+M  P G  ++++DADS  T  A   +EGAFYVW+ +E+E
Sbjct: 285 VEEPAFKSAIAQTV-EWLQREMTAPEGYFYASQDADSFTTSEADEPEEGAFYVWSDRELE 343

Query: 455 DIL-GEHAILFKEHYYLKPTGNCD----LSRMSDPHNEFKGKNVLIELNDSSASASKLGM 509
            +L  E     +  + +   GN +    L R +  +   + KN L +L ++    S +  
Sbjct: 344 TLLTAEELQALQSEFTVTAEGNFEGSNVLQRQNGGNLSNEAKNALKKLFNARYGNSSIAT 403

Query: 510 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 569
                 N   E +   ++ R     P  D K+I +WN L+IS  ARA             
Sbjct: 404 FPPATNN--SEAKTTAWEGRIP---PVTDTKMITAWNSLMISGLARA------------- 445

Query: 570 NFPVVGSDRKEYMEVAESAASFIRRHLYDE-QTHRLQHSFRNGPSKAPGFLDDYAFLISG 628
            + V G   K Y + A  A +FI  + + E + HRL +   NG +      +DYA  I  
Sbjct: 446 -YAVFG--EKTYWDCAVKATNFIWENQWVEGRFHRLNY---NGKATVSAQSEDYALFIKA 499

Query: 629 LLDLYE-FGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPS-VLLRVKEDHDGAE 686
           LLDL+       +WL  A++LQ   DE     E GGYFNT  ++ + +++R +   D A 
Sbjct: 500 LLDLHACHPEQPQWLDQAVQLQAEFDEYLWSVETGGYFNTANDNSNDLIVRERTYIDNAT 559

Query: 687 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD 742
           P+ N V+V NLV+L  I    ++DY   +AE +L  F + ++    A P +    D
Sbjct: 560 PAANGVAVANLVQLFEIT--EQTDYL-ASAEKTLNAFSSIMEKSPQACPGLFSGLD 612


>gi|218437933|ref|YP_002376262.1| hypothetical protein PCC7424_0938 [Cyanothece sp. PCC 7424]
 gi|218170661|gb|ACK69394.1| protein of unknown function DUF255 [Cyanothece sp. PCC 7424]
          Length = 687

 Score =  372 bits (956), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 247/674 (36%), Positives = 344/674 (51%), Gaps = 92/674 (13%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA   S YL +HA NP+DW++W +EA + A+  + PIFLSIGYS+CHWC VME E+F 
Sbjct: 3   NRLAQVKSLYLRKHADNPIDWWSWCDEALSSAKAENKPIFLSIGYSSCHWCTVMEGEAFS 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP-DLKPLMGGT 221
           D  +A+ +N  F+ IKVDREERPD+D +YM  +Q + G GGWPL++FL+P DL P  GGT
Sbjct: 63  DGAIAEYMNANFLPIKVDREERPDLDSIYMQALQMMIGQGGWPLNIFLTPDDLVPFYGGT 122

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL-SEALSASASSNKLPDEL 280
           YFP E +Y RPGF  +L+ V+  +D +++ L       +E L +  +   + +N    EL
Sbjct: 123 YFPVEPRYNRPGFLQVLQSVRHFYDTEKEKLKSFKQEILEVLHNSTILPLSDTNLQAHEL 182

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK-KLEDTGKSGEASEGQK 339
               L+   + ++KS     G FG  P FP      ++L  S+ K E      +A+E + 
Sbjct: 183 FYRGLKTNTQVITKS----VGDFGR-PSFPMIPYASLILQGSRFKFESDYDGKQAAEARG 237

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
             L      A GGI+DHVGGGFHRY+VD  W VPHFEKMLYD GQ+     + +S     
Sbjct: 238 ADL------ALGGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDNGQIIEYLANLWSSGSQ- 290

Query: 400 FYSYICRDI---LDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
            Y    R I     +L+R+M  P G  ++A+DAD+         +EGAFYVW   ++E +
Sbjct: 291 -YPSFQRAIAGTAQWLKREMTAPEGYFYAAQDADNFVHSEDAEPEEGAFYVWRYSDLEKL 349

Query: 457 LGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 515
           L E  +   K  + + P GN            F+G NVL          ++ G   E + 
Sbjct: 350 LSEDELEALKTAFTITPEGN------------FEGSNVL--------QRTQEGTFTEDFE 389

Query: 516 NILGECRRKLFDVR-------------------------SKRPRPHLDDKVIVSWNGLVI 550
            IL     KLF VR                           R  P  D K+IV+WN L+I
Sbjct: 390 EILD----KLFGVRYGASSQDIEHFPPARNNQEAKTGNWQGRIPPVTDTKMIVAWNSLMI 445

Query: 551 SSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRN 610
           S  ARA  + +          P+       Y E+A  AA FI ++ +  Q  RL      
Sbjct: 446 SGLARAYGVFRE---------PL-------YWELATGAAEFICQNQW--QNGRLHRLNYE 487

Query: 611 GPSKAPGFLDDYAFLISGLLDLYE-FGSGTKWLVWAIELQNTQDELFLDREGGGYFNT-T 668
           G +      +DYAFLI  LLDL   F S T+WL  AIE+Q   D LF   E GGY+N  T
Sbjct: 488 GQATVLAQSEDYAFLIKALLDLQTAFPSKTEWLNKAIEIQEEFDNLFCSVEMGGYYNNAT 547

Query: 669 GEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLK 728
                +L+R +   D A PS N +++ NL+RL  +   +++  Y + AE +L  F + L 
Sbjct: 548 DNSEDLLVRERSYLDNATPSANGIAITNLIRLGRL---TENLSYFEQAERALQAFSSILS 604

Query: 729 DMAMAVPLMCCAAD 742
               A P +  A D
Sbjct: 605 QSPQACPSLFTALD 618


>gi|427728058|ref|YP_007074295.1| hypothetical protein Nos7524_0793 [Nostoc sp. PCC 7524]
 gi|427363977|gb|AFY46698.1| highly conserved protein containing a thioredoxin domain [Nostoc
           sp. PCC 7524]
          Length = 688

 Score =  372 bits (955), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 242/673 (35%), Positives = 347/673 (51%), Gaps = 90/673 (13%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNRLA   S YL +HA NP+DW+ W +EAFA AR  D PIFLSIGYS+CHWC VME E+F
Sbjct: 2   TNRLAQAQSLYLRKHAENPIDWWPWCDEAFATARAEDKPIFLSIGYSSCHWCTVMEGEAF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP-DLKPLMGG 220
            D+ +A+ +N  F+ IKVDREERPD+D +YM  +Q + G GGWPL+VFL+P DL P   G
Sbjct: 62  SDQALAEYMNANFLPIKVDREERPDIDSIYMQALQMMSGQGGWPLNVFLTPEDLVPFYAG 121

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL--SEALSASASSNKLPD 278
           TYFP E +Y RPGF  +L+ ++  +D +++ L Q  A  +E L  S  L   A+      
Sbjct: 122 TYFPLEPRYNRPGFLQVLQALRRYYDTEKEELRQRKAVILESLLTSAVLQGDATQEAEAQ 181

Query: 279 ELPQNALRLCAEQLSKSYDSRFG-----GFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGE 333
           EL           L + +++  G      +G++  FP      M+ Y    L  T  +  
Sbjct: 182 EL-----------LGRGWETSTGIITPNQYGNS--FP------MIPYAELALRGTRFNFP 222

Query: 334 AS-EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDA 392
           +  + Q++       +A GGI+DHV GGFHRY+VD  W VPHFEKMLYD GQ+     + 
Sbjct: 223 SRYDAQQVCTQRGLDLALGGIYDHVAGGFHRYTVDPTWTVPHFEKMLYDNGQIVEFLANL 282

Query: 393 FSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSK 451
           +S   ++  ++      +++L+R+M  P G  ++A+DADS      T  +EGAFYVW+  
Sbjct: 283 WSAGIQEPAFTRAVAGTIEWLQREMTAPEGYFYAAQDADSFTNPAETEPEEGAFYVWSYT 342

Query: 452 EVEDILGEHAIL-FKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMP 510
           E+ ++L    +   ++ + + P GN            F+GKNVL   N       +L + 
Sbjct: 343 ELAELLSPTELAELQQQFTVTPNGN------------FEGKNVLQRRN-----PGQLSIT 385

Query: 511 LEKYLNILGECR--------------RKLFDVRSK----RPRPHLDDKVIVSWNGLVISS 552
           LE  L+ L   R              R   + ++     R     D K+IV+WN L+IS 
Sbjct: 386 LETALDKLFTARYGAAPDALETFPPARDNQEAKTSNWPGRIPSVTDTKMIVAWNSLMISG 445

Query: 553 FARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRH-LYDEQTHRLQHSFRNG 611
            ARA         +A+F  P+ G       ++A  AA FI +H L + + HRL +    G
Sbjct: 446 LARA---------AAVFQEPIYG-------DIAARAAKFILQHQLVNGRFHRLNY---QG 486

Query: 612 PSKAPGFLDDYAFLISGLLDLYEFGSGTK-WLVWAIELQNTQDELFLDREGGGYFNTTGE 670
                   +DYAF I  LLDL       + WL  AI LQ   +E     E GGYFNT  +
Sbjct: 487 QPTVLAQSEDYAFFIKALLDLQACSPEQRFWLENAIALQTEFNEFLWSVELGGYFNTASD 546

Query: 671 -DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKD 729
               +++R +   D A PS N V++ NLVRL  +   +   +Y   AE  L  F + ++ 
Sbjct: 547 ASQELIVRERSYADNATPSANGVAIANLVRLTLL---TDDLHYLDLAEQGLKAFNSVMQQ 603

Query: 730 MAMAVPLMCCAAD 742
              A P +  A D
Sbjct: 604 APQACPSLFTALD 616


>gi|307107988|gb|EFN56229.1| hypothetical protein CHLNCDRAFT_145019 [Chlorella variabilis]
          Length = 648

 Score =  372 bits (955), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 203/465 (43%), Positives = 277/465 (59%), Gaps = 30/465 (6%)

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           M  F+L+ MA GG+ DHVGGGFHRYSVDE WHVPHFEKMLYD  QLA  YL AF +T+D 
Sbjct: 114 MATFSLRQMAAGGMWDHVGGGFHRYSVDEYWHVPHFEKMLYDNPQLAATYLAAFQITRDA 173

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG- 458
            Y+ + R I DYL R M  PGG +F+AEDADS +   +  KKEG FYVW+ +E++ +LG 
Sbjct: 174 QYAGVARGIFDYLLRGMTHPGGGLFAAEDADSLDP-ASGDKKEGWFYVWSWEELQQLLGP 232

Query: 459 EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 518
           E A  F  HYY K  GNCDLS  SDPH EF G N LI+    + +A+            L
Sbjct: 233 EDAPAFCAHYYAKQGGNCDLSPRSDPHGEFVGLNCLIQRQSLAQTAAAAARGEADTAAAL 292

Query: 519 GECRRKLFDVRSKRPRPHLDDK-----------------------VIVSWNGLVISSFAR 555
             CR KLF  R +RPRPH DDK                       ++ +WNG+ IS++A 
Sbjct: 293 AACREKLFRARERRPRPHRDDKARARGRGGAWPRILSNPWQHRLLIVAAWNGMAISAYAL 352

Query: 556 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA 615
           AS+IL  E   A   FPV G    +Y++ A  AA+F+R+HL+D +T RL+  F  GPS  
Sbjct: 353 ASRILPHEQPPAARCFPVEGRPPGDYLQAALQAAAFVRQHLWDGETGRLRRCFTTGPSAV 412

Query: 616 PGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL 675
            GF DDYA++++GLLDL+          WA++LQ T DE+  D  GG YF+    D S+L
Sbjct: 413 EGFADDYAWMVAGLLDLHSTTGD-----WALQLQGTMDEVLWDEAGGAYFSGVAGDASIL 467

Query: 676 LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP 735
           LR+KED+DGAEP+ +S+++ NL RLA +    +S  +R+ A    A F  RL +  +A+P
Sbjct: 468 LRMKEDYDGAEPAASSIALANLWRLAGLCGTEESARWRERAAKCAAAFAERLGEAPVALP 527

Query: 736 LMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
            M  +  +L++   + V++ G + + D + +L AA  S+  +  V
Sbjct: 528 QMAASLHLLTLGHPRQVIIAGAQGAPDTQALLDAAFYSFTPDMVV 572



 Score =  153 bits (387), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 68/88 (77%), Positives = 74/88 (84%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNRL+ E SPYLLQHAHNPVDW+ WGEEAF  ARK D PIFLS+GYSTCHWCHVME ESF
Sbjct: 17  TNRLSKEESPYLLQHAHNPVDWYPWGEEAFERARKEDKPIFLSVGYSTCHWCHVMERESF 76

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDK 189
           E E  A L+N  FV++KVDREERPDVDK
Sbjct: 77  ESEETAALMNQLFVNVKVDREERPDVDK 104


>gi|126659475|ref|ZP_01730608.1| hypothetical protein CY0110_07109 [Cyanothece sp. CCY0110]
 gi|126619209|gb|EAZ89945.1| hypothetical protein CY0110_07109 [Cyanothece sp. CCY0110]
          Length = 686

 Score =  372 bits (954), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 249/666 (37%), Positives = 351/666 (52%), Gaps = 76/666 (11%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA   S YL +HA NP+DW+ W EEA   A++ + PIFLSIGYS+CHWC VME E+F 
Sbjct: 3   NRLANTQSLYLRKHAENPIDWWYWCEEALEAAKQENKPIFLSIGYSSCHWCTVMEGEAFS 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP-DLKPLMGGT 221
           D+ +A  LND F+ IKVDREERPD+D +YM+ +Q +   GGWPL++FL+P DL P  GGT
Sbjct: 63  DQAIATYLNDNFLPIKVDREERPDLDSIYMSSLQMMGIQGGWPLNIFLTPGDLVPFYGGT 122

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFP E +YGRPGF  +L+ ++  +D +++ L     F  E L + L  SA+       LP
Sbjct: 123 YFPVEPRYGRPGFLQVLQSIRHFYDVEKEKL---NGFKQEIL-KGLQQSAT-------LP 171

Query: 282 QNALRLCAEQL-SKSYDSRFGGFG-SAPKFPRPVEIQMMLYHSKKLEDTG-KSGEASEGQ 338
            + + +   QL  +  D        +A  F RP    M+ Y +  LE T    GE  E Q
Sbjct: 172 MSEIDVNNAQLIYRGVDVNTKIIQVTAEDFGRPC-FPMIPYSNLALEGTRFLFGEPEERQ 230

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ----LANVYLDAFS 394
           K+V+   Q +A GGI DHVGGGFHRY+VD  W VPHFEKMLYD GQ    LAN++ +   
Sbjct: 231 KLVIQRGQDLALGGIFDHVGGGFHRYTVDSTWTVPHFEKMLYDNGQIMEYLANLWSNG-- 288

Query: 395 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 454
             ++  +       + +L+R+M  P G  ++A+DADS  T+     +EG FYVW  +++E
Sbjct: 289 -QQEPAFERAIALTVQWLQREMTSPEGYFYAAQDADSFATKEDKEPEEGTFYVWKYEQLE 347

Query: 455 DILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEK 513
            +L    +    E + + P GN            F+GKNVL   N S  S S + + L+K
Sbjct: 348 QLLNTKKLEELTEVFTITPEGN------------FEGKNVLQRRNGSKFSDS-IEIILDK 394

Query: 514 -YLNILGECRRKL---FDVRSKRPRPHL----------DDKVIVSWNGLVISSFARASKI 559
            +    G  R  L      ++ +    +          D K+IV+WN L+IS  ARA  I
Sbjct: 395 LFQERYGTSRNNLETFLPAKNNQEAQEINWPGRIPAVTDTKMIVAWNSLMISGLARAYAI 454

Query: 560 LKSEAESAMFNFPVVGSDRKEYMEVAESAASFI-RRHLYDEQTHRLQHSFRNGPSKAPGF 618
            K          P+       Y ++  +A  FI  +   + + HR+ +    G       
Sbjct: 455 FKQ---------PL-------YWQLGCNATQFILNKQWLNGRLHRINYE---GNPSILAQ 495

Query: 619 LDDYAFLISGLLDLYEFGSG-TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPS-VLL 676
            +DY FLI  LLDL+   +  T+WL  AIE+Q   DE F   E GGY+N   ++ + +L+
Sbjct: 496 SEDYGFLIKALLDLHAANAQETQWLDKAIEIQQEFDEFFWSLEMGGYYNNAADNSNDLLV 555

Query: 677 RVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL 736
           R +   D A PS N +++ NLVRLA +        Y   AE  L  F   L +   A P 
Sbjct: 556 RERSYIDNATPSANGIAISNLVRLARLTDNLD---YLDKAEQGLQAFSHILSESPRACPS 612

Query: 737 MCCAAD 742
           +  A D
Sbjct: 613 LLTALD 618


>gi|145593487|ref|YP_001157784.1| hypothetical protein Strop_0929 [Salinispora tropica CNB-440]
 gi|145302824|gb|ABP53406.1| protein of unknown function DUF255 [Salinispora tropica CNB-440]
          Length = 699

 Score =  371 bits (953), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 221/599 (36%), Positives = 307/599 (51%), Gaps = 44/599 (7%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA   SPYL+QH  NPVDW+ W  EAFAEA +RDVPI +S+GY+ CHWCHVM  ESF 
Sbjct: 2   NRLAGATSPYLIQHKDNPVDWWPWCAEAFAEAHRRDVPIMISVGYAACHWCHVMAHESFA 61

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA LLN+ FV+IKVDREERPDVD VYMT  QA+ G GGWP++VF +PD  P   GTY
Sbjct: 62  DEQVAALLNEGFVAIKVDREERPDVDAVYMTATQAMTGQGGWPMTVFAAPDGTPFFCGTY 121

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP      +P F  +L+ V  AW  +R  + Q GA  +E +  A +    S  L  +L  
Sbjct: 122 FP------KPNFLRLLQSVTTAWQDQRSAVLQQGAAVVEAIGGAQAVGGPSAPLTVDL-- 173

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             L   A++L + YD   GGFG APKFP  + +  +L   ++  D           ++V 
Sbjct: 174 --LDAAADRLGEEYDEANGGFGGAPKFPPHLNLLFLLRRYQRTGD-------QRSLEIVR 224

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            T + MA+GG+HD + GGF RY VD +W VPHFEKMLYD   L  VY   + LT D    
Sbjct: 225 HTAEAMARGGLHDQLAGGFARYCVDGQWAVPHFEKMLYDNALLLRVYTHLWRLTGDPMAR 284

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
            + RD   +L  ++  PG    SA DAD+   EG T       YVWT  ++ + LGE   
Sbjct: 285 RVARDTARFLADELHRPGEGFASALDADADGVEGLT-------YVWTPAQLVEALGEEDG 337

Query: 463 LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL---- 518
            +    +            + P  E +      E    SAS  +L   ++     +    
Sbjct: 338 RWAADLFAVTEQGSFTPHAASPPGEARSG---AEAAAQSASVLRLARDVDDATPEVQARW 394

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
            E   +L  VR  RP+P  DDKV+ +WNGL I++ A   ++    AE A    P   ++ 
Sbjct: 395 QEIAHRLLVVRDARPQPARDDKVVAAWNGLAITAIAEFQQVAAGYAEDA----PGPDANL 450

Query: 579 KEYMEVA------ESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDL 632
            E + +       ++A    R HL   +  R     R G  +A G L+DY  +      +
Sbjct: 451 MEGVTIVADGAMRDAAEHLARVHLVAGRLRRTSRDGRVG--EAAGVLEDYGCVAEAFCAM 508

Query: 633 YEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNS 691
           ++     +WL+ A +L +   E F   + G +++T  +   ++ R  +  D A PSG S
Sbjct: 509 HQLTGEGRWLILAGQLLDVALERFAAPQ-GSFYDTADDAERLVSRPADPTDNATPSGRS 566


>gi|300770884|ref|ZP_07080761.1| thymidylate kinase [Sphingobacterium spiritivorum ATCC 33861]
 gi|300762157|gb|EFK58976.1| thymidylate kinase [Sphingobacterium spiritivorum ATCC 33861]
          Length = 672

 Score =  371 bits (953), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 220/618 (35%), Positives = 317/618 (51%), Gaps = 67/618 (10%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           +N+L  EHSPYL QHAHNPV W  WGEEA  +A+  +  I +SIGYS CHWCHVME ESF
Sbjct: 2   SNQLQYEHSPYLKQHAHNPVHWMPWGEEALTKAKTENKLIIISIGYSACHWCHVMERESF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           E++ +A+ +N ++VS+K+DREERPD+D++YMT VQ +   GGWPL+    PD +P+ GGT
Sbjct: 62  ENDAIAQTMNKFYVSVKIDREERPDIDQIYMTAVQLMTNAGGWPLNCICLPDGRPIYGGT 121

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIE---QLSEALSASA--SSNKL 276
           YF P D      ++ IL ++   W+       Q    AIE   +L++ +  S     N +
Sbjct: 122 YFKPHD------WQNILLQIAQMWE-------QQPLVAIEYATKLTDGIQQSERLPINPI 168

Query: 277 PDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASE 336
           PD+     L          +D++ GG+  APKFP P     +L          + G  + 
Sbjct: 169 PDQYNTADLSAIITPWVALFDTKDGGYNRAPKFPLPNNWLFLL----------RYGVLAG 218

Query: 337 GQKM---VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 393
            +K+   V FTLQ MA GGI+D +GGGF RYSVD  WH+PHFEKMLYD GQL +++ +A+
Sbjct: 219 DEKIIDHVHFTLQKMACGGIYDQIGGGFARYSVDPYWHIPHFEKMLYDNGQLLSLFSEAY 278

Query: 394 SLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 453
                 FY  + ++ + +  R+M+      + A DADS   EG     EG +Y ++  E+
Sbjct: 279 QQRPLPFYKRVVQETIHWANREMLAANNGFYCALDADS---EGV----EGKYYSFSKSEI 331

Query: 454 EDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEK 513
           E ILGE A LF  ++ +   GN             +  N+ I   D+   A + G   E+
Sbjct: 332 EKILGEDAPLFISYFNITAEGNWTE----------ESTNIPILDPDADLMALEAGYSAEE 381

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
           +   L E + KL+  R  R RP LD K + +WN L++     A ++              
Sbjct: 382 WETCLAEAKEKLYRYRETRIRPGLDHKQLATWNALMLKGLTDAYRVF------------- 428

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
              D   Y++ A   A FI   L  +   R+ H  ++   +  GFLDDYAF     + LY
Sbjct: 429 ---DNSSYLDTAIKNAHFIIDELI-KSDGRILHQPKDANREIFGFLDDYAFTTEAFIALY 484

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
           E     KWL  A +L +   ELF D     ++ T      ++ R  E  D   P+  S  
Sbjct: 485 EATFDEKWLDLARQLADKALELFYDSHQKTFYYTADSSGELIARKSEIMDNVIPASTSAI 544

Query: 694 VINLVRLASIVAGSKSDY 711
           V+ L +L  +    K DY
Sbjct: 545 VLQLKKLGLLF--DKEDY 560


>gi|159036527|ref|YP_001535780.1| hypothetical protein Sare_0871 [Salinispora arenicola CNS-205]
 gi|157915362|gb|ABV96789.1| protein of unknown function DUF255 [Salinispora arenicola CNS-205]
          Length = 699

 Score =  371 bits (953), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 223/600 (37%), Positives = 311/600 (51%), Gaps = 46/600 (7%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA   SPYLLQH  NPVDW+ W  EAFAEA +RDVP+ +S+GYS CHWCHVM  ESF 
Sbjct: 2   NRLADATSPYLLQHKDNPVDWWPWCAEAFAEAERRDVPVLISVGYSACHWCHVMAHESFA 61

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE V  LLN+ FV+IKVDREERPDVD VYMT  QA+ G GGWP++VF +PD  P   GTY
Sbjct: 62  DEQVGALLNENFVAIKVDREERPDVDAVYMTATQAMTGQGGWPMTVFATPDGTPFFCGTY 121

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP      +P F  +L+ V  AW  +R  + + GA  +E +  A +    S  L  EL  
Sbjct: 122 FP------KPNFLRLLQSVAAAWRDQRAAVLRQGAAVVEAIGGAQAVGGPSAPLTAEL-- 173

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             L   A++L++ YD   GGFG APKFP  + +  +L   ++ + TG    A    +++ 
Sbjct: 174 --LDAAADRLAEEYDETNGGFGGAPKFPPHLNLLFLL---RQYQRTG----AQRSLEIIR 224

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            T + MA+GG+HD + GGF RYSVD RW VPHFEKMLYD   L  VY   + LT D    
Sbjct: 225 HTCEAMARGGLHDQLAGGFARYSVDGRWAVPHFEKMLYDNALLLRVYTHLWRLTGDQLAR 284

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
            + RD   +L  ++  PG    SA DAD+   EG T       YVWT  ++ + LGE   
Sbjct: 285 RVARDTARFLADELHRPGEGFASALDADTDGVEGLT-------YVWTPAQLVEALGEEDG 337

Query: 463 LFKEHYY-LKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE----KYLNI 517
            +    + +   G+      + P       +      D   S  +L   ++    +    
Sbjct: 338 RWAADLFDVTEEGSFTPHAAAPPGEALTAADA----TDQPTSVLRLARDVDDAAPEVRTR 393

Query: 518 LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSD 577
             E   +L  VR  RP+P  DDKV+ +WNGL I++ A   ++    AE A    P   ++
Sbjct: 394 WQEVAHRLLVVRDARPQPARDDKVVAAWNGLAITAIAEFQQVAAGYAEDA----PGQDAN 449

Query: 578 RKEYMEVA------ESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLD 631
             E + +       ++A    + HL D +  R     R G  +A G L+DY  +      
Sbjct: 450 LMEGVTIVADGAMRDAAEHLAQVHLVDGRLRRTSRDGRVG--EAAGVLEDYGCVAEAFCA 507

Query: 632 LYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNS 691
           +++     +WLV A  L +   E F   + G +++T  +   ++ R  +  D A PSG S
Sbjct: 508 MHQVTGEGRWLVLAGRLLDVALERFAAPD-GSFYDTADDAERLVSRPADPTDNATPSGRS 566


>gi|288917991|ref|ZP_06412350.1| protein of unknown function DUF255 [Frankia sp. EUN1f]
 gi|288350646|gb|EFC84864.1| protein of unknown function DUF255 [Frankia sp. EUN1f]
          Length = 669

 Score =  371 bits (952), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 228/655 (34%), Positives = 329/655 (50%), Gaps = 54/655 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N+LA + SPYLLQHA NPVDW+ WG EAFAEA  R VP+ LS+GY++CHWCHVM  ESFE
Sbjct: 3   NKLAEQTSPYLLQHADNPVDWWPWGPEAFAEATARGVPVLLSVGYASCHWCHVMAHESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D  +A  +N+ FV+IKVDREERPDVD VYM    AL G GGWP++VFL+P  +P   GTY
Sbjct: 63  DAQIAAYMNEHFVNIKVDREERPDVDSVYMDVTVALTGHGGWPMTVFLTPAAEPFFAGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE--ALSASASSNKLPDEL 280
           FPP  + G+  F  +L  V DAW ++R+ + ++GA    +L+E  AL    +  +   +L
Sbjct: 123 FPPRPRQGQTSFPQLLTAVSDAWTQRREEIEEAGADIARRLAEVVALPGGTAGGEGGPQL 182

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
             + L      L+  +D+R GGFG  PKFP  +  +++L H  +  D           +M
Sbjct: 183 GADLLDGAVAGLAGRFDARHGGFGPKPKFPPSMVAELLLRHWARTGD-------DRALEM 235

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
           V  T + MA+GGI+D + GGF RYSVD  W VPHFEKMLYD  QL  VYL  +  T    
Sbjct: 236 VRVTCERMARGGIYDQLAGGFARYSVDATWTVPHFEKMLYDNAQLLRVYLHLWRATGSAL 295

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAET-EGATRKKEGAFYVWTSKEVEDILGE 459
              + R+ +++L  D+  P G   SA DAD+    +     +EGA Y WT  ++ D+LG 
Sbjct: 296 AERVVRETVEFLLTDLRTPEGGFASALDADAVPAGQPNAHPEEGASYSWTPAQLADVLGP 355

Query: 460 HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILG 519
               +             +  +++      G +VL+   D    A               
Sbjct: 356 EDGAWA----------AGVLGVTEAGTFEHGTSVLMLPADPDDPAR------------FA 393

Query: 520 ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRK 579
             R  L   RS RP+P  DDK++ +WN            I       A+   P   +   
Sbjct: 394 RVRSALAAARSSRPQPARDDKIVAAWN---------GLAIAALAEAGALLAEPAWIAAAT 444

Query: 580 EYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGT 639
              E+          HL+D +  R     R GP+   G L+DY  +  G L L++  +  
Sbjct: 445 RAAELLRDV------HLHDGRLWRTSRDGRRGPNA--GVLEDYGCVADGYLALHQVTADP 496

Query: 640 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVR 699
           +WL  A EL +     F   + GG+F+T  +  ++L R +E  D A PSG +     ++ 
Sbjct: 497 RWLTLAGELLDVVRARFAAPD-GGFFDTADDAEALLRRPRESSDSATPSGQAAVAGAMLT 555

Query: 700 LASIVAGSKSDYYRQNAEHSLAVFETRL-KDMAMAVPLMCCAADMLSVPSRKHVV 753
            A++   ++   +R  A  ++ +    L KD   A      A  +L+ P+   VV
Sbjct: 556 FAALTGSAE---HRDAAVATVGLLMPLLAKDARYAGWAGAVAEAVLAGPAEVAVV 607


>gi|291437584|ref|ZP_06576974.1| conserved hypothetical protein [Streptomyces ghanaensis ATCC 14672]
 gi|291340479|gb|EFE67435.1| conserved hypothetical protein [Streptomyces ghanaensis ATCC 14672]
          Length = 677

 Score =  371 bits (952), Expect = e-99,   Method: Compositional matrix adjust.
 Identities = 239/627 (38%), Positives = 332/627 (52%), Gaps = 62/627 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA E SPYLLQHA NPVDW+ W + AFAEAR+R+VP+ LS+GYS+CHWCHVM  ESFE
Sbjct: 3   NRLANETSPYLLQHADNPVDWWPWSDGAFAEARERNVPVLLSVGYSSCHWCHVMAHESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D   A  LN  FVS+KVDREERPDVD VYM  VQA  G GGWP++VFL+PD +P   GTY
Sbjct: 63  DRTTADYLNGHFVSVKVDREERPDVDAVYMEAVQAATGHGGWPMTVFLTPDAEPFYFGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLP--DEL 280
           FPPE ++G P F  +L+ +  AW ++RD +          L+     S    K+P   EL
Sbjct: 123 FPPEPRHGMPSFLQVLQGIHQAWQERRDEVTDVAGKITRDLA-GREISYGDAKVPGEQEL 181

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
            Q  L      L++ YD + GGFG APKFP  + ++ +L H  +   TG  G      +M
Sbjct: 182 AQALL-----GLTREYDPQRGGFGGAPKFPPSMVLEFLLRHHAR---TGAEG----ALQM 229

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
              T + MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L  VY   +  T    
Sbjct: 230 AQDTCERMARGGIYDQLGGGFARYSVDRDWVVPHFEKMLYDNALLCRVYAHLWRATGSEL 289

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH 460
              +  +  D++ R++  P G   SA DADS   +G  R  EGA+YVWT  ++ ++LGE 
Sbjct: 290 ARRVALETADFMVRELRTPEGGFASALDADS--DDGTGRHVEGAYYVWTPAQLREVLGEE 347

Query: 461 -AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-IELNDSSASASKLGMPLEKYLNIL 518
            A L   ++ +   G  +           +G +VL +   D    A++           +
Sbjct: 348 DADLAARYFGVTEEGTFE-----------EGASVLQLPQRDEVFDAAR-----------V 385

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
              R +L   R+ RP P  DDKV+ +WNGL +++ A                      DR
Sbjct: 386 DGVRERLLAARAARPAPGRDDKVVAAWNGLAVAALAETGAYF----------------DR 429

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYAFLISGLLDLYEFGS 637
            + +E A +A   + R  +DE   R+  + ++G   A  G L+DYA +  G L L     
Sbjct: 430 PDLVEAAVAAGDLLVRLHFDEHA-RIARTSKDGHVGANAGVLEDYADVAEGFLALASVTG 488

Query: 638 GTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINL 697
              WL +A  L +     F D + G  ++T  +   ++ R ++  D A PSG S +   L
Sbjct: 489 EGVWLEFAGLLLDHVLARFTDPDSGALYDTAADAERLIRRPQDPTDNAVPSGWSAAAGAL 548

Query: 698 VRLASIVAGSKSDYYRQNAEHSLAVFE 724
           +   S  A + S+ +R  AE +L V +
Sbjct: 549 L---SYAAHTGSEPHRTAAERALGVVK 572


>gi|410479889|ref|YP_006767526.1| thioredoxin [Leptospirillum ferriphilum ML-04]
 gi|406775141|gb|AFS54566.1| conserved hypothetical protein containing a thioredoxin domain
           [Leptospirillum ferriphilum ML-04]
          Length = 699

 Score =  371 bits (952), Expect = 1e-99,   Method: Compositional matrix adjust.
 Identities = 231/679 (34%), Positives = 348/679 (51%), Gaps = 53/679 (7%)

Query: 94  TSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWC 153
           T H      NRL  E SPYL QHA NPVDW+ WG+EAF +AR  + P+ LSIGY+ CHWC
Sbjct: 4   TFHEGGIVANRLKEETSPYLRQHADNPVDWYPWGKEAFEKARLEEKPVLLSIGYAACHWC 63

Query: 154 HVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVY-MTYVQALYGGGGWPLSVFLSP 212
           HVM  ESFE   +A ++N++FV+IKVDREERPD+D++Y M +       GGWPL++FL+P
Sbjct: 64  HVMAHESFERPDIASVMNEFFVNIKVDREERPDLDQIYQMAHTMITRRNGGWPLTMFLTP 123

Query: 213 DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASAS 272
              P  GGTYFP + ++G PGF  +L +++D +   R+ L +     ++ L +    + S
Sbjct: 124 SQVPFAGGTYFPAQPRFGLPGFVQVLEQIRDFYRDHREGLEKEDHPILQYLGQTNPVADS 183

Query: 273 SNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSG 332
                D  P  AL      L   +D  FGGFG APKFP  +++  +    ++ +  G S 
Sbjct: 184 REFELDLSPSEAL---VNNLKSRFDPEFGGFGGAPKFPHAMDLSYLF---RRFQRKGDST 237

Query: 333 EASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDA 392
            A     M   TL  M +GGI D VGGGF RYSVDERW +PHFEKMLYD   L       
Sbjct: 238 AA----HMATLTLSSMKRGGIWDQVGGGFARYSVDERWLIPHFEKMLYDNALLLEALSLG 293

Query: 393 FSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKE 452
            S++K+  YS    +++ +L R+M    G  +S+ DADS   EG    +EG FYV+ ++E
Sbjct: 294 ASVSKNPVYSRTAEELVGWLFREMRSDDGVYYSSLDADS---EG----EEGRFYVFQAEE 346

Query: 453 VEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV-LIELNDSSASASKLGMPL 511
           V  IL +        YY           +S P N F+G    L E       + +  +  
Sbjct: 347 VRSILSDEEYRVVSKYY----------GLSGPPN-FEGHAWNLYEARSIGELSKEFHLSE 395

Query: 512 EKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNF 571
                 +   R+KLF  RS R RP LDDKV+ SWN L+              A++ +F+ 
Sbjct: 396 SDIERRIESARQKLFAYRSTRVRPGLDDKVLASWNALM--------------AKALLFSG 441

Query: 572 PVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLD 631
            ++G  ++E++        ++ R ++  +   L   +       P +LDDYAFL+  +L+
Sbjct: 442 RILG--KQEWISAGRKTIDYMHRKMW--KNGLLMAVYSKKEPFLPAYLDDYAFLLLAVLE 497

Query: 632 LYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNS 691
                   + L +A  + +     F D E GG++ T     +++ R K  HDGA PSGN+
Sbjct: 498 SMRIDFRPEDLSFATTIADVLLAEFYDPESGGFYFTGKNHEALIHRPKNGHDGALPSGNA 557

Query: 692 VSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKH 751
            +V  L+ L ++        Y   A+ +L ++  ++K+       M  A +  S    + 
Sbjct: 558 AAVQGLLWLGTLTGHLP---YTSAADKTLRLYFAQMKEQPAGYTTMISALETYS--DSQP 612

Query: 752 VVLVGHKSSVDFENMLAAA 770
           VV +    + D+++ ++  
Sbjct: 613 VVFLAGPQAGDWKDKISCG 631


>gi|386845926|ref|YP_006263939.1| Spermatogenesis-associated protein 20 [Actinoplanes sp. SE50/110]
 gi|359833430|gb|AEV81871.1| Spermatogenesis-associated protein 20 [Actinoplanes sp. SE50/110]
          Length = 663

 Score =  370 bits (951), Expect = 1e-99,   Method: Compositional matrix adjust.
 Identities = 226/608 (37%), Positives = 319/608 (52%), Gaps = 63/608 (10%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA   SPYL QH  NPVDW+ W  EAFAEAR+R+VP+ +S+GY+ CHWCHVM  ESFE
Sbjct: 3   NRLANATSPYLQQHRDNPVDWWEWSAEAFAEARRREVPVLISVGYAACHWCHVMAHESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D+ VA  LN  FV+IKVDREERPDVD VYMT  QA+ G GGWP++VF +PD  P   GTY
Sbjct: 63  DDAVAAQLNADFVAIKVDREERPDVDAVYMTATQAMTGQGGWPMTVFATPDGDPFYCGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP +       F  +L  V  AW  +RD + + GA  ++ +  A +       +  E+  
Sbjct: 123 FPKQQ------FTRLLTSVTAAWRDERDGVLKQGAAVVQAVGGAQAVGGPVAAVTAEMLA 176

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
            A    A++    +D  +GGFG APKFP  + +  +L H   LE TG    ++E  ++V 
Sbjct: 177 AAAAGLAQE----HDQTYGGFGGAPKFPPHMNLLFLLRH---LERTG----SAEALELVR 225

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            T + MA+GGI+D + GGF RY+VDE W VPHFEKMLYD   L  VY   + LT DV   
Sbjct: 226 HTAERMARGGIYDQLAGGFARYAVDEHWTVPHFEKMLYDNALLLRVYTQLWRLTGDVPAR 285

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
            +  +  ++L RD+  P G + SA DAD+   EG T       Y WT  E+ ++LG    
Sbjct: 286 RVADETAEFLLRDLATPAGGLASALDADTDGVEGLT-------YAWTPAELTEVLGPDDG 338

Query: 463 LFKEHYYLKPTGNCDLSRMSDPHNEFK-GKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
            +            DL R++ P   F+ G++VL+   D  A+   L   ++++ ++    
Sbjct: 339 AWA----------ADLFRVT-PDGTFEHGRSVLVLARDIDAADPAL---VDRWRDV---- 380

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
           R +L D R KRP+P  DDKV+ SWNGL I++ A    +  S A                 
Sbjct: 381 RARLLDARGKRPQPARDDKVVASWNGLAITALAEHGALTGSTASREAAV----------- 429

Query: 582 MEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP-GFLDDYAFLISGLLDLYEFGSGTK 640
                 A     RHL D    RL+   R+G    P G L+DY  +    L +++  +  +
Sbjct: 430 ----ALAGVLADRHLID---GRLRRVSRDGVVGDPAGVLEDYGCVAEAFLAVHQITADPR 482

Query: 641 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 700
           W   A  L +     F     GG+++T  +   ++ R  +  D A PSG +     LV  
Sbjct: 483 WSRLAGRLLDVALARF-GTGSGGFYDTADDAEKLVTRPADPTDNATPSGLAAVCAALVTY 541

Query: 701 ASIVAGSK 708
           A++   ++
Sbjct: 542 AALTGETR 549


>gi|11499326|ref|NP_070565.1| hypothetical protein AF1737 [Archaeoglobus fulgidus DSM 4304]
 gi|2648814|gb|AAB89512.1| conserved hypothetical protein [Archaeoglobus fulgidus DSM 4304]
          Length = 642

 Score =  370 bits (950), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 223/623 (35%), Positives = 326/623 (52%), Gaps = 64/623 (10%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            NRL    SPYL + A+ PV+WF WGEEAFA+A+K D PI LSIG   CHWCHVM  ESF
Sbjct: 2   VNRLINSRSPYLRKAANQPVEWFEWGEEAFAKAKKEDKPILLSIGGVWCHWCHVMAKESF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           E+E +A+++N  FV+IKVDR+ERPD+DK Y  +V A  G GGWPL+VFL+PD KP  GGT
Sbjct: 62  ENEEIAEMINRNFVAIKVDRDERPDIDKRYQEFVMATTGSGGWPLTVFLTPDGKPFFGGT 121

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFPPED+Y  PGFKT+LRK+ + W   R+ L +S     E+L+EA+   A  +    ++ 
Sbjct: 122 YFPPEDRYHLPGFKTVLRKIAEMWRHDRERLLKSA----EELTEAVRRYAEGS-FKGDVD 176

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
           +  L    E +    D   GGFGSAPKF     ++++L H     D        E  K  
Sbjct: 177 EKLLDKGIEAVLDQTDYVNGGFGSAPKFHHAKAVELLLTHHFFTGD-------EEVLKAA 229

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             TL  MA+GGI+DH+ GGF RYS D +W  PH+EKMLYD  +L  +Y  A++LT    Y
Sbjct: 230 EITLDAMARGGIYDHLLGGFFRYSTDAKWVTPHYEKMLYDNAELLYLYSIAYALTGKRLY 289

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHA 461
             I   I++Y R+      G  ++++DAD  E +      EG +Y+++ +E+++IL E  
Sbjct: 290 QKIADGIVEYYRKFGCSNEGGFYASQDADIGELD------EGGYYLFSDRELKEILDERE 343

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASK-LGMPLEKYLNILGE 520
                 YY                 + +G+  L  +  +    SK LG+ +E+    +  
Sbjct: 344 FRIATLYY-----------------DIQGERKLPRIFLTEEEISKILGVSVEEVERAVNS 386

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            RRK+ + R +R  P++D  +   WNGL+I +     K+                     
Sbjct: 387 ARRKMLEFREQREMPYIDTTIYAGWNGLMIEALCMHHKVFGDNWS--------------- 431

Query: 581 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 640
            +E+AE  A+ + +  +D     L H+         G  +DY F   GLL L+E     +
Sbjct: 432 -LEMAEKTANRLLKEFWD--GRELLHT-----HNVEGLSEDYIFFARGLLALFEVTQRHE 483

Query: 641 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 700
           +L    E+ ++  E F D E GG+F++  E   + +R+K  HD    S N  +   L+ L
Sbjct: 484 YLEKCFEIVDSAVEKFWDGEDGGFFDS--ERAVLGIRLKNFHDSPTQSVNGSAPQLLLAL 541

Query: 701 ASIVAGSKSDYYRQNAEHSLAVF 723
           ++I    +   Y + A   L  F
Sbjct: 542 SAITGERR---YEELAVEGLRTF 561


>gi|158312686|ref|YP_001505194.1| hypothetical protein Franean1_0830 [Frankia sp. EAN1pec]
 gi|158108091|gb|ABW10288.1| protein of unknown function DUF255 [Frankia sp. EAN1pec]
          Length = 669

 Score =  370 bits (949), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 244/660 (36%), Positives = 331/660 (50%), Gaps = 64/660 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N+LA + SPYLLQHA NPVDW+ WG EAFAEA  R VP+ LS+GY+ CHWCHVM  ESFE
Sbjct: 3   NKLAEQTSPYLLQHADNPVDWWPWGPEAFAEATTRGVPVLLSVGYAACHWCHVMAHESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D  +A  +N  FV+IKVDREERPDVD VYM    AL G GGWP++VFL+P  +P   GTY
Sbjct: 63  DPEIAAYMNQHFVNIKVDREERPDVDSVYMDVTVALTGHGGWPMTVFLTPAAEPFFAGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSE--ALSASASSNKLPDEL 280
           FPP    G   F  ++  + DAW  +R  + QSGA    QL+E  A   +AS      ++
Sbjct: 123 FPPRPMRGSASFPQVMAAIVDAWTARRAEVEQSGADIARQLAEAVAPGGAASGGGATTQI 182

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
             + L      L+  +DS  GGFG APKFP  +  +M+L    +  D    G       M
Sbjct: 183 TADLLDRAVAGLADRFDSVHGGFGGAPKFPPSMVAEMLLRSWARTGDGRALG-------M 235

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
           V  T + MA+GG++D +GGGF RYSVDE W VPHFEKMLYD  QL  VYL  +  T    
Sbjct: 236 VRETCERMARGGMYDQLGGGFARYSVDESWTVPHFEKMLYDNAQLLRVYLHLWRATGLPL 295

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADS--AETEGATRKKEGAFYVWTSKEVEDILG 458
              + R+   +L  D+  P G   SA DAD+  A + G    +EGA Y WT  ++ D+LG
Sbjct: 296 AERVVRETAAFLLADLRTPEGGFASALDADAVPAGSPGG-HPEEGASYSWTPAQLVDVLG 354

Query: 459 -EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI 517
            +   L      +   G+ +            G +VL+   D    A             
Sbjct: 355 PDDGALAARVLGVTAEGSFE-----------HGTSVLMLPADPEDPARFA---------- 393

Query: 518 LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSD 577
               R  L   R+ RP+P  DDK++ +WNGLVI + A A  +L                 
Sbjct: 394 --RVRAALAAARATRPQPARDDKIVAAWNGLVIGALAEAGALLGE--------------- 436

Query: 578 RKEYMEVAESAASFIRR-HLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG 636
              ++  AE AA  +R  HL++ +  R     R GP+   G L+DY  +  G L L++  
Sbjct: 437 -PSWVGAAERAAELLRDVHLHEGRLWRTSRDGRRGPNA--GVLEDYGCVAEGFLTLHQVT 493

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
               WL  A EL +     F   + GGYF+T  +  ++L R ++  D A PSG +     
Sbjct: 494 GAAGWLALAGELLDVVRARFAAPD-GGYFDTADDAEALLRRPRDASDSATPSGQAAVAGA 552

Query: 697 LVRLASIVAGSK-SDYYRQNAEHSLAVF--ETRLKDMAMAVPLMCCAADMLSVPSRKHVV 753
           L+  A++   +   D  R   E    +   + R    A AV     A  +L+ P+   VV
Sbjct: 553 LLTYAALTGSADHRDSARATVEQLTPLLSRDARFAGWAGAV-----AEALLAGPAEVAVV 607


>gi|403380657|ref|ZP_10922714.1| hypothetical protein PJC66_12642 [Paenibacillus sp. JC66]
          Length = 547

 Score =  370 bits (949), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 226/594 (38%), Positives = 327/594 (55%), Gaps = 53/594 (8%)

Query: 156 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 215
           M  ESFEDE VA  LN  ++++KVDREERPDVDK+YM+  QA+ G GGWPL+V ++PD K
Sbjct: 1   MAQESFEDEKVAAWLNAHYIAVKVDREERPDVDKLYMSVCQAMTGQGGWPLTVLMTPDKK 60

Query: 216 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 275
           P   GTYFP   +YG+PG   I+ +V   W ++R+ L        E+++E +  +     
Sbjct: 61  PFFVGTYFPKTSQYGKPGVIDIVSQVHQKWTEQREELLDIA----EEIAETVR-NRQETA 115

Query: 276 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 335
           L  EL  + L +  E  S+++DS++GGFG APKFP P ++  +L + K+   TG+     
Sbjct: 116 LSGELSADMLDMAYELFSQAFDSQYGGFGDAPKFPSPHQLSFLLRYYKR---TGEQDALD 172

Query: 336 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 395
             +K    TL+ M +GG++DH+G GF R S DERW VPHFEKMLYD   LA VYL+A+ +
Sbjct: 173 MAEK----TLEGMHRGGMYDHIGYGFARCSADERWLVPHFEKMLYDNALLAAVYLEAYEV 228

Query: 396 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 455
           T    Y+ I   I  Y++RDM    G  FSAE + S   EGA    E  FY+WT +EV  
Sbjct: 229 TGKQEYAEIAEQIFAYVKRDMTSSEGFFFSAEGSHS---EGA----EEQFYLWTPEEVNA 281

Query: 456 ILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG-MPLEK 513
           +LGE    LF + + ++  G  D            G +V   L  + ++ ++L  M   +
Sbjct: 282 VLGEEDGELFCDVFDIQEDGPVD------------GYSVPNLLGLTRSTFARLQRMDPAE 329

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
               L   R KLF  R +R RPH DDK++ +WNGL+I + A+ +K+L+            
Sbjct: 330 RERRLERSRVKLFQHRERRARPHKDDKMLTAWNGLMIMALAKGAKVLQ------------ 377

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
               + E+ + A+ A  FI + L  E   RL   +R+G +  P +LDDYAFL+ GL++LY
Sbjct: 378 ----KAEHADAAQKAVGFILQRLVREDG-RLLARYRDGDAAIPAYLDDYAFLVWGLIELY 432

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
           E    T++L  A+        LF D E GG++ +  +   +L R KE HDG  PSGNS +
Sbjct: 433 EATRETEYLHQAVRFNQEMIRLFWDDESGGFYFSGIDGEKLLARSKEIHDGDMPSGNSAA 492

Query: 694 VINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVP 747
            +NL+RLAS+   +K     Q A   L  F   ++       +  CA D +  P
Sbjct: 493 AMNLLRLASLTEDTK---LLQLAHRQLRSFAAVVEQYPAGFSMYLCALDSILPP 543


>gi|414164591|ref|ZP_11420838.1| hypothetical protein HMPREF9697_02739 [Afipia felis ATCC 53690]
 gi|410882371|gb|EKS30211.1| hypothetical protein HMPREF9697_02739 [Afipia felis ATCC 53690]
          Length = 684

 Score =  369 bits (948), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 226/612 (36%), Positives = 321/612 (52%), Gaps = 65/612 (10%)

Query: 101 HTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVES 160
           H NRLA E SPYLLQH HNPVDW+ WG  A AEA+K   PI LSIGY+ CHWCHVM  ES
Sbjct: 7   HKNRLAGETSPYLLQHQHNPVDWWPWGPPALAEAQKTGKPILLSIGYAACHWCHVMAHES 66

Query: 161 FEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGG 220
           FEDE  A ++N+ FV+IKVDREERPD+D++YM  +  L   GGWPL++FL+PD  P+ GG
Sbjct: 67  FEDEATAAVMNEQFVAIKVDREERPDIDQIYMNALHLLGQQGGWPLTMFLTPDGAPIWGG 126

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           TYFP + +YGR  F  ++++    +  + D +A +       L+E  SA  +S  L    
Sbjct: 127 TYFPKQAQYGRASFIDVMQQFMRIYRDEPDKIAANKEAIARSLNERHSADTASIGL---- 182

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
             N L   A  ++++ D   GG   APKFP+             LE   ++G  +  ++ 
Sbjct: 183 --NELDNAAGSIARATDPDNGGLRGAPKFPQ----------CSMLEFLWRAGARTGDERY 230

Query: 341 VLFT---LQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTK 397
            + T   L  M++GGI+DH+GGG+ RYSVDERW VPHFEKMLYD  Q+ ++     +   
Sbjct: 231 FITTNLALTRMSQGGIYDHLGGGYARYSVDERWLVPHFEKMLYDNAQILDMLALEHARAP 290

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
           +  Y     + + +L+R+M+   G   S+ DADS   EG    +EG FYVW+  ++  +L
Sbjct: 291 NELYLQRAEETVGWLKREMLTKEGGFSSSLDADS---EG----EEGRFYVWSQSDIAQLL 343

Query: 458 G-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLN 516
           G + A  F   Y +   GN            F+G N+L  L+D S +A++          
Sbjct: 344 GPDDATFFAAKYGVSAEGN------------FEGHNILNRLDDGSDTATE--------AE 383

Query: 517 ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGS 576
            L   R  LF  R KR  P LDDKV+  WNGL+I++             +  FN      
Sbjct: 384 QLAALRAILFRAREKRVHPGLDDKVLADWNGLMIAA---------LAHAAGAFN------ 428

Query: 577 DRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG 636
            R +++ +A +   F+   +   +  RL HS+R G    P    D A +I   L L+E  
Sbjct: 429 -RPDWLTLACTVFGFVTTTM--SRHDRLGHSWRAGKLLQPALASDNAAMIRAALALHEAT 485

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
               +L  AI  Q   D  + D + GGYF T  +   ++LR     D A P+   ++  N
Sbjct: 486 GDHLFLDQAILWQADLDTHYGDPQHGGYFLTADDAEGLILRPHSSVDDAIPNHIGLTAQN 545

Query: 697 LVRLASIVAGSK 708
           L RLA +    +
Sbjct: 546 LARLAVLTGDER 557


>gi|144899665|emb|CAM76529.1| Protein of unknown function DUF255 [Magnetospirillum
           gryphiswaldense MSR-1]
          Length = 650

 Score =  369 bits (948), Expect = 3e-99,   Method: Compositional matrix adjust.
 Identities = 228/631 (36%), Positives = 319/631 (50%), Gaps = 67/631 (10%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNRLA E SPYL QH  NPV W++WG+ A AEA     P+ LSIGYS CHWCHVM  ESF
Sbjct: 7   TNRLAGETSPYLRQHQDNPVHWWSWGDAALAEAHSSGRPLLLSIGYSACHWCHVMAHESF 66

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           E+  +A L+N  FV++K+DREERPD+D +Y   +Q +   GGWPL++F +PD KP  GGT
Sbjct: 67  ENPEIAALMNRLFVNVKIDREERPDLDAIYQQALQHMGQHGGWPLTMFCTPDGKPFWGGT 126

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFPP  +YGRPGF  +L+ + D W + RD +  +    +  L EAL+     +  P  L 
Sbjct: 127 YFPPAPRYGRPGFPEVLQAIHDLWQRDRDRVDHN----VAALVEALAHDGGGDASP--LT 180

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
              L   A+ +    D   GG G APKFP+P     +   +K+   TG SG      + V
Sbjct: 181 LEMLDRGAKAILSHVDMEHGGLGGAPKFPQPGLFDYLWRSAKR---TGNSGL----HQAV 233

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             TL  + +GGI DH+GGGF RYS D+ W  PHFEKMLYD GQL ++    +  T++  +
Sbjct: 234 TLTLDRICQGGITDHLGGGFMRYSTDDVWLAPHFEKMLYDNGQLIDLLTLVWQDTQNPLF 293

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EH 460
                + + ++ R+M+    E  +   A  A++EG     EG FY W ++E+ D+LG E 
Sbjct: 294 QTRIEECITWVSREML---AEGAAFAAALDADSEG----HEGRFYTWKAQEIIDLLGPET 346

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
           A +F + Y +   GN            ++G N+   LN S            ++   L +
Sbjct: 347 ARIFAQAYDVSIQGN------------WEGVNI---LNRSKPQG-------HEHEEQLAQ 384

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            R  L   R+ R RP  DDKV+  WNG++I+  ARA  +                  R +
Sbjct: 385 ARTILLAARANRIRPGRDDKVLADWNGMMIAGLARAGFVFI----------------RPD 428

Query: 581 YMEVAESAASFI--RRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG 638
           ++++AE A + I  +  L D+   RL HS     +   GF DD A +    L LY+    
Sbjct: 429 WLDMAERAFAVITDKMTLADD---RLAHSLCQEQASHVGFADDLAHMARAALALYQATGK 485

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
             +L WA       D    D+  GGYF        V++R K   D A PS N   V  L 
Sbjct: 486 ADYLTWAETWVAAADRHHWDKAKGGYFQVAHSASDVIVRTKTVMDAAVPSANGTMVQVLA 545

Query: 699 RLASIVAGSKSDYYRQNAEHSLAVFETRLKD 729
            LA I   +    Y   A+  + VF  +  D
Sbjct: 546 ILAQI---TDKPAYADRAQAVVTVFMDQFND 573


>gi|75906768|ref|YP_321064.1| hypothetical protein Ava_0545 [Anabaena variabilis ATCC 29413]
 gi|75700493|gb|ABA20169.1| Protein of unknown function DUF255 [Anabaena variabilis ATCC 29413]
          Length = 711

 Score =  369 bits (948), Expect = 3e-99,   Method: Compositional matrix adjust.
 Identities = 242/664 (36%), Positives = 341/664 (51%), Gaps = 72/664 (10%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNRLA   S YL +HA NP+DW+ W +EA A A+ +D PIFLSIGYS+CHWC VME E+F
Sbjct: 28  TNRLAQTKSLYLRKHAENPIDWWPWCDEALATAKSQDKPIFLSIGYSSCHWCTVMEGEAF 87

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP-DLKPLMGG 220
            D+ +A+ +N  F+ IKVDREERPD+D +YM  +Q + G GGWPL+VFLSP DL P   G
Sbjct: 88  SDQAIAEYMNANFLPIKVDREERPDIDSIYMQALQMMSGQGGWPLNVFLSPEDLVPFYAG 147

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL--SEALSASASSNKLPD 278
           TYFP E KY RPGF  +L  ++  +D +++ L Q  A  +E L  S  L   A+      
Sbjct: 148 TYFPLEPKYNRPGFLQVLEALRRYYDTEKEDLRQRKALIVESLLTSAVLKGEATQEAEES 207

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKS-GEASEG 337
           EL ++        +++   + +G       FP      M+ Y    L  T  +     EG
Sbjct: 208 ELLRSGWETNTGVITR---NEYGN-----SFP------MIPYAELALRGTRFNFASRYEG 253

Query: 338 QKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL-T 396
           +++       +A GGI+DHV GGFHRY+VD  W VPHFEKMLYD GQ+     + +S   
Sbjct: 254 EQISTQRGLDLALGGIYDHVAGGFHRYTVDPTWTVPHFEKMLYDNGQIVEYLANLWSAGV 313

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
           ++  ++      + +L+R+M  P G  ++A+DADS  T   T  +EGAFYVW+  E+E +
Sbjct: 314 QEPSFARAVTGTVAWLQREMTAPAGYFYAAQDADSFTTPTDTEPEEGAFYVWSYAELEQL 373

Query: 457 LGEHAIL-FKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA---SKLGM--- 509
           L    +   ++ + + P GN            F+GKNVL   +    SA   + LG    
Sbjct: 374 LTPTELTELQQQFTVSPQGN------------FEGKNVLQRRHQWELSATIETALGKLFV 421

Query: 510 --------PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 561
                    LE +         K      + P    D K+IV+WN L+IS  ARA     
Sbjct: 422 ARYGSAADTLETFPPAQDNQEAKTTHWPGRIPSV-TDTKMIVAWNSLMISGLARA----- 475

Query: 562 SEAESAMFNFPVVGSDRKEYMEVAESAASFI-RRHLYDEQTHRLQHSFRNGPSKAPGFLD 620
               +A+F  P+ G       E+A  AA+FI      D + +RL +    G +      +
Sbjct: 476 ----AAVFQQPLAG-------ELAAKAANFILENQFVDGRFYRLNY---RGEAAVLAQSE 521

Query: 621 DYAFLISGLLDLYEFGSGTK-WLVWAIELQNTQDELFLDREGGGYFNTTGE-DPSVLLRV 678
           DYA  I  LLDL+      + WL  AI LQ   DE     E GGYFNT  +    +++R 
Sbjct: 522 DYALFIKALLDLHAATPENRFWLEKAIALQQQFDEFLWSIELGGYFNTASDASQDLIIRE 581

Query: 679 KEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMC 738
           +   D A PS N V++ NLVRL+ +   +   +Y   AE  L  F+T +     A P + 
Sbjct: 582 RSYMDNATPSANGVAIANLVRLSLL---TDDLHYLDLAEAGLKAFKTVMSSAPQACPSLF 638

Query: 739 CAAD 742
            A D
Sbjct: 639 TALD 642


>gi|427707072|ref|YP_007049449.1| hypothetical protein Nos7107_1658 [Nostoc sp. PCC 7107]
 gi|427359577|gb|AFY42299.1| hypothetical protein Nos7107_1658 [Nostoc sp. PCC 7107]
          Length = 685

 Score =  369 bits (947), Expect = 3e-99,   Method: Compositional matrix adjust.
 Identities = 239/669 (35%), Positives = 348/669 (52%), Gaps = 82/669 (12%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNRLA   S YL +HA NP+DW+ W +EA A A+  + PIFLSIGYS+CHWC VME E+F
Sbjct: 2   TNRLAQAQSLYLRKHAENPIDWWPWCDEALATAKAENKPIFLSIGYSSCHWCTVMEGEAF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP-DLKPLMGG 220
            D  +A  +N  F+ IKVDREERPD+D +YM  +Q + G GGWPL+ FLSP DL P   G
Sbjct: 62  SDGAIADYMNTNFLPIKVDREERPDIDSIYMQALQMMTGQGGWPLNTFLSPEDLVPFYAG 121

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           TYFP + +YGRPGF  +L+ ++  +D +++ L Q  A  ++ L   L+++   N  P E+
Sbjct: 122 TYFPVDPRYGRPGFLQVLQALRRYYDTEKEDLRQRKAVILDSL---LTSAVLQNSDPQEV 178

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGS---APKFPRPVEIQMMLYHSKKLEDTGKSGEAS-E 336
            ++ L      L K +++  G   S      FP      M+ Y    L  T  +  +  +
Sbjct: 179 QEHEL------LGKGWETSTGIITSNQYGNSFP------MIPYSELALRGTRFNLPSRYD 226

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL- 395
           G+++       +A GGI+DHVGGGFHRY+VD  W VPHFEKMLYD GQ+     + +S  
Sbjct: 227 GKQICTQRGLDLALGGIYDHVGGGFHRYTVDPTWTVPHFEKMLYDNGQIVEYLANLWSAG 286

Query: 396 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 455
            ++  ++      + +L+R+MI P G  ++A+DADS     A   +EGAFYVW+  ++E 
Sbjct: 287 IQEPAFARAIAGTVQWLQREMIAPEGYFYAAQDADSFTNSDAVEPEEGAFYVWSYSDLEQ 346

Query: 456 IL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKY 514
           +L  E     ++ + +   GN            F+  NVL   N       +L   +E+ 
Sbjct: 347 LLTSEELTQLQQEFTVSSQGN------------FESLNVLQRRN-----VGQLSAEIERI 389

Query: 515 LNILGECRR-------KLFDV--RSKRPRPH---------LDDKVIVSWNGLVISSFARA 556
           L  L   R        K+F     ++  + H          D K+IV+WN L+IS  ARA
Sbjct: 390 LAKLFTARYGDKAESLKIFPPARNNQEAKTHNWPGRIPSVTDTKMIVAWNSLMISGLARA 449

Query: 557 SKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHRLQHSFRNGPSKA 615
             +         F  P+       Y+E+A  AA+FI  H + D + HRL +    G +  
Sbjct: 450 GGV---------FQEPL-------YLELAAQAANFILEHQFVDGRFHRLNY---QGEATV 490

Query: 616 PGFLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGE-DPS 673
               +DYAF I  LLDL        +WL  AI +Q   DE     E GGYFNT+ +    
Sbjct: 491 LAQSEDYAFFIKALLDLQACSPDDQQWLENAIAIQAEFDEFLWSVELGGYFNTSSDASQD 550

Query: 674 VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMA 733
           +++R +   D A PS N V++ NLVRL+ +   + + +Y   AE  L  F + +     A
Sbjct: 551 LIIRERSYTDNATPSANGVAIANLVRLSLL---TDNLHYLDLAEQGLKAFRSVMSSHPQA 607

Query: 734 VPLMCCAAD 742
            P +  A D
Sbjct: 608 CPSLFTALD 616


>gi|17228732|ref|NP_485280.1| hypothetical protein all1237 [Nostoc sp. PCC 7120]
 gi|17130584|dbj|BAB73194.1| all1237 [Nostoc sp. PCC 7120]
          Length = 685

 Score =  369 bits (947), Expect = 4e-99,   Method: Compositional matrix adjust.
 Identities = 241/668 (36%), Positives = 336/668 (50%), Gaps = 80/668 (11%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNRLA   S YL +HA NP+DW+ W +EA A A+ +D PIFLSIGYS+CHWC VME E+F
Sbjct: 2   TNRLAQTKSLYLRKHAENPIDWWPWCDEALATAKTQDKPIFLSIGYSSCHWCTVMEGEAF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP-DLKPLMGG 220
            D+ +A  +N  F+ IKVDREERPD+D +YM  +Q + G GGWPL+VFLSP DL P   G
Sbjct: 62  SDQAIADYMNANFLPIKVDREERPDIDSIYMQALQMMSGQGGWPLNVFLSPEDLVPFYAG 121

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL--SEALSASASSNKLPD 278
           TYFP E KY RPGF  IL  ++  +D +++ L Q  A  +E L  S  L   A+      
Sbjct: 122 TYFPIEPKYNRPGFLQILEALRRYYDTEKEDLRQRKALIVESLLTSAVLKGEATQEAEES 181

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKS-GEASEG 337
           EL +         +++   + +G       FP      M+ Y    L  T  +     +G
Sbjct: 182 ELLKRGWETNTSVITR---NEYGN-----SFP------MIPYAELALRGTRFNFASRYDG 227

Query: 338 QKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL-T 396
           Q++       +A GGI+DHV GGFHRY+VD  W VPHFEKMLYD GQ+     + +S   
Sbjct: 228 QQVSTQRGLDLALGGIYDHVAGGFHRYTVDPTWTVPHFEKMLYDNGQIVEYLANLWSAGV 287

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
           K+  ++      + +L+R+M  P G  ++A+DADS  T      +EGAFYVW+  E+E +
Sbjct: 288 KEPAFARAVTGTVVWLQREMTAPAGYFYAAQDADSFTTPTDVEPEEGAFYVWSYAELEQL 347

Query: 457 LGEHAIL-FKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 515
           +    +   ++ + + P GN            F+GKNVL           +LG  +E  L
Sbjct: 348 VTPTELTELQQQFTVSPQGN------------FEGKNVL-----QRRQPGELGATIETAL 390

Query: 516 NILGECRR-KLFDVRSKRPRPH-----------------LDDKVIVSWNGLVISSFARAS 557
             L   R     D     P                     D K+IV+WN L+IS  ARA+
Sbjct: 391 GKLFAARYGSAADTLETFPPAQDNQEAKTTHWPGRIPSVTDTKMIVAWNSLMISGLARAA 450

Query: 558 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFI-RRHLYDEQTHRLQHSFRNGPSKAP 616
            +         F  P+ G       E+A  AA+FI      D + HRL +    G +   
Sbjct: 451 GV---------FQQPLAG-------ELAAKAANFILENQFVDGRFHRLNY---RGEAAVL 491

Query: 617 GFLDDYAFLISGLLDLYEFGSGTK-WLVWAIELQNTQDELFLDREGGGYFNTTGE-DPSV 674
              +DYA  I  LLDL+      + WL  AI LQ+  DE     E GGYFNT  +    +
Sbjct: 492 AQSEDYALFIKALLDLHTAEPENRFWLEKAIALQHQFDEFLWSIELGGYFNTASDASQDL 551

Query: 675 LLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAV 734
           ++R +   D A PS N V++ NLVRL+ +   +   +Y   AE  L  F++ +     A 
Sbjct: 552 IIRERSYMDNATPSANGVAIANLVRLSLL---TDDLHYLDLAEQGLKAFKSVMSSAPQAC 608

Query: 735 PLMCCAAD 742
           P +  A D
Sbjct: 609 PSLFTALD 616


>gi|312138733|ref|YP_004006069.1| hypothetical protein REQ_12910 [Rhodococcus equi 103S]
 gi|311888072|emb|CBH47384.1| conserved hypothetical protein [Rhodococcus equi 103S]
          Length = 674

 Score =  369 bits (946), Expect = 5e-99,   Method: Compositional matrix adjust.
 Identities = 226/626 (36%), Positives = 321/626 (51%), Gaps = 63/626 (10%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
            +  N L    SPYL QHA NPV W  WG +A A AR+RDVP+ LSIGY+ CHWCHVM  
Sbjct: 6   GRERNTLGEATSPYLRQHADNPVHWHQWGPDALAWARERDVPVLLSIGYAACHWCHVMAH 65

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFED+  A ++N+ FV IKVDREERPD+D VYM    A+ G GGWP++ FL+PD  P  
Sbjct: 66  ESFEDDATAAVMNEHFVCIKVDREERPDLDAVYMNATVAMTGQGGWPMTCFLTPDGAPFY 125

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
            GTY+P E + G P F  +L  V D W  +R  +  + A  + +L  + S +  +   P 
Sbjct: 126 CGTYYPREPRGGMPSFVQLLHAVTDTWRSRRGDVDDAAASVVAELRRS-SGALPAGGAPI 184

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
           ++P   L      + +  D   GGFG APKFP  + ++ +L   ++         A    
Sbjct: 185 DVPL--LSGAVANVLRDEDRDHGGFGGAPKFPPSMLLEGLLRSYERT-------SAGPTL 235

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
           + V  T + MA+GGI+D +GGGF RYSVD +W VPHFEKMLYD   L   Y      T  
Sbjct: 236 RAVERTAEAMARGGIYDQLGGGFARYSVDTQWVVPHFEKMLYDNALLVRFYAHLARRTGS 295

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
                +  + +D+L RD+    G   SA DAD       T  +EG  Y WT +++ D++G
Sbjct: 296 ALARRVTEETVDFLLRDLRTAAGAFASALDAD-------TDGEEGLTYAWTPQQIADVVG 348

Query: 459 -EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI 517
            +      E + +  TG  +           +G +VL    D          PL+   + 
Sbjct: 349 DDDGRWAAETFAVTDTGTFE-----------RGTSVLQLPAD----------PLDA--DR 385

Query: 518 LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSD 577
           L + R +L   R++RP+P  DDKV+ +WNGL I++ A A   L                 
Sbjct: 386 LADVRSRLLAARTRRPQPARDDKVVTAWNGLAITALAEAGAALG---------------- 429

Query: 578 RKEYMEVAESAASFI-RRHLYDEQTHRLQHSFRNGPSKAP-GFLDDYAFLISGLLDLYEF 635
           R +++E AE  A  +   HL D    RL+ +   G    P G L+DY  L +GL  L++ 
Sbjct: 430 RADWVEAAEECAHMVLSTHLVD---GRLRRASLGGTVGEPAGILEDYGALAAGLSTLHQV 486

Query: 636 GSGTKWLVWAIELQNTQDELFLD-REGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSV 694
               +WL  A  L +T  + F D  E G +F+T  +  +++ R ++  DGA PSG SV+ 
Sbjct: 487 TGAAEWLEAATGLLDTAIDHFADPDEPGSWFDTADDAETLVARPRDPLDGATPSGASVTT 546

Query: 695 INLVRLASIVAGSKSDYYRQNAEHSL 720
             L+  +S+VA  +S  Y   A  SL
Sbjct: 547 EALLTASSLVAADRSARYAVAAADSL 572


>gi|86742579|ref|YP_482979.1| hypothetical protein Francci3_3900 [Frankia sp. CcI3]
 gi|86569441|gb|ABD13250.1| protein of unknown function DUF255 [Frankia sp. CcI3]
          Length = 673

 Score =  368 bits (945), Expect = 6e-99,   Method: Compositional matrix adjust.
 Identities = 243/663 (36%), Positives = 336/663 (50%), Gaps = 66/663 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N+LA + SPYLLQHA NPVDW+ W   AFAEA +R VP+ LS+GY++CHWCHVM  ESFE
Sbjct: 3   NKLAEQTSPYLLQHADNPVDWWPWSPAAFAEAARRGVPVLLSVGYASCHWCHVMAHESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D   A+ +ND FV+IKVDREERPDVD VYM    AL G GGWP++VFL+P  +P   GTY
Sbjct: 63  DAATAEYMNDHFVNIKVDREERPDVDSVYMDVTVALTGHGGWPMTVFLTPTAEPFFAGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FPP  + G   F+ +L  V +AW  +RD + +SGA    +L+EA +   +S  L  E+  
Sbjct: 123 FPPRPRPGMGSFRQVLTAVTEAWRTRRDEIEESGADIARRLAEAATRGPASG-LAAEITP 181

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             L      LS  +D+R GGFG APKFP  +  +M+L HS +  D       +   +MV 
Sbjct: 182 ALLDTAVAGLSARFDARHGGFGGAPKFPPSMVAEMLLRHSARTGD-------ARSLEMVA 234

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            T + MA+GGI+D + GGF RYSVD  W VPHFEKMLYD   L  VYL  +  T      
Sbjct: 235 VTCERMARGGIYDQLAGGFARYSVDATWTVPHFEKMLYDNALLLRVYLHLWRATGSALAE 294

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDAD---------SAETEGATRKKEGAFYVWTSKEV 453
            + R+   +L  D+  P G   SA DAD         SA   GA   +EGA Y WT  + 
Sbjct: 295 RVVRETAAFLLADLRTPQGGFASALDADAVPADAVPASAAPAGA-HPEEGASYAWTPAQF 353

Query: 454 EDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE 512
             +LG E        + +   G+ +           +G +VL    D    A    +   
Sbjct: 354 VAVLGPEDGRWAAGVFGVTEQGSFE-----------RGTSVLRLPADPDDPARFAAVRAA 402

Query: 513 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 572
                            + RP+P  DDKV+ +WNGL I++ A A  +             
Sbjct: 403 LAAAR------------ATRPQPARDDKVVAAWNGLAIAALAEAGALF------------ 438

Query: 573 VVGSDRKEYMEVAESAASFIRR-HLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLD 631
               D  +++  AE AA  +R  HL + +  R     R G +   G L+DY  +  GLL 
Sbjct: 439 ----DEPDWVRAAEQAAVLLRDVHLVNGRLRRTSRDGRVGVNA--GVLEDYGDVAEGLLT 492

Query: 632 LYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNS 691
           L++     +WL  A  L +   + F   + GG+F+T  +   +L R ++D D A PSG +
Sbjct: 493 LHQVTGDPEWLALAGTLLDIVRDRFAASD-GGFFDTADDAEVLLRRPRDDSDSATPSGQA 551

Query: 692 VSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL-KDMAMAVPLMCCAADMLSVPSRK 750
                LV  A++   + S  +R  AE ++A     L +D   A      A  +L+ P+  
Sbjct: 552 AVAGALVSYAAL---TGSTEHRSAAETTVARVAPLLARDARFAGWAGAVAEALLAGPAEV 608

Query: 751 HVV 753
            VV
Sbjct: 609 AVV 611


>gi|387900736|ref|YP_006331032.1| hypothetical protein MUS_4478 [Bacillus amyloliquefaciens Y2]
 gi|387174846|gb|AFJ64307.1| conserved hypothetical protein YyaL [Bacillus amyloliquefaciens Y2]
          Length = 629

 Score =  368 bits (945), Expect = 6e-99,   Method: Compositional matrix adjust.
 Identities = 233/628 (37%), Positives = 338/628 (53%), Gaps = 58/628 (9%)

Query: 156 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 215
           M  ESFEDE +A +LND F++IKVDREERPDVD VYM   Q + G GGWPL+VF++PD K
Sbjct: 1   MAHESFEDEEIAGMLNDKFIAIKVDREERPDVDSVYMRICQLMTGQGGWPLNVFVTPDQK 60

Query: 216 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 275
           P   GTYFP   KY RPGF  +L  + + +   R          +E ++E  +A      
Sbjct: 61  PFYAGTYFPKTSKYNRPGFIDVLEHLSETFANDRQ--------HVEDIAENAAAHLEVKI 112

Query: 276 LPDE--LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGE 333
            P E  L + A+     QL+  +D+ +GGFG APKFP P    M+++  +    TGK  +
Sbjct: 113 HPAEGMLGEQAVHDTYRQLAGGFDTVYGGFGQAPKFPMP---HMLMFLLRYYSYTGKE-Q 168

Query: 334 ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 393
           A  G   V  TL  MA GGI DH+G GF RYS D  W VPHFEKMLYD   L   Y +A+
Sbjct: 169 ALAG---VTKTLDGMANGGIFDHIGFGFARYSTDNEWLVPHFEKMLYDNALLLPAYTEAY 225

Query: 394 SLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 453
            +T +  Y  I   I+ +++R+M+   G  FSA DAD   TEG    +EG +Y+W+ KE+
Sbjct: 226 QVTGNERYKQIAMQIVTFIQREMMHEDGSFFSALDAD---TEG----REGKYYIWSKKEI 278

Query: 454 EDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE 512
            ++LG E   L+ + Y +   GN +   +  PH  F  +  ++E  ++  + ++L   LE
Sbjct: 279 MNLLGDELGPLYCKVYNITDQGNFEGENI--PHLIFTRREAILE--ETGLTGNELAERLE 334

Query: 513 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 572
                  E R KL + R  R  PH DDKV+ SWN L+I+  A+A+K+         F+ P
Sbjct: 335 -------EARTKLLEARENRSYPHTDDKVLTSWNALMIAGLAKAAKV---------FHEP 378

Query: 573 VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDL 632
                  +++ +AE+A  F+ RHL  +   R+   +R G  K  GF+DDYAFLI   L+L
Sbjct: 379 -------DFLSMAETAIRFLERHLMPDG--RVMVRYREGEVKNKGFIDDYAFLIWAYLEL 429

Query: 633 YEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSV 692
           YE G    +L  A  L  +  ELF D   GG+F T  +  ++L+R KE +DGA PSGNS 
Sbjct: 430 YEAGFHPSYLQKAKTLCTSMLELFWDERHGGFFFTGNDAETLLVREKEVYDGAVPSGNSA 489

Query: 693 SVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHV 752
           + + L+RL  +  G  S    + AE   +VF+  ++    +      +    ++P +K +
Sbjct: 490 AAVQLLRLGRLT-GDIS--LIEKAEAMFSVFKREIEAYPSSNAFFMQSVLAHTMP-QKEI 545

Query: 753 VLVGHKSSVDFENMLAAAHASYDLNKTV 780
           V+ G K   D +  + A    +    T+
Sbjct: 546 VVFGSKDDPDRKRFIEALQEHFTPAYTI 573


>gi|434397636|ref|YP_007131640.1| protein of unknown function DUF255 [Stanieria cyanosphaera PCC
           7437]
 gi|428268733|gb|AFZ34674.1| protein of unknown function DUF255 [Stanieria cyanosphaera PCC
           7437]
          Length = 684

 Score =  368 bits (945), Expect = 6e-99,   Method: Compositional matrix adjust.
 Identities = 232/661 (35%), Positives = 336/661 (50%), Gaps = 67/661 (10%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL +  S YL +HA NP+DW+ W +EA ++A + D PI LSIGYS+CHWC VME E+F 
Sbjct: 3   NRLTSTQSLYLRKHADNPIDWWYWCDEALSKAEREDKPILLSIGYSSCHWCTVMEGEAFS 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP-DLKPLMGGT 221
           D+ +A+ LN  FV+IKVDREERPD+D +YM  VQ + G GGWPL++FL+P DL P  GGT
Sbjct: 63  DQAIAEYLNVNFVAIKVDREERPDLDSIYMQAVQMMTGQGGWPLNIFLTPGDLVPFYGGT 122

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFP + +Y RPGF  +L+ V   + + +  L     F  E LS    ++    + PD L 
Sbjct: 123 YFPLQPRYNRPGFLDVLQAVLRFYQEDKAKLEH---FKTEILSHLQQSTVLPLETPDSLT 179

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
           +  L    E  +        G  S P  P            ++     +      G+ +V
Sbjct: 180 KQLLFAGIETNTGVISPNDLGRPSFPMIPYATLALQGSRFKQEFRYNPQELSWQRGKDLV 239

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL-TKDVF 400
           L        GGI+DHVGGGFHRY+VD  W VPHFEKMLYD GQ+     + +S   ++  
Sbjct: 240 L--------GGIYDHVGGGFHRYTVDPTWTVPHFEKMLYDNGQILEYLANLWSAGCQEPE 291

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL-GE 459
            +    + +++L+R+M  P G  ++A+DADS     A   +EG+FYVW  +E+ D L  E
Sbjct: 292 IALAVTETVNWLKREMTAPNGYFYAAQDADSFVDVDAVEPEEGSFYVWNYQELADNLTAE 351

Query: 460 HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI-L 518
                +  + +   GN            F+GKNVL      + S S L   LEK   I  
Sbjct: 352 ELTELQTEFTVSVEGN------------FEGKNVLQRRQSGNLSDS-LTNTLEKLFTIRY 398

Query: 519 GECRRKLFDVRSKRPR-------------PHLDDKVIVSWNGLVISSFARASKILKSEAE 565
           G+ +  L      R               P  D K+IV+WN +VIS  AR   +  ++  
Sbjct: 399 GQAKESLAIFTPARNNHEAKTTPWQGRIPPVTDTKMIVAWNSIVISGLARVYAVFGNQL- 457

Query: 566 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHRLQHSFRNGPSKAPGFLDDYAF 624
                          Y+++A +A +FI +H + DE+ HRL +   +G ++ P   +DYA 
Sbjct: 458 ---------------YLDLAVTATNFILQHQWLDERFHRLNY---DGLAQVPAQSEDYAL 499

Query: 625 LISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH- 682
            I  LLDL       ++WL  A+ +Q   D+L    E GGY+N++  D +  L ++E   
Sbjct: 500 FIKALLDLQAATPEKSQWLEQAVRIQTEFDQLLWSNEMGGYYNSSNTDANQELLIQERSY 559

Query: 683 -DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAA 741
            D A P+ N V+V NLVRL+ +    +   Y   AE +L  F + +     A P +  A 
Sbjct: 560 IDNATPAANGVAVTNLVRLSLLTDNLE---YLDRAEQALQAFSSVMTRSPQACPTLFVAL 616

Query: 742 D 742
           D
Sbjct: 617 D 617


>gi|424867573|ref|ZP_18291355.1| hypothetical protein C75L2_00200010 [Leptospirillum sp. Group II
           'C75']
 gi|124516649|gb|EAY58157.1| protein of unknown function [Leptospirillum rubarum]
 gi|387221885|gb|EIJ76392.1| hypothetical protein C75L2_00200010 [Leptospirillum sp. Group II
           'C75']
          Length = 689

 Score =  368 bits (945), Expect = 6e-99,   Method: Compositional matrix adjust.
 Identities = 229/670 (34%), Positives = 346/670 (51%), Gaps = 53/670 (7%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA NPVDW+ WG+EAF +AR  + P+ LSIGY+ CHWCHVM  ESFE
Sbjct: 3   NRLKEETSPYLRQHADNPVDWYPWGKEAFEKARLEEKPVLLSIGYAACHWCHVMAHESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVY-MTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
              +A ++N++FV+IKVDREERPD+D++Y M +       GGWPL++FL+P   P  GGT
Sbjct: 63  RPDIASVMNEFFVNIKVDREERPDLDQIYQMAHTMITRRNGGWPLTMFLTPSQVPFAGGT 122

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFP + ++G PGF  +L +++D +   R+ L +     ++ L +    + S     D  P
Sbjct: 123 YFPAQPRFGLPGFVQVLEQIRDFYRDHREGLEKEDHPILQYLGQTNPVADSREFELDLSP 182

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
             AL      L   +D  FGGFG APKFP  +++  +    ++ +  G S  A     M 
Sbjct: 183 SEAL---VNNLKSRFDPEFGGFGGAPKFPHAMDLSYLF---RRFQRKGDSTAA----HMA 232

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             TL  M +GGI D VGGGF RYSVDERW +PHFEKMLYD   L        S++K+  Y
Sbjct: 233 TVTLSSMKRGGIWDQVGGGFARYSVDERWLIPHFEKMLYDNALLLEALALGASVSKNPVY 292

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHA 461
           S    +++ +L R+M    G  +S+ DADS   EG    +EG FYV+ ++EV  IL +  
Sbjct: 293 SRTAEELVGWLFREMRSDDGVYYSSLDADS---EG----EEGRFYVFQAEEVRSILSDEE 345

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV-LIELNDSSASASKLGMPLEKYLNILGE 520
                 YY           +S P N F+G    L E       + +  +        +  
Sbjct: 346 YRVVSKYY----------GLSGPPN-FEGHAWNLYEARSIGELSKEFHLSESDIERRIES 394

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            R+KLF  RS R RP LDDKV+ SWN L+              A++ +F+  ++G  ++E
Sbjct: 395 ARQKLFAYRSTRVRPGLDDKVLASWNALM--------------AKALLFSGRILG--KQE 438

Query: 581 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 640
           ++        ++ R ++  +   L   +       P +LDDYAFL+  +L+        +
Sbjct: 439 WISAGRKTIDYMHRKMW--KNGLLMAVYSKKEPFLPAYLDDYAFLLLAVLESMRIDFRPE 496

Query: 641 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 700
            L +A  + +     F D E GG++ T     +++ R K  HDGA PSGN+ +V  L+ L
Sbjct: 497 DLSFATTIADVLLAEFYDPESGGFYFTGKNHEALIHRPKNGHDGALPSGNAAAVQGLLWL 556

Query: 701 ASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSS 760
            ++        Y   A+ +L ++  ++K+       M  A +  S    + VV +    +
Sbjct: 557 GTLTGHLP---YTSAADKTLRLYFAQMKEQPAGYTTMISALETYS--DSQPVVFLAGPQA 611

Query: 761 VDFENMLAAA 770
            D+++ ++  
Sbjct: 612 GDWKDKISCG 621


>gi|297192427|ref|ZP_06909825.1| conserved hypothetical protein [Streptomyces pristinaespiralis ATCC
           25486]
 gi|297151361|gb|EDY61872.2| conserved hypothetical protein [Streptomyces pristinaespiralis ATCC
           25486]
          Length = 678

 Score =  368 bits (944), Expect = 7e-99,   Method: Compositional matrix adjust.
 Identities = 229/589 (38%), Positives = 310/589 (52%), Gaps = 72/589 (12%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            NRLA   SPYLLQHA NPVDW+ W   AF EAR+RDVP+FLS+GYS+CHWCHV+  ESF
Sbjct: 8   ANRLAQATSPYLLQHADNPVDWWQWEPAAFEEARRRDVPVFLSVGYSSCHWCHVLAHESF 67

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           ED   A  +N+ FV+IKVDREERPDVD VYM  VQA  G GGWP+SV+++ D +P   GT
Sbjct: 68  EDAETAAYMNEHFVNIKVDREERPDVDAVYMEAVQAATGQGGWPMSVWMTADGEPFYFGT 127

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLP--DE 279
           YFPP  ++G P F+ +L  V DAW  +RD + +        L+ A S     + +P  +E
Sbjct: 128 YFPPAPRHGMPSFRQVLEGVSDAWTGRRDEVGEVAQRIASDLA-ARSLVVGGDGVPGEEE 186

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
           L Q  L      L++ YD R GGFG APKFP  + ++ +L H  +   TG  G      +
Sbjct: 187 LAQALL-----GLTRDYDERHGGFGGAPKFPPSMVLEFLLRHHAR---TGAEG----ALQ 234

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           M   T + MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L  VY   +  T   
Sbjct: 235 MAADTCEAMARGGIYDQLGGGFARYSVDREWVVPHFEKMLYDNALLCRVYAHLWRATGSD 294

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE 459
               +  +  D+L R++    G   SA DADS   +G     EGAFYVWT  ++ ++LGE
Sbjct: 295 LARRVALETADFLVRELRTSEGGFASALDADSDTADGG--HAEGAFYVWTPAQLREVLGE 352

Query: 460 H-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 518
                  E + +   G             F+  + ++ L    A A              
Sbjct: 353 EDGARAAELFAVTEEGT------------FEEGSSVLRLPHGEADA-------------- 386

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
            + R++L   R +RPRP  DDKV+ +WNGL I++ A            A F        R
Sbjct: 387 -DLRQRLLAAREERPRPGRDDKVVAAWNGLAIAALAET---------GAFFG-------R 429

Query: 579 KEYMEVAESAAS-FIRRHL-YDEQTHRLQHSFRNGPSKA-PGFLDDYAFLISGLLDLYEF 635
            + +E A  AA   +R H+ ++    RL  + ++G   A  G L+DYA +  G L L   
Sbjct: 430 PDLVERATEAADLLVRVHMDFEAGGVRLHRTSKDGRLGANAGVLEDYADVAEGFLALAAV 489

Query: 636 GSGTKWLVWAIELQNTQDELFLDR---EGGGYFNTTGEDPSVLLRVKED 681
           G    WL +A  L +    + +DR   EG   ++T   D   L+R  +D
Sbjct: 490 GGEGSWLEFAGFLLD----MVMDRFTGEGCALYDTA-HDAEPLIRRPQD 533


>gi|428281760|ref|YP_005563495.1| hypothetical protein BSNT_06256 [Bacillus subtilis subsp. natto
           BEST195]
 gi|291486717|dbj|BAI87792.1| hypothetical protein BSNT_06256 [Bacillus subtilis subsp. natto
           BEST195]
          Length = 629

 Score =  368 bits (944), Expect = 7e-99,   Method: Compositional matrix adjust.
 Identities = 217/578 (37%), Positives = 316/578 (54%), Gaps = 68/578 (11%)

Query: 156 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 215
           M  ESFEDE +A+LLN+ FV+IKVDREERPDVD VYM   Q + G GGWPL+VF++PD K
Sbjct: 1   MAHESFEDEEIARLLNERFVAIKVDREERPDVDSVYMRICQLMTGQGGWPLNVFITPDQK 60

Query: 216 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 275
           P   GTYFP   K+ RPGF  +L  + + +   R+ +      A + L    +A +    
Sbjct: 61  PFYAGTYFPKTSKFNRPGFVDVLEHLSETFANDREHVEDIAENAAKHLQTKTAAKSGEG- 119

Query: 276 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 335
               L ++A+    +QL+  +D+ +GGFG APKFP P    M++Y  +   +TG+     
Sbjct: 120 ----LSESAIHRTFQQLASGFDTIYGGFGQAPKFPMP---HMLMYLLRYHHNTGQENALY 172

Query: 336 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 395
              K    TL  MA GGI+DH+G GF RYS D+ W VPHFEKMLYD   L   Y +A+ +
Sbjct: 173 NVTK----TLDSMANGGIYDHIGYGFARYSTDDEWLVPHFEKMLYDNALLLTAYTEAYQV 228

Query: 396 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 455
           T++  Y  IC  I+ +++R+M    G  FSA DAD   TEG    +EG +YVW+ +E+  
Sbjct: 229 TQNSRYKEICEQIITFIQREMTHEDGSFFSALDAD---TEG----EEGKYYVWSKEEILK 281

Query: 456 ILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV--LIELN------DSSASASK 506
            LG+    L+ + Y +   GN            F+GKN+  LI         D+  +  +
Sbjct: 282 TLGDDLGTLYCQVYDITEEGN------------FEGKNIPNLIHTKWEQIKADAGLTEKE 329

Query: 507 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 566
           L + LE       E R++L   R +R  PH+DDKV+ SWN L+I+  A+A+K+ +     
Sbjct: 330 LSLKLE-------EARQQLLKTREERTYPHVDDKVLTSWNALMIAGLAKAAKVYQ----- 377

Query: 567 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 626
                        +Y+ +A+ A +FI   L  +   R+   +R+G  K  GF+DDYAFL+
Sbjct: 378 -----------EPKYLSLAKDAITFIENKLIIDG--RVMVRYRDGEVKNKGFIDDYAFLL 424

Query: 627 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 686
              LDLYE      +L  A +L +    LF D E GG++ T  +  ++++R KE +DGA 
Sbjct: 425 WAYLDLYEASFDLSYLQKAKKLTDDIISLFWDEEHGGFYFTGHDAEALIVREKEVYDGAV 484

Query: 687 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE 724
           PSGNSV+ + L+RL  +   S      + AE   +VF+
Sbjct: 485 PSGNSVAAVQLLRLGQVTGDSS---LIEKAETMFSVFK 519


>gi|119715292|ref|YP_922257.1| hypothetical protein Noca_1052 [Nocardioides sp. JS614]
 gi|119535953|gb|ABL80570.1| protein of unknown function DUF255 [Nocardioides sp. JS614]
          Length = 652

 Score =  368 bits (944), Expect = 8e-99,   Method: Compositional matrix adjust.
 Identities = 229/612 (37%), Positives = 314/612 (51%), Gaps = 80/612 (13%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            NRLA   SPYLLQHA NPVDW+ WG EAF EAR+R VP+ LS+GY+ CHWCHVM  ESF
Sbjct: 2   VNRLATATSPYLLQHAQNPVDWWEWGPEAFEEARRRGVPVLLSVGYAACHWCHVMAHESF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           EDE  A  LN+ FVS+KVDREERPDVD VYM    ++ G GGWP++V L  +  P   GT
Sbjct: 62  EDEATAAYLNEHFVSVKVDREERPDVDAVYMQATTSMTGHGGWPMTVVLDHEGSPFFAGT 121

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFP   ++G+P F+ +L  + DAW  + D + +  A   E LS    A+A +      + 
Sbjct: 122 YFPDRPRHGQPAFRQVLEALADAWQNRSDEVRRVAANLREHLSSTSLATAGA-----PIT 176

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
           +  L      L+  YD+   GFG APKFP  + ++ +  H ++              +M+
Sbjct: 177 RAVLDGAVRTLALEYDADAAGFGGAPKFPPSMVLEFLRRHGER--------------EML 222

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             TL+ MA+GGIHD +GGGF RYSVD  W VPHFEKMLYD   L  VY +  +       
Sbjct: 223 GATLEAMARGGIHDQLGGGFARYSVDTDWVVPHFEKMLYDNALLLRVYAEWDTPVG---- 278

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHA 461
            +    I D+L  ++  P G   SA DADS   EGA    EG +YVWT  ++ ++LG   
Sbjct: 279 VWAAEGIADFLLGELRTPEGGFASALDADS---EGA----EGTYYVWTPAQLTEVLGPED 331

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
             +             L  ++D      G + L    D           L+++ +    C
Sbjct: 332 GPWAAR----------LLGVTDAGTFEHGTSTLQLRQDPD--------DLDRWFD----C 369

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
           +R+L + RS R RP  DDKV+ +WNGL IS   RA          A+   P       EY
Sbjct: 370 QRRLREARSHRERPARDDKVVAAWNGLAISGLCRA---------GALIGLP-------EY 413

Query: 582 MEVAESAASFI-RRHLYDEQTHRLQHSFRNGPSKAP-GFLDDYAFLISGLLDLYEFGSGT 639
           +  A +A   + R HL D    RL+   R+G   AP G L+D   + +G LDL +     
Sbjct: 414 VAAATAAGQLLWRVHLVD---GRLRRVSRDGVVGAPAGVLEDNGCVAAGFLDLLQATGDA 470

Query: 640 KWLVWA---IELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
            WL  A   +EL  T        E GG+F+T  +  +++ R ++  D A PSG +  V  
Sbjct: 471 VWLERAGAILELALTH----FAAEDGGFFDTADDAEALVARPRDPSDNASPSGLASMVHA 526

Query: 697 LVRLASIVAGSK 708
           L   A++    +
Sbjct: 527 LSTYAALTGSGR 538


>gi|389572654|ref|ZP_10162736.1| yyaL [Bacillus sp. M 2-6]
 gi|388427679|gb|EIL85482.1| yyaL [Bacillus sp. M 2-6]
          Length = 627

 Score =  367 bits (943), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 212/554 (38%), Positives = 307/554 (55%), Gaps = 55/554 (9%)

Query: 156 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 215
           M  ESFED+ VA +LN+ F+SIKVDREERPD+D +YM+  Q + G GGWPL+VF++PD K
Sbjct: 1   MAHESFEDQQVADILNEHFISIKVDREERPDIDSMYMSVCQMMTGQGGWPLNVFVTPDQK 60

Query: 216 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSAS---AS 272
           P   GTYFP    YGRPGF   L ++ DA+   RD         IE L+E  + +    +
Sbjct: 61  PFYAGTYFPKRSAYGRPGFIEALTQLLDAYHSDRD--------HIESLAEKATNNLRIKA 112

Query: 273 SNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSG 332
           + +  + L Q ++     QL  S+D+ +GGFGSAPKFP P    M+ +  +  E TG+  
Sbjct: 113 AGQTENTLTQESIHKAYYQLMSSFDTLYGGFGSAPKFPAP---HMLTFLMRYFEWTGQEN 169

Query: 333 EASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDA 392
                 K    TL  MA GGI+DH+G GF RYS DE+W VPHFEKMLYD   L + Y +A
Sbjct: 170 ALYAVTK----TLNGMANGGIYDHIGSGFTRYSTDEKWLVPHFEKMLYDNALLIDAYTEA 225

Query: 393 FSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKE 452
           + +T+   Y  + +D++ +++RDM+   G  +SA DADS   EG    KEG +YVWT KE
Sbjct: 226 YQITQHPEYEKLVQDLIQFIKRDMMNRDGSFYSAIDADS---EG----KEGQYYVWTKKE 278

Query: 453 VEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPL 511
           +   LG+    LF   Y++   GN +   +  PH       +    +D  A+ S   +  
Sbjct: 279 IMTHLGDDLGTLFCAVYHITEEGNFEGQNI--PH------TISTSFDDIKAAYS---IDD 327

Query: 512 EKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNF 571
           +   + L   R  L  VR +RP P +DDKV+ SWN L+IS+ A+A  +   E        
Sbjct: 328 QTLYSKLQSARNILLTVRQQRPAPLIDDKVLTSWNALMISALAKAGSVFHEE-------- 379

Query: 572 PVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLD 631
                   E + +A+ A SF+  HL   Q  RL   +R G  K  GF++DYA +++  + 
Sbjct: 380 --------EAIRMAKQAMSFLETHLV--QQERLMVRYREGDVKHLGFIEDYAHMLTAYMS 429

Query: 632 LYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNS 691
           LYE      WL  A  +     ELF D + GG+F +  +  ++++R KE +DGA PSGNS
Sbjct: 430 LYEATFDLDWLTKARAVGENMFELFWDEQIGGFFFSGSDAETLIVREKEVYDGAMPSGNS 489

Query: 692 VSVINLVRLASIVA 705
            ++  L++L+ ++ 
Sbjct: 490 TALQQLLKLSRMIG 503


>gi|375364488|ref|YP_005132527.1| hypothetical protein BACAU_3798 [Bacillus amyloliquefaciens subsp.
           plantarum CAU B946]
 gi|371570482|emb|CCF07332.1| conserved hypothetical protein YyaL [Bacillus amyloliquefaciens
           subsp. plantarum CAU B946]
          Length = 629

 Score =  367 bits (943), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 230/628 (36%), Positives = 334/628 (53%), Gaps = 58/628 (9%)

Query: 156 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 215
           M  ESFEDE +A +LND F+++KVDREERPDVD VYM   Q + G GGWPL+VF++PD K
Sbjct: 1   MAHESFEDEEIAGMLNDKFIAVKVDREERPDVDSVYMRICQLMTGQGGWPLNVFVTPDQK 60

Query: 216 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 275
           P   GTYFP   K+ RPGF  +L  + + +   R          +E ++E  +A      
Sbjct: 61  PFYAGTYFPKTSKFNRPGFIDVLEHLSETFANDRQ--------HVEDIAENAAAHLEVKV 112

Query: 276 LPDE--LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGE 333
            P E  L + A+     QL+  +D+ +GGFG APKFP P    M+++  +    TGK  +
Sbjct: 113 HPAEGMLGEQAVHDTYRQLAGGFDTVYGGFGQAPKFPMP---HMLMFLLRYYSYTGKE-Q 168

Query: 334 ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 393
           A  G   V  TL  MA GGI DH+G GF RYS D  W VPHFEKMLYD   L   Y +A+
Sbjct: 169 ALAG---VTKTLDGMANGGIFDHIGFGFARYSTDNEWLVPHFEKMLYDNALLLTAYTEAY 225

Query: 394 SLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 453
            +T +  Y  I   I+ +++R+M+   G  FSA DAD   TEG    +EG +Y+W+ KE+
Sbjct: 226 QVTGNERYKQIAMQIVTFIQREMMHEDGSFFSALDAD---TEG----REGKYYIWSKKEI 278

Query: 454 EDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE 512
            ++LG E   L+ + Y +   GN +   +  PH  F  +  ++E  ++  +  +L   LE
Sbjct: 279 MNLLGDELGPLYCKVYNITDQGNFEGENI--PHLIFTRREAILE--ETGLTGHELAERLE 334

Query: 513 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 572
                  E R KL + R  R  PH DDKV+ SWN L+I+  A+A+K+         F+ P
Sbjct: 335 -------EARTKLLEARENRSYPHTDDKVLTSWNALMIAGLAKAAKV---------FHEP 378

Query: 573 VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDL 632
                  +++ +AE+A  F+ RHL  +   R+   +R G  K  GF DDYAFLI G L+L
Sbjct: 379 -------DFLSMAETAIRFLERHLMPDG--RVMVRYREGEVKNKGFNDDYAFLIWGYLEL 429

Query: 633 YEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSV 692
           YE G    +L  A  L     ELF D   GG+F T  +  ++L+R KE +DGA PSGNS 
Sbjct: 430 YEAGFHPSYLQKAKTLCTNMLELFWDERHGGFFFTGNDAETLLVREKEVYDGAVPSGNSA 489

Query: 693 SVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHV 752
           + + L+RL  +          + AE   +VF+  ++    +      +    ++P +K +
Sbjct: 490 AAVQLLRLGRLTGDVS---LIEKAEAMFSVFKREIEAYPSSSAFFMQSVLAHTMP-QKEI 545

Query: 753 VLVGHKSSVDFENMLAAAHASYDLNKTV 780
           V+ G K   D +  + A    +    T+
Sbjct: 546 VVFGRKDDPDRKRFIEALQEHFTPAYTI 573


>gi|399928052|ref|ZP_10785410.1| hypothetical protein MinjM_13607 [Myroides injenensis M09-0166]
          Length = 665

 Score =  367 bits (943), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 222/625 (35%), Positives = 322/625 (51%), Gaps = 52/625 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N L  E SPYLLQHA NP+ W AW E+    A+K +  I +SIGYSTCHWCHVME ESFE
Sbjct: 2   NELHKETSPYLLQHASNPIHWKAWSEKTLELAKKSNKLIAISIGYSTCHWCHVMEHESFE 61

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D  VA L+N+ F+SIK+DREE PD+D  YM  VQ +   GGWPL+V   PD +P+ GGTY
Sbjct: 62  DNKVATLMNNHFISIKIDREEFPDIDAFYMKAVQIMTKQGGWPLNVVCLPDGRPIWGGTY 121

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP      +  +   L ++ + +  K + +     FA EQL E +S   SS  + +   +
Sbjct: 122 FP------KQTWLDSLTQLNELYQTKPETVID---FA-EQLHEGISL-LSSGPIENSETR 170

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             L +  E+ SKS+D   GG+G APKF  P     +LY    L+  G      +  + + 
Sbjct: 171 FNLEVLIEKWSKSFDWENGGYGRAPKFMMPSN---LLY----LQKLGVYSHTKDILEYID 223

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL  MA GG+ D V GGF RYSVD RWH+PHFEKMLYD  QL  VY DA+  TK+  Y 
Sbjct: 224 LTLTKMAWGGLFDTVEGGFSRYSVDMRWHIPHFEKMLYDNAQLLTVYADAYKRTKNNLYK 283

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
            +    + Y+  +     G  +SA DADS   +   + KEGA+YVWT KE++DI+ +   
Sbjct: 284 EVIAKTITYIENNWANKEGGYYSALDADSLNHDN--QLKEGAYYVWTEKELQDIINKEYD 341

Query: 463 LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECR 522
           +FK+ + +   G  +           +   VLI+  D  + A++  +     + +  E  
Sbjct: 342 IFKQVFNINDNGYWE-----------ENNYVLIQTQDLHSIANQNNIEYSHLVTLKKEWE 390

Query: 523 RKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYM 582
             L   R  R  P LDDK + SWN + I+    +   L                + KEY+
Sbjct: 391 ELLLQARKNRKAPRLDDKTLTSWNAMYINGLLNSYTAL----------------NNKEYL 434

Query: 583 EVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWL 642
            +A     FI   L+DE    L H+++NG      +LDDYA+ IS  ++LYE      +L
Sbjct: 435 VLAIKTFDFITAKLWDEDK-GLYHTYKNGQKTIKAYLDDYAYYISAAIELYEHTGEDNYL 493

Query: 643 VWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLAS 702
             A    +   + F D +   +F +      ++  + E  D   PS N++  +NL +LA 
Sbjct: 494 TIAKNCTDYVFDHFYDDKTKFFFYSQDIQEYIIKNI-ETEDNVIPSSNAIMCLNLQKLAV 552

Query: 703 IVAGSKSDYYRQNAEHSLAVFETRL 727
           +       +YR  + + L + +T++
Sbjct: 553 LYDNL---HYRNTSINMLEIIKTQI 574


>gi|328541699|ref|YP_004301808.1| Thioredoxin domain protein [Polymorphum gilvum SL003B-26A1]
 gi|326411451|gb|ADZ68514.1| Thioredoxin domain protein [Polymorphum gilvum SL003B-26A1]
          Length = 670

 Score =  367 bits (942), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 238/665 (35%), Positives = 332/665 (49%), Gaps = 76/665 (11%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            NRLA   SPYLLQH  NPV W  WGE+A AEAR  D PI LS+GY+ CHWCHVM  ESF
Sbjct: 3   ANRLADATSPYLLQHKDNPVHWHPWGEKALAEARSLDKPILLSVGYAACHWCHVMAHESF 62

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           ED   A+++N  FV+IKVDREERPD+D++YM  + AL   GGWPL++FL+PD +P  GGT
Sbjct: 63  EDPATAEVMNRLFVNIKVDREERPDIDQIYMNALHALGEQGGWPLTMFLTPDGEPFWGGT 122

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFP E ++GRP F  IL  V   +  +R  + ++    ++ L +    +A        L 
Sbjct: 123 YFPKEARWGRPAFVDILEAVAATYRSERSRIDRNRTGLMQVLKQRAQPAAP-------LD 175

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
              L L  ++L   +D   GG   APKFP+   + ++     +   TG        ++  
Sbjct: 176 SAILVLAGDRLLSLFDPEHGGIRGAPKFPQASILDLVWRAGLR---TGNPA----ARETF 228

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
           L TL+ ++ GGI+DH+ GG  RYSVDERW VPHFEKMLYD  Q     L A+  T +  +
Sbjct: 229 LHTLRQISNGGIYDHLKGGIARYSVDERWLVPHFEKMLYDNAQYLQHLLTAWLATGEDLF 288

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHA 461
                + + +L  +M  P G   S+ DADS   EG    +EG FYVWT+ EV ++LG  A
Sbjct: 289 RCRIDETVGWLLDEMRLPEGGFASSLDADS---EG----EEGRFYVWTAAEVAEVLGADA 341

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
             F   Y +   GN            ++G  +L  L  ++AS      P E+  N L   
Sbjct: 342 AFFARFYDISAAGN------------WEGVTILNRLTGTAAS------PEEE--NRLAAL 381

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
           R KL   R+ R RP LDDKV+  WNGL+I++ ARA +I+                 R+ +
Sbjct: 382 RAKLLSRRASRVRPALDDKVLADWNGLLIAALARAGRIVS----------------RESW 425

Query: 582 MEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKW 641
           +  AE A  FI   +      RL H++R G    PGF  D+A ++   + L E       
Sbjct: 426 IAAAEQAFRFIAESM--TGGGRLGHAWRAGRLVFPGFASDHAAMMQAAIALAEARP---- 479

Query: 642 LVWAIELQNTQDELFLD-------REGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSV 694
             W  +      E F D         GGG++ T  +   ++LR     D A P+ NSV+ 
Sbjct: 480 --WDAQHYLRIAEGFADALVRHYAAPGGGFYMTADDATDLILRPLSSADEAVPNANSVAA 537

Query: 695 INLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVL 754
               RL  +    +   +R  A+     F   +     A   + CA D   +  R  VV+
Sbjct: 538 DAFARLYLLTGDRR---HRDVADAVFHAFAGDVPKNLFATASLLCAFDT-RINGRLAVVV 593

Query: 755 VGHKS 759
             + S
Sbjct: 594 APNGS 598


>gi|119488064|ref|ZP_01621508.1| hypothetical protein L8106_11722 [Lyngbya sp. PCC 8106]
 gi|119455353|gb|EAW36492.1| hypothetical protein L8106_11722 [Lyngbya sp. PCC 8106]
          Length = 688

 Score =  367 bits (942), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 237/683 (34%), Positives = 355/683 (51%), Gaps = 109/683 (15%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA   S YL +HA NP+DW+ W +EA  +A+++D PIFLSIGYS+CHWC VME E+F 
Sbjct: 3   NRLAQSKSLYLRKHAENPIDWWPWCDEALEQAKRQDKPIFLSIGYSSCHWCTVMEGEAFS 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP-DLKPLMGGT 221
           D  VA+ +N+ F+SIKVDREERP++D +YM  +Q + G GGWPL++FLSP DL P +GGT
Sbjct: 63  DGAVAQYMNEHFISIKVDREERPEIDSIYMQALQMMTGQGGWPLNIFLSPDDLVPFVGGT 122

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASA--SSNKLPDE 279
           YFP + +YG+PGF  +LR+V+  ++ ++  L        +++  AL  S   S+++L + 
Sbjct: 123 YFPVQPRYGQPGFLEVLRRVRGFYNTEKTRLQN----LKQEIRNALVQSTVLSASQLNEG 178

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS-EGQ 338
           L Q  L      +++   +  GG    P+FP      M+ Y    L D     E+  + Q
Sbjct: 179 LLQQGLTTNTAVITR---NDLGG----PRFP------MIPYADTALHDVRFDFESPYDSQ 225

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS--LT 396
           +        +A GGI+DHV GGFHRY+VD  W VPHFEKMLYD GQ+     + +S  +T
Sbjct: 226 QACTQRGTDLASGGIYDHVAGGFHRYTVDPTWTVPHFEKMLYDNGQIVEYLANLWSAGIT 285

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
           K  F   I   +  +L+R+M  P G  ++++DAD+  T      +EG FYVW  +++E+I
Sbjct: 286 KPAFERSISGTV-SWLKREMTAPKGHFYASQDADNFTTPEDVEPEEGEFYVWNWQDLEEI 344

Query: 457 LG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 515
           +  E     +  + +  +GN            F+GKNVL   N        L  P+E  L
Sbjct: 345 VSPEEFGELQAQFSITKSGN------------FEGKNVLQRWN-----CDALSQPIESAL 387

Query: 516 NILGECRRKLFDVR-------------------------SKRPRPHLDDKVIVSWNGLVI 550
                   KLF VR                         S R  P  D K+IV+WN L+I
Sbjct: 388 -------AKLFAVRYGAKPQDLETFPPATNNQEAKSKNWSGRIPPVTDTKMIVAWNSLMI 440

Query: 551 SSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHRLQHSFR 609
           S  ARA+ + +                + EY+++A +AA FI  + + D + HR+ +   
Sbjct: 441 SGLARAATVFQ----------------QPEYLKIATTAAQFILENQWVDGRLHRVNY--- 481

Query: 610 NGPSKAPGFLDDYAFLISGLLDLYE-------FGSGTKWLVWAIELQNTQDELFLDREGG 662
           +G        +DYA  I  L+DL++       F     W   A+++Q   D+     E G
Sbjct: 482 DGNPDVLAQSEDYALFIKALIDLHQASLIESSFQLPEYWFEKAVKVQQEFDQFLWSVELG 541

Query: 663 GYFNT---TGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHS 719
           GY+N    TG++  +L+R +   D A P+ N V++ NLVRL   +   + DY  + AE  
Sbjct: 542 GYYNIGTDTGQE--LLMRERSYTDNATPAANGVAMANLVRL--FLLTEQLDYLDK-AEQG 596

Query: 720 LAVFETRLKDMAMAVPLMCCAAD 742
           +  F + ++    A P +  A D
Sbjct: 597 IQAFSSIMEKSPQACPSLFVALD 619


>gi|296445985|ref|ZP_06887935.1| protein of unknown function DUF255 [Methylosinus trichosporium
           OB3b]
 gi|296256503|gb|EFH03580.1| protein of unknown function DUF255 [Methylosinus trichosporium
           OB3b]
          Length = 679

 Score =  367 bits (941), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 239/675 (35%), Positives = 344/675 (50%), Gaps = 67/675 (9%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           +NRL+ E SPYLLQH  NPV W AW  E  A A+ +  PI LS GY+ CHWCHVM  ESF
Sbjct: 4   SNRLSEETSPYLLQHKDNPVHWRAWSAETLALAKAQGKPILLSSGYAACHWCHVMAAESF 63

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           E++ +A L+N  F+++KVDREERPD+D +Y   +Q L   GGWPL++FL+PD +P  GGT
Sbjct: 64  ENDRIAALMNANFINVKVDREERPDIDHLYQQALQMLGRRGGWPLTMFLTPDGEPFWGGT 123

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQS-GAFA--IEQLSEALSASASSNKLPD 278
           YFPPE ++G PGF  IL+ V + W +K  ++ ++ GA A  +++L+E+  A   S  L  
Sbjct: 124 YFPPEPRHGMPGFADILQAVAELWREKPAVVTRNVGAIANGLDRLAESAPAEPISPVL-- 181

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
                 L    E+L +  D   GG   APKFP+P  ++ +    K      ++G AS  +
Sbjct: 182 ------LETITERLEELIDREHGGIRGAPKFPQPPSLEFLWRAWK------RTGRASL-R 228

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
           + VL TL  + +GGI+DH+GGGF RYS DERW  PHFEKMLYD GQL  +    +   + 
Sbjct: 229 EAVLTTLDHICQGGIYDHIGGGFARYSTDERWLAPHFEKMLYDNGQLVELLTLVWQDERK 288

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
             Y+    + +D+  R+M  P G   S+ DADS         +EG FYVW++ E++  LG
Sbjct: 289 PLYAARVEETIDWALREMRLPEGVFASSLDADS-------EHEEGKFYVWSAAEIDAALG 341

Query: 459 EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 518
           E A  F+  Y +   GN +         E    N L+E+   SA A          L  L
Sbjct: 342 ERAGAFRAAYDVTEAGNWE---------EKNIPNRLLEMALGSAEAEAALAADRAALLAL 392

Query: 519 GECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDR 578
            E R           RP  DDK +  WNGL+I++ A A++                   R
Sbjct: 393 RETRV----------RPGRDDKALADWNGLMIAALAAAAQAFA----------------R 426

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG 638
            +++ VA +A  FI   +      RL HS+R G +K    LDDYA L    L L+E    
Sbjct: 427 PDWLAVATAAFDFIATSMTTADG-RLLHSYRAGRAKHMAVLDDYADLCRAALTLHEATGD 485

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
             +L    E     +  + D   GGYF T  +  +++ R K   D   PSGN      L 
Sbjct: 486 DAYLTRCREWAEIVETHYRD-PAGGYFFTADDAEALIRRAKIAEDAPLPSGNGAMTQVLA 544

Query: 699 RLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHK 758
           RL  +   +    YR+ AE +L  F   ++   +    +   A++L       +V++G +
Sbjct: 545 RLYHLTGETA---YRERAEATLTAFAGTVRRGLLGYSTLLSGAEILR--DGLQIVIIGAR 599

Query: 759 SSVDFENMLAAAHAS 773
           ++ D   +L   H +
Sbjct: 600 AAEDTAALLRVLHET 614


>gi|452857673|ref|YP_007499356.1| Uncharacterized protein yyaL [Bacillus amyloliquefaciens subsp.
           plantarum UCMB5036]
 gi|452081933|emb|CCP23707.1| Uncharacterized protein yyaL [Bacillus amyloliquefaciens subsp.
           plantarum UCMB5036]
          Length = 629

 Score =  367 bits (941), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 232/628 (36%), Positives = 336/628 (53%), Gaps = 58/628 (9%)

Query: 156 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 215
           M  ESFEDE +A +LND F++IKVDREERPDVD VYM   Q + G GGWPL+VF++PD K
Sbjct: 1   MAHESFEDEEIAGILNDKFIAIKVDREERPDVDSVYMRICQLMTGQGGWPLNVFVTPDQK 60

Query: 216 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 275
           P   GTYFP   K+ RPGF  +L  + + +   R          +E ++E  +A      
Sbjct: 61  PFYAGTYFPKTSKFNRPGFIDVLEHLSETFANDRQH--------VEDIAENAAAHLEVKV 112

Query: 276 LPDE--LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGE 333
            P E  L + A+     QL+  +D+ +GGFG APKFP P    M+++  +    TGK  +
Sbjct: 113 HPAEGMLGEQAVHDTYRQLAGGFDTVYGGFGQAPKFPMP---HMLMFLLRYYSYTGKE-Q 168

Query: 334 ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 393
           A  G   V  TL  MA GGI DH+G GF RYS D  W VPHFEKMLYD   L   Y +A 
Sbjct: 169 ALAG---VTKTLDGMANGGIFDHIGFGFARYSTDNEWLVPHFEKMLYDNALLLTAYTEAC 225

Query: 394 SLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 453
            +T +  Y  I   I+ +++R+M+   G  FSA DAD   TEG    +EG +Y+W+ KE+
Sbjct: 226 QVTGNERYKQIAMQIVTFIQREMMHEDGSFFSALDAD---TEG----REGKYYIWSKKEI 278

Query: 454 EDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE 512
            ++LG E   L+ + Y +   GN +   +  PH  F  +  ++E  ++  +  +L   LE
Sbjct: 279 MNLLGDELGPLYCKVYNITDQGNFEGENI--PHLIFTRREAILE--ETGLTGHELAERLE 334

Query: 513 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 572
                  E R KL + R  R  PH DDKV+ SWN L+I+  A+A+K+         F+ P
Sbjct: 335 -------EARTKLLEARENRSYPHTDDKVLTSWNALMIAGLAKAAKV---------FHEP 378

Query: 573 VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDL 632
                  +++ +AE+A  F+ RHL  +   R+   +R G  K  GF+DDYAFLI   L+L
Sbjct: 379 -------DFLSMAETAIRFLERHLMPDG--RVMVRYREGEVKNKGFIDDYAFLIWAYLEL 429

Query: 633 YEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSV 692
           YE G    +L  A  L  +  ELF D   GG+F T  +  ++L+R KE +DGA PSGNS 
Sbjct: 430 YEAGFNPSYLQKAKTLCTSMLELFWDERHGGFFFTGNDAETLLVREKEVYDGAVPSGNSA 489

Query: 693 SVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHV 752
           + + L+RL  +  G  S    + AE   +VF+  ++    +      +    ++P +K +
Sbjct: 490 AAVQLLRLGRLT-GDIS--LIEKAEAMFSVFKREIEAYPSSNAFFMQSVLAHTMP-QKEI 545

Query: 753 VLVGHKSSVDFENMLAAAHASYDLNKTV 780
           V+ G K   D +  + A    +    T+
Sbjct: 546 VVFGRKDDPDRKRFIEALQEHFTPAYTI 573


>gi|302865439|ref|YP_003834076.1| N-acylglucosamine 2-epimerase [Micromonospora aurantiaca ATCC
           27029]
 gi|302568298|gb|ADL44500.1| N-acylglucosamine 2-epimerase [Micromonospora aurantiaca ATCC
           27029]
          Length = 678

 Score =  366 bits (940), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 228/622 (36%), Positives = 326/622 (52%), Gaps = 54/622 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA   SPYLLQHA NPVDW+ W +EAFAEA++RDVP+ +S+GY+ CHWCHVM  ESFE
Sbjct: 2   NRLAEATSPYLLQHADNPVDWWPWCDEAFAEAKRRDVPVLISVGYAACHWCHVMAHESFE 61

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           +E VA+L+ND FV +KVDREERPDVD VYMT  QA+ G GGWP++VF +PD  P   GTY
Sbjct: 62  NEAVARLMNDDFVCVKVDREERPDVDAVYMTATQAMTGQGGWPMTVFATPDGTPFFCGTY 121

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP      R  F  +L  V  AW  +R+ + + G   +E +  A +    +  L  EL  
Sbjct: 122 FP------RANFIRLLGSVATAWRDQREAVLRQGTAVVEAIGGAQAVGGVTAPLTAEL-- 173

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             L   A +L+  YD   GGFG APKFP  + +  +L H ++   TG    ++   ++V 
Sbjct: 174 --LDAAASRLAGEYDETNGGFGGAPKFPPHMNLLFLLRHHQR---TG----SARSLEIVR 224

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            T + MA+GG++D + GGF RYSVD  W VPHFEKMLYD   L  VY   + LT D    
Sbjct: 225 HTCEAMARGGLNDQLAGGFARYSVDGHWTVPHFEKMLYDNALLLRVYTQLWRLTGDRLAR 284

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
            + RD   +L  ++   G    SA DAD+   EG T       YVWT  ++ ++LGE   
Sbjct: 285 RVARDTARFLADELHRAGEGFASALDADTEGVEGLT-------YVWTPDQLVEVLGEDDG 337

Query: 463 LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECR 522
            F            DL  ++       G +VL    D   +  ++     ++ +++G   
Sbjct: 338 RFA----------ADLFEVTADGTFEHGTSVLRLARDVDDADPEV---RARWQDVVG--- 381

Query: 523 RKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR----ASKILKSEAESAMFNFPVVGSDR 578
            +L   R  RP+P  DDKV+ +WNGL I++ A     AS ++  + E A     V+    
Sbjct: 382 -RLLAARDTRPQPARDDKVVAAWNGLAITAIAEFQQVASLLVSPDDEDANLMDGVLIVSD 440

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG 638
               + AE  A+    HL D +  R+      G  +  G L+DY  +      +++    
Sbjct: 441 GAMRDAAEHLATV---HLVDGRLRRVSRDKVVG--QPAGVLEDYGCVAEAFCAMHQLTGE 495

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
            +WL  A EL +     F   + G +++T  +   ++ R  +  D A PSG S  V  LV
Sbjct: 496 GRWLTLAGELLDVALARFAGPD-GAFYDTADDAERLVTRPADPTDNATPSGRSAIVAALV 554

Query: 699 RLASIVAGSKSDYYRQNAEHSL 720
             A++   ++   YR+ AE +L
Sbjct: 555 AYAALTGETR---YREAAEKTL 573


>gi|218246233|ref|YP_002371604.1| hypothetical protein PCC8801_1388 [Cyanothece sp. PCC 8801]
 gi|218166711|gb|ACK65448.1| protein of unknown function DUF255 [Cyanothece sp. PCC 8801]
          Length = 688

 Score =  366 bits (939), Expect = 3e-98,   Method: Compositional matrix adjust.
 Identities = 247/664 (37%), Positives = 342/664 (51%), Gaps = 70/664 (10%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           +NRLA   S YL +HA NP+DW+ W EEA   A++ + PIFLSIGYS+CHWC VME E+F
Sbjct: 2   SNRLATAQSLYLRKHADNPIDWWYWCEEALLTAKQSNRPIFLSIGYSSCHWCTVMEGEAF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP-DLKPLMGG 220
            D+ +A  LND F+ IK+DREERPD+D +YM  VQ +   GGWPL++FL+P DL P  GG
Sbjct: 62  SDQAIAAYLNDNFLPIKLDREERPDLDSLYMQAVQMMGIQGGWPLNIFLTPDDLVPFYGG 121

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           TYFP E +YGRPGF  +L+ ++  +D ++D L    +F      E L     S  LP   
Sbjct: 122 TYFPIEPRYGRPGFLQVLQSIRRFYDTEKDKL---NSFK----HEILDTLQKSAILP--- 171

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPK-FPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
             NA  L  E   +   +        P+ F RP    M+ Y +  L+ +  + ++ E Q 
Sbjct: 172 VTNAELLNNELFYRGITANTEVIIVNPQDFNRPC-FPMIPYANLALQGSRFAFQSQENQA 230

Query: 340 MVLFTL-QCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL--T 396
            V +   + +A GGI+DHVGGGFHRY+VD  W VPHFEKMLYD GQ+     + +S    
Sbjct: 231 TVTYQRGEDLALGGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDNGQIVEYLANLWSQGHQ 290

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
           +  F   I R + ++L+R+M  P G  ++A+DAD+  T      +EGAFYVW  +E+ED 
Sbjct: 291 EPAFKRAIARTV-EWLQREMTAPQGYFYAAQDADNFTTPDEKEPEEGAFYVWKYQELEDC 349

Query: 457 L-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 515
           L  E   L +  + L   GN            F+G NVL        S + L + L+K  
Sbjct: 350 LTSEELKLLEATFSLTAEGN------------FEGSNVLQRRMGGEFSEA-LEVILDKLF 396

Query: 516 NI-LGECRRKLF-------------DVRSKRPRPHLDDKVIVSWNGLVISSFARASKILK 561
            I  G  R+ L                   R  P  D K+IV+WN L+IS  ARA  +  
Sbjct: 397 MIRYGSSRKTLTTFPPAKNNQEAKNQTWPGRIPPVTDTKMIVAWNSLMISGLARAYGV-- 454

Query: 562 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHRLQHSFRNGPSKAPGFLD 620
                  F  P+       Y E+A +A  FI +  + + + +RL +    G        +
Sbjct: 455 -------FGDPL-------YWELAINATEFILQEQWVNNRLYRLNYE---GQPSVLAQAE 497

Query: 621 DYAFLISGLLDLYEFGSGTK-WLVWAIELQNTQDELFLDREGGGYFNTTGEDPS-VLLRV 678
           DYAF I  LLDL +     + WL  A E+Q   DE F   EGGGY+N   ++   +L+R 
Sbjct: 498 DYAFFIKALLDLQKANPWERQWLEKAKEVQEEFDEFFWSIEGGGYYNNASDNSGDLLIRE 557

Query: 679 KEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMC 738
           +   D A PS N V++ NLVRL+ +        Y   AE  L  F + L     A P + 
Sbjct: 558 RSYIDNATPSANGVALSNLVRLSRLTDDLD---YLHRAEQGLQTFSSVLSQSPKACPSLF 614

Query: 739 CAAD 742
            A D
Sbjct: 615 VALD 618


>gi|386383690|ref|ZP_10069151.1| hypothetical protein STSU_12230 [Streptomyces tsukubaensis
           NRRL18488]
 gi|385668865|gb|EIF92147.1| hypothetical protein STSU_12230 [Streptomyces tsukubaensis
           NRRL18488]
          Length = 672

 Score =  366 bits (939), Expect = 3e-98,   Method: Compositional matrix adjust.
 Identities = 243/627 (38%), Positives = 327/627 (52%), Gaps = 68/627 (10%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA   SPYLLQHA NPVDW+ W   AF EAR+RDVP+ LS+GYS+CHWCHVM  ESFE
Sbjct: 2   NRLADSQSPYLLQHADNPVDWWPWSPGAFEEARRRDVPVLLSVGYSSCHWCHVMAHESFE 61

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE  A  LN+ FVS+KVDREERPDVD VYM  VQA  G GGWP++VFL+ D +P   GTY
Sbjct: 62  DEATAAYLNEHFVSVKVDREERPDVDAVYMEAVQAATGQGGWPMTVFLNADGEPFYFGTY 121

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLP--DEL 280
           FPPE ++G   F+ +L  V  AW  +R+ + +  A     L+   +A+     LP  DEL
Sbjct: 122 FPPEPRHGMASFRQVLEGVTAAWRDRREEVGEVAAKITRDLA-GRAAAHGGEGLPGEDEL 180

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
            Q  L      L++ YD R+GGF  APKFP  + ++ +L H  +   TG  G       M
Sbjct: 181 SQALL-----GLTRDYDERYGGFAGAPKFPPSMVLEFLLRHYAR---TGARG----ALDM 228

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
              T + MA+GG++D +GGGF RYSVD  W VPHFEKMLYD   L  VY   +       
Sbjct: 229 AAGTCEAMARGGLYDQLGGGFARYSVDREWIVPHFEKMLYDNALLCRVYAHLWRADGSPL 288

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH 460
              I  +  D+L R++    G   SA DADS +  G     EGAFYVWT  ++ + LGE 
Sbjct: 289 ARRIALETADFLVRELRTAEGGFASALDADSHDPAG--EHGEGAFYVWTPAQLTEALGE- 345

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
                           D  R ++ +        + E       AS L +P E    +   
Sbjct: 346 ---------------ADGRRAAEIYG-------VTEEGTFERGASVLRLPGEDDPAL--- 380

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            R +LF+ R +RPRP  DDKV+ +WNGL I++ A                      DR +
Sbjct: 381 -RARLFEARERRPRPERDDKVVAAWNGLAIAALAETGAFF----------------DRPD 423

Query: 581 YMEVAESAAS-FIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFLISGLLDLYEFGSG 638
            +E A  AA   +R HL D    RL  + ++G     PG L+DYA +  G + L      
Sbjct: 424 LVERATEAADLLVRVHLGDGA--RLTRTSKDGVAGHNPGVLEDYADVAEGFIALAGVTGE 481

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
             WL +A  L +   +LF   E G  F+T  +   ++ R ++  D A P+G + +   L+
Sbjct: 482 GVWLDFAGVLLDLVIDLFTG-ENGTLFDTAHDAERLIRRPQDPTDNATPAGWTAAAGALL 540

Query: 699 RLASIVAGSKSDYYRQNAEHSLAVFET 725
              S  A + S+ +R  AE +L V + 
Sbjct: 541 ---SYAAHTGSEPHRAAAERALGVVKA 564


>gi|315501987|ref|YP_004080874.1| n-acylglucosamine 2-epimerase [Micromonospora sp. L5]
 gi|315408606|gb|ADU06723.1| N-acylglucosamine 2-epimerase [Micromonospora sp. L5]
          Length = 678

 Score =  365 bits (938), Expect = 4e-98,   Method: Compositional matrix adjust.
 Identities = 228/622 (36%), Positives = 326/622 (52%), Gaps = 54/622 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA   SPYLLQHA NPVDW+ W +EAFAEA++RDVP+ +S+GY+ CHWCHVM  ESFE
Sbjct: 2   NRLAEATSPYLLQHADNPVDWWPWCDEAFAEAKRRDVPVLISVGYAACHWCHVMAHESFE 61

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           +E VA+L+ND FV +KVDREERPDVD VYMT  QA+ G GGWP++VF +PD  P   GTY
Sbjct: 62  NEAVARLMNDDFVCVKVDREERPDVDAVYMTATQAMTGQGGWPMTVFATPDGTPFFCGTY 121

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP      R  F  +L  V  AW  +R+ + + G   +E +  A +    +  L  EL  
Sbjct: 122 FP------RANFIRLLGSVATAWRDQREAVLRQGTAVVEAIGGAQAVGGVTAPLTAEL-- 173

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             L   A +L+  YD   GGFG APKFP  + +  +L H ++   TG    ++   ++V 
Sbjct: 174 --LDAAASRLAGEYDETNGGFGGAPKFPPHMNLLFLLRHHQR---TG----SARSLEIVR 224

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            T + MA+GG++D + GGF RYSVD  W VPHFEKMLYD   L  VY   + LT D    
Sbjct: 225 HTCEAMARGGLNDQLAGGFARYSVDGHWTVPHFEKMLYDNALLLRVYTQLWRLTGDRLAR 284

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
            + RD   +L  ++   G    SA DAD+   EG T       YVWT  ++ ++LGE   
Sbjct: 285 RVARDTARFLADELHRAGEGFASALDADTEGVEGLT-------YVWTPGQLVEVLGEDDG 337

Query: 463 LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECR 522
            F            DL  ++       G +VL    D   +  ++     ++ +++G   
Sbjct: 338 RFA----------ADLFEVTADGTFEHGTSVLRLARDVDDADPEV---RARWQDVVG--- 381

Query: 523 RKLFDVRSKRPRPHLDDKVIVSWNGLVISSFAR----ASKILKSEAESAMFNFPVVGSDR 578
            +L   R  RP+P  DDKV+ +WNGL I++ A     AS ++  + E A     V+    
Sbjct: 382 -RLLAARDTRPQPARDDKVVAAWNGLAITAIAEFQQVASLLVSPDDEDANLMDGVLIVSD 440

Query: 579 KEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG 638
               + AE  A+    HL D +  R+      G  +  G L+DY  +      +++    
Sbjct: 441 GAMRDAAEHLATV---HLVDGRLRRVSRDKVVG--QPAGVLEDYGCVAEAFCAMHQLTGE 495

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
            +WL  A EL +     F   + G +++T  +   ++ R  +  D A PSG S  V  LV
Sbjct: 496 GRWLTLAGELLDVALARFAGPD-GAFYDTADDAERLVTRPADPTDNATPSGRSAIVAALV 554

Query: 699 RLASIVAGSKSDYYRQNAEHSL 720
             A++   ++   YR+ AE +L
Sbjct: 555 AYAALTGETR---YREAAEKTL 573


>gi|376005318|ref|ZP_09782832.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|375326245|emb|CCE18585.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
          Length = 686

 Score =  365 bits (938), Expect = 4e-98,   Method: Compositional matrix adjust.
 Identities = 234/677 (34%), Positives = 345/677 (50%), Gaps = 97/677 (14%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           +NRLA   S YL +HA NP+DW+ W +EA  ++R  D PIFLSIGYS+CHWC VME E+F
Sbjct: 2   SNRLAQSSSLYLRKHADNPIDWWPWCDEALEKSRTEDKPIFLSIGYSSCHWCTVMEGEAF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP-DLKPLMGG 220
            D  +A+ +N  F+ IKVDREERP++D +YM  +Q + G GGWPL+VFL+P D  P  GG
Sbjct: 62  SDAAIAEYMNANFIPIKVDREERPEIDSIYMQALQMMTGQGGWPLNVFLTPGDRIPFYGG 121

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           TYFP E +YGRPGF  +L+ + + +   ++ L       + QL +++         P EL
Sbjct: 122 TYFPIEPRYGRPGFLDLLKAIHNFYQTDKNKLETVTEEILTQLRQSMILP------PSEL 175

Query: 281 PQNALRLCAEQLSKSYD-SRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
            ++ L+   E  +     + +GG    P+FP  +    M +   +L  + K     +G+ 
Sbjct: 176 TEDLLKQGLETNTGVVGRNNYGG----PRFPM-IPYADMAWRGTRLISSPK----VDGKA 226

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS-LTKD 398
             L   + +  GGI+DHV GGFHRY+VD  W VPHFEKMLYD GQ+     D +S   K 
Sbjct: 227 ACLQRGKDLVTGGIYDHVAGGFHRYTVDPTWTVPHFEKMLYDNGQILEFLADLWSDGEKQ 286

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
             Y       +++L+R+M  P G  ++A+DADS  T      +EGAFYVWT++E+E  L 
Sbjct: 287 PAYQRAINGTVEWLKREMTAPEGYFYAAQDADSFVTSQDKEPEEGAFYVWTNQELETFLS 346

Query: 459 EHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI 517
                  +  + +  +GN            F+GK VL   N       +L   +E  L  
Sbjct: 347 PAEFGELQAQFTVTKSGN------------FEGKTVLQRWN-----CDELDPLIETALT- 388

Query: 518 LGECRRKLFDVRSKRPRPHL-------------------------DDKVIVSWNGLVISS 552
                 KLF VR   P   +                         D K+IV+WN L+IS 
Sbjct: 389 ------KLFAVRYGAPPAEVTTFPVAENNQAAKERDWPGRIPAVTDTKMIVAWNALMISG 442

Query: 553 FARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHRLQHSFRNG 611
            A+A+++L                D  EY+E+A  AA F+  H + D++ HR+ +   +G
Sbjct: 443 LAKAARVL----------------DNSEYLELATKAAKFVLEHQWVDDRFHRVNY---DG 483

Query: 612 PSKAPGFLDDYAFLISGLLDLYEFGSGTK-----WLVWAIELQNTQDELFLDREGGGYFN 666
                   +DYA LI  L+DL++           WL  A+++QN  D+     E GGYFN
Sbjct: 484 KVAVLSQSEDYALLIKALIDLHQASLQQPELADFWLTNAVQVQNEFDQYLWSVELGGYFN 543

Query: 667 TTGEDP-SVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 725
           T  +D  ++L+R +   D A P+ N V++ NLVRL  +   ++   Y   A  +L  F +
Sbjct: 544 TALDDAETLLIRERSYMDNATPAANGVAIANLVRLFLL---TEDLNYLDRALQALEAFAS 600

Query: 726 RLKDMAMAVPLMCCAAD 742
            ++    A P +  A D
Sbjct: 601 VMRQSPQACPSLFVAFD 617


>gi|338812196|ref|ZP_08624385.1| hypothetical protein ALO_08830 [Acetonema longum DSM 6540]
 gi|337275852|gb|EGO64300.1| hypothetical protein ALO_08830 [Acetonema longum DSM 6540]
          Length = 633

 Score =  365 bits (938), Expect = 4e-98,   Method: Compositional matrix adjust.
 Identities = 226/621 (36%), Positives = 334/621 (53%), Gaps = 53/621 (8%)

Query: 156 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 215
           ME ESFED+ VA LLN  +++IKVDREERPDVD +YM   QAL G GGWPL++ ++PD  
Sbjct: 1   MERESFEDQEVADLLNQDYIAIKVDREERPDVDHIYMQVCQALTGQGGWPLTIMMTPDKS 60

Query: 216 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 275
           P   GTYFP   K+GRPG   IL  +   W ++RD L        E++ +++ A    + 
Sbjct: 61  PFFAGTYFPKNSKWGRPGLMAILTALSQQWRQQRDSLNDYA----EEILKSIDAREPGSP 116

Query: 276 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 335
               L +  +      L++ +DS +GGF SAPKFP P  +  ++ + +       +GEA 
Sbjct: 117 Y-SLLSEEQVHAAFHGLARYFDSEYGGFSSAPKFPTPHNLLFLMRYWR------HTGEA- 168

Query: 336 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 395
           +   MV  TLQ M +GGI+DH+G GF RYSVD +W VPHFEKMLYD   L  +Y +AF  
Sbjct: 169 KAMDMVEKTLQSMRRGGIYDHLGFGFARYSVDHQWLVPHFEKMLYDNALLCYIYAEAFQA 228

Query: 396 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 455
           T +  Y+ +  +I+ Y++RDM GP G  +SAEDADS   EG    +EG FY+WT +E+  
Sbjct: 229 TGNKEYAQVAEEIIAYVQRDMTGPAGGFYSAEDADS---EG----EEGKFYLWTKEEILR 281

Query: 456 ILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN-DSSASASKLGMPLEK 513
            LG     +F ++Y++   GN D            G ++L  +  +    A+K+GM  ++
Sbjct: 282 ALGWTQGTIFADYYHVTAEGNFD-----------AGSSILHTIGREPGEYAAKVGMKPDE 330

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
           +  +L + R KL ++R++R  P  DDKV+ SWN L+I++ A+A+++L             
Sbjct: 331 FQAMLQDGREKLRELRNQRVHPFKDDKVLTSWNALMIAALAKAARVL------------- 377

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
              D+ +Y+  A  A +FI  HL   Q  RL    R G S    +LDDYA+L+  +++LY
Sbjct: 378 ---DKPQYLFAASQALNFIEIHL-TRQDGRLLARHRAGESAYLAYLDDYAYLLWAVIELY 433

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
           E      +L  A  L     ELF D + GG+F T  +   ++ R KE +DGA PSGNS +
Sbjct: 434 ETTLSAAYLEMAKGLAGNMVELFWDEKQGGFFFTGSDAEKLISRPKEIYDGATPSGNSAA 493

Query: 694 VINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVV 753
              L+RLA I   +         E     F   +     A      A D   +P  ++++
Sbjct: 494 AYALLRLARITEDAD---LLTVVERLFEYFAGEVSQAPRAFTFFLMAFDYYLMPP-QNII 549

Query: 754 LVGHKSSVDFENMLAAAHASY 774
           + G K  +   ++L  A   Y
Sbjct: 550 IAGVKDDIATVSLLKQARKYY 570


>gi|336176843|ref|YP_004582218.1| hypothetical protein [Frankia symbiont of Datisca glomerata]
 gi|334857823|gb|AEH08297.1| hypothetical protein FsymDg_0782 [Frankia symbiont of Datisca
           glomerata]
          Length = 690

 Score =  365 bits (937), Expect = 5e-98,   Method: Compositional matrix adjust.
 Identities = 243/656 (37%), Positives = 326/656 (49%), Gaps = 92/656 (14%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA + SPYLLQHA NPVDW+ WG  AFAEA  RDVP+ LS+GY++CHWCHVM  ESFE
Sbjct: 3   NRLAEQTSPYLLQHADNPVDWWPWGPSAFAEATARDVPVLLSVGYASCHWCHVMAHESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D   A ++N++FV++KVDREERPDVD VYM    AL G GGWP++VFL+P  +P   GTY
Sbjct: 63  DPDTAAIMNEYFVNVKVDREERPDVDAVYMDVTVALTGHGGWPMTVFLTPAGEPFFAGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FPP  + G   F+ +L  V  AW  +RD +A SGA    +++ A   SA     P  L  
Sbjct: 123 FPPAPRPGMSSFRQLLAAVTHAWRTRRDEVAASGADITRRIAAAALGSAGP---PAGLTG 179

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
           + L     ++++S+D   GGFGSAPKFP    ++M+L H  +  D           +MV 
Sbjct: 180 DLLDTAVAKVARSFDPEHGGFGSAPKFPPSALLEMLLRHHARTGDAAS-------LRMVT 232

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            T + MA+GGI+D + GGF RYSVD  W VPHFEKMLYD  QL  VYL  +  T      
Sbjct: 233 TTCERMARGGIYDQLAGGFARYSVDATWTVPHFEKMLYDNAQLLRVYLHLWRATGSPLAE 292

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEG-----------------------ATR 439
            + R+   +L RD+    G   SA DAD+    G                          
Sbjct: 293 RVARETAAFLLRDLGTTEGGFASALDADTVVPAGPGSGGDESPGHNAGGHNAGGHNAGGH 352

Query: 440 KKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPT------GNCDLSRMSDPHNEFKGKNV 493
             EGA YVWT  E+ D+LG     +    +          G+  L   +DP +  +  +V
Sbjct: 353 GAEGATYVWTPAELVDVLGPADGAWAADVFGVTAAGTFEHGSSVLRLPADPDDPGRFASV 412

Query: 494 LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSF 553
                                       R +L   R+ RP+P  DDK++ +WNGL I++ 
Sbjct: 413 ----------------------------RERLARARAARPQPARDDKIVAAWNGLAIAAL 444

Query: 554 ARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRR-HLYDEQTHRLQHSFRNGP 612
           A A  +L   A                ++  A SAA+ +R  HL D +  R     R G 
Sbjct: 445 AEAGALLAEPA----------------WVTAATSAATLLRDVHLVDGRLRRTSRHGRVGT 488

Query: 613 SKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDP 672
           +   G L+DY  +  GLL LY+     +WL  A +L       F   +GG  F+ T +D 
Sbjct: 489 NA--GVLEDYGDVAEGLLALYQVTGDEQWLALAGDLLAVVRARFAADDGG--FHDTADDA 544

Query: 673 SVLLRVKED-HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 727
             LLR   D  D   PSG +     L+  A++ A   SD +R+ AEH L V    L
Sbjct: 545 ERLLRRPRDPSDSPTPSGQAAVAGALLTYAALTA---SDEHRRAAEHVLEVLAPLL 597


>gi|186686249|ref|YP_001869445.1| hypothetical protein Npun_R6218 [Nostoc punctiforme PCC 73102]
 gi|186468701|gb|ACC84502.1| protein of unknown function DUF255 [Nostoc punctiforme PCC 73102]
          Length = 685

 Score =  365 bits (937), Expect = 6e-98,   Method: Compositional matrix adjust.
 Identities = 237/664 (35%), Positives = 342/664 (51%), Gaps = 72/664 (10%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNRLA   S YL +HA NP+DW+ W +EA A AR ++ PIFLSIGYS+CHWC VME E+F
Sbjct: 2   TNRLAEAKSLYLRKHAENPIDWWPWCDEALATARAQNKPIFLSIGYSSCHWCTVMEGEAF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP-DLKPLMGG 220
            D  +A  +N  ++ IKVDREERPD+D +YM  +Q + G GGWPL++FLSP DL P   G
Sbjct: 62  SDSAIADYMNANYLPIKVDREERPDLDSIYMQALQMMSGQGGWPLNIFLSPEDLVPFYAG 121

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           TYFP + +YGRPGF  +L+ ++  +D ++  L Q  A  IE L   L+++   +   DEL
Sbjct: 122 TYFPVDPRYGRPGFLQVLQALRRYYDTEKAELQQRKALIIESL---LTSAVLQDGTTDEL 178

Query: 281 PQNALRLCAEQLSKSYDSRFGGFG---SAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS-E 336
               L      L + +++  G      S   FP      M+ Y    L  T  + E+  +
Sbjct: 179 EDREL------LRQGWETSTGVITPGQSGNSFP------MIPYTELALRGTRFNFESRYD 226

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL- 395
           G+++       +A GGI+DHVGGGFHRY+VD  W VPHFEKMLYD GQ+     + +S  
Sbjct: 227 GKQVCTQRGLDLALGGIYDHVGGGFHRYTVDPTWTVPHFEKMLYDNGQIVEYIANLWSAG 286

Query: 396 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 455
            ++  +       + +L+R+M  P G  ++++DADS     A   +EGAFYVW+  EV+ 
Sbjct: 287 VQEPAFERAVAVTVQWLKREMTAPEGYFYASQDADSFTEPTAVEPEEGAFYVWSYSEVQQ 346

Query: 456 ILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS--------- 505
           +L  E     ++ + + P GN            F+G+NVL   N    SA+         
Sbjct: 347 LLTPEELTELQQQFTVTPNGN------------FEGRNVLQRRNSGKLSATLETSLSKLF 394

Query: 506 --KLGMPLEKYLNILGECRRKLFDVRSKRPR-PHL-DDKVIVSWNGLVISSFARASKILK 561
             + G+  E        C  +     +   R P + D K+IV+WN L+IS  A+A+ +  
Sbjct: 395 TARYGVSSELLETFPPACNNQEAKTTNWPGRIPSVTDTKMIVAWNSLMISGLAKAAGV-- 452

Query: 562 SEAESAMFNFPVVGSDRKEYMEVAESAASFI-RRHLYDEQTHRLQHSFRNGPSKAPGFLD 620
                  F  P+       Y+E+A  AA+FI      D +  RL +    G        +
Sbjct: 453 -------FQQPL-------YLELAARAANFILENQFVDGRFQRLNY---QGEPTVLAQSE 495

Query: 621 DYAFLISGLLDLYEFGSGTK-WLVWAIELQNTQDELFLDREGGGYFNTTGEDPS-VLLRV 678
           DYAF +  LLDL       K WL  AI +Q+   E     E GGYFNT+ +    +++R 
Sbjct: 496 DYAFFVKALLDLQASNPEHKQWLENAIAIQDEFTEFLWSVELGGYFNTSSDSSQDLIVRE 555

Query: 679 KEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMC 738
           +   D A PS N +++ NLVRLA +        Y   AE  L  F++ +     A P + 
Sbjct: 556 RSYADNATPSANGIAIANLVRLALLTDNLD---YLDLAELGLKAFKSVMHRAPQACPSLF 612

Query: 739 CAAD 742
            A D
Sbjct: 613 TALD 616


>gi|452972836|gb|EME72663.1| hypothetical protein BSONL12_20380 [Bacillus sonorensis L12]
          Length = 627

 Score =  365 bits (936), Expect = 7e-98,   Method: Compositional matrix adjust.
 Identities = 237/643 (36%), Positives = 339/643 (52%), Gaps = 87/643 (13%)

Query: 156 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 215
           M  ESFEDE VA+LLN+ FVSIKVDREERPDVD +YMT  Q + G GGWPL+VFL+P+ K
Sbjct: 1   MAHESFEDEEVAQLLNEKFVSIKVDREERPDVDSIYMTICQMMTGQGGWPLNVFLTPEQK 60

Query: 216 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 275
           P   GTYFP   +Y RPGF  +L+++   + K RD +        E+ +  L   A SN 
Sbjct: 61  PFYAGTYFPKTSRYNRPGFVEVLKQLSATFAKNRDHVEDIA----EKAANNLRIKAKSNA 116

Query: 276 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML-YHSKKLEDTGKSGEA 334
             + L ++ L+   +QL  S+D+ +GGFGSAPKFP P  +  +L YH         SGE 
Sbjct: 117 -GEALGEDILKRTYQQLINSFDTAYGGFGSAPKFPIPHMLTFLLRYHQ-------YSGEE 168

Query: 335 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 394
           +     V  TL  MA GGI+DH+G GF RYS D+ W VPHFEKMLYD   L   Y +A+ 
Sbjct: 169 N-ALYSVTKTLDSMANGGIYDHIGYGFARYSTDQEWLVPHFEKMLYDNALLLMAYTEAYQ 227

Query: 395 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 454
           +TK   Y  I   I+ ++RR+M    G  FSA DAD   TEG     EG +Y+W+  E+ 
Sbjct: 228 VTKRERYKRISEQIIAFIRREMTDERGAFFSALDAD---TEGV----EGKYYIWSKDEIT 280

Query: 455 DILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA-SKLGMPLE 512
           + LG E   L+           C +  ++D  N F+G N+   +  S      +  +   
Sbjct: 281 ETLGDELGSLY-----------CAVYDITDEGN-FEGFNIPNLIYTSFEQVRDEFSLTET 328

Query: 513 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 572
           +  N L   R+KLF+ R  R  PH+DDKV+ SWN L+I+  A+ASK+ ++          
Sbjct: 329 ELQNKLEAARQKLFEKRRGRIYPHVDDKVLTSWNALMIAGLAKASKVFEA---------- 378

Query: 573 VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDL 632
                  EY+E+A +A SFI   L   +  R+   +R+G  K  GF+DDYAFL+   L+L
Sbjct: 379 ------PEYLEMARTALSFIEDELI--KDGRVMVRYRDGEVKNKGFIDDYAFLLWSYLEL 430

Query: 633 YEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSV 692
           YE       L  A EL     +LF D + GG++ T  +  ++++R KE +DGA PSGN V
Sbjct: 431 YEASLNLPDLRKAKELAGDMIDLFWDEDHGGFYFTGKDAEALIVRDKEVYDGALPSGNGV 490

Query: 693 SVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPS---- 748
           + + L RL  +                L++ + R+ DM  A        D+ + PS    
Sbjct: 491 AAVQLFRLGRLTG-------------DLSLID-RVSDMFSAF-----HGDVSAYPSGHTN 531

Query: 749 -----------RKHVVLVGHKSSVDFENMLAAAHASYDLNKTV 780
                      +K +V++G +   + +N++ A   ++  N  V
Sbjct: 532 FLQSLLSQMMPQKEIVILGKRDDPNRQNIIRALQQAFQPNYAV 574


>gi|427718285|ref|YP_007066279.1| hypothetical protein Cal7507_3032 [Calothrix sp. PCC 7507]
 gi|427350721|gb|AFY33445.1| hypothetical protein Cal7507_3032 [Calothrix sp. PCC 7507]
          Length = 690

 Score =  365 bits (936), Expect = 7e-98,   Method: Compositional matrix adjust.
 Identities = 244/675 (36%), Positives = 346/675 (51%), Gaps = 89/675 (13%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNRLA   S YL +HA NP+DW++W +EA A A+  + PIFLSIGYS+CHWC VME E+F
Sbjct: 2   TNRLAKAQSLYLRKHAENPIDWWSWCDEALATAKADNKPIFLSIGYSSCHWCTVMEGEAF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP-DLKPLMGG 220
            D  +A+ +N  F+ IKVDREERPD+D +YM  +Q + G GGWPL+VFLSP DL P   G
Sbjct: 62  SDLAIAQYMNTNFLPIKVDREERPDLDSIYMQALQMMNGQGGWPLNVFLSPEDLVPFYAG 121

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL--SEALSASASSNKLPD 278
           TYFP E +YGRPGF  +L+ ++  +D + + L Q  A  +E L  S  L   ++ +   +
Sbjct: 122 TYFPLEPRYGRPGFLQVLQAIRRYYDTETEDLRQRKAVIVESLLTSAVLQDGSTQDIQEN 181

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS--E 336
           EL +     C   ++               FP      M+ Y    L  T +   AS  +
Sbjct: 182 ELLRQGWETCTGVITPHQQGN--------SFP------MIPYAELALRGT-RFNFASHYD 226

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS-- 394
           G+++       +A GGI+DHVGGGFHRY+VD  W VPHFEKMLYD GQ+     + +S  
Sbjct: 227 GKQICQQRGLDLALGGIYDHVGGGFHRYTVDPTWTVPHFEKMLYDNGQIVEYLANLWSAG 286

Query: 395 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 454
           + +  F   I + + ++L+R+M  P G  ++A+DADS     A   +EGAFYVWT  E+ 
Sbjct: 287 VQEPAFARAIAKTV-EWLQREMTAPAGYFYAAQDADSFINPTAVEPEEGAFYVWTYSELA 345

Query: 455 DILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEK 513
            +L  E     ++ + + P GN            F+ KNVL  L+     + +L   LEK
Sbjct: 346 KLLTPEELTELQQQFTVTPHGN------------FESKNVLQRLH-----SGELSKTLEK 388

Query: 514 YLNILGECRRKL-------FDVRSK-----------RPRPHLDDKVIVSWNGLVISSFAR 555
            L  L + R  +       F   S            R     D K+IV+WN L+IS  AR
Sbjct: 389 ALGKLFKARYGITPESLDTFPPASNNQEAKTNNWPGRIPSVTDTKMIVAWNSLMISGLAR 448

Query: 556 ASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFI-RRHLYDEQTHRLQHSFRNGPSK 614
           AS +         F  P+       Y+++A  AA+FI      D + HRL +    G   
Sbjct: 449 ASGV---------FQQPL-------YLQIAARAANFIWDNQFVDGRFHRLNYV---GQPN 489

Query: 615 APGFLDDYAFLISGLLDLYEFG------SGTKWLVWAIELQNTQDELFLDREGGGYFNTT 668
                +DYA  I  LLDL++        S + WL  AI LQ+  D      E GGY+N +
Sbjct: 490 VLAQSEDYALFIKALLDLHQATLLIGNESASFWLEKAIALQDEFDAYLWSVELGGYYNAS 549

Query: 669 GE-DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 727
            +    +++R +   D A PS N V++ NLVRL  +   + + +Y   AE  L  F+T +
Sbjct: 550 IDASQDLIVRERSYADNATPSANGVAIANLVRLTLL---TDNLHYLDLAEQGLKAFKTVM 606

Query: 728 KDMAMAVPLMCCAAD 742
                A P +  A D
Sbjct: 607 SRSPQACPSLFTALD 621


>gi|354566297|ref|ZP_08985470.1| hypothetical protein FJSC11DRAFT_1676 [Fischerella sp. JSC-11]
 gi|353546805|gb|EHC16253.1| hypothetical protein FJSC11DRAFT_1676 [Fischerella sp. JSC-11]
          Length = 691

 Score =  365 bits (936), Expect = 7e-98,   Method: Compositional matrix adjust.
 Identities = 245/674 (36%), Positives = 346/674 (51%), Gaps = 86/674 (12%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNRLA   S YL +HA NP+DW+ W +EA + A+ ++ PIFLSIGYS+CHWC VME E+F
Sbjct: 2   TNRLAEAKSLYLRKHAENPIDWWPWCDEALSTAKAQNKPIFLSIGYSSCHWCTVMEGEAF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP-DLKPLMGG 220
            D G+A+ +N  F+ IKVDREERPD+D +YM  +Q + G GGWPL+ FLSP DL P   G
Sbjct: 62  SDPGIAEYMNANFIPIKVDREERPDIDSIYMQALQMMSGQGGWPLNAFLSPDDLVPFYAG 121

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL--SEALSASASSNKLPD 278
           TYFP E +YGRPGF  +L+ ++  +D ++  L    A  +E L  S  L    ++     
Sbjct: 122 TYFPVEPRYGRPGFLQVLQAIRHYYDTEKQDLRDRKAVILESLLTSAVLQQQGTTATQDK 181

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
           EL           ++    +++G       FP     ++ L    + E T +     +G+
Sbjct: 182 ELLHKGRETSTGIITP---NQYGN-----SFPMIPYAELAL-RGTRFEVTSE----YDGK 228

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS--LT 396
           ++       +A GGI+DHVGGGFHRY+VD  W VPHFEKMLYD GQ+     + +S  + 
Sbjct: 229 QVCTQRGLDLALGGIYDHVGGGFHRYTVDPTWTVPHFEKMLYDNGQIVEYLANLWSAGIE 288

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADS------AETEGATRKKEGAFYVWTS 450
           +  F   I   +  +L+R+M  P G  ++A+DADS         +G +  +EGAFYVWT 
Sbjct: 289 EPAFKRAIAGTV-QWLKREMTAPEGYFYAAQDADSFTPPYQGGDKGGSEPEEGAFYVWTF 347

Query: 451 KEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGM 509
            E+E +L  E  I  ++ + +   GN            F+ KNVL        SA+    
Sbjct: 348 SELEQLLTAEELIELQQQFTVTANGN------------FESKNVLQRRRSGELSAT---- 391

Query: 510 PLEKYLNILGECR--------------RKLFDVRSK----RPRPHLDDKVIVSWNGLVIS 551
            +E  L  L   R              R   + +S+    R     D K+IV+WN L+IS
Sbjct: 392 -VETALKKLFVARYGATPESLETFPPARNNQEAKSRHWPGRIPAVTDTKMIVAWNSLMIS 450

Query: 552 SFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHRLQHSFRN 610
             ARA          A+F  PV       Y+E+A +AA FI  H + D + HRL  ++ N
Sbjct: 451 GLARA---------YAVFREPV-------YLELATTAADFIVNHQFVDGRFHRL--NYEN 492

Query: 611 GPSKAPGFLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDREGGGYFNTTG 669
            P+      +DYAF I  LLDL        KWL  AI LQ   DE     E GGY+NT+ 
Sbjct: 493 QPT-VLAQSEDYAFFIKALLDLQTCSPEQNKWLERAIALQEEFDEYLWSVELGGYYNTSS 551

Query: 670 E-DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLK 728
           +    +++R +   D A PS N V++ NLVRLA     + + +Y   AE  L  F + + 
Sbjct: 552 DASQDLIVRERSYVDNATPSANGVAIANLVRLALF---TDNLHYLDLAEQGLNAFRSVMN 608

Query: 729 DMAMAVPLMCCAAD 742
               A P +  A D
Sbjct: 609 STPQACPSLFTALD 622


>gi|428224685|ref|YP_007108782.1| hypothetical protein GEI7407_1235 [Geitlerinema sp. PCC 7407]
 gi|427984586|gb|AFY65730.1| hypothetical protein GEI7407_1235 [Geitlerinema sp. PCC 7407]
          Length = 682

 Score =  364 bits (935), Expect = 9e-98,   Method: Compositional matrix adjust.
 Identities = 246/706 (34%), Positives = 360/706 (50%), Gaps = 86/706 (12%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNRLA   S YL +HA NP+DW+ W +EA A+AR+ + PIFLSIGYS+CHWC VME E+F
Sbjct: 2   TNRLAHAKSLYLRKHAENPIDWWPWCDEAIAKARQENKPIFLSIGYSSCHWCTVMEGEAF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP-DLKPLMGG 220
            +  +A  +ND+FV IKVDREERPD+D +YM  +Q + G GGWPL+VFL+P DL P  GG
Sbjct: 62  SNGAIAAYMNDFFVPIKVDREERPDLDSIYMQSLQLMVGQGGWPLNVFLAPDDLVPFYGG 121

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           TYFP + +YGRPGF  +L+ ++  +D ++D ++      +E L EA S            
Sbjct: 122 TYFPVDPRYGRPGFLQVLQAIRRHFDTEKDKVSAVKQEILEHLQEAGSLE---------- 171

Query: 281 PQNALRLCAEQLSKSYDSRFG---GFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEG 337
           P     L  + L+KS +   G     G  P FP      M+ Y       T  S E  + 
Sbjct: 172 PGQGSDLTHDLLAKSLEYSTGILSARGPGPSFP------MIPYGEAAQRATRLSLERYDA 225

Query: 338 QKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ----LANVYLDAF 393
             +     + +A GGI+DHV GGFHRY+VD  W VPHFEKMLYD GQ    LAN +  A 
Sbjct: 226 GTICQQRGEHLALGGIYDHVAGGFHRYTVDPTWTVPHFEKMLYDNGQILEYLANEW--AR 283

Query: 394 SLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 453
            +T+  F   I   +  +L+R+M    G  ++A+DAD+  +  A   +EG FYVW   E+
Sbjct: 284 GVTEPAFERAIAGTV-TWLKREMTDAQGYFYAAQDADNFTSPEALEPEEGDFYVWRYDEL 342

Query: 454 EDIL--GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS-------- 503
             +L   E A L +E + + P+GN            F+G+NVL    + S S        
Sbjct: 343 AALLTPAELAAL-QEEFTVTPSGN------------FEGRNVLQRSREGSLSEVAEAALA 389

Query: 504 ---ASKLGMPLEKYLNILGECRRKLFDVRS--KRPRPHLDDKVIVSWNGLVISSFARASK 558
              A + G P             ++   ++   R  P  D K+I +WN L+IS  ARA+ 
Sbjct: 390 KLFAVRYGAPPVAVPTFPPAPSAQVAKTQTWPGRIPPVTDTKMIAAWNSLMISGLARAAA 449

Query: 559 ILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE-QTHRLQHSFRNGPSKAPG 617
           + +                R+EY ++A  AA F+  H + E + HRL +   +G +    
Sbjct: 450 VWQ----------------REEYYQLAAGAARFLLAHQWVEGRFHRLNY---DGEASVLA 490

Query: 618 FLDDYAFLISGLLDLYEFGSGTK-WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL 676
             +DYA  I  L+DL +   G + W+  A+++Q   D L    EGG Y         +++
Sbjct: 491 QSEDYALFIKALIDLDQARPGAEDWIEQAVKVQREFDALLGAEEGGYYNAARDRSQDLVI 550

Query: 677 RVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL 736
           R +   D A P+ NS+++ NLVRLA +   ++   Y   AE +L  F   +     A P 
Sbjct: 551 RERSYADNATPAPNSIAIANLVRLALL---TEDLSYLDRAEKALQSFSAPMARSPQACPS 607

Query: 737 MCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKTVSK 782
           M  A D+     R H+++   +++ D    LAA +    + K   +
Sbjct: 608 MFGALDLY----RNHLLI---RATPDVLQTLAARYCPTAVYKVADE 646


>gi|220935906|ref|YP_002514805.1| hypothetical protein Tgr7_2744 [Thioalkalivibrio sulfidophilus
           HL-EbGr7]
 gi|219997216|gb|ACL73818.1| conserved hypothetical protein [Thioalkalivibrio sulfidophilus
           HL-EbGr7]
          Length = 676

 Score =  364 bits (934), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 240/667 (35%), Positives = 351/667 (52%), Gaps = 57/667 (8%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
            + +NRLA E SPYLLQHA NPVDW+ WG EA  +A+  D PI LSIGYS CHWCHVM  
Sbjct: 3   EQTSNRLANETSPYLLQHADNPVDWYPWGPEALDKAKAEDKPILLSIGYSACHWCHVMAH 62

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMT-YVQALYGGGGWPLSVFLSPDLKPL 217
           ESFED   A+++N  +V+IKVDREERPD+DK+Y T +       GGWPL++FL+PD  P 
Sbjct: 63  ESFEDPATAQVMNRLYVNIKVDREERPDLDKIYQTAHFMLSQRSGGWPLTMFLTPDQVPF 122

Query: 218 MGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLP 277
            GGTYFP   ++G P F+ +L ++   + ++RD + +  A     L  AL+   S     
Sbjct: 123 FGGTYFPDAPRHGLPAFRDLLERIAGFYHERRDEIERQNA----SLQGALTGLFSPRGH- 177

Query: 278 DELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEG 337
           D L    L      +++ +D R GGFG+ PKFP P  ++ +L H  +  D          
Sbjct: 178 DPLNSAVLDTVRSAIAQQFDERDGGFGTPPKFPHPSTLERLLRHHAQTHD-------ERA 230

Query: 338 QKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTK 397
           + M  FTL+ MA+GG++D + GGF RYS D +W +PHFEKMLYD G L  +Y  A++ T 
Sbjct: 231 RYMACFTLEKMARGGLNDQLAGGFCRYSTDGQWMIPHFEKMLYDNGPLLALYAQAYAATG 290

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
           D +++ +      +  + M  P G  +SA DADS   EG    +EG +YVW  +EV  ++
Sbjct: 291 DAYFADVAGRTAAWAVQTMQSPEGGFYSALDADS---EG----EEGRYYVWQPEEVRKLV 343

Query: 458 GEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLN 516
            E    +F   Y L    N            F+G+  L         A + G        
Sbjct: 344 PEEVYPVFARVYGLDRGPN------------FEGRWHLHSFVTPEQLAKESGTDEATIEA 391

Query: 517 ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGS 576
           ++   R  L   R KR  P LDDK++ SWN L+I   A A++ L                
Sbjct: 392 MIEAARAPLLAARDKRVPPGLDDKILTSWNALMIRGLAVAARHLG--------------- 436

Query: 577 DRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG 636
            R E+++ A  A  FIR  L+  +  RL  +++NG ++   +LDD+A+L+  LL+L +  
Sbjct: 437 -RSEWVDAASRALDFIRAQLW--RDGRLLATYKNGSARLSAYLDDHAYLLDALLELLQVR 493

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
             T+ LV+A E+       F D E GG+F T  +  +++ R K   D A PSGN V+ + 
Sbjct: 494 WRTEDLVFAREIAEILLAHFEDSEHGGFFFTADDHEALIQRPKTFADEAMPSGNGVAALA 553

Query: 697 LVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMA-VPLMCCAADMLSVPSRKHVVLV 755
           L RL  ++   +   Y + AE ++ +  T +    MA   L+    + L +P  K V+L 
Sbjct: 554 LNRLGHLLGEPR---YVEAAERTVRLATTLMDQAPMAHASLISAFEEQLYLP--KLVILR 608

Query: 756 GHKSSVD 762
           G    ++
Sbjct: 609 GEAQRIE 615


>gi|407975443|ref|ZP_11156348.1| hypothetical protein NA8A_14074 [Nitratireductor indicus C115]
 gi|407429071|gb|EKF41750.1| hypothetical protein NA8A_14074 [Nitratireductor indicus C115]
          Length = 673

 Score =  364 bits (934), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 231/616 (37%), Positives = 325/616 (52%), Gaps = 68/616 (11%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N L  E SPYLLQH  NPV W  W + A  EAR+ + PI LS+GY+ CHWCHVM  ESFE
Sbjct: 8   NLLGEETSPYLLQHKDNPVHWRPWSKAALDEARELNRPILLSVGYAACHWCHVMAHESFE 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           ++ VA ++N  FV+IKVDREERP++D++YM  + A    GGWPL++FLSPD KP  GGTY
Sbjct: 68  NDQVADVMNRLFVNIKVDREERPEIDQIYMAALSATGEQGGWPLTMFLSPDGKPFWGGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAW-DKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           FPP+ +YGRPGF  +L  V  AW +K RD+   SG  + E+L + + A  S        P
Sbjct: 128 FPPQQRYGRPGFIEVLNAVHTAWLEKNRDL---SG--SAERLHDHVKARLSPPSAEGFDP 182

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
           Q+A+   AE++    D   GG   APKFP    IQ++      L+   +S   S     V
Sbjct: 183 QSAVTDLAERIHGMIDQDMGGLRGAPKFPNMPFIQILWL--SWLQTGNQSHRDS-----V 235

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
           + +L+ M  GGI+DHVGGG  RYS D  W VPHFEKMLYD  QL  +    F  T+D  +
Sbjct: 236 ITSLKRMLSGGIYDHVGGGLARYSTDANWLVPHFEKMLYDNAQLLRLLSWVFGETEDELF 295

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHA 461
                +++++L RDM   GG   S+ DADS   EGA    EG  Y+W+  ++E +LG   
Sbjct: 296 RIRIEEVINFLLRDMRVNGGAFASSLDADS---EGA----EGKAYLWSRLQIEAVLGSRT 348

Query: 462 ILFKEHYYL-KPT---GNCDLSRMSDPHNEFKGKNVLIEL-NDSSASASKLGMPLEKYLN 516
             F   + L KP    G+  L R++  H EF+G +    L ND +A              
Sbjct: 349 EAFLSTFELTKPDDWHGDPVLHRLA--HPEFQGTDTENALRNDLNA-------------- 392

Query: 517 ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGS 576
                   L   R+ R +P  DDKV+V WNGL I++ A  ++  +               
Sbjct: 393 --------LLSTRAGRIQPGRDDKVLVDWNGLAIAAIANCARQFQ--------------- 429

Query: 577 DRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFG 636
            R+++++ A++A  F+   +   ++ RL HS R G    P    DYA +IS    LY+  
Sbjct: 430 -RQDWLDAAKAAFHFVCESM---ESRRLPHSIRLGKRLFPALSSDYAAMISAATALYQAT 485

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVIN 696
               +L  A E   T      D E  G++ T+ +   V LR++ D D A PS  ++ +  
Sbjct: 486 RKRGFLDQASEWFETLKSWNADEENAGFYLTSSDASDVPLRIRGDVDEAMPSATALIIEA 545

Query: 697 LVRLASIVAGSKSDYY 712
           +  LA++    K + Y
Sbjct: 546 MCGLAALSGDDKVEEY 561


>gi|407980032|ref|ZP_11160833.1| thioredoxin [Bacillus sp. HYC-10]
 gi|407413294|gb|EKF35013.1| thioredoxin [Bacillus sp. HYC-10]
          Length = 627

 Score =  364 bits (934), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 213/558 (38%), Positives = 305/558 (54%), Gaps = 63/558 (11%)

Query: 156 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 215
           M  ESFED+ VA +LN+ F+SIKVDREERPD+D +YM+  Q + G GGWPL+VF++PD K
Sbjct: 1   MAHESFEDQQVADILNEHFISIKVDREERPDIDSMYMSVCQMMTGQGGWPLNVFVTPDQK 60

Query: 216 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSAS---AS 272
           P   GTYFP    YGRPGF   L ++ DA+   RD         IE L+E  + +    +
Sbjct: 61  PFYAGTYFPKRSAYGRPGFIEALTQLLDAYHNDRD--------HIESLAEKATNNLRIKA 112

Query: 273 SNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSG 332
           + +  + L Q ++     QL  S+D+ +GGFGSAPKFP P    M+ +  +  E TG+  
Sbjct: 113 AGQTENTLTQESIHKAYYQLMSSFDTLYGGFGSAPKFPAP---HMLSFLMRYFEWTGQEN 169

Query: 333 EASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDA 392
                 K    TL  MA GGI+DH+G GF RYS DE+W VPHFEKMLYD   L + Y +A
Sbjct: 170 ALYAVTK----TLNGMANGGIYDHIGSGFTRYSTDEKWLVPHFEKMLYDNALLIDAYTEA 225

Query: 393 FSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKE 452
           + +T+   Y  + +D++ +++RDM+   G  +SA DADS   EG    KEG +YVWT +E
Sbjct: 226 YQITQHPEYEKLVQDLIQFIKRDMMNRDGSFYSAIDADS---EG----KEGQYYVWTKEE 278

Query: 453 VEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKN----VLIELNDSSASASKL 507
           +   LG+    LF   Y++   GN            F+G+N    +    +D  A+ S  
Sbjct: 279 IMTHLGDDLGTLFCAVYHITEEGN------------FEGQNIPHTISTSFDDIKAAYSID 326

Query: 508 GMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESA 567
              L   L      R  L  VR +RP P +DDKV+ SWN L+IS+ A+A  +   E    
Sbjct: 327 DKTLHSKLQ---SARHILLTVRQQRPAPLIDDKVLTSWNALMISALAKAGSVFHVE---- 379

Query: 568 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 627
                       E + +A+ A SF+  HL   Q  RL   +R G  K  GF++DYA +++
Sbjct: 380 ------------EAIRMAKQAMSFLETHLV--QQERLMVRYREGDVKHLGFIEDYAHMLT 425

Query: 628 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 687
             + LYE      WL  A        ELF D + GG+F +  +  ++++R KE +DGA P
Sbjct: 426 AYMSLYEATFDLDWLTKARAAAENMFELFWDEQIGGFFFSGSDAEALIVREKEVYDGAMP 485

Query: 688 SGNSVSVINLVRLASIVA 705
           SGNS ++  L++L+ ++ 
Sbjct: 486 SGNSTALQKLLKLSRMIG 503


>gi|291569597|dbj|BAI91869.1| hypothetical protein [Arthrospira platensis NIES-39]
          Length = 686

 Score =  364 bits (934), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 232/666 (34%), Positives = 348/666 (52%), Gaps = 75/666 (11%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           +NRLA   S YL +HA NP+DW+ W +EA  ++R  D PIFLSIGYS+CHWC VME E+F
Sbjct: 2   SNRLAQSSSLYLRKHADNPIDWWPWCDEALEKSRTEDKPIFLSIGYSSCHWCTVMEGEAF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP-DLKPLMGG 220
            D  +A+ +N  F+ IKVDREERP++D +YM  +Q + G GGWPL+VFL+P D  P  GG
Sbjct: 62  SDAAIAEYMNANFIPIKVDREERPEIDSIYMQALQMMTGQGGWPLNVFLTPGDRIPFYGG 121

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           TYFP E +YGRPGF  +L+ + + +   ++ L       + QL +++         P EL
Sbjct: 122 TYFPIEPRYGRPGFLDLLKAIHNFYHTDKNKLETVTEEILTQLRQSVILP------PSEL 175

Query: 281 PQNALRLCAEQLSKSYD-SRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
            ++ L+   E  +     + +GG    P+FP      M    S+ +  +   G+A+  Q+
Sbjct: 176 TEDLLKQGLETNTGVVGRNNYGG----PRFPMIPYADMAWRGSRLISSSKVDGKAACLQR 231

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL-TKD 398
                 + +  GGI+DHV GGFHRY+VD  W VPHFEKMLYD GQ+     D +S   K 
Sbjct: 232 G-----KDLVTGGIYDHVAGGFHRYTVDPTWTVPHFEKMLYDNGQILEFLADLWSEGEKQ 286

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL- 457
             +       +++L+R+M  P G  ++A+DADS  T      +EGAFYVWT++E+E  L 
Sbjct: 287 PAFQRSINGTVEWLKREMTAPQGYFYAAQDADSFVTSQDKEPEEGAFYVWTNQELETFLT 346

Query: 458 GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV-----------LIELNDSSASASK 506
            E     +  + +  +GN            F+GK V           LIE   +   A +
Sbjct: 347 SEEFGELQAQFTVTKSGN------------FEGKTVLQRWNCDELDPLIETALAKLFAVR 394

Query: 507 LGMPLEKYLNI-LGECRR--KLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 563
            G P E+     + E  +  K  D   + P    D K+IV+WN L+IS  A+A+++    
Sbjct: 395 YGAPPEEVKTFPVAENNQGAKQRDWPGRIP-AVTDTKMIVAWNALMISGLAKAARVF--- 450

Query: 564 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHRLQHSFRNGPSKAPGFLDDY 622
                        D  EY+E+A +AA FI +H + D++ HR+ +   +G        +DY
Sbjct: 451 -------------DNSEYLELATTAAKFILKHQWVDDRFHRVNY---DGQVAVLSQAEDY 494

Query: 623 AFLISGLLDLYEFGSGTK-----WLVWAIELQNTQDELFLDREGGGYFNTTGEDP-SVLL 676
           A  +  L+DL++           WL  A+ +Q+  DE     E GGYFNT  +D  ++L+
Sbjct: 495 ALFVKALIDLHQASLQQPELAEFWLTNAVNVQSELDEYLWSMELGGYFNTALDDAETLLI 554

Query: 677 RVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL 736
           R +   D A P+ N V++ NLVRL  +   ++   Y   A  +L  F + ++    A P 
Sbjct: 555 RERSYMDNATPAANGVAIANLVRLFLL---TEDLNYLDRAGQALEAFASIMRQSPQACPS 611

Query: 737 MCCAAD 742
           +  A D
Sbjct: 612 LFVAFD 617


>gi|121604944|ref|YP_982273.1| hypothetical protein Pnap_2043 [Polaromonas naphthalenivorans CJ2]
 gi|120593913|gb|ABM37352.1| protein of unknown function DUF255 [Polaromonas naphthalenivorans
           CJ2]
          Length = 610

 Score =  364 bits (934), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 228/649 (35%), Positives = 347/649 (53%), Gaps = 52/649 (8%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           +NRLA++ S YLLQHA  PVDW+ WG+EA A AR+R +PI LSIGY+ CHWCHVM  ESF
Sbjct: 2   SNRLASQQSAYLLQHAGQPVDWYPWGDEALALARRRGLPILLSIGYAACHWCHVMAAESF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQAL-YGGGGWPLSVFLSPDLKPLMGG 220
            D  +A L+N+ FV+IKVDREERPD+D VY    Q L   GGGWPL++FLSP   P   G
Sbjct: 62  SDPAIAALMNEGFVNIKVDREERPDLDAVYQMAHQLLRRTGGGWPLTIFLSPQGVPFYSG 121

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           TYFP     G+  F+ +L  V   W ++R  LA+      +Q   A  A+++  +    +
Sbjct: 122 TYFPSAAPEGQATFQAVLGSVSAVWREQRPALARQ-----DQALLAALAASAPRRDDAAV 176

Query: 281 PQNALRLCA-EQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
           P  A+R  A +QL+ ++D   GGFG+APKFP P ++  +L  +++  D       ++ ++
Sbjct: 177 PGAAVRAQALQQLATAFDPAQGGFGAAPKFPHPSDLAFLLRRAREEGD-------AQARE 229

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           M L TL+ MA+GG++D +GGGF RYSVD +W +PHFEKML D G L  +Y DA +LT + 
Sbjct: 230 MALLTLRKMAEGGLYDQIGGGFFRYSVDAQWRIPHFEKMLCDNGVLLALYADALALTGEP 289

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE 459
            +  +  D   +  R+M    G   ++  AD A+       +EG FYVW S+ +   L  
Sbjct: 290 LFRRVVEDTASWALREMQSSAGGFHASLAADDAQ------GREGRFYVWESEPLRLALSP 343

Query: 460 HAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS-ASKLGMPLEKYLNI 517
           +   +   H+ L          +  P   F+G++  + +  ++   A  L  P  +   +
Sbjct: 344 NEWDVCAAHWGL----------VDGPG--FEGRHWHLRVARAAGPLAVTLRRPEAQVEEL 391

Query: 518 LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSD 577
           +   R KL   R KR RP  D K++  W  L+++  ARAS + +                
Sbjct: 392 IASARPKLLAERDKRERPARDAKLLTGWTALMMTGLARASAVCQ---------------- 435

Query: 578 RKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGS 637
           R E++  A SA  F++   + +      H     P +A  FLDD+AFL+  +L L++   
Sbjct: 436 RPEWLLAARSALRFVQAGRWQDDGRTSGHLLAL-PGQA-AFLDDHAFLLEAVLALHDADP 493

Query: 638 GTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINL 697
               L +A  +       F DR+ GG+F T  + P+++ R+K   D A PSGN  + + L
Sbjct: 494 QPGDLPFAQAIAKAMLAQFEDRDAGGFFFTRHDAPALIHRLKTGLDAATPSGNGTAALAL 553

Query: 698 VRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSV 746
           + L+  +   ++  YR  AE  + VF   + +   + P +  AA++L  
Sbjct: 554 LALSGKLDAPQAAAYRLAAERCVRVFAATVLNDPASFPRLLQAAELLQA 602


>gi|423065340|ref|ZP_17054130.1| hypothetical protein SPLC1_S240900 [Arthrospira platensis C1]
 gi|406713250|gb|EKD08422.1| hypothetical protein SPLC1_S240900 [Arthrospira platensis C1]
          Length = 686

 Score =  363 bits (933), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 234/677 (34%), Positives = 345/677 (50%), Gaps = 97/677 (14%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           +NRLA   S YL +HA NP+DW+ W +EA  ++R  D PIFLSIGYS+CHWC VME E+F
Sbjct: 2   SNRLAQSSSLYLRKHADNPIDWWPWCDEALEKSRTEDKPIFLSIGYSSCHWCTVMEGEAF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP-DLKPLMGG 220
            D  +A+ +N  F+ IKVDREERP++D +YM  +Q + G GGWPL+VFL+P D  P  GG
Sbjct: 62  SDAAIAEYMNANFIPIKVDREERPEIDSIYMQALQMMTGQGGWPLNVFLTPGDRIPFYGG 121

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           TYFP E +YGRPGF  +L+ + + +   ++ L       + QL +++         P EL
Sbjct: 122 TYFPIEPRYGRPGFLDLLKAIHNFYQTDKNKLETVTEEILTQLRQSMILP------PSEL 175

Query: 281 PQNALRLCAEQLSKSYD-SRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
            ++ L+   E  +     + +GG    P+FP  +    M +   +L  + K     +G+ 
Sbjct: 176 TEDLLKQGLETNTGVVGRNNYGG----PRFPM-IPYADMAWRGTRLISSPK----VDGKA 226

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS-LTKD 398
             L   + +  GGI+DHV GGFHRY+VD  W VPHFEKMLYD GQ+     D +S   K 
Sbjct: 227 ACLQRGKDLVTGGIYDHVAGGFHRYTVDPTWTVPHFEKMLYDNGQILEFLADLWSDGEKQ 286

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
             Y       +++L+R+M  P G  ++A+DADS  T      +EGAFYVWT++E+E  L 
Sbjct: 287 PAYQRAINGTVEWLKREMTAPEGYFYAAQDADSFVTSQDKEPEEGAFYVWTNQELETFLS 346

Query: 459 EHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI 517
                  +  + +  +GN            F+GK VL   N       +L   +E  L  
Sbjct: 347 PAEFGELQAQFTVTKSGN------------FEGKTVLQRWN-----CDELEPLIETAL-- 387

Query: 518 LGECRRKLFDVRSKRPRPHL-------------------------DDKVIVSWNGLVISS 552
                 KLF VR   P   +                         D K+IV+WN L+IS 
Sbjct: 388 -----AKLFAVRYGAPPAEVTTFPVAENNQAAKERDWPGRIPAVTDTKMIVAWNALMISG 442

Query: 553 FARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHRLQHSFRNG 611
            A+A+++L                D  EY+E+A  AA F+  H + D++ HR+ +   +G
Sbjct: 443 LAKAARVL----------------DNSEYLELATKAAKFVLEHQWVDDRFHRVNY---DG 483

Query: 612 PSKAPGFLDDYAFLISGLLDLYEFG-----SGTKWLVWAIELQNTQDELFLDREGGGYFN 666
                   +DYA LI  L+DL++           WL  A+++QN  D+     E GGYFN
Sbjct: 484 KVAVLSQSEDYALLIKALIDLHQASLQHPELADFWLTNAVKVQNEFDQYLWSVELGGYFN 543

Query: 667 TTGEDP-SVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 725
           T  +D  ++L+R +   D A P+ N V++ NLVRL  +   ++   Y   A  +L  F +
Sbjct: 544 TALDDAETLLIRERSYMDNATPAANGVAIANLVRLFLL---TEDLNYLDRALQALEAFAS 600

Query: 726 RLKDMAMAVPLMCCAAD 742
            ++    A P +  A D
Sbjct: 601 VMRQSPQACPSLFVAFD 617


>gi|294814700|ref|ZP_06773343.1| DUF255 domain-containing protein [Streptomyces clavuligerus ATCC
           27064]
 gi|326443082|ref|ZP_08217816.1| hypothetical protein SclaA2_18553 [Streptomyces clavuligerus ATCC
           27064]
 gi|294327299|gb|EFG08942.1| DUF255 domain-containing protein [Streptomyces clavuligerus ATCC
           27064]
          Length = 675

 Score =  363 bits (933), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 235/627 (37%), Positives = 326/627 (51%), Gaps = 67/627 (10%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL+ E SPYLLQHA NPVDW+ W  EAF EAR+R VP+ LS+GY++CHWCHVM  ESFE
Sbjct: 3   NRLSHETSPYLLQHADNPVDWWPWTREAFDEARERGVPVLLSVGYASCHWCHVMAHESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D   A  LN+ FVS+KVDREERPDVD VYM  VQA  G GGWP++VF++ + +P   GTY
Sbjct: 63  DGATAAYLNEHFVSVKVDREERPDVDAVYMEAVQAATGQGGWPMTVFMTAEGEPFYFGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FPPE ++G P F+ +L  V  AW  +RD + +  A     L+   S +   + +P    Q
Sbjct: 123 FPPEPRHGMPSFRQVLEGVTAAWTGRRDEVDEVAARIRRDLA-GRSLAHGGDGVPGAEEQ 181

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
               +    LS+ YD R GGFG APKFP  + ++ +L H  +   TG   EA+   +M  
Sbjct: 182 ARALIG---LSREYDERHGGFGGAPKFPPSMVLEFLLRHHAR---TGS--EAA--LQMAA 231

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            T + MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L  VY   + LT      
Sbjct: 232 ETAEAMARGGIYDQLGGGFARYSVDREWIVPHFEKMLYDNALLCRVYARLWRLTGAPLAR 291

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
            +  +  D++ R++    G   SA DADS   +G   + EGAFYVWT  ++ ++LGE   
Sbjct: 292 RVALETADFMVRELRTAEGGFASALDADSTGADGV--RAEGAFYVWTPAQLTEVLGEE-- 347

Query: 463 LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECR 522
                         +L  ++D      G +VL    D                      R
Sbjct: 348 --------DGRRAAELYGVTDEGTFEHGTSVLRLPGDDPGPG----------------IR 383

Query: 523 RKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYM 582
           ++L   R  R RP  DDKV+ +WNGL I++ A                      DR + +
Sbjct: 384 QRLLASRELRERPERDDKVVAAWNGLAIAALAETGAYF----------------DRPDLV 427

Query: 583 EVAESAASFIRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFLISGLLDLYEFGSGTKW 641
           E A  AA  + R L+ + + RL  + R+G   +  G L+DY  +  G L L        W
Sbjct: 428 ERATEAADLLVR-LHLDGSARLTRTSRDGRAGRNAGVLEDYGDVAEGFLALASVTGEGVW 486

Query: 642 LVWAIELQNTQDELFLDR---EGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
           L +A  L +    + LDR   E G  ++T  +   ++ R ++  D A PSG + +   L+
Sbjct: 487 LEFAGLLLD----IVLDRFTGENGTLYDTAHDAEQLIRRPQDPTDNAAPSGWTAAAGALL 542

Query: 699 RLASIVAGSKSDYYRQNAEHSLAVFET 725
              S  A + S+ +R  AE +L V + 
Sbjct: 543 ---SYAAHTGSEAHRTAAERALGVVKA 566


>gi|257059286|ref|YP_003137174.1| hypothetical protein Cyan8802_1422 [Cyanothece sp. PCC 8802]
 gi|256589452|gb|ACV00339.1| protein of unknown function DUF255 [Cyanothece sp. PCC 8802]
          Length = 688

 Score =  363 bits (933), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 247/680 (36%), Positives = 347/680 (51%), Gaps = 76/680 (11%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           +NRLA   S YL +HA NP+DW+ W EEA   A++ + PIFLSIGYS+CHWC VME E+F
Sbjct: 2   SNRLATAQSLYLRKHADNPIDWWYWCEEALLTAKQSNRPIFLSIGYSSCHWCTVMEGEAF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP-DLKPLMGG 220
            D+ +A  LND F+ IK+DREERPD+D +YM  VQ +   GGWPL++FL+P DL P  GG
Sbjct: 62  SDQAIAAYLNDNFLPIKLDREERPDLDSLYMQAVQMMGIQGGWPLNIFLTPDDLVPFYGG 121

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           TYFP E +YGRPGF  +L+ ++  +D ++D L    +F      E L     S  LP   
Sbjct: 122 TYFPIEPRYGRPGFLQVLQSIRRFYDTEKDKL---NSFK----HEILDTLQKSAILP--- 171

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPK-FPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
             NA  L  E   +   +        P+ F RP    M+ Y +  L+ +  + ++ E Q 
Sbjct: 172 VTNAELLNNELFYRGITANTEVIIVNPQDFNRPC-FPMIPYANLALQGSRFAFQSQENQA 230

Query: 340 MVLFTL-QCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ----LANVYLDAFS 394
            V +   + +A GGI+DHVGGGFHRY+VD  W VPHFEKMLYD GQ    LAN++   + 
Sbjct: 231 TVTYQRGEDLALGGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDNGQIVEYLANLWSQGYQ 290

Query: 395 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 454
             +  F   I R + ++L+R+M  P G  ++A+DAD+  T      +EGAFYVW  +E+E
Sbjct: 291 --EPAFKRAIARTV-EWLQREMTAPQGYFYAAQDADNFTTPDEKEPEEGAFYVWKFQELE 347

Query: 455 DILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEK 513
           + L  E   L +  + L   GN            F+G NVL        S +   +  + 
Sbjct: 348 EYLNSEEFKLLEATFSLTAEGN------------FEGSNVLQRRMGGEFSEALEAILDKL 395

Query: 514 YLNILGECRRKLF-------------DVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 560
           ++   G  R+ L                   R  P  D K+IV+WN L+IS  ARA  + 
Sbjct: 396 FMIRYGSSRKTLTTFPPAKNNQEAKNQTWPGRIPPVTDTKMIVAWNSLMISGLARAYGV- 454

Query: 561 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHRLQHSFRNGPSKAPGFL 619
                   F  P+       Y E+A +A  FI +  + + + +RL +    G        
Sbjct: 455 --------FGDPL-------YWELAINATEFILQEQWVNNRLYRLNYE---GQPSVLAQA 496

Query: 620 DDYAFLISGLLDLYEFGSGTK-WLVWAIELQNTQDELFLDREGGGYFNTTGEDPS-VLLR 677
           +DYAF I  LLDL       + WL  A E+Q   DE F   EGGGY+N   ++   +L+R
Sbjct: 497 EDYAFFIKALLDLQRANPWERQWLEKAKEVQEEFDEFFWSIEGGGYYNNASDNSGDLLIR 556

Query: 678 VKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLM 737
            +   D A PS N V++ NLVRL+ +        Y   AE  L  F + L     A P +
Sbjct: 557 ERSYIDNATPSANGVALSNLVRLSRLTDDLD---YLHRAEQGLQTFSSVLSQSPKACPSL 613

Query: 738 CCAADML----SVPSRKHVV 753
             A D      SV + K ++
Sbjct: 614 FVALDWYRFGNSVQTTKEIL 633


>gi|334119055|ref|ZP_08493142.1| hypothetical protein MicvaDRAFT_2721 [Microcoleus vaginatus FGP-2]
 gi|333458526|gb|EGK87143.1| hypothetical protein MicvaDRAFT_2721 [Microcoleus vaginatus FGP-2]
          Length = 695

 Score =  363 bits (933), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 243/678 (35%), Positives = 351/678 (51%), Gaps = 90/678 (13%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            NRLA   S YL +HA NP+DW+ W +EA   AR  + PIFLSIGYS+CHWC VME E+F
Sbjct: 2   VNRLAQSQSLYLRKHAENPIDWWPWCDEALEAARSENKPIFLSIGYSSCHWCTVMEGEAF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK-PLMGG 220
            D  +A+ +N  F+ +KVDREERPD+D +YM  +Q + G GGWPL+VFL+PD + P  GG
Sbjct: 62  SDRAIAEYMNSHFIPVKVDREERPDIDSIYMQTLQMMTGQGGWPLNVFLTPDERVPFYGG 121

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           TYFP E +YGRPGF  +L+ ++  +D ++  +    A  +  L +  + S  + +L  E+
Sbjct: 122 TYFPVEPRYGRPGFLEVLQAIRRFYDTEKGKVEAFKAEILGNLQQTAALSGVTAELNREI 181

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
            Q  L L    ++        G    P FP      M+ Y    L  T  + E+    K 
Sbjct: 182 FQKGLELNTGIVA--------GHNPGPSFP------MIPYAELALRGTRFNFESKYDSKQ 227

Query: 341 VLFTLQC-MAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ----LANVYLDAFSL 395
           V       +A GGI+D VGGGFHRY+VD  W VPHFEKMLYD GQ    LAN++     +
Sbjct: 228 VCTQRGLDLALGGIYDQVGGGFHRYTVDPTWTVPHFEKMLYDNGQIVEYLANLW--GAGI 285

Query: 396 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 455
            +  F + I   + ++L+R+M  P G  ++A+DADS  T      +EGAFYVWT  E+E 
Sbjct: 286 QEPAFETAIAGTV-EWLKREMTAPTGYFYAAQDADSFNTSEEVEPEEGAFYVWTYAELEQ 344

Query: 456 ILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI-----ELNDSSASA-SKL- 507
           +L  E     K H+ +  +GN            F+GKNVL      +L+D+  +A +KL 
Sbjct: 345 LLTPEELAEIKAHFTVSRSGN------------FEGKNVLQRRHPGKLSDTVKTALAKLF 392

Query: 508 -----GMPLEKYLNILGECRRKLFDVRSKRPR--PHL-DDKVIVSWNGLVISSFARASKI 559
                G P    +      R          P   P + D K+I +WN LVIS  ARA+ +
Sbjct: 393 QVRYGGNP--DSVKTFPPARNNQEAKNESWPGRIPAVTDTKMIAAWNSLVISGLARAAAV 450

Query: 560 LKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFL 619
             +                 EY+E+A  AA+FI  + + +   R Q    +G S      
Sbjct: 451 FGN----------------WEYLELAVKAANFILDNQWTD--GRFQRLNYDGHSAVTAQS 492

Query: 620 DDYAFLISGLLDLYE----FGSGTK---------WLVWAIELQNTQDELFLDREGGGYFN 666
           +DYA  +  LLDL++     G+G +         WL  A+++Q   DE     E GGY+N
Sbjct: 493 EDYALFVKALLDLHQASLTLGNGEEAKQLPNSQFWLNKAVQVQEEFDEFLWSVELGGYYN 552

Query: 667 TTGEDPS--VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE 724
            T +D S  +L+R +   D A P+ N +++ +LVRLA +  G   +Y  + A+  L  F 
Sbjct: 553 -TAKDASGDLLVRERSYIDNATPAANGIAIASLVRLALL--GPNLEYLDR-AQQGLQAFS 608

Query: 725 TRLKDMAMAVPLMCCAAD 742
           + ++D   A P +  A D
Sbjct: 609 SIVQDAPQACPSLLSAID 626


>gi|347535413|ref|YP_004842838.1| hypothetical protein FBFL15_0482 [Flavobacterium branchiophilum
           FL-15]
 gi|345528571|emb|CCB68601.1| Protein of unknown function YyaL [Flavobacterium branchiophilum
           FL-15]
          Length = 674

 Score =  363 bits (932), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 215/621 (34%), Positives = 320/621 (51%), Gaps = 52/621 (8%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           +N L  E SPYLLQHA NP+ W AW   A  ++   +  + +SIGYS CHWCHVME ESF
Sbjct: 2   SNLLHLESSPYLLQHAQNPIHWNAWNNHALQKSINENKLMIVSIGYSACHWCHVMEHESF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           E+  VA+++N  FV+IK+DREERPD+D +YM  +Q + G GGWPL++   PD +P+ GGT
Sbjct: 62  ENLEVAQVMNSHFVNIKIDREERPDLDALYMKALQIMTGQGGWPLNMVCLPDGRPVWGGT 121

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL- 280
           YF  ED      + T L+++++ ++ + + +        E+L + +       +  D+L 
Sbjct: 122 YFRKED------WTTALKQIQEVFENQPERMLDYA----EKLQKGIDTIGFKPQFHDDLV 171

Query: 281 -PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
             +  L     +  +S+D  FGG   APKF  P    ++L ++ + +D        E   
Sbjct: 172 FSKKTLEDLISKWKRSFDLDFGGMARAPKFMMPNNYVLLLRYADQNQD-------EELLD 224

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
            V  TL  MA GG+ D +GGGF RYSVD +WHVPHFEKMLYD  QL  +Y  AF  T D 
Sbjct: 225 FVHLTLTKMAYGGLFDVLGGGFSRYSVDMKWHVPHFEKMLYDNAQLLFLYAQAFQKTGDP 284

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE 459
            Y  +    + ++ ++         +A DADS  ++     +EGAFY+WT  E+  +LG+
Sbjct: 285 LYQEVVEKTIQFIEKEWFTDNKSFCAAYDADSINSQNVL--EEGAFYIWTQDELIALLGD 342

Query: 460 HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILG 519
             +LF + + +   G+ +            G  VLI+    +  A K  + L    N   
Sbjct: 343 DYVLFSKIFNINEFGHWE-----------HGHYVLIQNQTLAYWAEKESIDLAVLKNKKQ 391

Query: 520 ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRK 579
           E  +KL+  R +RP+P LD+KVI SWN L I     A K   +                K
Sbjct: 392 EWEQKLYQKRQQRPKPRLDNKVITSWNALTIKGLVEAYKTFGT----------------K 435

Query: 580 EYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGT 639
           +Y+++A   A FI   L+    H L H ++NG  K  GFL+DYAF+I   + +YE     
Sbjct: 436 KYLQMALQNAQFIAHTLWSPDGH-LWHIYQNGTCKINGFLEDYAFVIEAFIHIYEVTFDE 494

Query: 640 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVR 699
            WL+ A  L +   + F D     +   + +DP ++ +  E  D   PS NSV   NL  
Sbjct: 495 DWLLKAKTLTDYTFDYFFDTSKQMFRFNSRKDPELIAQHFEIEDNVIPSSNSVMAHNLNY 554

Query: 700 LASIVAGSKSDYYRQNAEHSL 720
           L+       + YY++ A + L
Sbjct: 555 LS---LAFDNLYYQKTAHNML 572


>gi|13473777|ref|NP_105345.1| hypothetical protein mlr4484 [Mesorhizobium loti MAFF303099]
 gi|14024528|dbj|BAB51131.1| mlr4484 [Mesorhizobium loti MAFF303099]
          Length = 671

 Score =  363 bits (932), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 223/603 (36%), Positives = 313/603 (51%), Gaps = 56/603 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N LA E SPYL QH+ NPV W AW   +  EAR  D PI LS+GY+ CHWCHVM  ESFE
Sbjct: 7   NLLAEEASPYLQQHSGNPVHWRAWSPASLEEARTLDRPILLSVGYAACHWCHVMAHESFE 66

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           ++GVA ++N  FV+IKVDREERPD+D++YM  + ++   GGWPL++FL+PD KP  GGTY
Sbjct: 67  NDGVAAVMNRLFVNIKVDREERPDIDQIYMAALSSMGEQGGWPLTMFLTPDGKPFWGGTY 126

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP E +YGRPGF  ++  V  AW +KRD L QS     + L+  + A  S       L +
Sbjct: 127 FPREARYGRPGFIQVMEAVDKAWREKRDSLHQSA----DGLTSHVEARLSGTHARQSLDR 182

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
            AL   A ++    D   GG   APKFP      + L+ S       + G A+  +  VL
Sbjct: 183 GALTDLAGRIDGMVDRDLGGLRGAPKFPN-APFMLTLWLSWL-----RDGNAAH-RDDVL 235

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            +L+ M  GGI+DH+GGG  RYS D  W VPHFEKMLYD  +L      AFS + +  + 
Sbjct: 236 VSLERMLAGGIYDHIGGGLSRYSTDAEWLVPHFEKMLYDNAELIRFCNWAFSASGNDLFR 295

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
               + +D+L R+M   GG   ++ DADS         +EG FY W  +E++ +LG+ + 
Sbjct: 296 IRIEETVDWLLREMRVEGGAFAASLDADS-------DGEEGLFYTWNRQEIKTVLGDDSA 348

Query: 463 LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECR 522
           LF +++ L           S PH  ++GK V+ +     A         EK + +    +
Sbjct: 349 LFFKYFTL-----------SAPHG-WEGKPVIHQTRTQQAQGVA---DREKLIPL----K 389

Query: 523 RKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYM 582
            +L  VR +R RP LD K +  WNGL+I++ A A + L                 R E++
Sbjct: 390 ARLLAVREERVRPGLDAKTLTDWNGLMIAALAEAGRSLG----------------RPEWI 433

Query: 583 EVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWL 642
           E A+ A + I     D    RL HS        P    DYA + +  + L+E      ++
Sbjct: 434 EAADKAFAHISGASRD---GRLPHSMLGTRKLFPALSSDYAAMANAGISLFEASGDWSYI 490

Query: 643 VWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLAS 702
             A +     D  + D  G GY+ T  +   V +R++ D D A  S  S  +  LVRLAS
Sbjct: 491 DQAKQFIEQLDHWYPDPAGTGYYLTASDSTDVPIRIRGDVDEAISSATSQIIAALVRLAS 550

Query: 703 IVA 705
           +  
Sbjct: 551 VTG 553


>gi|288818675|ref|YP_003433023.1| hypothetical protein HTH_1371 [Hydrogenobacter thermophilus TK-6]
 gi|384129427|ref|YP_005512040.1| hypothetical protein [Hydrogenobacter thermophilus TK-6]
 gi|288788075|dbj|BAI69822.1| conserved hypothetical protein [Hydrogenobacter thermophilus TK-6]
 gi|308752264|gb|ADO45747.1| protein of unknown function DUF255 [Hydrogenobacter thermophilus
           TK-6]
          Length = 648

 Score =  363 bits (931), Expect = 3e-97,   Method: Compositional matrix adjust.
 Identities = 222/632 (35%), Positives = 337/632 (53%), Gaps = 53/632 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL    SPYL + A+ PVDW+ W EEAF +A++ D P+ LSIG   CHWCHVM  ESFE
Sbjct: 5   NRLINARSPYLRKSAYQPVDWYEWCEEAFEKAKREDKPVLLSIGGVWCHWCHVMAKESFE 64

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D  +AK++N+ FV+IKVDR+ERPD+D+ Y   V AL G GGWPL+ FL+PD K   GGTY
Sbjct: 65  DPEIAKIINENFVAIKVDRDERPDIDRRYQETVIALTGSGGWPLTAFLTPDGKLFFGGTY 124

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FPPED++GRPG K++L ++   W ++++ + +S      +L      + SS    D + +
Sbjct: 125 FPPEDRWGRPGLKSLLLRISQLWREEKERILKSADHIFLELQ-----NYSSMTFKDFVDE 179

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             L+     L  S D   GG GSAPKF      +++LYH    ++          ++ ++
Sbjct: 180 ELLKRGIGALLSSVDYEKGGIGSAPKFHHAKAFELLLYHYYFTKE-------EIVKRAII 232

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            +L  MAKGGI+DH+ GGF RYS D+ W++PHFEKMLYD  +L  +Y  A+ + ++  Y 
Sbjct: 233 SSLDAMAKGGIYDHLLGGFFRYSTDDTWNIPHFEKMLYDNAELLRLYSLAYQVFENPLYE 292

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
           Y+ + I++Y +       G  ++++DAD    +      EG  Y +TS E+  +L    +
Sbjct: 293 YVAKGIVNYYKLYGSDQEGGFYASQDADIGVLD------EGGHYTFTSDELRLLLDPEEL 346

Query: 463 LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECR 522
              + Y+    G     RM  PH++   KNVL    D+   +  L +P EK   +L   +
Sbjct: 347 KVVKLYF----GIDTRGRM--PHHQH--KNVLFINMDAQQVSKVLDIPKEKVEELLKSAK 398

Query: 523 RKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYM 582
            K+   R+ R  P++D  +   WNGL+I +     K+ + E    M              
Sbjct: 399 EKMLSYRNSREIPYIDKTIYTGWNGLMIDALCVYYKVFQDEWSLLM-------------- 444

Query: 583 EVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWL 642
             AE  A+ + +  Y + +  L H+  +G S   G+ +DY +L  GLL L+E      +L
Sbjct: 445 --AEKTANRLIKERYRDGS--LDHT--DGVS---GYSEDYIYLSQGLLSLFEITQNRTYL 495

Query: 643 VWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL-RVKEDHDGAEPSGNSVSVINLVRLA 701
             A EL +   ELF D +G G+F+T  +   +LL + K   D    S N  S   L+ + 
Sbjct: 496 DMAKELLDKAIELFWDDQGWGFFDTHQKGEGLLLIKHKPIQDTPIQSVNGTSPYLLLLME 555

Query: 702 SIVAGSKSDYYRQNAEHSLAVFETRLKDMAMA 733
           +I   +K   Y + AE +L  F   +++M MA
Sbjct: 556 AITGDTK---YGEYAEKNLMAFSRFMREMPMA 584


>gi|295132488|ref|YP_003583164.1| six-hairpin glycosidase [Zunongwangia profunda SM-A87]
 gi|294980503|gb|ADF50968.1| six-hairpin glycosidase [Zunongwangia profunda SM-A87]
          Length = 678

 Score =  362 bits (930), Expect = 3e-97,   Method: Compositional matrix adjust.
 Identities = 218/632 (34%), Positives = 323/632 (51%), Gaps = 48/632 (7%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TN L  E SPYLLQHAHNPVDW AW +    +A+K +  + +S+GYS CHWCHVME ESF
Sbjct: 5   TNDLIYETSPYLLQHAHNPVDWKAWHKTVLEDAKKTNKLLLISVGYSACHWCHVMEHESF 64

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           ED  VA ++N  ++SIKVDREERPD+D+VYM  VQ + G GGWP+++   PD +P+ GGT
Sbjct: 65  EDPEVADIMNAHYISIKVDREERPDIDQVYMQAVQLMTGSGGWPMNIVALPDGRPVWGGT 124

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YF  E       +K+ L +++  + K+   L        E L +       +N    E  
Sbjct: 125 YFRKEQ------WKSALLQIQQIYKKESTQLTNYANKLKEGLQQLNLIDIGNNSY--EFS 176

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
           Q  L    E      D + GG  +APKF  P  +  +L ++ + +D        + Q+ V
Sbjct: 177 QKRLGEFIEIWKPYLDMKLGGTKNAPKFMMPTNLDFLLRYAYQFKD-------KKLQEYV 229

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
           L +L  ++ GG  DH+GGGF RYSVD+RWHVPHFEKMLYD  QL ++Y  A+ LT+D +Y
Sbjct: 230 LHSLDKISFGGTFDHIGGGFARYSVDDRWHVPHFEKMLYDNAQLLSLYSKAYKLTQDHWY 289

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHA 461
             + +    ++  ++    G  +SA DADS   +G   ++EGAFY W  +E+E++L    
Sbjct: 290 KEVIKKTARFIETELTDSTGAFYSALDADSENAKG--NQEEGAFYTWKKEELEELLASEF 347

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
            LF  ++ +   G  +            G  +L +         K  + LE+        
Sbjct: 348 DLFSAYFNINARGYWE-----------NGNYILYKTEKDDDFTKKHNISLEELYQKKSNW 396

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
            + L + R KR +P LDDK + SWN L ++ FA A                   + +  Y
Sbjct: 397 TKILSEARKKRKKPGLDDKTLTSWNALSLNGFAEA----------------YTATGKNHY 440

Query: 582 MEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKW 641
           + +A   A FI ++  +   + L HS++N  SK   +L+DYAF I   L LYE     KW
Sbjct: 441 LNIALKNAEFIIQNQLNPD-YSLFHSYKNKQSKINAYLEDYAFTIEAFLKLYEVTFDKKW 499

Query: 642 LVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLA 701
           +  +  L     E F ++E   +  T+ +D +++    E  D   P+ NSV   NL RL 
Sbjct: 500 IDISSHLTKYCFENFYNQENTLFNFTSKKDDALISTPIELTDNVIPASNSVMANNLFRLG 559

Query: 702 SIVAGSKSDYYRQNAEHSLAVFETRLKDMAMA 733
            +   S+   Y + +E  L V   ++    M 
Sbjct: 560 RLTGTSR---YLEVSEKMLQVISGKIGSYPMG 588


>gi|390440171|ref|ZP_10228522.1| Six-hairpin glycosidase-like [Microcystis sp. T1-4]
 gi|389836455|emb|CCI32648.1| Six-hairpin glycosidase-like [Microcystis sp. T1-4]
          Length = 692

 Score =  362 bits (930), Expect = 4e-97,   Method: Compositional matrix adjust.
 Identities = 242/669 (36%), Positives = 343/669 (51%), Gaps = 82/669 (12%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N LAA  S YL +HA NP+DW+ W + A   AR+ D PIFLSIGYS+CHWC VME E+F 
Sbjct: 3   NHLAASESLYLRKHAENPIDWWYWCDSALEIARREDKPIFLSIGYSSCHWCTVMEGEAFS 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPD-LKPLMGGT 221
           D  +A  LN +F+ IKVDREERPD+D +YM  +Q + G GGWPL+VFL+PD L P  GGT
Sbjct: 63  DRAIADYLNQYFLPIKVDREERPDIDSIYMQALQMMVGQGGWPLNVFLTPDSLIPFYGGT 122

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFP + ++ RPGF  +L+ V+  +D++++ L++   F  E L  AL  SA   +    L 
Sbjct: 123 YFPVQPRFNRPGFLQVLQSVRRYYDEEKEKLSK---FTAEMLG-ALRQSAILPRAETNLA 178

Query: 282 QNALRLCA-EQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK---KLEDTGKSGEASEG 337
           + +L     E  +K        +G  P FP      + L  S+     +D+ +      G
Sbjct: 179 EPSLLATGIETNTKVIRVNPNNYGR-PSFPMIPYSHLALQGSRFGDDFDDSLRQAAYQRG 237

Query: 338 QKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL-T 396
           + + L        GGI+DHVGGGFHRY+VD  W VPHFEKMLYD GQ+     + +S   
Sbjct: 238 EDLAL--------GGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDNGQIVEYLANLWSAGN 289

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
           ++  +    +  +++L+R+M  P G  ++A+DADS E       +EGAFYVW+ + + D 
Sbjct: 290 REAAFERGIKGTVNWLKREMTAPEGYFYAAQDADSFEKATDGEPEEGAFYVWSDRSLRDY 349

Query: 457 LG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 515
           L  E   L + ++ +   GN            F+G+NVL           KLG  +E  L
Sbjct: 350 LSTEELGLLQANFTVTAEGN------------FEGRNVL-----QRRQGGKLGKEIENML 392

Query: 516 NIL-----GECRRKLFDVRSKRPRPH-------------LDDKVIVSWNGLVISSFARAS 557
           + L     G  + +L      R                  D K+IV+WN L+IS  ARA 
Sbjct: 393 DKLFIRRYGSSQSQLALFPPARDNQEAKTVSWPGRIPAVTDTKMIVAWNSLMISGLARA- 451

Query: 558 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHRLQHSFRNGPSKAP 616
                    A+F  P+       Y ++A  AA FI +H + D +  RL +    G +   
Sbjct: 452 --------FAVFGEPL-------YWQMATVAAEFILKHQWLDGRFQRLNY---QGQASVL 493

Query: 617 GFLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL 675
              +D+A+ I  LLDL       T WL  AI+LQ   D  F   + GGYFN T  D S+ 
Sbjct: 494 AQSEDFAYFIKALLDLQTANPQETGWLEAAIDLQGEFDRWFWAEDEGGYFN-TASDHSLD 552

Query: 676 LRVKED--HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMA 733
           L V+E    D A PS N +++ NL+RL+ +    +   Y   AE +L  F T L+    A
Sbjct: 553 LIVRERGYTDNATPSANGIAIANLLRLSRLTENLE---YLDRAEKALQSFTTILEQSPTA 609

Query: 734 VPLMCCAAD 742
            P +  A D
Sbjct: 610 CPSLFVALD 618


>gi|422304439|ref|ZP_16391784.1| Six-hairpin glycosidase-like [Microcystis aeruginosa PCC 9806]
 gi|389790409|emb|CCI13705.1| Six-hairpin glycosidase-like [Microcystis aeruginosa PCC 9806]
          Length = 692

 Score =  362 bits (930), Expect = 4e-97,   Method: Compositional matrix adjust.
 Identities = 241/668 (36%), Positives = 339/668 (50%), Gaps = 80/668 (11%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N LA   S YL +HA NP+DW+ W + A   AR+ D PIFLSIGYS+CHWC VME E+F 
Sbjct: 3   NHLAESESLYLRKHAENPIDWWYWCDSALEIARREDKPIFLSIGYSSCHWCTVMEGEAFS 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPD-LKPLMGGT 221
           D  +A  LN +F+ IKVDREERPD+D +YM  +Q + G GGWPL+VFL+PD L P  GGT
Sbjct: 63  DRAIADYLNQYFLPIKVDREERPDIDSIYMQALQMMVGQGGWPLNVFLTPDSLIPFYGGT 122

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFP + ++ RPGF  +L+ V+  +D++++ L++   F  E L  AL  SA   +    L 
Sbjct: 123 YFPVQPRFNRPGFLQVLQSVRRYYDEEKEKLSK---FTAEMLG-ALRQSAILPRAETNLA 178

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK---KLEDTGKSGEASEGQ 338
             +L     + + +           P FP      + L  S+     ED+ +      G+
Sbjct: 179 APSLLATGIETNTAVIRVNPNNYGRPSFPMIPYANLALQGSRFGDDFEDSLRQAAYQRGE 238

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL-TK 397
            + L        GGI+DHVGGGFHRY+VD  W VPHFEKMLYD GQ+     + +S   +
Sbjct: 239 DLAL--------GGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDNGQIVEYLANLWSAGNR 290

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
           +  +    +  +++L+R+M  P G  ++A+DADS E       +EGAFYVW+  E+ D L
Sbjct: 291 EAAFERGIKGTVNWLKREMTAPEGYFYAAQDADSFEKATDREPEEGAFYVWSHLELRDYL 350

Query: 458 G-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLN 516
             E   L + ++ +   GN            F+G+NVL           KLG  +E  L+
Sbjct: 351 STEELGLLQANFTVTAEGN------------FEGRNVL-----QRRQGGKLGKDIENMLD 393

Query: 517 IL-----GECRRKLFDVRSKRPRPH-------------LDDKVIVSWNGLVISSFARASK 558
            L     G  + +L      R                  D K+IV+WN L+IS  ARA  
Sbjct: 394 KLFIRRYGSSQSQLALFPPARDNQEAKTVSWPGRIPAVTDTKMIVAWNSLMISGLARA-- 451

Query: 559 ILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHRLQHSFRNGPSKAPG 617
                   A+F  P+       Y ++A  AA FI +H + D +  RL +    G +    
Sbjct: 452 -------FAVFGEPL-------YWQMATVAAEFILKHQWLDGRFQRLNY---QGQASVLA 494

Query: 618 FLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL 676
             +D+A+ I  LLDL       T WL  AIELQ   D  F   + GGYFN T  D S+ L
Sbjct: 495 QSEDFAYFIKALLDLQTANPQETGWLEAAIELQGEFDRWFWAEDEGGYFN-TASDHSLDL 553

Query: 677 RVKED--HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAV 734
            V+E    D A PS N +++ NL+RL+ +    +   Y   AE +L  F T L+    A 
Sbjct: 554 IVRERGYTDNATPSANGIAIANLLRLSRLTENLE---YLDRAEKALQSFSTILEQSPTAC 610

Query: 735 PLMCCAAD 742
           P +  A D
Sbjct: 611 PSLFVALD 618


>gi|209523771|ref|ZP_03272324.1| protein of unknown function DUF255 [Arthrospira maxima CS-328]
 gi|209495803|gb|EDZ96105.1| protein of unknown function DUF255 [Arthrospira maxima CS-328]
          Length = 686

 Score =  362 bits (929), Expect = 4e-97,   Method: Compositional matrix adjust.
 Identities = 233/677 (34%), Positives = 344/677 (50%), Gaps = 97/677 (14%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           +NRLA   S YL +HA NP+DW+ W +EA  ++R  D PIFLSIGYS+CHWC VME E+F
Sbjct: 2   SNRLAQSSSLYLRKHADNPIDWWPWCDEALEKSRTEDKPIFLSIGYSSCHWCTVMEGEAF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP-DLKPLMGG 220
            D  +A+ +N  F+ IKVDREERP++D +YM  +Q + G GGWPL+VFL+P D  P  GG
Sbjct: 62  SDAAIAEYMNANFIPIKVDREERPEIDSIYMQALQMMTGQGGWPLNVFLTPGDRIPFYGG 121

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           TYFP E +YGRPGF  +L+ + + +   ++ L       + QL +++         P EL
Sbjct: 122 TYFPIEPRYGRPGFLDLLKAIHNFYQTDKNKLETVTEEILTQLRQSMILP------PSEL 175

Query: 281 PQNALRLCAEQLSKSYD-SRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
            ++ L+   E  +     + +GG    P+FP  +    M +   +L  + K     +G+ 
Sbjct: 176 TEDLLKQGLETNTGVVGRNNYGG----PRFPM-IPYADMAWRGTRLISSPK----VDGKA 226

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS-LTKD 398
             L   + +  GGI+DHV GGFHRY+VD  W VPHFEKMLYD GQ+     D +S   K 
Sbjct: 227 ACLQRGKDLVTGGIYDHVAGGFHRYTVDPTWTVPHFEKMLYDNGQILEFLADLWSDGEKQ 286

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
             Y       +++L+R+M  P G  ++A+DADS  T      +EGAFYVWT++E+E  L 
Sbjct: 287 PAYQRAINGTVEWLKREMTAPEGYFYAAQDADSFVTSQDKEPEEGAFYVWTNQELETFLS 346

Query: 459 EHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI 517
                  +  + +  +GN            F+GK VL   N       +L   +E  L  
Sbjct: 347 PAEFGELQAQFTVTKSGN------------FEGKTVLQRWN-----CDELEPLIETAL-- 387

Query: 518 LGECRRKLFDVRSKRPRPHL-------------------------DDKVIVSWNGLVISS 552
                 KLF VR   P   +                         D K+IV+WN L+IS 
Sbjct: 388 -----AKLFAVRYGAPPAEVTTFPVAENNQAAKERDWPGRIPAVTDTKMIVAWNALMISG 442

Query: 553 FARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHRLQHSFRNG 611
            A+A+++L                D  EY+E+A  AA F+  H + D++ HR+ +   +G
Sbjct: 443 LAKAARVL----------------DNSEYLELATKAAKFVLEHQWVDDRFHRVNY---DG 483

Query: 612 PSKAPGFLDDYAFLISGLLDLYEFG-----SGTKWLVWAIELQNTQDELFLDREGGGYFN 666
                   +DYA  I  L+DL++           WL  A+++QN  D+     E GGYFN
Sbjct: 484 KVAVLSQSEDYALFIKALIDLHQASLQHPELADFWLTNAVKVQNEFDQYLWSVELGGYFN 543

Query: 667 TTGEDP-SVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 725
           T  +D  ++L+R +   D A P+ N V++ NLVRL  +   ++   Y   A  +L  F +
Sbjct: 544 TALDDAETLLIRERSYMDNATPAANGVAIANLVRLFLL---TEDLNYLDRALQALEAFAS 600

Query: 726 RLKDMAMAVPLMCCAAD 742
            ++    A P +  A D
Sbjct: 601 VMRQSPQACPSLFVAFD 617


>gi|86606925|ref|YP_475688.1| hypothetical protein CYA_2291 [Synechococcus sp. JA-3-3Ab]
 gi|86555467|gb|ABD00425.1| conserved hypothetical protein [Synechococcus sp. JA-3-3Ab]
          Length = 701

 Score =  362 bits (929), Expect = 4e-97,   Method: Compositional matrix adjust.
 Identities = 233/663 (35%), Positives = 332/663 (50%), Gaps = 60/663 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA   S YL +HA NPVDW+ W  EA  +AR  D PIFLSIGYS+CHWC VME E+F 
Sbjct: 3   NRLATCSSLYLRKHAENPVDWWPWIPEALEKARAEDKPIFLSIGYSSCHWCTVMEGEAFS 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP-DLKPLMGGT 221
           D  +A  LN  F+ IKVDREERPD+D +YM  +Q + G GGWPL+VFL+P DL P   GT
Sbjct: 63  DPEIAAFLNAHFLPIKVDREERPDLDSIYMQALQLMSGQGGWPLNVFLTPDDLVPFYAGT 122

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFP E ++GRPGF T+L+++   + +++D +       +  L+  LS     + +P +L 
Sbjct: 123 YFPVEPRFGRPGFLTVLQRILQFYRQEKDKIEDMKGQILAALT-TLSDLVPEDHIPPDLL 181

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
           ++ +      L+ +        G+  +FP     Q++L  ++     G  G  +  ++  
Sbjct: 182 RSGIPKIQPLLANA--------GAVQQFPMMPYAQLVLRSARFDPPEGIPGSPTALERAK 233

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL-TKDVF 400
              +  +  GGI DHV GGFHRY+VD  W VPHFEKMLYD GQ+     + ++   +D  
Sbjct: 234 ERGM-ALVLGGIFDHVAGGFHRYTVDPTWTVPHFEKMLYDNGQILEFLSELWAHGIQDAA 292

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH 460
                R  ++++ R+M  P G  ++A+DADS         +EG FYVW  +E++D+L E 
Sbjct: 293 IERAVRLTVEWVAREMTAPAGYFYAAQDADSFARREDAEPEEGEFYVWRWQELQDLLDEE 352

Query: 461 AI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILG 519
                ++ ++L P GN        P      +    EL     +A    +   +Y    G
Sbjct: 353 TFRALQQAFFLLPGGNFP----DRPGCIVLQRRQGGELPPEVETALTTHLFRARY----G 404

Query: 520 ECRRKL-----FDVRSKRPR-------PHLDDKVIVSWNGLVISSFARASKILKSEAESA 567
              R+       D +S R +       P  D K+IVSWNGL+IS  ARA ++   E    
Sbjct: 405 STERRTPFPLAVDAQSARRQSWPGRIPPVTDTKMIVSWNGLMISGLARAYQVFGEE---- 460

Query: 568 MFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 627
                       +Y+ +A  AA FI       QT  L     +G ++ P   +DYA LI 
Sbjct: 461 ------------DYLRLALRAAQFILSQQRHPQTGSLLRLNYDGTAQVPAQSEDYALLIK 508

Query: 628 GLLDLYEF-------GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGED-PSVLLRVK 679
            LLDL++         S   WL  AI LQ   D    D   GGYF +  +  P +L+R K
Sbjct: 509 ALLDLHQACLPRTGDPSSQYWLEAAIRLQQEMDTRLWDEARGGYFVSDAQSTPELLVREK 568

Query: 680 EDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCC 739
           E  D A P+ N V+V NLVRLA+I        Y + AE +L  F   +       P +  
Sbjct: 569 EFQDNATPAANGVAVANLVRLAAITGDLD---YLERAEQALKTFAHIMSTQPRVCPSLFV 625

Query: 740 AAD 742
             D
Sbjct: 626 GLD 628


>gi|434393621|ref|YP_007128568.1| hypothetical protein Glo7428_2913 [Gloeocapsa sp. PCC 7428]
 gi|428265462|gb|AFZ31408.1| hypothetical protein Glo7428_2913 [Gloeocapsa sp. PCC 7428]
          Length = 687

 Score =  362 bits (929), Expect = 4e-97,   Method: Compositional matrix adjust.
 Identities = 249/711 (35%), Positives = 354/711 (49%), Gaps = 119/711 (16%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNRLA   S YL +HA NP+DW+ W +EA A A+ ++ PIFLSIGYS+CHWC VME E+F
Sbjct: 2   TNRLAQAQSLYLRKHAENPIDWWTWCDEALATAKAQNKPIFLSIGYSSCHWCTVMEGEAF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP-DLKPLMGG 220
            D  +A  +N  F+ IKVDREERPD+D +YM  +Q + G GGWPL++F++P DL P  GG
Sbjct: 62  SDLAIADYMNAHFLPIKVDREERPDLDSIYMQALQMMVGQGGWPLNIFIAPDDLVPFYGG 121

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWD-KKRDMLAQSGAF--AIEQLSEALSASASSNKLP 277
           TYFP E +YGRPGF  +L+ ++  +D +K+D+LA+  A   AI+Q     SA     +  
Sbjct: 122 TYFPVEPRYGRPGFLQVLQAIRRYYDTEKQDLLARKAAILEAIQQ-----SAVLPKTQQS 176

Query: 278 DELPQNALRLCAEQLSKSYDSRFG-----GFGSAPKFPRPVEIQMMLYHSKKLEDTGKSG 332
           DE          + L K  ++  G      +G+  +FP     ++ L  ++      +  
Sbjct: 177 DE----------DLLKKGIETNTGVITPHDYGT--QFPMIPYAELALRGTRFNYSAWRYD 224

Query: 333 EASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDA 392
                Q+  L     +A GGI+DHV GGFHRY+VD  W VPHFEKMLYD GQ+     + 
Sbjct: 225 IPQVCQQRGL----DLALGGIYDHVAGGFHRYTVDPTWTVPHFEKMLYDNGQIVEYLANL 280

Query: 393 FSLTKDVFYSYICRDI---LDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 449
           +S    V    I R I   + +L+R+M  P G  ++A+DADS  +      +EGAFYVW+
Sbjct: 281 WS--NGVQEPAIERAIALTVQWLKREMTAPEGYFYAAQDADSFTSPYEAEPEEGAFYVWS 338

Query: 450 SKEVEDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 508
             E++ IL  E     ++ + +   GN            F+G+ VL   +  S S     
Sbjct: 339 YSELQQILSSEELSALEQQFTITSQGN------------FEGQIVLQRRHPGSLS----- 381

Query: 509 MPLEKYLNILGECRRKLFDVR-------------------------SKRPRPHLDDKVIV 543
                  +I  +   KLF VR                         S R     D K+IV
Sbjct: 382 -------DITEQALSKLFTVRYGATPESLDVFPPARNNQEAKTQNWSGRIPAVTDTKMIV 434

Query: 544 SWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRH-LYDEQTH 602
           +WN L+IS  ARA  + K                + EY+E+A S+A FI  H   D + H
Sbjct: 435 AWNSLMISGLARAYAVFK----------------KSEYLEIALSSARFILNHQQVDGRFH 478

Query: 603 RLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF----GSGTKWLVWAIELQNTQDELFLD 658
           RL +    G +      +DYA  I  LLDLY+      +   WL  AI LQ   DE    
Sbjct: 479 RLNY---EGQTSVIAQSEDYALFIKALLDLYQVTLKDANSQHWLEQAIALQAEFDEYLWS 535

Query: 659 REGGGYFNTTGE-DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAE 717
            E GGY+NT  +    +++R +   D A P+ N V++ NLVRLA +   ++   Y   AE
Sbjct: 536 IELGGYYNTASDASRDLIVRERSYADNATPAANGVAIANLVRLALL---TEKLSYLDRAE 592

Query: 718 HSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLA 768
            +L  F + +     A P +  A D       ++  LV   +S   E +LA
Sbjct: 593 QALQAFTSVMDSAPQACPSLFTALDWY-----RNCTLV-RTTSTTLETVLA 637


>gi|254409993|ref|ZP_05023773.1| conserved hypothetical protein [Coleofasciculus chthonoplastes PCC
           7420]
 gi|196183029|gb|EDX78013.1| conserved hypothetical protein [Coleofasciculus chthonoplastes PCC
           7420]
          Length = 695

 Score =  362 bits (929), Expect = 5e-97,   Method: Compositional matrix adjust.
 Identities = 234/674 (34%), Positives = 347/674 (51%), Gaps = 83/674 (12%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNRLA   S YL +HA NP+DW+ W +EA   A+  + PIFLSIGYS+CHWC VME E+F
Sbjct: 2   TNRLAQCQSLYLRKHAENPIDWWPWSDEALFTAKAENKPIFLSIGYSSCHWCTVMEGEAF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP-DLKPLMGG 220
            D  +A+ +N  F+ IKVDREERPD+D +YM  +Q + G GGWPL++FL+P D  P  GG
Sbjct: 62  SDPAIAQYMNANFLPIKVDREERPDIDSIYMQALQMMTGQGGWPLNIFLTPEDRVPFYGG 121

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           TYFP E +YGRPGF  +L+ ++  +D ++  L       +  L +++   AS      +L
Sbjct: 122 TYFPVEPRYGRPGFLQVLQAIRRFYDVEKTKLQNFKDEILGHLQQSVLLPASG-----QL 176

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSG-EASEGQK 339
               LR   ++  +  DS  G +G  P FP      + L   +  E T     +AS  + 
Sbjct: 177 TAELLRQGMDKTIRIVDS--GSYG--PSFPMIPYADLALRGIRFQEMTEVDAYQASRSRG 232

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL-TKD 398
           + L      AKGGI+DHV GGFHRY+VD  W VPHFEKMLYD GQ+     + +S+  K+
Sbjct: 233 LDL------AKGGIYDHVAGGFHRYTVDATWTVPHFEKMLYDNGQIVEYLANLWSVGIKE 286

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL- 457
             +       + +L R+M    G  ++A+DADS     A   +EGAFYVW+  E++ +L 
Sbjct: 287 AAFERAISGTVQWLTREMTASSGYFYAAQDADSFTEPSAAEPEEGAFYVWSYAELQQLLT 346

Query: 458 GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI-----ELNDSSASA------SK 506
            E     +E + + P GN            F+G+NVL      +L+D+  +A      ++
Sbjct: 347 AEELAELQEQFTVTPEGN------------FEGQNVLQRRYSDQLSDTLETALAKLFTAR 394

Query: 507 LGMP---LEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 563
            G P   LE +         K  +   + P    D K+IV+WN L+IS  ARA  + +  
Sbjct: 395 YGSPPDSLETFPPAQNNQEAKTKNWSGRIP-AVTDTKMIVAWNSLMISGLARAYGVFR-- 451

Query: 564 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHRLQHSFRNGPSKAPGFLDDY 622
                         + EY+E+A +AA FI  + + D++ HRL +    G +      +DY
Sbjct: 452 --------------KPEYLELATTAAKFILENQWVDQRFHRLNY---EGEASILAQSEDY 494

Query: 623 AFLISGLLDLYEFGSGT-------------KWLVWAIELQNTQDELFLDREGGGYFNTTG 669
           A  I  LLDL++   G               WL  AI++Q+  DE     E  GY+N   
Sbjct: 495 ALFIKALLDLHQASLGLATAQESSQSPIPDSWLEEAIKVQDEFDEYLWSVELAGYYNAAN 554

Query: 670 EDPS-VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLK 728
           +    +L+R +   D A P+ N V++ NLVRL  +   +++  Y   AE +L  F + + 
Sbjct: 555 DSSGDLLIRERSYTDNATPAANGVAIANLVRLTLL---TENLAYLDRAEVALNAFSSVMN 611

Query: 729 DMAMAVPLMCCAAD 742
             + + P +  A D
Sbjct: 612 QSSQSCPSLFTALD 625


>gi|440682478|ref|YP_007157273.1| hypothetical protein Anacy_2941 [Anabaena cylindrica PCC 7122]
 gi|428679597|gb|AFZ58363.1| hypothetical protein Anacy_2941 [Anabaena cylindrica PCC 7122]
          Length = 693

 Score =  362 bits (929), Expect = 5e-97,   Method: Compositional matrix adjust.
 Identities = 241/675 (35%), Positives = 354/675 (52%), Gaps = 86/675 (12%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNRLA E S YL +HA NP+DW+ W +EA   AR ++ PIFLSIGYS+CHWC VME E+F
Sbjct: 2   TNRLAEEKSLYLRKHAENPIDWWPWCDEALETARVQNKPIFLSIGYSSCHWCTVMEGEAF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP-DLKPLMGG 220
            D  +A+ +N  F+ IKVDREERPD+D +YM  +Q + G GGWPL+VFL+  DL P   G
Sbjct: 62  SDLEIAQYMNTNFLPIKVDREERPDLDSIYMQTLQFMSGQGGWPLNVFLAADDLVPFYAG 121

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD-E 279
           TYFP + +YGRPGF  +L  ++  +D +++ L Q  A  +    EAL  SA   K+ + E
Sbjct: 122 TYFPVDPRYGRPGFLQVLEALRRYYDTEKEELRQRKALIV----EALLTSAVMQKVTNQE 177

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGS---APKFPRPVEIQMMLYHSKKLEDTGKSGEAS- 335
           +  N L      L K +++  G   S      FP      M+ Y    L  T  + +   
Sbjct: 178 VADNQL------LQKGWETCTGIITSKQVGNSFP------MIPYAEFALRGTRFNYQFQY 225

Query: 336 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS- 394
           +GQ++       +A GGI+DHVGGGFHRY+VD  W VPHFEKMLYD GQ+     + +S 
Sbjct: 226 DGQQVCTQRGLDLALGGIYDHVGGGFHRYTVDPTWTVPHFEKMLYDNGQIIEYLANLWSG 285

Query: 395 -LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 453
            + +  F   +   +  +L+R+M   GG  ++A+DADS     A   +EGAFYVW+ +E+
Sbjct: 286 GIQEPAFERAVAGTV-KWLQREMTAQGGYFYAAQDADSFINSTAIEPEEGAFYVWSYREL 344

Query: 454 EDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-----------IELNDSS 501
           + +L  E     ++ + +   GN            F+G+ VL           +E+  S 
Sbjct: 345 QQLLTTEELNELQQQFAVTANGN------------FEGQIVLQRSHPGELSQTLEIALSK 392

Query: 502 ASASKLGMPLEKYLNILGECRRKLFDVRSKRPR--PHL-DDKVIVSWNGLVISSFARASK 558
              ++ G   E   N     R      ++  P   P + D K+IV+WN L+IS  ARA++
Sbjct: 393 LFTARYGATPESLSN-FPPARDNQEAKKTNWPGRIPAVTDTKMIVAWNSLMISGLARAAE 451

Query: 559 ILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHRLQHSFRNGPSKAPG 617
           + +                +  Y+E+A  AA FI  H + D + HRL +    G +    
Sbjct: 452 VFQ----------------QPNYLELAAQAARFILDHQFVDGRFHRLNYE---GEATVLA 492

Query: 618 FLDDYAFLISGLLDLYEFGSG---------TKWLVWAIELQNTQDELFLDREGGGYFNTT 668
             +DYAF I  LLDL++   G         + WL  A+ LQ+  DE     E GGYFNT+
Sbjct: 493 QSEDYAFFIKALLDLHQATLGQLDHVSSQNSDWLEKAVSLQDEFDEFLWSIELGGYFNTS 552

Query: 669 GEDPS-VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 727
            ++   +++R +   D A PS N +++ NLVRLA +   + + +Y   AE  L  F+  +
Sbjct: 553 SDNSQDLIVRERSYIDNATPSANGIAIANLVRLALL---TDNLHYLDLAEQGLTAFKGVM 609

Query: 728 KDMAMAVPLMCCAAD 742
            +   A P +  A D
Sbjct: 610 SNSPQACPSLFTALD 624


>gi|254381981|ref|ZP_04997344.1| conserved hypothetical protein [Streptomyces sp. Mg1]
 gi|194340889|gb|EDX21855.1| conserved hypothetical protein [Streptomyces sp. Mg1]
          Length = 686

 Score =  362 bits (929), Expect = 5e-97,   Method: Compositional matrix adjust.
 Identities = 229/625 (36%), Positives = 320/625 (51%), Gaps = 59/625 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA   SPYLLQHA NPVDW+ W   AF EAR+R+VP+ LS+GYS CHWCHVM  ESFE
Sbjct: 2   NRLAGVTSPYLLQHADNPVDWWPWEPAAFEEARRRNVPVLLSVGYSACHWCHVMAHESFE 61

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D   A  +N+ FV++KVDREERPDVD VYM  VQA  G GGWP++VFL+ D +P   GTY
Sbjct: 62  DGATAAYMNEHFVNVKVDREERPDVDAVYMEAVQAATGQGGWPMTVFLTADAEPFYFGTY 121

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS-EALSASASSNKLPDELP 281
           FPPE ++G P F  +L  V  AW  + + + +     +  L+        ++   P+EL 
Sbjct: 122 FPPEPRHGMPSFPQVLEGVHTAWTGRPEEVTEVARRIVGDLAGRRPDYGKAAVPGPEELA 181

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
              L      L++ YD+  GGFG APKFP  + ++ +L H  +   TG  G      +M 
Sbjct: 182 GALL-----GLTREYDAAHGGFGGAPKFPPSMVLEFLLRHHAR---TGSEG----ALQMA 229

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             T + MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L  VY   +  T     
Sbjct: 230 ADTCEAMARGGIYDQLGGGFARYSVDREWVVPHFEKMLYDNALLCRVYAHLWRATGSELA 289

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE-H 460
             +  +  D++ R++    G   SA DADS E E   +  EGA+Y WT  ++ ++LGE  
Sbjct: 290 RRVALETADFMVRELRTREGGFASALDADSEEPE-TGKHVEGAYYAWTPDQLREVLGEAD 348

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
             L    + +   G  +            G +VL    D  A      +  E++ +I   
Sbjct: 349 GELAAGCFGVTEEGTFE-----------HGTSVLRLPQDGPA------VDAERFASI--- 388

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            R +L   R  RP P  DDKV+ +WNGL I++ A                      +R +
Sbjct: 389 -RARLLAARGGRPAPGRDDKVVAAWNGLAIAALAECGAYF----------------ERPD 431

Query: 581 YMEVAESAASFIRRHLYDEQTH--RLQHSFRNGPSKA-PGFLDDYAFLISGLLDLYEFGS 637
            +E A  AA  + R  +D      RL  + ++G + A  G L+DY  +  G L L     
Sbjct: 432 LIERATEAADLLVRVHFDAAAGGPRLARTSKDGRAGANAGVLEDYGDVAEGFLALAAVTG 491

Query: 638 GTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINL 697
              WL +A  L +   +LF   E G  ++T  +   ++ R ++  D A PSG + +   L
Sbjct: 492 EGVWLEFAGFLVDLVLDLFT-AEDGSLYDTAHDAERLIRRPQDPTDSAAPSGWTAAAGAL 550

Query: 698 VRLASIVAGSKSDYYRQNAEHSLAV 722
           +   S  A + S  +R  AE +L V
Sbjct: 551 L---SYAAHTGSQAHRTAAERALGV 572


>gi|443327996|ref|ZP_21056601.1| thioredoxin domain containing protein [Xenococcus sp. PCC 7305]
 gi|442792405|gb|ELS01887.1| thioredoxin domain containing protein [Xenococcus sp. PCC 7305]
          Length = 682

 Score =  362 bits (928), Expect = 5e-97,   Method: Compositional matrix adjust.
 Identities = 239/669 (35%), Positives = 335/669 (50%), Gaps = 79/669 (11%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TN LA   S YL +HA NP+DW+ W +EA + A   + PIFLSIGYS+CHWC VME E+F
Sbjct: 2   TNHLAESRSLYLQKHAENPIDWWYWCDEALSIAAAENKPIFLSIGYSSCHWCTVMEGEAF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP-DLKPLMGG 220
            D  +A  LN+ FV IKVDREERPD+D +YM  +Q + G GGWPL++FL+P DL P  GG
Sbjct: 62  SDNAIADYLNNNFVPIKVDREERPDIDSIYMQALQMMTGQGGWPLNIFLTPGDLVPFYGG 121

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           TYFP   +Y RP F  IL+ V+  +D + + L       +  L  + S   + + L  EL
Sbjct: 122 TYFPVTPRYNRPSFIDILKSVRRFYDVETEKLEGFKTEILFNLQRSTSLETTEDALTSEL 181

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKS-GEASEGQK 339
               L      LS     R       P FP      M+ Y +  L+ +  +     +  K
Sbjct: 182 LDQGLETNTAVLSSGDPGR-------PNFP------MIPYATAALQGSRLNFNNRYDADK 228

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           + L   Q +  GGI DHV GGFHRY+VD  W VPHFEKMLYD GQ+     + +S  +  
Sbjct: 229 LCLQRGQDLVLGGICDHVAGGFHRYTVDHTWTVPHFEKMLYDNGQILEYLANLWSCQRHF 288

Query: 400 F-YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL- 457
                    I+++L+R+M+ P G  ++++DAD+  T  A   +EG FYVW+  E+E++L 
Sbjct: 289 LTIEDAIAGIVNWLKREMLAPQGYFYASQDADNFATAEAAEPEEGLFYVWSYNELENLLS 348

Query: 458 GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI 517
            E     +  + + P GN            F+G NVL   N    S S     LE+ L  
Sbjct: 349 AEELAELQAEFSITPQGN------------FEGSNVLQRFNHEELSPS-----LEQTLQK 391

Query: 518 LGECR--------------RKLFDVRSK----RPRPHLDDKVIVSWNGLVISSFARASKI 559
           L   R              +   + ++K    R  P  D K+I +WN L+IS  ARA+ +
Sbjct: 392 LFAARYGEKQTGIDTFPVAKNNREAKTKPWPGRIPPVTDTKMITAWNSLIISGLARAASV 451

Query: 560 LKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE-QTHRLQHSFRNGPSKAPGF 618
           L                    Y ++AE+ A+FI +  + E + HRL +   +G +     
Sbjct: 452 LGI----------------TNYQQLAENTANFILQQQWLEGRLHRLNY---DGQATVLAQ 492

Query: 619 LDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGED--PSVL 675
            +DYA  I  LLDL++      +WL  AI LQ   D LF    GGGY+N  G D   ++L
Sbjct: 493 SEDYALFIKALLDLHQSSPQNPQWLDSAIALQAEFDRLFWSEMGGGYYN-NGSDVGDNLL 551

Query: 676 LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP 735
           +R +   D A P+ N V++ NLVRL  +    +   YR  AE  L  F   +K    A P
Sbjct: 552 IRERSYMDNATPAANGVAMANLVRLFLLTDNLE---YRDRAEQGLQAFAGIMKSSPQACP 608

Query: 736 LMCCAADML 744
            +  A D L
Sbjct: 609 SLFVALDWL 617


>gi|239627004|ref|ZP_04670035.1| conserved hypothetical protein [Clostridiales bacterium 1_7_47_FAA]
 gi|239517150|gb|EEQ57016.1| conserved hypothetical protein [Clostridiales bacterium 1_7_47FAA]
          Length = 638

 Score =  362 bits (928), Expect = 5e-97,   Method: Compositional matrix adjust.
 Identities = 214/560 (38%), Positives = 298/560 (53%), Gaps = 63/560 (11%)

Query: 148 STCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLS 207
           STCHWCHVME ESFE+EG+A +LN  ++ IKVDREERPDVD VYM+  QA+ G GGWPL+
Sbjct: 7   STCHWCHVMERESFENEGIAGILNRDYICIKVDREERPDVDSVYMSVCQAMNGQGGWPLT 66

Query: 208 VFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 267
           + ++PD +P   GTYFPP+ +YGR G + +L  V   W   R+ L + GA  IE   +  
Sbjct: 67  IIMTPDCRPFFSGTYFPPKARYGRVGLEELLAAVSAQWKGGRERLLE-GAGRIEAFLKEQ 125

Query: 268 SASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLED 327
             +  S +   E+   A RL        +D + GGFG APKFP P  I  ++ +  +   
Sbjct: 126 EQADVSAEPGLEVVHRAFRL----FGDGFDKKNGGFGQAPKFPTPHNIMFLMEYGVRENK 181

Query: 328 TGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLAN 387
            G          M + TL  M +GGI DH+GGGF RYS DE+W VPHFEKMLYD   LA 
Sbjct: 182 PGAV-------DMAMDTLVQMYRGGIFDHIGGGFSRYSTDEQWLVPHFEKMLYDNALLAM 234

Query: 388 VYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYV 447
            Y  A+ LT    Y+ + + IL Y+  ++    G  +  +DADS          EG +YV
Sbjct: 235 AYAKAYGLTGRGLYARVVQRILGYVEAELTHASGGFYCGQDADSDGV-------EGRYYV 287

Query: 448 WTSKEVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL-NDSSASAS 505
           +T +E++ +LG E    F   + +   GN            F+GKN+   L N+   +A 
Sbjct: 288 FTPEEIKQVLGPEDGADFCSQFGITGIGN------------FEGKNIPNLLGNEDYETAG 335

Query: 506 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 565
           K               RRKL++ R +R   H DDK++VSWNG +I + A A  +L +   
Sbjct: 336 KEA------------SRRKLYEYRIRRAHLHKDDKILVSWNGWMICACAMAGAVLGA--- 380

Query: 566 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFL 625
                         +Y+++A  A +FIR HL  +   RL   +R+G +   G LDDYA  
Sbjct: 381 -------------GQYVDMAVRAEAFIRTHLVKD--GRLLVRYRDGDAAGQGKLDDYACY 425

Query: 626 ISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 685
           +  LL+LYE   GT +L  A+    T    F DRE GG++    +   +++R KE +DGA
Sbjct: 426 VLALLELYEVTFGTGYLEQAVYWAKTMVLQFFDRERGGFYLYAEDGEQLIVRTKEAYDGA 485

Query: 686 EPSGNSVSVINLVRLASIVA 705
            PSGNS +   L +LA I  
Sbjct: 486 VPSGNSAAARVLQQLAQITG 505


>gi|428211294|ref|YP_007084438.1| thioredoxin domain-containing protein [Oscillatoria acuminata PCC
           6304]
 gi|427999675|gb|AFY80518.1| thioredoxin domain protein [Oscillatoria acuminata PCC 6304]
          Length = 691

 Score =  361 bits (927), Expect = 7e-97,   Method: Compositional matrix adjust.
 Identities = 236/668 (35%), Positives = 343/668 (51%), Gaps = 80/668 (11%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TN LA   S YL +HA NP+DW+ W +EA A A+ ++ PIFLSIGYS+CHWC VME E+F
Sbjct: 2   TNHLAQTQSLYLRKHAENPIDWWPWCDEALATAKAQNKPIFLSIGYSSCHWCTVMEGEAF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP-DLKPLMGG 220
             E +A  +N  F+ IKVDREERPD+D +YM  +Q + G GGWPL++FL+P DL P  GG
Sbjct: 62  SSEAIASYMNANFLPIKVDREERPDIDSIYMQALQMMTGQGGWPLNIFLTPDDLIPFYGG 121

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           TYFP E +YGRPGF  +L+ ++  +D ++  LA      +  L +A +   + + LP+EL
Sbjct: 122 TYFPVEPRYGRPGFLELLQAIRRYYDLEKGKLAAFKEEIMGHLQQAATLPGTED-LPEEL 180

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
               L      ++      +G     P FP      MM Y    L+ T    E+   ++ 
Sbjct: 181 LWKGLETSVTVIAH---REYG-----PSFP------MMPYAQVVLQSTRFDRESEYDERS 226

Query: 341 VLFTLQC-MAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ----LANVYLDAFSL 395
            +      +A GGI+D V GGFHRY+VD  W VPHFEKMLYD GQ    LAN++ +    
Sbjct: 227 AIAQRGIDLASGGIYDAVAGGFHRYTVDPTWTVPHFEKMLYDNGQIVEFLANLWSEGI-- 284

Query: 396 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 455
            ++  + +     + +L+R+M  P G  ++A+DADS  T      +EGAFYVWT +E+E 
Sbjct: 285 -QEPGFEWAVAGTIQWLKREMTAPEGYFYAAQDADSFITPEDKEPEEGAFYVWTYQELER 343

Query: 456 ILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS--------- 505
           +L  E      + ++L P GN            F+GK VL   N  + S +         
Sbjct: 344 LLTVEEFTALNQEFFLSPEGN------------FEGKIVLKRTNLQALSPTVETALAKLF 391

Query: 506 --KLGMPLEKYLNILGECRR---KLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKIL 560
             + G   E        C     K  +   + P P  D K+IV+WN L+IS  ARA+ + 
Sbjct: 392 KVRYGALPEAVKTFPPACNNHEAKTHNWPGRIP-PVTDPKMIVAWNSLMISGLARAAVVF 450

Query: 561 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE-QTHRLQHSFRNGPSKAPGFL 619
            +                 EY  +A +AA+FI  H + E + HRL +   +G +      
Sbjct: 451 GN----------------GEYATLATTAANFILDHQWVEGRFHRLNY---DGQAAVLAQS 491

Query: 620 DDYAFLISGLLDLYEF----GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPS-V 674
           +DYA  I  LLDL +      S + WL  AI++Q   DE     E GGYFNT  +  S +
Sbjct: 492 EDYALFIKALLDLEQMEQVHPSNSNWLEKAIQVQEEFDEFLWSVELGGYFNTAKDSSSDL 551

Query: 675 LLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAV 734
           ++R +   D A P+ N V++ +L+RL+     ++   Y   A ++L  F   +     A 
Sbjct: 552 IVRERSYTDNATPAANGVAIASLIRLSMF---TEDLSYLDRAFNALKSFGAIMDRAPSAC 608

Query: 735 PLMCCAAD 742
           P +  A D
Sbjct: 609 PSLFAALD 616


>gi|402820063|ref|ZP_10869630.1| hypothetical protein IMCC14465_08640 [alpha proteobacterium
           IMCC14465]
 gi|402510806|gb|EJW21068.1| hypothetical protein IMCC14465_08640 [alpha proteobacterium
           IMCC14465]
          Length = 751

 Score =  361 bits (927), Expect = 8e-97,   Method: Compositional matrix adjust.
 Identities = 237/710 (33%), Positives = 357/710 (50%), Gaps = 76/710 (10%)

Query: 94  TSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWC 153
           T++S     NRL+ E SPYL QH  NPV W  W  +A A A++++ PI LSIGYS CHWC
Sbjct: 5   TTNSHIVLENRLSHEASPYLQQHKDNPVHWQPWDAKALASAQEQNKPILLSIGYSACHWC 64

Query: 154 HVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPD 213
           HVM  ESFE+E +A ++ND FV+IKVDREERPD+D +YM+ +  +   GGWPL++FL PD
Sbjct: 65  HVMAHESFENEDIASVMNDLFVNIKVDREERPDIDDIYMSALHMMGEQGGWPLTMFLLPD 124

Query: 214 LKPLMGGTYFPPEDKYGRPGFKTILR-----------KVKDAWDKKRDMLAQSGAFAIEQ 262
            +P  GGTYFPP  K+GRPGF  I R           KV++  DK    L      A + 
Sbjct: 125 GRPFWGGTYFPPIAKFGRPGFPDICREIARICTEETDKVQENADKLTQALQNKNNAAFKA 184

Query: 263 LSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHS 322
            ++  +    S  LP  LP++     +E L++  D  +GG   APKFP+P+  +++    
Sbjct: 185 ANQKTALEQLSPNLPLGLPEDLASEASENLARQIDLTYGGMQGAPKFPQPLIYELL---- 240

Query: 323 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 382
              +D  ++G     ++ VL TL  +  GGI DH+ GGF RYSVDE W VPHFEKM+YD 
Sbjct: 241 --WQDWLRNGR-DVSREAVLITLSGLCHGGIFDHIRGGFSRYSVDEEWLVPHFEKMIYDN 297

Query: 383 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMI-------GPGGEIFSAED------A 429
           G + ++  + +  T+D   +      +D+L  DM+         G    S +D      A
Sbjct: 298 GLILDLMGNVWKSTRDPMLTDRISKTVDWLLDDMLTNATNNSTDGAAALSKDDTPKPPAA 357

Query: 430 DSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFK 489
            +A  +  +  +EG +YVWT  E+  +LGE+   F   Y +   GN        P     
Sbjct: 358 FAASLDADSEGEEGKYYVWTVAELTSLLGENFPDFARTYRVTDAGNF-------PEGGGA 410

Query: 490 GKNVLIELNDSSASASKLGMPLE----KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSW 545
           G NV I LN    S    G   E    + LNIL +        ++ R RP  DDK++  W
Sbjct: 411 GDNVNI-LNRLPPSLHNEGFDEEARHAQSLNILAQA-------QALRTRPERDDKILADW 462

Query: 546 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQ--THR 603
           NGLVI++ AR S + ++                K+++E AE A   + + +  E+    +
Sbjct: 463 NGLVIAALARLSPVFQN----------------KKWLETAERAYRDVMQTMSYEEGGCLK 506

Query: 604 LQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGG 663
           L H+ R          +DY+ +    L L+       +L  A  L  T ++ + D + GG
Sbjct: 507 LAHAARGESKLNISMAEDYSNMADAALALFSATGTASYLASAEALTKTLEQFYTD-DVGG 565

Query: 664 YFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVF 723
           ++ T+ +  +++ R    +DGA P+ N  ++I + R  ++  G +   YR + E   A+ 
Sbjct: 566 FYMTSSQAETLITRPHTSYDGATPNANG-TMIGVYRRLAVFTGKQD--YRDSLE---ALI 619

Query: 724 ETRLKDMAMAVPLMC-CAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHA 772
           +T         P M     +  +   +   V+VG  S  DF+ +L  AHA
Sbjct: 620 KTHAIAAIKHYPQMPRYLTETENTRHQASCVIVGDPSDNDFKLLLETAHA 669


>gi|408826725|ref|ZP_11211615.1| hypothetical protein SsomD4_06008 [Streptomyces somaliensis DSM
           40738]
          Length = 651

 Score =  361 bits (927), Expect = 8e-97,   Method: Compositional matrix adjust.
 Identities = 223/599 (37%), Positives = 312/599 (52%), Gaps = 62/599 (10%)

Query: 129 EAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVD 188
           EAF EA++RD P+FLS+GYS CHWCHVM  ESFEDE  A  LN+ FVS+KVDREERPDVD
Sbjct: 3   EAFEEAKRRDAPVFLSVGYSACHWCHVMAHESFEDEATAAYLNEHFVSVKVDREERPDVD 62

Query: 189 KVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKK 248
            VYM  VQA  G GGWP+SVF++PD +P   GTYFPPE ++G P F+ +L  V  AW  +
Sbjct: 63  AVYMEAVQAATGQGGWPMSVFMTPDGEPFYFGTYFPPEARHGMPSFRQVLEGVHHAWTSR 122

Query: 249 RDMLAQSGAFAIEQLSEALSASASSNKLPDEL-PQNALRLCAEQLSKSYDSRFGGFGSAP 307
           RD + +     + +LS    A       P E  P  AL      L++ YD R GGFG AP
Sbjct: 123 RDEVDEVAGSIVRELSGRSLALGGDGGAPGEAEPAQALL----ALTREYDERHGGFGGAP 178

Query: 308 KFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVD 367
           KFP  + ++ +L H  +   TG  G      +M   T + MA+GGI+D +GGGF RYSVD
Sbjct: 179 KFPPSMVVEFLLRHHAR---TGSEG----ALQMAADTCEAMARGGIYDQLGGGFARYSVD 231

Query: 368 ERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAE 427
             W VPHFEKMLYD   L  VY   +  T       +  +  D++ R++  P G   SA 
Sbjct: 232 REWVVPHFEKMLYDNALLCRVYTHLWRATGSDLARRVALETADFMVRELRTPEGGFASAL 291

Query: 428 DADSAETEGATRKKEGAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNE 487
           DADS   +G  R  EGA+YVWT  ++ ++LGE    +   ++           +++    
Sbjct: 292 DADS--DDGTGRHVEGAYYVWTPAQLREVLGEEDAAYAARFH----------GVTEEGTF 339

Query: 488 FKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNG 547
            +G +VL    D+  + +      E+   I    RR+L   R +R RP  DDK++ +WNG
Sbjct: 340 EEGASVLRLPVDAGVAGA------ERLAGI----RRRLLAARDERARPGRDDKIVAAWNG 389

Query: 548 LVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHS 607
           L +++ A                      DR + +E A  AA  + R   DE   RL  +
Sbjct: 390 LAVAALAETGACF----------------DRPDLVERATEAADLLVRVHLDEGG-RLART 432

Query: 608 FRNGPSKA-PGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR---EGGG 663
            ++G + A  G L+DY  +  G L L        WL +A  L +      LDR   E G 
Sbjct: 433 SKDGRAGANAGVLEDYGDVAEGFLALAAVTGEGVWLEFAGLLLDG----VLDRFRGEDGE 488

Query: 664 YFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAV 722
            ++T  +   ++ R ++  D A PSG + +   L+   S  A + S+ +R  AE +L V
Sbjct: 489 LYDTAHDAEQLIRRPQDPTDNAAPSGWTAAAGALL---SYAAHTGSEAHRSAAERALGV 544


>gi|54302332|ref|YP_132325.1| hypothetical protein PBPRB0652 [Photobacterium profundum SS9]
 gi|46915754|emb|CAG22525.1| conserved hypothetical protein [Photobacterium profundum SS9]
          Length = 784

 Score =  361 bits (926), Expect = 9e-97,   Method: Compositional matrix adjust.
 Identities = 216/609 (35%), Positives = 325/609 (53%), Gaps = 66/609 (10%)

Query: 101 HTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVES 160
           +TNRL  E+SPYLLQHAHNPV+W+AWG+EAF  AR+ + PIFLSIGYSTCHWCHVME ES
Sbjct: 57  YTNRLILENSPYLLQHAHNPVNWYAWGKEAFDAARRENKPIFLSIGYSTCHWCHVMEAES 116

Query: 161 FEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGG 220
           F++E VA++LN +F+SIKVDR+ RPD+D  Y+       G  GWP+S FL+ D KP    
Sbjct: 117 FDNEEVARILNKYFISIKVDRDLRPDIDDFYIKAALVFSGKAGWPVSSFLTHDSKPFFVA 176

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKL-PDE 279
           +YF       RP F  +L +V+D W      L +S     +++ E    ++ ++ + P  
Sbjct: 177 SYF------SRPDFVDLLEQVQDKWTNNHQFLLKSAIEIYQEIQEQQKVASVADTISPSL 230

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
           + Q  +++ +     S D R+GG    PKFPR + + ++L   K ++D   S E     +
Sbjct: 231 IDQTIIKILS-----SEDKRWGGIDQIPKFPRELILMLLLRKLKTVDDFALSRE----WE 281

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
            +   L  + +GGI+D V GGFHRY+ D+ W +PHFEKML++Q  LA++Y +A+  + D 
Sbjct: 282 FISRELDALLQGGIYDQVAGGFHRYATDKAWRIPHFEKMLFNQALLADIYTNAWFYSGDN 341

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL-- 457
            Y  I  + L+Y+  +M       +SA DADS         +EG FY+W  +E+  +   
Sbjct: 342 EYKRIVIETLNYVLNEMRSDKACFYSATDADS-------ENEEGKFYLWHDREIASLFTP 394

Query: 458 GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI 517
           GE   + ++ Y ++  GN            F  KN+    N   + A    +  +  L  
Sbjct: 395 GETDFV-RKLYGIRQEGN------------FNHKNIPYLPNGLESVAEANDVDYQILLTK 441

Query: 518 LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSD 577
           +   R+KL+  R++R  P  D K +V W+ L+IS+ A +  +         FN P     
Sbjct: 442 IAGIRQKLYQKRAERIPPFKDKKQVVEWSALMISALANSGLV---------FNTP----- 487

Query: 578 RKEYMEVAESAASFIRRHLYDEQTHRLQHSFR---NGPSKAPGFLDDYAFLISGLLDLYE 634
             EY+ VA+  A  I +H  ++Q      SFR   +  + A   L DY   I  +L LY+
Sbjct: 488 --EYIRVADQCAEAIWQHAINDQG----SSFRLIDSNKASASATLGDYGHYIQAMLTLYD 541

Query: 635 FGSGTKWLVWA--IELQNTQDELFLDREGGGYFNTT-GEDPSVLLRVKEDHDGAEPSGNS 691
                 WL  +  I LQ  +  +F D++ GG+FNT   ++  + LR K   D    SGNS
Sbjct: 542 VTDKDIWLTRSHLIYLQAVR--MFQDKKSGGFFNTAFDQNEQLFLRSKNVTDNTVASGNS 599

Query: 692 VSVINLVRL 700
             ++ +V L
Sbjct: 600 AMLMAMVML 608


>gi|409198348|ref|ZP_11227011.1| thioredoxin domain-containing protein [Marinilabilia salmonicolor
           JCM 21150]
          Length = 675

 Score =  361 bits (926), Expect = 9e-97,   Method: Compositional matrix adjust.
 Identities = 226/669 (33%), Positives = 331/669 (49%), Gaps = 66/669 (9%)

Query: 97  SRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVM 156
           + N+ TN L    SPYLLQHAHNPVDW  W EE   +AR +D  + +SIGYS CHWCHVM
Sbjct: 2   TTNQDTNHLIHSTSPYLLQHAHNPVDWHPWNEETLDKARAQDKLMLVSIGYSACHWCHVM 61

Query: 157 EVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKP 216
             E FEDE  A+L+N+ F+ IKVDREERPDVD  ++T VQ +   GGWPL+V   PD +P
Sbjct: 62  AHECFEDEETARLMNEHFICIKVDREERPDVDNFFITAVQLMGAQGGWPLNVVTLPDGQP 121

Query: 217 LMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKL 276
             GGTYFP +       +K IL K+   +   R+ L          + +    S+  +++
Sbjct: 122 FWGGTYFPKDQ------WKEILIKINKLFHSDREKLTHHAHQLTTGIQQTSMISSEQSEV 175

Query: 277 PD--ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML----YHSKKLEDTGK 330
           PD  E+   AL    E+ S  +D + GG    PKFP PV ++ +L    +H +K+     
Sbjct: 176 PDLSEVINEAL----ERWSAQWDLQLGGSLGKPKFPMPVNLEFLLHLHFHHPQKM----- 226

Query: 331 SGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYL 390
                     +  TLQ MA+GGI+D  GGGF RYSVDE W VPHFEKMLYD  QL  +Y 
Sbjct: 227 ------FSDFLNTTLQQMARGGIYDQAGGGFARYSVDEFWKVPHFEKMLYDNAQLIELYS 280

Query: 391 DAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTS 450
            A++ +    Y  + ++ + ++   ++ P G  FSA DADS   EG    +EG +YVWT 
Sbjct: 281 HAYAHSGIKEYRDVVKETIAFVENKLMHPSGAFFSALDADS---EG----EEGKYYVWTE 333

Query: 451 KEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMP 510
           +E+ +I G    LF +++ +   G+ +            G  +L+        A K  M 
Sbjct: 334 EELLNIFGRDFPLFADYFNVNENGHWE-----------NGNYILLRTGSDEEFAHKHKMT 382

Query: 511 LEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFN 570
           LE+    +   ++ L + R KR RP LDDK I SWN L+      A K +          
Sbjct: 383 LEEVEKRVSVWKKDLVNRRKKRIRPGLDDKTITSWNALMTKGLVEAHKAVSD-------- 434

Query: 571 FPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLL 630
                     + ++A     FI   L  +    L  ++++G +   GF++DYA +IS  +
Sbjct: 435 --------SHFRKLALKNGEFICHSLISKDG-SLFRTWKDGRASVTGFMEDYASVISAFI 485

Query: 631 DLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGN 690
            LYE     KW+  +  L +  ++ F D+  G +         +     +  D   PS N
Sbjct: 486 GLYEITGDEKWIEQSSRLADYAEKAFYDKATGQFHYMEKNQTELPANHFDTQDNVIPSAN 545

Query: 691 SVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRK 750
           S+    L +LA++       +YR+ AE  L     + K+             M+  PS +
Sbjct: 546 SMMGHALFKLAALTG---DQHYRETAEKMLNQMLLQFKNYPWGFAHWGSLMLMIHKPSFE 602

Query: 751 HVVLVGHKS 759
            VV+ G K+
Sbjct: 603 -VVVAGSKT 610


>gi|425459385|ref|ZP_18838871.1| Similar to tr|Q8YXH6|Q8YXH6 [Microcystis aeruginosa PCC 9808]
 gi|389822926|emb|CCI29290.1| Similar to tr|Q8YXH6|Q8YXH6 [Microcystis aeruginosa PCC 9808]
          Length = 692

 Score =  361 bits (926), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 236/668 (35%), Positives = 342/668 (51%), Gaps = 80/668 (11%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N LAA  S YL +HA NP+DW+ W + A   AR+ D PIFLSIGYS+CHWC VME E+F 
Sbjct: 3   NHLAASESLYLRKHAENPIDWWYWCDSALEIARREDKPIFLSIGYSSCHWCTVMEGEAFS 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPD-LKPLMGGT 221
           D  +A  LN +F+ IKVDREERPD+D +YM  +Q + G GGWPL+VFL+PD L P  GGT
Sbjct: 63  DRAIADYLNQYFLPIKVDREERPDIDSIYMQALQMMVGQGGWPLNVFLTPDSLIPFYGGT 122

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFP + ++ RPGF  +L+ V+  +D++++ L++  A  +  L ++     +   L D   
Sbjct: 123 YFPVQPRFNRPGFLQVLQSVRRYYDEEKEKLSKFTAEMLGALRQSAILPRAETNLADP-- 180

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK---KLEDTGKSGEASEGQ 338
            + L    E  +         +G  P FP      + L  S+     ED+ +      G+
Sbjct: 181 -SLLATGIETNTAVIQVNPNNYGR-PSFPMIPYSHLALQGSRFGDDFEDSLRQAAYQRGE 238

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL-TK 397
            + L        GGI+DHVGGGFHRY+VD  W VPHFEKMLYD GQ+     + +S   +
Sbjct: 239 DLAL--------GGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDNGQIVEYLANLWSAGDR 290

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
           +  +    +  +++L+R+M  P G  ++A+DADS E       +EGAFYVW+ + + D L
Sbjct: 291 EAAFERGIKGTVNWLKREMTAPEGYFYAAQDADSFEKATDGEPEEGAFYVWSDRSLRDYL 350

Query: 458 GEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLN 516
               + L + ++ +   GN            F+G+NVL           +LG  +E  L+
Sbjct: 351 STEELGLLQANFTVTAEGN------------FEGRNVL-----QRRQGGELGKEIENLLD 393

Query: 517 IL-----GECRRKLFDVRSKRPRPH-------------LDDKVIVSWNGLVISSFARASK 558
            L     G  + +L      R                  D K+IV+WN L+IS  ARA  
Sbjct: 394 KLFIRRYGSSQAQLALFPPARDNQEAKTVSWPGRIPAVTDTKMIVAWNSLMISGLARA-- 451

Query: 559 ILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHRLQHSFRNGPSKAPG 617
                   A+F+ P+       Y +++  AA FI +H + D +  RL +    G +    
Sbjct: 452 -------FAVFSEPL-------YWQMSTQAAEFILQHQWLDGRFQRLNY---QGQASVLA 494

Query: 618 FLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL 676
             +D+A+ I  LLDL       T+WL  AI+LQ   D  F   + GGYFN T  D S+ L
Sbjct: 495 QSEDFAYFIKALLDLQTAKPQETRWLEAAIDLQGEFDRWFWAGDEGGYFN-TASDHSLDL 553

Query: 677 RVKED--HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAV 734
            V+E    D A PS N +++ NLVRL+ +    +   Y   AE +L  F T L+    A 
Sbjct: 554 IVRERGYTDNATPSANGIAIANLVRLSRLTENLE---YLDRAEKALQSFSTILEQSPTAC 610

Query: 735 PLMCCAAD 742
           P +  A D
Sbjct: 611 PSLFVALD 618


>gi|425465473|ref|ZP_18844782.1| Six-hairpin glycosidase-like [Microcystis aeruginosa PCC 9809]
 gi|389832278|emb|CCI24243.1| Six-hairpin glycosidase-like [Microcystis aeruginosa PCC 9809]
          Length = 692

 Score =  360 bits (925), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 235/667 (35%), Positives = 342/667 (51%), Gaps = 78/667 (11%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N LAA  S YL +HA NP+DW+ W + A   AR+ D PIFLSIGYS+CHWC VME E+F 
Sbjct: 3   NHLAASESLYLRKHAENPIDWWYWCDSALEIARREDKPIFLSIGYSSCHWCTVMEGEAFS 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPD-LKPLMGGT 221
           D+ +A  LN +F+ IKVDREERPD+D +YM  +Q + G GGWPL+VFL+PD L P  GGT
Sbjct: 63  DQAIADYLNQYFLPIKVDREERPDIDSIYMQALQMMVGQGGWPLNVFLTPDSLIPFYGGT 122

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFP + ++ RPGF  +L+ V+  +D++++ L++   F  E L  AL  SA   +    L 
Sbjct: 123 YFPVQPRFNRPGFLQVLQSVRRYYDEEKEKLSK---FTAEMLG-ALRQSAILPRAETNLA 178

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK---KLEDTGKSGEASEGQ 338
             +L     + + +           P FP      + L  S+     +D+ +      G+
Sbjct: 179 APSLLATGIETNTAVIRVNPNNYGRPSFPMIPYANLALQGSRFGDDFDDSLRQAAYQRGE 238

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL-TK 397
            + L        GGI+DHVGGGFHRY+VD  W VPHFEKMLYD GQ+     + +S   +
Sbjct: 239 DLAL--------GGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDNGQIVEYLANLWSAGNR 290

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
           +  +    +  +++L+R+M  P G  ++A+DADS E       +EGAFYVW+  E+ D L
Sbjct: 291 EAAFERGIKGTVNWLKREMTAPEGYFYAAQDADSFEKATDGEPEEGAFYVWSDLELRDYL 350

Query: 458 GEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLN 516
               + L + ++ +   GN            F+G+NVL           +LG  +E  L+
Sbjct: 351 STEELGLLQANFTVTAEGN------------FEGRNVL-----QRRQGGELGKEIEDMLD 393

Query: 517 IL-----GECRRKLFDVRSKRPRPH-------------LDDKVIVSWNGLVISSFARASK 558
            L     G  + +L      R                  D K+IV+WN L+IS  ARA  
Sbjct: 394 KLFIRRYGSSQAQLALFPPARDNQEAKTVSWPGRIPAVTDTKMIVAWNSLMISGLARA-- 451

Query: 559 ILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHRLQHSFRNGPSKAPG 617
                   A+F+ P+       Y ++A  AA FI +H + D +  RL +    G +    
Sbjct: 452 -------FAVFSEPL-------YWQMATVAAEFILKHQWLDGRFQRLNY---QGQASVLA 494

Query: 618 FLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDP-SVL 675
             +D+A+ I  LLDL       T WL  AI+LQ   D  F   + GGYFNT  +    ++
Sbjct: 495 QSEDFAYFIKALLDLQTAKPQETGWLEAAIDLQGEFDRWFWAGDEGGYFNTASDHSLDLI 554

Query: 676 LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP 735
           LR +   D A PS N +++ NL+RL+ +    +   Y   AE +L  F T L++   A P
Sbjct: 555 LRERGYTDNATPSANGIAIANLLRLSRLTENLE---YLDRAEKALQSFSTILEESPTACP 611

Query: 736 LMCCAAD 742
            +  A D
Sbjct: 612 SLFVALD 618


>gi|407778219|ref|ZP_11125484.1| hypothetical protein NA2_09603 [Nitratireductor pacificus pht-3B]
 gi|407299900|gb|EKF19027.1| hypothetical protein NA2_09603 [Nitratireductor pacificus pht-3B]
          Length = 668

 Score =  360 bits (925), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 217/606 (35%), Positives = 313/606 (51%), Gaps = 64/606 (10%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N LA E SPYL QH  NPV W AW  EA AEA+  D PI LSIGY+ CHWCHVM  ESFE
Sbjct: 6   NLLAEETSPYLQQHRDNPVHWRAWSPEALAEAQALDRPILLSIGYAACHWCHVMAHESFE 65

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           ++ VA ++N  F++IKVDREERP++D++YM  + A    GGWPL++FL+PD  P  GGTY
Sbjct: 66  NDAVAAVMNRLFINIKVDREERPEIDQIYMAALAATGEQGGWPLTMFLTPDGSPFWGGTY 125

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FPPE ++GRPGF  +L+ +  AW +KR  L +S       +  +L+        PD +  
Sbjct: 126 FPPEPRFGRPGFVQVLQAIDAAWREKRHELTKSAGNLKAHVQASLAPPPGEPPEPDAM-- 183

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             LR  A ++    D   GG   APKFP    ++++     +  D  +        + V 
Sbjct: 184 --LRDLAARVHGMIDPALGGLRGAPKFPNAPFMKILWLDGIQHGDRTRI-------EAVA 234

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            +L+ M  GGI+DHVGGG  RY+VD+RW VPHFEKMLYD  QL  +    ++ T D  + 
Sbjct: 235 DSLRHMLSGGIYDHVGGGLARYAVDDRWVVPHFEKMLYDNAQLLQLLCWVYARTHDQLFR 294

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
               + +D+L R+M   GG   S+ DAD       T  +EG  YVW+ +E+ ++LG  A 
Sbjct: 295 IRIEETVDWLLREMRVDGGGFASSLDAD-------TDGEEGKTYVWSRQELGEVLGSEAG 347

Query: 463 LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECR 522
            F + + L+        + +D H +     +L  LN  +A+       +   L+      
Sbjct: 348 AFLDVFTLE--------KPADWHRD----PILHRLNHPAATDPASETRMRTLLD------ 389

Query: 523 RKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYM 582
            +L   R  RP+P  DDK++V WNG+ I++ A A ++L                DR ++ 
Sbjct: 390 -RLLVARQARPQPGRDDKLLVDWNGMTITALATAGRLL----------------DRPDWT 432

Query: 583 EVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWL 642
           + A +A  F+   +   +  RL HS R      P    DYA +IS    LY   S    L
Sbjct: 433 QAARTAFRFVCESM---ENGRLPHSIRGDKQLFPALSSDYAAMISAATALYGATSDDALL 489

Query: 643 V----WAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
                WA +LQ        D+ G G++ +  +   V +R++ D D A PS  S  +  L 
Sbjct: 490 QQARKWAGQLQRWHQ----DKAGSGFYMSASDSGDVPMRIRGDVDEAIPSATSQVIEALA 545

Query: 699 RLASIV 704
            LA++ 
Sbjct: 546 ALATLT 551


>gi|425439757|ref|ZP_18820072.1| Six-hairpin glycosidase-like [Microcystis aeruginosa PCC 9717]
 gi|389719932|emb|CCH96294.1| Six-hairpin glycosidase-like [Microcystis aeruginosa PCC 9717]
          Length = 692

 Score =  360 bits (925), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 239/669 (35%), Positives = 342/669 (51%), Gaps = 80/669 (11%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TN LAA  S YL +HA NP+DW+ W + A   AR+ D PIFLSIGYS+CHWC VME E+F
Sbjct: 2   TNHLAASESLYLRKHAENPIDWWYWCDSALEIARREDKPIFLSIGYSSCHWCTVMEGEAF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPD-LKPLMGG 220
            D  +A  LN +F+ IKVDREERPD+D +YM  +Q + G GGWPL+VFL+PD L P  GG
Sbjct: 62  SDRAIADYLNHYFLPIKVDREERPDIDSIYMQALQMMVGQGGWPLNVFLTPDSLIPFYGG 121

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           TYFP + ++ RPGF  +L+ V+  +D++++ L++   F  E L  AL  SA   +    L
Sbjct: 122 TYFPVQPRFNRPGFLQVLQSVRRYYDEEKEKLSK---FTAEMLG-ALRQSAILPRAETNL 177

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK---KLEDTGKSGEASEG 337
               L     + + +           P FP      + L  S+     +D+ +      G
Sbjct: 178 AAPYLLATGIETNTAVIRVNPNNYGRPSFPMIPYSHLALQGSRFGDDFDDSLRQAAYQRG 237

Query: 338 QKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL-T 396
           + + L        GGI+DHVGGGFHRY+VD  W VPHFEKMLYD GQ+     + +S   
Sbjct: 238 EDLAL--------GGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDNGQIVEYLANLWSAGD 289

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
           ++  +    +  +++L+R+M  P G  ++A+DADS E       +EGAFYVW+  E+ D 
Sbjct: 290 REAAFERGIKGTVNWLKREMTAPEGYFYAAQDADSFEKATDGEPEEGAFYVWSDLELRDY 349

Query: 457 LGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 515
           L    + L + ++ +   GN            F+G+NVL           +LG  +E  L
Sbjct: 350 LSTEELGLLQANFTVTAEGN------------FEGRNVL-----QRRQGGELGEEIENML 392

Query: 516 NIL-----GECRRKLFDVRSKRPRPH-------------LDDKVIVSWNGLVISSFARAS 557
           + L     G  + +L      R                  D K+IV+WN L+IS  ARA 
Sbjct: 393 DKLFIRRYGSSQAQLALFPPARDNQEAKTVSWPGRIPAVTDTKMIVAWNSLMISGLARA- 451

Query: 558 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHRLQHSFRNGPSKAP 616
                    A+F+ P+       Y ++A  AA FI +H + D +  RL +    G +   
Sbjct: 452 --------FAVFSEPL-------YWQMATQAAEFILKHQWLDGRFQRLNY---QGQASVL 493

Query: 617 GFLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL 675
              +D+A+ I  LLDL       T WL  AI+LQ   D  F   + GGYFN T  D S+ 
Sbjct: 494 AQSEDFAYFIKALLDLQTAKPQETGWLEAAIDLQGEFDRWFWAEDEGGYFN-TASDHSLD 552

Query: 676 LRVKED--HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMA 733
           L V+E    D A PS N +++ NL+RL+ +    +   Y   AE +L  F T L++   A
Sbjct: 553 LIVRERGYTDNATPSANGIAIANLLRLSRLTENLE---YLDRAEKALQSFSTILEESPTA 609

Query: 734 VPLMCCAAD 742
            P +  A D
Sbjct: 610 CPSLFVALD 618


>gi|111225552|ref|YP_716346.1| hypothetical protein FRAAL6208 [Frankia alni ACN14a]
 gi|111153084|emb|CAJ64831.1| Conserved hypothetical protein [Frankia alni ACN14a]
          Length = 676

 Score =  360 bits (924), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 247/665 (37%), Positives = 334/665 (50%), Gaps = 67/665 (10%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N+LA + SPYLLQHA NPVDW+ W  EAFAEA +R VP+ LS+GY++CHWCHVM  ESFE
Sbjct: 3   NKLAEQTSPYLLQHADNPVDWWPWCPEAFAEAARRGVPVLLSVGYASCHWCHVMAHESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D   A  +ND FV+IKVDREERPDVD VYM    AL G GGWP++VFL+P  +P   GTY
Sbjct: 63  DVVTAAYMNDHFVNIKVDREERPDVDSVYMDVTVALTGHGGWPMTVFLTPTAEPFFAGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGA--------FAIEQLSEALSASASSN 274
           FPP  + G   F+ +L  V  AW  +R  + +SGA         A       L+AS +S 
Sbjct: 123 FPPRPRPGMGSFRQVLEAVVAAWQTRRAEIEESGADIARRLAEAAARGPVAGLAASPTSG 182

Query: 275 KLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEA 334
            + DEL    L      LS  +D+R GGFG APKFP  +  +M+L H+ +  D G S E 
Sbjct: 183 -VADELSPPLLDTAVAGLSARFDARHGGFGGAPKFPPSMVAEMLLRHAARTGD-GHSLE- 239

Query: 335 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 394
                MV  T + MA+GG++D + GGF RYSVD  W VPHFEKMLYD  QL  VYL  + 
Sbjct: 240 -----MVALTCERMARGGMYDQLAGGFARYSVDATWTVPHFEKMLYDNAQLLRVYLHLWR 294

Query: 395 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAET---EGATRKKEGAFYVWTSK 451
            T       + R+   +L  D+  P G   SA DAD+          + +EGA Y WT  
Sbjct: 295 ATGSPLAQRVVRETAAFLLADLRTPQGGFASALDADAVPAGVPAAHAQPEEGASYSWTPA 354

Query: 452 EVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMP 510
            +   LG +      E + +   G  +            G +VL    D   +A      
Sbjct: 355 GLRAALGADDGAWAAEIFGVTAEGTFE-----------HGTSVLQLPADPPDAARFA--- 400

Query: 511 LEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFN 570
                      R  L   R+ RP+P  DDKV+ +WNGL I++ A A  +           
Sbjct: 401 ---------AVRAALAAARAGRPQPARDDKVVAAWNGLAIAALAEAGAL----------- 440

Query: 571 FPVVGSDRKEYMEVAESAASFIRR-HLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 629
                 D   ++  AE AA  +R  HL   +  R     R G +   G L+DY  +  GL
Sbjct: 441 -----LDEPAWIRAAEDAAVLLRDVHLVAGRLRRTSRDGRVGTNA--GVLEDYGDVAEGL 493

Query: 630 LDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSG 689
           L L++     +WL  A EL       F   + GG+F+T  +  ++L R ++D D A PSG
Sbjct: 494 LTLHQVTGDPEWLTLAGELLEVVRARFAAPD-GGFFDTADDAEALLRRPRDDSDSATPSG 552

Query: 690 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL-KDMAMAVPLMCCAADMLSVPS 748
            +     L+  A++   + S  +R  AE ++A F   L +D   A      A  +L+ P+
Sbjct: 553 QAAVAGALLTYAAL---TGSAEHRSTAEATVARFAPLLSRDARFAGWAGAVAEALLAGPA 609

Query: 749 RKHVV 753
              VV
Sbjct: 610 EVAVV 614


>gi|427733870|ref|YP_007053414.1| thioredoxin domain-containing protein [Rivularia sp. PCC 7116]
 gi|427368911|gb|AFY52867.1| thioredoxin domain protein [Rivularia sp. PCC 7116]
          Length = 691

 Score =  360 bits (924), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 237/677 (35%), Positives = 345/677 (50%), Gaps = 93/677 (13%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNRLA+  S YL +HA NP+DW++W +EA + A +++ PIFLSIGYS+CHWC VME E+F
Sbjct: 2   TNRLASAQSLYLRKHAENPIDWWSWCDEALSTAVEQNKPIFLSIGYSSCHWCTVMEGEAF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP-DLKPLMGG 220
            D  VA+ +N  F+ IKVDREERPD+D +YM  +Q + G GGWPL+ FLSP DL P   G
Sbjct: 62  SDLEVAEYMNANFIPIKVDREERPDIDSIYMQALQMMSGQGGWPLNAFLSPDDLVPFYAG 121

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL--SEALSASASSNKLPD 278
           TYFPPE++Y RPGF  +L+ ++  +D ++  L +  A  +E L  S  L   A++    +
Sbjct: 122 TYFPPEERYNRPGFLQVLKAIRHYYDTEKQDLQKRKAVILESLLTSAVLQTEATAETQDN 181

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
           +L Q    +    ++ +             FP     QM L  S+    +    +    Q
Sbjct: 182 QLLQKGWEIFTGIIAPNEQGN--------SFPTIPYAQMALQGSRFNFTSRYDCKQICTQ 233

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS--LT 396
           + +      +A GGI DHV GGFHRY+VD  W VPHFEKMLYD GQ+     + +S  + 
Sbjct: 234 RGL-----DLALGGIFDHVAGGFHRYTVDPTWTVPHFEKMLYDNGQIVEYLANLWSAGVK 288

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
           +  F + I + +  +L+R+M  P G  ++A+DADS  T+     +EGAFYVW   ++E +
Sbjct: 289 EPAFETAIAKTV-KWLQREMTAPNGYFYAAQDADSFITQEDVEPEEGAFYVWGFSDLEQL 347

Query: 457 LGEHAIL-FKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 515
           L    +   ++++ + P GN            F+ +NVL + N     + +L   LE  L
Sbjct: 348 LTRAELTELQQNFTVTPNGN------------FENQNVLQKRN-----SDRLSNTLEATL 390

Query: 516 NILGECRR-------KLF-----DVRSK------RPRPHLDDKVIVSWNGLVISSFARAS 557
             L   R        K F     + ++K      R  P  D K+IV+WN ++IS  ARA 
Sbjct: 391 EKLFTARYGDDSSTIKTFAPARNNAQAKSHNWQGRIPPVTDTKMIVAWNAIMISGLARAY 450

Query: 558 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFI-RRHLYDEQTHRLQHSFRNGPSKAP 616
            +                  + EY+E+A  AA F+      D + +RL +  +      P
Sbjct: 451 AVFS----------------QLEYLEMATQAAKFVLENQFVDGRFYRLNYEGK------P 488

Query: 617 GFL---DDYAFLISGLLDLYE------FGSGTKWLVWAIELQNTQDELFLDREGGGYFNT 667
           G L   +DYA  I  LLDL++       G    WL  A+ LQ   ++     E  GYFN 
Sbjct: 489 GVLAQSEDYALFIKALLDLHQACFKADTGKPAFWLEKAVSLQEEFNDYLWSVELHGYFN- 547

Query: 668 TGEDPSVLLRVKEDH--DGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 725
           T  D S  L V+E +  D A PS N +++ NLVRL  +    +   Y   AE +L  F  
Sbjct: 548 TASDASKELIVRERNYIDSATPSANGIALCNLVRLTLVTDNLQ---YLNLAEQALTAFRG 604

Query: 726 RLKDMAMAVPLMCCAAD 742
            + D   A P +  A D
Sbjct: 605 VMNDATQACPSLFVALD 621


>gi|269468817|gb|EEZ80421.1| hypothetical protein Sup05_0857 [uncultured SUP05 cluster
           bacterium]
          Length = 753

 Score =  360 bits (924), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 216/641 (33%), Positives = 329/641 (51%), Gaps = 72/641 (11%)

Query: 89  RTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYS 148
           RT       R K  N L  E SPYLLQHAHNPV+W+A+ +EAF +A+  + P+F+SIGY+
Sbjct: 35  RTQHLDKQGRAKFVNHLILESSPYLLQHAHNPVNWYAFSDEAFDKAKAENKPVFISIGYA 94

Query: 149 TCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSV 208
           TCHWCHVME ESF+D  VA+ LN  F+SIKVDRE RPDVD  YM   Q + G GGWPL+ 
Sbjct: 95  TCHWCHVMEEESFDDVKVAEFLNKHFISIKVDREIRPDVDATYMNVSQLINGSGGWPLNA 154

Query: 209 FLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALS 268
            +  D K    GTYFP      +     IL +++  W  +++ +          + + L+
Sbjct: 155 VILSDGKAFFAGTYFP------KKQLLDILLQIQTLWKNEQNKVINQA----HDIDKILN 204

Query: 269 ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDT 328
            S  + K+   + +N +    + +  ++D   GGFG APKFP    + +++       D 
Sbjct: 205 KSTVTTKVG--INKNIVSKAIQAILDNFDELEGGFGEAPKFPHETMLLLLI-------DE 255

Query: 329 GKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 388
            K     +    +  TL  MA GG +D VGGGFHRYS D  W +PHFEKMLY+Q QL+ +
Sbjct: 256 QKRNPTDDLLNAITTTLDTMASGGFYDTVGGGFHRYSTDNSWLIPHFEKMLYNQAQLSLI 315

Query: 389 YLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVW 448
           Y  A+ LT+   Y  I +  LDY  R+M    G  FSA DADS +       +EG F+VW
Sbjct: 316 YTRAYQLTQKPLYKRIAKQTLDYTLREMQDTNGGFFSATDADSED-------EEGTFFVW 368

Query: 449 TSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL--IELNDSSASASK 506
           +  E++++L +      + Y+       DLS  +D    F+G +V+   ++ND + +  K
Sbjct: 369 SITELKNVLNKEEFKRFDQYF-------DLSTYTD----FEGNHVIRFKDVNDINENDYK 417

Query: 507 LGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAES 566
                      +     KL+ +R KR  P  D+KV++SWN L+I S   A  +       
Sbjct: 418 K----------IDALLTKLYKLRIKREPPLTDNKVLLSWNALMIPSLLEAGDVF------ 461

Query: 567 AMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLI 626
                     +  +Y +   + A ++     + Q +R+     N   +     +DYA+L 
Sbjct: 462 ----------NETKYTDAGVALARYLDNFNKNGQLYRVS---INNELQTIALSEDYAYLA 508

Query: 627 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 686
           +  L ++++   + WL   ++L +   + F D++  G FN T +   +    KE +DGA 
Sbjct: 509 NAYLSVFDYTHESIWLDKTVQLIDDMMQKFWDKKKFG-FNMTQDKKYLNTNYKESYDGAI 567

Query: 687 PSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 727
           PS N V+   LV+L   V G K  +++Q A+  L+ F   +
Sbjct: 568 PSANGVAYKVLVKLNYRVNGQK--FFKQ-AQQLLSAFSAEI 605


>gi|325676575|ref|ZP_08156253.1| thymidylate kinase [Rhodococcus equi ATCC 33707]
 gi|325552753|gb|EGD22437.1| thymidylate kinase [Rhodococcus equi ATCC 33707]
          Length = 674

 Score =  360 bits (924), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 220/610 (36%), Positives = 315/610 (51%), Gaps = 63/610 (10%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
            +  N L    SPYL QHA NPV W  WG +A A AR+RDVP+ LSIGY+ CHWCHVM  
Sbjct: 6   GRERNTLGEATSPYLRQHADNPVHWHQWGPDALAWARERDVPVLLSIGYAACHWCHVMAH 65

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFED+  A ++N+ FV IKVDREERPD+D VYM    A+ G GGWP++ FL+PD  P  
Sbjct: 66  ESFEDDATAAVMNEHFVCIKVDREERPDLDAVYMNATVAMTGQGGWPMTCFLTPDGAPFY 125

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
            GTY+P E + G P F  +L  V D W  +R  +  + A  + +L  + S +  +   P 
Sbjct: 126 CGTYYPREPRGGMPSFVQLLHAVTDTWRSRRGDVDDAAASVVAELRRS-SGALPAGGAPI 184

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
           ++P   L      + +  D   GGFG APKFP  + ++ +L   ++         A    
Sbjct: 185 DVPL--LSGAVANVLRDEDRDHGGFGGAPKFPPSMLLEGLLRSYERT-------SAGPTL 235

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
           + V  T + MA+GGI+D +GGGF RYSVD +W VPHFEKMLYD   L   Y      T  
Sbjct: 236 RAVERTAEAMARGGIYDQLGGGFARYSVDTQWVVPHFEKMLYDNALLVRFYAHLARRTGS 295

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
                +  + +D+L RD+    G   SA DAD       T  +EG  Y WT++++ D++G
Sbjct: 296 ALARRVTEETVDFLLRDLRTAAGAFASALDAD-------TDGEEGLTYAWTAQQIADVVG 348

Query: 459 -EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI 517
            +      E + +  TG  +           +G +VL    D          PL+   + 
Sbjct: 349 DDDGRWAAETFAVTDTGTFE-----------RGTSVLQLPAD----------PLDA--DR 385

Query: 518 LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSD 577
           L + R +L   R++RP+P  DDKV+ +WNGL I++ A A   L                 
Sbjct: 386 LADIRSRLLAARTRRPQPARDDKVVTAWNGLAITALAEAGAALG---------------- 429

Query: 578 RKEYMEVAESAASFI-RRHLYDEQTHRLQHSFRNGPSKAP-GFLDDYAFLISGLLDLYEF 635
           R +++E AE  A  +   HL D    RL+ +   G    P G L+DY  L +GL  L++ 
Sbjct: 430 RADWVEAAEECAHMVLSTHLVD---GRLRRASLGGTVGEPAGILEDYGALATGLSTLHQV 486

Query: 636 GSGTKWLVWAIELQNTQDELFLD-REGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSV 694
               +WL  A  L +T  + F D  E G +F+T  +  +++ R ++  DGA PSG SV+ 
Sbjct: 487 TGVAEWLEVATGLLDTAIDHFADPDEPGSWFDTADDAETLVARPRDPLDGATPSGASVTT 546

Query: 695 INLVRLASIV 704
             L+  +S+V
Sbjct: 547 EALLTASSLV 556


>gi|166365023|ref|YP_001657296.1| six-hairpin glycosidase-like [Microcystis aeruginosa NIES-843]
 gi|166087396|dbj|BAG02104.1| six-hairpin glycosidase-like [Microcystis aeruginosa NIES-843]
          Length = 692

 Score =  360 bits (924), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 239/668 (35%), Positives = 342/668 (51%), Gaps = 80/668 (11%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N LAA  S YL +HA NP+DW+ W + A   AR+ D PIFLSIGYS+CHWC VME E+F 
Sbjct: 3   NHLAASESLYLRKHAENPIDWWYWCDSALEIARREDKPIFLSIGYSSCHWCTVMEGEAFS 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPD-LKPLMGGT 221
           D+ +A  LN +F+ IKVDREERPD+D +YM  +Q + G GGWPL+VFL+PD L P  GGT
Sbjct: 63  DQAIADYLNQYFLPIKVDREERPDIDSIYMQALQMMVGQGGWPLNVFLTPDSLIPFYGGT 122

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFP + ++ RPGF  +L+ V+  +D++++ L++   F  E L  AL  SA   +    L 
Sbjct: 123 YFPVQPRFNRPGFLQVLQSVRRYYDEEKEKLSK---FTAEMLG-ALRQSAILPRSETNLA 178

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK---KLEDTGKSGEASEGQ 338
             +L     + + +           P FP      + L  S+     +D+ +      G+
Sbjct: 179 APSLLATGIETNTAVIRVNPNNYGRPSFPMIPYSHLALQGSRFGDDFDDSLRQAAYQRGE 238

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL-TK 397
            + L        GGI+DHVGGGFHRY+VD  W VPHFEKMLYD GQ+     + +S   +
Sbjct: 239 DLAL--------GGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDNGQIVEYLANLWSAGNR 290

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
           +  +    +  +++L+R+M  P G  ++A+DADS E       +EGAFYVW+  E+ D L
Sbjct: 291 EAAFERGIKGTVNWLKREMTAPEGYFYAAQDADSFEKATDGEPEEGAFYVWSDLELRDYL 350

Query: 458 G-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLN 516
             E   L + ++ +   GN            F+G+NVL           +LG  +E  L+
Sbjct: 351 STEELGLLQANFTVTAEGN------------FEGRNVL-----QRRQGGELGKEIENMLD 393

Query: 517 IL-----GECRRKLFDVRSKRPRPH-------------LDDKVIVSWNGLVISSFARASK 558
            L     G  + +L      R                  D K+IV+WN L+IS  ARA  
Sbjct: 394 KLFIRRYGSSQAQLALFPPARDNQEAKTVSWPGRIPAVTDTKMIVAWNSLMISGLARA-- 451

Query: 559 ILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHRLQHSFRNGPSKAPG 617
                   A+F  P+       Y ++A  AA FI +H + D +  RL +    G +    
Sbjct: 452 -------FAVFGEPL-------YWQMATVAAEFILKHQWLDGRFQRLNY---QGQASVLA 494

Query: 618 FLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL 676
             +D+A+ I  LLDL       T WL  AI+LQ   D  F   + GGYFN T  D S+ L
Sbjct: 495 QSEDFAYFIKALLDLQTAKPQETGWLEAAIDLQGEFDRWFWAEDEGGYFN-TASDHSLDL 553

Query: 677 RVKED--HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAV 734
            V+E    D A PS N +++ NL+RL+ +    +   Y   AE +L  F T L++   A 
Sbjct: 554 IVRERGYTDNATPSANGIAIANLLRLSRLTENLE---YLDRAEKALQSFSTILEESPTAC 610

Query: 735 PLMCCAAD 742
           P +  A D
Sbjct: 611 PSLFVALD 618


>gi|343087024|ref|YP_004776319.1| hypothetical protein [Cyclobacterium marinum DSM 745]
 gi|342355558|gb|AEL28088.1| protein of unknown function DUF255 [Cyclobacterium marinum DSM 745]
          Length = 682

 Score =  360 bits (924), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 239/670 (35%), Positives = 329/670 (49%), Gaps = 61/670 (9%)

Query: 96  HSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHV 155
           H+     N L    S YL QHA+NPV+W+ W +EA  +A+  + PI +SIGYS CHWCHV
Sbjct: 4   HTEVMKANHLIKSKSIYLQQHAYNPVEWYPWSKEALEKAKLENKPILVSIGYSACHWCHV 63

Query: 156 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 215
           ME ESFE + VAKL+N  F+ IK+DREERPD+D +YM  VQ +   GGWPL+VFL P+ K
Sbjct: 64  MEGESFEAKDVAKLMNAHFICIKIDREERPDLDNIYMEAVQVMGLQGGWPLNVFLLPNQK 123

Query: 216 PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNK 275
           P  GGTYF  E       +  +L  V  A+ ++ D L +S     + +  ++       K
Sbjct: 124 PFYGGTYFSKEQ------WIQVLSGVAQAFSQQYDDLVKSAEGFGQSIERSVIEKYGLKK 177

Query: 276 LPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 335
              +     +R  A+ L    D  +GG    PKFP PV I   L     L+D    GE  
Sbjct: 178 GKSKFFPETIRQIAKDLIGKIDPVWGGMKRVPKFPMPV-IWSFLLDMAILDDHEDLGEK- 235

Query: 336 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 395
                V FTL+ MA GGI+DH+GGGF RYSVD  W  PHFEKMLYD GQL ++Y  A+  
Sbjct: 236 -----VCFTLEKMAMGGIYDHLGGGFCRYSVDGEWFAPHFEKMLYDNGQLLSLYSKAYQY 290

Query: 396 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 455
           + +  +     + + +L  DM GP    +SA DADS         +EG FY WT  E++D
Sbjct: 291 SANALFREKITETISWLLNDMCGPEMGFYSALDADS-------DGEEGRFYTWTFSELKD 343

Query: 456 ILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 515
           +LG+    F + Y +K  GN +            GKN+L +           G   E  L
Sbjct: 344 LLGDDLNWFCQLYGIKEQGNWE-----------AGKNILYQTLPYVEVGENFGFTQEALL 392

Query: 516 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 575
           + L E + KL + R  R RP LDDK+I  WNG VI     A   L  E            
Sbjct: 393 SKLREVKLKLKEKRESRTRPGLDDKIISGWNGWVIKGLCDAYLALGEE------------ 440

Query: 576 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 635
               E    A    +FI  H+  E  + L  S++ G +  P FL+DYA +I   + LY+ 
Sbjct: 441 ----EIRNTAVRTGNFIWHHMVIE--NELYRSYKGGQAYTPAFLEDYAAVIQSFISLYKI 494

Query: 636 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 695
              + WL  A  L       F D E   ++    +   ++   KE  D   PS NSV   
Sbjct: 495 SFDSFWLRRAELLAQRVLRNFHDEEDEMFYFNDPKIEKLIANKKELFDNVIPSSNSVMAR 554

Query: 696 NLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP--LMCCAADML--SVPSRKH 751
           NL +L   +    +D Y   A+  L +    + DM +  P  L   A+  L  SVP+ + 
Sbjct: 555 NLHQLGLYLY---NDTYLAQAKSMLQL----VSDMLIKEPDFLANWASFYLEQSVPTAE- 606

Query: 752 VVLVGHKSSV 761
           +V+ G ++S 
Sbjct: 607 IVIAGKEAST 616


>gi|403723313|ref|ZP_10945570.1| hypothetical protein GORHZ_074_00090 [Gordonia rhizosphera NBRC
           16068]
 gi|403206090|dbj|GAB89901.1| hypothetical protein GORHZ_074_00090 [Gordonia rhizosphera NBRC
           16068]
          Length = 670

 Score =  360 bits (923), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 223/628 (35%), Positives = 322/628 (51%), Gaps = 78/628 (12%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA   SPYLLQHA NPVDW+ WG EAF EAR+RD P+ LS+GY+ CHWCHVM  ESFE
Sbjct: 3   NRLANATSPYLLQHASNPVDWWEWGPEAFEEARRRDTPVLLSVGYAACHWCHVMAHESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D   A ++N  FV +KVDREERPD+D +YM    A+ G GGWP++ FL+P  +P   GTY
Sbjct: 63  DAATAAVMNREFVCVKVDREERPDIDAIYMNATVAMTGQGGWPMTCFLTPSGEPFYCGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSN-KLPDELP 281
           FP   + G P    I+  V +AW ++RD +   GA   E L++  +A  S+   + DEL 
Sbjct: 123 FPSSPRGGMPSLTQIMLAVAEAWTQRRDEVDAMGAQVREHLTDHTAALPSTEVTVDDELL 182

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
            +A+   A  L    D   GGFG APKFP    ++ +L   +  E TG     +     V
Sbjct: 183 AHAV---ASALHDE-DRVAGGFGGAPKFPPSALLEGLL---RSWESTGD----TRALDAV 231

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             T   MA+GGI+D + GGF RY+VD  W +PHFEKMLYD  QL  VY      T D   
Sbjct: 232 GRTCTAMARGGIYDQLAGGFARYAVDNDWVIPHFEKMLYDNAQLLRVYGHLARRTGDRLA 291

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHA 461
             I  + + +L RD+   GG   S+ DAD+   EG+T       YVW+  E+ ++LG+  
Sbjct: 292 LRITEETVRFLDRDLRVAGG-FASSLDADADGVEGST-------YVWSPSELREVLGDDD 343

Query: 462 ILF-KEHYYLKPTGNCDLSRMS-----DPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 515
            L+  E + +  TG  +  R +     DP +  +  +V +                    
Sbjct: 344 GLWAAELFGVTATGTFEHGRSTLQLRRDPDDPVRFTSVAV-------------------- 383

Query: 516 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 575
                   +L   R+ RP+P  DDKV+  WN L +++ A A                  G
Sbjct: 384 --------RLLSARASRPQPARDDKVVTGWNALAVTALAEAG----------------AG 419

Query: 576 SDRKEYMEVAESAA-SFIRRHLYDEQTHRLQHSFRNGPSKAP-GFLDDYAFLISGLLDLY 633
             R E++++  S A S +  H+ D    RL+ S   G   AP   L+D+A L++ LL L+
Sbjct: 420 LGRPEWIDLGASCARSLVDHHIVD---GRLRRSSLGGTVGAPMAALEDHAALVTALLTLH 476

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREG-GGYFNTTGEDPSVLLRVKEDHDGAEPSGNSV 692
           +    T W    + L ++  E+F D E  G +F+  G+   ++ R ++  DGA P+G S+
Sbjct: 477 QVTGETSWRDEGLALLDSAVEVFADPEAPGTWFDAVGD--GLIARPRDPIDGATPAGASL 534

Query: 693 SVINLVRLASIVAGSKSDYYRQNAEHSL 720
               L+  +++     +  Y +  E +L
Sbjct: 535 MTEALLIASAVAPFGPATRYAEVLEQTL 562


>gi|425435449|ref|ZP_18815900.1| Similar to tr|Q8YXH6|Q8YXH6 [Microcystis aeruginosa PCC 9432]
 gi|389679973|emb|CCH91261.1| Similar to tr|Q8YXH6|Q8YXH6 [Microcystis aeruginosa PCC 9432]
          Length = 692

 Score =  360 bits (923), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 240/668 (35%), Positives = 344/668 (51%), Gaps = 80/668 (11%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N LAA  S YL +HA NP+DW+ W + A   AR+ D PIFLSIGYS+CHWC VME E+F 
Sbjct: 3   NHLAASESLYLRKHAENPIDWWYWCDSALEIARREDKPIFLSIGYSSCHWCTVMEGEAFS 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPD-LKPLMGGT 221
           D  +A  LN +F+ IKVDREERPD+D +YM  +Q + G GGWPL+VFL+PD L P  GGT
Sbjct: 63  DRAIADYLNQYFLPIKVDREERPDIDSIYMQALQMMVGQGGWPLNVFLTPDSLIPFYGGT 122

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFP + ++ RPGF  +L+ V+  +D++++ L++   F  E L  AL  SA   +    L 
Sbjct: 123 YFPVQPRFNRPGFLQVLQSVRRYYDEEKEKLSK---FTAEMLG-ALRQSAILPRAETNLA 178

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK---KLEDTGKSGEASEGQ 338
             +L     + + +           P FP      + L  S+     ED+ +      G+
Sbjct: 179 DPSLLATGIETNTAVIQVNPNNYGRPSFPMIPYSHLALQGSRFGDDFEDSLQQAAYQRGE 238

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL-TK 397
            + L        GGI+DHVGGGFHRY+VD  W VPHFEKMLYD GQ+     + +S   +
Sbjct: 239 DLAL--------GGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDNGQIVEYLANLWSAGDR 290

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
           +  +    +  +++L+R+M  P G  ++A+DADS E       +EGAFYVW+ + + D L
Sbjct: 291 EAAFERGIKGTVNWLKREMTAPEGYFYAAQDADSFEKATDGEPEEGAFYVWSDRSLRDYL 350

Query: 458 G-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLN 516
             E   L + ++ +   GN            F+G+NVL           +LG  +E  L+
Sbjct: 351 STEELGLLQANFTVTAEGN------------FEGRNVL-----QRRQGGELGKEIENILD 393

Query: 517 IL-------GECRRKLF-DVRSKRPRPHL----------DDKVIVSWNGLVISSFARASK 558
            L        + +  LF   R  +   ++          D K+IV+WN L+IS  ARA  
Sbjct: 394 KLFIRRYGSSQAQLALFPPARDNQEAKNVSWPGRIPAVTDTKMIVAWNSLMISGLARA-- 451

Query: 559 ILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHRLQHSFRNGPSKAPG 617
                   A+F+ P+       Y ++A  AA FI +H + D +  RL +    G +    
Sbjct: 452 -------FAVFSEPL-------YWQMATQAAEFILQHQWLDGRFQRLNY---QGQASVLA 494

Query: 618 FLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL 676
             +D+A+ I  LLDL       T WL  AI+LQ   D  F   + GGYFN T  D S+ L
Sbjct: 495 QSEDFAYFIKALLDLQTAKPQETGWLEAAIDLQGEFDRWFWAGDEGGYFN-TASDHSLDL 553

Query: 677 RVKED--HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAV 734
            V+E    D A PS N +++ NLVRL+ +    +   Y   AE +L  F T L+    A 
Sbjct: 554 IVRERGYTDNATPSANGIAIANLVRLSRLTENLE---YLDRAEKALQSFSTILEQSPTAC 610

Query: 735 PLMCCAAD 742
           P +  A D
Sbjct: 611 PSLFVALD 618


>gi|374850591|dbj|BAL53576.1| hypothetical conserved protein [uncultured Bacteroidetes bacterium]
          Length = 676

 Score =  359 bits (922), Expect = 3e-96,   Method: Compositional matrix adjust.
 Identities = 222/593 (37%), Positives = 310/593 (52%), Gaps = 48/593 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N+LAAE S YL QHA NPV W  WGEEAFA AR+    +FLSIGYS CHWCHVME ESF 
Sbjct: 3   NQLAAERSLYLRQHADNPVPWMPWGEEAFARARREQKLVFLSIGYSACHWCHVMEEESFA 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D  VA LL  W++ IKVDREERPDVD +YM+  QA+ G GGWPL+V L+P+ + +  GTY
Sbjct: 63  DPEVAALLERWYIPIKVDREERPDVDALYMSICQAMTGQGGWPLTVILTPEREVIFAGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP      R G   +L ++   W +   ML  S    +E+++  L ++ S +     +  
Sbjct: 123 FPKRSTPYRIGLIELLERIAALWQQDGQMLRSSAHALMERIAPHLRSAHSGH-----ITA 177

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             +    EQL K +D R+GGFG+ PKFP    +  +L    +         ++    +  
Sbjct: 178 GTITAALEQLDKLFDRRYGGFGTRPKFPMAAALWFLLIAGPR--------TSTRALDIAT 229

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL+ M  GGI DHVG GFHRYS DERW +PHFEKMLYDQ  L  VY +A  +TK   + 
Sbjct: 230 ATLEAMRWGGIWDHVGFGFHRYSTDERWFLPHFEKMLYDQALLLLVYAEAARITKRRLFE 289

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
               +I  YL R ++   G   ++EDAD       T   EGAFY W  +++  ++  H  
Sbjct: 290 ITAMEIAAYLDRTLLLEHGAFAASEDAD-------TPDGEGAFYQWRYEDLRRLIPSHEF 342

Query: 463 -LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
              +  ++L P GN        P     G+N+L     +     + G  LE++L      
Sbjct: 343 ERMRAIFHLSPEGNAHDEATGQP----TGRNILSAGTRTEDVLERFGGTLEEFLAWWEPL 398

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
           R++L  VR+ R RP  D+KV+  WN LV+++ ARA ++L+          P +       
Sbjct: 399 RQRLETVRNSRARPARDEKVLCDWNALVVAALARAGRLLRQ---------PTL------- 442

Query: 582 MEVAESAASFIRR-HLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 640
           +E A    S++ R H++ + T  L H   +G     GFLDDYAF     L+LY       
Sbjct: 443 IERARRTWSYLERVHVHADGT--LAHCSYSGEPAIDGFLDDYAFAAWAALELYHATGAND 500

Query: 641 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
           +L     L ++  E F+D  G G   T     + +L + E  DGA  SG  ++
Sbjct: 501 FLEHVEHLLHSITERFVD--GDGIVRTAAS--ADVLPLTEPSDGATVSGIGIT 549


>gi|300864691|ref|ZP_07109547.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
 gi|300337297|emb|CBN54695.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
          Length = 694

 Score =  359 bits (922), Expect = 3e-96,   Method: Compositional matrix adjust.
 Identities = 239/679 (35%), Positives = 344/679 (50%), Gaps = 93/679 (13%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNRLA   S YL +HA NP+DW+ W +EA   A + + PIFLS+GYS+CHWC VME E+F
Sbjct: 2   TNRLAQSQSLYLRKHAENPIDWWPWCDEALEIASRENKPIFLSVGYSSCHWCTVMENEAF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP-DLKPLMGG 220
            +  +A+ +N  F+ IKVDREERPD+D +YM  +Q + G GGWPL++FL P D  P  GG
Sbjct: 62  SNAAIAEYMNAHFIPIKVDREERPDLDSIYMQALQMMTGQGGWPLNIFLDPIDRIPFYGG 121

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           TYFP   +YGRPGF  +L  ++  +D ++  L    AF  E L+    ++A S    ++L
Sbjct: 122 TYFPVYPRYGRPGFLEVLHAIRRFYDLEKGKLQ---AFKEEILAHFQQSAALSGT--EKL 176

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEA-SEGQK 339
               LR   E  +    +R  G    P FP      MM Y    L     + E  S+ Q+
Sbjct: 177 SGKLLRRGLETSTAIISAREYG----PSFP------MMPYSESALRGMRFNLEGKSDSQQ 226

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL-TKD 398
           +       +A GGI+DHV GGFHRY+VD  W VPHFEKMLYD GQ+     + +S   ++
Sbjct: 227 VCTQRGLDLALGGIYDHVAGGFHRYTVDGTWTVPHFEKMLYDNGQIVEYLANLWSAGVRE 286

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL- 457
             +       +++L+R+MI P G  ++A+DAD+      T  +EGAFYVW+  E+E++L 
Sbjct: 287 PAFERAVAGTVEWLQREMIAPAGYFYAAQDADNFTNIEETEPEEGAFYVWSYSELENLLE 346

Query: 458 GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI 517
            +     +E + +  TGN            F+ KNVL           KL   LE  L  
Sbjct: 347 ADEFRELQEQFTVTQTGN------------FEAKNVL-----QRRHPGKLSSTLETALAK 389

Query: 518 LGECR-------------------RKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASK 558
           L + R                    K +D   + P    D K+IV+WN L+IS  ARA+ 
Sbjct: 390 LFKVRYGAVPESVKVFPPARNNQEAKSYDWPGRIP-AVTDTKMIVAWNSLMISGLARATA 448

Query: 559 ILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHRLQHSFRNGPSKAPG 617
           +                  + EY+E+A  AA+FI  + + D + HRL +   +G S    
Sbjct: 449 VFH----------------KSEYLELAAKAANFILDNQWIDGRFHRLNY---DGKSAVMA 489

Query: 618 FLDDYAFLISGLLDLYEFGSG---TK----------WLVWAIELQNTQDELFLDREGGGY 664
             +DYA  +  LLDL++   G   TK          WL  A+++Q   DE     E GGY
Sbjct: 490 QSEDYALFLKALLDLHQVSEGWLETKPDSFNLKPEVWLEKAVKIQEEFDEFLWSIEVGGY 549

Query: 665 FNTTGEDPS-VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVF 723
           +NT  +  + +L+R +   D A P+ N V++ NLVRL  +    +   Y   AE  L  F
Sbjct: 550 YNTASDASADLLVRERSYTDNATPAANGVAIANLVRLTLLTEDLQ---YLDRAEQGLQAF 606

Query: 724 ETRLKDMAMAVPLMCCAAD 742
            + ++D   A P +  A D
Sbjct: 607 SSVMQDSPQACPSLFAALD 625


>gi|425470696|ref|ZP_18849556.1| Similar to tr|Q8YXH6|Q8YXH6 [Microcystis aeruginosa PCC 9701]
 gi|389883513|emb|CCI36064.1| Similar to tr|Q8YXH6|Q8YXH6 [Microcystis aeruginosa PCC 9701]
          Length = 692

 Score =  359 bits (921), Expect = 3e-96,   Method: Compositional matrix adjust.
 Identities = 237/668 (35%), Positives = 345/668 (51%), Gaps = 80/668 (11%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N LA   S YL +HA NP+DW+ W + A   AR+ D PIFLSIGYS+CHWC VME E+F 
Sbjct: 3   NHLAESESLYLRKHAENPIDWWYWCDSALEIARREDKPIFLSIGYSSCHWCTVMEGEAFS 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPD-LKPLMGGT 221
           D  +A  LN +F+ IKVDREERPD+D +YM  +Q + G GGWPL+VFL+PD L P  GGT
Sbjct: 63  DRAIADYLNQYFLPIKVDREERPDIDSIYMQALQMMVGQGGWPLNVFLTPDSLIPFYGGT 122

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFP + ++ RPGF  +L+ V+  +D++++ L++   F  E L  AL  SA   +    L 
Sbjct: 123 YFPVQPRFNRPGFLQVLQSVRRYYDEEKEKLSK---FTDEMLG-ALRQSAILPRAETNLA 178

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK---KLEDTGKSGEASEGQ 338
           + +L     + + +           P FP      + L  S+     ED+ +      G+
Sbjct: 179 EPSLLATGIETNTAVIRVNPNNYGRPSFPMIPYSHLALQGSRFGDDFEDSLRQAAYQRGE 238

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL-TK 397
            + L        GGI+DHVGGGFHRY+VD  W VPHFEKMLYD GQ+     + +S   +
Sbjct: 239 DLAL--------GGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDNGQIVEYLANLWSAGNR 290

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
           +  +    +  +++L+R+M  P G  ++A+DADS E       +EGAFYVW+ + + D L
Sbjct: 291 EAAFERGIKGTVNWLKREMTAPEGYFYAAQDADSFEKATDGEPEEGAFYVWSDRSLRDYL 350

Query: 458 GEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLN 516
               + L + ++ +   GN            F+G+NVL           +LG  +E  L+
Sbjct: 351 STEELGLLQANFTVTAEGN------------FEGRNVL-----QRRQGGELGKEIENILD 393

Query: 517 IL-------GECRRKLF-DVRSKRPRPHL----------DDKVIVSWNGLVISSFARASK 558
            L        + +  LF   R  +   ++          D K+IV+WN L+IS  ARA  
Sbjct: 394 KLFIRRYGSSQAQLALFPPARDNQEAKNVSWPGRIPAVTDTKMIVAWNSLMISGLARA-- 451

Query: 559 ILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHRLQHSFRNGPSKAPG 617
                   A+F+ P+       Y ++A  AA FI +H + D +  RL +    G +    
Sbjct: 452 -------FAVFSEPL-------YWQMATVAAEFILQHQWLDGRFQRLNY---QGQASVLA 494

Query: 618 FLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL 676
             +D+A+ I  LLDL       T WL  AI+LQ   D  F   + GGYFN T  D S+ L
Sbjct: 495 QSEDFAYFIKALLDLQTANPQETGWLEAAIDLQGEFDRWFWAEDEGGYFN-TASDHSLDL 553

Query: 677 RVKED--HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAV 734
            V+E    D A PS N +++ NL+RL+ +    +   Y   AE +L  F T L++   A 
Sbjct: 554 IVRERGYTDNATPSANGIAIANLLRLSRLTENLE---YLDRAEKALQSFSTILEESPTAC 610

Query: 735 PLMCCAAD 742
           P +  A D
Sbjct: 611 PSLFVALD 618


>gi|183221169|ref|YP_001839165.1| hypothetical protein LEPBI_I1783 [Leptospira biflexa serovar Patoc
           strain 'Patoc 1 (Paris)']
 gi|189911260|ref|YP_001962815.1| hypothetical protein LBF_1730 [Leptospira biflexa serovar Patoc
           strain 'Patoc 1 (Ames)']
 gi|167775936|gb|ABZ94237.1| Conserved hypothetical protein containing a thioredoxin domain
           [Leptospira biflexa serovar Patoc strain 'Patoc 1
           (Ames)']
 gi|167779591|gb|ABZ97889.1| Conserved hypothetical protein [Leptospira biflexa serovar Patoc
           strain 'Patoc 1 (Paris)']
          Length = 690

 Score =  359 bits (921), Expect = 4e-96,   Method: Compositional matrix adjust.
 Identities = 236/668 (35%), Positives = 348/668 (52%), Gaps = 63/668 (9%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
           +K  NRL  E SPYLLQHAHNPVDWF WG EAF +A+K D  I LSIGYSTCHWCHVME 
Sbjct: 5   SKKPNRLVHEKSPYLLQHAHNPVDWFPWGTEAFEKAKKEDKIILLSIGYSTCHWCHVMER 64

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFED   A++LN  FV IK+DREERPD+DK+YM  + A+   GGWPL++FL+P+ +P++
Sbjct: 65  ESFEDISTAEVLNRDFVCIKLDREERPDIDKIYMDALHAMGTQGGWPLNMFLTPEKEPIL 124

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLP- 277
           GGTYFPPE++YG+  FK +LR V  AW +++  L Q+       L E      +  K+P 
Sbjct: 125 GGTYFPPENRYGKRSFKEVLRLVTKAWKEQKGELLQAANELSNYLREN-QTRTNDGKVPG 183

Query: 278 -DELPQNALRLCAEQLSKSYDSRFGGF--GSAPKFPRPVEIQMML-YHSKKLEDTGKSGE 333
            + L QN  R       + YD  F GF   +  KFP  + +  +L Y+S       K   
Sbjct: 184 TEILVQNFNRYW-----QVYDQEFFGFKTNTINKFPPSMALIFLLDYYS-----IHKDNR 233

Query: 334 ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 393
           A E   M   T   M  GGI+D VGGG +RY+ D  W VPHFEKMLYD           +
Sbjct: 234 ALE---MAYNTGYAMKSGGIYDQVGGGIYRYATDHEWLVPHFEKMLYDNALYVEFLAKLY 290

Query: 394 SLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 453
            +T ++F+     +I+ Y++RDM    G I SAEDADS   EG    +EG FY+W   E+
Sbjct: 291 QITGEIFFLEALMEIISYIQRDMRLDIGGIASAEDADS---EG----EEGKFYLWKESEI 343

Query: 454 EDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEK 513
              L E  ++   ++ +   GN + +  +  +   KGKN   E           G+  + 
Sbjct: 344 LSELTEEEVI--GYWNVTEEGNFE-NNQNILNVAIKGKNPYQE-----------GIHFKD 389

Query: 514 YLNI-LGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 572
              I L   +  L+ +R++R RP  DDK++ SWN L I +               + +F 
Sbjct: 390 GFKIKLERSKEILYQLRNQRIRPLRDDKILTSWNCLWIRAL--------------LASFE 435

Query: 573 VVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDL 632
             G     ++  +++   F+  +L  E    +   FR G +K  G L DY+ LI     L
Sbjct: 436 ATGDPL--FLNQSKTIYEFLFTYLVKEDG-SVYRRFREGETKFFGTLPDYSELIWVSFRL 492

Query: 633 YEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSV 692
           ++     ++ +  +++    +  FL  + G YF +   D  ++ R  + +DG EPSGNS 
Sbjct: 493 FQLVGDKQYFLQGLQIFKYVETHFLS-DMGPYFESAAGDEELIARTIDGYDGVEPSGNS- 550

Query: 693 SVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHV 752
           +++++      +    +D  ++ A    + F   L   +++ P M  A      PS+  V
Sbjct: 551 TILHIFYFLHSLGFLHADILKK-ANAIFSYFLPELTQNSLSYPSMLSAFQKFQTPSK--V 607

Query: 753 VLVGHKSS 760
           V+V H++ 
Sbjct: 608 VIVLHRNQ 615


>gi|389645929|ref|XP_003720596.1| spermatogenesis-associated protein 20 [Magnaporthe oryzae 70-15]
 gi|351637988|gb|EHA45853.1| spermatogenesis-associated protein 20 [Magnaporthe oryzae 70-15]
          Length = 865

 Score =  358 bits (920), Expect = 4e-96,   Method: Compositional matrix adjust.
 Identities = 237/714 (33%), Positives = 361/714 (50%), Gaps = 129/714 (18%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NR     SPY+  H   PV W    ++A A A+ ++  IF++IG+  CH+C +   ESF 
Sbjct: 49  NRAGDSESPYIQAHQDTPVAWQLLDKDAVALAKSQNKLIFMNIGFKACHYCRLTTQESFR 108

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           ++ VA LLN  F+ I VDREERPD+D +YM Y+QA+   GGWPL+VFL+P+L+P+ GGTY
Sbjct: 109 NKNVAALLNSSFIPILVDREERPDIDSIYMNYIQAVNSAGGWPLNVFLTPELEPVFGGTY 168

Query: 223 FPP---------EDKYGRPGFKTILRKVKDAWDKK--------RDMLAQSGAFAIE---- 261
           +P          ED      F  IL+K++  W ++        +D++ Q   FA E    
Sbjct: 169 WPGPGRSTSSAVEDGEEPLDFLGILKKLQKVWTEQEAKCRKEAQDIVLQLREFAAEGTMG 228

Query: 262 -----------------QLSEALSASASSNKLPD------------ELPQNALRLCAEQL 292
                             +S  ++A  +S + P             ++  + L      +
Sbjct: 229 VGNTEKVPSVATTGATVNISTGVAAPTTSTETPKKTVTASASATDLDVDLDQLEEAYANI 288

Query: 293 SKSYDSRFGGFGSAPKFPRPVEIQMML---YHSKKLED-TGKSGEASEGQKMVLFTLQCM 348
           S+S+D   GGF  +PKFP P ++  +L   +   ++ D  G   E +    M L TL+ +
Sbjct: 289 SRSFDRVNGGFNLSPKFPTPPKLSFLLRLAHLPPEVGDIVGGPEEIARATHMALATLRAL 348

Query: 349 AKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF---------SLTKDV 399
             GG+ DH+G GFHRYSV   W VPHFEKM+ D   L  VYLDA+         + T + 
Sbjct: 349 RDGGLRDHIGAGFHRYSVTADWSVPHFEKMIADNALLLGVYLDAWLGQAAKEGRAPTLED 408

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFS-----------AEDADSAETEGATRKKEGAFYVW 448
            ++ +  ++ DYL      PG E  S           +E +DS + +     +EGAFY+W
Sbjct: 409 EFADVVLELGDYLGN----PGSEFGSSSTCQDSLLPTSEASDSYQRKSDKHMREGAFYLW 464

Query: 449 TSKEVEDIL----------GEH-----AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNV 493
           T +E +  +          G+H     A +   ++ +K  GN  +    DPH+EF  +NV
Sbjct: 465 TRREFDATVSNTEDGDLTNGKHDGDFYARVAAAYWNVKEHGN--IPEEQDPHDEFINQNV 522

Query: 494 LIELNDSSASASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVISS 552
           L  +   +  ++  G+ +++   IL E RRKL   R S R RP +D+K +V++N + +S+
Sbjct: 523 LRVVKTPAELSTSFGIAVDEVNQILAEARRKLRARRDSDRVRPEVDEKQVVAYNAMAMSA 582

Query: 553 FARASKILKSEAESAMFNFPVVGSDR---KEYMEVAESAASFIRRHLYDEQTHRL-QHSF 608
            ARA  +L S            G D+     +M  A+ AA  ++  LYD++T +L +H F
Sbjct: 583 LARAGVVLWS-----------TGLDKHRGSAWMMCAKQAAIEMKGRLYDQETGKLSRHWF 631

Query: 609 RNGPSKAPGFLDDYAFLISGLLDLYE-FGSGTKWLVWAIELQNTQDELFLDREG------ 661
           RN  S      +DYAFLI  LLDLY+  G  + +L WA +LQ+ Q E+F DR        
Sbjct: 632 RNKKSSTDALAEDYAFLIEALLDLYDATGDESAYLDWAKQLQDKQIEMFYDRVAPSSQNL 691

Query: 662 -----------GGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIV 704
                      GG+++T  E P V+LR+K+  D ++PS N+VS  NL RLA I+
Sbjct: 692 DSDAAKTKSGSGGFYSTAEEAPDVILRLKDGMDTSQPSTNAVSASNLFRLALIL 745


>gi|411116326|ref|ZP_11388814.1| thioredoxin domain-containing protein [Oscillatoriales
           cyanobacterium JSC-12]
 gi|410713817|gb|EKQ71317.1| thioredoxin domain-containing protein [Oscillatoriales
           cyanobacterium JSC-12]
          Length = 698

 Score =  358 bits (920), Expect = 5e-96,   Method: Compositional matrix adjust.
 Identities = 250/708 (35%), Positives = 360/708 (50%), Gaps = 94/708 (13%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N LA  +S YL +HA NP+DW+ W +EA A AR+ D PIFLS+GYS+CHWC VME E+F 
Sbjct: 15  NHLANANSLYLRKHADNPIDWWYWCDEALAIARQEDKPIFLSVGYSSCHWCTVMEGEAFS 74

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP-DLKPLMGGT 221
           D+ +AK +N  F+ IKVDREERPD+D +YM  +Q + G GGWPL++FL+P DL P  GGT
Sbjct: 75  DQEIAKFMNTNFLPIKVDREERPDLDSIYMQALQMMTGQGGWPLNIFLTPDDLVPFYGGT 134

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLP--DE 279
           YFP E +YGRP F  +L  V+  +D+++  L    A       E LS   SS  LP  + 
Sbjct: 135 YFPVEPRYGRPSFLQVLEGVRRFYDQEKTKLQSVKA-------EILSNLQSSTLLPAVEA 187

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
           LP++      E  +    S+  G    P FP       M+ ++   +   +    S    
Sbjct: 188 LPRDVFLHGLEYNTGVISSKSVG----PSFP-------MIPYADVAQRAMRFLAKSRYNA 236

Query: 340 MVLFTLQC--MAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTK 397
           + + T +   +A GGI DHVGGGFHRY+VD  W VPHFEKMLYD GQ+     + +S   
Sbjct: 237 LEVSTQRGIDLALGGIFDHVGGGFHRYTVDPTWTVPHFEKMLYDNGQIMEYLANQWS--A 294

Query: 398 DVFYSYICRDI---LDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 454
           DV      R I   +++L+R+M  P G  ++A+DADS  +  AT  +EGAFYVW   E+ 
Sbjct: 295 DVQEPAFKRAIALTVEWLQREMTAPEGYFYAAQDADSFTSPDATEPEEGAFYVWGYDELT 354

Query: 455 DILGEHAIL-FKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEK 513
            +L E  +   +    +   GN            F+G NVL +   S   +  +   L+K
Sbjct: 355 TLLTEKELREMQTQLTITEKGN------------FEGVNVL-QRRHSGQLSEAIETALDK 401

Query: 514 YLNI---LGECRRKLF-DVRSKRPR----------PHLDDKVIVSWNGLVISSFARASKI 559
              I   +G  R K F   R+ R            P  D K+IV+WN L+IS  ARA+ +
Sbjct: 402 LFQIRYGIGTDRIKPFPPARNNREAQEMPWAGRIPPVTDTKMIVAWNSLMISGLARAAAV 461

Query: 560 LKSEAESAMFNFPVVGSDRKEYMEVAESAASFI-RRHLYDEQTHRLQHSFRNGPSKAPGF 618
            ++ +                ++E+A +A  FI  R   + + HR+ +   NG       
Sbjct: 462 FQNCS----------------WLELAVNATQFILERQWVENRLHRVNY---NGQPSVLAQ 502

Query: 619 LDDYAFLISGLLDLYE-------FGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGED 671
            +DYA  I  LLDL++         + + +L  A+ +Q   DE     E GGYFN T   
Sbjct: 503 SEDYALFIKALLDLHQAYQSLDSVAALSSFLDAAVRVQAELDEFLWSVELGGYFN-TDRT 561

Query: 672 PSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMA 731
           P +L+R +   D A P+ N V+V NLVRLA +   ++   Y   AE +L  F + ++   
Sbjct: 562 PDLLVRERSYMDNATPAANGVAVANLVRLALL---TEDLSYLDRAEQTLKAFGSVMERSP 618

Query: 732 MAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAAAHASYDLNKT 779
            A P +    D        H  LV  +++ D   +LAA +    + KT
Sbjct: 619 QACPSLFVGMDWF-----LHQTLV--RATPDAIALLAAQYQPTVMYKT 659


>gi|425450832|ref|ZP_18830655.1| Similar to tr|Q8YXH6|Q8YXH6 [Microcystis aeruginosa PCC 7941]
 gi|389768138|emb|CCI06653.1| Similar to tr|Q8YXH6|Q8YXH6 [Microcystis aeruginosa PCC 7941]
          Length = 692

 Score =  358 bits (919), Expect = 6e-96,   Method: Compositional matrix adjust.
 Identities = 240/665 (36%), Positives = 342/665 (51%), Gaps = 74/665 (11%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N LAA  S YL +HA NP+DW+ W + A   AR+ D PIFLSIGYS+CHWC VME E+F 
Sbjct: 3   NHLAASESLYLRKHAENPIDWWYWCDSALEIARREDKPIFLSIGYSSCHWCTVMEGEAFS 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPD-LKPLMGGT 221
           D  +A  LN +F+ IKVDREERPD+D +YM  +Q + G GGWPL+VFL+PD L P  GGT
Sbjct: 63  DRAIADYLNQYFLPIKVDREERPDIDSIYMQALQMMVGQGGWPLNVFLTPDSLIPFYGGT 122

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFP + ++ RPGF  +L+ V+  + ++++ L++   F  E L  AL  SA   +    L 
Sbjct: 123 YFPVQPRFNRPGFLQVLQSVRRYYGEEKEKLSK---FTAEMLG-ALRQSAILPRAETNLA 178

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
             +L     + + +           P FP      + L  S+  +D   S + +  Q+  
Sbjct: 179 DPSLLATGIETNTAVIQVNPNNYGRPSFPMIPYSHLALQGSRFGDDFDDSLQQAAYQRG- 237

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL-TKDVF 400
               + +A GGI+DHVGGGFHRY+VD  W VPHFEKMLYD GQ+     + +S   ++  
Sbjct: 238 ----EDLALGGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDNGQIVEYLANLWSAGDREAA 293

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-E 459
           +    +  +++L+R+M  P G  ++A+DADS E       +EGAFYVW+  E+ D L  E
Sbjct: 294 FERGIKGTVNWLKREMTAPEGYFYAAQDADSFEKARDREPEEGAFYVWSDLELRDYLSTE 353

Query: 460 HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL- 518
              L + ++ +   GN            F+G+NVL           +LG  +E  L+ L 
Sbjct: 354 ELGLLQANFTVTAEGN------------FEGRNVL-----QRRQGGELGKEIENILDKLF 396

Query: 519 ----GECRRKLFDVRSKRPRPH-------------LDDKVIVSWNGLVISSFARASKILK 561
               G  + +L      R                  D K+IV+WN L+IS  ARA     
Sbjct: 397 IRRYGSSQAQLALFPPARDNQEAKTVSWPGRIPAVTDTKMIVAWNSLMISGLARA----- 451

Query: 562 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHRLQHSFRNGPSKAPGFLD 620
                A+F+ P+       Y ++A  AA FI +H + D +  RL +    G +      +
Sbjct: 452 ----FAVFSEPL-------YWQMATQAAEFILQHQWLDGRFQRLNY---QGQASVLAQSE 497

Query: 621 DYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVK 679
           D+A+ I  LLDL       T WL  AI+LQ   D  F   + GGYFN T  D S+ L V+
Sbjct: 498 DFAYFIKALLDLQTAKPQETGWLEAAIDLQGEFDRWFWSEDEGGYFN-TASDHSLDLIVR 556

Query: 680 ED--HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLM 737
           E    D A PS N +++ NLVRL+ +    +   Y   AE +L  F T L+    A P +
Sbjct: 557 ERGYTDNATPSANGIAIANLVRLSRLTENLE---YLDRAEKALQSFSTILEQSPTACPSL 613

Query: 738 CCAAD 742
             A D
Sbjct: 614 FVALD 618


>gi|326331060|ref|ZP_08197358.1| hypothetical protein NBCG_02497 [Nocardioidaceae bacterium Broad-1]
 gi|325951101|gb|EGD43143.1| hypothetical protein NBCG_02497 [Nocardioidaceae bacterium Broad-1]
          Length = 655

 Score =  358 bits (918), Expect = 9e-96,   Method: Compositional matrix adjust.
 Identities = 232/656 (35%), Positives = 325/656 (49%), Gaps = 72/656 (10%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           +NRLA+  SPYLLQHA NPVDW+ WG +AF +AR+RDVP+ LS+GY+ CHWCHVM  ESF
Sbjct: 2   SNRLASATSPYLLQHAQNPVDWWEWGPDAFEDARRRDVPVLLSVGYAACHWCHVMAHESF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           EDE  A  +N+ FV+IKVDREERPDVD VYM    A+ G GGWP++V L  D  P   GT
Sbjct: 62  EDETTAAYMNEHFVNIKVDREERPDVDAVYMAATTAMTGSGGWPMTVVLDHDGNPFFAGT 121

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFP   ++G+P F  +L+ + +AW ++R  +        + L+     + ++        
Sbjct: 122 YFPDMPRHGQPAFTQVLQALSEAWTQRRSEIGAVADNVRQHLANISGVAGAAGDW----- 176

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
           Q  +    E L+  +D   GGFG APKFP  + ++ +   +  L      G  S    M+
Sbjct: 177 QVDVDAVVETLAGEFDPMAGGFGGAPKFPPSMVLEFLRRAAGAL------GADSRVSHML 230

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             T+  MA GGI+D VGGGF RY+VD  W VPHFEKMLYD  QL  +Y    +   D   
Sbjct: 231 SRTVAAMAGGGIYDQVGGGFARYAVDRGWVVPHFEKMLYDNAQLIGLYARLGTELGD--- 287

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EH 460
             + R+  D++ R++    G   SA DADS   EG     EG FYVWT  E+ ++LG E 
Sbjct: 288 -RVARESADWMIRELGTAEGGFASALDADS---EGV----EGKFYVWTPAELVEVLGAED 339

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
                + + +   G             F+     ++L        +           L  
Sbjct: 340 GAWAAQVFEVTEAGT------------FEEGASTLQLRHRPDDTER-----------LES 376

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            + +L   R +R RP  DDKV+ +WNGL IS    A  +L              G  R  
Sbjct: 377 VKARLLAAREERVRPARDDKVVAAWNGLAISGLVDAGLLL--------------GEPR-- 420

Query: 581 YMEVAESAASFI-RRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYAFLISGLLDLYEFGSG 638
           Y++ A +AA  + R H+ D    RL    R+G + A  G L+DY  + SG L L +    
Sbjct: 421 YIDAAVAAAELLWRVHVQDA---RLLRVSRDGVAGAHAGVLEDYGCVASGFLSLTQATGA 477

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
             WL  A  L +T    F   E GG+++T  +  +++ R ++  D A P G S  +  LV
Sbjct: 478 ATWLDRATSLLDTALTHF-RAEDGGFYDTGDDAEALVTRPRDASDNASPGGTSAMLHALV 536

Query: 699 RLASIVAGSKSDYYRQNAEHSLAVFETRL-KDMAMAVPLMCCAADMLSVPSRKHVV 753
              ++    +   YR  AE +L    T + K    A   +  AA M   P    VV
Sbjct: 537 TAHALTGEGR---YRTAAEEALGATSTLMTKAPRFAGWSLAAAATMAEGPLEIAVV 589


>gi|428770863|ref|YP_007162653.1| hypothetical protein Cyan10605_2528 [Cyanobacterium aponinum PCC
           10605]
 gi|428685142|gb|AFZ54609.1| protein of unknown function DUF255 [Cyanobacterium aponinum PCC
           10605]
          Length = 676

 Score =  357 bits (917), Expect = 9e-96,   Method: Compositional matrix adjust.
 Identities = 234/667 (35%), Positives = 346/667 (51%), Gaps = 81/667 (12%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TN L    S YL +HAHNP++W+ W +EA   A++ D PIFLSIGYS+CHWC VME E+F
Sbjct: 2   TNHLINTQSLYLQKHAHNPINWWYWCDEALNLAKQEDKPIFLSIGYSSCHWCTVMEGEAF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP-DLKPLMGG 220
            D  +A  LND F+SIKVDREERPD+D +YMT +Q + G GGWPL++FLSP DL P  GG
Sbjct: 62  SDGAIASYLNDNFISIKVDREERPDIDSIYMTALQMMTGQGGWPLNIFLSPDDLVPFYGG 121

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQL-SEALSASASSNKLPDE 279
           TYFP E +YGRPGF  IL+ ++D +  K D         ++ L + +     S N+L  E
Sbjct: 122 TYFPIEPRYGRPGFLQILQALRDFYHDKSDKFISLKNEIVKGLETNSNIIFTSENQLTPE 181

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
           L Q  +   ++ ++++       +GS P+FP      MM Y +  L+   K     +   
Sbjct: 182 LLQQGIANNSKVIARN------DYGS-PRFP------MMPYSNITLQGGVKDKNYRD--- 225

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ----LANVYLDAFSL 395
           + +     +  GGI+DHVGGGFHRY+VD  W VPHFEKMLYD G     LAN++ +   +
Sbjct: 226 LAIRRALDLVNGGIYDHVGGGFHRYTVDATWTVPHFEKMLYDNGLIMEFLANLWANGVEI 285

Query: 396 TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVED 455
           ++       C  I D+L+R+M    G  ++A+DAD+         +EG FYVW+ +++++
Sbjct: 286 SE---IKRACEGIKDWLKREMTSEKGYFYAAQDADNFADIHHIEPEEGEFYVWSYQQLKE 342

Query: 456 IL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKY 514
           IL  E    F + + +   GN            F+ KNVL +  D S +   +   L+K 
Sbjct: 343 ILSAEEFNAFIDTFIISEDGN------------FESKNVLQKREDKSIN-EIINNALDKL 389

Query: 515 LNI-LGECRRKL--------------FDVRSKRPRPHLDDKVIVSWNGLVISSFARASKI 559
             +  GE R  L              F    + P P  D K+I++WN L+IS  A A  +
Sbjct: 390 FKVRYGEERNSLEKFSPAKNNQEAKTFQWLGRIP-PVTDTKMILAWNSLMISGLATAYGV 448

Query: 560 LKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE-QTHRLQHSFRNGPSKAPGF 618
            +  +                Y+++AE A  FI  H ++  + HRL +    G       
Sbjct: 449 FQDVS----------------YLDLAEKATEFILNHQWENGRLHRLNYE---GNVAVFAQ 489

Query: 619 LDDYAFLISGLLDLYEFGSGTK--WLVWAIELQNTQDELFLDREGGGYFNTTGEDPS-VL 675
            +DY+  I  LLDL +        +L  AI++Q   ++   D+E GGY+N   ++ S +L
Sbjct: 490 SEDYSLFIKALLDLAQNHPTNTGFYLDQAIKIQAEFNQFCQDKEQGGYYNNAHDNSSDLL 549

Query: 676 LRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP 735
           +R K   D A PS N +++ NLVRL       K   Y   AE +L +F   +   + + P
Sbjct: 550 IREKSYIDNATPSPNGIAIANLVRLHLFTDEEK---YLDEAEKTLKLFSDIMNKASTSCP 606

Query: 736 LMCCAAD 742
            +  A +
Sbjct: 607 SLFTALN 613


>gi|389862702|ref|YP_006364942.1| hypothetical protein MODMU_0997 [Modestobacter marinus]
 gi|388484905|emb|CCH86447.1| conserved protein of unknown function [Modestobacter marinus]
          Length = 668

 Score =  357 bits (917), Expect = 1e-95,   Method: Compositional matrix adjust.
 Identities = 218/587 (37%), Positives = 290/587 (49%), Gaps = 63/587 (10%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLAA  SPYLLQHA NPVDW  WG +AFAEAR RDVP+ +S+GY+ CHWCHVM  ESFE
Sbjct: 3   NRLAAATSPYLLQHADNPVDWQEWGADAFAEARARDVPVLVSVGYAACHWCHVMAHESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D   A  LN  FV +KVDREERPDVD VY+   QAL G GGWP++VF +PD  P   GTY
Sbjct: 63  DAATAAQLNAGFVCVKVDREERPDVDSVYLAATQALTGQGGWPMTVFTTPDGAPFYCGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
            PP    G P F+ +L  V DAW  +R  L  +G   +E +S  L   A     P  L  
Sbjct: 123 LPPRPHPGMPSFRQVLDAVTDAWTHRRAGLQDAGQRIVEGISGRLDLGA-----PTPLTA 177

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
           + L      L+  YD   GGFG APKFP  + ++ +L    +  D    G       M  
Sbjct: 178 DLLDGAVRALADRYDREAGGFGGAPKFPPSMVLEFLLRAHARRGDEDALG-------MAR 230

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            T + MA+GGI D + GGF RYSVD  W VPHFEKMLYD   L   Y   +  T   +  
Sbjct: 231 HTAEAMARGGICDQLAGGFARYSVDAGWVVPHFEKMLYDNALLLRAYSHLWRTTGADWAR 290

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
            +  +   +L RD+    G   SA DAD   TEG     EG  YVWT  ++ ++LG+   
Sbjct: 291 RVADETARFLIRDLGTAEGGFASALDAD---TEGV----EGLSYVWTPAQLREVLGDDDG 343

Query: 463 LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECR 522
            +    +              P   F+     ++L        +           L   R
Sbjct: 344 SWAAQVF-----------GVTPEGTFEEGASTLQLRRDPDDGER-----------LARVR 381

Query: 523 RKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYM 582
             L   R++RP+P  DDKV+ +WNGL I++ A    +                +   E +
Sbjct: 382 AALLQARARRPQPARDDKVVTAWNGLAIAALADHGAL----------------TGDTELV 425

Query: 583 EVAESAASFIRR-HLYDEQTHRLQHSFRNGPSKA-PGFLDDYAFLISGLLDLYEFGSGTK 640
             A  AA  + R H  D    RL+ + R G   A  G L+D+  L  GLL L+   +  +
Sbjct: 426 RAAGRAADLLHRVHWVD---GRLRRASRGGVVGAHAGVLEDHGDLAEGLLALHAATAEPR 482

Query: 641 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEP 687
           WL WA EL +     F+D + G +++T  +  +++ R  +  DG  P
Sbjct: 483 WLRWAGELLDVVAARFVDAD-GRWYDTAADAEALVHRPFDPADGPTP 528


>gi|340385830|ref|XP_003391411.1| PREDICTED: uncharacterized protein yyaL-like [Amphimedon
           queenslandica]
          Length = 642

 Score =  357 bits (917), Expect = 1e-95,   Method: Compositional matrix adjust.
 Identities = 217/571 (38%), Positives = 306/571 (53%), Gaps = 46/571 (8%)

Query: 94  TSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWC 153
           T  S     N L  E SPYLLQHA NPVDW  WG EA   A++ D PI LSIGYS CHWC
Sbjct: 2   TDSSSGPRANALGRETSPYLLQHADNPVDWRPWGAEALERAKREDKPILLSIGYSACHWC 61

Query: 154 HVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQAL-YGGGGWPLSVFLSP 212
           HVM  ESFEDE  A+L+ND +++IKVDREERPD+DK+Y T  Q L    GGWPL+V L+P
Sbjct: 62  HVMAHESFEDEPTARLMNDLYINIKVDREERPDIDKIYQTAHQLLSRRPGGWPLTVILAP 121

Query: 213 DLK-PLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASA 271
           D + P   GTYFP   ++G P F+ +L +V+  + ++R+ + +  A  ++ L++  +AS 
Sbjct: 122 DDQAPFFAGTYFPDAPRHGMPSFRQVLVEVERLYRERREDIRRQNASLMDALADLDNASP 181

Query: 272 SSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKS 331
                 D L    L      L +S+DSR GGFG APKFP P  I+ ++     L     S
Sbjct: 182 GEEG--DSLSAQPLEAARAALLRSHDSRHGGFGGAPKFPHPTWIERLMRDRASLP---PS 236

Query: 332 GEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLD 391
            +      +  F+L  M  GG++DH GGGF+RY+VDE W +PHFEKMLYD G L  +   
Sbjct: 237 PDTDAALSIARFSLSKMCLGGLYDHAGGGFYRYTVDEMWMIPHFEKMLYDNGPLLEIAAR 296

Query: 392 AFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSK 451
            + LT D  +    ++   +  R+M  P G  +S  DADS       + +EG FY+WT +
Sbjct: 297 MYRLTGDELFVRAAKETAAWAMREMQSPQGGFWSTLDADS-------QGEEGKFYLWTPE 349

Query: 452 EVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDS-SASASKLGMP 510
           EV   + +      E+  L P    D      P N F+  +  + ++ S    A + G+ 
Sbjct: 350 EVRSHVPD-----DEYIALAPRFGLD-----RPPN-FESTHWHLHVDSSIEEVARQTGLS 398

Query: 511 LEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFN 570
             +    +     +LF+ RSKR  P  D+KVI SWNGL+I   A A  IL S+A      
Sbjct: 399 ESESAARIDRALARLFEARSKRVYPGRDEKVIASWNGLMIKGMAVAGSILGSQA------ 452

Query: 571 FPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLL 630
                      ++ A  A  FIR  ++ +   RL  ++++G ++   +LDD+A LI G+L
Sbjct: 453 ----------MIDSAARAVDFIRNAMWIDG--RLLATYKDGRARFNAYLDDHACLIDGIL 500

Query: 631 DLYEFGSGTKWLVWAIELQNTQDELFLDREG 661
            L       + L +AI+L   +  L   REG
Sbjct: 501 ALLAARWSAENLSFAIDL--VERTLIAAREG 529


>gi|425446506|ref|ZP_18826509.1| Six-hairpin glycosidase-like [Microcystis aeruginosa PCC 9443]
 gi|389733246|emb|CCI02963.1| Six-hairpin glycosidase-like [Microcystis aeruginosa PCC 9443]
          Length = 689

 Score =  357 bits (917), Expect = 1e-95,   Method: Compositional matrix adjust.
 Identities = 241/666 (36%), Positives = 343/666 (51%), Gaps = 76/666 (11%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N LA   S YL +HA NP+DW+ W + A   AR+ D PIFLSIGYS+CHWC VME E+F 
Sbjct: 3   NHLAESESLYLRKHAENPIDWWYWCDSALEIARREDKPIFLSIGYSSCHWCTVMEGEAFS 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPD-LKPLMGGT 221
           D  +A  LN +F+ IKVDREERPD+D +YM  +Q + G GGWPL+VFL+PD L P  GGT
Sbjct: 63  DRAIADYLNQYFLPIKVDREERPDIDSIYMQALQMMVGQGGWPLNVFLTPDSLIPFYGGT 122

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFP + ++ RPGF  +L+ V+  +D++++ L++   F  E L  AL  SA   +    L 
Sbjct: 123 YFPVQPRFNRPGFLQVLQSVRRYYDEEKEKLSK---FTAEMLG-ALRQSAILPRAETNLA 178

Query: 282 QNALRLCA-EQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
             +L     E+ +         +G  P FP      + L  S+  ED   S   +  Q+ 
Sbjct: 179 APSLLATGIEKNTAVIRVNPNNYGR-PSFPMIPYSHLALQGSRFGEDFDDSLRQAAYQRG 237

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL-TKDV 399
                + +A GGI+DHVGGGFHRY+VD  W VPHFEKMLYD GQ+     + +S   ++ 
Sbjct: 238 -----EDLALGGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDNGQIVEYLANLWSAGDREA 292

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG- 458
            +    +  +++L+R+M  P G  ++A+DADS E       +EGAFYVW+  E+ D L  
Sbjct: 293 AFERGIKGTVNWLKREMTAPEGYFYAAQDADSFEKATDGEPEEGAFYVWSDLELRDYLST 352

Query: 459 EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 518
           E   + + ++ +   GN            F+G+NVL           +LG  +E  L+ L
Sbjct: 353 EELGVLQANFTVTAEGN------------FEGRNVL-----QRRQGGELGEEIENMLDKL 395

Query: 519 -----GECRRKLFDVRSKRPRPH-------------LDDKVIVSWNGLVISSFARASKIL 560
                G  + +L      R                  D K+IV+WN L+IS  ARA    
Sbjct: 396 FIRRYGSSQAQLALFPPARDNQEAKTVSWPGRIPAVTDTKMIVAWNSLMISGLARA---- 451

Query: 561 KSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHRLQHSFRNGPSKAPGFL 619
                 A+F  P+       Y ++A  AA FI +H + D +  RL +    G +      
Sbjct: 452 -----FAVFGEPL-------YWQMAAQAAEFILKHQWLDGRFQRLNY---QGQASVLAQS 496

Query: 620 DDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRV 678
           +D+A+ I  LLDL       T+WL  AI+LQ   D  F   + GGYFNT   D S+ L V
Sbjct: 497 EDFAYFIKALLDLQTAKPQETRWLEAAIDLQGEFDRWFWAEDEGGYFNTAS-DHSLDLIV 555

Query: 679 KED--HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPL 736
           +E    D A PS N +++ NL+RL+ +    +   Y   AE +L  F T L+    A P 
Sbjct: 556 RERGYTDNATPSANGIAIANLLRLSRLTENLE---YLDRAEKALQSFSTILEQSPTACPS 612

Query: 737 MCCAAD 742
           +  A D
Sbjct: 613 LFVALD 618


>gi|392946294|ref|ZP_10311936.1| thioredoxin domain-containing protein [Frankia sp. QA3]
 gi|392289588|gb|EIV95612.1| thioredoxin domain-containing protein [Frankia sp. QA3]
          Length = 676

 Score =  357 bits (917), Expect = 1e-95,   Method: Compositional matrix adjust.
 Identities = 242/665 (36%), Positives = 335/665 (50%), Gaps = 67/665 (10%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N+LA + SPYLLQHA NPVDW+ W  EAFA+A +R VP+ LS+GY++CHWCHVM  ESFE
Sbjct: 3   NKLAEQTSPYLLQHADNPVDWWPWCPEAFADAARRGVPVLLSVGYASCHWCHVMAHESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D   A  +ND FV+IKVDREERPDVD VYM    AL G GGWP++VFL+P  +P   GTY
Sbjct: 63  DVVTAAYMNDHFVNIKVDREERPDVDSVYMDVTVALTGHGGWPMTVFLTPTAEPFFAGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLS--------EALSASASSN 274
           FPP  + G   F+ +L  V  AW  +R  + +SGA    +L+          L+AS +S 
Sbjct: 123 FPPRPRPGMGSFRQVLEAVVAAWQTRRAEIEESGADIARRLAEAAARGPVAGLAASPTSG 182

Query: 275 KLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEA 334
            + DEL    L      LS  +D+R GGFG APKFP  +  +M+L H+ +  D       
Sbjct: 183 -VADELTPQLLDTAVAGLSARFDARHGGFGGAPKFPPSMVAEMLLRHAARTGD------- 234

Query: 335 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 394
               +MV  T + +A+GG++D + GGF RYSVD  W VPHFEKMLYD  QL  VYL  + 
Sbjct: 235 EHSLEMVALTCERIARGGMYDQLAGGFARYSVDATWTVPHFEKMLYDNAQLLRVYLHLWR 294

Query: 395 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAET---EGATRKKEGAFYVWTSK 451
            T       + R    +L  D+  P G   SA DAD+          + +EGA Y WT  
Sbjct: 295 ATGSPLAQRVVRQTAAFLLADLRTPQGGFASALDADAVPAGVPAAHAQPEEGASYSWTPA 354

Query: 452 EVEDILG-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMP 510
            +   LG +      E + +   G  +            G +VL    D   +A      
Sbjct: 355 GLRAALGADDGAWAAEIFGVTAEGTFE-----------HGTSVLQLPADPPDAARFA--- 400

Query: 511 LEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFN 570
                      R  L   R+ RP+P  DDKV+ +WNGL I++ A A  +           
Sbjct: 401 ---------AVRAALAAARADRPQPARDDKVVAAWNGLAIAALAEAGAL----------- 440

Query: 571 FPVVGSDRKEYMEVAESAASFIRR-HLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGL 629
                 D   ++  AE AA  +R  HL   +  R     R G +   G L+DY  +  GL
Sbjct: 441 -----LDEPAWIRAAEDAAVLLRDVHLVAGRLRRTSRDGRVGTNA--GVLEDYGDVAEGL 493

Query: 630 LDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSG 689
           L L++     +WL  A EL +     F   + GG+F+T  +  ++L R ++D D A PSG
Sbjct: 494 LTLHQVTGDPEWLTLAGELLDVVRARFAAPD-GGFFDTADDAEALLRRPRDDSDSATPSG 552

Query: 690 NSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL-KDMAMAVPLMCCAADMLSVPS 748
            +     L+  A++   + S  +R+ AE ++A F   L +D   A      A  +L+ P+
Sbjct: 553 QAAVAGALLTYAAL---TGSAEHRRAAEETVARFAPLLSRDARFAGWAGAVAEALLAGPA 609

Query: 749 RKHVV 753
              VV
Sbjct: 610 EVAVV 614


>gi|443651764|ref|ZP_21130697.1| hypothetical protein C789_1237 [Microcystis aeruginosa DIANCHI905]
 gi|159027460|emb|CAO89425.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
 gi|443334405|gb|ELS48917.1| hypothetical protein C789_1237 [Microcystis aeruginosa DIANCHI905]
          Length = 692

 Score =  357 bits (915), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 239/668 (35%), Positives = 339/668 (50%), Gaps = 80/668 (11%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N LA   S YL +HA NP+DW+ W + A   AR+ D PIFLSIGYS+CHWC VME E+F 
Sbjct: 3   NHLAESESLYLRKHAENPIDWWYWCDSALEIARREDKPIFLSIGYSSCHWCTVMEGEAFS 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPD-LKPLMGGT 221
           D  +A  LN +F+ IKVDREERPD+D +YM  +Q + G GGWPL+VFL+PD L P  GGT
Sbjct: 63  DRAIADYLNQYFLPIKVDREERPDIDSIYMQALQMMVGQGGWPLNVFLTPDSLIPFYGGT 122

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFP + ++ RPGF  +L+ V+  ++++++ L++   F  E L  AL  SA   +    L 
Sbjct: 123 YFPVQPRFNRPGFLQVLQSVRRYYEEEKEKLSK---FTAEMLG-ALRQSAILPRAETNLA 178

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK---KLEDTGKSGEASEGQ 338
             +L     + + +           P FP      + L  S+     ED+ +      G+
Sbjct: 179 DPSLLATGIETNTAVIRVNPNNYGRPSFPMIPYSHLALQGSRFGDDFEDSLRQAAHQRGE 238

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL-TK 397
            + L        GGI+DHVGGGFHRY+VD  W VPHFEKMLYD GQ+     + +S   +
Sbjct: 239 DLAL--------GGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDNGQIVEYLANLWSAGDQ 290

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
           +  +    +  +++L+R+M  P G  ++A+DADS E       +EGAFYVW+  E+ D L
Sbjct: 291 EAAFERGIKGTVNWLKREMTAPEGYFYAAQDADSFEKATDGEPEEGAFYVWSDLELRDYL 350

Query: 458 G-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLN 516
             E   L + ++ +   GN            F+G+NVL           +LG  +E  L+
Sbjct: 351 STEELGLLQANFTVTAEGN------------FEGRNVL-----QRRQGGELGKEIENILD 393

Query: 517 IL-----GECRRKLFDVRSKRPRPH-------------LDDKVIVSWNGLVISSFARASK 558
            L     G  + +L      R                  D K+IV+WN L+IS  ARA  
Sbjct: 394 KLFIRRYGSSQAQLALFPPARDNQEAKTVSWPGRIPAVTDTKMIVAWNSLMISGLARA-- 451

Query: 559 ILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHRLQHSFRNGPSKAPG 617
                   A+F  P+       Y ++A  AA FI +H + D +  RL +    G +    
Sbjct: 452 -------FAVFGEPL-------YWQMATVAAEFILKHQWLDGRFQRLNY---QGQASVLA 494

Query: 618 FLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL 676
             +D+A+ I  LLDL       T WL  AI+LQ   D  F   + GGYFN T  D S+ L
Sbjct: 495 QSEDFAYFIKALLDLQTANPQETGWLEAAIDLQGEFDRWFWAEDEGGYFN-TASDHSLDL 553

Query: 677 RVKED--HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAV 734
            V+E    D A PS N +++ NLVRL+ +    +   Y   AE +L  F T L+    A 
Sbjct: 554 IVRERGYTDNATPSANGIAIANLVRLSRLTENLE---YLDRAEKALQSFSTILEQSPTAC 610

Query: 735 PLMCCAAD 742
           P +  A D
Sbjct: 611 PSLFVALD 618


>gi|150026141|ref|YP_001296967.1| hypothetical protein FP2103 [Flavobacterium psychrophilum JIP02/86]
 gi|149772682|emb|CAL44165.1| Protein of unknown function YyaL [Flavobacterium psychrophilum
           JIP02/86]
          Length = 686

 Score =  356 bits (913), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 217/617 (35%), Positives = 318/617 (51%), Gaps = 54/617 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N+L  E SPYLLQHA+NP+ W AW +   A A+K +  I +SIGYS CHWCHVME ESFE
Sbjct: 16  NQLNLETSPYLLQHANNPIHWQAWSKNTLATAQKENKLIIISIGYSACHWCHVMEHESFE 75

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           ++ VA ++N  F+SIKVDREERPDVD +YM  VQ +   GGWPL+V   PD +P+ GGTY
Sbjct: 76  NQEVASVMNLNFISIKVDREERPDVDAIYMKAVQMMTNRGGWPLNVVCLPDGRPIWGGTY 135

Query: 223 FPPEDKYGRPGFKTILRKVKDAW---DKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE 279
           F  E+      +   L+++ + +    +K    AQ     I+ L      +A      ++
Sbjct: 136 FQKEE------WTNTLQQLHELYVSNPQKIIKYAQKLHQGIQVLGTIQHHTAQ-----EQ 184

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
              N ++   E+ SKS+D  +GG+  APKF       MM  +   L+  G   ++ E   
Sbjct: 185 NHTNNIKPLVEKWSKSFDWEYGGYARAPKF-------MMPNNYLFLQRYGYQTKSQELLN 237

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
            V  TL  MA GGI D + GGF RYSVD RWH+PHFEKMLYD GQL ++Y  A+  T++ 
Sbjct: 238 FVDLTLTKMAHGGIFDTIAGGFSRYSVDIRWHIPHFEKMLYDNGQLVSLYAQAYKRTQNP 297

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE 459
            Y  +    L ++ R+ +      ++A DADS         +EGAFYVWT  E+++IL  
Sbjct: 298 LYKEVIEKTLTFVEREFLNSDNGFYAALDADSLNQNNEL--EEGAFYVWTKTELQEILKN 355

Query: 460 HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILG 519
              +F   Y +   G  +     D H       VLI+   S + ASK G+   +  N   
Sbjct: 356 DFEIFSHLYNVNDFGFWE----HDNH-------VLIQNQPSKSIASKFGLTENELQNKRK 404

Query: 520 ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRK 579
              + LF  R KRP+P LDDK + SWN +++  +  A   L ++                
Sbjct: 405 NWEQLLFTKREKRPKPRLDDKSLTSWNAIMLKGYTDAYNALGNQ---------------- 448

Query: 580 EYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGT 639
           +Y+ +AE  A FI    +  +   L  S++   S   GFL+DYAF I   + LY+     
Sbjct: 449 KYLAIAEKNAQFITTKQWSAEGF-LYRSYKKNKSTIEGFLEDYAFTIDAFISLYQATLNE 507

Query: 640 KWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVR 699
           K+L  A +L +   + F + +   +   + +   ++ +  E  D   P+ NSV   NL  
Sbjct: 508 KYLQQAKQLTDYCFDNFYNEKQHFFAFNSRKSAQLIAQHFETEDNVMPASNSVMANNLYV 567

Query: 700 LASIVAGSKSDYYRQNA 716
           L  + +   ++YY + A
Sbjct: 568 LGLLFS---NNYYEKIA 581


>gi|290957891|ref|YP_003489073.1| hypothetical protein SCAB_34251 [Streptomyces scabiei 87.22]
 gi|260647417|emb|CBG70522.1| conserved hypothetical protein [Streptomyces scabiei 87.22]
          Length = 691

 Score =  355 bits (912), Expect = 4e-95,   Method: Compositional matrix adjust.
 Identities = 227/630 (36%), Positives = 318/630 (50%), Gaps = 61/630 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA   SPYLLQHA NPVDW  W   AF EAR+RDVP+FLS+GYS CHWCHVM  ESFE
Sbjct: 9   NRLAHATSPYLLQHADNPVDWRPWEPAAFEEARRRDVPVFLSVGYSACHWCHVMAKESFE 68

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D+G A  LN+ FVS+KVDREERPDVD VYM  VQA  G GGWP+SVF++P  +P   GTY
Sbjct: 69  DKGTAAYLNEHFVSVKVDREERPDVDAVYMEAVQAATGQGGWPMSVFMTPAAEPFYFGTY 128

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FPP  + G P F+ +L  V  AW  +R  +A         L+E  +  A S+ LP    Q
Sbjct: 129 FPPGPRQGMPSFRQVLEGVHHAWSSRRQEVADVAVKITRDLAE-RALGAGSDGLPTGETQ 187

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
               L   QL++  DS  G F  + KFP  + ++ +L H  +   TG        ++M  
Sbjct: 188 AQALL---QLTRDVDSTSGWFKGSTKFPPSMVVEFLLRHHAR---TGSVA----AREMAE 237

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSV---DERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
                MA+  ++D VGGGFHRY +    +   VPHFEKMLYD   L  VY   +  T   
Sbjct: 238 GLCGAMARSSLYDQVGGGFHRYVLLAHADGPLVPHFEKMLYDNALLCRVYAHLWRATGSE 297

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE 459
               +  +  D++ R++    G   SA DADS +  G+ +  EGA+YVWT +++ ++LGE
Sbjct: 298 PARRVALETADFMVRELRTNEGGFASALDADSDDGTGSGKHVEGAYYVWTPEQLTEVLGE 357

Query: 460 HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL---N 516
                   Y+                        + E       AS L +P ++ +    
Sbjct: 358 EDAALAVRYF-----------------------GVTEEGTFEEGASVLQLPQQEGVFDAE 394

Query: 517 ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGS 576
            +   R +L   RS+RP P  DDKV+ +WNGL +++ A                      
Sbjct: 395 RIESVRERLLAARSRRPAPGRDDKVVAAWNGLAVAALAETGAYF---------------- 438

Query: 577 DRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKA-PGFLDDYAFLISGLLDLYEF 635
           DR + ++ A +AA  + R   DE+  RL  + R+G + A  G L+DYA +  G L L   
Sbjct: 439 DRPDLVDAAITAADLLVRLHLDERA-RLTRTSRDGQAGANAGVLEDYADVAEGFLALASV 497

Query: 636 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVI 695
                WL +A  L +     F+D   G  ++T  +   ++ R ++  D A PSG S +  
Sbjct: 498 TGEGVWLEFAGFLLDHVLARFVDEGSGALYDTASDAEKLIRRPQDPTDNATPSGWSAAAG 557

Query: 696 NLVRLASIVAGSKSDYYRQNAEHSLAVFET 725
               L    A + S+ +R+ AE +L V + 
Sbjct: 558 A---LLGYAAQTGSEPHRRAAERALGVVKA 584


>gi|409401428|ref|ZP_11251213.1| thymidylate kinase [Acidocella sp. MX-AZ02]
 gi|409129779|gb|EKM99602.1| thymidylate kinase [Acidocella sp. MX-AZ02]
          Length = 654

 Score =  355 bits (912), Expect = 4e-95,   Method: Compositional matrix adjust.
 Identities = 216/629 (34%), Positives = 318/629 (50%), Gaps = 67/629 (10%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNRL    SPYLLQH  NPV W  WGE AFAEA+ R+VP+ LSIGY+ CHWCHVM  ESF
Sbjct: 2   TNRLQDASSPYLLQHKDNPVHWQQWGEAAFAEAKARNVPVLLSIGYAACHWCHVMAHESF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           E+E +A LLN+ FV+IKVDREERPD+D+ YM  + A+   GGWPL++ L+P+  P  GGT
Sbjct: 62  ENEQIAGLLNERFVAIKVDREERPDIDQTYMAALHAMGEQGGWPLTMVLTPEGAPFWGGT 121

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFPP  ++GRP F  +L  +  AW  +++ +A+S       L+E     A++ K  D   
Sbjct: 122 YFPPTPRHGRPSFPQVLVALSQAWANEQEQIARSAGAIRRALAE-----AAATKPGDAPG 176

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
              L    E   +  D   GG   APKFP  + +   L+   +L D       + G++ V
Sbjct: 177 PELLHAVQEAFLRGMDWELGGLAGAPKFPN-IPVFRFLW---QLGD-------ARGREAV 225

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
              L+ M++GGI+DH+GGG+ RY+ D+ W VPHFEKMLYD   +  +   A +   +  Y
Sbjct: 226 HLLLERMSQGGIYDHLGGGYARYATDDAWLVPHFEKMLYDNALILELLAYAQADKPNPLY 285

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHA 461
           +   R+ + +L RDM   G    ++EDADS   EG    +EG FYV+T  E+E  LG+ A
Sbjct: 286 AARARETVGWLTRDMAAEGA-FAASEDADS---EG----EEGKFYVFTRAEIEAALGDDA 337

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
             F+  Y +   GN            ++G+ +L           +  +P       L  C
Sbjct: 338 RFFETAYPMPAAGN------------WEGRIIL-----------ERRLPFNGDETRLAAC 374

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
           R KL  +R  R RP  DDK++  WN L IS+  +A  + +                   +
Sbjct: 375 RAKLKALRDTRIRPGRDDKILADWNALAISALVKAGIVFQEPG----------------W 418

Query: 582 MEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKW 641
           + + E   + + + +  E+  R+ H+ R+G   A G L+D A +I   + LY+    + +
Sbjct: 419 IALGERIFTTLIQAM-GEEDGRIAHAMRDGKISAAGLLEDQAAMIRAGIALYQATDKSAY 477

Query: 642 LVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLA 701
           LV +  +    +  F D EG  Y +          R +   DG  PSG  +        A
Sbjct: 478 LVLSETILAATEARFGDGEGAFYISADDAQDVYAPRGRSIQDGPTPSGTGMMAQA---YA 534

Query: 702 SIVAGSKSDYYRQNAEHSLAVFETRLKDM 730
           S+   +  D YR   +  L  +  R + +
Sbjct: 535 SLFHLTGKDEYRAKTQAVLRAYGGRARAL 563


>gi|297559081|ref|YP_003678055.1| hypothetical protein Ndas_0098 [Nocardiopsis dassonvillei subsp.
           dassonvillei DSM 43111]
 gi|296843529|gb|ADH65549.1| protein of unknown function DUF255 [Nocardiopsis dassonvillei
           subsp. dassonvillei DSM 43111]
          Length = 677

 Score =  355 bits (911), Expect = 6e-95,   Method: Compositional matrix adjust.
 Identities = 224/599 (37%), Positives = 308/599 (51%), Gaps = 69/599 (11%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           +NRL+   SPYLLQHA NPV+W+ WGEEA AEAR+RDVP+ +S+GY+ CHWCHVM  ESF
Sbjct: 2   SNRLSDATSPYLLQHADNPVEWWPWGEEALAEARRRDVPLLVSVGYAACHWCHVMAHESF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           EDE  A L+N  FV++KVDREERPDVD VYM   QA+ G GGWP++VF +PD  P   GT
Sbjct: 62  EDEATAALMNSLFVNVKVDREERPDVDAVYMEATQAMTGQGGWPMTVFATPDGAPFYCGT 121

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFP      R  F+ +LR V DAW  +R  L   GA  +E LS   + +A+     D L 
Sbjct: 122 YFP------REHFQRLLRGVADAWRDQRTELVGQGARVVEALSGPRTLAAAPPPSADRL- 174

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
                L    L + YDS  GGFG+APKFP  + +  +    ++      + E++    M 
Sbjct: 175 ----DLAVRALVRDYDSAHGGFGTAPKFPPSMLLSFLTAQDERTRPLQSADESTPAWLMA 230

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYL---------DA 392
             T   MA+GG++D +GGGF RYSVD  W VPHFEKMLYD   L   Y            
Sbjct: 231 SGTALAMAQGGMYDQLGGGFARYSVDREWTVPHFEKMLYDNALLLRAYARMGRRPSGPGV 290

Query: 393 FSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKE 452
                      +  +  D++ RD+  P G   SA DADS   EG    +EG +YVWT  +
Sbjct: 291 SDAATHALLRRVAGETADWMLRDLRTPEGGFASALDADS---EG----EEGTYYVWTPAQ 343

Query: 453 VEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE 512
           + ++LGE    F    +           +++     +G +VL +L    A A +      
Sbjct: 344 LREVLGEEDAAFAAEVF----------GVTEEGTFERGASVL-QLPAPPADAWR------ 386

Query: 513 KYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFP 572
            Y  +    R  L   R++R  P  DDKV+ +WNGL +++ A A  +L            
Sbjct: 387 -YQRV----REALLAARAERVAPARDDKVVAAWNGLAVAALAEAGVLL------------ 429

Query: 573 VVGSDRKEYMEVAESAASFIRR-HLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLD 631
               +R + +E A +AA  + R HL D +  R     R G S   G L+DYA +  GLL 
Sbjct: 430 ----ERPDLVEAARAAADLLLRVHLRDGRLVRTSRDGRAGTSA--GVLEDYADVAEGLLV 483

Query: 632 LYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGN 690
           L+      ++   A  L +T  E F D  GG +++T  +   +  R ++  D   PSG 
Sbjct: 484 LHGVTGEARYAHEAGRLLDTVLERFGDGSGG-FYDTADDAERLFNRPQDPTDNVTPSGR 541


>gi|433772248|ref|YP_007302715.1| thioredoxin domain protein [Mesorhizobium australicum WSM2073]
 gi|433664263|gb|AGB43339.1| thioredoxin domain protein [Mesorhizobium australicum WSM2073]
          Length = 675

 Score =  355 bits (911), Expect = 6e-95,   Method: Compositional matrix adjust.
 Identities = 222/604 (36%), Positives = 309/604 (51%), Gaps = 58/604 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N LA E SPYL QH+ NPV W  W   +  EA+  D PI LS+GY+ CHWCHVM  ESFE
Sbjct: 10  NLLADEASPYLQQHSGNPVHWRGWSPASLEEAKALDRPILLSVGYAACHWCHVMAHESFE 69

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           ++ VA ++N  FV+IKVDREERPD+D++YM  + ++   GGWPL++FL+PD KP  GGTY
Sbjct: 70  NDDVAAVMNRLFVNIKVDREERPDIDQIYMAALSSMGEQGGWPLTMFLTPDGKPFWGGTY 129

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP E +YGRPGF  ++  V  AW +KR  L QS       +   LSA+ S   L  ++  
Sbjct: 130 FPREPRYGRPGFIQVMEAVDKAWREKRTSLHQSADGLTSHVEARLSATHSKALLDRDM-- 187

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             L   A ++S   D   GG   APKFP    +Q +           + G A+  +  VL
Sbjct: 188 --LSDLAGRVSGMIDRDRGGLAGAPKFPNAPFMQTLWLSWL------RDGNAAH-RDDVL 238

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            +L+ M  GGI+DH+GGG  RYS D  W VPHFEKMLYD  QL      A + T +  + 
Sbjct: 239 VSLEHMLSGGIYDHIGGGLSRYSTDAEWLVPHFEKMLYDNAQLIRFCNWALAATGNDLFR 298

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
               D + +L R+M   GG   ++ DADS         +EG FY W+  E+E +LG+ + 
Sbjct: 299 VRIEDTVGWLLREMRVEGGAFAASLDADS-------DGEEGLFYTWSRGEIESVLGDDST 351

Query: 463 LFKEHYYL-KPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
           LF +++ L  P G             ++GK VL +    + S    G+   + L  L   
Sbjct: 352 LFFKYFSLSSPPG-------------WEGKPVLHQ----TLSQQAFGVADRERLVPL--- 391

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
           + +L  VR +R RP LD K +  WNGL+I++ A A + L                 R ++
Sbjct: 392 KTRLLTVREQRVRPGLDAKTLTDWNGLMIAALAEAGRSLA----------------RPDW 435

Query: 582 MEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKW 641
           +E A  A + I +   D    RL HS        P    DYA + +  + L+E      +
Sbjct: 436 IEAAAKAFAHIGKAGRD---GRLPHSMLGVRKLFPALSSDYAAMTNAAISLFEATEDWSY 492

Query: 642 LVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLA 701
           +  A +     D    D EG GY+ T  +   V +R++ D D A PS  S  +   VRLA
Sbjct: 493 VEQASQFLGQLDHWHADVEGTGYYLTASDSTDVPIRIRGDVDEAIPSATSQIIEAQVRLA 552

Query: 702 SIVA 705
           SI  
Sbjct: 553 SITG 556


>gi|302497930|ref|XP_003010964.1| hypothetical protein ARB_02862 [Arthroderma benhamiae CBS 112371]
 gi|291174510|gb|EFE30324.1| hypothetical protein ARB_02862 [Arthroderma benhamiae CBS 112371]
          Length = 714

 Score =  355 bits (910), Expect = 6e-95,   Method: Compositional matrix adjust.
 Identities = 222/614 (36%), Positives = 332/614 (54%), Gaps = 60/614 (9%)

Query: 156 MEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK 215
           ME ESF    VA +LN  F+ IK+DREERPD+D VYM YVQA  G GGWPL+VFL+PDL+
Sbjct: 1   MEKESFMSAEVAAILNKSFIPIKLDREERPDIDDVYMNYVQATTGSGGWPLNVFLTPDLE 60

Query: 216 PLMGGTYFPPEDKYGRP--------GFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL 267
           P+ GGTY+P  +    P        GF  +L K++D W+ ++    +S      QL E  
Sbjct: 61  PVFGGTYWPGPNATPLPKLGGEEPVGFIDVLEKLRDVWNTQQLRCRESAKEITRQLREFA 120

Query: 268 S-----ASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHS 322
                 +  + ++  ++L  + L       +  YD+  GGF  +PKFP PV +  +L  S
Sbjct: 121 EEGTHLSQVNKSEQEEDLEVDLLEEAFTHFAARYDATNGGFSGSPKFPTPVNLSFLLRLS 180

Query: 323 K---KLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKML 379
           +   ++ D     E  +  +M + T+  +A+GGI D +G GF RYSV   W +PHFEKML
Sbjct: 181 RYPEEVMDIVGREECVKATEMAVNTMIKVARGGIRDQIGYGFSRYSVTPDWSLPHFEKML 240

Query: 380 YDQGQLANVYLDAFSLTKDVFYSYICRDILDYLRRD-MIGPGGEIFSAEDADSAETEGAT 438
           YDQ QL +V++D F  + +        D++ Y+    ++ P G  +S+EDADS  +   T
Sbjct: 241 YDQAQLLDVFIDGFEASHEPELLGAIYDLVTYITSTPILSPMGCFYSSEDADSQPSPEDT 300

Query: 439 RKKEGAFYVWTSKEVEDILGEH-AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIEL 497
            K+EGA+YVWT KE++ ILG+  A +   H+ + P GN  ++R++DPH+EF  +NVL   
Sbjct: 301 EKREGAYYVWTLKELKQILGQRDADVCARHWGVLPDGN--VARVNDPHDEFMNRNVLRIA 358

Query: 498 NDSSASASKLGMPLEKYLNILGECRRKLFDVR-SKRPRPHLDDKVIVSWNGLVISSFARA 556
              +  A + G+  E+ + IL   R KL + R +KR RP LDDK+IV+WNGLVI + A+ 
Sbjct: 359 TTPTQVAKEFGLNEEETIRILKTSRVKLREYRETKRVRPELDDKIIVAWNGLVIGALAKC 418

Query: 557 SKILKS-EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFR-NGPSK 614
           + +L+  +AE +           K   ++A +A  FI+ +L+D ++ +L   +R +    
Sbjct: 419 AILLEDIDAEKS-----------KHCRQMASNAVKFIKENLFDAESGQLWRIYRADSRGD 467

Query: 615 APGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQ--------------NTQ--DELFLD 658
            PGF DDYA+LISGLL LYE       L +A +LQ              N +  ++ F+ 
Sbjct: 468 TPGFADDYAYLISGLLQLYEATFDDAHLQFADKLQLCGKGKGVWLTARLNAEYLNKYFIS 527

Query: 659 REGG------GYFNTTGE----DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSK 708
                     G++ T  E     P  L R+K   D A PS N V   NL+RL+S++    
Sbjct: 528 VSASDSSICTGFYMTPSEAVTDTPGALFRLKTGTDSATPSTNGVIAQNLLRLSSLLEDES 587

Query: 709 SDYYRQNAEHSLAV 722
                +   H+ AV
Sbjct: 588 YKLKARQTCHAFAV 601


>gi|423129587|ref|ZP_17117262.1| hypothetical protein HMPREF9714_00662 [Myroides odoratimimus CCUG
           12901]
 gi|371648637|gb|EHO14125.1| hypothetical protein HMPREF9714_00662 [Myroides odoratimimus CCUG
           12901]
          Length = 706

 Score =  354 bits (909), Expect = 9e-95,   Method: Compositional matrix adjust.
 Identities = 216/625 (34%), Positives = 323/625 (51%), Gaps = 48/625 (7%)

Query: 83  VVAMAERTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIF 142
           +V +   T        N   N L  E SPYLLQHA+NP+ W AW +E    A + D  + 
Sbjct: 21  IVKIHLTTFVKQQQYHNLIMNLLHLESSPYLLQHANNPIYWKAWNKETLTLAEQEDKLLI 80

Query: 143 LSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGG 202
           +SIGYSTCHWCHVME ESFE++ VA L+N  F+SIKVDREE P +D  YM  +Q +   G
Sbjct: 81  ISIGYSTCHWCHVMEKESFENQEVADLMNQHFISIKVDREELPHLDNFYMKAIQIMTKQG 140

Query: 203 GWPLSVFLSPDLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQ 262
           GWPL+V   PD +P+ GGTYF       R  +   L ++   + +KRD +     FA  Q
Sbjct: 141 GWPLNVVCLPDGRPIWGGTYFK------RQNWIDSLSQLHHLYKEKRDTVLD---FAT-Q 190

Query: 263 LSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHS 322
           L E +S  + +    +E   N   L  E   KS+D  +GG+  APKF  P     +LY  
Sbjct: 191 LQEGISILSQAPIAQEESRFNT-DLVLENWKKSFDWEYGGYTRAPKFMMPTN---LLYLQ 246

Query: 323 KKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ 382
           KK    G      +  + +  TL  MA GG+ D V GGF RYSVD +WH+PHFEKMLYD 
Sbjct: 247 KK----GVLHRDQQLLEYIDLTLTRMAWGGLFDTVEGGFSRYSVDHKWHIPHFEKMLYDN 302

Query: 383 GQLANVYLDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKE 442
            QL +VY D +  T +  Y  +    ++++  +     G  +SA DADS ++    + +E
Sbjct: 303 AQLLSVYADGYKRTHNKLYKEVIDKTINFITNNWANGEGGYYSALDADSLDSHN--QLEE 360

Query: 443 GAFYVWTSKEVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSA 502
           GAFY+WT +E+++++ +   LF   + +   G+ +       +N++    VLI+  +   
Sbjct: 361 GAFYIWTIEELKELVQQDFPLFSTVFNINSFGHWE-------NNQY----VLIQTRELID 409

Query: 503 SASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKS 562
            A++  +PLE   N   +    L   R+ RP+P LDDK + SWN + I+    A    ++
Sbjct: 410 IANENNIPLEDLENKKKQWETALRQYRANRPKPRLDDKTLTSWNAMYITGLLDAYTATQN 469

Query: 563 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDY 622
            A                Y+E A++   FI  +L+ E+   L+ ++++G +K   FLDDY
Sbjct: 470 TA----------------YLEQAKALHLFIHNNLWCEERGLLR-TYKDGNAKIEAFLDDY 512

Query: 623 AFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDH 682
           AF I GL+ L+E     +++  A  L +   + FLD E   ++ +       +    E  
Sbjct: 513 AFYIQGLIYLFEHTEEQQYITEAKNLMDYSLDHFLDHESKFFYFSKHNQEDTITPAIETE 572

Query: 683 DGAEPSGNSVSVINLVRLASIVAGS 707
           D   PS N++  INL +L  +   S
Sbjct: 573 DNVIPSSNAIMAINLYKLGLLYENS 597


>gi|296131254|ref|YP_003638504.1| hypothetical protein Cfla_3431 [Cellulomonas flavigena DSM 20109]
 gi|296023069|gb|ADG76305.1| protein of unknown function DUF255 [Cellulomonas flavigena DSM
           20109]
          Length = 682

 Score =  354 bits (909), Expect = 9e-95,   Method: Compositional matrix adjust.
 Identities = 222/589 (37%), Positives = 306/589 (51%), Gaps = 59/589 (10%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLAA  SPYLLQHA NPVDW+ WG++AFAEAR+RDVP+ +S+GY+ CHWCHVM  ESFE
Sbjct: 3   NRLAASTSPYLLQHADNPVDWWEWGDDAFAEARRRDVPLLISVGYAACHWCHVMAHESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D   A  +N+ FV +KVDREERPDVD VYM   QA+ G GGWP++V  +PD +P   GTY
Sbjct: 63  DPATAAFMNEHFVCVKVDREERPDVDAVYMAATQAMTGSGGWPMTVVATPDGRPFFCGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKR-DMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           FPP      P F  +L  V  AW  +R ++L+ + A A    +        S    D + 
Sbjct: 123 FPPRRVQQVPSFPEVLAAVAAAWTGRRAEVLSSADAIADALAARPGPTDGPSGD--DRVD 180

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
           +  +      LS S+DSR GGFG APKFP  + ++ +L H  +  D    G       M 
Sbjct: 181 ERVVARALGALSASFDSRDGGFGGAPKFPPSMVLEWLLRHHARTGDADALG-------MA 233

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             TL  MA+GG++D + GG+ RYSVD  W VPHFEKMLYD   L  V+L A+ +T D   
Sbjct: 234 RRTLDAMARGGVYDQLAGGYARYSVDATWTVPHFEKMLYDNALLLRVHLHAWRMTGDALD 293

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHA 461
             +  +  D+L  D+    G   SA DADS   EG    +EGAFY WT  ++ ++LG+  
Sbjct: 294 RRVVEETADWLLTDLRTAEGGFASALDADS---EG----REGAFYAWTPAQLREVLGDDD 346

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
             +  H      G  D          F+    +++L +  A  +       +Y ++    
Sbjct: 347 GAWAAHVL----GVTDA-------GTFEHGASVLQLREDPADVA-------RYADV---- 384

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
           R +L   R +RPRP  DDKV+ +WNGL I++ A A  +                 DR ++
Sbjct: 385 RARLRAAREQRPRPARDDKVVSAWNGLAIAALAEAGAL----------------LDRPDW 428

Query: 582 MEVAESAASF---IRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFLISGLLDLYEFGS 637
           ++ A + A     +      +   RL  + R+G   +APG L+DYA +  G L L     
Sbjct: 429 LDAARACARLLADLHTRPGPDGGDRLVRTSRDGVAGRAPGVLEDYADVAEGYLALAAVTG 488

Query: 638 GTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 686
              W  WA  L  T    F D +GG Y     E  +VL  ++   D A+
Sbjct: 489 EHVWTTWARRLLATVLAHFGDGDGGLYDTADDETDAVLGALRRPQDVAD 537


>gi|172036954|ref|YP_001803455.1| putative six-hairpin glycosidase familly protein [Cyanothece sp.
           ATCC 51142]
 gi|354554754|ref|ZP_08974058.1| putative six-hairpin glycosidase familly protein [Cyanothece sp.
           ATCC 51472]
 gi|171698408|gb|ACB51389.1| putative six-hairpin glycosidase familly protein [Cyanothece sp.
           ATCC 51142]
 gi|353553563|gb|EHC22955.1| putative six-hairpin glycosidase familly protein [Cyanothece sp.
           ATCC 51472]
          Length = 686

 Score =  354 bits (908), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 242/662 (36%), Positives = 337/662 (50%), Gaps = 68/662 (10%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA   S YL +HA NP+DW+ W EEA   A+  + PIFLSIGYS+CHWC VME E+F 
Sbjct: 3   NRLANTQSLYLRKHAENPIDWWYWCEEALEIAKNENKPIFLSIGYSSCHWCTVMEGEAFC 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP-DLKPLMGGT 221
           D  +A  LND F+ IKVDREERPD+D +YM+ +Q +   GGWPL++FL+P DL P  GGT
Sbjct: 63  DLAIATYLNDNFLPIKVDREERPDLDSIYMSSLQMMGIQGGWPLNIFLTPGDLVPFYGGT 122

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFP E +YGRPGF  +L+ ++  +D +++ L     F  +++   L  SA        LP
Sbjct: 123 YFPVEPRYGRPGFLQVLQSIRRFYDVEKEKL---NGFK-QEIVNTLQQSAI-------LP 171

Query: 282 QNALRLCAEQL-SKSYDSRFGGFG-SAPKFPRPVEIQMMLYHSKKLEDTG-KSGEASEGQ 338
           +  + +   QL  +  D        +A  F RP    M+ Y +  L+ T    GE  E  
Sbjct: 172 KTDINVNNAQLIYRGVDVNTKIIQVTAEDFGRPC-FPMIPYSNLALQGTRFLFGEPEERH 230

Query: 339 KMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKD 398
            +V+   Q +A GGI D VGGGFHRY+VD  W VPHFEKMLYD GQ+     + +S  + 
Sbjct: 231 ILVIQRGQDLALGGIFDQVGGGFHRYTVDSTWTVPHFEKMLYDNGQIVEYLANLWSSGQQ 290

Query: 399 --VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
              F   I   +  +L+R+M  P G  ++A+DADS  T+     +EGAFYVW  +++E +
Sbjct: 291 EPAFERAIALTV-QWLQREMTAPDGYFYAAQDADSFATKEDKEPEEGAFYVWEYEQLEQL 349

Query: 457 LGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYL 515
           L    +    + + + P GN            F+GKNVL   N    S S   +  + + 
Sbjct: 350 LTSTELEALTDVFTITPEGN------------FEGKNVLQRRNKEKLSDSIETILDKLFK 397

Query: 516 NILGECRRKLFDVRSK-------------RPRPHLDDKVIVSWNGLVISSFARASKILKS 562
              G  R  L   ++              R  P  D K+IV+WNGL+IS  ARA  + K 
Sbjct: 398 ERYGTSRNNLDTFQAAKNNQDAKTIHWPGRIPPVTDTKMIVAWNGLMISGLARAYAVFKQ 457

Query: 563 EAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDY 622
                    P+       Y ++A +A  FI    +     R Q     G        +DY
Sbjct: 458 ---------PL-------YWQLACNATQFILEKQW--VNGRFQRINYQGNPSILAQSEDY 499

Query: 623 AFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL-RVKE 680
           AF I  LLDL       T+WL  A+E+Q   DE F   + GGY+N   ++ + LL R + 
Sbjct: 500 AFFIKALLDLQAANPQDTQWLDKAMEIQQEFDEYFWSVDTGGYYNNADDNNNDLLVRERS 559

Query: 681 DHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCA 740
             D A PS N +++ NLVRLA +        Y   AE +L  F   L++   A P +  A
Sbjct: 560 YIDNATPSANGIAISNLVRLARLTDNLD---YLDKAEQALQAFSYVLRESPRACPSLLTA 616

Query: 741 AD 742
            D
Sbjct: 617 LD 618


>gi|423133250|ref|ZP_17120897.1| hypothetical protein HMPREF9715_00672 [Myroides odoratimimus CIP
           101113]
 gi|371649306|gb|EHO14787.1| hypothetical protein HMPREF9715_00672 [Myroides odoratimimus CIP
           101113]
          Length = 667

 Score =  354 bits (908), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 216/606 (35%), Positives = 319/606 (52%), Gaps = 50/606 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N L  E SPYLLQHA+NP+ W AW +E    A + D  + +SIGYSTCHWCHVME ESFE
Sbjct: 2   NLLHLESSPYLLQHANNPIYWKAWNKETLTLAEQEDKLLIISIGYSTCHWCHVMEKESFE 61

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           ++ VA L+N+ F+SIKVDREE P +D  YM  +Q +   GGWPL+V   PD +P+ GGTY
Sbjct: 62  NQEVADLMNEHFISIKVDREELPHLDNFYMKAIQIMTKQGGWPLNVVCLPDGRPIWGGTY 121

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           F       R  +   L ++   + +KRD +     FA  QL E +S   S   +  E  +
Sbjct: 122 FK------RQNWIDSLSQLHHLYKEKRDTVLD---FAT-QLQEGISI-LSQAPIAQEDSR 170

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
               L  E   KS+D  +GG+   PKF  P     +LY  KK    G      +  + + 
Sbjct: 171 FNTELVLENWKKSFDWEYGGYTRTPKFMMPTN---LLYLQKK----GVLHRDQQLLEYID 223

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL  MA GG+ D V GGF RYSVD +WH+PHFEKMLYD  QL +VY D +  T +  Y 
Sbjct: 224 LTLTRMAWGGLFDTVEGGFSRYSVDHKWHIPHFEKMLYDNAQLLSVYADGYKRTHNKLYK 283

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
            +    +D++  +     G  +SA DADS ++    + +EGAFYVWT +E+++++ +   
Sbjct: 284 EVIDKTIDFITNNWANGEGGYYSALDADSLDSHN--QLEEGAFYVWTIEELKELVQQDFP 341

Query: 463 LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECR 522
           LF   + +   G+ + S+            VLI+  +    A++  +PLE   N   +  
Sbjct: 342 LFSTVFNINSFGHWENSQY-----------VLIQTRELIDIANENNIPLEDLENKKKQWE 390

Query: 523 RKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYM 582
             L   R+ RP+P LDDK + SWN + I+    A    ++ A                Y+
Sbjct: 391 TALRQYRANRPKPRLDDKTLTSWNAMYITGLLDAYTATQNTA----------------YL 434

Query: 583 EVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWL 642
           E A++   FI  +L+ E+   L+ ++++G +K   FLDDYAF I GL+ L+E     +++
Sbjct: 435 EQAKALHLFIHNNLWCEERGLLR-TYKDGNAKIEAFLDDYAFYIQGLIYLFEHTEEQQYI 493

Query: 643 VWAIELQNTQDELFLDREGG-GYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLA 701
             A  L +   + FLD E    YFN   ++ ++   + E  D   PS N++  +NL +L 
Sbjct: 494 TEAKNLMDYSLDHFLDHESKFFYFNKHNQEDTITPAI-ETEDNVIPSSNAIMAMNLYKLG 552

Query: 702 SIVAGS 707
            +   S
Sbjct: 553 LLYENS 558


>gi|423328847|ref|ZP_17306654.1| hypothetical protein HMPREF9711_02228 [Myroides odoratimimus CCUG
           3837]
 gi|404604409|gb|EKB04043.1| hypothetical protein HMPREF9711_02228 [Myroides odoratimimus CCUG
           3837]
          Length = 667

 Score =  354 bits (908), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 213/605 (35%), Positives = 316/605 (52%), Gaps = 48/605 (7%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N L  E SPYLLQHA+NP+ W AW +E    A + D  I +SIGYSTCHWCHVME ESFE
Sbjct: 2   NLLHLESSPYLLQHANNPIYWKAWNKETLTRAEQEDKLIIISIGYSTCHWCHVMEKESFE 61

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           ++ VA ++N  F+SIKVDREE P +D  YM  +Q +   GGWPL+V   PD +P+ GGTY
Sbjct: 62  NQEVADIMNQHFISIKVDREELPHLDNFYMKAIQIMTKQGGWPLNVVCLPDGRPIWGGTY 121

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           F  E       +   L ++   + +KRD +     FA  QL E +S   S   +  E  +
Sbjct: 122 FKKE------AWIDSLSQLHHLYKEKRDTVLD---FAT-QLQEGISI-LSQAPIAQEDSR 170

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
               L  E   KS+D  +GG+   PKF  P     +LY  KK    G      +  + + 
Sbjct: 171 FNTELVLENWKKSFDWEYGGYTRTPKFMMPTN---LLYLQKK----GVLHRDQQLLEYID 223

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL  MA GG+ D V GGF RYSVD +WH+PHFEKMLYD  QL +VY D +  T +  Y 
Sbjct: 224 LTLTRMAWGGLFDTVEGGFSRYSVDHKWHIPHFEKMLYDNAQLLSVYADGYKRTHNKLYK 283

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
            +    +D++  +     G  +SA DADS ++    + +EGAFY+WT +E+++++ +   
Sbjct: 284 EVIDKTIDFITNNWANGEGGYYSALDADSLDSHN--QLEEGAFYIWTIEELKELVQQDFP 341

Query: 463 LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECR 522
           LF   + +   G+ +       +N++    VLI+  +    A++  +PLE   N   +  
Sbjct: 342 LFSTVFNINSFGHWE-------NNQY----VLIQTRELIDIANENNIPLEDLENKKKQWE 390

Query: 523 RKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYM 582
             L   R+ RP+P LDDK + SWN + I+    A    ++ A                Y+
Sbjct: 391 TALRQYRANRPKPRLDDKTLTSWNAMYITGLLDAYTATQNTA----------------YL 434

Query: 583 EVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWL 642
           E A++   FI  +L+ E+   L+ ++++G +K   FLDDYAF I GL+ L+E     +++
Sbjct: 435 EQAKALHLFIHNNLWCEERGLLR-TYKDGNAKIEAFLDDYAFYIQGLIYLFEHTEEQQYI 493

Query: 643 VWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLAS 702
             A  L +   + FLD E   ++ +       +    E  D   PS N++  INL +L  
Sbjct: 494 TEAKNLMDYSLDHFLDHESKFFYFSKHNQEDTITPAIETEDNVIPSSNAIMAINLYKLGL 553

Query: 703 IVAGS 707
           +   S
Sbjct: 554 LYENS 558


>gi|54026795|ref|YP_121037.1| hypothetical protein nfa48210 [Nocardia farcinica IFM 10152]
 gi|54018303|dbj|BAD59673.1| hypothetical protein [Nocardia farcinica IFM 10152]
          Length = 687

 Score =  353 bits (907), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 231/637 (36%), Positives = 322/637 (50%), Gaps = 84/637 (13%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLAA  SPYL QHA NPV W+ W   A A A++RDVPI LSIGY++CHWCHVM  ESF 
Sbjct: 8   NRLAAATSPYLRQHADNPVHWWEWEPAALAAAKERDVPILLSIGYASCHWCHVMAHESFA 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D   A L+N+ FV +KVDREERPD+D VYM    A+ G GGWP++ FL+PD +P   GTY
Sbjct: 68  DPATAALMNENFVCVKVDREERPDLDAVYMNATVAMTGQGGWPMTCFLTPDGEPFYCGTY 127

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           +P   + G P F  +L  V D W  +RD + ++ A    Q++EAL A +S       LP+
Sbjct: 128 YPKTPRGGMPSFTQLLTAVTDTWRNRRDEVDRASA----QVAEALRAQSSG------LPE 177

Query: 283 NALRLCAEQLS-------KSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS 335
             LR+  E L        +  D  +GGFG APKFP    ++ +L   ++  D    G   
Sbjct: 178 GELRIAPELLDHAVAAVVREEDRAYGGFGGAPKFPPSALLEGLLRSWERTRDPAVYG--- 234

Query: 336 EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL 395
               +V  T + MA+GGI+D + GGF RYSVDERW VPHFEKMLYD  QL   Y      
Sbjct: 235 ----VVSRTAEAMARGGIYDQLRGGFARYSVDERWLVPHFEKMLYDNAQLLRAYAHLARR 290

Query: 396 T---KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADS-AETEGATRKKEGAFYVWTSK 451
           T   +    + + R+   +L  D+    G   SA DAD+  E +G     EGA YVWT  
Sbjct: 291 TVPDRSDLAARVARETAGFLLDDLGTEHGGFASALDADTHLEPDGP--GVEGATYVWTPA 348

Query: 452 EVEDILGEHAILFKEHYYLKPT------GNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 505
           E+   LG     +    +   T      G   L+R ++P +  + + V            
Sbjct: 349 ELVAELGPQDGAWAAEVFGVTTAGTFEQGTSVLTRRAEPDDPERFERV------------ 396

Query: 506 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 565
                           R  L   R +RP+P  DDKV+ +WNG+ I++ A     L   A 
Sbjct: 397 ----------------RAVLRAARDRRPQPARDDKVVTAWNGMAITALAEGGAALGEPA- 439

Query: 566 SAMFNFPVVGSDRKEYMEVAESAASF-IRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 624
                          ++E A + A F +  H+ D +  R       G S  PG L+DYA+
Sbjct: 440 ---------------WIEAAAACARFLLAEHVRDGRVRRASLGGTAGTS--PGVLEDYAW 482

Query: 625 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREG-GGYFNTTGEDPSVLLRVKEDHD 683
           L++GLL LY+      WL  A  L ++    F D E  G +F+T  +  +++ R ++  D
Sbjct: 483 LVTGLLALYQATGQADWLEPAQVLLDSAIAHFADPEAPGNWFDTADDAETLVARPRDPID 542

Query: 684 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL 720
           GA P+G S     L+  A++    ++  YR+ AE +L
Sbjct: 543 GATPAGASALAEALLTAAALADPERAVRYREAAEQTL 579


>gi|425456902|ref|ZP_18836608.1| Six-hairpin glycosidase-like [Microcystis aeruginosa PCC 9807]
 gi|389801878|emb|CCI18996.1| Six-hairpin glycosidase-like [Microcystis aeruginosa PCC 9807]
          Length = 692

 Score =  353 bits (907), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 236/665 (35%), Positives = 341/665 (51%), Gaps = 74/665 (11%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N LA   S YL +HA NP+DW+ W + A   AR+ D PIFLSIGYS+CHWC VME E+F 
Sbjct: 3   NHLAKSESLYLRKHAENPIDWWYWCDSALEIARREDKPIFLSIGYSSCHWCTVMEGEAFS 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPD-LKPLMGGT 221
           D+ +A  LN +F+ IKVDREERPD+D +YM  +Q + G GGWPL+VFL+PD L P  GGT
Sbjct: 63  DQAIADYLNQYFLPIKVDREERPDIDSIYMQALQMMVGQGGWPLNVFLTPDSLIPFYGGT 122

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFP + ++ RPGF  +L+ V+  +D++++ L++   F  E L  AL  SA   +    L 
Sbjct: 123 YFPVQPRFNRPGFLQVLQSVRRYYDEEKEKLSK---FTAEMLG-ALRQSAILPRSETNLA 178

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
             +L     + + +           P FP      + L  S+  +D   S + +  Q+  
Sbjct: 179 APSLLTTGIETNTAVIRVNPNNYGRPSFPMIPYSHLALQGSRFGDDFDDSLQQAAYQRG- 237

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL-TKDVF 400
               + +A GGI+DHVGGGFHRY+VD  W VPHFEKMLYD GQ+     + +S   ++  
Sbjct: 238 ----EDLALGGIYDHVGGGFHRYTVDSTWTVPHFEKMLYDNGQIVEYLANLWSAGDREAA 293

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH 460
           +    +  +++L+R+M  P G  ++A+DADS E       +EGAFYVW+  E+ D L   
Sbjct: 294 FERGIKGTVNWLKREMTAPEGYFYAAQDADSFEKATDGEPEEGAFYVWSDLELRDYLSTE 353

Query: 461 AI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL- 518
            + L + ++ +   GN            F+G+NVL           +LG  +E  L+ L 
Sbjct: 354 ELGLLQANFTVTAEGN------------FEGRNVL-----QRRQGGELGKEIENMLDKLF 396

Query: 519 ----GECRRKLFDVRSKRPRPH-------------LDDKVIVSWNGLVISSFARASKILK 561
               G  + +L      R                  D K+IV+WN L+IS  ARA     
Sbjct: 397 IRRYGSSQAQLALFPPARDNQEAKTVSWPGRIPAVTDTKMIVAWNSLMISGLARA----- 451

Query: 562 SEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHRLQHSFRNGPSKAPGFLD 620
                A+F  P+       Y ++A  A  FI ++ + D +  RL +    G +      +
Sbjct: 452 ----FAVFGEPL-------YWQMATVATEFILKYQWLDGRFQRLNY---QGQASVLAQSE 497

Query: 621 DYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVK 679
           D+A+ I  LLDL       T WL  AI+LQ   D  F   + GGYFN T  D S+ L V+
Sbjct: 498 DFAYFIKALLDLQTAKPQETGWLEAAIDLQGEFDRWFWAEDEGGYFN-TASDHSLDLIVR 556

Query: 680 ED--HDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLM 737
           E    D A PS N +++ NL+RL+ +    +   Y   AE +L  F T L+    A P +
Sbjct: 557 ERGYTDNATPSANGIAIANLLRLSRLTENLE---YLDRAEKALQSFSTILEQSPTACPSL 613

Query: 738 CCAAD 742
             A D
Sbjct: 614 FVALD 618


>gi|83313656|ref|YP_423920.1| hypothetical protein amb4557 [Magnetospirillum magneticum AMB-1]
 gi|82948497|dbj|BAE53361.1| Highly conserved protein containing a thioredoxin domain
           [Magnetospirillum magneticum AMB-1]
          Length = 671

 Score =  353 bits (907), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 236/656 (35%), Positives = 337/656 (51%), Gaps = 66/656 (10%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLAAE SPYLLQHAHNPV W+AWG EA AEA+  + PI LS+GYS CHWCHVM  ESFE
Sbjct: 4   NRLAAETSPYLLQHAHNPVHWWAWGPEALAEAKASNKPILLSVGYSACHWCHVMAHESFE 63

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D G+A L+N  FV+IKVDREERPD+D +Y   +  +   GGWPL++FL+PD +P  GGTY
Sbjct: 64  DAGIAGLMNRLFVNIKVDREERPDLDALYQNALGLMGQHGGWPLTMFLTPDAEPFWGGTY 123

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP   +YGR  F  +L  +  ++ +  + ++ +    +E++ E+L   A S   P  L  
Sbjct: 124 FPATTRYGRAAFPDVLEGIAHSFHRDPEKISHN----VERIRESLEKMARSPG-PLALDM 178

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             + L A Q  +  D   GG   APKFP+P   +  L+HS       ++G +S  +  V 
Sbjct: 179 EVVDLGAAQCLRLIDFEDGGTVGAPKFPQPGLFRF-LWHSYL-----RTGNSSL-KDAVT 231

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL  + +GGI+DH+GGGF RYS DE W VPHFEKMLYD  QL ++    +  T    Y 
Sbjct: 232 VTLNHICQGGIYDHLGGGFMRYSTDEFWLVPHFEKMLYDNAQLLSLLTKVWKHTGSPLYR 291

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHA 461
               + + +L RDM+  G    +A DADS   EG    +EG FY WTS+E+  ++  + A
Sbjct: 292 TRIFETVGWLLRDMMAEGDAFAAALDADS---EG----EEGLFYTWTSEELSALMDMDTA 344

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
           I F   Y ++  GN            ++G+ +L   N                   L E 
Sbjct: 345 IRFGTLYDVRAHGN------------WEGRTIL-HRNHPRGGGDD---------GDLAEA 382

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
           +  L   R KR  P  DDKV+  WN + IS+ A AS                +  DR ++
Sbjct: 383 KAVLLAARDKRIWPGRDDKVLADWNAMAISALAEAS----------------LAFDRPDW 426

Query: 582 MEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKW 641
           +  A  A   I   +      R  HS   G ++    LDDYA+LI   L L+E  +  ++
Sbjct: 427 LTAARKAFEVITTRM-TRPDGRPAHSLCQGRAETAAVLDDYAWLILAALSLHEATAAPEY 485

Query: 642 LVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLA 701
           L  A+   +       D   GGYF +  +   V++R K   D A PSGN +    L RL 
Sbjct: 486 LERALVWADQVHAHHWDGAEGGYFLSADDAGDVVIRTKPAFDSAVPSGNGMMAEALARL- 544

Query: 702 SIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRK-HVVLVG 756
            +V G ++  +R+ ++  +  F   + +    +P M    +  ++ +    VV+VG
Sbjct: 545 WLVTGDEA--WRERSQAVIDAFGAAIPEQ---IPHMTSLLEAFAILAEPLQVVIVG 595


>gi|427723011|ref|YP_007070288.1| hypothetical protein Lepto7376_1084 [Leptolyngbya sp. PCC 7376]
 gi|427354731|gb|AFY37454.1| hypothetical protein Lepto7376_1084 [Leptolyngbya sp. PCC 7376]
          Length = 681

 Score =  353 bits (907), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 237/669 (35%), Positives = 336/669 (50%), Gaps = 86/669 (12%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNRLA   S YL +HA NP+DW+ W +EA  +A+  + PIFLSIGYS+CHWC VME E+F
Sbjct: 2   TNRLADTKSLYLRKHAENPIDWWYWCDEALEKAKAENKPIFLSIGYSSCHWCTVMEGEAF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP-DLKPLMGG 220
            D+ +A  LN  F+ IKVDREERPD+D +YM  +Q + G GGWPL++FL+P DL P  GG
Sbjct: 62  SDQAIADYLNANFLPIKVDREERPDIDSIYMQALQLMTGQGGWPLNIFLTPDDLIPFYGG 121

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           TYFP   +Y RPGF  +L  ++  +D + + L +      E++   L  S +       L
Sbjct: 122 TYFPVSPRYNRPGFLDVLSSIRHFYDDEPERLKEIK----EEIFTILDRSVT-------L 170

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGS---APKFPRPVEIQMMLYHSKKLEDTGKSGEASEG 337
           P   L L    L KS ++  G  G     P FP      + L  S+  E+T   G A   
Sbjct: 171 PTTELSLDQTLLEKSIEACTGVVGRVSHGPSFPMIPYAAIALQGSRFTENTKHDGSAITK 230

Query: 338 QKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQ----LANVYLDAF 393
           ++ +      +A GGI+DHVGGGFHRY+VD  W VPHFEKMLYD GQ    LAN++ +  
Sbjct: 231 KRGL-----DLALGGIYDHVGGGFHRYTVDPNWTVPHFEKMLYDNGQITEFLANLWANG- 284

Query: 394 SLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 453
             T +  +       +++L R+M  P G  ++A+DADS    G    +EG FYVW   E+
Sbjct: 285 --TTEPSFKTALEGTVEWLSREMTAPQGYFYAAQDADSFLDAGHVEPEEGTFYVWDFDEL 342

Query: 454 EDILGEHAIL-FKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLE 512
           +    + A    +E+++++P GN            F+GK VL        +++++   L+
Sbjct: 343 QTQFSDTAFQELQENFFIEPDGN------------FEGKIVL-----KRRASTEIPESLQ 385

Query: 513 KYLNIL-----GECRRKLFDVRSKRPRPH-------------LDDKVIVSWNGLVISSFA 554
             LN L     G  R+ L      R                  D K+IV+WN L+IS  A
Sbjct: 386 ATLNQLFAERYGGDRQSLETFPPARDNAEAKNTDWAGRIPAVTDTKLIVAWNALMISGLA 445

Query: 555 RASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSK 614
           R   +L  E                +  ++A +  +FI    + E  H  + +F   P  
Sbjct: 446 RIYGVLSLE----------------KAWDLAVNCVNFILETQWQE-GHLYRLNFGEEPDG 488

Query: 615 APGFLDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPS 673
                +DYAFLI  LLDL     + T WL  AI LQ+  D  F   E  GYFN T E   
Sbjct: 489 VAQ-SEDYAFLIKALLDLQANNPTETHWLDKAITLQSEFDAKFWSAETKGYFNNT-EAKE 546

Query: 674 VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMA 733
           +L++ +   D A PS N ++V NL+RL  +   ++   Y   AE +L  F   L   +  
Sbjct: 547 LLIKERSYQDNATPSANGIAVTNLIRLFLL---TEDLAYLDKAEQALQTFAVVLDKSSQQ 603

Query: 734 VPLMCCAAD 742
            P +  A D
Sbjct: 604 APSLIAALD 612


>gi|359457589|ref|ZP_09246152.1| hypothetical protein ACCM5_02608 [Acaryochloris sp. CCMEE 5410]
          Length = 695

 Score =  353 bits (907), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 237/678 (34%), Positives = 333/678 (49%), Gaps = 99/678 (14%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA   S YL +HA NP+DW+ W EEA   A + + PIFLS+GYS+CHWC VME E+F 
Sbjct: 12  NRLAHSASLYLRKHADNPIDWWPWCEEALERAAQENKPIFLSVGYSSCHWCTVMEGEAFS 71

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP-DLKPLMGGT 221
           +  +AK +N  ++ IKVDREERPD+D +YM  VQA+ G GGWPL++FLSP DL P  GGT
Sbjct: 72  NSEIAKYMNAQYIPIKVDREERPDIDSIYMQAVQAMTGQGGWPLNMFLSPGDLVPFYGGT 131

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFP E KYGRPGF  +L  ++  +D +++ L        E+LS  L +S   N + D  P
Sbjct: 132 YFPEEPKYGRPGFLQVLEAIRSFYDTEKEKLDTQK----EKLSGHLQSSTVLNPIGDLQP 187

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG-KSGEASEGQKM 340
           +   +  A+  +   +   G     P FP      MM Y +  L  +   + E  + Q+ 
Sbjct: 188 ELLSKGIAKNTTVLINKMPG-----PSFP------MMPYATIALHGSRFSTSEQEQAQQA 236

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS--LTKD 398
                  +A GGI+DHV GGFHRY+VD  W VPHFEKMLYD GQ+     + +S  + + 
Sbjct: 237 CRQRGLDLALGGIYDHVAGGFHRYTVDPTWTVPHFEKMLYDNGQIVEYLANLWSTGVEEP 296

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG 458
            F   I   +  +L+R+M    G  ++A+DAD+  T      +EG FY WT  E+  +L 
Sbjct: 297 AFKRAIAVTVA-WLQREMTAEAGYFYAAQDADNFVTTADIEPEEGRFYTWTDSELTHLLT 355

Query: 459 -EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNI 517
            E      E + L   GN +            G  VL        S +            
Sbjct: 356 PEEYAAMAEIFNLSVQGNFE-----------DGLTVLQRQQPGVISET------------ 392

Query: 518 LGECRRKLFDVR-SKRPR------------------------PHLDDKVIVSWNGLVISS 552
           + E  +KLF VR   RP                         P  D K+IV+WN L+IS 
Sbjct: 393 VEEALQKLFQVRYGDRPESLKTFPPATHNQVAKTHPWPGRIPPVTDTKMIVAWNSLMISG 452

Query: 553 FARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE-QTHRLQHSFRNG 611
            ARA+ + +                + +Y+ +A  AASFI    + E + HR+ +   +G
Sbjct: 453 LARAAAVFQ----------------QPDYLALATKAASFILDQQWSEGRLHRVNY---DG 493

Query: 612 PSKAPGFLDDYAFLISGLLDLYE------FGSGTKWLVWAIELQNTQDELFLDREGGGYF 665
                   +DYA LI   LDL++       G  ++WL  A   Q   DE     EGGGYF
Sbjct: 494 EIAVIAQSEDYALLIKAFLDLHQACQSLAVGQASRWLEAAQTTQAEFDEHLWAVEGGGYF 553

Query: 666 NTTGE-DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE 724
           NT  E    +L+R +   D A P+ N V++ NL+RL+      +++Y  Q AE +L  F 
Sbjct: 554 NTGSEISEELLIRERSWLDNATPAANGVAIANLIRLSLFC--DRTEYLSQ-AEQALQTFG 610

Query: 725 TRLKDMAMAVPLMCCAAD 742
             +     A P +  A D
Sbjct: 611 QVMDSSTQACPSLFVALD 628


>gi|373108743|ref|ZP_09523024.1| hypothetical protein HMPREF9712_00617 [Myroides odoratimimus CCUG
           10230]
 gi|371645988|gb|EHO11505.1| hypothetical protein HMPREF9712_00617 [Myroides odoratimimus CCUG
           10230]
          Length = 681

 Score =  353 bits (907), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 213/605 (35%), Positives = 318/605 (52%), Gaps = 48/605 (7%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N L  E SPYLLQHA+NP+ W AW +E    A + D  + +SIGYSTCHWCHVME ESFE
Sbjct: 16  NLLHLESSPYLLQHANNPIYWKAWNKETLTLAEQEDKLLIISIGYSTCHWCHVMEKESFE 75

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           ++ VA L+N  F+SIKVDREE P +D  YM  +Q +   GGWPL+V   PD +P+ GGTY
Sbjct: 76  NQEVADLMNQHFISIKVDREELPHLDNFYMKAIQIMTKQGGWPLNVVCLPDGRPIWGGTY 135

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           F       R  +   L ++   + +KRD +     FA  QL E +S  + +    +E   
Sbjct: 136 FK------RQNWIDSLSQLHHLYKEKRDTVLD---FAT-QLQEGISILSQAPIAQEESRF 185

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
           N   L  E   KS+D  +GG+  APKF  P     +LY  KK    G      +  + + 
Sbjct: 186 NT-DLVLENWKKSFDWEYGGYTRAPKFMMPTN---LLYLQKK----GVLHRDQQLLEYID 237

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL  MA GG+ D V GGF RYSVD +WH+PHFEKMLYD  QL +VY D +  T +  Y 
Sbjct: 238 LTLTRMAWGGLFDTVEGGFSRYSVDHKWHIPHFEKMLYDNAQLLSVYADGYKRTHNKLYK 297

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
            +    ++++  +     G  +SA DADS ++    + +EGAFY+WT +E+++++ +   
Sbjct: 298 EVIDKTINFITNNWANGEGGYYSALDADSLDSHN--QLEEGAFYIWTIEELKELVQQDFP 355

Query: 463 LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECR 522
           LF   + +   G+ +       +N++    VLI+  +    A++  +PLE   N   +  
Sbjct: 356 LFSTVFNINSFGHWE-------NNQY----VLIQTRELIDIANENNIPLEDLENKKKQWE 404

Query: 523 RKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYM 582
             L   R+ RP+P LDDK + SWN + I+    A    ++ A                Y+
Sbjct: 405 TALRQYRANRPKPRLDDKTLTSWNAMYITGLLDAYTATQNTA----------------YL 448

Query: 583 EVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWL 642
           E A++   FI  +L+ E+   L+ ++++G +K   FLDDYAF I GL+ L+E     +++
Sbjct: 449 EQAKALHLFIHNNLWCEERGLLR-TYKDGNAKIEAFLDDYAFYIQGLIYLFEHTEEQQYI 507

Query: 643 VWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLAS 702
             A  L +   + FLD E   ++ +       +    E  D   PS N++  INL +L  
Sbjct: 508 TEAKNLMDYSLDHFLDHESKFFYFSKHNQEDTITPAIETEDNVIPSSNAIMAINLYKLGL 567

Query: 703 IVAGS 707
           +   S
Sbjct: 568 LYENS 572


>gi|453075692|ref|ZP_21978475.1| hypothetical protein G419_10417 [Rhodococcus triatomae BKS 15-14]
 gi|452762572|gb|EME20867.1| hypothetical protein G419_10417 [Rhodococcus triatomae BKS 15-14]
          Length = 671

 Score =  353 bits (907), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 223/623 (35%), Positives = 304/623 (48%), Gaps = 80/623 (12%)

Query: 99  NKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEV 158
            +H N L    SPYL QHA NPV W  WG +A   AR+RDVP+ LSIGY+ CHWCHVM  
Sbjct: 3   TRHRNALGEATSPYLRQHADNPVHWQQWGTDALEWARERDVPVLLSIGYAACHWCHVMAH 62

Query: 159 ESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLM 218
           ESFEDE  A ++N+ FV IKVDREERPD+D +YM    A+ G GGWP++ FL+ D +P  
Sbjct: 63  ESFEDEATAAVMNEHFVCIKVDREERPDLDAIYMNATVAMTGQGGWPMTCFLTADGEPFY 122

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPD 278
            GTYFPP  + G P F  +L  + D W  +RD + Q+ A    +L  A  A  +     D
Sbjct: 123 CGTYFPPSPRGGMPSFTQLLEAIDDTWRTRRDDVLQASASITTELRRAGGALPAGAAPLD 182

Query: 279 ELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQ 338
                 L      +    D   GGFG APKFP    ++ ML   ++            G 
Sbjct: 183 ---GPLLDAAVAAVRADEDVERGGFGGAPKFPPSALLEGMLRSHER-----------TGS 228

Query: 339 KMVL----FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 394
            MVL     T + MA+GG+ D +GGGF RYSVD  W VPHFEKMLYD  QL  VY     
Sbjct: 229 AMVLDSVTRTAEAMARGGLFDQLGGGFARYSVDADWVVPHFEKMLYDNAQLLRVYAHLAR 288

Query: 395 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 454
            T       +     +++ RD+    G   SA DAD+   EG T       Y WT +++ 
Sbjct: 289 RTGSDLAFRVTEATAEFMLRDLRTDTGCFASALDADTEGIEGLT-------YAWTPEQLI 341

Query: 455 DILG------EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLG 508
           ++LG         +L          G   L   SDP +  + ++V               
Sbjct: 342 EVLGFEDGVWAAGLLAVSSAGTFEAGTSVLQFPSDPDDWTRWESV--------------- 386

Query: 509 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 568
                        RR LFD RS RP+P  DDKV+ +WNGL I++ A A            
Sbjct: 387 -------------RRSLFDARSNRPQPARDDKVVTAWNGLAITALAEAG----------- 422

Query: 569 FNFPVVGSDRKEYMEVAESAA-SFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLIS 627
                 G  R E++  AE  A S +  HL D +  R   S  +    A   LDD+A L +
Sbjct: 423 -----AGLGRPEWIGAAERCARSLLDEHLVDGRLRR--ASLGSVVGDASAVLDDHAALAT 475

Query: 628 GLLDLYEFGSGTKWLVWAIELQNTQDELFLDR-EGGGYFNTTGEDPSVLLRVKEDHDGAE 686
           GLL L +     +WL  A ++ +   + F D  E G +F+T  +  +++ R ++  DGA 
Sbjct: 476 GLLTLQQVTGDAEWLARAQQILDLALDHFADENEPGSWFDTADDAETLIARPRDPVDGAT 535

Query: 687 PSGNSVSVINLVRLASIVAGSKS 709
           PSG S S+   + LAS+++ + +
Sbjct: 536 PSGTS-SMAEALLLASVLSSADT 557


>gi|86608794|ref|YP_477556.1| hypothetical protein CYB_1320 [Synechococcus sp. JA-2-3B'a(2-13)]
 gi|86557336|gb|ABD02293.1| conserved hypothetical protein [Synechococcus sp. JA-2-3B'a(2-13)]
          Length = 701

 Score =  353 bits (906), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 231/676 (34%), Positives = 337/676 (49%), Gaps = 78/676 (11%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            NRLA   S YL +HA NPVDW+ W  EA  +AR  D PIFLSIGYS+CHWC VME E+F
Sbjct: 2   ANRLATSSSLYLRKHAENPVDWWPWIPEALEKARAEDRPIFLSIGYSSCHWCTVMEGEAF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP-DLKPLMGG 220
            +  +A  LN  F+ IKVDREERPD+D +YM  +Q + G GGWPL+VFL+P DL P   G
Sbjct: 62  SNPEIAAFLNAHFLPIKVDREERPDLDSIYMQALQLMSGQGGWPLNVFLTPDDLVPFYAG 121

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           TYFP E ++GRPGF  +L+++   + ++++ + +     +  L+  LS     + +P +L
Sbjct: 122 TYFPVEPRFGRPGFLALLQRILQFYRQEKEKIEEMKGQILTALT-TLSDLVPEDHIPADL 180

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEAS----- 335
            ++ +      LS +        G+  +FP     Q++L  ++     G  G  S     
Sbjct: 181 LRSGIPKIQPLLSNA--------GAVQQFPMMPYAQLVLRSARFDPPEGIPGSMSALERA 232

Query: 336 --EGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 393
              G  +VL        GGI DHV GGFHRY+VD  W VPHFEKMLYD GQ+     D +
Sbjct: 233 KERGMALVL--------GGIFDHVAGGFHRYTVDPTWTVPHFEKMLYDNGQILEFLSDLW 284

Query: 394 S-LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKE 452
           +   +D       R  ++++ R+M  P G  ++A+DADS         +EG FYVW  +E
Sbjct: 285 AHGIQDPAIERAVRLTVEWVAREMTAPAGYFYAAQDADSFARAEDREPEEGEFYVWRWQE 344

Query: 453 VEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPL 511
           ++++LGE      ++ + L P GN       D     +   ++++     A   ++   L
Sbjct: 345 LQELLGEETFRALQQAFDLSPGGN-----FPD-----RPGCIVLQRQQGGALPPEVEAAL 394

Query: 512 EKYL--NILGECRRKL-----FDVRSKRPR-------PHLDDKVIVSWNGLVISSFARAS 557
             +L     G   R++      D +S R +       P  D K+IVSWN L+IS  ARA 
Sbjct: 395 TTHLFQARYGSADRRVPFPPAVDAQSARLQSWPGRIPPVTDTKMIVSWNALMISGLARAY 454

Query: 558 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPG 617
           ++  +                 +Y++ A  AA FI       +T  L     +G ++ P 
Sbjct: 455 QVFGN----------------ADYLQFALRAAQFILSQQRHPETGSLLRLNYDGTAQVPA 498

Query: 618 FLDDYAFLISGLLDLYE-----FGSGTK--WLVWAIELQNTQDELFLDREGGGYFNTTGE 670
             +DYA LI  LLDL +      G  T   WL  A++LQ   D    D   GGYF +  +
Sbjct: 499 KSEDYALLIKALLDLQQACLPLVGDPTPQDWLQAALQLQQEMDAQLWDPARGGYFVSDAQ 558

Query: 671 D-PSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKD 729
             P +L+R KE  D A P+ N V++ NLVRLA++        Y + AE +L  F   +  
Sbjct: 559 SAPELLVREKEFQDNATPAANGVAIANLVRLAALTGDLD---YLERAEQALKTFAHIMST 615

Query: 730 MAMAVPLMCCAADMLS 745
                P +    D  S
Sbjct: 616 QPRTCPSLFAGLDWYS 631


>gi|377558272|ref|ZP_09787883.1| hypothetical protein GOOTI_036_00590 [Gordonia otitidis NBRC
           100426]
 gi|377524607|dbj|GAB33048.1| hypothetical protein GOOTI_036_00590 [Gordonia otitidis NBRC
           100426]
          Length = 665

 Score =  353 bits (906), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 220/608 (36%), Positives = 308/608 (50%), Gaps = 69/608 (11%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N+L+   SPYL QHA NPVDW  W + A  EA  RDVPI LS+GY+ CHWCHVM  ESFE
Sbjct: 3   NQLSESSSPYLRQHADNPVDWREWSDAALEEAVHRDVPILLSVGYAACHWCHVMAHESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           +   A  +N  FV IKVDREERPD+D +YM    A+   GGWP++ FL+P   P   GTY
Sbjct: 63  NVDTATQMNRDFVCIKVDREERPDIDAIYMNATVAMTRQGGWPMTCFLTPAGDPFYCGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP   + G P F+ IL  V +AW  +R  +   G+   E LS+A SA  +   + DE   
Sbjct: 123 FPDTPRGGMPSFRQILAAVTEAWTTRRSEIESMGSRVREALSDAASALPNGGVVVDE--- 179

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             L           D   GGFG APKFP    ++ +L H ++  D           + V+
Sbjct: 180 RLLDYAVASALGDEDQTAGGFGGAPKFPPSALLEGLLRHYERTSDAAP-------LQSVM 232

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            T   MA+GGI+D +GGGF RY+VD  W VPHFEKMLYD  QL   Y     +  D    
Sbjct: 233 RTADAMARGGIYDQLGGGFARYAVDNDWVVPHFEKMLYDNAQLLRAYGHLARIVDDPLAG 292

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
            +  +I+++LRRD+   GG   S+ DAD+A  EG+T       YVWT +++ D+LG+   
Sbjct: 293 RVAEEIVEFLRRDLRVVGG-FASSLDADAAGVEGST-------YVWTPEQLRDVLGDD-- 342

Query: 463 LFKEHYYLKPTGN--CDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
                      GN    L  ++D      G + L    D   SA        +Y +I   
Sbjct: 343 ----------DGNWAAALFGVTDAGTFEHGTSTLQLRQDPDDSA--------RYADI--- 381

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            RR+L D RS RP+P  DDKV+ +WN + +++ A A                   S   +
Sbjct: 382 -RRRLLDARSARPQPARDDKVVTAWNAMAVTALAEAG----------------AASGHPD 424

Query: 581 YMEVA-ESAASFIRRHLYDEQTHRLQHSFRNGPSKAP-GFLDDYAFLISGLLDLYEFGSG 638
           ++E+A E     +  HL D     L+ S   G    P   LDD+A LI+ +L +Y+    
Sbjct: 425 WVELAVEVLTELLDNHLVD---GVLRRSSLGGLVGTPVAALDDHAALITAMLTVYQITGE 481

Query: 639 TKWLVWAIELQNTQDELFLD-REGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINL 697
            +W    + L +T  + F D  E G +F+   +  S++ R ++  DGA P+G S+ +   
Sbjct: 482 QRWCEQGLALLDTTIDTFADPDEQGAWFDAASD--SLIARPRDPADGATPAGASL-IAEA 538

Query: 698 VRLASIVA 705
             +AS +A
Sbjct: 539 ALIASAIA 546


>gi|434405724|ref|YP_007148609.1| thioredoxin domain protein [Cylindrospermum stagnale PCC 7417]
 gi|428259979|gb|AFZ25929.1| thioredoxin domain protein [Cylindrospermum stagnale PCC 7417]
          Length = 688

 Score =  353 bits (906), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 237/665 (35%), Positives = 340/665 (51%), Gaps = 71/665 (10%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNRLA   S YL +HA NP+DW+ W +EA A A+  + PIFLSIGYS+CHWC VME E+F
Sbjct: 2   TNRLAETKSLYLRKHAENPIDWWPWCDEALATAKTENKPIFLSIGYSSCHWCTVMEGEAF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP-DLKPLMGG 220
            D  +A  +N  F+ IKVDREERPD+D +YM  +Q + G GGWPL+ FLSP DL P   G
Sbjct: 62  SDSAIADYMNANFLPIKVDREERPDLDSIYMQALQMMSGQGGWPLNAFLSPDDLVPFYAG 121

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           TYFP E +YGRPGF  +L+ ++  +D +++ L    A  IE L   L+++   +   DEL
Sbjct: 122 TYFPLEPRYGRPGFLQVLQALRRYYDTEKEDLRDRKASIIESL---LTSAVLQDGAADEL 178

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKS-GEASEGQK 339
             N L      L   +++  G     PK P      M+ Y    L  T  +     +G++
Sbjct: 179 QDNQL------LRHGWETTTGII--TPK-PSGNSFPMIPYAELALRGTRFNFASQYDGKQ 229

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL-TKD 398
           +       +A GGI+DHVGGGFHRY+VD  W VPHFEKMLYD GQ+       +S   K+
Sbjct: 230 VCTQRGLELALGGIYDHVGGGFHRYTVDPTWTVPHFEKMLYDNGQILEYLASLWSAGVKE 289

Query: 399 VFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT-SKEVEDIL 457
             +       + +L+R+M  P G  ++A+DADS     A   +EGAFYVW+ S+  + + 
Sbjct: 290 PAFVRAVAGTVQWLQREMTAPEGYFYAAQDADSFFNSTAVEPEEGAFYVWSYSELEQLLT 349

Query: 458 GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVL-----------IELNDSSASASK 506
            E     ++ + + P GN            F+GKNVL           +E+       ++
Sbjct: 350 LEELTELQQQFTVTPNGN------------FEGKNVLQRRHAGELSQKLEVALGKLFTAR 397

Query: 507 LGMPLEKYLNILGECRRKLFDVRSKRPR--PHL-DDKVIVSWNGLVISSFARASKILKSE 563
            G P +  L      R  L    +  P   P + D K+IV+WN L+IS  ARA+ + +  
Sbjct: 398 YGAPPDS-LATFPPARDNLEAKTTNWPGRIPSVTDTKMIVAWNSLMISGLARAAGVFR-- 454

Query: 564 AESAMFNFPVVGSDRKEYMEVAESAASFI-RRHLYDEQTHRLQHSFRNGPSKAPGFLDDY 622
                         +  Y+E+A  AA+FI      D +  RL +    G +      +DY
Sbjct: 455 --------------QPLYLELAAKAANFILDNQFVDGRFQRLNY---GGEATVLAQSEDY 497

Query: 623 AFLISGLLDLYEF----GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPS-VLLR 677
           AF I  LLDL +        T WL  A+ LQ    E     E GGY+NT+ ++   +++R
Sbjct: 498 AFFIKALLDLSQVSLDSNQRTFWLEKAVTLQEEFAEFLWSVELGGYYNTSSDNSQDLIVR 557

Query: 678 VKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLM 737
            +   D A PS N +++ NLVRLA +   + + +Y   AE  L  F + +     A P +
Sbjct: 558 ERSYVDNATPSANGIAIANLVRLALL---TDNLHYLDLAEQGLKAFRSVMSSAPQACPSL 614

Query: 738 CCAAD 742
             A D
Sbjct: 615 FTALD 619


>gi|254421197|ref|ZP_05034915.1| conserved hypothetical protein [Synechococcus sp. PCC 7335]
 gi|196188686|gb|EDX83650.1| conserved hypothetical protein [Synechococcus sp. PCC 7335]
          Length = 700

 Score =  353 bits (905), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 238/691 (34%), Positives = 345/691 (49%), Gaps = 99/691 (14%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNRLA   S YL +HA NPVDW+ W EEA   A++ + PIFLSIGYS+CHWC VME E+F
Sbjct: 2   TNRLANSSSLYLRKHAENPVDWWPWCEEALTTAQRENKPIFLSIGYSSCHWCTVMEGEAF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK-PLMGG 220
            D+ +A  LN  F+ IKVDREERPD+D +YM  +Q + G GGWPL++FL+PD + P  GG
Sbjct: 62  SDDAIATYLNANFLPIKVDREERPDIDSIYMQALQMMVGQGGWPLNIFLTPDDQVPFYGG 121

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           TYFP E +YGRPGF  +L  +K  +D   + ++   +  +  LS+  S+  ++  L   L
Sbjct: 122 TYFPVEARYGRPGFLRVLTALKKLYDTDSEQISSVKSQILAGLSQ--SSELAAGALDKTL 179

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK---KLEDTGKSGEASE- 336
               ++ CA  L          F    +FP    ++ +L   +    L   GK   +SE 
Sbjct: 180 LPRGVQACARTLMP--------FDMGNRFPMIPYVRWVLQGDRLVQTLPALGKDEASSEV 231

Query: 337 --------GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANV 388
                   G  +     + +  GGI DHV GGFHRY+VD  W VPHFEKMLYD G +   
Sbjct: 232 SAGEVPIDGWHLSKQRARNLVTGGIFDHVAGGFHRYTVDATWTVPHFEKMLYDNGLIMEF 291

Query: 389 YLDAFSLTKDVFYSYICRDI---LDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAF 445
             + +   K      I R +   +D+L+R+M  P G  ++A+DAD+  +E A   +EG F
Sbjct: 292 LAECWQ--KGERTPAIARAVDKTVDWLKREMRSPAGFFYAAQDADNFTSEEAIEPEEGDF 349

Query: 446 YVWTSKEVEDILGEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASA 504
           YVW+  E+  +L E  +      + L   GN            F+GKNVL        + 
Sbjct: 350 YVWSYAELASVLSEAELDEMASAFTLSKAGN------------FEGKNVL-----QRQAT 392

Query: 505 SKLGMPLEKYLNILGECRRKLFDVRS-------------------KRPRPHLDDKVIVSW 545
            +L   LE  L+ L   R   +  ++                   KR  P  D K+IV+W
Sbjct: 393 DELSDSLEASLDKLFRVRYGSYASQTPTFEPAVDAQMAKGRVWPGKRIPPVTDTKLIVAW 452

Query: 546 NGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRH-LYDEQTHRL 604
           N L+IS  A+A         +A FN       RK+Y+ +A   A +I+++   D   +RL
Sbjct: 453 NALMISGLAKA---------AAAFN-------RKDYLVLAIETAGYIQQYQQVDGMLYRL 496

Query: 605 QHSFRNGPSKAPGFLDDYAFLISGLLDLYEF--------GSGTKWLVWAIELQNTQDELF 656
            +    G ++ P   +DYA LI  L+D+ +         G    WL   I LQ TQ +  
Sbjct: 497 SY---EGNAEVPAQSEDYALLIKALIDIQQACLAFAEYRGMAADWLAAVIALQ-TQFDQT 552

Query: 657 LDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNA 716
           L  E GGY N T E   ++++ +   D A P+ N V++ +LVRL  +   ++   Y   A
Sbjct: 553 LSSEQGGYLNATSE--RLIVQERSYQDSAIPAANGVAIASLVRLFLL---TEDLDYLPKA 607

Query: 717 EHSLAVFETRLKDMAMAVPLMCCAADMLSVP 747
           E ++  F T L+    A P +  A D  + P
Sbjct: 608 ESAIQSFSTVLQKSPRACPSLLQAFDWFTHP 638


>gi|158334352|ref|YP_001515524.1| hypothetical protein AM1_1172 [Acaryochloris marina MBIC11017]
 gi|158304593|gb|ABW26210.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
          Length = 686

 Score =  353 bits (905), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 234/677 (34%), Positives = 334/677 (49%), Gaps = 97/677 (14%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA   S YL +HA NP+DW+ W EEA   A + + PIFLS+GYS+CHWC VME E+F 
Sbjct: 3   NRLAHSASLYLRKHADNPIDWWPWCEEALERAAQENKPIFLSVGYSSCHWCTVMEGEAFS 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP-DLKPLMGGT 221
           +  +AK +N  ++ IKVDREERPD+D +YM  VQA+ G GGWPL++FLSP DL P  GGT
Sbjct: 63  NSEIAKYMNAQYIPIKVDREERPDIDSIYMQAVQAMTGQGGWPLNMFLSPGDLVPFYGGT 122

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFP E +YGRPGF  +L  ++  +D +++ L        E+LS  L +S   N + D  P
Sbjct: 123 YFPEEPRYGRPGFLQVLEAIRSFYDTEKEKLDTQK----EKLSGHLQSSTVLNPIGDLQP 178

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK-KLEDTGKSGEASEGQKM 340
           +    L ++ ++K+           P FP      + L+ S+    D  K+ +A   + +
Sbjct: 179 E----LLSKGIAKNTTVLINKM-PGPSFPMMPYAAIALHGSRFSTPDQEKAQQACRQRGL 233

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSL-TKDV 399
            L      A GGI+DHV GGFHRY+VD  W VPHFEKMLYD GQ+     + +S   K+ 
Sbjct: 234 DL------ALGGIYDHVAGGFHRYTVDPTWTVPHFEKMLYDNGQIVEYLANLWSAGVKEP 287

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG- 458
            +       + +L+R+M    G  ++A+DAD+  T      +EG FY WT  E+  +L  
Sbjct: 288 AFERAIAGTVAWLQREMTAEAGYFYAAQDADNFVTTADIEPEEGRFYTWTDSELTHLLTT 347

Query: 459 EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNIL 518
           E      E + L   GN +            G  VL        S +            +
Sbjct: 348 EEYAAMAEIFNLSAQGNFE-----------DGLTVLQRQQPGVISET------------V 384

Query: 519 GECRRKLFDVR-SKRPR------------------------PHLDDKVIVSWNGLVISSF 553
            E  RKLF VR  +RP                         P  D K+IV+WN L+IS  
Sbjct: 385 EEALRKLFQVRYGERPESLTTFPPATNNQVAKTHPWPGRIPPVTDTKMIVAWNSLMISGL 444

Query: 554 ARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE-QTHRLQHSFRNGP 612
           ARA+ + +                + +Y+ +A  AA FI    + E + HR+ +   +G 
Sbjct: 445 ARAAAVFQ----------------QPDYLALATKAARFILDQQWSEGRLHRVNY---DGE 485

Query: 613 SKAPGFLDDYAFLISGLLDLYE------FGSGTKWLVWAIELQNTQDELFLDREGGGYFN 666
                  +DYA LI   LDL++          ++WL  A   Q   DE     EGGGYFN
Sbjct: 486 IAVIAQSEDYALLIKAFLDLHQASQSLAVDQASRWLEAAQTTQAEFDEHLWAVEGGGYFN 545

Query: 667 TTGE-DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFET 725
           T  E    +L+R +   D A P+ N V++ NL+RL+ +    +++Y  Q AE +L  F  
Sbjct: 546 TGSEMSEELLIRERSWLDNATPAANGVAIANLIRLSLVC--DRTEYLSQ-AEQALQTFGQ 602

Query: 726 RLKDMAMAVPLMCCAAD 742
            +     A P +  A D
Sbjct: 603 VMGSSTQACPSLFVALD 619


>gi|374599798|ref|ZP_09672800.1| hypothetical protein Myrod_2291 [Myroides odoratus DSM 2801]
 gi|423324955|ref|ZP_17302796.1| hypothetical protein HMPREF9716_02153 [Myroides odoratimimus CIP
           103059]
 gi|373911268|gb|EHQ43117.1| hypothetical protein Myrod_2291 [Myroides odoratus DSM 2801]
 gi|404606964|gb|EKB06498.1| hypothetical protein HMPREF9716_02153 [Myroides odoratimimus CIP
           103059]
          Length = 665

 Score =  353 bits (905), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 222/639 (34%), Positives = 324/639 (50%), Gaps = 79/639 (12%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N L    +PYL QHA NP+ W AW    F +A++++  + +SIGYSTCHWCHVME ESF 
Sbjct: 2   NELQHASNPYLRQHASNPIHWKAWHPTVFEQAQEQNKLVIVSIGYSTCHWCHVMEEESFT 61

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           +  VA+++N  F+SIKVDREE PDVD  YM  VQ +   GGWPL+V   PD +P+ GGTY
Sbjct: 62  NPAVAEVMNQDFISIKVDREEHPDVDAYYMKAVQLMTKQGGWPLNVVCLPDGRPIWGGTY 121

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQ--------SGAFAIEQLSEALSASASSN 274
           FP                 K  W      LAQ        +  FA  +L E +     + 
Sbjct: 122 FP-----------------KQTWVNALTQLAQLHQNKPEATLEFAT-KLQEGVYIMGLA- 162

Query: 275 KLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEA 334
            + +E  +  L +  E+  +S+D  +GG+  APKF  P     +LY    L+  G     
Sbjct: 163 PVANEESRFNLDIVLEKWKQSFDLEYGGYQRAPKFMMPTN---LLY----LQKVGDLTRD 215

Query: 335 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 394
            +    +  TL  MA GGI D + GGF RYSVD +WH+PHFEKMLYD  QL +VY DA+ 
Sbjct: 216 KDLLHYIDLTLTQMAWGGIFDVLEGGFSRYSVDFKWHIPHFEKMLYDNAQLLSVYSDAYK 275

Query: 395 LTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVE 454
            T +  Y  +    + +++R+ +   G I+SA DADS   +G +  +EGA+YVWT   + 
Sbjct: 276 RTANPLYLEVITKTIQFIQRNWLSDWGGIYSALDADSVNDKGIS--QEGAYYVWTEATLR 333

Query: 455 DILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKY 514
            ILG+   LF + + +   G  +           +G  VLI+ N   AS +         
Sbjct: 334 RILGDDFSLFAQIFNVNAYGYWE-----------EGHFVLIQ-NQPLASIATANQ----- 376

Query: 515 LNILGECRRK------LFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 568
           L++     RK      L + R  RP+PHLDDK+I SWN ++I+    A            
Sbjct: 377 LDVFDLQERKKKWEQLLLEERDHRPKPHLDDKIICSWNAMLITGLLDAYS---------- 426

Query: 569 FNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 628
                  ++   Y++ AES   +I+ +L DE+   L HS  N  +   G+LDDYAF I  
Sbjct: 427 ------ATNETSYLQQAESIYHYIQTYLLDEE-RGLFHSSHNQNAHTLGYLDDYAFYIQA 479

Query: 629 LLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPS 688
           L+ L+E  +   +L  A  L +   +LFLD +   ++       + +LR  E  D   PS
Sbjct: 480 LIRLFEHTANQDYLWQAKRLMDLTLDLFLDEKSKFFYFNQASQANHILRSIETEDNVIPS 539

Query: 689 GNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 727
            N+V  ++L++L       +  +Y Q A+H + V ++ L
Sbjct: 540 ANAVLCMSLLQLG---VAFEHAHYTQLAQHMIEVMQSNL 575


>gi|226365325|ref|YP_002783108.1| hypothetical protein ROP_59160 [Rhodococcus opacus B4]
 gi|226243815|dbj|BAH54163.1| hypothetical protein [Rhodococcus opacus B4]
          Length = 671

 Score =  353 bits (905), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 220/633 (34%), Positives = 310/633 (48%), Gaps = 79/633 (12%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           +  N L    SPYL QHA NPV W  WG EA A AR+RDVPI LSIGYS CHWCHVM  E
Sbjct: 4   REHNTLGGSTSPYLRQHADNPVHWQQWGPEATAWARERDVPILLSIGYSACHWCHVMAHE 63

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SFEDE VA L+N+ FV +KVDREERPD+D VYM    A+ G GGWP++ FL+PD  P   
Sbjct: 64  SFEDEQVASLMNEHFVCVKVDREERPDLDAVYMNATVAMTGQGGWPMTCFLTPDGAPFYC 123

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE 279
           GTY+P + + G P F  +L  + D W  +R  +  + A  + +L                
Sbjct: 124 GTYYPAQPRGGMPSFTQLLGAIADTWRDRRGDVDDAAASVVAELRRGAGG---------- 173

Query: 280 LPQNALRLCAEQLS-------KSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSG 332
           +P+  +R+ A  L        +  D+  GGFG APKFP    ++ +L   ++  D    G
Sbjct: 174 IPEGEVRVTAALLDAAAGTVLRDEDAERGGFGGAPKFPPSALLEGLLRTYERSGDADVLG 233

Query: 333 EASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDA 392
                  +V  T   MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD  QL   Y   
Sbjct: 234 -------VVSRTASAMARGGIYDQLGGGFARYSVDAAWVVPHFEKMLYDNAQLLRAYAHL 286

Query: 393 FSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKE 452
              T       +  + +++L RD+    G   SA DAD+   EG T       YVWT ++
Sbjct: 287 GRRTGSEMALRVTEETVEFLLRDLRTDNGSFASALDADTEGVEGLT-------YVWTPQQ 339

Query: 453 VEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELN---DSSASASKLGM 509
           + ++LG                            E+  +   +  +   ++ AS  +L  
Sbjct: 340 LVEVLGSE------------------------DGEWAARVFAVTADGTFEAGASVLQLSR 375

Query: 510 PLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMF 569
             + + + +   R  L   R+ RP+P  DDKV+ +WNGL I++ A A             
Sbjct: 376 DPDDW-DRMRRIRDTLLARRATRPQPGRDDKVVTAWNGLAITALAEAG------------ 422

Query: 570 NFPVVGSDRKEYME-VAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISG 628
                G  R ++++  AE A + +  H+ D +  R       G S   G L+DYA L +G
Sbjct: 423 ----AGLGRPDWVDAAAECARAVLELHVVDGRLRRASLGASVGDSA--GVLEDYACLATG 476

Query: 629 LLDLYEFGSGTKWLVWAIELQNTQDELFLDRE-GGGYFNTTGEDPSVLLRVKEDHDGAEP 687
           LL LY+   G +WL  A  L +     F D E  G +F+T  +  +++ R ++  DGA P
Sbjct: 477 LLALYQATGGAEWLAHAQSLLDRALIHFADDERPGSWFDTADDAETLVTRPRDPVDGATP 536

Query: 688 SGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL 720
           +G S     L+  +++     S  Y   A  SL
Sbjct: 537 AGASCLAEALLTASAVADVDASGRYATAAAASL 569


>gi|48478494|ref|YP_024200.1| thymidylate kinase [Picrophilus torridus DSM 9790]
 gi|48431142|gb|AAT44007.1| thymidylate kinase [Picrophilus torridus DSM 9790]
          Length = 614

 Score =  353 bits (905), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 230/603 (38%), Positives = 314/603 (52%), Gaps = 79/603 (13%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N L  E SPYLLQHA NPVDW+ W E+AF +AR     IFLSIGYS+CHWCHVME ESF+
Sbjct: 2   NHLKNERSPYLLQHASNPVDWYPWSEQAFEKARSEGKLIFLSIGYSSCHWCHVMENESFK 61

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D+ VA+ +N  FVSIKVDREE PD+D  Y+T  Q + G  GWPL+  LSP+ KPL   TY
Sbjct: 62  DDLVARKMNKTFVSIKVDREEMPDIDNYYITLSQLMTGQAGWPLNFILSPEKKPLFAFTY 121

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
            P E +    G   +   +   W+ KRD L ++   AI  +   +         P+ +  
Sbjct: 122 IPRETRNNMIGMLDLCDTIDYLWNNKRDELLENANKAINAIKNEIK--------PERIDY 173

Query: 283 N-ALRLCAEQLSKSYDSRFGGFGSAPKFPRPVE-IQMMLYHSKKLEDTGKSGEASEGQKM 340
           N A+      L +++D  +GGFGSAPKFP   + I +MLYH     D            M
Sbjct: 174 NEAIENTFYSLKRTFDIEYGGFGSAPKFPEYHKLIFIMLYHKYFHGDI----------HM 223

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
            + TL  M  GGI+DHV GGFHRYS D  W VPHFEKM+YDQ      Y  A+ LT    
Sbjct: 224 AVKTLTEMRLGGIYDHVSGGFHRYSTDSMWIVPHFEKMMYDQAFAVLAYTQAYQLTGKKL 283

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH 460
           +     +I D++  +  G     ++A DAD        +  EG +Y W   +++DI+ + 
Sbjct: 284 FMDTVHEITDFVNNEFFGEA--FYTAIDAD-------YKNIEGYYYTWDYNDIKDIIDDD 334

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
            I        KP GN    ++S       G+N+L        S  KL    EK + IL +
Sbjct: 335 FINDFNI---KPEGNFISDKIS-------GRNILY-----LKSEDKLN---EKNMKILKK 376

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            + K  D       P  D K++   NG+ I +F+ A  + K               DRK 
Sbjct: 377 LKEKRVD------SPFKDKKILCDVNGMAIKAFSYAYSVFK---------------DRK- 414

Query: 581 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 640
            +++A SAA FI   +Y  Q  +L HS+ NG      F DD+AF ISGL++LY   +  K
Sbjct: 415 MLDMARSAADFILYEMY--QDGKLYHSYMNGLGPLANF-DDHAFFISGLIELYNITNEKK 471

Query: 641 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 700
           ++  A++L     +LF D  G G+FN+TG+      R+KE +D A PSG S  + NL+ L
Sbjct: 472 YIDAAVQLNKKCIDLFYD--GNGFFNSTGD-----FRMKEYYDSAVPSGLSAELQNLILL 524

Query: 701 ASI 703
           + I
Sbjct: 525 SFI 527


>gi|342883561|gb|EGU84024.1| hypothetical protein FOXB_05444 [Fusarium oxysporum Fo5176]
          Length = 870

 Score =  353 bits (905), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 228/702 (32%), Positives = 359/702 (51%), Gaps = 100/702 (14%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NR AA  SPY+   A + V W    +EA   +RK +  IF+ IGY  CH+C +M +E+F 
Sbjct: 167 NRAAASQSPYIRGQAESLVSWQLLDDEAVERSRKENKLIFMHIGYKACHFCRLMSIETFS 226

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           +   A +LN+ F+ + VDREERPD+D +YM YVQA+   GGWPL+VFL+P+L+P+ GGTY
Sbjct: 227 NPDSASVLNESFIPVIVDREERPDLDAIYMNYVQAVSNVGGWPLNVFLTPNLEPVFGGTY 286

Query: 223 FPPEDKYGRPGFK--------------TILRKVKDAW--------DKKRDMLAQSGAFAI 260
           +     +G  G +              TI +KV+D W         +  +++ Q   FA 
Sbjct: 287 W-----FGPAGRRHLSDDSTEEVLDSLTIFKKVRDIWIDQEARCRKEATEVVGQLKEFAA 341

Query: 261 EQL----------------------SEALSASASSNKLPDELPQNALRLCAEQLSKSYDS 298
           E                        S A +A   S  + +EL  + L      ++ ++D 
Sbjct: 342 EGTLGTRSISAPSALGPAGWGAPAPSHASTAKEKSTAVSEELDLDQLEEAYTHIAGTFDP 401

Query: 299 RFGGFGSAPKFPRPVEIQMMLYHSKK---LEDTGKSGEASEGQKMVLFTLQCMAKGGIHD 355
            FGGFG APKF  P ++  +L   K    ++D     E     ++ L T++ +  G +HD
Sbjct: 402 VFGGFGLAPKFLTPPKLAFLLGLLKSPGAVQDVVGEAECKHATEIALDTMRHIRDGALHD 461

Query: 356 HVGG-GFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT----KDVFYSYICRDILD 410
           H+GG GF R SV   W +P+FEK++ D  QL ++Y+DA+ ++    KD F   +  ++ +
Sbjct: 462 HIGGTGFSRCSVTADWSIPNFEKLVTDNAQLLSLYIDAWKVSGGGEKDEFLDVVL-ELAE 520

Query: 411 YLRRD-MIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE----HAILFK 465
           YL    ++ P G   S+E ADS   +G   K+EGA+YVWT +E + +L E     + +  
Sbjct: 521 YLTSSPIVLPEGGFASSEAADSYYRQGDKEKREGAYYVWTRREFDSVLDEIDSHMSPILA 580

Query: 466 EHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECRRKL 525
            ++ +   GN +    SDP+++F  +N+L   +     +++   P+EK    + + RR L
Sbjct: 581 SYWNVNQDGNVE--EESDPNDDFIDQNILRVKSTIEQLSTQFSTPVEKIKEYIEQGRRAL 638

Query: 526 FDVRSK-RPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEV 584
              R + R RP LDDK++V WNGLVIS+ ++A+  LK+          +      +   +
Sbjct: 639 RKRREQERVRPDLDDKIVVGWNGLVISALSKAASSLKT----------LRPEQSSKCRAI 688

Query: 585 AESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVW 644
           AE AA+ IR+ L+D    R+ +   +G      F DDYA++I GLLDL E     ++L +
Sbjct: 689 AEQAAACIRKKLWD-GNERILYRIWSGGRGNTAFADDYAYMIQGLLDLLELTGNQEYLEF 747

Query: 645 AIELQ-------------------NTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGA 685
           A  LQ                    TQ  LF D + G +F+T    P  +LR+K+  D +
Sbjct: 748 ADILQRESSQFPSHLTHPADHAITETQTSLFYDAD-GAFFSTQANSPYTILRLKDGMDTS 806

Query: 686 EPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRL 727
            PS N+VSV NL RLA++++   +D     A  ++  FE  +
Sbjct: 807 LPSTNAVSVANLFRLANLLS---NDDLAAKARQTINAFEVEV 845


>gi|377573232|ref|ZP_09802302.1| hypothetical protein MOPEL_013_00090 [Mobilicoccus pelagius NBRC
           104925]
 gi|377538035|dbj|GAB47467.1| hypothetical protein MOPEL_013_00090 [Mobilicoccus pelagius NBRC
           104925]
          Length = 681

 Score =  352 bits (904), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 219/605 (36%), Positives = 302/605 (49%), Gaps = 75/605 (12%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL    SPYL QHA NPVDW+ W +EA AEAR+RDVPI LSIGY+ CHWCHVM  E FE
Sbjct: 3   NRLVDATSPYLRQHADNPVDWWPWCDEALAEARERDVPILLSIGYAACHWCHVMAHEVFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DEGVA  L D FV+IKVDREERPD+D VYM+   AL G GGWP++  L+PD +P    TY
Sbjct: 63  DEGVASALADGFVAIKVDREERPDLDAVYMSATVALTGRGGWPMTCLLTPDGRPFFAATY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
            P      RP F  +L    +AW ++RD + +S     E L   + A A    +  + P+
Sbjct: 123 VP------RPQFLHLLASAHEAWTERRDEVEESADRIAEALRGQVDAQAQLAPVLGDTPE 176

Query: 283 ---------NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGE 333
                     AL    E+ + ++D   GGFG+APKFP  + +  +L H  +         
Sbjct: 177 AQGADDVLRAALDAAEERTASTFDWERGGFGTAPKFPPSMTLSWLLRHHDRT-------T 229

Query: 334 ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 393
                +MV  T + MA+GG++D + GGF RYS D  W VPHFEKMLYD   L +VY D F
Sbjct: 230 TPRALQMVEATCEAMARGGMYDQLAGGFTRYSTDADWVVPHFEKMLYDNALLLSVYTDWF 289

Query: 394 SLTKDVFYSYICRDILDYLRRDMIGPGGEIFSA--EDADSAETEGATRKKEGAFYVWTSK 451
            ++       + R+  ++L RD+  P G   S+   D+ +A       + EGA YVWT  
Sbjct: 290 RVSGSPLAERVARETAEFLLRDLRTPEGAFASSLDADSPAAPDAPPALEGEGAAYVWTPA 349

Query: 452 EVEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSA---SASKLG 508
           ++  +LGE                           +     +L+ + ++      AS L 
Sbjct: 350 QLTAVLGE--------------------------EDAATAALLLGVTEAGTFEHGASVLQ 383

Query: 509 MPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAM 568
             ++         R +L   R  RP+P  DDKV+ +WNGL I++ A AS  L        
Sbjct: 384 RRVDPDPAWWTSARERLLRARLTRPQPARDDKVVTAWNGLAIAALADASVAL-------- 435

Query: 569 FNFPVVGSDRKEYMEVAESAASF-IRRHLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFLI 626
                   D    +E A + A F +  H+ D    R + + R+G    A G  +D+  L 
Sbjct: 436 --------DDPRLLEAAVACAEFVVATHVVD---GRCRRTSRDGVVGDALGVAEDHGDLA 484

Query: 627 SGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAE 686
            GL+ L+       WL  A  L +   +LF D   GG+F+T  +   +LLR + D D AE
Sbjct: 485 HGLVRLHAATGEQVWLDAAGALLDVATDLF-DAPDGGFFDTGSDAAELLLRPRSDTDNAE 543

Query: 687 PSGNS 691
           P G S
Sbjct: 544 PCGAS 548


>gi|355570877|ref|ZP_09042147.1| protein of unknown function DUF255 [Methanolinea tarda NOBI-1]
 gi|354826159|gb|EHF10375.1| protein of unknown function DUF255 [Methanolinea tarda NOBI-1]
          Length = 711

 Score =  352 bits (904), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 236/673 (35%), Positives = 332/673 (49%), Gaps = 45/673 (6%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA +PVDW+ WG+EAF  AR+ D PIFLSIGY+TCHWCHVM  ESF 
Sbjct: 16  NRLIKEVSPYLRQHAFDPVDWYPWGDEAFIRAREEDKPIFLSIGYATCHWCHVMREESFS 75

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D  V + LN+ FV IK+DREERPD+D+ YM    A  G GGWPLS+FL+P   P    +Y
Sbjct: 76  DPEVGRFLNENFVCIKLDREERPDLDQYYMDACIAFTGRGGWPLSIFLTPGGVPFFATSY 135

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQ-LSEALSASASSNKLPDELP 281
            P     G  G   +L  +   W + RD      A ++ + +SE +   A  +     LP
Sbjct: 136 IPRTRTGGNYGILEVLAAIAAYWKEHRD-----DALSLARDISENI-VRARDHAYSGPLP 189

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
                +  + L   +DS+ GGFG  P+FP       +L +       G     +    + 
Sbjct: 190 AGTAGMVYDHLVSIHDSKNGGFGPPPRFPLFHLHLFLLRY-------GIIHRTTAPIDLS 242

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             TL  MA+GG++D +GGGFHRY+ DERW VPHFEKMLYDQ   A  Y +A++LT +   
Sbjct: 243 CHTLLSMARGGVYDQLGGGFHRYATDERWLVPHFEKMLYDQALAALAYSEAYTLTGNAVL 302

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHA 461
             + R  ++Y+ RD+  P G  ++ EDADS          EG FY WT  E+E +L    
Sbjct: 303 GNVARGCMEYICRDLQAPDGGFYAGEDADSG-------GGEGLFYTWTRDEIESVLSPEE 355

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
                  +   + NC  +  S      +   VL      + +A  LGM       +L   
Sbjct: 356 NRIASSVF---SLNCIDTPGSAGGTSAREAGVLSRARQPADAARLLGMAPGDVERVLETM 412

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
           + KL   R+ RP P  D  V+  WNGL IS+ + AS+ L   A                +
Sbjct: 413 KEKLLSARNTRPHPPRDTLVLTDWNGLAISALSVASRTLGDPA----------------F 456

Query: 582 MEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKW 641
           +  A  AA F+   +       + H +  G +   G   DYA +I GLLDL+       +
Sbjct: 457 LAAARRAAGFVLGQMRSPDGG-IYHRWMAGDAAIQGMSADYASVIMGLLDLFLATREPTF 515

Query: 642 LVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLA 701
           L  AIEL++   + F D++ GGY+ T  +   V +R KE  DG+ PS NS+S  NLVRL 
Sbjct: 516 LSAAIELEDYHFQNFWDKDKGGYYWTRDDQKDVPVRQKEFLDGSIPSSNSLSFSNLVRL- 574

Query: 702 SIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSV 761
            I+ G  S  Y + A      +   ++    +   M  A  +++      VV+ G ++  
Sbjct: 575 HILTGETS--YMERAGQVAGYYPPLVRQYPSSC-TMFFAGHLVTEGRAGTVVVTGDETDP 631

Query: 762 DFENMLAAAHASY 774
            +  ML     +Y
Sbjct: 632 LYVRMLGILDRNY 644


>gi|37521713|ref|NP_925090.1| hypothetical protein gll2144 [Gloeobacter violaceus PCC 7421]
 gi|35212711|dbj|BAC90085.1| gll2144 [Gloeobacter violaceus PCC 7421]
          Length = 650

 Score =  352 bits (903), Expect = 4e-94,   Method: Compositional matrix adjust.
 Identities = 244/673 (36%), Positives = 334/673 (49%), Gaps = 81/673 (12%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E S YL +HA+NP+DW  WG EA A+A   D P+F+SIGYS+CHWC VME E+F 
Sbjct: 8   NRLLHEKSLYLRKHAYNPIDWLPWGPEALAKAEHEDKPLFVSIGYSSCHWCTVMENEAFS 67

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP-DLKPLMGGT 221
           D  +A  +N  FV+IKVDREERPD+D +YM  +Q +   GGWPL++FL+P DL P  GGT
Sbjct: 68  DPEIAGFMNAHFVAIKVDREERPDIDAIYMQALQLMNQQGGWPLNIFLTPGDLVPFYGGT 127

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFP +D+YGRPGF  +L  + D +  +R+ L        E++  AL A+     L  ELP
Sbjct: 128 YFPVQDRYGRPGFLRVLEAIHDYYRGQRERLGDHK----ERMLGALEAATRLQPL-SELP 182

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
            + LR     L     +     G  P FP        L   + LE          G+   
Sbjct: 183 PDPLRRAVPPLR----ALLARDGMGPSFPMIPHAGFALRMGRFLEVELAQSACERGED-- 236

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV-F 400
                 +A GGI DHVGGGFHRY+VD  W VPHFEKMLYD GQ+     D ++    +  
Sbjct: 237 ------LATGGIFDHVGGGFHRYTVDGTWTVPHFEKMLYDNGQIVEFLSDLWASGLHIPA 290

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL-GE 459
           +         +L R+M    G  ++A+DADS   EG    +EG FYVW++ E+++IL GE
Sbjct: 291 FERAVEFTHRWLLREMTDGRGYFYAAQDADS---EG----EEGKFYVWSASELQEILSGE 343

Query: 460 HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILG 519
                +  ++L   GN            F+G+  +++      S   L   +E  L    
Sbjct: 344 ELAALESAFFLSAEGN------------FEGRTTVLQRR----SGDVLAPVVETALT--- 384

Query: 520 ECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRK 579
               KLF VRS+R     D K+IVSWN L+I+   RA+ +                  R 
Sbjct: 385 ----KLFGVRSRRVPAATDTKLIVSWNALMIAGLNRAADVF----------------GRP 424

Query: 580 EYMEVAESAASFIRRHLYDE-QTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG 638
           EY E A  AA FI  H     + +RL +   +G    P   +DYA  I  L+DLY     
Sbjct: 425 EYRETAVGAARFILEHQRAPGEFYRLNY---DGEPAIPAHAEDYACFIKALIDLYVSTQQ 481

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLV 698
            +WL  A  LQ   DE   D E GGYF+     P +L+R K+  D A P+ N ++  NLV
Sbjct: 482 GEWLEAARALQQQMDERLWDLEMGGYFSAPS-GPDLLIREKDFQDSATPAANGLAAANLV 540

Query: 699 RLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAAD-------MLSVPSRKH 751
           RL  +   +    Y + AE  L  F   L ++  A P +    D       + S P R  
Sbjct: 541 RLFLL---TDEPAYLEAAEALLRQFARILAEVPRAGPSLLAGYDWYRNQVLVQSDPERIA 597

Query: 752 VVLVGHKSSVDFE 764
            +L G+  +  F+
Sbjct: 598 ELLRGYWPTAVFK 610


>gi|297626872|ref|YP_003688635.1| thioredoxin [Propionibacterium freudenreichii subsp. shermanii
           CIRM-BIA1]
 gi|296922637|emb|CBL57214.1| Conserved protein containing thioredoxin domain [Propionibacterium
           freudenreichii subsp. shermanii CIRM-BIA1]
          Length = 894

 Score =  352 bits (903), Expect = 5e-94,   Method: Compositional matrix adjust.
 Identities = 221/614 (35%), Positives = 317/614 (51%), Gaps = 69/614 (11%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL AE SPYL  HA + +DW+ WG  A AEAR+R +P+ LS+GY++CHWCHVM  ESF 
Sbjct: 3   NRLVAESSPYLRGHADDLIDWWPWGPRALAEARRRQLPVLLSVGYASCHWCHVMAQESFR 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D  VA+ +ND FV+I VDREERPDVD+V+M   QAL G GGWP++VF +PD +P   GTY
Sbjct: 63  DPQVAQFVNDNFVAIAVDREERPDVDQVFMNATQALTGQGGWPMTVFCTPDGEPFFAGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP + + G+P F  + + +  AW ++RD + +SGA    QL++  SA+  +     E P 
Sbjct: 123 FPSQARVGQPSFLQVCQTLARAWAERRDEVVESGAHIASQLADQASAADPAGDQTGE-PP 181

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
            A  L A  L+   D   GGFG+APKFP+P  +  ++           +GE  +    V 
Sbjct: 182 AADELLARALAL-VDPDNGGFGTAPKFPQPASLDALMV----------TGEPHQ-IGAVQ 229

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQ----GQLANVYLDAFSLTKD 398
            +L+ + +GGIHD VGGGFHRY+VD  W VPHFEKML D     G L   +      T D
Sbjct: 230 LSLEHIVRGGIHDIVGGGFHRYAVDAAWAVPHFEKMLDDNALLLGTLTRAWRRTGPETGD 289

Query: 399 V--FYSYICRDILDYLRRDM---IGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 453
           +   +    R I+ +L R+M      G    S +DADS + +G  ++ EGAFY+WT  +V
Sbjct: 290 LREHFELAIRGIVGWLSREMAITTDAGTAFASGQDADSLDADG--QRVEGAFYLWTPHQV 347

Query: 454 EDILGEHAILFKEH-YYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMP-L 511
           E +      LF +  ++L P G                      + D S++    G P  
Sbjct: 348 EAVFNRRDALFAQAVFHLTPKGT---------------------MPDHSSTLRLHGDPDP 386

Query: 512 EKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNF 571
           ++   ILGE R    +VR++RP P  DDKV+  WNGL+  S   A+ +         F  
Sbjct: 387 DRLKRILGELR----EVRARRPAPARDDKVVAGWNGLLADSLTSAAMV---------FGE 433

Query: 572 PVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLD 631
           P       E++ +A S   ++    + +  H  + S       AP  L+DYA    G   
Sbjct: 434 P-------EWLTMARSVLDYLWSVHHFDTDHAARSSLAGVAGPAPAVLEDYAGFALGAAR 486

Query: 632 LYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNS 691
           L      T+ L  A+ +     ELF   + GG+F+    D ++  R ++  D   PS  S
Sbjct: 487 LAGATGDTELLDRAVTVLGRGVELF-GADDGGFFDAQ-HDEALFTRARQLADEGGPSATS 544

Query: 692 VSVINLVRLASIVA 705
           + V  L  +A +  
Sbjct: 545 IMVTALQVVAGLTG 558


>gi|428209785|ref|YP_007094138.1| hypothetical protein Chro_4890 [Chroococcidiopsis thermalis PCC
           7203]
 gi|428011706|gb|AFY90269.1| hypothetical protein Chro_4890 [Chroococcidiopsis thermalis PCC
           7203]
          Length = 698

 Score =  352 bits (902), Expect = 6e-94,   Method: Compositional matrix adjust.
 Identities = 246/681 (36%), Positives = 356/681 (52%), Gaps = 88/681 (12%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNRLA   S YL +HA NP+DW+ W +EA A+A+  + PIFLSIGYS+CHWC VME E+F
Sbjct: 2   TNRLAQTQSLYLRKHAENPIDWWFWCDEALAKAKAENKPIFLSIGYSSCHWCTVMEGEAF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP-DLKPLMGG 220
            D  VA  LN  F+ IKVDREERPD+D +YM  +Q + G GGWPL++FL+P DL P  GG
Sbjct: 62  SDLAVAAYLNANFLPIKVDREERPDIDSIYMQALQMMTGQGGWPLNIFLAPEDLVPFYGG 121

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWD-KKRDMLAQSGAFAIEQLSEALSASASSNKLPDE 279
           TYFP E +YGRPGF  +L+ ++  +D +K+D+ ++      E + EA+  +A    LP+ 
Sbjct: 122 TYFPLEPRYGRPGFLQVLQALRRYYDTEKQDLRSRQ-----EAILEAIQQAAI---LPNT 173

Query: 280 LPQNALRLCAEQLSKSYDSRFGG-FGSAPKFPRPVEIQMMLYHSKKL----EDTGKSGEA 334
            P N+  L  + +  S     GG +G+  KFP      + L   + L    ++   +   
Sbjct: 174 QPLNS-ALLRQGIETSTGIITGGDYGT--KFPMIPYADLALRGWRFLPVWKDNFRYNLPE 230

Query: 335 SEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS 394
           S  Q+ +      +A GGI+DHVGGGFHRY+VD  W VPHFEKMLYD GQ+     + +S
Sbjct: 231 SCTQRGI-----DLALGGIYDHVGGGFHRYTVDPTWTVPHFEKMLYDNGQIVEYLANLWS 285

Query: 395 L-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEV 453
              K+  +       + +L+R+M  P G  ++A+DADS         +EGAFYVW+  E+
Sbjct: 286 AGVKEPAFERAIALTVKWLQREMTAPEGYFYAAQDADSFIHPEEAEPEEGAFYVWSYSEL 345

Query: 454 EDIL-GEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIE-----LNDSSASASK- 506
           E+IL  E     +  + + P GN            F+GKNVL       L+++  SA K 
Sbjct: 346 ENILTSEELTAIQAEFTVTPQGN------------FEGKNVLQRRQVGILSETVESALKK 393

Query: 507 -----LGMPLEKYLNILGECRRKL---FDVRSKRPRPHLDDKVIVSWNGLVISSFARASK 558
                 G  +E+ L I    R          + R     D K+IV+WN L+IS  ARA+ 
Sbjct: 394 LFQVRYGSTVEE-LEIFPPARNNQEAKTQTWAGRIPAVTDTKMIVAWNSLMISGLARAAI 452

Query: 559 ILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLY-DEQTHRLQHSFRNGPSKAPG 617
           + +                + +Y+++A  AA+FI  + + D + HRL +   +G +    
Sbjct: 453 VFQ----------------QNDYLDLAVRAANFILENQWVDGRFHRLNY---DGKAAVMA 493

Query: 618 FLDDYAFLISGLLDLYEFG------------SGTKWLVWAIELQNTQDELFLDREGGGYF 665
             +DYA  I  LLDL++              +   WL  AI +Q   DEL    E GGYF
Sbjct: 494 QSEDYAQFIKALLDLHQASLVETLHVETLHVTSLHWLEKAIAVQTEFDELLWSVELGGYF 553

Query: 666 NTTGE-DPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFE 724
           NT  +    +++R +   D A P+ N V++ NLVRLA +   ++   Y   AE  L  F 
Sbjct: 554 NTAKDASQELIIRERSYMDNATPAANGVAIANLVRLALL---TEDLTYLDRAEQGLQAFS 610

Query: 725 TRLKDMAMAVPLMCCAADMLS 745
           + +     A P +  A D  S
Sbjct: 611 SAMHQHPQACPSLFTAFDWYS 631


>gi|225871957|ref|YP_002753411.1| hypothetical protein ACP_0267 [Acidobacterium capsulatum ATCC
           51196]
 gi|225793798|gb|ACO33888.1| conserved hypothetical protein [Acidobacterium capsulatum ATCC
           51196]
          Length = 702

 Score =  351 bits (901), Expect = 7e-94,   Method: Compositional matrix adjust.
 Identities = 203/597 (34%), Positives = 303/597 (50%), Gaps = 51/597 (8%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N LA   S YL    H PV W +WG++AFA A + D P+ L IG   CHWCHVM+ ES+E
Sbjct: 6   NALAHSSSAYLRSAMHQPVRWHSWGDDAFALAAQEDKPVLLDIGAVWCHWCHVMDRESYE 65

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           +  +A ++N+ F++IKVDR+ERPDVD  Y   VQA+ G GGWPL+  L+P+ KP  GGTY
Sbjct: 66  NPAIAAVINEHFIAIKVDRDERPDVDSRYQAAVQAMAGQGGWPLTAILTPEGKPFFGGTY 125

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FPPED+YGRPGF+ +LR + D W  +R    ++    +  +    S +  S  L   + +
Sbjct: 126 FPPEDRYGRPGFERVLRSLADVWQNRRGEALETANSVLGAIEHGESFAGRSGTLSISIVE 185

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             +    +Q    +D+R+GGFGS PKFP P  + M++       DT         ++   
Sbjct: 186 KLVSSAVQQ----FDARYGGFGSQPKFPHPSAMDMLI-------DTASRTGNERVREAAT 234

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL+ MA GG++D + GGFHRYSVDE+W VPHFEKMLYD   L + Y+ AF    +  ++
Sbjct: 235 VTLRKMAAGGVYDQLAGGFHRYSVDEQWIVPHFEKMLYDNAGLLSNYVHAFQSFVEPEFA 294

Query: 403 YICRDILDYLRRDMIG-PGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHA 461
            +  DI+ ++   +     G  ++++DAD           +G ++ WT  E   +L    
Sbjct: 295 AVAVDIIRWMDECLSDRERGGFYASQDAD------INLDDDGDYFTWTLAEARAVLSNEE 348

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
           +     Y+       D+  M D H+  + KNVL      +  A+ L +  E+    L   
Sbjct: 349 LAVAASYF-------DIGEMGDMHHNPQ-KNVLHSKRTLAEVAAALSLSAEEAQKKLDSA 400

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
           + KL   R +RP P +D  +  SWN L IS++ +A+++L          F ++  DR   
Sbjct: 401 KSKLLAARRERPTPFIDTTIYTSWNALAISAYLQAARVLDLPHAR---TFALLTLDR--- 454

Query: 582 MEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP-----GFLDDYAFLISGLLDLYEFG 636
                     I R  + E T  L H       K+P     G LDDYAFL    L+ +E  
Sbjct: 455 ----------ILREAWSE-TSGLSHVVAYADGKSPAAWVAGVLDDYAFLTDACLEAWEST 503

Query: 637 SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDP---SVLLRVKEDHDGAEPSGN 690
              K+   A ++ +     F D+  G +F+T  +     ++  R K   D   P+GN
Sbjct: 504 GDRKYYDAAAQIADAMIARFYDQTSGAFFDTEIQGSKLGALAARRKPLQDTPTPAGN 560


>gi|84498558|ref|ZP_00997321.1| hypothetical protein JNB_20238 [Janibacter sp. HTCC2649]
 gi|84381091|gb|EAP96976.1| hypothetical protein JNB_20238 [Janibacter sp. HTCC2649]
          Length = 663

 Score =  351 bits (901), Expect = 8e-94,   Method: Compositional matrix adjust.
 Identities = 243/691 (35%), Positives = 333/691 (48%), Gaps = 85/691 (12%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA   SPYLLQHA NPVDW+ WG +AFAEAR+RDVP+ LS+GY+ CHWCHVM  ESFE
Sbjct: 3   NRLAQSTSPYLLQHADNPVDWWEWGPDAFAEARRRDVPVLLSVGYAACHWCHVMAHESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D GVA  +N  FV++KVDREERPDVD VYM    AL G GGWP++  L+PD  P   GTY
Sbjct: 63  DVGVADAINANFVAVKVDREERPDVDAVYMNATTALTGHGGWPMTCVLTPDGDPFFAGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP      R  F  +L  V   W ++R  +  SGA    QL +  + ++SS+  P  L  
Sbjct: 123 FP------RQQFLALLANVTKVWTEQRADVVASGAHIAGQLRDMTAPASSSSITPQTLAG 176

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
                    L ++YD   GGFG APKFP  + ++ ++ H  +  D       ++   M  
Sbjct: 177 -----AVTNLRQNYDLARGGFGGAPKFPPSMALEFLIRHHARTGD-------ADALAMAR 224

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            T   MA+GGI+D + GGF RY+VD  W VPHFEKMLYD  QL  V+   +  T D    
Sbjct: 225 RTCDAMARGGIYDQLAGGFARYAVDADWVVPHFEKMLYDNTQLLRVHTHLWRSTGDPLAH 284

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
            I  +  D++ RD+    G   SA DAD+   +GA+   EGA Y WT  ++ ++LG    
Sbjct: 285 RIACETADFIIRDLGTSEGCFASALDADTV-IDGAS--VEGATYAWTPAQLVEVLGSQDG 341

Query: 463 LFKEHYYLKPT------GNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLN 516
           +         T      G   L   SDP +E     V                       
Sbjct: 342 VRAAELLSVTTEGTFEHGASTLQLRSDPEDEQWWSGV----------------------- 378

Query: 517 ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGS 576
                R +L D R  R +P  DDKV+ SWNGL I+  A A  +L                
Sbjct: 379 -----RTRLLDARLGRAQPARDDKVVTSWNGLAIAGLADAGALL---------------- 417

Query: 577 DRKEYMEVAESAASF-IRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 635
           DR ++++ A   A F +  H+ D +  R           A G  DD+  L  GLL L++ 
Sbjct: 418 DRPDFVDAAVRCAEFVVGSHVVDGRLRRASRDGVV--GAAAGVADDHGNLAEGLLALHQA 475

Query: 636 GSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLL-RVKEDHDGAEPSGNSVSV 694
               +WL  A  + +     F D E  G  + T +D   L  R +   D AEPSG S   
Sbjct: 476 TGDPRWLAEAGTILDVALTHFRDAE--GVVHDTADDAEQLFTRPRSQADNAEPSGVSSLA 533

Query: 695 INLVRLASIVAGSKSDYYRQNAEHSLA-VFETRLKDMAMAVPLMCCAADMLSVPSRKHVV 753
              +  A++   ++   +R+ A+ +LA V     +D   A   +  A    + P    V 
Sbjct: 534 GAWLTYAALTGSTR---HREAADAALASVGALAARDPRFAGWSLAVAEAAAAGP--LQVA 588

Query: 754 LVGHKSSVDFENMLAAAHASYDLNKTVSKKS 784
           +VGH S+   E + A A AS      +++ +
Sbjct: 589 IVGHGSTA--EALFATARASTSPGLVIARGA 617


>gi|443321849|ref|ZP_21050889.1| thioredoxin domain containing protein [Gloeocapsa sp. PCC 73106]
 gi|442788465|gb|ELR98158.1| thioredoxin domain containing protein [Gloeocapsa sp. PCC 73106]
          Length = 684

 Score =  351 bits (901), Expect = 8e-94,   Method: Compositional matrix adjust.
 Identities = 239/659 (36%), Positives = 336/659 (50%), Gaps = 77/659 (11%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA   S YL +HA NP+DW+ W EEA A A+  + PIFLSIGYS+CHWC VME E+F 
Sbjct: 3   NRLAKVKSLYLRKHAENPIDWWYWCEEAIATAKADNKPIFLSIGYSSCHWCTVMEGEAFS 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSP-DLKPLMGGT 221
           D+ +A  LN+ F+ IKVDREERPD+D +YM  +Q + G GGWPL++FL+P DL P  GGT
Sbjct: 63  DQAIADYLNENFLPIKVDREERPDIDSIYMQALQMISGQGGWPLNIFLTPDDLIPFYGGT 122

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKL-PDEL 280
           YFP E +YGRPGF  +LR ++  +D+++  L        +Q+   L  S   N + P+ L
Sbjct: 123 YFPVEPRYGRPGFLDVLRSLRHFYDQEKSKLNS----IKDQVRSGLEQSTMLNVVEPNHL 178

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG-KSGEASEGQK 339
               L     + + S  SR      +P FP      M+ Y    L+ +  K     +G++
Sbjct: 179 INKELLYKGIETNTSVISR--NSPGSPSFP------MIPYADLALKGSYLKFNSRYDGRE 230

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS--LTK 397
           +     + +A GGI DHVGGGFHRY+VD  W VPHFEKMLYD G +     + +S  +++
Sbjct: 231 LAKQRGKDLALGGICDHVGGGFHRYTVDPTWTVPHFEKMLYDNGLIVEYLANLWSGGISE 290

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
             F   I   +  +L+R+M  P    ++A+DADS  T  A   +EGAFYVW   E+E +L
Sbjct: 291 PSFERAIALTV-QWLKREMTAPESYFYAAQDADSFPTSDALEPEEGAFYVWNYSELESLL 349

Query: 458 GEHAI-LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLN 516
             + +      + +   GN            F+G NVL         + KL   LE  L 
Sbjct: 350 TPYELNQLGAEFTVSSEGN------------FEGSNVL-----QRRQSGKLSSSLETILA 392

Query: 517 ILGECR----RKLFD----VRSKRPRPHL----------DDKVIVSWNGLVISSFARASK 558
            L E R     K  D     R+ +    L          D K+IV+WN L+IS  ARA  
Sbjct: 393 KLFETRYGRSSKEIDCFPPARNNQEAKFLSWEGRIPAVTDTKMIVAWNSLMISGLARA-- 450

Query: 559 ILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGF 618
                   A+F+ P        Y ++A  A  FI  + + E   R Q    +G +  P  
Sbjct: 451 -------YAVFSEP-------SYWDLAVGATKFILNNQWVE--GRFQRLNYDGEAAVPAQ 494

Query: 619 LDDYAFLISGLLDLYEFG-SGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVL-L 676
            +DY   I  LLDLY      T WL  A+ +Q   D LF   + GGY+N   ++ + L L
Sbjct: 495 AEDYTLFIKALLDLYAAKPEETNWLDRALAVQQELDRLFWCSDSGGYYNNGSDNGATLPL 554

Query: 677 RVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVP 735
           R +  +D A PS N V++ NLVRL+ +    +   +   A+  LA+F   L+      P
Sbjct: 555 RERSYNDNAIPSANGVAIANLVRLSLLTDNLE---HLDRAQEILAIFGNVLQKYPQTCP 610


>gi|340975510|gb|EGS22625.1| hypothetical protein CTHT_0010970 [Chaetomium thermophilum var.
           thermophilum DSM 1495]
          Length = 785

 Score =  351 bits (900), Expect = 9e-94,   Method: Compositional matrix adjust.
 Identities = 244/734 (33%), Positives = 363/734 (49%), Gaps = 112/734 (15%)

Query: 74  SHRPIHPYKVVAMAERTPASTSHSRNKHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAE 133
           S +P+ P      AE TP +T        NR     SPY+ +HA  PV W    E     
Sbjct: 15  SSQPVQP-----PAEETPQNTLPPLR---NRAGESDSPYVRRHADTPVAWQLLDEATIER 66

Query: 134 ARKRDVPIFLSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMT 193
           A++ + PIF+ IG+   H+CH+   +SF +  VA+ LN  F+ I +DREERPD+D ++  
Sbjct: 67  AKEENKPIFMHIGFLADHFCHLTTQDSFSNPAVAEFLNQSFIPILIDREERPDLDTIFQN 126

Query: 194 YVQALYGGGGWPLSVFLSPDLKPLMGGTYF-----------------------PPEDKYG 230
           Y +A+   GGWPL++FL+PDL P+ GGTY+                       P ED YG
Sbjct: 127 YSEAVNATGGWPLNLFLTPDLYPIFGGTYWPGPGTEHSTLGSDRASESAIAGEPGEDSYG 186

Query: 231 RPGFKTILRKVKDAWDKKR--------DML------AQSGAFAIEQLSEALSASASSNKL 276
              F  I +K+   W  +         +ML      AQ G F+    S + +++A+ N  
Sbjct: 187 --DFLAIAKKIHGFWVTQEERCRREAFEMLHKLQDFAQEGTFSTPVGSGSAASAAADNS- 243

Query: 277 PDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK---KLEDTGKSGE 333
             +L  + L     +++K +D  + GFG+ PKFP P  +  +L  +K   ++ D     E
Sbjct: 244 --DLDLDQLDEALTRIAKMFDPVYHGFGT-PKFPNPARLSFLLRLAKFPTEVSDVIGERE 300

Query: 334 ASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAF 393
              G  M L TL+ +  GG+HDH+G GF R+SV + W +PHFEKM+ +   L  V+LDA+
Sbjct: 301 VENGTAMALKTLRRIRDGGLHDHLGAGFMRFSVTKNWGLPHFEKMVCENALLLGVFLDAW 360

Query: 394 ----------SLTKDVFYSYICRDILDYLRRDMI-GPGGEIFSAEDADSAETEGATRKKE 442
                     SL  +  ++ +  ++ DYL   +I  P G   ++E ADS    G    +E
Sbjct: 361 LGYTAGPKGPSLQDE--FADVVVEVADYLTGPIIRTPQGGFVTSEAADSYYRRGDKHMRE 418

Query: 443 GAFYVWTSKEVEDILG-------EHAILFKEHYY-LKPTGNCDLSRMSDPHNEFKGKNVL 494
           GA+Y+WT +E + ++G       +HA+     Y+ +   GN  + + +DP +EF  +NVL
Sbjct: 419 GAYYLWTRREFDQVVGGSGTSSDDHALAVAAAYWNVLEDGN--VPQENDPFDEFINQNVL 476

Query: 495 IELNDSSASASKLGMPLEKYLNILGECRRKLFDVRSK-RPRPHLDDKVIVSWNGLVISSF 553
               D    + + GMP  +   ++ + R KL   R K R RP  D+KV+VS NG+VIS+ 
Sbjct: 477 CVNRDVVELSRQFGMPQAEIRRVVDDARAKLRAHREKERVRPERDEKVVVSTNGMVISAL 536

Query: 554 ARASKILKSEAESAMFNFPVVGSDR-KEYMEVAESAASFIRRHLYDEQT---HRLQHSFR 609
           AR +  LK            V  +R   Y++ AE AASFI+  L+DE+    + L+  + 
Sbjct: 537 ARTAAALKG-----------VDDERAARYLKAAEQAASFIKEKLWDEKQTAGNPLRRFWY 585

Query: 610 NGPSKAPGFLDDYAFLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLD----------- 658
             PS    F DDYAFLI GLLDLY      KW  WA +LQ+ Q  LF D           
Sbjct: 586 QRPSDTKAFADDYAFLIEGLLDLYTTTLDKKWADWAKQLQDAQIRLFYDPIVPATTGAQP 645

Query: 659 -----REGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYR 713
                  GG Y N        +LR+K   D ++PS N+V+  NL RL ++ A   S  Y 
Sbjct: 646 SPRQAYSGGFYSNELAAISPTILRLKSGMDKSQPSTNAVAAANLFRLGALFA---SKEYT 702

Query: 714 QNAEHSLAVFETRL 727
             A  ++  FE  +
Sbjct: 703 SLARETVNAFEAEV 716


>gi|282897059|ref|ZP_06305061.1| Protein of unknown function DUF255 [Raphidiopsis brookii D9]
 gi|281197711|gb|EFA72605.1| Protein of unknown function DUF255 [Raphidiopsis brookii D9]
          Length = 657

 Score =  351 bits (900), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 230/673 (34%), Positives = 349/673 (51%), Gaps = 78/673 (11%)

Query: 134 ARKRDVPIFLSIGYSTCHWCHVMEVESFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMT 193
           A+  D PIFLSIGYS+CHWC VME E+F D  +A+ +N  F+ IKVDREERPD+D +YM 
Sbjct: 2   AKTEDKPIFLSIGYSSCHWCTVMEGEAFSDLAIAEYMNANFIPIKVDREERPDIDSIYMQ 61

Query: 194 YVQALYGGGGWPLSVFLSP-DLKPLMGGTYFPPEDKYGRPGFKTILRKVKDAWDKKRDML 252
            +Q + G GGWPL+ FLSP DL P   GTYFP   +YGRPGF  +L+ ++  +D +++  
Sbjct: 62  SLQMMTGQGGWPLNAFLSPDDLVPFYAGTYFPVSPRYGRPGFLEVLQAIRHYYDHQKEDF 121

Query: 253 AQSGAFAIEQLSEALSASASSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRP 312
            Q  A  +E L   LS++   N    +   +      +Q  ++             FP  
Sbjct: 122 RQRKASILESL---LSSTVLQNHGSGQFAHSQFHRFLKQGWETAIGVITPRQMGNSFPMI 178

Query: 313 VEIQMMLYHSKKLEDTGKSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHV 372
              Q++L  ++          A++G +M       +A GGI+DHVGGGFHRY+VD  W V
Sbjct: 179 PYCQLVLQGTRF-----NYPSANDGLEMATQRGLDLALGGIYDHVGGGFHRYTVDATWTV 233

Query: 373 PHFEKMLYDQGQLANVYLDAFSL-TKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADS 431
           PHFEKMLYD GQ+     + +S   +++ +       + +L R+MI P G  ++A+DADS
Sbjct: 234 PHFEKMLYDNGQIVEYLANLWSAGVEELAFKRAVAGTVSWLEREMISPTGYFYAAQDADS 293

Query: 432 AETEGATRKKEGAFYVWTSKEVEDILGEHAIL-FKEHYYLKPTGNCDLSRMSDPHNEFKG 490
                    +EGAFYVW+  E++++L +  +L  KEH+ +   GN            F+G
Sbjct: 294 FNYSTDMEPEEGAFYVWSYGELQELLSDQELLELKEHFSVSLEGN------------FEG 341

Query: 491 KNVLIELNDSSASASKLGMPLEKYLNILGECR--------------RKLFDVRSK----R 532
           KNVL  L     SA +LG  LE  L  L   R              R  ++ ++     R
Sbjct: 342 KNVLQRL-----SAGELGSSLELILGRLFLSRYGQTAETLTIFPPARNNYEAKTNPWHGR 396

Query: 533 PRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEYMEVAESAASFI 592
             P  D K+IV+WN L+IS  ARAS++ +                +  Y+++A  A  FI
Sbjct: 397 IPPVTDTKMIVAWNSLMISGLARASQVFQ----------------QPSYLKLAVKATRFI 440

Query: 593 -RRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG-TKWLVWAIELQN 650
             R   + + HRL +   +G        +DYA  I  LLDL++  SG + WL  AI LQ+
Sbjct: 441 LDRQFVNGRFHRLNY---DGEPTVLAQSEDYALFIKALLDLHQADSGSSSWLEQAIALQD 497

Query: 651 TQDELFLDREGGGYFNTTGEDPS-VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKS 709
             +E  L  E GGYFNT+ ++   +++R +   D A PS N V++ NL++L+ +   + +
Sbjct: 498 EFNEFLLSVELGGYFNTSSDNSQDLIIRERNFVDNATPSANGVAIANLIKLSLL---TDN 554

Query: 710 DYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLAA 769
            YY   AE +L  F T ++    + P +  A+D       ++  LV  +S++D   +LA+
Sbjct: 555 LYYLDLAESALKAFSTMIEKSPQSCPSLLIASDWY-----RNSTLV--RSNIDNIKILAS 607

Query: 770 AHASYDLNKTVSK 782
            +    +   +SK
Sbjct: 608 QYLPTTVFDVISK 620


>gi|83594951|ref|YP_428703.1| hypothetical protein Rru_A3622 [Rhodospirillum rubrum ATCC 11170]
 gi|386351716|ref|YP_006049964.1| hypothetical protein F11_18535 [Rhodospirillum rubrum F11]
 gi|83577865|gb|ABC24416.1| Protein of unknown function DUF255 [Rhodospirillum rubrum ATCC
           11170]
 gi|346720152|gb|AEO50167.1| hypothetical protein F11_18535 [Rhodospirillum rubrum F11]
          Length = 680

 Score =  350 bits (899), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 231/639 (36%), Positives = 323/639 (50%), Gaps = 70/639 (10%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYLLQH  NPV W  WGEEAFAEAR  + P+ LSIGYS CHWCHVM  ESFE
Sbjct: 4   NRLGEETSPYLLQHKDNPVHWLPWGEEAFAEARALNRPVLLSIGYSACHWCHVMAHESFE 63

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
              VA+++N  FVS+KVDREE PDVD +Y   +  +   GGWPL++FL+P+ +P+ GGTY
Sbjct: 64  HPQVAEVMNALFVSVKVDREEHPDVDALYQGALALMGEQGGWPLTLFLTPEGEPVTGGTY 123

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP E +YGRPGF  +LR+V + +    D +A++ A   E L+E ++A   +  +   LPQ
Sbjct: 124 FPREPRYGRPGFVQVLRQVSEIFRSAPDKVAETAARLREALAE-MNAGDRAGGVALSLPQ 182

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             L   A  L    D   GG   APKFP P     +     +  D       ++ +  V 
Sbjct: 183 --LDDAARALLSHIDGVAGGLSGAPKFPMPFVFDFLWRAYLRTGD-------AKLRAAVT 233

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL+ MA+GGI+DH+ GGF RYS D  W  PHFEKMLYD  QL  +    +  T+    +
Sbjct: 234 LTLERMAQGGIYDHLAGGFARYSTDSLWLAPHFEKMLYDNAQLIALMTLVWKTTRSPLLA 293

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHAI 462
                 + + + +MIG  G   ++ DADS   EG     EG FYVW   E++  LGE A 
Sbjct: 294 RRIAQTVAWAQSEMIGDNGAFAASLDADS---EGG----EGRFYVWDEAEIDAALGEQAA 346

Query: 463 LFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGECR 522
           LFK+ Y + P GN            ++G+ +L        + + L  P      +L E +
Sbjct: 347 LFKQAYDVTPQGN------------WEGRTIL--------NRATLSQPPTHASGLLDEGK 386

Query: 523 RKLF------------DVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFN 570
                           D R +RPRP  DDKV+  WNGL+I++ A A +            
Sbjct: 387 EDAIDAALAPARALLLDRRGQRPRPGRDDKVLADWNGLMIAALAEAGEA----------- 435

Query: 571 FPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLL 630
                  R E++ +  SA   +   +  E   RL H++  G       L+DYA +I   L
Sbjct: 436 -----LSRPEWVALGRSAFDAVVATMSREDG-RLGHAWCAGRLGETALLEDYAGMIHAAL 489

Query: 631 DLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGN 690
            L+       +L  A     T +  + D   GGYF +  +  ++++R +   D A+PSGN
Sbjct: 490 ALHGISGEAAFLTQAQVWAETVERQYRDPR-GGYFQSAADASALIVRTRGLIDSAQPSGN 548

Query: 691 SVSVINLVRLASIVAGSKSDYYRQNAEHSLAVFETRLKD 729
            +    L RL  +   +  + YRQ AE  LA +   L +
Sbjct: 549 GLLAQGLARLFLL---TGKELYRQRAEDILASYGASLSE 584


>gi|433602620|ref|YP_007034989.1| hypothetical protein BN6_07870 [Saccharothrix espanaensis DSM
           44229]
 gi|407880473|emb|CCH28116.1| hypothetical protein BN6_07870 [Saccharothrix espanaensis DSM
           44229]
          Length = 655

 Score =  350 bits (899), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 219/592 (36%), Positives = 296/592 (50%), Gaps = 85/592 (14%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           TNRLA+  SPYLLQHA NPV W  WG EAFAEAR+R VP+ LS+GY+ CHWCHVM  ESF
Sbjct: 2   TNRLASATSPYLLQHADNPVHWHPWGPEAFAEARERGVPVLLSVGYAACHWCHVMAHESF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           ED   A+ +N+ FV++KVDREERPDVD VYM   QAL G GGWP++ FL+   +P   GT
Sbjct: 62  EDAVTAEYMNEHFVNVKVDREERPDVDAVYMAVTQALSGHGGWPMTCFLTTAGEPFYAGT 121

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE-L 280
           Y+PP  + G P F+ +L  +  AW ++ D + +S A  + QL        +   LP   +
Sbjct: 122 YYPPTPRPGMPSFRQVLEAITHAWREQGDEVRESAASIVSQL--------AFKPLPQSTV 173

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
             + L      L   +D   GGFG APKFP  + ++ +L   +  E TG    + E   M
Sbjct: 174 DADVLDGAVVSLLGHFDRANGGFGGAPKFPPSMVLEFLL---RDHERTG----SVEALSM 226

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
           V  T   MA GG++D + GGF RYSVD  W VPHFEKMLYD   L  VY           
Sbjct: 227 VRATCDAMANGGLYDQLAGGFARYSVDAGWVVPHFEKMLYDNALLLRVYTHLSRRDPAER 286

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-- 458
           Y  + R+  ++L R +  P G   ++ DAD+   EG+T       YVWT  ++ D+LG  
Sbjct: 287 YRAVVRETAEFLLRTLGTPQGGFAASLDADTDGVEGST-------YVWTPAQLADVLGPV 339

Query: 459 ---EHAILF--KEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEK 513
                A+L+   E    +  G   L  + DP  E  G                       
Sbjct: 340 EGARAAVLYGVTEEGTFE-DGASTLRLLGDPDPEIAG----------------------- 375

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
                     KL  VR +RP+P  DDKV+ +WNGL I++ A A  +         F  P 
Sbjct: 376 ----------KLLAVREQRPQPGRDDKVVTAWNGLAIAALAEAGSV---------FGEP- 415

Query: 574 VGSDRKEYMEVAESAASFIRR-HLYDEQTHRLQHSFRNG-PSKAPGFLDDYAFLISGLLD 631
                  ++  AE AA  +   HL      RL  + R+G    A G L+DY     GLL 
Sbjct: 416 ------RWVVAAERAADLLLDVHLVG---GRLLRTSRDGVAGTAAGVLEDYGCFADGLLA 466

Query: 632 LYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 683
           L++     +WL  A EL +T    F   E G Y++T  +  +++ R  +  D
Sbjct: 467 LHQATGSQRWLTVACELLDTALARFAGAEPGVYYDTADDAEALVQRPSDPSD 518


>gi|348170966|ref|ZP_08877860.1| hypothetical protein SspiN1_10719, partial [Saccharopolyspora
           spinosa NRRL 18395]
          Length = 621

 Score =  350 bits (899), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 215/583 (36%), Positives = 306/583 (52%), Gaps = 61/583 (10%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            NRLA   SPYLLQHA NPVDW+ W  E FAEAR+RDVP+ LS+GY+ CHWCHVM  ESF
Sbjct: 2   ANRLANATSPYLLQHADNPVDWWPWSPEVFAEARRRDVPVLLSVGYAACHWCHVMVHESF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           ED  +A+++N+ FV++KVDREERPD+D VYM   QA+ G GGWP++ FL+PD +P   GT
Sbjct: 62  EDPEIARVMNENFVNVKVDREERPDIDSVYMEATQAMTGQGGWPMTCFLTPDGEPFHCGT 121

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE-L 280
           Y+PP+   G P F  +L  V  AW  + + + ++    +EQL      +A    LP+  L
Sbjct: 122 YYPPQPMSGMPSFGQLLHAVAQAWSGRGEEVRKAATRVVEQL------AAQRAPLPESIL 175

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
            ++ L     +L   +D+  GGFG APKFP  + ++ +L H +++   G   +     ++
Sbjct: 176 DEDVLAGAVSRLHAEFDAVHGGFGGAPKFPPSMVLEFLLRHHERV---GMPEDGHSALEL 232

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
              T   MA+GGI+D + GGF RYSVD  W VPHFEKMLYD   L   Y+    L  +  
Sbjct: 233 AEATCSAMARGGIYDQLAGGFARYSVDAAWVVPHFEKMLYDNALLLRTYVHLARL-GNSL 291

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH 460
              + R   ++L  D+  P G   ++ DAD   TEGA    EG  YVWT  ++ ++LG  
Sbjct: 292 GERVARATAEFLLHDLRTPEGGFAASLDAD---TEGA----EGLTYVWTPDQLREVLGP- 343

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
                    +       L  +++      G + L   +D    A                
Sbjct: 344 ---------VDGEWAVQLFEVTEAGTFENGASTLQLKHDPDDPARWR------------R 382

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            R +L + R +RP+P  DDKV+ +WNG+ I++ A A+++L                D   
Sbjct: 383 VRERLREARDQRPQPDKDDKVVTAWNGMAITALAEAAEVL----------------DEPR 426

Query: 581 YME-VAESAASFIRRHLYDEQTHRLQHSFRNGP-SKAPGFLDDYAFLISGLLDLYEFGSG 638
           +++  A++A   + RHL D    RL+ + RNG    A G LDDY     GLL L++    
Sbjct: 427 WIDAAAKAAELLLERHLID---GRLRRTSRNGAVGTAAGVLDDYGCFADGLLALHQATGE 483

Query: 639 TKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKED 681
            +WL  A  L +T  E F D +  G F  T  D   L+R   D
Sbjct: 484 PRWLEAACSLLDTALEQFADADHPGMFYDTAADAESLVRRPSD 526


>gi|363422908|ref|ZP_09310981.1| hypothetical protein AK37_19808 [Rhodococcus pyridinivorans AK37]
 gi|359732625|gb|EHK81638.1| hypothetical protein AK37_19808 [Rhodococcus pyridinivorans AK37]
          Length = 664

 Score =  350 bits (899), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 222/610 (36%), Positives = 309/610 (50%), Gaps = 71/610 (11%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLA   SPYL QHA NPV W  WG++A AEAR+RDVPI LSIGY+ CHWCHVM  ESFE
Sbjct: 3   NRLADALSPYLRQHADNPVHWQEWGDDALAEARERDVPILLSIGYAACHWCHVMAHESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE  A ++N+ FV IKVDREERPD+D VYM    A+ G GGWP++ FL+PD  P   GTY
Sbjct: 63  DEATAAVMNENFVCIKVDREERPDIDAVYMNATVAMAGQGGWPMTCFLTPDGSPFYCGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLP-DELP 281
           +P   + G P F  +L  +   W  +RD ++Q+      +L        SS  LP  E  
Sbjct: 123 YPNTPRGGMPSFVQLLEAITQTWHNRRDEVSQAADAVATELRR------SSGGLPVGEAA 176

Query: 282 QNALRL--CAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLEDTGKSGEASEG 337
             A+ L   A  ++   D   GGFG APKFP    ++ +L  Y   +  DT         
Sbjct: 177 VEAVLLDAAAAAIATDEDREHGGFGGAPKFPPSNLLEGLLRGYERTRSADT--------- 227

Query: 338 QKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTK 397
             +V  T   MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD   L  +Y     +T 
Sbjct: 228 LGLVERTTDAMARGGIYDQLGGGFARYSVDAAWTVPHFEKMLYDNALLLRLYAHLARVTG 287

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
               + + R+  ++L RD++   G   SA DAD+   EG T       YVWT  ++ ++L
Sbjct: 288 AELPTRVTRETAEFLLRDLLTTDGGFASALDADTDGVEGLT-------YVWTPDQLVEVL 340

Query: 458 G-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLN 516
           G +      E + + P G             F+    +++L D     ++          
Sbjct: 341 GADDGRWAAEAFTVTPGGT------------FEHGTSVLQLLDEPDDPAR---------- 378

Query: 517 ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGS 576
            L + R +LF  R  R +P  DDKV+ +WNG  I++ A A   L   A            
Sbjct: 379 -LADVRARLFAARQDRAQPGRDDKVVTAWNGFAITALAEAGIALGEPA------------ 425

Query: 577 DRKEYMEVAESAASF-IRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 635
               +++ A   A F + RHL D +  R   S         G L+DY  L++ LL +++ 
Sbjct: 426 ----WIDAAARCARFLLDRHLVDGRLRR--ASLGGVVGSPVGVLEDYGALVTALLAVHQG 479

Query: 636 GSGTKWLVWAIELQNTQDELFLDRE-GGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSV 694
                W+  A EL +     F D E  G +F+T  +  S++ R ++  DGA PSG S+  
Sbjct: 480 TGDRSWVERARELADVALTQFADPERPGSWFDTAHDAESLVARPRDPVDGATPSGASLIA 539

Query: 695 INLVRLASIV 704
             L+ L+++V
Sbjct: 540 EALLGLSALV 549


>gi|317125355|ref|YP_004099467.1| hypothetical protein Intca_2231 [Intrasporangium calvum DSM 43043]
 gi|315589443|gb|ADU48740.1| protein of unknown function DUF255 [Intrasporangium calvum DSM
           43043]
          Length = 661

 Score =  350 bits (898), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 228/612 (37%), Positives = 308/612 (50%), Gaps = 81/612 (13%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           +NRLAA  SPYLLQH  NPVDW  WGE AFAEAR+R+VP+ LS+GY+ CHWCHVM  ESF
Sbjct: 2   SNRLAAATSPYLLQHRDNPVDWQEWGESAFAEARERNVPVLLSVGYAACHWCHVMAHESF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           EDE VA+ LN+ FVS+KVDREERPD+D VYM  V A  G GGWP++ FL+P+ +P   GT
Sbjct: 62  EDEAVAQALNERFVSVKVDREERPDIDAVYMAAVTATTGHGGWPMTCFLTPEGEPFFCGT 121

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEAL-SASASSNKLPDEL 280
           YFP      R  F  ++  V +AW  + + +  SG      L E L S    +  L D  
Sbjct: 122 YFP------RDHFLQLVAAVDEAWRTREEEVRASGLHITSALREGLASPEPYAAGLAD-- 173

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
               L      LS  +DS  GGFG APKFP  + ++ +L H       G++G+      M
Sbjct: 174 ----LDRAVTLLSGQFDSGAGGFGGAPKFPPSMVLEFLLRHH------GRTGD-DVSLAM 222

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
              TL+ MA+ G++D VGGGF RYSVD +W VPHFEKMLYD   L  VY   + L ++  
Sbjct: 223 ADRTLEAMARSGMYDQVGGGFARYSVDAKWVVPHFEKMLYDNALLLRVYAHWWRLGQNPL 282

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE- 459
              + R+  ++L  ++    G   S+ DAD+   EG T       YVWT  ++ ++LGE 
Sbjct: 283 AEKVARETAEFLLTELRTTDGGFASSLDADTQGVEGLT-------YVWTPAQLAEVLGEA 335

Query: 460 ----HAILF--KEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEK 513
                A LF   EH   +  G   L  ++DP +    ++V                    
Sbjct: 336 DGARAADLFSVSEHGTFE-HGTSTLQLLTDPDDREWFRDV-------------------- 374

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
                   R +L   R+KRP+P  DDKV+ SWNGL I++ A A          A+F  P 
Sbjct: 375 --------RTRLAQARAKRPQPGRDDKVVTSWNGLAITALAEA---------GAIFEEPA 417

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
             +       +       +  H+ D    R   S       A G  DDY  +  GLL L+
Sbjct: 418 WVAAAVASANL------VLDLHVVDGGLRRA--SRDGRAGAAAGVADDYGNVAEGLLSLH 469

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
           +     +WL  A  L     + F   E GG+ +T  +   + LR +   D AEPSG S  
Sbjct: 470 QATGEARWLTVAGHLLRQARDRF-GAEDGGFHDTAADAEQLFLRPRSGADNAEPSGQSAI 528

Query: 694 VINLVRLASIVA 705
            + LV L ++  
Sbjct: 529 AVALVTLGALTG 540


>gi|378717042|ref|YP_005281931.1| hypothetical protein GPOL_c15160 [Gordonia polyisoprenivorans VH2]
 gi|375751745|gb|AFA72565.1| protein of unknown function DUF255 [Gordonia polyisoprenivorans
           VH2]
          Length = 678

 Score =  349 bits (896), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 214/589 (36%), Positives = 301/589 (51%), Gaps = 60/589 (10%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N L A  SPYL QHA NPV W  WG+ A AEA +RDVP+ LS+GY+ CHWCHVM  ESFE
Sbjct: 10  NELGAATSPYLRQHADNPVHWREWGDGALAEAARRDVPVLLSVGYAACHWCHVMAHESFE 69

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE  A  +N  FV IKVDREERPD+D +YM    A+ G GGWP++ FL+P  +P   GTY
Sbjct: 70  DEATAAQMNAEFVCIKVDREERPDIDAIYMNATVAMTGQGGWPMTCFLTPGGEPFYCGTY 129

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP   + G P F+ +L  + +AW ++RD ++  G    + L    +A  +      E+  
Sbjct: 130 FPDSPRNGMPSFRQLLTAITEAWTQRRDEVSDVGRKVRDHLHANAAALPAGAL---EVDD 186

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             L      +    D   GGFG APKFP    ++ +L H+   E TG      E      
Sbjct: 187 RLLAHAVNTVLGDEDRESGGFGGAPKFPPSALLEALLRHT---EYTGT----PEALDAAH 239

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            T + MA+GGIHD + GGF RY+VD  W VPHFEKMLYD  QL  VY     +T D   +
Sbjct: 240 RTCEAMARGGIHDQLAGGFARYAVDNDWVVPHFEKMLYDNAQLLRVYAHLARITGDPLAT 299

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE-HA 461
            +  +I+++LRRD+  PGG   SA DAD+A  EG+T       YVWT  ++ ++LG+   
Sbjct: 300 RVTGEIVEFLRRDLQVPGG-FASALDADAAGVEGST-------YVWTPTQLTEVLGDADG 351

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
               E + +  TG  +            G + L    D        G           + 
Sbjct: 352 QWAAELFGVTATGTFE-----------HGTSTLQFRLDPD------GFDTPAVRERFDDV 394

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
           RR+L   R+ RP+P  DDKV+  WN + +++ A A                  G    E+
Sbjct: 395 RRRLLAARADRPQPARDDKVVTGWNAIAVTALAEAG----------------AGLGHPEW 438

Query: 582 MEVA-ESAASFIRRHLYDEQTHRLQHSFRNGPSKAP-GFLDDYAFLISGLLDLYEFGSGT 639
           +++A E A + +  H+ D    RL+ +   G    P   LDD+A L++ LL L++     
Sbjct: 439 IDLAREVAVTLLAEHVRD---GRLRRASLGGIVGDPVAALDDHAALVTALLTLHQVTGEI 495

Query: 640 KWLVWAIELQNTQDELFLDR-EGGGYFNTTGEDPSVLLRVKEDHDGAEP 687
                A+EL +T  E+F D  E G +++  G D  ++ R ++  DGA P
Sbjct: 496 SHRDQALELLDTTIEIFADADEPGSWYDAAGTD--LIARPRDPIDGATP 542


>gi|424851297|ref|ZP_18275694.1| transcriptional regulator [Rhodococcus opacus PD630]
 gi|356665962|gb|EHI46033.1| transcriptional regulator [Rhodococcus opacus PD630]
          Length = 671

 Score =  349 bits (896), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 228/637 (35%), Positives = 306/637 (48%), Gaps = 87/637 (13%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           +  N L    SPYL QHA NPV W  WG  A   AR+RDVPI LSIGYS CHWCHVM  E
Sbjct: 4   RAQNTLGGSTSPYLRQHADNPVHWQQWGPAATEWARERDVPILLSIGYSACHWCHVMAHE 63

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SFEDE VA L+N+ FV +KVDREERPD+D VYM    A+ G GGWP++ FL+PD  P   
Sbjct: 64  SFEDEAVASLMNEHFVCVKVDREERPDLDAVYMNATVAMTGQGGWPMTCFLTPDGAPFYC 123

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE 279
           GTY+P E + G P F  +L  + D W  +R  +  + A  + +L                
Sbjct: 124 GTYYPAEPRGGMPSFTQLLGAIADTWRDRRGDVDDAAASVVAELRRGAGG---------- 173

Query: 280 LPQNALR-------LCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSG 332
           +P+  +R         A  + +  D+  GGFG APKFP    ++ +L   ++  D    G
Sbjct: 174 IPEGDVRVDAALLDAAAGAVLRDEDADRGGFGGAPKFPPSALMEGLLRTYERSGDDDVLG 233

Query: 333 EASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDA 392
                  +V  T   MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD  QL  VY   
Sbjct: 234 -------VVARTASAMARGGIYDQLGGGFARYSVDAAWVVPHFEKMLYDNAQLLRVYAHL 286

Query: 393 FSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKE 452
              T       +  + +++L RD+    G   SA DAD+   EG T       YVWT ++
Sbjct: 287 GRRTGSDLAVRVTEETVEFLLRDLRTDNGSFASALDADTEGVEGLT-------YVWTPEQ 339

Query: 453 VEDILGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS--ASASKLGMP 510
           + ++LG                         P +      V     D +  A  S L +P
Sbjct: 340 LVEVLG-------------------------PEDGEWAARVFAVTADGTFEAGTSVLQLP 374

Query: 511 -----LEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 565
                 +++  I G     L D R+ RP+P  DDKV+ +WNGL I++ A A         
Sbjct: 375 RDPDDWDRWRRIRG----TLLDQRATRPQPGRDDKVVTAWNGLTITALAEAG-------- 422

Query: 566 SAMFNFPVVGSDRKEYMEVAESAASFI-RRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAF 624
                    G  R E++  A   A  +   H+ D +  R       G S   G L+DYA 
Sbjct: 423 --------AGLGRPEWVAAAADCARAVLGLHVVDGRLRRASLGTSVGESA--GVLEDYAC 472

Query: 625 LISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDRE-GGGYFNTTGEDPSVLLRVKEDHD 683
           L +GLL LY+    T WL  A  L +     F D E  G +F+T  +  +++ R ++  D
Sbjct: 473 LATGLLALYQATGDTAWLTHAQALLDRALIHFADDERPGTWFDTADDAETLVTRPRDPVD 532

Query: 684 GAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEHSL 720
           GA PSG S  V  LV  A++ +   S  Y   A  SL
Sbjct: 533 GATPSGASCLVEALVTAAAVTSADASGRYASAAAESL 569


>gi|309811967|ref|ZP_07705733.1| conserved hypothetical protein [Dermacoccus sp. Ellin185]
 gi|308434025|gb|EFP57891.1| conserved hypothetical protein [Dermacoccus sp. Ellin185]
          Length = 697

 Score =  349 bits (896), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 220/608 (36%), Positives = 299/608 (49%), Gaps = 73/608 (12%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
            NRLAA  SPYL QHA NPVDW  WG+EAFAEAR RDVP+ LS+GY+ CHWCHVM  ESF
Sbjct: 2   ANRLAASLSPYLRQHADNPVDWHEWGDEAFAEARHRDVPVLLSVGYAACHWCHVMAHESF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           ED  +A  L   FV++KVDREERPDVD VYM   QAL G GGWP++V L+PD +P   GT
Sbjct: 62  EDAAIAAQLAKGFVAVKVDREERPDVDAVYMNVTQALTGHGGWPMTVLLTPDGEPFYAGT 121

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASAS--------- 272
           YFP E       F ++L  + + W   R  +  +    +E +     A A+         
Sbjct: 122 YFPREQ------FSSLLHSIGELWRDDRARVEGAARSIVEAMQTRSRADATGLGPGGDDL 175

Query: 273 ---SNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTG 329
               ++   +L    L      L + +D   GGFG APKFP  + ++ +L H  +  D  
Sbjct: 176 LGQGDRAERQLVGVDLTRAVVGLRRQFDDSRGGFGGAPKFPPSMTLEHLLRHHARTGD-- 233

Query: 330 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 389
                ++   M   T + MA+GG++D + GGF RYSVD  W VPHFEKMLYD  QL  VY
Sbjct: 234 -----ADALAMARRTGEAMARGGMYDQLDGGFARYSVDADWVVPHFEKMLYDNAQLLRVY 288

Query: 390 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 449
              +  T D +   +  +  D++ R +    G   SA DAD+   EG T       YVW 
Sbjct: 289 AHLWRATGDDWARRVTYETADFIMRRLGTSEGAFASALDADTDGVEGLT-------YVWN 341

Query: 450 SKEVEDILGEH------AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSAS 503
           ++E+ ++LG         +L    +     G   L    DP   F  +     L D S  
Sbjct: 342 AEELVEVLGRSDGARAAELLGVTRHGTFEDGRSTLQLRRDPAELFSPEV----LGDRSPD 397

Query: 504 ASKLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSE 563
           A               + R +L  VR++RP+P  DDKV+ SWNGL I++ A A  IL+  
Sbjct: 398 A------------WWSDVRARLRSVRAERPQPARDDKVVTSWNGLAIAALAEAGMILEQP 445

Query: 564 AESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYA 623
           +  A                  E+A   +  H+ D +  R   S +   S+A    DDY 
Sbjct: 446 SWVAAAR---------------EAADVVLATHVVDGRLRRA--SLKGRVSEALACADDYG 488

Query: 624 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHD 683
            L  GLL L++    T+    AI L +    LF D  G   ++T  +   + +R + D D
Sbjct: 489 NLAEGLLVLHQANGETRHAEVAIGLLDDAARLFFD--GDTVYDTGSDASQLFIRPRSDGD 546

Query: 684 GAEPSGNS 691
            AEP G S
Sbjct: 547 NAEPCGAS 554


>gi|409389284|ref|ZP_11241136.1| hypothetical protein GORBP_039_00820 [Gordonia rubripertincta NBRC
           101908]
 gi|403200576|dbj|GAB84370.1| hypothetical protein GORBP_039_00820 [Gordonia rubripertincta NBRC
           101908]
          Length = 662

 Score =  349 bits (896), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 215/627 (34%), Positives = 310/627 (49%), Gaps = 78/627 (12%)

Query: 105 LAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFEDE 164
           L +  SPYL QHA NPV W  W + A  +AR+RDVP+ LS+GY+ CHWCHVM  ESFED+
Sbjct: 2   LGSATSPYLRQHADNPVHWQEWSDAALKQARERDVPVLLSVGYAACHWCHVMAHESFEDD 61

Query: 165 GVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTYFP 224
             A  +N  FV +KVDREERPD+D +YM+   A+ G GGWP++ FL+PD  P   GTY+P
Sbjct: 62  ATAAQMNRDFVCVKVDREERPDIDAIYMSATVAMTGQGGWPMTCFLTPDGDPFYAGTYYP 121

Query: 225 PEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQNA 284
           P      P F+ +L  V++AW ++R  L  + A   E +       A+++ LP+      
Sbjct: 122 PRPHGQIPSFRQVLTAVREAWTQRRADLDDTAAKVREHI------VANTSPLPEGTVAVD 175

Query: 285 LRLCAEQLSKSY---DSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
            RL A  +       D+  GGFG APKFP    ++ ++ H+++  D      A       
Sbjct: 176 DRLLAHGVRTVLDEEDTELGGFGGAPKFPPSALLEALIRHTERTGDAAAIEAAGR----- 230

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             T+  M +GGI+D + GGF RYSVD  W VPHFEKMLYD  QL   Y      T D   
Sbjct: 231 --TMHAMGRGGIYDQLAGGFARYSVDAGWVVPHFEKMLYDNAQLLRAYAHLARRTGDPLA 288

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EH 460
             +  + + ++RRD+  PGG   S+ DAD+ E EG+T       YVWT  E+ ++LG E 
Sbjct: 289 RRVVEETIAFIRRDLRVPGG-FASSLDADADEVEGST-------YVWTPAELAEVLGPET 340

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPL-----EKYL 515
                E + +   G  +  R                        S L +P      E++ 
Sbjct: 341 GRWAAELFVVTEQGTFEHGR------------------------STLQLPADPDDRERFD 376

Query: 516 NILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVG 575
            +    R  L + R +R +P  DDKV+  WN + I++ A A   L               
Sbjct: 377 TV----RAALLEARDRRVQPARDDKVVTVWNAMTITALAEAGAGL--------------- 417

Query: 576 SDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 635
            D     E    A   +  HL   +  R   S      ++ G LDD+A L + LL L++ 
Sbjct: 418 GDVSYVDEAIRCADELLTNHLVGGRLRR--SSLGGDVGESDGGLDDHAALSTALLTLFQV 475

Query: 636 GSGTKWLVWAIELQNTQDELFLDREG-GGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSV 694
              T+WL   + L ++  E F D E  G +F+ TGE   ++ R ++  DGA PSG S+  
Sbjct: 476 TGETRWLGAGLGLLDSAVERFADPEAPGAWFDATGE--GLIARPRDPIDGATPSGASLMA 533

Query: 695 INLVRLASIVAGSKSDYYRQNAEHSLA 721
             L+  + +   +K+  Y +  EHSL+
Sbjct: 534 EALLTASMLADSAKAVGYAELLEHSLS 560


>gi|384261487|ref|YP_005416673.1| hypothetical protein RSPPHO_01077 [Rhodospirillum photometricum DSM
           122]
 gi|378402587|emb|CCG07703.1| Putative uncharacterized protein [Rhodospirillum photometricum DSM
           122]
          Length = 742

 Score =  348 bits (893), Expect = 6e-93,   Method: Compositional matrix adjust.
 Identities = 215/599 (35%), Positives = 290/599 (48%), Gaps = 55/599 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QH  +PV W  WG EAFAEAR    PI LS+GY+ CHWCHVM  ESFE
Sbjct: 88  NRLGEETSPYLRQHRTHPVHWAPWGPEAFAEARATHRPILLSVGYAACHWCHVMAHESFE 147

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D  VA ++N  FV +KVDREERPD+D  Y   + A    GGWPL+VFL+P+ KP  GGTY
Sbjct: 148 DPAVADIVNALFVPVKVDREERPDIDAFYQAALAATGQPGGWPLTVFLTPEGKPFAGGTY 207

Query: 223 FPPEDKYGRPGFKTILRKVKD-AWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           FPPE + GRPGF  +L+ V + A     DM  Q+ A            +    +L D   
Sbjct: 208 FPPEPRQGRPGFVEVLKMVSNFARSHPEDMDRQADALTEALRPHPPEGAREGGRLED--- 264

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMV 341
              L      L    D   GG G APKFP P    +M   + + +D G           V
Sbjct: 265 ---LDAAVRALLAHIDPEHGGLGGAPKFPMPAVFALMHRVAHRTDDPGLG-------HAV 314

Query: 342 LFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFY 401
             +L  MA+GG++DH+ GGF RY+ D  W +PHFEKMLYD   L  +  + +  T+D   
Sbjct: 315 THSLTRMAQGGLYDHLAGGFARYATDAAWQIPHFEKMLYDNALLIELMTEVWRSTRDPLL 374

Query: 402 SYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEHA 461
           +   R  + +L R+M    G   ++ DAD+          EG F +W+  E++ +LG  A
Sbjct: 375 ARRVRQTVAWLDREMSAENGAFAASLDADN-------EAGEGGFALWSVGEIKALLGPLA 427

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
             F E Y + P G  +       HN       L++ +  +A        LE++L      
Sbjct: 428 PAFMEAYGVTPEGTWE------GHNILHRAGPLLDADAETA--------LEEHL---ASA 470

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
           R  L+  R  RPRP  DDKV+  WNGLVI++ ARA  +    A         +   R  +
Sbjct: 471 RDLLWRAREHRPRPARDDKVLADWNGLVIAALARAGLVFGEPA--------WIARARHAW 522

Query: 582 MEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTKW 641
             +    A+  R        HRL HS  +G  +A   L+DYA L+   L LYE      +
Sbjct: 523 EGIL---ATMTR------PDHRLGHSLCHGRLQAEAMLEDYAGLMRAGLALYEITGEAPF 573

Query: 642 LVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 700
           L   +   NT +  + D +  GY  T      +  R +   D A PSG  + +    RL
Sbjct: 574 LEQVLAWANTVEGDYRDDDSPGYCQTARSAQDLPWRPRSFTDTATPSGTGLLLQAYARL 632


>gi|182678267|ref|YP_001832413.1| hypothetical protein Bind_1283 [Beijerinckia indica subsp. indica
           ATCC 9039]
 gi|182634150|gb|ACB94924.1| protein of unknown function DUF255 [Beijerinckia indica subsp.
           indica ATCC 9039]
          Length = 687

 Score =  348 bits (893), Expect = 6e-93,   Method: Compositional matrix adjust.
 Identities = 220/618 (35%), Positives = 308/618 (49%), Gaps = 61/618 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N L+   SPYLLQHAHNPV W  W + A  EA+  + PI LS+GY+ CHWCHVM  ESFE
Sbjct: 4   NELSQAASPYLLQHAHNPVHWRMWTKAALEEAQALNKPILLSVGYAACHWCHVMAHESFE 63

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D   A ++N+ FV+IKVDREERPD+D +YM+ +QA    GGWPL++FL+P  +P  GGTY
Sbjct: 64  DPETAAVMNELFVNIKVDREERPDIDHIYMSALQAFGERGGWPLTMFLTPKGEPFWGGTY 123

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP  + +GRP F T+L+ V +A+DK+ + + ++     E L +  +    +      L  
Sbjct: 124 FPKVESFGRPAFVTVLKTVAEAFDKQPERITKNTEVVREGLGKRPAGEEGA-----ALSL 178

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             +   A Q+    D   GG   +PKFP     + +     ++            + +V 
Sbjct: 179 EQMNNLAPQMVNFIDQVDGGLRGSPKFPNTPIFEFLWRAGARISKVPY-------RDLVR 231

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            TL  M++GGI+DH+GGG+ RYS DERW VPHFEKMLYD  Q+  +    F    D  + 
Sbjct: 232 HTLDRMSEGGIYDHLGGGYARYSTDERWLVPHFEKMLYDNAQILELLALCFREFNDELFL 291

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHA 461
              R+ + +L R+M  P G   SA DADS   EG     EG FYVW  +E+   LG E A
Sbjct: 292 TRARETVGWLHREMTSPEGAFCSALDADS---EGV----EGKFYVWVWEELVQTLGVEDA 344

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
           I F + Y        +  R+ +   E  G  V I LN   +       P ++    L   
Sbjct: 345 IYFGKFY--------NAGRIGNWAEEKHGAMVTI-LNRLESH-----RPSDEEEERLAPM 390

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
           R+KLF VR KR  P LDDK++  WNGL+I+S   A+                   D  E+
Sbjct: 391 RQKLFAVREKRVHPGLDDKIMADWNGLMIASLVNAATTF----------------DAPEW 434

Query: 582 MEVAESAASFI--RRHLYDEQ-THRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSG 638
           + +A  A  FI    H  D+Q   RL HS+R G    P    DYA +    + L+E  + 
Sbjct: 435 ITIAAKAYDFIISTMHFIDDQGIKRLAHSWRAGVLVTPAMALDYAAMTRAAIALHEVRNH 494

Query: 639 TK--------WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGN 690
                     +L  AI      +    D E G       +   V+LR+    D A P+ +
Sbjct: 495 PAVSDILIRDYLADAITFAEQLETYHQDPESGLLCMAAKDANDVILRLSPTSDDAIPNAH 554

Query: 691 SVSVINLVRLASIVAGSK 708
            V +  L+RLA +    +
Sbjct: 555 PVFLTALIRLAGLTGDDR 572


>gi|359768980|ref|ZP_09272745.1| hypothetical protein GOPIP_085_00790 [Gordonia polyisoprenivorans
           NBRC 16320]
 gi|359313677|dbj|GAB25578.1| hypothetical protein GOPIP_085_00790 [Gordonia polyisoprenivorans
           NBRC 16320]
          Length = 678

 Score =  348 bits (893), Expect = 7e-93,   Method: Compositional matrix adjust.
 Identities = 220/600 (36%), Positives = 303/600 (50%), Gaps = 82/600 (13%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N L A  SPYL QHA NPV W  WG+ A AEA +RDVP+ LS+GY+ CHWCHVM  ESFE
Sbjct: 10  NELGAATSPYLRQHADNPVHWREWGDGALAEAARRDVPVLLSVGYAACHWCHVMAHESFE 69

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE  A  +N  FV IKVDREERPD+D +YM    A+ G GGWP++ FL+P  +P   GTY
Sbjct: 70  DEATAAQMNAEFVCIKVDREERPDIDAIYMNATVAMTGQGGWPMTCFLTPGGEPFYCGTY 129

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP   + G P F+ +L  + +AW ++RD ++  G    + L    +A  +      E+  
Sbjct: 130 FPDSPRNGMPSFRQLLTAITEAWTQRRDEVSDVGRKVRDHLHANAAALPAGAL---EVDD 186

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             L      +    D   GGFG APKFP    ++ +L H+   E TG      E      
Sbjct: 187 RLLAHAVNTVLGDEDRESGGFGGAPKFPPSALLEALLRHT---EYTGT----PEALDAAR 239

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            T   MA+GGIHD + GGF RY+VD  W VPHFEKMLYD  QL  VY     +T D   +
Sbjct: 240 RTCDAMARGGIHDQLAGGFARYAVDNDWVVPHFEKMLYDNAQLLRVYAHLARITGDPLAT 299

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE--- 459
            +  +I+++LRRD+  PGG   SA DAD+A  EG+T       YVWT  ++ ++LG+   
Sbjct: 300 RVTGEIVEFLRRDLRVPGG-FASALDADAAGVEGST-------YVWTPIQLTEVLGDADG 351

Query: 460 --HAILFK-------EHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMP 510
              A LF        EH      G   L    DP + F    V    +D           
Sbjct: 352 QWAAELFGVTASGTFEH------GTSTLQFRLDP-DGFDTPAVRERFDD----------- 393

Query: 511 LEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFN 570
                      RR+L   R++RP+P  DDKV+  WN + +++ A A              
Sbjct: 394 ----------VRRRLLAARAERPQPARDDKVVTGWNAIAVTALAEAG------------- 430

Query: 571 FPVVGSDRKEYMEVA-ESAASFIRRHLYDEQTHRLQHSFRNGPSKAP-GFLDDYAFLISG 628
               G    E+ ++A E AA+ +  H+ D    RL+ +   G    P   LDD+A L++ 
Sbjct: 431 ---AGLGHPEWTDLAREVAATLLAEHVRD---GRLRRASLGGIVGDPVAALDDHAALVTA 484

Query: 629 LLDLYEFGSGTKWLVWAIELQNTQDELFLDR-EGGGYFNTTGEDPSVLLRVKEDHDGAEP 687
           LL L++          A+EL +T  E+F D  E G +++  G D  ++ R ++  DGA P
Sbjct: 485 LLTLHQVTGEISHRDQALELLDTTIEIFADADEPGSWYDAAGTD--LIARPRDPIDGATP 542


>gi|148560433|ref|YP_001259868.1| hypothetical protein BOV_1983 [Brucella ovis ATCC 25840]
 gi|148371690|gb|ABQ61669.1| conserved hypothetical protein [Brucella ovis ATCC 25840]
          Length = 666

 Score =  348 bits (893), Expect = 7e-93,   Method: Compositional matrix adjust.
 Identities = 225/615 (36%), Positives = 309/615 (50%), Gaps = 62/615 (10%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           +NRLA E S YL QHA+NPV W  WG +A   A++ D PI LSIGY+TCHWCHVM  ESF
Sbjct: 6   SNRLAGEPSAYLRQHANNPVHWQPWGRKALDAAKELDRPILLSIGYATCHWCHVMAHESF 65

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           ED+ VA ++N +F+++KVDREERPD+D++YM  + A+   GGWPL++FL PD KP  GGT
Sbjct: 66  EDDDVAAVMNAFFINVKVDREERPDIDQIYMAALGAMGQQGGWPLTMFLRPDGKPFWGGT 125

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFP   ++  PGF  IL  V + W + +D +  +     + L   L  +A S  L +E+ 
Sbjct: 126 YFPRHARHNMPGFVDILHAVNNLWHRDKDKINHNAEAVFDHLEGRL--AAQSQPLQNEIS 183

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPR-PVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
           +      A ++    D + GG    PKFP  P    + L    +  +T +          
Sbjct: 184 R--FDDLANRIGSLIDPQRGGIEGVPKFPNAPFMDTLWLSWLYRHNETHRDN-------- 233

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
            L +L+ M +GGI+DH+GGG  RYS D  W VPHFEKMLYD  Q       AF+ T D  
Sbjct: 234 FLLSLKTMLQGGIYDHLGGGLCRYSTDAEWLVPHFEKMLYDNAQFIRHANYAFAETGDDL 293

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH 460
           +     + +D+L R+M  P G   S+ DADS   EG    +EG FYVWT  E++ +LG  
Sbjct: 294 FRIRIEETVDWLIREMQLPDGCFASSLDADS---EG----EEGKFYVWTEDEIDAVLGTD 346

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
           A +FK  Y + P GN            ++GKN+L  L+  +A+ +    PL      +  
Sbjct: 347 AEVFKTFYAVTPGGN------------WEGKNILNRLH--AAAETPTPPPL------VEA 386

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            RRKL   R  R RP  DDK +  WNGL I + A A +                   R +
Sbjct: 387 ARRKLLAHRETRIRPGRDDKALTDWNGLAIRALAEAGRSFA----------------RTD 430

Query: 581 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 640
           ++E A  A   I       Q  R+ H    G    P    DYA +I+  L LYE      
Sbjct: 431 WLEHAVQAYRSIGSSF---QDGRIAHCRMEGAFLYPALATDYAAMINAALALYEATGEFA 487

Query: 641 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 700
           ++  A + +   D    D  G    +  G D  V+L    D+D A PS  S  +  L RL
Sbjct: 488 YIDDARKFKRALDGSHRDSAGNYRLSALGAD-DVILHAYGDYDEAIPSATSQIIEALTRL 546

Query: 701 ASIVAGSKSDYYRQN 715
              +A   S  Y +N
Sbjct: 547 --FLATGDSALYEEN 559


>gi|311743136|ref|ZP_07716944.1| thioredoxin [Aeromicrobium marinum DSM 15272]
 gi|311313816|gb|EFQ83725.1| thioredoxin [Aeromicrobium marinum DSM 15272]
          Length = 697

 Score =  348 bits (893), Expect = 7e-93,   Method: Compositional matrix adjust.
 Identities = 225/620 (36%), Positives = 312/620 (50%), Gaps = 69/620 (11%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRLAA  SPYLLQHA NPVDW+ W +EA AEAR+RDVP+ LS+GY+ CHWCHVM  ESFE
Sbjct: 42  NRLAAATSPYLLQHADNPVDWWEWCDEALAEARRRDVPVLLSVGYAACHWCHVMAHESFE 101

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D   A  +ND FV++KVDREERPDVD VYM   QA+ G GGWP++  L+PD +P   GTY
Sbjct: 102 DATTAAYMNDHFVNVKVDREERPDVDAVYMRATQAMSGHGGWPMTCVLTPDGEPFFAGTY 161

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FPPE + G P F  +L+ + +AW ++RD +   G   +  L E      ++    D L  
Sbjct: 162 FPPEPRGGHPAFTQVLQALSEAWAERRDEVLTVGRDVVAHLRE------TTEPAGDRLGT 215

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             L   A  L+  +D    GFG++PKFP  + ++ +L H+       ++G AS    MV 
Sbjct: 216 ADLDAAATALAGQFDDDAAGFGASPKFPPSMVLEFLLRHAD------RTGSASS-IAMVE 268

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            T + MA+GG++D + GGF RYSVD  W VPHFEKMLYD  QL  VYL  +  T      
Sbjct: 269 RTAEAMARGGLYDQLAGGFARYSVDRFWRVPHFEKMLYDNAQLVRVYLHLWRATGSPLAE 328

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILG-EHA 461
            + R+  D+L  ++    G   SA DADS          EG FYVW   ++   LG    
Sbjct: 329 RVVRETADFLLTELRTAEGGFASALDADS-------DGHEGTFYVWNPDQLLKTLGAADG 381

Query: 462 ILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGEC 521
               E   +  TG  +          F    +  + +D            E++  +    
Sbjct: 382 AWATELLQVSATGTFE--------RGFSTLQLPTDPDDP-----------ERWDRV---- 418

Query: 522 RRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKEY 581
           R +L   RS R RP  DDKV+ +WNGL +S+ A A  +L                D  EY
Sbjct: 419 RARLLAARSTRTRPDRDDKVVAAWNGLAVSALAEAGVLL----------------DVPEY 462

Query: 582 MEVAESAASFIRR-HLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 640
           ++ A  AA  +   H       R       GP    G L+D+  +    L L       +
Sbjct: 463 VDAAVVAAELLATVHTAGGYLLRTSRDGVAGPHA--GVLEDHGAVAEAYLVLLGVTGDLR 520

Query: 641 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 700
           W   A  L +     F D   GG+F+T  +D  +++R ++  D A PSG S +   L+  
Sbjct: 521 WWQRAEPLLDRVLTDFAD-PSGGFFDTAEDD--LVVRPRDTSDNAYPSGTSAAAAALLTA 577

Query: 701 ASIVAGSKSDYYRQNAEHSL 720
           A++    +   +R+ AE +L
Sbjct: 578 AAVTGEQR---WREGAESAL 594


>gi|397736226|ref|ZP_10502910.1| hypothetical protein JVH1_7484 [Rhodococcus sp. JVH1]
 gi|396928069|gb|EJI95294.1| hypothetical protein JVH1_7484 [Rhodococcus sp. JVH1]
          Length = 671

 Score =  348 bits (892), Expect = 8e-93,   Method: Compositional matrix adjust.
 Identities = 227/626 (36%), Positives = 304/626 (48%), Gaps = 65/626 (10%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           +  N L    SPYL QHA NPV W  WG EA   AR+RDVPI LSIGYS CHWCHVM  E
Sbjct: 4   RAQNTLGGSTSPYLRQHADNPVHWQQWGPEATEWARERDVPILLSIGYSACHWCHVMAHE 63

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMG 219
           SFEDE VA L+N+ FV +KVDREERPD+D VYM    A+ G GGWP++ FL+PD  P   
Sbjct: 64  SFEDESVASLMNEHFVCVKVDREERPDLDAVYMNATVAMTGQGGWPMTCFLTPDGAPFYC 123

Query: 220 GTYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDE 279
           GTY+P E + G P F  +L  + D W  +R  +  + A  + +L          +    +
Sbjct: 124 GTYYPAEPRGGMPSFTQLLSAIADTWRDRRGDVDDAAASVVAELRRGAGGIPEGDV---Q 180

Query: 280 LPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQK 339
           +    L   A  + +  D+  GGFG APKFP    ++ +L   +  E +G    A E   
Sbjct: 181 VDAALLDAAAGAVLRDEDADRGGFGGAPKFPPSALMEGLL---RTYERSG----AEEVLG 233

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDV 399
           +V  T   MA+GGI+D +GGGF RYSVD  W VPHFEKMLYD  QL   Y      T   
Sbjct: 234 VVARTASAMARGGIYDQLGGGFARYSVDAAWVVPHFEKMLYDNAQLLRAYAHLGRRTGSD 293

Query: 400 FYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGE 459
               +  + +++L RD+    G   SA DAD+   EG T       YVWT  ++ ++LG 
Sbjct: 294 LAVRVTEETVEFLLRDLRTDNGSFASALDADTEGVEGLT-------YVWTPAQLVEVLG- 345

Query: 460 HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSS--ASASKLGMPLE-KYLN 516
                                   P +      V     D +  A  S L +P +    +
Sbjct: 346 ------------------------PEDGEWAARVFAVTADGTFEAGTSVLQLPRDPDDWD 381

Query: 517 ILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGS 576
                R  L   R+ RP+P  DDKV+ +WNGL I++ A A                  G 
Sbjct: 382 RWSRIRGTLLAQRATRPQPGRDDKVVTAWNGLTITALAEAG----------------AGL 425

Query: 577 DRKEYMEVAESAASFI-RRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEF 635
            R E++  A   A  +   H+ D +  R       G S   G L+DYA L +GLL LY+ 
Sbjct: 426 GRPEWVAAAADCARAVLGLHVVDGRLRRASLGTSVGESA--GVLEDYACLATGLLALYQA 483

Query: 636 GSGTKWLVWAIELQNTQDELFLDRE-GGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSV 694
              ++WL  A  L +     F D E  G +F+T  +  S++ R ++  DGA PSG S  V
Sbjct: 484 TGDSEWLTHAQALLDRALIHFADDERPGSWFDTADDAESLVTRPRDPVDGATPSGASCLV 543

Query: 695 INLVRLASIVAGSKSDYYRQNAEHSL 720
             L+  A++  G  S  Y   A  SL
Sbjct: 544 EALLTAAAVADGEASGRYATAAAESL 569


>gi|379729659|ref|YP_005321855.1| hypothetical protein SGRA_1536 [Saprospira grandis str. Lewin]
 gi|378575270|gb|AFC24271.1| hypothetical protein SGRA_1536 [Saprospira grandis str. Lewin]
          Length = 689

 Score =  348 bits (892), Expect = 8e-93,   Method: Compositional matrix adjust.
 Identities = 224/630 (35%), Positives = 333/630 (52%), Gaps = 72/630 (11%)

Query: 100 KHTNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVE 159
           K++NRL  E SPYL QHAHNPVDW+ WG+EA  +A+  +  I LSIGYSTCHWCHVME E
Sbjct: 2   KYSNRLQKESSPYLQQHAHNPVDWYPWGQEALDKAKAENKMILLSIGYSTCHWCHVMEKE 61

Query: 160 SFEDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGG-GGWPLSVFLSPDLKPLM 218
           SFED  V + +N  FVSIKVDREERPD+D +YM  VQ + GG GGWPL+ FL P+ +P  
Sbjct: 62  SFEDPRVGEFMNQHFVSIKVDREERPDLDHIYMEAVQLVTGGQGGWPLNCFLLPNGRPFF 121

Query: 219 GGTYFPPEDKYGRPGFKTILRKVKDAWDKK-RDMLAQSGAF------AIEQLSEALSASA 271
           GGTYFPP     R  +  +L  +   W ++ + ++ Q+           ++++E +    
Sbjct: 122 GGTYFPPRRMQNRNSWMEVLGNLSKVWQEQPKTIIDQADKLYNFLQKGEDKMTEGIDFGQ 181

Query: 272 SSNKLPDELPQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMML--YHSKKLEDTG 329
           + +        +    C +QL+ ++D + GGFG +PKFP  + ++ +L  Y+ +K     
Sbjct: 182 NGDS---PFKASDWNYCLDQLADNFDEQAGGFGHSPKFPSVMSLRYLLNSYYYEK----- 233

Query: 330 KSGEASEGQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVY 389
                 +  + + F+L  M  GGI+D +GGGF RY+VD  W +PHFEKMLYD   L  + 
Sbjct: 234 ----DQKAMQQLQFSLDAMIYGGIYDQLGGGFARYTVDRYWKIPHFEKMLYDNALLIGLL 289

Query: 390 LDAFSLTKDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWT 449
            D++ LT+   Y+    +  ++L+ +M  P G  +SA DADS   EG    +EG FYVW 
Sbjct: 290 ADSYKLTQKPLYAQTIAECWNWLQSEMQSPEGTYYSALDADS---EG----EEGKFYVWN 342

Query: 450 SKEVEDILGE----HAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASAS 505
            +E++  L         +F + Y   P GN            ++GK +L      +  A 
Sbjct: 343 WEELQRALANWPQPWKQIFLDFYDASPAGN------------WEGKIILRRPQSLAGFAQ 390

Query: 506 KLGMPLEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAE 565
              +  E+    L + +  L D+R++R RP  D+K+I+SWN L+ S+  +A + ++    
Sbjct: 391 SRKLDPEELQQELDKIKAHLLDIRAQRIRPGRDEKIILSWNALLASALLKAYQAIR---- 446

Query: 566 SAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAP--GFLDDYA 623
                 P       EY + A      I + L +E+   L HS+  G   AP   F DDYA
Sbjct: 447 -----LP-------EYKKAALGILEQIEKRLQNEKGQLL-HSYA-GDKIAPQLAFSDDYA 492

Query: 624 FLISGLLDLYEFGSGTKWLVWAIELQNTQDELFLDR--EGGGYFNTTGEDPSVLLRVKED 681
           FLI   L  YE     K L  A +L         D   E G ++ ++ +   +L R K+ 
Sbjct: 493 FLIEAHLLAYEVSFEEKHLQRADQLMQA---CIADHSAEAGLFYYSSAQQTDILYRKKDL 549

Query: 682 HDGAEPSGNSVSVINLVRLASIVAGSKSDY 711
           +D A PSGNS  + NL +L  ++   K++Y
Sbjct: 550 YDSATPSGNSSLMHNLEQLGILL--DKAEY 577


>gi|251771511|gb|EES52088.1| protein of unknown function DUF255 [Leptospirillum
           ferrodiazotrophum]
          Length = 674

 Score =  347 bits (891), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 235/601 (39%), Positives = 314/601 (52%), Gaps = 66/601 (10%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           NRL  E SPYL QHA NPVDW+ WGEEA+ E+ +   P+ LSIGY+ CHWCHVM  ESFE
Sbjct: 3   NRLKDETSPYLRQHAENPVDWYPWGEEAWEESARSGRPVLLSIGYAACHWCHVMAHESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQAL-YGGGGWPLSVFLSPDLKPLMGGT 221
           D   A  +N  FV+IKVDREERPD+D +Y T  Q L   GGGWPL+VFL+    P   GT
Sbjct: 63  DPETAAQMNRDFVNIKVDREERPDLDLIYQTAHQILARRGGGWPLTVFLTSRKVPFAAGT 122

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFP   ++G PGF  +L +++  +D+ R  L       +  + E+L+        P    
Sbjct: 123 YFPRTSRFGLPGFTEVLGRIRGFYDEHRSELESPENRQVVDILESLT--------PRRRG 174

Query: 282 QNALRLCAEQ-----LSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASE 336
           +++L L   Q     L + +D  FGGFG APKFP          HS+ L     S EAS+
Sbjct: 175 ESSLSLAPVQSFLAHLRQVFDRDFGGFGGAPKFP----------HSQGLSFLLDSSEASD 224

Query: 337 GQKMVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLT 396
            ++M   TL+ MA+GG+ D +GGGF RYSVD+RW +PHFEKMLYD G L  +Y  A ++T
Sbjct: 225 -REMAFLTLRKMARGGLFDQIGGGFARYSVDDRWEIPHFEKMLYDNGPLLGLYARAHAMT 283

Query: 397 KDVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDI 456
            D F+  +      + +R+M    G  FS+ DADS   EG    +EG FY W+  EVE+ 
Sbjct: 284 GDPFFREVAERTALWAQREMRSQEGMYFSSLDADS---EG----EEGRFYRWSRTEVEES 336

Query: 457 LGEHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKN---VLIELNDSSASASKLGMPLEK 513
           L       +    L   G         P N F+G +   VL +  +  A    L  P E 
Sbjct: 337 LSGR----ERQAALACLG------FDRPPN-FEGHHWHAVLAKTPEEWAREEGLS-PFEA 384

Query: 514 YLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPV 573
              + G  R  LF  RS R RP LDDK++ SWN L       A + L  E          
Sbjct: 385 SEALRG-ARETLFRRRSSRVRPGLDDKMLTSWNALWARGLLEAGRHLGRE---------- 433

Query: 574 VGSDRKEYMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLY 633
               R+E  E+  +    IRRH++ E   RL      G S+   +LDDYAFL+  LL+  
Sbjct: 434 --DWRQEGREILRA----IRRHMWHEG--RLLAVRAGGKSRLGAYLDDYAFLLEALLEEL 485

Query: 634 EFGSGTKWLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVS 693
                 + L +A+ +     ELF D E GG+F T  +  S+ +R K  HD + PSGN  +
Sbjct: 486 SSEFSEETLDFALSVARALQELFEDPEEGGFFFTARDHESLPVRTKPGHDQSLPSGNGSA 545

Query: 694 V 694
            
Sbjct: 546 A 546


>gi|161619977|ref|YP_001593864.1| spermatogenesis-associated protein 20 [Brucella canis ATCC 23365]
 gi|260567466|ref|ZP_05837936.1| conserved hypothetical protein [Brucella suis bv. 4 str. 40]
 gi|376275351|ref|YP_005115790.1| thioredoxin domain-containing protein [Brucella canis HSK A52141]
 gi|161336788|gb|ABX63093.1| Spermatogenesis-associated protein 20 precursor [Brucella canis
           ATCC 23365]
 gi|260156984|gb|EEW92064.1| conserved hypothetical protein [Brucella suis bv. 4 str. 40]
 gi|363403918|gb|AEW14213.1| thioredoxin domain-containing protein [Brucella canis HSK A52141]
          Length = 666

 Score =  347 bits (891), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 224/615 (36%), Positives = 309/615 (50%), Gaps = 62/615 (10%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           +NRLA E S YL QHA+NPV W  WG +A   A++ D PI LSIGY+ CHWCHVM  ESF
Sbjct: 6   SNRLAGEPSAYLRQHANNPVHWQPWGRKALDAAKELDRPILLSIGYAACHWCHVMAHESF 65

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           ED+ VA ++N +F+++KVDREERPD+D++YM  + A+   GGWPL++FL PD KP  GGT
Sbjct: 66  EDDDVAAVMNAFFINVKVDREERPDIDQIYMAALGAMGQQGGWPLTMFLRPDGKPFWGGT 125

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFP   ++  PGF  IL  V + W + +D +  +     + L   L  +A S  L +E+ 
Sbjct: 126 YFPRHARHNMPGFVDILHAVNNLWHRDKDKINHNAEAVFDHLEGRL--AAQSQPLQNEIS 183

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPR-PVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
           +      A ++    D + GG    PKFP  P    + L    +  +T +          
Sbjct: 184 R--FDDLANRIGSLIDPQRGGIEGVPKFPNAPFMDTLWLSWLYRHNETHRDN-------- 233

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
            L +L+ M +GGI+DH+GGG  RYS D  W VPHFEKMLYD  Q       AF+ T D  
Sbjct: 234 FLLSLKTMLQGGIYDHLGGGLCRYSTDAEWLVPHFEKMLYDNAQFIRHANYAFAETGDDL 293

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH 460
           +     + +D+L R+M  P G   S+ DADS   EG    +EG FYVWT  E++ +LG +
Sbjct: 294 FRIRIEETVDWLIREMQLPDGCFASSLDADS---EG----EEGKFYVWTEDEIDAVLGTY 346

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
           A +FK  Y + P GN            ++GKN+L  L+  +A+ +    PL      +  
Sbjct: 347 AEVFKTFYAVTPGGN------------WEGKNILNRLH--AAAETPTPPPL------VEA 386

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            RRKL   R  R RP  DDK +  WNGL I + A A +                   R +
Sbjct: 387 ARRKLLAHRETRIRPGRDDKALTDWNGLAIRALAEAGRSFA----------------RTD 430

Query: 581 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 640
           ++E A  A   I       Q  R+ H    G    P    DYA +I+  L LYE      
Sbjct: 431 WLEHAVQAYRSIGSSF---QDGRIAHCRMEGAFLYPALATDYAAMINAALALYEATGEFA 487

Query: 641 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 700
           ++  A + +   D    D  G    +  G D  V+L    D+D A PS  S  +  L RL
Sbjct: 488 YIDDARKFKRALDGSHRDSAGNYRLSALGAD-DVILHAYGDYDEAIPSATSQIIEALTRL 546

Query: 701 ASIVAGSKSDYYRQN 715
              +A   S  Y +N
Sbjct: 547 --FLATGDSALYEEN 559


>gi|377562896|ref|ZP_09792262.1| hypothetical protein GOSPT_007_00380 [Gordonia sputi NBRC 100414]
 gi|377529874|dbj|GAB37427.1| hypothetical protein GOSPT_007_00380 [Gordonia sputi NBRC 100414]
          Length = 667

 Score =  347 bits (890), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 226/618 (36%), Positives = 310/618 (50%), Gaps = 96/618 (15%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N+L+A  SPYL QHA NPVDW  W + A  E+ +RDVPI LS+GY+ CHWCHVM  ESFE
Sbjct: 3   NQLSASSSPYLRQHADNPVDWREWTDAALEESVRRDVPILLSVGYAACHWCHVMAHESFE 62

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           D   A  +N  FV IKVDREERPD+D +YM    A+   GGWP++ FL+P  +P   GTY
Sbjct: 63  DADTAAQMNRDFVCIKVDREERPDIDAIYMNATVAMTRQGGWPMTCFLTPSGEPFYCGTY 122

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP   + G P F+ IL  V  AW  +R  +   GA   E LS+A SA  +     DE   
Sbjct: 123 FPDTPRGGMPSFRQILSAVTQAWTTRRSEIESMGARVREALSDAASALPAGGVDVDE--- 179

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
             L      +    D   GGFG  PKFP    ++ +L H +      +SG+A+  Q  V+
Sbjct: 180 RLLDYAVTTVLGDEDQAAGGFGGPPKFPPSALLEGLLRHYE------RSGDAAPLQA-VM 232

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
            T   MA+GGI+D +GGGF RY+VD  W VPHFEKMLYD  QL  VY     +  D    
Sbjct: 233 RTTDAMARGGIYDQLGGGFSRYAVDNDWVVPHFEKMLYDNAQLLRVYGHLARIVDDPLSG 292

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH-- 460
            I  +I+D+LRRD+   GG   S+ DAD+A  EG+T       YVWT  ++ ++LG+   
Sbjct: 293 RIAEEIVDFLRRDLRVVGG-FASSLDADAAGVEGST-------YVWTPAQLREVLGDEDG 344

Query: 461 ---AILFK-------EHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMP 510
              A LF        EH      G   L   +DP                          
Sbjct: 345 DWAAALFGVTEAGTFEH------GASTLQLRTDP-------------------------- 372

Query: 511 LEKYLNILGECRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFN 570
            ++Y ++    RR+L   R+ RP+P  DDKV+  WN + +++ A A   L          
Sbjct: 373 -DRYADV----RRRLLTARASRPQPPRDDKVVTGWNAMAVTALAEAGAALG--------- 418

Query: 571 FPVVGSDRKEYMEVA-ESAASFIRRHLYDEQTHRLQHSFRNGPSKAP-GFLDDYAFLISG 628
                    +++++A E     +  HL D Q   L+ S   G   AP   LDD+A L++ 
Sbjct: 419 -------HSDWVDLAVEVLTELVDSHLVDGQ---LRRSSLGGVVGAPLAALDDHAALVTA 468

Query: 629 LLDLYEFGSGTKWLVWAIELQNTQDELFLD-REGGGYFNTTGEDPSVLLRVKEDHDGAEP 687
           +L +Y+    T W    + L +   E F D  E G +F+      +++ R ++  DGA P
Sbjct: 469 MLTVYQVTGETSWCDKGLALLDEAIETFADPDEAGAWFDAA--QGTLIARPRDPADGATP 526

Query: 688 SGNSVSVINLVRLASIVA 705
           SG S     LV  A++VA
Sbjct: 527 SGAS-----LVAEATLVA 539


>gi|154251723|ref|YP_001412547.1| hypothetical protein Plav_1270 [Parvibaculum lavamentivorans DS-1]
 gi|154155673|gb|ABS62890.1| protein of unknown function DUF255 [Parvibaculum lavamentivorans
           DS-1]
          Length = 676

 Score =  347 bits (890), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 235/661 (35%), Positives = 341/661 (51%), Gaps = 63/661 (9%)

Query: 103 NRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESFE 162
           N L AE SPYLLQHA NPV W  WGE A   A+K   PI LS+GY+ CHWCHVM  ESFE
Sbjct: 4   NFLDAETSPYLLQHADNPVHWRPWGEAALDAAKKEKKPILLSVGYAACHWCHVMAHESFE 63

Query: 163 DEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGTY 222
           DE VA ++N+ FV+IKVDREERPD+D +YM+ +  L   GGWPL++FL+P+ +P  GGTY
Sbjct: 64  DESVAAVMNEHFVNIKVDREERPDIDAIYMSALHLLGQQGGWPLTMFLTPEGEPFWGGTY 123

Query: 223 FPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELPQ 282
           FP E  YGRPGF  +L +V   + ++   + ++    ++ L E  SA+A   +   ++P 
Sbjct: 124 FPKEPNYGRPGFVQVLEEVARIFREEPAKVYKNRTALVKALEEQ-SATARPGEPTPQVPI 182

Query: 283 NALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSKKLEDTGKSGEASEGQKMVL 342
               + AE+L +  D   GG   APKFP+ V +  +L+ +     TG+   A+     V 
Sbjct: 183 ----VVAEKLREIMDPVHGGIRGAPKFPQ-VPLLTLLWRAHL--RTGREDLAAP----VS 231

Query: 343 FTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVFYS 402
             L  M++GGI+DH+GGG+ RYSVDE W  PHFEKMLYD   L ++    +  T+   Y 
Sbjct: 232 RALDHMSEGGIYDHLGGGYARYSVDEFWLAPHFEKMLYDNALLIDLLTLVWQETRKPLYE 291

Query: 403 YICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL--GEH 460
              R+ +++L R+M+  GG   ++ DADS   EG     EG FYVW+  E++++L  GE 
Sbjct: 292 RRIRETVEWLAREMVTEGGGFAASLDADS---EGV----EGKFYVWSEAEIDNLLTPGE- 343

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
           A LFK+ Y +   GN            ++  N+L  L  + A  +       +    L  
Sbjct: 344 AELFKQVYNVSGEGN------------WEETNILNRLARADAPFTA------EEEAALEP 385

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            + +LF  R  R  P  DDKV+  WNGL+I++ ARA                        
Sbjct: 386 LKARLFLERDLRVHPGFDDKVLADWNGLMIAALARAGAAFGEAG---------------- 429

Query: 581 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 640
           + E+A +A  F+   +   +  RL H++R G  +     DD A +    L LYE     +
Sbjct: 430 WTEMAAAAFRFVMTEM--RKDGRLHHAWRAGKLQHIAMADDLANMADAALALYEATGEAE 487

Query: 641 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 700
           +L  A  L       + D   GGYF T  + P++++R +   D A P+ N      L RL
Sbjct: 488 YLQAAESLAAELGAHYRDETNGGYFFTADDAPALIVRRRTVADDATPAANGTMPGVLARL 547

Query: 701 ASIVAGSKSDYYRQNAEHSLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSS 760
           A +    K DY  + A+  +  F   L+      PL    A + +      +VL+G K+ 
Sbjct: 548 ALMT--GKQDYLAR-ADELIRAFAGELQQNIF--PLGSYIASLDTRLKPVQIVLIGSKAE 602

Query: 761 V 761
            
Sbjct: 603 T 603


>gi|163844081|ref|YP_001628485.1| spermatogenesis-associated protein 20 [Brucella suis ATCC 23445]
 gi|163674804|gb|ABY38915.1| Spermatogenesis-associated protein 20 precursor [Brucella suis ATCC
           23445]
          Length = 666

 Score =  347 bits (890), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 224/615 (36%), Positives = 308/615 (50%), Gaps = 62/615 (10%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           +NRLA E S YL QHA+NPV W  WG +A   A++ D PI LSIGY+ CHWCHVM  ESF
Sbjct: 6   SNRLAGEPSAYLRQHANNPVHWQPWGRKALDAAKELDRPILLSIGYAACHWCHVMAHESF 65

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           ED+ VA ++N +F+++KVDREERPD+D++YM  + A+   GGWPL++FL PD KP  GGT
Sbjct: 66  EDDDVAAVMNAFFINVKVDREERPDIDQIYMAALGAMGQQGGWPLTMFLRPDGKPFWGGT 125

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFP   ++  PGF  IL  V + W + +D +  +     + L   L  +A S  L +E+ 
Sbjct: 126 YFPRHARHNMPGFVDILHAVNNLWHRDKDKINHNAEAVFDHLEGRL--AAQSQPLQNEIS 183

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPR-PVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
           +      A ++    D + GG    PKFP  P    + L    +  +T +          
Sbjct: 184 R--FDDLANRIGSLIDPQRGGIEGVPKFPNAPFMDTLWLSWLYRHNETHRDN-------- 233

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
            L +L+ M +GGI+DH+GGG  RYS D  W VPHFEKMLYD  Q       AF+ T D  
Sbjct: 234 FLLSLKTMLQGGIYDHLGGGLCRYSTDAEWLVPHFEKMLYDNAQFIRHANYAFAETGDDL 293

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH 460
           +     + +D+L R+M  P G   S+ DADS   EG    +EG FYVWT  E++ +LG  
Sbjct: 294 FRIRIEETVDWLIREMQVPDGCFASSLDADS---EG----EEGKFYVWTEDEIDAVLGTD 346

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
           A +FK  Y + P GN            ++GKN+L  L+  +A+ +    PL      +  
Sbjct: 347 AEVFKTFYVVTPGGN------------WEGKNILNRLH--AAAETPTPPPL------VEA 386

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            RRKL   R  R RP  DDK +  WNGL I + A A +                   R +
Sbjct: 387 ARRKLLAHRETRIRPGRDDKALTDWNGLAIRALAEAGRSFA----------------RTD 430

Query: 581 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 640
           ++E A  A   I       Q  R+ H    G    P    DYA +I+  L LYE      
Sbjct: 431 WLEHAVQAYRSIGSSF---QDGRIAHCRMEGAFLYPALATDYAAMINAALALYEATGEFA 487

Query: 641 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 700
           ++  A + +   D    D  G    +  G D  V+L    D+D A PS  S  +  L RL
Sbjct: 488 YIDDARKFKRALDGSHRDSAGNYRLSALGAD-DVILHAYGDYDEAIPSATSQIIEALTRL 546

Query: 701 ASIVAGSKSDYYRQN 715
              +A   S  Y +N
Sbjct: 547 --FLATGDSALYEEN 559


>gi|428313155|ref|YP_007124132.1| thioredoxin domain-containing protein [Microcoleus sp. PCC 7113]
 gi|428254767|gb|AFZ20726.1| thioredoxin domain protein [Microcoleus sp. PCC 7113]
          Length = 702

 Score =  347 bits (890), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 240/710 (33%), Positives = 345/710 (48%), Gaps = 106/710 (14%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           +NRLA   S YL +HA NP+DW+ W +EA   A+  + PIFLSIGYS+CHWC VME E+F
Sbjct: 2   SNRLAHSQSLYLRKHAENPIDWWPWCDEALETAKVANKPIFLSIGYSSCHWCTVMEGEAF 61

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLK-PLMGG 220
            +  +A+ +N  F+ IKVDREERPD+D +YM  +Q + G GGWPL+VFL+PD + P  GG
Sbjct: 62  SNSAIAEYMNANFLPIKVDREERPDIDSIYMQALQMMTGQGGWPLNVFLTPDDRVPFYGG 121

Query: 221 TYFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDEL 280
           TYFP   +YGRPGF  +L+ V+  +D ++  L       +  L +A S    +  L ++L
Sbjct: 122 TYFPVTPRYGRPGFLQVLQAVRRFYDLEKTKLQTFKEEILTNLQQA-SVPPGTEPLSEDL 180

Query: 281 PQNALRLCAEQLSKSYDSRFGGFGSAPKFPRPVEIQMMLYHSK-KLEDTGKSGEASEGQK 339
            +  +      +S       G +G  P FP     +++L  S+ K E    S +A   + 
Sbjct: 181 LERGIETNTGVVSA------GNYG--PSFPMMPYAELVLRGSRFKFESKYDSFQAVRLRG 232

Query: 340 MVLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFS--LTK 397
           + L      AKGGI+DHV GGFHRY+VD  W VPHFEKMLYD GQ+     + +S  +T+
Sbjct: 233 LDL------AKGGIYDHVAGGFHRYTVDATWTVPHFEKMLYDNGQIVEYLANLWSAGITE 286

Query: 398 DVFYSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDIL 457
             F   I   +  +L+R+M  P G  ++A+DADS     A   +EGAFYVW+  E+E +L
Sbjct: 287 PAFKRAIAGTV-QWLKREMTSPQGFFYAAQDADSFSEPNAAEPEEGAFYVWSYGELEQLL 345

Query: 458 G-EHAILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLI-----ELNDSSASASKLGMPL 511
             E     KE + +   GN            F+G NVL      EL+D+  +A      L
Sbjct: 346 TPEELTELKEQFTITAEGN------------FEGTNVLQRRHSEELSDTVEAA------L 387

Query: 512 EKYLNILGECRRKLFDV--------------RSKRPRPHLDDKVIVSWNGLVISSFARAS 557
            K   +    +  + D                  R     D K+IV+WN L+IS  AR+ 
Sbjct: 388 AKLFAVRYGSKPDVLDTFPPARNNQEAKGNNWQGRIPAVTDTKMIVAWNSLMISGLARSY 447

Query: 558 KILKSEAESAMFNFPVVGSDRKEYMEVAESAASFIRRHLYDE-QTHRLQHSFRNGPSKAP 616
            +                  + EY ++A  AA FI    + + + HRL +   +G     
Sbjct: 448 SVFH----------------QPEYWQLAADAAQFILNSQWVQGRFHRLNY---DGQPSVL 488

Query: 617 GFLDDYAFLISGLLDLYEFG-----------------SGTKWLVWAIELQNTQDELFLDR 659
              +DYA  I  LLDL++                     + WL  AI +Q   DE     
Sbjct: 489 AQSEDYALFIKALLDLHQASWSFSKMHLESSNPPSNLQPSDWLEKAIRVQEEFDEFLWSV 548

Query: 660 EGGGYFNTTGEDPS-VLLRVKEDHDGAEPSGNSVSVINLVRLASIVAGSKSDYYRQNAEH 718
           E GGY+N   +    +L+R +   D A PS N +++ NLVRLA +    +   Y   AE 
Sbjct: 549 ELGGYYNAASDGSGELLVRERSYADNATPSANGIAIANLVRLALLTEDLQ---YLDQAEQ 605

Query: 719 SLAVFETRLKDMAMAVPLMCCAADMLSVPSRKHVVLVGHKSSVDFENMLA 768
           +L  F   +       P +  A D        H  L+  +SS DF   L+
Sbjct: 606 ALQAFSRVMNQSPQVCPSLFTALDWYC-----HCTLI--RSSDDFLTSLS 648


>gi|225626442|ref|ZP_03784481.1| Spermatogenesis-associated protein 20 precursor [Brucella ceti str.
           Cudo]
 gi|225618099|gb|EEH15142.1| Spermatogenesis-associated protein 20 precursor [Brucella ceti str.
           Cudo]
          Length = 682

 Score =  347 bits (889), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 224/615 (36%), Positives = 308/615 (50%), Gaps = 62/615 (10%)

Query: 102 TNRLAAEHSPYLLQHAHNPVDWFAWGEEAFAEARKRDVPIFLSIGYSTCHWCHVMEVESF 161
           +NRLA E S YL QHA+NPV W  WG +A   A++ D PI LSIGY+ CHWCHVM  ESF
Sbjct: 22  SNRLAGEPSAYLRQHANNPVHWQPWGRKALDAAKELDRPILLSIGYAACHWCHVMAHESF 81

Query: 162 EDEGVAKLLNDWFVSIKVDREERPDVDKVYMTYVQALYGGGGWPLSVFLSPDLKPLMGGT 221
           ED+ VA ++N +F+++KVDREERPD+D++YM  + A+   GGWPL++FL PD KP  GGT
Sbjct: 82  EDDDVAAVMNAFFINVKVDREERPDIDQIYMAALGAMGQQGGWPLTMFLRPDGKPFWGGT 141

Query: 222 YFPPEDKYGRPGFKTILRKVKDAWDKKRDMLAQSGAFAIEQLSEALSASASSNKLPDELP 281
           YFP   ++  PGF  IL  V + W + +D +  +     + L   L  +A S  L +E+ 
Sbjct: 142 YFPRHARHNMPGFVDILHAVNNLWHRDKDKINHNAEAVFDHLEGRL--AAQSQPLQNEIS 199

Query: 282 QNALRLCAEQLSKSYDSRFGGFGSAPKFPR-PVEIQMMLYHSKKLEDTGKSGEASEGQKM 340
           +      A ++    D + GG    PKFP  P    + L    +  +T +          
Sbjct: 200 R--FDDLANRIGSLIDPQRGGIEGVPKFPNAPFMDTLWLSWLYRHNETHRDN-------- 249

Query: 341 VLFTLQCMAKGGIHDHVGGGFHRYSVDERWHVPHFEKMLYDQGQLANVYLDAFSLTKDVF 400
            L +L+ M +GGI+DH+GGG  RYS D  W VPHFEKMLYD  Q       AF+ T D  
Sbjct: 250 FLLSLKTMLQGGIYDHLGGGLCRYSTDAEWLVPHFEKMLYDNAQFIRHANYAFAETGDDL 309

Query: 401 YSYICRDILDYLRRDMIGPGGEIFSAEDADSAETEGATRKKEGAFYVWTSKEVEDILGEH 460
           +     + +D+L R+M  P G   S+ DADS   EG    +EG FYVWT  E++ +LG  
Sbjct: 310 FRIRIEETVDWLIREMQLPDGCFASSLDADS---EG----EEGKFYVWTEDEIDAVLGTD 362

Query: 461 AILFKEHYYLKPTGNCDLSRMSDPHNEFKGKNVLIELNDSSASASKLGMPLEKYLNILGE 520
           A +FK  Y + P GN            ++GKN+L  L+  +A+ +    PL      +  
Sbjct: 363 AEVFKTFYAVTPGGN------------WEGKNILNRLH--AAAETPTPPPL------VEA 402

Query: 521 CRRKLFDVRSKRPRPHLDDKVIVSWNGLVISSFARASKILKSEAESAMFNFPVVGSDRKE 580
            RRKL   R  R RP  DDK +  WNGL I + A A +                   R +
Sbjct: 403 ARRKLLAHRETRIRPGRDDKALTDWNGLAIRALAEAGRSFA----------------RTD 446

Query: 581 YMEVAESAASFIRRHLYDEQTHRLQHSFRNGPSKAPGFLDDYAFLISGLLDLYEFGSGTK 640
           ++E A  A   I       Q  R+ H    G    P    DYA +I+  L LYE      
Sbjct: 447 WLEHAVQAYRSIGSSF---QDGRIAHCRMEGAFLYPALATDYAAMINAALALYEATGEFA 503

Query: 641 WLVWAIELQNTQDELFLDREGGGYFNTTGEDPSVLLRVKEDHDGAEPSGNSVSVINLVRL 700
           ++  A + +   D    D  G    +  G D  V+L    D+D A PS  S  +  L RL
Sbjct: 504 YIDDARKFKRALDGSHRDSAGNYRLSALGAD-DVILHAYGDYDEAIPSATSQIIEALTRL 562

Query: 701 ASIVAGSKSDYYRQN 715
              +A   S  Y +N
Sbjct: 563 --FLATGDSALYEEN 575


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.319    0.134    0.405 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 12,754,527,029
Number of Sequences: 23463169
Number of extensions: 563428584
Number of successful extensions: 1170385
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1671
Number of HSP's successfully gapped in prelim test: 106
Number of HSP's that attempted gapping in prelim test: 1159972
Number of HSP's gapped (non-prelim): 2212
length of query: 784
length of database: 8,064,228,071
effective HSP length: 151
effective length of query: 633
effective length of database: 8,816,256,848
effective search space: 5580690584784
effective search space used: 5580690584784
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 81 (35.8 bits)